1
|
Reyna-Blanco CS, Caduff M, Galimberti M, Leuenberger C, Wegmann D. Inference of Locus-Specific Population Mixtures from Linked Genome-Wide Allele Frequencies. Mol Biol Evol 2024; 41:msae137. [PMID: 38958167 PMCID: PMC11255385 DOI: 10.1093/molbev/msae137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 06/26/2024] [Accepted: 06/27/2024] [Indexed: 07/04/2024] Open
Abstract
Admixture between populations and species is common in nature. Since the influx of new genetic material might be either facilitated or hindered by selection, variation in mixture proportions along the genome is expected in organisms undergoing recombination. Various graph-based models have been developed to better understand these evolutionary dynamics of population splits and mixtures. However, current models assume a single mixture rate for the entire genome and do not explicitly account for linkage. Here, we introduce TreeSwirl, a novel method for inferring branch lengths and locus-specific mixture proportions by using genome-wide allele frequency data, assuming that the admixture graph is known or has been inferred. TreeSwirl builds upon TreeMix that uses Gaussian processes to estimate the presence of gene flow between diverged populations. However, in contrast to TreeMix, our model infers locus-specific mixture proportions employing a hidden Markov model that accounts for linkage. Through simulated data, we demonstrate that TreeSwirl can accurately estimate locus-specific mixture proportions and handle complex demographic scenarios. It also outperforms related D- and f-statistics in terms of accuracy and sensitivity to detect introgressed loci.
Collapse
Affiliation(s)
- Carlos S Reyna-Blanco
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
| | - Madleina Caduff
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
| | - Marco Galimberti
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
- Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA
| | | | - Daniel Wegmann
- Department of Biology, University of Fribourg, Fribourg 1700, Switzerland
- Swiss Institute of Bioinformatics, Fribourg 1700, Switzerland
| |
Collapse
|
2
|
Ray DD, Flagel L, Schrider DR. IntroUNET: Identifying introgressed alleles via semantic segmentation. PLoS Genet 2024; 20:e1010657. [PMID: 38377104 PMCID: PMC10906877 DOI: 10.1371/journal.pgen.1010657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 03/01/2024] [Accepted: 01/29/2024] [Indexed: 02/22/2024] Open
Abstract
A growing body of evidence suggests that gene flow between closely related species is a widespread phenomenon. Alleles that introgress from one species into a close relative are typically neutral or deleterious, but sometimes confer a significant fitness advantage. Given the potential relevance to speciation and adaptation, numerous methods have therefore been devised to identify regions of the genome that have experienced introgression. Recently, supervised machine learning approaches have been shown to be highly effective for detecting introgression. One especially promising approach is to treat population genetic inference as an image classification problem, and feed an image representation of a population genetic alignment as input to a deep neural network that distinguishes among evolutionary models (i.e. introgression or no introgression). However, if we wish to investigate the full extent and fitness effects of introgression, merely identifying genomic regions in a population genetic alignment that harbor introgressed loci is insufficient-ideally we would be able to infer precisely which individuals have introgressed material and at which positions in the genome. Here we adapt a deep learning algorithm for semantic segmentation, the task of correctly identifying the type of object to which each individual pixel in an image belongs, to the task of identifying introgressed alleles. Our trained neural network is thus able to infer, for each individual in a two-population alignment, which of those individual's alleles were introgressed from the other population. We use simulated data to show that this approach is highly accurate, and that it can be readily extended to identify alleles that are introgressed from an unsampled "ghost" population, performing comparably to a supervised learning method tailored specifically to that task. Finally, we apply this method to data from Drosophila, showing that it is able to accurately recover introgressed haplotypes from real data. This analysis reveals that introgressed alleles are typically confined to lower frequencies within genic regions, suggestive of purifying selection, but are found at much higher frequencies in a region previously shown to be affected by adaptive introgression. Our method's success in recovering introgressed haplotypes in challenging real-world scenarios underscores the utility of deep learning approaches for making richer evolutionary inferences from genomic data.
Collapse
Affiliation(s)
- Dylan D. Ray
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Lex Flagel
- Division of Data Science, Gencove Inc., New York, New York, United States of America
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Daniel R. Schrider
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
3
|
Ray DD, Flagel L, Schrider DR. IntroUNET: identifying introgressed alleles via semantic segmentation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.07.527435. [PMID: 36865105 PMCID: PMC9979274 DOI: 10.1101/2023.02.07.527435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
A growing body of evidence suggests that gene flow between closely related species is a widespread phenomenon. Alleles that introgress from one species into a close relative are typically neutral or deleterious, but sometimes confer a significant fitness advantage. Given the potential relevance to speciation and adaptation, numerous methods have therefore been devised to identify regions of the genome that have experienced introgression. Recently, supervised machine learning approaches have been shown to be highly effective for detecting introgression. One especially promising approach is to treat population genetic inference as an image classification problem, and feed an image representation of a population genetic alignment as input to a deep neural network that distinguishes among evolutionary models (i.e. introgression or no introgression). However, if we wish to investigate the full extent and fitness effects of introgression, merely identifying genomic regions in a population genetic alignment that harbor introgressed loci is insufficient-ideally we would be able to infer precisely which individuals have introgressed material and at which positions in the genome. Here we adapt a deep learning algorithm for semantic segmentation, the task of correctly identifying the type of object to which each individual pixel in an image belongs, to the task of identifying introgressed alleles. Our trained neural network is thus able to infer, for each individual in a two-population alignment, which of those individual's alleles were introgressed from the other population. We use simulated data to show that this approach is highly accurate, and that it can be readily extended to identify alleles that are introgressed from an unsampled "ghost" population, performing comparably to a supervised learning method tailored specifically to that task. Finally, we apply this method to data from Drosophila, showing that it is able to accurately recover introgressed haplotypes from real data. This analysis reveals that introgressed alleles are typically confined to lower frequencies within genic regions, suggestive of purifying selection, but are found at much higher frequencies in a region previously shown to be affected by adaptive introgression. Our method's success in recovering introgressed haplotypes in challenging real-world scenarios underscores the utility of deep learning approaches for making richer evolutionary inferences from genomic data.
Collapse
Affiliation(s)
- Dylan D. Ray
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Lex Flagel
- Division of Data Science, Gencove Inc., New York, NY 11101, USA
- Department of Plant and Microbial Biology, University of Minnesota, St Paul MN, 55108, USA
| | - Daniel R. Schrider
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
4
|
Zhang Y, Zhu Q, Shao Y, Jiang Y, Ouyang Y, Zhang L, Zhang W. Inferring Historical Introgression with Deep Learning. Syst Biol 2023; 72:1013-1038. [PMID: 37257491 DOI: 10.1093/sysbio/syad033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 05/28/2023] [Accepted: 05/30/2023] [Indexed: 06/02/2023] Open
Abstract
Resolving phylogenetic relationships among taxa remains a challenge in the era of big data due to the presence of genetic admixture in a wide range of organisms. Rapidly developing sequencing technologies and statistical tests enable evolutionary relationships to be disentangled at a genome-wide level, yet many of these tests are computationally intensive and rely on phased genotypes, large sample sizes, restricted phylogenetic topologies, or hypothesis testing. To overcome these difficulties, we developed a deep learning-based approach, named ERICA, for inferring genome-wide evolutionary relationships and local introgressed regions from sequence data. ERICA accepts sequence alignments of both population genomic data and multiple genome assemblies, and efficiently identifies discordant genealogy patterns and exchanged regions across genomes when compared with other methods. We further tested ERICA using real population genomic data from Heliconius butterflies that have undergone adaptive radiation and frequent hybridization. Finally, we applied ERICA to characterize hybridization and introgression in wild and cultivated rice, revealing the important role of introgression in rice domestication and adaptation. Taken together, our findings demonstrate that ERICA provides an effective method for teasing apart evolutionary relationships using whole genome data, which can ultimately facilitate evolutionary studies on hybridization and introgression.
Collapse
Affiliation(s)
- Yubo Zhang
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Qingjie Zhu
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Yi Shao
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Yanchen Jiang
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing 100871, China
| | - Yidan Ouyang
- National Key Laboratory of Crop Genetic Improvement and National Centre of Plant Gene Research (Wuhan), Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan 430070, China
| | - Li Zhang
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Wei Zhang
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing 100871, China
| |
Collapse
|
5
|
Pawar H, Rymbekova A, Cuadros-Espinoza S, Huang X, de Manuel M, van der Valk T, Lobon I, Alvarez-Estape M, Haber M, Dolgova O, Han S, Esteller-Cucala P, Juan D, Ayub Q, Bautista R, Kelley JL, Cornejo OE, Lao O, Andrés AM, Guschanski K, Ssebide B, Cranfield M, Tyler-Smith C, Xue Y, Prado-Martinez J, Marques-Bonet T, Kuhlwilm M. Ghost admixture in eastern gorillas. Nat Ecol Evol 2023; 7:1503-1514. [PMID: 37500909 PMCID: PMC10482688 DOI: 10.1038/s41559-023-02145-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 06/30/2023] [Indexed: 07/29/2023]
Abstract
Archaic admixture has had a substantial impact on human evolution with multiple events across different clades, including from extinct hominins such as Neanderthals and Denisovans into modern humans. In great apes, archaic admixture has been identified in chimpanzees and bonobos but the possibility of such events has not been explored in other species. Here, we address this question using high-coverage whole-genome sequences from all four extant gorilla subspecies, including six newly sequenced eastern gorillas from previously unsampled geographic regions. Using approximate Bayesian computation with neural networks to model the demographic history of gorillas, we find a signature of admixture from an archaic 'ghost' lineage into the common ancestor of eastern gorillas but not western gorillas. We infer that up to 3% of the genome of these individuals is introgressed from an archaic lineage that diverged more than 3 million years ago from the common ancestor of all extant gorillas. This introgression event took place before the split of mountain and eastern lowland gorillas, probably more than 40 thousand years ago and may have influenced perception of bitter taste in eastern gorillas. When comparing the introgression landscapes of gorillas, humans and bonobos, we find a consistent depletion of introgressed fragments on the X chromosome across these species. However, depletion in protein-coding content is not detectable in eastern gorillas, possibly as a consequence of stronger genetic drift in this species.
Collapse
Affiliation(s)
- Harvinder Pawar
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
| | - Aigerim Rymbekova
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Wien, Austria
| | | | - Xin Huang
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Wien, Austria
| | - Marc de Manuel
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
| | - Tom van der Valk
- Department of Bioinformatics and Genetics, Scilifelab, Swedish Museum of Natural History, Stockholm, Sweden
- Centre for Palaeogenetics, Stockholm, Sweden
| | - Irene Lobon
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
| | | | - Marc Haber
- Institute of Cancer and Genomic Sciences, University of Birmingham, Dubai, United Arab Emirates
| | - Olga Dolgova
- Integrative Genomics Lab, CIC bioGUNE-Centro de Investigación Cooperativa en Biociencias, Parque Científico Tecnológico de Bizkaia building 801A, Derio, Spain
| | - Sojung Han
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Wien, Austria
| | | | - David Juan
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
| | - Qasim Ayub
- Wellcome Sanger Institute, Hinxton, UK
- Monash University Malaysia Genomics Facility, School of Science, Monash University Malaysia, Selangor Darul Ehsan, Malaysia
| | | | - Joanna L Kelley
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA
| | - Omar E Cornejo
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA
| | - Oscar Lao
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
| | - Aida M Andrés
- UCL Genetics Institute, Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Katerina Guschanski
- Animal Ecology, Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
- Science for Life Laboratory, Uppsala, Sweden
| | | | - Mike Cranfield
- Gorilla Doctors, Karen C. Drayer Wildlife Health Center, One Health Institute, University of California Davis, School of Veterinary Medicine, Davis, CA, USA
| | | | - Yali Xue
- Wellcome Sanger Institute, Hinxton, UK
| | - Javier Prado-Martinez
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
- Wellcome Sanger Institute, Hinxton, UK
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain.
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, Barcelona, Spain.
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, Barcelona, Spain.
| | - Martin Kuhlwilm
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain.
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria.
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Wien, Austria.
| |
Collapse
|
6
|
Brand CM, Colbran LL, Capra JA. Resurrecting the alternative splicing landscape of archaic hominins using machine learning. Nat Ecol Evol 2023; 7:939-953. [PMID: 37142741 PMCID: PMC11440953 DOI: 10.1038/s41559-023-02053-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 03/29/2023] [Indexed: 05/06/2023]
Abstract
Alternative splicing contributes to adaptation and divergence in many species. However, it has not been possible to directly compare splicing between modern and archaic hominins. Here, we unmask the recent evolution of this previously unobservable regulatory mechanism by applying SpliceAI, a machine-learning algorithm that identifies splice-altering variants (SAVs), to high-coverage genomes from three Neanderthals and a Denisovan. We discover 5,950 putative archaic SAVs, of which 2,186 are archaic-specific and 3,607 also occur in modern humans via introgression (244) or shared ancestry (3,520). Archaic-specific SAVs are enriched in genes that contribute to traits potentially relevant to hominin phenotypic divergence, such as the epidermis, respiration and spinal rigidity. Compared to shared SAVs, archaic-specific SAVs occur in sites under weaker selection and are more common in genes with tissue-specific expression. Further underscoring the importance of negative selection on SAVs, Neanderthal lineages with low effective population sizes are enriched for SAVs compared to Denisovan and shared SAVs. Finally, we find that nearly all introgressed SAVs in humans were shared across the three Neanderthals, suggesting that older SAVs were more tolerated in human genomes. Our results reveal the splicing landscape of archaic hominins and identify potential contributions of splicing to phenotypic differences among hominins.
Collapse
Affiliation(s)
- Colin M Brand
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Laura L Colbran
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - John A Capra
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, USA.
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA.
| |
Collapse
|
7
|
Ragsdale AP, Weaver TD, Atkinson EG, Hoal EG, Möller M, Henn BM, Gravel S. A weakly structured stem for human origins in Africa. Nature 2023; 617:755-763. [PMID: 37198480 PMCID: PMC10208968 DOI: 10.1038/s41586-023-06055-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 04/05/2023] [Indexed: 05/19/2023]
Abstract
Despite broad agreement that Homo sapiens originated in Africa, considerable uncertainty surrounds specific models of divergence and migration across the continent1. Progress is hampered by a shortage of fossil and genomic data, as well as variability in previous estimates of divergence times1. Here we seek to discriminate among such models by considering linkage disequilibrium and diversity-based statistics, optimized for rapid, complex demographic inference2. We infer detailed demographic models for populations across Africa, including eastern and western representatives, and newly sequenced whole genomes from 44 Nama (Khoe-San) individuals from southern Africa. We infer a reticulated African population history in which present-day population structure dates back to Marine Isotope Stage 5. The earliest population divergence among contemporary populations occurred 120,000 to 135,000 years ago and was preceded by links between two or more weakly differentiated ancestral Homo populations connected by gene flow over hundreds of thousands of years. Such weakly structured stem models explain patterns of polymorphism that had previously been attributed to contributions from archaic hominins in Africa2-7. In contrast to models with archaic introgression, we predict that fossil remains from coexisting ancestral populations should be genetically and morphologically similar, and that only an inferred 1-4% of genetic differentiation among contemporary human populations can be attributed to genetic drift between stem populations. We show that model misspecification explains the variation in previous estimates of divergence times, and argue that studying a range of models is key to making robust inferences about deep history.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, USA
| | - Timothy D Weaver
- Department of Anthropology, University of California, Davis, CA, USA
| | - Elizabeth G Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Eileen G Hoal
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Marlo Möller
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Brenna M Henn
- Department of Anthropology, University of California, Davis, CA, USA.
- Genome Center, University of California, Davis, CA, USA.
| | - Simon Gravel
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
8
|
Pfennig A, Petersen LN, Kachambwa P, Lachance J. Evolutionary Genetics and Admixture in African Populations. Genome Biol Evol 2023; 15:evad054. [PMID: 36987563 PMCID: PMC10118306 DOI: 10.1093/gbe/evad054] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 03/15/2023] [Accepted: 03/22/2023] [Indexed: 03/30/2023] Open
Abstract
As the ancestral homeland of our species, Africa contains elevated levels of genetic diversity and substantial population structure. Importantly, African genomes are heterogeneous: They contain mixtures of multiple ancestries, each of which have experienced different evolutionary histories. In this review, we view population genetics through the lens of admixture, highlighting how multiple demographic events have shaped African genomes. Each of these historical vignettes paints a recurring picture of population divergence followed by secondary contact. First, we give a brief overview of genetic variation in Africa and examine deep population structure within Africa, including the evidence of ancient introgression from archaic "ghost" populations. Second, we describe the genetic legacies of admixture events that have occurred during the past 10,000 years. This includes gene flow between different click-speaking Khoe-San populations, the stepwise spread of pastoralism from eastern to southern Africa, multiple migrations of Bantu speakers across the continent, as well as admixture from the Middle East and Europe into the Sahel region and North Africa. Furthermore, the genomic signatures of more recent admixture can be found in the Cape Peninsula and throughout the African diaspora. Third, we highlight how natural selection has shaped patterns of genetic variation across the continent, noting that gene flow provides a potent source of adaptive variation and that selective pressures vary across Africa. Finally, we explore the biomedical implications of population structure in Africa on health and disease and call for more ethically conducted studies of genetic variation in Africa.
Collapse
Affiliation(s)
- Aaron Pfennig
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | | | | | - Joseph Lachance
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| |
Collapse
|
9
|
Watanabe Y, Ohashi J. Modern Japanese ancestry-derived variants reveal the formation process of the current Japanese regional gradations. iScience 2023; 26:106130. [PMID: 36879818 PMCID: PMC9984562 DOI: 10.1016/j.isci.2023.106130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/02/2022] [Accepted: 01/31/2023] [Indexed: 02/05/2023] Open
Abstract
Modern Japanese people have two major ancestral populations: indigenous Jomon hunter-gatherers and continental East Asian farmers. To determine the formation process of the current Japanese population, we developed a detection method for variants derived from ancestral populations using a summary statistic, the ancestry marker index (AMI). We applied AMI to modern Japanese population samples and identified 208,648 single nucleotide polymorphisms (SNPs) that were likely derived from the Jomon people (Jomon-derived variants). Analysis of Jomon-derived variants in 10,842 modern Japanese individuals recruited from all over Japan revealed that the admixture proportions of the Jomon people varied between prefectures, probably owing to the prehistoric population size difference. The estimated allele frequencies of genome-wide SNPs in the ancestral populations of the modern Japanese suggested their adaptive phenotypic characteristics to their respective livelihoods. Based on our findings, we propose a formation model for the genotypic and phenotypic gradations of the current Japanese archipelago populations.
Collapse
Affiliation(s)
- Yusuke Watanabe
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo 113-0033, Japan.,Genome Medical Science Project Toyama Project, National Center for Global Health and Medicine, Tokyo 162-8655, Japan
| | - Jun Ohashi
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo 113-0033, Japan
| |
Collapse
|
10
|
Estimating bonobo ( Pan paniscus) and chimpanzee ( Pan troglodytes) evolutionary history from nucleotide site patterns. Proc Natl Acad Sci U S A 2022; 119:e2200858119. [PMID: 35452306 PMCID: PMC9170072 DOI: 10.1073/pnas.2200858119] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
There is genomic evidence of widespread admixture in deep time between many closely related species, including humans. Our closest living relatives, bonobos and chimpanzees, may also exhibit such patterns. However, assessing the exact degree of interbreeding remains challenging because previous studies have resulted in multiple inconsistent demographic models. We use an approach that addresses these gaps by analyzing all lineages, simultaneously estimating parameters, and comparing previously models. We find evidence of considerable introgression from western into eastern chimpanzees. We also show more breeding females than males and evidence of male-biased dispersal in western chimpanzees. These findings highlight the extent of admixture in bonobo and chimpanzee evolutionary history and are consistent with substantial differences between past and present chimpanzee biogeography. Admixture appears increasingly ubiquitous in the evolutionary history of various taxa, including humans. Such gene flow likely also occurred among our closest living relatives: bonobos (Pan paniscus) and chimpanzees (Pan troglodytes). However, our understanding of their evolutionary history has been limited by studies that do not consider all Pan lineages or do not analyze all lineages simultaneously, resulting in conflicting demographic models. Here, we investigate this gap in knowledge using nucleotide site patterns calculated from whole-genome sequences from the autosomes of 71 bonobos and chimpanzees, representing all five extant Pan lineages. We estimated demographic parameters and compared all previously proposed demographic models for this clade. We further considered sex bias in Pan evolutionary history by analyzing the site patterns from the X chromosome. We show that 1) 21% of autosomal DNA in eastern chimpanzees derives from western chimpanzee introgression and that 2) all four chimpanzee lineages share a common ancestor about 987,000 y ago, much earlier than previous estimates. In addition, we suggest that 3) there was male reproductive skew throughout Pan evolutionary history and find evidence of 4) male-biased dispersal from western to eastern chimpanzees. Collectively, these results offer insight into bonobo and chimpanzee evolutionary history and suggest considerable differences between current and historic chimpanzee biogeography.
Collapse
|
11
|
Weasel L. How Neanderthals became White: The introgression of race into contemporary human evolutionary genomics. Am Nat 2022; 200:129-139. [DOI: 10.1086/720130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
|
12
|
Hayakawa T, Terahara M, Fujito NT, Matsunaga T, Teshima KM, Hane M, Kitajima K, Sato C, Takahata N, Satta Y. Lower promoter activity of the ST8SIA2 gene has been favored in evolving human collective brains. PLoS One 2021; 16:e0259897. [PMID: 34914745 PMCID: PMC8675693 DOI: 10.1371/journal.pone.0259897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 10/28/2021] [Indexed: 11/18/2022] Open
Abstract
ST8SIA2 is an important molecule regulating expression of the phenotype involved in schizophrenia. Lowered promoter activity of the ST8SIA2 gene is considered to be protective against schizophrenia by conferring tolerance to psychosocial stress. Here, we examined the promoter-type composition of anatomically modern humans (AMHs) and archaic humans (AHs; Neanderthals and Denisovans), and compared the promoter activity at the population level (population promoter activity; PPA) between them. In AMHs, the TCT-type, showing the second lowest promoter activity, was most prevalent in the ancestral population of non-Africans. However, the detection of only the CGT-type from AH samples and recombination tracts in AH sequences showed that the CGT- and TGT-types, exhibiting the two highest promoter activities, were common in AH populations. Furthermore, interspecies gene flow occurred into AMHs from AHs and into Denisovans from Neanderthals, influencing promoter-type compositions independently in both AMHs and AHs. The difference of promoter-type composition makes PPA unique in each population. East and Southeast Asian populations show the lowest PPA. This results from the selective increase of the CGC-type, showing the lowest promoter activity, in these populations. Every non-African population shows significantly lower PPA than African populations, resulting from the TCT-type having the highest prevalence in the ancestral population of non-Africans. In addition, PPA reduction is also found among subpopulations within Africa via a slight increase of the TCT-type. These findings indicate a trend toward lower PPA in the spread of AMHs, interpreted as a continuous adaptation to psychosocial stress arising in migration. This trend is considered as genetic tuning for the evolution of collective brains. The inferred promoter-type composition of AHs differed markedly from that of AMHs, resulting in higher PPA in AHs than in AMHs. This suggests that the trend toward lower PPA is a unique feature in AMH spread.
Collapse
Affiliation(s)
- Toshiyuki Hayakawa
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka, Japan
- Faculty of Arts and Science, Kyushu University, Fukuoka, Japan
- * E-mail:
| | - Masahiro Terahara
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka, Japan
| | - Naoko T. Fujito
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Kanagawa, Japan
| | - Takumi Matsunaga
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka, Japan
| | | | - Masaya Hane
- Bioscience and Biotechnology Center, Nagoya University, Nagoya, Aichi, Japan
| | - Ken Kitajima
- Institute for Glyco-core Research (iGCORE), Nagoya University, Nagoya, Aichi, Japan
| | - Chihiro Sato
- Institute for Glyco-core Research (iGCORE), Nagoya University, Nagoya, Aichi, Japan
| | - Naoyuki Takahata
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Kanagawa, Japan
| | - Yoko Satta
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Kanagawa, Japan
| |
Collapse
|
13
|
Refining models of archaic admixture in Eurasia with ArchaicSeeker 2.0. Nat Commun 2021; 12:6232. [PMID: 34716342 PMCID: PMC8556419 DOI: 10.1038/s41467-021-26503-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Accepted: 10/06/2021] [Indexed: 12/30/2022] Open
Abstract
We developed a method, ArchaicSeeker 2.0, to identify introgressed hominin sequences and model multiple-wave admixture. The new method enabled us to discern two waves of introgression from both Denisovan-like and Neanderthal-like hominins in present-day Eurasian populations and an ancient Siberian individual. We estimated that an early Denisovan-like introgression occurred in Eurasia around 118.8-94.0 thousand years ago (kya). In contrast, we detected only one single episode of Denisovan-like admixture in indigenous peoples eastern to the Wallace-Line. Modeling ancient admixtures suggested an early dispersal of modern humans throughout Asia before the Toba volcanic super-eruption 74 kya, predating the initial peopling of Asia as proposed by the traditional Out-of-Africa model. Survived archaic sequences are involved in various phenotypes including immune and body mass (e.g., ZNF169), cardiovascular and lung function (e.g., HHAT), UV response and carbohydrate metabolism (e.g., HYAL1/HYAL2/HYAL3), while "archaic deserts" are enriched with genes associated with skin development and keratinization.
Collapse
|
14
|
Li Y, Wu DD. Finding unknown species in the genomes of extant species. J Genet Genomics 2021; 48:867-871. [PMID: 34509382 DOI: 10.1016/j.jgg.2021.05.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 05/21/2021] [Accepted: 05/23/2021] [Indexed: 11/18/2022]
Abstract
Although many species have gone extinct, their genetic components might exist in extant species because of ancient hybridization. Via advances in genome sequencing and development of modern population genetics, one can find the legacy of unknown or extinct species in the context of available genomes from extant species. Such discovery can be used as a strategy to search for hidden species or fossils in conservation biology and archeology, gain novel insight into complex evolutionary history, and provide the new sources of genetic variation for breeding and trait improvement in agriculture.
Collapse
Affiliation(s)
- Yan Li
- State Key Laboratory for Conservation and Utilization of Bio-resource in Yunnan and School of Life Science & School of Ecology and Environmental Science, Yunnan University, Kunming 650091, China.
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| |
Collapse
|
15
|
Ahlquist KD, Bañuelos MM, Funk A, Lai J, Rong S, Villanea FA, Witt KE. Our Tangled Family Tree: New Genomic Methods Offer Insight into the Legacy of Archaic Admixture. Genome Biol Evol 2021; 13:evab115. [PMID: 34028527 PMCID: PMC8480178 DOI: 10.1093/gbe/evab115] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 05/07/2021] [Accepted: 05/22/2021] [Indexed: 11/30/2022] Open
Abstract
The archaic ancestry present in the human genome has captured the imagination of both scientists and the wider public in recent years. This excitement is the result of new studies pushing the envelope of what we can learn from the archaic genetic information that has survived for over 50,000 years in the human genome. Here, we review the most recent ten years of literature on the topic of archaic introgression, including the current state of knowledge on Neanderthal and Denisovan introgression, as well as introgression from other as-yet unidentified archaic populations. We focus this review on four topics: 1) a reimagining of human demographic history, including evidence for multiple admixture events between modern humans, Neanderthals, Denisovans, and other archaic populations; 2) state-of-the-art methods for detecting archaic ancestry in population-level genomic data; 3) how these novel methods can detect archaic introgression in modern African populations; and 4) the functional consequences of archaic gene variants, including how those variants were co-opted into novel function in modern human populations. The goal of this review is to provide a simple-to-access reference for the relevant methods and novel data, which has changed our understanding of the relationship between our species and its siblings. This body of literature reveals the large degree to which the genetic legacy of these extinct hominins has been integrated into the human populations of today.
Collapse
Affiliation(s)
- K D Ahlquist
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Mayra M Bañuelos
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Alyssa Funk
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Jiaying Lai
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Brown Center for Biomedical Informatics, Brown University, Providence, Rhode Island, USA
| | - Stephen Rong
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, USA
| | - Fernando A Villanea
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Anthropology, University of Colorado Boulder, Colorado, USA
| | - Kelsey E Witt
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, USA
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island, USA
| |
Collapse
|
16
|
Introgression is widespread in the radiation of carnivorous Nepenthes pitcher plants. Mol Phylogenet Evol 2021; 163:107214. [PMID: 34052438 DOI: 10.1016/j.ympev.2021.107214] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Revised: 05/14/2021] [Accepted: 05/25/2021] [Indexed: 11/23/2022]
Abstract
Introgression and hybridization are important processes in plant evolution, but they are difficult to study from a phylogenetic perspective, because they conflict with the bifurcating evolutionary history typically depicted in phylogenetic models. The role of hybridization in plant evolution is best documented in the form of allo-polyploidizations. In contrast, homoploid hybridization and introgression are less explored, although they may be crucial in adaptive radiations. Here we employ genome-wide data (ddRAD-seq, transcriptomes) to investigate the evolutionary history of Nepenthes, a radiation of c. 160 species of iconic carnivorous plants mainly from tropical Asia. Our data indicates that the main radiation is only c. 5 million years old, and confirms previous bifurcating phylogenies. However, due to a greatly expanded number of loci, we were able test for the first time the long-standing hypotheses of introgression and historical hybridization. The genus presents one very clear case of organellar capture between two distantly related but sympatric groups. Furthermore, all Nepenthes species show introgression signals in their nuclear genomes, as uncovered by a general survey of ABBA-BABA-like statistics. The ancestor of the rapid main radiation shows ancestry from two deeply diverged lineages, as indicated by phylogenetic network analyses. All major clades of the main radiation show further introgression both within and between each other, as suggested by admixture graphs. Our study supports the hypothesis that rapid adaptive radiations are hotspots of introgression in the tree of life, and highlights the need to consider non-treelike processes in evolutionary studies of Nepenthes in particular.
Collapse
|
17
|
Torres JB. A history of you, me, and humanity: mitochondrial DNA in anthropological research. AIMS GENETICS 2021. [DOI: 10.3934/genet.2016.2.146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
AbstractWithin genetic anthropology, mitochondrial DNA (mtDNA) has garnered a prominent if not enduring place within the anthropological toolkit. MtDNA has provided new and innovative perspectives on the emergence and dispersal of our species, interactions with extinct human species, and illuminated relationships between human groups. In this paper, I provide a brief overview of the major findings ascertained from mtDNA about human origins, human dispersal across the globe, interactions with other hominin species, and the more recent uses of mtDNA in direct to consumer ancestry tests. Relative to nuclear DNA, mtDNA is a small section of the genome and due to its inheritance pattern provides a limited resolution of population history and an individual's genetic ancestry. Consequently, some scholars dismiss mtDNA as insignificant due to the limited inferences that may be made using the locus. Regardless, mtDNA provides some useful insights to understanding how social, cultural, and environmental factors have shaped patterns of genetic variability. Furthermore, with regard to the experiences of historically marginalized groups, in particular those of African descent throughout the Americas, mtDNA has the potential to fill gaps in knowledge that would otherwise remain unknown. Within anthropological sciences, the value of this locus for understanding human experience is maximized when contextualized with complementary lines of evidence.
Collapse
Affiliation(s)
- Jada Benn Torres
- Laboratory of Genetic Anthropology, Department of Anthropology, Vanderbilt University, Nashville, TN 37325, USA
| |
Collapse
|
18
|
Gopalan S, Atkinson EG, Buck LT, Weaver TD, Henn BM. Inferring archaic introgression from hominin genetic data. Evol Anthropol 2021; 30:199-220. [PMID: 33951239 PMCID: PMC8360192 DOI: 10.1002/evan.21895] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 08/03/2020] [Accepted: 03/29/2021] [Indexed: 01/05/2023]
Abstract
Questions surrounding the timing, extent, and evolutionary consequences of archaic admixture into human populations have a long history in evolutionary anthropology. More recently, advances in human genetics, particularly in the field of ancient DNA, have shed new light on the question of whether or not Homo sapiens interbred with other hominin groups. By the late 1990s, published genetic work had largely concluded that archaic groups made no lasting genetic contribution to modern humans; less than a decade later, this conclusion was reversed following the successful DNA sequencing of an ancient Neanderthal. This reversal of consensus is noteworthy, but the reasoning behind it is not widely understood across all academic communities. There remains a communication gap between population geneticists and paleoanthropologists. In this review, we endeavor to bridge this gap by outlining how technological advancements, new statistical methods, and notable controversies ultimately led to the current consensus.
Collapse
Affiliation(s)
- Shyamalika Gopalan
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA.,Department of Evolutionary Anthropology, Duke University, Durham, North Carolina, USA
| | - Elizabeth G Atkinson
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital and Stanley Center for Psychiatric Research, Broad Institute, Boston, Massachusetts, USA
| | - Laura T Buck
- Research Centre in Evolutionary Anthropology and Palaeoecology, Liverpool John Moores University, Liverpool, UK
| | - Timothy D Weaver
- Department of Anthropology, University of California, Davis, California, USA
| | - Brenna M Henn
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA.,Department of Anthropology, University of California, Davis, California, USA.,UC Davis Genome Center, University of California, Davis, California, USA
| |
Collapse
|
19
|
Hollfelder N, Breton G, Sjödin P, Jakobsson M. The deep population history in Africa. Hum Mol Genet 2021; 30:R2-R10. [PMID: 33438014 PMCID: PMC8117439 DOI: 10.1093/hmg/ddab005] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/22/2020] [Accepted: 01/05/2021] [Indexed: 12/28/2022] Open
Abstract
Africa is the continent with the greatest genetic diversity among humans and the level of diversity is further enhanced by incorporating non-majority groups, which are often understudied. Many of today's minority populations historically practiced foraging lifestyles, which were the only subsistence strategies prior to the rise of agriculture and pastoralism, but only a few groups practicing these strategies remain today. Genomic investigations of Holocene human remains excavated across the African continent show that the genetic landscape was vastly different compared to today's genetic landscape and that many groups that today are population isolate inhabited larger regions in the past. It is becoming clear that there are periods of isolation among groups and geographic areas, but also genetic contact over large distances throughout human history in Africa. Genomic information from minority populations and from prehistoric remains provide an invaluable source of information on the human past, in particular deep human population history, as Holocene large-scale population movements obscure past patterns of population structure. Here we revisit questions on the nature and time of the radiation of early humans in Africa, the extent of gene-flow among human populations as well as introgression from archaic and extinct lineages on the continent.
Collapse
Affiliation(s)
- Nina Hollfelder
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
| | - Gwenna Breton
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
| | - Per Sjödin
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
| | - Mattias Jakobsson
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
- Palaeo-Research Institute, University of Johannesburg, Physical, Cnr Kingsway & University Roads, Auckland Park, Johannesburg 2092, South Africa
- SciLifeLab, Stockholm and Uppsala, Entrance C11, BMC, Husargatan 3, 752 37 Uppsala, Sweden
| |
Collapse
|
20
|
Bergström A, Stringer C, Hajdinjak M, Scerri EML, Skoglund P. Origins of modern human ancestry. Nature 2021; 590:229-237. [PMID: 33568824 DOI: 10.1038/s41586-021-03244-5] [Citation(s) in RCA: 89] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 12/14/2020] [Indexed: 01/30/2023]
Abstract
New finds in the palaeoanthropological and genomic records have changed our view of the origins of modern human ancestry. Here we review our current understanding of how the ancestry of modern humans around the globe can be traced into the deep past, and which ancestors it passes through during our journey back in time. We identify three key phases that are surrounded by major questions, and which will be at the frontiers of future research. The most recent phase comprises the worldwide expansion of modern humans between 40 and 60 thousand years ago (ka) and their last known contacts with archaic groups such as Neanderthals and Denisovans. The second phase is associated with a broadly construed African origin of modern human diversity between 60 and 300 ka. The oldest phase comprises the complex separation of modern human ancestors from archaic human groups from 0.3 to 1 million years ago. We argue that no specific point in time can currently be identified at which modern human ancestry was confined to a limited birthplace, and that patterns of the first appearance of anatomical or behavioural traits that are used to define Homo sapiens are consistent with a range of evolutionary histories.
Collapse
Affiliation(s)
- Anders Bergström
- Ancient Genomics Laboratory, Francis Crick Institute, London, UK
| | - Chris Stringer
- Department of Earth Sciences, Natural History Museum, London, UK.
| | - Mateja Hajdinjak
- Ancient Genomics Laboratory, Francis Crick Institute, London, UK
| | - Eleanor M L Scerri
- Pan-African Evolution Research Group, Max Planck Institute for Science of Human History, Jena, Germany.,Department of Classics and Archaeology, University of Malta, Msida, Malta.,Institute of Prehistoric Archaeology, University of Cologne, Cologne, Germany
| | - Pontus Skoglund
- Ancient Genomics Laboratory, Francis Crick Institute, London, UK.
| |
Collapse
|
21
|
KOGANEBUCHI KAE, OOTA HIROKI. Paleogenomics of human remains in East Asia and Yaponesia focusing on current advances and future directions. ANTHROPOL SCI 2021. [DOI: 10.1537/ase.2011302] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Affiliation(s)
- KAE KOGANEBUCHI
- Laboratory of Genome Anthropology, Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo
- Advanced Medical Research Center, Faculty of Medicine, University of the Ryukyus, Nishihara
- Department of Human Biology and Anatomy, Graduate School of Medicine, University of the Ryukyus, Nishihara
| | - HIROKI OOTA
- Laboratory of Genome Anthropology, Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo
| |
Collapse
|
22
|
Mapping gene flow between ancient hominins through demography-aware inference of the ancestral recombination graph. PLoS Genet 2020; 16:e1008895. [PMID: 32760067 PMCID: PMC7410169 DOI: 10.1371/journal.pgen.1008895] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 05/29/2020] [Indexed: 01/09/2023] Open
Abstract
The sequencing of Neanderthal and Denisovan genomes has yielded many new insights about interbreeding events between extinct hominins and the ancestors of modern humans. While much attention has been paid to the relatively recent gene flow from Neanderthals and Denisovans into modern humans, other instances of introgression leave more subtle genomic evidence and have received less attention. Here, we present a major extension of the ARGweaver algorithm, called ARGweaver-D, which can infer local genetic relationships under a user-defined demographic model that includes population splits and migration events. This Bayesian algorithm probabilistically samples ancestral recombination graphs (ARGs) that specify not only tree topologies and branch lengths along the genome, but also indicate migrant lineages. The sampled ARGs can therefore be parsed to produce probabilities of introgression along the genome. We show that this method is well powered to detect the archaic migration into modern humans, even with only a few samples. We then show that the method can also detect introgressed regions stemming from older migration events, or from unsampled populations. We apply it to human, Neanderthal, and Denisovan genomes, looking for signatures of older proposed migration events, including ancient humans into Neanderthal, and unknown archaic hominins into Denisovans. We identify 3% of the Neanderthal genome that is putatively introgressed from ancient humans, and estimate that the gene flow occurred between 200-300kya. We find no convincing evidence that negative selection acted against these regions. Finally, we predict that 1% of the Denisovan genome was introgressed from an unsequenced, but highly diverged, archaic hominin ancestor. About 15% of these “super-archaic” regions—comprising at least about 4Mb—were, in turn, introgressed into modern humans and continue to exist in the genomes of people alive today. We present ARGweaver-D, an extension of the ARGweaver algorithm which can be applied under a user-defined demographic model including population splits and migration events. Given genome sequence data from a collection of individuals across multiple closely related populations or subspecies, ARGweaver-D can infer trees describing the genetic relationships among these individuals at every location along the genome, conditional on the demographic model. Like ARGweaver, ARGweaver-D is a Bayesian method, sampling trees from the posterior distribution in order to account for uncertainty. Using simulations, we show that ARGweaver-D can successfully identify regions introgressed from Neanderthals and Denisovans into modern humans. It is also well-powered to detect introgressed regions stemming from older gene-flow events. We apply ARGweaver-D to the genomes of two Neanderthals, a Denisovan, and two African humans. We identify 3% of the Neanderthal genome which is likely derived from gene flow from ancient humans. We also identify about 1% of the Denisovan genome that may be traced to an unsequenced archaic hominin; 15% of these regions were subsequently passed to modern humans. We find no convincing evidence that selection acted against any of these introgressed regions.
Collapse
|
23
|
Mughal MR, Koch H, Huang J, Chiaromonte F, DeGiorgio M. Learning the properties of adaptive regions with functional data analysis. PLoS Genet 2020; 16:e1008896. [PMID: 32853200 PMCID: PMC7480868 DOI: 10.1371/journal.pgen.1008896] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 09/09/2020] [Accepted: 05/29/2020] [Indexed: 12/12/2022] Open
Abstract
Identifying regions of positive selection in genomic data remains a challenge in population genetics. Most current approaches rely on comparing values of summary statistics calculated in windows. We present an approach termed SURFDAWave, which translates measures of genetic diversity calculated in genomic windows to functional data. By transforming our discrete data points to be outputs of continuous functions defined over genomic space, we are able to learn the features of these functions that signify selection. This enables us to confidently identify complex modes of natural selection, including adaptive introgression. We are also able to predict important selection parameters that are responsible for shaping the inferred selection events. By applying our model to human population-genomic data, we recapitulate previously identified regions of selective sweeps, such as OCA2 in Europeans, and predict that its beneficial mutation reached a frequency of 0.02 before it swept 1,802 generations ago, a time when humans were relatively new to Europe. In addition, we identify BNC2 in Europeans as a target of adaptive introgression, and predict that it harbors a beneficial mutation that arose in an archaic human population that split from modern humans within the hypothesized modern human-Neanderthal divergence range.
Collapse
Affiliation(s)
- Mehreen R. Mughal
- Bioinformatics and Genomics at the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Hillary Koch
- Department of Statistics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Jinguo Huang
- Bioinformatics and Genomics at the Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Francesca Chiaromonte
- Department of Statistics, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, Florida, United States of America
| |
Collapse
|
24
|
Carlson J, DeWitt WS, Harris K. Inferring evolutionary dynamics of mutation rates through the lens of mutation spectrum variation. Curr Opin Genet Dev 2020; 62:50-57. [PMID: 32619789 DOI: 10.1016/j.gde.2020.05.024] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 05/13/2020] [Accepted: 05/22/2020] [Indexed: 01/04/2023]
Abstract
There are many possible failure points in the transmission of genetic information that can produce heritable germline mutations. Once a mutation has been passed from parents to offspring for several generations, it can be difficult or impossible to identify its root cause; however, sometimes the nature of the ancestral and derived DNA sequences can provide mechanistic clues about a genetic change that happened hundreds or thousands of generations ago. Here, we review evidence that the sequence context 'spectrum' of germline mutagenesis has been evolving surprisingly rapidly over the history of humans and other species. We go on to discuss possible causal factors that might underlie rapid mutation spectrum evolution.
Collapse
Affiliation(s)
- Jedidiah Carlson
- Department of Genome Sciences, Foege Hall, University of Washington, Seattle, WA 98105, United States
| | - William S DeWitt
- Department of Genome Sciences, Foege Hall, University of Washington, Seattle, WA 98105, United States; Computational Biology Program, Fred Hutchinson Cancer Research Center, 1100 Eastlake Ave E, Seattle, WA 98109, United States
| | - Kelley Harris
- Department of Genome Sciences, Foege Hall, University of Washington, Seattle, WA 98105, United States; Computational Biology Program, Fred Hutchinson Cancer Research Center, 1100 Eastlake Ave E, Seattle, WA 98109, United States.
| |
Collapse
|
25
|
VolcanoFinder: Genomic scans for adaptive introgression. PLoS Genet 2020; 16:e1008867. [PMID: 32555579 PMCID: PMC7326285 DOI: 10.1371/journal.pgen.1008867] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Revised: 06/30/2020] [Accepted: 05/18/2020] [Indexed: 12/16/2022] Open
Abstract
Recent research shows that introgression between closely-related species is an important source of adaptive alleles for a wide range of taxa. Typically, detection of adaptive introgression from genomic data relies on comparative analyses that require sequence data from both the recipient and the donor species. However, in many cases, the donor is unknown or the data is not currently available. Here, we introduce a genome-scan method—VolcanoFinder—to detect recent events of adaptive introgression using polymorphism data from the recipient species only. VolcanoFinder detects adaptive introgression sweeps from the pattern of excess intermediate-frequency polymorphism they produce in the flanking region of the genome, a pattern which appears as a volcano-shape in pairwise genetic diversity. Using coalescent theory, we derive analytical predictions for these patterns. Based on these results, we develop a composite-likelihood test to detect signatures of adaptive introgression relative to the genomic background. Simulation results show that VolcanoFinder has high statistical power to detect these signatures, even for older sweeps and for soft sweeps initiated by multiple migrant haplotypes. Finally, we implement VolcanoFinder to detect archaic introgression in European and sub-Saharan African human populations, and uncovered interesting candidates in both populations, such as TSHR in Europeans and TCHH-RPTN in Africans. We discuss their biological implications and provide guidelines for identifying and circumventing artifactual signals during empirical applications of VolcanoFinder. The process by which beneficial alleles are introduced into a species from a closely-related species is termed adaptive introgression. We present an analytically-tractable model for the effects of adaptive introgression on non-adaptive genetic variation in the genomic region surrounding the beneficial allele. The result we describe is a characteristic volcano-shaped pattern of increased variability that arises around the positively-selected site, and we introduce an open-source method VolcanoFinder to detect this signal in genomic data. Importantly, VolcanoFinder is a population-genetic likelihood-based approach, rather than a comparative-genomic approach, and can therefore probe genomic variation data from a single population for footprints of adaptive introgression, even from a priori unknown and possibly extinct donor species.
Collapse
|
26
|
The Impact of Recessive Deleterious Variation on Signals of Adaptive Introgression in Human Populations. Genetics 2020; 215:799-812. [PMID: 32487519 PMCID: PMC7337073 DOI: 10.1534/genetics.120.303081] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 05/01/2020] [Indexed: 12/19/2022] Open
Abstract
Admixture with archaic hominins has altered the landscape of genomic variation in modern human populations. Several gene regions have been identified previously as candidates of adaptive introgression (AI) that facilitated human adaptation to specific environments. However, simulation-based studies have suggested that population genetic processes other than adaptive mutations, such as heterosis from recessive deleterious variants private to populations before admixture, can also lead to patterns in genomic data that resemble AI. The extent to which the presence of deleterious variants affect the false-positive rate and the power of current methods to detect AI has not been fully assessed. Here, we used extensive simulations under parameters relevant for human evolution to show that recessive deleterious mutations can increase the false positive rates of tests for AI compared to models without deleterious variants, especially when the recombination rates are low. We next examined candidates of AI in modern humans identified from previous studies, and show that 24 out of 26 candidate regions remain significant, even when deleterious variants are included in the null model. However, two AI candidate genes, HYAL2 and HLA, are particularly susceptible to high false positive signals of AI due to recessive deleterious mutations. These genes are located in regions of the human genome with high exon density together with low recombination rate, factors that we show increase the rate of false-positives due to recessive deleterious mutations. Although the combination of such parameters is rare in the human genome, caution is warranted in such regions, as well as in other species with more compact genomes and/or lower recombination rates. In sum, our results suggest that recessive deleterious mutations cannot account for the signals of AI in most, but not all, of the top candidates for AI in humans, suggesting they may be genuine signals of adaptation.
Collapse
|
27
|
Sankararaman S. Methods for detecting introgressed archaic sequences. Curr Opin Genet Dev 2020; 62:85-90. [PMID: 32717667 PMCID: PMC7484293 DOI: 10.1016/j.gde.2020.05.026] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 05/12/2020] [Accepted: 05/22/2020] [Indexed: 11/16/2022]
Abstract
Analysis of genome sequences from archaic and modern humans have revealed multiple episodes of admixture between highly-diverged population groups. Statistical methods that attempt to localize DNA segments introduced by these events offer a powerful tool to investigate recent human evolution. We review recent advances in methods for detecting introgressed sequences.
Collapse
Affiliation(s)
- Sriram Sankararaman
- Department of Computer Science, University of California, Los Angeles, CA 90095, United States; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, United States; Department of Computational Medicine, University of California, Los Angeles, CA 90095, United States.
| |
Collapse
|
28
|
Yan SM, McCoy RC. Archaic hominin genomics provides a window into gene expression evolution. Curr Opin Genet Dev 2020; 62:44-49. [PMID: 32615344 PMCID: PMC7483639 DOI: 10.1016/j.gde.2020.05.014] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2020] [Revised: 05/06/2020] [Accepted: 05/08/2020] [Indexed: 02/08/2023]
Abstract
Differences in gene expression are thought to account for most phenotypic differences within and between species. Consequently, gene expression is a powerful lens through which to study divergence between modern humans and our closest evolutionary relatives, the Neanderthals and Denisovans. Such insights complement biological knowledge gleaned from the fossil record, while also revealing general features of the mode and tempo of regulatory evolution. Because of the degradation of ancient RNA, gene expression profiles of archaic hominins must be studied by indirect means. As such, conclusions drawn from these studies are often laden with assumptions about the genetic architecture of gene expression, the complexity of which is increasingly apparent. Despite these challenges, rapid technical and conceptual advances in the fields of ancient genomics, functional genomics, statistical genomics, and genome engineering are revolutionizing understanding of hominin gene expression evolution.
Collapse
Affiliation(s)
- Stephanie M Yan
- Department of Biology, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Rajiv C McCoy
- Department of Biology, Johns Hopkins University, Baltimore, MD, 21218, USA.
| |
Collapse
|
29
|
Hsieh P, Vollger MR, Dang V, Porubsky D, Baker C, Cantsilieris S, Hoekzema K, Lewis AP, Munson KM, Sorensen M, Kronenberg ZN, Murali S, Nelson BJ, Chiatante G, Maggiolini FAM, Blanché H, Underwood JG, Antonacci F, Deleuze JF, Eichler EE. Adaptive archaic introgression of copy number variants and the discovery of previously unknown human genes. Science 2020; 366:366/6463/eaax2083. [PMID: 31624180 DOI: 10.1126/science.aax2083] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2019] [Revised: 07/05/2019] [Accepted: 09/12/2019] [Indexed: 01/01/2023]
Abstract
Copy number variants (CNVs) are subject to stronger selective pressure than single-nucleotide variants, but their roles in archaic introgression and adaptation have not been systematically investigated. We show that stratified CNVs are significantly associated with signatures of positive selection in Melanesians and provide evidence for adaptive introgression of large CNVs at chromosomes 16p11.2 and 8p21.3 from Denisovans and Neanderthals, respectively. Using long-read sequence data, we reconstruct the structure and complex evolutionary history of these polymorphisms and show that both encode positively selected genes absent from most human populations. Our results collectively suggest that large CNVs originating in archaic hominins and introgressed into modern humans have played an important role in local population adaptation and represent an insufficiently studied source of large-scale genetic variation.
Collapse
Affiliation(s)
- PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Vy Dang
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Stuart Cantsilieris
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Melanie Sorensen
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Zev N Kronenberg
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Shwetha Murali
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Bradley J Nelson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Giorgia Chiatante
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari, Italy
| | | | - Hélène Blanché
- Fondation Jean Dausset-Centre d'Etude du Polymorphisme Humain, Paris, France
| | - Jason G Underwood
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.,Pacific Biosciences (PacBio) of California, Inc., Menlo Park, CA, USA
| | - Francesca Antonacci
- Dipartimento di Biologia, Università degli Studi di Bari "Aldo Moro," Bari, Italy
| | | | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA. .,Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| |
Collapse
|
30
|
Wang K, Mathieson I, O’Connell J, Schiffels S. Tracking human population structure through time from whole genome sequences. PLoS Genet 2020; 16:e1008552. [PMID: 32150539 PMCID: PMC7082067 DOI: 10.1371/journal.pgen.1008552] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 03/19/2020] [Accepted: 12/04/2019] [Indexed: 11/18/2022] Open
Abstract
The genetic diversity of humans, like many species, has been shaped by a complex pattern of population separations followed by isolation and subsequent admixture. This pattern, reaching at least as far back as the appearance of our species in the paleontological record, has left its traces in our genomes. Reconstructing a population's history from these traces is a challenging problem. Here we present a novel approach based on the Multiple Sequentially Markovian Coalescent (MSMC) to analyze the separation history between populations. Our approach, called MSMC-IM, uses an improved implementation of the MSMC (MSMC2) to estimate coalescence rates within and across pairs of populations, and then fits a continuous Isolation-Migration model to these rates to obtain a time-dependent estimate of gene flow. We show, using simulations, that our method can identify complex demographic scenarios involving post-split admixture or archaic introgression. We apply MSMC-IM to whole genome sequences from 15 worldwide populations, tracking the process of human genetic diversification. We detect traces of extremely deep ancestry between some African populations, with around 1% of ancestry dating to divergences older than a million years ago.
Collapse
Affiliation(s)
- Ke Wang
- Department of Archaeogenetics, Max Planck Institute for the Science of Human History, Jena, Germany
| | - Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Jared O’Connell
- 23andMe Inc., Mountain View, California, United States of America
| | - Stephan Schiffels
- Department of Archaeogenetics, Max Planck Institute for the Science of Human History, Jena, Germany
| |
Collapse
|
31
|
Durvasula A, Sankararaman S. Recovering signals of ghost archaic introgression in African populations. SCIENCE ADVANCES 2020; 6:eaax5097. [PMID: 32095519 PMCID: PMC7015685 DOI: 10.1126/sciadv.aax5097] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 12/03/2019] [Indexed: 05/18/2023]
Abstract
While introgression from Neanderthals and Denisovans has been documented in modern humans outside Africa, the contribution of archaic hominins to the genetic variation of present-day Africans remains poorly understood. We provide complementary lines of evidence for archaic introgression into four West African populations. Our analyses of site frequency spectra indicate that these populations derive 2 to 19% of their genetic ancestry from an archaic population that diverged before the split of Neanderthals and modern humans. Using a method that can identify segments of archaic ancestry without the need for reference archaic genomes, we built genome-wide maps of archaic ancestry in the Yoruba and the Mende populations. Analyses of these maps reveal segments of archaic ancestry at high frequency in these populations that represent potential targets of adaptive introgression. Our results reveal the substantial contribution of archaic ancestry in shaping the gene pool of present-day West African populations.
Collapse
Affiliation(s)
- Arun Durvasula
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Sriram Sankararaman
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
32
|
Identifying and Interpreting Apparent Neanderthal Ancestry in African Individuals. Cell 2020; 180:677-687.e16. [PMID: 32004458 DOI: 10.1016/j.cell.2020.01.012] [Citation(s) in RCA: 95] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Revised: 11/13/2019] [Accepted: 01/07/2020] [Indexed: 01/27/2023]
Abstract
Admixture has played a prominent role in shaping patterns of human genomic variation, including gene flow with now-extinct hominins like Neanderthals and Denisovans. Here, we describe a novel probabilistic method called IBDmix to identify introgressed hominin sequences, which, unlike existing approaches, does not use a modern reference population. We applied IBDmix to 2,504 individuals from geographically diverse populations to identify and analyze Neanderthal sequences segregating in modern humans. Strikingly, we find that African individuals carry a stronger signal of Neanderthal ancestry than previously thought. We show that this can be explained by genuine Neanderthal ancestry due to migrations back to Africa, predominately from ancestral Europeans, and gene flow into Neanderthals from an early dispersing group of humans out of Africa. Our results refine our understanding of Neanderthal ancestry in African and non-African populations and demonstrate that remnants of Neanderthal genomes survive in every modern human population studied to date.
Collapse
|
33
|
Lipson M, Ribot I, Mallick S, Rohland N, Olalde I, Adamski N, Broomandkhoshbacht N, Lawson AM, López S, Oppenheimer J, Stewardson K, Asombang RN, Bocherens H, Bradman N, Culleton BJ, Cornelissen E, Crevecoeur I, de Maret P, Fomine FLM, Lavachery P, Mindzie CM, Orban R, Sawchuk E, Semal P, Thomas MG, Van Neer W, Veeramah KR, Kennett DJ, Patterson N, Hellenthal G, Lalueza-Fox C, MacEachern S, Prendergast ME, Reich D. Ancient West African foragers in the context of African population history. Nature 2020; 577:665-670. [PMID: 31969706 PMCID: PMC8386425 DOI: 10.1038/s41586-020-1929-1] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Accepted: 11/29/2019] [Indexed: 12/31/2022]
Abstract
Our knowledge of ancient human population structure in sub-Saharan Africa, particularly prior to the advent of food production, remains limited. Here we report genome-wide DNA data from four children-two of whom were buried approximately 8,000 years ago and two 3,000 years ago-from Shum Laka (Cameroon), one of the earliest known archaeological sites within the probable homeland of the Bantu language group1-11. One individual carried the deeply divergent Y chromosome haplogroup A00, which today is found almost exclusively in the same region12,13. However, the genome-wide ancestry profiles of all four individuals are most similar to those of present-day hunter-gatherers from western Central Africa, which implies that populations in western Cameroon today-as well as speakers of Bantu languages from across the continent-are not descended substantially from the population represented by these four people. We infer an Africa-wide phylogeny that features widespread admixture and three prominent radiations, including one that gave rise to at least four major lineages deep in the history of modern humans.
Collapse
Affiliation(s)
- Mark Lipson
- Department of Genetics, Harvard Medical School, Boston, MA, USA.
| | - Isabelle Ribot
- Département d'Anthropologie, Université de Montréal, Montreal, Quebec, Canada
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Medical and Population Genetics Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Nadin Rohland
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Iñigo Olalde
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Institute of Evolutionary Biology (CSIC-UPF), Barcelona, Spain
| | - Nicole Adamski
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Nasreen Broomandkhoshbacht
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
- Department of Anthropology, University of California, Santa Cruz, CA, USA
| | - Ann Marie Lawson
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Saioa López
- UCL Genetics Institute, University College London, London, UK
| | - Jonas Oppenheimer
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Kristin Stewardson
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | | | - Hervé Bocherens
- Department of Geosciences, Biogeology, University of Tübingen, Tübingen, Germany
- Senckenberg Research Centre for Human Evolution and Palaeoenvironment, University of Tübingen, Tübingen, Germany
| | - Neil Bradman
- UCL Genetics Institute, University College London, London, UK
- The Henry Stewart Group, London, UK
| | - Brendan J Culleton
- Institutes of Energy and the Environment, Pennsylvania State University, University Park, PA, USA
| | - Els Cornelissen
- Department of Cultural Anthropology and History, Royal Museum for Central Africa, Tervuren, Belgium
| | | | - Pierre de Maret
- Faculté de Philosophie et Sciences Sociales, Université Libre de Bruxelles, Brussels, Belgium
| | | | - Philippe Lavachery
- Agence Wallonne du Patrimoine, Service Public de Wallonie, Namur, Belgium
| | | | - Rosine Orban
- Royal Belgian Institute of Natural Sciences, Brussels, Belgium
| | - Elizabeth Sawchuk
- Department of Anthropology, Stony Brook University, Stony Brook, NY, USA
| | - Patrick Semal
- Royal Belgian Institute of Natural Sciences, Brussels, Belgium
| | - Mark G Thomas
- UCL Genetics Institute, University College London, London, UK
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Wim Van Neer
- Royal Belgian Institute of Natural Sciences, Brussels, Belgium
- Department of Biology, University of Leuven, Leuven, Belgium
| | - Krishna R Veeramah
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
| | - Douglas J Kennett
- Department of Anthropology, University of California, Santa Barbara, CA, USA
| | - Nick Patterson
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Garrett Hellenthal
- UCL Genetics Institute, University College London, London, UK
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | | | - Scott MacEachern
- Division of Social Science, Duke Kunshan University, Kunshan, China
| | - Mary E Prendergast
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Sociology and Anthropology, Saint Louis University, Madrid, Spain
| | - David Reich
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Medical and Population Genetics Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| |
Collapse
|
34
|
Identification of African-Specific Admixture between Modern and Archaic Humans. Am J Hum Genet 2019; 105:1254-1261. [PMID: 31809748 DOI: 10.1016/j.ajhg.2019.11.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2018] [Accepted: 11/03/2019] [Indexed: 11/21/2022] Open
Abstract
Recent work has demonstrated that two archaic human groups (Neanderthals and Denisovans) interbred with modern humans and contributed to the contemporary human gene pool. These findings relied on the availability of high-coverage genomes from both Neanderthals and Denisovans. Here we search for evidence of archaic admixture from a worldwide panel of 1,667 individuals using an approach that does not require the presence of an archaic human reference genome. We find no evidence for archaic admixture in the Andaman Islands, as previously claimed, or on the island of Flores, where Homo floresiensis fossils have been found. However, we do find evidence for at least one archaic admixture event in sub-Saharan Africa, with the strongest signal in Khoesan and Pygmy individuals from Southern and Central Africa. The locations of these putative archaic admixture tracts are weighted against functional regions of the genome, consistent with the long-term effects of purifying selection against introgressed genetic material.
Collapse
|
35
|
Gokcumen O. Archaic hominin introgression into modern human genomes. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2019; 171 Suppl 70:60-73. [PMID: 31702050 DOI: 10.1002/ajpa.23951] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 10/04/2019] [Accepted: 10/08/2019] [Indexed: 01/01/2023]
Abstract
Ancient genomes from multiple Neanderthal and the Denisovan individuals, along with DNA sequence data from diverse contemporary human populations strongly support the prevalence of gene flow among different hominins. Recent studies now provide evidence for multiple gene flow events that leave genetic signatures in extant and ancient human populations. These events include older gene flow from an unknown hominin in Africa predating out-of-Africa migrations, and in the last 50,000-100,000 years, multiple gene flow events from Neanderthals into ancestral Eurasian human populations, and at least three distinct introgression events from a lineage close to Denisovans into ancestors of extant Southeast Asian and Oceanic populations. Some of these introgression events may have happened as late as 20,000 years before present and reshaped the way in which we think about human evolution. In this review, I aim to answer anthropologically relevant questions with regard to recent research on ancient hominin introgression in the human lineage. How have genomic data from archaic hominins changed our view of human evolution? Is there any doubt about whether introgression from ancient hominins to the ancestors of present-day humans occurred? What is the current view of human evolutionary history from the genomics perspective? What is the impact of introgression on human phenotypes?
Collapse
Affiliation(s)
- Omer Gokcumen
- Department of Biological Sciences, North Campus, University at Buffalo, Buffalo, New York
| |
Collapse
|
36
|
Deng L, Zhang C, Yuan K, Gao Y, Pan Y, Ge X, He Y, Yuan Y, Lu Y, Zhang X, Chen H, Lou H, Wang X, Lu D, Liu J, Tian L, Feng Q, Khan A, Yang Y, Jin ZB, Yang J, Lu F, Qu J, Kang L, Su B, Xu S. Prioritizing natural-selection signals from the deep-sequencing genomic data suggests multi-variant adaptation in Tibetan highlanders. Natl Sci Rev 2019; 6:1201-1222. [PMID: 34691999 PMCID: PMC8291452 DOI: 10.1093/nsr/nwz108] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 06/17/2019] [Accepted: 06/18/2019] [Indexed: 12/13/2022] Open
Abstract
Human genetic adaptation to high altitudes (>2500 m) has been extensively studied over the last few years, but few functional adaptive genetic variants have been identified, largely owing to the lack of deep-genome sequencing data available to previous studies. Here, we build a list of putative adaptive variants, including 63 missense, 7 loss-of-function, 1,298 evolutionarily conserved variants and 509 expression quantitative traits loci. Notably, the top signal of selection is located in TMEM247, a transmembrane protein-coding gene. The Tibetan version of TMEM247 harbors one high-frequency (76.3%) missense variant, rs116983452 (c.248C > T; p.Ala83Val), with the T allele derived from archaic ancestry and carried by >94% of Tibetans but absent or in low frequencies (<3%) in non-Tibetan populations. The rs116983452-T is strongly and positively correlated with altitude and significantly associated with reduced hemoglobin concentration (p = 5.78 × 10-5), red blood cell count (p = 5.72 × 10-7) and hematocrit (p = 2.57 × 10-6). In particular, TMEM247-rs116983452 shows greater effect size and better predicts the phenotypic outcome than any EPAS1 variants in association with adaptive traits in Tibetans. Modeling the interaction between TMEM247-rs116983452 and EPAS1 variants indicates weak but statistically significant epistatic effects. Our results support that multiple variants may jointly deliver the fitness of the Tibetans on the plateau, where a complex model is needed to elucidate the adaptive evolution mechanism.
Collapse
Affiliation(s)
- Lian Deng
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Chao Zhang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Kai Yuan
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yang Gao
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Yuwen Pan
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Xueling Ge
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yaoxi He
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Yuan Yuan
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yan Lu
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Xiaoxi Zhang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Hao Chen
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Haiyi Lou
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Xiaoji Wang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Dongsheng Lu
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Jiaojiao Liu
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Lei Tian
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Qidi Feng
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Asifullah Khan
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yajun Yang
- State Key Laboratory of Genetic Engineering and Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Zi-Bing Jin
- The Eye Hospital, School of Ophthalmology & Optometry, Wenzhou Medical University, China National Center for International Research in Regenerative Medicine and Neurogenetics, State Key Laboratory of Ophthalmology, Optometry and Visual Science, Wenzhou 325027, China
| | - Jian Yang
- The Eye Hospital, School of Ophthalmology & Optometry, Wenzhou Medical University, China National Center for International Research in Regenerative Medicine and Neurogenetics, State Key Laboratory of Ophthalmology, Optometry and Visual Science, Wenzhou 325027, China
- Institute for Molecular Bioscience, Queensland Brain Institute, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Fan Lu
- The Eye Hospital, School of Ophthalmology & Optometry, Wenzhou Medical University, China National Center for International Research in Regenerative Medicine and Neurogenetics, State Key Laboratory of Ophthalmology, Optometry and Visual Science, Wenzhou 325027, China
| | - Jia Qu
- The Eye Hospital, School of Ophthalmology & Optometry, Wenzhou Medical University, China National Center for International Research in Regenerative Medicine and Neurogenetics, State Key Laboratory of Ophthalmology, Optometry and Visual Science, Wenzhou 325027, China
| | - Longli Kang
- Key Laboratory for Molecular Genetic Mechanisms and Intervention Research on High Altitude Disease of Tibet Autonomous Region, School of Medicine, Xizang Minzu University, Xianyang 712082, China
| | - Bing Su
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| | - Shuhua Xu
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nu-trition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
- Collaborative Innovation Center of Genetics and Development, Shanghai 200438, China
- Human Phenome Institute, Fudan University, Shanghai 201203, China
| |
Collapse
|
37
|
Abstract
The dispersal of anatomically modern human populations out of Africa and across much of the rest of the world around 55 to 50 thousand years before present (ka) is recorded genetically by the multiple hominin groups they met and interbred with along the way, including the Neandertals and Denisovans. The signatures of these introgression events remain preserved in the genomes of modern-day populations, and provide a powerful record of the sequence and timing of these early migrations, with Asia proving a particularly complex area. At least 3 different hominin groups appear to have been involved in Asia, of which only the Denisovans are currently known. Several interbreeding events are inferred to have taken place east of Wallace's Line, consistent with archaeological evidence of widespread and early hominin presence in the area. However, archaeological and fossil evidence indicates archaic hominins had not spread as far as the Sahul continent (New Guinea, Australia, and Tasmania), where recent genetic evidence remains enigmatic.
Collapse
|
38
|
Abstract
Context: Africa's role in the narrative of human evolution is indisputably emphasised in the emergence of Homo sapiens. However, once humans dispersed beyond Africa, the history of those who stayed remains vastly under-studied, lacking the proper attention the birthplace of both modern and archaic humans deserves. The sequencing of Neanderthal and Denisovan genomes has elucidated evidence of admixture between archaic and modern humans outside of Africa, but has not aided efforts in answering whether archaic admixture happened within Africa. Objectives: This article reviews the state of research for archaic introgression in African populations and discusses recent insights into this topic. Methods: Gathering published sources and recently released preprints, this review reports on the different methods developed for detecting archaic introgression. Particularly it discusses how relevant these are when implemented on African populations and what findings these studies have shown so far. Results: Methods for detecting archaic introgression have been predominantly developed and implemented on non-African populations. Recent preprints present new methods considering African populations. While a number of studies using these methods suggest archaic introgression in Africa, without an African archaic genome to validate these results, such findings remain as putative archaic introgression. Conclusion: In light of the caveats with implementing current archaic introgression detection methods in Africa, we recommend future studies to concentrate on unravelling the complicated demographic history of Africa through means of ancient DNA where possible and through more focused efforts to sequence modern DNA from more representative populations across the African continent.
Collapse
Affiliation(s)
- Cindy Santander
- a Department of Zoology , University of Oxford , Oxford , UK
| | - Francesco Montinaro
- a Department of Zoology , University of Oxford , Oxford , UK.,b Estonian Biocentre , University of Tartu , Tartu , Estonia
| | | |
Collapse
|
39
|
Ragsdale AP, Gravel S. Models of archaic admixture and recent history from two-locus statistics. PLoS Genet 2019; 15:e1008204. [PMID: 31181058 PMCID: PMC6586359 DOI: 10.1371/journal.pgen.1008204] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 06/20/2019] [Accepted: 05/17/2019] [Indexed: 11/18/2022] Open
Abstract
We learn about population history and underlying evolutionary biology through patterns of genetic polymorphism. Many approaches to reconstruct evolutionary histories focus on a limited number of informative statistics describing distributions of allele frequencies or patterns of linkage disequilibrium. We show that many commonly used statistics are part of a broad family of two-locus moments whose expectation can be computed jointly and rapidly under a wide range of scenarios, including complex multi-population demographies with continuous migration and admixture events. A full inspection of these statistics reveals that widely used models of human history fail to predict simple patterns of linkage disequilibrium. To jointly capture the information contained in classical and novel statistics, we implemented a tractable likelihood-based inference framework for demographic history. Using this approach, we show that human evolutionary models that include archaic admixture in Africa, Asia, and Europe provide a much better description of patterns of genetic diversity across the human genome. We estimate that an unidentified, deeply diverged population admixed with modern humans within Africa both before and after the split of African and Eurasian populations, contributing 4 - 8% genetic ancestry to individuals in world-wide populations.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Human Genetics, McGill University, Montreal, QC, Canada
| | - Simon Gravel
- Department of Human Genetics, McGill University, Montreal, QC, Canada
| |
Collapse
|
40
|
Durvasula A, Sankararaman S. A statistical model for reference-free inference of archaic local ancestry. PLoS Genet 2019; 15:e1008175. [PMID: 31136573 PMCID: PMC6555542 DOI: 10.1371/journal.pgen.1008175] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 06/07/2019] [Accepted: 05/03/2019] [Indexed: 01/01/2023] Open
Abstract
Statistical analyses of genomic data from diverse human populations have demonstrated that archaic hominins, such as Neanderthals and Denisovans, interbred or admixed with the ancestors of present-day humans. Central to these analyses are methods for inferring archaic ancestry along the genomes of present-day individuals (archaic local ancestry). Methods for archaic local ancestry inference rely on the availability of reference genomes from the ancestral archaic populations for accurate inference. However, several instances of archaic admixture lack reference archaic genomes, making it difficult to characterize these events. We present a statistical method that combines diverse population genetic summary statistics to infer archaic local ancestry without access to an archaic reference genome. We validate the accuracy and robustness of our method in simulations. When applied to genomes of European individuals, our method recovers segments that are substantially enriched for Neanderthal ancestry, even though our method did not have access to any Neanderthal reference genomes.
Collapse
Affiliation(s)
- Arun Durvasula
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
| | - Sriram Sankararaman
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California
- Department of Computer Science, University of California, Los Angeles, Los Angeles, California
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, California
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, California
| |
Collapse
|
41
|
Jacobs GS, Hudjashov G, Saag L, Kusuma P, Darusallam CC, Lawson DJ, Mondal M, Pagani L, Ricaut FX, Stoneking M, Metspalu M, Sudoyo H, Lansing JS, Cox MP. Multiple Deeply Divergent Denisovan Ancestries in Papuans. Cell 2019; 177:1010-1021.e32. [DOI: 10.1016/j.cell.2019.02.035] [Citation(s) in RCA: 137] [Impact Index Per Article: 27.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 01/07/2019] [Accepted: 02/21/2019] [Indexed: 12/29/2022]
|
42
|
Ancient admixture from an extinct ape lineage into bonobos. Nat Ecol Evol 2019; 3:957-965. [PMID: 31036897 DOI: 10.1038/s41559-019-0881-7] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 03/21/2019] [Indexed: 01/28/2023]
Abstract
Admixture is a recurrent phenomenon in humans and other great ape populations. Genetic information from extinct hominins allows us to study historical interactions with modern humans and discover adaptive functions of gene flow. Here, we investigate whole genomes from bonobo and chimpanzee populations for signatures of gene flow from unknown archaic populations, finding evidence for an ancient admixture event between bonobos and a divergent lineage. This result reveals a complex population history in our closest living relatives, probably several hundred thousand years ago. We reconstruct up to 4.8% of the genome of this 'ghost' ape, which represents genomic data of an extinct great ape population. Genes contained in archaic fragments might confer functional consequences for the immunity, behaviour and physiology of bonobos. Finally, comparing the landscapes of introgressed regions in humans and bonobos, we find that a recurrent depletion of introgression is rare, suggesting that genomic incompatibilities arose seldom in these lineages.
Collapse
|
43
|
Lorente-Galdos B, Lao O, Serra-Vidal G, Santpere G, Kuderna LFK, Arauna LR, Fadhlaoui-Zid K, Pimenoff VN, Soodyall H, Zalloua P, Marques-Bonet T, Comas D. Whole-genome sequence analysis of a Pan African set of samples reveals archaic gene flow from an extinct basal population of modern humans into sub-Saharan populations. Genome Biol 2019; 20:77. [PMID: 31023378 PMCID: PMC6485163 DOI: 10.1186/s13059-019-1684-5] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 03/28/2019] [Indexed: 12/30/2022] Open
Abstract
Background Population demography and gene flow among African groups, as well as the putative archaic introgression of ancient hominins, have been poorly explored at the genome level. Results Here, we examine 15 African populations covering all major continental linguistic groups, ecosystems, and lifestyles within Africa through analysis of whole-genome sequence data of 21 individuals sequenced at deep coverage. We observe a remarkable correlation among genetic diversity and geographic distance, with the hunter-gatherer groups being more genetically differentiated and having larger effective population sizes throughout most modern-human history. Admixture signals are found between neighbor populations from both hunter-gatherer and agriculturalists groups, whereas North African individuals are closely related to Eurasian populations. Regarding archaic gene flow, we test six complex demographic models that consider recent admixture as well as archaic introgression. We identify the fingerprint of an archaic introgression event in the sub-Saharan populations included in the models (~ 4.0% in Khoisan, ~ 4.3% in Mbuti Pygmies, and ~ 5.8% in Mandenka) from an early divergent and currently extinct ghost modern human lineage. Conclusion The present study represents an in-depth genomic analysis of a Pan African set of individuals, which emphasizes their complex relationships and demographic history at population level. Electronic supplementary material The online version of this article (10.1186/s13059-019-1684-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Belen Lorente-Galdos
- Departament de Ciències Experimentals i de la Salut, Institut de Biologia Evolutiva (UPF/CSIC), Universitat Pompeu Fabra, 08003, Barcelona, Spain.,Department of Neuroscience, Yale School of Medicine, New Haven, CT, USA
| | - Oscar Lao
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, 08028, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Gerard Serra-Vidal
- Departament de Ciències Experimentals i de la Salut, Institut de Biologia Evolutiva (UPF/CSIC), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Gabriel Santpere
- Departament de Ciències Experimentals i de la Salut, Institut de Biologia Evolutiva (UPF/CSIC), Universitat Pompeu Fabra, 08003, Barcelona, Spain.,Department of Neuroscience, Yale School of Medicine, New Haven, CT, USA
| | - Lukas F K Kuderna
- Departament de Ciències Experimentals i de la Salut, Institut de Biologia Evolutiva (UPF/CSIC), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Lara R Arauna
- Departament de Ciències Experimentals i de la Salut, Institut de Biologia Evolutiva (UPF/CSIC), Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Karima Fadhlaoui-Zid
- College of Science, Department of Biology, Taibah University, Al Madinah, Al Monawarah, Saudi Arabia.,Higher Institute of Biotechnology of Beja, University of Jendouba, Avenue Habib Bourguiba, BP, 382, 9000, Beja, Tunisia
| | - Ville N Pimenoff
- Oncology Data Analytics Program, Bellvitge Biomedical Research Institute (ICO-IDIBELL), Consortium for Biomedical Research in Epidemiology and Public Health, Hospitalet de Llobregat, Barcelona, Spain.,Department of Archaeology, University of Helsinki, Helsinki, Finland
| | - Himla Soodyall
- Division of Human Genetics, School of Pathology, Faculty of Health Sciences, University of the Witwatersrand and National Health Laboratory Service, Johannesburg, South Africa
| | - Pierre Zalloua
- School of Medicine, The Lebanese American University, Beirut, 1102-2801, Lebanon
| | - Tomas Marques-Bonet
- Departament de Ciències Experimentals i de la Salut, Institut de Biologia Evolutiva (UPF/CSIC), Universitat Pompeu Fabra, 08003, Barcelona, Spain.,CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, 08028, Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats, ICREA, 08003, Barcelona, Spain
| | - David Comas
- Departament de Ciències Experimentals i de la Salut, Institut de Biologia Evolutiva (UPF/CSIC), Universitat Pompeu Fabra, 08003, Barcelona, Spain.
| |
Collapse
|
44
|
Beichman AC, Huerta-Sanchez E, Lohmueller KE. Using Genomic Data to Infer Historic Population Dynamics of Nonmodel Organisms. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2018. [DOI: 10.1146/annurev-ecolsys-110617-062431] [Citation(s) in RCA: 89] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genome sequence data are now being routinely obtained from many nonmodel organisms. These data contain a wealth of information about the demographic history of the populations from which they originate. Many sophisticated statistical inference procedures have been developed to infer the demographic history of populations from this type of genomic data. In this review, we discuss the different statistical methods available for inference of demography, providing an overview of the underlying theory and logic behind each approach. We also discuss the types of data required and the pros and cons of each method. We then discuss how these methods have been applied to a variety of nonmodel organisms. We conclude by presenting some recommendations for researchers looking to use genomic data to infer demographic history.
Collapse
Affiliation(s)
- Annabel C. Beichman
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095, USA
| | - Emilia Huerta-Sanchez
- Department of Molecular and Cell Biology, University of California, Merced, California 95343, USA
- Current affiliation: Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island 02912, USA
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095, USA
- Interdepartmental Program in Bioinformatics and Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, USA
| |
Collapse
|
45
|
Hey J, Chung Y, Sethuraman A, Lachance J, Tishkoff S, Sousa VC, Wang Y. Phylogeny Estimation by Integration over Isolation with Migration Models. Mol Biol Evol 2018; 35:2805-2818. [PMID: 30137463 PMCID: PMC6231491 DOI: 10.1093/molbev/msy162] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Phylogeny estimation is difficult for closely related populations and species, especially if they have been exchanging genes. We present a hierarchical Bayesian, Markov-chain Monte Carlo method with a state space that includes all possible phylogenies in a full Isolation-with-Migration model framework. The method is based on a new type of genealogy augmentation called a "hidden genealogy" that enables efficient updating of the phylogeny. This is the first likelihood-based method to fully incorporate directional gene flow and genetic drift for estimation of a species or population phylogeny. Application to human hunter-gatherer populations from Africa revealed a clear phylogenetic history, with strong support for gene exchange with an unsampled ghost population, and relatively ancient divergence between a ghost population and modern human populations, consistent with human/archaic divergence. In contrast, a study of five chimpanzee populations reveals a clear phylogeny with several pairs of populations having exchanged DNA, but does not support a history with an unsampled ghost population.
Collapse
Affiliation(s)
- Jody Hey
- Department of Biology, Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA
| | - Yujin Chung
- Department of Biology, Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA
- The Department of Applied Statistics, Kyonggi University, Suwon, South Korea
| | - Arun Sethuraman
- Department of Biology, Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA
- Department of Biological Sciences, California State University San Marcos, San Marcos, CA
| | - Joseph Lachance
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
- Georgia Institute of Technology, Atlanta, GA
| | - Sarah Tishkoff
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Vitor C Sousa
- Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ
- University of Lisbon, Lisboa, Portugal
| | - Yong Wang
- Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ
- Ancestry, San Francisco, CA
| |
Collapse
|
46
|
Kim BY, Wei X, Fitz-Gibbon S, Lohmueller KE, Ortego J, Gugger PF, Sork VL. RADseq data reveal ancient, but not pervasive, introgression between Californian tree and scrub oak species (Quercussect.Quercus: Fagaceae). Mol Ecol 2018; 27:4556-4571. [DOI: 10.1111/mec.14869] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2017] [Revised: 07/25/2018] [Accepted: 08/29/2018] [Indexed: 12/24/2022]
Affiliation(s)
- Bernard Y. Kim
- Department of Ecology and Evolutionary Biology; University of California at Los Angeles; Los Angeles California
| | - Xinzeng Wei
- Department of Ecology and Evolutionary Biology; University of California at Los Angeles; Los Angeles California
- Key Laboratory of Aquatic Botany and Watershed Ecology; Wuhan Botanical Garden; Chinese Academy of Sciences; Wuhan Hubei China
| | - Sorel Fitz-Gibbon
- Department of Ecology and Evolutionary Biology; University of California at Los Angeles; Los Angeles California
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology; University of California at Los Angeles; Los Angeles California
- Department of Human Genetics; David Geffen School of Medicine; University of California; Los Angeles California
| | - Joaquín Ortego
- Department of Integrative Ecology; Estación Biológica de Doñana, EBD-CSIC; Seville Spain
| | - Paul F. Gugger
- Department of Ecology and Evolutionary Biology; University of California at Los Angeles; Los Angeles California
- Appalachian Laboratory; University of Maryland Center for Environmental Science; Frostburg Maryland
| | - Victoria L. Sork
- Department of Ecology and Evolutionary Biology; University of California at Los Angeles; Los Angeles California
- Institute of the Environment and Sustainability; University of California; Los Angeles California
| |
Collapse
|
47
|
Steinrücken M, Spence JP, Kamm JA, Wieczorek E, Song YS. Model-based detection and analysis of introgressed Neanderthal ancestry in modern humans. Mol Ecol 2018; 27:3873-3888. [PMID: 29603507 PMCID: PMC6165692 DOI: 10.1111/mec.14565] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Revised: 02/16/2018] [Accepted: 03/06/2018] [Indexed: 01/03/2023]
Abstract
Genetic evidence has revealed that the ancestors of modern human populations outside Africa and their hominin sister groups, notably Neanderthals, exchanged genetic material in the past. The distribution of these introgressed sequence tracts along modern-day human genomes provides insight into the selective forces acting on them and the role of introgression in the evolutionary history of hominins. Studying introgression patterns on the X-chromosome is of particular interest, as sex chromosomes are thought to play a special role in speciation. Recent studies have developed methods to localize introgressed ancestries, reporting long regions that are depleted of Neanderthal introgression and enriched in genes, suggesting negative selection against the Neanderthal variants. On the other hand, enriched Neanderthal ancestry in hair- and skin-related genes suggests that some introgressed variants facilitated adaptation to new environments. Here, we present a model-based introgression detection method called dical-admix. We demonstrate its efficiency and accuracy through extensive simulations and apply it to detect tracts of Neanderthal introgression in modern human individuals from the 1000 Genomes Project. Our findings are largely concordant with previous studies, consistent with weak selection against Neanderthal ancestry. We find evidence that selection against Neanderthal ancestry was due to higher genetic load in Neanderthals resulting from small effective population size, rather than widespread Dobzhansky-Müller incompatibilities (DMIs) that could contribute to reproductive isolation. Moreover, we confirm the previously reported low level of introgression on the X-chromosome, but find little evidence that DMIs contributed to this pattern.
Collapse
Affiliation(s)
- Matthias Steinrücken
- Department of Ecology and Evolution, University of Chicago
- Department of Biostatistics and Epidemiology, University of
Massachusetts, Amherst
- Department of EECS, University of California, Berkeley
| | - Jeffrey P. Spence
- Computational Biology Graduate Group, University of California,
Berkeley
| | - John A. Kamm
- Department of Statistics, University of California, Berkeley
| | | | - Yun S. Song
- Department of EECS, University of California, Berkeley
- Department of Statistics, University of California, Berkeley
- Chan Zuckerberg Biohub, San Francisco
| |
Collapse
|
48
|
Skov L, Hui R, Shchur V, Hobolth A, Scally A, Schierup MH, Durbin R. Detecting archaic introgression using an unadmixed outgroup. PLoS Genet 2018; 14:e1007641. [PMID: 30226838 PMCID: PMC6161914 DOI: 10.1371/journal.pgen.1007641] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Revised: 09/28/2018] [Accepted: 08/17/2018] [Indexed: 12/24/2022] Open
Abstract
Human populations outside of Africa have experienced at least two bouts of introgression from archaic humans, from Neanderthals and Denisovans. In Papuans there is prior evidence of both these introgressions. Here we present a new approach to detect segments of individual genomes of archaic origin without using an archaic reference genome. The approach is based on a hidden Markov model that identifies genomic regions with a high density of single nucleotide variants (SNVs) not seen in unadmixed populations. We show using simulations that this provides a powerful approach to identifying segments of archaic introgression with a low rate of false detection, given data from a suitable outgroup population is available, without the archaic introgression but containing a majority of the variation that arose since initial separation from the archaic lineage. Furthermore our approach is able to infer admixture proportions and the times both of admixture and of initial divergence between the human and archaic populations. We apply the model to detect archaic introgression in 89 Papuans and show how the identified segments can be assigned to likely Neanderthal or Denisovan origin. We report more Denisovan admixture than previous studies and find a shift in size distribution of fragments of Neanderthal and Denisovan origin that is compatible with a difference in admixture time. Furthermore, we identify small amounts of Denisova ancestry in South East Asians and South Asians. The genetic history of present-day individuals includes episodes of mating between divergent groups, which have led to 'introgressed' genetic material persisting in modern genome sequences. Perhaps the most notable examples of such events in humans are the introgressions from Neanderthals into non-Africans 50,000 or so years ago, and from a related archaic group known as Denisovans into the ancestors of indigenous people from Papua-New Guinea and Australia. Methods to identify introgressions and the genomic regions that derive from them generally involve the use of reference genome sequences for the source populations. However, there are advantages in having methods independent of reference sequences, both to reduce bias and to detect possible introgression from groups for which we currently lack a reference genome. In this paper we describe such an approach, in a statistical framework which exploits the fact that introgressed regions will contain a high density of genetic variants that are private to the group receiving the divergent material. We apply this method to 89 Papuan genome sequences, estimating times of introgression and initial divergence between archaic and modern humans, and compare it to other related methods.
Collapse
Affiliation(s)
- Laurits Skov
- Bioinformatics Research Centre, Aarhus University, Aarhus C., Denmark
- * E-mail: (LS); (RD)
| | - Ruoyun Hui
- Department of Genetics, University of Cambridge, Cambridge United Kingdom
| | - Vladimir Shchur
- Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Asger Hobolth
- Bioinformatics Research Centre, Aarhus University, Aarhus C., Denmark
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge United Kingdom
| | | | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge United Kingdom
- Wellcome Sanger Institute, Hinxton, Cambridge, United Kingdom
- * E-mail: (LS); (RD)
| |
Collapse
|
49
|
Schlebusch CM, Jakobsson M. Tales of Human Migration, Admixture, and Selection in Africa. Annu Rev Genomics Hum Genet 2018; 19:405-428. [DOI: 10.1146/annurev-genom-083117-021759] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In the last three decades, genetic studies have played an increasingly important role in exploring human history. They have helped to conclusively establish that anatomically modern humans first appeared in Africa roughly 250,000–350,000 years before present and subsequently migrated to other parts of the world. The history of humans in Africa is complex and includes demographic events that influenced patterns of genetic variation across the continent. Through genetic studies, it has become evident that deep African population history is captured by relationships among African hunter–gatherers, as the world's deepest population divergences occur among these groups, and that the deepest population divergence dates to 300,000 years before present. However, the spread of pastoralism and agriculture in the last few thousand years has shaped the geographic distribution of present-day Africans and their genetic diversity. With today's sequencing technologies, we can obtain full genome sequences from diverse sets of extant and prehistoric Africans. The coming years will contribute exciting new insights toward deciphering human evolutionary history in Africa.
Collapse
Affiliation(s)
- Carina M. Schlebusch
- Human Evolution, Department of Organismal Biology, Uppsala University, SE-752 36 Uppsala, Sweden;,
- Centre for Anthropological Research and Department of Anthropology and Development Studies, University of Johannesburg, 2006 Johannesburg, South Africa
- SciLifeLab, SE-751 23 Uppsala, Sweden
| | - Mattias Jakobsson
- Human Evolution, Department of Organismal Biology, Uppsala University, SE-752 36 Uppsala, Sweden;,
- Centre for Anthropological Research and Department of Anthropology and Development Studies, University of Johannesburg, 2006 Johannesburg, South Africa
- SciLifeLab, SE-751 23 Uppsala, Sweden
| |
Collapse
|
50
|
Tucci S, Vohr SH, McCoy RC, Vernot B, Robinson MR, Barbieri C, Nelson BJ, Fu W, Purnomo GA, Sudoyo H, Eichler EE, Barbujani G, Visscher PM, Akey JM, Green RE. Evolutionary history and adaptation of a human pygmy population of Flores Island, Indonesia. Science 2018; 361:511-516. [PMID: 30072539 PMCID: PMC6709593 DOI: 10.1126/science.aar8486] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 06/22/2018] [Indexed: 12/21/2022]
Abstract
Flores Island, Indonesia, was inhabited by the small-bodied hominin species Homo floresiensis, which has an unknown evolutionary relationship to modern humans. This island is also home to an extant human pygmy population. Here we describe genome-scale single-nucleotide polymorphism data and whole-genome sequences from a contemporary human pygmy population living on Flores near the cave where H. floresiensis was found. The genomes of Flores pygmies reveal a complex history of admixture with Denisovans and Neanderthals but no evidence for gene flow with other archaic hominins. Modern individuals bear the signatures of recent positive selection encompassing the FADS (fatty acid desaturase) gene cluster, likely related to diet, and polygenic selection acting on standing variation that contributed to their short-stature phenotype. Thus, multiple independent instances of hominin insular dwarfism occurred on Flores.
Collapse
Affiliation(s)
- Serena Tucci
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute, Princeton University, Princeton, NJ, USA
- Department of Life Sciences and Biotechnologies, University of Ferrara, Ferrara, Italy
| | - Samuel H Vohr
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Rajiv C McCoy
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute, Princeton University, Princeton, NJ, USA
| | - Benjamin Vernot
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Matthew R Robinson
- Department of Computational Biology, Génopode, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Génopode, Quatier Sorge, Lausanne, Switzerland
| | - Chiara Barbieri
- Department of Linguistic and Cultural Evolution, Max Planck Institute for the Science of Human History, Jena, Germany
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Switzerland
| | - Brad J Nelson
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Wenqing Fu
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Gludhug A Purnomo
- Genome Diversity and Diseases Laboratory, Eijkman Institute for Molecular Biology, Jakarta, Indonesia
| | - Herawati Sudoyo
- Genome Diversity and Diseases Laboratory, Eijkman Institute for Molecular Biology, Jakarta, Indonesia
- Department of Medical Biology, Faculty of Medicine, University of Indonesia, Jakarta, Indonesia
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Guido Barbujani
- Department of Life Sciences and Biotechnologies, University of Ferrara, Ferrara, Italy
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Joshua M Akey
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA.
- Lewis-Sigler Institute, Princeton University, Princeton, NJ, USA
| | - Richard E Green
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA.
| |
Collapse
|