1
|
Akbari A, Barton AR, Gazal S, Li Z, Kariminejad M, Perry A, Zeng Y, Mittnik A, Patterson N, Mah M, Zhou X, Price AL, Lander ES, Pinhasi R, Rohland N, Mallick S, Reich D. Pervasive findings of directional selection realize the promise of ancient DNA to elucidate human adaptation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.14.613021. [PMID: 39314480 PMCID: PMC11419161 DOI: 10.1101/2024.09.14.613021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
We present a method for detecting evidence of natural selection in ancient DNA time-series data that leverages an opportunity not utilized in previous scans: testing for a consistent trend in allele frequency change over time. By applying this to 8433 West Eurasians who lived over the past 14000 years and 6510 contemporary people, we find an order of magnitude more genome-wide significant signals than previous studies: 347 independent loci with >99% probability of selection. Previous work showed that classic hard sweeps driving advantageous mutations to fixation have been rare over the broad span of human evolution, but in the last ten millennia, many hundreds of alleles have been affected by strong directional selection. Discoveries include an increase from ~0% to ~20% in 4000 years for the major risk factor for celiac disease at HLA-DQB1; a rise from ~0% to ~8% in 6000 years of blood type B; and fluctuating selection at the TYK2 tuberculosis risk allele rising from ~2% to ~9% from ~5500 to ~3000 years ago before dropping to ~3%. We identify instances of coordinated selection on alleles affecting the same trait, with the polygenic score today predictive of body fat percentage decreasing by around a standard deviation over ten millennia, consistent with the "Thrifty Gene" hypothesis that a genetic predisposition to store energy during food scarcity became disadvantageous after farming. We also identify selection for combinations of alleles that are today associated with lighter skin color, lower risk for schizophrenia and bipolar disease, slower health decline, and increased measures related to cognitive performance (scores on intelligence tests, household income, and years of schooling). These traits are measured in modern industrialized societies, so what phenotypes were adaptive in the past is unclear. We estimate selection coefficients at 9.9 million variants, enabling study of how Darwinian forces couple to allelic effects and shape the genetic architecture of complex traits.
Collapse
Affiliation(s)
- Ali Akbari
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Alison R Barton
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Steven Gazal
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Zheng Li
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | | | - Annabel Perry
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Yating Zeng
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Alissa Mittnik
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Matthew Mah
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Xiang Zhou
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Alkes L Price
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Eric S Lander
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Ron Pinhasi
- Department of Biology, Massachusetts Institute of Technology (MIT), Cambridge, MA, USA
- Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria
| | - Nadin Rohland
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - David Reich
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
2
|
Götsch H, Bürger R. Polygenic dynamics underlying the response of quantitative traits to directional selection. Theor Popul Biol 2024; 158:21-59. [PMID: 38677378 DOI: 10.1016/j.tpb.2024.04.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 04/14/2024] [Accepted: 04/19/2024] [Indexed: 04/29/2024]
Abstract
We study the response of a quantitative trait to exponential directional selection in a finite haploid population, both at the genetic and the phenotypic level. We assume an infinite sites model, in which the number of new mutations per generation in the population follows a Poisson distribution (with mean Θ) and each mutation occurs at a new, previously monomorphic site. Mutation effects are beneficial and drawn from a distribution. Sites are unlinked and contribute additively to the trait. Assuming that selection is stronger than random genetic drift, we model the initial phase of the dynamics by a supercritical Galton-Watson process. This enables us to obtain time-dependent results. We show that the copy-number distribution of the mutant in generation n, conditioned on non-extinction until n, is described accurately by the deterministic increase from an initial distribution with mean 1. This distribution is related to the absolutely continuous part W+ of the random variable, typically denoted W, that characterizes the stochasticity accumulating during the mutant's sweep. A suitable transformation yields the approximate dynamics of the mutant frequency distribution in a Wright-Fisher population of size N. Our expression provides a very accurate approximation except when mutant frequencies are close to 1. On this basis, we derive explicitly the (approximate) time dependence of the expected mean and variance of the trait and of the expected number of segregating sites. Unexpectedly, we obtain highly accurate approximations for all times, even for the quasi-stationary phase when the expected per-generation response and the trait variance have equilibrated. The latter refine classical results. In addition, we find that Θ is the main determinant of the pattern of adaptation at the genetic level, i.e., whether the initial allele-frequency dynamics are best described by sweep-like patterns at few loci or small allele-frequency shifts at many. The number of segregating sites is an appropriate indicator for these patterns. The selection strength determines primarily the rate of adaptation. The accuracy of our results is tested by comprehensive simulations in a Wright-Fisher framework. We argue that our results apply to more complex forms of directional selection.
Collapse
Affiliation(s)
- Hannah Götsch
- Faculty of Mathematics, University of Vienna, 1090 Vienna, Austria; Vienna Graduate School of Population Genetics, Austria.
| | - Reinhard Bürger
- Faculty of Mathematics, University of Vienna, 1090 Vienna, Austria
| |
Collapse
|
3
|
Beichman AC, Zhu L, Harris K. The Evolutionary Interplay of Somatic and Germline Mutation Rates. Annu Rev Biomed Data Sci 2024; 7:83-105. [PMID: 38669515 DOI: 10.1146/annurev-biodatasci-102523-104225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2024]
Abstract
Novel sequencing technologies are making it increasingly possible to measure the mutation rates of somatic cell lineages. Accurate germline mutation rate measurement technologies have also been available for a decade, making it possible to assess how this fundamental evolutionary parameter varies across the tree of life. Here, we review some classical theories about germline and somatic mutation rate evolution that were formulated using principles of population genetics and the biology of aging and cancer. We find that somatic mutation rate measurements, while still limited in phylogenetic diversity, seem consistent with the theory that selection to preserve the soma is proportional to life span. However, germline and somatic theories make conflicting predictions regarding which species should have the most accurate DNA repair. Resolving this conflict will require carefully measuring how mutation rates scale with time and cell division and achieving a better understanding of mutation rate pleiotropy among cell types.
Collapse
Affiliation(s)
- Annabel C Beichman
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA;
| | - Luke Zhu
- Department of Bioengineering, University of Washington, Seattle, Washington, USA
| | - Kelley Harris
- Computational Biology Division, Fred Hutchinson Cancer Center, Seattle, Washington, USA
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA;
| |
Collapse
|
4
|
Yilmaz F, Karageorgiou C, Kim K, Pajic P, Scheer K, Beck CR, Torregrossa AM, Lee C, Gokcumen O. Paleolithic Gene Duplications Primed Adaptive Evolution of Human Amylase Locus Upon Agriculture. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.27.568916. [PMID: 38077078 PMCID: PMC10705236 DOI: 10.1101/2023.11.27.568916] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Starch digestion is a cornerstone of human nutrition. The amylase genes code for the starch-digesting amylase enzyme. Previous studies suggested that the salivary amylase (AMY1) gene copy number increased in response to agricultural diets. However, the lack of nucleotide resolution of the amylase locus hindered detailed evolutionary analyses. Here, we have resolved this locus at nucleotide resolution in 98 present-day humans and identified 30 distinct haplotypes, revealing that the coding sequences of all amylase gene copies are evolving under negative selection. The phylogenetic reconstruction suggested that haplotypes with three AMY1 gene copies, prevalent across all continents and constituting about 70% of observed haplotypes, originated before the out-of-Africa migrations of ancestral modern humans. Using thousands of unique 25 base pair sequences across the amylase locus, we showed that additional AMY1 gene copies existed in the genomes of four archaic hominin genomes, indicating that the initial duplication of this locus may have occurred as far back 800,000 years ago. We similarly analyzed 73 ancient human genomes dating from 300 - 45,000 years ago and found that the AMY1 copy number variation observed today existed long before the advent of agriculture (~10,000 years ago), predisposing this locus to adaptive increase in the frequency of higher amylase copy number with the spread of agriculture. Mechanistically, the common three-copy haplotypes seeded non-allelic homologous recombination events that appear to be occurring at one of the fastest rates seen for tandem repeats in the human genome. Our study provides a comprehensive population-level understanding of the genomic structure of the amylase locus, identifying the mechanisms and evolutionary history underlying its duplication and copy number variability in relation to the onset of agriculture.
Collapse
|
5
|
Shpak M, Lawrence KN, Pool JE. The Precision and Power of Population Branch Statistics in Identifying the Genomic Signatures of Local Adaptation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.14.594139. [PMID: 38798330 PMCID: PMC11118325 DOI: 10.1101/2024.05.14.594139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Population branch statistics, which estimate the branch lengths of focal populations with respect to two outgroups, have been used as an alternative to FST-based genome-wide scans for identifying loci associated with local selective sweeps. In addition to the original population branch statistic (PBS), there are subsequently proposed branch rescalings: normalized population branch statistic (PBSn1), which adjusts focal branch length with respect to outgroup branch lengths at the same locus, and population branch excess (PBE), which also incorporates median branch lengths at other loci. PBSn1 and PBE have been proposed to be less sensitive to allele frequency divergence generated by background selection or geographically ubiquitous positive selection rather than local selective sweeps. However, the accuracy and statistical power of branch statistics have not been systematically assessed. To do so, we simulate genomes in representative large and small populations with varying proportions of sites evolving under genetic drift or background selection (approximated using variable Ne), local selective sweeps, and geographically parallel selective sweeps. We then assess the probability that local selective sweep loci are correctly identified as outliers by FST and by each of the branch statistics. We find that branch statistics consistently outperform FST at identifying local sweeps. When background selection and/or parallel sweeps are introduced, PBSn1 and especially PBE correctly identify local sweeps among their top outliers at a higher frequency than PBS. These results validate the greater specificity of rescaled branch statistics such as PBE to detect population-specific positive selection, supporting their use in genomic studies focused on local adaptation.
Collapse
Affiliation(s)
- Max Shpak
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, WI, USA
| | - Kadee N. Lawrence
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, WI, USA
| | - John E. Pool
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, WI, USA
| |
Collapse
|
6
|
Rodrigues MF, Kern AD, Ralph PL. Shared evolutionary processes shape landscapes of genomic variation in the great apes. Genetics 2024; 226:iyae006. [PMID: 38242701 PMCID: PMC10990428 DOI: 10.1093/genetics/iyae006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 10/26/2023] [Accepted: 01/03/2024] [Indexed: 01/21/2024] Open
Abstract
For at least the past 5 decades, population genetics, as a field, has worked to describe the precise balance of forces that shape patterns of variation in genomes. The problem is challenging because modeling the interactions between evolutionary processes is difficult, and different processes can impact genetic variation in similar ways. In this paper, we describe how diversity and divergence between closely related species change with time, using correlations between landscapes of genetic variation as a tool to understand the interplay between evolutionary processes. We find strong correlations between landscapes of diversity and divergence in a well-sampled set of great ape genomes, and explore how various processes such as incomplete lineage sorting, mutation rate variation, GC-biased gene conversion and selection contribute to these correlations. Through highly realistic, chromosome-scale, forward-in-time simulations, we show that the landscapes of diversity and divergence in the great apes are too well correlated to be explained via strictly neutral processes alone. Our best fitting simulation includes both deleterious and beneficial mutations in functional portions of the genome, in which 9% of fixations within those regions is driven by positive selection. This study provides a framework for modeling genetic variation in closely related species, an approach which can shed light on the complex balance of forces that have shaped genetic variation.
Collapse
Affiliation(s)
- Murillo F Rodrigues
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
| | - Andrew D Kern
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
| | - Peter L Ralph
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
- Department of Mathematics, University of Oregon, Eugene, OR 97403, USA
| |
Collapse
|
7
|
Song H, Chu J, Li W, Li X, Fang L, Han J, Zhao S, Ma Y. A Novel Approach Utilizing Domain Adversarial Neural Networks for the Detection and Classification of Selective Sweeps. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2304842. [PMID: 38308186 PMCID: PMC11005742 DOI: 10.1002/advs.202304842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 01/10/2024] [Indexed: 02/04/2024]
Abstract
The identification and classification of selective sweeps are of great significance for improving the understanding of biological evolution and exploring opportunities for precision medicine and genetic improvement. Here, a domain adaptation sweep detection and classification (DASDC) method is presented to balance the alignment of two domains and the classification performance through a domain-adversarial neural network and its adversarial learning modules. DASDC effectively addresses the issue of mismatch between training data and real genomic data in deep learning models, leading to a significant improvement in its generalization capability, prediction robustness, and accuracy. The DASDC method demonstrates improved identification performance compared to existing methods and excels in classification performance, particularly in scenarios where there is a mismatch between application data and training data. The successful implementation of DASDC in real data of three distinct species highlights its potential as a useful tool for identifying crucial functional genes and investigating adaptive evolutionary mechanisms, particularly with the increasing availability of genomic data.
Collapse
Affiliation(s)
- Hui Song
- Key Laboratory of Agricultural Animal GeneticsBreeding, and Reproduction of the Ministry of Education & Key Laboratory of Swine Genetics and Breeding of the Ministry of AgricultureHuazhong Agricultural UniversityWuhan430070China
| | - Jinyu Chu
- Key Laboratory of Agricultural Animal GeneticsBreeding, and Reproduction of the Ministry of Education & Key Laboratory of Swine Genetics and Breeding of the Ministry of AgricultureHuazhong Agricultural UniversityWuhan430070China
| | - Wangjiao Li
- Key Laboratory of Agricultural Animal GeneticsBreeding, and Reproduction of the Ministry of Education & Key Laboratory of Swine Genetics and Breeding of the Ministry of AgricultureHuazhong Agricultural UniversityWuhan430070China
| | - Xinyun Li
- Key Laboratory of Agricultural Animal GeneticsBreeding, and Reproduction of the Ministry of Education & Key Laboratory of Swine Genetics and Breeding of the Ministry of AgricultureHuazhong Agricultural UniversityWuhan430070China
- Hubei Hongshan LaboratoryWuhan430070China
| | - Lingzhao Fang
- Center for Quantitative Genetics and GenomicsAarhus UniversityAarhus8000Denmark
| | - Jianlin Han
- Key Laboratory of Agricultural Animal GeneticsBreeding, and Reproduction of the Ministry of Education & Key Laboratory of Swine Genetics and Breeding of the Ministry of AgricultureHuazhong Agricultural UniversityWuhan430070China
- CAAS‐ILRI Joint Laboratory on Livestock and Forage Genetic ResourcesInstitute of Animal ScienceChinese Academy of Agricultural Sciences (CAAS)Beijing100193China
- Livestock Genetics ProgramInternational Livestock Research Institute (ILRI)Nairobi00100Kenya
| | - Shuhong Zhao
- Key Laboratory of Agricultural Animal GeneticsBreeding, and Reproduction of the Ministry of Education & Key Laboratory of Swine Genetics and Breeding of the Ministry of AgricultureHuazhong Agricultural UniversityWuhan430070China
- Hubei Hongshan LaboratoryWuhan430070China
- Lingnan Modern Agricultural Science and Technology Guangdong LaboratoryGuangzhou510642China
| | - Yunlong Ma
- Key Laboratory of Agricultural Animal GeneticsBreeding, and Reproduction of the Ministry of Education & Key Laboratory of Swine Genetics and Breeding of the Ministry of AgricultureHuazhong Agricultural UniversityWuhan430070China
- Hubei Hongshan LaboratoryWuhan430070China
- Lingnan Modern Agricultural Science and Technology Guangdong LaboratoryGuangzhou510642China
| |
Collapse
|
8
|
Przeworski M. 2023 ASHG Scientific Achievement Award. Am J Hum Genet 2024; 111:425-427. [PMID: 38458164 PMCID: PMC10995464 DOI: 10.1016/j.ajhg.2023.12.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 12/12/2023] [Indexed: 03/10/2024] Open
Abstract
This article is based on the address given by the author at the 2023 meeting of The American Society of Human Genetics (ASHG) in Washington, D.C. A video of the original address can be found at the ASHG website.
Collapse
Affiliation(s)
- Molly Przeworski
- Departments of Biological Sciences and Systems Biology, Columbia University, New York, NY, USA.
| |
Collapse
|
9
|
Liu S, Luo H, Zhang P, Li Y, Hao D, Zhang S, Song T, Xu T, He S. Adaptive Selection of Cis-regulatory Elements in the Han Chinese. Mol Biol Evol 2024; 41:msae034. [PMID: 38377343 PMCID: PMC10917166 DOI: 10.1093/molbev/msae034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 01/18/2024] [Accepted: 02/05/2024] [Indexed: 02/22/2024] Open
Abstract
Cis-regulatory elements have an important role in human adaptation to the living environment. However, the lag in population genomic cohort studies and epigenomic studies, hinders the research in the adaptive analysis of cis-regulatory elements in human populations. In this study, we collected 4,013 unrelated individuals and performed a comprehensive analysis of adaptive selection of genome-wide cis-regulatory elements in the Han Chinese. In total, 12.34% of genomic regions are under the influence of adaptive selection, where 1.00% of enhancers and 2.06% of promoters are under positive selection, and 0.06% of enhancers and 0.02% of promoters are under balancing selection. Gene ontology enrichment analysis of these cis-regulatory elements under adaptive selection reveals that many positive selections in the Han Chinese occur in pathways involved in cell-cell adhesion processes, and many balancing selections are related to immune processes. Two classes of adaptive cis-regulatory elements related to cell adhesion were in-depth analyzed, one is the adaptive enhancers derived from neanderthal introgression, leads to lower hyaluronidase level in skin, and brings better performance on UV-radiation resistance to the Han Chinese. Another one is the cis-regulatory elements regulating wound healing, and the results suggest the positive selection inhibits coagulation and promotes angiogenesis and wound healing in the Han Chinese. Finally, we found that many pathogenic alleles, such as risky alleles of type 2 diabetes or schizophrenia, remain in the population due to the hitchhiking effect of positive selections. Our findings will help deepen our understanding of the adaptive evolution of genome regulation in the Han Chinese.
Collapse
Affiliation(s)
- Shuai Liu
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Huaxia Luo
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Peng Zhang
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yanyan Li
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Di Hao
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Sijia Zhang
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tingrui Song
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Tao Xu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan 250117, Shandong, China
| | - Shunmin He
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
10
|
Buffalo V, Kern AD. A quantitative genetic model of background selection in humans. PLoS Genet 2024; 20:e1011144. [PMID: 38507461 PMCID: PMC10984650 DOI: 10.1371/journal.pgen.1011144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 04/01/2024] [Accepted: 01/19/2024] [Indexed: 03/22/2024] Open
Abstract
Across the human genome, there are large-scale fluctuations in genetic diversity caused by the indirect effects of selection. This "linked selection signal" reflects the impact of selection according to the physical placement of functional regions and recombination rates along chromosomes. Previous work has shown that purifying selection acting against the steady influx of new deleterious mutations at functional portions of the genome shapes patterns of genomic variation. To date, statistical efforts to estimate purifying selection parameters from linked selection models have relied on classic Background Selection theory, which is only applicable when new mutations are so deleterious that they cannot fix in the population. Here, we develop a statistical method based on a quantitative genetics view of linked selection, that models how polygenic additive fitness variance distributed along the genome increases the rate of stochastic allele frequency change. By jointly predicting the equilibrium fitness variance and substitution rate due to both strong and weakly deleterious mutations, we estimate the distribution of fitness effects (DFE) and mutation rate across three geographically distinct human samples. While our model can accommodate weaker selection, we find evidence of strong selection operating similarly across all human samples. Although our quantitative genetic model of linked selection fits better than previous models, substitution rates of the most constrained sites disagree with observed divergence levels. We find that a model incorporating selective interference better predicts observed divergence in conserved regions, but overall our results suggest uncertainty remains about the processes generating fitness variation in humans.
Collapse
Affiliation(s)
- Vince Buffalo
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| | - Andrew D. Kern
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| |
Collapse
|
11
|
Simon A, Coop G. The contribution of gene flow, selection, and genetic drift to five thousand years of human allele frequency change. Proc Natl Acad Sci U S A 2024; 121:e2312377121. [PMID: 38363870 PMCID: PMC10907250 DOI: 10.1073/pnas.2312377121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 01/09/2024] [Indexed: 02/18/2024] Open
Abstract
Genomic time series from experimental evolution studies and ancient DNA datasets offer us a chance to directly observe the interplay of various evolutionary forces. We show how the genome-wide variance in allele frequency change between two time points can be decomposed into the contributions of gene flow, genetic drift, and linked selection. In closed populations, the contribution of linked selection is identifiable because it creates covariances between time intervals, and genetic drift does not. However, repeated gene flow between populations can also produce directionality in allele frequency change, creating covariances. We show how to accurately separate the fraction of variance in allele frequency change due to admixture and linked selection in a population receiving gene flow. We use two human ancient DNA datasets, spanning around 5,000 y, as time transects to quantify the contributions to the genome-wide variance in allele frequency change. We find that a large fraction of genome-wide change is due to gene flow. In both cases, after correcting for known major gene flow events, we do not observe a signal of genome-wide linked selection. Thus despite the known role of selection in shaping long-term polymorphism levels, and an increasing number of examples of strong selection on single loci and polygenic scores from ancient DNA, it appears to be gene flow and drift, and not selection, that are the main determinants of recent genome-wide allele frequency change. Our approach should be applicable to the growing number of contemporary and ancient temporal population genomics datasets.
Collapse
Affiliation(s)
- Alexis Simon
- Center for Population Biology, University of California, Davis, CA95616
- Department of Evolution and Ecology, University of California, Davis, CA95616
| | - Graham Coop
- Center for Population Biology, University of California, Davis, CA95616
- Department of Evolution and Ecology, University of California, Davis, CA95616
| |
Collapse
|
12
|
Zurita AMI, Kyriazis CC, Lohmueller KE. The impact of non-neutral synonymous mutations when inferring selection on non-synonymous mutations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.07.579314. [PMID: 38370782 PMCID: PMC10871344 DOI: 10.1101/2024.02.07.579314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
The distribution of fitness effects (DFE) describes the proportions of new mutations that have different effects on reproductive fitness. Accurate measurements of the DFE are important because the DFE is a fundamental parameter in evolutionary genetics and has implications for our understanding of other phenomena like complex disease or inbreeding depression. Current computational methods to infer the DFE for nonsynonymous mutations from natural variation first estimate demographic parameters from synonymous variants to control for the effects of demography and background selection. Then, conditional on these parameters, the DFE is then inferred for nonsynonymous mutations. This approach relies on the assumption that synonymous variants are neutrally evolving. However, some evidence points toward synonymous mutations having measurable effects on fitness. To test whether selection on synonymous mutations affects inference of the DFE of nonsynonymous mutations, we simulated several possible models of selection on synonymous mutations using SLiM and attempted to recover the DFE of nonsynonymous mutations using Fit∂a∂i, a common method for DFE inference. Our results show that the presence of selection on synonymous variants leads to incorrect inferences of recent population growth. Furthermore, under certain parameter combinations, inferences of the DFE can have an inflated proportion of highly deleterious nonsynonymous mutations. However, this bias can be eliminated if the correct demographic parameters are used for DFE inference instead of the biased ones inferred from synonymous variants. Our work demonstrates how unmodeled selection on synonymous mutations may affect downstream inferences of the DFE.
Collapse
Affiliation(s)
- Aina Martinez I Zurita
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, USA
| | - Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, USA
| | - Kirk E Lohmueller
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, USA
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, USA
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, USA
| |
Collapse
|
13
|
Cousins T, Tabin D, Patterson N, Reich D, Durvasula A. Accurate inference of population history in the presence of background selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.18.576291. [PMID: 38313273 PMCID: PMC10838404 DOI: 10.1101/2024.01.18.576291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]
Abstract
All published methods for learning about demographic history make the simplifying assumption that the genome evolves neutrally, and do not seek to account for the effects of natural selection on patterns of variation. This is a major concern, as ample work has demonstrated the pervasive effects of natural selection and in particular background selection (BGS) on patterns of genetic variation in diverse species. Simulations and theoretical work have shown that methods to infer changes in effective population size over time (Ne(t)) become increasingly inaccurate as the strength of linked selection increases. Here, we introduce an extension to the Pairwise Sequentially Markovian Coalescent (PSMC) algorithm, PSMC+, which explicitly co-models demographic history and natural selection. We benchmark our method using forward-in-time simulations with BGS and find that our approach improves the accuracy of effective population size inference. Leveraging a high resolution map of BGS in humans, we infer considerable changes in the magnitude of inferred effective population size relative to previous reports. Finally, we separately infer Ne(t) on the X chromosome and on the autosomes in diverse great apes without making a correction for selection, and find that the inferred ratio fluctuates substantially through time in a way that differs across species, showing that uncorrected selection may be an important driver of signals of genetic difference on the X chromosome and autosomes.
Collapse
Affiliation(s)
- Trevor Cousins
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Daniel Tabin
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA USA
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Boston, MA, USA
| | - Arun Durvasula
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Epidemiology, Harvard School of Public Health, Boston, MA, USA
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
14
|
Simon A, Coop G. The contribution of gene flow, selection, and genetic drift to five thousand years of human allele frequency change. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.11.548607. [PMID: 37503227 PMCID: PMC10370008 DOI: 10.1101/2023.07.11.548607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Genomic time series from experimental evolution studies and ancient DNA datasets offer us a chance to directly observe the interplay of various evolutionary forces. We show how the genome-wide variance in allele frequency change between two time points can be decomposed into the contributions of gene flow, genetic drift, and linked selection. In closed populations, the contribution of linked selection is identifiable because it creates covariances between time intervals, and genetic drift does not. However, repeated gene flow between populations can also produce directionality in allele frequency change, creating covariances. We show how to accurately separate the fraction of variance in allele frequency change due to admixture and linked selection in a population receiving gene flow. We use two human ancient DNA datasets, spanning around 5,000 years, as time transects to quantify the contributions to the genome-wide variance in allele frequency change. We find that a large fraction of genome-wide change is due to gene flow. In both cases, after correcting for known major gene flow events, we do not observe a signal of genome-wide linked selection. Thus despite the known role of selection in shaping long-term polymorphism levels, and an increasing number of examples of strong selection on single loci and polygenic scores from ancient DNA, it appears to be gene flow and drift, and not selection, that are the main determinants of recent genome-wide allele frequency change. Our approach should be applicable to the growing number of contemporary and ancient temporal population genomics datasets.
Collapse
Affiliation(s)
- Alexis Simon
- Center for Population Biology, University of California, Davis, CA 95616
- Department of Evolution and Ecology, University of California, Davis, CA 95616
| | - Graham Coop
- Center for Population Biology, University of California, Davis, CA 95616
- Department of Evolution and Ecology, University of California, Davis, CA 95616
| |
Collapse
|
15
|
Schrider DR. Allelic gene conversion softens selective sweeps. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.05.570141. [PMID: 38106127 PMCID: PMC10723294 DOI: 10.1101/2023.12.05.570141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
The prominence of positive selection, in which beneficial mutations are favored by natural selection and rapidly increase in frequency, is a subject of intense debate. Positive selection can result in selective sweeps, in which the haplotype(s) bearing the adaptive allele "sweep" through the population, thereby removing much of the genetic diversity from the region surrounding the target of selection. Two models of selective sweeps have been proposed: classical sweeps, or "hard sweeps", in which a single copy of the adaptive allele sweeps to fixation, and "soft sweeps", in which multiple distinct copies of the adaptive allele leave descendants after the sweep. Soft sweeps can be the outcome of recurrent mutation to the adaptive allele, or the presence of standing genetic variation consisting of multiple copies of the adaptive allele prior to the onset of selection. Importantly, soft sweeps will be common when populations can rapidly adapt to novel selective pressures, either because of a high mutation rate or because adaptive alleles are already present. The prevalence of soft sweeps is especially controversial, and it has been noted that selection on standing variation or recurrent mutations may not always produce soft sweeps. Here, we show that the inverse is true: selection on single-origin de novo mutations may often result in an outcome that is indistinguishable from a soft sweep. This is made possible by allelic gene conversion, which "softens" hard sweeps by copying the adaptive allele onto multiple genetic backgrounds, a process we refer to as a "pseudo-soft" sweep. We carried out a simulation study examining the impact of gene conversion on sweeps from a single de novo variant in models of human, Drosophila, and Arabidopsis populations. The fraction of simulations in which gene conversion had produced multiple haplotypes with the adaptive allele upon fixation was appreciable. Indeed, under realistic demographic histories and gene conversion rates, even if selection always acts on a single-origin mutation, sweeps involving multiple haplotypes are more likely than hard sweeps in large populations, especially when selection is not extremely strong. Thus, even when the mutation rate is low or there is no standing variation, hard sweeps are expected to be the exception rather than the rule in large populations. These results also imply that the presence of signatures of soft sweeps does not necessarily mean that adaptation has been especially rapid or is not mutation limited.
Collapse
Affiliation(s)
- Daniel R Schrider
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599
| |
Collapse
|
16
|
Rodrigues MF, Kern AD, Ralph PL. Shared evolutionary processes shape landscapes of genomic variation in the great apes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.07.527547. [PMID: 36798346 PMCID: PMC9934647 DOI: 10.1101/2023.02.07.527547] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
For at least the past five decades population genetics, as a field, has worked to describe the precise balance of forces that shape patterns of variation in genomes. The problem is challenging because modelling the interactions between evolutionary processes is difficult, and different processes can impact genetic variation in similar ways. In this paper, we describe how diversity and divergence between closely related species change with time, using correlations between landscapes of genetic variation as a tool to understand the interplay between evolutionary processes. We find strong correlations between landscapes of diversity and divergence in a well sampled set of great ape genomes, and explore how various processes such as incomplete lineage sorting, mutation rate variation, GC-biased gene conversion and selection contribute to these correlations. Through highly realistic, chromosome-scale, forward-in-time simulations we show that the landscapes of diversity and divergence in the great apes are too well correlated to be explained via strictly neutral processes alone. Our best fitting simulation includes both deleterious and beneficial mutations in functional portions of the genome, in which 9% of fixations within those regions is driven by positive selection. This study provides a framework for modelling genetic variation in closely related species, an approach which can shed light on the complex balance of forces that have shaped genetic variation.
Collapse
Affiliation(s)
- Murillo F. Rodrigues
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
| | - Andrew D. Kern
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
| | - Peter L. Ralph
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
- Department of Mathematics, University of Oregon
| |
Collapse
|
17
|
Pivirotto AM, Platt A, Patel R, Kumar S, Hey J. Analyses of allele age and fitness impact reveal human beneficial alleles to be older than neutral controls. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561569. [PMID: 37873438 PMCID: PMC10592680 DOI: 10.1101/2023.10.09.561569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
A classic population genetic prediction is that alleles experiencing directional selection should swiftly traverse allele frequency space, leaving detectable reductions in genetic variation in linked regions. However, despite this expectation, identifying clear footprints of beneficial allele passage has proven to be surprisingly challenging. We addressed the basic premise underlying this expectation by estimating the ages of large numbers of beneficial and deleterious alleles in a human population genomic data set. Deleterious alleles were found to be young, on average, given their allele frequency. However, beneficial alleles were older on average than non-coding, non-regulatory alleles of the same frequency. This finding is not consistent with directional selection and instead indicates some type of balancing selection. Among derived beneficial alleles, those fixed in the population show higher local recombination rates than those still segregating, consistent with a model in which new beneficial alleles experience an initial period of balancing selection due to linkage disequilibrium with deleterious recessive alleles. Alleles that ultimately fix following a period of balancing selection will leave a modest 'soft' sweep impact on the local variation, consistent with the overall paucity of species-wide 'hard' sweeps in human genomes.
Collapse
Affiliation(s)
| | - Alexander Platt
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- University of Pennsylvania, Department of Genetics, Philadelphia PA 19104, USA
| | - Ravi Patel
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- Institute for Genomics and Evolutionary Medicine, Temple University, PA 19122, USA
| | - Sudhir Kumar
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- Institute for Genomics and Evolutionary Medicine, Temple University, PA 19122, USA
| | - Jody Hey
- Temple University, Department of Biology, Philadelphia PA 19122, USA
| |
Collapse
|
18
|
Salazar-Tortosa DF, Huang YF, Enard D. Assessing the Presence of Recent Adaptation in the Human Genome With Mixture Density Regression. Genome Biol Evol 2023; 15:evad170. [PMID: 37713622 PMCID: PMC10563788 DOI: 10.1093/gbe/evad170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Revised: 08/30/2023] [Accepted: 09/04/2023] [Indexed: 09/17/2023] Open
Abstract
How much genome differences between species reflect neutral or adaptive evolution is a central question in evolutionary genomics. In humans and other mammals, the presence of adaptive versus neutral genomic evolution has proven particularly difficult to quantify. The difficulty notably stems from the highly heterogeneous organization of mammalian genomes at multiple levels (functional sequence density, recombination, etc.) which complicates the interpretation and distinction of adaptive versus neutral evolution signals. In this study, we introduce mixture density regressions (MDRs) for the study of the determinants of recent adaptation in the human genome. MDRs provide a flexible regression model based on multiple Gaussian distributions. We use MDRs to model the association between recent selection signals and multiple genomic factors likely to affect the occurrence/detection of positive selection, if the latter was present in the first place to generate these associations. We find that an MDR model with two Gaussian distributions provides an excellent fit to the genome-wide distribution of a common sweep summary statistic (integrated haplotype score), with one of the two distributions likely enriched in positive selection. We further find several factors associated with signals of recent adaptation, including the recombination rate, the density of regulatory elements in immune cells, GC content, gene expression in immune cells, the density of mammal-wide conserved elements, and the distance to the nearest virus-interacting gene. These results support the presence of strong positive selection in recent human evolution and highlight MDRs as a powerful tool to make sense of signals of recent genomic adaptation.
Collapse
Affiliation(s)
- Diego F Salazar-Tortosa
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, USA
- Department of Ecology, University of Granada, Granada, Spain
| | - Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, State College, Pennsylvania, PA 16801, USA
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, State College, Pennsylvania, PA 16801, USA
| | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, USA
| |
Collapse
|
19
|
Tobler R, Souilmi Y, Huber CD, Bean N, Turney CSM, Grey ST, Cooper A. The role of genetic selection and climatic factors in the dispersal of anatomically modern humans out of Africa. Proc Natl Acad Sci U S A 2023; 120:e2213061120. [PMID: 37220274 PMCID: PMC10235988 DOI: 10.1073/pnas.2213061120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 03/14/2023] [Indexed: 05/25/2023] Open
Abstract
The evolutionarily recent dispersal of anatomically modern humans (AMH) out of Africa (OoA) and across Eurasia provides a unique opportunity to examine the impacts of genetic selection as humans adapted to multiple new environments. Analysis of ancient Eurasian genomic datasets (~1,000 to 45,000 y old) reveals signatures of strong selection, including at least 57 hard sweeps after the initial AMH movement OoA, which have been obscured in modern populations by extensive admixture during the Holocene. The spatiotemporal patterns of these hard sweeps provide a means to reconstruct early AMH population dispersals OoA. We identify a previously unsuspected extended period of genetic adaptation lasting ~30,000 y, potentially in the Arabian Peninsula area, prior to a major Neandertal genetic introgression and subsequent rapid dispersal across Eurasia as far as Australia. Consistent functional targets of selection initiated during this period, which we term the Arabian Standstill, include loci involved in the regulation of fat storage, neural development, skin physiology, and cilia function. Similar adaptive signatures are also evident in introgressed archaic hominin loci and modern Arctic human groups, and we suggest that this signal represents selection for cold adaptation. Surprisingly, many of the candidate selected loci across these groups appear to directly interact and coordinately regulate biological processes, with a number associated with major modern diseases including the ciliopathies, metabolic syndrome, and neurodegenerative disorders. This expands the potential for ancestral human adaptation to directly impact modern diseases, providing a platform for evolutionary medicine.
Collapse
Affiliation(s)
- Raymond Tobler
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, SA5005, Australia
| | - Yassine Souilmi
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, SA5005, Australia
- Environment Institute, The University of Adelaide, Adelaide, SA5005, Australia
| | - Christian D. Huber
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, SA5005, Australia
| | - Nigel Bean
- Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers, The University of Adelaide, Adelaide, SA5005, Australia
- School of Mathematical Sciences, The University of Adelaide, Adelaide, SA5005, Australia
| | - Chris S. M. Turney
- Division of Research, University of Technology Sydney, Ultimo, NSW2007, Australia
| | - Shane T. Grey
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW2052, Australia
- Transplantation Immunology Group, Translation Science Pillar, Garvan Institute of Medical Research, Darlinghurst, NSW2010, Australia
| | - Alan Cooper
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, SA5005, Australia
- Blue Sky Genetics, Ashton, SA5137, Australia
| |
Collapse
|
20
|
Linder RA, Zabanavar B, Majumder A, Hoang HCS, Delgado VG, Tran R, La VT, Leemans SW, Long AD. Adaptation in Outbred Sexual Yeast is Repeatable, Polygenic and Favors Rare Haplotypes. Mol Biol Evol 2022; 39:msac248. [PMID: 36366952 PMCID: PMC9728589 DOI: 10.1093/molbev/msac248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We carried out a 200 generation Evolve and Resequence (E&R) experiment initiated from an outbred diploid recombined 18-way synthetic base population. Replicate populations were evolved at large effective population sizes (>105 individuals), exposed to several different chemical challenges over 12 weeks of evolution, and whole-genome resequenced. Weekly forced outcrossing resulted in an average between adjacent-gene per cell division recombination rate of ∼0.0008. Despite attempts to force weekly sex, roughly half of our populations evolved cheaters and appear to be evolving asexually. Focusing on seven chemical stressors and 55 total evolved populations that remained sexual we observed large fitness gains and highly repeatable patterns of genome-wide haplotype change within chemical challenges, with limited levels of repeatability across chemical treatments. Adaptation appears highly polygenic with almost the entire genome showing significant and consistent patterns of haplotype change with little evidence for long-range linkage disequilibrium in a subset of populations for which we sequenced haploid clones. That is, almost the entire genome is under selection or drafting with selected sites. At any given locus adaptation was almost always dominated by one of the 18 founder's alleles, with that allele varying spatially and between treatments, suggesting that selection acts primarily on rare variants private to a founder or haplotype blocks harboring multiple mutations.
Collapse
Affiliation(s)
- Robert A Linder
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| | - Behzad Zabanavar
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| | - Arundhati Majumder
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| | - Hannah Chiao-Shyan Hoang
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| | - Vanessa Genesaret Delgado
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| | - Ryan Tran
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| | - Vy Thoai La
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| | - Simon William Leemans
- Department of Biomedical Engineering, School of Engineering, University of California, Irvine
| | - Anthony D Long
- Department of Ecology and Evolutionary Biology, School of Biological Sciences, University of California, Irvine
| |
Collapse
|
21
|
Souilmi Y, Tobler R, Johar A, Williams M, Grey ST, Schmidt J, Teixeira JC, Rohrlach A, Tuke J, Johnson O, Gower G, Turney C, Cox M, Cooper A, Huber CD. Admixture has obscured signals of historical hard sweeps in humans. Nat Ecol Evol 2022; 6:2003-2015. [PMID: 36316412 PMCID: PMC9715430 DOI: 10.1038/s41559-022-01914-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Accepted: 09/16/2022] [Indexed: 11/06/2022]
Abstract
The role of natural selection in shaping biological diversity is an area of intense interest in modern biology. To date, studies of positive selection have primarily relied on genomic datasets from contemporary populations, which are susceptible to confounding factors associated with complex and often unknown aspects of population history. In particular, admixture between diverged populations can distort or hide prior selection events in modern genomes, though this process is not explicitly accounted for in most selection studies despite its apparent ubiquity in humans and other species. Through analyses of ancient and modern human genomes, we show that previously reported Holocene-era admixture has masked more than 50 historic hard sweeps in modern European genomes. Our results imply that this canonical mode of selection has probably been underappreciated in the evolutionary history of humans and suggest that our current understanding of the tempo and mode of selection in natural populations may be inaccurate.
Collapse
Affiliation(s)
- Yassine Souilmi
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia.
| | - Raymond Tobler
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia.
- Evolution of Cultural Diversity Initiative, Australian National University, Canberra, Australian Capital Territory, Australia.
| | - Angad Johar
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia.
- Department of Cardiovascular Diseases, Mayo Clinic, Rochester, MN, USA.
| | - Matthew Williams
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia
| | - Shane T Grey
- Transplantation Immunology Group, Immunology Division, Garvan Institute of Medical Research, Darlinghurst, New South Wales, Australia
- St Vincent's Clinical School, Faculty of Medicine, UNSW, Darlinghurst, New South Wales, Australia
| | - Joshua Schmidt
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia
| | - João C Teixeira
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia
| | - Adam Rohrlach
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, The University of Adelaide, Adelaide, South Australia, Australia
- Department of Archaeogenetics, Max Planck Institute for the Science of Human History, Jena, Germany
| | - Jonathan Tuke
- ARC Centre of Excellence for Mathematical and Statistical Frontiers, The University of Adelaide, Adelaide, South Australia, Australia
- School of Mathematical Sciences, The University of Adelaide, Adelaide, South Australia, Australia
| | - Olivia Johnson
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia
| | - Graham Gower
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia
| | - Chris Turney
- Chronos 14Carbon-Cycle Facility and Earth and Sustainability Science Research Centre, University of New South Wales, Sydney, New South Wales, Australia
| | - Murray Cox
- Statistics and Bioinformatics Group, School of Fundamental Sciences, Massey University, Palmerston North, New Zealand
| | - Alan Cooper
- South Australian Museum, Adelaide, South Australia, Australia.
- BlueSky Genetics, Ashton, South Australia, Australia.
| | - Christian D Huber
- Australian Centre for Ancient DNA, The University of Adelaide, Adelaide, South Australia, Australia.
- Department of Biology, Penn State University, University Park, PA, USA.
| |
Collapse
|
22
|
Gray OA, Yoo J, Sobreira DR, Jousma J, Witonsky D, Sakabe NJ, Peng YJ, Prabhakar NR, Fang Y, Nobréga MA, Di Rienzo A. A pleiotropic hypoxia-sensitive EPAS1 enhancer is disrupted by adaptive alleles in Tibetans. SCIENCE ADVANCES 2022; 8:eade1942. [PMID: 36417539 PMCID: PMC9683707 DOI: 10.1126/sciadv.ade1942] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 10/25/2022] [Indexed: 06/16/2023]
Abstract
In Tibetans, noncoding alleles in EPAS1-whose protein product hypoxia-inducible factor 2α (HIF-2α) drives the response to hypoxia-carry strong signatures of positive selection; however, their functional mechanism has not been systematically examined. Here, we report that high-altitude alleles disrupt the activity of four EPAS1 enhancers in one or more cell types. We further characterize one enhancer (ENH5) whose activity is both allele specific and hypoxia dependent. Deletion of ENH5 results in down-regulation of EPAS1 and HIF-2α targets in acute hypoxia and in a blunting of the transcriptional response to sustained hypoxia. Deletion of ENH5 in mice results in dysregulation of gene expression across multiple tissues. We propose that pleiotropic adaptive effects of the Tibetan alleles in EPAS1 underlie the strong selective signal at this gene.
Collapse
Affiliation(s)
- Olivia A. Gray
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| | - Jennifer Yoo
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
- Institute for Integrative Physiology and Center for Systems Biology of O2 Sensing, The University of Chicago, Chicago, IL 60637, USA
- Department of Medicine, The University of Chicago, Chicago, IL 60637, USA
| | - Débora R. Sobreira
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| | - Jordan Jousma
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| | - David Witonsky
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| | - Noboru J. Sakabe
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| | - Ying-Jie Peng
- Institute for Integrative Physiology and Center for Systems Biology of O2 Sensing, The University of Chicago, Chicago, IL 60637, USA
| | - Nanduri R. Prabhakar
- Institute for Integrative Physiology and Center for Systems Biology of O2 Sensing, The University of Chicago, Chicago, IL 60637, USA
| | - Yun Fang
- Department of Medicine, The University of Chicago, Chicago, IL 60637, USA
| | - Marcelo A. Nobréga
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| | - Anna Di Rienzo
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
23
|
Le MK, Smith OS, Akbari A, Harpak A, Reich D, Narasimhan VM. 1,000 ancient genomes uncover 10,000 years of natural selection in Europe. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.08.24.505188. [PMID: 36052370 PMCID: PMC9435429 DOI: 10.1101/2022.08.24.505188] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Ancient DNA has revolutionized our understanding of human population history. However, its potential to examine how rapid cultural evolution to new lifestyles may have driven biological adaptation has not been met, largely due to limited sample sizes. We assembled genome-wide data from 1,291 individuals from Europe over 10,000 years, providing a dataset that is large enough to resolve the timing of selection into the Neolithic, Bronze Age, and Historical periods. We identified 25 genetic loci with rapid changes in frequency during these periods, a majority of which were previously undetected. Signals specific to the Neolithic transition are associated with body weight, diet, and lipid metabolism-related phenotypes. They also include immune phenotypes, most notably a locus that confers immunity to Salmonella infection at a time when ancient Salmonella genomes have been shown to adapt to human hosts, thus providing a possible example of human-pathogen co-evolution. In the Bronze Age, selection signals are enriched near genes involved in pigmentation and immune-related traits, including at a key human protein interactor of SARS-CoV-2. Only in the Historical period do the selection candidates we detect largely mirror previously-reported signals, highlighting how the statistical power of previous studies was limited to the last few millennia. The Historical period also has multiple signals associated with vitamin D binding, providing evidence that lactase persistence may have been part of an oligogenic adaptation for efficient calcium uptake and challenging the theory that its adaptive value lies only in facilitating caloric supplementation during times of scarcity. Finally, we detect selection on complex traits in all three periods, including selection favoring variants that reduce body weight in the Neolithic. In the Historical period, we detect selection favoring variants that increase risk for cardiovascular disease plausibly reflecting selection for a more active inflammatory response that would have been adaptive in the face of increased infectious disease exposure. Our results provide an evolutionary rationale for the high prevalence of these deadly diseases in modern societies today and highlight the unique power of ancient DNA in elucidating biological change that accompanied the profound cultural transformations of recent human history.
Collapse
Affiliation(s)
- Megan K Le
- Department of Computer Science, The University of Texas at Austin
| | - Olivia S Smith
- Department of Integrative Biology, The University of Texas at Austin
| | - Ali Akbari
- Department of Genetics, Harvard Medical School
- Department of Human Evolutionary Biology, Harvard University
- Broad Institute of MIT and Harvard
| | - Arbel Harpak
- Department of Integrative Biology, The University of Texas at Austin
- Department of Population Health, Dell Medical School
| | - David Reich
- Department of Genetics, Harvard Medical School
- Department of Human Evolutionary Biology, Harvard University
- Howard Hughes Medical Institute, Harvard Medical School
- Broad Institute of MIT and Harvard
| | - Vagheesh M Narasimhan
- Department of Integrative Biology, The University of Texas at Austin
- Department of Statistics and Data Science, The University of Texas at Austin
| |
Collapse
|
24
|
Yair S, Coop G. Population differentiation of polygenic score predictions under stabilizing selection. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200416. [PMID: 35430887 PMCID: PMC9014188 DOI: 10.1098/rstb.2020.0416] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 03/08/2022] [Indexed: 12/15/2022] Open
Abstract
Given the many small-effect loci uncovered by genome-wide association studies (GWAS), polygenic scores have become central to genomic medicine, and have found application in diverse settings including evolutionary studies of adaptation. Despite their promise, polygenic scores have been found to suffer from limited portability across human populations. This at first seems in conflict with the observation that most common genetic variation is shared among populations. We investigate one potential cause of this discrepancy: stabilizing selection on complex traits. Counterintuitively, while stabilizing selection constrains phenotypic evolution, it accelerates the loss and fixation of alleles underlying trait variation within populations (GWAS loci). Thus even when populations share an optimum phenotype, stabilizing selection erodes the variance contributed by their shared GWAS loci, such that predictions from GWAS in one population explain less of the phenotypic variation in another. We develop theory to quantify how stabilizing selection is expected to reduce the prediction accuracy of polygenic scores in populations not represented in GWAS samples. In addition, we find that polygenic scores can substantially overstate average genetic differences of phenotypes among populations. We emphasize stabilizing selection around a common optimum as a useful null model to connect patterns of allele frequency and polygenic score differentiation. This article is part of the theme issue 'Celebrating 50 years since Lewontin's apportionment of human diversity'.
Collapse
Affiliation(s)
- Sivan Yair
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Graham Coop
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| |
Collapse
|
25
|
Roca-Umbert A, Caro-Consuegra R, Londono-Correa D, Rodriguez-Lozano GF, Vicente R, Bosch E. Understanding signatures of positive natural selection in human zinc transporter genes. Sci Rep 2022; 12:4320. [PMID: 35279701 PMCID: PMC8918337 DOI: 10.1038/s41598-022-08439-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 02/25/2022] [Indexed: 12/11/2022] Open
Abstract
Zinc is an essential micronutrient with a tightly regulated systemic and cellular homeostasis. In humans, some zinc transporter genes (ZTGs) have been previously reported as candidates for strong geographically restricted selective sweeps. However, since zinc homeostasis is maintained by the joint action of 24 ZTGs, other more subtle modes of selection could have also facilitated human adaptation to zinc availability. Here, we studied whether the complete set of ZTGs are enriched for signals of positive selection in worldwide populations and population groups from South Asia. ZTGs showed higher levels of genetic differentiation between African and non-African populations than would be randomly expected, as well as other signals of polygenic selection outside Africa. Moreover, in several South Asian population groups, ZTGs were significantly enriched for SNPs with unusually extended haplotypes and displayed SNP genotype-environmental correlations when considering zinc deficiency levels in soil in that geographical area. Our study replicated some well-characterized targets for positive selection in East Asia and sub-Saharan Africa, and proposes new candidates for follow-up in South Asia (SLC39A5) and Africa (SLC39A7). Finally, we identified candidate variants for adaptation in ZTGs that could contribute to different disease susceptibilities and zinc-related human health traits.
Collapse
Affiliation(s)
- Ana Roca-Umbert
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, 08003, Barcelona, Spain
| | - Rocio Caro-Consuegra
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, 08003, Barcelona, Spain
| | - Diego Londono-Correa
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, 08003, Barcelona, Spain
| | - Gabriel Felipe Rodriguez-Lozano
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, 08003, Barcelona, Spain
| | - Ruben Vicente
- Laboratory of Molecular Physiology, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, 08003, Barcelona, Spain
| | - Elena Bosch
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, 08003, Barcelona, Spain. .,Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), 43206, Reus, Spain.
| |
Collapse
|
26
|
Saitou M, Masuda N, Gokcumen O. Similarity-Based Analysis of Allele Frequency Distribution among Multiple Populations Identifies Adaptive Genomic Structural Variants. Mol Biol Evol 2022; 39:msab313. [PMID: 34718708 PMCID: PMC8896759 DOI: 10.1093/molbev/msab313] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Structural variants have a considerable impact on human genomic diversity. However, their evolutionary history remains mostly unexplored. Here, we developed a new method to identify potentially adaptive structural variants based on a similarity-based analysis that incorporates genotype frequency data from 26 populations simultaneously. Using this method, we analyzed 57,629 structural variants and identified 576 structural variants that show unusual population differentiation. Of these putatively adaptive structural variants, we further showed that 24 variants are multiallelic and overlap with coding sequences, and 20 variants are significantly associated with GWAS traits. Closer inspection of the haplotypic variation associated with these putatively adaptive and functional structural variants reveals deviations from neutral expectations due to: 1) population differentiation of rapidly evolving multiallelic variants, 2) incomplete sweeps, and 3) recent population-specific negative selection. Overall, our study provides new methodological insights, documents hundreds of putatively adaptive variants, and introduces evolutionary models that may better explain the complex evolution of structural variants.
Collapse
Affiliation(s)
- Marie Saitou
- Department of Biological Sciences, University at Buffalo, State University of New York, Buffalo, NY, USA
- Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, USA
| | - Naoki Masuda
- Department of Mathematics, University at Buffalo, State University of New York, Buffalo, NY, USA
- Computational and Data-Enabled Science and Engineering Program, University at Buffalo, State University of New York, Buffalo, NY, USA
| | - Omer Gokcumen
- Department of Biological Sciences, University at Buffalo, State University of New York, Buffalo, NY, USA
| |
Collapse
|
27
|
Maiorano AM, Cardoso DF, Carvalheiro R, Júnior GAF, de Albuquerque LG, de Oliveira HN. Signatures of selection in Nelore cattle revealed by whole-genome sequencing data. Genomics 2022; 114:110304. [DOI: 10.1016/j.ygeno.2022.110304] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 01/07/2022] [Accepted: 02/01/2022] [Indexed: 11/04/2022]
|
28
|
Whiteman NK. Evolution in small steps and giant leaps. Evolution 2022; 76:67-77. [PMID: 35040122 PMCID: PMC9387839 DOI: 10.1111/evo.14432] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 12/28/2021] [Accepted: 01/03/2022] [Indexed: 02/03/2023]
Abstract
The first Editor of Evolution was Ernst Mayr. His foreword to the first issue of Evolution published in 1947 framed evolution as a "problem of interaction" that was just beginning to be studied in this broad context. First, I explore progress and prospects on understanding the subsidiary interactions identified by Mayr, including interactions between parts of organisms, between individuals and populations, between species, and between the organism and its abiotic environment. Mayr's overall "problem of interaction" framework is examined in the context of coevolution within and among levels of biological organization. This leads to a comparison in the relative roles of biotic versus abiotic agents of selection and fluctuating versus directional selection, followed by stabilizing selection in shaping the genomic architecture of adaptation. Oligogenic architectures may be typical for traits shaped more by fluctuating selection and biotic selection. Conversely, polygenic architectures may be typical for traits shaped more by directional followed by stabilizing selection and abiotic selection. The distribution of effect sizes and turnover dynamics of adaptive alleles in these scenarios deserves further study. Second, I review two case studies on the evolution of acquired toxicity in animals, one involving cardiac glycosides obtained from plants and one involving bacterial virulence factors horizontally transferred to animals. The approaches used in these studies and the results gained directly flow from Mayr's vision of an evolutionary biology that revolves around the "problem of interaction."
Collapse
Affiliation(s)
- Noah K. Whiteman
- Department of Integrative Biology, University of California, Berkeley, California 94720
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720
| |
Collapse
|
29
|
Fine human genetic map based on UK10K data set. Hum Genet 2022; 141:273-281. [DOI: 10.1007/s00439-021-02415-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 12/03/2021] [Indexed: 11/04/2022]
|
30
|
Klassmann A, Gautier M. Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data. PLoS One 2022; 17:e0262024. [PMID: 35041674 PMCID: PMC8765611 DOI: 10.1371/journal.pone.0262024] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 12/15/2021] [Indexed: 12/19/2022] Open
Abstract
Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some of them necessitate polarized variants. Here, we unify and extend previously proposed modifications to loosen these requirements. We compare the modified versions with the original ones by measuring the false discovery rate in simulated whole-genome scans and by quantifying the overlap of inferred candidate regions in empirical data. We find that phasing information is indispensable for accurate estimation of within-population statistics (for all but very large samples) and of cross-population statistics for small samples. Ancestry information, in contrast, is of lesser importance for both types of statistic. Our publicly available R package rehh incorporates the modified statistics presented here.
Collapse
Affiliation(s)
| | - Mathieu Gautier
- CBGP, Univ Montpellier, CIRAD, INRAE, IRD, Institut Agro, Montpellier, France
| |
Collapse
|
31
|
Liang YY, Shi Y, Yuan S, Zhou BF, Chen XY, An QQ, Ingvarsson PK, Plomion C, Wang B. Linked selection shapes the landscape of genomic variation in three oak species. THE NEW PHYTOLOGIST 2022; 233:555-568. [PMID: 34637540 DOI: 10.1111/nph.17793] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 09/27/2021] [Indexed: 06/13/2023]
Abstract
Natural selection shapes genome-wide patterns of diversity within species and divergence between species. However, quantifying the efficacy of selection and elucidating the relative importance of different types of selection in shaping genomic variation remain challenging. We sequenced whole genomes of 101 individuals of three closely related oak species to track the divergence history, and to dissect the impacts of selective sweeps and background selection on patterns of genomic variation. We estimated that the three species diverged around the late Neogene and experienced a bottleneck during the Pleistocene. We detected genomic regions with elevated relative differentiation ('FST -islands'). Population genetic inferences from the site frequency spectrum and ancestral recombination graph indicated that FST -islands were formed by selective sweeps. We also found extensive positive selection; the fixation of adaptive mutations and reduction neutral diversity around substitutions generated a signature of selective sweeps. Prevalent negative selection and background selection have reduced genetic diversity in both genic and intergenic regions, and contributed substantially to the baseline variation in genetic diversity. Our results demonstrate the importance of linked selection in shaping genomic variation, and illustrate how the extent and strength of different selection models vary across the genome.
Collapse
Affiliation(s)
- Yi-Ye Liang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- University of the Chinese Academy of Sciences, Beijing, 100049, China
| | - Yong Shi
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Shuai Yuan
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Biao-Feng Zhou
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Xue-Yan Chen
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Qing-Qing An
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Pär K Ingvarsson
- Department of Plant Biology, Linnean Center for Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Sciences, Uppsala, SE-75007, Sweden
| | | | - Baosheng Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, 510650, China
| |
Collapse
|
32
|
Hayward LK, Sella G. Polygenic adaptation after a sudden change in environment. eLife 2022; 11:66697. [PMID: 36155653 PMCID: PMC9683794 DOI: 10.7554/elife.66697] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Accepted: 07/18/2022] [Indexed: 11/13/2022] Open
Abstract
Polygenic adaptation is thought to be ubiquitous, yet remains poorly understood. Here, we model this process analytically, in the plausible setting of a highly polygenic, quantitative trait that experiences a sudden shift in the fitness optimum. We show how the mean phenotype changes over time, depending on the effect sizes of loci that contribute to variance in the trait, and characterize the allele dynamics at these loci. Notably, we describe the two phases of the allele dynamics: The first is a rapid phase, in which directional selection introduces small frequency differences between alleles whose effects are aligned with or opposed to the shift, ultimately leading to small differences in their probability of fixation during a second, longer phase, governed by stabilizing selection. As we discuss, key results should hold in more general settings and have important implications for efforts to identify the genetic basis of adaptation in humans and other species.
Collapse
Affiliation(s)
- Laura Katharine Hayward
- Department of Mathematics, Columbia UniversityNew YorkUnited States,Institute of Science and TechnologyMaria GuggingAustria
| | - Guy Sella
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States,Program for Mathematical Genomics, Columbia UniversityNew YorkUnited States
| |
Collapse
|
33
|
Kun Á. Is there still evolution in the human population? Biol Futur 2022; 73:359-374. [PMID: 36592324 PMCID: PMC9806833 DOI: 10.1007/s42977-022-00146-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 12/08/2022] [Indexed: 01/03/2023]
Abstract
It is often claimed that humanity has stopped evolving because modern medicine erased all selection on survival. Even if that would be true, and it is not, there would be other mechanisms of evolution which could still led to changes in allelic frequencies. Here I show, by applying basic evolutionary genetics knowledge, that we expect humanity to evolve. The results from genome sequencing projects have repeatedly affirmed that there are still recent signs of selection in our genomes. I give some examples of such adaptation. Then I briefly discuss what our evolutionary future has in store for us.
Collapse
Affiliation(s)
- Ádám Kun
- grid.5591.80000 0001 2294 6276Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary ,Parmenides Center for the Conceptual Foundations of Science, Pöcking, Germany ,grid.481817.3Institute of Evolution, Centre for Ecological Research, Budapest, Hungary ,grid.5018.c0000 0001 2149 4407MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Budapest, Hungary ,grid.5018.c0000 0001 2149 4407MTA-ELTE-MTM Ecology Research Group, Budapest, Hungary
| |
Collapse
|
34
|
Levy SB, Leonard WR. The evolutionary significance of human brown adipose tissue: Integrating the timescales of adaptation. Evol Anthropol 2021; 31:75-91. [PMID: 34910348 DOI: 10.1002/evan.21930] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 05/14/2021] [Accepted: 11/19/2021] [Indexed: 12/20/2022]
Abstract
While human adaptability is regarded as a classical topic in anthropology, recent work provides new insight into metabolic adaptations to cold climates and the role of phenotypic plasticity in human evolution. A growing body of literature demonstrates that adults retain brown adipose tissue (BAT) which may play a role in non-shivering thermogenesis. In this narrative review, we apply the timescales of adaptation framework in order to explore the adaptive significance of human BAT. Human variation in BAT is shaped by multiple adaptive modes (i.e., allostasis, acclimatization, developmental adaptation, epigenetic inheritance, and genetic adaptation), and together the adaptive modes act as an integrated system. We hypothesize that plasticity in BAT facilitated the successful expansion of human populations into circumpolar regions, allowing for selection of genetic adaptations to cold climates to take place. Future research rooted in human energetics and biocultural perspectives is essential for understanding BAT's adaptive and health significance.
Collapse
Affiliation(s)
- Stephanie B Levy
- Department of Anthropology, CUNY Hunter College, New York, New York, USA.,New York Consortium in Evolutionary Primatology, New York, New York, USA
| | - William R Leonard
- Department of Anthropology, Northwestern University, Evanston, Illinois, USA
| |
Collapse
|
35
|
Laval G, Patin E, Boutillier P, Quintana-Murci L. Sporadic occurrence of recent selective sweeps from standing variation in humans as revealed by an approximate Bayesian computation approach. Genetics 2021; 219:6377789. [PMID: 34849862 DOI: 10.1093/genetics/iyab161] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Accepted: 09/01/2021] [Indexed: 12/14/2022] Open
Abstract
During their dispersals over the last 100,000 years, modern humans have been exposed to a large variety of environments, resulting in genetic adaptation. While genome-wide scans for the footprints of positive Darwinian selection have increased knowledge of genes and functions potentially involved in human local adaptation, they have globally produced evidence of a limited contribution of selective sweeps in humans. Conversely, studies based on machine learning algorithms suggest that recent sweeps from standing variation are widespread in humans, an observation that has been recently questioned. Here, we sought to formally quantify the number of recent selective sweeps in humans, by leveraging approximate Bayesian computation and whole-genome sequence data. Our computer simulations revealed suitable ABC estimations, regardless of the frequency of the selected alleles at the onset of selection and the completion of sweeps. Under a model of recent selection from standing variation, we inferred that an average of 68 (from 56 to 79) and 140 (from 94 to 198) sweeps occurred over the last 100,000 years of human history, in African and Eurasian populations, respectively. The former estimation is compatible with human adaptation rates estimated since divergence with chimps, and reveals numbers of sweeps per generation per site in the range of values estimated in Drosophila. Our results confirm the rarity of selective sweeps in humans and show a low contribution of sweeps from standing variation to recent human adaptation.
Collapse
Affiliation(s)
- Guillaume Laval
- Human Evolutionary Genetics Unit, Institut Pasteur, UMR 2000, CNRS, Paris 75015, France
| | - Etienne Patin
- Human Evolutionary Genetics Unit, Institut Pasteur, UMR 2000, CNRS, Paris 75015, France
| | - Pierre Boutillier
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Lluis Quintana-Murci
- Human Evolutionary Genetics Unit, Institut Pasteur, UMR 2000, CNRS, Paris 75015, France.,Human Genomics and Evolution, Collège de France, 75005 Paris, France
| |
Collapse
|
36
|
Johri P, Charlesworth B, Howell EK, Lynch M, Jensen JD. Revisiting the notion of deleterious sweeps. Genetics 2021; 219:iyab094. [PMID: 34125884 PMCID: PMC9101445 DOI: 10.1093/genetics/iyab094] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 06/08/2021] [Indexed: 11/14/2022] Open
Abstract
It has previously been shown that, conditional on its fixation, the time to fixation of a semi-dominant deleterious autosomal mutation in a randomly mating population is the same as that of an advantageous mutation. This result implies that deleterious mutations could generate selective sweep-like effects. Although their fixation probabilities greatly differ, the much larger input of deleterious relative to beneficial mutations suggests that this phenomenon could be important. We here examine how the fixation of mildly deleterious mutations affects levels and patterns of polymorphism at linked sites-both in the presence and absence of interference amongst deleterious mutations-and how this class of sites may contribute to divergence between-populations and species. We find that, while deleterious fixations are unlikely to represent a significant proportion of outliers in polymorphism-based genomic scans within populations, minor shifts in the frequencies of deleterious mutations can influence the proportions of private variants and the value of FST after a recent population split. As sites subject to deleterious mutations are necessarily found in functional genomic regions, interpretations in terms of recurrent positive selection may require reconsideration.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Emma K Howell
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Michael Lynch
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
- Center for Mechanisms of Evolution, The Biodesign Institute, Arizona State University, Tempe, AZ 85287, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
37
|
Villegas-Mirón P, Acosta S, Nye J, Bertranpetit J, Laayouni H. Chromosome X-wide Analysis of Positive Selection in Human Populations: Common and Private Signals of Selection and its Impact on Inactivated Genes and Enhancers. Front Genet 2021; 12:714491. [PMID: 34646300 PMCID: PMC8502928 DOI: 10.3389/fgene.2021.714491] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 09/08/2021] [Indexed: 01/22/2023] Open
Abstract
The ability of detecting adaptive (positive) selection in the genome has opened the possibility of understanding the genetic basis of population-specific adaptations genome-wide. Here, we present the analysis of recent selective sweeps, specifically in the X chromosome, in human populations from the third phase of the 1,000 Genomes Project using three different haplotype-based statistics. We describe instances of recent positive selection that fit the criteria of hard or soft sweeps, and detect a higher number of events among sub-Saharan Africans than non-Africans (Europe and East Asia). A global enrichment of neural-related processes is observed and numerous genes related to fertility appear among the top candidates, reflecting the importance of reproduction in human evolution. Commonalities with previously reported genes under positive selection are found, while particularly strong new signals are reported in specific populations or shared across different continental groups. We report an enrichment of signals in genes that escape X chromosome inactivation, which may contribute to the differentiation between sexes. We also provide evidence of a widespread presence of soft-sweep-like signatures across the chromosome and a global enrichment of highly scoring regions that overlap potential regulatory elements. Among these, enhancers-like signatures seem to present putative signals of positive selection which might be in concordance with selection in their target genes. Also, particularly strong signals appear in regulatory regions that show differential activities, which might point to population-specific regulatory adaptations.
Collapse
Affiliation(s)
- Pablo Villegas-Mirón
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona, Spain
| | - Sandra Acosta
- Department Pathology and Experimental Therapeutics, Medical School, University of Barcelona, Barcelona, Spain
| | - Jessica Nye
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona, Spain
| | - Jaume Bertranpetit
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona, Spain
| | - Hafid Laayouni
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Barcelona, Spain.,Bioinformatics Studies, ESCI-UPF, Barcelona, Spain
| |
Collapse
|
38
|
Ojeda-Granados C, Abondio P, Setti A, Sarno S, Gnecchi-Ruscone GA, González-Orozco E, De Fanti S, Jiménez-Kaufmann A, Rangel-Villalobos H, Moreno-Estrada A, Sazzini M. Dietary, Cultural and Pathogens-Related Selective Pressures Shaped Differential Adaptive Evolution Among Native Mexican Populations. Mol Biol Evol 2021; 39:6379730. [PMID: 34597392 PMCID: PMC8763094 DOI: 10.1093/molbev/msab290] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Native American genetic ancestry has been remarkably implicated with increased risk of diverse health issues in several Mexican populations, especially in relation to the dramatic changes in environmental, dietary, and cultural settings they have recently undergone. In particular, the effects of these ecological transitions and Westernization of lifestyles have been investigated so far predominantly on Mestizo individuals. Nevertheless, indigenous groups, rather than admixed Mexicans, have plausibly retained the highest proportions of genetic components shaped by natural selection in response to the ancient milieu experienced by Mexican ancestors during their pre-Columbian evolutionary history. These formerly adaptive variants have the potential to represent the genetic determinants of some biological traits that are peculiar to Mexican people, as well as a reservoir of loci with possible biomedical relevance. To test such a hypothesis, we used genome-wide genotype data to infer the unique adaptive evolution of Native Mexican groups selected as reasonable descendants of the main pre-Columbian Mexican civilizations. A combination of haplotype-based and gene-network analyses enabled us to detect genomic signatures ascribable to polygenic adaptive traits plausibly evolved by the main genetic clusters of Mexican indigenous populations to cope with local environmental and/or cultural conditions. Some of these adaptations were found to play a role in modulating the susceptibility/resistance of these groups to certain pathological conditions, thus providing new evidence that diverse selective pressures have contributed to shape the current biological and disease-risk patterns of present-day Native and Mestizo Mexican populations.
Collapse
Affiliation(s)
- Claudia Ojeda-Granados
- Laboratory of Molecular Anthropology & Centre for Genome Biology, Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Bologna, Italy.,Department of Molecular Biology in Medicine, Civil Hospital of Guadalajara "Fray Antonio Alcalde" & Health Sciences Center, University of Guadalajara, Jalisco, Mexico
| | - Paolo Abondio
- Laboratory of Molecular Anthropology & Centre for Genome Biology, Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Bologna, Italy
| | - Alice Setti
- Laboratory of Molecular Anthropology & Centre for Genome Biology, Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Bologna, Italy.,Laboratory of Molecular Virology, Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Povo-Trento, Italy
| | - Stefania Sarno
- Laboratory of Molecular Anthropology & Centre for Genome Biology, Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Bologna, Italy
| | - Guido Alberto Gnecchi-Ruscone
- Laboratory of Molecular Anthropology & Centre for Genome Biology, Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Bologna, Italy.,Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Eduardo González-Orozco
- National Laboratory of Genomics for Biodiversity (LANGEBIO), UGA, CINVESTAV, Irapuato, Guanajuato, Mexico
| | - Sara De Fanti
- Interdepartmental Centre Alma Mater Research Institute on Global Challenges and Climate Change, University of Bologna, Italy
| | - Andres Jiménez-Kaufmann
- National Laboratory of Genomics for Biodiversity (LANGEBIO), UGA, CINVESTAV, Irapuato, Guanajuato, Mexico
| | - Héctor Rangel-Villalobos
- Instituto de Investigación en Genética Molecular, Centro Universitario de la Ciénega, Universidad de Guadalajara, Ocotlán, Jalisco, Mexico
| | - Andrés Moreno-Estrada
- National Laboratory of Genomics for Biodiversity (LANGEBIO), UGA, CINVESTAV, Irapuato, Guanajuato, Mexico
| | - Marco Sazzini
- Laboratory of Molecular Anthropology & Centre for Genome Biology, Department of Biological, Geological and Environmental Sciences (BiGeA), University of Bologna, Bologna, Italy.,Interdepartmental Centre Alma Mater Research Institute on Global Challenges and Climate Change, University of Bologna, Italy
| |
Collapse
|
39
|
Rabier CE, Berry V, Stoltz M, Santos JD, Wang W, Glaszmann JC, Pardi F, Scornavacca C. On the inference of complex phylogenetic networks by Markov Chain Monte-Carlo. PLoS Comput Biol 2021; 17:e1008380. [PMID: 34478440 PMCID: PMC8445492 DOI: 10.1371/journal.pcbi.1008380] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Revised: 09/16/2021] [Accepted: 07/13/2021] [Indexed: 11/19/2022] Open
Abstract
For various species, high quality sequences and complete genomes are nowadays available for many individuals. This makes data analysis challenging, as methods need not only to be accurate, but also time efficient given the tremendous amount of data to process. In this article, we introduce an efficient method to infer the evolutionary history of individuals under the multispecies coalescent model in networks (MSNC). Phylogenetic networks are an extension of phylogenetic trees that can contain reticulate nodes, which allow to model complex biological events such as horizontal gene transfer, hybridization and introgression. We present a novel way to compute the likelihood of biallelic markers sampled along genomes whose evolution involved such events. This likelihood computation is at the heart of a Bayesian network inference method called SnappNet, as it extends the Snapp method inferring evolutionary trees under the multispecies coalescent model, to networks. SnappNet is available as a package of the well-known beast 2 software. Recently, the MCMC_BiMarkers method, implemented in PhyloNet, also extended Snapp to networks. Both methods take biallelic markers as input, rely on the same model of evolution and sample networks in a Bayesian framework, though using different methods for computing priors. However, SnappNet relies on algorithms that are exponentially more time-efficient on non-trivial networks. Using simulations, we compare performances of SnappNet and MCMC_BiMarkers. We show that both methods enjoy similar abilities to recover simple networks, but SnappNet is more accurate than MCMC_BiMarkers on more complex network scenarios. Also, on complex networks, SnappNet is found to be extremely faster than MCMC_BiMarkers in terms of time required for the likelihood computation. We finally illustrate SnappNet performances on a rice data set. SnappNet infers a scenario that is consistent with previous results and provides additional understanding of rice evolution.
Collapse
Affiliation(s)
- Charles-Elie Rabier
- Institut des Sciences de l’Evolution (ISEM), Université de Montpellier, CNRS, EPHE, IRD, Montpellier, France
- Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier (LIRMM), Université de Montpellier, CNRS, Montpellier, France
- Institut Montpelliérain Alexander Grothendieck (IMAG), Université de Montpellier, CNRS, Montpellier, France
| | - Vincent Berry
- Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier (LIRMM), Université de Montpellier, CNRS, Montpellier, France
| | - Marnus Stoltz
- Institut des Sciences de l’Evolution (ISEM), Université de Montpellier, CNRS, EPHE, IRD, Montpellier, France
| | - João D. Santos
- CIRAD, UMR AGAP, Montpellier, France
- Amélioration Génétique et Adaptation des Plantes méditerranéennes et tropicales (AGAP), Université de Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | - Wensheng Wang
- Institute of Crop Sciences (ICS), Chinese Academy of Agricultural Sciences, Beijing, China
| | - Jean-Christophe Glaszmann
- CIRAD, UMR AGAP, Montpellier, France
- Amélioration Génétique et Adaptation des Plantes méditerranéennes et tropicales (AGAP), Université de Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | - Fabio Pardi
- Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier (LIRMM), Université de Montpellier, CNRS, Montpellier, France
| | - Celine Scornavacca
- Institut des Sciences de l’Evolution (ISEM), Université de Montpellier, CNRS, EPHE, IRD, Montpellier, France
| |
Collapse
|
40
|
Abstract
The repeated adaptation of oceanic threespine sticklebacks to fresh water has made it a premier organism to study parallel evolution. These small fish have multiple distinct ecotypes that display a wide range of diverse phenotypic traits. Ecotypes are easily crossed in the laboratory, and families are large and develop quickly enough for quantitative trait locus analyses, positioning the threespine stickleback as a versatile model organism to address a wide range of biological questions. Extensive genomic resources, including linkage maps, a high-quality reference genome, and developmental genetics tools have led to insights into the genomic basis of adaptation and the identification of genomic changes controlling traits in vertebrates. Recently, threespine sticklebacks have been used as a model system to identify the genomic basis of highly complex traits, such as behavior and host-microbiome and host-parasite interactions. We review the latest findings and new avenues of research that have led the threespine stickleback to be considered a supermodel of evolutionary genomics.
Collapse
Affiliation(s)
- Kerry Reid
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York 11794, USA;
| | - Michael A Bell
- University of California Museum of Paleontology, Berkeley, California 94720, USA
| | - Krishna R Veeramah
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York 11794, USA;
| |
Collapse
|
41
|
Madgwick PG, Kanitz R. Evolution of resistance under alternative models of selective interference. J Evol Biol 2021; 34:1608-1623. [PMID: 34449949 PMCID: PMC9293239 DOI: 10.1111/jeb.13919] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 07/28/2021] [Accepted: 08/19/2021] [Indexed: 01/19/2023]
Abstract
The use of multiple pesticides or drugs can lead to a simultaneous selection pressure for resistance alleles at different loci. Models of resistance evolution focus on how this can delay the spread of resistance through a population, but often neglect how this can also reduce the probability that a resistance allele spreads. This neglected factor has been studied in a parallel literature as selective interference. Models of interference use alternative constructions of fitness, where selection coefficients from different loci either add or multiply. Although these are equivalent under weak selection, the two constructions make alternative predictions under the strong selection that characterizes resistance evolution. Here, simulations are used to examine the effects of interference on the probability of fixation and time to fixation of a new and strongly beneficial mutation in the presence of another strongly beneficial allele with variable starting frequency. The results from simulations show a complicated pattern of effects. The key result is that, under multiplicativity, the presence of the strongly beneficial allele leads to a small reduction in the probability of fixation for the new beneficial mutation up to ~10%, and a negligible increase in the average time to fixation up to ~2%, whereas under additivity, the effect is more substantial at up to ~50% for the probability of fixation and ~100% for the average time to fixation. Consequently, the effect of interference is only an important feature of resistance evolution under additivity. Current evidence from studies of experimental evolution provides widespread support for the basic features of additivity, which suggests that interference may afford resistance a different pattern of evolution than other adaptations: rather than the gradual and simultaneous selection of many alleles with small effects, the rapid evolution of resistance may involve the sequential selection of alleles with large effects.
Collapse
Affiliation(s)
- Philip G Madgwick
- Syngenta, Jealott's Hill International Research Centre, Bracknell, UK
| | | |
Collapse
|
42
|
Buffalo V. Quantifying the relationship between genetic diversity and population size suggests natural selection cannot explain Lewontin's Paradox. eLife 2021; 10:e67509. [PMID: 34409937 PMCID: PMC8486380 DOI: 10.7554/elife.67509] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 08/16/2021] [Indexed: 12/21/2022] Open
Abstract
Neutral theory predicts that genetic diversity increases with population size, yet observed levels of diversity across metazoans vary only two orders of magnitude while population sizes vary over several. This unexpectedly narrow range of diversity is known as Lewontin's Paradox of Variation (1974). While some have suggested selection constrains diversity, tests of this hypothesis seem to fall short. Here, I revisit Lewontin's Paradox to assess whether current models of linked selection are capable of reducing diversity to this extent. To quantify the discrepancy between pairwise diversity and census population sizes across species, I combine previously-published estimates of pairwise diversity from 172 metazoan taxa with newly derived estimates of census sizes. Using phylogenetic comparative methods, I show this relationship is significant accounting for phylogeny, but with high phylogenetic signal and evidence that some lineages experience shifts in the evolutionary rate of diversity deep in the past. Additionally, I find a negative relationship between recombination map length and census size, suggesting abundant species have less recombination and experience greater reductions in diversity due to linked selection. However, I show that even assuming strong and abundant selection, models of linked selection are unlikely to explain the observed relationship between diversity and census sizes across species.
Collapse
Affiliation(s)
- Vince Buffalo
- Institute for Ecology and Evolution, University of OregonEugeneUnited States
| |
Collapse
|
43
|
Skead K, Ang Houle A, Abelson S, Agbessi M, Bruat V, Lin B, Soave D, Shlush L, Wright S, Dick J, Morris Q, Awadalla P. Interacting evolutionary pressures drive mutation dynamics and health outcomes in aging blood. Nat Commun 2021; 12:4921. [PMID: 34389724 PMCID: PMC8363714 DOI: 10.1038/s41467-021-25172-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 07/27/2021] [Indexed: 01/10/2023] Open
Abstract
Age-related clonal hematopoiesis (ARCH) is characterized by age-associated accumulation of somatic mutations in hematopoietic stem cells (HSCs) or their pluripotent descendants. HSCs harboring driver mutations will be positively selected and cells carrying these mutations will rise in frequency. While ARCH is a known risk factor for blood malignancies, such as Acute Myeloid Leukemia (AML), why some people who harbor ARCH driver mutations do not progress to AML remains unclear. Here, we model the interaction of positive and negative selection in deeply sequenced blood samples from individuals who subsequently progressed to AML, compared to healthy controls, using deep learning and population genetics. Our modeling allows us to discriminate amongst evolutionary classes with high accuracy and captures signatures of purifying selection in most individuals. Purifying selection, acting on benign or mildly damaging passenger mutations, appears to play a critical role in preventing disease-predisposing clones from rising to dominance and is associated with longer disease-free survival. Through exploring a range of evolutionary models, we show how different classes of selection shape clonal dynamics and health outcomes thus enabling us to better identify individuals at a high risk of malignancy.
Collapse
Affiliation(s)
- Kimberly Skead
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | - Armande Ang Houle
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Sagi Abelson
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | | | - Vanessa Bruat
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Boxi Lin
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - David Soave
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Mathematics, Wilfrid Laurier University, Waterloo, ON, Canada
| | - Liran Shlush
- Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
| | - Stephen Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - John Dick
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Princess Margaret Cancer Centre, Toronto, ON, Canada
| | - Quaid Morris
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada.
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, United States.
| | - Philip Awadalla
- Ontario Institute for Cancer Research, Toronto, ON, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
44
|
Zeng L, Liu HQ, Tu XL, Ji CM, Gou X, Esmailizadeh A, Wang S, Wang MS, Wang MC, Li XL, Charati H, Adeola AC, Moshood Adedokun RA, Oladipo O, Olaogun SC, Sanke OJ, Godwin F M, Cecily Ommeh S, Agwanda B, Kasiiti Lichoti J, Han JL, Zheng HK, Wang CF, Zhang YP, Frantz LAF, Wu DD. Genomes reveal selective sweeps in kiang and donkey for high-altitude adaptation. Zool Res 2021; 42:450-460. [PMID: 34156172 PMCID: PMC8317180 DOI: 10.24272/j.issn.2095-8137.2021.095] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Over the last several hundred years, donkeys have adapted to high-altitude conditions on the Tibetan Plateau. Interestingly, the kiang, a closely related equid species, also inhabits this region. Previous reports have demonstrated the importance of specific genes and adaptive introgression in divergent lineages for adaptation to hypoxic conditions on the Tibetan Plateau. Here, we assessed whether donkeys and kiangs adapted to the Tibetan Plateau via the same or different biological pathways and whether adaptive introgression has occurred. We assembled a de novo genome from a kiang individual and analyzed the genomes of five kiangs and 93 donkeys (including 24 from the Tibetan Plateau). Our analyses suggested the existence of a strong hard selective sweep at the EPAS1 locus in kiangs. In Tibetan donkeys, however, another gene, i.e., EGLN1, was likely involved in their adaptation to high altitude. In addition, admixture analysis found no evidence for interspecific gene flow between kiangs and Tibetan donkeys. Our findings indicate that despite the short evolutionary time scale since the arrival of donkeys on the Tibetan Plateau, as well as the existence of a closely related species already adapted to hypoxia, Tibetan donkeys did not acquire adaptation via admixture but instead evolved adaptations via a different biological pathway.
Collapse
Affiliation(s)
- Lin Zeng
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan 650204, China
| | - He-Qun Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.,Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, NY 10461, USA
| | - Xiao-Long Tu
- Annoroad Gene Tech. (Beijing) Co., Ltd., Beijing 100176, China
| | - Chang-Mian Ji
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, Hainan 571101, China.,Biomarker Technologies Corporation, Beijing 101300, China
| | - Xiao Gou
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Ali Esmailizadeh
- Department of Animal Science, Faculty of Agriculture, Shahid Bahonar University of Kerman, Kerman, PB 76169-133, Iran
| | - Sheng Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Ming-Shan Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | | | - Xiao-Long Li
- Biomarker Technologies Corporation, Beijing 101300, China
| | - Hadi Charati
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan 650204, China
| | - Adeniyi C Adeola
- State Key Laboratory of Genetic Resources and Evolution, Yunnan Laboratory of Molecular Biology of Domestic Animals, Germplasm Bank of Wild Species, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.,Sino-Africa Joint Research Center, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | | | - Olatunbosun Oladipo
- Federal College of Animal Health and Production Technology, Moor-Plantation, Ibadan, Nigeria
| | | | - Oscar J Sanke
- Taraba State Ministry of Agriculture and Natural Resources, Jalingo 660221, Nigeria
| | | | - Sheila Cecily Ommeh
- Institute For Biotechnology Research Jomo Kenyatta University of Agriculture and Technology, Nairobi 62000-00200, Kenya.,Department of Zoology, National Museums of Kenya, Nairobi 40658-00100, Kenya
| | - Bernard Agwanda
- Department of Zoology, National Museums of Kenya, Nairobi 40658-00100, Kenya
| | - Jacqueline Kasiiti Lichoti
- State Department of Livestock, Ministry of Agriculture, Livestock, Fisheries and Irrigation, Nairobi, Kenya
| | - Jian-Lin Han
- CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100193, China
| | - Hong-Kun Zheng
- Biomarker Technologies Corporation, Beijing 101300, China
| | - Chang-Fa Wang
- Equus Laboratory, Dairy Cattle Research Center, Shandong Academy of Agricultural Sciences, Ji'nan, Shandong 250131, China.,Liaocheng Research Institute of Donkey High-Efficiency Breeding and Ecological Feeding, Liaocheng University, Liaocheng, Shandong 252059, China. E-mail:
| | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan 650223, China. E-mail:
| | - Laurent A F Frantz
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK. E-mail:
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan 650223, China.,Institute of Three-River-Source National Park, Chinese Academy of Sciences, Qinghai 810008, China. E-mail:
| |
Collapse
|
45
|
Guiblet WM, DeGiorgio M, Cheng X, Chiaromonte F, Eckert KA, Huang YF, Makova KD. Selection and thermostability suggest G-quadruplexes are novel functional elements of the human genome. Genome Res 2021; 31:1136-1149. [PMID: 34187812 PMCID: PMC8256861 DOI: 10.1101/gr.269589.120] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 05/24/2021] [Indexed: 12/11/2022]
Abstract
Approximately 1% of the human genome has the ability to fold into G-quadruplexes (G4s)-noncanonical strand-specific DNA structures forming at G-rich motifs. G4s regulate several key cellular processes (e.g., transcription) and have been hypothesized to participate in others (e.g., firing of replication origins). Moreover, G4s differ in their thermostability, and this may affect their function. Yet, G4s may also hinder replication, transcription, and translation and may increase genome instability and mutation rates. Therefore, depending on their genomic location, thermostability, and functionality, G4 loci might evolve under different selective pressures, which has never been investigated. Here we conducted the first genome-wide analysis of G4 distribution, thermostability, and selection. We found an overrepresentation, high thermostability, and purifying selection for G4s within genic components in which they are expected to be functional-promoters, CpG islands, and 5' and 3' UTRs. A similar pattern was observed for G4s within replication origins, enhancers, eQTLs, and TAD boundary regions, strongly suggesting their functionality. In contrast, G4s on the nontranscribed strand of exons were underrepresented, were unstable, and evolved neutrally. In general, G4s on the nontranscribed strand of genic components had lower density and were less stable than those on the transcribed strand, suggesting that the former are avoided at the RNA level. Across the genome, purifying selection was stronger at stable G4s. Our results suggest that purifying selection preserves the sequences of functional G4s, whereas nonfunctional G4s are too costly to be tolerated in the genome. Thus, G4s are emerging as fundamental, functional genomic elements.
Collapse
Affiliation(s)
- Wilfried M Guiblet
- Bioinformatics and Genomics Graduate Program, Penn State University, University Park, Pennsylvania 16802, USA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, Florida 33431, USA
| | - Xiaoheng Cheng
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
| | - Francesca Chiaromonte
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
- Sant'Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Kristin A Eckert
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
- Department of Pathology, Penn State University, College of Medicine, Hershey, Pennsylvania 17033, USA
| | - Yi-Fei Huang
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, Pennsylvania 16802, USA
| |
Collapse
|
46
|
Rowan TN, Durbin HJ, Seabury CM, Schnabel RD, Decker JE. Powerful detection of polygenic selection and evidence of environmental adaptation in US beef cattle. PLoS Genet 2021; 17:e1009652. [PMID: 34292938 PMCID: PMC8297814 DOI: 10.1371/journal.pgen.1009652] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 06/09/2021] [Indexed: 12/19/2022] Open
Abstract
Selection on complex traits can rapidly drive evolution, especially in stressful environments. This polygenic selection does not leave intense sweep signatures on the genome, rather many loci experience small allele frequency shifts, resulting in large cumulative phenotypic changes. Directional selection and local adaptation are changing populations; but, identifying loci underlying polygenic or environmental selection has been difficult. We use genomic data on tens of thousands of cattle from three populations, distributed over time and landscapes, in linear mixed models with novel dependent variables to map signatures of selection on complex traits and local adaptation. We identify 207 genomic loci associated with an animal's birth date, representing ongoing selection for monogenic and polygenic traits. Additionally, hundreds of additional loci are associated with continuous and discrete environments, providing evidence for historical local adaptation. These candidate loci highlight the nervous system's possible role in local adaptation. While advanced technologies have increased the rate of directional selection in cattle, it has likely been at the expense of historically generated local adaptation, which is especially problematic in changing climates. When applied to large, diverse cattle datasets, these selection mapping methods provide an insight into how selection on complex traits continually shapes the genome. Further, understanding the genomic loci implicated in adaptation may help us breed more adapted and efficient cattle, and begin to understand the basis for mammalian adaptation, especially in changing climates. These selection mapping approaches help clarify selective forces and loci in evolutionary, model, and agricultural contexts.
Collapse
Affiliation(s)
- Troy N. Rowan
- Division of Animal Sciences, University of Missouri, Columbia, Missouri, United States of America
- Genetics Area Program, University of Missouri, Columbia, Missouri, United States of America
- Department of Animal Science, University of Tennessee, Knoxville, Tennessee, United States of America
- College of Veterinary Medicine, Large Animal Clinical Science, University of Tennessee, Knoxville, Tennessee, United States of America
| | - Harly J. Durbin
- Division of Animal Sciences, University of Missouri, Columbia, Missouri, United States of America
- Genetics Area Program, University of Missouri, Columbia, Missouri, United States of America
| | - Christopher M. Seabury
- Department of Veterinary Pathobiology, Texas A&M University, College Station, Texas, United States of America
| | - Robert D. Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, Missouri, United States of America
- Genetics Area Program, University of Missouri, Columbia, Missouri, United States of America
- Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, United States of America
| | - Jared E. Decker
- Division of Animal Sciences, University of Missouri, Columbia, Missouri, United States of America
- Genetics Area Program, University of Missouri, Columbia, Missouri, United States of America
- Institute for Data Science and Informatics, University of Missouri, Columbia, Missouri, United States of America
| |
Collapse
|
47
|
Menardo F, Gagneux S, Freund F. Multiple Merger Genealogies in Outbreaks of Mycobacterium tuberculosis. Mol Biol Evol 2021; 38:290-306. [PMID: 32667991 PMCID: PMC8480183 DOI: 10.1093/molbev/msaa179] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The Kingman coalescent and its developments are often considered among the most important advances in population genetics of the last decades. Demographic inference based on coalescent theory has been used to reconstruct the population dynamics and evolutionary history of several species, including Mycobacterium tuberculosis (MTB), an important human pathogen causing tuberculosis. One key assumption of the Kingman coalescent is that the number of descendants of different individuals does not vary strongly, and violating this assumption could lead to severe biases caused by model misspecification. Individual lineages of MTB are expected to vary strongly in reproductive success because 1) MTB is potentially under constant selection due to the pressure of the host immune system and of antibiotic treatment, 2) MTB undergoes repeated population bottlenecks when it transmits from one host to the next, and 3) some hosts show much higher transmission rates compared with the average (superspreaders). Here, we used an approximate Bayesian computation approach to test whether multiple-merger coalescents (MMC), a class of models that allow for large variation in reproductive success among lineages, are more appropriate models to study MTB populations. We considered 11 publicly available whole-genome sequence data sets sampled from local MTB populations and outbreaks and found that MMC had a better fit compared with the Kingman coalescent for 10 of the 11 data sets. These results indicate that the null model for analyzing MTB outbreaks should be reassessed and that past findings based on the Kingman coalescent need to be revisited.
Collapse
Affiliation(s)
- Fabrizio Menardo
- Department of Medical Parasitology and Infection Biology, Swiss Tropical and Public Health Institute, Basel, Switzerland.,University of Basel, Basel, Switzerland
| | - Sébastien Gagneux
- Department of Medical Parasitology and Infection Biology, Swiss Tropical and Public Health Institute, Basel, Switzerland.,University of Basel, Basel, Switzerland
| | - Fabian Freund
- Department of Plant Biodiversity and Breeding Informatics, Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
| |
Collapse
|
48
|
Roberts Kingman GA, Vyas DN, Jones FC, Brady SD, Chen HI, Reid K, Milhaven M, Bertino TS, Aguirre WE, Heins DC, von Hippel FA, Park PJ, Kirch M, Absher DM, Myers RM, Di Palma F, Bell MA, Kingsley DM, Veeramah KR. Predicting future from past: The genomic basis of recurrent and rapid stickleback evolution. SCIENCE ADVANCES 2021; 7:7/25/eabg5285. [PMID: 34144992 PMCID: PMC8213234 DOI: 10.1126/sciadv.abg5285] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 05/05/2021] [Indexed: 05/30/2023]
Abstract
Similar forms often evolve repeatedly in nature, raising long-standing questions about the underlying mechanisms. Here, we use repeated evolution in stickleback to identify a large set of genomic loci that change recurrently during colonization of freshwater habitats by marine fish. The same loci used repeatedly in extant populations also show rapid allele frequency changes when new freshwater populations are experimentally established from marine ancestors. Marked genotypic and phenotypic changes arise within 5 years, facilitated by standing genetic variation and linkage between adaptive regions. Both the speed and location of changes can be predicted using empirical observations of recurrence in natural populations or fundamental genomic features like allelic age, recombination rates, density of divergent loci, and overlap with mapped traits. A composite model trained on these stickleback features can also predict the location of key evolutionary loci in Darwin's finches, suggesting that similar features are important for evolution across diverse taxa.
Collapse
Affiliation(s)
- Garrett A Roberts Kingman
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305-5329, USA
| | - Deven N Vyas
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794-5245, USA
| | - Felicity C Jones
- Friedrich Miescher Laboratory of the Max Planck Society, Max-Planck-Ring, Tübingen, Germany
| | - Shannon D Brady
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305-5329, USA
| | - Heidi I Chen
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305-5329, USA
| | - Kerry Reid
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794-5245, USA
| | - Mark Milhaven
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794-5245, USA
- School of Life Sciences, Arizona State University, Tempe, AZ 85281, USA
| | - Thomas S Bertino
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794-5245, USA
| | - Windsor E Aguirre
- Department of Biological Sciences, DePaul University, Chicago, IL 60614-3207, USA
| | - David C Heins
- Department of Ecology and Evolutionary Biology, Tulane University, New Orleans, LA 70118, USA
| | - Frank A von Hippel
- Department of Community, Environment and Policy, Mel & Enid Zuckerman College of Public Health, University of Arizona, Tucson, AZ 85724, USA
| | - Peter J Park
- Department of Biology, Farmingdale State College, Farmingdale, NY 11735-1021, USA
| | - Melanie Kirch
- Friedrich Miescher Laboratory of the Max Planck Society, Max-Planck-Ring, Tübingen, Germany
| | - Devin M Absher
- HudsonAlpha Institute for Biotechnology, 601 Genome Way, Huntsville, AL 35806, USA
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, 601 Genome Way, Huntsville, AL 35806, USA
| | - Federica Di Palma
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA
| | - Michael A Bell
- University of California Museum of Paleontology, University of California, Berkeley, Berkeley, CA 94720, USA.
| | - David M Kingsley
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305-5329, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Krishna R Veeramah
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794-5245, USA.
| |
Collapse
|
49
|
Harris AM, DeGiorgio M. A Likelihood Approach for Uncovering Selective Sweep Signatures from Haplotype Data. Mol Biol Evol 2021; 37:3023-3046. [PMID: 32392293 PMCID: PMC7530616 DOI: 10.1093/molbev/msaa115] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Selective sweeps are frequent and varied signatures in the genomes of natural populations, and detecting them is consequently important in understanding mechanisms of adaptation by natural selection. Following a selective sweep, haplotypic diversity surrounding the site under selection decreases, and this deviation from the background pattern of variation can be applied to identify sweeps. Multiple methods exist to locate selective sweeps in the genome from haplotype data, but none leverages the power of a model-based approach to make their inference. Here, we propose a likelihood ratio test statistic T to probe whole-genome polymorphism data sets for selective sweep signatures. Our framework uses a simple but powerful model of haplotype frequency spectrum distortion to find sweeps and additionally make an inference on the number of presently sweeping haplotypes in a population. We found that the T statistic is suitable for detecting both hard and soft sweeps across a variety of demographic models, selection strengths, and ages of the beneficial allele. Accordingly, we applied the T statistic to variant calls from European and sub-Saharan African human populations, yielding primarily literature-supported candidates, including LCT, RSPH3, and ZNF211 in CEU, SYT1, RGS18, and NNT in YRI, and HLA genes in both populations. We also searched for sweep signatures in Drosophila melanogaster, finding expected candidates at Ace, Uhg1, and Pimet. Finally, we provide open-source software to compute the T statistic and the inferred number of presently sweeping haplotypes from whole-genome data.
Collapse
Affiliation(s)
- Alexandre M Harris
- Department of Biology, Pennsylvania State University, University Park, PA.,Molecular, Cellular, and Integrative Biosciences, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL
| |
Collapse
|
50
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects. Mol Biol Evol 2021; 38:2986-3003. [PMID: 33591322 PMCID: PMC8233493 DOI: 10.1093/molbev/msab050] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Current procedures for inferring population history generally assume complete neutrality—that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|