1
|
Daigle A, Johri P. Hill-Robertson interference may bias the inference of fitness effects of new mutations in highly selfing species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.06.579142. [PMID: 38370745 PMCID: PMC10871249 DOI: 10.1101/2024.02.06.579142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
The accurate estimation of the distribution of fitness effects (DFE) of new mutations is critical for population genetic inference but remains a challenging task. While various methods have been developed for DFE inference using the site frequency spectrum of putatively neutral and selected sites, their applicability in species with diverse life history traits and complex demographic scenarios is not well understood. Selfing is common among eukaryotic species and can lead to decreased effective recombination rates, increasing the effects of selection at linked sites, including interference between selected alleles. We employ forward simulations to investigate the limitations of current DFE estimation approaches in the presence of selfing and other model violations, such as linkage, departures from semidominance, population structure, and uneven sampling. We find that distortions of the site frequency spectrum due to Hill-Robertson interference in highly selfing populations lead to mis-inference of the deleterious DFE of new mutations. Specifically, when inferring the distribution of selection coefficients, there is an overestimation of nearly neutral and strongly deleterious mutations and an underestimation of mildly deleterious mutations when interference between selected alleles is pervasive. In addition, the presence of cryptic population structure with low rates of migration and uneven sampling across subpopulations leads to the false inference of a deleterious DFE skewed towards effectively neutral/mildly deleterious mutations. Finally, the proportion of adaptive substitutions estimated at high rates of selfing is substantially overestimated. Our observations apply broadly to species and genomic regions with little/no recombination and where interference might be pervasive.
Collapse
Affiliation(s)
- Austin Daigle
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, NC 27599
| | - Parul Johri
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599
- Integrative Program for Biological & Genome Sciences, University of North Carolina, Chapel Hill, NC 27599
| |
Collapse
|
2
|
Kaushik S, Jain K, Johri P. Genetic diversity during selective sweeps in non-recombining populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.12.612756. [PMID: 39345399 PMCID: PMC11429930 DOI: 10.1101/2024.09.12.612756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/01/2024]
Abstract
Selective sweeps, resulting from the spread of beneficial, neutral, or deleterious mutations through a population, shape patterns of genetic variation at linked neutral sites. While many theoretical, computational, and statistical advances have been made in understanding the genomic signatures of selective sweeps in recombining populations, substantially less is understood in populations with little/no recombination. We present a mathematical framework based on diffusion theory for obtaining the site frequency spectrum (SFS) at linked neutral sites immediately post and during the fixation of moderately or strongly beneficial mutations. We find that when a single hard sweep occurs, the SFS decays as 1/ x for low derived allele frequencies ( x ), similar to the neutral SFS at equilibrium, whereas at higher derived allele frequencies, it follows a 1/ x 2 power law. These power laws are universal in the sense that they are independent of the dominance and inbreeding coefficient, and also characterize the SFS during the sweep. Additionally, we find that the derived allele frequency where the SFS shifts from the 1/ x to 1/ x 2 law, is inversely proportional to the selection strength: thus under strong selection, the SFS follows the 1/ x 2 dependence for most allele frequencies, resembling a rapidly expanding neutral population. When clonal interference is pervasive, the SFS immediately post-fixation becomes U-shaped and is better explained by the equilibrium SFS of selected sites. Our results will be important in developing statistical methods to infer the timing and strength of recent selective sweeps in asexual populations, genomic regions that lack recombination, and clonally propagating tumor populations.
Collapse
|
3
|
Sellinger T, Johannes F, Tellier A. Improved inference of population histories by integrating genomic and epigenomic data. eLife 2024; 12:RP89470. [PMID: 39264367 PMCID: PMC11392530 DOI: 10.7554/elife.89470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2024] Open
Abstract
With the availability of high-quality full genome polymorphism (SNPs) data, it becomes feasible to study the past demographic and selective history of populations in exquisite detail. However, such inferences still suffer from a lack of statistical resolution for recent, for example bottlenecks, events, and/or for populations with small nucleotide diversity. Additional heritable (epi)genetic markers, such as indels, transposable elements, microsatellites, or cytosine methylation, may provide further, yet untapped, information on the recent past population history. We extend the Sequential Markovian Coalescent (SMC) framework to jointly use SNPs and other hyper-mutable markers. We are able to (1) improve the accuracy of demographic inference in recent times, (2) uncover past demographic events hidden to SNP-based inference methods, and (3) infer the hyper-mutable marker mutation rates under a finite site model. As a proof of principle, we focus on demographic inference in Arabidopsis thaliana using DNA methylation diversity data from 10 European natural accessions. We demonstrate that segregating single methylated polymorphisms (SMPs) satisfy the modeling assumptions of the SMC framework, while differentially methylated regions (DMRs) are not suitable as their length exceeds that of the genomic distance between two recombination events. Combining SNPs and SMPs while accounting for site- and region-level epimutation processes, we provide new estimates of the glacial age bottleneck and post-glacial population expansion of the European A. thaliana population. Our SMC framework readily accounts for a wide range of heritable genomic markers, thus paving the way for next-generation inference of evolutionary history by combining information from several genetic and epigenetic markers.
Collapse
Affiliation(s)
- Thibaut Sellinger
- Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, Munich, Germany
- Department of Environment and Biodiversity, Paris Lodron University of Salzburg, Salzburg, Austria
| | - Frank Johannes
- Professorship for Plant Epigenomics, Department of Molecular Life Sciences, Technical University of Munich, Freising, Germany
| | - Aurélien Tellier
- Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, Munich, Germany
| |
Collapse
|
4
|
Roberts M, Josephs EB. Previously unmeasured genetic diversity explains part of Lewontin's paradox in a k -mer-based meta-analysis of 112 plant species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.17.594778. [PMID: 38798362 PMCID: PMC11118579 DOI: 10.1101/2024.05.17.594778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
At the molecular level, most evolution is expected to be neutral. A key prediction of this expectation is that the level of genetic diversity in a population should scale with population size. However, as was noted by Richard Lewontin in 1974 and reaffirmed by later studies, the slope of the population size-diversity relationship in nature is much weaker than expected under neutral theory. We hypothesize that one contributor to this paradox is that current methods relying on single nucleotide polymorphisms (SNPs) called from aligning short reads to a reference genome underestimate levels of genetic diversity in many species. To test this idea, we calculated nucleotide diversity ( π ) and k -mer-based metrics of genetic diversity across 112 plant species, amounting to over 205 terabases of DNA sequencing data from 27,488 individual plants. We then compared how these different metrics correlated with proxies of population size that account for both range size and population density variation across species. We found that our population size proxies scaled anywhere from about 3 to over 20 times faster with k -mer diversity than nucleotide diversity after adjusting for evolutionary history, mating system, life cycle habit, cultivation status, and invasiveness. The relationship between k -mer diversity and population size proxies also remains significant after correcting for genome size, whereas the analogous relationship for nucleotide diversity does not. These results suggest that variation not captured by common SNP-based analyses explains part of Lewontin's paradox in plants.
Collapse
Affiliation(s)
- Miles Roberts
- Genetics and Genome Sciences Program, Michigan State University, East Lansing MI
| | - Emily B. Josephs
- Department of Plant Biology, Michigan State University, East Lansing, MI
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI
- Plant Resilience Institute, Michigan State University, East Lansing, MI
| |
Collapse
|
5
|
Dabi A, Schrider DR. Population size rescaling significantly biases outcomes of forward-in-time population genetic simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.07.588318. [PMID: 38645049 PMCID: PMC11030438 DOI: 10.1101/2024.04.07.588318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Simulations are an essential tool in all areas of population genetic research, used in tasks such as the validation of theoretical analysis and the study of complex evolutionary models. Forward-in-time simulations are especially flexible, allowing for various types of natural selection, complex genetic architectures, and non-Wright-Fisher dynamics. However, their intense computational requirements can be prohibitive to simulating large populations and genomes. A popular method to alleviate this burden is to scale down the population size by some scaling factor while scaling up the mutation rate, selection coefficients, and recombination rate by the same factor. However, this rescaling approach may in some cases bias simulation results. To investigate the manner and degree to which rescaling impacts simulation outcomes, we carried out simulations with different demographic histories and distributions of fitness effects using several values of the rescaling factor, Q , and compared the deviation of key outcomes (fixation times, allele frequencies, linkage disequilibrium, and the fraction of mutations that fix during the simulation) between the scaled and unscaled simulations. Our results indicate that scaling introduces substantial biases to each of these measured outcomes, even at small values of Q . Moreover, the nature of these effects depends on the evolutionary model and scaling factor being examined. While increasing the scaling factor tends to increase the observed biases, this relationship is not always straightforward, thus it may be difficult to know the impact of scaling on simulation outcomes a priori. However, it appears that for most models, only a small number of replicates was needed to accurately quantify the bias produced by rescaling for a given Q . In summary, while rescaling forward-in-time simulations may be necessary in many cases, researchers should be aware of the rescaling procedure's impact on simulation outcomes and consider investigating its magnitude in smaller scale simulations of the desired model(s) before selecting an appropriate value of Q .
Collapse
Affiliation(s)
- Amjad Dabi
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Daniel R. Schrider
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA
| |
Collapse
|
6
|
Simon NM, Kim Y, Bautista DM, Dutton JR, Brem RB. Stem cell transcriptional profiles from mouse subspecies reveal cis-regulatory evolution at translation genes. Heredity (Edinb) 2024:10.1038/s41437-024-00715-z. [PMID: 39164520 DOI: 10.1038/s41437-024-00715-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 08/06/2024] [Accepted: 08/08/2024] [Indexed: 08/22/2024] Open
Abstract
A key goal of evolutionary genomics is to harness molecular data to draw inferences about selective forces that have acted on genomes. The field progresses in large part through the development of advanced molecular-evolution analysis methods. Here we explored the intersection between classical sequence-based tests for selection and an empirical expression-based approach, using stem cells from Mus musculus subspecies as a model. Using a test of directional, cis-regulatory evolution across genes in pathways, we discovered a unique program of induction of translation genes in stem cells of the Southeast Asian mouse M. m. castaneus relative to its sister taxa. We then mined population-genomic sequences to pursue underlying regulatory mechanisms for this expression divergence, finding robust evidence for alleles unique to M. m. castaneus at the upstream regions of the translation genes. We interpret our data under a model of changes in lineage-specific pressures across Mus musculus in stem cells with high translational capacity. Our findings underscore the rigor of integrating expression and sequence-based methods to generate hypotheses about evolutionary events from long ago.
Collapse
Affiliation(s)
- Noah M Simon
- Biology of Aging Doctoral Program, Leonard Davis School of Gerontology, University of Southern California, Los Angeles, CA, 90089, USA
- Buck Institute for Research on Aging, Novato, CA, 94945, USA
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, 94720, USA
| | - Yujin Kim
- Stem Cell Institute, University of Minnesota, Minneapolis, MN, 55455, USA
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Diana M Bautista
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, 94720, USA
| | - James R Dutton
- Stem Cell Institute, University of Minnesota, Minneapolis, MN, 55455, USA
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Rachel B Brem
- Buck Institute for Research on Aging, Novato, CA, 94945, USA.
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, 94720, USA.
| |
Collapse
|
7
|
Smith ML, Hahn MW. Selection leads to false inferences of introgression using popular methods. Genetics 2024; 227:iyae089. [PMID: 38805070 DOI: 10.1093/genetics/iyae089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2023] [Revised: 10/28/2023] [Accepted: 05/21/2024] [Indexed: 05/29/2024] Open
Abstract
Detecting introgression between closely related populations or species is a fundamental objective in evolutionary biology. Existing methods for detecting migration and inferring migration rates from population genetic data often assume a neutral model of evolution. Growing evidence of the pervasive impact of selection on large portions of the genome across diverse taxa suggests that this assumption is unrealistic in most empirical systems. Further, ignoring selection has previously been shown to negatively impact demographic inferences (e.g. of population size histories). However, the impacts of biologically realistic selection on inferences of migration remain poorly explored. Here, we simulate data under models of background selection, selective sweeps, balancing selection, and adaptive introgression. We show that ignoring selection sometimes leads to false inferences of migration in popularly used methods that rely on the site frequency spectrum. Specifically, balancing selection and some models of background selection result in the rejection of isolation-only models in favor of isolation-with-migration models and lead to elevated estimates of migration rates. BPP, a method that analyzes sequence data directly, showed false positives for all conditions at recent divergence times, but balancing selection also led to false positives at medium-divergence times. Our results suggest that such methods may be unreliable in some empirical systems, such that new methods that are robust to selection need to be developed.
Collapse
Affiliation(s)
- Megan L Smith
- Department of Biological Sciences, Mississippi State University, Starkville, MS 39762, USA
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
- Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
8
|
Modica A, Lalagüe H, Muratorio S, Scotti I. Rolling down that mountain: microgeographical adaptive divergence during a fast population expansion along a steep environmental gradient in European beech. Heredity (Edinb) 2024; 133:99-112. [PMID: 38890557 PMCID: PMC11286953 DOI: 10.1038/s41437-024-00696-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 05/23/2024] [Accepted: 05/23/2024] [Indexed: 06/20/2024] Open
Abstract
Forest tree populations harbour high genetic diversity thanks to large effective population sizes and strong gene flow, allowing them to diversify through adaptation to local environmental pressures within dispersal distance. Many tree populations also experienced historical demographic fluctuations, including spatial population contraction or expansions at various temporal scales, which may constrain their ability to adapt to environmental variations. Our aim is to investigate how recent contraction and expansion events interfere with local adaptation, by studying patterns of adaptive divergence between closely related stands undergoing environmentally contrasted conditions, and having or not recently expanded. To investigate genome-wide signatures of local adaptation while accounting for demography, we analysed divergence in a European beech population by testing pairwise differentiation among four tree stands at ~35k Single Nucleotide Polymorphisms from ~9k genomic regions. We applied three divergence outlier search methods resting on different assumptions and targeting either single SNPs or contiguous genomic regions, while accounting for the effect of population size variations on genetic divergence. We found 27 signals of selective signatures in 19 target regions. Putatively adaptive divergence involved all stand pairs. We retrieved signals both when comparing old-growth stands and recently colonised areas and when comparing stands within the old-growth area. Therefore, adaptive divergence processes have taken place both over short time spans, under strong environmental contrasts, and over short ecological gradients, in populations that have been stable in the long term. This suggests that standing genetic variation supports local, microgeographic divergence processes, which can maintain genetic diversity at the landscape level.
Collapse
Affiliation(s)
- Andrea Modica
- INRAE, URFM, 228, Route de l'Aérodrome, 84914, Avignon, France
| | - Hadrien Lalagüe
- INRAE, EcoFoG, Campus agronomique, 97310, Kourou, French Guiana
| | - Sylvie Muratorio
- INRAE, EcoBioP, 173, Route de Saint-Jean-de-Luz RD 918, 64310, Saint-Pée-sur-Nivelle, France
| | - Ivan Scotti
- INRAE, URFM, 228, Route de l'Aérodrome, 84914, Avignon, France.
| |
Collapse
|
9
|
Simon NM, Kim Y, Bautista DM, Dutton JR, Brem RB. Stem cell transcriptional profiles from mouse subspecies reveal cis -regulatory evolution at translation genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.18.549406. [PMID: 37503246 PMCID: PMC10370129 DOI: 10.1101/2023.07.18.549406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
A key goal of evolutionary genomics is to harness molecular data to draw inferences about selective forces that have acted on genomes. The field progresses in large part through the development of advanced molecular-evolution analysis methods. Here we explored the intersection between classical sequence-based tests for selection and an empirical expression-based approach, using stem cells from Mus musculus subspecies as a model. Using a test of directional, cis -regulatory evolution across genes in pathways, we discovered a unique program of induction of translation genes in stem cells of the Southeast Asian mouse M. m. castaneus relative to its sister taxa. We then mined population-genomic sequences to pursue underlying regulatory mechanisms for this expression divergence, finding robust evidence for alleles unique to M. m. castaneus at the upstream regions of the translation genes. We interpret our data under a model of changes in lineage-specific pressures across Mus musculus in stem cells with high translational capacity. Our findings underscore the rigor of integrating expression and sequence-based methods to generate hypotheses about evolutionary events from long ago.
Collapse
|
10
|
Marsh JI, Johri P. Biases in ARG-Based Inference of Historical Population Size in Populations Experiencing Selection. Mol Biol Evol 2024; 41:msae118. [PMID: 38874402 PMCID: PMC11245712 DOI: 10.1093/molbev/msae118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/05/2024] [Accepted: 06/11/2024] [Indexed: 06/15/2024] Open
Abstract
Inferring the demographic history of populations provides fundamental insights into species dynamics and is essential for developing a null model to accurately study selective processes. However, background selection and selective sweeps can produce genomic signatures at linked sites that mimic or mask signals associated with historical population size change. While the theoretical biases introduced by the linked effects of selection have been well established, it is unclear whether ancestral recombination graph (ARG)-based approaches to demographic inference in typical empirical analyses are susceptible to misinference due to these effects. To address this, we developed highly realistic forward simulations of human and Drosophila melanogaster populations, including empirically estimated variability of gene density, mutation rates, recombination rates, purifying, and positive selection, across different historical demographic scenarios, to broadly assess the impact of selection on demographic inference using a genealogy-based approach. Our results indicate that the linked effects of selection minimally impact demographic inference for human populations, although it could cause misinference in populations with similar genome architecture and population parameters experiencing more frequent recurrent sweeps. We found that accurate demographic inference of D. melanogaster populations by ARG-based methods is compromised by the presence of pervasive background selection alone, leading to spurious inferences of recent population expansion, which may be further worsened by recurrent sweeps, depending on the proportion and strength of beneficial mutations. Caution and additional testing with species-specific simulations are needed when inferring population history with non-human populations using ARG-based approaches to avoid misinference due to the linked effects of selection.
Collapse
Affiliation(s)
- Jacob I Marsh
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Parul Johri
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|
11
|
Spurgin LG, Bosse M, Adriaensen F, Albayrak T, Barboutis C, Belda E, Bushuev A, Cecere JG, Charmantier A, Cichon M, Dingemanse NJ, Doligez B, Eeva T, Erikstad KE, Fedorov V, Griggio M, Heylen D, Hille S, Hinde CA, Ivankina E, Kempenaers B, Kerimov A, Krist M, Kvist L, Laine VN, Mänd R, Matthysen E, Nager R, Nikolov BP, Norte AC, Orell M, Ouyang J, Petrova-Dinkova G, Richner H, Rubolini D, Slagsvold T, Tilgar V, Török J, Tschirren B, Vágási CI, Yuta T, Groenen MAM, Visser ME, van Oers K, Sheldon BC, Slate J. The great tit HapMap project: A continental-scale analysis of genomic variation in a songbird. Mol Ecol Resour 2024; 24:e13969. [PMID: 38747336 DOI: 10.1111/1755-0998.13969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 04/10/2024] [Accepted: 04/29/2024] [Indexed: 06/04/2024]
Abstract
A major aim of evolutionary biology is to understand why patterns of genomic diversity vary within taxa and space. Large-scale genomic studies of widespread species are useful for studying how environment and demography shape patterns of genomic divergence. Here, we describe one of the most geographically comprehensive surveys of genomic variation in a wild vertebrate to date; the great tit (Parus major) HapMap project. We screened ca 500,000 SNP markers across 647 individuals from 29 populations, spanning ~30 degrees of latitude and 40 degrees of longitude - almost the entire geographical range of the European subspecies. Genome-wide variation was consistent with a recent colonisation across Europe from a South-East European refugium, with bottlenecks and reduced genetic diversity in island populations. Differentiation across the genome was highly heterogeneous, with clear 'islands of differentiation', even among populations with very low levels of genome-wide differentiation. Low local recombination rates were a strong predictor of high local genomic differentiation (FST), especially in island and peripheral mainland populations, suggesting that the interplay between genetic drift and recombination causes highly heterogeneous differentiation landscapes. We also detected genomic outlier regions that were confined to one or more peripheral great tit populations, probably as a result of recent directional selection at the species' range edges. Haplotype-based measures of selection were related to recombination rate, albeit less strongly, and highlighted population-specific sweeps that likely resulted from positive selection. Our study highlights how comprehensive screens of genomic variation in wild organisms can provide unique insights into spatio-temporal evolutionary dynamics.
Collapse
Affiliation(s)
- Lewis G Spurgin
- School of Biological Sciences, Norwich Research Park, University of East Anglia, Norwich, UK
- Department of Biology, Edward Grey Institute, University of Oxford, Oxford, UK
| | - Mirte Bosse
- Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, The Netherlands
- Department of Ecological Science, Animal Ecology Group, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Frank Adriaensen
- Evolutionary Ecology Group, Department of Biology, University of Antwerp, Antwerp, Belgium
| | - Tamer Albayrak
- Department of Biology, Science and art Faculty, Mehmet Akif Ersoy University, Istiklal Yerleskesi, Burdur, Turkey
- Biology Education, Buca Faculty of Education, Mathematics and Science Education, Dokuz Eylül University, İzmir, Turkey
| | | | - Eduardo Belda
- Institut d'Investigació per a la Gestió Integrada de Zones Costaneres, Campus de Gandia, Universitat Politècnica de València, València, Spain
| | - Andrey Bushuev
- Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Jacopo G Cecere
- Area Avifauna Migratrice, Istituto Superiore per la Protezione e la Ricerca Ambientale (ISPRA), Ozzano Emilia, Italy
| | | | - Mariusz Cichon
- Institute of Environmental Sciences, Jagiellonian University, Kraków, Poland
| | - Niels J Dingemanse
- Behavioural Ecology, Faculty of Biology, LMU München, Planegg-Martinsried, Germany
| | - Blandine Doligez
- UMR CNRS 5558-LBBE, Biométrie et Biologie Évolutive, Villeurbanne, France
- Department of Ecology and Evolution, Animal Ecology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Tapio Eeva
- Department of Biology, University of Turku, Turku, Finland
| | - Kjell Einar Erikstad
- Norwegian Institute for Nature Research, FRAM-High North Research Centre for Climate and the Environment, Tromsø, Norway
| | | | - Matteo Griggio
- Department of Biology, University of Padova, Padova, Italy
| | - Dieter Heylen
- Department of Biology, Edward Grey Institute, University of Oxford, Oxford, UK
- Evolutionary Ecology Group, Department of Biology, University of Antwerp, Antwerp, Belgium
- Interuniversity Institute for Biostatistics and Statistical Bioinformatics, Hasselt University, Diepenbeek, Belgium
| | - Sabine Hille
- Institute of Wildlife Biology and Game Management, University of Natural Resources and Life Science, Vienna, Austria
| | - Camilla A Hinde
- Behavioural Ecology Group, Department of Life Sciences, Anglia Ruskin University, Cambridgeshire, UK
| | - Elena Ivankina
- Faculty of Biology, Zvenigorod Biological Station, Lomonosov Moscow State University, Moscow, Russia
| | - Bart Kempenaers
- Department of Ornithology, Max Planck Institute for Biological Intelligence, Seewiesen, Germany
| | - Anvar Kerimov
- Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Milos Krist
- Department of Zoology, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - Laura Kvist
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Veronika N Laine
- Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands
- Zoology Unit, Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland
| | - Raivo Mänd
- Department of Zoology, University of Tartu, Tartu, Estonia
| | - Erik Matthysen
- Evolutionary Ecology Group, Department of Biology, University of Antwerp, Antwerp, Belgium
| | - Ruedi Nager
- School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow, UK
| | - Boris P Nikolov
- Bulgarian Ornithological Centre, Institute of Biodiversity and Ecosystem Research, Bulgarian Academy of Sciences, Sofia, Bulgaria
| | - Ana Claudia Norte
- MARE - Marine and Environmental Sciences Centre, Department of Life Sciences, Faculty of Sciences and Technology, University of Coimbra, Coimbra, Portugal
| | - Markku Orell
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | | | - Gergana Petrova-Dinkova
- Bulgarian Ornithological Centre, Institute of Biodiversity and Ecosystem Research, Bulgarian Academy of Sciences, Sofia, Bulgaria
| | - Heinz Richner
- Evolutionary Ecology Lab, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | - Diego Rubolini
- Dipartimento di Scienze e Politiche Ambientali, Università Degli Studi di Milano, Milan, Italy
| | - Tore Slagsvold
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway
| | - Vallo Tilgar
- Department of Zoology, University of Tartu, Tartu, Estonia
| | - János Török
- Behavioural Ecology Group, Department of Systematic Zoology and Ecology, ELTE Eötvös Loránd University, Budapest, Hungary
| | - Barbara Tschirren
- Centre for Ecology and Conservation, University of Exeter, Penryn, UK
| | - Csongor I Vágási
- Evolutionary Ecology Group, Hungarian Department of Biology and Ecology, Babeș-Bolyai University, Cluj-Napoca, Romania
| | - Teru Yuta
- Yamashina Institute for Ornithology, Abiko, Japan
| | - Martien A M Groenen
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, The Netherlands
| | - Marcel E Visser
- Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, The Netherlands
- Groningen Institute for Evolutionary Life Sciences (GELIFES), University of Groningen, Groningen, the Netherlands
| | - Kees van Oers
- Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands
| | - Ben C Sheldon
- Department of Biology, Edward Grey Institute, University of Oxford, Oxford, UK
| | - Jon Slate
- School of Biosciences, University of Sheffield, Sheffield, UK
| |
Collapse
|
12
|
Soni V, Jensen JD. Temporal challenges in detecting balancing selection from population genomic data. G3 (BETHESDA, MD.) 2024; 14:jkae069. [PMID: 38551137 DOI: 10.1093/g3journal/jkae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 12/21/2023] [Accepted: 03/19/2024] [Indexed: 04/28/2024]
Abstract
The role of balancing selection in maintaining genetic variation remains an open question in population genetics. Recent years have seen numerous studies identifying candidate loci potentially experiencing balancing selection, most predominantly in human populations. There are however numerous alternative evolutionary processes that may leave similar patterns of variation, thereby potentially confounding inference, and the expected signatures of balancing selection additionally change in a temporal fashion. Here we use forward-in-time simulations to quantify expected statistical power to detect balancing selection using both site frequency spectrum- and linkage disequilibrium-based methods under a variety of evolutionarily realistic null models. We find that whilst site frequency spectrum-based methods have little power immediately after a balanced mutation begins segregating, power increases with time since the introduction of the balanced allele. Conversely, linkage disequilibrium-based methods have considerable power whilst the allele is young, and power dissipates rapidly as the time since introduction increases. Taken together, this suggests that site frequency spectrum-based methods are most effective at detecting long-term balancing selection (>25N generations since the introduction of the balanced allele) whilst linkage disequilibrium-based methods are effective over much shorter timescales (<1N generations), thereby leaving a large time frame over which current methods have little power to detect the action of balancing selection. Finally, we investigate the extent to which alternative evolutionary processes may mimic these patterns, and demonstrate the need for caution in attempting to distinguish the signatures of balancing selection from those of both neutral processes (e.g. population structure and admixture) as well as of alternative selective processes (e.g. partial selective sweeps).
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
13
|
Soni V, Terbot JW, Jensen JD. Population genetic considerations regarding the interpretation of within-patient SARS-CoV-2 polymorphism data. Nat Commun 2024; 15:3240. [PMID: 38627371 PMCID: PMC11021480 DOI: 10.1038/s41467-024-46261-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/29/2024] [Indexed: 04/19/2024] Open
Affiliation(s)
- Vivak Soni
- Center for Evolution & Medicine, Arizona State University, School of Life Sciences, Tempe, AZ, USA
| | - John W Terbot
- Center for Evolution & Medicine, Arizona State University, School of Life Sciences, Tempe, AZ, USA
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Jeffrey D Jensen
- Center for Evolution & Medicine, Arizona State University, School of Life Sciences, Tempe, AZ, USA.
| |
Collapse
|
14
|
Murga-Moreno J, Casillas S, Barbadilla A, Uricchio L, Enard D. An efficient and robust ABC approach to infer the rate and strength of adaptation. G3 (BETHESDA, MD.) 2024; 14:jkae031. [PMID: 38365205 PMCID: PMC11090462 DOI: 10.1093/g3journal/jkae031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 10/10/2023] [Accepted: 01/29/2024] [Indexed: 02/18/2024]
Abstract
Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in nonmodel species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to nonmodel genomes. We apply ABC-MK to the human proteome and a set of known virus interacting proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85719, USA
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | | | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85719, USA
| |
Collapse
|
15
|
Harris M, Kim BY, Garud N. Enrichment of hard sweeps on the X chromosome compared to autosomes in six Drosophila species. Genetics 2024; 226:iyae019. [PMID: 38366786 PMCID: PMC10990427 DOI: 10.1093/genetics/iyae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 01/17/2024] [Accepted: 01/18/2024] [Indexed: 02/18/2024] Open
Abstract
The X chromosome, being hemizygous in males, is exposed one-third of the time increasing the visibility of new mutations to natural selection, potentially leading to different evolutionary dynamics than autosomes. Recently, we found an enrichment of hard selective sweeps over soft selective sweeps on the X chromosome relative to the autosomes in a North American population of Drosophila melanogaster. To understand whether this enrichment is a universal feature of evolution on the X chromosome, we analyze diversity patterns across 6 commonly studied Drosophila species. We find an increased proportion of regions with steep reductions in diversity and elevated homozygosity on the X chromosome compared to autosomes. To assess if these signatures are consistent with positive selection, we simulate a wide variety of evolutionary scenarios spanning variations in demography, mutation rate, recombination rate, background selection, hard sweeps, and soft sweeps and find that the diversity patterns observed on the X are most consistent with hard sweeps. Our findings highlight the importance of sex chromosomes in driving evolutionary processes and suggest that hard sweeps have played a significant role in shaping diversity patterns on the X chromosome across multiple Drosophila species.
Collapse
Affiliation(s)
- Mariana Harris
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Bernard Y Kim
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Nandita Garud
- Department of Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
16
|
Guo B, Borda V, Laboulaye R, Spring MD, Wojnarski M, Vesely BA, Silva JC, Waters NC, O'Connor TD, Takala-Harrison S. Strong positive selection biases identity-by-descent-based inferences of recent demography and population structure in Plasmodium falciparum. Nat Commun 2024; 15:2499. [PMID: 38509066 PMCID: PMC10954658 DOI: 10.1038/s41467-024-46659-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 02/28/2024] [Indexed: 03/22/2024] Open
Abstract
Malaria genomic surveillance often estimates parasite genetic relatedness using metrics such as Identity-By-Decent (IBD), yet strong positive selection stemming from antimalarial drug resistance or other interventions may bias IBD-based estimates. In this study, we use simulations, a true IBD inference algorithm, and empirical data sets from different malaria transmission settings to investigate the extent of this bias and explore potential correction strategies. We analyze whole genome sequence data generated from 640 new and 3089 publicly available Plasmodium falciparum clinical isolates. We demonstrate that positive selection distorts IBD distributions, leading to underestimated effective population size and blurred population structure. Additionally, we discover that the removal of IBD peak regions partially restores the accuracy of IBD-based inferences, with this effect contingent on the population's background genetic relatedness and extent of inbreeding. Consequently, we advocate for selection correction for parasite populations undergoing strong, recent positive selection, particularly in high malaria transmission settings.
Collapse
Affiliation(s)
- Bing Guo
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Victor Borda
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Roland Laboulaye
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Michele D Spring
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Mariusz Wojnarski
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Brian A Vesely
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Joana C Silva
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, USA
- Global Health and Tropical Medicine (GHTM), Instituto de Higiene e Medicina Tropical (IHMT), Universidade NOVA de Lisboa (NOVA), Lisbon, Portugal
| | - Norman C Waters
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Timothy D O'Connor
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA.
| | - Shannon Takala-Harrison
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
17
|
Matheson J, Masel J. Background Selection From Unlinked Sites Causes Nonindependent Evolution of Deleterious Mutations. Genome Biol Evol 2024; 16:evae050. [PMID: 38482769 PMCID: PMC10972689 DOI: 10.1093/gbe/evae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2024] [Indexed: 04/01/2024] Open
Abstract
Background selection describes the reduction in neutral diversity caused by selection against deleterious alleles at other loci. It is typically assumed that the purging of deleterious alleles affects linked neutral variants, and indeed simulations typically only treat a genomic window. However, background selection at unlinked loci also depresses neutral diversity. In agreement with previous analytical approximations, in our simulations of a human-like genome with a realistically high genome-wide deleterious mutation rate, the effects of unlinked background selection exceed those of linked background selection. Background selection reduces neutral genetic diversity by a factor that is independent of census population size. Outside of genic regions, the strength of background selection increases with the mean selection coefficient, contradicting the linked theory but in agreement with the unlinked theory. Neutral diversity within genic regions is fairly independent of the strength of selection. Deleterious genetic load among haploid individuals is underdispersed, indicating nonindependent evolution of deleterious mutations. Empirical evidence for underdispersion was previously interpreted as evidence for global epistasis, but we recover it from a non-epistatic model.
Collapse
Affiliation(s)
- Joseph Matheson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
- Department of Ecology, Behavior, and Evolution, University of California San Diego, San Diego, CA 92093, USA
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
18
|
Buffalo V, Kern AD. A quantitative genetic model of background selection in humans. PLoS Genet 2024; 20:e1011144. [PMID: 38507461 PMCID: PMC10984650 DOI: 10.1371/journal.pgen.1011144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 04/01/2024] [Accepted: 01/19/2024] [Indexed: 03/22/2024] Open
Abstract
Across the human genome, there are large-scale fluctuations in genetic diversity caused by the indirect effects of selection. This "linked selection signal" reflects the impact of selection according to the physical placement of functional regions and recombination rates along chromosomes. Previous work has shown that purifying selection acting against the steady influx of new deleterious mutations at functional portions of the genome shapes patterns of genomic variation. To date, statistical efforts to estimate purifying selection parameters from linked selection models have relied on classic Background Selection theory, which is only applicable when new mutations are so deleterious that they cannot fix in the population. Here, we develop a statistical method based on a quantitative genetics view of linked selection, that models how polygenic additive fitness variance distributed along the genome increases the rate of stochastic allele frequency change. By jointly predicting the equilibrium fitness variance and substitution rate due to both strong and weakly deleterious mutations, we estimate the distribution of fitness effects (DFE) and mutation rate across three geographically distinct human samples. While our model can accommodate weaker selection, we find evidence of strong selection operating similarly across all human samples. Although our quantitative genetic model of linked selection fits better than previous models, substitution rates of the most constrained sites disagree with observed divergence levels. We find that a model incorporating selective interference better predicts observed divergence in conserved regions, but overall our results suggest uncertainty remains about the processes generating fitness variation in humans.
Collapse
Affiliation(s)
- Vince Buffalo
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| | - Andrew D. Kern
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| |
Collapse
|
19
|
Soni V, Pfeifer SP, Jensen JD. The Effects of Mutation and Recombination Rate Heterogeneity on the Inference of Demography and the Distribution of Fitness Effects. Genome Biol Evol 2024; 16:evae004. [PMID: 38207127 PMCID: PMC10834165 DOI: 10.1093/gbe/evae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 12/12/2023] [Accepted: 01/07/2024] [Indexed: 01/13/2024] Open
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavor; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modeled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination before utilizing population genomic data to quantify the effects of genetic drift (i.e. as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modeled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Susanne P Pfeifer
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
20
|
Schiebelhut LM, Guillaume AS, Kuhn A, Schweizer RM, Armstrong EE, Beaumont MA, Byrne M, Cosart T, Hand BK, Howard L, Mussmann SM, Narum SR, Rasteiro R, Rivera-Colón AG, Saarman N, Sethuraman A, Taylor HR, Thomas GWC, Wellenreuther M, Luikart G. Genomics and conservation: Guidance from training to analyses and applications. Mol Ecol Resour 2024; 24:e13893. [PMID: 37966259 DOI: 10.1111/1755-0998.13893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 10/25/2023] [Accepted: 10/30/2023] [Indexed: 11/16/2023]
Abstract
Environmental change is intensifying the biodiversity crisis and threatening species across the tree of life. Conservation genomics can help inform conservation actions and slow biodiversity loss. However, more training, appropriate use of novel genomic methods and communication with managers are needed. Here, we review practical guidance to improve applied conservation genomics. We share insights aimed at ensuring effectiveness of conservation actions around three themes: (1) improving pedagogy and training in conservation genomics including for online global audiences, (2) conducting rigorous population genomic analyses properly considering theory, marker types and data interpretation and (3) facilitating communication and collaboration between managers and researchers. We aim to update students and professionals and expand their conservation toolkit with genomic principles and recent approaches for conserving and managing biodiversity. The biodiversity crisis is a global problem and, as such, requires international involvement, training, collaboration and frequent reviews of the literature and workshops as we do here.
Collapse
Affiliation(s)
- Lauren M Schiebelhut
- Life and Environmental Sciences, University of California, Merced, California, USA
| | - Annie S Guillaume
- Geospatial Molecular Epidemiology group (GEOME), Laboratory for Biological Geochemistry (LGB), École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Arianna Kuhn
- Department of Biological Sciences, University of Lethbridge, Lethbridge, Alberta, Canada
- Virginia Museum of Natural History, Martinsville, Virginia, USA
| | - Rena M Schweizer
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | | | - Mark A Beaumont
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Margaret Byrne
- Department of Biodiversity, Conservation and Attractions, Biodiversity and Conservation Science, Perth, Western Australia, Australia
| | - Ted Cosart
- Flathead Lake Biology Station, University of Montana, Missoula, Montana, USA
| | - Brian K Hand
- Flathead Lake Biological Station, University of Montana, Polson, Montana, USA
| | - Leif Howard
- Flathead Lake Biology Station, University of Montana, Missoula, Montana, USA
| | - Steven M Mussmann
- Southwestern Native Aquatic Resources and Recovery Center, U.S. Fish & Wildlife Service, Dexter, New Mexico, USA
| | - Shawn R Narum
- Hagerman Genetics Lab, University of Idaho, Hagerman, Idaho, USA
| | - Rita Rasteiro
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
| | - Angel G Rivera-Colón
- Department of Evolution, Ecology, and Behavior, University of Illinois at Urbana-Champaign, Champaign, Illinois, USA
| | - Norah Saarman
- Department of Biology and Ecology Center, Utah State University, Logan, Utah, USA
| | - Arun Sethuraman
- Department of Biology, San Diego State University, San Diego, California, USA
| | - Helen R Taylor
- Royal Zoological Society of Scotland, Edinburgh, Scotland
| | - Gregg W C Thomas
- Informatics Group, Harvard University, Cambridge, Massachusetts, USA
| | - Maren Wellenreuther
- Plant and Food Research, Nelson, New Zealand
- University of Auckland, Auckland, New Zealand
| | - Gordon Luikart
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
- Flathead Lake Biology Station, University of Montana, Missoula, Montana, USA
| |
Collapse
|
21
|
Thom G, Moreira LR, Batista R, Gehara M, Aleixo A, Smith BT. Genomic Architecture Predicts Tree Topology, Population Structuring, and Demographic History in Amazonian Birds. Genome Biol Evol 2024; 16:evae002. [PMID: 38236173 PMCID: PMC10823491 DOI: 10.1093/gbe/evae002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 10/26/2023] [Accepted: 12/12/2023] [Indexed: 01/19/2024] Open
Abstract
Geographic barriers are frequently invoked to explain genetic structuring across the landscape. However, inferences on the spatial and temporal origins of population variation have been largely limited to evolutionary neutral models, ignoring the potential role of natural selection and intrinsic genomic processes known as genomic architecture in producing heterogeneity in differentiation across the genome. To test how variation in genomic characteristics (e.g. recombination rate) impacts our ability to reconstruct general patterns of differentiation between species that cooccur across geographic barriers, we sequenced the whole genomes of multiple bird populations that are distributed across rivers in southeastern Amazonia. We found that phylogenetic relationships within species and demographic parameters varied across the genome in predictable ways. Genetic diversity was positively associated with recombination rate and negatively associated with species tree support. Gene flow was less pervasive in genomic regions of low recombination, making these windows more likely to retain patterns of population structuring that matched the species tree. We further found that approximately a third of the genome showed evidence of selective sweeps and linked selection, skewing genome-wide estimates of effective population sizes and gene flow between populations toward lower values. In sum, we showed that the effects of intrinsic genomic characteristics and selection can be disentangled from neutral processes to elucidate spatial patterns of population differentiation.
Collapse
Affiliation(s)
- Gregory Thom
- Department of Ornithology, American Museum of Natural History, New York, NY, USA
- Museum of Natural Science, Louisiana State University, Baton Rouge, LA, USA
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, USA
| | - Lucas Rocha Moreira
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
- Department of Vertebrate Genomics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Romina Batista
- Programa de Coleções Biológicas, Instituto Nacional de Pesquisas da Amazônia, Manaus, Brazil
- School of Science, Engineering and Environment, University of Salford, Manchester, UK
| | - Marcelo Gehara
- Department of Earth and Environmental Sciences, Rutgers University, Newark, NJ, USA
| | - Alexandre Aleixo
- Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland
- Department of Environmental Genomics, Instituto Tecnológico Vale, Belém, Brazil
| | - Brian Tilston Smith
- Department of Ornithology, American Museum of Natural History, New York, NY, USA
| |
Collapse
|
22
|
Harris M, Kim B, Garud N. Enrichment of hard sweeps on the X chromosome compared to autosomes in six Drosophila species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.21.545888. [PMID: 38106201 PMCID: PMC10723260 DOI: 10.1101/2023.06.21.545888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
The X chromosome, being hemizygous in males, is exposed one third of the time increasing the visibility of new mutations to natural selection, potentially leading to different evolutionary dynamics than autosomes. Recently, we found an enrichment of hard selective sweeps over soft selective sweeps on the X chromosome relative to the autosomes in a North American population of Drosophila melanogaster. To understand whether this enrichment is a universal feature of evolution on the X chromosome, we analyze diversity patterns across six commonly studied Drosophila species. We find an increased proportion of regions with steep reductions in diversity and elevated homozygosity on the X chromosome compared to autosomes. To assess if these signatures are consistent with positive selection, we simulate a wide variety of evolutionary scenarios spanning variations in demography, mutation rate, recombination rate, background selection, hard sweeps, and soft sweeps, and find that the diversity patterns observed on the X are most consistent with hard sweeps. Our findings highlight the importance of sex chromosomes in driving evolutionary processes and suggest that hard sweeps have played a significant role in shaping diversity patterns on the X chromosome across multiple Drosophila species.
Collapse
Affiliation(s)
- Mariana Harris
- Department of Computational Medicine, University of California Los Angeles, Los Angeles California, United States of America
| | - Bernard Kim
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Nandita Garud
- Ecology and Evolutionary Biology, University of California Los Angeles, Los Angeles California, United States of America
- Department of Human Genetics, University of California, Los Angeles, California, United States of America
| |
Collapse
|
23
|
Long H, Johri P, Gout JF, Ni J, Hao Y, Licknack T, Wang Y, Pan J, Jiménez-Marín B, Lynch M. Paramecium Genetics, Genomics, and Evolution. Annu Rev Genet 2023; 57:391-410. [PMID: 38012024 PMCID: PMC11334263 DOI: 10.1146/annurev-genet-071819-104035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
The ciliate genus Paramecium served as one of the first model systems in microbial eukaryotic genetics, contributing much to the early understanding of phenomena as diverse as genome rearrangement, cryptic speciation, cytoplasmic inheritance, and endosymbiosis, as well as more recently to the evolution of mating types, introns, and roles of small RNAs in DNA processing. Substantial progress has recently been made in the area of comparative and population genomics. Paramecium species combine some of the lowest known mutation rates with some of the largest known effective populations, along with likely very high recombination rates, thereby harboring a population-genetic environment that promotes an exceptionally efficient capacity for selection. As a consequence, the genomes are extraordinarily streamlined, with very small intergenic regions combined with small numbers of tiny introns. The subject of the bulk of Paramecium research, the ancient Paramecium aurelia species complex, is descended from two whole-genome duplication events that retain high degrees of synteny, thereby providing an exceptional platform for studying the fates of duplicate genes. Despite having a common ancestor dating to several hundred million years ago, the known descendant species are morphologically indistinguishable, raising significant questions about the common view that gene duplications lead to the origins of evolutionary novelties.
Collapse
Affiliation(s)
- Hongan Long
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, Qingdao, Shandong Province, China;
- Laboratory for Marine Biology and Biotechnology, Laoshan Laboratory, Qingdao, Shandong Province, China
| | - Parul Johri
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Jean-Francois Gout
- Department of Biological Sciences, Mississippi State University, Starkville, Mississippi, USA
| | - Jiahao Ni
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, Qingdao, Shandong Province, China;
| | - Yue Hao
- Cancer and Cell Biology Division, Translational Genomics Research Institute, Phoenix, Arizona, USA
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, Arizona, USA;
| | - Timothy Licknack
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, Arizona, USA;
| | - Yaohai Wang
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, Qingdao, Shandong Province, China;
| | - Jiao Pan
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, Qingdao, Shandong Province, China;
| | - Berenice Jiménez-Marín
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, Arizona, USA;
| | - Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, Arizona, USA;
| |
Collapse
|
24
|
Wolters JF, LaBella AL, Opulente DA, Rokas A, Hittinger CT. Mitochondrial genome diversity across the subphylum Saccharomycotina. Front Microbiol 2023; 14:1268944. [PMID: 38075892 PMCID: PMC10701893 DOI: 10.3389/fmicb.2023.1268944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 10/31/2023] [Indexed: 12/20/2023] Open
Abstract
Introduction Eukaryotic life depends on the functional elements encoded by both the nuclear genome and organellar genomes, such as those contained within the mitochondria. The content, size, and structure of the mitochondrial genome varies across organisms with potentially large implications for phenotypic variance and resulting evolutionary trajectories. Among yeasts in the subphylum Saccharomycotina, extensive differences have been observed in various species relative to the model yeast Saccharomyces cerevisiae, but mitochondrial genome sampling across many groups has been scarce, even as hundreds of nuclear genomes have become available. Methods By extracting mitochondrial assemblies from existing short-read genome sequence datasets, we have greatly expanded both the number of available genomes and the coverage across sparsely sampled clades. Results Comparison of 353 yeast mitochondrial genomes revealed that, while size and GC content were fairly consistent across species, those in the genera Metschnikowia and Saccharomyces trended larger, while several species in the order Saccharomycetales, which includes S. cerevisiae, exhibited lower GC content. Extreme examples for both size and GC content were scattered throughout the subphylum. All mitochondrial genomes shared a core set of protein-coding genes for Complexes III, IV, and V, but they varied in the presence or absence of mitochondrially-encoded canonical Complex I genes. We traced the loss of Complex I genes to a major event in the ancestor of the orders Saccharomycetales and Saccharomycodales, but we also observed several independent losses in the orders Phaffomycetales, Pichiales, and Dipodascales. In contrast to prior hypotheses based on smaller-scale datasets, comparison of evolutionary rates in protein-coding genes showed no bias towards elevated rates among aerobically fermenting (Crabtree/Warburg-positive) yeasts. Mitochondrial introns were widely distributed, but they were highly enriched in some groups. The majority of mitochondrial introns were poorly conserved within groups, but several were shared within groups, between groups, and even across taxonomic orders, which is consistent with horizontal gene transfer, likely involving homing endonucleases acting as selfish elements. Discussion As the number of available fungal nuclear genomes continues to expand, the methods described here to retrieve mitochondrial genome sequences from these datasets will prove invaluable to ensuring that studies of fungal mitochondrial genomes keep pace with their nuclear counterparts.
Collapse
Affiliation(s)
- John F. Wolters
- Laboratory of Genetics, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, Center for Genomic Science Innovation, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, WI, United States
| | - Abigail L. LaBella
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, United States
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, United States
| | - Dana A. Opulente
- Laboratory of Genetics, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, Center for Genomic Science Innovation, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, WI, United States
- Biology Department, Villanova University, Villanova, PA, United States
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, United States
| | - Chris Todd Hittinger
- Laboratory of Genetics, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, Center for Genomic Science Innovation, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, WI, United States
| |
Collapse
|
25
|
Soni V, Pfeifer SP, Jensen JD. The effects of mutation and recombination rate heterogeneity on the inference of demography and the distribution of fitness effects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.11.566703. [PMID: 38014252 PMCID: PMC10680612 DOI: 10.1101/2023.11.11.566703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavour; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modelled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination prior to utilizing population genomic data to quantify the effects of genetic drift (i.e., as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modelled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Susanne P. Pfeifer
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Jeffrey D. Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| |
Collapse
|
26
|
Terbot JW, Cooper BS, Good JM, Jensen JD. A Simulation Framework for Modeling the Within-Patient Evolutionary Dynamics of SARS-CoV-2. Genome Biol Evol 2023; 15:evad204. [PMID: 37950882 PMCID: PMC10664409 DOI: 10.1093/gbe/evad204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 10/31/2023] [Accepted: 11/07/2023] [Indexed: 11/13/2023] Open
Abstract
The global impact of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has led to considerable interest in detecting novel beneficial mutations and other genomic changes that may signal the development of variants of concern (VOCs). The ability to accurately detect these changes within individual patient samples is important in enabling early detection of VOCs. Such genomic scans for rarely acting positive selection are best performed via comparison of empirical data with simulated data wherein commonly acting evolutionary factors, including mutation and recombination, reproductive and infection dynamics, and purifying and background selection, can be carefully accounted for and parameterized. Although there has been work to quantify these factors in SARS-CoV-2, they have yet to be integrated into a baseline model describing intrahost evolutionary dynamics. To construct such a baseline model, we develop a simulation framework that enables one to establish expectations for underlying levels and patterns of patient-level variation. By varying eight key parameters, we evaluated 12,096 different model-parameter combinations and compared them with existing empirical data. Of these, 592 models (∼5%) were plausible based on the resulting mean expected number of segregating variants. These plausible models shared several commonalities shedding light on intrahost SARS-CoV-2 evolutionary dynamics: severe infection bottlenecks, low levels of reproductive skew, and a distribution of fitness effects skewed toward strongly deleterious mutations. We also describe important areas of model uncertainty and highlight additional sequence data that may help to further refine a baseline model. This study lays the groundwork for the improved analysis of existing and future SARS-CoV-2 within-patient data.
Collapse
Affiliation(s)
- John W Terbot
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, Arizona, USA
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Brandon S Cooper
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Jeffrey M Good
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, Arizona, USA
| |
Collapse
|
27
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. Evolution 2023; 77:2113-2127. [PMID: 37395482 PMCID: PMC10547124 DOI: 10.1093/evolut/qpad120] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 06/15/2023] [Accepted: 06/30/2023] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modeled by a realistic mutation rate and as part of a realistic distribution of fitness effects, as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modeled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false-positive rates are in excess of true-positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| |
Collapse
|
28
|
Murga-Moreno J, Casillas S, Barbadilla A, Uricchio L, Enard D. An efficient and robust ABC approach to infer the rate and strength of adaptation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.29.555322. [PMID: 37693550 PMCID: PMC10491248 DOI: 10.1101/2023.08.29.555322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in non-model species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to non-model genomes. We apply ABC-MK to the human proteome and a set of known Virus Interacting Proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, USA
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | | | - David Enard
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, USA
| |
Collapse
|
29
|
Nigenda-Morales SF, Lin M, Nuñez-Valencia PG, Kyriazis CC, Beichman AC, Robinson JA, Ragsdale AP, Urbán R J, Archer FI, Viloria-Gómora L, Pérez-Álvarez MJ, Poulin E, Lohmueller KE, Moreno-Estrada A, Wayne RK. The genomic footprint of whaling and isolation in fin whale populations. Nat Commun 2023; 14:5465. [PMID: 37699896 PMCID: PMC10497599 DOI: 10.1038/s41467-023-40052-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 07/10/2023] [Indexed: 09/14/2023] Open
Abstract
Twentieth century industrial whaling pushed several species to the brink of extinction, with fin whales being the most impacted. However, a small, resident population in the Gulf of California was not targeted by whaling. Here, we analyzed 50 whole-genomes from the Eastern North Pacific (ENP) and Gulf of California (GOC) fin whale populations to investigate their demographic history and the genomic effects of natural and human-induced bottlenecks. We show that the two populations diverged ~16,000 years ago, after which the ENP population expanded and then suffered a 99% reduction in effective size during the whaling period. In contrast, the GOC population remained small and isolated, receiving less than one migrant per generation. However, this low level of migration has been crucial for maintaining its viability. Our study exposes the severity of whaling, emphasizes the importance of migration, and demonstrates the use of genome-based analyses and simulations to inform conservation strategies.
Collapse
Affiliation(s)
- Sergio F Nigenda-Morales
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico.
- Department of Biological Sciences, California State University San Marcos, San Marcos, CA, 92096, USA.
| | - Meixi Lin
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, 94305, USA.
| | - Paulina G Nuñez-Valencia
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México (UNAM), Cuernavaca, Morelos, México
| | - Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Annabel C Beichman
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Jacqueline A Robinson
- Institute for Human Genetics, University of California, San Francisco (UCSF), San Francisco, CA, 94143, USA
| | - Aaron P Ragsdale
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico
- Department of Integrative Biology, University of Wisconsin, Madison, WI, 53706, USA
| | - Jorge Urbán R
- Departamento de Ciencias Marinas y Costeras, Universidad Autónoma de Baja California Sur (UABCS), La Paz, Baja California Sur, Mexico
| | - Frederick I Archer
- Marine Mammal and Turtle Division, Southwest Fisheries Science Center, La Jolla, CA, 92037, USA
| | - Lorena Viloria-Gómora
- Departamento de Ciencias Marinas y Costeras, Universidad Autónoma de Baja California Sur (UABCS), La Paz, Baja California Sur, Mexico
| | - María José Pérez-Álvarez
- Escuela de Medicina Veterinaria, Facultad de Medicina y Ciencias de la Salud, Universidad Mayor, Santiago, Chile
- Millennium Institute Biodiversity of Antarctic and Subantarctic Ecosystems (BASE), Universidad de Chile, Santiago, Chile
| | - Elie Poulin
- Millennium Institute Biodiversity of Antarctic and Subantarctic Ecosystems (BASE), Universidad de Chile, Santiago, Chile
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA, 90095, USA.
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA, USA.
| | - Andrés Moreno-Estrada
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico.
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| |
Collapse
|
30
|
Charmouh AP, Bocedi G, Hartfield M. Inferring the distributions of fitness effects and proportions of strongly deleterious mutations. G3 (BETHESDA, MD.) 2023; 13:jkad140. [PMID: 37337692 PMCID: PMC10468728 DOI: 10.1093/g3journal/jkad140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 06/05/2023] [Accepted: 06/05/2023] [Indexed: 06/21/2023]
Abstract
The distribution of fitness effects is a key property in evolutionary genetics as it has implications for several evolutionary phenomena including the evolution of sex and mating systems, the rate of adaptive evolution, and the prevalence of deleterious mutations. Despite the distribution of fitness effects being extensively studied, the effects of strongly deleterious mutations are difficult to infer since such mutations are unlikely to be present in a sample of haplotypes, so genetic data may contain very little information about them. Recent work has attempted to correct for this issue by expanding the classic gamma-distributed model to explicitly account for strongly deleterious mutations. Here, we use simulations to investigate one such method, adding a parameter (plth) to capture the proportion of strongly deleterious mutations. We show that plth can improve the model fit when applied to individual species but underestimates the true proportion of strongly deleterious mutations. The parameter can also artificially maximize the likelihood when used to jointly infer a distribution of fitness effects from multiple species. As plth and related parameters are used in current inference algorithms, our results are relevant with respect to avoiding model artifacts and improving future tools for inferring the distribution of fitness effects.
Collapse
Affiliation(s)
- Anders P Charmouh
- School of Biological Sciences, University of Aberdeen, Aberdeen AB24 3FX, UK
- Bioinformatics Research Centre Aarhus University, University City 81, building 1872, 3rd floor. DK-8000 Aarhus C, Denmark
| | - Greta Bocedi
- School of Biological Sciences, University of Aberdeen, Aberdeen AB24 3FX, UK
| | - Matthew Hartfield
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
31
|
Charlesworth B. The effects of inversion polymorphisms on patterns of neutral genetic diversity. Genetics 2023; 224:iyad116. [PMID: 37348059 PMCID: PMC10411593 DOI: 10.1093/genetics/iyad116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 02/23/2023] [Accepted: 06/11/2023] [Indexed: 06/24/2023] Open
Abstract
The strong reduction in the frequency of recombination in heterozygotes for an inversion and a standard gene arrangement causes the arrangements to become partially isolated genetically, resulting in sequence divergence between them and changes in the levels of neutral variability at nucleotide sites within each arrangement class. Previous theoretical studies on the effects of inversions on neutral variability have assumed either that the population is panmictic or that it is divided into 2 populations subject to divergent selection. Here, the theory is extended to a model of an arbitrary number of demes connected by migration, using a finite island model with the inversion present at the same frequency in all demes. Recursion relations for mean pairwise coalescent times are used to obtain simple approximate expressions for diversity and divergence statistics for an inversion polymorphism at equilibrium under recombination and drift, and for the approach to equilibrium following the sweep of an inversion to a stable intermediate frequency. The effects of an inversion polymorphism on patterns of linkage disequilibrium are also examined. The reduction in effective recombination rate caused by population subdivision can have significant effects on these statistics. The theoretical results are discussed in relation to population genomic data on inversion polymorphisms, with an emphasis on Drosophila melanogaster. Methods are proposed for testing whether or not inversions are close to recombination-drift equilibrium, and for estimating the rate of recombinational exchange in heterozygotes for inversions; difficulties involved in estimating the ages of inversions are also discussed.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
32
|
Terbot JW, Cooper BS, Good JM, Jensen JD. A simulation framework for modeling the within-patient evolutionary dynamics of SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.13.548462. [PMID: 37503016 PMCID: PMC10370031 DOI: 10.1101/2023.07.13.548462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
The global impact of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has led to considerable interest in detecting novel beneficial mutations and other genomic changes that may signal the development of variants of concern (VOCs). The ability to accurately detect these changes within individual patient samples is important in enabling early detection of VOCs. Such genomic scans for positive selection are best performed via comparison of empirical data to simulated data wherein evolutionary factors, including mutation and recombination rates, reproductive and infection dynamics, and purifying and background selection, can be carefully accounted for and parameterized. While there has been work to quantify these factors in SARS-CoV-2, they have yet to be integrated into a baseline model describing intra-host evolutionary dynamics. To construct such a baseline model, we develop a simulation framework that enables one to establish expectations for underlying levels and patterns of patient-level variation. By varying eight key parameters, we evaluated 12,096 different model-parameter combinations and compared them to existing empirical data. Of these, 592 models (~5%) were plausible based on the resulting mean expected number of segregating variants. These plausible models shared several commonalities shedding light on intra-host SARS-CoV-2 evolutionary dynamics: severe infection bottlenecks, low levels of reproductive skew, and a distribution of fitness effects skewed towards strongly deleterious mutations. We also describe important areas of model uncertainty and highlight additional sequence data that may help to further refine a baseline model. This study lays the groundwork for the improved analysis of existing and future SARS-CoV-2 within-patient data.
Collapse
Affiliation(s)
- John W Terbot
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Brandon S. Cooper
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Jeffrey M. Good
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Jeffrey D. Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
| |
Collapse
|
33
|
Guo B, Borda V, Laboulaye R, Spring MD, Wojnarski M, Vesely BA, Silva JC, Waters NC, O'Connor TD, Takala-Harrison S. Strong Positive Selection Biases Identity-By-Descent-Based Inferences of Recent Demography and Population Structure in Plasmodium falciparum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.14.549114. [PMID: 37502843 PMCID: PMC10370022 DOI: 10.1101/2023.07.14.549114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Malaria genomic surveillance often estimates parasite genetic relatedness using metrics such as Identity-By-Decent (IBD). Yet, strong positive selection stemming from antimalarial drug resistance or other interventions may bias IBD-based estimates. In this study, we utilized simulations, a true IBD inference algorithm, and empirical datasets from different malaria transmission settings to investigate the extent of such bias and explore potential correction strategies. We analyzed whole genome sequence data generated from 640 new and 4,026 publicly available Plasmodium falciparum clinical isolates. Our findings demonstrated that positive selection distorts IBD distributions, leading to underestimated effective population size and blurred population structure. Additionally, we discovered that the removal of IBD peak regions partially restored the accuracy of IBD-based inferences, with this effect contingent on the population's background genetic relatedness. Consequently, we advocate for selection correction for parasite populations undergoing strong, recent positive selection, particularly in high malaria transmission settings.
Collapse
Affiliation(s)
- Bing Guo
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD USA
| | - Victor Borda
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Roland Laboulaye
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Michele D Spring
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Mariusz Wojnarski
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Brian A Vesely
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Joana C Silva
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Norman C Waters
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Timothy D O'Connor
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Shannon Takala-Harrison
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD USA
| |
Collapse
|
34
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.15.545166. [PMID: 37398347 PMCID: PMC10312679 DOI: 10.1101/2023.06.15.545166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modelled by a realistic mutation rate and as part of a realistic distribution of fitness effects (DFE), as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modelled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false positive rates are in excess of true positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong. Teaser Text Outlier-based genomic scans have proven a popular approach for identifying loci that have potentially experienced recent positive selection. However, it has previously been shown that an evolutionarily appropriate baseline model that incorporates non-equilibrium population histories, purifying and background selection, and variation in mutation and recombination rates is necessary to reduce often extreme false positive rates when performing genomic scans. Here we evaluate the power to detect recurrent selective sweeps using common SFS-based and haplotype-based methods under these increasingly realistic models. We find that while these appropriate evolutionary baselines are essential to reduce false positive rates, the power to accurately detect recurrent selective sweeps is generally low across much of the biologically relevant parameter space.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Present address: Department of Biology, Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | | |
Collapse
|
35
|
Johri P, Pfeifer SP, Jensen JD. Developing an evolutionary baseline model for humans: jointly inferring purifying selection with population history. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.11.536488. [PMID: 37090533 PMCID: PMC10120674 DOI: 10.1101/2023.04.11.536488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Building evolutionarily appropriate baseline models for natural populations is not only important for answering fundamental questions in population genetics - including quantifying the relative contributions of adaptive vs. non-adaptive processes - but it is also essential for identifying candidate loci experiencing relatively rare and episodic forms of selection ( e.g., positive or balancing selection). Here, a baseline model was developed for a human population of West African ancestry, the Yoruba, comprising processes constantly operating on the genome ( i.e. , purifying and background selection, population size changes, recombination rate heterogeneity, and gene conversion). Specifically, to perform joint inference of selective effects with demography, an approximate Bayesian approach was employed that utilizes the decay of background selection effects around functional elements, taking into account genomic architecture. This approach inferred a recent 6-fold population growth together with a distribution of fitness effects that is skewed towards effectively neutral mutations. Importantly, these results further suggest that, while strong and/or frequent recurrent positive selection is inconsistent with observed data, weak to moderate positive selection is consistent but unidentifiable if rare.
Collapse
|
36
|
Terbot JW, Johri P, Liphardt SW, Soni V, Pfeifer SP, Cooper BS, Good JM, Jensen JD. Developing an appropriate evolutionary baseline model for the study of SARS-CoV-2 patient samples. PLoS Pathog 2023; 19:e1011265. [PMID: 37018331 PMCID: PMC10075409 DOI: 10.1371/journal.ppat.1011265] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/06/2023] Open
Abstract
Over the past 3 years, Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has spread through human populations in several waves, resulting in a global health crisis. In response, genomic surveillance efforts have proliferated in the hopes of tracking and anticipating the evolution of this virus, resulting in millions of patient isolates now being available in public databases. Yet, while there is a tremendous focus on identifying newly emerging adaptive viral variants, this quantification is far from trivial. Specifically, multiple co-occurring and interacting evolutionary processes are constantly in operation and must be jointly considered and modeled in order to perform accurate inference. We here outline critical individual components of such an evolutionary baseline model-mutation rates, recombination rates, the distribution of fitness effects, infection dynamics, and compartmentalization-and describe the current state of knowledge pertaining to the related parameters of each in SARS-CoV-2. We close with a series of recommendations for future clinical sampling, model construction, and statistical analysis.
Collapse
Affiliation(s)
- John W Terbot
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
| | - Parul Johri
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
| | - Schuyler W Liphardt
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Vivak Soni
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
| | - Susanne P Pfeifer
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
| | - Brandon S Cooper
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Jeffrey M Good
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Jeffrey D Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
| |
Collapse
|
37
|
Freund F, Kerdoncuff E, Matuszewski S, Lapierre M, Hildebrandt M, Jensen JD, Ferretti L, Lambert A, Sackton TB, Achaz G. Interpreting the pervasive observation of U-shaped Site Frequency Spectra. PLoS Genet 2023; 19:e1010677. [PMID: 36952570 PMCID: PMC10072462 DOI: 10.1371/journal.pgen.1010677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 04/04/2023] [Accepted: 02/22/2023] [Indexed: 03/25/2023] Open
Abstract
The standard neutral model of molecular evolution has traditionally been used as the null model for population genomics. We gathered a collection of 45 genome-wide site frequency spectra from a diverse set of species, most of which display an excess of low and high frequency variants compared to the expectation of the standard neutral model, resulting in U-shaped spectra. We show that multiple merger coalescent models often provide a better fit to these observations than the standard Kingman coalescent. Hence, in many circumstances these under-utilized models may serve as the more appropriate reference for genomic analyses. We further discuss the underlying evolutionary processes that may result in the widespread U-shape of frequency spectra.
Collapse
Affiliation(s)
- Fabian Freund
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
- Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Elise Kerdoncuff
- Department of Genetics, University of California, Berkeley, California, United States of America
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
| | | | - Marguerite Lapierre
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
| | | | - Jeffrey D Jensen
- Center for Evolution & Medicine, School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | - Luca Ferretti
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Amaury Lambert
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, Paris, France
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
| | - Timothy B Sackton
- Éco-anthropologie, Muséum National d'Histoire Naturelle, Université Paris-Cité, Paris, France
| | - Guillaume Achaz
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
- SMILE group, Center for Interdisciplinary Research in Biology (CIRB), Collège de France, Paris, France
| |
Collapse
|
38
|
Jensen JD. Population genetic concerns related to the interpretation of empirical outliers and the neglect of common evolutionary processes. Heredity (Edinb) 2023; 130:109-110. [PMID: 36829044 PMCID: PMC9981695 DOI: 10.1038/s41437-022-00575-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 10/27/2022] [Accepted: 10/28/2022] [Indexed: 02/26/2023] Open
Affiliation(s)
- Jeffrey D Jensen
- School of Life Science, Arizona State University, Tempe, AZ, USA.
| |
Collapse
|
39
|
Kimmitt AA, Pegan TM, Jones AW, Wacker KS, Brennan CL, Hudon J, Kirchman JJ, Ruegg K, Benz BW, Herman R, Winger BM. Genetic evidence for widespread population size expansion in North American boreal birds prior to the Last Glacial Maximum. Proc Biol Sci 2023; 290:20221334. [PMID: 36695033 PMCID: PMC9874272 DOI: 10.1098/rspb.2022.1334] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 12/19/2022] [Indexed: 01/26/2023] Open
Abstract
Pleistocene climate cycles are well documented to have shaped contemporary species distributions and genetic diversity. Northward range expansions in response to deglaciation following the Last Glacial Maximum (LGM; approximately 21 000 years ago) are surmised to have led to population size expansions in terrestrial taxa and changes in seasonal migratory behaviour. Recent findings, however, suggest that some northern temperate populations may have been more stable than expected through the LGM. We modelled the demographic history of 19 co-distributed boreal-breeding North American bird species from full mitochondrial gene sets and species-specific molecular rates. We used these demographic reconstructions to test how species with different migratory strategies were affected by glacial cycles. Our results suggest that effective population sizes increased in response to Pleistocene deglaciation earlier than the LGM, whereas genetic diversity was maintained throughout the LGM despite shifts in geographical range. We conclude that glacial cycles prior to the LGM have most strongly shaped contemporary genetic diversity in these species. We did not find a relationship between historic population dynamics and migratory strategy, contributing to growing evidence that major switches in migratory strategy during the LGM are unnecessary to explain contemporary migratory patterns.
Collapse
Affiliation(s)
- Abigail A. Kimmitt
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Teresa M. Pegan
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Andrew W. Jones
- Department of Ornithology, Cleveland Museum of Natural History, Cleveland, OH 44106, USA
| | - Kristen S. Wacker
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Courtney L. Brennan
- Department of Ornithology, Cleveland Museum of Natural History, Cleveland, OH 44106, USA
| | - Jocelyn Hudon
- Royal Alberta Museum, Edmonton, Alberta Canada, T5J 0G2
| | | | - Kristen Ruegg
- Biology Department, Colorado State University, Fort Collins, CO 80521, USA
| | - Brett W. Benz
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Rachael Herman
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Benjamin M. Winger
- Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
40
|
Lea AJ, Garcia A, Arevalo J, Ayroles JF, Buetow K, Cole SW, Eid Rodriguez D, Gutierrez M, Highland HM, Hooper PL, Justice A, Kraft T, North KE, Stieglitz J, Kaplan H, Trumble BC, Gurven MD. Natural selection of immune and metabolic genes associated with health in two lowland Bolivian populations. Proc Natl Acad Sci U S A 2023; 120:e2207544120. [PMID: 36574663 PMCID: PMC9910614 DOI: 10.1073/pnas.2207544120] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Accepted: 09/21/2022] [Indexed: 12/28/2022] Open
Abstract
A growing body of work has addressed human adaptations to diverse environments using genomic data, but few studies have connected putatively selected alleles to phenotypes, much less among underrepresented populations such as Amerindians. Studies of natural selection and genotype-phenotype relationships in underrepresented populations hold potential to uncover previously undescribed loci underlying evolutionarily and biomedically relevant traits. Here, we worked with the Tsimane and the Moseten, two Amerindian populations inhabiting the Bolivian lowlands. We focused most intensively on the Tsimane, because long-term anthropological work with this group has shown that they have a high burden of both macro and microparasites, as well as minimal cardiometabolic disease or dementia. We therefore generated genome-wide genotype data for Tsimane individuals to study natural selection, and paired this with blood mRNA-seq as well as cardiometabolic and immune biomarker data generated from a larger sample that included both populations. In the Tsimane, we identified 21 regions that are candidates for selective sweeps, as well as 5 immune traits that show evidence for polygenic selection (e.g., C-reactive protein levels and the response to coronaviruses). Genes overlapping candidate regions were strongly enriched for known involvement in immune-related traits, such as abundance of lymphocytes and eosinophils. Importantly, we were also able to draw on extensive phenotype information for the Tsimane and Moseten and link five regions (containing PSD4, MUC21 and MUC22, TOX2, ANXA6, and ABCA1) with biomarkers of immune and metabolic function. Together, our work highlights the utility of pairing evolutionary analyses with anthropological and biomedical data to gain insight into the genetic basis of health-related traits.
Collapse
Affiliation(s)
- Amanda J. Lea
- Department of Biological Sciences, Vanderbilt University, Nashville, TN37235
| | - Angela Garcia
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ85287
| | - Jesusa Arevalo
- Department of Medicine, University of California, Los Angeles, CA90095
| | - Julien F. Ayroles
- Department of Ecology and Evolution, Princeton University, Princeton, NJ08544
- Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ08544
| | - Kenneth Buetow
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ85287
- School of Life Sciences, Arizona State University, Tempe, AZ85287
| | - Steve W. Cole
- Department of Psychiatry and Biobehavioral Sciences, University of California, Los Angeles, CA90095
- Department of Medicine, University of California, Los Angeles, CA90095
| | | | | | - Heather M. Highland
- Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC27516
| | - Paul L. Hooper
- Economic Science Institute, Chapman University, Orange, CA92866
| | | | - Thomas Kraft
- Department of Anthropology, University of Utah, Salt Lake City, UT84112
| | - Kari E. North
- Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC27516
| | | | - Hillard Kaplan
- Institute for Economics and Society, Chapman University, Orange, CA92866
| | - Benjamin C. Trumble
- Center for Evolution and Medicine, Arizona State University, Tempe, AZ85287
- School of Human Evolution and Social Change, Arizona State University, Tempe, AZ85287
| | - Michael D. Gurven
- Department of Anthropology, University of California, Santa Barbara, CA93106
| |
Collapse
|
41
|
Lynch M, Trickovic B, Kempes CP. Evolutionary scaling of maximum growth rate with organism size. Sci Rep 2022; 12:22586. [PMID: 36585440 PMCID: PMC9803686 DOI: 10.1038/s41598-022-23626-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 11/02/2022] [Indexed: 12/31/2022] Open
Abstract
Data from nearly 1000 species reveal the upper bound to rates of biomass production achievable by natural selection across the Tree of Life. For heterotrophs, maximum growth rates scale positively with organism size in bacteria but negatively in eukaryotes, whereas for phototrophs, the scaling is negligible for cyanobacteria and weakly negative for eukaryotes. These results have significant implications for understanding the bioenergetic consequences of the transition from prokaryotes to eukaryotes, and of the expansion of some groups of the latter into multicellularity. The magnitudes of the scaling coefficients for eukaryotes are significantly lower than expected under any proposed physical-constraint model. Supported by genomic, bioenergetic, and population-genetic data and theory, an alternative hypothesis for the observed negative scaling in eukaryotes postulates that growth-diminishing mutations with small effects passively accumulate with increasing organism size as a consequence of associated increases in the power of random genetic drift. In contrast, conditional on the structural and functional features of ribosomes, natural selection has been able to promote bacteria with the fastest possible growth rates, implying minimal conflicts with both bioenergetic constraints and random genetic drift. If this extension of the drift-barrier hypothesis is correct, the interpretations of comparative studies of biological traits that have traditionally ignored differences in population-genetic environments will require revisiting.
Collapse
Affiliation(s)
- Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ, 85287, USA.
| | - Bogi Trickovic
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ, 85287, USA
| | | |
Collapse
|
42
|
Harris M, Garud NR. Enrichment of Hard Sweeps on the X Chromosome in Drosophila melanogaster. Mol Biol Evol 2022; 40:6955808. [PMID: 36546413 PMCID: PMC9825254 DOI: 10.1093/molbev/msac268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 11/11/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
The characteristic properties of the X chromosome, such as male hemizygosity and its unique inheritance pattern, expose it to natural selection in a way that can be different from the autosomes. Here, we investigate the differences in the tempo and mode of adaptation on the X chromosome and autosomes in a population of Drosophila melanogaster. Specifically, we test the hypothesis that due to hemizygosity and a lower effective population size on the X, the relative proportion of hard sweeps, which are expected when adaptation is gradual, compared with soft sweeps, which are expected when adaptation is rapid, is greater on the X than on the autosomes. We quantify the incidence of hard versus soft sweeps in North American D. melanogaster population genomic data with haplotype homozygosity statistics and find an enrichment of the proportion of hard versus soft sweeps on the X chromosome compared with the autosomes, confirming predictions we make from simulations. Understanding these differences may enable a deeper understanding of how important phenotypes arise as well as the impact of fundamental evolutionary parameters on adaptation, such as dominance, sex-specific selection, and sex-biased demography.
Collapse
Affiliation(s)
- Mariana Harris
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA
| | | |
Collapse
|
43
|
Burny C, Nolte V, Dolezal M, Schlötterer C. Genome-wide selection signatures reveal widespread synergistic effects of two different stressors in Drosophila melanogaster. Proc Biol Sci 2022; 289:20221857. [PMID: 36259211 PMCID: PMC9579754 DOI: 10.1098/rspb.2022.1857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Experimental evolution combined with whole-genome sequencing (evolve and resequence (E&R)) is a powerful approach to study the adaptive architecture of selected traits. Nevertheless, so far the focus has been on the selective response triggered by a single stressor. Building on the highly parallel selection response of founder populations with reduced variation, we evaluated how the presence of a second stressor affects the genomic selection response. After 20 generations of adaptation to laboratory conditions at either 18°C or 29°C, strong genome-wide selection signatures were observed. Only 38% of the selection signatures can be attributed to laboratory adaptation (no difference between temperature regimes). The remaining selection responses are either caused by temperature-specific effects, or reflect the joint effects of temperature and laboratory adaptation (same direction, but the magnitude differs between temperatures). The allele frequency changes resulting from the combined effects of temperature and laboratory adaptation were more extreme in the hot environment for 83% of the affected genomic regions-indicating widespread synergistic effects of the two stressors. We conclude that E&R with reduced genetic variation is a powerful approach to study genome-wide fitness consequences driven by the combined effects of multiple environmental factors.
Collapse
Affiliation(s)
- Claire Burny
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, Vienna 1210, Austria.,Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Vienna 1210, Austria
| | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, Vienna 1210, Austria
| | - Marlies Dolezal
- Plattform Bioinformatik und Biostatistik, Vetmeduni Vienna, Vienna 1210, Austria
| | - Christian Schlötterer
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, Vienna 1210, Austria
| |
Collapse
|
44
|
Sandoval-Padilla I, Zamora-Tavares MDP, Ruiz-Sánchez E, Pérez-Alquicira J, Vargas-Ponce O. Characterization of the plastome of Physaliscordata and comparative analysis of eight species of Physalis sensu stricto. PHYTOKEYS 2022; 210:109-134. [PMID: 36760406 PMCID: PMC9836641 DOI: 10.3897/phytokeys.210.85668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 09/07/2022] [Indexed: 06/18/2023]
Abstract
In this study, we sequenced, assembled, and annotated the plastome of Physaliscordata Mill. and compared it with seven species of the genus Physalis sensu stricto. Sequencing, annotating, and comparing plastomes allow us to understand the evolutionary mechanisms associated with physiological functions, select possible molecular markers, and identify the types of selection that have acted in different regions of the genome. The plastome of P.cordata is 157,000 bp long and presents the typical quadripartite structure with a large single-copy (LSC) region of 87,267 bp and a small single-copy (SSC) region of 18,501 bp, which are separated by two inverted repeat (IRs) regions of 25,616 bp each. These values are similar to those found in the other species, except for P.angulata L. and P.pruinosa L., which presented an expansion of the LSC region and a contraction of the IR regions. The plastome in all Physalis species studied shows variation in the boundary of the regions with three distinct types, the percentage of the sequence identity between coding and non-coding regions, and the number of repetitive regions and microsatellites. Four genes and 10 intergenic regions show promise as molecular markers and eight genes were under positive selection. The maximum likelihood analysis showed that the plastome is a good source of information for phylogenetic inference in the genus, given the high support values and absence of polytomies. In the Physalis plastomes analyzed here, the differences found, the positive selection of genes, and the phylogenetic relationships do not show trends that correspond to the biological or ecological characteristics of the species studied.
Collapse
Affiliation(s)
- Isaac Sandoval-Padilla
- Doctorado en Ciencias en Biosistemática, Ecología y Manejo de Recursos Naturales y Agrícolas, Centro Universitario de Ciencias Biológicas y Agropecuarias, Universidad de Guadalajara, Ramón Padilla Sánchez 2100, 45200 Las Agujas, Zapopan, Jalisco, MexicoUniversidad de GuadalajaraZapopanMexico
| | - María del Pilar Zamora-Tavares
- Doctorado en Ciencias en Biosistemática, Ecología y Manejo de Recursos Naturales y Agrícolas, Centro Universitario de Ciencias Biológicas y Agropecuarias, Universidad de Guadalajara, Ramón Padilla Sánchez 2100, 45200 Las Agujas, Zapopan, Jalisco, MexicoUniversidad de GuadalajaraZapopanMexico
| | - Eduardo Ruiz-Sánchez
- Doctorado en Ciencias en Biosistemática, Ecología y Manejo de Recursos Naturales y Agrícolas, Centro Universitario de Ciencias Biológicas y Agropecuarias, Universidad de Guadalajara, Ramón Padilla Sánchez 2100, 45200 Las Agujas, Zapopan, Jalisco, MexicoUniversidad de GuadalajaraZapopanMexico
| | - Jessica Pérez-Alquicira
- Doctorado en Ciencias en Biosistemática, Ecología y Manejo de Recursos Naturales y Agrícolas, Centro Universitario de Ciencias Biológicas y Agropecuarias, Universidad de Guadalajara, Ramón Padilla Sánchez 2100, 45200 Las Agujas, Zapopan, Jalisco, MexicoUniversidad de GuadalajaraZapopanMexico
- Laboratorio Nacional de Identificación y Caracterización Vegetal A(LaniVeg), Consejo Nacional de Ciencia y Tecnología (CONACyT), Universidad de Guadalajara, Ramón Padilla Sánchez 2100, 45200 Las Agujas, Zapopan, Jalisco, MexicoCONACYTMexico CityMexico
| | - Ofelia Vargas-Ponce
- Doctorado en Ciencias en Biosistemática, Ecología y Manejo de Recursos Naturales y Agrícolas, Centro Universitario de Ciencias Biológicas y Agropecuarias, Universidad de Guadalajara, Ramón Padilla Sánchez 2100, 45200 Las Agujas, Zapopan, Jalisco, MexicoUniversidad de GuadalajaraZapopanMexico
| |
Collapse
|
45
|
Provost K, Shue SY, Forcellati M, Smith BT. The Genomic Landscapes of Desert Birds Form over Multiple Time Scales. Mol Biol Evol 2022; 39:6711078. [PMID: 36134537 PMCID: PMC9577548 DOI: 10.1093/molbev/msac200] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Spatial models show that genetic differentiation between populations can be explained by factors ranging from geographic distance to environmental resistance across the landscape. However, genomes exhibit a landscape of differentiation, indicating that multiple processes may mediate divergence in different portions of the genome. We tested this idea by comparing alternative geographic predctors of differentiation in ten bird species that co-occur in Sonoran and Chihuahuan Deserts of North America. Using population-level genomic data, we described the genomic landscapes across species and modeled conditions that represented historical and contemporary mechanisms. The characteristics of genomic landscapes differed across species, influenced by varying levels of population structuring and admixture between deserts, and the best-fit models contrasted between the whole genome and partitions along the genome. Both historical and contemporary mechanisms were important in explaining genetic distance, but particularly past and current environments, suggesting that genomic evolution was modulated by climate and habitat There were also different best-ftit models across genomic partitions of the data, indicating that these regions capture different evolutionary histories. These results show that the genomic landscape of differentiation can be associated with alternative geographic factors operating on different portions of the genome, which reflect how heterogeneous patterns of genetic differentiation can evolve across species and genomes.
Collapse
Affiliation(s)
| | - Stephanie Yun Shue
- Bergen County Academies, Hackensack, NJ, USA,Biological Sciences, University of California Berkeley, Berkeley, CA, USA
| | - Meghan Forcellati
- Bergen County Academies, Hackensack, NJ, USA,Ecology, Evolution, and Environmental Biology, Columbia University, New York, NY, USA
| | - Brian Tilston Smith
- Department of Ornithology, American Museum of Natural History, New York, NY, USA
| |
Collapse
|
46
|
Hager ER, Harringmeyer OS, Wooldridge TB, Theingi S, Gable JT, McFadden S, Neugeboren B, Turner KM, Jensen JD, Hoekstra HE. A chromosomal inversion contributes to divergence in multiple traits between deer mouse ecotypes. Science 2022; 377:399-405. [PMID: 35862520 PMCID: PMC9571565 DOI: 10.1126/science.abg0718] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
How locally adapted ecotypes are established and maintained within a species is a long-standing question in evolutionary biology. Using forest and prairie ecotypes of deer mice (Peromyscus maniculatus), we characterized the genetic basis of variation in two defining traits-tail length and coat color-and discovered a 41-megabase chromosomal inversion linked to both. The inversion frequency is 90% in the dark, long-tailed forest ecotype; decreases across a habitat transition; and is absent from the light, short-tailed prairie ecotype. We implicate divergent selection in maintaining the inversion at frequencies observed in the wild, despite high levels of gene flow, and explore fitness benefits that arise from suppressed recombination within the inversion. We uncover a key role for a large, previously uncharacterized inversion in the evolution and maintenance of classic mammalian ecotypes.
Collapse
Affiliation(s)
- Emily R Hager
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - Olivia S Harringmeyer
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - T Brock Wooldridge
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - Shunn Theingi
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - Jacob T Gable
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - Sade McFadden
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - Beverly Neugeboren
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - Kyle M Turner
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Hopi E Hoekstra
- Department of Molecular and Cellular Biology, Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, and Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA
| |
Collapse
|
47
|
Abstract
We discuss the genetic, demographic, and selective forces that are likely to be at play in restricting observed levels of DNA sequence variation in natural populations to a much smaller range of values than would be expected from the distribution of census population sizes alone-Lewontin's Paradox. While several processes that have previously been strongly emphasized must be involved, including the effects of direct selection and genetic hitchhiking, it seems unlikely that they are sufficient to explain this observation without contributions from other factors. We highlight a potentially important role for the less-appreciated contribution of population size change; specifically, the likelihood that many species and populations may be quite far from reaching the relatively high equilibrium diversity values that would be expected given their current census sizes.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
48
|
Johri P, Eyre-Walker A, Gutenkunst RN, Lohmueller KE, Jensen JD. On the prospect of achieving accurate joint estimation of selection with population history. Genome Biol Evol 2022; 14:evac088. [PMID: 35675379 PMCID: PMC9254643 DOI: 10.1093/gbe/evac088] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/02/2022] [Indexed: 11/15/2022] Open
Abstract
As both natural selection and population history can affect genome-wide patterns of variation, disentangling the contributions of each has remained as a major challenge in population genetics. We here discuss historical and recent progress towards this goal-highlighting theoretical and computational challenges that remain to be addressed, as well as inherent difficulties in dealing with model complexity and model violations-and offer thoughts on potentially fruitful next steps.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | | | - Ryan N Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, USA
- Department of Human Genetics, University of California, Los Angeles, CA, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
49
|
Booker TR, Payseur BA, Tigano A. Background selection under evolving recombination rates. Proc Biol Sci 2022; 289:20220782. [PMID: 35730151 PMCID: PMC9233929 DOI: 10.1098/rspb.2022.0782] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Background selection (BGS), the effect that purifying selection exerts on sites linked to deleterious alleles, is expected to be ubiquitous across eukaryotic genomes. The effects of BGS reflect the interplay of the rates and fitness effects of deleterious mutations with recombination. A fundamental assumption of BGS models is that recombination rates are invariant over time. However, in some lineages, recombination rates evolve rapidly, violating this central assumption. Here, we investigate how recombination rate evolution affects genetic variation under BGS. We show that recombination rate evolution modifies the effects of BGS in a manner similar to a localized change in the effective population size, potentially leading to underestimation or overestimation of the genome-wide effects of selection. Furthermore, we find evidence that recombination rate evolution in the ancestors of modern house mice may have impacted inferences of the genome-wide effects of selection in that species.
Collapse
Affiliation(s)
- Tom R. Booker
- Department of Zoology, University of British Columbia, Vancouver Campus, Vancouver, BC, Canada
| | - Bret A. Payseur
- Laboratory of Genetics, University of Wisconsin - Madison, Madison, WI, USA
| | - Anna Tigano
- Department of Biology, University of British Columbia, Okanagan Campus, Kelowna, BC, Canada
| |
Collapse
|
50
|
De Jode A, Le Moan A, Johannesson K, Faria R, Stankowski S, Westram AM, Butlin RK, Rafajlović M, Fraïsse C. Ten years of demographic modelling of divergence and speciation in the sea. Evol Appl 2022; 16:542-559. [PMID: 36793688 PMCID: PMC9923478 DOI: 10.1111/eva.13428] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 05/16/2022] [Accepted: 05/25/2022] [Indexed: 11/27/2022] Open
Abstract
Understanding population divergence that eventually leads to speciation is essential for evolutionary biology. High species diversity in the sea was regarded as a paradox when strict allopatry was considered necessary for most speciation events because geographical barriers seemed largely absent in the sea, and many marine species have high dispersal capacities. Combining genome-wide data with demographic modelling to infer the demographic history of divergence has introduced new ways to address this classical issue. These models assume an ancestral population that splits into two subpopulations diverging according to different scenarios that allow tests for periods of gene flow. Models can also test for heterogeneities in population sizes and migration rates along the genome to account, respectively, for background selection and selection against introgressed ancestry. To investigate how barriers to gene flow arise in the sea, we compiled studies modelling the demographic history of divergence in marine organisms and extracted preferred demographic scenarios together with estimates of demographic parameters. These studies show that geographical barriers to gene flow do exist in the sea but that divergence can also occur without strict isolation. Heterogeneity of gene flow was detected in most population pairs suggesting the predominance of semipermeable barriers during divergence. We found a weak positive relationship between the fraction of the genome experiencing reduced gene flow and levels of genome-wide differentiation. Furthermore, we found that the upper bound of the 'grey zone of speciation' for our dataset extended beyond that found before, implying that gene flow between diverging taxa is possible at higher levels of divergence than previously thought. Finally, we list recommendations for further strengthening the use of demographic modelling in speciation research. These include a more balanced representation of taxa, more consistent and comprehensive modelling, clear reporting of results and simulation studies to rule out nonbiological explanations for general results.
Collapse
Affiliation(s)
- Aurélien De Jode
- Department of Marine Sciences‐TjärnöUniversity of GothenburgGothenburgSweden
| | - Alan Le Moan
- Department of Marine Sciences‐TjärnöUniversity of GothenburgGothenburgSweden
| | - Kerstin Johannesson
- Department of Marine Sciences‐TjärnöUniversity of GothenburgGothenburgSweden
| | - Rui Faria
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório AssociadoUniversidade do PortoVairãoPortugal,BIOPOLIS Program in Genomics, Biodiversity and Land PlanningCIBIOVairãoPortugal
| | - Sean Stankowski
- Institute of Science and Technology Austria (IST Austria)KlosterneuburgAustria
| | - Anja Marie Westram
- Institute of Science and Technology Austria (IST Austria)KlosterneuburgAustria,Faculty of Biosciences and AquacultureNord UniversityBodøNorway
| | - Roger K. Butlin
- Department of Marine Sciences‐TjärnöUniversity of GothenburgGothenburgSweden,Ecology and Evolutionary Biology, School of BiosciencesThe University of SheffieldSheffieldUK
| | - Marina Rafajlović
- Department of Marine SciencesUniversity of GothenburgGothenburgSweden
| | | |
Collapse
|