1
|
Matheson J, Masel J. Background Selection From Unlinked Sites Causes Nonindependent Evolution of Deleterious Mutations. Genome Biol Evol 2024; 16:evae050. [PMID: 38482769 PMCID: PMC10972689 DOI: 10.1093/gbe/evae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2024] [Indexed: 04/01/2024] Open
Abstract
Background selection describes the reduction in neutral diversity caused by selection against deleterious alleles at other loci. It is typically assumed that the purging of deleterious alleles affects linked neutral variants, and indeed simulations typically only treat a genomic window. However, background selection at unlinked loci also depresses neutral diversity. In agreement with previous analytical approximations, in our simulations of a human-like genome with a realistically high genome-wide deleterious mutation rate, the effects of unlinked background selection exceed those of linked background selection. Background selection reduces neutral genetic diversity by a factor that is independent of census population size. Outside of genic regions, the strength of background selection increases with the mean selection coefficient, contradicting the linked theory but in agreement with the unlinked theory. Neutral diversity within genic regions is fairly independent of the strength of selection. Deleterious genetic load among haploid individuals is underdispersed, indicating nonindependent evolution of deleterious mutations. Empirical evidence for underdispersion was previously interpreted as evidence for global epistasis, but we recover it from a non-epistatic model.
Collapse
Affiliation(s)
- Joseph Matheson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
- Department of Ecology, Behavior, and Evolution, University of California San Diego, San Diego, CA 92093, USA
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
2
|
Buffalo V, Kern AD. A quantitative genetic model of background selection in humans. PLoS Genet 2024; 20:e1011144. [PMID: 38507461 PMCID: PMC10984650 DOI: 10.1371/journal.pgen.1011144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 04/01/2024] [Accepted: 01/19/2024] [Indexed: 03/22/2024] Open
Abstract
Across the human genome, there are large-scale fluctuations in genetic diversity caused by the indirect effects of selection. This "linked selection signal" reflects the impact of selection according to the physical placement of functional regions and recombination rates along chromosomes. Previous work has shown that purifying selection acting against the steady influx of new deleterious mutations at functional portions of the genome shapes patterns of genomic variation. To date, statistical efforts to estimate purifying selection parameters from linked selection models have relied on classic Background Selection theory, which is only applicable when new mutations are so deleterious that they cannot fix in the population. Here, we develop a statistical method based on a quantitative genetics view of linked selection, that models how polygenic additive fitness variance distributed along the genome increases the rate of stochastic allele frequency change. By jointly predicting the equilibrium fitness variance and substitution rate due to both strong and weakly deleterious mutations, we estimate the distribution of fitness effects (DFE) and mutation rate across three geographically distinct human samples. While our model can accommodate weaker selection, we find evidence of strong selection operating similarly across all human samples. Although our quantitative genetic model of linked selection fits better than previous models, substitution rates of the most constrained sites disagree with observed divergence levels. We find that a model incorporating selective interference better predicts observed divergence in conserved regions, but overall our results suggest uncertainty remains about the processes generating fitness variation in humans.
Collapse
Affiliation(s)
- Vince Buffalo
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| | - Andrew D. Kern
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| |
Collapse
|
3
|
Novo I, Pérez-Pereira N, Santiago E, Quesada H, Caballero A. An empirical test of the estimation of historical effective population size using Drosophila melanogaster. Mol Ecol Resour 2023; 23:1632-1640. [PMID: 37455584 DOI: 10.1111/1755-0998.13837] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/07/2023] [Accepted: 07/05/2023] [Indexed: 07/18/2023]
Abstract
The availability of a large number of high-density markers (SNPs) allows the estimation of historical effective population size (Ne ) from linkage disequilibrium between loci. A recent refinement of methods to estimate historical Ne from the recent past has been shown to be rather accurate with simulation data. The method has also been applied to real data for numerous species. However, the simulation data cannot encompass all the complexities of real genomes, and the performance of any estimation method with real data is always uncertain, as the true demography of the populations is not known. Here, we carried out an experimental design with Drosophila melanogaster to test the method with real data following a known demographic history. We used a population maintained in the laboratory with a constant census size of about 2800 individuals and subjected the population to a drastic decline to a size of 100 individuals. After a few generations, the population was expanded back to the previous size and after a few further generations again expanded to twice the initial size. Estimates of historical Ne were obtained with the software GONE both for autosomal and X chromosomes from samples of 17 individuals sequenced for the whole genome. Estimates of the historical effective size were able to infer the patterns of changes that occurred in the populations showing generally good performance of the method. We discuss the limitations of the method and the application of the software carried out so far.
Collapse
Affiliation(s)
- Irene Novo
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, Vigo, Spain
| | - Noelia Pérez-Pereira
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, Vigo, Spain
| | - Enrique Santiago
- Departamento de Biología Funcional, Facultad de Biología, Universidad de Oviedo, Oviedo, Spain
| | - Humberto Quesada
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, Vigo, Spain
| | - Armando Caballero
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, Vigo, Spain
| |
Collapse
|
4
|
Abstract
Abstract
Few doubt that effective population size (Ne) is one of the most important parameters in evolutionary biology, but how many can say they really understand the concept? Ne is the evolutionary analogue of the number of individuals (or adults) in the population, N. Whereas ecological consequences of population size depend on N, evolutionary consequences (rates of loss of genetic diversity and increase in inbreeding; relative effectiveness of selection) depend on Ne. Formal definitions typically relate effective size to a key population genetic parameter, such as loss of heterozygosity or variance in allele frequency. However, for practical application to real populations, it is more useful to define Ne in terms of three demographic parameters: number of potential parents (adult N), and mean and variance in offspring number. Defined this way, Ne determines the rate of random genetic drift across the entire genome in the offspring generation. Other evolutionary forces (mutation, migration, selection)—together with factors such as variation in recombination rate—can also affect genetic variation, and this leads to heterogeneity across the genome in observed rates of genetic change. For some, it has been convenient to interpret this heterogeneity in terms of heterogeneity in Ne, but unfortunately this has muddled the concepts of genetic drift and effective population size. A commonly-repeated misconception is that Ne is the number of parents that actually contribute genes to the next generation (NP). In reality, NP can be smaller or larger than Ne, and the NP/Ne ratio depends on the sex ratio, the mean and variance in offspring number, and whether inbreeding or variance Ne is of interest.
Collapse
Affiliation(s)
- Robin S Waples
- Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, WA, 98112 USA
| |
Collapse
|
5
|
Novo I, Santiago E, Caballero A. The estimates of effective population size based on linkage disequilibrium are virtually unaffected by natural selection. PLoS Genet 2022; 18:e1009764. [PMID: 35077457 PMCID: PMC8815936 DOI: 10.1371/journal.pgen.1009764] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 02/04/2022] [Accepted: 12/21/2021] [Indexed: 11/19/2022] Open
Abstract
The effective population size (Ne) is a key parameter to quantify the magnitude of genetic drift and inbreeding, with important implications in human evolution. The increasing availability of high-density genetic markers allows the estimation of historical changes in Ne across time using measures of genome diversity or linkage disequilibrium between markers. Directional selection is expected to reduce diversity and Ne, and this reduction is modulated by the heterogeneity of the genome in terms of recombination rate. Here we investigate by computer simulations the consequences of selection (both positive and negative) and recombination rate heterogeneity in the estimation of historical Ne. We also investigate the relationship between diversity parameters and Ne across the different regions of the genome using human marker data. We show that the estimates of historical Ne obtained from linkage disequilibrium between markers (NeLD) are virtually unaffected by selection. In contrast, those estimates obtained by coalescence mutation-recombination-based methods can be strongly affected by it, which could have important consequences for the estimation of human demography. The simulation results are supported by the analysis of human data. The estimates of NeLD obtained for particular genomic regions do not correlate, or they do it very weakly, with recombination rate, nucleotide diversity, proportion of polymorphic sites, background selection statistic, minor allele frequency of SNPs, loss of function and missense variants and gene density. This suggests that NeLD measures mainly reflect demographic changes in population size across generations.
Collapse
Affiliation(s)
- Irene Novo
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, Vigo, Spain
| | - Enrique Santiago
- Departamento de Biología Funcional, Facultad de Biología, Universidad de Oviedo, Oviedo, Spain
| | - Armando Caballero
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, Vigo, Spain
| |
Collapse
|
6
|
Novo I, López-Cortegano E, Caballero A. Highly pleiotropic variants of human traits are enriched in genomic regions with strong background selection. Hum Genet 2021; 140:1343-1351. [PMID: 34228221 PMCID: PMC8338839 DOI: 10.1007/s00439-021-02308-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 06/18/2021] [Indexed: 11/27/2022]
Abstract
Recent studies have shown the ubiquity of pleiotropy for variants affecting human complex traits. These studies also show that rare variants tend to be less pleiotropic than common ones, suggesting that purifying natural selection acts against highly pleiotropic variants of large effect. Here, we investigate the mean frequency, effect size and recombination rate associated with pleiotropic variants, and focus particularly on whether highly pleiotropic variants are enriched in regions with putative strong background selection. We evaluate variants for 41 human traits using data from the NHGRI-EBI GWAS Catalog, as well as data from other three studies. Our results show that variants involving a higher degree of pleiotropy tend to be more common, have larger mean effect sizes, and contribute more to heritability than variants with a lower degree of pleiotropy. This is consistent with the fact that variants of large effect and frequency are more likely detected by GWAS. Using data from four different studies, we also show that more pleiotropic variants are enriched in genome regions with stronger background selection than less pleiotropic variants, suggesting that highly pleiotropic variants are subjected to strong purifying selection. From the above results, we hypothesized that a number of highly pleiotropic variants of low effect/frequency may pass undetected by GWAS.
Collapse
Affiliation(s)
- Irene Novo
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, 36310, Vigo, Spain.
| | - Eugenio López-Cortegano
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, 36310, Vigo, Spain
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - Armando Caballero
- Centro de Investigación Mariña, Universidade de Vigo, Facultade de Bioloxía, 36310, Vigo, Spain
| |
Collapse
|
7
|
Santiago E, Caballero A. The value of targeting recombination as a strategy against coronavirus diseases. Heredity (Edinb) 2020; 125:169-172. [PMID: 32606420 PMCID: PMC7325643 DOI: 10.1038/s41437-020-0337-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Revised: 06/17/2020] [Accepted: 06/18/2020] [Indexed: 01/10/2023] Open
Affiliation(s)
- Enrique Santiago
- Departamento de Biología Funcional, Facultad de Biología, Universidad de Oviedo, Oviedo, Spain.
| | - Armando Caballero
- Centro de Investigación Mariña, Departamento de Bioquímica, Genética e Inmunología, Edificio CC Experimentais, Campus de Vigo, As Lagoas, Universidade de Vigo, Marcosende, 36310, Vigo, Spain
| |
Collapse
|
8
|
Garaeva AY, Sidorova AE, Levashova NT, Tverdislov VA. A Percolation Lattice of Natural Selection as a Switch of Deterministic and Random Processes in the Mutation Flow. Biophysics (Nagoya-shi) 2020. [DOI: 10.1134/s0006350920030069] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
9
|
The Temporal Dynamics of Background Selection in Nonequilibrium Populations. Genetics 2020; 214:1019-1030. [PMID: 32071195 DOI: 10.1534/genetics.119.302892] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2019] [Accepted: 01/30/2020] [Indexed: 01/06/2023] Open
Abstract
Neutral genetic diversity across the genome is determined by the complex interplay of mutation, demographic history, and natural selection. While the direct action of natural selection is limited to functional loci across the genome, its impact can have effects on nearby neutral loci due to genetic linkage. These effects of selection at linked sites, referred to as genetic hitchhiking and background selection (BGS), are pervasive across natural populations. However, only recently has there been a focus on the joint consequences of demography and selection at linked sites, and some empirical studies have come to apparently contradictory conclusions as to their combined effects. To understand the relationship between demography and selection at linked sites, we conducted an extensive forward simulation study of BGS under a range of demographic models. We found that the relative levels of diversity in BGS and neutral regions vary over time and that the initial dynamics after a population size change are often in the opposite direction of the long-term expected trajectory. Our detailed observations of the temporal dynamics of neutral diversity in the context of selection at linked sites in nonequilibrium populations provide new intuition about why patterns of diversity under BGS vary through time in natural populations and help reconcile previously contradictory observations. Most notably, our results highlight that classical models of BGS are poorly suited for predicting diversity in nonequilibrium populations.
Collapse
|
10
|
Comeron JM. Background selection as null hypothesis in population genomics: insights and challenges from Drosophila studies. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0471. [PMID: 29109230 PMCID: PMC5698629 DOI: 10.1098/rstb.2016.0471] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/04/2017] [Indexed: 12/11/2022] Open
Abstract
The consequences of selection at linked sites are multiple and widespread across the genomes of most species. Here, I first review the main concepts behind models of selection and linkage in recombining genomes, present the difficulty in parametrizing these models simply as a reduction in effective population size (Ne) and discuss the predicted impact of recombination rates on levels of diversity across genomes. Arguments are then put forward in favour of using a model of selection and linkage with neutral and deleterious mutations (i.e. the background selection model, BGS) as a sensible null hypothesis for investigating the presence of other forms of selection, such as balancing or positive. I also describe and compare two studies that have generated high-resolution landscapes of the predicted consequences of selection at linked sites in Drosophila melanogaster. Both studies show that BGS can explain a very large fraction of the observed variation in diversity across the whole genome, thus supporting its use as null model. Finally, I identify and discuss a number of caveats and challenges in studies of genetic hitchhiking that have been often overlooked, with several of them sharing a potential bias towards overestimating the evidence supporting recent selective sweeps to the detriment of a BGS explanation. One potential source of bias is the analysis of non-equilibrium populations: it is precisely because models of selection and linkage predict variation in Ne across chromosomes that demographic dynamics are not expected to be equivalent chromosome- or genome-wide. Other challenges include the use of incomplete genome annotations, the assumption of temporally stable recombination landscapes, the presence of genes under balancing selection and the consequences of ignoring non-crossover (gene conversion) recombination events. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, IA 52242, USA .,Interdisciplinary Program in Genetics, University of Iowa, Iowa City, IA 52242, USA
| |
Collapse
|