1
|
Subramanian S. Purifying Selection Influences the Comparison of Heterozygosities between Populations. BIOLOGY 2024; 13:810. [PMID: 39452119 PMCID: PMC11505596 DOI: 10.3390/biology13100810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2024] [Revised: 09/29/2024] [Accepted: 10/09/2024] [Indexed: 10/26/2024]
Abstract
Heterozygosity is a fundamental measure routinely used to compare between populations to infer the level of genetic variation and their relative effective population sizes. However, such comparison is highly influenced by the magnitude of selection pressure on the genomic regions used. Using over 2 million Single Nucleotide Variants (SNVs) from chimpanzee and mouse populations, this study shows that the heterozygosities estimated using neutrally evolving sites of large populations were two times higher than those of small populations. However, this difference was only ~1.6 times for the heterozygosities estimated using nonsynonymous sites. This suggests an excess in the nonsynonymous heterozygosities due to the segregation of deleterious variants in small populations. This excess in the nonsynonymous heterozygosities of the small populations was estimated to be 23-31%. Further analysis revealed that the magnitude of the excess is modulated by effective population size (Ne) and selection intensity (s). Using chimpanzee populations, this investigation found that the excess in nonsynonymous diversity in the small population was little (6%) when the difference between the Ne values of large and small populations was small (2.4 times). Conversely, this was high (23%) when the difference in Ne was large (5.9 times). Analysis using mouse populations showed that the excess in the nonsynonymous diversity of highly constrained genes of the small population was much higher (38%) than that observed for the genes under relaxed selective constraints (21%). Similar results were observed when the expression levels of genes were used as a proxy for selection intensity. These results emphasize the use of neutral regions, less constrained genes, or lowly expressed genes when comparing the heterozygosities between populations.
Collapse
Affiliation(s)
- Sankar Subramanian
- Centre for Bioinnovation, School of Science, Technology, and Engineering, The University of the Sunshine Coast, Moreton Bay, QLD 4502, Australia
| |
Collapse
|
2
|
Marsh JI, Johri P. Biases in ARG-Based Inference of Historical Population Size in Populations Experiencing Selection. Mol Biol Evol 2024; 41:msae118. [PMID: 38874402 PMCID: PMC11245712 DOI: 10.1093/molbev/msae118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/05/2024] [Accepted: 06/11/2024] [Indexed: 06/15/2024] Open
Abstract
Inferring the demographic history of populations provides fundamental insights into species dynamics and is essential for developing a null model to accurately study selective processes. However, background selection and selective sweeps can produce genomic signatures at linked sites that mimic or mask signals associated with historical population size change. While the theoretical biases introduced by the linked effects of selection have been well established, it is unclear whether ancestral recombination graph (ARG)-based approaches to demographic inference in typical empirical analyses are susceptible to misinference due to these effects. To address this, we developed highly realistic forward simulations of human and Drosophila melanogaster populations, including empirically estimated variability of gene density, mutation rates, recombination rates, purifying, and positive selection, across different historical demographic scenarios, to broadly assess the impact of selection on demographic inference using a genealogy-based approach. Our results indicate that the linked effects of selection minimally impact demographic inference for human populations, although it could cause misinference in populations with similar genome architecture and population parameters experiencing more frequent recurrent sweeps. We found that accurate demographic inference of D. melanogaster populations by ARG-based methods is compromised by the presence of pervasive background selection alone, leading to spurious inferences of recent population expansion, which may be further worsened by recurrent sweeps, depending on the proportion and strength of beneficial mutations. Caution and additional testing with species-specific simulations are needed when inferring population history with non-human populations using ARG-based approaches to avoid misinference due to the linked effects of selection.
Collapse
Affiliation(s)
- Jacob I Marsh
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Parul Johri
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|
3
|
Soni V, Jensen JD. Temporal challenges in detecting balancing selection from population genomic data. G3 (BETHESDA, MD.) 2024; 14:jkae069. [PMID: 38551137 DOI: 10.1093/g3journal/jkae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 12/21/2023] [Accepted: 03/19/2024] [Indexed: 04/28/2024]
Abstract
The role of balancing selection in maintaining genetic variation remains an open question in population genetics. Recent years have seen numerous studies identifying candidate loci potentially experiencing balancing selection, most predominantly in human populations. There are however numerous alternative evolutionary processes that may leave similar patterns of variation, thereby potentially confounding inference, and the expected signatures of balancing selection additionally change in a temporal fashion. Here we use forward-in-time simulations to quantify expected statistical power to detect balancing selection using both site frequency spectrum- and linkage disequilibrium-based methods under a variety of evolutionarily realistic null models. We find that whilst site frequency spectrum-based methods have little power immediately after a balanced mutation begins segregating, power increases with time since the introduction of the balanced allele. Conversely, linkage disequilibrium-based methods have considerable power whilst the allele is young, and power dissipates rapidly as the time since introduction increases. Taken together, this suggests that site frequency spectrum-based methods are most effective at detecting long-term balancing selection (>25N generations since the introduction of the balanced allele) whilst linkage disequilibrium-based methods are effective over much shorter timescales (<1N generations), thereby leaving a large time frame over which current methods have little power to detect the action of balancing selection. Finally, we investigate the extent to which alternative evolutionary processes may mimic these patterns, and demonstrate the need for caution in attempting to distinguish the signatures of balancing selection from those of both neutral processes (e.g. population structure and admixture) as well as of alternative selective processes (e.g. partial selective sweeps).
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
4
|
Buffalo V, Kern AD. A quantitative genetic model of background selection in humans. PLoS Genet 2024; 20:e1011144. [PMID: 38507461 PMCID: PMC10984650 DOI: 10.1371/journal.pgen.1011144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 04/01/2024] [Accepted: 01/19/2024] [Indexed: 03/22/2024] Open
Abstract
Across the human genome, there are large-scale fluctuations in genetic diversity caused by the indirect effects of selection. This "linked selection signal" reflects the impact of selection according to the physical placement of functional regions and recombination rates along chromosomes. Previous work has shown that purifying selection acting against the steady influx of new deleterious mutations at functional portions of the genome shapes patterns of genomic variation. To date, statistical efforts to estimate purifying selection parameters from linked selection models have relied on classic Background Selection theory, which is only applicable when new mutations are so deleterious that they cannot fix in the population. Here, we develop a statistical method based on a quantitative genetics view of linked selection, that models how polygenic additive fitness variance distributed along the genome increases the rate of stochastic allele frequency change. By jointly predicting the equilibrium fitness variance and substitution rate due to both strong and weakly deleterious mutations, we estimate the distribution of fitness effects (DFE) and mutation rate across three geographically distinct human samples. While our model can accommodate weaker selection, we find evidence of strong selection operating similarly across all human samples. Although our quantitative genetic model of linked selection fits better than previous models, substitution rates of the most constrained sites disagree with observed divergence levels. We find that a model incorporating selective interference better predicts observed divergence in conserved regions, but overall our results suggest uncertainty remains about the processes generating fitness variation in humans.
Collapse
Affiliation(s)
- Vince Buffalo
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| | - Andrew D. Kern
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| |
Collapse
|
5
|
Soni V, Pfeifer SP, Jensen JD. The effects of mutation and recombination rate heterogeneity on the inference of demography and the distribution of fitness effects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.11.566703. [PMID: 38014252 PMCID: PMC10680612 DOI: 10.1101/2023.11.11.566703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavour; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modelled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination prior to utilizing population genomic data to quantify the effects of genetic drift (i.e., as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modelled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Susanne P. Pfeifer
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Jeffrey D. Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| |
Collapse
|
6
|
Devi A, Speyer G, Lynch M. The divergence of mean phenotypes under persistent directional selection. Genetics 2023; 224:iyad091. [PMID: 37200616 PMCID: PMC10552002 DOI: 10.1093/genetics/iyad091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 02/26/2023] [Accepted: 05/04/2023] [Indexed: 05/20/2023] Open
Abstract
Numerous organismal traits, particularly at the cellular level, are likely to be under persistent directional selection across phylogenetic lineages. Unless all mutations affecting such traits have large enough effects to be efficiently selected in all species, gradients in mean phenotypes are expected to arise as a consequence of differences in the power of random genetic drift, which varies by approximately five orders of magnitude across the Tree of Life. Prior theoretical work examining the conditions under which such gradients can arise focused on the simple situation in which all genomic sites affecting the trait have identical and constant mutational effects. Here, we extend this theory to incorporate the more biologically realistic situation in which mutational effects on a trait differ among nucleotide sites. Pursuit of such modifications leads to the development of semi-analytic expressions for the ways in which selective interference arises via linkage effects in single-effects models, which then extend to more complex scenarios. The theory developed clarifies the conditions under which mutations of different selective effects mutually interfere with each others' fixation and shows how variance in effects among sites can substantially modify and extend the expected scaling relationships between mean phenotypes and effective population sizes.
Collapse
Affiliation(s)
- Archana Devi
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Gil Speyer
- Knowledge Enterprise, Arizona State University, Tempe, AZ 85287, USA
| | - Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
7
|
Jiang P, Kreitman M, Reinitz J. The effect of mutational robustness on the evolvability of multicellular organisms and eukaryotic cells. J Evol Biol 2023; 36:906-924. [PMID: 37256290 PMCID: PMC10315174 DOI: 10.1111/jeb.14180] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 03/29/2023] [Accepted: 04/18/2023] [Indexed: 06/01/2023]
Abstract
Canalization involves mutational robustness, the lack of phenotypic change as a result of genetic mutations. Given the large divergence in phenotype across species, understanding the relationship between high robustness and evolvability has been of interest to both theorists and experimentalists. Although canalization was originally proposed in the context of multicellular organisms, the effect of multicellularity and other classes of hierarchical organization on evolvability has not been considered by theoreticians. We address this issue using a Boolean population model with explicit representation of an environment in which individuals with explicit genotype and a hierarchical phenotype representing multicellularity evolve. Robustness is described by a single real number between zero and one which emerges from the genotype-phenotype map. We find that high robustness is favoured in constant environments, and lower robustness is favoured after environmental change. Multicellularity and hierarchical organization severely constrain robustness: peak evolvability occurs at an absolute level of robustness of about 0.99 compared with values of about 0.5 in a classical neutral network model. These constraints result in a sharp peak of evolvability in which the maximum is set by the fact that the fixation of adaptive mutations becomes more improbable as robustness decreases. When robustness is put under genetic control, robustness levels leading to maximum evolvability are selected for, but maximal relative fitness appears to require recombination.
Collapse
Affiliation(s)
- Pengyao Jiang
- Department of Ecology & Evolution, University of Chicago, Chicago, Illinois, USA
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Martin Kreitman
- Department of Ecology & Evolution, University of Chicago, Chicago, Illinois, USA
- Institute for Genomics & Systems Biology, Chicago, Illinois, USA
| | - John Reinitz
- Department of Ecology & Evolution, University of Chicago, Chicago, Illinois, USA
- Institute for Genomics & Systems Biology, Chicago, Illinois, USA
- Department of Statistics, University of Chicago, Chicago, Illinois, USA
- Department of Molecular Genetics and Cell Biology, University of Chicago, Chicago, Illinois, USA
| |
Collapse
|
8
|
Why does the X chromosome lag behind autosomes in GWAS findings? PLoS Genet 2023; 19:e1010472. [PMID: 36848382 PMCID: PMC9997976 DOI: 10.1371/journal.pgen.1010472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 03/09/2023] [Accepted: 02/15/2023] [Indexed: 03/01/2023] Open
Abstract
The X-chromosome is among the largest human chromosomes. It differs from autosomes by a number of important features including hemizygosity in males, an almost complete inactivation of one copy in females, and unique patterns of recombination. We used data from the Catalog of Published Genome Wide Association Studies to compare densities of the GWAS-detected SNPs on the X-chromosome and autosomes. The density of GWAS-detected SNPs on the X-chromosome is 6-fold lower compared to the density of the GWAS-detected SNPs on autosomes. Differences between the X-chromosome and autosomes cannot be explained by differences in the overall SNP density, lower X-chromosome coverage by genotyping platforms or low call rate of X-chromosomal SNPs. Similar differences in the density of GWAS-detected SNPs were found in female-only GWASs (e.g. ovarian cancer GWASs). We hypothesized that the lower density of GWAS-detected SNPs on the X-chromosome compared to autosomes is not a result of a methodological bias, e.g. differences in coverage or call rates, but has a real underlying biological reason-a lower density of functional SNPs on the X-chromosome versus autosomes. This hypothesis is supported by the observation that (i) the overall SNP density of X-chromosome is lower compared to the SNP density on autosomes and that (ii) the density of genic SNPs on the X-chromosome is lower compared to autosomes while densities of intergenic SNPs are similar.
Collapse
|
9
|
Frankham R. Effects of genomic homozygosity on total fitness in an invertebrate: lethal equivalent estimates for Drosophila melanogaster. CONSERV GENET 2022. [DOI: 10.1007/s10592-022-01493-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
10
|
Parker DJ, Jaron KS, Dumas Z, Robinson‐Rechavi M, Schwander T. X chromosomes show relaxed selection and complete somatic dosage compensation across
Timema
stick insect species. J Evol Biol 2022; 35:1734-1750. [PMID: 35933721 PMCID: PMC10087215 DOI: 10.1111/jeb.14075] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Revised: 05/06/2022] [Accepted: 07/14/2022] [Indexed: 11/29/2022]
Abstract
Sex chromosomes have evolved repeatedly across the tree of life. As they are present in different copy numbers in males and females, they are expected to experience different selection pressures than the autosomes, with consequences including a faster rate of evolution, increased accumulation of sexually antagonistic alleles and the evolution of dosage compensation. Whether these consequences are general or linked to idiosyncrasies of specific taxa is not clear as relatively few taxa have been studied thus far. Here, we use whole-genome sequencing to identify and characterize the evolution of the X chromosome in five species of Timema stick insects with XX:X0 sex determination. The X chromosome had a similar size (approximately 12% of the genome) and gene content across all five species, suggesting that the X chromosome originated prior to the diversification of the genus. Genes on the X showed evidence of relaxed selection (elevated dN/dS) and a slower evolutionary rate (dN + dS) than genes on the autosomes, likely due to sex-biased mutation rates. Genes on the X also showed almost complete dosage compensation in somatic tissues (heads and legs), but dosage compensation was absent in the reproductive tracts. Contrary to prediction, sex-biased genes showed little enrichment on the X, suggesting that the advantage X-linkage provides to the accumulation of sexually antagonistic alleles is weak. Overall, we found the consequences of X-linkage on gene sequences and expression to be similar across Timema species, showing the characteristics of the X chromosome are surprisingly consistent over 30 million years of evolution.
Collapse
Affiliation(s)
- Darren J. Parker
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- Swiss Institute of Bioinformatics Lausanne Switzerland
- School of Natural Sciences Bangor University Bangor UK
| | - Kamil S. Jaron
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- Swiss Institute of Bioinformatics Lausanne Switzerland
- School of Biological Sciences Institute of Evolutionary Biology University of Edinburgh Edinburgh UK
| | - Zoé Dumas
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
| | - Marc Robinson‐Rechavi
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
- Swiss Institute of Bioinformatics Lausanne Switzerland
| | - Tanja Schwander
- Department of Ecology and Evolution University of Lausanne Lausanne Switzerland
| |
Collapse
|
11
|
Johri P, Aquadro CF, Beaumont M, Charlesworth B, Excoffier L, Eyre-Walker A, Keightley PD, Lynch M, McVean G, Payseur BA, Pfeifer SP, Stephan W, Jensen JD. Recommendations for improving statistical inference in population genomics. PLoS Biol 2022; 20:e3001669. [PMID: 35639797 PMCID: PMC9154105 DOI: 10.1371/journal.pbio.3001669] [Citation(s) in RCA: 48] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The field of population genomics has grown rapidly in response to the recent advent of affordable, large-scale sequencing technologies. As opposed to the situation during the majority of the 20th century, in which the development of theoretical and statistical population genetic insights outpaced the generation of data to which they could be applied, genomic data are now being produced at a far greater rate than they can be meaningfully analyzed and interpreted. With this wealth of data has come a tendency to focus on fitting specific (and often rather idiosyncratic) models to data, at the expense of a careful exploration of the range of possible underlying evolutionary processes. For example, the approach of directly investigating models of adaptive evolution in each newly sequenced population or species often neglects the fact that a thorough characterization of ubiquitous nonadaptive processes is a prerequisite for accurate inference. We here describe the perils of these tendencies, present our consensus views on current best practices in population genomic data analysis, and highlight areas of statistical inference and theory that are in need of further attention. Thereby, we argue for the importance of defining a biologically relevant baseline model tuned to the details of each new analysis, of skepticism and scrutiny in interpreting model fitting results, and of carefully defining addressable hypotheses and underlying uncertainties.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | - Charles F. Aquadro
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Mark Beaumont
- School of Biological Sciences, University of Bristol, Bristol, United Kingdom
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne, Switzerland
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Peter D. Keightley
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Michael Lynch
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | - Gil McVean
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
| | - Bret A. Payseur
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Susanne P. Pfeifer
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | | | - Jeffrey D. Jensen
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| |
Collapse
|
12
|
Moinet A, Schlichta F, Peischl S, Excoffier L. Strong neutral sweeps occurring during a population contraction. Genetics 2022; 220:6529544. [PMID: 35171980 PMCID: PMC8982045 DOI: 10.1093/genetics/iyac021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 01/22/2022] [Indexed: 11/14/2022] Open
Abstract
A strong reduction in diversity around a specific locus is often interpreted as a recent rapid fixation of a positively selected allele, a phenomenon called a selective sweep. Rapid fixation of neutral variants can however lead to a similar reduction in local diversity, especially when the population experiences changes in population size, e.g. bottlenecks or range expansions. The fact that demographic processes can lead to signals of nucleotide diversity very similar to signals of selective sweeps is at the core of an ongoing discussion about the roles of demography and natural selection in shaping patterns of neutral variation. Here, we quantitatively investigate the shape of such neutral valleys of diversity under a simple model of a single population size change, and we compare it to signals of a selective sweep. We analytically describe the expected shape of such "neutral sweeps" and show that selective sweep valleys of diversity are, for the same fixation time, wider than neutral valleys. On the other hand, it is always possible to parametrize our model to find a neutral valley that has the same width as a given selected valley. Our findings provide further insight into how simple demographic models can create valleys of genetic diversity similar to those attributed to positive selection.
Collapse
Affiliation(s)
- Antoine Moinet
- Interfaculty Bioinformatics Unit, University of Bern, Bern 3012, Switzerland,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland,Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland
| | - Flávia Schlichta
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland,Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland
| | - Stephan Peischl
- Interfaculty Bioinformatics Unit, University of Bern, Bern 3012, Switzerland,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland,Corresponding author.
| | - Laurent Excoffier
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland,Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland
| |
Collapse
|
13
|
Morales-Arce AY, Johri P, Jensen JD. Inferring the distribution of fitness effects in patient-sampled and experimental virus populations: two case studies. Heredity (Edinb) 2022; 128:79-87. [PMID: 34987185 PMCID: PMC8728706 DOI: 10.1038/s41437-021-00493-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 12/12/2021] [Accepted: 12/13/2021] [Indexed: 11/19/2022] Open
Abstract
We here propose an analysis pipeline for inferring the distribution of fitness effects (DFE) from either patient-sampled or experimentally-evolved viral populations, that explicitly accounts for non-Wright-Fisher and non-equilibrium population dynamics inherent to pathogens. We examine the performance of this approach via extensive power and performance analyses, and highlight two illustrative applications - one from an experimentally-passaged RNA virus, and the other from a clinically-sampled DNA virus. Finally, we discuss how such DFE inference may shed light on major research questions in virus evolution, ranging from a quantification of the population genetic processes governing genome size, to the role of Hill-Robertson interference in dictating adaptive outcomes, to the potential design of novel therapeutic approaches to eradicate within-patient viral populations via induced mutational meltdown.
Collapse
Affiliation(s)
- Ana Y Morales-Arce
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Parul Johri
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D Jensen
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ, USA.
| |
Collapse
|
14
|
Bourgeois Y, Fields P, Bento G, Ebert D. Balancing selection for pathogen resistance reveals an intercontinental signature of Red Queen coevolution. Mol Biol Evol 2021; 38:4918-4933. [PMID: 34289047 PMCID: PMC8557431 DOI: 10.1093/molbev/msab217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The link between long-term host–parasite coevolution and genetic diversity is key to understanding genetic epidemiology and the evolution of resistance. The model of Red Queen host–parasite coevolution posits that high genetic diversity is maintained when rare host resistance variants have a selective advantage, which is believed to be the mechanistic basis for the extraordinarily high levels of diversity at disease-related genes such as the major histocompatibility complex in jawed vertebrates and R-genes in plants. The parasites that drive long-term coevolution are, however, often elusive. Here we present evidence for long-term balancing selection at the phenotypic (variation in resistance) and genomic (resistance locus) level in a particular host–parasite system: the planktonic crustacean Daphnia magna and the bacterium Pasteuria ramosa. The host shows widespread polymorphisms for pathogen resistance regardless of geographic distance, even though there is a clear genome-wide pattern of isolation by distance at other sites. In the genomic region of a previously identified resistance supergene, we observed consistent molecular signals of balancing selection, including higher genetic diversity, older coalescence times, and lower differentiation between populations, which set this region apart from the rest of the genome. We propose that specific long-term coevolution by negative-frequency-dependent selection drives this elevated diversity at the host's resistance loci on an intercontinental scale and provide an example of a direct link between the host’s resistance to a virulent pathogen and the large-scale diversity of its underlying genes.
Collapse
Affiliation(s)
- Yann Bourgeois
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Peter Fields
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Gilberto Bento
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Dieter Ebert
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| |
Collapse
|
15
|
Huang X, Fortier AL, Coffman AJ, Struck TJ, Irby MN, James JE, León-Burguete JE, Ragsdale AP, Gutenkunst RN. Inferring genome-wide correlations of mutation fitness effects between populations. Mol Biol Evol 2021; 38:4588-4602. [PMID: 34043790 PMCID: PMC8476148 DOI: 10.1093/molbev/msab162] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The effect of a mutation on fitness may differ between populations depending on environmental and genetic context, but little is known about the factors that underlie such differences. To quantify genome-wide correlations in mutation fitness effects, we developed a novel concept called a joint distribution of fitness effects (DFE) between populations. We then proposed a new statistic w to measure the DFE correlation between populations. Using simulation, we showed that inferring the DFE correlation from the joint allele frequency spectrum is statistically precise and robust. Using population genomic data, we inferred DFE correlations of populations in humans, Drosophila melanogaster, and wild tomatoes. In these species, we found that the overall correlation of the joint DFE was inversely related to genetic differentiation. In humans and D. melanogaster, deleterious mutations had a lower DFE correlation than tolerated mutations, indicating a complex joint DFE. Altogether, the DFE correlation can be reliably inferred, and it offers extensive insight into the genetics of population divergence.
Collapse
|
16
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects. Mol Biol Evol 2021; 38:2986-3003. [PMID: 33591322 PMCID: PMC8233493 DOI: 10.1093/molbev/msab050] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Current procedures for inferring population history generally assume complete neutrality—that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
17
|
Abstract
The great apes play an important role as model organisms. They are our closest living relatives, allowing us to identify the genetic basis of phenotypic traits that we think of as characteristically human. However, the most significant asset of great apes as model organisms is that they share with humans most of their genetic makeup. This means that we can extend our vast knowledge of the human genome, its genes, and the associated phenotypes to these species. Comparative genomic studies of humans and apes thus reveal how very similar genomes react when exposed to different population genetic regimes. In this way, each species represents a natural experiment, where a genome highly similar to the human one, is differently exposed to the evolutionary forces of demography, population structure, selection, recombination, and admixture/hybridization. The initial sequencing of reference genomes for chimpanzee, orangutan, gorilla, the bonobo, each provided new insights and a second generation of sequencing projects has provided diversity data for all the great apes. In this chapter, we will outline some of the findings that population genomic analysis of great apes has provided, and how comparative studies have helped us understand how the fundamental forces in evolution have contributed to shaping the genomes and the genetic diversity of the great apes.
Collapse
Affiliation(s)
- David Castellano
- Bioinformatics and Genomics, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Kasper Munch
- Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
| |
Collapse
|
18
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The impact of purifying and background selection on the inference of population history: problems and prospects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021. [PMID: 33501439 PMCID: PMC7836109 DOI: 10.1101/2020.04.28.066365] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Current procedures for inferring population history generally assume complete neutrality - that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects (DFE) and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the DFE as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne 3012, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
19
|
Jensen JD, Stikeleather RA, Kowalik TF, Lynch M. Imposed mutational meltdown as an antiviral strategy. Evolution 2020; 74:2549-2559. [PMID: 33047822 PMCID: PMC7993354 DOI: 10.1111/evo.14107] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 09/30/2020] [Accepted: 10/10/2020] [Indexed: 12/25/2022]
Abstract
Following widespread infections of the most recent coronavirus known to infect humans, SARS‐CoV‐2, attention has turned to potential therapeutic options. With no drug or vaccine yet approved, one focal point of research is to evaluate the potential value of repurposing existing antiviral treatments, with the logical strategy being to identify at least a short‐term intervention to prevent within‐patient progression, while long‐term vaccine strategies unfold. Here, we offer an evolutionary/population‐genetic perspective on one approach that may overwhelm the capacity for pathogen defense (i.e., adaptation) – induced mutational meltdown – providing an overview of key concepts, review of previous theoretical and experimental work of relevance, and guidance for future research. Applied with appropriate care, including target specificity, induced mutational meltdown may provide a general, rapidly implemented approach for the within‐patient eradication of a wide range of pathogens or other undesirable microorganisms.
Collapse
Affiliation(s)
- Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, Arizona, 85281.,Center for Evolution & Medicine, Arizona State University, Tempe, Arizona, 85281
| | - Ryan A Stikeleather
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, Arizona, 85281
| | - Timothy F Kowalik
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, Worcester, Massachusetts, 01655
| | - Michael Lynch
- School of Life Sciences, Arizona State University, Tempe, Arizona, 85281.,Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, Arizona, 85281
| |
Collapse
|
20
|
Santiago E, Caballero A. The value of targeting recombination as a strategy against coronavirus diseases. Heredity (Edinb) 2020; 125:169-172. [PMID: 32606420 PMCID: PMC7325643 DOI: 10.1038/s41437-020-0337-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2020] [Revised: 06/17/2020] [Accepted: 06/18/2020] [Indexed: 01/10/2023] Open
Affiliation(s)
- Enrique Santiago
- Departamento de Biología Funcional, Facultad de Biología, Universidad de Oviedo, Oviedo, Spain.
| | - Armando Caballero
- Centro de Investigación Mariña, Departamento de Bioquímica, Genética e Inmunología, Edificio CC Experimentais, Campus de Vigo, As Lagoas, Universidade de Vigo, Marcosende, 36310, Vigo, Spain
| |
Collapse
|
21
|
Reay WR, Cairns MJ. Pairwise common variant meta-analyses of schizophrenia with other psychiatric disorders reveals shared and distinct gene and gene-set associations. Transl Psychiatry 2020; 10:134. [PMID: 32398653 PMCID: PMC7217970 DOI: 10.1038/s41398-020-0817-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 04/20/2020] [Accepted: 04/23/2020] [Indexed: 01/09/2023] Open
Abstract
The complex aetiology of schizophrenia is postulated to share components with other psychiatric disorders. We investigated pleiotropy amongst the common variant genomics of schizophrenia and seven other psychiatric disorders using a multimarker association test. Transcriptomic imputation was then leveraged to investigate the functional significance of variation mapped to these genes, prioritising several interesting functional candidates. Gene-based analysis of common variation revealed 67 schizophrenia-associated genes shared with other psychiatric phenotypes, including bipolar disorder, major depressive disorder, ADHD and autism-spectrum disorder. In addition, we uncovered 78 genes significantly enriched with common variant associations for schizophrenia that were not linked to any of these seven disorders (P > 0.05). Multivariable gene-set association suggested that common variation enrichment within biologically constrained genes observed for schizophrenia also occurs across several psychiatric phenotypes. Pairwise meta-analysis of schizophrenia and each psychiatric phenotype was implemented and identified 330 significantly associated genes (PMeta < 2.7 × 10-6) that were only nominally associated with each disorder individually (P < 0.05). These analyses consolidate the overlap between the genomic architecture of schizophrenia and other psychiatric disorders, uncovering several candidate pleiotropic genes which warrant further investigation.
Collapse
Affiliation(s)
- William R Reay
- School of Biomedical Sciences and Pharmacy, The University of Newcastle, Callaghan, NSW, Australia
- Centre for Brain and Mental Health Research, Hunter Medical Research Institute, Newcastle, NSW, Australia
| | - Murray J Cairns
- School of Biomedical Sciences and Pharmacy, The University of Newcastle, Callaghan, NSW, Australia.
- Centre for Brain and Mental Health Research, Hunter Medical Research Institute, Newcastle, NSW, Australia.
| |
Collapse
|
22
|
Johri P, Charlesworth B, Jensen JD. Toward an Evolutionarily Appropriate Null Model: Jointly Inferring Demography and Purifying Selection. Genetics 2020; 215:173-192. [PMID: 32152045 PMCID: PMC7198275 DOI: 10.1534/genetics.119.303002] [Citation(s) in RCA: 89] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 03/05/2020] [Indexed: 01/27/2023] Open
Abstract
The question of the relative evolutionary roles of adaptive and nonadaptive processes has been a central debate in population genetics for nearly a century. While advances have been made in the theoretical development of the underlying models, and statistical methods for estimating their parameters from large-scale genomic data, a framework for an appropriate null model remains elusive. A model incorporating evolutionary processes known to be in constant operation, genetic drift (as modulated by the demographic history of the population) and purifying selection, is lacking. Without such a null model, the role of adaptive processes in shaping within- and between-population variation may not be accurately assessed. Here, we investigate how population size changes and the strength of purifying selection affect patterns of variation at "neutral" sites near functional genomic components. We propose a novel statistical framework for jointly inferring the contribution of the relevant selective and demographic parameters. By means of extensive performance analyses, we quantify the utility of the approach, identify the most important statistics for parameter estimation, and compare the results with existing methods. Finally, we reanalyze genome-wide population-level data from a Zambian population of Drosophila melanogaster, and find that it has experienced a much slower rate of population growth than was inferred when the effects of purifying selection were neglected. Our approach represents an appropriate null model, against which the effects of positive selection can be assessed.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, Arizona 85287
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, Arizona 85287
| |
Collapse
|
23
|
Booker TR, Keightley PD. Understanding the Factors That Shape Patterns of Nucleotide Diversity in the House Mouse Genome. Mol Biol Evol 2019; 35:2971-2988. [PMID: 30295866 PMCID: PMC6278861 DOI: 10.1093/molbev/msy188] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
A major goal of population genetics has been to determine the extent by which selection at linked sites influences patterns of neutral nucleotide diversity in the genome. Multiple lines of evidence suggest that diversity is influenced by both positive and negative selection. For example, in many species there are troughs in diversity surrounding functional genomic elements, consistent with the action of either background selection (BGS) or selective sweeps. In this study, we investigated the causes of the diversity troughs that are observed in the wild house mouse genome. Using the unfolded site frequency spectrum, we estimated the strength and frequencies of deleterious and advantageous mutations occurring in different functional elements in the genome. We then used these estimates to parameterize forward-in-time simulations of chromosomes, using realistic distributions of functional elements and recombination rate variation in order to determine whether selection at linked sites can explain the observed patterns of nucleotide diversity. The simulations suggest that BGS alone cannot explain the dips in diversity around either exons or conserved noncoding elements. A combination of BGS and selective sweeps produces deeper dips in diversity than BGS alone, but the inferred parameters of selection cannot fully explain the patterns observed in the genome. Our results provide evidence of sweeps shaping patterns of nucleotide diversity across the mouse genome and also suggest that infrequent, strongly advantageous mutations play an important role in this. The limitations of using the unfolded site frequency spectrum for inferring the frequency and effects of advantageous mutations are discussed.
Collapse
Affiliation(s)
- Tom R Booker
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom.,Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Peter D Keightley
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
24
|
Fraser BA, Whiting JR. What can be learned by scanning the genome for molecular convergence in wild populations? Ann N Y Acad Sci 2019; 1476:23-42. [PMID: 31241191 PMCID: PMC7586825 DOI: 10.1111/nyas.14177] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/24/2019] [Accepted: 06/04/2019] [Indexed: 12/11/2022]
Abstract
Convergent evolution, where independent lineages evolve similar phenotypes in response to similar challenges, can provide valuable insight into how selection operates and the limitations it encounters. However, it has only recently become possible to explore how convergent evolution is reflected at the genomic level. The overlapping outlier approach (OOA), where genome scans of multiple independent lineages are used to find outliers that overlap and therefore identify convergently evolving loci, is becoming popular. Here, we present a quantitative analysis of 34 studies that used this approach across many sampling designs, taxa, and sampling intensities. We found that OOA studies with increased biological sampling power within replicates have increased likelihood of finding overlapping, "convergent" signals of adaptation between them. When identifying convergent loci as overlapping outliers, it is tempting to assume that any false-positive outliers derived from individual scans will fail to overlap across replicates, but this cannot be guaranteed. We highlight how population demographics and genomic context can contribute toward both true convergence and false positives in OOA studies. We finish with an exploration of emerging methods that couple genome scans with phenotype and environmental measures, leveraging added information from genome data to more directly test hypotheses of the likelihood of convergent evolution.
Collapse
Affiliation(s)
- Bonnie A Fraser
- Department of Biosciences, University of Exeter, Exeter, United Kingdom
| | - James R Whiting
- Department of Biosciences, University of Exeter, Exeter, United Kingdom
| |
Collapse
|
25
|
Ecological and Evolutionary Processes Shaping Viral Genetic Diversity. Viruses 2019; 11:v11030220. [PMID: 30841497 PMCID: PMC6466605 DOI: 10.3390/v11030220] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 02/22/2019] [Accepted: 02/27/2019] [Indexed: 02/07/2023] Open
Abstract
The contemporary genomic diversity of viruses is a result of the continuous and dynamic interaction of past ecological and evolutionary processes. Thus, genome sequences of viruses can be a valuable source of information about these processes. In this review, we first describe the relevant processes shaping viral genomic variation, with a focus on the role of host–virus coevolution and its potential to give rise to eco-evolutionary feedback loops. We further give a brief overview of available methodology designed to extract information about these processes from genomic data. Short generation times and small genomes make viruses ideal model systems to study the joint effect of complex coevolutionary and eco-evolutionary interactions on genetic evolution. This complexity, together with the diverse array of lifetime and reproductive strategies in viruses ask for extensions of existing inference methods, for example by integrating multiple information sources. Such integration can broaden the applicability of genetic inference methods and thus further improve our understanding of the role viruses play in biological communities.
Collapse
|
26
|
Schrago CG, Seuánez HN. Large ancestral effective population size explains the difficult phylogenetic placement of owl monkeys. Am J Primatol 2019; 81:e22955. [PMID: 30779198 DOI: 10.1002/ajp.22955] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2018] [Revised: 12/05/2018] [Accepted: 12/15/2018] [Indexed: 11/07/2022]
Abstract
The phylogenetic position of owl monkeys, grouped in the genus Aotus, has been a controversial issue for understanding Neotropical primate evolution. Explanations of the difficult phylogenetic assignment of owl monkeys have been elusive, frequently relying on insufficient data (stochastic error) or scenarios of rapid speciation (adaptive radiation) events. Using a coalescent-based approach, we explored the population-level mechanisms likely explaining these topological discrepancies. We examined the topological variance of 2,192 orthologous genes shared between representatives of the three major Cebidae lineages and the outgroup. By employing a methodological framework that allows for reticulated tree topologies, our analysis explicitly tested for non-dichotomous evolutionary processes impacting the finding of the position of owl monkeys in the cebid phylogeny. Our findings indicated that Aotus is a sister lineage of the callitrichines. Most gene trees (>50%) failed to recover the species tree topology, although the distribution of gene trees mismatching the true species topology followed the standard expectation of the multispecies coalescent without reticulation. We showed that the large effective population size of the common ancestor of Aotus and callitrichines was the most likely factor responsible for generating phylogenetic uncertainty. On the other hand, fast speciation scenarios or introgression played minor roles. We propose that the difficult phylogenetic placement of Aotus is explained by population-level processes associated with the large ancestral effective size. These results shed light on the biogeography of the early cebid diversification in the Miocene, highlighting the relevance of evaluating phylogenetic relationships employing population-aware approaches.
Collapse
Affiliation(s)
- Carlos G Schrago
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Hector N Seuánez
- Department of Genetics, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.,Division of Genetics, National Cancer Institute, Rio de Janeiro, Brazil
| |
Collapse
|
27
|
Local PCA Shows How the Effect of Population Structure Differs Along the Genome. Genetics 2018; 211:289-304. [PMID: 30459280 DOI: 10.1534/genetics.118.301747] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 11/05/2018] [Indexed: 11/18/2022] Open
Abstract
Population structure leads to systematic patterns in measures of mean relatedness between individuals in large genomic data sets, which are often discovered and visualized using dimension reduction techniques such as principal component analysis (PCA). Mean relatedness is an average of the relationships across locus-specific genealogical trees, which can be strongly affected on intermediate genomic scales by linked selection and other factors. We show how to use local PCA to describe this intermediate-scale heterogeneity in patterns of relatedness, and apply the method to genomic data from three species, finding in each that the effect of population structure can vary substantially across only a few megabases. In a global human data set, localized heterogeneity is likely explained by polymorphic chromosomal inversions. In a range-wide data set of Medicago truncatula, factors that produce heterogeneity are shared between chromosomes, correlate with local gene density, and may be caused by linked selection, such as background selection or local adaptation. In a data set of primarily African Drosophila melanogaster, large-scale heterogeneity across each chromosome arm is explained by known chromosomal inversions thought to be under recent selection and, after removing samples carrying inversions, remaining heterogeneity is correlated with recombination rate and gene density, again suggesting a role for linked selection. The visualization method provides a flexible new way to discover biological drivers of genetic variation, and its application to data highlights the strong effects that linked selection and chromosomal inversions can have on observed patterns of genetic variation.
Collapse
|
28
|
Perrier C, Charmantier A. On the importance of time scales when studying adaptive evolution. Evol Lett 2018; 3:240-247. [PMID: 31171979 PMCID: PMC6546376 DOI: 10.1002/evl3.86] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2018] [Revised: 09/25/2018] [Accepted: 10/03/2018] [Indexed: 01/22/2023] Open
Abstract
Long‐term field studies coupled with quantitative genomics offer a powerful means to understand the genetic bases underlying quantitative traits and their evolutionary changes. However, analyzing and interpreting the time scales at which adaptive evolution occurs is challenging. First, while evolution is predictable in the short term, with strikingly rapid phenotypic changes in data series, it remains unpredictable in the long term. Second, while the temporal dynamics of some loci with large effects on phenotypic variation and fitness have been characterized, this task can be complicated in cases of highly polygenic trait architecture implicating numerous small effect size loci, or when statistical tests are sensitive to the heterogeneity of some key characteristics of the genome, like variation in recombination rate along the chromosomes. After introducing these aforementioned challenges, we discuss a recent investigation of the genomic architecture and spatio‐temporal variation in great tit bill length, which was related to the recent use of bird feeders. We discuss how this case study illustrates the importance of considering different temporal scales and evolutionary mechanisms both while analyzing trait temporal trends and when searching for and interpreting the signals of putative genomic footprints of selection. More generally this commentary discusses interesting challenges for unraveling the time scale at which adaptive traits evolve and their genomic bases.
Collapse
Affiliation(s)
- Charles Perrier
- CEFE UMR 5175, CNRS Université de Montpellier, Université Paul-Valery Montpellier Montpellier cedex 05 France
| | - Anne Charmantier
- CEFE UMR 5175, CNRS Université de Montpellier, Université Paul-Valery Montpellier Montpellier cedex 05 France
| |
Collapse
|
29
|
High-throughput inference of pairwise coalescence times identifies signals of selection and enriched disease heritability. Nat Genet 2018; 50:1311-1317. [PMID: 30104759 PMCID: PMC6145075 DOI: 10.1038/s41588-018-0177-x] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Accepted: 06/21/2018] [Indexed: 12/19/2022]
Abstract
Interest in reconstructing demographic histories has motivated the development of methods to estimate locus-specific pairwise coalescence times from whole-genome sequence data. Here we introduce a powerful new method, ASMC, that can estimate coalescence times using only SNP array data, and is orders of magnitude faster than previous approaches. We applied ASMC to detect recent positive selection in 113,851 phased British samples from the UK Biobank, and detected 12 genome-wide significant signals, including 6 novel loci. We also applied ASMC to sequencing data from 498 Dutch individuals to detect background selection at deeper time scales. We detected strong heritability enrichment in regions of high background selection in an analysis of 20 independent diseases and complex traits using stratified LD score regression, conditioned on a broad set of functional annotations (including other background selection annotations). These results underscore the widespread effects of background selection on the genetic architecture of complex traits.
Collapse
|
30
|
Alachiotis N, Pavlidis P. RAiSD detects positive selection based on multiple signatures of a selective sweep and SNP vectors. Commun Biol 2018; 1:79. [PMID: 30271960 PMCID: PMC6123745 DOI: 10.1038/s42003-018-0085-8] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 06/05/2018] [Indexed: 12/16/2022] Open
Abstract
Selective sweeps leave distinct signatures locally in genomes, enabling the detection of loci that have undergone recent positive selection. Multiple signatures of a selective sweep are known, yet each neutrality test only identifies a single signature. We present RAiSD (Raised Accuracy in Sweep Detection), an open-source software that implements a novel, to our knowledge, and parameter-free detection mechanism that relies on multiple signatures of a selective sweep via the enumeration of SNP vectors. RAiSD achieves higher sensitivity and accuracy than the current state of the art, while the computational complexity is greatly reduced, allowing up to 1000 times faster processing than widely used tools, and negligible memory requirements. Nikolaos Alachiotis and Pavlos Pavlidis present RAiSD, a computational method for identifying multiple signatures of selective sweeps using single nucleotide polymorphism vectors. They show that RAiSD has higher sensitivity and accuracy with reduced computational complexity than current methods.
Collapse
Affiliation(s)
- Nikolaos Alachiotis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013, Heraklion, Crete, Greece.
| | - Pavlos Pavlidis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, Nikolaou Plastira 100, 70013, Heraklion, Crete, Greece.
| |
Collapse
|
31
|
Pardiñas AF, Holmans P, Pocklington AJ, Escott-Price V, Ripke S, Carrera N, Legge SE, Bishop S, Cameron D, Hamshere ML, Han J, Hubbard L, Lynham A, Mantripragada K, Rees E, MacCabe JH, McCarroll SA, Baune BT, Breen G, Byrne EM, Dannlowski U, Eley TC, Hayward C, Martin NG, McIntosh AM, Plomin R, Porteous DJ, Wray NR, Caballero A, Geschwind DH, Huckins LM, Ruderfer DM, Santiago E, Sklar P, Stahl EA, Won H, Agerbo E, Als TD, Andreassen OA, Bækvad-Hansen M, Mortensen PB, Pedersen CB, Børglum AD, Bybjerg-Grauholm J, Djurovic S, Durmishi N, Pedersen MG, Golimbet V, Grove J, Hougaard DM, Mattheisen M, Molden E, Mors O, Nordentoft M, Pejovic-Milovancevic M, Sigurdsson E, Silagadze T, Hansen CS, Stefansson K, Stefansson H, Steinberg S, Tosato S, Werge T, Collier DA, Rujescu D, Kirov G, Owen MJ, O'Donovan MC, Walters JTR. Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat Genet 2018; 50:381-389. [PMID: 29483656 PMCID: PMC5918692 DOI: 10.1038/s41588-018-0059-2] [Citation(s) in RCA: 983] [Impact Index Per Article: 163.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 01/07/2018] [Indexed: 12/13/2022]
Abstract
Schizophrenia is a debilitating psychiatric condition often associated with poor quality of life and decreased life expectancy. Lack of progress in improving treatment outcomes has been attributed to limited knowledge of the underlying biology, although large-scale genomic studies have begun to provide insights. We report a new genome-wide association study of schizophrenia (11,260 cases and 24,542 controls), and through meta-analysis with existing data we identify 50 novel associated loci and 145 loci in total. Through integrating genomic fine-mapping with brain expression and chromosome conformation data, we identify candidate causal genes within 33 loci. We also show for the first time that the common variant association signal is highly enriched among genes that are under strong selective pressures. These findings provide new insights into the biology and genetic architecture of schizophrenia, highlight the importance of mutation-intolerant genes and suggest a mechanism by which common risk variants persist in the population.
Collapse
Affiliation(s)
- Antonio F Pardiñas
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Peter Holmans
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Andrew J Pocklington
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Valentina Escott-Price
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Stephan Ripke
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Department of Psychiatry and Psychotherapy, Charité, Campus Mitte, Berlin, Germany
| | - Noa Carrera
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Sophie E Legge
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Sophie Bishop
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Darren Cameron
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Marian L Hamshere
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Jun Han
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Leon Hubbard
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Amy Lynham
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Kiran Mantripragada
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Elliott Rees
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - James H MacCabe
- Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Steven A McCarroll
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Bernhard T Baune
- Discipline of Psychiatry, University of Adelaide, Adelaide, South Australia, Australia
| | - Gerome Breen
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
- NIHR Biomedical Research Centre for Mental Health, Maudsley Hospital and Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Enda M Byrne
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
| | - Udo Dannlowski
- Department of Psychiatry and Psychotherapy, University of Münster, Münster, Germany
| | - Thalia C Eley
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Caroline Hayward
- Medical Genetics Section, Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Nicholas G Martin
- School of Psychology, University of Queensland, Brisbane, Queensland, Australia
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Andrew M McIntosh
- Division of Psychiatry, University of Edinburgh, Edinburgh, UK
- Centre for Cognitive Ageing and Cognitive Epidemiology, University of Edinburgh, Edinburgh, UK
| | - Robert Plomin
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - David J Porteous
- Medical Genetics Section, Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Naomi R Wray
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
| | - Armando Caballero
- Departamento de Bioquímica, Genética e Inmunología. Facultad de Biología, Universidad de Vigo, Vigo, Spain
| | - Daniel H Geschwind
- Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Laura M Huckins
- Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Douglas M Ruderfer
- Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Enrique Santiago
- Departamento de Biología Funcional. Facultad de Biología, Universidad de Oviedo, Oviedo, Spain
| | - Pamela Sklar
- Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Eli A Stahl
- Division of Psychiatric Genomics, Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Hyejung Won
- Department of Neurology, Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Esben Agerbo
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- National Centre for Register-Based Research, Aarhus University, Aarhus, Denmark
| | - Thomas D Als
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- iSEQ, Center for Integrative Sequencing, Aarhus University, Aarhus, Denmark
- Department of Biomedicine-Human Genetics, Aarhus University, Aarhus, Denmark
| | - Ole A Andreassen
- Institute of Clinical Medicine, University of Oslo, Oslo, Norway
- NORMENT, KG Jebsen Centre for Psychosis Research, Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
| | - Marie Bækvad-Hansen
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Center for Neonatal Screening, Department for Congenital Disorders, Statens Serum Institut, Copenhagen, Denmark
| | - Preben Bo Mortensen
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- National Centre for Register-Based Research, Aarhus University, Aarhus, Denmark
- iSEQ, Center for Integrative Sequencing, Aarhus University, Aarhus, Denmark
| | - Carsten Bøcker Pedersen
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- National Centre for Register-Based Research, Aarhus University, Aarhus, Denmark
| | - Anders D Børglum
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- iSEQ, Center for Integrative Sequencing, Aarhus University, Aarhus, Denmark
- Department of Biomedicine-Human Genetics, Aarhus University, Aarhus, Denmark
| | - Jonas Bybjerg-Grauholm
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Center for Neonatal Screening, Department for Congenital Disorders, Statens Serum Institut, Copenhagen, Denmark
| | - Srdjan Djurovic
- NORMENT, KG Jebsen Centre for Psychosis Research, Department of Clinical Science, University of Bergen, Bergen, Norway
- Department of Medical Genetics, Oslo University Hospital, Oslo, Norway
| | - Naser Durmishi
- Department of Child and Adolescent Psychiatry, University Clinic of Psychiatry, Skopje, Macedonia
| | - Marianne Giørtz Pedersen
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- National Centre for Register-Based Research, Aarhus University, Aarhus, Denmark
| | - Vera Golimbet
- Department of Clinical Genetics, Mental Health Research Center, Moscow, Russia
| | - Jakob Grove
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- iSEQ, Center for Integrative Sequencing, Aarhus University, Aarhus, Denmark
- Department of Biomedicine-Human Genetics, Aarhus University, Aarhus, Denmark
- Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
| | - David M Hougaard
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Center for Neonatal Screening, Department for Congenital Disorders, Statens Serum Institut, Copenhagen, Denmark
| | - Manuel Mattheisen
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- iSEQ, Center for Integrative Sequencing, Aarhus University, Aarhus, Denmark
- Department of Biomedicine-Human Genetics, Aarhus University, Aarhus, Denmark
| | - Espen Molden
- Center for Psychopharmacology, Diakonhjemmet Hospital, Oslo, Norway
| | - Ole Mors
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Psychosis Research Unit, Aarhus University Hospital, Risskov, Denmark
| | - Merete Nordentoft
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Mental Health Services in the Capital Region of Denmark, Mental Health Center Copenhagen, University of Copenhagen, Copenhagen, Denmark
| | | | | | - Teimuraz Silagadze
- Department of Psychiatry and Drug Addiction, Tbilisi State Medical University (TSMU), Tbilisi, Georgia
| | - Christine Søholm Hansen
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Center for Neonatal Screening, Department for Congenital Disorders, Statens Serum Institut, Copenhagen, Denmark
| | | | | | | | - Sarah Tosato
- Section of Psychiatry, Department of Public Health and Community Medicine, University of Verona, Verona, Italy
| | - Thomas Werge
- iPSYCH, The Lundbeck Foundation Initiative for Integrative Psychiatric Research, Aarhus, Denmark
- Institute of Biological Psychiatry, MHC Sct. Hans, Mental Health Services Copenhagen, Roskilde, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - David A Collier
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
- Discovery Neuroscience Research, Eli Lilly and Company, Lilly Research Laboratories, Windlesham, UK
| | - Dan Rujescu
- Department of Psychiatry, University of Halle, Halle, Germany
- Department of Psychiatry, University of Munich, Munich, Germany
| | - George Kirov
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK
| | - Michael J Owen
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK.
| | - Michael C O'Donovan
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK.
| | - James T R Walters
- MRC Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, Cardiff, UK.
| |
Collapse
|
32
|
|
33
|
Cheng X, Xu C, DeGiorgio M. Fast and robust detection of ancestral selective sweeps. Mol Ecol 2017; 26:6871-6891. [DOI: 10.1111/mec.14416] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2017] [Revised: 10/16/2017] [Accepted: 10/23/2017] [Indexed: 01/01/2023]
Affiliation(s)
- Xiaoheng Cheng
- Huck Institutes of Life Sciences; Pennsylvania State University; University Park PA USA
- Department of Biology; Pennsylvania State University; University Park PA USA
| | - Cheng Xu
- Huck Institutes of Life Sciences; Pennsylvania State University; University Park PA USA
| | - Michael DeGiorgio
- Department of Biology; Pennsylvania State University; University Park PA USA
- Department of Statistics; Pennsylvania State University; University Park PA USA
- Institute for CyberScience; Pennsylvania State University; University Park PA USA
| |
Collapse
|
34
|
Burri R. Interpreting differentiation landscapes in the light of long-term linked selection. Evol Lett 2017. [DOI: 10.1002/evl3.14] [Citation(s) in RCA: 116] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Affiliation(s)
- Reto Burri
- Department of Population Ecology; Friedrich Schiller University Jena; Dornburger Strasse 159 D-07743 Jena Germany
| |
Collapse
|
35
|
Van Doren BM, Campagna L, Helm B, Illera JC, Lovette IJ, Liedvogel M. Correlated patterns of genetic diversity and differentiation across an avian family. Mol Ecol 2017; 26:3982-3997. [PMID: 28256062 DOI: 10.1111/mec.14083] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Revised: 02/19/2017] [Accepted: 02/22/2017] [Indexed: 01/01/2023]
Abstract
Comparative studies of closely related taxa can provide insights into the evolutionary forces that shape genome evolution and the prevalence of convergent molecular evolution. We investigated patterns of genetic diversity and differentiation in stonechats (genus Saxicola), a widely distributed avian species complex with phenotypic variation in plumage, morphology and migratory behaviour, to ask whether similar genomic regions have become differentiated in independent, but closely related, taxa. We used whole-genome pooled sequencing of 262 individuals from five taxa and found that levels of genetic diversity and divergence are strongly correlated among different stonechat taxa. We then asked whether these patterns remain correlated at deeper evolutionary scales and found that homologous genomic regions have become differentiated in stonechats and the closely related Ficedula flycatchers. Such correlation across a range of evolutionary divergence and among phylogenetically independent comparisons suggests that similar processes may be driving the differentiation of these independently evolving lineages, which in turn may be the result of intrinsic properties of particular genomic regions (e.g. areas of low recombination). Consequently, studies employing genome scans to search for areas important for reproductive isolation or adaptation should account for corresponding regions of differentiation, as these regions may not necessarily represent speciation islands or evidence of local adaptation.
Collapse
Affiliation(s)
- Benjamin M Van Doren
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA.,Cornell Lab of Ornithology, Cornell University, Ithaca, NY, 14850, USA
| | - Leonardo Campagna
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA.,Cornell Lab of Ornithology, Cornell University, Ithaca, NY, 14850, USA
| | - Barbara Helm
- Animal Health and Comparative Medicine, Institute of Biodiversity, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Juan Carlos Illera
- Research Unit of Biodiversity (UO-CSIC-PA), Oviedo University, Campus of Mieres, Research Building, 5th Floor, c/ Gonzalo Gutiérrez Quirós s/n, 33600, Mieres, Asturias, Spain
| | - Irby J Lovette
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, 14853, USA.,Cornell Lab of Ornithology, Cornell University, Ithaca, NY, 14850, USA
| | - Miriam Liedvogel
- Max Planck Institute for Evolutionary Biology, AG Behavioural Genomics, August-Thienemann-Str. 2, 24306, Plön, Germany
| |
Collapse
|
36
|
Dynamics and Fate of Beneficial Mutations Under Lineage Contamination by Linked Deleterious Mutations. Genetics 2017; 205:1305-1318. [PMID: 28100591 DOI: 10.1534/genetics.116.194597] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 01/04/2017] [Indexed: 11/18/2022] Open
Abstract
Beneficial mutations drive adaptive evolution, yet their selective advantage does not ensure their fixation. Haldane's application of single-type branching process theory showed that genetic drift alone could cause the extinction of newly arising beneficial mutations with high probability. With linkage, deleterious mutations will affect the dynamics of beneficial mutations and might further increase their extinction probability. Here, we model the lineage dynamics of a newly arising beneficial mutation as a multitype branching process. Our approach accounts for the combined effects of drift and the stochastic accumulation of linked deleterious mutations, which we call lineage contamination We first study the lineage-contamination phenomenon in isolation, deriving dynamics and survival probabilities (the complement of extinction probabilities) of beneficial lineages. We find that survival probability is zero when [Formula: see text] where U is deleterious mutation rate and [Formula: see text] is the selective advantage of the beneficial mutation in question, and is otherwise depressed below classical predictions by a factor bounded from below by [Formula: see text] We then put the lineage contamination phenomenon into the context of an evolving population by incorporating the effects of background selection. We find that, under the combined effects of lineage contamination and background selection, ensemble survival probability is never zero but is depressed below classical predictions by a factor bounded from below by [Formula: see text] where [Formula: see text] is mean selective advantage of beneficial mutations, and [Formula: see text] This factor, and other bounds derived from it, are independent of the fitness effects of deleterious mutations. At high enough mutation rates, lineage contamination can depress fixation probabilities to values that approach zero. This fact suggests that high mutation rates can, perhaps paradoxically, (1) alleviate competition among beneficial mutations, or (2) potentially even shut down the adaptive process. We derive critical mutation rates above which these two events become likely.
Collapse
|
37
|
|
38
|
Santiago E, Caballero A. Joint Prediction of the Effective Population Size and the Rate of Fixation of Deleterious Mutations. Genetics 2016; 204:1267-1279. [PMID: 27672094 PMCID: PMC5105856 DOI: 10.1534/genetics.116.188250] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Accepted: 09/20/2016] [Indexed: 11/18/2022] Open
Abstract
Mutation, genetic drift, and selection are considered the main factors shaping genetic variation in nature. There is a lack, however, of general predictions accounting for the mutual interrelation between these factors. In the context of the background selection model, we provide a set of equations for the joint prediction of the effective population size and the rate of fixation of deleterious mutations, which are applicable both to sexual and asexual species. For a population of N haploid individuals and a model of deleterious mutations with effect s appearing with rate U in a genome L Morgans long, the asymptotic effective population size (Ne) and the average number of generations (T) between consecutive fixations can be approximated by [Formula: see text] and [Formula: see text] The solution is applicable to Muller's ratchet, providing satisfactory approximations to the rate of accumulation of mutations for a wide range of parameters. We also obtain predictions of the effective size accounting for the expected nucleotide diversity. Predictions for sexual populations allow for outlining the general conditions where mutational meltdown occurs. The equations can be extended to any distribution of mutational effects and the consideration of hotspots of recombination, showing that Ne is rather insensitive and not proportional to changes in N for many combinations of parameters. This could contribute to explain the observed small differences in levels of polymorphism between species with very different census sizes.
Collapse
Affiliation(s)
- Enrique Santiago
- Departamento de Biología Funcional, Facultad de Biología, Universidad de Oviedo, 33071 Oviedo, Spain
| | - Armando Caballero
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, 36310 Vigo, Spain
| |
Collapse
|
39
|
Wang J, Santiago E, Caballero A. Prediction and estimation of effective population size. Heredity (Edinb) 2016; 117:193-206. [PMID: 27353047 PMCID: PMC5026755 DOI: 10.1038/hdy.2016.43] [Citation(s) in RCA: 169] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Revised: 05/03/2016] [Accepted: 05/16/2016] [Indexed: 12/19/2022] Open
Abstract
Effective population size (Ne) is a key parameter in population genetics. It has important applications in evolutionary biology, conservation genetics and plant and animal breeding, because it measures the rates of genetic drift and inbreeding and affects the efficacy of systematic evolutionary forces, such as mutation, selection and migration. We review the developments in predictive equations and estimation methodologies of effective size. In the prediction part, we focus on the equations for populations with different modes of reproduction, for populations under selection for unlinked or linked loci and for the specific applications to conservation genetics. In the estimation part, we focus on methods developed for estimating the current or recent effective size from molecular marker or sequence data. We discuss some underdeveloped areas in predicting and estimating Ne for future research.
Collapse
Affiliation(s)
- J Wang
- Institute of Zoology, Zoological Society of London, London, UK
| | - E Santiago
- Departamento de Biología Funcional, Facultad de Biología, Universidad de Oviedo, Oviedo, Spain
| | - A Caballero
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, Vigo, Spain
| |
Collapse
|
40
|
Elyashiv E, Sattath S, Hu TT, Strutsovsky A, McVicker G, Andolfatto P, Coop G, Sella G. A Genomic Map of the Effects of Linked Selection in Drosophila. PLoS Genet 2016; 12:e1006130. [PMID: 27536991 PMCID: PMC4990265 DOI: 10.1371/journal.pgen.1006130] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2015] [Accepted: 05/26/2016] [Indexed: 01/23/2023] Open
Abstract
Natural selection at one site shapes patterns of genetic variation at linked sites. Quantifying the effects of "linked selection" on levels of genetic diversity is key to making reliable inference about demography, building a null model in scans for targets of adaptation, and learning about the dynamics of natural selection. Here, we introduce the first method that jointly infers parameters of distinct modes of linked selection, notably background selection and selective sweeps, from genome-wide diversity data, functional annotations and genetic maps. The central idea is to calculate the probability that a neutral site is polymorphic given local annotations, substitution patterns, and recombination rates. Information is then combined across sites and samples using composite likelihood in order to estimate genome-wide parameters of distinct modes of selection. In addition to parameter estimation, this approach yields a map of the expected neutral diversity levels along the genome. To illustrate the utility of our approach, we apply it to genome-wide resequencing data from 125 lines in Drosophila melanogaster and reliably predict diversity levels at the 1Mb scale. Our results corroborate estimates of a high fraction of beneficial substitutions in proteins and untranslated regions (UTR). They allow us to distinguish between the contribution of sweeps and other modes of selection around amino acid substitutions and to uncover evidence for pervasive sweeps in untranslated regions (UTRs). Our inference further suggests a substantial effect of other modes of linked selection and of adaptation in particular. More generally, we demonstrate that linked selection has had a larger effect in reducing diversity levels and increasing their variance in D. melanogaster than previously appreciated.
Collapse
Affiliation(s)
- Eyal Elyashiv
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| | - Shmuel Sattath
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Tina T. Hu
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Alon Strutsovsky
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Graham McVicker
- The Laboratory of Genetics and The Integrative Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California, United States of America
| | - Peter Andolfatto
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Graham Coop
- Department of Evolution and Ecology, University of California, Davis, Davis, California, United States of America
| | - Guy Sella
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| |
Collapse
|
41
|
Renzette N, Kowalik TF, Jensen JD. On the relative roles of background selection and genetic hitchhiking in shaping human cytomegalovirus genetic diversity. Mol Ecol 2015. [PMID: 26211679 DOI: 10.1111/mec.13331] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
A central focus of population genetics has been examining the contribution of selective and neutral processes in shaping patterns of intraspecies diversity. In terms of selection specifically, surveys of higher organisms have shown considerable variation in the relative contributions of background selection and genetic hitchhiking in shaping the distribution of polymorphisms, although these analyses have rarely been extended to bacteria and viruses. Here, we study the evolution of a ubiquitous, viral pathogen, human cytomegalovirus (HCMV), by analysing the relationship among intraspecies diversity, interspecies divergence and rates of recombination. We show that there is a strong correlation between diversity and divergence, consistent with expectations of neutral evolution. However, after correcting for divergence, there remains a significant correlation between intraspecies diversity and recombination rates, with additional analyses suggesting that this correlation is largely due to the effects of background selection. In addition, a small number of loci, centred on long noncoding RNAs, also show evidence of selective sweeps. These data suggest that HCMV evolution is dominated by neutral mechanisms as well as background selection, expanding our understanding of linked selection to a novel class of organisms.
Collapse
Affiliation(s)
- Nicholas Renzette
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01655, USA
| | - Timothy F Kowalik
- Department of Microbiology and Physiological Systems, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01655, USA.,Immunology and Microbiology Program, University of Massachusetts Medical School, 368 Plantation Street, Worcester, MA, 01655, USA
| | - Jeffrey D Jensen
- Swiss Institute of Bioinformatics (SIB), Lausanne, CH-1015, Switzerland.,School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, CH-1015, Switzerland
| |
Collapse
|
42
|
Villemereuil P, Gaggiotti OE. A new F
ST
‐based method to uncover local adaptation using environmental variables. Methods Ecol Evol 2015. [DOI: 10.1111/2041-210x.12418] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- Pierre Villemereuil
- LECA, UMR 5553 Centre National de la Recherche Scientifique Université Jospeh Fourier 2233 rue de la piscine 38400 Saint Martin d'Hères France
| | - Oscar E. Gaggiotti
- LECA, UMR 5553 Centre National de la Recherche Scientifique Université Jospeh Fourier 2233 rue de la piscine 38400 Saint Martin d'Hères France
- Scottish Oceans Institute University of St Andrews Fife KY16 8LB UK
| |
Collapse
|
43
|
Barrett SCH, Arunkumar R, Wright SI. The demography and population genomics of evolutionary transitions to self-fertilization in plants. Philos Trans R Soc Lond B Biol Sci 2015; 369:rstb.2013.0344. [PMID: 24958918 DOI: 10.1098/rstb.2013.0344] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The evolution of self-fertilization from outcrossing has occurred on numerous occasions in flowering plants. This shift in mating system profoundly influences the morphology, ecology, genetics and evolution of selfing lineages. As a result, there has been sustained interest in understanding the mechanisms driving the evolution of selfing and its environmental context. Recently, patterns of molecular variation have been used to make inferences about the selective mechanisms associated with mating system transitions. However, these inferences can be complicated by the action of linked selection following the transition. Here, using multilocus simulations and comparative molecular data from related selfers and outcrossers, we demonstrate that there is little evidence for strong bottlenecks associated with initial transitions to selfing, and our simulation results cast doubt on whether it is possible to infer the role of bottlenecks associated with reproductive assurance in the evolution of selfing. They indicate that the effects of background selection on the loss of diversity and efficacy of selection occur rapidly following the shift to high selfing. Future comparative studies that integrate explicit ecological and genomic details are necessary for quantifying the independent and joint effects of selection and demography on transitions to selfing and the loss of genetic diversity.
Collapse
Affiliation(s)
- Spencer C H Barrett
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks Street, Toronto, Ontario, Canada M5S 3B2
| | - Ramesh Arunkumar
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks Street, Toronto, Ontario, Canada M5S 3B2
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks Street, Toronto, Ontario, Canada M5S 3B2
| |
Collapse
|
44
|
Bank C, Ewing GB, Ferrer-Admettla A, Foll M, Jensen JD. Thinking too positive? Revisiting current methods of population genetic selection inference. Trends Genet 2014; 30:540-6. [PMID: 25438719 DOI: 10.1016/j.tig.2014.09.010] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2014] [Revised: 09/19/2014] [Accepted: 09/23/2014] [Indexed: 02/03/2023]
Abstract
In the age of next-generation sequencing, the availability of increasing amounts and improved quality of data at decreasing cost ought to allow for a better understanding of how natural selection is shaping the genome than ever before. However, alternative forces, such as demography and background selection (BGS), obscure the footprints of positive selection that we would like to identify. In this review, we illustrate recent developments in this area, and outline a roadmap for improved selection inference. We argue (i) that the development and obligatory use of advanced simulation tools is necessary for improved identification of selected loci, (ii) that genomic information from multiple time points will enhance the power of inference, and (iii) that results from experimental evolution should be utilized to better inform population genomic studies.
Collapse
Affiliation(s)
- Claudia Bank
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland.
| | - Gregory B Ewing
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland
| | - Anna Ferrer-Admettla
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland; Department of Biology and Biochemistry, University of Fribourg, 1700 Fribourg, Switzerland
| | - Matthieu Foll
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland
| | - Jeffrey D Jensen
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland
| |
Collapse
|
45
|
Zhou L, Bawa R, Holliday JA. Exome resequencing reveals signatures of demographic and adaptive processes across the genome and range of black cottonwood (Populus trichocarpa). Mol Ecol 2014; 23:2486-99. [PMID: 24750333 DOI: 10.1111/mec.12752] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2013] [Revised: 04/09/2014] [Accepted: 04/11/2014] [Indexed: 12/11/2022]
Abstract
Extant variation in temperate and boreal plant species has been influenced by both demographic histories associated with Pleistocene glacial cycles and adaptation to local climate. We used sequence capture to investigate the role of these neutral and adaptive processes in shaping diversity in black cottonwood (Populus trichocarpa). Nucleotide diversity and Tajima's D were lowest at replacement sites and highest at intergenic sites, while LD showed the opposite pattern. With samples grouped into three populations arrayed latitudinally, effective population size was highest in the north, followed by south and centre, and LD was highest in the south followed by the north and centre, suggesting a possible northern glacial refuge. FST outlier analysis revealed that promoter, 5'-UTR and intronic sites were enriched for outliers compared with coding regions, while no outliers were found among intergenic sites. Codon usage bias was evident, and genes with synonymous outliers had 30% higher average expression compared with genes containing replacement outliers. These results suggest divergent selection related to regulation of gene expression is important to local adaptation in P. trichocarpa. Finally, within-population selective sweeps were much more pronounced in the central population than in putative northern and southern refugia, which may reflect the different demographic histories of the populations and concomitant effects on signatures of genetic hitchhiking from standing variation.
Collapse
Affiliation(s)
- L Zhou
- Department of Forest Resources and Environmental Conservation, Virginia Polytechnic Institute and State University, 304 Cheatham Hall, Blacksburg, VA, 24061, USA
| | | | | |
Collapse
|
46
|
The evolutionary genetics of the genes underlying phenotypic associations for loblolly pine (Pinus taeda, Pinaceae). Genetics 2013; 195:1353-72. [PMID: 24121773 DOI: 10.1534/genetics.113.157198] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
A primary goal of evolutionary genetics is to discover and explain the genetic basis of fitness-related traits and how this genetic basis evolves within natural populations. Unprecedented technological advances have fueled the discovery of genetic variants associated with ecologically relevant phenotypes in many different life forms, as well as the ability to scan genomes for deviations from selectively neutral models of evolution. Theoretically, the degree of overlap between lists of genomic regions identified using each approach is related to the genetic architecture of fitness-related traits and the strength and type of natural selection molding variation at these traits within natural populations. Here we address for the first time in a plant the degree of overlap between these lists, using patterns of nucleotide diversity and divergence for >7000 unique amplicons described from the extensive expressed sequence tag libraries generated for loblolly pine (Pinus taeda L.) in combination with the >1000 published genetic associations. We show that loci associated with phenotypic traits are distinct with regard to neutral expectations. Phenotypes measured at the whole plant level (e.g., disease resistance) exhibit an approximately twofold increase in the proportion of adaptive nonsynonymous substitutions over the genome-wide average. As expected for polygenic traits, these signals were apparent only when loci were considered at the level of functional sets. The ramifications of this result are discussed in light of the continued efforts to dissect the genetic basis of quantitative traits.
Collapse
|
47
|
Abstract
Purifying selection at many linked sites alters patterns of molecular evolution, reducing overall diversity and distorting the shapes of genealogies. Recombination attenuates these effects; however, purifying selection can significantly distort genealogies even for substantial recombination rates. Here, we show that when selection and/or recombination are sufficiently strong, the genealogy at any single site can be described by a time-dependent effective population size, Ne(t), which has a simple analytic form. Our results illustrate how recombination reduces distortions in genealogies and allow us to quantitatively describe the shapes of genealogies in the presence of strong purifying selection and recombination. We also analyze the effects of a distribution of selection coefficients across the genome.
Collapse
|