1
|
Li J, Amado A, Bank C. Rapid adaptation of recombining populations on tunable fitness landscapes. Mol Ecol 2024; 33:e16900. [PMID: 36855836 DOI: 10.1111/mec.16900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 01/28/2023] [Accepted: 02/01/2023] [Indexed: 03/02/2023]
Abstract
How does standing genetic variation affect polygenic adaptation in recombining populations? Despite a large body of work in quantitative genetics, epistatic and weak additive fitness effects among simultaneously segregating genetic variants are difficult to capture experimentally or to predict theoretically. In this study, we simulated adaptation on fitness landscapes with tunable ruggedness driven by standing genetic variation in recombining populations. We confirmed that recombination hinders the movement of a population through a rugged fitness landscape. When surveying the effect of epistasis on the fixation of alleles, we found that the combined effects of high ruggedness and high recombination probabilities lead to preferential fixation of alleles that had a high initial frequency. This indicates that positive epistatic alleles escape from being broken down by recombination when they start at high frequency. We further extract direct selection coefficients and pairwise epistasis along the adaptive path. When taking the final fixed genotype as the reference genetic background, we observe that, along the adaptive path, beneficial direct selection appears stronger and pairwise epistasis weaker than in the underlying fitness landscape. Quantitatively, the ratio of epistasis and direct selection is smaller along the adaptive path (≈ 1 ) than expected. Thus, adaptation on a rugged fitness landscape may lead to spurious signals of direct selection generated through epistasis. Our study highlights how the interplay of epistasis and recombination constrains the adaptation of a diverse population to a new environment.
Collapse
Affiliation(s)
- Juan Li
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute for Bioinformatics, Lausanne, Switzerland
| | - André Amado
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute for Bioinformatics, Lausanne, Switzerland
| | - Claudia Bank
- Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Swiss Institute for Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
2
|
Dochtermann NA, Klock B, Roff DA, Royauté R. Drift on holey landscapes as a dominant evolutionary process. Proc Natl Acad Sci U S A 2023; 120:e2313282120. [PMID: 38113257 PMCID: PMC10756301 DOI: 10.1073/pnas.2313282120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 11/15/2023] [Indexed: 12/21/2023] Open
Abstract
An organism's phenotype has been shaped by evolution but the specific processes have to be indirectly inferred for most species. For example, correlations among traits imply the historical action of correlated selection and, more generally, the expression and distribution of traits is expected to be reflective of the adaptive landscapes that have shaped a population. However, our expectations about how quantitative traits-like most behaviors, physiological processes, and life-history traits-should be distributed under different evolutionary processes are not clear. Here, we show that genetic variation in quantitative traits is not distributed as would be expected under dominant evolutionary models. Instead, we found that genetic variation in quantitative traits across six phyla and 60 species (including both Plantae and Animalia) is consistent with evolution across high-dimensional "holey landscapes." This suggests that the leading conceptualizations and modeling of the evolution of trait integration fail to capture how phenotypes are shaped and that traits are integrated in a manner contrary to predictions of dominant evolutionary theory. Our results demonstrate that our understanding of how evolution has shaped phenotypes remains incomplete and these results provide a starting point for reassessing the relevance of existing evolutionary models.
Collapse
Affiliation(s)
- Ned A. Dochtermann
- Department of Biological Sciences, North Dakota State University, Fargo, ND58108
| | - Brady Klock
- Department of Biological Sciences, North Dakota State University, Fargo, ND58108
| | - Derek A. Roff
- Department of Biology, University of California, Riverside, CA92521
| | - Raphaël Royauté
- Université Paris-Saclay, French National Research Institute for Agriculture, Food, and Environment, AgroParisTech, UMR EcoSys, Palaiseau91120, France
| |
Collapse
|
3
|
Yousefi M, Andrejka L, Szamecz M, Winslow MM, Petrov DA, Boross G. Fully accessible fitness landscape of oncogene-negative lung adenocarcinoma. Proc Natl Acad Sci U S A 2023; 120:e2303224120. [PMID: 37695905 PMCID: PMC10515140 DOI: 10.1073/pnas.2303224120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 07/12/2023] [Indexed: 09/13/2023] Open
Abstract
Cancer genomes are almost invariably complex with genomic alterations cooperating during each step of carcinogenesis. In cancers that lack a single dominant oncogene mutation, cooperation between the inactivation of multiple tumor suppressor genes can drive tumor initiation and growth. Here, we shed light on how the sequential acquisition of genomic alterations generates oncogene-negative lung tumors. We couple tumor barcoding with combinatorial and multiplexed somatic genome editing to characterize the fitness landscapes of three tumor suppressor genes NF1, RASA1, and PTEN, the inactivation of which jointly drives oncogene-negative lung adenocarcinoma initiation and growth. The fitness landscape was surprisingly accessible, with each additional mutation leading to growth advantage. Furthermore, the fitness landscapes remained fully accessible across backgrounds with the inactivation of additional tumor suppressor genes. These results suggest that while predicting cancer evolution will be challenging, acquiring the multiple alterations that drive the growth of oncogene-negative tumors can be facilitated by the lack of constraints on mutational order.
Collapse
Affiliation(s)
- Maryam Yousefi
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
| | - Laura Andrejka
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
| | - Márton Szamecz
- Eötvös Loránd University, Faculty of Informatics, Budapest1053, Hungary
| | - Monte M. Winslow
- Department of Genetics, Stanford University School of Medicine, Stanford, CA94305
- Cancer Biology Program, Stanford University School of Medicine, Stanford, CA94305
- Department of Pathology, Stanford University School of Medicine, Stanford, CA94305
| | - Dmitri A. Petrov
- Cancer Biology Program, Stanford University School of Medicine, Stanford, CA94305
- Department of Biology, Stanford University, Stanford, CA94305
| | - Gábor Boross
- Department of Biology, Stanford University, Stanford, CA94305
| |
Collapse
|
4
|
Barker-Clarke R, Weaver DT, Scott JG. Graph 'texture' features as novel metrics that can summarize complex biological graphs. Phys Med Biol 2023; 68:174001. [PMID: 37385267 PMCID: PMC10598684 DOI: 10.1088/1361-6560/ace305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 06/08/2023] [Accepted: 06/29/2023] [Indexed: 07/01/2023]
Abstract
Objective.Image texture features, such as those derived by Haralicket al, are a powerful metric for image classification and are used across fields including cancer research. Our aim is to demonstrate how analogous texture features can be derived for graphs and networks. We also aim to illustrate how these new metrics summarize graphs, may aid comparative graph studies, may help classify biological graphs, and might assist in detecting dysregulation in cancer.Approach.We generate the first analogies of image texture for graphs and networks. Co-occurrence matrices for graphs are generated by summing over all pairs of neighboring nodes in the graph. We generate metrics for fitness landscapes, gene co-expression and regulatory networks, and protein interaction networks. To assess metric sensitivity we varied discretization parameters and noise. To examine these metrics in the cancer context we compare metrics for both simulated and publicly available experimental gene expression and build random forest classifiers for cancer cell lineage.Main results.Our novel graph 'texture' features are shown to be informative of graph structure and node label distributions. The metrics are sensitive to discretization parameters and noise in node labels. We demonstrate that graph texture features vary across different biological graph topologies and node labelings. We show how our texture metrics can be used to classify cell line expression by lineage, demonstrating classifiers with 82% and 89% accuracy.Significance.New metrics provide opportunities for better comparative analyzes and new models for classification. Our texture features are novel second-order graph features for networks or graphs with ordered node labels. In the complex cancer informatics setting, evolutionary analyses and drug response prediction are two examples where new network science approaches like this may prove fruitful.
Collapse
Affiliation(s)
- R Barker-Clarke
- Department of Translational Hematology and Oncology Research, Lerner Research Institute, Cleveland, OH 44195, United States of America
| | - D T Weaver
- Department of Translational Hematology and Oncology Research, Lerner Research Institute, Cleveland, OH 44195, United States of America
- School of Medicine, Case Western Reserve University, Cleveland, OH 44195, United States of America
| | - J G Scott
- Department of Translational Hematology and Oncology Research, Lerner Research Institute, Cleveland, OH 44195, United States of America
- School of Medicine, Case Western Reserve University, Cleveland, OH 44195, United States of America
| |
Collapse
|
5
|
Abstract
Owing to stochastic fluctuations arising from finite population size, known as genetic drift, the ability of a population to explore a rugged fitness landscape depends on its size. In the weak mutation regime, while the mean steady-state fitness increases with population size, we find that the height of the first fitness peak encountered when starting from a random genotype displays various behaviours versus population size, even among small and simple rugged landscapes. We show that the accessibility of the different fitness peaks is key to determining whether this height overall increases or decreases with population size. Furthermore, there is often a finite population size that maximizes the height of the first fitness peak encountered when starting from a random genotype. This holds across various classes of model rugged landscapes with sparse peaks, and in some experimental and experimentally inspired ones. Thus, early adaptation in rugged fitness landscapes can be more efficient and predictable for relatively small population sizes than in the large-size limit. This article is part of the theme issue ‘Interdisciplinary approaches to predicting evolutionary biology’.
Collapse
Affiliation(s)
- Richard Servajean
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Anne-Florence Bitbol
- Institute of Bioengineering, School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
6
|
Diaz-Colunga J, Skwara A, Gowda K, Diaz-Uriarte R, Tikhonov M, Bajic D, Sanchez A. Global epistasis on fitness landscapes. Philos Trans R Soc Lond B Biol Sci 2023; 378:20220053. [PMID: 37004717 PMCID: PMC10067270 DOI: 10.1098/rstb.2022.0053] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2023] Open
Abstract
Epistatic interactions between mutations add substantial complexity to adaptive landscapes and are often thought of as detrimental to our ability to predict evolution. Yet, patterns of global epistasis, in which the fitness effect of a mutation is well-predicted by the fitness of its genetic background, may actually be of help in our efforts to reconstruct fitness landscapes and infer adaptive trajectories. Microscopic interactions between mutations, or inherent nonlinearities in the fitness landscape, may cause global epistasis patterns to emerge. In this brief review, we provide a succinct overview of recent work about global epistasis, with an emphasis on building intuition about why it is often observed. To this end, we reconcile simple geometric reasoning with recent mathematical analyses, using these to explain why different mutations in an empirical landscape may exhibit different global epistasis patterns—ranging from diminishing to increasing returns. Finally, we highlight open questions and research directions. This article is part of the theme issue ‘Interdisciplinary approaches to predicting evolutionary biology’.
Collapse
Affiliation(s)
- Juan Diaz-Colunga
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | - Abigail Skwara
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | - Karna Gowda
- Department of Ecology & Evolution & Center for the Physics of Evolving Systems, The University of Chicago, Chicago, IL 60637, USA
| | - Ramon Diaz-Uriarte
- Department of Biochemistry, School of Medicine, Universidad Autónoma de Madrid, Madrid 28029, Spain
- Instituto de Investigaciones Biomédicas ‘Alberto Sols’ (UAM-CSIC), Madrid 28029, Spain
| | - Mikhail Tikhonov
- Department of Physics, Washington University of St Louis, St Louis, MO 63130, USA
| | - Djordje Bajic
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06511, USA
| | - Alvaro Sanchez
- Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT 06511, USA
- Department of Microbial Biotechnology, Campus de Cantoblanco, CNB-CSIC, Madrid 28049, Spain
| |
Collapse
|
7
|
Sanchez A, Bajic D, Diaz-Colunga J, Skwara A, Vila JCC, Kuehn S. The community-function landscape of microbial consortia. Cell Syst 2023; 14:122-134. [PMID: 36796331 DOI: 10.1016/j.cels.2022.12.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 10/17/2022] [Accepted: 12/21/2022] [Indexed: 02/17/2023]
Abstract
Quantitatively linking the composition and function of microbial communities is a major aspiration of microbial ecology. Microbial community functions emerge from a complex web of molecular interactions between cells, which give rise to population-level interactions among strains and species. Incorporating this complexity into predictive models is highly challenging. Inspired by a similar problem in genetics of predicting quantitative phenotypes from genotypes, an ecological community-function (or structure-function) landscape could be defined that maps community composition and function. In this piece, we present an overview of our current understanding of these community landscapes, their uses, limitations, and open questions. We argue that exploiting the parallels between both landscapes could bring powerful predictive methodologies from evolution and genetics into ecology, providing a boost to our ability to engineer and optimize microbial consortia.
Collapse
Affiliation(s)
- Alvaro Sanchez
- Department of Ecology & Evolutionary Biology & Microbial Sciences Institute, Yale University, New Haven, CT, USA; Department of Microbial Biotechnology, CNB-CSIC, Campus de Cantoblanco, Madrid, Spain.
| | - Djordje Bajic
- Department of Ecology & Evolutionary Biology & Microbial Sciences Institute, Yale University, New Haven, CT, USA
| | - Juan Diaz-Colunga
- Department of Ecology & Evolutionary Biology & Microbial Sciences Institute, Yale University, New Haven, CT, USA
| | - Abigail Skwara
- Department of Ecology & Evolutionary Biology & Microbial Sciences Institute, Yale University, New Haven, CT, USA
| | - Jean C C Vila
- Department of Ecology & Evolutionary Biology & Microbial Sciences Institute, Yale University, New Haven, CT, USA
| | - Seppe Kuehn
- Center for the Physics of Evolving Systems, The Unviersity of Chicago, Chicago, IL, USA; Department of Ecology and Evolution, The University of Chicago, Chicago, IL, USA
| |
Collapse
|
8
|
González-Forero M. How development affects evolution. Evolution 2023; 77:562-579. [PMID: 36691368 DOI: 10.1093/evolut/qpac003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 09/14/2022] [Accepted: 10/06/2022] [Indexed: 01/25/2023]
Abstract
Natural selection acts on developmentally constructed phenotypes, but how does development affect evolution? This question prompts a simultaneous consideration of development and evolution. However, there has been a lack of general mathematical frameworks mechanistically integrating the two, which may have inhibited progress on the question. Here, we use a new mathematical framework that mechanistically integrates development into evolution to analyse how development affects evolution. We show that, while selection pushes genotypic and phenotypic evolution up the fitness landscape, development determines the admissible evolutionary pathway, such that evolutionary outcomes occur at path peaks rather than landscape peaks. Changes in development can generate path peaks, triggering genotypic or phenotypic diversification, even on constant, single-peak landscapes. Phenotypic plasticity, niche construction, extra-genetic inheritance, and developmental bias alter the evolutionary path and hence the outcome. Thus, extra-genetic inheritance can have permanent evolutionary effects by changing the developmental constraints, even if extra-genetically acquired elements are not transmitted to future generations. Selective development, whereby phenotype construction points in the adaptive direction, may induce adaptive or maladaptive evolution depending on the developmental constraints. Moreover, developmental propagation of phenotypic effects over age enables the evolution of negative senescence. Overall, we find that development plays a major evolutionary role.
Collapse
|
9
|
Topper A, Kolodny O. Crossing the valley of non-intimidating conspicuousness: evolution of warning coloration through the lens of fitness landscapes. Evolution 2023; 77:335-341. [PMID: 36626813 DOI: 10.1093/evolut/qpac044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 11/03/2022] [Accepted: 11/23/2022] [Indexed: 01/12/2023]
Abstract
The initial evolution of conspicuous aposematism is a longstanding evolutionary paradox: while the benefits of conspicuousness in aposematic signals have been demonstrated, they rely on predators being familiar with the conspicuous signals and avoiding them. In a system dominated by naïve predators, the appearance of conspicuousness would be expected to increase detection and attack rate by the predators. Hence, it is unclear how such signals could become established in a naïve community. We suggest that this problem may usefully be framed in the terms of fitness landscapes, an idea used for conceptualizing the mapping between genotype/phenotype and fitness. The evolution of conspicuousness can be thought of as a special case of valley crossing, which concerns the transition of populations between fitness peaks, when such a transition imposes an initial decrease in fitness. Crypsis may be regarded as a local fitness peak, hindering predators' ability to detect prey; for an unpalatable species, conspicuous aposematism may constitute a higher-still fitness peak, preventing predation attempts altogether and allowing access to niches unavailable to species encumbered by the necessity to remain concealed from predators. However, in order to reach this higher peak, the population must first cross the valley of non-intimidating conspicuousness, in which the prey is conspicuous but the predators are not yet deterred. Using terms borrowed from the concept of fitness landscapes, we categorize several solutions suggested previously in the literature as either concerning changes in the fitness landscape or as illuminating possible ridges connecting the two peaks, which emerge from unconsidered dimensions of the fitness landscape. We suggest that considering this question through the lens of fitness landscapes not only facilitates useful categorization of previously suggested solutions but may also prove useful for thinking about novel ones.
Collapse
Affiliation(s)
- Akiva Topper
- Department of Ecology, Evolution and Behavior, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Oren Kolodny
- Department of Ecology, Evolution and Behavior, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
10
|
Yang CH, Scarpino SV. A Family of Fitness Landscapes Modeled through Gene Regulatory Networks. Entropy (Basel) 2022; 24:622. [PMID: 35626507 PMCID: PMC9141513 DOI: 10.3390/e24050622] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 04/11/2022] [Accepted: 04/26/2022] [Indexed: 02/01/2023]
Abstract
Fitness landscapes are a powerful metaphor for understanding the evolution of biological systems. These landscapes describe how genotypes are connected to each other through mutation and related through fitness. Empirical studies of fitness landscapes have increasingly revealed conserved topographical features across diverse taxa, e.g., the accessibility of genotypes and "ruggedness". As a result, theoretical studies are needed to investigate how evolution proceeds on fitness landscapes with such conserved features. Here, we develop and study a model of evolution on fitness landscapes using the lens of Gene Regulatory Networks (GRNs), where the regulatory products are computed from multiple genes and collectively treated as phenotypes. With the assumption that regulation is a binary process, we prove the existence of empirically observed, topographical features such as accessibility and connectivity. We further show that these results hold across arbitrary fitness functions and that a trade-off between accessibility and ruggedness need not exist. Then, using graph theory and a coarse-graining approach, we deduce a mesoscopic structure underlying GRN fitness landscapes where the information necessary to predict a population's evolutionary trajectory is retained with minimal complexity. Using this coarse-graining, we develop a bottom-up algorithm to construct such mesoscopic backbones, which does not require computing the genotype network and is therefore far more efficient than brute-force approaches. Altogether, this work provides mathematical results of high-dimensional fitness landscapes and a path toward connecting theory to empirical studies.
Collapse
Affiliation(s)
- Chia-Hung Yang
- Network Science Institute, Northeastern University, Boston, MA 02115, USA
| | - Samuel V. Scarpino
- Network Science Institute, Northeastern University, Boston, MA 02115, USA
- Physics Department, Northeastern University, Boston, MA 02115, USA
- Roux Institute, Northeastern University, Boston, MA 02115, USA
- Institute for Experiential AI, Northeastern University, Boston, MA 02115, USA
- Santa Fe Institute, Santa Fe, NM 87501, USA
- Vermont Complex Systems Center, University of Vermont, Burlington, VT 05405, USA
| |
Collapse
|
11
|
Guo Y, Amir A. The effect of weak clonal interference on average fitness trajectories in the presence of macroscopic epistasis. Genetics 2022; 220:6529545. [PMID: 35171996 PMCID: PMC8982035 DOI: 10.1093/genetics/iyac028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 01/28/2022] [Indexed: 11/13/2022] Open
Abstract
Adaptation dynamics on fitness landscapes is often studied theoretically in the strong-selection, weak-mutation regime. However, in a large population, multiple beneficial mutants can emerge before any of them fixes in the population. Competition between mutants is known as clonal interference, and while it is known to slow down the rate of adaptation (when compared to the strong-selection, weak-mutation model with the same parameters), how it affects the shape of long-term fitness trajectories in the presence of epistasis is an open question. Here, by considering how changes in fixation probabilities arising from weak clonal interference affect the dynamics of adaptation on fitness-parameterized landscapes, we find that the change in the shape of fitness trajectory arises only through changes in the supply of beneficial mutations (or equivalently, the beneficial mutation rate). Furthermore, a depletion of beneficial mutations as a population climbs up the fitness landscape can speed up the rescaled fitness trajectory (where adaptation speed is measured relative to its value at the start of the experiment), while an enhancement of the beneficial mutation rate does the opposite of slowing it down. Our findings suggest that by carrying out evolution experiments in both regimes (with and without clonal interference), one could potentially distinguish the different sources of macroscopic epistasis (fitness effect of mutations vs change in fraction of beneficial mutations).
Collapse
Affiliation(s)
- Yipei Guo
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138, USA,Program in Biophysics, Harvard University, Boston, MA 02115, USA,Janelia Research Campus, Virginia, VA 20147, USA
| | - Ariel Amir
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138, USA,Corresponding author: John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
12
|
Abstract
During their evolution, proteins explore sequence space via an interplay between random mutations and phenotypic selection. Here, we build upon recent progress in reconstructing data-driven fitness landscapes for families of homologous proteins, to propose stochastic models of experimental protein evolution. These models predict quantitatively important features of experimentally evolved sequence libraries, like fitness distributions and position-specific mutational spectra. They also allow us to efficiently simulate sequence libraries for a vast array of combinations of experimental parameters like sequence divergence, selection strength, and library size. We showcase the potential of the approach in reanalyzing two recent experiments to determine protein structure from signals of epistasis emerging in experimental sequence libraries. To be detectable, these signals require sufficiently large and sufficiently diverged libraries. Our modeling framework offers a quantitative explanation for different outcomes of recently published experiments. Furthermore, we can forecast the outcome of time- and resource-intensive evolution experiments, opening thereby a way to computationally optimize experimental protocols.
Collapse
Affiliation(s)
- M Bisardi
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, Paris, F-75005, France.,Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, Paris, F-75005, France
| | - J Rodriguez-Rivas
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, Paris, F-75005, France
| | - F Zamponi
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, Paris, F-75005, France
| | - M Weigt
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, Paris, F-75005, France
| |
Collapse
|
13
|
Park J, Wang HH. Systematic dissection of σ 70 sequence diversity and function in bacteria. Cell Rep 2021; 36:109590. [PMID: 34433066 PMCID: PMC8716302 DOI: 10.1016/j.celrep.2021.109590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 04/19/2021] [Accepted: 08/02/2021] [Indexed: 10/29/2022] Open
Abstract
Primary σ70 factors are key conserved bacterial regulatory proteins that interact with regulatory DNA to control gene expression. It is, however, poorly understood whether σ70 sequence diversity in different bacteria reflects functional differences. Here, we employ comparative and functional genomics to explore the sequence and function relationship of primary σ70. Using multiplex automated genome engineering and deep sequencing (MAGE-seq), we generate a saturation mutagenesis library and high-resolution fitness map of E. coli σ70 in domains 2-4. Mapping natural σ70 sequence diversity to the E. coli σ70 fitness landscape reveals significant predicted fitness deficits across σ70 orthologs. Interestingly, these predicted deficits are larger than observed fitness changes for 15 σ70 orthologs introduced into E. coli. Finally, we use a multiplexed transcriptional reporter assay and RNA sequencing (RNA-seq) to explore functional differences of several σ70 orthologs. This work provides an in-depth analysis of σ70 sequence and function to improve efforts to understand the evolution and engineering potential of this global regulator.
Collapse
Affiliation(s)
- Jimin Park
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA; Integrated Program in Cellular, Molecular and Biomedical Studies, Columbia University Irving Medical Center, New York, NY, USA.
| | - Harris H Wang
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA; Department of Pathology and Cell Biology, Columbia University Irving Medical Center, New York, NY, USA.
| |
Collapse
|
14
|
Bakhtin Y, Katsnelson MI, Wolf YI, Koonin EV. Evolution in the weak-mutation limit: Stasis periods punctuated by fast transitions between saddle points on the fitness landscape. Proc Natl Acad Sci U S A 2021; 118:e2015665118. [PMID: 33472973 DOI: 10.1073/pnas.2015665118] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The gradual character of evolution is a key feature of the Darwinian worldview. However, macroevolutionary events are often thought to occur in a nongradualist manner, in a regime known as punctuated equilibrium, whereby extended periods of evolutionary stasis are punctuated by rapid transitions between states. Here we analyze a simple mathematical model of population evolution on fitness landscapes and show that, for a large population in the weak-mutation limit, the process of adaptive evolution consists of extended periods of stasis, which the population spends around saddle points on the landscape, interrupted by rapid transitions to new saddle points when a beneficial mutation is fixed. Thus, phenomenologically, the default regime of biological evolution seems to closely resemble punctuated equilibrium. A mathematical analysis of the evolution of a large population under the weak-mutation limit shows that such a population would spend most of the time in stasis in the vicinity of saddle points on the fitness landscape. The periods of stasis are punctuated by fast transitions, in lnNe/s time (Ne, effective population size; s, selection coefficient of a mutation), when a new beneficial mutation is fixed in the evolving population, which accordingly moves to a different saddle, or on much rarer occasions from a saddle to a local peak. Phenomenologically, this mode of evolution of a large population resembles punctuated equilibrium (PE) whereby phenotypic changes occur in rapid bursts that are separated by much longer intervals of stasis during which mutations accumulate but the phenotype does not change substantially. Theoretically, PE has been linked to self-organized criticality (SOC), a model in which the size of “avalanches” in an evolving system is power-law-distributed, resulting in increasing rarity of major events. Here we show, however, that a PE-like evolutionary regime is the default for a very simple model of an evolving population that does not rely on SOC or any other special conditions.
Collapse
|
15
|
Abstract
Natural selection imposes a complex filter on which variants persist in a population resulting in evolutionary patterns that vary greatly along the genome. Some sites evolve close to neutrally, while others are highly conserved, allow only specific states, or only change in concert with other sites. On one hand, such constraints on sequence evolution can be to infer biological function, one the other hand they need to be accounted for in phylogenetic reconstruction. Phylogenetic models often account for this complexity by partitioning sites into a small number of discrete classes with different rates and/or state preferences. Appropriate model complexity is typically determined by model selection procedures. Here, we present an efficient algorithm to estimate more complex models that allow for different preferences at every site and explore the accuracy at which such models can be estimated from simulated data. Our iterative approximate maximum likelihood scheme uses information in the data efficiently and accurately estimates site-specific preferences from large data sets with moderately diverged sequences and known topology. However, the joint estimation of site-specific rates, and site-specific preferences, and phylogenetic branch length can suffer from identifiability problems, while ignoring variation in preferences across sites results in branch length underestimates. Site-specific preferences estimated from large HIV pol alignments show qualitative concordance with intra-host estimates of fitness costs. Analysis of these substitution models suggests near saturation of divergence after a few hundred years. Such saturation can explain the inability to infer deep divergence times of HIV and SIVs using molecular clock approaches and time-dependent rate estimates.
Collapse
Affiliation(s)
- Vadim Puller
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 61, Basel, Switzerland
| | - Pavel Sagulenko
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, 72076 Tübingen, Germany
| | - Richard A Neher
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 61, Basel, Switzerland
| |
Collapse
|
16
|
Abstract
Connection patterns among Local Optima Networks (LONs) can inform heuristic design for optimisation. LON research has predominantly required complete enumeration of a fitness landscape, thereby restricting analysis to problems diminutive in size compared to real-life situations. LON sampling algorithms are therefore important. In this article, we study LON construction algorithms for the Quadratic Assignment Problem (QAP). Using machine learning, we use estimated LON features to predict search performance for competitive heuristics used in the QAP domain. The results show that by using random forest regression, LON construction algorithms produce fitness landscape features which can explain almost all search variance. We find that LON samples better relate to search than enumerated LONs do. The importance of fitness levels of sampled LONs in search predictions is crystallised. Features from LONs produced by different algorithms are combined in predictions for the first time, with promising results for this "super-sampling": a model to predict tabu search success explained 99% of variance. Arguments are made for the use-case of each LON algorithm and for combining the exploitative process of one with the exploratory optimisation of the other.
Collapse
Affiliation(s)
- Sarah L Thomson
- Department of Computing Science and Mathematics, University of Stirling, Stirling, FK94LA, UK
| | - Gabriela Ochoa
- Department of Computing Science and Mathematics, University of Stirling, Stirling, FK94LA, UK
| | - Sébastien Verel
- Laboratoire LISIC, Université du Littoral Côte d'Opale, France
| | - Nadarajen Veerapen
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France
| |
Collapse
|
17
|
Somarelli JA, Gardner H, Cannataro VL, Gunady EF, Boddy AM, Johnson NA, Fisk JN, Gaffney SG, Chuang JH, Li S, Ciccarelli FD, Panchenko AR, Megquier K, Kumar S, Dornburg A, DeGregori J, Townsend JP. Molecular Biology and Evolution of Cancer: From Discovery to Action. Mol Biol Evol 2020; 37:320-326. [PMID: 31642480 PMCID: PMC6993850 DOI: 10.1093/molbev/msz242] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Cancer progression is an evolutionary process. During this process, evolving cancer cell populations encounter restrictive ecological niches within the body, such as the primary tumor, circulatory system, and diverse metastatic sites. Efforts to prevent or delay cancer evolution-and progression-require a deep understanding of the underlying molecular evolutionary processes. Herein we discuss a suite of concepts and tools from evolutionary and ecological theory that can inform cancer biology in new and meaningful ways. We also highlight current challenges to applying these concepts, and propose ways in which incorporating these concepts could identify new therapeutic modes and vulnerabilities in cancer.
Collapse
Affiliation(s)
- Jason A Somarelli
- Department of Medicine, Duke University Medical Center, Durham, NC
- Duke Cancer Institute, Duke University Medical Center, Durham, NC
| | - Heather Gardner
- Sackler School of Graduate Biomedical Sciences, Tufts University, Medford, MA
| | | | - Ella F Gunady
- Department of Medicine, Duke University Medical Center, Durham, NC
| | - Amy M Boddy
- Department of Anthropology, University of California, Santa Barbara, CA
| | | | | | - Stephen G Gaffney
- Department of Biostatistics, Yale School of Public Health, New Haven, CT
| | | | - Sheng Li
- The Jackson Laboratory for Genomic Medicine, Farmington, CT
| | - Francesca D Ciccarelli
- Cancer Systems Biology Laboratory, The Francis Crick Institute, London, United Kingdom
- King’s College London, London, United Kingdom
| | - Anna R Panchenko
- Department of Pathology and Molecular Medicine, School of Medicine, Queen’s University, Kingston, ON, Canada
- Ontario Institute of Cancer Research, Toronto, ON, Canada
| | - Kate Megquier
- Broad Institute, Massachusettes Institute of Technology and Harvard University
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, and Department of Biology, Temple University, Philadelphia, PA
| | - Alex Dornburg
- North Carolina Museum of Natural Sciences, Raleigh, NC
| | - James DeGregori
- Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO
| | - Jeffrey P Townsend
- Department of Biostatistics, Yale School of Public Health, New Haven, CT
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT
| |
Collapse
|
18
|
Abstract
Fitness interactions between mutations can influence a population's evolution in many different ways. While epistatic effects are difficult to measure precisely, important information is captured by the mean and variance of log fitnesses for individuals carrying different numbers of mutations. We derive predictions for these quantities from a class of simple fitness landscapes, based on models of optimizing selection on quantitative traits. We also explore extensions to the models, including modular pleiotropy, variable effect sizes, mutational bias and maladaptation of the wild type. We illustrate our approach by reanalysing a large dataset of mutant effects in a yeast snoRNA (small nucleolar RNA). Though characterized by some large epistatic effects, these data give a good overall fit to the non-epistatic null model, suggesting that epistasis might have limited influence on the evolutionary dynamics in this system. We also show how the amount of epistasis depends on both the underlying fitness landscape and the distribution of mutations, and so is expected to vary in consistent ways between new mutations, standing variation and fixed mutations.
Collapse
Affiliation(s)
- Christelle Fraïsse
- 1 Institut des Sciences de l'Evolution, CNRS-UM-IRD , Montpellier , France.,2 Department of Genetics, University of Cambridge , Downing Street, Cambridge CB2 3EH , UK.,3 Institute of Science and Technology Austria , Am Campus 1, Klosterneuburg 3400 , Austria
| | - John J Welch
- 2 Department of Genetics, University of Cambridge , Downing Street, Cambridge CB2 3EH , UK
| |
Collapse
|
19
|
Kemble H, Nghe P, Tenaillon O. Recent insights into the genotype-phenotype relationship from massively parallel genetic assays. Evol Appl 2019; 12:1721-1742. [PMID: 31548853 PMCID: PMC6752143 DOI: 10.1111/eva.12846] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 06/21/2019] [Accepted: 07/02/2019] [Indexed: 12/20/2022] Open
Abstract
With the molecular revolution in Biology, a mechanistic understanding of the genotype-phenotype relationship became possible. Recently, advances in DNA synthesis and sequencing have enabled the development of deep mutational scanning assays, capable of scoring comprehensive libraries of genotypes for fitness and a variety of phenotypes in massively parallel fashion. The resulting empirical genotype-fitness maps pave the way to predictive models, potentially accelerating our ability to anticipate the behaviour of pathogen and cancerous cell populations from sequencing data. Besides from cellular fitness, phenotypes of direct application in industry (e.g. enzyme activity) and medicine (e.g. antibody binding) can be quantified and even selected directly by these assays. This review discusses the technological basis of and recent developments in massively parallel genetics, along with the trends it is uncovering in the genotype-phenotype relationship (distribution of mutation effects, epistasis), their possible mechanistic bases and future directions for advancing towards the goal of predictive genetics.
Collapse
Affiliation(s)
- Harry Kemble
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Philippe Nghe
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Olivier Tenaillon
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
| |
Collapse
|
20
|
Gonzalez CE, Roberts P, Ostermeier M. Fitness Effects of Single Amino Acid Insertions and Deletions in TEM-1 β-Lactamase. J Mol Biol 2019; 431:2320-2330. [PMID: 31034887 DOI: 10.1016/j.jmb.2019.04.030] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2019] [Revised: 04/17/2019] [Accepted: 04/18/2019] [Indexed: 11/16/2022]
Abstract
Short insertions and deletions (InDels) are a common type of mutation found in nature and a useful source of variation in protein engineering. InDel events have important consequences in protein evolution, often opening new pathways for adaptation. However, much less is known about the effects of InDels compared to point mutations and amino acid substitutions. In particular, deep mutagenesis studies on the distribution of fitness effects of mutations have focused almost exclusively on amino acid substitutions. Here, we present a near-comprehensive analysis of the fitness effects of single amino acid InDels in TEM-1 β-lactamase. While we found InDels to be largely deleterious, partially overlapping deletion-tolerant and insertion-tolerant regions were observed throughout the protein, especially in unstructured regions and at the end of helices. The signal sequence of TEM-1 tolerated InDels more than the mature protein. Most regions of the protein tolerated insertions more than deletions, but a few regions tolerated deletions more than insertions. We examined the relationship between InDel tolerance and a variety of measures to help understand its origin. These measures included evolutionary variation in β-lactamases, secondary structure identity, tolerance to amino acid substitutions, solvent accessibility, and side-chain weighted contact number. We found secondary structure, weighted contact number, and evolutionary variation in class A beta-lactamases to be the somewhat predictive of InDel fitness effects.
Collapse
Affiliation(s)
- Courtney E Gonzalez
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA
| | - Paul Roberts
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA.
| |
Collapse
|
21
|
Gonzalez CE, Ostermeier M. Pervasive Pairwise Intragenic Epistasis among Sequential Mutations in TEM-1 β-Lactamase. J Mol Biol 2019; 431:1981-1992. [PMID: 30922874 DOI: 10.1016/j.jmb.2019.03.020] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Revised: 02/25/2019] [Accepted: 03/13/2019] [Indexed: 12/25/2022]
Abstract
Interactions between mutations play a central role in shaping the fitness landscape, but a clear picture of intragenic epistasis has yet to emerge. To further reveal the prevalence and patterns of intragenic epistasis, we present a survey of epistatic interactions between sequential mutations in TEM-1 β-lactamase. We measured the fitness effect of ~12,000 pairs of consecutive amino acid substitutions and used our previous study of the fitness effects of single amino acid substitutions to calculate epistasis for over 8000 mutation pairs. Since sequential mutations are prone to physically interact, we postulated that our study would be surveying specific epistasis instead of nonspecific epistasis. We found widespread negative epistasis, especially in beta-strands, and a high frequency of negative sign epistasis among individually beneficial mutations. Negative epistasis (52%) occurred 7.6 times as frequently as positive epistasis (6.8%). Buried residues experienced more negative epistasis that surface-exposed residues. However, TEM-1 exhibited a couple of hotspots for positive epistasis, most notably L221/ R222 at which many combinations of mutations positively interacted. This study is the first to systematically examine pairwise epistasis throughout an entire protein performing its native function in its native host.
Collapse
Affiliation(s)
- Courtney E Gonzalez
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA.
| |
Collapse
|
22
|
Abstract
Experiments show that evolutionary fitness landscapes can have a rich combinatorial structure due to epistasis. For some landscapes, this structure can produce a computational constraint that prevents evolution from finding local fitness optima-thus overturning the traditional assumption that local fitness peaks can always be reached quickly if no other evolutionary forces challenge natural selection. Here, I introduce a distinction between easy landscapes of traditional theory where local fitness peaks can be found in a moderate number of steps, and hard landscapes where finding local optima requires an infeasible amount of time. Hard examples exist even among landscapes with no reciprocal sign epistasis; on these semismooth fitness landscapes, strong selection weak mutation dynamics cannot find the unique peak in polynomial time. More generally, on hard rugged fitness landscapes that include reciprocal sign epistasis, no evolutionary dynamics-even ones that do not follow adaptive paths-can find a local fitness optimum quickly. Moreover, on hard landscapes, the fitness advantage of nearby mutants cannot drop off exponentially fast but has to follow a power-law that long-term evolution experiments have associated with unbounded growth in fitness. Thus, the constraint of computational complexity enables open-ended evolution on finite landscapes. Knowing this constraint allows us to use the tools of theoretical computer science and combinatorial optimization to characterize the fitness landscapes that we expect to see in nature. I present candidates for hard landscapes at scales from single genes, to microbes, to complex organisms with costly learning (Baldwin effect) or maintained cooperation (Hankshaw effect). Just how ubiquitous hard landscapes (and the corresponding ultimate constraint on evolution) are in nature becomes an open empirical question.
Collapse
|
23
|
Abstract
A fitness landscape is a map between the genotype and its reproductive success in a given environment. The topography of fitness landscapes largely governs adaptive dynamics, constraining evolutionary trajectories and the predictability of evolution. Theory suggests that this topography can be deformed by mutations that produce substantial changes to the environment. Despite its importance, the deformability of fitness landscapes has not been systematically studied beyond abstract models, and little is known about its reach and consequences in empirical systems. Here we have systematically characterized the deformability of the genome-wide metabolic fitness landscape of the bacterium Escherichia coli Deformability is quantified by the noncommutativity of epistatic interactions, which we experimentally demonstrate in mutant strains on the path to an evolutionary innovation. Our analysis shows that the deformation of fitness landscapes by metabolic mutations rarely affects evolutionary trajectories in the short range. However, mutations with large environmental effects produce long-range landscape deformations in distant regions of the genotype space that affect the fitness of later descendants. Our results therefore suggest that, even in situations in which mutations have strong environmental effects, fitness landscapes may retain their power to forecast evolution over small mutational distances despite the potential attenuation of that power over longer evolutionary trajectories. Our methods and results provide an avenue for integrating adaptive and eco-evolutionary dynamics with complex genetics and genomics.
Collapse
Affiliation(s)
- Djordje Bajić
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511;
- Microbial Sciences Institute, Yale University West Campus, West Haven, CT 06516
| | - Jean C C Vila
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511
- Microbial Sciences Institute, Yale University West Campus, West Haven, CT 06516
| | - Zachary D Blount
- BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI 48824
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824
- Department of Biology, Kenyon College, Gambier OH 43022
| | - Alvaro Sánchez
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06511;
- Microbial Sciences Institute, Yale University West Campus, West Haven, CT 06516
| |
Collapse
|
24
|
Gorter FA, Aarts MGM, Zwaan BJ, de Visser JAGM. Local Fitness Landscapes Predict Yeast Evolutionary Dynamics in Directionally Changing Environments. Genetics 2018; 208:307-22. [PMID: 29141909 DOI: 10.1534/genetics.117.300519] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 10/21/2017] [Indexed: 11/18/2022] Open
Abstract
The fitness landscape is a concept that is widely used for understanding and predicting evolutionary adaptation. The topography of the fitness landscape depends critically on the environment, with potentially far-reaching consequences for evolution under changing conditions. However, few studies have assessed directly how empirical fitness landscapes change across conditions, or validated the predicted consequences of such change. We previously evolved replicate yeast populations in the presence of either gradually increasing, or constant high, concentrations of the heavy metals cadmium (Cd), nickel (Ni), and zinc (Zn), and analyzed their phenotypic and genomic changes. Here, we reconstructed the local fitness landscapes underlying adaptation to each metal by deleting all repeatedly mutated genes both by themselves and in combination. Fitness assays revealed that the height, and/or shape, of each local fitness landscape changed considerably across metal concentrations, with distinct qualitative differences between unconditionally (Cd) and conditionally toxic metals (Ni and Zn). This change in topography had particularly crucial consequences in the case of Ni, where a substantial part of the individual mutational fitness effects changed in sign across concentrations. Based on the Ni landscape analyses, we made several predictions about which mutations had been selected when during the evolution experiment. Deep sequencing of population samples from different time points generally confirmed these predictions, demonstrating the power of landscape reconstruction analyses for understanding and ultimately predicting evolutionary dynamics, even under complex scenarios of environmental change.
Collapse
|
25
|
Abstract
Many morphological, behavioral, physiological, and life-history traits covary across the biological scales of individuals, populations, and species. However, the processes that cause traits to covary also change over these scales, challenging our ability to use patterns of trait covariance to infer process. Trait relationships are also widely assumed to have generic functional relationships with similar evolutionary potentials, and even though many different trait relationships are now identified, there is little appreciation that these may influence trait covariation and evolution in unique ways. We use a trait-performance-fitness framework to classify and organize trait relationships into three general classes, address which ones more likely generate trait covariation among individuals in a population, and review how selection shapes phenotypic covariation. We generate predictions about how trait covariance changes within and among populations as a result of trait relationships and in response to selection and consider how these can be tested with comparative data. Careful comparisons of covariation patterns can narrow the set of hypothesized processes that cause trait covariation when the form of the trait relationship and how it responds to selection yield clear predictions about patterns of trait covariation. We discuss the opportunities and limitations of comparative approaches to evaluate hypotheses about the evolutionary causes and consequences of trait covariation and highlight the importance of evaluating patterns within populations replicated in the same and in different selective environments. Explicit hypotheses about trait relationships are key to generating effective predictions about phenotype and its evolution using covariance data.
Collapse
|
26
|
Bayer CD, van Loo B, Hollfelder F. Specificity Effects of Amino Acid Substitutions in Promiscuous Hydrolases: Context-Dependence of Catalytic Residue Contributions to Local Fitness Landscapes in Nearby Sequence Space. Chembiochem 2017; 18:1001-1015. [PMID: 28464395 PMCID: PMC5488252 DOI: 10.1002/cbic.201600657] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Indexed: 12/18/2022]
Abstract
Catalytic promiscuity can facilitate evolution of enzyme functions-a multifunctional catalyst may act as a springboard for efficient functional adaptation. We test the effect of single mutations on multiple activities in two groups of promiscuous AP superfamily members to probe this hypothesis. We quantify the effect of site-saturating mutagenesis of an analogous, nucleophile-flanking residue in two superfamily members: an arylsulfatase (AS) and a phosphonate monoester hydrolase (PMH). Statistical analysis suggests that no one physicochemical characteristic alone explains the mutational effects. Instead, these effects appear to be dominated by their structural context. Likewise, the effect of changing the catalytic nucleophile itself is not reaction-type-specific. Mapping of "fitness landscapes" of four activities onto the possible variation of a chosen sequence position revealed tremendous potential for respecialization of AP superfamily members through single-point mutations, highlighting catalytic promiscuity as a powerful predictor of adaptive potential.
Collapse
Affiliation(s)
- Christopher D. Bayer
- Department of BiochemistryUniversity of Cambridge80 Tennis Court RoadCB2 1GACambridgeUK
- Present address: c-LEcta GmbHPerlickstrasse 504103LeipzigGermany
| | - Bert van Loo
- Department of BiochemistryUniversity of Cambridge80 Tennis Court RoadCB2 1GACambridgeUK
- Present address: Institute for Evolution and BiodiversityUniversity of MünsterHüfferstrasse 148149MünsterGermany
| | - Florian Hollfelder
- Department of BiochemistryUniversity of Cambridge80 Tennis Court RoadCB2 1GACambridgeUK
| |
Collapse
|
27
|
Klesmith JR, Bacik JP, Wrenbeck EE, Michalczyk R, Whitehead TA. Trade-offs between enzyme fitness and solubility illuminated by deep mutational scanning. Proc Natl Acad Sci U S A 2017; 114:2265-70. [PMID: 28196882 DOI: 10.1073/pnas.1614437114] [Citation(s) in RCA: 87] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Proteins are marginally stable, and an understanding of the sequence determinants for improved protein solubility is highly desired. For enzymes, it is well known that many mutations that increase protein solubility decrease catalytic activity. These competing effects frustrate efforts to design and engineer stable, active enzymes without laborious high-throughput activity screens. To address the trade-off between enzyme solubility and activity, we performed deep mutational scanning using two different screens/selections that purport to gauge protein solubility for two full-length enzymes. We assayed a TEM-1 beta-lactamase variant and levoglucosan kinase (LGK) using yeast surface display (YSD) screening and a twin-arginine translocation pathway selection. We then compared these scans with published experimental fitness landscapes. Results from the YSD screen could explain 37% of the variance in the fitness landscapes for one enzyme. Five percent to 10% of all single missense mutations improve solubility, matching theoretical predictions of global protein stability. For a given solubility-enhancing mutation, the probability that it would retain wild-type fitness was correlated with evolutionary conservation and distance to active site, and anticorrelated with contact number. Hybrid classification models were developed that could predict solubility-enhancing mutations that maintain wild-type fitness with an accuracy of 90%. The downside of using such classification models is the removal of rare mutations that improve both fitness and solubility. To reveal the biophysical basis of enhanced protein solubility and function, we determined the crystallographic structure of one such LGK mutant. Beyond fundamental insights into trade-offs between stability and activity, these results have potential biotechnological applications.
Collapse
|
28
|
Abstract
Fitness landscapes determine the course of adaptation by constraining and shaping evolutionary trajectories. Knowledge of the structure of a fitness landscape can thus predict evolutionary outcomes. Empirical fitness landscapes, however, have so far only offered limited insight into real-world questions, as the high dimensionality of sequence spaces makes it impossible to exhaustively measure the fitness of all variants of biologically meaningful sequences. We must therefore revert to statistical descriptions of fitness landscapes that are based on a sparse sample of fitness measurements. It remains unclear, however, how much data are required for such statistical descriptions to be useful. Here, we assess the ability of regression models accounting for single and pairwise mutations to correctly approximate a complex quasi-empirical fitness landscape. We compare approximations based on various sampling regimes of an RNA landscape and find that the sampling regime strongly influences the quality of the regression. On the one hand it is generally impossible to generate sufficient samples to achieve a good approximation of the complete fitness landscape, and on the other hand systematic sampling schemes can only provide a good description of the immediate neighborhood of a sequence of interest. Nevertheless, we obtain a remarkably good and unbiased fit to the local landscape when using sequences from a population that has evolved under strong selection. Thus, current statistical methods can provide a good approximation to the landscape of naturally evolving populations.
Collapse
Affiliation(s)
- Louis du Plessis
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland Insitute for Integrative Biology, ETH Zürich, Zürich, Switzerland Swiss Institute of Bioinformatics, Switzerland
| | - Gabriel E Leventhal
- Insitute for Integrative Biology, ETH Zürich, Zürich, Switzerland Department of Civil and Environmental Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA
| | | |
Collapse
|
29
|
Manukyan N, Eppstein MJ, Buzas JS. Tunably Rugged Landscapes With Known Maximum and Minimum. IEEE Trans Evol Comput 2016; 20:263-274. [PMID: 28804241 PMCID: PMC5553988 DOI: 10.1109/tevc.2015.2454857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
We propose NM landscapes as a new class of tunably rugged benchmark problems. NM landscapes are well defined on alphabets of any arity, including both discrete and real-valued alphabets, include epistasis in a natural and transparent manner, are proven to have known value and location of the global maximum and, with some additional constraints, are proven to also have a known global minimum. Empirical studies are used to illustrate that, when coefficients are selected from a recommended distribution, the ruggedness of NM landscapes is smoothly tunable and correlates with several measures of search difficulty. We discuss why these properties make NM landscapes preferable to both NK landscapes and Walsh polynomials as benchmark landscape models with tunable epistasis.
Collapse
Affiliation(s)
- Narine Manukyan
- Department of Computer Science, University of Vermont, Burlington, VT 05401 USA
| | - Margaret J Eppstein
- Department of Computer Science, University of Vermont, Burlington, VT 05401 USA
| | - Jeffrey S Buzas
- Department and Mathematics and Statistics, University of Vermont, Burlington, VT 05401 USA
| |
Collapse
|
30
|
Rodrigues JV, Bershtein S, Li A, Lozovsky ER, Hartl DL, Shakhnovich EI. Biophysical principles predict fitness landscapes of drug resistance. Proc Natl Acad Sci U S A 2016; 113:E1470-8. [PMID: 26929328 DOI: 10.1073/pnas.1601441113] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Fitness landscapes of drug resistance constitute powerful tools to elucidate mutational pathways of antibiotic escape. Here, we developed a predictive biophysics-based fitness landscape of trimethoprim (TMP) resistance for Escherichia coli dihydrofolate reductase (DHFR). We investigated the activity, binding, folding stability, and intracellular abundance for a complete set of combinatorial DHFR mutants made out of three key resistance mutations and extended this analysis to DHFR originated from Chlamydia muridarum and Listeria grayi We found that the acquisition of TMP resistance via decreased drug affinity is limited by a trade-off in catalytic efficiency. Protein stability is concurrently affected by the resistant mutants, which precludes a precise description of fitness from a single molecular trait. Application of the kinetic flux theory provided an accurate model to predict resistance phenotypes (IC50) quantitatively from a unique combination of the in vitro protein molecular properties. Further, we found that a controlled modulation of the GroEL/ES chaperonins and Lon protease levels affects the intracellular steady-state concentration of DHFR in a mutation-specific manner, whereas IC50 is changed proportionally, as indeed predicted by the model. This unveils a molecular rationale for the pleiotropic role of the protein quality control machinery on the evolution of antibiotic resistance, which, as we illustrate here, may drastically confound the evolutionary outcome. These results provide a comprehensive quantitative genotype-phenotype map for the essential enzyme that serves as an important target of antibiotic and anticancer therapies.
Collapse
|
31
|
Abstract
Complex combinatorial problems are most often optimised with heuristic solvers, which usually deliver acceptable results without any indication of the quality obtained. Recently, predictive diagnostic optimisation was proposed as a means of characterising the fitness landscape while optimising a combinatorial problem. The scalars produced by predictive diagnostic optimisation appear to describe the difficulty of the problem with relative reliability. In this study, we record more scalars that may be helpful in determining problem difficulty during the optimisation process and analyse these in combination with other well-known landscape descriptors by using exploratory factor analysis on four landscapes that arise from different search operators, applied to a varied set of quadratic assignment problem instances. Factors are designed to capture properties by combining the collinear variances of several variables. The extracted factors can be interpreted as the features of landscapes detected by the variables, but disappoint in their weak correlations with the result quality achieved by the optimiser, which we regard as the most reliable indicator of difficulty available. It appears that only the prediction error of predictive diagnostic optimisation has a strong correlation with the quality of the results produced, followed by a medium correlation of the fitness distance correlation of the local optima.
Collapse
Affiliation(s)
- I Moser
- Department of Computer Science and Software Engineering, Swinburne University of Technology, Melbourne, Victoria, Australia
| | - M Gheorghita
- Department of Computer Science and Software Engineering, Swinburne University of Technology, Melbourne, Victoria, Australia
| | - A Aleti
- Faculty of Information Technology, Monash University, Melbourne, Victoria, Australia
| |
Collapse
|
32
|
Abstract
Many microbial populations rapidly adapt to changing environments with multiple variants competing for survival. To quantify such complex evolutionary dynamics in vivo, time resolved and genome wide data including rare variants are essential. We performed whole-genome deep sequencing of HIV-1 populations in 9 untreated patients, with 6-12 longitudinal samples per patient spanning 5-8 years of infection. The data can be accessed and explored via an interactive web application. We show that patterns of minor diversity are reproducible between patients and mirror global HIV-1 diversity, suggesting a universal landscape of fitness costs that control diversity. Reversions towards the ancestral HIV-1 sequence are observed throughout infection and account for almost one third of all sequence changes. Reversion rates depend strongly on conservation. Frequent recombination limits linkage disequilibrium to about 100 bp in most of the genome, but strong hitch-hiking due to short range linkage limits diversity.
Collapse
Affiliation(s)
- Fabio Zanini
- Evolutionary Dynamics and Biophysics, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Johanna Brodin
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
| | - Lina Thebo
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
| | - Christa Lanz
- Evolutionary Dynamics and Biophysics, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Göran Bratt
- Department of Clinical Science and Education, Stockholm South General Hospital, Stockholm, Sweden
| | - Jan Albert
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm, Sweden
- Department of Clinical Microbiology, Karolinska University Hospital, Stockholm, Sweden
| | - Richard A Neher
- Evolutionary Dynamics and Biophysics, Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|
33
|
Abstract
A central aspect of cultural evolutionary theory concerns how human groups respond to environmental change. Although we are painting with a broad brush, it is fair to say that prior to the twenty-first century, adaptation often happened gradually over multiple human generations, through a combination of individual and social learning, cumulative cultural evolution and demographic shifts. The result was a generally resilient and sustainable population. In the twenty-first century, however, considerable change happens within small portions of a human generation, on a vastly larger range of geographical and population scales and involving a greater degree of horizontal learning. As a way of gauging the complexity of societal response to environmental change in a globalized future, we discuss several theoretical tools for understanding how human groups adapt to uncertainty. We use our analysis to estimate the limits of predictability of future societal change, in the belief that knowing when to hedge bets is better than relying on a false sense of predictability.
Collapse
Affiliation(s)
- R Alexander Bentley
- Department of Comparative Cultural Studies, University of Houston, Houston, TX 77204, USA
| | - Michael J O'Brien
- Department of Anthropology, University of Missouri, 317 Lowry Hall, Columbia, MO 65211, USA
| |
Collapse
|
34
|
Rabbers I, van Heerden JH, Nordholt N, Bachmann H, Teusink B, Bruggeman FJ. Metabolism at evolutionary optimal States. Metabolites 2015; 5:311-43. [PMID: 26042723 PMCID: PMC4495375 DOI: 10.3390/metabo5020311] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2015] [Revised: 05/20/2015] [Accepted: 05/25/2015] [Indexed: 01/13/2023] Open
Abstract
Metabolism is generally required for cellular maintenance and for the generation of offspring under conditions that support growth. The rates, yields (efficiencies), adaptation time and robustness of metabolism are therefore key determinants of cellular fitness. For biotechnological applications and our understanding of the evolution of metabolism, it is necessary to figure out how the functional system properties of metabolism can be optimized, via adjustments of the kinetics and expression of enzymes, and by rewiring metabolism. The trade-offs that can occur during such optimizations then indicate fundamental limits to evolutionary innovations and bioengineering. In this paper, we review several theoretical and experimental findings about mechanisms for metabolic optimization.
Collapse
Affiliation(s)
- Iraes Rabbers
- Department of Systems Bioinformatics, VU University Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands.
| | - Johan H van Heerden
- Department of Systems Bioinformatics, VU University Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands.
| | - Niclas Nordholt
- Department of Systems Bioinformatics, VU University Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands.
| | - Herwig Bachmann
- Department of Systems Bioinformatics, VU University Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands.
- NIZO Food Research, 6718 ZB Ede, The Netherlands.
- Top Institute Food and Nutrition, 6700 AN Wageningen, The Netherlands.
| | - Bas Teusink
- Department of Systems Bioinformatics, VU University Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands.
| | - Frank J Bruggeman
- Department of Systems Bioinformatics, VU University Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands.
| |
Collapse
|
35
|
Manhart M, Morozov AV. Protein folding and binding can emerge as evolutionary spandrels through structural coupling. Proc Natl Acad Sci U S A 2015; 112:1797-802. [PMID: 25624494 DOI: 10.1073/pnas.1415895112] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Binding interactions between proteins and other molecules mediate numerous cellular processes, including metabolism, signaling, and gene regulation. These interactions often evolve in response to changes in the protein's chemical or physical environment (such as the addition of an antibiotic). Several recent studies have shown the importance of folding stability in constraining protein evolution. Here we investigate how structural coupling between folding and binding--the fact that most proteins can only bind their targets when folded--gives rise to an evolutionary coupling between the traits of folding stability and binding strength. Using a biophysical and evolutionary model, we show how these protein traits can emerge as evolutionary "spandrels" even if they do not confer an intrinsic fitness advantage. In particular, proteins can evolve strong binding interactions that have no functional role but merely serve to stabilize the protein if its misfolding is deleterious. Furthermore, such proteins may have divergent fates, evolving to bind or not bind their targets depending on random mutational events. These observations may explain the abundance of apparently nonfunctional interactions among proteins observed in high-throughput assays. In contrast, for proteins with both functional binding and deleterious misfolding, evolution may be highly predictable at the level of biophysical traits: adaptive paths are tightly constrained to first gain extra folding stability and then partially lose it as the new binding function is developed. These findings have important consequences for our understanding of how natural and engineered proteins evolve under selective pressure.
Collapse
|
36
|
Seifert D, Di Giallonardo F, Metzner KJ, Günthard HF, Beerenwinkel N. A framework for inferring fitness landscapes of patient-derived viruses using quasispecies theory. Genetics 2015; 199:191-203. [PMID: 25406469 DOI: 10.1534/genetics.114.172312] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Fitness is a central quantity in evolutionary models of viruses. However, it remains difficult to determine viral fitness experimentally, and existing in vitro assays can be poor predictors of in vivo fitness of viral populations within their hosts. Next-generation sequencing can nowadays provide snapshots of evolving virus populations, and these data offer new opportunities for inferring viral fitness. Using the equilibrium distribution of the quasispecies model, an established model of intrahost viral evolution, we linked fitness parameters to the composition of the virus population, which can be estimated by next-generation sequencing. For inference, we developed a Bayesian Markov chain Monte Carlo method to sample from the posterior distribution of fitness values. The sampler can overcome situations where no maximum-likelihood estimator exists, and it can adaptively learn the posterior distribution of highly correlated fitness landscapes without prior knowledge of their shape. We tested our approach on simulated data and applied it to clinical human immunodeficiency virus 1 samples to estimate their fitness landscapes in vivo. The posterior fitness distributions allowed for differentiating viral haplotypes from each other, for determining neutral haplotype networks, in which no haplotype is more or less credibly fit than any other, and for detecting epistasis in fitness landscapes. Our implemented approach, called QuasiFit, is available at http://www.cbg.ethz.ch/software/quasifit.
Collapse
|
37
|
Abstract
Mutations are the source of evolutionary variation. The interactions of multiple mutations can have important effects on fitness and evolutionary trajectories. We have recently described the distribution of fitness effects of all single mutations for a nine-amino-acid region of yeast Hsp90 (Hsp82) implicated in substrate binding. Here, we report and discuss the distribution of intragenic epistatic effects within this region in seven Hsp90 point mutant backgrounds of neutral to slightly deleterious effect, resulting in an analysis of more than 1,000 double mutants. We find negative epistasis between substitutions to be common, and positive epistasis to be rare--resulting in a pattern that indicates a drastic change in the distribution of fitness effects one step away from the wild type. This can be well explained by a concave relationship between phenotype and genotype (i.e., a concave shape of the local fitness landscape), suggesting mutational robustness intrinsic to the local sequence space. Structural analyses indicate that, in this region, epistatic effects are most pronounced when a solvent-inaccessible position is involved in the interaction. In contrast, all 18 observations of positive epistasis involved at least one mutation at a solvent-exposed position. By combining the analysis of evolutionary and biophysical properties of an epistatic landscape, these results contribute to a more detailed understanding of the complexity of protein evolution.
Collapse
Affiliation(s)
- Claudia Bank
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Ryan T Hietpas
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA
| | - Jeffrey D Jensen
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Daniel N A Bolon
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA
| |
Collapse
|
38
|
Abstract
Much of the current theory of adaptation is based on Gillespie's mutational landscape model (MLM), which assumes that the fitness values of genotypes linked by single mutational steps are independent random variables. On the other hand, a growing body of empirical evidence shows that real fitness landscapes, while possessing a considerable amount of ruggedness, are smoother than predicted by the MLM. In the present article we propose and analyze a simple fitness landscape model with tunable ruggedness based on the rough Mount Fuji (RMF) model originally introduced by Aita et al. in the context of protein evolution. We provide a comprehensive collection of results pertaining to the topographical structure of RMF landscapes, including explicit formulas for the expected number of local fitness maxima, the location of the global peak, and the fitness correlation function. The statistics of single and multiple adaptive steps on the RMF landscape are explored mainly through simulations, and the results are compared to the known behavior in the MLM model. Finally, we show that the RMF model can explain the large number of second-step mutations observed on a highly fit first-step background in a recent evolution experiment with a microvirid bacteriophage.
Collapse
|
39
|
Keepers KG, Martin AP. Fitness landscape of sympatric pupfishes a useful tool for visualizing speciation. Mol Ecol 2014; 23:2144-5. [PMID: 24766631 DOI: 10.1111/mec.12727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Revised: 03/01/2014] [Accepted: 03/14/2014] [Indexed: 11/30/2022]
Abstract
Arguably the most useful model of evolution emerged from the mind of Sewall Wright when he invented the fitness landscape (Wright 1932). In a recent issue of Molecular Ecology, Martin & Feinstein (2014) investigate the genetics and demographic history of an adaptive radiation of pupfish on San Salvador Island. Since the founder species colonized the island 10,000 years ago, two descendent species have appeared and in several lakes all three species (a durophage, a scale-eater, and the generalist ancestral form) coexist. The three species are thought to occupy three distinct fitness peaks. The durophage and generalists' peaks are close, whereas the scale-eater's peak is predicted to be distant and separated from the other two by a deep valley. Consistent with this view, gene flow between the two species on close fitness peaks is greater than the gene flow between these two species and the third species on a more distant peak. Correspondingly, the inferred fitness landscape predicts progress towards speciation, with more limited separation of species on close peaks, and that speciation is more complete for the scale-eater. The article provides an illustrative example of the power afforded by analysis of large numbers of SNPs for estimating key parameters underlying evolutionary divergence.
Collapse
Affiliation(s)
- Kyle G Keepers
- Ecology and Evolutionary Biology, Ramaley N122, Campus Box 334, University of Colorado, Boulder, CO 80309-0334, USA
| | | |
Collapse
|
40
|
Fraïsse C, Elderfield JAD, Welch JJ. The genetics of speciation: are complex incompatibilities easier to evolve? J Evol Biol 2014; 27:688-99. [PMID: 24581268 DOI: 10.1111/jeb.12339] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 12/11/2013] [Accepted: 01/04/2014] [Indexed: 01/22/2023]
Abstract
Reproductive isolation can evolve readily when genotypes containing incompatible alleles are connected by chains of fit intermediates. Experimental crosses show that such Dobzhansky-Muller incompatibilities (DMIs) are often complex (involving alleles at three or more loci) and asymmetrical (such that reciprocal introgressions have very different effects on fitness). One possible explanation is that asymmetrical and complex DMIs are 'easier to evolve', because they block fewer of the possible evolutionary paths between the parental genotypes. To assess this argument, we model evolutionary divergence in allopatry and calculate the delays to divergence caused by DMIs of different kinds. We find that the number of paths is sometimes, though not always, a reliable predictor of the time to divergence. In particular, we find limited support for the idea that symmetrical DMIs take longer to evolve, but this applies largely to two-locus symmetrical DMIs (which leave no path of fit intermediates). Symmetrical complex DMIs can also delay divergence, but only in a limited region of parameter space. In most other cases, the presence and form of DMIs have little influence on times to divergence, and so we argue that ease of evolution is unlikely to be important in explaining the experimental data.
Collapse
Affiliation(s)
- C Fraïsse
- Université Montpellier 2, Montpellier Cedex 5, France; Station Méditerranéenne de l'Environnement Littoral, CNRS, Institut des Sciences de l'Evolution (ISEM UMR 5554), Sete, France; Department of Genetics, University of Cambridge, Cambridge, UK
| | | | | |
Collapse
|
41
|
Lalić J, Elena SF. Magnitude and sign epistasis among deleterious mutations in a positive-sense plant RNA virus. Heredity (Edinb) 2012; 109:71-7. [PMID: 22491062 PMCID: PMC3400743 DOI: 10.1038/hdy.2012.15] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2011] [Revised: 02/21/2012] [Accepted: 02/28/2012] [Indexed: 11/09/2022] Open
Abstract
How epistatic interactions between mutations determine the genetic architecture of fitness is of central importance in evolution. The study of epistasis is particularly interesting for RNA viruses because of their genomic compactness, lack of genetic redundancy, and apparent low complexity. Moreover, interactions between mutations in viral genomes determine traits such as resistance to antiviral drugs, virulence and host range. In this study we generated 53 Tobacco etch potyvirus genotypes carrying pairs of single-nucleotide substitutions and measured their separated and combined deleterious fitness effects. We found that up to 38% of pairs had significant epistasis for fitness, including both positive and negative deviations from the null hypothesis of multiplicative effects. Interestingly, the sign of epistasis was correlated with viral protein-protein interactions in a model network, being predominantly positive between linked pairs of proteins and negative between unlinked ones. Furthermore, 55% of significant interactions were cases of reciprocal sign epistasis (RSE), indicating that adaptive landscapes for RNA viruses maybe highly rugged. Finally, we found that the magnitude of epistasis correlated negatively with the average effect of mutations. Overall, our results are in good agreement to those previously reported for other viruses and further consolidate the view that positive epistasis is the norm for small and compact genomes that lack genetic robustness.
Collapse
Affiliation(s)
- J Lalić
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-UPV, València, Spain and
| | - S F Elena
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas-UPV, València, Spain and
- Santa Fe Institute, Santa Fe NM, USA
| |
Collapse
|
42
|
Abstract
Random mutations under neutral or near-neutral conditions are studied by considering plausible evolutionary trajectories on "neutral nets"-i.e., collections of sequences (genotypes) interconnected via single-point mutations encoding for the same ground-state structure (phenotype). We use simple exact lattice models for the mapping between sequence and conformational spaces. Densities of states based on model intrachain interactions are determined by exhaustive conformational enumeration. We compare results from two very different interaction schemes to ascertain robustness of the conclusions. In both models, sequences in a majority of neutral nets center around a single "prototype sequence" of maximum mutational stability, tolerating the largest number of neutral mutations. General analytical considerations show that these topologies by themselves lead to higher steady-state evolutionary populations at prototype sequences. On average, native thermodynamic stability increases toward a maximum at the prototype sequence, resulting in funnel-like arrangements of native stabilities in sequence space. These observations offer a unified perspective on sequence design, native stability, and mutational stability of proteins. These principles are generalizable from native stability to any measure of fitness provided that its variation with respect to mutations is essentially smooth.
Collapse
Affiliation(s)
- E Bornberg-Bauer
- Theoretical Bioinformatics Group, German Cancer Research Centre Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany
| | | |
Collapse
|