1
|
Abstract
The landscape paradigm is revisited in the light of evolution in simple systems. A brief overview of different classes of fitness landscapes is followed by a more detailed discussion of the RNA model, which is currently the only evolutionary model that allows for a comprehensive molecular analysis of a fitness landscape. Neutral networks of genotypes are indispensable for the success of evolution. Important insights into the evolutionary mechanism are gained by considering the topology of sequence and shape spaces. The dynamic concept of molecular quasispecies is viewed in the light of the landscape paradigm. The distribution of fitness values in state space is mirrored by the population structures of mutant distributions. Two classes of thresholds for replication error or mutations are important: (i) the-conventional-genotypic error threshold, which separates ordered replication from random drift on neutral networks, and (ii) a phenotypic error threshold above which the molecular phenotype is lost. Empirical landscapes are reviewed and finally, the implications of the landscape concept for virus evolution are discussed.
Collapse
Affiliation(s)
- Peter Schuster
- Institut für Theoretische Chemie der Universität Wien, Währingerstraße 17, 1090, Wien, Austria.
| | - Peter F Stadler
- Institut für Informatik der Universität Leipzig, Härtelstraße 16-18, 04107, Leipzig, Germany.,The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| |
Collapse
|
2
|
Rezazadegan R, Reidys C. Degeneracy and genetic assimilation in RNA evolution. BMC Bioinformatics 2018; 19:543. [PMID: 30587112 PMCID: PMC6307299 DOI: 10.1186/s12859-018-2497-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2018] [Accepted: 11/16/2018] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND The neutral theory of Motoo Kimura stipulates that evolution is mostly driven by neutral mutations. However adaptive pressure eventually leads to changes in phenotype that involve non-neutral mutations. The relation between neutrality and adaptation has been studied in the context of RNA before and here we further study transitional mutations in the context of degenerate (plastic) RNA sequences and genetic assimilation. We propose quasineutral mutations, i.e. mutations which preserve an element of the phenotype set, as minimal mutations and study their properties. We also propose a general probabilistic interpretation of genetic assimilation and specialize it to the Boltzmann ensemble of RNA sequences. RESULTS We show that degenerate sequences i.e. sequences with more than one structure at the MFE level have the highest evolvability among all sequences and are central to evolutionary innovation. Degenerate sequences also tend to cluster together in the sequence space. The selective pressure in an evolutionary simulation causes the population to move towards regions with more degenerate sequences, i.e. regions at the intersection of different neutral networks, and this causes the number of such sequences to increase well beyond the average percentage of degenerate sequences in the sequence space. We also observe that evolution by quasineutral mutations tends to conserve the number of base pairs in structures and thereby maintains structural integrity even in the presence of pressure to the contrary. CONCLUSIONS We conclude that degenerate RNA sequences play a major role in evolutionary adaptation.
Collapse
Affiliation(s)
- Reza Rezazadegan
- University of Virginia Biocomplexity Institute, 995 Research Park Boulevard, Charlottesville, 22911 USA
| | - Christian Reidys
- University of Virginia Biocomplexity Institute, 995 Research Park Boulevard, Charlottesville, 22911 USA
- Department of Mathematics, University of Virginia, 141 Cabell Drive, Charlottesville, 22904 USA
| |
Collapse
|
3
|
Aita T, Yomo T. Evolutionary dynamics of a polymorphic self-replicator population with a finite population size and hyper mutation rate. J Theor Biol 2015. [PMID: 26209021 DOI: 10.1016/j.jtbi.2015.07.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Self-replicating biomolecules, subject to experimental evolution, exhibit hyper mutation rates where the genotypes of most offspring have at least a one point mutation. Thus, we formulated the evolutionary dynamics of an asexual self-replicator population with a finite population size and hyper mutation rate, based on the probability density of fitnesses (fitness distribution) for the evolving population. As a case study, we used a Kauffman's "NK fitness landscape". We deduced recurrence relations for the first three cumulants of the fitness distribution and compared them with the results of computer simulations. We found that the evolutionary dynamics is classified in terms of two modes of selection: the "radical mode" and the "gentle mode". In the radical mode, only a small number of genotypes with the highest or near highest fitness values can leave offspring. In the gentle mode, genotypes with moderate fitness values can leave offspring. We clarified how the evolutionary equilibrium and climbing rate depend on given parameters such as gradient and ruggedness of the landscape, mutation rate and population size, in terms of the two modes of selection. Roughly, the radical mode conducts the fast climbing but attains to the stationary states with low fitness, while the gentle mode conducts the slow climbing but attains to the stationary states with high fitness.
Collapse
Affiliation(s)
- Takuyo Aita
- Exploratory Research for Advanced Technology, Japan Science and Technology Agency, Yamadaoka 1-5, Suita, Osaka, Japan
| | - Tetsuya Yomo
- Exploratory Research for Advanced Technology, Japan Science and Technology Agency, Yamadaoka 1-5, Suita, Osaka, Japan; Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University, Yamadaoka 1-5, Suita, Osaka, Japan; Graduate School of Frontier Biosciences, Osaka University, Yamadaoka 1-5, Suita, Osaka, Japan.
| |
Collapse
|
4
|
|
5
|
Aita T. Hierarchical distribution of ascending slopes, nearly neutral networks, highlands, and local optima at the dth order in an NK fitness landscape. J Theor Biol 2008; 254:252-63. [DOI: 10.1016/j.jtbi.2008.06.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2007] [Revised: 06/06/2008] [Accepted: 06/07/2008] [Indexed: 10/21/2022]
|
6
|
Aita T, Husimi Y. Fitting protein-folding free energy landscape for a certain conformation to an NK fitness landscape. J Theor Biol 2008; 253:151-61. [DOI: 10.1016/j.jtbi.2008.02.034] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2007] [Revised: 01/22/2008] [Accepted: 02/15/2008] [Indexed: 10/22/2022]
|
7
|
Aita T, Hayashi Y, Toyota H, Husimi Y, Urabe I, Yomo T. Extracting characteristic properties of fitness landscape from in vitro molecular evolution: a case study on infectivity of fd phage to E.coli. J Theor Biol 2007; 246:538-50. [PMID: 17316695 DOI: 10.1016/j.jtbi.2006.12.037] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2006] [Accepted: 12/20/2006] [Indexed: 11/20/2022]
Abstract
We have developed a methodology for extracting characteristic properties of a fitness landscape of interest by analyzing fitness data on an in vitro molecular evolution. The in vitro evolution is required to be conducted as the following "adaptive walk": a single parent sequence generates N mutant sequences as its offsprings, and the fittest individual among the N offsprings will become a new parent in the next generation. N is the library size of mutants to be screened in a single generation. Our theory of the adaptive walk on the "NK landscape" suggests the following: the adaptive walker starting from a random sequence climbs the landscape easily in an early stage, and then reaches a stationary phase in which the mutation-selection-random drift balance sets in. The stationary fitness value is nearly proportional to square root of ln N. Our analysis is performed from the following points: (1) stationary fitness values, (2) time series of fitness in the transitional state, (3) mutant's fitness distribution, and (4) the strength of selection pressure. Applying our methodology, we analyzed experimental data on the in vitro evolution of a random polypeptide (139 amino acids) toward acquiring infectivity (= ability to infect) of fd phage. As a result, we estimated that k is about 27 in this system, indicating that an arbitrary residue in a sequence is affected from other 23% residues. In this article, we demonstrated that the experimental data is consistent with our theoretical equations quantitatively, and that our methodology for extracting characteristic properties of a fitness landscape may be effective.
Collapse
Affiliation(s)
- Takuyo Aita
- Rational Evolutionary Design of Advanced Biomolecules(REDS) Group/JST, Saitama Small Enterprise Promotion Corporation SKIP City, Kawaguchi 333-0844, Japan.
| | | | | | | | | | | |
Collapse
|
8
|
Hayashi Y, Aita T, Toyota H, Husimi Y, Urabe I, Yomo T. Experimental rugged fitness landscape in protein sequence space. PLoS One 2006; 1:e96. [PMID: 17183728 PMCID: PMC1762315 DOI: 10.1371/journal.pone.0000096] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2006] [Accepted: 10/01/2006] [Indexed: 11/30/2022] Open
Abstract
The fitness landscape in sequence space determines the process of biomolecular evolution. To plot the fitness landscape of protein function, we carried out in vitro molecular evolution beginning with a defective fd phage carrying a random polypeptide of 139 amino acids in place of the g3p minor coat protein D2 domain, which is essential for phage infection. After 20 cycles of random substitution at sites 12–130 of the initial random polypeptide and selection for infectivity, the selected phage showed a 1.7×104-fold increase in infectivity, defined as the number of infected cells per ml of phage suspension. Fitness was defined as the logarithm of infectivity, and we analyzed (1) the dependence of stationary fitness on library size, which increased gradually, and (2) the time course of changes in fitness in transitional phases, based on an original theory regarding the evolutionary dynamics in Kauffman's n-k fitness landscape model. In the landscape model, single mutations at single sites among n sites affect the contribution of k other sites to fitness. Based on the results of these analyses, k was estimated to be 18–24. According to the estimated parameters, the landscape was plotted as a smooth surface up to a relative fitness of 0.4 of the global peak, whereas the landscape had a highly rugged surface with many local peaks above this relative fitness value. Based on the landscapes of these two different surfaces, it appears possible for adaptive walks with only random substitutions to climb with relative ease up to the middle region of the fitness landscape from any primordial or random sequence, whereas an enormous range of sequence diversity is required to climb further up the rugged surface above the middle region.
Collapse
Affiliation(s)
- Yuuki Hayashi
- Department of Bioinformatic Engineering, Osaka University, Suita, Osaka, Japan
| | - Takuyo Aita
- Rational Evolutionary Design of Advanced Biomolecules (REDS) Group/JST, Saitama Small Enterprise Promotion Corporation SKIP City, Kawaguchi, Saitama, Japan
- Department of Functional Materials Science, Saitama University, Saitama, Japan
| | - Hitoshi Toyota
- Department of Biotechnology, Osaka University, Suita, Osaka, Japan
| | - Yuzuru Husimi
- Department of Functional Materials Science, Saitama University, Saitama, Japan
| | - Itaru Urabe
- Department of Biotechnology, Osaka University, Suita, Osaka, Japan
| | - Tetsuya Yomo
- Department of Bioinformatic Engineering, Osaka University, Suita, Osaka, Japan
- Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka, Japan
- Expoloratory Research for Advanced Technology (ERATO), Japan Science and Technology Agency (JST), Suita, Osaka, Japan
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
9
|
|
10
|
Aita T, Hamamatsu N, Nomiya Y, Uchiyama H, Shibanaka Y, Husimi Y. Surveying a local fitness landscape of a protein with epistatic sites for the study of directed evolution. Biopolymers 2002; 64:95-105. [PMID: 11979520 DOI: 10.1002/bip.10126] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
We present a method for analysis of a fitness landscape of a biopolymer with significantly epistatic sites. The analysis is based on a quasi-additive fitness model. The fitness model is constructed with additive terms conducted by "site-fitness" and epistatic terms conducted by "pair-fitness," where the site-fitness is a fitness contribution from an independent residue and the pair-fitness is a fitness contribution from a pair of epistatic residues. As a case study, we analyzed the sequence-fitness data for 45 clones of thermostable prolyl endopeptidase mutants. They were generated by a mutation scrambling method, which can accumulate advantageous mutations. The fitness contributions from 14 single-point mutations including E67Q and Q656R were identified by the analysis. As a result, we found that the fitness model with a significant epistatic term by a pair of the 67th site and 656th site was in good agreement with the experimental data and that the explored landscape in the binary 14-dimensional sequence space is still a mountainous landscape with twin peaks. The validity was supported by the analysis of mutant fitness distributions derived from another mutation scrambling experiment and by (3D) structural data.
Collapse
Affiliation(s)
- Takuyo Aita
- Tsukuba Research Institute, Novartis Pharma K. K., Ohkubo 8, Tsukuba 300-2611, Japan
| | | | | | | | | | | |
Collapse
|
11
|
|
12
|
Aita T, Husimi Y. Theory of evolutionary molecular engineering through simultaneous accumulation of advantageous mutations. J Theor Biol 2000; 207:543-56. [PMID: 11093838 DOI: 10.1006/jtbi.2000.2195] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We examined the effectiveness of an "adaptive leap" strategy using the "mutation scrambling" method as an efficient optimization technique (Uchiyama, 2000;J. Biochem.128, 441-447) for cases where mutational (rough) additivity holds in fitness. The mutation scrambling method is composed of the following three processes: (1) preliminary selection of several advantageous single-point mutations introduced in a wild-type sequence; (2) preparation of various multiple-point mutants incorporating the advantageous mutant residue or wild-type residue at each of the selected sites, by scrambling the mutant residues and wild-type residues (this process is called mutation scrambling); and (3) selection of the fittest through screening of the mutant pool. The fitness distribution in the mutant pool is controlled by the mixing ratio of the mutant residues to the wild-type residues. We focused on the mutant fitness distribution and obtained the optimal mixing ratio which efficiently generates superior multiple-point mutants with high fitnesses. As a result, we found that the optimal ratio lies between 7/3 and 9/1 in realistic cases. Particularly, this strategy works well in cases where the number of component mutations is large and the size of the population to be screened is small. Analysis of the mutant fitness distributions with various mixing ratios is also useful to explore local fitness landscapes.
Collapse
Affiliation(s)
- T Aita
- Tsukuba Research Institute, Novartis Pharma K.K., Ohkubo 8, Tsukuba, 300-2611, Japan
| | | |
Collapse
|
13
|
Aita T, Uchiyama H, Inaoka T, Nakajima M, Kokubo T, Husimi Y. Analysis of a local fitness landscape with a model of the rough Mt. Fuji-type landscape: application to prolyl endopeptidase and thermolysin. Biopolymers 2000; 54:64-79. [PMID: 10799982 DOI: 10.1002/(sici)1097-0282(200007)54:1<64::aid-bip70>3.0.co;2-r] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
A method of analysis of a local fitness landscape for a current biopolymer is presented. Based on the assumption of additivity of mutational effects in the biopolymer, we assigned a site-fitness to each residue at each site. The assigned values of site-fitnesses were obtained by the least-squares method to minimize discrepancies between experimental fitnesses and theoretical ones. As test cases, we analyzed a section of a local landscape for the thermostability of prolyl endopeptidase and that for the enzymatic activity of thermolysin. These sections were proved to be of the rough Mt. Fuji-type with straight theta values of larger than 1.0, where straight theta is defined as the ratio of the "mean slope" to the "degree of roughness" on the fitness surface. Furthermore, we theoretically explained discrepancies between the fitnesses of multiple mutants and those predicted based on strict additivity of the component mutations by using a model of the rough Mt. Fuji-type landscape. According to this model, the discrepancies depend on the local landscape property (such as the straight theta value) and the location of the wild type on the landscape and the mean change in fitness by the component mutations. Our results suggest that this model may provide a good approximation of real sections of local landscapes for current biopolymers phenomenologically.
Collapse
Affiliation(s)
- T Aita
- Department of Functional Materials Science Saitama University Urawa 338, Japan
| | | | | | | | | | | |
Collapse
|
14
|
Aita T, Uchiyama H, Inaoka T, Nakajima M, Kokubo T, Husimi Y. Analysis of a local fitness landscape with a model of the rough Mt. Fuji-type landscape: Application to prolyl endopeptidase and thermolysin. Biopolymers 2000. [DOI: 10.1002/(sici)1097-0282(200007)54:1%3c64::aid-bip70%3e3.0.co;2-r] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
15
|
Abstract
Directed enzyme evolution has emerged in the past few years as a powerful alternative to rational approaches for engineering biocatalysts. Prerequisites for successful directed evolution are functional expression in a suitable microbial host, a rapid screen for the desired feature(s) and a well-thought-out working strategy for navigating protein landscapes. The rapidly growing body of literature on enzyme evolution in vitro includes techniques for creating and searching combinatorial enzyme libraries, as well as several successful examples of different evolutionary strategies being used.
Collapse
Affiliation(s)
- O Kuchner
- Division of Chemistry and Chemical Engineering 210-41, California Institute of Technology, Pasadena 91125, USA
| | | |
Collapse
|
16
|
Schuster P. Genotypes with phenotypes: Adventures in an RNA toy world. Biophys Chem 1997; 66:75-110. [PMID: 17029873 DOI: 10.1016/s0301-4622(97)00058-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/1997] [Accepted: 04/17/1997] [Indexed: 11/28/2022]
Abstract
Evolution has created the complexity of the animate world and deciphering the language of evolution is the key towards understanding nature. The dynamics of evolution is simplified by considering it as a superposition of three less sophisticated processes: population dynamics, population support dynamics, and genotype-phenotype mapping. Evolution of molecules in laboratory assays provides a sufficiently simple system for the quantitative analysis of the three phenomena. Coarse-grained notions of structures like RNA secondary structures are used as model phenotypes. They provide an excellent tool for a comprehensive analysis of the entire complex of molecular evolution. The mapping from RNA genotypes into secondary structures is highly redundant. In order to find at least one sequence for every common structures one need only search a (relatively) small part of sequence space. The existence of selectively neutral phenotypes plays an important role for the the success and the efficiency of evolutionary optimization. Molecular evolution found a highly promising technological application in the design of biomolecules with predefined properties.
Collapse
Affiliation(s)
- P Schuster
- Institut für Theoretische Chemie und Strahlenchemie, Universität Wien, A-1090 Wien, Austria
| |
Collapse
|
17
|
|
18
|
Weinberger ED. Local properties of Kauffman's N-k model: A tunably rugged energy landscape. PHYSICAL REVIEW. A, ATOMIC, MOLECULAR, AND OPTICAL PHYSICS 1991; 44:6399-6413. [PMID: 9905770 DOI: 10.1103/physreva.44.6399] [Citation(s) in RCA: 58] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
|
19
|
Hoffmann S. The Nucleoproteinic System. Chirality 1991. [DOI: 10.1007/978-3-642-76569-8_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
20
|
Fontana W, Schnabl W, Schuster P. Physical aspects of evolutionary optimization and adaptation. PHYSICAL REVIEW. A, GENERAL PHYSICS 1989; 40:3301-3321. [PMID: 9902537 DOI: 10.1103/physreva.40.3301] [Citation(s) in RCA: 97] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
21
|
Schuster P, Swetina J. Stationary mutant distributions and evolutionary optimization. Bull Math Biol 1988; 50:635-60. [PMID: 3219448 DOI: 10.1007/bf02460094] [Citation(s) in RCA: 161] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
22
|
|
23
|
Abstract
Molecular evolution is viewed as a typical combinatorial optimization problem. We analyse a chemical reaction model which considers RNA replication including correct copying and point mutations together with hydrolytic degradation and the dilution flux of a flow reactor. The corresponding stochastic reaction network is implemented on a computer in order to investigate some basic features of evolutionary optimization dynamics. Characteristic features of real molecular systems are mimicked by folding binary sequences into unknotted two-dimensional structures. Selective values are derived from these molecular 'phenotypes' by an evaluation procedure which assigns numerical values to different elements of the secondary structure. The fitness function obtained thereby contains nontrivial long-range interactions which are typical for real systems. The fitness landscape also reveals quite involved and bizarre local topologies which we consider also representative of polynucleotide replication in actually occurring systems. Optimization operates on an ensemble of sequences via mutation and natural selection. The strategy observed in the simulation experiments is fairly general and resembles closely a heuristic widely applied in operations research areas. Despite the relative smallness of the system--we study 2000 molecules of chain length v = 70 in a typical simulation experiment--features typical for the evolution of real populations are observed as there are error thresholds for replication, evolutionary steps and quasistationary sequence distributions. The relative importance of selectively neutral or almost neutral variants is discussed quantitatively. Four characteristic ensemble properties, entropy of the distribution, ensemble correlation, mean Hamming distance and diversity of the population, are computed and checked for their sensitivity in recording major optimization events during the simulation.
Collapse
|