1
|
Zhao R, Lukacsovich T, Gaut R, Emerson JJ. FREQ-Seq2: a method for precise high-throughput combinatorial quantification of allele frequencies. G3 (BETHESDA, MD.) 2023; 13:jkad162. [PMID: 37494033 PMCID: PMC10542570 DOI: 10.1093/g3journal/jkad162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 01/26/2023] [Accepted: 07/14/2023] [Indexed: 07/27/2023]
Abstract
The accurate determination of allele frequencies is crucially important across a wide range of problems in genetics, such as developing population genetic models, making inferences from genome-wide association studies, determining genetic risk for diseases, as well as other scientific and medical applications. Furthermore, understanding how allele frequencies change over time in populations is central to ascertaining their evolutionary dynamics. We present a precise, efficient, and economical method (FREQ-Seq2) for quantifying the relative frequencies of different alleles at loci of interest in mixed population samples. Through the creative use of paired barcode sequences, we exponentially increased the throughput of the original FREQ-Seq method from 48 to 2,304 samples. FREQ-Seq2 can be targeted to specific genomic regions of interest, which are amplified using universal barcoded adapters to generate Illumina sequencing libraries. Our enhanced method, available as a kit along with open-source software for analyzing sequenced libraries, enables the detection and removal of errors that are undetectable in the original FREQ-Seq method as well as other conventional methods for allele frequency quantification. Finally, we validated the performance of our sequencing-based approach with a highly multiplexed set of control samples as well as a competitive evolution experiment in Escherichia coli and compare the latter to estimates derived from manual colony counting. Our analyses demonstrate that FREQ-Seq2 is flexible, inexpensive, and produces large amounts of data with low error, low noise, and desirable statistical properties. In summary, FREQ-Seq2 is a powerful method for quantifying allele frequency that provides a versatile approach for profiling mixed populations.
Collapse
Affiliation(s)
- Roy Zhao
- Center for Complex Biological Systems, University of California, Irvine, CA 92697, USA
| | - Tamas Lukacsovich
- Brain Research Institute, University of Zürich, 8057 Zürich, Switzerland
| | - Rebecca Gaut
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA
| | - J J Emerson
- Center for Complex Biological Systems, University of California, Irvine, CA 92697, USA
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA
| |
Collapse
|
2
|
Abstract
We consider evolution of a large population, where fitness of each organism is defined by many phenotypical traits. These traits result from expression of many genes. Under some assumptions on fitness we prove that such model organisms are capable, to some extent, to recognize the fitness landscape. That fitness landscape learning sharply reduces the number of mutations needed for adaptation. Moreover, this learning increases phenotype robustness with respect to mutations, i.e., canalizes the phenotype. We show that learning and canalization work only when evolution is gradual. Organisms can be adapted to many constraints associated with a hard environment, if that environment becomes harder step by step. Our results explain why evolution can involve genetic changes of a relatively large effect and why the total number of changes are surprisingly small.
Collapse
Affiliation(s)
- John Reinitz
- Departments of Statistics, Ecology and Evolution, Molecular Genetics and Cell Biology, University of Chicago, Chicago, IL, USA
| | - Sergey Vakulenko
- Saint Petersburg National Research University of Information Technologies, Mechanics and Optics, Saint Petersburg, Russian Federation
| | - Dmitri Grigoriev
- CNRS, Mathématiques, Université de Lille, Villeneuve d'Ascq, France
| | - Andreas Weber
- Department of Computer Science, University of Bonn, Bonn, Germany
| |
Collapse
|
3
|
Trubenová B, Krejca MS, Lehre PK, Kötzing T. Surfing on the seascape: Adaptation in a changing environment. Evolution 2019; 73:1356-1374. [PMID: 31206653 PMCID: PMC6771940 DOI: 10.1111/evo.13784] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 04/15/2019] [Indexed: 12/11/2022]
Abstract
The environment changes constantly at various time scales and, in order to survive, species need to keep adapting. Whether these species succeed in avoiding extinction is a major evolutionary question. Using a multilocus evolutionary model of a mutation-limited population adapting under strong selection, we investigate the effects of the frequency of environmental fluctuations on adaptation. Our results rely on an "adaptive-walk" approximation and use mathematical methods from evolutionary computation theory to investigate the interplay between fluctuation frequency, the similarity of environments, and the number of loci contributing to adaptation. First, we assume a linear additive fitness function, but later generalize our results to include several types of epistasis. We show that frequent environmental changes prevent populations from reaching a fitness peak, but they may also prevent the large fitness loss that occurs after a single environmental change. Thus, the population can survive, although not thrive, in a wide range of conditions. Furthermore, we show that in a frequently changing environment, the similarity of threats that a population faces affects the level of adaptation that it is able to achieve. We check and supplement our analytical results with simulations.
Collapse
Affiliation(s)
- Barbora Trubenová
- Institute of Science and Technology AustriaAm Campus 1Klosterneuburg 3400Austria
| | - Martin S. Krejca
- Hasso Plattner InstituteProf.‐Dr.‐Helmert‐Straße 2‐314482 PotsdamGermany
| | | | - Timo Kötzing
- Hasso Plattner InstituteProf.‐Dr.‐Helmert‐Straße 2‐314482 PotsdamGermany
| |
Collapse
|
4
|
Abstract
We consider evolution of a large population, where fitness of each organism is defined by many phenotypical traits. These traits result from expression of many genes. Under some assumptions on fitness we prove that such model organisms are capable, to some extent, to recognize the fitness landscape. That fitness landscape learning sharply reduces the number of mutations needed for adaptation. Moreover, this learning increases phenotype robustness with respect to mutations, i.e., canalizes the phenotype. We show that learning and canalization work only when evolution is gradual. Organisms can be adapted to many constraints associated with a hard environment, if that environment becomes harder step by step. Our results explain why evolution can involve genetic changes of a relatively large effect and why the total number of changes are surprisingly small.
Collapse
Affiliation(s)
- John Reinitz
- Departments of Statistics, Ecology and Evolution, Molecular Genetics and Cell Biology, University of Chicago, Chicago, IL, USA
| | - Sergey Vakulenko
- Saint Petersburg National Research University of Information Technologies, Mechanics and Optics, Saint Petersburg, Russian Federation
| | - Dmitri Grigoriev
- CNRS, Mathématiques, Université de Lille, Villeneuve d'Ascq, France
| | - Andreas Weber
- Department of Computer Science, University of Bonn, Bonn, Germany
| |
Collapse
|
5
|
Computational Complexity as an Ultimate Constraint on Evolution. Genetics 2019; 212:245-265. [PMID: 30833289 DOI: 10.1534/genetics.119.302000] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Accepted: 02/22/2019] [Indexed: 01/28/2023] Open
Abstract
Experiments show that evolutionary fitness landscapes can have a rich combinatorial structure due to epistasis. For some landscapes, this structure can produce a computational constraint that prevents evolution from finding local fitness optima-thus overturning the traditional assumption that local fitness peaks can always be reached quickly if no other evolutionary forces challenge natural selection. Here, I introduce a distinction between easy landscapes of traditional theory where local fitness peaks can be found in a moderate number of steps, and hard landscapes where finding local optima requires an infeasible amount of time. Hard examples exist even among landscapes with no reciprocal sign epistasis; on these semismooth fitness landscapes, strong selection weak mutation dynamics cannot find the unique peak in polynomial time. More generally, on hard rugged fitness landscapes that include reciprocal sign epistasis, no evolutionary dynamics-even ones that do not follow adaptive paths-can find a local fitness optimum quickly. Moreover, on hard landscapes, the fitness advantage of nearby mutants cannot drop off exponentially fast but has to follow a power-law that long-term evolution experiments have associated with unbounded growth in fitness. Thus, the constraint of computational complexity enables open-ended evolution on finite landscapes. Knowing this constraint allows us to use the tools of theoretical computer science and combinatorial optimization to characterize the fitness landscapes that we expect to see in nature. I present candidates for hard landscapes at scales from single genes, to microbes, to complex organisms with costly learning (Baldwin effect) or maintained cooperation (Hankshaw effect). Just how ubiquitous hard landscapes (and the corresponding ultimate constraint on evolution) are in nature becomes an open empirical question.
Collapse
|
6
|
Fragata I, Blanckaert A, Dias Louro MA, Liberles DA, Bank C. Evolution in the light of fitness landscape theory. Trends Ecol Evol 2019; 34:69-82. [DOI: 10.1016/j.tree.2018.10.009] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 10/16/2018] [Accepted: 10/17/2018] [Indexed: 01/28/2023]
|
7
|
Heredia JP. Modelling Evolutionary Algorithms with Stochastic Differential Equations. EVOLUTIONARY COMPUTATION 2017; 26:657-686. [PMID: 29155604 DOI: 10.1162/evco_a_00216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
There has been renewed interest in modelling the behaviour of evolutionary algorithms (EAs) by more traditional mathematical objects, such as ordinary differential equations or Markov chains. The advantage is that the analysis becomes greatly facilitated due to the existence of well established methods. However, this typically comes at the cost of disregarding information about the process. Here, we introduce the use of stochastic differential equations (SDEs) for the study of EAs. SDEs can produce simple analytical results for the dynamics of stochastic processes, unlike Markov chains which can produce rigorous but unwieldy expressions about the dynamics. On the other hand, unlike ordinary differential equations (ODEs), they do not discard information about the stochasticity of the process. We show that these are especially suitable for the analysis of fixed budget scenarios and present analogues of the additive and multiplicative drift theorems from runtime analysis. In addition, we derive a new more general multiplicative drift theorem that also covers non-elitist EAs. This theorem simultaneously allows for positive and negative results, providing information on the algorithm's progress even when the problem cannot be optimised efficiently. Finally, we provide results for some well-known heuristics namely Random Walk (RW), Random Local Search (RLS), the (1+1) EA, the Metropolis Algorithm (MA), and the Strong Selection Weak Mutation (SSWM) algorithm.
Collapse
Affiliation(s)
- Jorge Pérez Heredia
- Department of Computer Science, University of Sheffield, Sheffield S1 4DP, United Kingdom
| |
Collapse
|
8
|
Oliveto PS, Paixão T, Pérez Heredia J, Sudholt D, Trubenová B. How to Escape Local Optima in Black Box Optimisation: When Non-elitism Outperforms Elitism. ALGORITHMICA 2017; 80:1604-1633. [PMID: 30996504 PMCID: PMC6438649 DOI: 10.1007/s00453-017-0369-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Accepted: 08/19/2017] [Indexed: 06/09/2023]
Abstract
Escaping local optima is one of the major obstacles to function optimisation. Using the metaphor of a fitness landscape, local optima correspond to hills separated by fitness valleys that have to be overcome. We define a class of fitness valleys of tunable difficulty by considering their length, representing the Hamming path between the two optima and their depth, the drop in fitness. For this function class we present a runtime comparison between stochastic search algorithms using different search strategies. The ( 1 + 1 ) EA is a simple and well-studied evolutionary algorithm that has to jump across the valley to a point of higher fitness because it does not accept worsening moves (elitism). In contrast, the Metropolis algorithm and the Strong Selection Weak Mutation (SSWM) algorithm, a famous process in population genetics, are both able to cross the fitness valley by accepting worsening moves. We show that the runtime of the ( 1 + 1 ) EA depends critically on the length of the valley while the runtimes of the non-elitist algorithms depend crucially on the depth of the valley. Moreover, we show that both SSWM and Metropolis can also efficiently optimise a rugged function consisting of consecutive valleys.
Collapse
Affiliation(s)
| | - Tiago Paixão
- IST Austria, Am Campus 1, 3400 Klosterneuburg, Austria
| | | | | | | |
Collapse
|