1
|
Sappington A, Mohanty V. Probabilistic Genotype-Phenotype Maps Reveal Mutational Robustness of RNA Folding, Spin Glasses, and Quantum Circuits. ARXIV 2024:arXiv:2301.01847v2. [PMID: 36713233 PMCID: PMC9882568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Recent studies of genotype-phenotype (GP) maps have reported universally enhanced phenotypic robustness to genotype mutations, a feature essential to evolution. Virtually all of these studies make a simplifying assumption that each genotype-represented as a sequence-maps deterministically to a single phenotype, such as a discrete structure. Here, we introduce probabilistic genotype-phenotype (PrGP) maps, where each genotype maps to a vector of phenotype probabilities, as a more realistic and universal language for investigating robustness in a variety of physical, biological, and computational systems. We study three model systems to show that PrGP maps offer a generalized framework which can handle uncertainty emerging from various physical sources: (1) thermal fluctuation in RNA folding, (2) external field disorder in spin glass ground state finding, and (3) superposition and entanglement in quantum circuits, which are realized experimentally on IBM quantum computers. In all three cases, we observe a novel biphasic robustness scaling which is enhanced relative to random expectation for more frequent phenotypes and approaches random expectation for less frequent phenotypes. We derive an analytical theory for the behavior of PrGP robustness, and we demonstrate that the theory is highly predictive of empirical robustness.
Collapse
Affiliation(s)
- Anna Sappington
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139
- Harvard-MIT Health Sciences and Technology, Harvard Medical School, Boston, MA 02115 and Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Vaibhav Mohanty
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138
- Harvard-MIT Health Sciences and Technology, Harvard Medical School, Boston, MA 02115 and Massachusetts Institute of Technology, Cambridge, MA 02139
| |
Collapse
|
2
|
Schmutzer M, Dasmeh P, Wagner A. Frustration can Limit the Adaptation of Promiscuous Enzymes Through Gene Duplication and Specialisation. J Mol Evol 2024; 92:104-120. [PMID: 38470504 PMCID: PMC10978624 DOI: 10.1007/s00239-024-10161-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 02/16/2024] [Indexed: 03/14/2024]
Abstract
Virtually all enzymes catalyse more than one reaction, a phenomenon known as enzyme promiscuity. It is unclear whether promiscuous enzymes are more often generalists that catalyse multiple reactions at similar rates or specialists that catalyse one reaction much more efficiently than other reactions. In addition, the factors that shape whether an enzyme evolves to be a generalist or a specialist are poorly understood. To address these questions, we follow a three-pronged approach. First, we examine the distribution of promiscuity in empirical enzymes reported in the BRENDA database. We find that the promiscuity distribution of empirical enzymes is bimodal. In other words, a large fraction of promiscuous enzymes are either generalists or specialists, with few intermediates. Second, we demonstrate that enzyme biophysics is not sufficient to explain this bimodal distribution. Third, we devise a constraint-based model of promiscuous enzymes undergoing duplication and facing selection pressures favouring subfunctionalization. The model posits the existence of constraints between the catalytic efficiencies of an enzyme for different reactions and is inspired by empirical case studies. The promiscuity distribution predicted by our constraint-based model is consistent with the empirical bimodal distribution. Our results suggest that subfunctionalization is possible and beneficial only in certain enzymes. Furthermore, the model predicts that conflicting constraints and selection pressures can cause promiscuous enzymes to enter a 'frustrated' state, in which competing interactions limit the specialisation of enzymes. We find that frustration can be both a driver and an inhibitor of enzyme evolution by duplication and subfunctionalization. In addition, our model predicts that frustration becomes more likely as enzymes catalyse more reactions, implying that natural selection may prefer catalytically simple enzymes. In sum, our results suggest that frustration may play an important role in enzyme evolution.
Collapse
Affiliation(s)
- Michael Schmutzer
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pouria Dasmeh
- Center for Human Genetics, Philipps University of Marburg, Marburg, Germany
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
- Santa Fe Institute, Santa Fe, NM, USA.
| |
Collapse
|
3
|
Mohanty V, Greenbury SF, Sarkany T, Narayanan S, Dingle K, Ahnert SE, Louis AA. Maximum mutational robustness in genotype-phenotype maps follows a self-similar blancmange-like curve. J R Soc Interface 2023; 20:20230169. [PMID: 37491910 PMCID: PMC10369032 DOI: 10.1098/rsif.2023.0169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 06/27/2023] [Indexed: 07/27/2023] Open
Abstract
Phenotype robustness, defined as the average mutational robustness of all the genotypes that map to a given phenotype, plays a key role in facilitating neutral exploration of novel phenotypic variation by an evolving population. By applying results from coding theory, we prove that the maximum phenotype robustness occurs when genotypes are organized as bricklayer's graphs, so-called because they resemble the way in which a bricklayer would fill in a Hamming graph. The value of the maximal robustness is given by a fractal continuous everywhere but differentiable nowhere sums-of-digits function from number theory. Interestingly, genotype-phenotype maps for RNA secondary structure and the hydrophobic-polar (HP) model for protein folding can exhibit phenotype robustness that exactly attains this upper bound. By exploiting properties of the sums-of-digits function, we prove a lower bound on the deviation of the maximum robustness of phenotypes with multiple neutral components from the bricklayer's graph bound, and show that RNA secondary structure phenotypes obey this bound. Finally, we show how robustness changes when phenotypes are coarse-grained and derive a formula and associated bounds for the transition probabilities between such phenotypes.
Collapse
Affiliation(s)
- Vaibhav Mohanty
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, UK
- Program in Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge, MA, USA
- MD-PhD Program, Harvard Medical School, Boston, MA, USA and Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Sam F. Greenbury
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK
- The Alan Turing Institute, British Library, London, UK
| | - Tasmin Sarkany
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
| | - Shyam Narayanan
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Kamaludin Dingle
- Department of Mathematics and Natural Sciences, Centre for Applied Mathematics and Bioinformatics (CAMB), Gulf University of Science and Technology, Kuwait
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA
| | - Sebastian E. Ahnert
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK
- Department of Chemical Engineering and Biotechnology, Cavendish Laboratory, University of Cambridge, Cambridge, UK
- The Alan Turing Institute, British Library, London, UK
| | - Ard A. Louis
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, UK
| |
Collapse
|
4
|
Santos-Moreno J, Tasiudi E, Kusumawardhani H, Stelling J, Schaerli Y. Robustness and innovation in synthetic genotype networks. Nat Commun 2023; 14:2454. [PMID: 37117168 PMCID: PMC10147661 DOI: 10.1038/s41467-023-38033-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 04/13/2023] [Indexed: 04/30/2023] Open
Abstract
Genotype networks are sets of genotypes connected by small mutational changes that share the same phenotype. They facilitate evolutionary innovation by enabling the exploration of different neighborhoods in genotype space. Genotype networks, first suggested by theoretical models, have been empirically confirmed for proteins and RNAs. Comparative studies also support their existence for gene regulatory networks (GRNs), but direct experimental evidence is lacking. Here, we report the construction of three interconnected genotype networks of synthetic GRNs producing three distinct phenotypes in Escherichia coli. Our synthetic GRNs contain three nodes regulating each other by CRISPR interference and governing the expression of fluorescent reporters. The genotype networks, composed of over twenty different synthetic GRNs, provide robustness in face of mutations while enabling transitions to innovative phenotypes. Through realistic mathematical modeling, we quantify robustness and evolvability for the complete genotype-phenotype map and link these features mechanistically to GRN motifs. Our work thereby exemplifies how GRN evolution along genotype networks might be driving evolutionary innovation.
Collapse
Affiliation(s)
- Javier Santos-Moreno
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland
- Department of Medicine and Life Sciences, Pompeu Fabra University, 00803, Barcelona, Spain
| | - Eve Tasiudi
- Department of Biosystems Science and Engineering, ETH Zurich and SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Hadiastri Kusumawardhani
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland
| | - Joerg Stelling
- Department of Biosystems Science and Engineering, ETH Zurich and SIB Swiss Institute of Bioinformatics, Basel, Switzerland.
| | - Yolanda Schaerli
- Department of Fundamental Microbiology, University of Lausanne, Biophore Building, 1015, Lausanne, Switzerland.
| |
Collapse
|
5
|
Mohanty V, Louis AA. Robustness and stability of spin-glass ground states to perturbed interactions. Phys Rev E 2023; 107:014126. [PMID: 36797942 DOI: 10.1103/physreve.107.014126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 12/16/2022] [Indexed: 06/18/2023]
Abstract
Across many problems in science and engineering, it is important to consider how much the output of a given system changes due to perturbations of the input. Here, we investigate the glassy phase of ±J spin glasses at zero temperature by calculating the robustness of the ground states to flips in the sign of single interactions. For random graphs and the Sherrington-Kirkpatrick model, we find relatively large sets of bond configurations that generate the same ground state. These sets can themselves be analyzed as subgraphs of the interaction domain, and we compute many of their topological properties. In particular, we find that the robustness, equivalent to the average degree, of these subgraphs is much higher than one would expect from a random model. Most notably, it scales in the same logarithmic way with the size of the subgraph as has been found in genotype-phenotype maps for RNA secondary structure folding, protein quaternary structure, gene regulatory networks, as well as for models for genetic programming. The similarity between these disparate systems suggests that this scaling may have a more universal origin.
Collapse
Affiliation(s)
- Vaibhav Mohanty
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, OX1 3NP, United Kingdom
- MD-PhD Program and Program in Health Sciences and Technology, Harvard Medical School, Boston, Massachusetts 02125, USA and Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Ard A Louis
- Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, OX1 3NP, United Kingdom
| |
Collapse
|
6
|
Abstract
Transcriptomes are known to organize themselves into gene co-expression clusters or modules where groups of genes display distinct patterns of coordinated or synchronous expression across independent biological samples. The functional significance of these co-expression clusters is suggested by the fact that highly coexpressed groups of genes tend to be enriched in genes involved in common functions and biological processes. While gene co-expression is widely assumed to reflect close regulatory proximity, the validity of this assumption remains unclear. Here we use a simple synthetic gene regulatory network (GRN) model and contrast the resulting co-expression structure produced by these networks with their known regulatory architecture and with the co-expression structure measured in available human expression data. Using randomization tests, we found that the levels of co-expression observed in simulated expression data were, just as with empirical data, significantly higher than expected by chance. When examining the source of correlated expression, we found that individual regulators, both in simulated and experimental data, fail, on average, to display correlated expression with their immediate targets. However, highly correlated gene pairs tend to share at least one common regulator, while most gene pairs sharing common regulators do not necessarily display correlated expression. Our results demonstrate that widespread co-expression naturally emerges in regulatory networks, and that it is a reliable and direct indicator of active co-regulation in a given cellular context.
Collapse
|
7
|
Hagolani PF, Zimm R, Vroomans R, Salazar-Ciudad I. On the evolution and development of morphological complexity: A view from gene regulatory networks. PLoS Comput Biol 2021; 17:e1008570. [PMID: 33626036 PMCID: PMC7939363 DOI: 10.1371/journal.pcbi.1008570] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 03/08/2021] [Accepted: 11/27/2020] [Indexed: 12/26/2022] Open
Abstract
How does morphological complexity evolve? This study suggests that the likelihood of mutations increasing phenotypic complexity becomes smaller when the phenotype itself is complex. In addition, the complexity of the genotype-phenotype map (GPM) also increases with the phenotypic complexity. We show that complex GPMs and the above mutational asymmetry are inevitable consequences of how genes need to be wired in order to build complex and robust phenotypes during development. We randomly wired genes and cell behaviors into networks in EmbryoMaker. EmbryoMaker is a mathematical model of development that can simulate any gene network, all animal cell behaviors (division, adhesion, apoptosis, etc.), cell signaling, cell and tissues biophysics, and the regulation of those behaviors by gene products. Through EmbryoMaker we simulated how each random network regulates development and the resulting morphology (i.e. a specific distribution of cells and gene expression in 3D). This way we obtained a zoo of possible 3D morphologies. Real gene networks are not random, but a random search allows a relatively unbiased exploration of what is needed to develop complex robust morphologies. Compared to the networks leading to simple morphologies, the networks leading to complex morphologies have the following in common: 1) They are rarer; 2) They need to be finely tuned; 3) Mutations in them tend to decrease morphological complexity; 4) They are less robust to noise; and 5) They have more complex GPMs. These results imply that, when complexity evolves, it does so at a progressively decreasing rate over generations. This is because as morphological complexity increases, the likelihood of mutations increasing complexity decreases, morphologies become less robust to noise, and the GPM becomes more complex. We find some properties in common, but also some important differences, with non-developmental GPM models (e.g. RNA, protein and gene networks in single cells).
Collapse
Affiliation(s)
- Pascal F. Hagolani
- Evo-devo Helsinki community, Centre of Excellence in Experimental and Computational Developmental Biology, Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Roland Zimm
- Evo-devo Helsinki community, Centre of Excellence in Experimental and Computational Developmental Biology, Institute of Biotechnology, University of Helsinki, Helsinki, Finland
- Institute of Functional Genomics, École Normale Superieure, Lyon, France
- Konrad Lorenz Insititute for Evolution and Cognition Research, Vienna, Austria
| | - Renske Vroomans
- Origins Center, Nijenborgh, Groningen, The Netherlands
- Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands
| | - Isaac Salazar-Ciudad
- Evo-devo Helsinki community, Centre of Excellence in Experimental and Computational Developmental Biology, Institute of Biotechnology, University of Helsinki, Helsinki, Finland
- Genomics, Bioinformatics and Evolution group, Departament de Genètica i Microbiologia, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
- Centre de Rercerca Matemàtica, Cerdanyola del Vallès, Spain
| |
Collapse
|
8
|
Webster-Wood VA, Gill JP, Thomas PJ, Chiel HJ. Control for multifunctionality: bioinspired control based on feeding in Aplysia californica. BIOLOGICAL CYBERNETICS 2020; 114:557-588. [PMID: 33301053 PMCID: PMC8543386 DOI: 10.1007/s00422-020-00851-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 11/20/2020] [Indexed: 06/12/2023]
Abstract
Animals exhibit remarkable feats of behavioral flexibility and multifunctional control that remain challenging for robotic systems. The neural and morphological basis of multifunctionality in animals can provide a source of bioinspiration for robotic controllers. However, many existing approaches to modeling biological neural networks rely on computationally expensive models and tend to focus solely on the nervous system, often neglecting the biomechanics of the periphery. As a consequence, while these models are excellent tools for neuroscience, they fail to predict functional behavior in real time, which is a critical capability for robotic control. To meet the need for real-time multifunctional control, we have developed a hybrid Boolean model framework capable of modeling neural bursting activity and simple biomechanics at speeds faster than real time. Using this approach, we present a multifunctional model of Aplysia californica feeding that qualitatively reproduces three key feeding behaviors (biting, swallowing, and rejection), demonstrates behavioral switching in response to external sensory cues, and incorporates both known neural connectivity and a simple bioinspired mechanical model of the feeding apparatus. We demonstrate that the model can be used for formulating testable hypotheses and discuss the implications of this approach for robotic control and neuroscience.
Collapse
Affiliation(s)
- Victoria A Webster-Wood
- Department of Mechanical Engineering, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA, 15213, USA.
- Department of Biomedical Engineering, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA, 15213, USA.
- McGowan Institute for Regenerative Medicine, Carnegie Mellon University, 5000 Forbes Ave., Pittsburgh, PA, 15213, USA.
| | - Jeffrey P Gill
- Department of Biology, Case Western Reserve University, 2080 Adelbert Road, Cleveland, OH, 44106-7080, USA
| | - Peter J Thomas
- Department of Mathematics, Applied Mathematics and Statistics, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, OH, 44106-4901, USA
- Department of Biology, Department of Cognitive Science, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, OH, 44106-4901, USA
- Department of Electrical Computer and Systems Engineering, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, OH, 44106-4901, USA
| | - Hillel J Chiel
- Department of Biology, Case Western Reserve University, 2080 Adelbert Road, Cleveland, OH, 44106-7080, USA
- Department of Neurosciences, Case Western Reserve University, 2080 Adelbert Road, Cleveland, OH, 44106-7080, USA
- Department of Biomedical Engineering, Case Western Reserve University, 2080 Adelbert Road, Cleveland, OH, 44106-7080, USA
| |
Collapse
|
9
|
Kogenaru M, Nghe P, Poelwijk FJ, Tans SJ. Predicting Evolutionary Constraints by Identifying Conflicting Demands in Regulatory Networks. Cell Syst 2020; 10:526-534.e3. [PMID: 32553183 DOI: 10.1016/j.cels.2020.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2019] [Revised: 03/14/2020] [Accepted: 05/17/2020] [Indexed: 11/17/2022]
Abstract
Gene regulation networks allow organisms to adapt to diverse environmental niches. However, the constraints underlying the evolution of gene regulation remain ill defined. Here, we show that partial order-a concept that ranks network output levels as a function of different input signals-identifies such constraints. We tested our predictions by experimentally evolving an engineered signal-integrating network in multiple environments. We find that populations: (1) expand in fitness space along the Pareto-optimal front associated with conflicts in regulatory demands, by fine-tuning binding affinities within the network, and (2) expand beyond the Pareto-optimal front through changes in the network structure. Our constraint predictions are based only on partial order and do not require information on the network architecture or underlying genetics. Overall, our findings show that limited knowledge of current regulatory phenotypes can provide predictions on future evolutionary constraints.
Collapse
Affiliation(s)
- Manjunatha Kogenaru
- AMOLF, Science Park 104, Amsterdam 1098 XG, the Netherlands; Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - Philippe Nghe
- AMOLF, Science Park 104, Amsterdam 1098 XG, the Netherlands; Laboratoire de Biochimie, UMR CBI 8231, ESPCI Paris - PSL, PSL Research University, 10 rue Vauquelin, Paris 75005, France.
| | - Frank J Poelwijk
- Department of Data Sciences, Dana-Farber Cancer Institute, 360 Brookline Avenue, Boston, MA 02215, USA
| | - Sander J Tans
- AMOLF, Science Park 104, Amsterdam 1098 XG, the Netherlands; Department of Bionanoscience, Kavli Institute of Nanoscience Delft, Delft University of Technology, Van der Maasweg 9, Delft 2629, the Netherlands.
| |
Collapse
|
10
|
Weiß M, Ahnert SE. Using small samples to estimate neutral component size and robustness in the genotype-phenotype map of RNA secondary structure. J R Soc Interface 2020; 17:20190784. [PMID: 32429824 DOI: 10.1098/rsif.2019.0784] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
In genotype-phenotype (GP) maps, the genotypes that map to the same phenotype are usually not randomly distributed across the space of genotypes, but instead are predominantly connected through one-point mutations, forming network components that are commonly referred to as neutral components (NCs). Because of their impact on evolutionary processes, the characteristics of these NCs, like their size or robustness, have been studied extensively. Here, we introduce a framework that allows the estimation of NC size and robustness in the GP map of RNA secondary structure. The advantage of this framework is that it only requires small samples of genotypes and their local environment, which also allows experimental realizations. We verify our framework by applying it to the exhaustively analysable GP map of RNA sequence length L = 15, and benchmark it against an existing method by applying it to longer, naturally occurring functional non-coding RNA sequences. Although it is specific to the RNA secondary structure GP map in the first place, our framework can probably be transferred and adapted to other sequence-to-structure GP maps.
Collapse
Affiliation(s)
- Marcel Weiß
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK.,Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge CB2 1LR, UK
| | - Sebastian E Ahnert
- Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK.,Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge CB2 1LR, UK
| |
Collapse
|
11
|
Garte S, Albert A. Genotype Components as Predictors of Phenotype in Model Gene Regulatory Networks. Acta Biotheor 2019; 67:299-320. [PMID: 31286303 DOI: 10.1007/s10441-019-09350-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Accepted: 07/04/2019] [Indexed: 10/26/2022]
Abstract
Models of gene regulatory networks (GRN) have proven useful for understanding many aspects of the highly complex behavior of biological control networks. Randomly generated non-Boolean networks were used in experimental simulations to generate data on dynamic phenotypes as a function of several genotypic parameters. We found that predictive relationships between some phenotypes and quantitative genotypic parameters such as number of network genes, interaction density, and initial condition could be derived depending on the strength of the topological (positional) genotype on specific phenotypes. We quantitated the strength of the topological genotype effect (TGE) on a number of phenotypes in multi-gene networks. For phenotypes with a low influence of topological genotype, derived and empirical relationships using quantitative genotype parameters were accurate in phenotypic outcomes. We found a number of dynamic network properties, including oscillation behaviors, that were largely dependent on genotype topology, and for which no such general quantitative relationships were determinable. It remains to be determined if these results are applicable to biological gene regulatory networks.
Collapse
|
12
|
Aguilar‐Rodríguez J, Peel L, Stella M, Wagner A, Payne JL. The architecture of an empirical genotype-phenotype map. Evolution 2018; 72:1242-1260. [PMID: 29676774 PMCID: PMC6055911 DOI: 10.1111/evo.13487] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 04/03/2018] [Indexed: 12/15/2022]
Abstract
Recent advances in high-throughput technologies are bringing the study of empirical genotype-phenotype (GP) maps to the fore. Here, we use data from protein-binding microarrays to study an empirical GP map of transcription factor (TF) -binding preferences. In this map, each genotype is a DNA sequence. The phenotype of this DNA sequence is its ability to bind one or more TFs. We study this GP map using genotype networks, in which nodes represent genotypes with the same phenotype, and edges connect nodes if their genotypes differ by a single small mutation. We describe the structure and arrangement of genotype networks within the space of all possible binding sites for 525 TFs from three eukaryotic species encompassing three kingdoms of life (animal, plant, and fungi). We thus provide a high-resolution depiction of the architecture of an empirical GP map. Among a number of findings, we show that these genotype networks are "small-world" and assortative, and that they ubiquitously overlap and interface with one another. We also use polymorphism data from Arabidopsis thaliana to show how genotype network structure influences the evolution of TF-binding sites in vivo. We discuss our findings in the context of regulatory evolution.
Collapse
Affiliation(s)
- José Aguilar‐Rodríguez
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
- Current Address: Department of Biology, Stanford University, StanfordCA, USA; Department of Chemical and Systems Biology, Stanford UniversityStanfordCAUSA
| | - Leto Peel
- Institute of Information and Communication Technologies, Electronics and Applied MathematicsUniversité Catholique de LouvainLouvain‐la‐NeuveBelgium
- Namur Center for Complex SystemsUniversity of NamurNamurBelgium
| | - Massimo Stella
- Institute for Complex Systems Simulation, Department of Electronics and Computer ScienceUniversity of SouthamptonSouthamptonUnited Kingdom
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
- The Santa Fe InstituteSanta FeNew MexicoUSA
| | - Joshua L. Payne
- Swiss Institute of BioinformaticsLausanneSwitzerland
- Institute for Integrative Biology, ETHZurichSwitzerland
| |
Collapse
|
13
|
Ahnert SE. Structural properties of genotype-phenotype maps. J R Soc Interface 2018; 14:rsif.2017.0275. [PMID: 28679667 DOI: 10.1098/rsif.2017.0275] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Accepted: 06/06/2017] [Indexed: 12/21/2022] Open
Abstract
The map between genotype and phenotype is fundamental to biology. Biological information is stored and passed on in the form of genotypes, and expressed in the form of phenotypes. A growing body of literature has examined a wide range of genotype-phenotype (GP) maps and has established a number of properties that appear to be shared by many GP maps. These properties are 'structural' in the sense that they are properties of the distribution of phenotypes across the point-mutation network of genotypes. They include: a redundancy of genotypes, meaning that many genotypes map to the same phenotypes, a highly non-uniform distribution of the number of genotypes per phenotype, a high robustness of phenotypes and the ability to reach a large number of new phenotypes within a small number of mutational steps. A further important property is that the robustness and evolvability of phenotypes are positively correlated. In this review, I give an overview of the study of GP maps with particular emphasis on these structural properties, and discuss a model that attempts to explain why these properties arise, as well as some of the fundamental ways in which the structure of GP maps can affect evolutionary outcomes.
Collapse
Affiliation(s)
- S E Ahnert
- Sainsbury Laboratory, University of Cambridge, Bateman Street, Cambridge CB2 1LR, UK .,Theory of Condensed Matter Group, Cavendish Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge CB3 0HE, UK
| |
Collapse
|
14
|
Jablonski D. Approaches to Macroevolution: 1. General Concepts and Origin of Variation. Evol Biol 2017; 44:427-450. [PMID: 29142333 PMCID: PMC5661017 DOI: 10.1007/s11692-017-9420-0] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2017] [Accepted: 05/26/2017] [Indexed: 12/11/2022]
Abstract
Approaches to macroevolution require integration of its two fundamental components, i.e. the origin and the sorting of variation, in a hierarchical framework. Macroevolution occurs in multiple currencies that are only loosely correlated, notably taxonomic diversity, morphological disparity, and functional variety. The origin of variation within this conceptual framework is increasingly understood in developmental terms, with the semi-hierarchical structure of gene regulatory networks (GRNs, used here in a broad sense incorporating not just the genetic circuitry per se but the factors controlling the timing and location of gene expression and repression), the non-linear relation between magnitude of genetic change and the phenotypic results, the evolutionary potential of co-opting existing GRNs, and developmental responsiveness to nongenetic signals (i.e. epigenetics and plasticity), all requiring modification of standard microevolutionary models, and rendering difficult any simple definition of evolutionary novelty. The developmental factors underlying macroevolution create anisotropic probabilities-i.e., an uneven density distribution-of evolutionary change around any given phenotypic starting point, and the potential for coordinated changes among traits that can accommodate change via epigenetic mechanisms. From this standpoint, "punctuated equilibrium" and "phyletic gradualism" simply represent two cells in a matrix of evolutionary models of phenotypic change, and the origin of trends and evolutionary novelty are not simply functions of ecological opportunity. Over long timescales, contingency becomes especially important, and can be viewed in terms of macroevolutionary lags (the temporal separation between the origin of a trait or clade and subsequent diversification); such lags can arise by several mechanisms: as geological or phylogenetic artifacts, or when diversifications require synergistic interactions among traits, or between traits and external events. The temporal and spatial patterns of the origins of evolutionary novelties are a challenge to macroevolutionary theory; individual events can be described retrospectively, but a general model relating development, genetics, and ecology is needed. An accompanying paper (Jablonski in Evol Biol 2017) reviews diversity dynamics and the sorting of variation, with some general conclusions.
Collapse
Affiliation(s)
- David Jablonski
- Department of Geophysical Sciences, University of Chicago, 5734 South Ellis Avenue, Chicago, IL 60637 USA
| |
Collapse
|
15
|
Jiménez A, Cotterell J, Munteanu A, Sharpe J. A spectrum of modularity in multi-functional gene circuits. Mol Syst Biol 2017; 13:925. [PMID: 28455348 PMCID: PMC5408781 DOI: 10.15252/msb.20167347] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
A major challenge in systems biology is to understand the relationship between a circuit's structure and its function, but how is this relationship affected if the circuit must perform multiple distinct functions within the same organism? In particular, to what extent do multi‐functional circuits contain modules which reflect the different functions? Here, we computationally survey a range of bi‐functional circuits which show no simple structural modularity: They can switch between two qualitatively distinct functions, while both functions depend on all genes of the circuit. Our analysis reveals two distinct classes: hybrid circuits which overlay two simpler mono‐functional sub‐circuits within their circuitry, and emergent circuits, which do not. In this second class, the bi‐functionality emerges from more complex designs which are not fully decomposable into distinct modules and are consequently less intuitive to predict or understand. These non‐intuitive emergent circuits are just as robust as their hybrid counterparts, and we therefore suggest that the common bias toward studying modular systems may hinder our understanding of real biological circuits.
Collapse
Affiliation(s)
- Alba Jiménez
- EMBL-CRG Systems Biology Research Unit, Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - James Cotterell
- EMBL-CRG Systems Biology Research Unit, Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Andreea Munteanu
- EMBL-CRG Systems Biology Research Unit, Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - James Sharpe
- EMBL-CRG Systems Biology Research Unit, Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain .,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
16
|
Letsou W, Cai L. Noncommutative Biology: Sequential Regulation of Complex Networks. PLoS Comput Biol 2016; 12:e1005089. [PMID: 27560383 PMCID: PMC4999240 DOI: 10.1371/journal.pcbi.1005089] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Accepted: 07/28/2016] [Indexed: 12/21/2022] Open
Abstract
Single-cell variability in gene expression is important for generating distinct cell types, but it is unclear how cells use the same set of regulatory molecules to specifically control similarly regulated genes. While combinatorial binding of transcription factors at promoters has been proposed as a solution for cell-type specific gene expression, we found that such models resulted in substantial information bottlenecks. We sought to understand the consequences of adopting sequential logic wherein the time-ordering of factors informs the final outcome. We showed that with noncommutative control, it is possible to independently control targets that would otherwise be activated simultaneously using combinatorial logic. Consequently, sequential logic overcomes the information bottleneck inherent in complex networks. We derived scaling laws for two noncommutative models of regulation, motivated by phosphorylation/neural networks and chromosome folding, respectively, and showed that they scale super-exponentially in the number of regulators. We also showed that specificity in control is robust to the loss of a regulator. Lastly, we connected these theoretical results to real biological networks that demonstrate specificity in the context of promiscuity. These results show that achieving a desired outcome often necessitates roundabout steps. DNA is the blueprint of life. Yet the order in which a cell follows these instructions makes it capable of generating thousands of different fates. How this information is extracted from underlying gene regulatory networks is unclear, especially given that biological networks are highly interconnected, and that the number of signaling pathways is relatively small (approximately 5–10). The conventional approach for increasing the information capacity of a limited set of regulators is to use them in combination. Surprisingly, combinatorial logic does not increase the diversity of target configurations or cell fates, but instead causes information bottlenecks. A different approach, called sequential logic, uses noncommutative sequences of a small set of regulators to drive networks to a large number of novel configurations. If certain targets are first protected, then even promiscuous regulators can activate specific subsets of lineage-specific targets. In this paper we show how sequential logic outperforms combinatorial logic, and argue that noncommutative sequences underlie a number of cases of biological regulation, e.g. how a small number of signaling pathways generates a large diversity of cell types in development. In addition to explaining biological networks, sequential logic may be a general experimental design strategy in synthetic and single-cell biology.
Collapse
Affiliation(s)
- William Letsou
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Long Cai
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, United States of America
- * E-mail:
| |
Collapse
|
17
|
Khalid F, Aguilar-Rodríguez J, Wagner A, Payne JL. Genonets server-a web server for the construction, analysis and visualization of genotype networks. Nucleic Acids Res 2016; 44:W70-6. [PMID: 27106055 PMCID: PMC4987894 DOI: 10.1093/nar/gkw313] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 04/13/2016] [Indexed: 12/16/2022] Open
Abstract
A genotype network is a graph in which vertices represent genotypes that have the same phenotype. Edges connect vertices if their corresponding genotypes differ in a single small mutation. Genotype networks are used to study the organization of genotype spaces. They have shed light on the relationship between robustness and evolvability in biological systems as different as RNA macromolecules and transcriptional regulatory circuits. Despite the importance of genotype networks, no tool exists for their automatic construction, analysis and visualization. Here we fill this gap by presenting the Genonets Server, a tool that provides the following features: (i) the construction of genotype networks for categorical and univariate phenotypes from DNA, RNA, amino acid or binary sequences; (ii) analyses of genotype network topology and how it relates to robustness and evolvability, as well as analyses of genotype network topography and how it relates to the navigability of a genotype network via mutation and natural selection; (iii) multiple interactive visualizations that facilitate exploratory research and education. The Genonets Server is freely available at http://ieu-genonets.uzh.ch.
Collapse
Affiliation(s)
- Fahad Khalid
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015 Lausanne, Switzerland
| | - José Aguilar-Rodríguez
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015 Lausanne, Switzerland
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015 Lausanne, Switzerland The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| | - Joshua L Payne
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Genopode, 1015 Lausanne, Switzerland
| |
Collapse
|
18
|
Boolean Models of Biological Processes Explain Cascade-Like Behavior. Sci Rep 2016; 7:20067. [PMID: 26821940 PMCID: PMC4731822 DOI: 10.1038/srep20067] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 12/08/2015] [Indexed: 11/09/2022] Open
Abstract
Biological networks play a key role in determining biological function and therefore, an understanding of their structure and dynamics is of central interest in systems biology. In Boolean models of such networks, the status of each molecule is either “on” or “off” and along with the molecules interact with each other, their individual status changes from “on” to “off” or vice-versa and the system of molecules in the network collectively go through a sequence of changes in state. This sequence of changes is termed a biological process. In this paper, we examine the common perception that events in biomolecular networks occur sequentially, in a cascade-like manner, and ask whether this is likely to be an inherent property. In further investigations of the budding and fission yeast cell-cycle, we identify two generic dynamical rules. A Boolean system that complies with these rules will automatically have a certain robustness. By considering the biological requirements in robustness and designability, we show that those Boolean dynamical systems, compared to an arbitrary dynamical system, statistically present the characteristics of cascadeness and sequentiality, as observed in the budding and fission yeast cell- cycle. These results suggest that cascade-like behavior might be an intrinsic property of biological processes.
Collapse
|
19
|
Pritykin Y, Ghersi D, Singh M. Genome-Wide Detection and Analysis of Multifunctional Genes. PLoS Comput Biol 2015; 11:e1004467. [PMID: 26436655 PMCID: PMC4593560 DOI: 10.1371/journal.pcbi.1004467] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 07/19/2015] [Indexed: 12/25/2022] Open
Abstract
Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. Almost every aspect of cellular function depends on protein activity. In spite of being fine-tuned to carry out highly specific functions, proteins can also multitask. Experimental studies have identified genes and proteins endowed with more than one molecular function, or participating in very different biological processes. These studies suggest that the degree of functional plasticity exhibited by proteins might go well beyond a simple “one protein—one function” relationship. However, systematic studies of the properties of multifunctional genes (and their encoded proteins) have been limited. Here we present a computational framework to identify putative multifunctional genes, and compare their properties with those of other genes. We find that multifunctional genes are significantly different from other genes with respect to their physicochemical properties, expression profiles, and interaction properties. We also observe that multifunctional genes tend to be more conserved, and that a greater fraction of them are associated with human disorders. Taken together, these results represent a step towards a more complete understanding of the role multifunctional genes play in the functional organization of the cell.
Collapse
Affiliation(s)
- Yuri Pritykin
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
- Lewis–Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Dario Ghersi
- Lewis–Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- School of Interdisciplinary Informatics, University of Nebraska at Omaha, Omaha, Nebraska, United States of America
- * E-mail: (DG); (MS)
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
- Lewis–Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- * E-mail: (DG); (MS)
| |
Collapse
|
20
|
Function does not follow form in gene regulatory circuits. Sci Rep 2015; 5:13015. [PMID: 26290154 PMCID: PMC4542331 DOI: 10.1038/srep13015] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 07/06/2015] [Indexed: 11/08/2022] Open
Abstract
Gene regulatory circuits are to the cell what arithmetic logic units are to the chip: fundamental components of information processing that map an input onto an output. Gene regulatory circuits come in many different forms, distinct structural configurations that determine who regulates whom. Studies that have focused on the gene expression patterns (functions) of circuits with a given structure (form) have examined just a few structures or gene expression patterns. Here, we use a computational model to exhaustively characterize the gene expression patterns of nearly 17 million three-gene circuits in order to systematically explore the relationship between circuit form and function. Three main conclusions emerge. First, function does not follow form. A circuit of any one structure can have between twelve and nearly thirty thousand distinct gene expression patterns. Second, and conversely, form does not follow function. Most gene expression patterns can be realized by more than one circuit structure. And third, multifunctionality severely constrains circuit form. The number of circuit structures able to drive multiple gene expression patterns decreases rapidly with the number of these patterns. These results indicate that it is generally not possible to infer circuit function from circuit form, or vice versa.
Collapse
|
21
|
Intersecting transcription networks constrain gene regulatory evolution. Nature 2015; 523:361-5. [PMID: 26153861 PMCID: PMC4531262 DOI: 10.1038/nature14613] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Accepted: 06/04/2015] [Indexed: 12/21/2022]
Abstract
Epistasis—the non-additive interactions between different genetic loci—constrains evolutionary pathways, blocking some and permitting others1–8. For biological networks such as transcription circuits, the nature of these constraints and their consequences are largely unknown. Here we describe the evolutionary pathways of a transcription network that controls the response to mating pheromone in yeasts9. A component of this network, the transcription regulator Ste12, has evolved two different modes of binding to a set of its target genes. In one group of species, Ste12 binds to specific DNA binding sites, while in another lineage it occupies DNA indirectly, relying on a second transcription regulator to recognize DNA. We show, through the construction of various possible evolutionary intermediates, that evolution of the direct mode of DNA binding was not directly accessible to the ancestor. Instead, it was contingent on a lineage-specific change to an overlapping transcription network with a different function, the specification of cell type. These results show that analyzing and predicting the evolution of cis-regulatory regions requires an understanding of their positions in overlapping networks, as this placement constrains the available evolutionary pathways.
Collapse
|
22
|
Albert R, Thakar J. Boolean modeling: a logic-based dynamic approach for understanding signaling and regulatory networks and for making useful predictions. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2015; 6:353-69. [PMID: 25269159 DOI: 10.1002/wsbm.1273] [Citation(s) in RCA: 69] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The biomolecules inside or near cells form a complex interacting system. Cellular phenotypes and behaviors arise from the totality of interactions among the components of this system. A fruitful way of modeling interacting biomolecular systems is by network-based dynamic models that characterize each component by a state variable, and describe the change in the state variables due to the interactions in the system. Dynamic models can capture the stable state patterns of this interacting system and can connect them to different cell fates or behaviors. A Boolean or logic model characterizes each biomolecule by a binary state variable that relates the abundance of that molecule to a threshold abundance necessary for downstream processes. The regulation of this state variable is described in a parameter free manner, making Boolean modeling a practical choice for systems whose kinetic parameters have not been determined. Boolean models integrate the body of knowledge regarding the components and interactions of biomolecular systems, and capture the system's dynamic repertoire, for example the existence of multiple cell fates. These models were used for a variety of systems and led to important insights and predictions. Boolean models serve as an efficient exploratory model, a guide for follow-up experiments, and as a foundation for more quantitative models.
Collapse
|
23
|
Emmrich PMF, Roberts HE, Pancaldi V. A Boolean gene regulatory model of heterosis and speciation. BMC Evol Biol 2015; 15:24. [PMID: 25888139 PMCID: PMC4349475 DOI: 10.1186/s12862-015-0298-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2014] [Accepted: 01/27/2015] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND Modelling genetic phenomena affecting biological traits is important for the development of agriculture as it allows breeders to predict the potential of breeding for certain traits. One such phenomenon is heterosis or hybrid vigor: crossing individuals from genetically distinct populations often results in improvements in quantitative traits, such as growth rate, biomass production and stress resistance. Heterosis has become a very useful tool in global agriculture, but its genetic basis remains controversial and its effects hard to predict. We have taken a computational approach to studying heterosis, developing a simulation of evolution, independent reassortment of alleles and hybridization of Gene Regulatory Networks (GRNs) in a Boolean framework. These artificial regulatory networks exhibit topological properties that reflect those observed in biology, and fitness is measured as the ability of a network to respond to external inputs in a pre-defined way. RESULTS Our model reproduced common experimental observations on heterosis using only biologically justified parameters, such as mutation rates. Hybrid vigor was observed and its extent was seen to increase as parental populations diverged, up until a point of sudden collapse of hybrid fitness. Thus, the model also describes a process akin to speciation due to genetic incompatibility of the separated populations. We also reproduce, for the first time in a model, the fact that hybrid vigor cannot easily be fixed by within a breeding line, currently an important limitation of the use of hybrid crops. The simulation allowed us to study the effects of three standard models for the genetic basis of heterosis: dominance, over-dominance, and epistasis. CONCLUSION This study describes the most detailed simulation of heterosis using gene regulatory networks to date and reproduces several phenomena associated with heterosis for the first time in a model. The level of detail in our model allows us to suggest possible warning signs of the impending collapse of hybrid vigor in breeding. In addition, the simulation provides a framework that can be extended to study other aspects of heterosis and alternative evolutionary scenarios.
Collapse
Affiliation(s)
- Peter Martin Ferdinand Emmrich
- Department of Plant Sciences, University of Cambridge, CB2 3EA, Cambridge, UK.
- Current address: John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK.
| | - Hannah Elizabeth Roberts
- Department of Plant Sciences, University of Cambridge, CB2 3EA, Cambridge, UK.
- Current address: The Nuffield Department of Clinical Medicine, Oxford University, Peter Medawar Building for Pathogen Research, Oxford, OX1 3SY, UK.
| | - Vera Pancaldi
- Department of Plant Sciences, University of Cambridge, CB2 3EA, Cambridge, UK.
- Current address: Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), Calle Melchor Fernández Almagro, 3, Madrid, E-28029, Spain.
| |
Collapse
|
24
|
Pechenick DA, Payne JL, Moore JH. Phenotypic robustness and the assortativity signature of human transcription factor networks. PLoS Comput Biol 2014; 10:e1003780. [PMID: 25121490 PMCID: PMC4133045 DOI: 10.1371/journal.pcbi.1003780] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2014] [Accepted: 06/30/2014] [Indexed: 11/21/2022] Open
Abstract
Many developmental, physiological, and behavioral processes depend on the precise expression of genes in space and time. Such spatiotemporal gene expression phenotypes arise from the binding of sequence-specific transcription factors (TFs) to DNA, and from the regulation of nearby genes that such binding causes. These nearby genes may themselves encode TFs, giving rise to a transcription factor network (TFN), wherein nodes represent TFs and directed edges denote regulatory interactions between TFs. Computational studies have linked several topological properties of TFNs — such as their degree distribution — with the robustness of a TFN's gene expression phenotype to genetic and environmental perturbation. Another important topological property is assortativity, which measures the tendency of nodes with similar numbers of edges to connect. In directed networks, assortativity comprises four distinct components that collectively form an assortativity signature. We know very little about how a TFN's assortativity signature affects the robustness of its gene expression phenotype to perturbation. While recent theoretical results suggest that increasing one specific component of a TFN's assortativity signature leads to increased phenotypic robustness, the biological context of this finding is currently limited because the assortativity signatures of real-world TFNs have not been characterized. It is therefore unclear whether these earlier theoretical findings are biologically relevant. Moreover, it is not known how the other three components of the assortativity signature contribute to the phenotypic robustness of TFNs. Here, we use publicly available DNaseI-seq data to measure the assortativity signatures of genome-wide TFNs in 41 distinct human cell and tissue types. We find that all TFNs share a common assortativity signature and that this signature confers phenotypic robustness to model TFNs. Lastly, we determine the extent to which each of the four components of the assortativity signature contributes to this robustness. The cells of living organisms do not concurrently express their entire complement of genes. Instead, they regulate their gene expression, and one consequence of this is the potential for different cells to adopt different stable gene expression patterns. For example, the development of an embryo necessitates that cells alter their gene expression patterns in order to differentiate. These gene expression phenotypes are largely robust to genetic mutation, and one source of this robustness may reside in the network structure of interacting molecules that underlie genetic regulation. Theoretical studies of regulatory networks have linked network structure to robustness; however, it is also necessary to more extensively characterize real-world regulatory networks in order to understand which structural properties may be biologically meaningful. We recently used theoretical models to show that a particular structural property, degree assortativity, is linked to robustness. Here, we measure the assortativity of human regulatory networks in 41 distinct cell and tissue types. We then develop a theoretical framework to explore how this structural property affects robustness, and we find that the gene expression phenotypes of human regulatory networks are more robust than expected by chance alone.
Collapse
Affiliation(s)
- Dov A. Pechenick
- Computational Genetics Laboratory, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Joshua L. Payne
- Institute of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Jason H. Moore
- Computational Genetics Laboratory, Dartmouth College, Hanover, New Hampshire, United States of America
- * E-mail:
| |
Collapse
|
25
|
Sucena É, Vanderberghe K, Zhurov V, Grbić M. Reversion of developmental mode in insects: evolution from long germband to short germband in the polyembrionic wasp Macrocentrus cingulum Brischke. Evol Dev 2014; 16:233-46. [PMID: 24981069 DOI: 10.1111/ede.12086] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Germband size in insects has played a central role in our understanding of insect patterning mechanisms and their evolution. The polarity of evolutionary change in insect patterning has been viewed so far as the unidirectional shift from the ancestral short germband patterning of basal hemimetabolous insects to the long germband patterning observed in most modern Holometabola. However, some orders of holometabolic insects display both short and long germband development, though the absence of a clear phylogenetic context does not permit definite conclusions on the polarity of change. Derived hymenoptera, that is, bees and wasps, represent a classical textbook example of long germband development. Yet, in some wasps putative short germband development has been described correlating with lifestyle changes, namely with evolution of endoparasitism and polyembryony. To address the potential reversion from long to short germband, we focused on the family Braconidae, which displays ancestral long germband development, and examined the derived polyembryonic braconid Macrocentrus cingulum. Using SEM analysis of M. cingulum embryogenesis coupled with analyses of embryonic patterning markers, we show that this wasp evolved short germband embryogenesis secondarily, in a way that is reminiscent of embryogenesis in the beetle Tribolium castaneum. This work shows that the evolution of germband size in insects is a reversible process that may correlate with other life-history traits and suggests broader implications on the mechanisms and evolvability of insect development.
Collapse
Affiliation(s)
- Élio Sucena
- Instituto Gulbenkian de Ciência, Apartado 14, 2781-901, Oeiras, Portugal; Universidade de Lisboa, Faculdade de Ciências, Departamento de Biologia Animal, edifício C2, Campo Grande, 1749-016, Lisboa, Portugal
| | | | | | | |
Collapse
|
26
|
Payne JL, Wagner A. Latent phenotypes pervade gene regulatory circuits. BMC SYSTEMS BIOLOGY 2014; 8:64. [PMID: 24884746 PMCID: PMC4061115 DOI: 10.1186/1752-0509-8-64] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2014] [Accepted: 05/12/2014] [Indexed: 12/22/2022]
Abstract
BACKGROUND Latent phenotypes are non-adaptive byproducts of adaptive phenotypes. They exist in biological systems as different as promiscuous enzymes and genome-scale metabolic reaction networks, and can give rise to evolutionary adaptations and innovations. We know little about their prevalence in the gene expression phenotypes of regulatory circuits, important sources of evolutionary innovations. RESULTS Here, we study a space of more than sixteen million three-gene model regulatory circuits, where each circuit is represented by a genotype, and has one or more functions embodied in one or more gene expression phenotypes. We find that the majority of circuits with single functions have latent expression phenotypes. Moreover, the set of circuits with a given spectrum of functions has a repertoire of latent phenotypes that is much larger than that of any one circuit. Most of this latent repertoire can be easily accessed through a series of small genetic changes that preserve a circuit's main functions. Both circuits and gene expression phenotypes that are robust to genetic change are associated with a greater number of latent phenotypes. CONCLUSIONS Our observations suggest that latent phenotypes are pervasive in regulatory circuits, and may thus be an important source of evolutionary adaptations and innovations involving gene regulation.
Collapse
|
27
|
Gutiérrez J, Maere S. Modeling the evolution of molecular systems from a mechanistic perspective. TRENDS IN PLANT SCIENCE 2014; 19:292-303. [PMID: 24709144 DOI: 10.1016/j.tplants.2014.03.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2013] [Revised: 03/09/2014] [Accepted: 03/11/2014] [Indexed: 06/03/2023]
Abstract
Systems biology-inspired genotype-phenotype mapping models are increasingly being used to study the evolutionary properties of molecular biological systems, in particular the general emergent properties of evolving systems, such as modularity, robustness, and evolvability. However, the level of abstraction at which many of these models operate might not be sufficient to capture all relevant intricacies of biological evolution in sufficient detail. Here, we argue that in particular gene and genome duplications, both evolutionary mechanisms of potentially major importance for the evolution of molecular systems and of special relevance to plant evolution, are not adequately accounted for in most GPM modeling frameworks, and that more fine-grained mechanistic models may significantly advance understanding of how gen(om)e duplication impacts molecular systems evolution.
Collapse
Affiliation(s)
- Jayson Gutiérrez
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Steven Maere
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium; Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.
| |
Collapse
|
28
|
Barve A, Hosseini SR, Martin OC, Wagner A. Historical contingency and the gradual evolution of metabolic properties in central carbon and genome-scale metabolisms. BMC SYSTEMS BIOLOGY 2014; 8:48. [PMID: 24758311 PMCID: PMC4022055 DOI: 10.1186/1752-0509-8-48] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2014] [Accepted: 04/15/2014] [Indexed: 11/13/2022]
Abstract
BACKGROUND A metabolism can evolve through changes in its biochemical reactions that are caused by processes such as horizontal gene transfer and gene deletion. While such changes need to preserve an organism's viability in its environment, they can modify other important properties, such as a metabolism's maximal biomass synthesis rate and its robustness to genetic and environmental change. Whether such properties can be modulated in evolution depends on whether all or most viable metabolisms - those that can synthesize all essential biomass precursors - are connected in a space of all possible metabolisms. Connectedness means that any two viable metabolisms can be converted into one another through a sequence of single reaction changes that leave viability intact. If the set of viable metabolisms is disconnected and highly fragmented, then historical contingency becomes important and restricts the alteration of metabolic properties, as well as the number of novel metabolic phenotypes accessible in evolution. RESULTS We here computationally explore two vast spaces of possible metabolisms to ask whether viable metabolisms are connected. We find that for all but the simplest metabolisms, most viable metabolisms can be transformed into one another by single viability-preserving reaction changes. Where this is not the case, alternative essential metabolic pathways consisting of multiple reactions are responsible, but such pathways are not common. CONCLUSIONS Metabolism is thus highly evolvable, in the sense that its properties could be fine-tuned by successively altering individual reactions. Historical contingency does not strongly restrict the origin of novel metabolic phenotypes.
Collapse
Affiliation(s)
- Aditya Barve
- Institute of Evolutionary Biology and Environmental Sciences, University of Zurich, Bldg. Y27, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- The Swiss Institute of Bioinformatics, Bioinformatics, Quartier Sorge, Batiment Genopode, 1015 Lausanne, Switzerland
| | - Sayed-Rzgar Hosseini
- Institute of Evolutionary Biology and Environmental Sciences, University of Zurich, Bldg. Y27, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- The Swiss Institute of Bioinformatics, Bioinformatics, Quartier Sorge, Batiment Genopode, 1015 Lausanne, Switzerland
- Computational Biology and Bioinformatics Master’s Program, Department of Computer Science, ETH Zurich, Universitätsstrasse. 6, CH-8092 Zurich, Switzerland
| | - Olivier C Martin
- INRA, UMR 0320/UMR 8120 Génétique Végétale, Univ Paris-Sud, F-91190 Gif-sur-Yvette, France
| | - Andreas Wagner
- Institute of Evolutionary Biology and Environmental Sciences, University of Zurich, Bldg. Y27, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- The Swiss Institute of Bioinformatics, Bioinformatics, Quartier Sorge, Batiment Genopode, 1015 Lausanne, Switzerland
- The Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| |
Collapse
|