1
|
Murrugarra D, Veliz-Cuba A, Dimitrova E, Kadelka C, Wheeler M, Laubenbacher R. Modular Control of Biological Networks. ARXIV 2024:arXiv:2401.12477v2. [PMID: 38344220 PMCID: PMC10854280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/17/2024]
Abstract
The concept of control is central to understanding and applications of biological network models. Some of their key structural features relate to control functions, through gene regulation, signaling, or metabolic mechanisms, and computational models need to encode these. Applications of models often focus on model-based control, such as in biomedicine or metabolic engineering. This paper presents an approach to model-based control that exploits two common features of biological networks, namely their modular structure and canalizing features of their regulatory mechanisms. The paper focuses on intracellular regulatory networks, represented by Boolean network models. A main result of this paper is that control strategies can be identified by focusing on one module at a time. This paper also presents a criterion based on canalizing features of the regulatory rules to identify modules that do not contribute to network control and can be excluded. For even moderately sized networks, finding global control inputs is computationally very challenging. The modular approach presented here leads to a highly efficient approach to solving this problem. This approach is applied to a published Boolean network model of blood cancer large granular lymphocyte (T-LGL) leukemia to identify a minimal control set that achieves a desired control objective.
Collapse
Affiliation(s)
- David Murrugarra
- Department of Mathematics, University of Kentucky, Lexington, KY 40506, USA
| | - Alan Veliz-Cuba
- Department of Mathematics, University of Dayton, Dayton, OH 45469, USA
| | - Elena Dimitrova
- Mathematics Department, California Polytechnic State University, San Luis Obispo, CA 93407, USA
| | - Claus Kadelka
- Department of Mathematics, Iowa State University, Ames, IA 50011, USA
| | - Matthew Wheeler
- Department of Medicine, University of Florida, Gainesville, FL 32610, USA
| | | |
Collapse
|
2
|
Kadelka C, Murrugarra D. Canalization reduces the nonlinearity of regulation in biological networks. NPJ Syst Biol Appl 2024; 10:67. [PMID: 38871768 DOI: 10.1038/s41540-024-00392-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 05/31/2024] [Indexed: 06/15/2024] Open
Abstract
Biological networks, such as gene regulatory networks, possess desirable properties. They are more robust and controllable than random networks. This motivates the search for structural and dynamical features that evolution has incorporated into biological networks. A recent meta-analysis of published, expert-curated Boolean biological network models has revealed several such features, often referred to as design principles. Among others, the biological networks are enriched for certain recurring network motifs, the dynamic update rules are more redundant, more biased, and more canalizing than expected, and the dynamics of biological networks are better approximable by linear and lower-order approximations than those of comparable random networks. Since most of these features are interrelated, it is paramount to disentangle cause and effect, that is, to understand which features evolution actively selects for, and thus truly constitute evolutionary design principles. Here, we compare published Boolean biological network models with different ensembles of null models and show that the abundance of canalization in biological networks can almost completely explain their recently postulated high approximability. Moreover, an analysis of random N-K Kauffman models reveals a strong dependence of approximability on the dynamical robustness of a network.
Collapse
Affiliation(s)
- Claus Kadelka
- Department of Mathematics, Iowa State University, 411 Morrill Rd, Ames, 50011, IA, USA.
| | - David Murrugarra
- Department of Mathematics, University of Kentucky, 719 Patterson Office Tower, Lexington, 40506, KY, USA
| |
Collapse
|
3
|
Zhang D, Gao S, Liu ZP, Gao R. LogicGep: Boolean networks inference using symbolic regression from time-series transcriptomic profiling data. Brief Bioinform 2024; 25:bbae286. [PMID: 38886006 PMCID: PMC11182660 DOI: 10.1093/bib/bbae286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Revised: 05/09/2024] [Accepted: 06/06/2024] [Indexed: 06/20/2024] Open
Abstract
Reconstructing the topology of gene regulatory network from gene expression data has been extensively studied. With the abundance functional transcriptomic data available, it is now feasible to systematically decipher regulatory interaction dynamics in a logic form such as a Boolean network (BN) framework, which qualitatively indicates how multiple regulators aggregated to affect a common target gene. However, inferring both the network topology and gene interaction dynamics simultaneously is still a challenging problem since gene expression data are typically noisy and data discretization is prone to information loss. We propose a new method for BN inference from time-series transcriptional profiles, called LogicGep. LogicGep formulates the identification of Boolean functions as a symbolic regression problem that learns the Boolean function expression and solve it efficiently through multi-objective optimization using an improved gene expression programming algorithm. To avoid overly emphasizing dynamic characteristics at the expense of topology structure ones, as traditional methods often do, a set of promising Boolean formulas for each target gene is evolved firstly, and a feed-forward neural network trained with continuous expression data is subsequently employed to pick out the final solution. We validated the efficacy of LogicGep using multiple datasets including both synthetic and real-world experimental data. The results elucidate that LogicGep adeptly infers accurate BN models, outperforming other representative BN inference algorithms in both network topology reconstruction and the identification of Boolean functions. Moreover, the execution of LogicGep is hundreds of times faster than other methods, especially in the case of large network inference.
Collapse
Affiliation(s)
- Dezhen Zhang
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
| | - Shuhua Gao
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
| | - Zhi-Ping Liu
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
| | - Rui Gao
- Center of Intelligent Medicine, School of Control Science and Engineering, Shandong University, Jinan, Shandong 250061, China
| |
Collapse
|
4
|
Kadelka C, Wheeler M, Veliz-Cuba A, Murrugarra D, Laubenbacher R. Modularity of biological systems: a link between structure and function. J R Soc Interface 2023; 20:20230505. [PMID: 37876275 PMCID: PMC10598444 DOI: 10.1098/rsif.2023.0505] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 10/05/2023] [Indexed: 10/26/2023] Open
Abstract
This paper addresses two topics in systems biology, the hypothesis that biological systems are modular and the problem of relating structure and function of biological systems. The focus here is on gene regulatory networks, represented by Boolean network models, a commonly used tool. Most of the research on gene regulatory network modularity has focused on network structure, typically represented through either directed or undirected graphs. But since gene regulation is a highly dynamic process as it determines the function of cells over time, it is natural to consider functional modularity as well. One of the main results is that the structural decomposition of a network into modules induces an analogous decomposition of the dynamic structure, exhibiting a strong relationship between network structure and function. An extensive simulation study provides evidence for the hypothesis that modularity might have evolved to increase phenotypic complexity while maintaining maximal dynamic robustness to external perturbations.
Collapse
Affiliation(s)
- Claus Kadelka
- Department of Mathematics, Iowa State University, Ames, IA, USA
| | - Matthew Wheeler
- Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Alan Veliz-Cuba
- Department of Mathematics, University of Dayton, Dayton, OH, USA
| | - David Murrugarra
- Department of Mathematics, University of Kentucky, Lexington, KY, USA
| | | |
Collapse
|
5
|
Plaugher D, Murrugarra D. Phenotype Control techniques for Boolean gene regulatory networks. Bull Math Biol 2023; 85:89. [PMID: 37646851 PMCID: PMC10542862 DOI: 10.1007/s11538-023-01197-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 08/11/2023] [Indexed: 09/01/2023]
Abstract
Modeling cell signal transduction pathways via Boolean networks (BNs) has become an established method for analyzing intracellular communications over the last few decades. What's more, BNs provide a course-grained approach, not only to understanding molecular communications, but also for targeting pathway components that alter the long-term outcomes of the system. This has come to be known as phenotype control theory. In this review we study the interplay of various approaches for controlling gene regulatory networks such as: algebraic methods, control kernel, feedback vertex set, and stable motifs. The study will also include comparative discussion between the methods, using an established cancer model of T-Cell Large Granular Lymphocyte Leukemia. Further, we explore possible options for making the control search more efficient using reduction and modularity. Finally, we will include challenges presented such as the complexity and the availability of software for implementing each of these control techniques.
Collapse
Affiliation(s)
- Daniel Plaugher
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY, USA.
| | - David Murrugarra
- Department of Mathematics, University of Kentucky, Lexington, KY, USA
| |
Collapse
|
6
|
Plaugher D, Murrugarra D. Phenotype control techniques for Boolean gene regulatory networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.17.537158. [PMID: 37131770 PMCID: PMC10153207 DOI: 10.1101/2023.04.17.537158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Modeling cell signal transduction pathways via Boolean networks (BNs) has become an established method for analyzing intracellular communications over the last few decades. What’s more, BNs provide a course-grained approach, not only to understanding molecular communications, but also for targeting pathway components that alter the long-term outcomes of the system. This has come to be known as phenotype control theory . In this review we study the interplay of various approaches for controlling gene regulatory networks such as: algebraic methods, control kernel, feedback vertex set, and stable motifs. The study will also include comparative discussion between the methods, using an established cancer model of T-Cell Large Granular Lymphocyte (T-LGL) Leukemia. Further, we explore possible options for making the control search more efficient using reduction and modularity. Finally, we will include challenges presented such as the complexity and the availability of software for implementing each of these control techniques.
Collapse
Affiliation(s)
- Daniel Plaugher
- Department of Toxicology and Cancer Biology, University of Kentucky
| | | |
Collapse
|
7
|
Manicka S, Johnson K, Levin M, Murrugarra D. The nonlinearity of regulation in biological networks. NPJ Syst Biol Appl 2023; 9:10. [PMID: 37015937 PMCID: PMC10073134 DOI: 10.1038/s41540-023-00273-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 03/23/2023] [Indexed: 04/06/2023] Open
Abstract
The extent to which the components of a biological system are (non)linearly regulated determines how amenable they are to therapy and control. To better understand this property termed "regulatory nonlinearity", we analyzed a suite of 137 published Boolean network models, containing a variety of complex nonlinear regulatory interactions, using a probabilistic generalization of Boolean logic that George Boole himself had proposed. Leveraging the continuous-nature of this formulation, we used Taylor decomposition to approximate the models with various levels of regulatory nonlinearity. A comparison of the resulting series of approximations of the biological models with appropriate random ensembles revealed that biological regulation tends to be less nonlinear than expected, meaning that higher-order interactions among the regulatory inputs tend to be less pronounced. A further categorical analysis of the biological models revealed that the regulatory nonlinearity of cancer and disease networks could not only be sometimes higher than expected but also be relatively more variable. We show that this variation is caused by differences in the apportioning of information among the various orders of regulatory nonlinearity. Our results suggest that there may have been a weak but discernible selection pressure for biological systems to evolve linear regulation on average, but for certain systems such as cancer, on the other hand, to simultaneously evolve more nonlinear rules.
Collapse
Affiliation(s)
- Santosh Manicka
- Department of Biology, Tufts University, Medford, MA, 02155, USA
| | - Kathleen Johnson
- Department of Mathematics, University of Kentucky, Lexington, KY, 40506, USA
| | - Michael Levin
- Department of Biology, Tufts University, Medford, MA, 02155, USA
| | - David Murrugarra
- Department of Mathematics, University of Kentucky, Lexington, KY, 40506, USA.
| |
Collapse
|
8
|
Temporary and permanent control of partially specified Boolean networks. Biosystems 2023; 223:104795. [PMID: 36377120 DOI: 10.1016/j.biosystems.2022.104795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 10/16/2022] [Accepted: 10/19/2022] [Indexed: 01/11/2023]
Abstract
Boolean networks (BNs) are a well-accepted modelling formalism in computational systems biology. Nevertheless, modellers often cannot identify only a single BN that matches the biological reality. The typical reasons for this is insufficient knowledge or a lack of experimental data. Formally, this uncertainty can be expressed using partially specified Boolean networks (PSBNs), which encode the wide range of network candidates into a single structure. In this paper, we target the control of PSBNs. The goal of BN control is to find perturbations which guarantee stabilisation of the system in the desired state. Specifically, we consider variable perturbations (gene knock-out and over-expression) with three types of application time-window: one-step, temporary, and permanent. While the control of fully specified BNs is a thoroughly explored topic, control of PSBNs introduces additional challenges that we address in this paper. In particular, the unspecified components of the model cause a significant amount of additional state space explosion. To address this issue, we propose a fully symbolic methodology that can represent the numerous system variants in a compact form. In fully specified models, the efficiency of a perturbation is characterised by the count of perturbed variables (the perturbation size). However, in the case of a PSBN, a perturbation might work only for a subset of concrete BN models. To that end, we introduce and quantify perturbation robustness. This metric characterises how efficient the given perturbation is with respect to the model uncertainty. Finally, we evaluate the novel control methods using non-trivial real-world PSBN models. We inspect the method's scalability and efficiency with respect to the size of the state space and the number of unspecified components. We also compare the robustness metrics for all three perturbation types. Our experiments support the hypothesis that one-step perturbations are significantly less robust than temporary and permanent ones.
Collapse
|
9
|
Parmer T, Rocha LM, Radicchi F. Influence maximization in Boolean networks. Nat Commun 2022; 13:3457. [PMID: 35710639 PMCID: PMC9203747 DOI: 10.1038/s41467-022-31066-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 06/01/2022] [Indexed: 11/30/2022] Open
Abstract
The optimization problem aiming at the identification of minimal sets of nodes able to drive the dynamics of Boolean networks toward desired long-term behaviors is central for some applications, as for example the detection of key therapeutic targets to control pathways in models of biological signaling and regulatory networks. Here, we develop a method to solve such an optimization problem taking inspiration from the well-studied problem of influence maximization for spreading processes in social networks. We validate the method on small gene regulatory networks whose dynamical landscapes are known by means of brute-force analysis. We then systematically study a large collection of gene regulatory networks. We find that for about 65% of the analyzed networks, the minimal driver sets contain less than 20% of their nodes.
Collapse
Affiliation(s)
- Thomas Parmer
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, 47408, USA
| | - Luis M Rocha
- Consortium for Social and Biomedical Complexity, Systems Science and Industrial Engineering Department, Thomas J. Watson College of Engineering and Applied Science, Binghamton University (State University of New York), Binghamton, NY, 13902, USA
- Instituto Gulbenkian de Ciência, Oeiras, 2780-156, Portugal
| | - Filippo Radicchi
- Center for Complex Networks and Systems Research, Luddy School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, 47408, USA.
| |
Collapse
|
10
|
Mendik P, Kerestély M, Kamp S, Deritei D, Kunšič N, Vassy Z, Csermely P, Veres DV. Translocating proteins compartment-specifically alter the fate of epithelial-mesenchymal transition in a compartmentalized Boolean network model. NPJ Syst Biol Appl 2022; 8:19. [PMID: 35680961 PMCID: PMC9184490 DOI: 10.1038/s41540-022-00228-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 05/20/2022] [Indexed: 11/13/2022] Open
Abstract
Regulation of translocating proteins is crucial in defining cellular behaviour. Epithelial-mesenchymal transition (EMT) is important in cellular processes, such as cancer progression. Several orchestrators of EMT, such as key transcription factors, are known to translocate. We show that translocating proteins become enriched in EMT-signalling. To simulate the compartment-specific functions of translocating proteins we created a compartmentalized Boolean network model. This model successfully reproduced known biological traits of EMT and as a novel feature it also captured organelle-specific functions of proteins. Our results predicted that glycogen synthase kinase-3 beta (GSK3B) compartment-specifically alters the fate of EMT, amongst others the activation of nuclear GSK3B halts transforming growth factor beta-1 (TGFB) induced EMT. Moreover, our results recapitulated that the nuclear activation of glioma associated oncogene transcription factors (GLI) is needed to achieve a complete EMT. Compartmentalized network models will be useful to uncover novel control mechanisms of biological processes. Our algorithmic procedures can be automatically rerun on the https://translocaboole.linkgroup.hu website, which provides a framework for similar future studies.
Collapse
Affiliation(s)
- Péter Mendik
- Department of Molecular Biology, Institute of Biochemistry and Molecular Biology, Semmelweis University, Budapest, Hungary
| | - Márk Kerestély
- Department of Molecular Biology, Institute of Biochemistry and Molecular Biology, Semmelweis University, Budapest, Hungary
| | | | - Dávid Deritei
- Department of Molecular Biology, Institute of Biochemistry and Molecular Biology, Semmelweis University, Budapest, Hungary
| | - Nina Kunšič
- Department of Molecular Biology, Institute of Biochemistry and Molecular Biology, Semmelweis University, Budapest, Hungary
| | - Zsolt Vassy
- Department of Molecular Biology, Institute of Biochemistry and Molecular Biology, Semmelweis University, Budapest, Hungary
| | - Péter Csermely
- Department of Molecular Biology, Institute of Biochemistry and Molecular Biology, Semmelweis University, Budapest, Hungary
| | - Daniel V Veres
- Department of Molecular Biology, Institute of Biochemistry and Molecular Biology, Semmelweis University, Budapest, Hungary. .,Turbine Ltd, Budapest, Hungary.
| |
Collapse
|
11
|
Discovering sparse control strategies in neural activity. PLoS Comput Biol 2022; 18:e1010072. [PMID: 35622828 PMCID: PMC9140285 DOI: 10.1371/journal.pcbi.1010072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 04/01/2022] [Indexed: 11/19/2022] Open
Abstract
Biological circuits such as neural or gene regulation networks use internal states to map sensory input to an adaptive repertoire of behavior. Characterizing this mapping is a major challenge for systems biology. Though experiments that probe internal states are developing rapidly, organismal complexity presents a fundamental obstacle given the many possible ways internal states could map to behavior. Using C. elegans as an example, we propose a protocol for systematic perturbation of neural states that limits experimental complexity and could eventually help characterize collective aspects of the neural-behavioral map. We consider experimentally motivated small perturbations—ones that are most likely to preserve natural dynamics and are closer to internal control mechanisms—to neural states and their impact on collective neural activity. Then, we connect such perturbations to the local information geometry of collective statistics, which can be fully characterized using pairwise perturbations. Applying the protocol to a minimal model of C. elegans neural activity, we find that collective neural statistics are most sensitive to a few principal perturbative modes. Dominant eigenvalues decay initially as a power law, unveiling a hierarchy that arises from variation in individual neural activity and pairwise interactions. Highest-ranking modes tend to be dominated by a few, “pivotal” neurons that account for most of the system’s sensitivity, suggesting a sparse mechanism of collective control.
Collapse
|
12
|
Dynamical Analysis of a Boolean Network Model of the Oncogene Role of lncRNA ANRIL and lncRNA UFC1 in Non-Small Cell Lung Cancer. Biomolecules 2022; 12:biom12030420. [PMID: 35327612 PMCID: PMC8946683 DOI: 10.3390/biom12030420] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Revised: 03/01/2022] [Accepted: 03/01/2022] [Indexed: 12/14/2022] Open
Abstract
Long non-coding RNA (lncRNA) such as ANRIL and UFC1 have been verified as oncogenic genes in non-small cell lung cancer (NSCLC). It is well known that the tumor suppressor microRNA-34a (miR-34a) is downregulated in NSCLC. Furthermore, miR-34a induces senescence and apoptosis in breast, glioma, cervical cancer including NSCLC by targeting Myc. Recent evidence suggests that these two lncRNAs act as a miR-34a sponge in corresponding cancers. However, the biological functions between these two non-coding RNAs (ncRNAs) have not yet been studied in NSCLC. Therefore, we present a Boolean model to analyze the gene regulation between these two ncRNAs in NSCLC. We compared our model to several experimental studies involving gain- or loss-of-function genes in NSCLC cells and achieved an excellent agreement. Additionally, we predict three positive circuits involving miR-34a/E2F1/ANRIL, miR-34a/E2F1/UFC1, and miR-34a/Myc/ANRIL. Our circuit- perturbation analysis shows that these circuits are important for regulating cell-fate decisions such as senescence and apoptosis. Thus, our Boolean network permits an explicit cell-fate mechanism associated with NSCLC. Therefore, our results support that ANRIL and/or UFC1 is an attractive target for drug development in tumor growth and aggressive proliferation of NSCLC, and that a valuable outcome can be achieved through the miRNA-34a/Myc pathway.
Collapse
|