1
|
Sgariglia D, Carneiro FRG, Vidal de Carvalho LA, Pedreira CE, Carels N, da Silva FAB. Optimizing therapeutic targets for breast cancer using boolean network models. Comput Biol Chem 2024; 109:108022. [PMID: 38350182 DOI: 10.1016/j.compbiolchem.2024.108022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 09/18/2023] [Accepted: 01/31/2024] [Indexed: 02/15/2024]
Abstract
Studying gene regulatory networks associated with cancer provides valuable insights for therapeutic purposes, given that cancer is fundamentally a genetic disease. However, as the number of genes in the system increases, the complexity arising from the interconnections between network components grows exponentially. In this study, using Boolean logic to adjust the existing relationships between network components has facilitated simplifying the modeling process, enabling the generation of attractors that represent cell phenotypes based on breast cancer RNA-seq data. A key therapeutic objective is to guide cells, through targeted interventions, to transition from the current cancer attractor to a physiologically distinct attractor unrelated to cancer. To achieve this, we developed a computational method that identifies network nodes whose inhibition can facilitate the desired transition from one tumor attractor to another associated with apoptosis, leveraging transcriptomic data from cell lines. To validate the model, we utilized previously published in vitro experiments where the downregulation of specific proteins resulted in cell growth arrest and death of a breast cancer cell line. The method proposed in this manuscript combines diverse data sources, conducts structural network analysis, and incorporates relevant biological knowledge on apoptosis in cancer cells. This comprehensive approach aims to identify potential targets of significance for personalized medicine.
Collapse
Affiliation(s)
| | - Flavia Raquel Gonçalves Carneiro
- Center of Technological Development in Health (CDTS), FIOCRUZ, Rio de Janeiro, Brazil; Laboratório Interdisciplinar de Pesquisas Médicas Instituto Oswaldo Cruz, FIOCRUZ, Rio de Janeiro, Brazil; Program of Immunology and Tumor Biology, Brazilian National Cancer Institute(INCA), Rio de Janeiro 20231050, Brazil
| | | | | | - Nicolas Carels
- Platform of Biological System Modeling, Center of Technological Development in Health (CDTS), FIOCRUZ, Rio de Janeiro, Brazil
| | | |
Collapse
|
2
|
McElroy M, Green K, Voulgarakis NK. Self-Regulated Symmetry Breaking Model for Stem Cell Differentiation. ENTROPY (BASEL, SWITZERLAND) 2023; 25:815. [PMID: 37238570 PMCID: PMC10217192 DOI: 10.3390/e25050815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 05/03/2023] [Accepted: 05/16/2023] [Indexed: 05/28/2023]
Abstract
In conventional disorder-order phase transitions, a system shifts from a highly symmetric state, where all states are equally accessible (disorder) to a less symmetric state with a limited number of available states (order). This transition may occur by varying a control parameter that represents the intrinsic noise of the system. It has been suggested that stem cell differentiation can be considered as a sequence of such symmetry-breaking events. Pluripotent stem cells, with their capacity to develop into any specialized cell type, are considered highly symmetric systems. In contrast, differentiated cells have lower symmetry, as they can only carry out a limited number of functions. For this hypothesis to be valid, differentiation should emerge collectively in stem cell populations. Additionally, such populations must have the ability to self-regulate intrinsic noise and navigate through a critical point where spontaneous symmetry breaking (differentiation) occurs. This study presents a mean-field model for stem cell populations that considers the interplay of cell-cell cooperativity, cell-to-cell variability, and finite-size effects. By introducing a feedback mechanism to control intrinsic noise, the model can self-tune through different bifurcation points, facilitating spontaneous symmetry breaking. Standard stability analysis showed that the system can potentially differentiate into several cell types mathematically expressed as stable nodes and limit cycles. The existence of a Hopf bifurcation in our model is discussed in light of stem cell differentiation.
Collapse
Affiliation(s)
- Madelynn McElroy
- Department of Mathematics and Statistics, Washington State University, Pullman, WA 99164, USA; (M.M.); (K.G.)
- Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, WA 99164, USA
| | - Kaylie Green
- Department of Mathematics and Statistics, Washington State University, Pullman, WA 99164, USA; (M.M.); (K.G.)
- Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, WA 99164, USA
| | - Nikolaos K. Voulgarakis
- Department of Mathematics and Statistics, Washington State University, Pullman, WA 99164, USA; (M.M.); (K.G.)
| |
Collapse
|
3
|
Wang Y, He S. Inference on autoregulation in gene expression with variance-to-mean ratio. J Math Biol 2023; 86:87. [PMID: 37131095 PMCID: PMC10154285 DOI: 10.1007/s00285-023-01924-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 04/14/2023] [Accepted: 04/18/2023] [Indexed: 05/04/2023]
Abstract
Some genes can promote or repress their own expressions, which is called autoregulation. Although gene regulation is a central topic in biology, autoregulation is much less studied. In general, it is extremely difficult to determine the existence of autoregulation with direct biochemical approaches. Nevertheless, some papers have observed that certain types of autoregulations are linked to noise levels in gene expression. We generalize these results by two propositions on discrete-state continuous-time Markov chains. These two propositions form a simple but robust method to infer the existence of autoregulation from gene expression data. This method only needs to compare the mean and variance of the gene expression level. Compared to other methods for inferring autoregulation, our method only requires non-interventional one-time data, and does not need to estimate parameters. Besides, our method has few restrictions on the model. We apply this method to four groups of experimental data and find some genes that might have autoregulation. Some inferred autoregulations have been verified by experiments or other theoretical works.
Collapse
Affiliation(s)
- Yue Wang
- Department of Computational Medicine, University of California, Los Angeles, CA, 90095, USA.
- Institut des Hautes Études Scientifiques (IHÉS), Bures-sur-Yvette, 91440, Essonne, France.
| | - Siqi He
- Simons Center for Geometry and Physics, Stony Brook University, Stony Brook, NY, 11794, USA
| |
Collapse
|
4
|
Ferreira SS, Anderson CE, Antunes MS. A logical way to reprogram plants. Biochem Biophys Res Commun 2023; 654:80-86. [PMID: 36898227 DOI: 10.1016/j.bbrc.2023.02.080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 02/27/2023] [Indexed: 03/06/2023]
Abstract
Living cells constantly monitor their external and internal environments for changing conditions, stresses or developmental cues. Networks of genetically encoded components sense and process these signals following pre-defined rules in such a way that specific combinations of the presence or absence of certain signals activate suitable responses. Many biological signal integration mechanisms approximate Boolean logic operations, whereby presence or absence of signals are computed as variables with values described as either true or false, respectively. Boolean logic gates are commonly used in algebra and in computer sciences, and have long been recognized as useful information processing devices in electronic circuits. In these circuits, logic gates integrate multiple input values and produce an output signal according to pre-defined Boolean logic operations. Recent implementation of these logic operations using genetic components to process information in living cells has allowed genetic circuits to enable novel traits with decision-making capabilities. Although several literature reports describe the design and use of these logic gates to introduce new functions in bacterial, yeast and mammalian cells, similar approaches in plants remain scarce, likely due to challenges posed by the complexity of plants and the lack of some technological advances, e.g., species-independent genetic transformation. In this mini review, we have surveyed recent reports describing synthetic genetic Boolean logic operators in plants and the different gate architectures used. We also briefly discuss the potential of deploying these genetic devices in plants to bring to fruition a new generation of resilient crops and improved biomanufacturing platforms.
Collapse
Affiliation(s)
- Savio S Ferreira
- Department of Biological Sciences, University of North Texas, Denton, TX, 76203, USA; BioDiscovery Institute, University of North Texas, Denton, TX, 76203, USA.
| | - Charles E Anderson
- Department of Biological Sciences, University of North Texas, Denton, TX, 76203, USA; BioDiscovery Institute, University of North Texas, Denton, TX, 76203, USA.
| | - Mauricio S Antunes
- Department of Biological Sciences, University of North Texas, Denton, TX, 76203, USA; BioDiscovery Institute, University of North Texas, Denton, TX, 76203, USA.
| |
Collapse
|
5
|
Wang Y, He S. Using Fano factors to determine certain types of gene autoregulation. ARXIV 2023:arXiv:2301.06692v2. [PMID: 36713249 PMCID: PMC9882590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The expression of one gene might be regulated by its corresponding protein, which is called autoregulation. Although gene regulation is a central topic in biology, autoregulation is much less studied. In general, it is extremely difficult to determine the existence of autoregulation with direct biochemical approaches. Nevertheless, some papers have observed that certain types of autoregulations are linked to noise levels in gene expression. We generalize these results by two propositions on discrete-state continuous-time Markov chains. These two propositions form a simple but robust method to infer the existence of autoregulation in certain scenarios from gene expression data. This method only depends on the Fano factor, namely the ratio of variance and mean of the gene expression level. Compared to other methods for inferring autoregulation, our method only requires non-interventional one-time data, and does not need to estimate parameters. Besides, our method has few restrictions on the model. We apply this method to four groups of experimental data and find some genes that might have autoregulation. Some inferred autoregulations have been verified by experiments or other theoretical works.
Collapse
Affiliation(s)
- Yue Wang
- Department of Computational Medicine, University of California, Los Angeles, California, United States of America
- Institut des Hautes Études Scientifiques, Bures-sur-Yvette, Essonne, France
| | - Siqi He
- Simons Center for Geometry and Physics, Stony Brook University, Stony Brook, New York, United States of America
| |
Collapse
|
6
|
Kang C, McElroy M, Voulgarakis NK. Emergent Criticality in Coupled Boolean Networks. ENTROPY (BASEL, SWITZERLAND) 2023; 25:235. [PMID: 36832602 PMCID: PMC9955248 DOI: 10.3390/e25020235] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 01/23/2023] [Accepted: 01/25/2023] [Indexed: 06/01/2023]
Abstract
Early embryonic development involves forming all specialized cells from a fluid-like mass of identical stem cells. The differentiation process consists of a series of symmetry-breaking events, starting from a high-symmetry state (stem cells) to a low-symmetry state (specialized cells). This scenario closely resembles phase transitions in statistical mechanics. To theoretically study this hypothesis, we model embryonic stem cell (ESC) populations through a coupled Boolean network (BN) model. The interaction is applied using a multilayer Ising model that considers paracrine and autocrine signaling, along with external interventions. It is demonstrated that cell-to-cell variability can be interpreted as a mixture of steady-state probability distributions. Simulations have revealed that such models can undergo a series of first- and second-order phase transitions as a function of the system parameters that describe gene expression noise and interaction strengths. These phase transitions result in spontaneous symmetry-breaking events that generate new types of cells characterized by various steady-state distributions. Coupled BNs have also been shown to self-organize in states that allow spontaneous cell differentiation.
Collapse
Affiliation(s)
- Chris Kang
- Department of Mathematics and Statistics, Washington State University, Pullman, WA 99164, USA
| | - Madelynn McElroy
- Department of Mathematics and Statistics, Washington State University, Pullman, WA 99164, USA
- Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, WA 99164, USA
| | - Nikolaos K. Voulgarakis
- Department of Mathematics and Statistics, Washington State University, Pullman, WA 99164, USA
| |
Collapse
|
7
|
Tan Y, Neto FBL, Neto UB. PALLAS: Penalized mAximum LikeLihood and pArticle Swarms for Inference of Gene Regulatory Networks From Time Series Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1807-1816. [PMID: 33170782 DOI: 10.1109/tcbb.2020.3037090] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
We present PALLAS, a practical method for gene regulatory network (GRN) inference from time series data, which employs penalized maximum likelihood and particle swarms for optimization. PALLAS is based on the Partially-Observable Boolean Dynamical System (POBDS) model and thus does not require ad-hoc binarization of the data. The penalty in the likelihood is a LASSO regularization term, which encourages the resulting network to be sparse. PALLAS is able to scale to networks of realistic size under no prior knowledge, by virtue of a novel continuous-discrete Fish School Search particle swarm algorithm for efficient simultaneous maximization of the penalized likelihood over the discrete space of networks and the continuous space of observational parameters. The performance of PALLAS is demonstrated by a comprehensive set of experiments using synthetic data generated from real and artificial networks, as well as real time series microarray and RNA-seq data, where it is compared to several other well-known methods for gene regulatory network inference. The results show that PALLAS can infer GRNs more accurately than other methods, while being capable of working directly on gene expression data, without need of ad-hoc binarization. PALLAS is a fully-fledged program, written in python, and available on GitHub (https://github.com/yukuntan92/PALLAS).
Collapse
|
8
|
Li F, Xie L. Set Stabilization of Probabilistic Boolean Networks Using Pinning Control. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:2555-2561. [PMID: 30530342 DOI: 10.1109/tnnls.2018.2881279] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Probabilistic Boolean network (PBN) is a kind of stochastic logical system in which update functions are randomly selected from a set of candidate Boolean functions according to a prescribed probability distribution at each time step. In this brief, a pinning controller design algorithm is proposed to set stabilize any PBN with probability one. First, an algorithm is given to change the columns of its transition matrix. Then, according to the newly obtained transition matrix, a fraction of nodes can be selected as pinning nodes to inject control inputs to achieve set stabilization. The problem is challenging since the Boolean functions in a PBN are not deterministic but are randomly chosen among several Boolean functions. Furthermore, the structure matrices of the pinning controllers are given by solving some logical matrices equations based on which a pinning controller design algorithm is provided to set stabilize the PBN with probability one. Finally, the theoretical results are validated using several examples.
Collapse
|
9
|
Barbosa S, Niebel B, Wolf S, Mauch K, Takors R. A guide to gene regulatory network inference for obtaining predictive solutions: Underlying assumptions and fundamental biological and data constraints. Biosystems 2018; 174:37-48. [PMID: 30312740 DOI: 10.1016/j.biosystems.2018.10.008] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Revised: 10/05/2018] [Accepted: 10/08/2018] [Indexed: 02/07/2023]
Abstract
The study of biological systems at a system level has become a reality due to the increasing powerful computational approaches able to handle increasingly larger datasets. Uncovering the dynamic nature of gene regulatory networks in order to attain a system level understanding and improve the predictive power of biological models is an important research field in systems biology. The task itself presents several challenges, since the problem is of combinatorial nature and highly depends on several biological constraints and also the intended application. Given the intrinsic interdisciplinary nature of gene regulatory network inference, we present a review on the currently available approaches, their challenges and limitations. We propose guidelines to select the most appropriate method considering the underlying assumptions and fundamental biological and data constraints.
Collapse
Affiliation(s)
- Sara Barbosa
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany.
| | - Bastian Niebel
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany
| | - Sebastian Wolf
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany
| | - Klaus Mauch
- Insilico Biotechnology AG, Meitnerstrasse 9, 70563 Stuttgart, Germany
| | - Ralf Takors
- Institute of Biochemical Engineering, University of Stuttgart, Allmandring 31, 70569 Stuttgart, Germany
| |
Collapse
|
10
|
Mizera A. Reviving the Two-State Markov Chain Approach. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1525-1537. [PMID: 28534781 DOI: 10.1109/tcbb.2017.2704592] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Probabilistic Boolean networks (PBNs) is a well-established computational framework for modelling biological systems. The steady-state dynamics of PBNs is of crucial importance in the study of such systems. However, for large PBNs, which often arise in systems biology, obtaining the steady-state distribution poses a significant challenge. In this paper, we revive the two-state Markov chain approach to solve this problem. This paper contributes in three aspects. First, we identify a problem of generating biased results with the approach and we propose a few heuristics to avoid such a pitfall. Second, we conduct an extensive experimental comparison of the extended two-state Markov chain approach and another approach based on the Skart method. We analyze the results with machine learning techniques and we show that statistically the two-state Markov chain approach has a better performance. Finally, we demonstrate the potential of the extended two-state Markov chain approach on a case study of a large PBN model of apoptosis in hepatocytes.
Collapse
|
11
|
Determining Relative Dynamic Stability of Cell States Using Boolean Network Model. Sci Rep 2018; 8:12077. [PMID: 30104572 PMCID: PMC6089891 DOI: 10.1038/s41598-018-30544-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 08/02/2018] [Indexed: 01/05/2023] Open
Abstract
Cell state transition is at the core of biological processes in metazoan, which includes cell differentiation, epithelial-to-mesenchymal transition (EMT) and cell reprogramming. In these cases, it is important to understand the molecular mechanism of cellular stability and how the transitions happen between different cell states, which is controlled by a gene regulatory network (GRN) hard-wired in the genome. Here we use Boolean modeling of GRN to study the cell state transition of EMT and systematically compare four available methods to calculate the cellular stability of three cell states in EMT in both normal and genetically mutated cases. The results produced from four methods generally agree but do not totally agree with each other. We show that distribution of one-degree neighborhood of cell states, which are the nearest states by Hamming distance, causes the difference among the methods. From that, we propose a new method based on one-degree neighborhood, which is the simplest one and agrees with other methods to estimate the cellular stability in all scenarios of our EMT model. This new method will help the researchers in the field of cell differentiation and cell reprogramming to calculate cellular stability using Boolean model, and then rationally design their experimental protocols to manipulate the cell state transition.
Collapse
|
12
|
Mizera A, Pang J, Su C, Yuan Q. ASSA-PBN: A Toolbox for Probabilistic Boolean Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1203-1216. [PMID: 29990128 DOI: 10.1109/tcbb.2017.2773477] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
As a well-established computational framework, probabilistic Boolean networks (PBNs) are widely used for modelling, simulation, and analysis of biological systems. To analyze the steady-state dynamics of PBNs is of crucial importance to explore the characteristics of biological systems. However, the analysis of large PBNs, which often arise in systems biology, is prone to the infamous state-space explosion problem. Therefore, the employment of statistical methods often remains the only feasible solution. We present ${\mathsf{ASSA-PBN}}$ , a software toolbox for modelling, simulation, and analysis of PBNs. ${\mathsf{ASSA-PBN}}$ provides efficient statistical methods with three parallel techniques to speed up the computation of steady-state probabilities. Moreover, particle swarm optimisation (PSO) and differential evolution (DE) are implemented for the estimation of PBN parameters. Additionally, we implement in-depth analyses of PBNs, including long-run influence analysis, long-run sensitivity analysis, computation of one-parameter profile likelihoods, and the visualization of one-parameter profile likelihoods. A PBN model of apoptosis is used as a case study to illustrate the main functionalities of ${\mathsf{ASSA-PBN}}$ and to demonstrate the capabilities of ${\mathsf{ASSA-PBN}}$ to effectively analyse biological systems modelled as PBNs.
Collapse
|
13
|
Chen H, Liang J, Lu J, Qiu J. Synchronization for the Realization-Dependent Probabilistic Boolean Networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:819-831. [PMID: 28129189 DOI: 10.1109/tnnls.2017.2647989] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
This paper investigates the synchronization problem for the realization-dependent probabilistic Boolean networks (PBNs) coupled unidirectionally in the drive-response configuration. The realization of the response PBN is assumed to be uniquely determined by the realization signal generated by the drive PBN at each discrete time instant. First, the drive-response PBNs are expressed in their algebraic forms based on the semitensor product method, and then, a necessary and sufficient condition is presented for the synchronization of the PBNs. Second, by resorting to a newly defined matrix operator, the reachable set from any initial state is expressed by a column vector. Consequently, an easily computable algebraic criterion is derived assuring the synchronization of the drive-response PBNs. Finally, three illustrative examples are employed to demonstrate the applicability and usefulness of the developed theoretical results.
Collapse
|
14
|
Chen S, Nagel S, Schneider B, Dai H, Geffers R, Kaufmann M, Meyer C, Pommerenke C, Thress KS, Li J, Quentmeier H, Drexler HG, MacLeod RAF. A new ETV6-NTRK3 cell line model reveals MALAT1 as a novel therapeutic target - a short report. Cell Oncol (Dordr) 2017; 41:93-101. [PMID: 29119387 DOI: 10.1007/s13402-017-0356-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/06/2017] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Previously, the chromosomal translocation t(12;15)(p13;q25) has been found to recurrently occur in both solid tumors and leukemias. This translocation leads to ETV6-NTRK3 (EN) gene fusions resulting in ectopic expression of the NTRK3 neurotropic tyrosine receptor kinase moiety as well as oligomerization through the donated ETV6-sterile alpha motif domain. As yet, no in vitro cell line model carrying this anomaly is available. Here we genetically characterized the acute promyelocytic leukemia (APL) cell line AP-1060 and, by doing so, revealed the presence of a t(12;15)(p13;q25). Subsequently, we evaluated its suitability as a model for this important clinical entity. METHODS Spectral karyotyping, fluorescence in situ hybridization (FISH), and genomic and transcriptomic microarray-based profiling were used to screen for the presence of EN fusions. qRT-PCR was used for quantitative expression analyses. Responses to AZ-23 (NTRK) and wortmannin (PI3K) inhibitors, as well as to arsenic trioxide (ATO), were assessed using colorimetric assays. An AZ-23 microarray screen was used to define the EN targetome, which was parsed bioinformatically. MAPK1 and MALAT1 activation were assayed using Western blotting and RNA-FISH, respectively, whereas an AML patient cohort was used to assess the clinical occurrence of MALAT1 activation. RESULTS An EN fusion was detected in AP1060 cells which, accordingly, turned out to be hypersensitive to AZ-23. We also found that AZ-23 can potentiate the effect of ATO and inhibit the phosphorylation of its canonical target MAPK1. The AZ-23 microarray screen highlighted a novel EN target, MALAT1, which also proved sensitive to wortmannin. Finally, we found that MALAT1 was massively up-regulated in a subset of AML patients. CONCLUSIONS From our data we conclude that AP-1060 may serve as a first publicly available preclinical model for EN. In addition, we conclude that these EN-positive cells are sensitive to the NTRK inhibitor AZ-23 and that this inhibitor may potentiate the therapeutic efficacy of ATO. Our data also highlight a novel AML EN target, MALAT1, which was so far only conspicuous in solid tumors.
Collapse
Affiliation(s)
- Suning Chen
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany.,Jiangsu Institute of Hematology, Key Laboratory of Thrombosis and Hemostasis of Ministry of Health, The First Affiliated Hospital of Soochow University, Suzhou, People's Republic of China
| | - Stefan Nagel
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany
| | - Bjoern Schneider
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany.,Institute of Pathology and Molecular Pathology, University of Rostock, Rostock, Germany
| | - Haiping Dai
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany.,Jiangsu Institute of Hematology, Key Laboratory of Thrombosis and Hemostasis of Ministry of Health, The First Affiliated Hospital of Soochow University, Suzhou, People's Republic of China
| | - Robert Geffers
- Genome Analytics Research Group, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Maren Kaufmann
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany
| | - Corinna Meyer
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany
| | - Claudia Pommerenke
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany
| | | | - Jiao Li
- Department of Hematology, Yixing People's Hospital of Jiangsu Province, Yixing, People's Republic of China
| | - Hilmar Quentmeier
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany
| | - Hans G Drexler
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany
| | - Roderick A F MacLeod
- Department of Human and Animal Cell Lines, DSMZ - German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7b, 38124, Braunschweig, Germany.
| |
Collapse
|
15
|
Simak M, Yeang CH, Lu HHS. Exploring candidate biological functions by Boolean Function Networks for Saccharomyces cerevisiae. PLoS One 2017; 12:e0185475. [PMID: 28981547 PMCID: PMC5628832 DOI: 10.1371/journal.pone.0185475] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2017] [Accepted: 09/13/2017] [Indexed: 01/26/2023] Open
Abstract
The great amount of gene expression data has brought a big challenge for the discovery of Gene Regulatory Network (GRN). For network reconstruction and the investigation of regulatory relations, it is desirable to ensure directness of links between genes on a map, infer their directionality and explore candidate biological functions from high-throughput transcriptomic data. To address these problems, we introduce a Boolean Function Network (BFN) model based on techniques of hidden Markov model (HMM), likelihood ratio test and Boolean logic functions. BFN consists of two consecutive tests to establish links between pairs of genes and check their directness. We evaluate the performance of BFN through the application to S. cerevisiae time course data. BFN produces regulatory relations which show consistency with succession of cell cycle phases. Furthermore, it also improves sensitivity and specificity when compared with alternative methods of genetic network reverse engineering. Moreover, we demonstrate that BFN can provide proper resolution for GO enrichment of gene sets. Finally, the Boolean functions discovered by BFN can provide useful insights for the identification of control mechanisms of regulatory processes, which is the special advantage of the proposed approach. In combination with low computational complexity, BFN can serve as an efficient screening tool to reconstruct genes relations on the whole genome level. In addition, the BFN approach is also feasible to a wide range of time course datasets.
Collapse
Affiliation(s)
- Maria Simak
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan
- Institute of Statistics, National Chiao Tung University, Hsinchu, Taiwan
| | | | - Henry Horng-Shing Lu
- Institute of Statistics, National Chiao Tung University, Hsinchu, Taiwan
- Big Data Research Center, National Chiao Tung University, Hsinchu, Taiwan
| |
Collapse
|
16
|
Mathematical and Computational Modeling in Complex Biological Systems. BIOMED RESEARCH INTERNATIONAL 2017; 2017:5958321. [PMID: 28386558 PMCID: PMC5366773 DOI: 10.1155/2017/5958321] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Revised: 12/20/2016] [Accepted: 01/16/2017] [Indexed: 12/22/2022]
Abstract
The biological process and molecular functions involved in the cancer progression remain difficult to understand for biologists and clinical doctors. Recent developments in high-throughput technologies urge the systems biology to achieve more precise models for complex diseases. Computational and mathematical models are gradually being used to help us understand the omics data produced by high-throughput experimental techniques. The use of computational models in systems biology allows us to explore the pathogenesis of complex diseases, improve our understanding of the latent molecular mechanisms, and promote treatment strategy optimization and new drug discovery. Currently, it is urgent to bridge the gap between the developments of high-throughput technologies and systemic modeling of the biological process in cancer research. In this review, we firstly studied several typical mathematical modeling approaches of biological systems in different scales and deeply analyzed their characteristics, advantages, applications, and limitations. Next, three potential research directions in systems modeling were summarized. To conclude, this review provides an update of important solutions using computational modeling approaches in systems biology.
Collapse
|
17
|
Das H, Layek RK. Estimation of delays in generalized asynchronous Boolean networks. MOLECULAR BIOSYSTEMS 2016; 12:3098-110. [PMID: 27464825 DOI: 10.1039/c6mb00276e] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
A new generalized asynchronous Boolean network (GABN) model has been proposed in this paper. This continuous-time discrete-state model captures the biological reality of cellular dynamics without compromising the computational efficiency of the Boolean framework. The GABN synthesis procedure is based on the prior knowledge of the logical structure of the regulatory network, and the experimental transcriptional parameters. The novelty of the proposed methodology lies in considering different delays associated with the activation and deactivation of a particular protein (especially the transcription factors). A few illustrative examples of some well-studied network motifs have been provided to explore the scope of using the GABN model for larger networks. The GABN model of the p53-signaling pathway in response to γ-irradiation has also been simulated in the current paper to provide an indirect validation of the proposed schema.
Collapse
Affiliation(s)
- Haimabati Das
- Department of Electronics and Electrical Communication Engineering, Indian Institute of Technology Kharagpur, 721302, India.
| | | |
Collapse
|
18
|
|
19
|
A Probabilistic Boolean Network Approach for the Analysis of Cancer-Specific Signalling: A Case Study of Deregulated PDGF Signalling in GIST. PLoS One 2016; 11:e0156223. [PMID: 27232499 PMCID: PMC4883749 DOI: 10.1371/journal.pone.0156223] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Accepted: 05/11/2016] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Signal transduction networks are increasingly studied with mathematical modelling approaches while each of them is suited for a particular problem. For the contextualisation and analysis of signalling networks with steady-state protein data, we identified probabilistic Boolean network (PBN) as a promising framework which could capture quantitative changes of molecular changes at steady-state with a minimal parameterisation. RESULTS AND CONCLUSION In our case study, we successfully applied the PBN approach to model and analyse the deregulated Platelet-Derived Growth Factor (PDGF) signalling pathway in Gastrointestinal Stromal Tumour (GIST). We experimentally determined a rich and accurate dataset of steady-state profiles of selected downstream kinases of PDGF-receptor-alpha mutants in combination with inhibitor treatments. Applying the tool optPBN, we fitted a literature-derived candidate network model to the training dataset consisting of single perturbation conditions. Model analysis suggested several important crosstalk interactions. The validity of these predictions was further investigated experimentally pointing to relevant ongoing crosstalk from PI3K to MAPK signalling in tumour cells. The refined model was evaluated with a validation dataset comprising multiple perturbation conditions. The model thereby showed excellent performance allowing to quantitatively predict the combinatorial responses from the individual treatment results in this cancer setting. The established optPBN pipeline is also widely applicable to gain a better understanding of other signalling networks at steady-state in a context-specific fashion.
Collapse
|
20
|
Chatterjee P, Pal NR. Construction of synergy networks from gene expression data related to disease. Gene 2016; 590:250-62. [PMID: 27222483 DOI: 10.1016/j.gene.2016.05.029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2015] [Revised: 03/11/2016] [Accepted: 05/17/2016] [Indexed: 02/07/2023]
Abstract
A few methods have been developed to determine whether genes collaborate with each other in relation to a particular disease using an information theoretic measure of synergy. Here, we propose an alternative definition of synergy and justify that our definition improves upon the existing measures of synergy in the context of gene interactions. We use this definition on a prostate cancer data set consisting of gene expression levels in both cancerous and non-cancerous samples and identify pairs of genes which are unable to discriminate between cancerous and non-cancerous samples individually but can do so jointly when we take their synergistic property into account. We also propose a very simple yet effective technique for computation of conditional entropy at a very low cost. The worst case complexity of our method is O(n) while the best case complexity of a state-of-the-art method is O(n(2)). Furthermore, our method can also be extended to find synergistic relation among triplets or even among a larger number of genes. Finally, we validate our results by demonstrating that these findings cannot be due to pure chance and provide the relevance of the synergistic pairs in cancer biology.
Collapse
Affiliation(s)
- Prantik Chatterjee
- Electronics and Communication Sciences Unit, Indian Statistical Institute, Calcutta, India
| | - Nikhil Ranjan Pal
- Electronics and Communication Sciences Unit, Indian Statistical Institute, Calcutta, India.
| |
Collapse
|
21
|
Zhou JX, Samal A, d'Hérouël AF, Price ND, Huang S. Relative stability of network states in Boolean network models of gene regulation in development. Biosystems 2016; 142-143:15-24. [PMID: 26965665 DOI: 10.1016/j.biosystems.2016.03.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Revised: 02/27/2016] [Accepted: 03/02/2016] [Indexed: 01/06/2023]
Abstract
Progress in cell type reprogramming has revived the interest in Waddington's concept of the epigenetic landscape. Recently researchers developed the quasi-potential theory to represent the Waddington's landscape. The Quasi-potential U(x), derived from interactions in the gene regulatory network (GRN) of a cell, quantifies the relative stability of network states, which determine the effort required for state transitions in a multi-stable dynamical system. However, quasi-potential landscapes, originally developed for continuous systems, are not suitable for discrete-valued networks which are important tools to study complex systems. In this paper, we provide a framework to quantify the landscape for discrete Boolean networks (BNs). We apply our framework to study pancreas cell differentiation where an ensemble of BN models is considered based on the structure of a minimal GRN for pancreas development. We impose biologically motivated structural constraints (corresponding to specific type of Boolean functions) and dynamical constraints (corresponding to stable attractor states) to limit the space of BN models for pancreas development. In addition, we enforce a novel functional constraint corresponding to the relative ordering of attractor states in BN models to restrict the space of BN models to the biological relevant class. We find that BNs with canalyzing/sign-compatible Boolean functions best capture the dynamics of pancreas cell differentiation. This framework can also determine the genes' influence on cell state transitions, and thus can facilitate the rational design of cell reprogramming protocols.
Collapse
Affiliation(s)
- Joseph Xu Zhou
- Institute for Systems Biology, Seattle, WA, USA; Kavli Institute for Theoretical Physics, UC Santa Barbara, CA, USA
| | - Areejit Samal
- Institute for Systems Biology, Seattle, WA, USA; The Institute of Mathematical Sciences, Chennai, India; The Abdus Salam International Centre for Theoretical Physics, Trieste, Italy
| | - Aymeric Fouquier d'Hérouël
- Institute for Systems Biology, Seattle, WA, USA; Luxembourg Centre for Systems Biomedicine, Esch-sur-Alzette, Luxembourg
| | | | - Sui Huang
- Institute for Systems Biology, Seattle, WA, USA.
| |
Collapse
|
22
|
Acharya L, Reynolds R, Zhu D. Network inference through synergistic subnetwork evolution. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2015; 2015:12. [PMID: 26640480 PMCID: PMC4662719 DOI: 10.1186/s13637-015-0027-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 08/21/2015] [Indexed: 12/02/2022]
Abstract
Study of signaling networks is important for a better understanding of cell behaviors e.g., growth, differentiation, metabolism, proptosis, and gaining deeper insights into the molecular mechanisms of complex diseases. While there have been many successes in developing computational approaches for identifying potential genes and proteins involved in cell signaling, new methods are needed for identifying network structures that depict underlying signal cascading mechanisms. In this paper, we propose a new computational approach for inferring signaling network structures from overlapping gene sets related to the networks. In the proposed approach, a signaling network is represented as a directed graph and is viewed as a union of many active paths representing linear and overlapping chains of signal cascading activities in the network. Gene sets represent the sets of genes participating in active paths without prior knowledge of the order in which genes occur within each path. From a compendium of unordered gene sets, the proposed algorithm reconstructs the underlying network structure through evolution of synergistic active paths. In our context, the extent of edge overlapping among active paths is used to define the synergy present in a network. We evaluated the performance of the proposed algorithm in terms of its convergence and recovering true active paths by utilizing four gene set compendiums derived from the KEGG database. Evaluation of results demonstrate the ability of the algorithm in reconstructing the underlying networks with high accuracy and precision.
Collapse
Affiliation(s)
- Lipi Acharya
- Dow AgroSciences, 9330 Zionsville Road, Indianapolis, IN 46268 USA
| | - Robert Reynolds
- Department of Computer Science, Wayne State University, 5057 Woodward Avenue, Detroit, MI 48202 USA
| | - Dongxiao Zhu
- Department of Computer Science, Wayne State University, 5057 Woodward Avenue, Detroit, MI 48202 USA
| |
Collapse
|
23
|
Chatterjee P, Pal NR. Discovery of synergistic genetic network: A minimum spanning tree-based approach. J Bioinform Comput Biol 2015; 14:1650003. [PMID: 26620041 DOI: 10.1142/s0219720016500037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Identification of gene interactions is one of the very well-known and important problems in the field of genetics. However, discovering synergistic gene interactions is a relatively new problem which has been proven to be as significant as the former in genetics. Several approaches have been proposed in this regard and most of them depend upon information theoretic measures. These approaches quantize the gene expression levels, explicitly or implicitly and therefore, may lose information. Here, we have proposed a novel approach for identifying synergistic gene interactions directly from the continuous expression levels, using a minimum spanning tree (MST)-based algorithm. We have used this approach to find pairs of synergistically interacting genes in prostate cancer. The advantages of our method are that it does not need any discretization and it can be extended straightway to find synergistically interacting sets of genes having three or more elements as per the requirement of the situation. We have demonstrated the relevance of the synergistic genes in cancer biology using KEGG pathway analysis and otherwise.
Collapse
|
24
|
Guo NL, Wan YW. Network-based identification of biomarkers coexpressed with multiple pathways. Cancer Inform 2014; 13:37-47. [PMID: 25392692 PMCID: PMC4218687 DOI: 10.4137/cin.s14054] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Revised: 06/25/2014] [Accepted: 06/29/2014] [Indexed: 02/07/2023] Open
Abstract
Unraveling complex molecular interactions and networks and incorporating clinical information in modeling will present a paradigm shift in molecular medicine. Embedding biological relevance via modeling molecular networks and pathways has become increasingly important for biomarker identification in cancer susceptibility and metastasis studies. Here, we give a comprehensive overview of computational methods used for biomarker identification, and provide a performance comparison of several network models used in studies of cancer susceptibility, disease progression, and prognostication. Specifically, we evaluated implication networks, Boolean networks, Bayesian networks, and Pearson’s correlation networks in constructing gene coexpression networks for identifying lung cancer diagnostic and prognostic biomarkers. The results show that implication networks, implemented in Genet package, identified sets of biomarkers that generated an accurate prediction of lung cancer risk and metastases; meanwhile, implication networks revealed more biologically relevant molecular interactions than Boolean networks, Bayesian networks, and Pearson’s correlation networks when evaluated with MSigDB database.
Collapse
Affiliation(s)
- Nancy Lan Guo
- Mary Babb Randolph Cancer Center/School of Public Health, West Virginia University, Morgantown, WV, USA
| | - Ying-Wooi Wan
- Mary Babb Randolph Cancer Center/School of Public Health, West Virginia University, Morgantown, WV, USA
| |
Collapse
|
25
|
Korsunsky I, McGovern K, LaGatta T, Olde Loohuis L, Grosso-Applewhite T, Griffeth N, Mishra B. Systems biology of cancer: a challenging expedition for clinical and quantitative biologists. Front Bioeng Biotechnol 2014; 2:27. [PMID: 25191654 PMCID: PMC4137540 DOI: 10.3389/fbioe.2014.00027] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2014] [Accepted: 07/18/2014] [Indexed: 11/25/2022] Open
Abstract
A systems-biology approach to complex disease (such as cancer) is now complementing traditional experience-based approaches, which have typically been invasive and expensive. The rapid progress in biomedical knowledge is enabling the targeting of disease with therapies that are precise, proactive, preventive, and personalized. In this paper, we summarize and classify models of systems biology and model checking tools, which have been used to great success in computational biology and related fields. We demonstrate how these models and tools have been used to study some of the twelve biochemical pathways implicated in but not unique to pancreatic cancer, and conclude that the resulting mechanistic models will need to be further enhanced by various abstraction techniques to interpret phenomenological models of cancer progression.
Collapse
Affiliation(s)
- Ilya Korsunsky
- Department of Computer Science, Courant Institute, New York University, New York, NY, USA
| | - Kathleen McGovern
- Department of Mathematics and Statistics, Hunter College, City University of New York, New York, NY, USA
| | - Tom LaGatta
- Department of Mathematics, Courant Institute, New York University, New York, NY, USA
| | - Loes Olde Loohuis
- Department of Computer Science, The Graduate Center, City University of New York, New York, NY, USA
| | - Terri Grosso-Applewhite
- Department of Computer Science, The Graduate Center, City University of New York, New York, NY, USA
| | - Nancy Griffeth
- Department of Mathematics and Computer Science, Lehman College, City University of New York, New York, NY, USA
| | - Bud Mishra
- Department of Computer Science, Courant Institute, New York University, New York, NY, USA
- Department of Mathematics, Courant Institute, New York University, New York, NY, USA
| |
Collapse
|
26
|
Trairatphisan P, Mizera A, Pang J, Tantar AA, Sauter T. optPBN: an optimisation toolbox for probabilistic Boolean networks. PLoS One 2014; 9:e98001. [PMID: 24983623 PMCID: PMC4077690 DOI: 10.1371/journal.pone.0098001] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Accepted: 04/27/2014] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND There exist several computational tools which allow for the optimisation and inference of biological networks using a Boolean formalism. Nevertheless, the results from such tools yield only limited quantitative insights into the complexity of biological systems because of the inherited qualitative nature of Boolean networks. RESULTS We introduce optPBN, a Matlab-based toolbox for the optimisation of probabilistic Boolean networks (PBN) which operates under the framework of the BN/PBN toolbox. optPBN offers an easy generation of probabilistic Boolean networks from rule-based Boolean model specification and it allows for flexible measurement data integration from multiple experiments. Subsequently, optPBN generates integrated optimisation problems which can be solved by various optimisers. In term of functionalities, optPBN allows for the construction of a probabilistic Boolean network from a given set of potential constitutive Boolean networks by optimising the selection probabilities for these networks so that the resulting PBN fits experimental data. Furthermore, the optPBN pipeline can also be operated on large-scale computational platforms to solve complex optimisation problems. Apart from exemplary case studies which we correctly inferred the original network, we also successfully applied optPBN to study a large-scale Boolean model of apoptosis where it allows identifying the inverse correlation between UVB irradiation, NFκB and Caspase 3 activations, and apoptosis in primary hepatocytes quantitatively. Also, the results from optPBN help elucidating the relevancy of crosstalk interactions in the apoptotic network. SUMMARY The optPBN toolbox provides a simple yet comprehensive pipeline for integrated optimisation problem generation in the PBN formalism that can readily be solved by various optimisers on local or grid-based computational platforms. optPBN can be further applied to various biological studies such as the inference of gene regulatory networks or the identification of the interaction's relevancy in signal transduction networks.
Collapse
Affiliation(s)
- Panuwat Trairatphisan
- Systems Biology Group, Life Sciences Research Unit, University of Luxembourg, Luxembourg, Luxembourg
| | - Andrzej Mizera
- Computer Science and Communications Research Unit, University of Luxembourg, Luxembourg, Luxembourg
| | - Jun Pang
- Computer Science and Communications Research Unit, University of Luxembourg, Luxembourg, Luxembourg
- Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Luxembourg, Luxembourg
| | - Alexandru Adrian Tantar
- Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Luxembourg, Luxembourg
| | - Thomas Sauter
- Systems Biology Group, Life Sciences Research Unit, University of Luxembourg, Luxembourg, Luxembourg
| |
Collapse
|
27
|
Zhu P, Liang J, Han J. Gene perturbation and intervention in context-sensitive stochastic Boolean networks. BMC SYSTEMS BIOLOGY 2014; 8:60. [PMID: 24886608 PMCID: PMC4062525 DOI: 10.1186/1752-0509-8-60] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2013] [Accepted: 04/22/2014] [Indexed: 01/03/2023]
Abstract
Background In a gene regulatory network (GRN), gene expressions are affected by noise, and stochastic fluctuations exist in the interactions among genes. These stochastic interactions are context dependent, thus it becomes important to consider noise in a context-sensitive manner in a network model. As a logical model, context-sensitive probabilistic Boolean networks (CSPBNs) account for molecular and genetic noise in the temporal context of gene functions. In a CSPBN with n genes and k contexts, however, a computational complexity of O(nk222n) (or O(nk2n)) is required for an accurate (or approximate) computation of the state transition matrix (STM) of the size (2n ∙ k) × (2n ∙ k) (or 2n × 2n). The evaluation of a steady state distribution (SSD) is more challenging. Recently, stochastic Boolean networks (SBNs) have been proposed as an efficient implementation of an instantaneous PBN. Results The notion of stochastic Boolean networks (SBNs) is extended for the general model of PBNs, i.e., CSPBNs. This yields a novel structure of context-sensitive SBNs (CSSBNs) for modeling the stochasticity in a GRN. A CSSBN enables an efficient simulation of a CSPBN with a complexity of O(nLk2n) for computing the state transition matrix, where L is a factor related to the required sequence length in CSSBN for achieving a desired accuracy. A time-frame expanded CSSBN can further efficiently simulate the stationary behavior of a CSPBN and allow for a tunable tradeoff between accuracy and efficiency. The CSSBN approach is more efficient than an analytical method and more accurate than an approximate analysis. Conclusions Context-sensitive stochastic Boolean networks (CSSBNs) are proposed as an efficient approach to modeling the effects of gene perturbation and intervention in gene regulatory networks. A CSSBN analysis provides biologically meaningful insights into the oscillatory dynamics of the p53-Mdm2 network in a context-switching environment. It is shown that random gene perturbation has a greater effect on the final distribution of the steady state of a network compared to context switching activities. The CSSBN approach can further predict the steady state distribution of a glioma network under gene intervention. Ultimately, this will help drug discovery and develop effective drug intervention strategies.
Collapse
Affiliation(s)
| | | | - Jie Han
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, Canada.
| |
Collapse
|
28
|
Zhu P, Han J. Stochastic multiple-valued gene networks. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2014; 8:42-53. [PMID: 24681918 DOI: 10.1109/tbcas.2013.2291398] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Among various approaches to modeling gene regulatory networks (GRNs), Boolean networks (BNs) and its probabilistic extension, probabilistic Boolean networks (PBNs), have been studied to gain insights into the dynamics of GRNs. To further exploit the simplicity of logical models, a multiple-valued network employs gene states that are not limited to binary values, thus providing a finer granularity in the modeling of GRNs. In this paper, stochastic multiple-valued networks (SMNs) are proposed for modeling the effects of noise and gene perturbation in a GRN. An SMN enables an accurate and efficient simulation of a probabilistic multiple-valued network (as an extension of a PBN). In a k-level SMN of n genes, it requires a complexity of O(nLk(n)) to compute the state transition matrix, where L is a factor related to the minimum sequence length in the SMN for achieving a desired accuracy. The use of randomly permuted stochastic sequences further increases computational efficiency and allows for a tunable tradeoff between accuracy and efficiency. The analysis of a p53-Mdm2 network and a WNT5A network shows that the proposed SMN approach is efficient in evaluating the network dynamics and steady state distribution of gene networks under random gene perturbation.
Collapse
|
29
|
Trairatphisan P, Mizera A, Pang J, Tantar AA, Schneider J, Sauter T. Recent development and biomedical applications of probabilistic Boolean networks. Cell Commun Signal 2013; 11:46. [PMID: 23815817 PMCID: PMC3726340 DOI: 10.1186/1478-811x-11-46] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2013] [Accepted: 06/22/2013] [Indexed: 12/13/2022] Open
Abstract
Probabilistic Boolean network (PBN) modelling is a semi-quantitative approach widely used for the study of the topology and dynamic aspects of biological systems. The combined use of rule-based representation and probability makes PBN appealing for large-scale modelling of biological networks where degrees of uncertainty need to be considered.A considerable expansion of our knowledge in the field of theoretical research on PBN can be observed over the past few years, with a focus on network inference, network intervention and control. With respect to areas of applications, PBN is mainly used for the study of gene regulatory networks though with an increasing emergence in signal transduction, metabolic, and also physiological networks. At the same time, a number of computational tools, facilitating the modelling and analysis of PBNs, are continuously developed.A concise yet comprehensive review of the state-of-the-art on PBN modelling is offered in this article, including a comparative discussion on PBN versus similar models with respect to concepts and biomedical applications. Due to their many advantages, we consider PBN to stand as a suitable modelling framework for the description and analysis of complex biological systems, ranging from molecular to physiological levels.
Collapse
Affiliation(s)
| | - Andrzej Mizera
- Computer Science and Communications Research Unit, University of Luxembourg, Luxembourg
| | - Jun Pang
- Computer Science and Communications Research Unit, University of Luxembourg, Luxembourg
| | - Alexandru Adrian Tantar
- Computer Science and Communications Research Unit, University of Luxembourg, Luxembourg
- Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, Luxembourg
| | - Jochen Schneider
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Luxembourg
- Saarland University Medical Center, Department of Internal Medicine II, Homburg, Saarland, Germany
| | - Thomas Sauter
- Life Sciences Research Unit, University of Luxembourg, Luxembourg
| |
Collapse
|
30
|
Berestovsky N, Nakhleh L. An Evaluation of Methods for Inferring Boolean Networks from Time-Series Data. PLoS One 2013; 8:e66031. [PMID: 23805196 PMCID: PMC3689729 DOI: 10.1371/journal.pone.0066031] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2012] [Accepted: 05/06/2013] [Indexed: 01/12/2023] Open
Abstract
Regulatory networks play a central role in cellular behavior and decision making. Learning these regulatory networks is a major task in biology, and devising computational methods and mathematical models for this task is a major endeavor in bioinformatics. Boolean networks have been used extensively for modeling regulatory networks. In this model, the state of each gene can be either ‘on’ or ‘off’ and that next-state of a gene is updated, synchronously or asynchronously, according to a Boolean rule that is applied to the current-state of the entire system. Inferring a Boolean network from a set of experimental data entails two main steps: first, the experimental time-series data are discretized into Boolean trajectories, and then, a Boolean network is learned from these Boolean trajectories. In this paper, we consider three methods for data discretization, including a new one we propose, and three methods for learning Boolean networks, and study the performance of all possible nine combinations on four regulatory systems of varying dynamics complexities. We find that employing the right combination of methods for data discretization and network learning results in Boolean networks that capture the dynamics well and provide predictive power. Our findings are in contrast to a recent survey that placed Boolean networks on the low end of the “faithfulness to biological reality” and “ability to model dynamics” spectra. Further, contrary to the common argument in favor of Boolean networks, we find that a relatively large number of time points in the time-series data is required to learn good Boolean networks for certain data sets. Last but not least, while methods have been proposed for inferring Boolean networks, as discussed above, missing still are publicly available implementations thereof. Here, we make our implementation of the methods available publicly in open source at http://bioinfo.cs.rice.edu/.
Collapse
Affiliation(s)
- Natalie Berestovsky
- Department of Computer Science, Rice University, Houston, Texas, United States of America
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
31
|
Vijesh N, Chakrabarti SK, Sreekumar J. Modeling of gene regulatory networks: A review. ACTA ACUST UNITED AC 2013. [DOI: 10.4236/jbise.2013.62a027] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
32
|
Abstract
It has been suggested that irreducible sets of states in Probabilistic Boolean Networks correspond to cellular phenotype. In this study, we identify such sets of states for each phase of the budding yeast cell cycle. We find that these “ergodic sets” underly the cyclin activity levels during each phase of the cell cycle. Our results compare to the observations made in several laboratory experiments as well as the results of differential equation models. Dynamical studies of this model: (i) indicate that under stochastic external signals the continuous oscillating waves of cyclin activity and the opposing waves of CKIs emerge from the logic of a Boolean-based regulatory network without the need for specific biochemical/kinetic parameters; (ii) suggest that the yeast cell cycle network is robust to the varying behavior of cell size (e.g., cell division under nitrogen deprived conditions); (iii) suggest the irreversibility of the Start signal is a function of logic of the G1 regulon, and changing the structure of the regulatory network can render start reversible.
Collapse
|
33
|
Chueh TH, Lu HHS. Inference of biological pathway from gene expression profiles by time delay boolean networks. PLoS One 2012; 7:e42095. [PMID: 22952589 PMCID: PMC3432056 DOI: 10.1371/journal.pone.0042095] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Accepted: 07/02/2012] [Indexed: 11/18/2022] Open
Abstract
One great challenge of genomic research is to efficiently and accurately identify complex gene regulatory networks. The development of high-throughput technologies provides numerous experimental data such as DNA sequences, protein sequence, and RNA expression profiles makes it possible to study interactions and regulations among genes or other substance in an organism. However, it is crucial to make inference of genetic regulatory networks from gene expression profiles and protein interaction data for systems biology. This study will develop a new approach to reconstruct time delay boolean networks as a tool for exploring biological pathways. In the inference strategy, we will compare all pairs of input genes in those basic relationships by their corresponding p-scores for every output gene. Then, we will combine those consistent relationships to reveal the most probable relationship and reconstruct the genetic network. Specifically, we will prove that O(log n) state transition pairs are sufficient and necessary to reconstruct the time delay boolean network of n nodes with high accuracy if the number of input genes to each gene is bounded. We also have implemented this method on simulated and empirical yeast gene expression data sets. The test results show that this proposed method is extensible for realistic networks.
Collapse
Affiliation(s)
- Tung-Hung Chueh
- Green Energy and Environment Research Laboratories, Industrial Technology Research Institute, Chutung, Hsinchu, Taiwan, Republic of China
| | - Henry Horng-Shing Lu
- Institute of Statistics, National Chiao Tung University, Hsinchu, Taiwan, Republic of China
| |
Collapse
|
34
|
Liang J, Han J. Stochastic Boolean networks: an efficient approach to modeling gene regulatory networks. BMC SYSTEMS BIOLOGY 2012; 6:113. [PMID: 22929591 PMCID: PMC3532238 DOI: 10.1186/1752-0509-6-113] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 08/06/2012] [Indexed: 11/10/2022]
Abstract
BACKGROUND Various computational models have been of interest due to their use in the modelling of gene regulatory networks (GRNs). As a logical model, probabilistic Boolean networks (PBNs) consider molecular and genetic noise, so the study of PBNs provides significant insights into the understanding of the dynamics of GRNs. This will ultimately lead to advances in developing therapeutic methods that intervene in the process of disease development and progression. The applications of PBNs, however, are hindered by the complexities involved in the computation of the state transition matrix and the steady-state distribution of a PBN. For a PBN with n genes and N Boolean networks, the complexity to compute the state transition matrix is O(nN22n) or O(nN2n) for a sparse matrix. RESULTS This paper presents a novel implementation of PBNs based on the notions of stochastic logic and stochastic computation. This stochastic implementation of a PBN is referred to as a stochastic Boolean network (SBN). An SBN provides an accurate and efficient simulation of a PBN without and with random gene perturbation. The state transition matrix is computed in an SBN with a complexity of O(nL2n), where L is a factor related to the stochastic sequence length. Since the minimum sequence length required for obtaining an evaluation accuracy approximately increases in a polynomial order with the number of genes, n, and the number of Boolean networks, N, usually increases exponentially with n, L is typically smaller than N, especially in a network with a large number of genes. Hence, the computational efficiency of an SBN is primarily limited by the number of genes, but not directly by the total possible number of Boolean networks. Furthermore, a time-frame expanded SBN enables an efficient analysis of the steady-state distribution of a PBN. These findings are supported by the simulation results of a simplified p53 network, several randomly generated networks and a network inferred from a T cell immune response dataset. An SBN can also implement the function of an asynchronous PBN and is potentially useful in a hybrid approach in combination with a continuous or single-molecule level stochastic model. CONCLUSIONS Stochastic Boolean networks (SBNs) are proposed as an efficient approach to modelling gene regulatory networks (GRNs). The SBN approach is able to recover biologically-proven regulatory behaviours, such as the oscillatory dynamics of the p53-Mdm2 network and the dynamic attractors in a T cell immune response network. The proposed approach can further predict the network dynamics when the genes are under perturbation, thus providing biologically meaningful insights for a better understanding of the dynamics of GRNs. The algorithms and methods described in this paper have been implemented in Matlab packages, which are attached as Additional files.
Collapse
Affiliation(s)
- Jinghang Liang
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB T6G 2V4, Canada.
| | | |
Collapse
|
35
|
MacNamara A, Terfve C, Henriques D, Bernabé BP, Saez-Rodriguez J. State-time spectrum of signal transduction logic models. Phys Biol 2012; 9:045003. [PMID: 22871648 DOI: 10.1088/1478-3975/9/4/045003] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Despite the current wealth of high-throughput data, our understanding of signal transduction is still incomplete. Mathematical modeling can be a tool to gain an insight into such processes. Detailed biochemical modeling provides deep understanding, but does not scale well above relatively a few proteins. In contrast, logic modeling can be used where the biochemical knowledge of the system is sparse and, because it is parameter free (or, at most, uses relatively a few parameters), it scales well to large networks that can be derived by manual curation or retrieved from public databases. Here, we present an overview of logic modeling formalisms in the context of training logic models to data, and specifically the different approaches to modeling qualitative to quantitative data (state) and dynamics (time) of signal transduction. We use a toy model of signal transduction to illustrate how different logic formalisms (Boolean, fuzzy logic and differential equations) treat state and time. Different formalisms allow for different features of the data to be captured, at the cost of extra requirements in terms of computational power and data quality and quantity. Through this demonstration, the assumptions behind each formalism are discussed, as well as their advantages and disadvantages and possible future developments.
Collapse
Affiliation(s)
- Aidan MacNamara
- European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | | | | | | | | |
Collapse
|
36
|
Affiliation(s)
- Nancy Lan Guo
- Mary Babb Randolph Cancer Center/Department of Community Medicine, School of Medicine, West Virginia University, Morgantown, WV 26506-9300
| |
Collapse
|
37
|
Acharya L, Judeh T, Duan Z, Rabbat M, Zhu D. GSGS: a computational approach to reconstruct signaling pathway structures from gene sets. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 9:438-450. [PMID: 22025758 DOI: 10.1109/tcbb.2011.143] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Reconstruction of signaling pathway structures is essential to decipher complex regulatory relationships in living cells. Existing approaches often rely on unrealistic biological assumptions and do not explicitly consider signal transduction mechanisms. Signal transduction events refer to linear cascades of reactions from cell surface to nucleus and characterize a signaling pathway. We propose a novel approach, Gene Set Gibbs Sampling, to reverse engineer signaling pathway structures from gene sets related to pathways. We hypothesize that signaling pathways are structurally an ensemble of overlapping linear signal transduction events which we encode as Information Flows (IFs). We infer signaling pathway structures from gene sets, referred to as Information Flow Gene Sets (IFGSs), corresponding to these events. Thus, an IFGS only reflects which genes appear in the underlying IF but not their ordering. GSGS offers a Gibbs sampling procedure to reconstruct the underlying signaling pathway structure by sequentially inferring IFs from the overlapping IFGSs related to the pathway. In the proof-of-concept studies, our approach is shown to outperform existing network inference approaches using data generated from benchmark networks in DREAM. We perform a sensitivity analysis to assess the robustness of our approach. Finally, we implement GSGS to reconstruct signaling mechanisms in breast cancer cells.
Collapse
|
38
|
Graudenzi A, Serra R, Villani M, Damiani C, Colacci A, Kauffman SA. Dynamical Properties of a Boolean Model of Gene Regulatory Network with Memory. J Comput Biol 2011; 18:1291-303. [DOI: 10.1089/cmb.2010.0069] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
Affiliation(s)
- Alex Graudenzi
- European Centre for Living Technology (ECLT), University Cá Foscari of Venice, Venice, Italy
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Roberto Serra
- European Centre for Living Technology (ECLT), University Cá Foscari of Venice, Venice, Italy
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Marco Villani
- European Centre for Living Technology (ECLT), University Cá Foscari of Venice, Venice, Italy
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Chiara Damiani
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Annamaria Colacci
- Excellence Environmental Carcinogenesis, Environmental Protection and Health Prevention Agency, Emilia-Romagna, Bologna, Italy
| | - Stuart A. Kauffman
- UVM's Complex Systems Center, University of Vermont, Burlington, Vermont
| |
Collapse
|
39
|
Integrating quantitative knowledge into a qualitative gene regulatory network. PLoS Comput Biol 2011; 7:e1002157. [PMID: 21935350 PMCID: PMC3174175 DOI: 10.1371/journal.pcbi.1002157] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2011] [Accepted: 06/28/2011] [Indexed: 11/19/2022] Open
Abstract
Despite recent improvements in molecular techniques, biological knowledge remains incomplete. Any theorizing about living systems is therefore necessarily based on the use of heterogeneous and partial information. Much current research has focused successfully on the qualitative behaviors of macromolecular networks. Nonetheless, it is not capable of taking into account available quantitative information such as time-series protein concentration variations. The present work proposes a probabilistic modeling framework that integrates both kinds of information. Average case analysis methods are used in combination with Markov chains to link qualitative information about transcriptional regulations to quantitative information about protein concentrations. The approach is illustrated by modeling the carbon starvation response in Escherichia coli. It accurately predicts the quantitative time-series evolution of several protein concentrations using only knowledge of discrete gene interactions and a small number of quantitative observations on a single protein concentration. From this, the modeling technique also derives a ranking of interactions with respect to their importance during the experiment considered. Such a classification is confirmed by the literature. Therefore, our method is principally novel in that it allows (i) a hybrid model that integrates both qualitative discrete model and quantities to be built, even using a small amount of quantitative information, (ii) new quantitative predictions to be derived, (iii) the robustness and relevance of interactions with respect to phenotypic criteria to be precisely quantified, and (iv) the key features of the model to be extracted that can be used as a guidance to design future experiments. Understanding the response of a biological system to a stress is of great interest in biology. This issue is usually tackled by integrating information arising from different experiments into mathematical models. In particular, continuous models take quantitative information into account after a parameter estimation step whereas much recent research has focused on the qualitative behaviors of macromolecular networks. However, both modeling approaches fail to handle the true nature of biological information, including heterogeneity, incompleteness and multi-scale features, as emphasized by recent advances in molecular techniques. The principle novelty of our method lies in the use of probabilities and average-case analysis to overcome this weakness and to fill the gap between qualitative and quantitative models. Our framework is applied to study the response of Escherichia coli to a carbon starvation stress. We combine a small amount of quantitative information on protein concentrations with a qualitative model of transcriptional regulations. We derive quantitative predictions about proteins, quantify the robustness and relevance of transcriptional interactions, and automatically extract the key features of the model. The main biological novelty is therefore the presentation of new knowledge derived from the combination of quantitative and qualitative multi-scale information in a single approach.
Collapse
|
40
|
Xiao Y. A tutorial on analysis and simulation of boolean gene regulatory network models. Curr Genomics 2011; 10:511-25. [PMID: 20436877 PMCID: PMC2808677 DOI: 10.2174/138920209789208237] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2008] [Revised: 05/11/2009] [Accepted: 05/11/2009] [Indexed: 11/22/2022] Open
Abstract
Driven by the desire to understand genomic functions through the interactions among genes and gene products, the research in gene regulatory networks has become a heated area in genomic signal processing. Among the most studied mathematical models are Boolean networks and probabilistic Boolean networks, which are rule-based dynamic systems. This tutorial provides an introduction to the essential concepts of these two Boolean models, and presents the up-to-date analysis and simulation methods developed for them. In the Analysis section, we will show that Boolean models are Markov chains, based on which we present a Markovian steady-state analysis on attractors, and also reveal the relationship between probabilistic Boolean networks and dynamic Bayesian networks (another popular genetic network model), again via Markov analysis; we dedicate the last subsection to structural analysis, which opens a door to other topics such as network control. The Simulation section will start from the basic tasks of creating state transition diagrams and finding attractors, proceed to the simulation of network dynamics and obtaining the steady-state distributions, and finally come to an algorithm of generating artificial Boolean networks with prescribed attractors. The contents are arranged in a roughly logical order, such that the Markov chain analysis lays the basis for the most part of Analysis section, and also prepares the readers to the topics in Simulation section.
Collapse
Affiliation(s)
- Yufei Xiao
- Dept. of Epidemiology & Biostatistics, University of Texas Health Science Center at San Antonio, San Antonio, TX 78229, USA
| |
Collapse
|
41
|
Dimitrova ES, Mitra I, Jarrah AS. Probabilistic polynomial dynamical systems for reverse engineering of gene regulatory networks. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2011; 2011:1. [PMID: 21910920 PMCID: PMC3171177 DOI: 10.1186/1687-4153-2011-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2010] [Accepted: 06/06/2011] [Indexed: 02/08/2023]
Abstract
Elucidating the structure and/or dynamics of gene regulatory networks from experimental data is a major goal of systems biology. Stochastic models have the potential to absorb noise, account for un-certainty, and help avoid data overfitting. Within the frame work of probabilistic polynomial dynamical systems, we present an algorithm for the reverse engineering of any gene regulatory network as a discrete, probabilistic polynomial dynamical system. The resulting stochastic model is assembled from all minimal models in the model space and the probability assignment is based on partitioning the model space according to the likeliness with which a minimal model explains the observed data. We used this method to identify stochastic models for two published synthetic network models. In both cases, the generated model retains the key features of the original model and compares favorably to the resulting models from other algorithms.
Collapse
Affiliation(s)
- Elena S Dimitrova
- Department of Mathematical Sciences, Clemson University, Clemson, SC 29634-0975, USA
| | - Indranil Mitra
- Sealy Center of Molecular Medicine, University of Texas Medical Branch, Galveston, TX 77550, USA
| | - Abdul Salam Jarrah
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA 24061-0477, USA
- Department of Mathematics and Statistics, American University of Sharjah, Sharjah, UAE
| |
Collapse
|
42
|
Kim H, Gelenbe E. G-Networks Based Two Layer Stochastic Modeling of Gene Regulatory Networks with Post-Translational Processes. ACTA ACUST UNITED AC 2011. [DOI: 10.4051/ibc.2011.3.2.0008] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
43
|
Bouaynaya N, Shterenberg R, Schonfeld D. Inverse perturbation for optimal intervention in gene regulatory networks. ACTA ACUST UNITED AC 2010; 27:103-10. [PMID: 21062762 DOI: 10.1093/bioinformatics/btq605] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Analysis and intervention in the dynamics of gene regulatory networks is at the heart of emerging efforts in the development of modern treatment of numerous ailments including cancer. The ultimate goal is to develop methods to intervene in the function of living organisms in order to drive cells away from a malignant state into a benign form. A serious limitation of much of the previous work in cancer network analysis is the use of external control, which requires intervention at each time step, for an indefinite time interval. This is in sharp contrast to the proposed approach, which relies on the solution of an inverse perturbation problem to introduce a one-time intervention in the structure of regulatory networks. This isolated intervention transforms the steady-state distribution of the dynamic system to the desired steady-state distribution. RESULTS We formulate the optimal intervention problem in gene regulatory networks as a minimal perturbation of the network in order to force it to converge to a desired steady-state distribution of gene regulation. We cast optimal intervention in gene regulation as a convex optimization problem, thus providing a globally optimal solution which can be efficiently computed using standard toolboxes for convex optimization. The criteria adopted for optimality is chosen to minimize potential adverse effects as a consequence of the intervention strategy. We consider a perturbation that minimizes (i) the overall energy of change between the original and controlled networks and (ii) the time needed to reach the desired steady-state distribution of gene regulation. Furthermore, we show that there is an inherent trade-off between minimizing the energy of the perturbation and the convergence rate to the desired distribution. We apply the proposed control to the human melanoma gene regulatory network. AVAILABILITY The MATLAB code for optimal intervention in gene regulatory networks can be found online: http://syen.ualr.edu/nxbouaynaya/Bioinformatics2010.html.
Collapse
Affiliation(s)
- Nidhal Bouaynaya
- Department of Systems Engineering, University of Arkansas at Little Rock, Little Rock, AR 72204, USA.
| | | | | |
Collapse
|
44
|
Tang B, Wu X, Tan G, Chen SS, Jing Q, Shen B. Computational inference and analysis of genetic regulatory networks via a supervised combinatorial-optimization pattern. BMC SYSTEMS BIOLOGY 2010; 4 Suppl 2:S3. [PMID: 20840730 PMCID: PMC2982690 DOI: 10.1186/1752-0509-4-s2-s3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Background Post-genome era brings about diverse categories of omics data. Inference and analysis of genetic regulatory networks act prominently in extracting inherent mechanisms, discovering and interpreting the related biological nature and living principles beneath mazy phenomena, and eventually promoting the well-beings of humankind. Results A supervised combinatorial-optimization pattern based on information and signal-processing theories is introduced into the inference and analysis of genetic regulatory networks. An associativity measure is proposed to define the regulatory strength/connectivity, and a phase-shift metric determines regulatory directions among components of the reconstructed networks. Thus, it solves the undirected regulatory problems arising from most of current linear/nonlinear relevance methods. In case of computational and topological redundancy, we constrain the classified group size of pair candidates within a multiobjective combinatorial optimization (MOCO) pattern. Conclusions We testify the proposed approach on two real-world microarray datasets of different statistical characteristics. Thus, we reveal the inherent design mechanisms for genetic networks by quantitative means, facilitating further theoretic analysis and experimental design with diverse research purposes. Qualitative comparisons with other methods and certain related focuses needing further work are illustrated within the discussion section.
Collapse
Affiliation(s)
- Binhua Tang
- Department of Bioinformatics, Tongji University, Shanghai, China.
| | | | | | | | | | | |
Collapse
|
45
|
Hickman GJ, Hodgman TC. Inference of gene regulatory networks using boolean-network inference methods. J Bioinform Comput Biol 2010; 7:1013-29. [PMID: 20014476 DOI: 10.1142/s0219720009004448] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2009] [Revised: 08/14/2009] [Accepted: 08/15/2009] [Indexed: 02/03/2023]
Abstract
The modeling of genetic networks especially from microarray and related data has become an important aspect of the biosciences. This review takes a fresh look at a specific family of models used for constructing genetic networks, the so-called Boolean networks. The review outlines the various different types of Boolean network developed to date, from the original Random Boolean Network to the current Probabilistic Boolean Network. In addition, some of the different inference methods available to infer these genetic networks are also examined. Where possible, particular attention is paid to input requirements as well as the efficiency, advantages and drawbacks of each method. Though the Boolean network model is one of many models available for network inference today, it is well established and remains a topic of considerable interest in the field of genetic network inference. Hybrids of Boolean networks with other approaches may well be the way forward in inferring the most informative networks.
Collapse
|
46
|
Abstract
State diagrams (stategraphs) are suitable for describing the behavior of dynamic systems. However, when they are used to model large and complex systems, determining the states and transitions among them can be overwhelming, due to their flat, unstratified structure. In this article, we present the use of statecharts as a novel way of modeling complex gene networks. Statecharts extend conventional state diagrams with features such as nested hierarchy, recursion, and concurrency. These features are commonly utilized in engineering for designing complex systems and can enable us to model complex gene networks in an efficient and systematic way. We modeled five key gene network motifs, simple regulation, autoregulation, feed-forward loop, single-input module, and dense overlapping regulon, using statecharts. Specifically, utilizing nested hierarchy and recursion, we were able to model a complex interlocked feed-forward loop network in a highly structured way, demonstrating the potential of our approach for modeling large and complex gene networks.
Collapse
|
47
|
Kim H, Gelenbe E. Anomaly detection in gene expression via stochastic models of gene regulatory networks. BMC Genomics 2009; 10 Suppl 3:S26. [PMID: 19958490 PMCID: PMC2788379 DOI: 10.1186/1471-2164-10-s3-s26] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Background The steady-state behaviour of gene regulatory networks (GRNs) can provide crucial evidence for detecting disease-causing genes. However, monitoring the dynamics of GRNs is particularly difficult because biological data only reflects a snapshot of the dynamical behaviour of the living organism. Also most GRN data and methods are used to provide limited structural inferences. Results In this study, the theory of stochastic GRNs, derived from G-Networks, is applied to GRNs in order to monitor their steady-state behaviours. This approach is applied to a simulation dataset which is generated by using the stochastic gene expression model, and observe that the G-Network properly detects the abnormally expressed genes in the simulation study. In the analysis of real data concerning the cell cycle microarray of budding yeast, our approach finds that the steady-state probability of CLB2 is lower than that of other agents, while most of the genes have similar steady-state probabilities. These results lead to the conclusion that the key regulatory genes of the cell cycle can be expressed in the absence of CLB type cyclines, which was also the conclusion of the original microarray experiment study. Conclusion G-networks provide an efficient way to monitor steady-state of GRNs. Our method produces more reliable results then the conventional t-test in detecting differentially expressed genes. Also G-networks are successfully applied to the yeast GRNs. This study will be the base of further GRN dynamics studies cooperated with conventional GRN inference algorithms.
Collapse
Affiliation(s)
- Haseong Kim
- Intelligent Systems Networks Group, Electrical and Electronic Engineering Department, Imperial College London, UK.
| | | |
Collapse
|
48
|
Kim H, Lee JK, Park T. Inference of large-scale gene regulatory networks using regression-based network approach. J Bioinform Comput Biol 2009; 7:717-35. [PMID: 19634200 DOI: 10.1142/s0219720009004278] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2008] [Revised: 03/16/2009] [Accepted: 03/17/2009] [Indexed: 11/18/2022]
Abstract
The gene regulatory network modeling plays a key role in search for relationships among genes. Many modeling approaches have been introduced to find the causal relationship between genes using time series microarray data. However, they have been suffering from high dimensionality, overfitting, and heavy computation time. Further, the selection of a best model among several possible competing models is not guaranteed that it is the best one. In this study, we propose a simple procedure for constructing large scale gene regulatory networks using a regression-based network approach. We determine the optimal out-degree of network structure by using the sum of squared coefficients which are obtained from all appropriate regression models. Through the simulated data, accuracy of estimation and robustness against noise are computed in order to compare with the vector autoregressive regression model. Our method shows high accuracy and robustness for inferring large-scale gene networks. Also it is applied to Caulobacter crescentus cell cycle data consisting of 1472 genes. It shows that many genes are regulated by two transcription factors, ctrA and gcrA, that are known for global regulators.
Collapse
Affiliation(s)
- Haseong Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, San 56-1, Shilim-dong, Korea.
| | | | | |
Collapse
|
49
|
Mishra B. Intelligently deciphering unintelligible designs: algorithmic algebraic model checking in systems biology. J R Soc Interface 2009; 6:575-97. [PMID: 19364723 PMCID: PMC2696146 DOI: 10.1098/rsif.2008.0546] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Systems biology, as a subject, has captured the imagination of both biologists and systems scientists alike. But what is it? This review provides one researcher's somewhat idiosyncratic view of the subject, but also aims to persuade young scientists to examine the possible evolution of this subject in a rich historical context. In particular, one may wish to read this review to envision a subject built out of a consilience of many interesting concepts from systems sciences, logic and model theory, and algebra, culminating in novel tools, techniques and theories that can reveal deep principles in biology--seen beyond mere observations. A particular focus in this review is on approaches embedded in an embryonic program, dubbed 'algorithmic algebraic model checking', and its powers and limitations.
Collapse
Affiliation(s)
- Bud Mishra
- Courant Institute of Mathematical Sciences, New York University, 251 Mercer Street, New York, NY 10012, USA.
| |
Collapse
|
50
|
Abstract
Traditionally molecular biology research has tended to reduce biological pathways to composite units studied as isolated parts of the cellular system. With the advent of high throughput methodologies that can capture thousands of data points, and powerful computational approaches, the reality of studying cellular processes at a systems level is upon us. As these approaches yield massive datasets, systems level analyses have drawn upon other fields such as engineering and mathematics, adapting computational and statistical approaches to decipher relationships between molecules. Guided by high quality datasets and analyses, one can begin the process of predictive modeling. The findings from such approaches are often surprising and beyond normal intuition. We discuss four classes of dynamical systems used to model genetic regulatory networks. The discussion is divided into continuous and discrete models, as well as deterministic and stochastic model classes. For each combination of these categories, a model is presented and discussed in the context of the yeast cell cycle, illustrating how different types of questions can be addressed by different model classes.
Collapse
|