1
|
Wu Z, Sinha S. SPREd: a simulation-supervised neural network tool for gene regulatory network reconstruction. BIOINFORMATICS ADVANCES 2024; 4:vbae011. [PMID: 38444538 PMCID: PMC10913396 DOI: 10.1093/bioadv/vbae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 11/08/2023] [Accepted: 01/18/2024] [Indexed: 03/07/2024]
Abstract
Summary Reconstruction of gene regulatory networks (GRNs) from expression data is a significant open problem. Common approaches train a machine learning (ML) model to predict a gene's expression using transcription factors' (TFs') expression as features and designate important features/TFs as regulators of the gene. Here, we present an entirely different paradigm, where GRN edges are directly predicted by the ML model. The new approach, named "SPREd," is a simulation-supervised neural network for GRN inference. Its inputs comprise expression relationships (e.g. correlation, mutual information) between the target gene and each TF and between pairs of TFs. The output includes binary labels indicating whether each TF regulates the target gene. We train the neural network model using synthetic expression data generated by a biophysics-inspired simulation model that incorporates linear as well as non-linear TF-gene relationships and diverse GRN configurations. We show SPREd to outperform state-of-the-art GRN reconstruction tools GENIE3, ENNET, PORTIA, and TIGRESS on synthetic datasets with high co-expression among TFs, similar to that seen in real data. A key advantage of the new approach is its robustness to relatively small numbers of conditions (columns) in the expression matrix, which is a common problem faced by existing methods. Finally, we evaluate SPREd on real data sets in yeast that represent gold-standard benchmarks of GRN reconstruction and show it to perform significantly better than or comparably to existing methods. In addition to its high accuracy and speed, SPREd marks a first step toward incorporating biophysics principles of gene regulation into ML-based approaches to GRN reconstruction. Availability and implementation Data and code are available from https://github.com/iiiime/SPREd.
Collapse
Affiliation(s)
- Zijun Wu
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, United States
| | - Saurabh Sinha
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA 30332, United States
- H. Milton Steward School of Industrial & Systems Engineering, Georgia Institute of Technology, Atlanta, GA 30332, United States
| |
Collapse
|
2
|
Wu Z, Sinha S. SPREd: A simulation-supervised neural network tool for gene regulatory network reconstruction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.09.566399. [PMID: 38014297 PMCID: PMC10680606 DOI: 10.1101/2023.11.09.566399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Reconstruction of gene regulatory networks (GRNs) from expression data is a significant open problem. Common approaches train a machine learning (ML) model to predict a gene's expression using transcription factors' (TFs') expression as features and designate important features/TFs as regulators of the gene. Here, we present an entirely different paradigm, where GRN edges are directly predicted by the ML model. The new approach, named "SPREd" is a simulation-supervised neural network for GRN inference. Its inputs comprise expression relationships (e.g., correlation, mutual information) between the target gene and each TF and between pairs of TFs. The output includes binary labels indicating whether each TF regulates the target gene. We train the neural network model using synthetic expression data generated by a biophysics-inspired simulation model that incorporates linear as well as non-linear TF-gene relationships and diverse GRN configurations. We show SPREd to outperform state-of-the-art GRN reconstruction tools GENIE3, ENNET, PORTIA and TIGRESS on synthetic datasets with high co-expression among TFs, similar to that seen in real data. A key advantage of the new approach is its robustness to relatively small numbers of conditions (columns) in the expression matrix, which is a common problem faced by existing methods. Finally, we evaluate SPREd on real data sets in yeast that represent gold standard benchmarks of GRN reconstruction and show it to perform significantly better than or comparably to existing methods. In addition to its high accuracy and speed, SPREd marks a first step towards incorporating biophysics principles of gene regulation into ML-based approaches to GRN reconstruction.
Collapse
Affiliation(s)
- Zijun Wu
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| | - Saurabh Sinha
- Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
- H. Milton Steward School of Industrial & Systems Engineering, Georgia Institute of Technology, Atlanta, GA, 30318, USA
| |
Collapse
|
3
|
Hourican C, Peeters G, Melis R, Gill TM, Rikkert MO, Quax R. Understanding multimorbidity requires sign-disease networks and higher-order interactions, a perspective. FRONTIERS IN SYSTEMS BIOLOGY 2023; 3:1155599. [PMID: 37810371 PMCID: PMC10557993 DOI: 10.3389/fsysb.2023.1155599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]
Abstract
Background Count scores, disease clustering, and pairwise associations between diseases remain ubiquitous in multimorbidity research despite two major shortcomings: they yield no insight into plausible mechanisms underlying multimorbidity, and they ignore higher-order interactions such as effect modification. Objectives We argue that two components are currently missing but vital to develop novel multimorbidity metrics. Firstly, networks should be constructed which consists simultaneously of signs, symptoms, and diseases, since only then could they yield insight into plausible shared biological mechanisms underlying diseases.Secondly, learning pairwise associations is insufficient to fully characterize the correlations in a system. That is, synergistic (e.g., cooperative or antagonistic) effects are widespread in complex systems, where two or more elements combined give a larger or smaller effect than the sum of their individual effects. It can even occur that pairs of symptoms have no pairwise associations whatsoever, but in combination have a significant association. Therefore, higher-order interactions should be included in networks used to study multimorbidity, resulting in so-called hypergraphs. Methods We illustrate our argument using a synthetic Bayesian Network model of symptoms, signs and diseases, composed of pairwise and higher-order interactions. We simulate network interventions on both individual and population levels and compare the ground-truth outcomes with the predictions from pairwise associations. Conclusion We find that, when judged purely from the pairwise associations, interventions can have unexpected 'side-effects' or the most opportune intervention could be missed. The hypergraph uncovers links missed in pairwise networks, giving a more complete overview of sign and disease associations.
Collapse
Affiliation(s)
- Cillian Hourican
- Computational Science Lab, Institute of Informatics, University of Amsterdam, Amsterdam, The Netherlands
| | - Geeske Peeters
- Department of Geriatric Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
- Radboudumc Alzheimer Centre, Radboud university medical centre, Nijmegen, The Netherlands
| | - René Melis
- Department of Geriatric Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Thomas M. Gill
- Department of Internal Medicine, Yale University School of Medicine, New Haven, CT
| | - Marcel Olde Rikkert
- Department of Geriatric Medicine, Radboud University Medical Center, Nijmegen, The Netherlands
- Radboudumc Alzheimer Centre, Radboud university medical centre, Nijmegen, The Netherlands
| | - Rick Quax
- Computational Science Lab, Institute of Informatics, University of Amsterdam, Amsterdam, The Netherlands
- Institute for Advanced Study, 1012 GC Amsterdam, The Netherlands
| |
Collapse
|
4
|
Kim M, Kim E. Mathematical model of the cell signaling pathway based on the extended Boolean network model with a stochastic process. BMC Bioinformatics 2022; 23:515. [PMID: 36451112 PMCID: PMC9710037 DOI: 10.1186/s12859-022-05077-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 11/23/2022] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND In cell signaling pathways, proteins interact with each other to determine cell fate in response to either cell-extrinsic (micro-environmental) or intrinsic cues. One of the well-studied pathways, the mitogen-activated protein kinase (MAPK) signaling pathway, regulates cell processes such as differentiation, proliferation, apoptosis, and survival in response to various micro-environmental stimuli in eukaryotes. Upon micro-environmental stimulus, receptors on the cell membrane become activated. Activated receptors initiate a cascade of protein activation in the MAPK pathway. This activation involves protein binding, creating scaffold proteins, which are known to facilitate effective MAPK signaling transduction. RESULTS This paper presents a novel mathematical model of a cell signaling pathway coordinated by protein scaffolding. The model is based on the extended Boolean network approach with stochastic processes. Protein production or decay in a cell was modeled considering the stochastic process, whereas the protein-protein interactions were modeled based on the extended Boolean network approach. Our model fills a gap in the binary set applied to previous models. The model simultaneously considers the stochastic process directly. Using the model, we simulated a simplified mitogen-activated protein kinase (MAPK) signaling pathway upon stimulation of both a single receptor at the initial time and multiple receptors at several time points. Our simulations showed that the signal is amplified as it travels down to the pathway from the receptor, generating substantially amplified downstream ERK activity. The noise generated by the stochastic process of protein self-activity in the model was also amplified as the signaling propagated through the pathway. CONCLUSIONS The signaling transduction in a simplified MAPK signaling pathway could be explained by a mathematical model based on the extended Boolean network model with a stochastic process. The model simulations demonstrated signaling amplifications when it travels downstream, which was already observed in experimental settings. We also highlight the importance of stochastic activity in regulating protein inactivation.
Collapse
Affiliation(s)
- Minsoo Kim
- grid.35541.360000000121053345Natural Product Informatics Research Center, Korea Institute of Science and Technology, Gangneung, South Korea
| | - Eunjung Kim
- grid.35541.360000000121053345Natural Product Informatics Research Center, Korea Institute of Science and Technology, Gangneung, South Korea
| |
Collapse
|
5
|
Pušnik Ž, Mraz M, Zimic N, Moškon M. Review and assessment of Boolean approaches for inference of gene regulatory networks. Heliyon 2022; 8:e10222. [PMID: 36033302 PMCID: PMC9403406 DOI: 10.1016/j.heliyon.2022.e10222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 04/22/2022] [Accepted: 08/03/2022] [Indexed: 10/25/2022] Open
Abstract
Boolean descriptions of gene regulatory networks can provide an insight into interactions between genes. Boolean networks hold predictive power, are easy to understand, and can be used to simulate the observed networks in different scenarios. We review fundamental and state-of-the-art methods for inference of Boolean networks. We introduce a methodology for a straightforward evaluation of Boolean inference approaches based on the generation of evaluation datasets, application of selected inference methods, and evaluation of performance measures to guide the selection of the best method for a given inference problem. We demonstrate this procedure on inference methods REVEAL (REVerse Engineering ALgorithm), Best-Fit Extension, MIBNI (Mutual Information-based Boolean Network Inference), GABNI (Genetic Algorithm-based Boolean Network Inference) and ATEN (AND/OR Tree ENsemble algorithm), which infers Boolean descriptions of gene regulatory networks from discretised time series data. Boolean inference approaches tend to perform better in terms of dynamic accuracy, and slightly worse in terms of structural correctness. We believe that the proposed methodology and provided guidelines will help researchers to develop Boolean inference approaches with a good predictive capability while maintaining structural correctness and biological relevance.
Collapse
Affiliation(s)
- Žiga Pušnik
- University of Ljubljana, Faculty of Computer and Information Science, Večna pot 113, Ljubljana, SI-1000, Slovenia
| | - Miha Mraz
- University of Ljubljana, Faculty of Computer and Information Science, Večna pot 113, Ljubljana, SI-1000, Slovenia
| | - Nikolaj Zimic
- University of Ljubljana, Faculty of Computer and Information Science, Večna pot 113, Ljubljana, SI-1000, Slovenia
| | - Miha Moškon
- University of Ljubljana, Faculty of Computer and Information Science, Večna pot 113, Ljubljana, SI-1000, Slovenia
| |
Collapse
|
6
|
Passemiers A, Moreau Y, Raimondi D. Fast and accurate inference of gene regulatory networks through robust precision matrix estimation. Bioinformatics 2022; 38:2802-2809. [PMID: 35561176 PMCID: PMC9113237 DOI: 10.1093/bioinformatics/btac178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 03/14/2022] [Accepted: 03/22/2022] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Transcriptional regulation mechanisms allow cells to adapt and respond to external stimuli by altering gene expression. The possible cell transcriptional states are determined by the underlying gene regulatory network (GRN), and reliably inferring such network would be invaluable to understand biological processes and disease progression. RESULTS In this article, we present a novel method for the inference of GRNs, called PORTIA, which is based on robust precision matrix estimation, and we show that it positively compares with state-of-the-art methods while being orders of magnitude faster. We extensively validated PORTIA using the DREAM and MERLIN+P datasets as benchmarks. In addition, we propose a novel scoring metric that builds on graph-theoretical concepts. AVAILABILITY AND IMPLEMENTATION The code and instructions for data acquisition and full reproduction of our results are available at https://github.com/AntoinePassemiers/PORTIA-Manuscript. PORTIA is available on PyPI as a Python package (portia-grn). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
7
|
Shafiekhani S, Poursheykhani A, Rahbar S, Jafari AH. Simulating ATO Mechanism and EGFR Signaling with Fuzzy Logic and Petri Net. J Biomed Phys Eng 2021; 11:325-336. [PMID: 34189121 PMCID: PMC8236109 DOI: 10.31661/jbpe.v0i0.796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 11/05/2017] [Indexed: 12/04/2022]
Abstract
Background: Interactions of many key proteins or genes in signalling pathway have been studied qualitatively in the literature, but only little quantitative information is available. Objective: Although much has been done to clarify the biochemistry of transcriptional dynamics in signalling pathway, it remains difficult to find out and predict quantitative responses. The aim of this study is to construct a computational model of epidermal growth factor receptor (EGFR) signalling pathway as one of hallmarks of cancer so as to predict quantitative responses. Material and Methods: In this analytical study, we presented a computational model to investigate EGFR signalling pathway. Interaction of Arsenic trioxide (ATO) with EGFR signalling pathway factors has been elicited by systematic search in data bases, as ATO is one of the mysterious chemotherapy agents that control EGFR expression in cancer. ATO has dichotomous manner in vivo, dependent on its concentration. According to fuzzy rules based upon qualitative knowledge and Petri Net, we can construct a quantitative model to describe ATO mechanism in EGFR signalling pathway. Results: By Fuzzy Logic models that have the potential to trade with the loss of quantitative information on how different species interact, along with Petri net quantitatively describe the dynamics of EGFR signalling pathway. By this model the dynamic of different factors in EGFR signalling pathway is achieved. Conclusion: The use of Fuzzy Logic and PNs in biological network modelling causes a deeper understanding and comprehensive analysis of the biological networks.
Collapse
Affiliation(s)
- Sajad Shafiekhani
- PhD Candidate, Department of Biophysics & Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- PhD Candidate, Research Center for Science and Technology in Medicine (RCSTIM), Tehran University of Medical Sciences, Tehran, Iran
| | - Arash Poursheykhani
- PhD Candidate, Department of Medical Genetics, School of Medicine, Mashhad University of Medical Science, Mashhad, Iran
| | - Sara Rahbar
- PhD Candidate, Department of Biophysics & Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- PhD Candidate, Research Center for Science and Technology in Medicine (RCSTIM), Tehran University of Medical Sciences, Tehran, Iran
| | - Amir Homayoun Jafari
- PhD, Department of Biophysics & Biomedical Engineering, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
- PhD, Research Center for Science and Technology in Medicine (RCSTIM), Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
8
|
Uniform patterns based area-efficient and accurate stochastic computing finite impulse response filter. PLoS One 2021; 16:e0245943. [PMID: 33503067 PMCID: PMC7840030 DOI: 10.1371/journal.pone.0245943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 01/08/2021] [Indexed: 11/30/2022] Open
Abstract
Stochastic computing has recently gained attention due to its low hardware complexity and better fault tolerance against soft errors. However, stochastic computing based circuits suffer from different errors which affect the output accuracy of these circuits. In this paper, an accurate and area-efficient stochastic computing based digital finite impulse response filter is designed. In the proposed work, constant uniform patterns are used as stochastic numbers for the select lines of different MUXes in the filter and the error performance of filter is analysed. Based on the error performance, the combinations of these patterns are proposed for reducing the output error of stochastic computing based filters. The architectures for generating these uniform patterns are also proposed. Results show that the proposed design methodology has better error performance and comparable hardware complexity as compared to the state-of-the-art implementations.
Collapse
|
9
|
Hu X, Hu Y, Wu F, Leung RWT, Qin J. Integration of single-cell multi-omics for gene regulatory network inference. Comput Struct Biotechnol J 2020; 18:1925-1938. [PMID: 32774787 PMCID: PMC7385034 DOI: 10.1016/j.csbj.2020.06.033] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2020] [Revised: 06/17/2020] [Accepted: 06/20/2020] [Indexed: 12/20/2022] Open
Abstract
The advancement of single-cell sequencing technology in recent years has provided an opportunity to reconstruct gene regulatory networks (GRNs) with the data from thousands of single cells in one sample. This uncovers regulatory interactions in cells and speeds up the discoveries of regulatory mechanisms in diseases and biological processes. Therefore, more methods have been proposed to reconstruct GRNs using single-cell sequencing data. In this review, we introduce technologies for sequencing single-cell genome, transcriptome, and epigenome. At the same time, we present an overview of current GRN reconstruction strategies utilizing different single-cell sequencing data. Bioinformatics tools were grouped by their input data type and mathematical principles for reader's convenience, and the fundamental mathematics inherent in each group will be discussed. Furthermore, the adaptabilities and limitations of these different methods will also be summarized and compared, with the hope to facilitate researchers recognizing the most suitable tools for them.
Collapse
Affiliation(s)
- Xinlin Hu
- Shenzhen Key Laboratory of Advanced Machine Learning and Applications, College of Mathematics and Statistics, Shenzhen University, Shenzhen 518060, China
| | - Yaohua Hu
- Shenzhen Key Laboratory of Advanced Machine Learning and Applications, College of Mathematics and Statistics, Shenzhen University, Shenzhen 518060, China
| | - Fanjie Wu
- School of Pharmaceutical Sciences (Shenzhen), Sun Yat-sen University, Shenzhen 518107, China
| | - Ricky Wai Tak Leung
- School of Pharmaceutical Sciences (Shenzhen), Sun Yat-sen University, Shenzhen 518107, China
| | - Jing Qin
- School of Pharmaceutical Sciences (Shenzhen), Sun Yat-sen University, Shenzhen 518107, China
| |
Collapse
|
10
|
Chen X, Li M, Zheng R, Wu FX, Wang J. D3GRN: a data driven dynamic network construction method to infer gene regulatory networks. BMC Genomics 2019; 20:929. [PMID: 31881937 PMCID: PMC6933629 DOI: 10.1186/s12864-019-6298-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND To infer gene regulatory networks (GRNs) from gene-expression data is still a fundamental and challenging problem in systems biology. Several existing algorithms formulate GRNs inference as a regression problem and obtain the network with an ensemble strategy. Recent studies on data driven dynamic network construction provide us a new perspective to solve the regression problem. RESULTS In this study, we propose a data driven dynamic network construction method to infer gene regulatory network (D3GRN), which transforms the regulatory relationship of each target gene into functional decomposition problem and solves each sub problem by using the Algorithm for Revealing Network Interactions (ARNI). To remedy the limitation of ARNI in constructing networks solely from the unit level, a bootstrapping and area based scoring method is taken to infer the final network. On DREAM4 and DREAM5 benchmark datasets, D3GRN performs competitively with the state-of-the-art algorithms in terms of AUPR. CONCLUSIONS We have proposed a novel data driven dynamic network construction method by combining ARNI with bootstrapping and area based scoring strategy. The proposed method performs well on the benchmark datasets, contributing as a competitive method to infer gene regulatory networks in a new perspective.
Collapse
Affiliation(s)
- Xiang Chen
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha, China.
| | - Ruiqing Zheng
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Fang-Xiang Wu
- Department of Mechanical Engineering and Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK S7N 5A9, Canada
| | - Jianxin Wang
- School of Computer Science and Engineering, Central South University, Changsha, China
| |
Collapse
|
11
|
Alexiou A, Chatzichronis S, Perveen A, Hafeez A, Ashraf GM. Algorithmic and Stochastic Representations of Gene Regulatory Networks and Protein-Protein Interactions. Curr Top Med Chem 2019; 19:413-425. [PMID: 30854971 DOI: 10.2174/1568026619666190311125256] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2018] [Revised: 10/15/2018] [Accepted: 12/26/2018] [Indexed: 02/06/2023]
Abstract
BACKGROUND Latest studies reveal the importance of Protein-Protein interactions on physiologic functions and biological structures. Several stochastic and algorithmic methods have been published until now, for the modeling of the complex nature of the biological systems. OBJECTIVE Biological Networks computational modeling is still a challenging task. The formulation of the complex cellular interactions is a research field of great interest. In this review paper, several computational methods for the modeling of GRN and PPI are presented analytically. METHODS Several well-known GRN and PPI models are presented and discussed in this review study such as: Graphs representation, Boolean Networks, Generalized Logical Networks, Bayesian Networks, Relevance Networks, Graphical Gaussian models, Weight Matrices, Reverse Engineering Approach, Evolutionary Algorithms, Forward Modeling Approach, Deterministic models, Static models, Hybrid models, Stochastic models, Petri Nets, BioAmbients calculus and Differential Equations. RESULTS GRN and PPI methods have been already applied in various clinical processes with potential positive results, establishing promising diagnostic tools. CONCLUSION In literature many stochastic algorithms are focused in the simulation, analysis and visualization of the various biological networks and their dynamics interactions, which are referred and described in depth in this review paper.
Collapse
Affiliation(s)
| | | | - Asma Perveen
- Glocal School of Life Sciences, Glocal University, Mirzapur Pole, Saharanpur, Uttar Pradesh, India
| | - Abdul Hafeez
- Glocal School of Pharmacy, Glocal University, Mirzapur Pole, Saharanpur, Uttar Pradesh, India
| | - Ghulam Md. Ashraf
- King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
12
|
Welkenhuysen N, Schnitzer B, Österberg L, Cvijovic M. Robustness of Nutrient Signaling Is Maintained by Interconnectivity Between Signal Transduction Pathways. Front Physiol 2019; 9:1964. [PMID: 30719010 PMCID: PMC6348271 DOI: 10.3389/fphys.2018.01964] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 12/31/2018] [Indexed: 12/16/2022] Open
Abstract
Systems biology approaches provide means to study the interplay between biological processes leading to the mechanistic understanding of the properties of complex biological systems. Here, we developed a vector format rule-based Boolean logic model of the yeast S. cerevisiae cAMP-PKA, Snf1, and the Snf3-Rgt2 pathway to better understand the role of crosstalk on network robustness and function. We identified that phosphatases are the common unknown components of the network and that crosstalk from the cAMP-PKA pathway to other pathways plays a critical role in nutrient sensing events. The model was simulated with known crosstalk combinations and subsequent analysis led to the identification of characteristics and impact of pathway interconnections. Our results revealed that the interconnections between the Snf1 and Snf3-Rgt2 pathway led to increased robustness in these signaling pathways. Overall, our approach contributes to the understanding of the function and importance of crosstalk in nutrient signaling.
Collapse
Affiliation(s)
- Niek Welkenhuysen
- Department of Mathematical Sciences, University of Gothenburg, Gothenburg, Sweden.,Department of Mathematical Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Barbara Schnitzer
- Department of Mathematical Sciences, University of Gothenburg, Gothenburg, Sweden.,Department of Mathematical Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Linnea Österberg
- Department of Mathematical Sciences, University of Gothenburg, Gothenburg, Sweden.,Department of Mathematical Sciences, Chalmers University of Technology, Gothenburg, Sweden.,Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Marija Cvijovic
- Department of Mathematical Sciences, University of Gothenburg, Gothenburg, Sweden.,Department of Mathematical Sciences, Chalmers University of Technology, Gothenburg, Sweden
| |
Collapse
|
13
|
Groß A, Kracher B, Kraus JM, Kühlwein SD, Pfister AS, Wiese S, Luckert K, Pötz O, Joos T, Van Daele D, De Raedt L, Kühl M, Kestler HA. Representing dynamic biological networks with multi-scale probabilistic models. Commun Biol 2019; 2:21. [PMID: 30675519 PMCID: PMC6336720 DOI: 10.1038/s42003-018-0268-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Accepted: 12/07/2018] [Indexed: 12/26/2022] Open
Abstract
Dynamic models analyzing gene regulation and metabolism face challenges when adapted to modeling signal transduction networks. During signal transduction, molecular reactions and mechanisms occur in different spatial and temporal frames and involve feedbacks. This impedes the straight-forward use of methods based on Boolean networks, Bayesian approaches, and differential equations. We propose a new approach, ProbRules, that combines probabilities and logical rules to represent the dynamics of a system across multiple scales. We demonstrate that ProbRules models can represent various network motifs of biological systems. As an example of a comprehensive model of signal transduction, we provide a Wnt network that shows remarkable robustness under a range of phenotypical and pathological conditions. Its simulation allows the clarification of controversially discussed molecular mechanisms of Wnt signaling by predicting wet-lab measurements. ProbRules provides an avenue in current computational modeling by enabling systems biologists to integrate vast amounts of available data on different scales.
Collapse
Affiliation(s)
- Alexander Groß
- Institute of Medical Systems Biology, Ulm University, 89081 Ulm, Germany
| | - Barbara Kracher
- Institute of Biochemistry and Molecular Biology, Ulm University, 89081 Ulm, Germany
| | - Johann M. Kraus
- Institute of Medical Systems Biology, Ulm University, 89081 Ulm, Germany
| | - Silke D. Kühlwein
- Institute of Medical Systems Biology, Ulm University, 89081 Ulm, Germany
| | - Astrid S. Pfister
- Institute of Biochemistry and Molecular Biology, Ulm University, 89081 Ulm, Germany
| | - Sebastian Wiese
- Core Unit Mass Spectrometry and Proteomics, Ulm University, 89081 Ulm, Germany
| | - Katrin Luckert
- NMI Natural and Medical Sciences Institute at the University of Tübingen, 72770 Reutlingen, Germany
| | - Oliver Pötz
- NMI Natural and Medical Sciences Institute at the University of Tübingen, 72770 Reutlingen, Germany
| | - Thomas Joos
- NMI Natural and Medical Sciences Institute at the University of Tübingen, 72770 Reutlingen, Germany
| | - Dries Van Daele
- Department of Computer Science, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium
| | - Luc De Raedt
- Department of Computer Science, Katholieke Universiteit Leuven, 3001 Heverlee, Belgium
| | - Michael Kühl
- Institute of Biochemistry and Molecular Biology, Ulm University, 89081 Ulm, Germany
| | - Hans A. Kestler
- Institute of Medical Systems Biology, Ulm University, 89081 Ulm, Germany
| |
Collapse
|
14
|
Dehghannasiri R, Shahrokh Esfahani M, Dougherty ER. An experimental design framework for Markovian gene regulatory networks under stationary control policy. BMC SYSTEMS BIOLOGY 2018; 12:137. [PMID: 30577732 PMCID: PMC6302376 DOI: 10.1186/s12918-018-0649-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
BACKGROUND A fundamental problem for translational genomics is to find optimal therapies based on gene regulatory intervention. Dynamic intervention involves a control policy that optimally reduces a cost function based on phenotype by externally altering the state of the network over time. When a gene regulatory network (GRN) model is fully known, the problem is addressed using classical dynamic programming based on the Markov chain associated with the network. When the network is uncertain, a Bayesian framework can be applied, where policy optimality is with respect to both the dynamical objective and the uncertainty, as characterized by a prior distribution. In the presence of uncertainty, it is of great practical interest to develop an experimental design strategy and thereby select experiments that optimally reduce a measure of uncertainty. RESULTS In this paper, we employ mean objective cost of uncertainty (MOCU), which quantifies uncertainty based on the degree to which uncertainty degrades the operational objective, that being the cost owing to undesirable phenotypes. We assume that a number of conditional probabilities characterizing regulatory relationships among genes are unknown in the Markovian GRN. In sum, there is a prior distribution which can be updated to a posterior distribution by observing a regulatory trajectory, and an optimal control policy, known as an "intrinsically Bayesian robust" (IBR) policy. To obtain a better IBR policy, we select an experiment that minimizes the MOCU remaining after applying its output to the network. At this point, we can either stop and find the resulting IBR policy or proceed to determine more unknown conditional probabilities via regulatory observation and find the IBR policy from the resulting posterior distribution. For sequential experimental design this entire process is iterated. Owing to the computational complexity of experimental design, which requires computation of many potential IBR policies, we implement an approximate method utilizing mean first passage times (MFPTs) - but only in experimental design, the final policy being an IBR policy. CONCLUSIONS Comprehensive performance analysis based on extensive simulations on synthetic and real GRNs demonstrate the efficacy of the proposed method, including the accuracy and computational advantage of the approximate MFPT-based design.
Collapse
Affiliation(s)
| | | | - Edward R. Dougherty
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, 77843 TX USA
- Center for Bioinformatics and Genomic Systems Engineering, Texas A&M University, College Station, 77845 TX USA
| |
Collapse
|
15
|
Zheng R, Li M, Chen X, Wu FX, Pan Y, Wang J. BiXGBoost: a scalable, flexible boosting-based method for reconstructing gene regulatory networks. Bioinformatics 2018; 35:1893-1900. [DOI: 10.1093/bioinformatics/bty908] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2018] [Revised: 10/28/2018] [Accepted: 11/04/2018] [Indexed: 12/11/2022] Open
Affiliation(s)
- Ruiqing Zheng
- School of Information Science and Engineering, Central South University, Changsha, China
| | - Min Li
- School of Information Science and Engineering, Central South University, Changsha, China
| | - Xiang Chen
- School of Information Science and Engineering, Central South University, Changsha, China
| | - Fang-Xiang Wu
- School of Information Science and Engineering, Central South University, Changsha, China
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, Canada
| | - Yi Pan
- School of Information Science and Engineering, Central South University, Changsha, China
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Jianxin Wang
- School of Information Science and Engineering, Central South University, Changsha, China
| |
Collapse
|
16
|
Olariu V, Peterson C. Kinetic models of hematopoietic differentiation. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2018; 11:e1424. [PMID: 29660842 PMCID: PMC6191385 DOI: 10.1002/wsbm.1424] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Revised: 02/13/2018] [Accepted: 03/16/2018] [Indexed: 01/02/2023]
Abstract
As cell and molecular biology is becoming increasingly quantitative, there is an upsurge of interest in mechanistic modeling at different levels of resolution. Such models mostly concern kinetics and include gene and protein interactions as well as cell population dynamics. The final goal of these models is to provide experimental predictions, which is now taking on. However, even without matured predictions, kinetic models serve the purpose of compressing a plurality of experimental results into something that can empower the data interpretation, and importantly, suggesting new experiments by turning "knobs" in silico. Once formulated, kinetic models can be executed in terms of molecular rate equations for concentrations or by stochastic simulations when only a limited number of copies are involved. Developmental processes, in particular those of stem and progenitor cell commitments, are not only topical but also particularly suitable for kinetic modeling due to the finite number of key genes involved in cellular decisions. Stem and progenitor cell commitment processes have been subject to intense experimental studies over the last decade with some emphasis on embryonic and hematopoietic stem cells. Gene and protein interactions governing these processes can be modeled by binary Boolean rules or by continuous-valued models with interactions set by binding strengths. Conceptual insights along with tested predictions have emerged from such kinetic models. Here we review kinetic modeling efforts applied to stem cell developmental systems with focus on hematopoiesis. We highlight the future challenges including multi-scale models integrating cell dynamical and transcriptional models. This article is categorized under: Models of Systems Properties and Processes > Mechanistic Models Developmental Biology > Stem Cell Biology and Regeneration.
Collapse
Affiliation(s)
- Victor Olariu
- Department of Computational Biology, Lund University, Lund, Sweden
| | - Carsten Peterson
- Department of Computational Biology, Lund University, Lund, Sweden
| |
Collapse
|
17
|
Yu B, Xu JM, Li S, Chen C, Chen RX, Wang L, Zhang Y, Wang MH. Inference of time-delayed gene regulatory networks based on dynamic Bayesian network hybrid learning method. Oncotarget 2017; 8:80373-80392. [PMID: 29113310 PMCID: PMC5655205 DOI: 10.18632/oncotarget.21268] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 08/27/2017] [Indexed: 01/31/2023] Open
Abstract
Gene regulatory networks (GRNs) research reveals complex life phenomena from the perspective of gene interaction, which is an important research field in systems biology. Traditional Bayesian networks have a high computational complexity, and the network structure scoring model has a single feature. Information-based approaches cannot identify the direction of regulation. In order to make up for the shortcomings of the above methods, this paper presents a novel hybrid learning method (DBNCS) based on dynamic Bayesian network (DBN) to construct the multiple time-delayed GRNs for the first time, combining the comprehensive score (CS) with the DBN model. DBNCS algorithm first uses CMI2NI (conditional mutual inclusive information-based network inference) algorithm for network structure profiles learning, namely the construction of search space. Then the redundant regulations are removed by using the recursive optimization algorithm (RO), thereby reduce the false positive rate. Secondly, the network structure profiles are decomposed into a set of cliques without loss, which can significantly reduce the computational complexity. Finally, DBN model is used to identify the direction of gene regulation within the cliques and search for the optimal network structure. The performance of DBNCS algorithm is evaluated by the benchmark GRN datasets from DREAM challenge as well as the SOS DNA repair network in Escherichia coli, and compared with other state-of-the-art methods. The experimental results show the rationality of the algorithm design and the outstanding performance of the GRNs.
Collapse
Affiliation(s)
- Bin Yu
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China
- CAS Key Laboratory of Geospace Environment, Department of Geophysics and Planetary Science, University of Science and Technology of China, Hefei 230026, China
- Bioinformatics and Systems Biology Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Jia-Meng Xu
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China
- Bioinformatics and Systems Biology Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Shan Li
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China
- Bioinformatics and Systems Biology Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Cheng Chen
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China
- Bioinformatics and Systems Biology Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Rui-Xin Chen
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China
- Bioinformatics and Systems Biology Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Lei Wang
- Key Laboratory of Eco-chemical Engineering, Ministry of Education, Laboratory of Inorganic Synthesis and Applied Chemistry, College of Chemistry and Molecular Engineering, Qingdao University of Science and Technology, Qingdao 266042, China
| | - Yan Zhang
- Bioinformatics and Systems Biology Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
- College of Electromechanical Engineering, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Ming-Hui Wang
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China
- Bioinformatics and Systems Biology Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| |
Collapse
|
18
|
Arshad OA, Datta A. Towards targeted combinatorial therapy design for the treatment of castration-resistant prostate cancer. BMC Bioinformatics 2017; 18:134. [PMID: 28361666 PMCID: PMC5374594 DOI: 10.1186/s12859-017-1522-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Prostate cancer is one of the most prevalent cancers in males in the United States and amongst the leading causes of cancer related deaths. A particularly virulent form of this disease is castration-resistant prostate cancer (CRPC), where patients no longer respond to medical or surgical castration. CRPC is a complex, multifaceted and heterogeneous malady with limited standard treatment options. RESULTS The growth and progression of prostate cancer is a complicated process that involves multiple pathways. The signaling network comprising the integral constituents of the signature pathways involved in the development and progression of prostate cancer is modeled as a combinatorial circuit. The failures in the gene regulatory network that lead to cancer are abstracted as faults in the equivalent circuit and the Boolean circuit model is then used to design therapies tailored to counteract the effect of each molecular abnormality and to propose potentially efficacious combinatorial therapy regimens. Furthermore, stochastic computational modeling is utilized to identify potentially vulnerable components in the network that may serve as viable candidates for drug development. CONCLUSION The results presented herein can aid in the design of scientifically well-grounded targeted therapies that can be employed for the treatment of prostate cancer patients.
Collapse
Affiliation(s)
- Osama Ali Arshad
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA.,Center for Bioinformatics and Genomics Systems Engineering, Texas A&M University, College Station, TX, USA
| | - Aniruddha Datta
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA. .,Center for Bioinformatics and Genomics Systems Engineering, Texas A&M University, College Station, TX, USA.
| |
Collapse
|
19
|
Jereesh AS, Govindan VK. Immuno-hybrid algorithm: a novel hybrid approach for GRN reconstruction. 3 Biotech 2016; 6:222. [PMID: 28330294 PMCID: PMC5065543 DOI: 10.1007/s13205-016-0536-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Accepted: 10/03/2016] [Indexed: 11/28/2022] Open
Abstract
Bio-inspired algorithms are widely used to optimize the model parameters of GRN. In this paper, focus is given to develop improvised versions of bio-inspired algorithm for the specific problem of reconstruction of gene regulatory network. The approach is applied to the data set that was developed by the DNA microarray technology through biological experiments in the lab. This paper introduced a novel hybrid method, which combines the clonal selection algorithm and BFGS Quasi-Newton algorithm. The proposed approach implemented for real world E. coli data set and identified most of the relations. The results are also compared with the existing methods and proven to be efficient.
Collapse
Affiliation(s)
- A. S. Jereesh
- Department of Computer Science, Cochin University of Science and Technology, Cochin, Kerala India
| | - V. K. Govindan
- Department of Computer Science and Engineering, Indian Institute of Information Technology Pala, Kottayam, Kerala India
| |
Collapse
|
20
|
Murrugarra D, Miller J, Mueller AN. Estimating Propensity Parameters Using Google PageRank and Genetic Algorithms. Front Neurosci 2016; 10:513. [PMID: 27891072 PMCID: PMC5104906 DOI: 10.3389/fnins.2016.00513] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Accepted: 10/25/2016] [Indexed: 12/03/2022] Open
Abstract
Stochastic Boolean networks, or more generally, stochastic discrete networks, are an important class of computational models for molecular interaction networks. The stochasticity stems from the updating schedule. Standard updating schedules include the synchronous update, where all the nodes are updated at the same time, and the asynchronous update where a random node is updated at each time step. The former produces a deterministic dynamics while the latter a stochastic dynamics. A more general stochastic setting considers propensity parameters for updating each node. Stochastic Discrete Dynamical Systems (SDDS) are a modeling framework that considers two propensity parameters for updating each node and uses one when the update has a positive impact on the variable, that is, when the update causes the variable to increase its value, and uses the other when the update has a negative impact, that is, when the update causes it to decrease its value. This framework offers additional features for simulations but also adds a complexity in parameter estimation of the propensities. This paper presents a method for estimating the propensity parameters for SDDS. The method is based on adding noise to the system using the Google PageRank approach to make the system ergodic and thus guaranteeing the existence of a stationary distribution. Then with the use of a genetic algorithm, the propensity parameters are estimated. Approximation techniques that make the search algorithms efficient are also presented and Matlab/Octave code to test the algorithms are available at http://www.ms.uky.edu/~dmu228/GeneticAlg/Code.html.
Collapse
Affiliation(s)
- David Murrugarra
- Department of Mathematics, University of Kentucky Lexington, KY, USA
| | - Jacob Miller
- Department of Mathematics, University of Kentucky Lexington, KY, USA
| | - Alex N Mueller
- Department of Mathematics, University of Kentucky Lexington, KY, USA
| |
Collapse
|
21
|
Kim D, Kang M, Biswas A, Liu C, Gao J. Integrative approach for inference of gene regulatory networks using lasso-based random featuring and application to psychiatric disorders. BMC Med Genomics 2016; 9 Suppl 2:50. [PMID: 27510319 PMCID: PMC4980788 DOI: 10.1186/s12920-016-0202-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Background Inferring gene regulatory networks is one of the most interesting research areas in the systems biology. Many inference methods have been developed by using a variety of computational models and approaches. However, there are two issues to solve. First, depending on the structural or computational model of inference method, the results tend to be inconsistent due to innately different advantages and limitations of the methods. Therefore the combination of dissimilar approaches is demanded as an alternative way in order to overcome the limitations of standalone methods through complementary integration. Second, sparse linear regression that is penalized by the regularization parameter (lasso) and bootstrapping-based sparse linear regression methods were suggested in state of the art methods for network inference but they are not effective for a small sample size data and also a true regulator could be missed if the target gene is strongly affected by an indirect regulator with high correlation or another true regulator. Results We present two novel network inference methods based on the integration of three different criteria, (i) z-score to measure the variation of gene expression from knockout data, (ii) mutual information for the dependency between two genes, and (iii) linear regression-based feature selection. Based on these criterion, we propose a lasso-based random feature selection algorithm (LARF) to achieve better performance overcoming the limitations of bootstrapping as mentioned above. Conclusions In this work, there are three main contributions. First, our z score-based method to measure gene expression variations from knockout data is more effective than similar criteria of related works. Second, we confirmed that the true regulator selection can be effectively improved by LARF. Lastly, we verified that an integrative approach can clearly outperform a single method when two different methods are effectively jointed. In the experiments, our methods were validated by outperforming the state of the art methods on DREAM challenge data, and then LARF was applied to inferences of gene regulatory network associated with psychiatric disorders.
Collapse
Affiliation(s)
- Dongchul Kim
- Department of Computer Science, University of Texas Rio Grande Valley, Edinburg, 78541, TX, US
| | - Mingon Kang
- Department of Computer Science, Kennesaw State University, Marietta, 30144, GA, US
| | - Ashis Biswas
- Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, 76019, TX, US
| | - Chunyu Liu
- Department of Psychiatry, University of Illinois at Chicago, Chicago, 60607, IL, US
| | - Jean Gao
- Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, 76019, TX, US.
| |
Collapse
|
22
|
Mayer G, Marcus K, Eisenacher M, Kohl M. Boolean modeling techniques for protein co-expression networks in systems medicine. Expert Rev Proteomics 2016; 13:555-69. [PMID: 27105325 DOI: 10.1080/14789450.2016.1181546] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
INTRODUCTION Application of systems biology/systems medicine approaches is promising for proteomics/biomedical research, but requires selection of an adequate modeling type. AREAS COVERED This article reviews the existing Boolean network modeling approaches, which provide in comparison with alternative modeling techniques several advantages for the processing of proteomics data. Application of methods for inference, reduction and validation of protein co-expression networks that are derived from quantitative high-throughput proteomics measurements is presented. It's also shown how Boolean models can be used to derive system-theoretic characteristics that describe both the dynamical behavior of such networks as a whole and the properties of different cell states (e.g. healthy or diseased cell states). Furthermore, application of methods derived from control theory is proposed in order to simulate the effects of therapeutic interventions on such networks, which is a promising approach for the computer-assisted discovery of biomarkers and drug targets. Finally, the clinical application of Boolean modeling analyses is discussed. Expert commentary: Boolean modeling of proteomics data is still in its infancy. Progress in this field strongly depends on provision of a repository with public access to relevant reference models. Also required are community supported standards that facilitate input of both proteomics and patient related data (e.g. age, gender, laboratory results, etc.).
Collapse
Affiliation(s)
- Gerhard Mayer
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| | - Katrin Marcus
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| | - Martin Eisenacher
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| | - Michael Kohl
- a Medizinisches Proteom Center (MPC) , Ruhr-Universität Bochum , Bochum , Germany
| |
Collapse
|
23
|
Identification of Potential Drug Targets in Cancer Signaling Pathways using Stochastic Logical Models. Sci Rep 2016; 6:23078. [PMID: 26988076 PMCID: PMC4796823 DOI: 10.1038/srep23078] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 02/24/2016] [Indexed: 01/16/2023] Open
Abstract
The investigation of vulnerable components in a signaling pathway can contribute to development of drug therapy addressing aberrations in that pathway. Here, an original signaling pathway is derived from the published literature on breast cancer models. New stochastic logical models are then developed to analyze the vulnerability of the components in multiple signalling sub-pathways involved in this signaling cascade. The computational results are consistent with the experimental results, where the selected proteins were silenced using specific siRNAs and the viability of the cells were analyzed 72 hours after silencing. The genes elF4E and NFkB are found to have nearly no effect on the relative cell viability and the genes JAK2, Stat3, S6K, JUN, FOS, Myc, and Mcl1 are effective candidates to influence the relative cell growth. The vulnerabilities of some targets such as Myc and S6K are found to vary significantly depending on the weights of the sub-pathways; this will be indicative of the chosen target to require customization for therapy. When these targets are utilized, the response of breast cancers from different patients will be highly variable because of the known heterogeneities in signaling pathways among the patients. The targets whose vulnerabilities are invariably high might be more universally acceptable targets.
Collapse
|
24
|
Modeling Gene Networks in Saccharomyces cerevisiae Based on Gene Expression Profiles. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2015; 2015:621264. [PMID: 26839582 PMCID: PMC4709922 DOI: 10.1155/2015/621264] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Revised: 10/14/2015] [Accepted: 11/16/2015] [Indexed: 11/30/2022]
Abstract
Detailed and innovative analysis of gene regulatory network structures may reveal novel insights to biological mechanisms. Here we study how gene regulatory network in Saccharomyces cerevisiae can differ under aerobic and anaerobic conditions. To achieve this, we discretized the gene expression profiles and calculated the self-entropy of down- and upregulation of gene expression as well as joint entropy. Based on these quantities the uncertainty coefficient was calculated for each gene triplet, following which, separate gene logic networks were constructed for the aerobic and anaerobic conditions. Four structural parameters such as average degree, average clustering coefficient, average shortest path, and average betweenness were used to compare the structure of the corresponding aerobic and anaerobic logic networks. Five genes were identified to be putative key components of the two energy metabolisms. Furthermore, community analysis using the Newman fast algorithm revealed two significant communities for the aerobic but only one for the anaerobic network. David Gene Functional Classification suggests that, under aerobic conditions, one such community reflects the cell cycle and cell replication, while the other one is linked to the mitochondrial respiratory chain function.
Collapse
|
25
|
Vincent J, Martre P, Gouriou B, Ravel C, Dai Z, Petit JM, Pailloux M. RulNet: A Web-Oriented Platform for Regulatory Network Inference, Application to Wheat -Omics Data. PLoS One 2015; 10:e0127127. [PMID: 25993562 PMCID: PMC4437996 DOI: 10.1371/journal.pone.0127127] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2014] [Accepted: 01/20/2015] [Indexed: 12/19/2022] Open
Abstract
With the increasing amount of -omics data available, a particular effort has to be made to provide suitable analysis tools. A major challenge is that of unraveling the molecular regulatory networks from massive and heterogeneous datasets. Here we describe RulNet, a web-oriented platform dedicated to the inference and analysis of regulatory networks from qualitative and quantitative -omics data by means of rule discovery. Queries for rule discovery can be written in an extended form of the RQL query language, which has a syntax similar to SQL. RulNet also offers users interactive features that progressively adjust and refine the inferred networks. In this paper, we present a functional characterization of RulNet and compare inferred networks with correlation-based approaches. The performance of RulNet has been evaluated using the three benchmark datasets used for the transcriptional network inference challenge DREAM5. Overall, RulNet performed as well as the best methods that participated in this challenge and it was shown to behave more consistently when compared across the three datasets. Finally, we assessed the suitability of RulNet to analyze experimental -omics data and to infer regulatory networks involved in the response to nitrogen and sulfur supply in wheat (Triticum aestivum L.) grains. The results highlight putative actors governing the response to nitrogen and sulfur supply in wheat grains. We evaluate the main characteristics and features of RulNet as an all-in-one solution for RN inference, visualization and editing. Using simple yet powerful RulNet queries allowed RNs involved in the adaptation of wheat grain to N and S supply to be discovered. We demonstrate the effectiveness and suitability of RulNet as a platform for the analysis of RNs involving different types of -omics data. The results are promising since they are consistent with what was previously established by the scientific community.
Collapse
Affiliation(s)
- Jonathan Vincent
- Blaise Pascal University, UMR6158 CNRS LIMOS Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes, Aubière, F-63 173, France
- INRA, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Clermont-Ferrand, F-63 039, France
- Blaise Pascal University, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Aubière, F-63 177, France
| | - Pierre Martre
- INRA, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Clermont-Ferrand, F-63 039, France
- Blaise Pascal University, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Aubière, F-63 177, France
| | - Benjamin Gouriou
- Blaise Pascal University, UMR6158 CNRS LIMOS Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes, Aubière, F-63 173, France
| | - Catherine Ravel
- INRA, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Clermont-Ferrand, F-63 039, France
- Blaise Pascal University, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Aubière, F-63 177, France
| | - Zhanwu Dai
- INRA, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Clermont-Ferrand, F-63 039, France
- Blaise Pascal University, UMR1095 Genetics, Diversity and Ecophysiology of Cereals, Aubière, F-63 177, France
| | - Jean-Marc Petit
- INSA Lyon, UMR5205 CNRS LIRIS Laboratoire d’Informatique en Images et Systèmes d’Information, Villeurbanne, F-69 621, France
| | - Marie Pailloux
- Blaise Pascal University, UMR6158 CNRS LIMOS Laboratoire d'Informatique, de Modélisation et d'Optimisation des Systèmes, Aubière, F-63 173, France
| |
Collapse
|
26
|
Brandon M, Howard B, Lawrence C, Laubenbacher R. Iron acquisition and oxidative stress response in aspergillus fumigatus. BMC SYSTEMS BIOLOGY 2015; 9:19. [PMID: 25908096 PMCID: PMC4418068 DOI: 10.1186/s12918-015-0163-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Accepted: 03/31/2015] [Indexed: 01/08/2023]
Abstract
BACKGROUND Aspergillus fumigatus is a ubiquitous airborne fungal pathogen that presents a life-threatening health risk to individuals with weakened immune systems. A. fumigatus pathogenicity depends on its ability to acquire iron from the host and to resist host-generated oxidative stress. Gaining a deeper understanding of the molecular mechanisms governing A. fumigatus iron acquisition and oxidative stress response may ultimately help to improve the diagnosis and treatment of invasive aspergillus infections. RESULTS This study follows a systems biology approach to investigate how adaptive behaviors emerge from molecular interactions underlying A. fumigatus iron regulation and oxidative stress response. We construct a Boolean network model from known interactions and simulate how changes in environmental iron and superoxide levels affect network dynamics. We propose rules for linking long term model behavior to qualitative estimates of cell growth and cell death. These rules are used to predict phenotypes of gene deletion strains. The model is validated on the basis of its ability to reproduce literature data not used in model generation. CONCLUSIONS The model reproduces gene expression patterns in experimental time course data when A. fumigatus is switched from a low iron to a high iron environment. In addition, the model is able to accurately represent the phenotypes of many knockout strains under varying iron and superoxide conditions. Model simulations support the hypothesis that intracellular iron regulates A. fumigatus transcription factors, SreA and HapX, by a post-translational, rather than transcriptional, mechanism. Finally, the model predicts that blocking siderophore-mediated iron uptake reduces resistance to oxidative stress. This indicates that combined targeting of siderophore-mediated iron uptake and the oxidative stress response network may act synergistically to increase fungal cell killing.
Collapse
Affiliation(s)
- Madison Brandon
- Center for Cell Analysis and Modeling, University of Connecticut Health Center, 400 Farmington Ave, Farmington, 06030, USA. .,Center for Quantitative Medicine, University of Connecticut Health Center, Farmington, 06030, USA.
| | - Brad Howard
- Department of Biological Sciences, Virginia Tech, 1405 Perry Street, Blacksburg, 24061, USA. .,Virginia Bioinformatics Institute, Virginia Tech, 1015 Life Science Circle, Blacksburg, 24061, US.
| | - Christopher Lawrence
- Department of Biological Sciences, Virginia Tech, 1405 Perry Street, Blacksburg, 24061, USA. .,Virginia Bioinformatics Institute, Virginia Tech, 1015 Life Science Circle, Blacksburg, 24061, US.
| | - Reinhard Laubenbacher
- Center for Quantitative Medicine, University of Connecticut Health Center, Farmington, 06030, USA. .,The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, 06030, USA. .,Department of Cell Biology, University of Connecticut Health Center, 263 Farmington Ave, Farmington, 06030, USA.
| |
Collapse
|
27
|
Abstract
Behaviours of complex biomolecular systems are often irreducible to the elementary properties of their individual components. Explanatory and predictive mathematical models are therefore useful for fully understanding and precisely engineering cellular functions. The development and analyses of these models require their adaptation to the problems that need to be solved and the type and amount of available genetic or molecular data. Quantitative and logic modelling are among the main methods currently used to model molecular and gene networks. Each approach comes with inherent advantages and weaknesses. Recent developments show that hybrid approaches will become essential for further progress in synthetic biology and in the development of virtual organisms.
Collapse
Affiliation(s)
- Nicolas Le Novère
- Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, UK
| |
Collapse
|
28
|
Abstract
Logical models have widely been used to gain insights into the biological behavior of gene regulatory networks (GRNs). Most logical models assume a synchronous update of the genes' states in a GRN. However, this may not be appropriate, because each gene may require a different period of time for changing its state. In this article, asynchronous stochastic Boolean networks (ASBNs) are proposed for investigating various asynchronous state-updating strategies in a GRN. As in stochastic computation, ASBNs use randomly permutated stochastic sequences to encode probability. Investigated by several stochasticity models, a GRN is considered to be subject to noise and external perturbation. Hence, both stochasticity and asynchronicity are considered in the state evolution of a GRN. As a case study, ASBNs are utilized to investigate the dynamic behavior of a T helper network. It is shown that ASBNs are efficient in evaluating the steady-state distributions (SSDs) of the network with random gene perturbation. The SSDs found by using ASBNs show the robustness of the attractors of the T helper network, when various stochasticity and asynchronicity models are considered to investigate its dynamic behavior.
Collapse
Affiliation(s)
- Peican Zhu
- Department of Electrical and Computer Engineering, University of Alberta , Edmonton, Alberta, Canada
| | | |
Collapse
|
29
|
Inference of SNP-gene regulatory networks by integrating gene expressions and genetic perturbations. BIOMED RESEARCH INTERNATIONAL 2014; 2014:629697. [PMID: 25136606 PMCID: PMC4127230 DOI: 10.1155/2014/629697] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Accepted: 05/09/2014] [Indexed: 11/18/2022]
Abstract
In order to elucidate the overall relationships between gene expressions and genetic perturbations, we propose a network inference method to infer gene regulatory network where single nucleotide polymorphism (SNP) is involved as a regulator of genes. In the most of the network inferences named as SNP-gene regulatory network (SGRN) inference, pairs of SNP-gene are given by separately performing expression quantitative trait loci (eQTL) mappings. In this paper, we propose a SGRN inference method without predefined eQTL information assuming a gene is regulated by a single SNP at most. To evaluate the performance, the proposed method was applied to random data generated from synthetic networks and parameters. There are three main contributions. First, the proposed method provides both the gene regulatory inference and the eQTL identification. Second, the experimental results demonstrated that integration of multiple methods can produce competitive performances. Lastly, the proposed method was also applied to psychiatric disorder data in order to explore how the method works with real data.
Collapse
|
30
|
Zhu P, Liang J, Han J. Gene perturbation and intervention in context-sensitive stochastic Boolean networks. BMC SYSTEMS BIOLOGY 2014; 8:60. [PMID: 24886608 PMCID: PMC4062525 DOI: 10.1186/1752-0509-8-60] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2013] [Accepted: 04/22/2014] [Indexed: 01/03/2023]
Abstract
Background In a gene regulatory network (GRN), gene expressions are affected by noise, and stochastic fluctuations exist in the interactions among genes. These stochastic interactions are context dependent, thus it becomes important to consider noise in a context-sensitive manner in a network model. As a logical model, context-sensitive probabilistic Boolean networks (CSPBNs) account for molecular and genetic noise in the temporal context of gene functions. In a CSPBN with n genes and k contexts, however, a computational complexity of O(nk222n) (or O(nk2n)) is required for an accurate (or approximate) computation of the state transition matrix (STM) of the size (2n ∙ k) × (2n ∙ k) (or 2n × 2n). The evaluation of a steady state distribution (SSD) is more challenging. Recently, stochastic Boolean networks (SBNs) have been proposed as an efficient implementation of an instantaneous PBN. Results The notion of stochastic Boolean networks (SBNs) is extended for the general model of PBNs, i.e., CSPBNs. This yields a novel structure of context-sensitive SBNs (CSSBNs) for modeling the stochasticity in a GRN. A CSSBN enables an efficient simulation of a CSPBN with a complexity of O(nLk2n) for computing the state transition matrix, where L is a factor related to the required sequence length in CSSBN for achieving a desired accuracy. A time-frame expanded CSSBN can further efficiently simulate the stationary behavior of a CSPBN and allow for a tunable tradeoff between accuracy and efficiency. The CSSBN approach is more efficient than an analytical method and more accurate than an approximate analysis. Conclusions Context-sensitive stochastic Boolean networks (CSSBNs) are proposed as an efficient approach to modeling the effects of gene perturbation and intervention in gene regulatory networks. A CSSBN analysis provides biologically meaningful insights into the oscillatory dynamics of the p53-Mdm2 network in a context-switching environment. It is shown that random gene perturbation has a greater effect on the final distribution of the steady state of a network compared to context switching activities. The CSSBN approach can further predict the steady state distribution of a glioma network under gene intervention. Ultimately, this will help drug discovery and develop effective drug intervention strategies.
Collapse
Affiliation(s)
| | | | - Jie Han
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, Canada.
| |
Collapse
|
31
|
Verification and optimal control of context-sensitive probabilistic Boolean networks using model checking and polynomial optimization. ScientificWorldJournal 2014; 2014:968341. [PMID: 24587766 PMCID: PMC3920625 DOI: 10.1155/2014/968341] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 10/28/2013] [Indexed: 11/18/2022] Open
Abstract
One of the significant topics in systems biology is to develop control theory of gene regulatory networks (GRNs). In typical
control of GRNs, expression of some genes is inhibited (activated) by manipulating external stimuli and expression of other genes. It is expected to apply control theory of GRNs to gene therapy technologies in the future. In this paper, a control method using a Boolean network (BN) is studied. A BN is widely used as a model of GRNs, and gene expression is expressed by a binary value (ON or OFF). In particular, a context-sensitive probabilistic Boolean network (CS-PBN), which is one of the extended models of BNs, is used. For CS-PBNs, the verification problem and the optimal control problem are considered. For the verification problem, a solution method using the probabilistic model checker PRISM is proposed. For the optimal control problem, a solution method using polynomial optimization is proposed. Finally, a numerical example on the WNT5A network, which is related to melanoma, is presented. The proposed methods provide us useful tools in control theory of GRNs.
Collapse
|
32
|
Patton KL, John DJ, Norris JL, Lewis DR, Muday GK. Hierarchical Probabilistic Interaction Modeling for Multiple Gene Expression Replicates. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:336-346. [PMID: 26355781 DOI: 10.1109/tcbb.2014.2299804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Microarray technology allows for the collection of multiple replicates of gene expression time course data for hundreds of genes at a handful of time points. Developing hypotheses about a gene transcriptional network, based on time course gene expression data is an important and very challenging problem. In many situations there are similarities which suggest a hierarchical structure between the replicates. This paper develops posterior probabilities for network features based on multiple hierarchical replications. Through Bayesian inference, in conjunction with the Metropolis-Hastings algorithm and model averaging, a hierarchical multiple replicate algorithm is applied to seven sets of simulated data and to a set of Arabidopsis thaliana gene expression data. The models of the simulated data suggest high posterior probabilities for pairs of genes which have at least moderate signal partial correlation. For the Arabidopsis model, many of the highest posterior probability edges agree with the literature.
Collapse
|
33
|
Zhu P, Han J. Stochastic multiple-valued gene networks. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2014; 8:42-53. [PMID: 24681918 DOI: 10.1109/tbcas.2013.2291398] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Among various approaches to modeling gene regulatory networks (GRNs), Boolean networks (BNs) and its probabilistic extension, probabilistic Boolean networks (PBNs), have been studied to gain insights into the dynamics of GRNs. To further exploit the simplicity of logical models, a multiple-valued network employs gene states that are not limited to binary values, thus providing a finer granularity in the modeling of GRNs. In this paper, stochastic multiple-valued networks (SMNs) are proposed for modeling the effects of noise and gene perturbation in a GRN. An SMN enables an accurate and efficient simulation of a probabilistic multiple-valued network (as an extension of a PBN). In a k-level SMN of n genes, it requires a complexity of O(nLk(n)) to compute the state transition matrix, where L is a factor related to the minimum sequence length in the SMN for achieving a desired accuracy. The use of randomly permuted stochastic sequences further increases computational efficiency and allows for a tunable tradeoff between accuracy and efficiency. The analysis of a p53-Mdm2 network and a WNT5A network shows that the proposed SMN approach is efficient in evaluating the network dynamics and steady state distribution of gene networks under random gene perturbation.
Collapse
|
34
|
Optimal control of gene regulatory networks with effectiveness of multiple drugs: a Boolean network approach. BIOMED RESEARCH INTERNATIONAL 2013; 2013:246761. [PMID: 24058904 PMCID: PMC3763581 DOI: 10.1155/2013/246761] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Revised: 06/25/2013] [Accepted: 07/12/2013] [Indexed: 11/17/2022]
Abstract
Developing control theory of gene regulatory networks is one of the significant topics in the field of systems biology, and it is expected to apply the obtained results to gene therapy technologies in the future. In this paper, a control method using a Boolean network (BN) is studied. A BN is widely used as a model of gene regulatory networks, and gene expression is expressed by a binary value (0 or 1). In the control problem, we assume that the concentration level of a part of genes is arbitrarily determined as the control input. However, there are cases that no gene satisfying this assumption exists, and it is important to consider structural control via external stimuli. Furthermore, these controls are realized by multiple drugs, and it is also important to consider multiple effects such as duration of effect and side effects. In this paper, we propose a BN model with two types of the control inputs and an optimal control method with duration of drug effectiveness. First, a BN model and duration of drug effectiveness are discussed. Next, the optimal control problem is formulated and is reduced to an integer linear programming problem. Finally, numerical simulations are shown.
Collapse
|
35
|
Berestovsky N, Nakhleh L. An Evaluation of Methods for Inferring Boolean Networks from Time-Series Data. PLoS One 2013; 8:e66031. [PMID: 23805196 PMCID: PMC3689729 DOI: 10.1371/journal.pone.0066031] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2012] [Accepted: 05/06/2013] [Indexed: 01/12/2023] Open
Abstract
Regulatory networks play a central role in cellular behavior and decision making. Learning these regulatory networks is a major task in biology, and devising computational methods and mathematical models for this task is a major endeavor in bioinformatics. Boolean networks have been used extensively for modeling regulatory networks. In this model, the state of each gene can be either ‘on’ or ‘off’ and that next-state of a gene is updated, synchronously or asynchronously, according to a Boolean rule that is applied to the current-state of the entire system. Inferring a Boolean network from a set of experimental data entails two main steps: first, the experimental time-series data are discretized into Boolean trajectories, and then, a Boolean network is learned from these Boolean trajectories. In this paper, we consider three methods for data discretization, including a new one we propose, and three methods for learning Boolean networks, and study the performance of all possible nine combinations on four regulatory systems of varying dynamics complexities. We find that employing the right combination of methods for data discretization and network learning results in Boolean networks that capture the dynamics well and provide predictive power. Our findings are in contrast to a recent survey that placed Boolean networks on the low end of the “faithfulness to biological reality” and “ability to model dynamics” spectra. Further, contrary to the common argument in favor of Boolean networks, we find that a relatively large number of time points in the time-series data is required to learn good Boolean networks for certain data sets. Last but not least, while methods have been proposed for inferring Boolean networks, as discussed above, missing still are publicly available implementations thereof. Here, we make our implementation of the methods available publicly in open source at http://bioinfo.cs.rice.edu/.
Collapse
Affiliation(s)
- Natalie Berestovsky
- Department of Computer Science, Rice University, Houston, Texas, United States of America
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
36
|
Kitchen JL, Allaby RG. Systems Modeling at Multiple Levels of Regulation: Linking Systems and Genetic Networks to Spatially Explicit Plant Populations. PLANTS (BASEL, SWITZERLAND) 2013; 2:16-49. [PMID: 27137364 PMCID: PMC4844292 DOI: 10.3390/plants2010016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Revised: 12/21/2012] [Accepted: 01/16/2013] [Indexed: 11/16/2022]
Abstract
Selection and adaptation of individuals to their underlying environments are highly dynamical processes, encompassing interactions between the individual and its seasonally changing environment, synergistic or antagonistic interactions between individuals and interactions amongst the regulatory genes within the individual. Plants are useful organisms to study within systems modeling because their sedentary nature simplifies interactions between individuals and the environment, and many important plant processes such as germination or flowering are dependent on annual cycles which can be disrupted by climate behavior. Sedentism makes plants relevant candidates for spatially explicit modeling that is tied in with dynamical environments. We propose that in order to fully understand the complexities behind plant adaptation, a system that couples aspects from systems biology with population and landscape genetics is required. A suitable system could be represented by spatially explicit individual-based models where the virtual individuals are located within time-variable heterogeneous environments and contain mutable regulatory gene networks. These networks could directly interact with the environment, and should provide a useful approach to studying plant adaptation.
Collapse
Affiliation(s)
- James L Kitchen
- School of Life Sciences, University of Warwick, Coventry, CV4 7AL, UK
| | - Robin G Allaby
- School of Life Sciences, University of Warwick, Coventry, CV4 7AL, UK.
| |
Collapse
|
37
|
Balasubramanian D, Schneper L, Kumari H, Mathee K. A dynamic and intricate regulatory network determines Pseudomonas aeruginosa virulence. Nucleic Acids Res 2012; 41:1-20. [PMID: 23143271 PMCID: PMC3592444 DOI: 10.1093/nar/gks1039] [Citation(s) in RCA: 317] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Pseudomonas aeruginosa is a metabolically versatile bacterium that is found in a wide range of biotic and abiotic habitats. It is a major human opportunistic pathogen causing numerous acute and chronic infections. The critical traits contributing to the pathogenic potential of P. aeruginosa are the production of a myriad of virulence factors, formation of biofilms and antibiotic resistance. Expression of these traits is under stringent regulation, and it responds to largely unidentified environmental signals. This review is focused on providing a global picture of virulence gene regulation in P. aeruginosa. In addition to key regulatory pathways that control the transition from acute to chronic infection phenotypes, some regulators have been identified that modulate multiple virulence mechanisms. Despite of a propensity for chaotic behaviour, no chaotic motifs were readily observed in the P. aeruginosa virulence regulatory network. Having a ‘birds-eye’ view of the regulatory cascades provides the forum opportunities to pose questions, formulate hypotheses and evaluate theories in elucidating P. aeruginosa pathogenesis. Understanding the mechanisms involved in making P. aeruginosa a successful pathogen is essential in helping devise control strategies.
Collapse
Affiliation(s)
- Deepak Balasubramanian
- Department of Biological Sciences, College of Arts and Science, Florida International University, Miami, FL 33199, USA
| | | | | | | |
Collapse
|