1
|
Yang B, Bao W, Chen B. PGRNIG: novel parallel gene regulatory network identification algorithm based on GPU. Brief Funct Genomics 2022; 21:441-454. [PMID: 36064791 DOI: 10.1093/bfgp/elac028] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 07/30/2022] [Accepted: 08/03/2022] [Indexed: 12/14/2022] Open
Abstract
Molecular biology has revealed that complex life phenomena can be treated as the result of many gene interactions. Investigating these interactions and understanding the intrinsic mechanisms of biological systems using gene expression data have attracted a lot of attention. As a typical gene regulatory network (GRN) inference method, the S-system has been utilized to deal with small-scale network identification. However, it is extremely difficult to optimize it to infer medium-to-large networks. This paper proposes a novel parallel swarm intelligent algorithm, PGRNIG, to optimize the parameters of the S-system. We employed the clone selection strategy to improve the whale optimization algorithm (CWOA). To enhance the time efficiency of CWOA optimization, we utilized a parallel CWOA (PCWOA) based on the compute unified device architecture (CUDA) platform. Decomposition strategy and L1 regularization were utilized to reduce the search space and complexity of GRN inference. We applied the PGRNIG algorithm on three synthetic datasets and two real time-series expression datasets of the species of Escherichia coli and Saccharomyces cerevisiae. Experimental results show that PGRNIG could infer the gene regulatory network more accurately than other state-of-the-art methods with a convincing computational speed-up. Our findings show that CWOA and PCWOA have faster convergence performances than WOA.
Collapse
Affiliation(s)
- Bin Yang
- School of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China
| | - Wenzheng Bao
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou 221018, China
| | - Baitong Chen
- Xuzhou First People's Hospital, Xuzhou 221000, China
| |
Collapse
|
2
|
The partition representation of enzymatic reaction networks and its application for searching bi-stable reaction systems. PLoS One 2022; 17:e0263111. [PMID: 35081159 PMCID: PMC8791506 DOI: 10.1371/journal.pone.0263111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 01/12/2022] [Indexed: 11/30/2022] Open
Abstract
The signal transduction system, which is known as a regulatory mechanism for biochemical reaction systems in the cell, has been the subject of intensive research in recent years, and its design methods have become necessary from the viewpoint of synthetic biology. We proposed the partition representation of enzymatic reaction networks consisting of post-translational modification reactions such as phosphorylation, which is an important basic component of signal transduction systems, and attempted to find enzymatic reaction networks with bistability to demonstrate the effectiveness of the proposed representation method. The partition modifiers can be naturally introduced into the partition representation of enzymatic reaction networks when applied to search. By randomly applying the partition modifiers as appropriate, we searched for bistable and resettable enzymatic reaction networks consisting of four post-translational modification reactions. The proposed search algorithm worked well and we were able to find various bistable enzymatic reaction networks, including a typical bistable enzymatic reaction network with positive auto-feedbacks and mutually negative regulations. Since the search algorithm is divided into an evaluation function specific to the characteristics of the enzymatic reaction network to be searched and an independent algorithm part, it may be applied to search for dynamic properties such as biochemical adaptation, the ability to reset the biochemical state after responding to a stimulus, by replacing the evaluation function with one for other characteristics.
Collapse
|
3
|
Yang B, Zhang W. Traffic Data Prediction Based on Complex-Valued S-System Model. INTELLIGENT COMPUTING THEORIES AND APPLICATION 2020:423-431. [DOI: 10.1007/978-3-030-60802-6_37] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
4
|
Kizhakkethil Youseph AS, Chetty M, Karmakar G. Reverse engineering genetic networks using nonlinear saturation kinetics. Biosystems 2019; 182:30-41. [PMID: 31185246 DOI: 10.1016/j.biosystems.2019.103977] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 04/25/2019] [Accepted: 05/27/2019] [Indexed: 01/01/2023]
Abstract
A gene regulatory network (GRN) represents a set of genes along with their regulatory interactions. Cellular behavior is driven by genetic level interactions. Dynamics of such systems show nonlinear saturation kinetics which can be best modeled by Michaelis-Menten (MM) and Hill equations. Although MM equation is being widely used for modeling biochemical processes, it has been applied rarely for reverse engineering GRNs. In this paper, we develop a complete framework for a novel model for GRN inference using MM kinetics. A set of coupled equations is first proposed for modeling GRNs. In the coupled model, Michaelis-Menten constant associated with regulation by a gene is made invariant irrespective of the gene being regulated. The parameter estimation of the proposed model is carried out using an evolutionary optimization method, namely, trigonometric differential evolution (TDE). Subsequently, the model is further improved and the regulations of different genes by a given gene are made distinct by allowing varying values of Michaelis-Menten constants for each regulation. Apart from making the model more relevant biologically, the improvement results in a decoupled GRN model with fast estimation of model parameters. Further, to enhance exploitation of the search, we propose a local search algorithm based on hill climbing heuristics. A novel mutation operation is also proposed to avoid population stagnation and premature convergence. Real life benchmark data sets generated in vivo are used for validating the proposed model. Further, we also analyze realistic in silico datasets generated using GeneNetweaver. The comparison of the performance of proposed model with other existing methods shows the potential of the proposed model.
Collapse
Affiliation(s)
| | - Madhu Chetty
- School of Science, Engineering and Information Technology, Federation University Australia, Gippsland 3842, Australia
| | - Gour Karmakar
- School of Science, Engineering and Information Technology, Federation University Australia, Gippsland 3842, Australia
| |
Collapse
|
5
|
Inference of Biochemical S-Systems via Mixed-Variable Multiobjective Evolutionary Optimization. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2017; 2017:3020326. [PMID: 28607576 PMCID: PMC5457779 DOI: 10.1155/2017/3020326] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Accepted: 04/27/2017] [Indexed: 12/18/2022]
Abstract
Inference of the biochemical systems (BSs) via experimental data is important for understanding how biochemical components in vivo interact with each other. However, it is not a trivial task because BSs usually function with complex and nonlinear dynamics. As a popular ordinary equation (ODE) model, the S-System describes the dynamical properties of BSs by incorporating the power rule of biochemical reactions but behaves as a challenge because it has a lot of parameters to be confirmed. This work is dedicated to proposing a general method for inference of S-Systems by experimental data, using a biobjective optimization (BOO) model and a specially mixed-variable multiobjective evolutionary algorithm (mv-MOEA). Regarding that BSs are sparse in common sense, we introduce binary variables indicating network connections to eliminate the difficulty of threshold presetting and take data fitting error and the L0-norm as two objectives to be minimized in the BOO model. Then, a selection procedure that automatically runs tradeoff between two objectives is employed to choose final inference results from the obtained nondominated solutions of the mv-MOEA. Inference results of the investigated networks demonstrate that our method can identify their dynamical properties well, although the automatic selection procedure sometimes ignores some weak connections in BSs.
Collapse
|
6
|
Yang B, Liu S, Zhang W. Reverse engineering of gene regulatory network using restricted gene expression programming. J Bioinform Comput Biol 2016; 14:1650021. [PMID: 27338130 DOI: 10.1142/s0219720016500219] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Inference of gene regulatory networks has been becoming a major area of interest in the field of systems biology over the past decade. In this paper, we present a novel representation of S-system model, named restricted gene expression programming (RGEP), to infer gene regulatory network. A new hybrid evolutionary algorithm based on structure-based evolutionary algorithm and cuckoo search (CS) is proposed to optimize the architecture and corresponding parameters of model, respectively. Two synthetic benchmark datasets and one real biological dataset from SOS DNA repair network in E. coli are used to test the validity of our method. Experimental results demonstrate that our proposed method performs better than previously proposed popular methods.
Collapse
Affiliation(s)
- Bin Yang
- 1 School of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China
| | - Sanrong Liu
- 1 School of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China
| | - Wei Zhang
- 1 School of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China
| |
Collapse
|
7
|
Mandal S, Khan A, Saha G, Pal RK. Reverse engineering of gene regulatory networks based on S-systems and Bat algorithm. J Bioinform Comput Biol 2016; 14:1650010. [DOI: 10.1142/s0219720016500104] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The correct inference of gene regulatory networks for the understanding of the intricacies of the complex biological regulations remains an intriguing task for researchers. With the availability of large dimensional microarray data, relationships among thousands of genes can be simultaneously extracted. Among the prevalent models of reverse engineering genetic networks, S-system is considered to be an efficient mathematical tool. In this paper, Bat algorithm, based on the echolocation of bats, has been used to optimize the S-system model parameters. A decoupled S-system has been implemented to reduce the complexity of the algorithm. Initially, the proposed method has been successfully tested on an artificial network with and without the presence of noise. Based on the fact that a real-life genetic network is sparsely connected, a novel Accumulative Cardinality based decoupled S-system has been proposed. The cardinality has been varied from zero up to a maximum value, and this model has been implemented for the reconstruction of the DNA SOS repair network of Escherichia coli. The obtained results have shown significant improvements in the detection of a greater number of true regulations, and in the minimization of false detections compared to other existing methods.
Collapse
Affiliation(s)
- Sudip Mandal
- Electronics and Communication Engineering Department, Global Institute of Management and Technology Krishnanagar, West Bengal 741102, India
| | - Abhinandan Khan
- Computer Science and Engineering Department, University of Calcutta, Acharya Prafulla Chandra Siksha Prangan, JD-2, Sector – III, Salt Lake Kolkata 700098, India
| | - Goutam Saha
- Information Technology Department, North Eastern Hill University, Umshing, Mawkynroh, Shillong, Meghalaya 793022, India
| | - Rajat Kumar Pal
- Computer Science and Engineering Department, University of Calcutta, Acharya Prafulla Chandra Siksha Prangan, JD-2, Sector – III, Salt Lake Kolkata 700098, India
| |
Collapse
|
8
|
Zhan C, Li BYS, Yeung LF. Structural and practical identifiability analysis of S-system. IET Syst Biol 2015; 9:285-293. [PMID: 26577163 PMCID: PMC8687182 DOI: 10.1049/iet-syb.2015.0014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Revised: 08/03/2015] [Accepted: 08/14/2015] [Indexed: 10/08/2023] Open
Abstract
In the field of systems biology, biological reaction networks are usually modelled by ordinary differential equations. A sub-class, the S-systems representation, is a widely used form of modelling. Existing S-systems identification techniques assume that the system itself is always structurally identifiable. However, due to practical limitations, biological reaction networks are often only partially measured. In addition, the captured data only covers a limited trajectory, therefore data can only be considered as a local snapshot of the system responses with respect to the complete set of state trajectories over the entire state space. Hence the estimated model can only reflect partial system dynamics and may not be unique. To improve the identification quality, the structural and practical identifiablility of S-system are studied. The S-system is shown to be identifiable under a set of assumptions. Then, an application on yeast fermentation pathway was conducted. Two case studies were chosen; where the first case is based on a larger state trajectories and the second case is based on a smaller one. By expanding the dataset which span a relatively larger state space, the uncertainty of the estimated system can be reduced. The results indicated that initial concentration is related to the practical identifiablity.
Collapse
Affiliation(s)
- Choujun Zhan
- Department of Electronics Communication and Software Engineering, Nanfang College of Sun Yat-Sen University, Guangdong 510970, People's Republic of China
| | - Benjamin Yee Shing Li
- Department of Electronic Engineering, City University of Hong Kong, Hong Kong, Hong Kong.
| | - Lam Fat Yeung
- Department of Electronic Engineering, City University of Hong Kong, Hong Kong, Hong Kong
| |
Collapse
|
9
|
Castellini A, Paltrinieri D, Manca V. MP-GeneticSynth: inferring biological network regulations from time series. Bioinformatics 2015; 31:785-7. [PMID: 25344496 DOI: 10.1093/bioinformatics/btu694] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MP-GeneticSynth is a Java tool for discovering the logic and regulation mechanisms responsible for observed biological dynamics in terms of finite difference recurrent equations. The software makes use of: (i) metabolic P systems as a modeling framework, (ii) an evolutionary approach to discover flux regulation functions as linear combinations of given primitive functions, (iii) a suitable reformulation of the least squares method to estimate function parameters considering simultaneously all the reactions involved in complex dynamics. The tool is available as a plugin for the virtual laboratory MetaPlab. It has graphical and interactive interfaces for data preparation, a priori knowledge integration, and flux regulator analysis. Availability and implementation: Source code, binaries, documentation (including quick start guide and videos) and case studies are freely available at http://mplab.sci.univr.it/plugins/mpgs/index.html.
Collapse
Affiliation(s)
- Alberto Castellini
- Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy
| | - Daniele Paltrinieri
- Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy
| | - Vincenzo Manca
- Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy Center for BioMedical Computing, Verona University, 37134 Verona, Italy, Max Planck Institute for Molecular Plant Physiology-Syst Bio & Math Modelling Group, 14476, Germany, University of Potsdam, Institute for Biochemistry and Biology - Bioinformatics Group, 14476, Germany and Department of Computer Science, Verona University, 37134 Verona, Italy
| |
Collapse
|
10
|
Cakır T, Khatibipour MJ. Metabolic network discovery by top-down and bottom-up approaches and paths for reconciliation. Front Bioeng Biotechnol 2014; 2:62. [PMID: 25520953 PMCID: PMC4253960 DOI: 10.3389/fbioe.2014.00062] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Accepted: 11/14/2014] [Indexed: 11/13/2022] Open
Abstract
The primary focus in the network-centric analysis of cellular metabolism by systems biology approaches is to identify the active metabolic network for the condition of interest. Two major approaches are available for the discovery of the condition-specific metabolic networks. One approach starts from genome-scale metabolic networks, which cover all possible reactions known to occur in the related organism in a condition-independent manner, and applies methods such as the optimization-based Flux-Balance Analysis to elucidate the active network. The other approach starts from the condition-specific metabolome data, and processes the data with statistical or optimization-based methods to extract information content of the data such that the active network is inferred. These approaches, termed bottom-up and top-down, respectively, are currently employed independently. However, considering that both approaches have the same goal, they can both benefit from each other paving the way for the novel integrative analysis methods of metabolome data- and flux-analysis approaches in the post-genomic era. This study reviews the strengths of constraint-based analysis and network inference methods reported in the metabolic systems biology field; then elaborates on the potential paths to reconcile the two approaches to shed better light on how the metabolism functions.
Collapse
Affiliation(s)
- Tunahan Cakır
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University (formerly known as Gebze Institute of Technology) , Gebze , Turkey
| | - Mohammad Jafar Khatibipour
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University (formerly known as Gebze Institute of Technology) , Gebze , Turkey ; Department of Chemical Engineering, Gebze Technical University (formerly known as Gebze Institute of Technology) , Gebze , Turkey
| |
Collapse
|
11
|
Cazzaniga P, Damiani C, Besozzi D, Colombo R, Nobile MS, Gaglio D, Pescini D, Molinari S, Mauri G, Alberghina L, Vanoni M. Computational strategies for a system-level understanding of metabolism. Metabolites 2014; 4:1034-87. [PMID: 25427076 PMCID: PMC4279158 DOI: 10.3390/metabo4041034] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 11/05/2014] [Accepted: 11/12/2014] [Indexed: 12/20/2022] Open
Abstract
Cell metabolism is the biochemical machinery that provides energy and building blocks to sustain life. Understanding its fine regulation is of pivotal relevance in several fields, from metabolic engineering applications to the treatment of metabolic disorders and cancer. Sophisticated computational approaches are needed to unravel the complexity of metabolism. To this aim, a plethora of methods have been developed, yet it is generally hard to identify which computational strategy is most suited for the investigation of a specific aspect of metabolism. This review provides an up-to-date description of the computational methods available for the analysis of metabolic pathways, discussing their main advantages and drawbacks. In particular, attention is devoted to the identification of the appropriate scale and level of accuracy in the reconstruction of metabolic networks, and to the inference of model structure and parameters, especially when dealing with a shortage of experimental measurements. The choice of the proper computational methods to derive in silico data is then addressed, including topological analyses, constraint-based modeling and simulation of the system dynamics. A description of some computational approaches to gain new biological knowledge or to formulate hypotheses is finally provided.
Collapse
Affiliation(s)
- Paolo Cazzaniga
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Chiara Damiani
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Daniela Besozzi
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Riccardo Colombo
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Marco S Nobile
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Daniela Gaglio
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Dario Pescini
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Sara Molinari
- Dipartimento di Biotecnologie e Bioscienze, Università degli Studi di Milano-Bicocca, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Giancarlo Mauri
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Lilia Alberghina
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Marco Vanoni
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| |
Collapse
|
12
|
Zhan C, Situ W, Yeung LF, Tsang PWM, Yang G. A Parameter Estimation Method for Biological Systems modelled by ODE/DDE Models Using Spline Approximation and Differential Evolution Algorithm. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:1066-1076. [PMID: 26357044 DOI: 10.1109/tcbb.2014.2322360] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The inverse problem of identifying unknown parameters of known structure dynamical biological systems, which are modelled by ordinary differential equations or delay differential equations, from experimental data is treated in this paper. A two stage approach is adopted: first, combine spline theory and Nonlinear Programming (NLP), the parameter estimation problem is formulated as an optimization problem with only algebraic constraints; then, a new differential evolution (DE) algorithm is proposed to find a feasible solution. The approach is designed to handle problem of realistic size with noisy observation data. Three cases are studied to evaluate the performance of the proposed algorithm: two are based on benchmark models with priori-determined structure and parameters; the other one is a particular biological system with unknown model structure. In the last case, only a set of observation data available and in this case a nominal model is adopted for the identification. All the test systems were successfully identified by using a reasonable amount of experimental data within an acceptable computation time. Experimental evaluation reveals that the proposed method is capable of fast estimation on the unknown parameters with good precision.
Collapse
|
13
|
Inference of Vohradský's models of genetic networks by solving two-dimensional function optimization problems. PLoS One 2014; 8:e83308. [PMID: 24386175 PMCID: PMC3875442 DOI: 10.1371/journal.pone.0083308] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2013] [Accepted: 11/01/2013] [Indexed: 11/21/2022] Open
Abstract
The inference of a genetic network is a problem in which mutual interactions among genes are inferred from time-series of gene expression levels. While a number of models have been proposed to describe genetic networks, this study focuses on a mathematical model proposed by Vohradský. Because of its advantageous features, several researchers have proposed the inference methods based on Vohradský's model. When trying to analyze large-scale networks consisting of dozens of genes, however, these methods must solve high-dimensional non-linear function optimization problems. In order to resolve the difficulty of estimating the parameters of the Vohradský's model, this study proposes a new method that defines the problem as several two-dimensional function optimization problems. Through numerical experiments on artificial genetic network inference problems, we showed that, although the computation time of the proposed method is not the shortest, the method has the ability to estimate parameters of Vohradský's models more effectively with sufficiently short computation times. This study then applied the proposed method to an actual inference problem of the bacterial SOS DNA repair system, and succeeded in finding several reasonable regulations.
Collapse
|
14
|
Chowdhury AR, Chetty M, Vinh NX. Incorporating time-delays in S-System model for reverse engineering genetic networks. BMC Bioinformatics 2013; 14:196. [PMID: 23777625 PMCID: PMC3839642 DOI: 10.1186/1471-2105-14-196] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2013] [Accepted: 06/07/2013] [Indexed: 11/10/2022] Open
Abstract
Background In any gene regulatory network (GRN), the complex interactions occurring amongst transcription factors and target genes can be either instantaneous or time-delayed. However, many existing modeling approaches currently applied for inferring GRNs are unable to represent both these interactions simultaneously. As a result, all these approaches cannot detect important interactions of the other type. S-System model, a differential equation based approach which has been increasingly applied for modeling GRNs, also suffers from this limitation. In fact, all S-System based existing modeling approaches have been designed to capture only instantaneous interactions, and are unable to infer time-delayed interactions. Results In this paper, we propose a novel Time-Delayed S-System (TDSS) model which uses a set of delay differential equations to represent the system dynamics. The ability to incorporate time-delay parameters in the proposed S-System model enables simultaneous modeling of both instantaneous and time-delayed interactions. Furthermore, the delay parameters are not limited to just positive integer values (corresponding to time stamps in the data), but can also take fractional values. Moreover, we also propose a new criterion for model evaluation exploiting the sparse and scale-free nature of GRNs to effectively narrow down the search space, which not only reduces the computation time significantly but also improves model accuracy. The evaluation criterion systematically adapts the max-min in-degrees and also systematically balances the effect of network accuracy and complexity during optimization. Conclusion The four well-known performance measures applied to the experimental studies on synthetic networks with various time-delayed regulations clearly demonstrate that the proposed method can capture both instantaneous and delayed interactions correctly with high precision. The experiments carried out on two well-known real-life networks, namely IRMA and SOS DNA repair network in Escherichia coli show a significant improvement compared with other state-of-the-art approaches for GRN modeling.
Collapse
Affiliation(s)
- Ahsan Raja Chowdhury
- Gippsland School of Information Technology, Monash University, Churchill, Victoria-3842, Australia.
| | | | | |
Collapse
|
15
|
Using evolutionary computations to understand the design and evolution of gene and cell regulatory networks. Methods 2013; 62:39-55. [PMID: 23726941 DOI: 10.1016/j.ymeth.2013.05.013] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2012] [Revised: 11/30/2012] [Accepted: 05/21/2013] [Indexed: 12/21/2022] Open
Abstract
This paper surveys modeling approaches for studying the evolution of gene regulatory networks (GRNs). Modeling of the design or 'wiring' of GRNs has become increasingly common in developmental and medical biology, as a means of quantifying gene-gene interactions, the response to perturbations, and the overall dynamic motifs of networks. Drawing from developments in GRN 'design' modeling, a number of groups are now using simulations to study how GRNs evolve, both for comparative genomics and to uncover general principles of evolutionary processes. Such work can generally be termed evolution in silico. Complementary to these biologically-focused approaches, a now well-established field of computer science is Evolutionary Computations (ECs), in which highly efficient optimization techniques are inspired from evolutionary principles. In surveying biological simulation approaches, we discuss the considerations that must be taken with respect to: (a) the precision and completeness of the data (e.g. are the simulations for very close matches to anatomical data, or are they for more general exploration of evolutionary principles); (b) the level of detail to model (we proceed from 'coarse-grained' evolution of simple gene-gene interactions to 'fine-grained' evolution at the DNA sequence level); (c) to what degree is it important to include the genome's cellular context; and (d) the efficiency of computation. With respect to the latter, we argue that developments in computer science EC offer the means to perform more complete simulation searches, and will lead to more comprehensive biological predictions.
Collapse
|
16
|
Wu SJ, Wu CT, Chang JY. Adaptive neural-based fuzzy modeling for biological systems. Math Biosci 2013; 242:153-60. [PMID: 23376801 DOI: 10.1016/j.mbs.2013.01.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2012] [Revised: 01/01/2013] [Accepted: 01/11/2013] [Indexed: 11/29/2022]
Abstract
The inverse problem of identifying dynamic biological networks from their time-course response data set is a cornerstone of systems biology. Hill and Michaelis-Menten model, which is a forward approach, provides local kinetic information. However, repeated modifications and a large amount of experimental data are necessary for the parameter identification. S-system model, which is composed of highly nonlinear differential equations, provides the direct identification of an interactive network. However, the identification of skeletal-network structure is challenging. Moreover, biological systems are always subject to uncertainty and noise. Are there suitable candidates with the potential to deal with noise-contaminated data sets? Fuzzy set theory is developed for handing uncertainty, imprecision and complexity in the real world; for example, we say "driving speed is high" wherein speed is a fuzzy variable and high is a fuzzy set, which uses the membership function to indicate the degree of a element belonging to the set (words in Italics to denote fuzzy variables or fuzzy sets). Neural network possesses good robustness and learning capability. In this study we hybrid these two together into a neural-fuzzy modeling technique. A biological system is formulated to a multi-input-multi-output (MIMO) Takagi-Sugeno (T-S) fuzzy system, which is composed of rule-based linear subsystems. Two kinds of smooth membership functions (MFs), Gaussian and Bell-shaped MFs, are used. The performance of the proposed method is tested with three biological systems.
Collapse
Affiliation(s)
- Shinq-Jen Wu
- Department of Electrical Engineering, Da-Yeh University, Chang-Hwa, Taiwan, Republic of China.
| | | | | |
Collapse
|
17
|
Abstract
Biochemical systems theory (BST) is the foundation for a set of analytical andmodeling tools that facilitate the analysis of dynamic biological systems. This paper depicts major developments in BST up to the current state of the art in 2012. It discusses its rationale, describes the typical strategies and methods of designing, diagnosing, analyzing, and utilizing BST models, and reviews areas of application. The paper is intended as a guide for investigators entering the fascinating field of biological systems analysis and as a resource for practitioners and experts.
Collapse
|
18
|
Yang B, Chen Y, Jiang M. Reverse engineering of gene regulatory networks using flexible neural tree models. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2012.07.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
19
|
|
20
|
Liu LZ, Wu FX, Zhang WJ. Inference of biological S-system using the separable estimation method and the genetic algorithm. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:955-965. [PMID: 21968962 DOI: 10.1109/tcbb.2011.126] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Reconstruction of a biological system from its experimental time series data is a challenging task in systems biology. The S-system which consists of a group of nonlinear ordinary differential equations (ODEs) is an effective model to characterize molecular biological systems and analyze the system dynamics. However, inference of S-systems without the knowledge of system structure is not a trivial task due to its nonlinearity and complexity. In this paper, a pruning separable parameter estimation algorithm (PSPEA) is proposed for inferring S-systems. This novel algorithm combines the separable parameter estimation method (SPEM) and a pruning strategy, which includes adding an l₁ regularization term to the objective function and pruning the solution with a threshold value. Then, this algorithm is combined with the continuous genetic algorithm (CGA) to form a hybrid algorithm that owns the properties of these two combined algorithms. The performance of the pruning strategy in the proposed algorithm is evaluated from two aspects: the parameter estimation error and structure identification accuracy. The results show that the proposed algorithm with the pruning strategy has much lower estimation error and much higher identification accuracy than the existing method.
Collapse
Affiliation(s)
- Li-Zhi Liu
- Department of Mechanical Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, Saskatchewan S7N 5A9, Canada.
| | | | | |
Collapse
|
21
|
Yang X, Dent JE, Nardini C. An S-System Parameter Estimation Method (SPEM) for biological networks. J Comput Biol 2012; 19:175-87. [PMID: 22300319 DOI: 10.1089/cmb.2011.0269] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Advances in experimental biology, coupled with advances in computational power, bring new challenges to the interdisciplinary field of computational biology. One such broad challenge lies in the reverse engineering of gene networks, and goes from determining the structure of static networks, to reconstructing the dynamics of interactions from time series data. Here, we focus our attention on the latter area, and in particular, on parameterizing a dynamic network of oriented interactions between genes. By basing the parameterizing approach on a known power-law relationship model between connected genes (S-system), we are able to account for non-linearity in the network, without compromising the ability to analyze network characteristics. In this article, we introduce the S-System Parameter Estimation Method (SPEM). SPEM, a freely available R software package (http://www.picb.ac.cn/ClinicalGenomicNTW/temp3.html), takes gene expression data in time series and returns the network of interactions as a set of differential equations. The methods, which are presented and tested here, are shown to provide accurate results not only on synthetic data, but more importantly on real and therefore noisy by nature, biological data. In summary, SPEM shows high sensitivity and positive predicted values, as well as free availability and expansibility (because based on open source software). We expect these characteristics to make it a useful and broadly applicable software in the challenging reconstruction of dynamic gene networks.
Collapse
Affiliation(s)
- Xinyi Yang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, PR China
| | | | | |
Collapse
|
22
|
Hsiao YT, Lee WP. Inferring robust gene networks from expression data by a sensitivity-based incremental evolution method. BMC Bioinformatics 2012; 13 Suppl 7:S8. [PMID: 22595005 PMCID: PMC3348052 DOI: 10.1186/1471-2105-13-s7-s8] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Background Reconstructing gene regulatory networks (GRNs) from expression data is one of the most important challenges in systems biology research. Many computational models and methods have been proposed to automate the process of network reconstruction. Inferring robust networks with desired behaviours remains challenging, however. This problem is related to network dynamics but has yet to be investigated using network modeling. Results We propose an incremental evolution approach for inferring GRNs that takes network robustness into consideration and can deal with a large number of network parameters. Our approach includes a sensitivity analysis procedure to iteratively select the most influential network parameters, and it uses a swarm intelligence procedure to perform parameter optimization. We have conducted a series of experiments to evaluate the external behaviors and internal robustness of the networks inferred by the proposed approach. The results and analyses have verified the effectiveness of our approach. Conclusions Sensitivity analysis is crucial to identifying the most sensitive parameters that govern the network dynamics. It can further be used to derive constraints for network parameters in the network reconstruction process. The experimental results show that the proposed approach can successfully infer robust GRNs with desired system behaviors.
Collapse
Affiliation(s)
- Yu-Ting Hsiao
- Department of Information Management, National Sun Yat-sen University, 70, Lienhai Road, Kaohsiung, Taiwan
| | | |
Collapse
|
23
|
Kimura S, Araki D, Matsumura K, Okada-Hatakeyama M. Inference of S-system models of genetic networks by solving one-dimensional function optimization problems. Math Biosci 2012; 235:161-70. [PMID: 22155075 DOI: 10.1016/j.mbs.2011.11.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2011] [Revised: 10/21/2011] [Accepted: 11/22/2011] [Indexed: 11/17/2022]
Affiliation(s)
- S Kimura
- Graduate School of Engineering, Tottori University, 4-101, Koyama-minami, Tottori 680-8552, Japan.
| | | | | | | |
Collapse
|
24
|
KIMURA SHUHEI, SHIRAISHI YUICHI, OKADA MARIKO. INFERENCE OF GENETIC NETWORKS USING LPMs: ASSESSMENT OF CONFIDENCE VALUES OF REGULATIONS. J Bioinform Comput Biol 2011. [DOI: 10.1142/s0219720010004859] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
When we apply inference methods based on a set of differential equations into actual genetic network inference problems, we often end up with a large number of false-positive regulations. However, as we must check the inferred regulations through biochemical experiments, fewer false-positive regulations are preferable. In order to reduce the number of regulations checked, this study proposes a new method that assigns confidence values to all of the regulations contained in the target network. For this purpose, we combine a residual bootstrap method with the existing method, i.e. the inference method using linear programming machines (LPMs). Through numerical experiments on an artificial genetic network inference problem, we confirmed that most of the regulations with high confidence values are actually present in the target networks. We then used the proposed method to analyze the bacterial SOS DNA repair system, and succeeded in assigning reasonable confidence values to its regulations. Although this study combined the bootstrap method with the inference method using the LPMs, the proposed bootstrap approach could be combined with any method that has an ability to infer a genetic network from time-series of gene expression levels.
Collapse
Affiliation(s)
- SHUHEI KIMURA
- Graduate School of Engineering, Tottori University, 4-101, Koyama-minami, Tottori 680-8552, Japan
| | - YUICHI SHIRAISHI
- Research Center for Allergy and Immunology, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama 230-0045, Japan
| | - MARIKO OKADA
- Research Center for Allergy and Immunology, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama 230-0045, Japan
| |
Collapse
|
25
|
Kentzoglanakis K, Poole M. A swarm intelligence framework for reconstructing gene networks: searching for biologically plausible architectures. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 9:358-371. [PMID: 21576756 DOI: 10.1109/tcbb.2011.87] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
In this paper, we investigate the problem of reverse engineering the topology of gene regulatory networks from temporal gene expression data. We adopt a computational intelligence approach comprising swarm intelligence techniques, namely particle swarm optimization (PSO) and ant colony optimization (ACO). In addition, the recurrent neural network (RNN) formalism is employed for modeling the dynamical behavior of gene regulatory systems. More specifically, ACO is used for searching the discrete space of network architectures and PSO for searching the corresponding continuous space of RNN model parameters. We propose a novel solution construction process in the context of ACO for generating biologically plausible candidate architectures. The objective is to concentrate the search effort into areas of the structure space that contain architectures which are feasible in terms of their topological resemblance to real-world networks. The proposed framework is initially applied to the reconstruction of a small artificial network that has previously been studied in the context of gene network reverse engineering. Subsequently, we consider an artificial data set with added noise for reconstructing a subnetwork of the genetic interaction network of S. cerevisiae (yeast). Finally, the framework is applied to a real-world data set for reverse engineering the SOS response system of the bacterium Escherichia coli. Results demonstrate the relative advantage of utilizing problem-specific knowledge regarding biologically plausible structural properties of gene networks over conducting a problem-agnostic search in the vast space of network architectures.
Collapse
|
26
|
Zhan C, Yeung LF. Parameter estimation in systems biology models using spline approximation. BMC SYSTEMS BIOLOGY 2011; 5:14. [PMID: 21255466 PMCID: PMC3750107 DOI: 10.1186/1752-0509-5-14] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2010] [Accepted: 01/24/2011] [Indexed: 11/24/2022]
Abstract
Background Mathematical models for revealing the dynamics and interactions properties of biological systems play an important role in computational systems biology. The inference of model parameter values from time-course data can be considered as a "reverse engineering" process and is still one of the most challenging tasks. Many parameter estimation methods have been developed but none of these methods is effective for all cases and can overwhelm all other approaches. Instead, various methods have their advantages and disadvantages. It is worth to develop parameter estimation methods which are robust against noise, efficient in computation and flexible enough to meet different constraints. Results Two parameter estimation methods of combining spline theory with Linear Programming (LP) and Nonlinear Programming (NLP) are developed. These methods remove the need for ODE solvers during the identification process. Our analysis shows that the augmented cost function surfaces used in the two proposed methods are smoother; which can ease the optima searching process and hence enhance the robustness and speed of the search algorithm. Moreover, the cores of our algorithms are LP and NLP based, which are flexible and consequently additional constraints can be embedded/removed easily. Eight system biology models are used for testing the proposed approaches. Our results confirm that the proposed methods are both efficient and robust. Conclusions The proposed approaches have general application to identify unknown parameter values of a wide range of systems biology models.
Collapse
Affiliation(s)
- Choujun Zhan
- Department of Electronic Engineering, City University of Hong Kong, PR China.
| | | |
Collapse
|
27
|
Kabir M, Noman N, Iba H. Reverse engineering gene regulatory network from microarray data using linear time-variant model. BMC Bioinformatics 2010; 11 Suppl 1:S56. [PMID: 20122231 PMCID: PMC3009529 DOI: 10.1186/1471-2105-11-s1-s56] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Background Gene regulatory network is an abstract mapping of gene regulations in living cells that can help to predict the system behavior of living organisms. Such prediction capability can potentially lead to the development of improved diagnostic tests and therapeutics. DNA microarrays, which measure the expression level of thousands of genes in parallel, constitute the numeric seed for the inference of gene regulatory networks. In this paper, we have proposed a new approach for inferring gene regulatory networks from time-series gene expression data using linear time-variant model. Here, Self-Adaptive Differential Evolution, a versatile and robust Evolutionary Algorithm, is used as the learning paradigm. Results To assess the potency of the proposed work, a well known nonlinear synthetic network has been used. The reconstruction method has inferred this synthetic network topology and the associated regulatory parameters with high accuracy from both the noise-free and noisy time-series data. For validation purposes, the proposed approach is also applied to the simulated expression dataset of cAMP oscillations in Dictyostelium discoideum and has proved it's strength in finding the correct regulations. The strength of this work has also been verified by analyzing the real expression dataset of SOS DNA repair system in Escherichia coli and it has succeeded in finding more correct and reasonable regulations as compared to various existing works. Conclusion By the proposed approach, the gene interaction networks have been inferred in an efficient manner from both the synthetic, simulated cAMP oscillation expression data and real expression data. The computational time of this approach is also considerably smaller, which makes it to be more suitable for larger network reconstruction. Thus the proposed approach can serve as an initiate for the future researches regarding the associated area.
Collapse
Affiliation(s)
- Mitra Kabir
- Department of Computer Science and Engineering, University of Dhaka, Dhaka, Bangladesh.
| | | | | |
Collapse
|
28
|
Chou IC, Voit EO. Recent developments in parameter estimation and structure identification of biochemical and genomic systems. Math Biosci 2009; 219:57-83. [PMID: 19327372 PMCID: PMC2693292 DOI: 10.1016/j.mbs.2009.03.002] [Citation(s) in RCA: 298] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2008] [Revised: 03/06/2009] [Accepted: 03/15/2009] [Indexed: 01/16/2023]
Abstract
The organization, regulation and dynamical responses of biological systems are in many cases too complex to allow intuitive predictions and require the support of mathematical modeling for quantitative assessments and a reliable understanding of system functioning. All steps of constructing mathematical models for biological systems are challenging, but arguably the most difficult task among them is the estimation of model parameters and the identification of the structure and regulation of the underlying biological networks. Recent advancements in modern high-throughput techniques have been allowing the generation of time series data that characterize the dynamics of genomic, proteomic, metabolic, and physiological responses and enable us, at least in principle, to tackle estimation and identification tasks using 'top-down' or 'inverse' approaches. While the rewards of a successful inverse estimation or identification are great, the process of extracting structural and regulatory information is technically difficult. The challenges can generally be categorized into four areas, namely, issues related to the data, the model, the mathematical structure of the system, and the optimization and support algorithms. Many recent articles have addressed inverse problems within the modeling framework of Biochemical Systems Theory (BST). BST was chosen for these tasks because of its unique structural flexibility and the fact that the structure and regulation of a biological system are mapped essentially one-to-one onto the parameters of the describing model. The proposed methods mainly focused on various optimization algorithms, but also on support techniques, including methods for circumventing the time consuming numerical integration of systems of differential equations, smoothing overly noisy data, estimating slopes of time series, reducing the complexity of the inference task, and constraining the parameter search space. Other methods targeted issues of data preprocessing, detection and amelioration of model redundancy, and model-free or model-based structure identification. The total number of proposed methods and their applications has by now exceeded one hundred, which makes it difficult for the newcomer, as well as the expert, to gain a comprehensive overview of available algorithmic options and limitations. To facilitate the entry into the field of inverse modeling within BST and related modeling areas, the article presented here reviews the field and proposes an operational 'work-flow' that guides the user through the estimation process, identifies possibly problematic steps, and suggests corresponding solutions based on the specific characteristics of the various available algorithms. The article concludes with a discussion of the present state of the art and with a description of open questions.
Collapse
Affiliation(s)
- I-Chun Chou
- Integrative BioSystems Institute and The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, 313 Ferst Drive, Atlanta, GA 30332, USA.
| | | |
Collapse
|
29
|
Kimura S, Nakayama S, Hatakeyama M. Genetic network inference as a series of discrimination tasks. ACTA ACUST UNITED AC 2009; 25:918-25. [PMID: 19189976 DOI: 10.1093/bioinformatics/btp072] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Genetic network inference methods based on sets of differential equations generally require a great deal of time, as the equations must be solved many times. To reduce the computational cost, researchers have proposed other methods for inferring genetic networks by solving sets of differential equations only a few times, or even without solving them at all. When we try to obtain reasonable network models using these methods, however, we must estimate the time derivatives of the gene expression levels with great precision. In this study, we propose a new method to overcome the drawbacks of inference methods based on sets of differential equations. RESULTS Our method infers genetic networks by obtaining classifiers capable of predicting the signs of the derivatives of the gene expression levels. For this purpose, we defined a genetic network inference problem as a series of discrimination tasks, then solved the defined series of discrimination tasks with a linear programming machine. Our experimental results demonstrated that the proposed method is capable of correctly inferring genetic networks, and doing so more than 500 times faster than the other inference methods based on sets of differential equations. Next, we applied our method to actual expression data of the bacterial SOS DNA repair system. And finally, we demonstrated that our approach relates to the inference method based on the S-system model. Though our method provides no estimation of the kinetic parameters, it should be useful for researchers interested only in the network structure of a target system. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shuhei Kimura
- Graduate School of Engineering, Tottori University, Koyama-minami, Tottori, Japan.
| | | | | |
Collapse
|
30
|
Gennemark P, Wedelin D. Benchmarks for identification of ordinary differential equations from time series data. ACTA ACUST UNITED AC 2009; 25:780-6. [PMID: 19176548 PMCID: PMC2654804 DOI: 10.1093/bioinformatics/btp050] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Motivation: In recent years, the biological literature has seen a significant increase of reported methods for identifying both structure and parameters of ordinary differential equations (ODEs) from time series data. A natural way to evaluate the performance of such methods is to try them on a sufficient number of realistic test cases. However, weak practices in specifying identification problems and lack of commonly accepted benchmark problems makes it difficult to evaluate and compare different methods. Results: To enable better evaluation and comparisons between different methods, we propose how to specify identification problems as optimization problems with a model space of allowed reactions (e.g. reaction kinetics like Michaelis–Menten or S-systems), ranges for the parameters, time series data and an error function. We also define a file format for such problems. We then present a collection of more than 40 benchmark problems for ODE model identification of cellular systems. The collection includes realistic problems of different levels of difficulty w.r.t. size and quality of data. We consider both problems with simulated data from known systems, and problems with real data. Finally, we present results based on our identification algorithm for all benchmark problems. In comparison with publications on which we have based some of the benchmark problems, our approach allows all problems to be solved without the use of supercomputing. Availability: The benchmark problems are available at www.odeidentification.org Contact:peterg@chalmers.se Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Peter Gennemark
- Department of Mathematical Sciences, University of Göteborg, SE-412 96 Göteborg, Sweden.
| | | |
Collapse
|
31
|
Goel G, Chou IC, Voit EO. System estimation from metabolic time-series data. Bioinformatics 2008; 24:2505-11. [PMID: 18772153 PMCID: PMC2732280 DOI: 10.1093/bioinformatics/btn470] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2008] [Revised: 08/27/2008] [Accepted: 08/29/2008] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION At the center of computational systems biology are mathematical models that capture the dynamics of biological systems and offer novel insights. The bottleneck in the construction of these models is presently the identification of model parameters that make the model consistent with observed data. Dynamic flux estimation (DFE) is a novel methodological framework for estimating parameters for models of metabolic systems from time-series data. DFE consists of two distinct phases, an entirely model-free and assumption-free data analysis and a model-based mathematical characterization of process representations. The model-free phase reveals inconsistencies within the data, and between data and the alleged system topology, while the model-based phase allows quantitative diagnostics of whether--or to what degree--the assumed mathematical formulations are appropriate or in need of improvement. Hallmarks of DFE are the facility to: diagnose data and model consistency; circumvent undue compensation of errors; determine functional representations of fluxes uncontaminated by errors in other fluxes and pinpoint sources of remaining errors. Our results suggest that the proposed approach is more effective and robust than presently available methods for deriving metabolic models from time-series data. Its avoidance of error compensation among process descriptions promises significantly improved extrapolability toward new data or experimental conditions.
Collapse
Affiliation(s)
- Gautam Goel
- Integrative BioSystems Institute and The Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, 313 Ferst Drive, Atlanta, GA 30332, USA
| | | | | |
Collapse
|
32
|
Nakatsui M, Ueda T, Maki Y, Ono I, Okamoto M. Method for inferring and extracting reliable genetic interactions from time-series profile of gene expression. Math Biosci 2008; 215:105-14. [PMID: 18638491 DOI: 10.1016/j.mbs.2008.06.007] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2007] [Revised: 06/11/2008] [Accepted: 06/13/2008] [Indexed: 11/16/2022]
Affiliation(s)
- Masahiko Nakatsui
- Laboratory for Bioinformatics, Graduate School of Systems Life Sciences, Kyushu University, Fukuoka 812-8581, Japan
| | | | | | | | | |
Collapse
|
33
|
González-Díaz H, González-Díaz Y, Santana L, Ubeira FM, Uriarte E. Proteomics, networks and connectivity indices. Proteomics 2008; 8:750-78. [DOI: 10.1002/pmic.200700638] [Citation(s) in RCA: 170] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
34
|
Floares AG. A reverse engineering algorithm for neural networks, applied to the subthalamopallidal network of basal ganglia. Neural Netw 2008; 21:379-86. [PMID: 18243654 DOI: 10.1016/j.neunet.2007.12.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2007] [Revised: 12/01/2007] [Accepted: 12/03/2007] [Indexed: 10/22/2022]
Abstract
Modeling neural networks with ordinary differential equations systems is a sensible approach, but also very difficult. This paper describes a new algorithm based on linear genetic programming which can be used to reverse engineer neural networks. The RODES algorithm automatically discovers the structure of the network, including neural connections, their signs and strengths, estimates its parameters, and can even be used to identify the biophysical mechanisms involved. The algorithm is tested on simulated time series data, generated using a realistic model of the subthalamopallidal network of basal ganglia. The resulting ODE system is highly accurate, and results are obtained in a matter of minutes. This is because the problem of reverse engineering a system of coupled differential equations is reduced to one of reverse engineering individual algebraic equations. The algorithm allows the incorporation of common domain knowledge to restrict the solution space. To our knowledge, this is the first time a realistic reverse engineering algorithm based on linear genetic programming has been applied to neural networks.
Collapse
Affiliation(s)
- Alexandru George Floares
- Artificial Intelligence Department, Oncological Institute Cluj-Napoca, 400015, Str. Republicii, No 34-36, Cluj-Napoca, Transylvania, Romania.
| |
Collapse
|
35
|
Kimura S, Sonoda K, Yamane S, Maeda H, Matsumura K, Hatakeyama M. Function approximation approach to the inference of reduced NGnet models of genetic networks. BMC Bioinformatics 2008; 9:23. [PMID: 18194576 PMCID: PMC2258286 DOI: 10.1186/1471-2105-9-23] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2007] [Accepted: 01/14/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The inference of a genetic network is a problem in which mutual interactions among genes are deduced using time-series of gene expression patterns. While a number of models have been proposed to describe genetic regulatory networks, this study focuses on a set of differential equations since it has the ability to model dynamic behavior of gene expression. When we use a set of differential equations to describe genetic networks, the inference problem can be defined as a function approximation problem. On the basis of this problem definition, we propose in this study a new method to infer reduced NGnet models of genetic networks. RESULTS Through numerical experiments on artificial genetic network inference problems, we demonstrated that our method has the ability to infer genetic networks correctly and it was faster than the other inference methods. We then applied the proposed method to actual expression data of the bacterial SOS DNA repair system, and succeeded in finding several reasonable regulations. When our method inferred the genetic network from the actual data, it required about 4.7 min on a single-CPU personal computer. CONCLUSION The proposed method has an ability to obtain reasonable networks with a short computational time. As a high performance computer is not always available at every laboratory, the short computational time of our method is a preferable feature. There does not seem to be a perfect model for the inference of genetic networks yet. Therefore, in order to extract reliable information from the observed gene expression data, we should infer genetic networks using multiple inference methods based on different models. Our approach could be used as one of the promising inference methods.
Collapse
Affiliation(s)
- Shuhei Kimura
- Faculty of Engineering, Tottori University, 4-101 Koyama-Minami, Tottori, Japan.
| | | | | | | | | | | |
Collapse
|
36
|
Inferring Gene Regulatory Networks from Expression Data. COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS 2008. [DOI: 10.1007/978-3-540-76803-6_2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
37
|
|