1
|
Wu J, Stewart WCL, Jayaprakash C, Das J. BioNetGMMFit: estimating parameters of a BioNetGen model from time-stamped snapshots of single cells. NPJ Syst Biol Appl 2023; 9:46. [PMID: 37736766 PMCID: PMC10516955 DOI: 10.1038/s41540-023-00299-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 07/31/2023] [Indexed: 09/23/2023] Open
Abstract
Mechanistic models are commonly employed to describe signaling and gene regulatory kinetics in single cells and cell populations. Recent advances in single-cell technologies have produced multidimensional datasets where snapshots of copy numbers (or abundances) of a large number of proteins and mRNA are measured across time in single cells. The availability of such datasets presents an attractive scenario where mechanistic models are validated against experiments, and estimated model parameters enable quantitative predictions of signaling or gene regulatory kinetics. To empower the systems biology community to easily estimate parameters accurately from multidimensional single-cell data, we have merged a widely used rule-based modeling software package BioNetGen, which provides a user-friendly way to code for mechanistic models describing biochemical reactions, and the recently introduced CyGMM, that uses cell-to-cell differences to improve parameter estimation for such networks, into a single software package: BioNetGMMFit. BioNetGMMFit provides parameter estimates of the model, supplied by the user in the BioNetGen markup language (BNGL), which yield the best fit for the observed single-cell, time-stamped data of cellular components. Furthermore, for more precise estimates, our software generates confidence intervals around each model parameter. BioNetGMMFit is capable of fitting datasets of increasing cell population sizes for any mechanistic model specified in the BioNetGen markup language. By streamlining the process of developing mechanistic models for large single-cell datasets, BioNetGMMFit provides an easily-accessible modeling framework designed for scale and the broader biochemical signaling community.
Collapse
Affiliation(s)
- John Wu
- Department of Computer Science, The Ohio State University, 281 W Lane Ave, Columbus, OH, 43210, USA.
- Steve and Cindy Rasmussen Institute for Genomics, The Abigail Wexner Research Institute, Nationwide Children's Hospital, 700 Children's Drive, Columbus, OH, 43205, USA.
| | | | - Ciriyam Jayaprakash
- Department of Physics, The Ohio State University, 191 W Woodruff Ave, Columbus, OH, 43210, USA
| | - Jayajit Das
- Steve and Cindy Rasmussen Institute for Genomics, The Abigail Wexner Research Institute, Nationwide Children's Hospital, 700 Children's Drive, Columbus, OH, 43205, USA.
- Departments of Pediatrics, Biomedical Informatics, Pelotonia Institute of Immuno-Oncology, College of Medicine, and Biophysics Program, The Ohio State University, 370 W 9th Ave, Columbus, OH, 43210, USA.
| |
Collapse
|
2
|
Gasparyan M, Rao S. Parameter Estimation for Kinetic Models of Chemical Reaction Networks from Partial Experimental Data of Species' Concentrations. Bioengineering (Basel) 2023; 10:1056. [PMID: 37760158 PMCID: PMC10526083 DOI: 10.3390/bioengineering10091056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 08/23/2023] [Accepted: 08/31/2023] [Indexed: 09/29/2023] Open
Abstract
The current manuscript addresses the problem of parameter estimation for kinetic models of chemical reaction networks from observed time series partial experimental data of species concentrations. It is demonstrated how the Kron reduction method of kinetic models, in conjunction with the (weighted) least squares optimization technique, can be used as a tool to solve the above-mentioned ill-posed parameter estimation problem. First, a new trajectory-independent measure is introduced to quantify the dynamical difference between the original mathematical model and the corresponding Kron-reduced model. This measure is then crucially used to estimate the parameters contained in the kinetic model so that the corresponding values of the species' concentrations predicted by the model fit the available experimental data. The new parameter estimation method is tested on two real-life examples of chemical reaction networks: nicotinic acetylcholine receptors and Trypanosoma brucei trypanothione synthetase. Both weighted and unweighted least squares techniques, combined with Kron reduction, are used to find the best-fitting parameter values. The method of leave-one-out cross-validation is utilized to determine the preferred technique. For nicotinic receptors, the training errors due to the application of unweighted and weighted least squares are 3.22 and 3.61 respectively, while for Trypanosoma synthetase, the application of unweighted and weighted least squares result in training errors of 0.82 and 0.70 respectively. Furthermore, the problem of identifiability of dynamical systems, i.e., the possibility of uniquely determining the parameters from certain types of output, has also been addressed.
Collapse
Affiliation(s)
- Manvel Gasparyan
- School of Environmental Engineering, University of Seoul, Seoul 02504, Republic of Korea
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon 21985, Republic of Korea;
- Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Shodhan Rao
- Center for Biosystems and Biotech Data Science, Ghent University Global Campus, Incheon 21985, Republic of Korea;
- Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
3
|
Xu J. SBMLKinetics: a tool for annotation-independent classification of reaction kinetics for SBML models. BMC Bioinformatics 2023; 24:248. [PMID: 37312031 DOI: 10.1186/s12859-023-05380-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 06/07/2023] [Indexed: 06/15/2023] Open
Abstract
BACKGROUND Reaction networks are widely used as mechanistic models in systems biology to reveal principles of biological systems. Reactions are governed by kinetic laws that describe reaction rates. Selecting the appropriate kinetic laws is difficult for many modelers. There exist tools that attempt to find the correct kinetic laws based on annotations. Here, I developed annotation-independent technologies that assist modelers by focusing on finding kinetic laws commonly used for similar reactions. RESULTS Recommending kinetic laws and other analyses of reaction networks can be viewed as a classification problem. Existing approaches to determining similar reactions rely heavily on having good annotations, a condition that is often unsatisfied in model repositories such as BioModels. I developed an annotation-independent approach to find similar reactions via reaction classifications. I proposed a two-dimensional kinetics classification scheme (2DK) that analyzed reactions along the dimensions of kinetics type (K type) and reaction type (R type). I identified approximately ten mutually exclusive K types, including zeroth order, mass action, Michaelis-Menten, Hill kinetics, and others. R types were organized by the number of distinct reactants and the number of distinct products in reactions. I constructed a tool, SBMLKinetics, that inputted a collection of SBML models and then calculated reaction classifications as the probability of each 2DK class. The effectiveness of 2DK was evaluated on BioModels, and the scheme classified over 95% of the reactions. CONCLUSIONS 2DK had many applications. It provided a data-driven annotation-independent approach to recommending kinetic laws by using type common for the kind of models in combination with the R type of the reactions. Alternatively, 2DK could also be used to alert users that a kinetic law was unusual for the K type and R type. Last, 2DK provided a way to analyze groups of models to compare their kinetic laws. I applied 2DK to BioModels to compare the kinetics of signaling networks with the kinetics of metabolic networks and found significant differences in K type distributions.
Collapse
Affiliation(s)
- Jin Xu
- Department of Bioengineering, University of Washington, Seattle, USA.
| |
Collapse
|
4
|
Pasquini M, Stenta M. LinChemIn: SynGraph-a data model and a toolkit to analyze and compare synthetic routes. J Cheminform 2023; 15:41. [PMID: 37005691 PMCID: PMC10067316 DOI: 10.1186/s13321-023-00714-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 03/20/2023] [Indexed: 04/04/2023] Open
Abstract
BACKGROUND The increasing amount of chemical reaction data makes traditional ways to navigate its corpus less effective, while the demand for novel approaches and instruments is rising. Recent data science and machine learning techniques support the development of new ways to extract value from the available reaction data. On the one side, Computer-Aided Synthesis Planning tools can predict synthetic routes in a model-driven approach; on the other side, experimental routes can be extracted from the Network of Organic Chemistry, in which reaction data are linked in a network. In this context, the need to combine, compare and analyze synthetic routes generated by different sources arises naturally. RESULTS Here we present LinChemIn, a python toolkit that allows chemoinformatics operations on synthetic routes and reaction networks. Wrapping some third-party packages for handling graph arithmetic and chemoinformatics and implementing new data models and functionalities, LinChemIn allows the interconversion between data formats and data models and enables route-level analysis and operations, including route comparison and descriptors calculation. Object-Oriented Design principles inspire the software architecture, and the modules are structured to maximize code reusability and support code testing and refactoring. The code structure should facilitate external contributions, thus encouraging open and collaborative software development. CONCLUSIONS The current version of LinChemIn allows users to combine synthetic routes generated from various tools and analyze them, and constitutes an open and extensible framework capable of incorporating contributions from the community and fostering scientific discussion. Our roadmap envisages the development of sophisticated metrics for routes evaluation, a multi-parameter scoring system, and the implementation of an entire "ecosystem" of functionalities operating on synthetic routes. LinChemIn is freely available at https://github.com/syngenta/linchemin.
Collapse
Affiliation(s)
- Marta Pasquini
- Syngenta Crop Protection AG, Schaffhauserstrasse, 4332, Stein, AG, Switzerland.
| | - Marco Stenta
- Syngenta Crop Protection AG, Schaffhauserstrasse, 4332, Stein, AG, Switzerland
| |
Collapse
|
5
|
Swanson JMJ. Multiscale kinetic analysis of proteins. Curr Opin Struct Biol 2022; 72:169-175. [PMID: 34923310 PMCID: PMC9288830 DOI: 10.1016/j.sbi.2021.11.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 10/30/2021] [Accepted: 11/01/2021] [Indexed: 02/03/2023]
Abstract
The stochasticity of molecular motion results in the existence of multiple kinetically relevant pathways in many biomolecular mechanisms. Because it is highly demanding to characterize them for complex systems, mechanisms are often described with a single-pathway perspective. However, kinetic network analysis and sub-ensemble experimental insight are increasingly demonstrating not only the existence of competing pathways but also the importance of kinetic selection in biology. This review focuses on advances in multiscale kinetic analysis of proteins, which connects molecular level information from simulations to macroscopic data to characterize mechanistic reaction networks and the reactive flux through them. We describe a range of methods used and highlight several examples where kinetic modeling has revealed functional importance of pathway heterogeneity.
Collapse
|
6
|
Zhang T, Androulakis IP, Bonate P, Cheng L, Helikar T, Parikh J, Rackauckas C, Subramanian K, Cho CR. Two heads are better than one: current landscape of integrating QSP and machine learning : An ISoP QSP SIG white paper by the working group on the integration of quantitative systems pharmacology and machine learning. J Pharmacokinet Pharmacodyn 2022; 49:5-18. [PMID: 35103884 PMCID: PMC8837505 DOI: 10.1007/s10928-022-09805-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 01/10/2022] [Indexed: 12/02/2022]
Abstract
Quantitative systems pharmacology (QSP) modeling is applied to address essential questions in drug development, such as the mechanism of action of a therapeutic agent and the progression of disease. Meanwhile, machine learning (ML) approaches also contribute to answering these questions via the analysis of multi-layer 'omics' data such as gene expression, proteomics, metabolomics, and high-throughput imaging. Furthermore, ML approaches can also be applied to aspects of QSP modeling. Both approaches are powerful tools and there is considerable interest in integrating QSP modeling and ML. So far, a few successful implementations have been carried out from which we have learned about how each approach can overcome unique limitations of the other. The QSP + ML working group of the International Society of Pharmacometrics QSP Special Interest Group was convened in September, 2019 to identify and begin realizing new opportunities in QSP and ML integration. The working group, which comprises 21 members representing 18 academic and industry organizations, has identified four categories of current research activity which will be described herein together with case studies of applications to drug development decision making. The working group also concluded that the integration of QSP and ML is still in its early stages of moving from evaluating available technical tools to building case studies. This paper reports on this fast-moving field and serves as a foundation for future codification of best practices.
Collapse
Affiliation(s)
- Tongli Zhang
- University of Cincinnati, Cincinnati, OH, 45267, USA.
| | | | | | | | - Tomáš Helikar
- Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, NE, USA
| | | | - Christopher Rackauckas
- Pumas-AI, Baltimore, MD, USA
- Department of Mathematics, Massachusetts Institute of Technology, Boston, MA, USA
| | | | | |
Collapse
|
7
|
Kahana A, Lancet D. Self-reproducing catalytic micelles as nanoscopic protocell precursors. Nat Rev Chem 2021; 5:870-878. [PMID: 37117387 DOI: 10.1038/s41570-021-00329-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/03/2021] [Indexed: 12/31/2022]
Abstract
Protocells at life's origin are often conceived as bilayer-enclosed precursors of life, whose self-reproduction rests on the early advent of replicating catalytic biopolymers. This Perspective describes an alternative scenario, wherein reproducing nanoscopic lipid micelles with catalytic capabilities were forerunners of biopolymer-containing protocells. This postulate gains considerable support from experiments describing micellar catalysis and autocatalytic proliferation, and, more recently, from reports on cross-catalysis in mixed micelles that lead to life-like steady-state dynamics. Such results, along with evidence for micellar prebiotic compatibility, synergize with predictions of our chemically stringent computer-simulated model, illustrating how mutually catalytic lipid networks may enable micellar compositional reproduction that could underlie primal selection and evolution. Finally, we highlight studies on how endogenously catalysed lipid modifications could guide further protocellular complexification, including micelle to vesicle transition and monomer to biopolymer progression. These portrayals substantiate the possibility that protocellular evolution could have been seeded by pre-RNA lipid assemblies.
Collapse
|
8
|
Wulkow N, Telgmann R, Hungenberg K, Schütte C, Wulkow M. Deterministic and Stochastic Parameter Estimation for Polymer Reaction Kinetics I: Theory and Simple Examples. MACROMOL THEOR SIMUL 2021. [DOI: 10.1002/mats.202100017] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Niklas Wulkow
- Freie Universität Berlin Arnimallee 14 Berlin 14195 Germany
| | - Regina Telgmann
- Computing in Technology GmbH Harry‐Wilters‐Ring 27 Rastede 26180 Germany
| | | | | | - Michael Wulkow
- Computing in Technology GmbH Harry‐Wilters‐Ring 27 Rastede 26180 Germany
| |
Collapse
|
9
|
Hsu IS, Moses AM. Stochastic models for single-cell data: Current challenges and the way forward. FEBS J 2021; 289:647-658. [PMID: 33570798 DOI: 10.1111/febs.15760] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 12/22/2020] [Accepted: 02/10/2021] [Indexed: 11/28/2022]
Abstract
Although the quantity and quality of single-cell data have progressed rapidly, making quantitative predictions with single-cell stochastic models remains challenging. The stochastic nature of cellular processes leads to at least three challenges in building models with single-cell data: (a) because variability in single-cell data can be attributed to multiple different sources, it is difficult to rule out conflicting mechanistic models that explain the same data equally well; (b) the distinction between interesting biological variability and experimental variability is sometimes ambiguous; (c) the nonstandard distributions of single-cell data can lead to violations of the assumption of symmetric errors in least-squares fitting. In this review, we first discuss recent studies that overcome some of the challenges or set up a promising direction and then introduce some powerful statistical approaches utilized in these studies. We conclude that applying and developing statistical approaches could lead to further progress in building stochastic models for single-cell data.
Collapse
Affiliation(s)
- Ian S Hsu
- Department of Cell & Systems Biology, University of Toronto, ON, Canada
| | - Alan M Moses
- Department of Cell & Systems Biology, University of Toronto, ON, Canada
| |
Collapse
|
10
|
Vohradsky J, Schwarz M, Ramaniuk O, Ruiz-Larrabeiti O, Vaňková Hausnerová V, Šanderová H, Krásný L. Kinetic Modeling and Meta-Analysis of the Bacillus subtilis SigB Regulon during Spore Germination and Outgrowth. Microorganisms 2021; 9:microorganisms9010112. [PMID: 33466511 PMCID: PMC7824861 DOI: 10.3390/microorganisms9010112] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 12/21/2020] [Accepted: 12/29/2020] [Indexed: 11/16/2022] Open
Abstract
The exponential increase in the number of conducted studies combined with the development of sequencing methods have led to an enormous accumulation of partially processed experimental data in the past two decades. Here, we present an approach using literature-mined data complemented with gene expression kinetic modeling and promoter sequence analysis. This approach allowed us to identify the regulon of Bacillus subtilis sigma factor SigB of RNA polymerase (RNAP) specifically expressed during germination and outgrowth. SigB is critical for the cell's response to general stress but is also expressed during spore germination and outgrowth, and this specific regulon is not known. This approach allowed us to (i) define a subset of the known SigB regulon controlled by SigB specifically during spore germination and outgrowth, (ii) identify the influence of the promoter sequence binding motif organization on the expression of the SigB-regulated genes, and (iii) suggest additional sigma factors co-controlling other SigB-dependent genes. Experiments then validated promoter sequence characteristics necessary for direct RNAP-SigB binding. In summary, this work documents the potential of computational approaches to unravel new information even for a well-studied system; moreover, the study specifically identifies the subset of the SigB regulon, which is activated during germination and outgrowth.
Collapse
Affiliation(s)
- Jiri Vohradsky
- Laboratory of Bioinformatics, Institute of Microbiology of the Czech Academy of Sciences, Vídeňská 1083, 14220 Prague, Czech Republic;
- Correspondence:
| | - Marek Schwarz
- Laboratory of Bioinformatics, Institute of Microbiology of the Czech Academy of Sciences, Vídeňská 1083, 14220 Prague, Czech Republic;
| | - Olga Ramaniuk
- Laboratory of Microbial Genetics and Gene Expression, Institute of Microbiology of the Czech Academy of Sciences, Vídeňská 1083, 14220 Prague, Czech Republic; (O.R.); (O.R.-L.); (V.V.H.); (H.Š.); (L.K.)
| | - Olatz Ruiz-Larrabeiti
- Laboratory of Microbial Genetics and Gene Expression, Institute of Microbiology of the Czech Academy of Sciences, Vídeňská 1083, 14220 Prague, Czech Republic; (O.R.); (O.R.-L.); (V.V.H.); (H.Š.); (L.K.)
- Bacterial Stress Response Research Group, Department of Immunology, Microbiology and Parasitology, University of the Basque Country UPV/EHU, 48940 Leioa, Spain
| | - Viola Vaňková Hausnerová
- Laboratory of Microbial Genetics and Gene Expression, Institute of Microbiology of the Czech Academy of Sciences, Vídeňská 1083, 14220 Prague, Czech Republic; (O.R.); (O.R.-L.); (V.V.H.); (H.Š.); (L.K.)
| | - Hana Šanderová
- Laboratory of Microbial Genetics and Gene Expression, Institute of Microbiology of the Czech Academy of Sciences, Vídeňská 1083, 14220 Prague, Czech Republic; (O.R.); (O.R.-L.); (V.V.H.); (H.Š.); (L.K.)
| | - Libor Krásný
- Laboratory of Microbial Genetics and Gene Expression, Institute of Microbiology of the Czech Academy of Sciences, Vídeňská 1083, 14220 Prague, Czech Republic; (O.R.); (O.R.-L.); (V.V.H.); (H.Š.); (L.K.)
| |
Collapse
|
11
|
Fang Y, Kaszuba T, Imoukhuede PI. Systems Biology Will Direct Vascular-Targeted Therapy for Obesity. Front Physiol 2020; 11:831. [PMID: 32760294 PMCID: PMC7373796 DOI: 10.3389/fphys.2020.00831] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 06/22/2020] [Indexed: 12/12/2022] Open
Abstract
Healthy adipose tissue expansion and metabolism during weight gain require coordinated angiogenesis and lymphangiogenesis. These vascular growth processes rely on the vascular endothelial growth factor (VEGF) family of ligands and receptors (VEGFRs). Several studies have shown that controlling vascular growth by regulating VEGF:VEGFR signaling can be beneficial for treating obesity; however, dysregulated angiogenesis and lymphangiogenesis are associated with several chronic tissue inflammation symptoms, including hypoxia, immune cell accumulation, and fibrosis, leading to obesity-related metabolic disorders. An ideal obesity treatment should minimize adipose tissue expansion and the advent of adverse metabolic consequences, which could be achieved by normalizing VEGF:VEGFR signaling. Toward this goal, a systematic investigation of the interdependency of vascular and metabolic systems in obesity and tools to predict personalized treatment ranges are necessary to improve patient outcomes through vascular-targeted therapies. Systems biology can identify the critical VEGF:VEGFR signaling mechanisms that can be targeted to regress adipose tissue expansion and can predict the metabolic consequences of different vascular-targeted approaches. Establishing a predictive, biologically faithful platform requires appropriate computational models and quantitative tissue-specific data. Here, we discuss the involvement of VEGF:VEGFR signaling in angiogenesis, lymphangiogenesis, adipogenesis, and macrophage specification – key mechanisms that regulate adipose tissue expansion and metabolism. We then provide useful computational approaches for simulating these mechanisms, and detail quantitative techniques for acquiring tissue-specific parameters. Systems biology, through computational models and quantitative data, will enable an accurate representation of obese adipose tissue that can be used to direct the development of vascular-targeted therapies for obesity and associated metabolic disorders.
Collapse
Affiliation(s)
- Yingye Fang
- Imoukhuede Systems Biology Laboratory, Department of Biomedical Engineering, McKelvey School of Engineering, Washington University in St. Louis, St. Louis, MO, United States
| | - Tomasz Kaszuba
- Imoukhuede Systems Biology Laboratory, Department of Biomedical Engineering, McKelvey School of Engineering, Washington University in St. Louis, St. Louis, MO, United States
| | - P I Imoukhuede
- Imoukhuede Systems Biology Laboratory, Department of Biomedical Engineering, McKelvey School of Engineering, Washington University in St. Louis, St. Louis, MO, United States
| |
Collapse
|
12
|
Loskot P, Atitey K, Mihaylova L. Comprehensive Review of Models and Methods for Inferences in Bio-Chemical Reaction Networks. Front Genet 2019; 10:549. [PMID: 31258548 PMCID: PMC6588029 DOI: 10.3389/fgene.2019.00549] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Accepted: 05/24/2019] [Indexed: 01/30/2023] Open
Abstract
The key processes in biological and chemical systems are described by networks of chemical reactions. From molecular biology to biotechnology applications, computational models of reaction networks are used extensively to elucidate their non-linear dynamics. The model dynamics are crucially dependent on the parameter values which are often estimated from observations. Over the past decade, the interest in parameter and state estimation in models of (bio-) chemical reaction networks (BRNs) grew considerably. The related inference problems are also encountered in many other tasks including model calibration, discrimination, identifiability, and checking, and optimum experiment design, sensitivity analysis, and bifurcation analysis. The aim of this review paper is to examine the developments in literature to understand what BRN models are commonly used, and for what inference tasks and inference methods. The initial collection of about 700 documents concerning estimation problems in BRNs excluding books and textbooks in computational biology and chemistry were screened to select over 270 research papers and 20 graduate research theses. The paper selection was facilitated by text mining scripts to automate the search for relevant keywords and terms. The outcomes are presented in tables revealing the levels of interest in different inference tasks and methods for given models in the literature as well as the research trends are uncovered. Our findings indicate that many combinations of models, tasks and methods are still relatively unexplored, and there are many new research opportunities to explore combinations that have not been considered-perhaps for good reasons. The most common models of BRNs in literature involve differential equations, Markov processes, mass action kinetics, and state space representations whereas the most common tasks are the parameter inference and model identification. The most common methods in literature are Bayesian analysis, Monte Carlo sampling strategies, and model fitting to data using evolutionary algorithms. The new research problems which cannot be directly deduced from the text mining data are also discussed.
Collapse
Affiliation(s)
- Pavel Loskot
- College of Engineering, Swansea University, Swansea, United Kingdom
| | - Komlan Atitey
- College of Engineering, Swansea University, Swansea, United Kingdom
| | - Lyudmila Mihaylova
- Department of Automatic Control and Systems Engineering, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|