1
|
Mahanty B. Hybrid modeling in bioprocess dynamics: Structural variabilities, implementation strategies, and practical challenges. Biotechnol Bioeng 2023; 120:2072-2091. [PMID: 37458311 DOI: 10.1002/bit.28503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 07/09/2023] [Accepted: 07/10/2023] [Indexed: 07/30/2023]
Abstract
Hybrid modeling, with an appropriate blend of the mechanistic and data-driven framework, is increasingly being adopted in bioprocess modeling, model-based experimental design (digital-twin), identification of critical process parameters, and optimization. However, the development of a hybrid model from experimental data is an inherently complex workflow, involving designed experiments, selection of the data-driven process, identification of model parameters, assessment fitness, and generalization capability. Depending on the complexity of the process system and purpose, each piece of these modules can flexibly be incorporated into the puzzle. However, this extra flexibility can be a cause of concern to trace an "optimal" model structure. In this paper, the development of hybrid models in a common bioprocess system, selection of data-driven components and their mapping to states, choice of parameter identification techniques, and model quality assurance are revisited. The challenges associated with hybrid-model development, and corrective actions have also been reviewed. The review also suggests the lack of data, and code sharing in communal repositories can be a hurdle in the exploration, and expansion of those tools in a bioprocess system.
Collapse
Affiliation(s)
- Biswanath Mahanty
- Department of Biotechnology, Krunya Institute of Technology and Sciences, Coimbatore, Tamil Nadu, India
| |
Collapse
|
2
|
Browning AP, Ansari N, Drovandi C, Johnston APR, Simpson MJ, Jenner AL. Identifying cell-to-cell variability in internalization using flow cytometry. J R Soc Interface 2022; 19:20220019. [PMID: 35611619 PMCID: PMC9131125 DOI: 10.1098/rsif.2022.0019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Biological heterogeneity is a primary contributor to the variation observed in experiments that probe dynamical processes, such as the internalization of material by cells. Given that internalization is a critical process by which many therapeutics and viruses reach their intracellular site of action, quantifying cell-to-cell variability in internalization is of high biological interest. Yet, it is common for studies of internalization to neglect cell-to-cell variability. We develop a simple mathematical model of internalization that captures the dynamical behaviour, cell-to-cell variation, and extrinsic noise introduced by flow cytometry. We calibrate our model through a novel distribution-matching approximate Bayesian computation algorithm to flow cytometry data of internalization of anti-transferrin receptor antibody in a human B-cell lymphoblastoid cell line. This approach provides information relating to the region of the parameter space, and consequentially the nature of cell-to-cell variability, that produces model realizations consistent with the experimental data. Given that our approach is agnostic to sample size and signal-to-noise ratio, our modelling framework is broadly applicable to identify biological variability in single-cell data from internalization assays and similar experiments that probe cellular dynamical processes.
Collapse
Affiliation(s)
- Alexander P Browning
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia.,ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia.,QUT Centre for Data Science, Queensland University of Technology, Brisbane, Australia
| | - Niloufar Ansari
- Drug Delivery, Disposition and Dynamics, Monash Institute of Pharmaceutical Sciences, Monash University, 399 Royal Parade, Parkville, Victoria 3052, Australia
| | - Christopher Drovandi
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia.,ARC Centre of Excellence for Mathematical and Statistical Frontiers, Queensland University of Technology, Brisbane, Australia.,QUT Centre for Data Science, Queensland University of Technology, Brisbane, Australia
| | - Angus P R Johnston
- Drug Delivery, Disposition and Dynamics, Monash Institute of Pharmaceutical Sciences, Monash University, 399 Royal Parade, Parkville, Victoria 3052, Australia
| | - Matthew J Simpson
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia.,QUT Centre for Data Science, Queensland University of Technology, Brisbane, Australia
| | - Adrianne L Jenner
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, Australia.,QUT Centre for Data Science, Queensland University of Technology, Brisbane, Australia
| |
Collapse
|
3
|
Dynamic Modeling of the Impact of Temperature Changes on CO 2 Production during Milk Fermentation in Batch Bioreactors. Foods 2021; 10:foods10081809. [PMID: 34441586 PMCID: PMC8392827 DOI: 10.3390/foods10081809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 08/01/2021] [Accepted: 08/02/2021] [Indexed: 11/17/2022] Open
Abstract
Knowledge of the mathematical models of the fermentation processes is indispensable for their simulation and optimization and for the design and synthesis of the applicable control systems. The paper focuses on determining a dynamic mathematical model of the milk fermentation process taking place in a batch bioreactor. Models in the literature describe milk fermentation in batch bioreactors as an autonomous system. They do not enable the analysis of the effect of temperature changes on the metabolism during fermentation. In the presented extensive multidisciplinary study, we have developed a new mathematical model that considers the impact of temperature changes on the dynamics of the CO2 produced during fermentation in the batch bioreactor. Based on laboratory tests and theoretical analysis, the appropriate structure of the temperature-considered dynamic model was first determined. Next, the model parameters of the fermentation process in the laboratory bioreactor were identified by means of particle swarm optimization. Finally, the experiments with the laboratory batch bioreactor were compared with the simulations to verify the derived mathematical model. The developed model proved to be very suitable for simulations, and, above all, it enables the design and synthesis of a control system for batch bioreactors.
Collapse
|
4
|
Model Parameterization with Quantitative Proteomics: Case Study with Trehalose Metabolism in Saccharomyces cerevisiae. Processes (Basel) 2021. [DOI: 10.3390/pr9010139] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
When Saccharomyces cerevisiae undergoes heat stress it stimulates several changes that are necessary for its survival, notably in carbon metabolism. Notable changes include increase in trehalose production and glycolytic flux. The increase in glycolytic flux has been postulated to be due to the regulatory effects in upper glycolysis, but this has not been confirmed. Additionally, trehalose is a useful industrial compound for its protective properties. A model of trehalose metabolism in S. cerevisiae was constructed using Convenient Modeller, a software that uses a combination of convenience kinetics and a genetic algorithm. The model was parameterized with quantitative omics under standard conditions and validated using data collected under heat stress conditions. The completed model was used to show that feedforward activation of pyruvate kinase by fructose 1,6-bisphosphate during heat stress contributes to the increase in metabolic flux. We were also able to demonstrate in silico that overexpression of enzymes involved in production and degradation of trehalose can lead to higher trehalose yield in the cell. By integrating quantitative proteomics with metabolic modelling, we were able to confirm that the flux increase in trehalose metabolic pathways during heat stress is due to regulatory effects and not purely changes in enzyme expression. The overexpression of enzymes involved in trehalose metabolism is a potential approach to be exploited for trehalose production without need for increasing temperature.
Collapse
|
5
|
Yurinskaya MM, Garbuz DG, Afanasiev VN, Evgen’ev MB, Vinokurov MG. Effects of the Hydrogen Sulfide Donor GYY4137 and HSP70 Protein on the Activation of SH-SY5Y Cells by Lipopolysaccharide. Mol Biol 2021. [DOI: 10.1134/s002689332006014x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
6
|
Lee D, Jayaraman A, Kwon JS. Development of a hybrid model for a partially known intracellular signaling pathway through correction term estimation and neural network modeling. PLoS Comput Biol 2020; 16:e1008472. [PMID: 33315899 PMCID: PMC7769624 DOI: 10.1371/journal.pcbi.1008472] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 12/28/2020] [Accepted: 10/26/2020] [Indexed: 12/30/2022] Open
Abstract
Developing an accurate first-principle model is an important step in employing systems biology approaches to analyze an intracellular signaling pathway. However, an accurate first-principle model is difficult to be developed since it requires in-depth mechanistic understandings of the signaling pathway. Since underlying mechanisms such as the reaction network structure are not fully understood, significant discrepancy exists between predicted and actual signaling dynamics. Motivated by these considerations, this work proposes a hybrid modeling approach that combines a first-principle model and an artificial neural network (ANN) model so that predictions of the hybrid model surpass those of the original model. First, the proposed approach determines an optimal subset of model states whose dynamics should be corrected by the ANN by examining the correlation between each state and outputs through relative order. Second, an L2-regularized least-squares problem is solved to infer values of the correction terms that are necessary to minimize the discrepancy between the model predictions and available measurements. Third, an ANN is developed to generalize relationships between the values of the correction terms and the system dynamics. Lastly, the original first-principle model is coupled with the developed ANN to finalize the hybrid model development so that the model will possess generalized prediction capabilities while retaining the model interpretability. We have successfully validated the proposed methodology with two case studies, simplified apoptosis and lipopolysaccharide-induced NFκB signaling pathways, to develop hybrid models with in silico and in vitro measurements, respectively. An intracellular signaling pathway is often represented by a set of nonlinear ordinary differential equations, which translate our current knowledge about the signaling pathway into a testable mathematical model. However, predictions from such models are often subject to high uncertainty since many signaling pathways are only partially known beforehand. In this study, we propose a systematic approach to develop a hybrid model to improve model accuracy by combining machine learning and the first-principle modeling. Specifically, model correction terms are learned from discrepancy between model predictions and measurements, and these terms are added to the first-principle model to enhance the prediction accuracy. Once these correction terms are learned from the data, an artificial neural network (ANN) model is developed to find an empirical relation between the model and the correction terms so that the developed ANN can be used to posses improved predictive capabilities even in new operating conditions (i.e., generalizability). The final hybrid model is then constructed by coupling the first-principle model with the developed ANN.
Collapse
Affiliation(s)
- Dongheon Lee
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
| | - Arul Jayaraman
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Department of Biomedical Engineering, Texas A&M University, College Station, Texas, USA
| | - Joseph S. Kwon
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas, USA
- Texas A&M Energy Institute, Texas A&M University, College Station, Texas, USA
- * E-mail:
| |
Collapse
|
7
|
Zhang D, Savage TR, Cho BA. Combining model structure identification and hybrid modelling for photo-production process predictive simulation and optimisation. Biotechnol Bioeng 2020; 117:3356-3367. [PMID: 33616912 DOI: 10.1002/bit.27512] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Revised: 07/10/2020] [Accepted: 07/20/2020] [Indexed: 12/12/2022]
Abstract
Integrating physical knowledge and machine learning is a critical aspect of developing industrially focused digital twins for monitoring, optimisation, and design of microalgal and cyanobacterial photo-production processes. However, identifying the correct model structure to quantify the complex biological mechanism poses a severe challenge for the construction of kinetic models, while the lack of data due to the time-consuming experiments greatly impedes applications of most data-driven models. This study proposes the use of an innovative hybrid modelling approach that consists of a simple kinetic model to govern the overall process dynamic trajectory and a data-driven model to estimate mismatch between the kinetic equations and the real process. An advanced automatic model structure identification strategy is adopted to simultaneously identify the most physically probable kinetic model structure and minimum number of data-driven model parameters that can accurately represent multiple data sets over a broad spectrum of process operating conditions. Through this hybrid modelling and automatic structure identification framework, a highly accurate mathematical model was constructed to simulate and optimise an algal lutein production process. Performance of this hybrid model for long-term predictive modelling, optimisation, and online self-calibration is demonstrated and thoroughly discussed, indicating its significant potential for future industrial application.
Collapse
Affiliation(s)
- Dongda Zhang
- Centre for Process Integration, University of Manchester, The Mill, Manchester, UK
| | - Thomas R Savage
- Centre for Process Integration, University of Manchester, The Mill, Manchester, UK
| | - Bovinille A Cho
- Centre for Process Integration, University of Manchester, The Mill, Manchester, UK
| |
Collapse
|
8
|
Lee D, Jayaraman A, Kwon JS. Identification of cell‐to‐cell heterogeneity through systems engineering approaches. AIChE J 2020. [DOI: 10.1002/aic.16925] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Dongheon Lee
- Artie McFerrin Department of Chemical EngineeringTexas A&M University Texas
| | - Arul Jayaraman
- Artie McFerrin Department of Chemical EngineeringTexas A&M University Texas
| | - Joseph S.‐I. Kwon
- Artie McFerrin Department of Chemical EngineeringTexas A&M University Texas
| |
Collapse
|
9
|
Lee D, Jayaraman A, Sang-Il Kwon J. Identification of a time-varying intracellular signalling model through data clustering and parameter selection: application to NF-[inline-formula removed]B signalling pathway induced by LPS in the presence of BFA. IET Syst Biol 2019; 13:169-179. [PMID: 31318334 PMCID: PMC8687386 DOI: 10.1049/iet-syb.2018.5079] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Revised: 02/07/2019] [Accepted: 02/14/2019] [Indexed: 01/02/2023] Open
Abstract
Developing a model for a signalling pathway requires several iterations of experimentation and model refinement to obtain an accurate model. However, the implementation of such an approach to model a signalling pathway induced by a poorly-known stimulus can become labour intensive because only limited information on the pathway is available beforehand to formulate an initial model. Therefore, a large number of iterations are required since the initial model is likely to be erroneous. In this work, a numerical scheme is proposed to construct a time-varying model for a signalling pathway induced by a poorly-known stimulus when its nominal model is available in the literature. Here, the nominal model refers to one that describes the signalling dynamics under a well-characterised stimulus. First, global sensitivity analysis is implemented on the nominal model to identify the most important parameters, which are assumed to be piecewise constants. Second, measurement data are clustered to determine temporal subdomains where the parameters take different values. Finally, a least-squares problem is solved to estimate the parameter values in each temporal subdomain. The effectiveness of this approach is illustrated by developing a time-varying model for NF-[inline-formula removed]B signalling dynamics induced by lipopolysaccharide in the presence of brefeldin A.
Collapse
Affiliation(s)
- Dongheon Lee
- Texas A&M Energy Institute, Texas A&M University, College Station, TX 77843, USA
| | - Arul Jayaraman
- Department of Biomedical Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Joseph Sang-Il Kwon
- Texas A&M Energy Institute, Texas A&M University, College Station, TX 77843, USA.
| |
Collapse
|
10
|
Ozik J, Collier N, Heiland R, An G, Macklin P. Learning-accelerated discovery of immune-tumour interactions. MOLECULAR SYSTEMS DESIGN & ENGINEERING 2019; 4:747-760. [PMID: 31497314 PMCID: PMC6690424 DOI: 10.1039/c9me00036d] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Accepted: 04/18/2019] [Indexed: 05/04/2023]
Abstract
We present an integrated framework for enabling dynamic exploration of design spaces for cancer immunotherapies with detailed dynamical simulation models on high-performance computing resources. Our framework combines PhysiCell, an open source agent-based simulation platform for cancer and other multicellular systems, and EMEWS, an open source platform for extreme-scale model exploration. We build an agent-based model of immunosurveillance against heterogeneous tumours, which includes spatial dynamics of stochastic tumour-immune contact interactions. We implement active learning and genetic algorithms using high-performance computing workflows to adaptively sample the model parameter space and iteratively discover optimal cancer regression regions within biological and clinical constraints.
Collapse
Affiliation(s)
- Jonathan Ozik
- Decision and Infrastructure Sciences , Argonne National Laboratory , 9700 S. Cass Ave , Lemont , IL 60439 , USA .
- Consortium for Advanced Science and Engineering , University of Chicago , 5801 S. Ellis Ave. , Chicago , IL 60637 , USA
| | - Nicholson Collier
- Decision and Infrastructure Sciences , Argonne National Laboratory , 9700 S. Cass Ave , Lemont , IL 60439 , USA .
- Consortium for Advanced Science and Engineering , University of Chicago , 5801 S. Ellis Ave. , Chicago , IL 60637 , USA
| | - Randy Heiland
- Intelligent Systems Engineering , Indiana University , 700 N. Woodlawn Avenue Bloomington , IN 47408 , USA
| | - Gary An
- The University of Vermont Medical Center , 111 Colchester Avenue , Burlington , VT 05401 , USA
| | - Paul Macklin
- Intelligent Systems Engineering , Indiana University , 700 N. Woodlawn Avenue Bloomington , IN 47408 , USA
| |
Collapse
|
11
|
Amdoun R, Benyoussef EH, Benamghar A, Khelifi L. Prediction of hyoscyamine content in Datura stramonium L. hairy roots using different modeling approaches: Response Surface Methodology (RSM), Artificial Neural Network (ANN) and Kriging. Biochem Eng J 2019. [DOI: 10.1016/j.bej.2019.01.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
12
|
Special Issue on “Biological Networks”. Processes (Basel) 2018. [DOI: 10.3390/pr6120242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Networks of coordinated interactions among biological entities govern a myriad of biological functions that span a wide range of both length and time scales—from ecosystems to individual cells, and from years (e.g., the life cycle of periodical cicadas) to milliseconds (e.g., allosteric enzymeregulation[...]
Collapse
|
13
|
Kim JH, Lee JM. Successive complementary model-based experimental designs for parameter estimation of fed-batch bioreactors. Bioprocess Biosyst Eng 2018; 41:1767-1777. [PMID: 30099622 DOI: 10.1007/s00449-018-1999-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Accepted: 08/07/2018] [Indexed: 10/28/2022]
Abstract
When a dynamic model is used for the description of (fed-)batch bioreactors, it is typical that the model parameters are highly correlated to each other. In this case, it is important to keep the parameter correlation as small as possible to obtain a reliable set of parameter estimates. In this study, we propose an anticorrelation parameter estimation scheme that can be best utilized when a number of different batch experiments are sequentially processed. The scheme iteratively performs parameter estimation and model-based design of experiment (MBDOE) at the beginning and between the batches. The important difference from the existing approaches is that the MBDOE objective is defined according to the system analysis performed a priori, so that each new batch supplements what is lacking from the previous batches combined, in terms of information. The use of the scheme is illustrated on a fed-batch bioreactor model.
Collapse
Affiliation(s)
- Jung Hun Kim
- School of Chemical and Biological Engineering, Institute of Chemical Processes, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Jong Min Lee
- School of Chemical and Biological Engineering, Institute of Chemical Processes, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea.
| |
Collapse
|
14
|
Lee D, Singla A, Wu HJ, Kwon JSI. An integrated numerical and experimental framework for modeling of CTB and GD1b ganglioside binding kinetics. AIChE J 2018. [DOI: 10.1002/aic.16209] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Dongheon Lee
- Artie McFerrin Dept. of Chemical Engineering; Texas A&M University; College Station TX 77840
| | - Akshi Singla
- Artie McFerrin Dept. of Chemical Engineering; Texas A&M University; College Station TX 77840
| | - Hung-Jen Wu
- Artie McFerrin Dept. of Chemical Engineering; Texas A&M University; College Station TX 77840
| | - Joseph Sang-Il Kwon
- Artie McFerrin Dept. of Chemical Engineering; Texas A&M University; College Station TX 77840
| |
Collapse
|
15
|
Sequential Parameter Estimation for Mammalian Cell Model Based on In Silico Design of Experiments. Processes (Basel) 2018. [DOI: 10.3390/pr6080100] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Due to the complicated metabolism of mammalian cells, the corresponding dynamic mathematical models usually consist of large sets of differential and algebraic equations with a large number of parameters to be estimated. On the other hand, the measured data for estimating the model parameters are limited. Consequently, the parameter estimates may converge to a local minimum far from the optimal ones, especially when the initial guesses of the parameter values are poor. The methodology presented in this paper provides a systematic way for estimating parameters sequentially that generates better initial guesses for parameter estimation and improves the accuracy of the obtained metabolic model. The model parameters are first classified into four subsets of decreasing importance, based on the sensitivity of the model’s predictions on the parameters’ assumed values. The parameters in the most sensitive subset, typically a small fraction of the total, are estimated first. When estimating the remaining parameters with next most sensitive subset, the subsets of parameters with higher sensitivities are estimated again using their previously obtained optimal values as the initial guesses. The power of this sequential estimation approach is illustrated through a case study on the estimation of parameters in a dynamic model of CHO cell metabolism in fed-batch culture. We show that the sequential parameter estimation approach improves model accuracy and that using limited data to estimate low-sensitivity parameters can worsen model performance.
Collapse
|