1
|
Lang PF, Penas DR, Banga JR, Weindl D, Novak B. Reusable rule-based cell cycle model explains compartment-resolved dynamics of 16 observables in RPE-1 cells. PLoS Comput Biol 2024; 20:e1011151. [PMID: 38190398 PMCID: PMC10773963 DOI: 10.1371/journal.pcbi.1011151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 11/24/2023] [Indexed: 01/10/2024] Open
Abstract
The mammalian cell cycle is regulated by a well-studied but complex biochemical reaction system. Computational models provide a particularly systematic and systemic description of the mechanisms governing mammalian cell cycle control. By combining both state-of-the-art multiplexed experimental methods and powerful computational tools, this work aims at improving on these models along four dimensions: model structure, validation data, validation methodology and model reusability. We developed a comprehensive model structure of the full cell cycle that qualitatively explains the behaviour of human retinal pigment epithelial-1 cells. To estimate the model parameters, time courses of eight cell cycle regulators in two compartments were reconstructed from single cell snapshot measurements. After optimisation with a parallel global optimisation metaheuristic we obtained excellent agreements between simulations and measurements. The PEtab specification of the optimisation problem facilitates reuse of model, data and/or optimisation results. Future perturbation experiments will improve parameter identifiability and allow for testing model predictive power. Such a predictive model may aid in drug discovery for cell cycle-related disorders.
Collapse
Affiliation(s)
- Paul F. Lang
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| | - David R. Penas
- Computational Biology Lab, MBG-CSIC (Spanish National Research Council), Pontevedra, Spain
| | - Julio R. Banga
- Computational Biology Lab, MBG-CSIC (Spanish National Research Council), Pontevedra, Spain
| | - Daniel Weindl
- Computational Health Center, Helmholtz Zentrum München Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH), Neuherberg, Germany
| | - Bela Novak
- Department of Biochemistry, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
2
|
Korwek Z, Czerkies M, Jaruszewicz-Błońska J, Prus W, Kosiuk I, Kochańczyk M, Lipniacki T. Nonself RNA rewires IFN-β signaling: A mathematical model of the innate immune response. Sci Signal 2023; 16:eabq1173. [PMID: 38085817 DOI: 10.1126/scisignal.abq1173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 11/22/2023] [Indexed: 12/18/2023]
Abstract
Type I interferons (IFNs) are key coordinators of the innate immune response to viral infection, which, through activation of the transcriptional regulators STAT1 and STAT2 (STAT1/2) in bystander cells, induce the expression of IFN-stimulated genes (ISGs). Here, we showed that in cells transfected with poly(I:C), an analog of viral RNA, the transcriptional activity of STAT1/2 was terminated because of depletion of the interferon-β (IFN-β) receptor, IFNAR. Activation of RNase L and PKR, products of two ISGs, not only hindered the replenishment of IFNAR but also suppressed negative regulators of IRF3 and NF-κB, consequently promoting IFNB transcription. We incorporated these findings into a mathematical model of innate immunity. By coupling signaling through the IRF3-NF-κB and STAT1/2 pathways with the activities of RNase L and PKR, the model explains how poly(I:C) switches the transcriptional program from being STAT1/2 induced to being IRF3 and NF-κB induced, which converts IFN-β-responding cells to IFN-β-secreting cells.
Collapse
Affiliation(s)
- Zbigniew Korwek
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw 02-106, Poland
| | - Maciej Czerkies
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw 02-106, Poland
| | - Joanna Jaruszewicz-Błońska
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw 02-106, Poland
| | - Wiktor Prus
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw 02-106, Poland
| | - Ilona Kosiuk
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw 02-106, Poland
| | - Marek Kochańczyk
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw 02-106, Poland
| | - Tomasz Lipniacki
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw 02-106, Poland
| |
Collapse
|
3
|
Lin L, Zhao Y, Zheng Q, Zhang J, Li H, Wu W. Epigenetic targeting of autophagy for cancer: DNA and RNA methylation. Front Oncol 2023; 13:1290330. [PMID: 38148841 PMCID: PMC10749975 DOI: 10.3389/fonc.2023.1290330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 11/28/2023] [Indexed: 12/28/2023] Open
Abstract
Autophagy, a crucial cellular mechanism responsible for degradation and recycling of intracellular components, is modulated by an intricate network of molecular signals. Its paradoxical involvement in oncogenesis, acting as both a tumor suppressor and promoter, has been underscored in recent studies. Central to this regulatory network are the epigenetic modifications of DNA and RNA methylation, notably the presence of N6-methyldeoxyadenosine (6mA) in genomic DNA and N6-methyladenosine (m6A) in eukaryotic mRNA. The 6mA modification in genomic DNA adds an extra dimension of epigenetic regulation, potentially impacting the transcriptional dynamics of genes linked to autophagy and, especially, cancer. Conversely, m6A modification, governed by methyltransferases and demethylases, influences mRNA stability, processing, and translation, affecting genes central to autophagic pathways. As we delve deeper into the complexities of autophagy regulation, the importance of these methylation modifications grows more evident. The interplay of 6mA, m6A, and autophagy points to a layered regulatory mechanism, illuminating cellular reactions to a range of conditions. This review delves into the nexus between DNA 6mA and RNA m6A methylation and their influence on autophagy in cancer contexts. By closely examining these epigenetic markers, we underscore their promise as therapeutic avenues, suggesting novel approaches for cancer intervention through autophagy modulation.
Collapse
Affiliation(s)
- Luobin Lin
- Guangdong Province Key Laboratory of Biotechnology Drug Candidates, School of Life Sciences and Biopharmaceuticals, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Yuntao Zhao
- Guangdong Province Key Laboratory of Biotechnology Drug Candidates, School of Life Sciences and Biopharmaceuticals, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Qinzhou Zheng
- Guangdong Province Key Laboratory of Biotechnology Drug Candidates, School of Life Sciences and Biopharmaceuticals, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Jiayang Zhang
- Guangdong Province Key Laboratory of Biotechnology Drug Candidates, School of Life Sciences and Biopharmaceuticals, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| | - Huaqin Li
- School of Health Sciences, Guangzhou Xinhua University, Guangzhou, Guangdong, China
| | - Wenmei Wu
- Guangdong Province Key Laboratory of Biotechnology Drug Candidates, School of Life Sciences and Biopharmaceuticals, Guangdong Pharmaceutical University, Guangzhou, Guangdong, China
| |
Collapse
|
4
|
Wu J, Stewart WCL, Jayaprakash C, Das J. BioNetGMMFit: estimating parameters of a BioNetGen model from time-stamped snapshots of single cells. NPJ Syst Biol Appl 2023; 9:46. [PMID: 37736766 PMCID: PMC10516955 DOI: 10.1038/s41540-023-00299-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 07/31/2023] [Indexed: 09/23/2023] Open
Abstract
Mechanistic models are commonly employed to describe signaling and gene regulatory kinetics in single cells and cell populations. Recent advances in single-cell technologies have produced multidimensional datasets where snapshots of copy numbers (or abundances) of a large number of proteins and mRNA are measured across time in single cells. The availability of such datasets presents an attractive scenario where mechanistic models are validated against experiments, and estimated model parameters enable quantitative predictions of signaling or gene regulatory kinetics. To empower the systems biology community to easily estimate parameters accurately from multidimensional single-cell data, we have merged a widely used rule-based modeling software package BioNetGen, which provides a user-friendly way to code for mechanistic models describing biochemical reactions, and the recently introduced CyGMM, that uses cell-to-cell differences to improve parameter estimation for such networks, into a single software package: BioNetGMMFit. BioNetGMMFit provides parameter estimates of the model, supplied by the user in the BioNetGen markup language (BNGL), which yield the best fit for the observed single-cell, time-stamped data of cellular components. Furthermore, for more precise estimates, our software generates confidence intervals around each model parameter. BioNetGMMFit is capable of fitting datasets of increasing cell population sizes for any mechanistic model specified in the BioNetGen markup language. By streamlining the process of developing mechanistic models for large single-cell datasets, BioNetGMMFit provides an easily-accessible modeling framework designed for scale and the broader biochemical signaling community.
Collapse
Affiliation(s)
- John Wu
- Department of Computer Science, The Ohio State University, 281 W Lane Ave, Columbus, OH, 43210, USA.
- Steve and Cindy Rasmussen Institute for Genomics, The Abigail Wexner Research Institute, Nationwide Children's Hospital, 700 Children's Drive, Columbus, OH, 43205, USA.
| | | | - Ciriyam Jayaprakash
- Department of Physics, The Ohio State University, 191 W Woodruff Ave, Columbus, OH, 43210, USA
| | - Jayajit Das
- Steve and Cindy Rasmussen Institute for Genomics, The Abigail Wexner Research Institute, Nationwide Children's Hospital, 700 Children's Drive, Columbus, OH, 43205, USA.
- Departments of Pediatrics, Biomedical Informatics, Pelotonia Institute of Immuno-Oncology, College of Medicine, and Biophysics Program, The Ohio State University, 370 W 9th Ave, Columbus, OH, 43210, USA.
| |
Collapse
|
5
|
Jaruszewicz-Błońska J, Kosiuk I, Prus W, Lipniacki T. A plausible identifiable model of the canonical NF-κB signaling pathway. PLoS One 2023; 18:e0286416. [PMID: 37267242 DOI: 10.1371/journal.pone.0286416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 05/15/2023] [Indexed: 06/04/2023] Open
Abstract
An overwhelming majority of mathematical models of regulatory pathways, including the intensively studied NF-κB pathway, remains non-identifiable, meaning that their parameters may not be determined by existing data. The existing NF-κB models that are capable of reproducing experimental data contain non-identifiable parameters, whereas simplified models with a smaller number of parameters exhibit dynamics that differs from that observed in experiments. Here, we reduced an existing model of the canonical NF-κB pathway by decreasing the number of equations from 15 to 6. The reduced model retains two negative feedback loops mediated by IκBα and A20, and in response to both tonic and pulsatile TNF stimulation exhibits dynamics that closely follow that of the original model. We carried out the sensitivity-based linear analysis and Monte Carlo-based analysis to demonstrate that the resulting model is both structurally and practically identifiable given measurements of 5 model variables from a simple TNF stimulation protocol. The reduced model is capable of reproducing different types of responses that are characteristic to regulatory motifs controlled by negative feedback loops: nearly-perfect adaptation as well as damped and sustained oscillations. It can serve as a building block of more comprehensive models of the immune response and cancer, where NF-κB plays a decisive role. Our approach, although may not be automatically generalized, suggests that models of other regulatory pathways can be transformed to identifiable, while retaining their dynamical features.
Collapse
Affiliation(s)
| | - Ilona Kosiuk
- Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw, Poland
| | - Wiktor Prus
- Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw, Poland
| | - Tomasz Lipniacki
- Institute of Fundamental Technological Research of the Polish Academy of Sciences, Warsaw, Poland
| |
Collapse
|
6
|
Konur S, Gheorghe M, Krasnogor N. Verifiable biology. J R Soc Interface 2023; 20:20230019. [PMID: 37160165 PMCID: PMC10169095 DOI: 10.1098/rsif.2023.0019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 04/17/2023] [Indexed: 05/11/2023] Open
Abstract
The formalization of biological systems using computational modelling approaches as an alternative to mathematical-based methods has recently received much interest because computational models provide a deeper mechanistic understanding of biological systems. In particular, formal verification, complementary approach to standard computational techniques such as simulation, is used to validate the system correctness and obtain critical information about system behaviour. In this study, we survey the most frequently used computational modelling approaches and formal verification techniques for computational biology. We compare a number of verification tools and software suites used to analyse biological systems and biochemical networks, and to verify a wide range of biological properties. For users who have no expertise in formal verification, we present a novel methodology that allows them to easily apply formal verification techniques to analyse their biological or biochemical system of interest.
Collapse
Affiliation(s)
- Savas Konur
- Department of Computer Science, University of Bradford, Richmond Building, Bradford BD7 1DP, UK
| | - Marian Gheorghe
- Department of Computer Science, University of Bradford, Richmond Building, Bradford BD7 1DP, UK
| | - Natalio Krasnogor
- School of Computing Science, Newcastle University, Science Square, Newcastle upon Tyne NE4 5TG, UK
| |
Collapse
|
7
|
Rukhlenko OS, Halasz M, Rauch N, Zhernovkov V, Prince T, Wynne K, Maher S, Kashdan E, MacLeod K, Carragher NO, Kolch W, Kholodenko BN. Control of cell state transitions. Nature 2022; 609:975-985. [PMID: 36104561 PMCID: PMC9644236 DOI: 10.1038/s41586-022-05194-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 08/04/2022] [Indexed: 11/09/2022]
Abstract
Understanding cell state transitions and purposefully controlling them is a longstanding challenge in biology. Here we present cell state transition assessment and regulation (cSTAR), an approach for mapping cell states, modelling transitions between them and predicting targeted interventions to convert cell fate decisions. cSTAR uses omics data as input, classifies cell states, and develops a workflow that transforms the input data into mechanistic models that identify a core signalling network, which controls cell fate transitions by influencing whole-cell networks. By integrating signalling and phenotypic data, cSTAR models how cells manoeuvre in Waddington's landscape1 and make decisions about which cell fate to adopt. Notably, cSTAR devises interventions to control the movement of cells in Waddington's landscape. Testing cSTAR in a cellular model of differentiation and proliferation shows a high correlation between quantitative predictions and experimental data. Applying cSTAR to different types of perturbation and omics datasets, including single-cell data, demonstrates its flexibility and scalability and provides new biological insights. The ability of cSTAR to identify targeted perturbations that interconvert cell fates will enable designer approaches for manipulating cellular development pathways and mechanistically underpinned therapeutic interventions.
Collapse
Affiliation(s)
- Oleksii S Rukhlenko
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
| | - Melinda Halasz
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
- Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Dublin, Ireland
| | - Nora Rauch
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
| | - Vadim Zhernovkov
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
| | - Thomas Prince
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
| | - Kieran Wynne
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
| | - Stephanie Maher
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
| | - Eugene Kashdan
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
| | - Kenneth MacLeod
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
| | - Neil O Carragher
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
| | - Walter Kolch
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland
- Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Dublin, Ireland
| | - Boris N Kholodenko
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, Ireland.
- Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Dublin, Ireland.
- Department of Pharmacology, Yale University School of Medicine, New Haven, USA.
| |
Collapse
|
8
|
Eriksson O, Bhalla US, Blackwell KT, Crook SM, Keller D, Kramer A, Linne ML, Saudargienė A, Wade RC, Hellgren Kotaleski J. Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows. eLife 2022; 11:e69013. [PMID: 35792600 PMCID: PMC9259018 DOI: 10.7554/elife.69013] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 05/13/2022] [Indexed: 12/22/2022] Open
Abstract
Modeling in neuroscience occurs at the intersection of different points of view and approaches. Typically, hypothesis-driven modeling brings a question into focus so that a model is constructed to investigate a specific hypothesis about how the system works or why certain phenomena are observed. Data-driven modeling, on the other hand, follows a more unbiased approach, with model construction informed by the computationally intensive use of data. At the same time, researchers employ models at different biological scales and at different levels of abstraction. Combining these models while validating them against experimental data increases understanding of the multiscale brain. However, a lack of interoperability, transparency, and reusability of both models and the workflows used to construct them creates barriers for the integration of models representing different biological scales and built using different modeling philosophies. We argue that the same imperatives that drive resources and policy for data - such as the FAIR (Findable, Accessible, Interoperable, Reusable) principles - also support the integration of different modeling approaches. The FAIR principles require that data be shared in formats that are Findable, Accessible, Interoperable, and Reusable. Applying these principles to models and modeling workflows, as well as the data used to constrain and validate them, would allow researchers to find, reuse, question, validate, and extend published models, regardless of whether they are implemented phenomenologically or mechanistically, as a few equations or as a multiscale, hierarchical system. To illustrate these ideas, we use a classical synaptic plasticity model, the Bienenstock-Cooper-Munro rule, as an example due to its long history, different levels of abstraction, and implementation at many scales.
Collapse
Affiliation(s)
- Olivia Eriksson
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
| | - Upinder Singh Bhalla
- National Center for Biological Sciences, Tata Institute of Fundamental ResearchBangaloreIndia
| | - Kim T Blackwell
- Department of Bioengineering, Volgenau School of Engineering, George Mason UniversityFairfaxUnited States
| | - Sharon M Crook
- School of Mathematical and Statistical Sciences, Arizona State UniversityTempeUnited States
| | - Daniel Keller
- Blue Brain Project, École Polytechnique Fédérale de LausanneLausanneSwitzerland
| | - Andrei Kramer
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| | - Marja-Leena Linne
- Faculty of Medicine and Health Technology, Tampere UniversityTampereFinland
| | - Ausra Saudargienė
- Neuroscience Institute, Lithuanian University of Health SciencesKaunasLithuania
- Department of Informatics, Vytautas Magnus UniversityKaunasLithuania
| | - Rebecca C Wade
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS)HeidelbergGermany
- Center for Molecular Biology (ZMBH), ZMBH-DKFZ Alliance, University of HeidelbergHeidelbergGermany
- Interdisciplinary Center for Scientific Computing (IWR), Heidelberg UniversityHeidelbergGermany
| | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| |
Collapse
|
9
|
Neumann J, Lin YT, Mallela A, Miller EF, Colvin J, Duprat AT, Chen Y, Hlavacek WS, Posner RG. Implementation of a practical Markov chain Monte Carlo sampling algorithm in PyBioNetFit. Bioinformatics 2022; 38:1770-1772. [PMID: 34986226 DOI: 10.1093/bioinformatics/btac004] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 11/30/2021] [Accepted: 01/03/2022] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Bayesian inference in biological modeling commonly relies on Markov chain Monte Carlo (MCMC) sampling of a multidimensional and non-Gaussian posterior distribution that is not analytically tractable. Here, we present the implementation of a practical MCMC method in the open-source software package PyBioNetFit (PyBNF), which is designed to support parameterization of mathematical models for biological systems. The new MCMC method, am, incorporates an adaptive move proposal distribution. For warm starts, sampling can be initiated at a specified location in parameter space and with a multivariate Gaussian proposal distribution defined initially by a specified covariance matrix. Multiple chains can be generated in parallel using a computer cluster. We demonstrate that am can be used to successfully solve real-world Bayesian inference problems, including forecasting of new Coronavirus Disease 2019 case detection with Bayesian quantification of forecast uncertainty. AVAILABILITY AND IMPLEMENTATION PyBNF version 1.1.9, the first stable release with am, is available at PyPI and can be installed using the pip package-management system on platforms that have a working installation of Python 3. PyBNF relies on libRoadRunner and BioNetGen for simulations (e.g. numerical integration of ordinary differential equations defined in SBML or BNGL files) and Dask.Distributed for task scheduling on Linux computer clusters. The Python source code can be freely downloaded/cloned from GitHub and used and modified under terms of the BSD-3 license (https://github.com/lanl/pybnf). Online documentation covering installation/usage is available (https://pybnf.readthedocs.io/en/latest/). A tutorial video is available on YouTube (https://www.youtube.com/watch?v=2aRqpqFOiS4&t=63s). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jacob Neumann
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA
| | - Yen Ting Lin
- Information Sciences Group, Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Abhishek Mallela
- Department of Mathematics, University of California, Davis, CA 95616, USA
| | - Ely F Miller
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA
| | - Joshua Colvin
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA
| | - Abell T Duprat
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA
| | - Ye Chen
- Department of Mathematics and Statistics, Northern Arizona University, Flagstaff, AZ 86011, USA
| | - William S Hlavacek
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - Richard G Posner
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA
| |
Collapse
|
10
|
Mallela A, Neumann J, Miller EF, Chen Y, Posner RG, Lin YT, Hlavacek WS. Bayesian Inference of State-Level COVID-19 Basic Reproduction Numbers across the United States. Viruses 2022; 14:157. [PMID: 35062361 PMCID: PMC8780010 DOI: 10.3390/v14010157] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 01/08/2022] [Accepted: 01/12/2022] [Indexed: 02/05/2023] Open
Abstract
Although many persons in the United States have acquired immunity to COVID-19, either through vaccination or infection with SARS-CoV-2, COVID-19 will pose an ongoing threat to non-immune persons so long as disease transmission continues. We can estimate when sustained disease transmission will end in a population by calculating the population-specific basic reproduction number ℛ0, the expected number of secondary cases generated by an infected person in the absence of any interventions. The value of ℛ0 relates to a herd immunity threshold (HIT), which is given by 1-1/ℛ0. When the immune fraction of a population exceeds this threshold, sustained disease transmission becomes exponentially unlikely (barring mutations allowing SARS-CoV-2 to escape immunity). Here, we report state-level ℛ0 estimates obtained using Bayesian inference. Maximum a posteriori estimates range from 7.1 for New Jersey to 2.3 for Wyoming, indicating that disease transmission varies considerably across states and that reaching herd immunity will be more difficult in some states than others. ℛ0 estimates were obtained from compartmental models via the next-generation matrix approach after each model was parameterized using regional daily confirmed case reports of COVID-19 from 21 January 2020 to 21 June 2020. Our ℛ0 estimates characterize the infectiousness of ancestral strains, but they can be used to determine HITs for a distinct, currently dominant circulating strain, such as SARS-CoV-2 variant Delta (lineage B.1.617.2), if the relative infectiousness of the strain can be ascertained. On the basis of Delta-adjusted HITs, vaccination data, and seroprevalence survey data, we found that no state had achieved herd immunity as of 20 September 2021.
Collapse
Affiliation(s)
- Abhishek Mallela
- Department of Mathematics, University of California, Davis, CA 95616, USA;
| | - Jacob Neumann
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA; (J.N.); (E.F.M.); (R.G.P.)
| | - Ely F. Miller
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA; (J.N.); (E.F.M.); (R.G.P.)
| | - Ye Chen
- Department of Mathematics and Statistics, Northern Arizona University, Flagstaff, AZ 86011, USA;
| | - Richard G. Posner
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ 86011, USA; (J.N.); (E.F.M.); (R.G.P.)
| | - Yen Ting Lin
- Los Alamos National Laboratory, Information Sciences Group, Computer, Computational and Statistical Sciences Division, Los Alamos, NM 87545, USA;
| | - William S. Hlavacek
- Los Alamos National Laboratory, Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos, NM 87545, USA
| |
Collapse
|
11
|
Schmiester L, Weindl D, Hasenauer J. Efficient gradient-based parameter estimation for dynamic models using qualitative data. BIOINFORMATICS (OXFORD, ENGLAND) 2021. [PMID: 34260697 DOI: 10.1101/2021.02.06.430039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
MOTIVATION Unknown parameters of dynamical models are commonly estimated from experimental data. However, while various efficient optimization and uncertainty analysis methods have been proposed for quantitative data, methods for qualitative data are rare and suffer from bad scaling and convergence. RESULTS Here, we propose an efficient and reliable framework for estimating the parameters of ordinary differential equation models from qualitative data. In this framework, we derive a semi-analytical algorithm for gradient calculation of the optimal scaling method developed for qualitative data. This enables the use of efficient gradient-based optimization algorithms. We demonstrate that the use of gradient information improves performance of optimization and uncertainty quantification on several application examples. On average, we achieve a speedup of more than one order of magnitude compared to gradient-free optimization. In addition, in some examples, the gradient-based approach yields substantially improved objective function values and quality of the fits. Accordingly, the proposed framework substantially improves the parameterization of models from qualitative data. AVAILABILITY AND IMPLEMENTATION The proposed approach is implemented in the open-source Python Parameter EStimation TOolbox (pyPESTO). pyPESTO is available at https://github.com/ICB-DCM/pyPESTO. All application examples and code to reproduce this study are available at https://doi.org/10.5281/zenodo.4507613. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Leonard Schmiester
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg 85764, Germany
- Center for Mathematics, Technische Universität München, Garching 85748, Germany
| | - Daniel Weindl
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg 85764, Germany
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg 85764, Germany
- Center for Mathematics, Technische Universität München, Garching 85748, Germany
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn 53113, Germany
| |
Collapse
|
12
|
Mallela A, Neumann J, Miller EF, Chen Y, Posner RG, Lin YT, Hlavacek WS. Bayesian Inference of State-Level COVID-19 Basic Reproduction Numbers across the United States. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021. [PMID: 34611664 DOI: 10.1101/2021.09.27.21264188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Although many persons in the United States have acquired immunity to COVID-19, either through vaccination or infection with SARS-CoV-2, COVID-19 will pose an ongoing threat to non-immune persons so long as disease transmission continues. We can estimate when sustained disease transmission will end in a population by calculating the population-specific basic reproduction number ℛ 0 , the expected number of secondary cases generated by an infected person in the absence of any interventions. The value of ℛ 0 relates to a herd immunity threshold (HIT), which is given by 1 - 1/ℛ 0 . When the immune fraction of a population exceeds this threshold, sustained disease transmission becomes exponentially unlikely (barring mutations allowing SARS-CoV-2 to escape immunity). Here, we report state-level ℛ 0 estimates obtained using Bayesian inference. Maximum a posteriori estimates range from 7.1 for New Jersey to 2.3 for Wyoming, indicating that disease transmission varies considerably across states and that reaching herd immunity will be more difficult in some states than others. ℛ 0 estimates were obtained from compartmental models via the next-generation matrix approach after each model was parameterized using regional daily confirmed case reports of COVID-19 from 21-January-2020 to 21-June-2020. Our ℛ 0 estimates characterize infectiousness of ancestral strains, but they can be used to determine HITs for a distinct, currently dominant circulating strain, such as SARS-CoV-2 variant Delta (lineage B.1.617.2), if the relative infectiousness of the strain can be ascertained. On the basis of Delta-adjusted HITs, vaccination data, and seroprevalence survey data, we find that no state has achieved herd immunity as of 20-September-2021. Significance Statement COVID-19 will continue to threaten non-immune persons in the presence of ongoing disease transmission. We can estimate when sustained disease transmission will end by calculating the population-specific basic reproduction number ℛ 0 , which relates to a herd immunity threshold (HIT), given by 1 - 1/ℛ 0 . When the immune fraction of a population exceeds this threshold, sustained disease transmission becomes exponentially unlikely. Here, we report state-level ℛ 0 estimates indicating that disease transmission varies considerably across states. Our ℛ 0 estimates can also be used to determine HITs for the Delta variant of COVID-19. On the basis of Delta-adjusted HITs, vaccination data, and serological survey results, we find that no state has yet achieved herd immunity.
Collapse
|
13
|
Lubbock AL, Lopez CF. Programmatic modeling for biological systems. CURRENT OPINION IN SYSTEMS BIOLOGY 2021; 27:100343. [PMID: 34485764 PMCID: PMC8411905 DOI: 10.1016/j.coisb.2021.05.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
Computational modeling has become an established technique to encode mathematical representations of cellular processes and gain mechanistic insights that drive testable predictions. These models are often constructed using graphical user interfaces or domain-specific languages, with community standards used for interchange. Models undergo steady state or dynamic analysis, which can include simulation and calibration within a single application, or transfer across various tools. Here, we describe a novel programmatic modeling paradigm, whereby modeling is augmented with software engineering best practices. We focus on Python - a popular programming language with a large scientific package ecosystem. Models can be encoded as programs, adding benefits such as modularity, testing, and automated documentation generators, while still being extensible and exportable to standardized formats for use with external tools if desired. Programmatic modeling is a key technology to enable collaborative model development and enhance dissemination, transparency, and reproducibility.
Collapse
Affiliation(s)
- Alexander L.R. Lubbock
- Department of Biochemistry, Vanderbilt University, Nashville, Tennessee 37212, United States of America
- Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville Tennessee 37212, United States of America
| | - Carlos F. Lopez
- Department of Biochemistry, Vanderbilt University, Nashville, Tennessee 37212, United States of America
- Vanderbilt-Ingram Cancer Center, Vanderbilt University, Nashville Tennessee 37212, United States of America
- Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee 37212, United States of America
| |
Collapse
|
14
|
Budde K, Smith J, Wilsdorf P, Haack F, Uhrmacher AM. Relating simulation studies by provenance-Developing a family of Wnt signaling models. PLoS Comput Biol 2021; 17:e1009227. [PMID: 34351901 PMCID: PMC8407594 DOI: 10.1371/journal.pcbi.1009227] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 08/31/2021] [Accepted: 06/29/2021] [Indexed: 12/28/2022] Open
Abstract
For many biological systems, a variety of simulation models exist. A new simulation model is rarely developed from scratch, but rather revises and extends an existing one. A key challenge, however, is to decide which model might be an appropriate starting point for a particular problem and why. To answer this question, we need to identify entities and activities that contributed to the development of a simulation model. Therefore, we exploit the provenance data model, PROV-DM, of the World Wide Web Consortium and, building on previous work, continue developing a PROV ontology for simulation studies. Based on a case study of 19 Wnt/β-catenin signaling models, we identify crucial entities and activities as well as useful metadata to both capture the provenance information from individual simulation studies and relate these forming a family of models. The approach is implemented in WebProv, a web application for inserting and querying provenance information. Our specialization of PROV-DM contains the entities Research Question, Assumption, Requirement, Qualitative Model, Simulation Model, Simulation Experiment, Simulation Data, and Wet-lab Data as well as activities referring to building, calibrating, validating, and analyzing a simulation model. We show that most Wnt simulation models are connected to other Wnt models by using (parts of) these models. However, the overlap, especially regarding the Wet-lab Data used for calibration or validation of the models is small. Making these aspects of developing a model explicit and queryable is an important step for assessing and reusing simulation models more effectively. Exposing this information helps to integrate a new simulation model within a family of existing ones and may lead to the development of more robust and valid simulation models. We hope that our approach becomes part of a standardization effort and that modelers adopt the benefits of provenance when considering or creating simulation models. We revise a provenance ontology for simulation studies of cellular biochemical models. Provenance information is useful for understanding the creation of a simulation model because it not only contains information about the entities and activities that have led to a simulation model but also their relations, all of which can be visualized. It provides additional structure by explicitly recording research questions, assumptions, and requirements and relating them along with data, qualitative models, simulation models, and simulation experiments through a small set of predefined but extensible activities. We have applied our concept to a family of 19 Wnt signaling models and implemented a web-based tool (WebProv) to store the provenance information from these studies. The resulting provenance graph visualizes the story line of simulation studies and demonstrates the creation and calibration of simulation models, the successive attempts of validation and extension, and shows, beyond an individual simulation study, how the Wnt models are related. Thereby, the steps and sources that contributed to a simulation model are made explicit. Our approach complements other approaches aimed at facilitating the reuse and assessment of simulation products in systems biology such as model repositories as well as annotation and documentation guidelines.
Collapse
Affiliation(s)
- Kai Budde
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
- * E-mail:
| | - Jacob Smith
- Faculty of Computer Science, University of New Brunswick, Fredericton, Canada
| | - Pia Wilsdorf
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
| | - Fiete Haack
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
| | - Adelinde M. Uhrmacher
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
| |
Collapse
|
15
|
Rukhlenko OS, Kholodenko BN. Modeling the Nonlinear Dynamics of Intracellular Signaling Networks. Bio Protoc 2021; 11:e4089. [PMID: 34395728 PMCID: PMC8329461 DOI: 10.21769/bioprotoc.4089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Revised: 04/09/2021] [Accepted: 05/28/2021] [Indexed: 11/17/2022] Open
Abstract
This protocol illustrates a pipeline for modeling the nonlinear behavior of intracellular signaling pathways. At fixed spatial points, nonlinear signaling dynamics are described by ordinary differential equations (ODEs). At constant parameters, these ODEs may have multiple attractors, such as multiple steady states or limit cycles. Standard optimization procedures fine-tune the parameters for the system trajectories localized within the basin of attraction of only one attractor, usually a stable steady state. The suggested protocol samples the parameter space and captures the overall dynamic behavior by analyzing the number and stability of steady states and the shapes of the assembly of nullclines, which are determined as projections of quasi-steady-state trajectories into different 2D spaces of system variables. Our pipeline allows identifying main qualitative features of the model behavior, perform bifurcation analysis, and determine the borders separating the different dynamical regimes within the assembly of 2D parametric planes. Partial differential equation (PDE) systems describing the nonlinear spatiotemporal behavior are derived by coupling fixed point dynamics with species diffusion.
Collapse
Affiliation(s)
- Oleksii S. Rukhlenko
- Systems Biology Ireland, School of Medicine and Medical Science, University College Dublin, Belfield, Dublin 4, Ireland
| | - Boris N. Kholodenko
- Systems Biology Ireland, School of Medicine and Medical Science, University College Dublin, Belfield, Dublin 4, Ireland
- Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland
- Department of Pharmacology, Yale University School of Medicine, New Haven, USA
| |
Collapse
|
16
|
Schmiester L, Weindl D, Hasenauer J. Efficient gradient-based parameter estimation for dynamic models using qualitative data. Bioinformatics 2021; 37:4493-4500. [PMID: 34260697 PMCID: PMC8652033 DOI: 10.1093/bioinformatics/btab512] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Revised: 07/02/2021] [Accepted: 07/08/2021] [Indexed: 11/22/2022] Open
Abstract
Motivation Unknown parameters of dynamical models are commonly estimated from experimental data. However, while various efficient optimization and uncertainty analysis methods have been proposed for quantitative data, methods for qualitative data are rare and suffer from bad scaling and convergence. Results Here, we propose an efficient and reliable framework for estimating the parameters of ordinary differential equation models from qualitative data. In this framework, we derive a semi-analytical algorithm for gradient calculation of the optimal scaling method developed for qualitative data. This enables the use of efficient gradient-based optimization algorithms. We demonstrate that the use of gradient information improves performance of optimization and uncertainty quantification on several application examples. On average, we achieve a speedup of more than one order of magnitude compared to gradient-free optimization. In addition, in some examples, the gradient-based approach yields substantially improved objective function values and quality of the fits. Accordingly, the proposed framework substantially improves the parameterization of models from qualitative data. Availability and implementation The proposed approach is implemented in the open-source Python Parameter EStimation TOolbox (pyPESTO). pyPESTO is available at https://github.com/ICB-DCM/pyPESTO. All application examples and code to reproduce this study are available at https://doi.org/10.5281/zenodo.4507613. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Leonard Schmiester
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, 85764, Germany.,Center for Mathematics, Technische Universität München, Garching, 85748, Germany
| | - Daniel Weindl
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, 85764, Germany
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, 85764, Germany.,Center for Mathematics, Technische Universität München, Garching, 85748, Germany.,Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, 53113, Germany
| |
Collapse
|
17
|
Porubsky V, Smith L, Sauro HM. Publishing reproducible dynamic kinetic models. Brief Bioinform 2021; 22:bbaa152. [PMID: 32793969 PMCID: PMC8138891 DOI: 10.1093/bib/bbaa152] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Revised: 05/19/2020] [Accepted: 06/17/2020] [Indexed: 11/14/2022] Open
Abstract
Publishing repeatable and reproducible computational models is a crucial aspect of the scientific method in computational biology and one that is often forgotten in the rush to publish. The pressures of academic life and the lack of any reward system at institutions, granting agencies and journals means that publishing reproducible science is often either non-existent or, at best, presented in the form of an incomplete description. In the article, we will focus on repeatability and reproducibility in the systems biology field where a great many published models cannot be reproduced and in many cases even repeated. This review describes the current landscape of software tooling, model repositories, model standards and best practices for publishing repeatable and reproducible kinetic models. The review also discusses possible future remedies including working more closely with journals to help reviewers and editors ensure that published kinetic models are at minimum, repeatable. Contact: hsauro@uw.edu.
Collapse
Affiliation(s)
- Veronica Porubsky
- Department of Bioengineering, University of Washington, Seattle, 98105,USA
| | - Lucian Smith
- Department of Bioengineering, University of Washington, Seattle, 98105,USA
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, 98105,USA
| |
Collapse
|
18
|
Santibáñez R, Garrido D, Martin AJM. Atlas: automatic modeling of regulation of bacterial gene expression and metabolism using rule-based languages. Bioinformatics 2021; 36:5473-5480. [PMID: 33367504 PMCID: PMC8016457 DOI: 10.1093/bioinformatics/btaa1040] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Revised: 11/19/2020] [Accepted: 12/12/2020] [Indexed: 12/31/2022] Open
Abstract
MOTIVATION Cells are complex systems composed of hundreds of genes whose products interact to produce elaborated behaviors. To control such behaviors, cells rely on transcription factors to regulate gene expression, and gene regulatory networks (GRNs) are employed to describe and understand such behavior. However, GRNs are static models, and dynamic models are difficult to obtain due to their size, complexity, stochastic dynamics and interactions with other cell processes. RESULTS We developed Atlas, a Python software that converts genome graphs and gene regulatory, interaction and metabolic networks into dynamic models. The software employs these biological networks to write rule-based models for the PySB framework. The underlying method is a divide-and-conquer strategy to obtain sub-models and combine them later into an ensemble model. To exemplify the utility of Atlas, we used networks of varying size and complexity of Escherichia coli and evaluated in silico modifications, such as gene knockouts and the insertion of promoters and terminators. Moreover, the methodology could be applied to the dynamic modeling of natural and synthetic networks of any bacteria. AVAILABILITY AND IMPLEMENTATION Code, models and tutorials are available online (https://github.com/networkbiolab/atlas). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rodrigo Santibáñez
- Laboratorio de Biología de Redes, Centro de Genómica y Bioinformática, Universidad Mayor, Santiago 8580745, Chile
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820436, Chile
| | - Daniel Garrido
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820436, Chile
| | - Alberto J M Martin
- Laboratorio de Biología de Redes, Centro de Genómica y Bioinformática, Universidad Mayor, Santiago 8580745, Chile
| |
Collapse
|
19
|
Jiang R, Jacob B, Geiger M, Matthew S, Rumsey B, Singh P, Wrede F, Yi TM, Drawert B, Hellander A, Petzold L. Epidemiological modeling in StochSS Live! Bioinformatics 2021; 37:2787-2788. [PMID: 33512399 PMCID: PMC7929247 DOI: 10.1093/bioinformatics/btab061] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 01/20/2021] [Accepted: 01/26/2021] [Indexed: 11/16/2022] Open
Abstract
Summary We present StochSS Live!, a web-based service for modeling, simulation and analysis of a wide range of mathematical, biological and biochemical systems. Using an epidemiological model of COVID-19, we demonstrate the power of StochSS Live! to enable researchers to quickly develop a deterministic or a discrete stochastic model, infer its parameters and analyze the results. Availability and implementation StochSS Live! is freely available at https://live.stochss.org/ Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Richard Jiang
- Department of Computer Science, University of California-Santa Barbara, Santa Barbara, CA, 93117, USA
| | - Bruno Jacob
- Department of Mechanical Engineering, University of California-Santa Barbara, Santa Barbara, California, 93106, USA
| | - Matthew Geiger
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina at Asheville, Asheville, NC, 28804
| | - Sean Matthew
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina at Asheville, Asheville, NC, 28804
| | - Bryan Rumsey
- National Environmental Modeling and Analysis Center (NEMAC), University of North Carolina at Asheville, Asheville, NC, 28804
| | - Prashant Singh
- Department of Information Technology, Uppsala University, Box 337, 751 05 Uppsala, Sweden
| | - Fredrik Wrede
- Department of Information Technology, Uppsala University, Box 337, 751 05 Uppsala, Sweden
| | - Tau-Mu Yi
- Department of Molecular, Cellular, and Developmental Biology, University of California-Santa Barbara, Santa Barbara, CA 93106, USA
| | - Brian Drawert
- Department of Computer Science, University of North Carolina at Asheville, Asheville, NC, 28804, USA
| | - Andreas Hellander
- Department of Information Technology, Uppsala University, Box 337, 751 05 Uppsala, Sweden
| | - Linda Petzold
- Department of Computer Science, University of California-Santa Barbara, Santa Barbara, CA, 93117, USA.,Department of Mechanical Engineering, University of California-Santa Barbara, Santa Barbara, California, 93106, USA
| |
Collapse
|
20
|
Schmiester L, Schälte Y, Bergmann FT, Camba T, Dudkin E, Egert J, Fröhlich F, Fuhrmann L, Hauber AL, Kemmer S, Lakrisenko P, Loos C, Merkt S, Müller W, Pathirana D, Raimúndez E, Refisch L, Rosenblatt M, Stapor PL, Städter P, Wang D, Wieland FG, Banga JR, Timmer J, Villaverde AF, Sahle S, Kreutz C, Hasenauer J, Weindl D. PEtab-Interoperable specification of parameter estimation problems in systems biology. PLoS Comput Biol 2021; 17:e1008646. [PMID: 33497393 PMCID: PMC7864467 DOI: 10.1371/journal.pcbi.1008646] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 02/05/2021] [Accepted: 12/18/2020] [Indexed: 01/24/2023] Open
Abstract
Reproducibility and reusability of the results of data-based modeling studies are essential. Yet, there has been-so far-no broadly supported format for the specification of parameter estimation problems in systems biology. Here, we introduce PEtab, a format which facilitates the specification of parameter estimation problems using Systems Biology Markup Language (SBML) models and a set of tab-separated value files describing the observation model and experimental data as well as parameters to be estimated. We already implemented PEtab support into eight well-established model simulation and parameter estimation toolboxes with hundreds of users in total. We provide a Python library for validation and modification of a PEtab problem and currently 20 example parameter estimation problems based on recent studies.
Collapse
Affiliation(s)
- Leonard Schmiester
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
| | - Yannik Schälte
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
| | | | - Tacio Camba
- Department of Applied Mathematics II, University of Vigo, Vigo, Galicia, Spain
- BioProcess Engineering Group, IIM-CSIC, Vigo, Galicia, Spain
| | - Erika Dudkin
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany
| | - Janine Egert
- Faculty of Medicine and Medical Center, Institute of Medical Biometry and Statistics, University of Freiburg, Freiburg, Germany
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
| | - Fabian Fröhlich
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA
| | - Lara Fuhrmann
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany
| | - Adrian L. Hauber
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
- Institute of Physics, University of Freiburg, Freiburg, Germany
| | - Svenja Kemmer
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
- Institute of Physics, University of Freiburg, Freiburg, Germany
| | - Polina Lakrisenko
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
| | - Carolin Loos
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
- Ragon Institute of MGH, MIT and Harvard, Cambridge, Massachusetts, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Simon Merkt
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany
| | - Wolfgang Müller
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Heidelberg, Germany
| | - Dilan Pathirana
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany
| | - Elba Raimúndez
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany
| | - Lukas Refisch
- Faculty of Medicine and Medical Center, Institute of Medical Biometry and Statistics, University of Freiburg, Freiburg, Germany
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
| | - Marcus Rosenblatt
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
- Institute of Physics, University of Freiburg, Freiburg, Germany
| | - Paul L. Stapor
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
| | - Philipp Städter
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
| | - Dantong Wang
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
| | - Franz-Georg Wieland
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
- Institute of Physics, University of Freiburg, Freiburg, Germany
| | - Julio R. Banga
- BioProcess Engineering Group, IIM-CSIC, Vigo, Galicia, Spain
| | - Jens Timmer
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
- Institute of Physics, University of Freiburg, Freiburg, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, Germany
| | | | - Sven Sahle
- BioQUANT/COS, Heidelberg University, Heidelberg, Germany
| | - Clemens Kreutz
- Faculty of Medicine and Medical Center, Institute of Medical Biometry and Statistics, University of Freiburg, Freiburg, Germany
- Freiburg Center for Data Analysis and Modeling (FDM), University of Freiburg, Freiburg, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, Germany
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- Center for Mathematics, Technische Universität München, Garching, Germany
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany
- * E-mail:
| | - Daniel Weindl
- Institute of Computational Biology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
| |
Collapse
|
21
|
Johnson ME, Chen A, Faeder JR, Henning P, Moraru II, Meier-Schellersheim M, Murphy RF, Prüstel T, Theriot JA, Uhrmacher AM. Quantifying the roles of space and stochasticity in computer simulations for cell biology and cellular biochemistry. Mol Biol Cell 2021; 32:186-210. [PMID: 33237849 PMCID: PMC8120688 DOI: 10.1091/mbc.e20-08-0530] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 10/13/2020] [Accepted: 11/17/2020] [Indexed: 12/29/2022] Open
Abstract
Most of the fascinating phenomena studied in cell biology emerge from interactions among highly organized multimolecular structures embedded into complex and frequently dynamic cellular morphologies. For the exploration of such systems, computer simulation has proved to be an invaluable tool, and many researchers in this field have developed sophisticated computational models for application to specific cell biological questions. However, it is often difficult to reconcile conflicting computational results that use different approaches to describe the same phenomenon. To address this issue systematically, we have defined a series of computational test cases ranging from very simple to moderately complex, varying key features of dimensionality, reaction type, reaction speed, crowding, and cell size. We then quantified how explicit spatial and/or stochastic implementations alter outcomes, even when all methods use the same reaction network, rates, and concentrations. For simple cases, we generally find minor differences in solutions of the same problem. However, we observe increasing discordance as the effects of localization, dimensionality reduction, and irreversible enzymatic reactions are combined. We discuss the strengths and limitations of commonly used computational approaches for exploring cell biological questions and provide a framework for decision making by researchers developing new models. As computational power and speed continue to increase at a remarkable rate, the dream of a fully comprehensive computational model of a living cell may be drawing closer to reality, but our analysis demonstrates that it will be crucial to evaluate the accuracy of such models critically and systematically.
Collapse
Affiliation(s)
- M. E. Johnson
- Thomas C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, 21218
| | - A. Chen
- Thomas C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, 21218
| | - J. R. Faeder
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15260
| | - P. Henning
- Institute for Visual and Analytic Computing, University of Rostock, 18055 Rostock, Germany
| | - I. I. Moraru
- Department of Cell Biology, Center for Cell Analysis and Modeling, University of Connecticut Health Center, Farmington, CT 06030
| | - M. Meier-Schellersheim
- Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892
| | - R. F. Murphy
- Computational Biology Department, Department of Biological Sciences, Department of Biomedical Engineering, Machine Learning Department, Carnegie Mellon University, Pittsburgh, PA 15289
| | - T. Prüstel
- Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892
| | - J. A. Theriot
- Department of Biology and Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195
| | - A. M. Uhrmacher
- Institute for Visual and Analytic Computing, University of Rostock, 18055 Rostock, Germany
| |
Collapse
|
22
|
Haiman ZB, Zielinski DC, Koike Y, Yurkovich JT, Palsson BO. MASSpy: Building, simulating, and visualizing dynamic biological models in Python using mass action kinetics. PLoS Comput Biol 2021; 17:e1008208. [PMID: 33507922 PMCID: PMC7872247 DOI: 10.1371/journal.pcbi.1008208] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 02/09/2021] [Accepted: 12/21/2020] [Indexed: 01/01/2023] Open
Abstract
Mathematical models of metabolic networks utilize simulation to study system-level mechanisms and functions. Various approaches have been used to model the steady state behavior of metabolic networks using genome-scale reconstructions, but formulating dynamic models from such reconstructions continues to be a key challenge. Here, we present the Mass Action Stoichiometric Simulation Python (MASSpy) package, an open-source computational framework for dynamic modeling of metabolism. MASSpy utilizes mass action kinetics and detailed chemical mechanisms to build dynamic models of complex biological processes. MASSpy adds dynamic modeling tools to the COnstraint-Based Reconstruction and Analysis Python (COBRApy) package to provide an unified framework for constraint-based and kinetic modeling of metabolic networks. MASSpy supports high-performance dynamic simulation through its implementation of libRoadRunner: the Systems Biology Markup Language (SBML) simulation engine. Three examples are provided to demonstrate how to use MASSpy: (1) a validation of the MASSpy modeling tool through dynamic simulation of detailed mechanisms of enzyme regulation; (2) a feature demonstration using a workflow for generating ensemble of kinetic models using Monte Carlo sampling to approximate missing numerical values of parameters and to quantify biological uncertainty, and (3) a case study in which MASSpy is utilized to overcome issues that arise when integrating experimental data with the computation of functional states of detailed biological mechanisms. MASSpy represents a powerful tool to address challenges that arise in dynamic modeling of metabolic networks, both at small and large scales.
Collapse
Affiliation(s)
- Zachary B. Haiman
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Daniel C. Zielinski
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Yuko Koike
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - James T. Yurkovich
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| |
Collapse
|
23
|
Kirsch K, Zeke A, Tőke O, Sok P, Sethi A, Sebő A, Kumar GS, Egri P, Póti ÁL, Gooley P, Peti W, Bento I, Alexa A, Reményi A. Co-regulation of the transcription controlling ATF2 phosphoswitch by JNK and p38. Nat Commun 2020; 11:5769. [PMID: 33188182 PMCID: PMC7666158 DOI: 10.1038/s41467-020-19582-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 10/21/2020] [Indexed: 01/26/2023] Open
Abstract
Transcription factor phosphorylation at specific sites often activates gene expression, but how environmental cues quantitatively control transcription is not well-understood. Activating protein 1 transcription factors are phosphorylated by mitogen-activated protein kinases (MAPK) in their transactivation domains (TAD) at so-called phosphoswitches, which are a hallmark in response to growth factors, cytokines or stress. We show that the ATF2 TAD is controlled by functionally distinct signaling pathways (JNK and p38) through structurally different MAPK binding sites. Moreover, JNK mediated phosphorylation at an evolutionarily more recent site diminishes p38 binding and made the phosphoswitch differently sensitive to JNK and p38 in vertebrates. Structures of MAPK-TAD complexes and mechanistic modeling of ATF2 TAD phosphorylation in cells suggest that kinase binding motifs and phosphorylation sites line up to maximize MAPK based co-regulation. This study shows how the activity of an ancient transcription controlling phosphoswitch became dependent on the relative flux of upstream signals.
Collapse
Affiliation(s)
- Klára Kirsch
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | - András Zeke
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | - Orsolya Tőke
- Laboratory for NMR Spectroscopy, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | - Péter Sok
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | - Ashish Sethi
- Department of Biochemistry and Molecular Biology and Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Anna Sebő
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | | | - Péter Egri
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | - Ádám L Póti
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | - Paul Gooley
- Department of Biochemistry and Molecular Biology and Bio21 Molecular Science and Biotechnology Institute, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Wolfgang Peti
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ, USA
| | - Isabel Bento
- European Molecular Biology Laboratory, Hamburg, Germany
| | - Anita Alexa
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary
| | - Attila Reményi
- Biomolecular Interactions Research Group, Institute of Organic Chemistry, Research Center for Natural Sciences, H-1117, Budapest, Hungary.
| |
Collapse
|
24
|
Porubsky VL, Goldberg AP, Rampadarath AK, Nickerson DP, Karr JR, Sauro HM. Best Practices for Making Reproducible Biochemical Models. Cell Syst 2020; 11:109-120. [PMID: 32853539 DOI: 10.1016/j.cels.2020.06.012] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 05/15/2020] [Accepted: 06/24/2020] [Indexed: 01/03/2023]
Abstract
Like many scientific disciplines, dynamical biochemical modeling is hindered by irreproducible results. This limits the utility of biochemical models by making them difficult to understand, trust, or reuse. We comprehensively list the best practices that biochemical modelers should follow to build reproducible biochemical model artifacts-all data, model descriptions, and custom software used by the model-that can be understood and reused. The best practices provide advice for all steps of a typical biochemical modeling workflow in which a modeler collects data; constructs, trains, simulates, and validates the model; uses the predictions of a model to advance knowledge; and publicly shares the model artifacts. The best practices emphasize the benefits obtained by using standard tools and formats and provides guidance to modelers who do not or cannot use standards in some stages of their modeling workflow. Adoption of these best practices will enhance the ability of researchers to reproduce, understand, and reuse biochemical models.
Collapse
Affiliation(s)
- Veronica L Porubsky
- Department of Bioengineering, University of Washington, Seattle, WA 98105, USA.
| | - Arthur P Goldberg
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Anand K Rampadarath
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - David P Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Jonathan R Karr
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, WA 98105, USA
| |
Collapse
|
25
|
Schmiester L, Weindl D, Hasenauer J. Parameterization of mechanistic models from qualitative data using an efficient optimal scaling approach. J Math Biol 2020; 81:603-623. [PMID: 32696085 PMCID: PMC7427713 DOI: 10.1007/s00285-020-01522-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 05/28/2020] [Indexed: 12/21/2022]
Abstract
Quantitative dynamical models facilitate the understanding of biological processes and the prediction of their dynamics. These models usually comprise unknown parameters, which have to be inferred from experimental data. For quantitative experimental data, there are several methods and software tools available. However, for qualitative data the available approaches are limited and computationally demanding. Here, we consider the optimal scaling method which has been developed in statistics for categorical data and has been applied to dynamical systems. This approach turns qualitative variables into quantitative ones, accounting for constraints on their relation. We derive a reduced formulation for the optimization problem defining the optimal scaling. The reduced formulation possesses the same optimal points as the established formulation but requires less degrees of freedom. Parameter estimation for dynamical models of cellular pathways revealed that the reduced formulation improves the robustness and convergence of optimizers. This resulted in substantially reduced computation times. We implemented the proposed approach in the open-source Python Parameter EStimation TOolbox (pyPESTO) to facilitate reuse and extension. The proposed approach enables efficient parameterization of quantitative dynamical models using qualitative data.
Collapse
Affiliation(s)
- Leonard Schmiester
- Institute of Computational Biology, Helmholtz Zentrum München–German Research Center for Environmental Health, 85764 Neuherberg, Germany
- Center for Mathematics, Technische Universität München, 85748 Garching, Germany
| | - Daniel Weindl
- Institute of Computational Biology, Helmholtz Zentrum München–German Research Center for Environmental Health, 85764 Neuherberg, Germany
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München–German Research Center for Environmental Health, 85764 Neuherberg, Germany
- Center for Mathematics, Technische Universität München, 85748 Garching, Germany
- Faculty of Mathematics and Natural Sciences, University of Bonn, 53113 Bonn, Germany
| |
Collapse
|
26
|
Keating SM, Waltemath D, König M, Zhang F, Dräger A, Chaouiya C, Bergmann FT, Finney A, Gillespie CS, Helikar T, Hoops S, Malik‐Sheriff RS, Moodie SL, Moraru II, Myers CJ, Naldi A, Olivier BG, Sahle S, Schaff JC, Smith LP, Swat MJ, Thieffry D, Watanabe L, Wilkinson DJ, Blinov ML, Begley K, Faeder JR, Gómez HF, Hamm TM, Inagaki Y, Liebermeister W, Lister AL, Lucio D, Mjolsness E, Proctor CJ, Raman K, Rodriguez N, Shaffer CA, Shapiro BE, Stelling J, Swainston N, Tanimura N, Wagner J, Meier‐Schellersheim M, Sauro HM, Palsson B, Bolouri H, Kitano H, Funahashi A, Hermjakob H, Doyle JC, Hucka M. SBML Level 3: an extensible format for the exchange and reuse of biological models. Mol Syst Biol 2020; 16:e9110. [PMID: 32845085 PMCID: PMC8411907 DOI: 10.15252/msb.20199110] [Citation(s) in RCA: 126] [Impact Index Per Article: 31.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 06/24/2020] [Accepted: 07/09/2020] [Indexed: 12/25/2022] Open
Abstract
Systems biology has experienced dramatic growth in the number, size, and complexity of computational models. To reproduce simulation results and reuse models, researchers must exchange unambiguous model descriptions. We review the latest edition of the Systems Biology Markup Language (SBML), a format designed for this purpose. A community of modelers and software authors developed SBML Level 3 over the past decade. Its modular form consists of a core suited to representing reaction-based models and packages that extend the core with features suited to other model types including constraint-based models, reaction-diffusion models, logical network models, and rule-based models. The format leverages two decades of SBML and a rich software ecosystem that transformed how systems biologists build and interact with models. More recently, the rise of multiscale models of whole cells and organs, and new data sources such as single-cell measurements and live imaging, has precipitated new ways of integrating data with models. We provide our perspectives on the challenges presented by these developments and how SBML Level 3 provides the foundation needed to support this evolution.
Collapse
|
27
|
Kochen MA, Lopez CF. A Probabilistic Approach to Explore Signal Execution Mechanisms With Limited Experimental Data. Front Genet 2020; 11:686. [PMID: 32754196 PMCID: PMC7381302 DOI: 10.3389/fgene.2020.00686] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 06/04/2020] [Indexed: 11/30/2022] Open
Abstract
Mathematical models of biochemical reaction networks are central to the study of dynamic cellular processes and hypothesis generation that informs experimentation and validation. Unfortunately, model parameters are often not available and sparse experimental data leads to challenges in model calibration and parameter estimation. This can in turn lead to unreliable mechanistic interpretations of experimental data and the generation of poorly conceived hypotheses for experimental validation. To address this challenge, we evaluate whether a Bayesian-inspired probability-based approach, that relies on expected values for quantities of interest calculated from available information regarding the reaction network topology and parameters can be used to qualitatively explore hypothetical biochemical network execution mechanisms in the context of limited available data. We test our approach on a model of extrinsic apoptosis execution to identify preferred signal execution modes across varying conditions. Apoptosis signal processing can take place either through a mitochondria independent (Type I) mode or a mitochondria dependent (Type II) mode. We first show that in silico knockouts, represented by model subnetworks, successfully identify the most likely execution mode for specific concentrations of key molecular regulators. We then show that changes in molecular regulator concentrations alter the overall reaction flux through the network by shifting the primary route of signal flow between the direct caspase and mitochondrial pathways. Our work thus demonstrates that probabilistic approaches can be used to explore the qualitative dynamic behavior of model biochemical systems even with missing or sparse data.
Collapse
Affiliation(s)
- Michael A Kochen
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, United States
| | - Carlos F Lopez
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, United States.,Department of Biochemistry, Vanderbilt University, Nashville, TN, United States
| |
Collapse
|
28
|
Mitra ED, Hlavacek WS. Bayesian inference using qualitative observations of underlying continuous variables. Bioinformatics 2020; 36:3177-3184. [PMID: 32049328 PMCID: PMC7214020 DOI: 10.1093/bioinformatics/btaa084] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Revised: 01/08/2020] [Accepted: 02/03/2020] [Indexed: 01/28/2023] Open
Abstract
MOTIVATION Recent work has demonstrated the feasibility of using non-numerical, qualitative data to parameterize mathematical models. However, uncertainty quantification (UQ) of such parameterized models has remained challenging because of a lack of a statistical interpretation of the objective functions used in optimization. RESULTS We formulated likelihood functions suitable for performing Bayesian UQ using qualitative observations of underlying continuous variables or a combination of qualitative and quantitative data. To demonstrate the resulting UQ capabilities, we analyzed a published model for immunoglobulin E (IgE) receptor signaling using synthetic qualitative and quantitative datasets. Remarkably, estimates of parameter values derived from the qualitative data were nearly as consistent with the assumed ground-truth parameter values as estimates derived from the lower throughput quantitative data. These results provide further motivation for leveraging qualitative data in biological modeling. AVAILABILITY AND IMPLEMENTATION The likelihood functions presented here are implemented in a new release of PyBioNetFit, an open-source application for analyzing Systems Biology Markup Language- and BioNetGen Language-formatted models, available online at www.github.com/lanl/PyBNF. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Eshan D Mitra
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - William S Hlavacek
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|
29
|
Salazar-Cavazos E, Nitta CF, Mitra ED, Wilson BS, Lidke KA, Hlavacek WS, Lidke DS. Multisite EGFR phosphorylation is regulated by adaptor protein abundances and dimer lifetimes. Mol Biol Cell 2020; 31:695-708. [PMID: 31913761 PMCID: PMC7202077 DOI: 10.1091/mbc.e19-09-0548] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Differential epidermal growth factor receptor (EGFR) phosphorylation is thought to couple receptor activation to distinct signaling pathways. However, the molecular mechanisms responsible for biased signaling are unresolved due to a lack of insight into the phosphorylation patterns of full-length EGFR. We extended a single-molecule pull-down technique previously used to study protein-protein interactions to allow for robust measurement of receptor phosphorylation. We found that EGFR is predominantly phosphorylated at multiple sites, yet phosphorylation at specific tyrosines is variable and only a subset of receptors share phosphorylation at the same site, even with saturating ligand concentrations. We found distinct populations of receptors as soon as 1 min after ligand stimulation, indicating early diversification of function. To understand this heterogeneity, we developed a mathematical model. The model predicted that variations in phosphorylation are dependent on the abundances of signaling partners, while phosphorylation levels are dependent on dimer lifetimes. The predictions were confirmed in studies of cell lines with different expression levels of signaling partners, and in experiments comparing low- and high-affinity ligands and oncogenic EGFR mutants. These results reveal how ligand-regulated receptor dimerization dynamics and adaptor protein concentrations play critical roles in EGFR signaling.
Collapse
Affiliation(s)
| | | | - Eshan D Mitra
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545
| | | | - Keith A Lidke
- Comprehensive Cancer Center, and.,Department of Physics and Astronomy, University of New Mexico, Albuquerque, NM 87131
| | - William S Hlavacek
- Comprehensive Cancer Center, and.,Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545
| | - Diane S Lidke
- Department of Pathology.,Comprehensive Cancer Center, and
| |
Collapse
|
30
|
Mitra ED, Hlavacek WS. Parameter Estimation and Uncertainty Quantification for Systems Biology Models. CURRENT OPINION IN SYSTEMS BIOLOGY 2019; 18:9-18. [PMID: 32719822 PMCID: PMC7384601 DOI: 10.1016/j.coisb.2019.10.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Mathematical models can provide quantitative insights into immunoreceptor signaling, and other biological processes, but require parameterization and uncertainty quantification before reliable predictions become possible. We review currently available methods and software tools to address these problems. We consider gradient-based and gradient-free methods for point estimation of parameter values, and methods of profile likelihood, bootstrapping, and Bayesian inference for uncertainty quantification. We consider recent and potential future applications of these methods to systems-level modeling of immune-related phenomena.
Collapse
Affiliation(s)
- Eshan D. Mitra
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | - William S. Hlavacek
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|
31
|
Santibáñez R, Garrido D, Martin AJM. Pleione: A tool for statistical and multi-objective calibration of Rule-based models. Sci Rep 2019; 9:15104. [PMID: 31641245 PMCID: PMC6805871 DOI: 10.1038/s41598-019-51546-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Accepted: 09/24/2019] [Indexed: 11/17/2022] Open
Abstract
Mathematical models based on Ordinary Differential Equations (ODEs) are frequently used to describe and simulate biological systems. Nevertheless, such models are often difficult to understand. Unlike ODE models, Rule-Based Models (RBMs) utilise formal language to describe reactions as a cumulative number of statements that are easier to understand and correct. They are also gaining popularity because of their conciseness and simulation flexibility. However, RBMs generally lack tools to perform further analysis that requires simulation. This situation arises because exact and approximate simulations are computationally intensive. Translating RBMs into ODEs is commonly used to reduce simulation time, but this technique may be prohibitive due to combinatorial explosion. Here, we present the software called Pleione to calibrate RBMs. Parameter calibration is essential given the incomplete experimental determination of reaction rates and the goal of using models to reproduce experimental data. The software distributes stochastic simulations and calculations and incorporates equivalence tests to determine the fitness of RBMs compared with data. The primary features of Pleione were thoroughly tested on a model of gene regulation in Escherichia coli. Pleione yielded satisfactory results regarding calculation time and error reduction for multiple simulators, models, parameter search strategies, and computing infrastructures.
Collapse
Affiliation(s)
- Rodrigo Santibáñez
- Network Biology Lab, Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago, 8580745, Chile
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago, 7820436, Chile
| | - Daniel Garrido
- Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago, 7820436, Chile
| | - Alberto J M Martin
- Network Biology Lab, Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago, 8580745, Chile.
| |
Collapse
|
32
|
Mitra ED, Suderman R, Colvin J, Ionkov A, Hu A, Sauro HM, Posner RG, Hlavacek WS. PyBioNetFit and the Biological Property Specification Language. iScience 2019; 19:1012-1036. [PMID: 31522114 PMCID: PMC6744527 DOI: 10.1016/j.isci.2019.08.045] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 06/21/2019] [Accepted: 08/22/2019] [Indexed: 02/07/2023] Open
Abstract
In systems biology modeling, important steps include model parameterization, uncertainty quantification, and evaluation of agreement with experimental observations. To help modelers perform these steps, we developed the software PyBioNetFit, which in addition supports checking models against known system properties and solving design problems. PyBioNetFit introduces Biological Property Specification Language (BPSL) for the formal declaration of system properties. BPSL allows qualitative data to be used alone or in combination with quantitative data. PyBioNetFit performs parameterization with parallelized metaheuristic optimization algorithms that work directly with existing model definition standards: BioNetGen Language (BNGL) and Systems Biology Markup Language (SBML). We demonstrate PyBioNetFit's capabilities by solving various example problems, including the challenging problem of parameterizing a 153-parameter model of cell cycle control in yeast based on both quantitative and qualitative data. We demonstrate the model checking and design applications of PyBioNetFit and BPSL by analyzing a model of targeted drug interventions in autophagy signaling.
Collapse
Affiliation(s)
- Eshan D Mitra
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Ryan Suderman
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Joshua Colvin
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - Alexander Ionkov
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Andrew Hu
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Richard G Posner
- Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - William S Hlavacek
- Theoretical Biology and Biophysics Group, Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
| |
Collapse
|