1
|
Li LX, Aguilar B, Gennari JH, Qin G. LM-Merger: A workflow for merging logical models with an application to gene regulation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.13.612961. [PMID: 39345612 PMCID: PMC11429764 DOI: 10.1101/2024.09.13.612961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/01/2024]
Abstract
Motivation Gene regulatory network (GRN) models provide mechanistic understanding of genetic interactions that regulate gene expression and, consequently, influence cellular behavior. Dysregulated gene expression plays a critical role in disease progression and treatment response, making GRN models a promising tool for precision medicine. While researchers have built many models to describe specific subsets of gene interactions, more comprehensive models that cover a broader range of genes are challenging to build. This necessitates the development of automated approaches for merging existing models. Results We present LM-Merger, a workflow for semi-automatically merging logical GRN models. The workflow consists of five main steps: (a) model identification, (b) model standardization and annotation, (c) model verification, (d) model merging, and (d) model evaluation. We demonstrate the feasibility and benefit of this workflow with two pairs of published models pertaining to acute myeloid leukemia (AML). The integrated models were able to retain the predictive accuracy of the original models, while expanding coverage of the biological system. Notably, when applied to a new dataset, the integrated models outperformed the individual models in predicting patient response. This study highlights the potential of logical model merging to advance systems biology research and our understanding of complex diseases. Availability and implementation The workflow and accompanying tools, including modules for model standardization, automated logical model merging, and evaluation, are available at https://github.com/IlyaLab/LogicModelMerger/.
Collapse
Affiliation(s)
- Luna Xingyu Li
- Institute for Systems Biology, Seattle, WA 98109, United States of America
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195, United States of America
| | - Boris Aguilar
- Institute for Systems Biology, Seattle, WA 98109, United States of America
| | - John H Gennari
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195, United States of America
| | - Guangrong Qin
- Institute for Systems Biology, Seattle, WA 98109, United States of America
| |
Collapse
|
2
|
Fang J, Huang Y, Li Y, Luo H, Ma L, Duan M, Li X, Zhang R, Xiong Y. Experiment and Simulation Study on the Adsorption Interaction between a Fluorescent Tracer and a Montmorillonite Crystal in Drilling Fluid. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2024; 40:24901-24920. [PMID: 39546812 DOI: 10.1021/acs.langmuir.4c02848] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2024]
Abstract
The adsorption interaction of oil field tracer in drilling fluid plays a significant role in tracer monitoring (TM) technology in the petroleum industry. In this work, the adsorption performances of Rhodamine B (RhB+) and fluorescein sodium (Fln-) tracers with montmorillonite (MMT) crystal in drilling fluid were investigated by both experimental and simulation methods. For the experimental aspect, the macroscopic results indicate thermodynamic monolayer adsorption by the Langmuir model and kinetic chemical adsorption by the pseudo-second-order (PSO) model. As a result, MMT shows a larger adsorption capacity (qm) for RhB+ than for Fln- with q m ( RhB + ) = 0.069 g g - 1 > q m ( Fln - ) = 0.016 g g - 1 but stronger adsorption spontaneity (ΔrGmθ) for Fln- than for RhB+ with Δ r G m θ ( Fln - ) = - 7.92 kJ mol-1 < Δ r G m θ ( RhB + ) = - 6.90 kJ mol-1. Meanwhile, the interaction rate (k2) of Fln- was shown to be faster than that of RhB+ with k 2 ( Fln - ) = 1.07 min - 1 > k 2 ( RhB + ) = 0.95 min - 1 . For simulation insight, MMT shows much higher system stability (E) for Fln- than for RhB+ with E Fln - · · · MMT < E RhB + · · · MMT and Δ E Fln - · · · MMT > Δ E RhB + · · · MMT . Meanwhile, the microscopic simulation results reveal configuration changes and site distinctions for RhB+ and Fln- interactions with the MMT crystal. The different adsorption responses were explained by proposing an interaction mechanism of force dominance and position orientation. Specifically, Fln- was deduced to interact with metal (Al, Ca) and metalloid (Si) elements in the MMT crystal interlayer by "upright-insertion" orientation while RhB+ was deduced to interact with oxygen atoms on the MMT crystal surface by a "flat-lying" orientation. Hydrogen bonds, the electrostatic interaction, and the coordination effect were revealed to dominate for the interaction of tracer adsorption. This work provides both performance and mechanism investigation of fluorescent tracer adsorption interaction with the MMT crystal in drilling fluid, which is of great significance in reservoir exploitation.
Collapse
Affiliation(s)
- Jie Fang
- School of Chemistry and Chemical Engineering, Southwest Petroleum University, Chengdu 610500, China
| | - Ying Huang
- CNOOC Energy Tech-Drilling & Production Co., Tianjin 300452, China
- NOOC Energy Technology & Services Limited Key Laboratory for Exploration & Development of Unconventional Resources, Beijing 100029, China
| | - Yangbing Li
- CNOOC Energy Tech-Drilling & Production Co., Tianjin 300452, China
- NOOC Energy Technology & Services Limited Key Laboratory for Exploration & Development of Unconventional Resources, Beijing 100029, China
| | - Houfu Luo
- School of Chemistry and Chemical Engineering, Southwest Petroleum University, Chengdu 610500, China
| | - Lihua Ma
- School of Chemistry and Chemical Engineering, Southwest Petroleum University, Chengdu 610500, China
| | - Ming Duan
- School of Chemistry and Chemical Engineering, Southwest Petroleum University, Chengdu 610500, China
| | - Xinliang Li
- School of Chemistry and Chemical Engineering, Southwest Petroleum University, Chengdu 610500, China
| | - Run Zhang
- Australian Institute for Bioengineering and Nanotechnology, AIBN, The University of Queensland, St. Lucia QLD 4072, Australia
| | - Yan Xiong
- School of Chemistry and Chemical Engineering, Southwest Petroleum University, Chengdu 610500, China
| |
Collapse
|
3
|
Agmon E. Foundations of a Compositional Systems Biology. ARXIV 2024:arXiv:2408.00942v2. [PMID: 39130201 PMCID: PMC11312625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Composition is a powerful principle for systems biology, focused on the interfaces, interconnections, and orchestration of distributed processes to enable integrative multiscale simulations. Whereas traditional models focus on the structure or dynamics of specific subsystems in controlled conditions, compositional systems biology aims to connect these models, asking critical questions about the space between models: What variables should a submodel expose through its interface? How do coupled models connect and translate across scales? How do domain-specific models connect across biological and physical disciplines to drive the synthesis of new knowledge? This approach requires robust software to integrate diverse datasets and submodels, providing researchers with tools to flexibly recombine, iteratively refine, and collaboratively expand their models. This article offers a comprehensive framework to support this vision, including: a conceptual and graphical framework to define interfaces and composition patterns; standardized schemas that facilitate modular data and model assembly; biological templates that integrate detailed submodels that connect molecular processes to the emergence of the cellular interface; and user-friendly software interfaces that empower research communities to construct and improve multiscale models of cellular systems. By addressing these needs, compositional systems biology will foster a unified and scalable approach to understanding complex cellular systems.
Collapse
|
4
|
James JS, Dai J, Chew WL, Cai Y. The design and engineering of synthetic genomes. Nat Rev Genet 2024:10.1038/s41576-024-00786-y. [PMID: 39506144 DOI: 10.1038/s41576-024-00786-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/23/2024] [Indexed: 11/08/2024]
Abstract
Synthetic genomics seeks to design and construct entire genomes to mechanistically dissect fundamental questions of genome function and to engineer organisms for diverse applications, including bioproduction of high-value chemicals and biologics, advanced cell therapies, and stress-tolerant crops. Recent progress has been fuelled by advancements in DNA synthesis, assembly, delivery and editing. Computational innovations, such as the use of artificial intelligence to provide prediction of function, also provide increasing capabilities to guide synthetic genome design and construction. However, translating synthetic genome-scale projects from idea to implementation remains highly complex. Here, we aim to streamline this implementation process by comprehensively reviewing the strategies for design, construction, delivery, debugging and tailoring of synthetic genomes as well as their potential applications.
Collapse
Affiliation(s)
- Joshua S James
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Junbiao Dai
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Shenzhen Key Laboratory of Agricultural Synthetic Biology, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Shenzhen Key Laboratory of Synthetic Genomics, Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Wei Leong Chew
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Yizhi Cai
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK.
| |
Collapse
|
5
|
Bi X, Cheng Y, Lv X, Liu Y, Li J, Du G, Chen J, Liu L. A Multi-Omics, Machine Learning-Aware, Genome-Wide Metabolic Model of Bacillus Subtilis Refines the Gene Expression and Cell Growth Prediction. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2408705. [PMID: 39287062 PMCID: PMC11558093 DOI: 10.1002/advs.202408705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Indexed: 09/19/2024]
Abstract
Given the extensive heterogeneity and variability, understanding cellular functions and regulatory mechanisms through the analysis of multi-omics datasets becomes extremely challenging. Here, a comprehensive modeling framework of multi-omics machine learning and metabolic network models are proposed that covers various cellular biological processes across multiple scales. This model on an extensive normalized compendium of Bacillus subtilis is validated, which encompasses gene expression data from environmental perturbations, transcriptional regulation, signal transduction, protein translation, and growth measurements. Comparison with high-throughput experimental data shows that EM_iBsu1209-ME, constructed on this basis, can accurately predict the expression of 605 genes and the synthesis of 23 metabolites under different conditions. This study paves the way for the construction of comprehensive biological databases and high-performance multi-omics metabolic models to achieve accurate predictive analysis in exploring complex mechanisms of cell genotypes and phenotypes.
Collapse
Affiliation(s)
- Xinyu Bi
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| | - Yang Cheng
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| | - Xueqin Lv
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| | - Yanfeng Liu
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| | - Jianghua Li
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| | - Guocheng Du
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| | - Jian Chen
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| | - Long Liu
- Key Laboratory of Carbohydrate Chemistry and BiotechnologyMinistry of EducationJiangnan UniversityWuxi214122China
- Science Center for Future FoodsMinistry of EducationJiangnan UniversityWuxi214122China
| |
Collapse
|
6
|
Lu H, Xiao L, Liao W, Yan X, Nielsen J. Cell factory design with advanced metabolic modelling empowered by artificial intelligence. Metab Eng 2024; 85:61-72. [PMID: 39038602 DOI: 10.1016/j.ymben.2024.07.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 07/06/2024] [Accepted: 07/06/2024] [Indexed: 07/24/2024]
Abstract
Advances in synthetic biology and artificial intelligence (AI) have provided new opportunities for modern biotechnology. High-performance cell factories, the backbone of industrial biotechnology, are ultimately responsible for determining whether a bio-based product succeeds or fails in the fierce competition with petroleum-based products. To date, one of the greatest challenges in synthetic biology is the creation of high-performance cell factories in a consistent and efficient manner. As so-called white-box models, numerous metabolic network models have been developed and used in computational strain design. Moreover, great progress has been made in AI-powered strain engineering in recent years. Both approaches have advantages and disadvantages. Therefore, the deep integration of AI with metabolic models is crucial for the construction of superior cell factories with higher titres, yields and production rates. The detailed applications of the latest advanced metabolic models and AI in computational strain design are summarized in this review. Additionally, approaches for the deep integration of AI and metabolic models are discussed. It is anticipated that advanced mechanistic metabolic models powered by AI will pave the way for the efficient construction of powerful industrial chassis strains in the coming years.
Collapse
Affiliation(s)
- Hongzhong Lu
- State Key Laboratory of Microbial Metabolism, School of Life Science and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China.
| | - Luchi Xiao
- State Key Laboratory of Microbial Metabolism, School of Life Science and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China
| | - Wenbin Liao
- State Key Laboratory of Microbial Metabolism, School of Life Science and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, PR China; Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai, 200237, PR China
| | - Xuefeng Yan
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai, 200237, PR China
| | - Jens Nielsen
- BioInnovation Institute, Ole Måløes Vej, DK2200, Copenhagen N, Denmark; Department of Biology and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE412 96, Gothenburg, Sweden.
| |
Collapse
|
7
|
Mutsuddy A, Huggins JR, Amrit A, Erdem C, Calhoun JC, Birtwistle MR. Mechanistic modeling of cell viability assays with in silico lineage tracing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.23.609433. [PMID: 39253474 PMCID: PMC11383287 DOI: 10.1101/2024.08.23.609433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Data from cell viability assays, which measure cumulative division and death events in a population and reflect substantial cellular heterogeneity, are widely available. However, interpreting such data with mechanistic computational models is hindered because direct model/data comparison is often muddled. We developed an algorithm that tracks simulated division and death events in mechanistically detailed single-cell lineages to enable such a model/data comparison and suggest causes of cell-cell drug response variability. Using our previously developed model of mammalian single-cell proliferation and death signaling, we simulated drug dose response experiments for four targeted anti-cancer drugs (alpelisib, neratinib, trametinib and palbociclib) and compared them to experimental data. Simulations are consistent with data for strong growth inhibition by trametinib (MEK inhibitor) and overall lack of efficacy for alpelisib (PI-3K inhibitor), but are inconsistent with data for palbociclib (CDK4/6 inhibitor) and neratinib (EGFR inhibitor). Model/data inconsistencies suggest (i) the importance of CDK4/6 for driving the cell cycle may be overestimated, and (ii) that the cellular balance between basal (tonic) and ligand-induced signaling is a critical determinant of receptor inhibitor response. Simulations show subpopulations of rapidly and slowly dividing cells in both control and drug-treated conditions. Variations in mother cells prior to drug treatment all impinging on ERK pathway activity are associated with the rapidly dividing phenotype and trametinib resistance. This work lays a foundation for the application of mechanistic modeling to large-scale cell viability assay datasets and better understanding determinants of cellular heterogeneity in drug response.
Collapse
Affiliation(s)
- Arnab Mutsuddy
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA
| | - Jonah R. Huggins
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA
| | - Aurore Amrit
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA
- Faculté de Pharmacie, Université Paris Cité, Paris, France
| | - Cemal Erdem
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA
- Department of Medical Biosciences, Umeå University, Umeå, Sweden
| | - Jon C. Calhoun
- Holcombe Department of Electrical and Computer Engineering, Clemson University, Clemson, SC, USA
| | - Marc R. Birtwistle
- Department of Chemical and Biomolecular Engineering, Clemson University, Clemson, SC, USA
- Department of Bioengineering, Clemson University, Clemson, SC, USA
| |
Collapse
|
8
|
Kim K, Choe D, Cho S, Palsson B, Cho BK. Reduction-to-synthesis: the dominant approach to genome-scale synthetic biology. Trends Biotechnol 2024; 42:1048-1063. [PMID: 38423803 DOI: 10.1016/j.tibtech.2024.02.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/09/2024] [Accepted: 02/12/2024] [Indexed: 03/02/2024]
Abstract
Advances in systems and synthetic biology have propelled the construction of reduced bacterial genomes. Genome reduction was initially focused on exploring properties of minimal genomes, but more recently it has been deployed as an engineering strategy to enhance strain performance. This review provides the latest updates on reduced genomes, focusing on dual-track approaches of top-down reduction and bottom-up synthesis for their construction. Using cases from studies that are based on established industrial workhorse strains, we discuss the construction of a series of synthetic phenotypes that are candidates for biotechnological applications. Finally, we address the possible uses of reduced genomes for biotechnological applications and the needed future research directions that may ultimately lead to the total synthesis of rationally designed genomes.
Collapse
Affiliation(s)
- Kangsan Kim
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea; KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Donghui Choe
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Suhyung Cho
- KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Bernhard Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Kongens, Lyngby, Denmark
| | - Byung-Kwan Cho
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea; KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea; Graduate School of Engineering Biology, Korea Advanced Institute of Science and Technology, Daejeon, 34141, Republic of Korea.
| |
Collapse
|
9
|
Metz TO, Chang CH, Gautam V, Anjum A, Tian S, Wang F, Colby SM, Nunez JR, Blumer MR, Edison AS, Fiehn O, Jones DP, Li S, Morgan ET, Patti GJ, Ross DH, Shapiro MR, Williams AJ, Wishart DS. Introducing 'identification probability' for automated and transferable assessment of metabolite identification confidence in metabolomics and related studies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.30.605945. [PMID: 39131324 PMCID: PMC11312557 DOI: 10.1101/2024.07.30.605945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Methods for assessing compound identification confidence in metabolomics and related studies have been debated and actively researched for the past two decades. The earliest effort in 2007 focused primarily on mass spectrometry and nuclear magnetic resonance spectroscopy and resulted in four recommended levels of metabolite identification confidence - the Metabolite Standards Initiative (MSI) Levels. In 2014, the original MSI Levels were expanded to five levels (including two sublevels) to facilitate communication of compound identification confidence in high resolution mass spectrometry studies. Further refinement in identification levels have occurred, for example to accommodate use of ion mobility spectrometry in metabolomics workflows, and alternate approaches to communicate compound identification confidence also have been developed based on identification points schema. However, neither qualitative levels of identification confidence nor quantitative scoring systems address the degree of ambiguity in compound identifications in context of the chemical space being considered, are easily automated, or are transferable between analytical platforms. In this perspective, we propose that the metabolomics and related communities consider identification probability as an approach for automated and transferable assessment of compound identification and ambiguity in metabolomics and related studies. Identification probability is defined simply as 1/N, where N is the number of compounds in a reference library or chemical space that match to an experimentally measured molecule within user-defined measurement precision(s), for example mass measurement or retention time accuracy, etc. We demonstrate the utility of identification probability in an in silico analysis of multi-property reference libraries constructed from the Human Metabolome Database and computational property predictions, provide guidance to the community in transparent implementation of the concept, and invite the community to further evaluate this concept in parallel with their current preferred methods for assessing metabolite identification confidence.
Collapse
Affiliation(s)
- Thomas O. Metz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Christine H. Chang
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Vasuk Gautam
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Afia Anjum
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Siyang Tian
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| | - Fei Wang
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
- Alberta Machine Intelligence Institute, Edmonton, AB, Canada
| | - Sean M. Colby
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Jamie R. Nunez
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Madison R. Blumer
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Arthur S. Edison
- Department of Biochemistry & Molecular Biology, Complex Carbohydrate Research Center and Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| | - Oliver Fiehn
- West Coast Metabolomics Center, University of California Davis, Davis, CA, USA
| | - Dean P. Jones
- Clinical Biomarkers Laboratory, Department of Medicine, Emory University, Atlanta, Georgia, USA
| | - Shuzhao Li
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Edward T. Morgan
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Gary J. Patti
- Center for Mass Spectrometry and Metabolic Tracing, Department of Chemistry, Department of Medicine, Washington University, Saint Louis, Missouri, USA
| | - Dylan H. Ross
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Madelyn R. Shapiro
- Artificial Intelligence & Data Analytics Division, Pacific Northwest National Laboratory, Richland, WA USA
| | - Antony J. Williams
- U.S. Environmental Protection Agency, Office of Research & Development, Center for Computational Toxicology & Exposure (CCTE), Research Triangle Park, NC USA
| | - David S. Wishart
- Department of Biological Sciences, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
10
|
Zelenka NR, Di Cara N, Sharma K, Sarvaharman S, Ghataora JS, Parmeggiani F, Nivala J, Abdallah ZS, Marucci L, Gorochowski TE. Data hazards in synthetic biology. Synth Biol (Oxf) 2024; 9:ysae010. [PMID: 38973982 PMCID: PMC11227101 DOI: 10.1093/synbio/ysae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/17/2024] [Accepted: 06/19/2024] [Indexed: 07/09/2024] Open
Abstract
Data science is playing an increasingly important role in the design and analysis of engineered biology. This has been fueled by the development of high-throughput methods like massively parallel reporter assays, data-rich microscopy techniques, computational protein structure prediction and design, and the development of whole-cell models able to generate huge volumes of data. Although the ability to apply data-centric analyses in these contexts is appealing and increasingly simple to do, it comes with potential risks. For example, how might biases in the underlying data affect the validity of a result and what might the environmental impact of large-scale data analyses be? Here, we present a community-developed framework for assessing data hazards to help address these concerns and demonstrate its application to two synthetic biology case studies. We show the diversity of considerations that arise in common types of bioengineering projects and provide some guidelines and mitigating steps. Understanding potential issues and dangers when working with data and proactively addressing them will be essential for ensuring the appropriate use of emerging data-intensive AI methods and help increase the trustworthiness of their applications in synthetic biology.
Collapse
Affiliation(s)
- Natalie R Zelenka
- Jean Golding Institute, University of Bristol, Bristol, UK
- BrisEngBio, University of Bristol, Bristol, UK
| | - Nina Di Cara
- School of Psychological Science, University of Bristol, Bristol, UK
| | - Kieren Sharma
- School of Engineering Mathematics and Technology, University of Bristol, Bristol, UK
| | | | - Jasdeep S Ghataora
- BrisEngBio, University of Bristol, Bristol, UK
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Fabio Parmeggiani
- BrisEngBio, University of Bristol, Bristol, UK
- School of Biochemistry, University of Bristol, Bristol, UK
- School of Pharmacy and Pharmaceutical Sciences, Cardiff University, Cardiff, UK
| | - Jeff Nivala
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - Zahraa S Abdallah
- School of Engineering Mathematics and Technology, University of Bristol, Bristol, UK
| | - Lucia Marucci
- BrisEngBio, University of Bristol, Bristol, UK
- School of Engineering Mathematics and Technology, University of Bristol, Bristol, UK
| | - Thomas E Gorochowski
- BrisEngBio, University of Bristol, Bristol, UK
- School of Biological Sciences, University of Bristol, Bristol, UK
| |
Collapse
|
11
|
Rafelski SM, Theriot JA. Establishing a conceptual framework for holistic cell states and state transitions. Cell 2024; 187:2633-2651. [PMID: 38788687 DOI: 10.1016/j.cell.2024.04.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 04/10/2024] [Accepted: 04/24/2024] [Indexed: 05/26/2024]
Abstract
Cell states were traditionally defined by how they looked, where they were located, and what functions they performed. In this post-genomic era, the field is largely focused on a molecular view of cell state. Moving forward, we anticipate that the observables used to define cell states will evolve again as single-cell imaging and analytics are advancing at a breakneck pace via the collection of large-scale, systematic cell image datasets and the application of quantitative image-based data science methods. This is, therefore, a key moment in the arc of cell biological research to develop approaches that integrate the spatiotemporal observables of the physical structure and organization of the cell with molecular observables toward the concept of a holistic cell state. In this perspective, we propose a conceptual framework for holistic cell states and state transitions that is data-driven, practical, and useful to enable integrative analyses and modeling across many data types.
Collapse
Affiliation(s)
- Susanne M Rafelski
- Allen Institute for Cell Science, 615 Westlake Avenue N, Seattle, WA 98125, USA.
| | - Julie A Theriot
- Department of Biology and Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
12
|
Hao T, Song Z, Zhang M, Zhang L, Yang J, Li J, Sun J. Reconstruction of Metabolic-Protein Interaction Integrated Network of Eriocheir sinensis and Analysis of Ecdysone Synthesis. Genes (Basel) 2024; 15:410. [PMID: 38674345 PMCID: PMC11049885 DOI: 10.3390/genes15040410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 03/24/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Integrated networks have become a new interest in genome-scale network research due to their ability to comprehensively reflect and analyze the molecular processes in cells. Currently, none of the integrated networks have been reported for higher organisms. Eriocheir sinensis is a typical aquatic animal that grows through ecdysis. Ecdysone has been identified to be a crucial regulator of ecdysis, but the influence factors and regulatory mechanisms of ecdysone synthesis in E. sinensis are still unclear. In this work, the genome-scale metabolic network and protein-protein interaction network of E. sinensis were integrated to reconstruct a metabolic-protein interaction integrated network (MPIN). The MPIN was used to analyze the influence factors of ecdysone synthesis through flux variation analysis. In total, 236 integrated reactions (IRs) were found to influence the ecdysone synthesis of which 16 IRs had a significant impact. These IRs constitute three ecdysone synthesis routes. It is found that there might be alternative pathways to obtain cholesterol for ecdysone synthesis in E. sinensis instead of absorbing it directly from the feeds. The MPIN reconstructed in this work is the first integrated network for higher organisms. The analysis based on the MPIN supplies important information for the mechanism analysis of ecdysone synthesis in E. sinensis.
Collapse
Affiliation(s)
- Tong Hao
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Zhentao Song
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Mingzhi Zhang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Lingrui Zhang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Jiarui Yang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Jingjing Li
- Tianjin Fisheries Research Institute, Tianjin 300211, China;
| | - Jinsheng Sun
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| |
Collapse
|
13
|
Sun G, DeFelice MM, Gillies TE, Ahn-Horst TA, Andrews CJ, Krummenacker M, Karp PD, Morrison JH, Covert MW. Cross-evaluation of E. coli's operon structures via a whole-cell model suggests alternative cellular benefits for low- versus high-expressing operons. Cell Syst 2024; 15:227-245.e7. [PMID: 38417437 PMCID: PMC10957310 DOI: 10.1016/j.cels.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 09/12/2023] [Accepted: 02/08/2024] [Indexed: 03/01/2024]
Abstract
Many bacteria use operons to coregulate genes, but it remains unclear how operons benefit bacteria. We integrated E. coli's 788 polycistronic operons and 1,231 transcription units into an existing whole-cell model and found inconsistencies between the proposed operon structures and the RNA-seq read counts that the model was parameterized from. We resolved these inconsistencies through iterative, model-guided corrections to both datasets, including the correction of RNA-seq counts of short genes that were misreported as zero by existing alignment algorithms. The resulting model suggested two main modes by which operons benefit bacteria. For 86% of low-expression operons, adding operons increased the co-expression probabilities of their constituent proteins, whereas for 92% of high-expression operons, adding operons resulted in more stable expression ratios between the proteins. These simulations underscored the need for further experimental work on how operons reduce noise and synchronize both the expression timing and the quantity of constituent genes. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Mialy M DeFelice
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Taryn E Gillies
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Travis A Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Cecelia J Andrews
- Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA
| | | | | | - Jerry H Morrison
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
14
|
Sechkar K, Steel H, Perrino G, Stan GB. A coarse-grained bacterial cell model for resource-aware analysis and design of synthetic gene circuits. Nat Commun 2024; 15:1981. [PMID: 38438391 PMCID: PMC10912777 DOI: 10.1038/s41467-024-46410-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 02/27/2024] [Indexed: 03/06/2024] Open
Abstract
Within a cell, synthetic and native genes compete for expression machinery, influencing cellular process dynamics through resource couplings. Models that simplify competitive resource binding kinetics can guide the design of strategies for countering these couplings. However, in bacteria resource availability and cell growth rate are interlinked, which complicates resource-aware biocircuit design. Capturing this interdependence requires coarse-grained bacterial cell models that balance accurate representation of metabolic regulation against simplicity and interpretability. We propose a coarse-grained E. coli cell model that combines the ease of simplified resource coupling analysis with appreciation of bacterial growth regulation mechanisms and the processes relevant for biocircuit design. Reliably capturing known growth phenomena, it provides a unifying explanation to disparate empirical relations between growth and synthetic gene expression. Considering a biomolecular controller that makes cell-wide ribosome availability robust to perturbations, we showcase our model's usefulness in numerically prototyping biocircuits and deriving analytical relations for design guidance.
Collapse
Affiliation(s)
- Kirill Sechkar
- Department of Engineering Science, University of Oxford, Parks Road, Oxford, OX1 3PJ, UK
| | - Harrison Steel
- Department of Engineering Science, University of Oxford, Parks Road, Oxford, OX1 3PJ, UK
| | - Giansimone Perrino
- Department of Bioengineering, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK.
- Imperial College Centre of Excellence in Synthetic Biology, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK.
| | - Guy-Bart Stan
- Department of Bioengineering, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK.
- Imperial College Centre of Excellence in Synthetic Biology, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK.
| |
Collapse
|
15
|
Baghdassarian HM, Lewis NE. Resource allocation in mammalian systems. Biotechnol Adv 2024; 71:108305. [PMID: 38215956 PMCID: PMC11182366 DOI: 10.1016/j.biotechadv.2023.108305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 12/17/2023] [Accepted: 12/18/2023] [Indexed: 01/14/2024]
Abstract
Cells execute biological functions to support phenotypes such as growth, migration, and secretion. Complementarily, each function of a cell has resource costs that constrain phenotype. Resource allocation by a cell allows it to manage these costs and optimize their phenotypes. In fact, the management of resource constraints (e.g., nutrient availability, bioenergetic capacity, and macromolecular machinery production) shape activity and ultimately impact phenotype. In mammalian systems, quantification of resource allocation provides important insights into higher-order multicellular functions; it shapes intercellular interactions and relays environmental cues for tissues to coordinate individual cells to overcome resource constraints and achieve population-level behavior. Furthermore, these constraints, objectives, and phenotypes are context-dependent, with cells adapting their behavior according to their microenvironment, resulting in distinct steady-states. This review will highlight the biological insights gained from probing resource allocation in mammalian cells and tissues.
Collapse
Affiliation(s)
- Hratch M Baghdassarian
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA; Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
16
|
Schulz-Mirbach H, Dronsella B, He H, Erb TJ. Creating new-to-nature carbon fixation: A guide. Metab Eng 2024; 82:12-28. [PMID: 38160747 DOI: 10.1016/j.ymben.2023.12.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 12/23/2023] [Accepted: 12/27/2023] [Indexed: 01/03/2024]
Abstract
Synthetic biology aims at designing new biological functions from first principles. These new designs allow to expand the natural solution space and overcome the limitations of naturally evolved systems. One example is synthetic CO2-fixation pathways that promise to provide more efficient ways for the capture and conversion of CO2 than natural pathways, such as the Calvin Benson Bassham (CBB) cycle of photosynthesis. In this review, we provide a practical guideline for the design and realization of such new-to-nature CO2-fixation pathways. We introduce the concept of "synthetic CO2-fixation", and give a general overview over the enzymology and topology of synthetic pathways, before we derive general principles for their design from their eight naturally evolved analogs. We provide a comprehensive summary of synthetic carbon-assimilation pathways and derive a step-by-step, practical guide from the theoretical design to their practical implementation, before ending with an outlook on new developments in the field.
Collapse
Affiliation(s)
- Helena Schulz-Mirbach
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043, Marburg, Germany
| | - Beau Dronsella
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043, Marburg, Germany; Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - Hai He
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043, Marburg, Germany
| | - Tobias J Erb
- Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch-Str. 10, 35043, Marburg, Germany; Center for Synthetic Microbiology (SYNMIKRO), Karl-von-Frisch-Str. 16, D-35043, Marburg, Germany.
| |
Collapse
|
17
|
Akbari A, Haiman ZB, Palsson BO. A data-driven approach for timescale decomposition of biochemical reaction networks. mSystems 2024; 9:e0100123. [PMID: 38259168 PMCID: PMC10946255 DOI: 10.1128/msystems.01001-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 12/05/2023] [Indexed: 01/24/2024] Open
Abstract
Understanding the dynamics of biological systems in evolving environments is a challenge due to their scale and complexity. Here, we present a computational framework for the timescale decomposition of biochemical reaction networks to distill essential patterns from their intricate dynamics. This approach identifies timescale hierarchies, concentration pools, and coherent structures from time-series data, providing a system-level description of reaction networks at physiologically important timescales. We apply this technique to kinetic models of hypothetical and biological pathways, validating it by reproducing analytically characterized or previously known concentration pools of these pathways. Moreover, by analyzing the timescale hierarchy of the glycolytic pathway, we elucidate the connections between the stoichiometric and dissipative structures of reaction networks and the temporal organization of coherent structures. Specifically, we show that glycolysis is a cofactor-driven pathway, the slowest dynamics of which are described by a balance between high-energy phosphate bond and redox trafficking. Overall, this approach provides more biologically interpretable characterizations of network dynamics than large-scale kinetic models, thus facilitating model reduction and personalized medicine applications. IMPORTANCE Complex interactions within interconnected biochemical reaction networks enable cellular responses to a wide range of unpredictable environmental perturbations. Understanding how biological functions arise from these intricate interactions has been a long-standing problem in biology. Here, we introduce a computational approach to dissect complex biological systems' dynamics in evolving environments. This approach characterizes the timescale hierarchies of complex reaction networks, offering a system-level understanding at physiologically relevant timescales. Analyzing various hypothetical and biological pathways, we show how stoichiometric properties shape the way energy is dissipated throughout reaction networks. Notably, we establish that glycolysis operates as a cofactor-driven pathway, where the slowest dynamics are governed by a balance between high-energy phosphate bonds and redox trafficking. This approach enhances our understanding of network dynamics and facilitates the development of reduced-order kinetic models with biologically interpretable components.
Collapse
Affiliation(s)
- Amir Akbari
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Zachary B. Haiman
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, California, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
18
|
Gilbert BR, Luthey-Schulten Z. Replicating Chromosomes in Whole-Cell Models of Bacteria. Methods Mol Biol 2024; 2819:625-653. [PMID: 39028527 DOI: 10.1007/978-1-0716-3930-6_29] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Computational models of cells cannot be considered complete unless they include the most fundamental process of life, the replication of genetic material. In a recent study, we presented a computational framework to model systems of replicating bacterial chromosomes as polymers at 10 bp resolution with Brownian dynamics. This approach was used to investigate changes in chromosome organization during replication and extend the applicability of an existing whole-cell model (WCM) for a genetically minimal bacterium, JCVI-syn3A, to the entire cell cycle. To achieve cell-scale chromosome structures that are realistic, we modeled the chromosome as a self-avoiding homopolymer with bending and torsional stiffnesses that capture the essential mechanical properties of dsDNA in Syn3A. Additionally, the polymer interacts with ribosomes distributed according to cryo-electron tomograms of Syn3A. The polymer model was further augmented by computational models of loop extrusion by structural maintenance of chromosomes (SMC) protein complexes and topoisomerase action, and the modeling and analysis of multi-fork replication states.
Collapse
Affiliation(s)
- Benjamin R Gilbert
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
- NSF Science and Technology Center for Quantitative Cell Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| |
Collapse
|
19
|
Chew YH, Marucci L. Mechanistic Model-Driven Biodesign in Mammalian Synthetic Biology. Methods Mol Biol 2024; 2774:71-84. [PMID: 38441759 DOI: 10.1007/978-1-0716-3718-0_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
Mathematical modeling plays a vital role in mammalian synthetic biology by providing a framework to design and optimize design circuits and engineered bioprocesses, predict their behavior, and guide experimental design. Here, we review recent models used in the literature, considering mathematical frameworks at the molecular, cellular, and system levels. We report key challenges in the field and discuss opportunities for genome-scale models, machine learning, and cybergenetics to expand the capabilities of model-driven mammalian cell biodesign.
Collapse
Affiliation(s)
- Yin Hoon Chew
- School of Mathematics, University of Birmingham, Birmingham, UK
| | - Lucia Marucci
- Department of Engineering Mathematics, University of Bristol, Bristol, UK.
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, UK.
| |
Collapse
|
20
|
Karp PD, Paley S, Caspi R, Kothari A, Krummenacker M, Midford PE, Moore LR, Subhraveti P, Gama-Castro S, Tierrafria VH, Lara P, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Sun G, Ahn-Horst TA, Choi H, Covert MW, Collado-Vides J, Paulsen I. The EcoCyc Database (2023). EcoSal Plus 2023; 11:eesp00022023. [PMID: 37220074 PMCID: PMC10729931 DOI: 10.1128/ecosalplus.esp-0002-2023] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 04/04/2023] [Indexed: 01/28/2024]
Abstract
EcoCyc is a bioinformatics database available online at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on the regulation of gene expression, E. coli gene essentiality, and nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for the analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed online. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. Data generated from a whole-cell model that is parameterized from the latest data on EcoCyc are also available. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.
Collapse
Affiliation(s)
- Peter D. Karp
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Suzanne Paley
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Ron Caspi
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Anamika Kothari
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Markus Krummenacker
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Peter E. Midford
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Lisa R. Moore
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Pallavi Subhraveti
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Socorro Gama-Castro
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Victor H. Tierrafria
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Paloma Lara
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Luis Muñiz-Rascado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - César Bonavides-Martinez
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Alberto Santos-Zavaleta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Amanda Mackie
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, New South Wales, Australia
| | - Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Travis A. Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Heejo Choi
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Markus W. Covert
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Ian Paulsen
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| |
Collapse
|
21
|
Bernstein DB, Akkas B, Price MN, Arkin AP. Evaluating E. coli genome-scale metabolic model accuracy with high-throughput mutant fitness data. Mol Syst Biol 2023; 19:e11566. [PMID: 37888487 DOI: 10.15252/msb.202311566] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 09/23/2023] [Accepted: 10/05/2023] [Indexed: 10/28/2023] Open
Abstract
The Escherichia coli genome-scale metabolic model (GEM) is an exemplar systems biology model for the simulation of cellular metabolism. Experimental validation of model predictions is essential to pinpoint uncertainty and ensure continued development of accurate models. Here, we quantified the accuracy of four subsequent E. coli GEMs using published mutant fitness data across thousands of genes and 25 different carbon sources. This evaluation demonstrated the utility of the area under a precision-recall curve relative to alternative accuracy metrics. An analysis of errors in the latest (iML1515) model identified several vitamins/cofactors that are likely available to mutants despite being absent from the experimental growth medium and highlighted isoenzyme gene-protein-reaction mapping as a key source of inaccurate predictions. A machine learning approach further identified metabolic fluxes through hydrogen ion exchange and specific central metabolism branch points as important determinants of model accuracy. This work outlines improved practices for the assessment of GEM accuracy with high-throughput mutant fitness data and highlights promising areas for future model refinement in E. coli and beyond.
Collapse
Affiliation(s)
- David B Bernstein
- Department of Bioengineering, University of California, Berkeley, CA, USA
| | - Batu Akkas
- Department of Bioengineering, University of California, Berkeley, CA, USA
| | - Morgan N Price
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Adam P Arkin
- Department of Bioengineering, University of California, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| |
Collapse
|
22
|
Kaizu K, Takahashi K. Technologies for whole-cell modeling: Genome-wide reconstruction of a cell in silico. Dev Growth Differ 2023; 65:554-564. [PMID: 37856476 PMCID: PMC11520977 DOI: 10.1111/dgd.12897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 09/06/2023] [Accepted: 10/14/2023] [Indexed: 10/21/2023]
Abstract
With advances in high-throughput, large-scale in vivo measurement and genome modification techniques at the single-nucleotide level, there is an increasing demand for the development of new technologies for the flexible design and control of cellular systems. Computer-aided design is a powerful tool to design new cells. Whole-cell modeling aims to integrate various cellular subsystems, determine their interactions and cooperative mechanisms, and predict comprehensive cellular behaviors by computational simulations on a genome-wide scale. It has been applied to prokaryotes, yeasts, and higher eukaryotic cells, and utilized in a wide range of applications, including production of valuable substances, drug discovery, and controlled differentiation. Whole-cell modeling, consisting of several thousand elements with diverse scales and properties, requires innovative model construction, simulation, and analysis techniques. Furthermore, whole-cell modeling has been extended to multiple scales, including high-resolution modeling at the single-nucleotide and single-amino acid levels and multicellular modeling of tissues and organs. This review presents an overview of the current state of whole-cell modeling, discusses the novel computational and experimental technologies driving it, and introduces further developments toward multihierarchical modeling on a whole-genome scale.
Collapse
|
23
|
Georgouli K, Yeom JS, Blake RC, Navid A. Multi-scale models of whole cells: progress and challenges. Front Cell Dev Biol 2023; 11:1260507. [PMID: 38020904 PMCID: PMC10661945 DOI: 10.3389/fcell.2023.1260507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 10/19/2023] [Indexed: 12/01/2023] Open
Abstract
Whole-cell modeling is "the ultimate goal" of computational systems biology and "a grand challenge for 21st century" (Tomita, Trends in Biotechnology, 2001, 19(6), 205-10). These complex, highly detailed models account for the activity of every molecule in a cell and serve as comprehensive knowledgebases for the modeled system. Their scope and utility far surpass those of other systems models. In fact, whole-cell models (WCMs) are an amalgam of several types of "system" models. The models are simulated using a hybrid modeling method where the appropriate mathematical methods for each biological process are used to simulate their behavior. Given the complexity of the models, the process of developing and curating these models is labor-intensive and to date only a handful of these models have been developed. While whole-cell models provide valuable and novel biological insights, and to date have identified some novel biological phenomena, their most important contribution has been to highlight the discrepancy between available data and observations that are used for the parametrization and validation of complex biological models. Another realization has been that current whole-cell modeling simulators are slow and to run models that mimic more complex (e.g., multi-cellular) biosystems, those need to be executed in an accelerated fashion on high-performance computing platforms. In this manuscript, we review the progress of whole-cell modeling to date and discuss some of the ways that they can be improved.
Collapse
Affiliation(s)
- Konstantia Georgouli
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Jae-Seung Yeom
- Center for Applied Scientific Computing, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Robert C. Blake
- Center for Applied Scientific Computing, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Ali Navid
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| |
Collapse
|
24
|
Han Y, Li W, Filko A, Li J, Zhang F. Genome-wide promoter responses to CRISPR perturbations of regulators reveal regulatory networks in Escherichia coli. Nat Commun 2023; 14:5757. [PMID: 37717013 PMCID: PMC10505187 DOI: 10.1038/s41467-023-41572-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 09/08/2023] [Indexed: 09/18/2023] Open
Abstract
Elucidating genome-scale regulatory networks requires a comprehensive collection of gene expression profiles, yet measuring gene expression responses for every transcription factor (TF)-gene pair in living prokaryotic cells remains challenging. Here, we develop pooled promoter responses to TF perturbation sequencing (PPTP-seq) via CRISPR interference to address this challenge. Using PPTP-seq, we systematically measure the activity of 1372 Escherichia coli promoters under single knockdown of 183 TF genes, illustrating more than 200,000 possible TF-gene responses in one experiment. We perform PPTP-seq for E. coli growing in three different media. The PPTP-seq data reveal robust steady-state promoter activities under most single TF knockdown conditions. PPTP-seq also enables identifications of, to the best of our knowledge, previously unknown TF autoregulatory responses and complex transcriptional control on one-carbon metabolism. We further find context-dependent promoter regulation by multiple TFs whose relative binding strengths determined promoter activities. Additionally, PPTP-seq reveals different promoter responses in different growth media, suggesting condition-specific gene regulation. Overall, PPTP-seq provides a powerful method to examine genome-wide transcriptional regulatory networks and can be potentially expanded to reveal gene expression responses to other genetic elements.
Collapse
Affiliation(s)
- Yichao Han
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Wanji Li
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Alden Filko
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Jingyao Li
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA
| | - Fuzhong Zhang
- Department of Energy, Environmental and Chemical Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA.
- Division of Biological and Biomedical Sciences, Washington University in St. Louis, Saint Louis, Missouri, USA.
- Institute of Materials Science and Engineering, Washington University in St. Louis, Saint Louis, Missouri, USA.
| |
Collapse
|
25
|
van Lent P, Schmitz J, Abeel T. Simulated Design-Build-Test-Learn Cycles for Consistent Comparison of Machine Learning Methods in Metabolic Engineering. ACS Synth Biol 2023; 12:2588-2599. [PMID: 37616156 PMCID: PMC10510747 DOI: 10.1021/acssynbio.3c00186] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Indexed: 08/25/2023]
Abstract
Combinatorial pathway optimization is an important tool in metabolic flux optimization. Simultaneous optimization of a large number of pathway genes often leads to combinatorial explosions. Strain optimization is therefore often performed using iterative design-build-test-learn (DBTL) cycles. The aim of these cycles is to develop a product strain iteratively, every time incorporating learning from the previous cycle. Machine learning methods provide a potentially powerful tool to learn from data and propose new designs for the next DBTL cycle. However, due to the lack of a framework for consistently testing the performance of machine learning methods over multiple DBTL cycles, evaluating the effectiveness of these methods remains a challenge. In this work, we propose a mechanistic kinetic model-based framework to test and optimize machine learning for iterative combinatorial pathway optimization. Using this framework, we show that gradient boosting and random forest models outperform the other tested methods in the low-data regime. We demonstrate that these methods are robust for training set biases and experimental noise. Finally, we introduce an algorithm for recommending new designs using machine learning model predictions. We show that when the number of strains to be built is limited, starting with a large initial DBTL cycle is favorable over building the same number of strains for every cycle.
Collapse
Affiliation(s)
- Paul van Lent
- Delft
Bioinformatics Lab, Delft University of
Technology Van Mourik, Delft 2628 XE, The Netherlands
| | - Joep Schmitz
- Department
of Science and Research, Joep Schmitz -
dsm-firmenich, Science & Research, P.O. Box 1, 2600
MA Delft, The Netherlands
| | - Thomas Abeel
- Delft
Bioinformatics Lab, Delft University of
Technology Van Mourik, Delft 2628 XE, The Netherlands
- Infectious
Disease and Microbiome Program, Broad Institute
of MIT and Harvard, Cambridge, Massachusetts 02142, United States
| |
Collapse
|
26
|
Akbari A, Haiman ZB, Palsson BO. A data-driven approach for timescale decomposition of biochemical reaction networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.21.554230. [PMID: 37662221 PMCID: PMC10473577 DOI: 10.1101/2023.08.21.554230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Understanding the dynamics of biological systems in evolving environments is a challenge due to their scale and complexity. Here, we present a computational framework for timescale decomposition of biochemical reaction networks to distill essential patterns from their intricate dynamics. This approach identifies timescale hierarchies, concentration pools, and coherent structures from time-series data, providing a system-level description of reaction networks at physiologically important timescales. We apply this technique to kinetic models of hypothetical and biological pathways, validating it by reproducing analytically characterized or previously known concentration pools of these pathways. Moreover, by analyzing the timescale hierarchy of the glycolytic pathway, we elucidate the connections between the stoichiometric and dissipative structures of reaction networks and the temporal organization of coherent structures. Specifically, we show that glycolysis is a cofactor driven pathway, the slowest dynamics of which are described by a balance between high-energy phosphate bond and redox trafficking. Overall, this approach provides more biologically interpretable characterizations of network dynamics than large-scale kinetic models, thus facilitating model reduction and personalized medicine applications.
Collapse
|
27
|
Gilbert BR, Thornburg ZR, Brier TA, Stevens JA, Grünewald F, Stone JE, Marrink SJ, Luthey-Schulten Z. Dynamics of chromosome organization in a minimal bacterial cell. Front Cell Dev Biol 2023; 11:1214962. [PMID: 37621774 PMCID: PMC10445541 DOI: 10.3389/fcell.2023.1214962] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 07/10/2023] [Indexed: 08/26/2023] Open
Abstract
Computational models of cells cannot be considered complete unless they include the most fundamental process of life, the replication and inheritance of genetic material. By creating a computational framework to model systems of replicating bacterial chromosomes as polymers at 10 bp resolution with Brownian dynamics, we investigate changes in chromosome organization during replication and extend the applicability of an existing whole-cell model (WCM) for a genetically minimal bacterium, JCVI-syn3A, to the entire cell-cycle. To achieve cell-scale chromosome structures that are realistic, we model the chromosome as a self-avoiding homopolymer with bending and torsional stiffnesses that capture the essential mechanical properties of dsDNA in Syn3A. In addition, the conformations of the circular DNA must avoid overlapping with ribosomes identitied in cryo-electron tomograms. While Syn3A lacks the complex regulatory systems known to orchestrate chromosome segregation in other bacteria, its minimized genome retains essential loop-extruding structural maintenance of chromosomes (SMC) protein complexes (SMC-scpAB) and topoisomerases. Through implementing the effects of these proteins in our simulations of replicating chromosomes, we find that they alone are sufficient for simultaneous chromosome segregation across all generations within nested theta structures. This supports previous studies suggesting loop-extrusion serves as a near-universal mechanism for chromosome organization within bacterial and eukaryotic cells. Furthermore, we analyze ribosome diffusion under the influence of the chromosome and calculate in silico chromosome contact maps that capture inter-daughter interactions. Finally, we present a methodology to map the polymer model of the chromosome to a Martini coarse-grained representation to prepare molecular dynamics models of entire Syn3A cells, which serves as an ultimate means of validation for cell states predicted by the WCM.
Collapse
Affiliation(s)
- Benjamin R. Gilbert
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Zane R. Thornburg
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Troy A. Brier
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Jan A. Stevens
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Fabian Grünewald
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - John E. Stone
- NVIDIA Corporation, Santa Clara, CA, United States
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Siewert J. Marrink
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, United States
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL, United States
- NSF Center for the Physics of Living Cells, Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, United States
| |
Collapse
|
28
|
Maheshwari AJ, Calles J, Waterton SK, Endy D. Engineering tRNA abundances for synthetic cellular systems. Nat Commun 2023; 14:4594. [PMID: 37524714 PMCID: PMC10390467 DOI: 10.1038/s41467-023-40199-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Accepted: 07/13/2023] [Indexed: 08/02/2023] Open
Abstract
Routinizing the engineering of synthetic cells requires specifying beforehand how many of each molecule are needed. Physics-based tools for estimating desired molecular abundances in whole-cell synthetic biology are missing. Here, we use a colloidal dynamics simulator to make predictions for how tRNA abundances impact protein synthesis rates. We use rational design and direct RNA synthesis to make 21 synthetic tRNA surrogates from scratch. We use evolutionary algorithms within a computer aided design framework to engineer translation systems predicted to work faster or slower depending on tRNA abundance differences. We build and test the so-specified synthetic systems and find qualitative agreement between expected and observed systems. First principles modeling combined with bottom-up experiments can help molecular-to-cellular scale synthetic biology realize design-build-work frameworks that transcend tinker-and-test.
Collapse
Affiliation(s)
| | - Jonathan Calles
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Sean K Waterton
- Department of Biology, Stanford University, Stanford, CA, 94305, USA
| | - Drew Endy
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
29
|
Choi H, Covert MW. Whole-cell modeling of E. coli confirms that in vitro tRNA aminoacylation measurements are insufficient to support cell growth and predicts a positive feedback mechanism regulating arginine biosynthesis. Nucleic Acids Res 2023; 51:5911-5930. [PMID: 37224536 PMCID: PMC10325894 DOI: 10.1093/nar/gkad435] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 05/04/2023] [Accepted: 05/09/2023] [Indexed: 05/26/2023] Open
Abstract
In Escherichia coli, inconsistencies between in vitro tRNA aminoacylation measurements and in vivo protein synthesis demands were postulated almost 40 years ago, but have proven difficult to confirm. Whole-cell modeling can test whether a cell behaves in a physiologically correct manner when parameterized with in vitro measurements by providing a holistic representation of cellular processes in vivo. Here, a mechanistic model of tRNA aminoacylation, codon-based polypeptide elongation, and N-terminal methionine cleavage was incorporated into a developing whole-cell model of E. coli. Subsequent analysis confirmed the insufficiency of aminoacyl-tRNA synthetase kinetic measurements for cellular proteome maintenance, and estimated aminoacyl-tRNA synthetase kcats that were on average 7.6-fold higher. Simulating cell growth with perturbed kcats demonstrated the global impact of these in vitro measurements on cellular phenotypes. For example, an insufficient kcat for HisRS caused protein synthesis to be less robust to the natural variability in aminoacyl-tRNA synthetase expression in single cells. More surprisingly, insufficient ArgRS activity led to catastrophic impacts on arginine biosynthesis due to underexpressed N-acetylglutamate synthase, where translation depends on repeated CGG codons. Overall, the expanded E. coli model deepens understanding of how translation operates in an in vivo context.
Collapse
Affiliation(s)
- Heejo Choi
- Department of Bioengineering, Stanford University, 443 Via Ortega, Stanford, CA 94305, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, 443 Via Ortega, Stanford, CA 94305, USA
| |
Collapse
|
30
|
Skalnik CJ, Cheah SY, Yang MY, Wolff MB, Spangler RK, Talman L, Morrison JH, Peirce SM, Agmon E, Covert MW. Whole-cell modeling of E. coli colonies enables quantification of single-cell heterogeneity in antibiotic responses. PLoS Comput Biol 2023; 19:e1011232. [PMID: 37327241 DOI: 10.1371/journal.pcbi.1011232] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 06/01/2023] [Indexed: 06/18/2023] Open
Abstract
Antibiotic resistance poses mounting risks to human health, as current antibiotics are losing efficacy against increasingly resistant pathogenic bacteria. Of particular concern is the emergence of multidrug-resistant strains, which has been rapid among Gram-negative bacteria such as Escherichia coli. A large body of work has established that antibiotic resistance mechanisms depend on phenotypic heterogeneity, which may be mediated by stochastic expression of antibiotic resistance genes. The link between such molecular-level expression and the population levels that result is complex and multi-scale. Therefore, to better understand antibiotic resistance, what is needed are new mechanistic models that reflect single-cell phenotypic dynamics together with population-level heterogeneity, as an integrated whole. In this work, we sought to bridge single-cell and population-scale modeling by building upon our previous experience in "whole-cell" modeling, an approach which integrates mathematical and mechanistic descriptions of biological processes to recapitulate the experimentally observed behaviors of entire cells. To extend whole-cell modeling to the "whole-colony" scale, we embedded multiple instances of a whole-cell E. coli model within a model of a dynamic spatial environment, allowing us to run large, parallelized simulations on the cloud that contained all the molecular detail of the previous whole-cell model and many interactive effects of a colony growing in a shared environment. The resulting simulations were used to explore the response of E. coli to two antibiotics with different mechanisms of action, tetracycline and ampicillin, enabling us to identify sub-generationally-expressed genes, such as the beta-lactamase ampC, which contributed greatly to dramatic cellular differences in steady-state periplasmic ampicillin and was a significant factor in determining cell survival.
Collapse
Affiliation(s)
- Christopher J Skalnik
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Sean Y Cheah
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Mica Y Yang
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Mattheus B Wolff
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Ryan K Spangler
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Lee Talman
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Jerry H Morrison
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Shayn M Peirce
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Eran Agmon
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
- Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, Farmington, Connecticut, United States of America
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| |
Collapse
|
31
|
Nikolados EM, Oyarzún DA. Deep learning for optimization of protein expression. Curr Opin Biotechnol 2023; 81:102941. [PMID: 37087839 DOI: 10.1016/j.copbio.2023.102941] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 02/02/2023] [Accepted: 03/17/2023] [Indexed: 04/25/2023]
Abstract
Recent progress in high-throughput DNA synthesis and sequencing has enabled the development of massively parallel reporter assays for strain characterization. These datasets map a large number of DNA sequences to protein expression levels, sparking increased interest in data-driven methods for sequence-to-expression modeling. Here, we highlight advances in deep learning models of protein expression and their potential for optimizing strains engineered to produce recombinant proteins. We review recent works that built highly accurate models and discuss challenges that hinder adoption by end users. There is a need to better align this technology with the constraints encountered in strain engineering, particularly the cost of acquiring large amounts of data and the requirement for interpretable models that generalize beyond the training data. Overcoming these barriers will help to incentivize academic and industrial laboratories to tap into a new era of data-centric strain engineering.
Collapse
Affiliation(s)
| | - Diego A Oyarzún
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK; School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK; The Alan Turing Institute, London NW1 2DB, UK.
| |
Collapse
|
32
|
Sanders LM, Scott RT, Yang JH, Qutub AA, Garcia Martin H, Berrios DC, Hastings JJA, Rask J, Mackintosh G, Hoarfrost AL, Chalk S, Kalantari J, Khezeli K, Antonsen EL, Babdor J, Barker R, Baranzini SE, Beheshti A, Delgado-Aparicio GM, Glicksberg BS, Greene CS, Haendel M, Hamid AA, Heller P, Jamieson D, Jarvis KJ, Komarova SV, Komorowski M, Kothiyal P, Mahabal A, Manor U, Mason CE, Matar M, Mias GI, Miller J, Myers JG, Nelson C, Oribello J, Park SM, Parsons-Wingerter P, Prabhu RK, Reynolds RJ, Saravia-Butler A, Saria S, Sawyer A, Singh NK, Snyder M, Soboczenski F, Soman K, Theriot CA, Van Valen D, Venkateswaran K, Warren L, Worthey L, Zitnik M, Costes SV. Biological research and self-driving labs in deep space supported by artificial intelligence. NAT MACH INTELL 2023. [DOI: 10.1038/s42256-023-00618-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]
|
33
|
Kim GB, Choi SY, Cho IJ, Ahn DH, Lee SY. Metabolic engineering for sustainability and health. Trends Biotechnol 2023; 41:425-451. [PMID: 36635195 DOI: 10.1016/j.tibtech.2022.12.014] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 12/17/2022] [Accepted: 12/21/2022] [Indexed: 01/12/2023]
Abstract
Bio-based production of chemicals and materials has attracted much attention due to the urgent need to establish sustainability and enhance human health. Metabolic engineering (ME) allows purposeful modification of cellular metabolic, regulatory, and signaling networks to achieve enhanced production of desired chemicals and degradation of environmentally harmful chemicals. ME has significantly progressed over the past 30 years through further integration of the strategies of synthetic biology, systems biology, evolutionary engineering, and data science aided by artificial intelligence. Here we review the field of ME from its emergence to the current state-of-the-art, highlighting its contribution to sustainable production of chemicals, health, and the environment through representative examples. Future challenges of ME and perspectives are also discussed.
Collapse
Affiliation(s)
- Gi Bae Kim
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - So Young Choi
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - In Jin Cho
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea; BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Da-Hee Ahn
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Sang Yup Lee
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea; BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea.
| |
Collapse
|
34
|
Colloidal Physics Modeling Reveals How Per-Ribosome Productivity Increases with Growth Rate in Escherichia coli. mBio 2023; 14:e0286522. [PMID: 36537810 PMCID: PMC9973364 DOI: 10.1128/mbio.02865-22] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Faster-growing cells must synthesize proteins more quickly. Increased ribosome abundance only partly accounts for increases in total protein synthesis rates. The productivity of individual ribosomes must increase too, almost doubling by an unknown mechanism. Prior models point to diffusive transport as a limiting factor but raise a paradox: faster-growing cells are more crowded, yet crowding slows diffusion. We suspected that physical crowding, transport, and stoichiometry, considered together, might reveal a more nuanced explanation. To investigate, we built a first-principles physics-based model of Escherichia coli cytoplasm in which Brownian motion and diffusion arise directly from physical interactions between individual molecules of finite size, density, and physiological abundance. Using our microscopically detailed model, we predicted that physical transport of individual ternary complexes accounts for ~80% of translation elongation latency. We also found that volumetric crowding increases during faster growth even as cytoplasmic mass density remains relatively constant. Despite slowed diffusion, we predicted that improved proximity between ternary complexes and ribosomes wins out, illustrating a simple physics-based mechanism for how individual elongating ribosomes become more productive. We speculate that crowding imposes a physical limit on growth rate and undergirds cellular behavior more broadly. Unfitted colloidal-scale modeling offers systems biology a complementary "physics engine" for exploring how cellular-scale behaviors arise from physical transport and reactions among individual molecules. IMPORTANCE Ribosomes are the factories in cells that synthesize proteins. When cells grow faster, there are not enough ribosomes to keep up with the demand for faster protein synthesis without individual ribosomes becoming more productive. Yet, faster-growing cells are more crowded, seemingly making it harder for each ribosome to do its work. Our computational model of the physics of translation elongation reveals the underlying mechanism for how individual ribosomes become more productive: proximity and stoichiometry of translation molecules overcome crowding. Our model also suggests a universal physical limitation of cell growth rates.
Collapse
|
35
|
Bi X, Cheng Y, Xu X, Lv X, Liu Y, Li J, Du G, Chen J, Ledesma-Amaro R, Liu L. etiBsu1209: A comprehensive multiscale metabolic model for Bacillus subtilis. Biotechnol Bioeng 2023; 120:1623-1639. [PMID: 36788025 DOI: 10.1002/bit.28355] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 12/08/2022] [Accepted: 02/13/2023] [Indexed: 02/16/2023]
Abstract
Genome-scale metabolic models (GEMs) have been widely used to guide the computational design of microbial cell factories, and to date, seven GEMs have been reported for Bacillus subtilis, a model gram-positive microorganism widely used in bioproduction of functional nutraceuticals and food ingredients. However, none of them are widely used because they often lead to erroneous predictions due to their low predictive power and lack of information on regulatory mechanisms. In this work, we constructed a new version of GEM for B. subtilis (iBsu1209), which contains 1209 genes, 1595 metabolites, and 1948 reactions. We applied machine learning to fill gaps, which formed a relatively complete metabolic network able to predict with high accuracy (89.3%) the growth of 1209 mutants under 12 different culture conditions. In addition, we developed a visualization and code-free software, Model Tool, for multiconstraints model reconstruction and analysis. We used this software to construct etiBsu1209, a multiscale model that integrates enzymatic constraints, thermodynamic constraints, and transcriptional regulatory networks. Furthermore, we used etiBsu1209 to guide a metabolic engineering strategy (knocking out fabI and yfkN genes) for the overproduction of nutraceutical menaquinone-7, and the titer increased to 153.94 mg/L, 2.2-times that of the parental strain. To the best of our knowledge, etiBsu1209 is the first comprehensive multiscale model for B. subtilis and can serve as a solid basis for rational computational design of B. subtilis cell factories for bioproduction.
Collapse
Affiliation(s)
- Xinyu Bi
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | - Yang Cheng
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | - Xianhao Xu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | - Xueqin Lv
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | - Yanfeng Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | - Jianghua Li
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | - Guocheng Du
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | - Jian Chen
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| | | | - Long Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.,Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi, China
| |
Collapse
|
36
|
Understanding How Cells Probe the World: A Preliminary Step towards Modeling Cell Behavior? Int J Mol Sci 2023; 24:ijms24032266. [PMID: 36768586 PMCID: PMC9916635 DOI: 10.3390/ijms24032266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 01/16/2023] [Accepted: 01/20/2023] [Indexed: 01/26/2023] Open
Abstract
Cell biologists have long aimed at quantitatively modeling cell function. Recently, the outstanding progress of high-throughput measurement methods and data processing tools has made this a realistic goal. The aim of this paper is twofold: First, to suggest that, while much progress has been done in modeling cell states and transitions, current accounts of environmental cues driving these transitions remain insufficient. There is a need to provide an integrated view of the biochemical, topographical and mechanical information processed by cells to take decisions. It might be rewarding in the near future to try to connect cell environmental cues to physiologically relevant outcomes rather than modeling relationships between these cues and internal signaling networks. The second aim of this paper is to review exogenous signals that are sensed by living cells and significantly influence fate decisions. Indeed, in addition to the composition of the surrounding medium, cells are highly sensitive to the properties of neighboring surfaces, including the spatial organization of anchored molecules and substrate mechanical and topographical properties. These properties should thus be included in models of cell behavior. It is also suggested that attempts at cell modeling could strongly benefit from two research lines: (i) trying to decipher the way cells encode the information they retrieve from environment analysis, and (ii) developing more standardized means of assessing the quality of proposed models, as was done in other research domains such as protein structure prediction.
Collapse
|
37
|
Stevens JA, Grünewald F, van Tilburg PAM, König M, Gilbert BR, Brier TA, Thornburg ZR, Luthey-Schulten Z, Marrink SJ. Molecular dynamics simulation of an entire cell. Front Chem 2023; 11:1106495. [PMID: 36742032 PMCID: PMC9889929 DOI: 10.3389/fchem.2023.1106495] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 01/09/2023] [Indexed: 01/19/2023] Open
Abstract
The ultimate microscope, directed at a cell, would reveal the dynamics of all the cell's components with atomic resolution. In contrast to their real-world counterparts, computational microscopes are currently on the brink of meeting this challenge. In this perspective, we show how an integrative approach can be employed to model an entire cell, the minimal cell, JCVI-syn3A, at full complexity. This step opens the way to interrogate the cell's spatio-temporal evolution with molecular dynamics simulations, an approach that can be extended to other cell types in the near future.
Collapse
Affiliation(s)
- Jan A. Stevens
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Fabian Grünewald
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - P. A. Marco van Tilburg
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Melanie König
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Benjamin R. Gilbert
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Champaign, IL, United States
| | - Troy A. Brier
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Champaign, IL, United States
| | - Zane R. Thornburg
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Champaign, IL, United States
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Champaign, IL, United States
| | - Siewert J. Marrink
- Molecular Dynamics Group, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| |
Collapse
|
38
|
Wu K, Mao Z, Mao Y, Niu J, Cai J, Yuan Q, Yun L, Liao X, Wang Z, Ma H. ecBSU1: A Genome-Scale Enzyme-Constrained Model of Bacillus subtilis Based on the ECMpy Workflow. Microorganisms 2023; 11:microorganisms11010178. [PMID: 36677469 PMCID: PMC9864840 DOI: 10.3390/microorganisms11010178] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 12/24/2022] [Accepted: 01/05/2023] [Indexed: 01/13/2023] Open
Abstract
Genome-scale metabolic models (GEMs) play an important role in the phenotype prediction of microorganisms, and their accuracy can be further improved by integrating other types of biological data such as enzyme concentrations and kinetic coefficients. Enzyme-constrained models (ecModels) have been constructed for several species and were successfully applied to increase the production of commodity chemicals. However, there was still no genome-scale ecModel for the important model organism Bacillus subtilis prior to this study. Here, we integrated enzyme kinetic and proteomic data to construct the first genome-scale ecModel of B. subtilis (ecBSU1) using the ECMpy workflow. We first used ecBSU1 to simulate overflow metabolism and explore the trade-off between biomass yield and enzyme usage efficiency. Next, we simulated the growth rate on eight previously published substrates and found that the simulation results of ecBSU1 were in good agreement with the literature. Finally, we identified target genes that enhance the yield of commodity chemicals using ecBSU1, most of which were consistent with the experimental data, and some of which may be potential novel targets for metabolic engineering. This work demonstrates that the integration of enzymatic constraints is an effective method to improve the performance of GEMs. The ecModel can predict overflow metabolism more precisely and can be used for the identification of target genes to guide the rational design of microbial cell factories.
Collapse
Affiliation(s)
- Ke Wu
- Key Laboratory of Systems Bioengineering (Ministry of Education), Frontier Science Center for Synthetic Biology (Ministry of Education), Department of Biochemical Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Zhitao Mao
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Yufeng Mao
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Jinhui Niu
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Jingyi Cai
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Qianqian Yuan
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Lili Yun
- Tianjin Medical Laboratory, BGI-Tianjin, BGI-Shenzhen, Tianjin 300308, China
| | - Xiaoping Liao
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
| | - Zhiwen Wang
- Key Laboratory of Systems Bioengineering (Ministry of Education), Frontier Science Center for Synthetic Biology (Ministry of Education), Department of Biochemical Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
- Correspondence: (Z.W.); (H.M.)
| | - Hongwu Ma
- Biodesign Center, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- National Technology Innovation Center of Synthetic Biology, Tianjin 300308, China
- Correspondence: (Z.W.); (H.M.)
| |
Collapse
|
39
|
Beer RD, Di Paolo EA. The theoretical foundations of enaction: Precariousness. Biosystems 2023; 223:104823. [PMID: 36574923 DOI: 10.1016/j.biosystems.2022.104823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 11/28/2022] [Accepted: 12/14/2022] [Indexed: 12/25/2022]
Abstract
Enaction is an increasingly influential approach to cognition that grew out of Maturana and Varela's earlier work on autopoiesis and the biology of cognition. As with any relatively new scientific discipline, the enactive approach would benefit greatly from a careful analysis of its theoretical foundations. Here we initiate such an analysis for one of the core concepts of enaction, precariousness. Specifically, we consider three types of fragility: systemic, processual and thermodynamic. Using a glider in the Game of Life as a toy model, we illustrate each of these fragilities and examine the relationships between them. We also argue that each type of fragility is characterized by which aspects of a system are hardwired into its definition from the outset and which aspects are emergent and hence vulnerable to disintegration without ongoing maintenance.
Collapse
Affiliation(s)
- Randall D Beer
- Cognitive Science Program, Luddy School of Informatics, Computing and Engineering, Indiana University, USA.
| | - Ezequiel A Di Paolo
- Ikerbasque, Basque Foundation for Science, Bizkaia, Spain; IAS-Research Center for Life, Mind and Society, University of the Basque Country, Donostia, Spain; Department of Informatics, University of Sussex, Brighton, UK
| |
Collapse
|
40
|
Gopalakrishnan S, Joshi CJ, Valderrama-Gómez MÁ, Icten E, Rolandi P, Johnson W, Kontoravdi C, Lewis NE. Guidelines for extracting biologically relevant context-specific metabolic models using gene expression data. Metab Eng 2023; 75:181-191. [PMID: 36566974 PMCID: PMC10258867 DOI: 10.1016/j.ymben.2022.12.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 12/01/2022] [Accepted: 12/17/2022] [Indexed: 12/24/2022]
Abstract
Genome-scale metabolic models comprehensively describe an organism's metabolism and can be tailored using omics data to model condition-specific physiology. The quality of context-specific models is impacted by (i) choice of algorithm and parameters and (ii) alternate context-specific models that equally explain the -omics data. Here we quantify the influence of alternate optima on microbial and mammalian model extraction using GIMME, iMAT, MBA, and mCADRE. We find that metabolic tasks defining an organism's phenotype must be explicitly and quantitatively protected. The scope of alternate models is strongly influenced by algorithm choice and the topological properties of the parent genome-scale model with fatty acid metabolism and intracellular metabolite transport contributing much to alternate solutions in all models. mCADRE extracted the most reproducible context-specific models and models generated using MBA had the most alternate solutions. There were fewer qualitatively different solutions generated by GIMME in E. coli, but these increased substantially in the mammalian models. Screening ensembles using a receiver operating characteristic plot identified the best-performing models. A comprehensive evaluation of models extracted using combinations of extraction methods and expression thresholds revealed that GIMME generated the best-performing models in E. coli, whereas mCADRE is better suited for complex mammalian models. These findings suggest guidelines for benchmarking -omics integration algorithms and motivate the development of a systematic workflow to enumerate alternate models and extract biologically relevant context-specific models.
Collapse
Affiliation(s)
| | - Chintan J Joshi
- Department of Pediatrics, University of California San Diego, United States
| | | | - Elcin Icten
- Digital Integration and Predictive Technologies, Amgen Inc, United States
| | - Pablo Rolandi
- Digital Integration and Predictive Technologies, Amgen Inc, United States
| | - William Johnson
- Digital Integration and Predictive Technologies, Amgen Inc, United States
| | - Cleo Kontoravdi
- Department of Chemical Engineering, Imperial College London, UK
| | - Nathan E Lewis
- Department of Pediatrics, University of California San Diego, United States; Department of Bioengineering, University of California San Diego, United States.
| |
Collapse
|
41
|
Han B, Dai Z, Li Z. Computer-Based Design of a Cell Factory for High-Yield Cytidine Production. ACS Synth Biol 2022; 11:4123-4133. [PMID: 36442151 DOI: 10.1021/acssynbio.2c00431] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Pyrimidine ribonucleotide de novo biosynthesis pathway (PRdnBP) is an important pathway to produce pyrimidine nucleosides. We attempted to systematically investigate PRdnBP in Escherichia coli with genome-scale metabolic models and utilized the models to guide strain design. The balance of central carbon metabolism and PRdnBP affected the production of cytidine from glucose. Using Bayesian metabolic flux analysis, the effect of modified PRdnBP on the metabolic network was analyzed. The acetate overflow became coupled with PRdnBP flux, while they were originally independent under oxygen-sufficient conditions. The coupling between cytidine production and acetate secretion in the modified strain was weakened by arcA deletion, which resulted in further improving the efficient accumulation of cytidine. In total, 1.28 g/L of cytidine with a yield of 0.26 g/g glucose was produced. The yield of cytidine produced by E. coli is higher than previous reports. Our strategy provides an effective attempt to find metabolic bottlenecks in genetically engineered bacteria by using flux coupling analysis.
Collapse
Affiliation(s)
- Bin Han
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai200237, China
| | - Zeyu Dai
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai200237, China
| | - Zhimin Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai200237, China.,Shanghai Collaborative Innovation Center for Biomanufacturing Technology, 130 Meilong Road, Shanghai200237, China
| |
Collapse
|
42
|
Favate JS, Liang S, Cope AL, Yadavalli SS, Shah P. The landscape of transcriptional and translational changes over 22 years of bacterial adaptation. eLife 2022; 11:e81979. [PMID: 36214449 PMCID: PMC9645810 DOI: 10.7554/elife.81979] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 10/07/2022] [Indexed: 12/31/2022] Open
Abstract
Organisms can adapt to an environment by taking multiple mutational paths. This redundancy at the genetic level, where many mutations have similar phenotypic and fitness effects, can make untangling the molecular mechanisms of complex adaptations difficult. Here, we use the Escherichia coli long-term evolution experiment (LTEE) as a model to address this challenge. To understand how different genomic changes could lead to parallel fitness gains, we characterize the landscape of transcriptional and translational changes across 12 replicate populations evolving in parallel for 50,000 generations. By quantifying absolute changes in mRNA abundances, we show that not only do all evolved lines have more mRNAs but that this increase in mRNA abundance scales with cell size. We also find that despite few shared mutations at the genetic level, clones from replicate populations in the LTEE are remarkably similar in their gene expression patterns at both the transcriptional and translational levels. Furthermore, we show that the majority of the expression changes are due to changes at the transcriptional level with very few translational changes. Finally, we show how mutations in transcriptional regulators lead to consistent and parallel changes in the expression levels of downstream genes. These results deepen our understanding of the molecular mechanisms underlying complex adaptations and provide insights into the repeatability of evolution.
Collapse
Affiliation(s)
- John S Favate
- Department of Genetics, Rutgers UniversityPiscatawayUnited States
| | - Shun Liang
- Department of Genetics, Rutgers UniversityPiscatawayUnited States
| | - Alexander L Cope
- Department of Genetics, Rutgers UniversityPiscatawayUnited States
- Robert Wood Johnson Medical School, Rutgers UniversityNew BrunswickUnited States
| | - Srujana S Yadavalli
- Department of Genetics, Rutgers UniversityPiscatawayUnited States
- Waksman Institute, Rutgers UniversityPiscatawayUnited States
| | - Premal Shah
- Department of Genetics, Rutgers UniversityPiscatawayUnited States
- Human Genetics Institute of New Jersey, Rutgers UniversityPiscatawayUnited States
| |
Collapse
|
43
|
Howard-Varona C, Roux S, Bowen BP, Silva LP, Lau R, Schwenck SM, Schwartz S, Woyke T, Northen T, Sullivan MB, Floge SA. Protist impacts on marine cyanovirocell metabolism. ISME COMMUNICATIONS 2022; 2:94. [PMID: 37938263 PMCID: PMC9723779 DOI: 10.1038/s43705-022-00169-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 08/25/2022] [Accepted: 09/06/2022] [Indexed: 07/26/2023]
Abstract
The fate of oceanic carbon and nutrients depends on interactions between viruses, prokaryotes, and unicellular eukaryotes (protists) in a highly interconnected planktonic food web. To date, few controlled mechanistic studies of these interactions exist, and where they do, they are largely pairwise, focusing either on viral infection (i.e., virocells) or protist predation. Here we studied population-level responses of Synechococcus cyanobacterial virocells (i.e., cyanovirocells) to the protist Oxyrrhis marina using transcriptomics, endo- and exo-metabolomics, photosynthetic efficiency measurements, and microscopy. Protist presence had no measurable impact on Synechococcus transcripts or endometabolites. The cyanovirocells alone had a smaller intracellular transcriptional and metabolic response than cyanovirocells co-cultured with protists, displaying known patterns of virus-mediated metabolic reprogramming while releasing diverse exometabolites during infection. When protists were added, several exometabolites disappeared, suggesting microbial consumption. In addition, the intracellular cyanovirocell impact was largest, with 4.5- and 10-fold more host transcripts and endometabolites, respectively, responding to protists, especially those involved in resource and energy production. Physiologically, photosynthetic efficiency also increased, and together with the transcriptomics and metabolomics findings suggest that cyanovirocell metabolic demand is highest when protists are present. These data illustrate cyanovirocell responses to protist presence that are not yet considered when linking microbial physiology to global-scale biogeochemical processes.
Collapse
Affiliation(s)
| | - Simon Roux
- Department of Microbiology, The Ohio State University, Columbus, OH, USA
- U.S. DOE Joint Genome Institute, Berkeley, CA, USA
| | | | - Leslie P Silva
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Syft Technologies, Ltd, Christchurch, 8024, New Zealand
| | - Rebecca Lau
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Cellular and Molecular Medicine and Biomedical Sciences Graduate Program, University of California San Diego, La Jolla, CA, USA
| | - Sarah M Schwenck
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
- Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA, USA
- Microbial and Environmental Genomics, J. Craig Venter Institute, La Jolla, CA, USA
| | - Samuel Schwartz
- Department of Biology, Wake Forest University, Winston Salem, NC, USA
| | - Tanja Woyke
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- U.S. DOE Joint Genome Institute, Berkeley, CA, USA
| | - Trent Northen
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- U.S. DOE Joint Genome Institute, Berkeley, CA, USA
| | - Matthew B Sullivan
- Department of Microbiology, The Ohio State University, Columbus, OH, USA.
- Department of Civil, Environmental and Geodetic Engineering, and Center of Microbiome Science, The Ohio State University, Columbus, OH, USA.
| | - Sheri A Floge
- Department of Biology, Wake Forest University, Winston Salem, NC, USA.
| |
Collapse
|
44
|
Past, Present, and Future of Genome Modification in Escherichia coli. Microorganisms 2022; 10:microorganisms10091835. [PMID: 36144436 PMCID: PMC9504249 DOI: 10.3390/microorganisms10091835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 09/05/2022] [Accepted: 09/05/2022] [Indexed: 12/04/2022] Open
Abstract
Escherichia coli K-12 is one of the most well-studied species of bacteria. This species, however, is much more difficult to modify by homologous recombination (HR) than other model microorganisms. Research on HR in E. coli has led to a better understanding of the molecular mechanisms of HR, resulting in technical improvements and rapid progress in genome research, and allowing whole-genome mutagenesis and large-scale genome modifications. Developments using λ Red (exo, bet, and gam) and CRISPR-Cas have made E. coli as amenable to genome modification as other model microorganisms, such as Saccharomyces cerevisiae and Bacillus subtilis. This review describes the history of recombination research in E. coli, as well as improvements in techniques for genome modification by HR. This review also describes the results of large-scale genome modification of E. coli using these technologies, including DNA synthesis and assembly. In addition, this article reviews recent advances in genome modification, considers future directions, and describes problems associated with the creation of cells by design.
Collapse
|
45
|
Xing J. Reconstructing data-driven governing equations for cell phenotypic transitions: integration of data science and systems biology. Phys Biol 2022; 19:10.1088/1478-3975/ac8c16. [PMID: 35998617 PMCID: PMC9585661 DOI: 10.1088/1478-3975/ac8c16] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 08/23/2022] [Indexed: 11/11/2022]
Abstract
Cells with the same genome can exist in different phenotypes and can change between distinct phenotypes when subject to specific stimuli and microenvironments. Some examples include cell differentiation during development, reprogramming for induced pluripotent stem cells and transdifferentiation, cancer metastasis and fibrosis progression. The regulation and dynamics of cell phenotypic conversion is a fundamental problem in biology, and has a long history of being studied within the formalism of dynamical systems. A main challenge for mechanism-driven modeling studies is acquiring sufficient amount of quantitative information for constraining model parameters. Advances in quantitative experimental approaches, especially high throughput single-cell techniques, have accelerated the emergence of a new direction for reconstructing the governing dynamical equations of a cellular system from quantitative single-cell data, beyond the dominant statistical approaches. Here I review a selected number of recent studies using live- and fixed-cell data and provide my perspective on future development.
Collapse
Affiliation(s)
- Jianhua Xing
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15232, USA
- Department of Physics and Astronomy, University of Pittsburgh, Pittsburgh, PA 15232, USA
- UPMC-Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
46
|
Xing J. Reconstructing data-driven governing equations for cell phenotypic transitions: integration of data science and systems biology. Phys Biol 2022. [PMID: 35998617 DOI: 10.48550/arxiv.2203.14964] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Cells with the same genome can exist in different phenotypes and can change between distinct phenotypes when subject to specific stimuli and microenvironments. Some examples include cell differentiation during development, reprogramming for induced pluripotent stem cells and transdifferentiation, cancer metastasis and fibrosis progression. The regulation and dynamics of cell phenotypic conversion is a fundamental problem in biology, and has a long history of being studied within the formalism of dynamical systems. A main challenge for mechanism-driven modeling studies is acquiring sufficient amount of quantitative information for constraining model parameters. Advances in quantitative experimental approaches, especially high throughput single-cell techniques, have accelerated the emergence of a new direction for reconstructing the governing dynamical equations of a cellular system from quantitative single-cell data, beyond the dominant statistical approaches. Here I review a selected number of recent studies using live- and fixed-cell data and provide my perspective on future development.
Collapse
Affiliation(s)
- Jianhua Xing
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15232, United States of America.,Department of Physics and Astronomy, University of Pittsburgh, Pittsburgh, PA 15232, United States of America.,UPMC-Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, United States of America
| |
Collapse
|
47
|
Gawthrop PJ, Pan M. Network thermodynamics of biological systems: A bond graph approach. Math Biosci 2022; 352:108899. [PMID: 36057321 DOI: 10.1016/j.mbs.2022.108899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 08/26/2022] [Accepted: 08/26/2022] [Indexed: 10/14/2022]
Abstract
Edmund Crampin (1973-2021) was at the forefront of Systems Biology research and his work will influence the field for years to come. This paper brings together and summarises the seminal work of his group in applying energy-based bond graph methods to biological systems. In particular, this paper: (a) motivates the need to consider energy in modelling biology; (b) introduces bond graphs as a methodology for achieving this; (c) describes extensions to modelling electrochemical transduction; (d) outlines how bond graph models can be constructed in a modular manner and (e) describes stoichiometric approaches to deriving fundamental properties of reaction networks. These concepts are illustrated using a new bond graph model of photosynthesis in chloroplasts.
Collapse
Affiliation(s)
- Peter J Gawthrop
- Systems Biology Laboratory, School of Mathematics and Statistics, and Department of Biomedical Engineering, University of Melbourne, Victoria 3010, Australia.
| | - Michael Pan
- Systems Biology Laboratory, School of Mathematics and Statistics, and Department of Biomedical Engineering, University of Melbourne, Victoria 3010, Australia; School of Mathematics and Statistics, University of Melbourne, Victoria 3010, Australia
| |
Collapse
|
48
|
Integrative modeling of the cell. Acta Biochim Biophys Sin (Shanghai) 2022; 54:1213-1221. [PMID: 36017893 PMCID: PMC9909318 DOI: 10.3724/abbs.2022115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
A whole-cell model represents certain aspects of the cell structure and/or function. Due to the high complexity of the cell, an integrative modeling approach is often taken to utilize all available information including experimental data, prior knowledge and prior models. In this review, we summarize an emerging workflow of whole-cell modeling into five steps: (i) gather information; (ii) represent the modeled system into modules; (iii) translate input information into scoring function; (iv) sample the whole-cell model; (v) validate and interpret the model. In particular, we propose the integrative modeling of the cell by combining available (whole-cell) models to maximize the accuracy, precision, and completeness. In addition, we list quantitative predictions of various aspects of cell biology from existing whole-cell models. Moreover, we discuss the remaining challenges and future directions, and highlight the opportunity to establish an integrative spatiotemporal multi-scale whole-cell model based on a community approach.
Collapse
|
49
|
Ahn-Horst TA, Mille LS, Sun G, Morrison JH, Covert MW. An expanded whole-cell model of E. coli links cellular physiology with mechanisms of growth rate control. NPJ Syst Biol Appl 2022; 8:30. [PMID: 35986058 PMCID: PMC9391491 DOI: 10.1038/s41540-022-00242-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 07/28/2022] [Indexed: 11/09/2022] Open
Abstract
Growth and environmental responses are essential for living organisms to survive and adapt to constantly changing environments. In order to simulate new conditions and capture dynamic responses to environmental shifts in a developing whole-cell model of E. coli, we incorporated additional regulation, including dynamics of the global regulator guanosine tetraphosphate (ppGpp), along with dynamics of amino acid biosynthesis and translation. With the model, we show that under perturbed ppGpp conditions, small molecule feedback inhibition pathways, in addition to regulation of expression, play a role in ppGpp regulation of growth. We also found that simulations with dysregulated amino acid synthesis pathways provide average amino acid concentration predictions that are comparable to experimental results but on the single-cell level, concentrations unexpectedly show regular fluctuations. Additionally, during both an upshift and downshift in nutrient availability, the simulated cell responds similarly with a transient increase in the mRNA:rRNA ratio. This additional simulation functionality should support a variety of new applications and expansions of the E. coli Whole-Cell Modeling Project.
Collapse
Affiliation(s)
- Travis A Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | | | - Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Jerry H Morrison
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
50
|
Erdem C, Mutsuddy A, Bensman EM, Dodd WB, Saint-Antoine MM, Bouhaddou M, Blake RC, Gross SM, Heiser LM, Feltus FA, Birtwistle MR. A scalable, open-source implementation of a large-scale mechanistic model for single cell proliferation and death signaling. Nat Commun 2022; 13:3555. [PMID: 35729113 PMCID: PMC9213456 DOI: 10.1038/s41467-022-31138-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 06/07/2022] [Indexed: 02/01/2023] Open
Abstract
Mechanistic models of how single cells respond to different perturbations can help integrate disparate big data sets or predict response to varied drug combinations. However, the construction and simulation of such models have proved challenging. Here, we developed a python-based model creation and simulation pipeline that converts a few structured text files into an SBML standard and is high-performance- and cloud-computing ready. We applied this pipeline to our large-scale, mechanistic pan-cancer signaling model (named SPARCED) and demonstrate it by adding an IFNγ pathway submodel. We then investigated whether a putative crosstalk mechanism could be consistent with experimental observations from the LINCS MCF10A Data Cube that IFNγ acts as an anti-proliferative factor. The analyses suggested this observation can be explained by IFNγ-induced SOCS1 sequestering activated EGF receptors. This work forms a foundational recipe for increased mechanistic model-based data integration on a single-cell level, an important building block for clinically-predictive mechanistic models.
Collapse
Affiliation(s)
- Cemal Erdem
- Department of Chemical & Biomolecular Engineering, Clemson University, Clemson, SC, USA.
| | - Arnab Mutsuddy
- Department of Chemical & Biomolecular Engineering, Clemson University, Clemson, SC, USA
| | - Ethan M Bensman
- Computer Science, School of Computing, Clemson University, Clemson, SC, USA
| | - William B Dodd
- Department of Chemical & Biomolecular Engineering, Clemson University, Clemson, SC, USA
| | - Michael M Saint-Antoine
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
| | - Mehdi Bouhaddou
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA, USA
| | - Robert C Blake
- Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA, USA
| | - Sean M Gross
- Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA
| | - Laura M Heiser
- Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR, USA
| | - F Alex Feltus
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, USA
- Biomedical Data Science and Informatics Program, Clemson University, Clemson, SC, USA
- Center for Human Genetics, Clemson University, Clemson, SC, USA
| | - Marc R Birtwistle
- Department of Chemical & Biomolecular Engineering, Clemson University, Clemson, SC, USA.
- Department of Bioengineering, Clemson University, Clemson, SC, USA.
| |
Collapse
|