1
|
Cardiff RAL, Carothers JM, Zalatan JG, Sauro HM. Systems-Level Modeling for CRISPR-Based Metabolic Engineering. ACS Synth Biol 2024; 13:2643-2652. [PMID: 39119666 DOI: 10.1021/acssynbio.4c00053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2024]
Abstract
The CRISPR-Cas system has enabled the development of sophisticated, multigene metabolic engineering programs through the use of guide RNA-directed activation or repression of target genes. To optimize biosynthetic pathways in microbial systems, we need improved models to inform design and implementation of transcriptional programs. Recent progress has resulted in new modeling approaches for identifying gene targets and predicting the efficacy of guide RNA targeting. Genome-scale and flux balance models have successfully been applied to identify targets for improving biosynthetic production yields using combinatorial CRISPR-interference (CRISPRi) programs. The advent of new approaches for tunable and dynamic CRISPR activation (CRISPRa) promises to further advance these engineering capabilities. Once appropriate targets are identified, guide RNA prediction models can lead to increased efficacy in gene targeting. Developing improved models and incorporating approaches from machine learning may be able to overcome current limitations and greatly expand the capabilities of CRISPR-Cas9 tools for metabolic engineering.
Collapse
Affiliation(s)
- Ryan A L Cardiff
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, Washington 98195, United States
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
- Department of Chemical Engineering, University of Washington, Seattle, Washington 98195, United States
| | - James M Carothers
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, Washington 98195, United States
- Department of Chemical Engineering, University of Washington, Seattle, Washington 98195, United States
| | - Jesse G Zalatan
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, Washington 98195, United States
- Department of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Herbert M Sauro
- Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, Washington 98195, United States
- Department of Bioengineering, University of Washington, Seattle, Washington 98195, United States
| |
Collapse
|
2
|
Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024; 74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]
Abstract
Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.
Collapse
Affiliation(s)
- Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Satyajit Beura
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Suman Mondal
- P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Kumar Das
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
3
|
de Atauri P, Foguet C, Cascante M. Control analysis in the identification of key enzymes driving metabolic adaptations: Towards drug target discovery. Biosystems 2023; 231:104984. [PMID: 37506820 DOI: 10.1016/j.biosystems.2023.104984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 07/18/2023] [Accepted: 07/25/2023] [Indexed: 07/30/2023]
Abstract
Metabolic Control Analysis (MCA) marked a turning point in understanding the design principles of metabolic network control by establishing control coefficients as a means to quantify the degree of control that an enzyme exerts on flux or metabolite concentrations. MCA has demonstrated that control of metabolic pathways is distributed among many enzymes rather than depending on a single rate-limiting step. MCA also proved that this distribution depends not only on the stoichiometric structure of the network but also on other kinetic determinants, such as the degree of saturation of the enzyme active site, the distance to thermodynamic equilibrium, and metabolite feedback regulatory loops. Consequently, predicting the alterations that occur during metabolic adaptation in response to strong changes involving a redistribution in such control distribution can be challenging. Here, using the framework provided by MCA, we illustrate how control distribution in a metabolic pathway/network depends on enzyme kinetic determinants and to what extent the redistribution of control affects our predictions on candidate enzymes suitable as targets for small molecule inhibition in the drug discovery process. Our results uncover that kinetic determinants can lead to unexpected control distribution and outcomes that cannot be predicted solely from stoichiometric determinants. We also unveil that the inference of key enzyme-drivers of an observed metabolic adaptation can be dramatically improved using mean control coefficients and ruling out those enzyme activities that are associated with low control coefficients. As the use of constraint-based stoichiometric genome-scale metabolic models (GSMMs) becomes increasingly prevalent for identifying genes/enzymes that could be potential drug targets, we anticipate that incorporating kinetic determinants and ruling out enzymes with low control coefficients into GSMM workflows will facilitate more accurate predictions and reveal novel therapeutic targets.
Collapse
Affiliation(s)
- Pedro de Atauri
- Department of Biochemistry and Molecular Biomedicine & Institute of Biomedicine of Universitat de Barcelona, Faculty of Biology, Universitat de Barcelona, Barcelona, 08028, Spain; Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Instituto de Salud Carlos III (ISCIII), Madrid, 28020, Spain.
| | - Carles Foguet
- British Heart Foundation Cardiovascular Epidemiology Unit and Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge, CB2 0BD, United Kingdom
| | - Marta Cascante
- Department of Biochemistry and Molecular Biomedicine & Institute of Biomedicine of Universitat de Barcelona, Faculty of Biology, Universitat de Barcelona, Barcelona, 08028, Spain; Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBEREHD), Instituto de Salud Carlos III (ISCIII), Madrid, 28020, Spain.
| |
Collapse
|
4
|
Huß S, Nikoloski Z. Systematic comparison of local approaches for isotopically nonstationary metabolic flux analysis. FRONTIERS IN PLANT SCIENCE 2023; 14:1178239. [PMID: 37346134 PMCID: PMC10280729 DOI: 10.3389/fpls.2023.1178239] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 05/04/2023] [Indexed: 06/23/2023]
Abstract
Quantification of reaction fluxes of metabolic networks can help us understand how the integration of different metabolic pathways determine cellular functions. Yet, intracellular fluxes cannot be measured directly but are estimated with metabolic flux analysis (MFA) that relies on the patterns of isotope labeling of metabolites in the network. For metabolic systems, typical for plants, where all potentially labeled atoms effectively have only one source atom pool, only isotopically nonstationary MFA can provide information about intracellular fluxes. There are several global approaches that implement MFA for an entire metabolic network and estimate, at once, a steady-state flux distribution for all reactions with identifiable fluxes in the network. In contrast, local approaches deal with estimation of fluxes for a subset of reactions, with smaller data demand for flux estimation. Here we present a systematic comparative review and benchmarking of the existing local approaches for isotopically nonstationary MFA. The comparison is conducted with respect to the required data and underlying computational problems solved on a synthetic network example. Furthermore, we benchmark the performance of these approaches in estimating fluxes for a subset of reactions using data obtained from the simulation of nitrogen fluxes in the Arabidopsis thaliana core metabolism. The findings pinpoint practical aspects that need to be considered when applying local approaches for flux estimation in large-scale plant metabolic networks.
Collapse
Affiliation(s)
- Sebastian Huß
- Systems Biology and Mathematical Modelling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | - Zoran Nikoloski
- Systems Biology and Mathematical Modelling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| |
Collapse
|
5
|
Wendering P, Nikoloski Z. Toward mechanistic modeling and rational engineering of plant respiration. PLANT PHYSIOLOGY 2023; 191:2150-2166. [PMID: 36721968 PMCID: PMC10069892 DOI: 10.1093/plphys/kiad054] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 01/23/2023] [Indexed: 06/18/2023]
Abstract
Plant respiration not only provides energy to support all cellular processes, including biomass production, but also plays a major role in the global carbon cycle. Therefore, modulation of plant respiration can be used to both increase the plant yield and mitigate the effects of global climate change. Mechanistic modeling of plant respiration at sufficient biochemical detail can provide key insights for rational engineering of this process. Yet, despite its importance, plant respiration has attracted considerably less modeling effort in comparison to photosynthesis. In this update review, we highlight the advances made in modeling of plant respiration, emphasizing the gradual but important change from phenomenological to models based on first principles. We also provide a detailed account of the existing resources that can contribute to resolving the challenges in modeling plant respiration. These resources point at tangible improvements in the representation of cellular processes that contribute to CO2 evolution and consideration of kinetic properties of underlying enzymes to facilitate mechanistic modeling. The update review emphasizes the need to couple biochemical models of respiration with models of acclimation and adaptation of respiration for their effective usage in guiding breeding efforts and improving terrestrial biosphere models tailored to future climate scenarios.
Collapse
Affiliation(s)
- Philipp Wendering
- Systems Biology and Mathematical Modeling, Max Planck Institute of Molecular Plant Physiology, 14476 Potsdam, Germany
| | - Zoran Nikoloski
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, 14476 Potsdam, Germany
| |
Collapse
|
6
|
Tamura T. Trimming Gene Deletion Strategies for Growth-Coupled Production in Constraint-Based Metabolic Networks: TrimGdel. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1540-1549. [PMID: 35731759 DOI: 10.1109/tcbb.2022.3185221] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
When simulating genome-scale metabolite production using constraint-based metabolic networks, it is often necessary to find gene deletion strategies which lead to growth-coupled production, which means that target metabolites are produced when cell growth is maximized. Existing methods are effective when the number of gene deletions is relatively small, but when the number of required gene deletions exceeds approximately 1% of whole genes, the time required for the calculation is often unfeasible. Therefore, a complementing algorithm that is effective even when the required number of gene deletions is approximately 1% to 5% of whole genes would be helpful because the number of deletable genes in a strain is increasing with advances in genetic engineering technology. In this study, the author developed an algorithm, TrimGdel, which first computes a strategy with many gene deletions that results in growth-coupled production and then gradually reduces the number of gene deletions while ensuring the original production rate and growth rate. The results of the computer experiments showed that TrimGdel can calculate stoichiometrically feasible gene deletion strategies, especially those whose sizes are 1 to 5% of whole genes, which lead to growth-coupled production of many target metabolites, which include useful vitamins such as biotin and pantothenate, for which existing methods could not.
Collapse
|
7
|
Tamura T, Muto-Fujita A, Tohsato Y, Kosaka T. Gene Deletion Algorithms for Minimum Reaction Network Design by Mixed-Integer Linear Programming for Metabolite Production in Constraint-Based Models: gDel_minRN. J Comput Biol 2023; 30:553-568. [PMID: 36809057 DOI: 10.1089/cmb.2022.0352] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023] Open
Abstract
Genome-scale constraint-based metabolic networks play an important role in the simulation of growth-coupled production, which means that cell growth and target metabolite production are simultaneously achieved. For growth-coupled production, a minimal reaction-network-based design is known to be effective. However, the obtained reaction networks often fail to be realized by gene deletions due to conflicts with gene-protein-reaction (GPR) relations. Here, we developed gDel_minRN that determines gene deletion strategies using mixed-integer linear programming to achieve growth-coupled production by repressing the maximum number of reactions via GPR relations. The results of computational experiments showed that gDel_minRN could determine the core parts, which include only 30% to 55% of whole genes, for stoichiometrically feasible growth-coupled production for many target metabolites, which include useful vitamins such as biotin (vitamin B7), riboflavin (vitamin B2), and pantothenate (vitamin B5). Since gDel_minRN calculates a constraint-based model of the minimum number of gene-associated reactions without conflict with GPR relations, it helps biological analysis of the core parts essential for growth-coupled production for each target metabolite. The source codes, implemented in MATLAB using CPLEX and COBRA Toolbox, are available on https://github.com/MetNetComp/gDel-minRN.
Collapse
Affiliation(s)
- Takeyuki Tamura
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, Japan
| | - Ai Muto-Fujita
- RIKEN Center for Biosystems Dynamics Research, Suita, Osaka, Japan
| | - Yukako Tohsato
- Faculty of Information Science and Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Tomoyuki Kosaka
- Research Center for Thermotolerant Microbial Resources (RCTMR), Yamaguchi University, Yoshida, Yamaguchi, Japan.,Graduate School of Sciences and Technology for Innovation, Yamaguchi University, Yoshida, Yamaguchi, Japan
| |
Collapse
|
8
|
Huß S, Judd RS, Koper K, Maeda HA, Nikoloski Z. An automated workflow that generates atom mappings for large-scale metabolic models and its application to Arabidopsis thaliana. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 111:1486-1500. [PMID: 35819300 DOI: 10.1111/tpj.15903] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 07/01/2022] [Accepted: 07/06/2022] [Indexed: 06/15/2023]
Abstract
Quantification of reaction fluxes of metabolic networks can help us understand how the integration of different metabolic pathways determines cellular functions. Yet, intracellular fluxes cannot be measured directly but are estimated with metabolic flux analysis (MFA), which relies on the patterns of isotope labeling of metabolites in the network. The application of MFA also requires a stoichiometric model with atom mappings that are currently not available for the majority of large-scale metabolic network models, particularly of plants. While automated approaches such as the Reaction Decoder Toolkit (RDT) can produce atom mappings for individual reactions, tracing the flow of individual atoms of the entire reactions across a metabolic model remains challenging. Here we establish an automated workflow to obtain reliable atom mappings for large-scale metabolic models by refining the outcome of RDT, and apply the workflow to metabolic models of Arabidopsis thaliana. We demonstrate the accuracy of RDT through a comparative analysis with atom mappings from a large database of biochemical reactions, MetaCyc. We further show the utility of our automated workflow by simulating 15 N isotope enrichment and identifying nitrogen (N)-containing metabolites which show enrichment patterns that are informative for flux estimation in future 15 N-MFA studies of A. thaliana. The automated workflow established in this study can be readily expanded to other species for which metabolic models have been established and the resulting atom mappings will facilitate MFA and graph-theoretic structural analyses with large-scale metabolic networks.
Collapse
Affiliation(s)
- Sebastian Huß
- Systems Biology and Mathematical Modelling Group, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam, Germany
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24- 25, 14476, Potsdam, Germany
| | - Rika Siedah Judd
- Department of Botany, University of Wisconsin-Madison, 430, Lincoln, Dr. Madison, Wisconsin, 53706, USA
| | - Kaan Koper
- Department of Botany, University of Wisconsin-Madison, 430, Lincoln, Dr. Madison, Wisconsin, 53706, USA
| | - Hiroshi A Maeda
- Department of Botany, University of Wisconsin-Madison, 430, Lincoln, Dr. Madison, Wisconsin, 53706, USA
| | - Zoran Nikoloski
- Systems Biology and Mathematical Modelling Group, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam, Germany
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24- 25, 14476, Potsdam, Germany
| |
Collapse
|
9
|
Relative flux trade-offs and optimization of metabolic network functionalities. Comput Struct Biotechnol J 2022; 20:3963-3971. [PMID: 35950188 PMCID: PMC9340536 DOI: 10.1016/j.csbj.2022.07.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 07/22/2022] [Accepted: 07/22/2022] [Indexed: 11/21/2022] Open
Abstract
Trade-offs between traits are present across different levels of biological systems and ultimately reflect constraints imposed by physicochemical laws and the structure of underlying biochemical networks. Yet, mechanistic explanation of how trade-offs between molecular traits arise and how they relate to optimization of fitness-related traits remains elusive. Here, we introduce the concept of relative flux trade-offs and propose a constraint-based approach, termed FluTOr, to identify metabolic reactions whose fluxes are in relative trade-off with respect to an optimized fitness-related cellular task, like growth. We then employed FluTOr to identify relative flux trade-offs in the genome-scale metabolic networks of Escherichia coli, Saccharomyces cerevisiae, and Arabidopsis thaliana. For the metabolic models of E. coli and S. cerevisiae we showed that: (i) the identified relative flux trade-offs depend on the carbon source used and that (ii) reactions that participated in relative trade-offs in both species were implicated in cofactor biosynthesis. In contrast to the two microorganisms, the relative flux trade-offs for the metabolic model of A. thaliana did not depend on the available nitrogen sources, reflecting the differences in the underlying metabolic network as well as the considered environments. Lastly, the established connection between relative flux trade-offs allowed us to identify overexpression targets that can be used to optimize fitness-related traits. Altogether, our computational approach and findings demonstrate how relative flux trade-offs can shape optimization of metabolic tasks, important in biotechnological applications.
Collapse
|
10
|
Characterization of effects of genetic variants via genome-scale metabolic modelling. Cell Mol Life Sci 2021; 78:5123-5138. [PMID: 33950314 PMCID: PMC8254712 DOI: 10.1007/s00018-021-03844-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Revised: 04/15/2021] [Accepted: 04/23/2021] [Indexed: 12/19/2022]
Abstract
Genome-scale metabolic networks for model plants and crops in combination with approaches from the constraint-based modelling framework have been used to predict metabolic traits and design metabolic engineering strategies for their manipulation. With the advances in technologies to generate large-scale genotyping data from natural diversity panels and other populations, genome-wide association and genomic selection have emerged as statistical approaches to determine genetic variants associated with and predictive of traits. Here, we review recent advances in constraint-based approaches that integrate genetic variants in genome-scale metabolic models to characterize their effects on reaction fluxes. Since some of these approaches have been applied in organisms other than plants, we provide a critical assessment of their applicability particularly in crops. In addition, we further dissect the inferred effects of genetic variants with respect to reaction rate constants, abundances of enzymes, and concentrations of metabolites, as main determinants of reaction fluxes and relate them with their combined effects on complex traits, like growth. Through this systematic review, we also provide a roadmap for future research to increase the predictive power of statistical approaches by coupling them with mechanistic models of metabolism.
Collapse
|