1
|
Verma S, Magazzù G, Eftekhari N, Lou T, Gilhespy A, Occhipinti A, Angione C. Cross-attention enables deep learning on limited omics-imaging-clinical data of 130 lung cancer patients. CELL REPORTS METHODS 2024:100817. [PMID: 38981473 DOI: 10.1016/j.crmeth.2024.100817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 04/18/2024] [Accepted: 06/17/2024] [Indexed: 07/11/2024]
Abstract
Deep-learning tools that extract prognostic factors derived from multi-omics data have recently contributed to individualized predictions of survival outcomes. However, the limited size of integrated omics-imaging-clinical datasets poses challenges. Here, we propose two biologically interpretable and robust deep-learning architectures for survival prediction of non-small cell lung cancer (NSCLC) patients, learning simultaneously from computed tomography (CT) scan images, gene expression data, and clinical information. The proposed models integrate patient-specific clinical, transcriptomic, and imaging data and incorporate Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome pathway information, adding biological knowledge within the learning process to extract prognostic gene biomarkers and molecular pathways. While both models accurately stratify patients in high- and low-risk groups when trained on a dataset of only 130 patients, introducing a cross-attention mechanism in a sparse autoencoder significantly improves the performance, highlighting tumor regions and NSCLC-related genes as potential biomarkers and thus offering a significant methodological advancement when learning from small imaging-omics-clinical samples.
Collapse
Affiliation(s)
- Suraj Verma
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK
| | | | | | - Thai Lou
- Gateshead Health NHS Foundation Trust, Gateshead, UK
| | - Alex Gilhespy
- South Tyneside and Sunderland NHS Foundation Trust, Sunderland, UK
| | - Annalisa Occhipinti
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK; Centre for Digital Innovation, Teesside University, Middlesbrough, UK; National Horizons Centre, Teesside University, Darlington, UK
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK; Centre for Digital Innovation, Teesside University, Middlesbrough, UK; National Horizons Centre, Teesside University, Darlington, UK.
| |
Collapse
|
2
|
Atasoy M, Scott WT, Regueira A, Mauricio-Iglesias M, Schaap PJ, Smidt H. Biobased short chain fatty acid production - Exploring microbial community dynamics and metabolic networks through kinetic and microbial modeling approaches. Biotechnol Adv 2024; 73:108363. [PMID: 38657743 DOI: 10.1016/j.biotechadv.2024.108363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 04/03/2024] [Accepted: 04/12/2024] [Indexed: 04/26/2024]
Abstract
In recent years, there has been growing interest in harnessing anaerobic digestion technology for resource recovery from waste streams. This approach has evolved beyond its traditional role in energy generation to encompass the production of valuable carboxylic acids, especially volatile fatty acids (VFAs) like acetic acid, propionic acid, and butyric acid. VFAs hold great potential for various industries and biobased applications due to their versatile properties. Despite increasing global demand, over 90% of VFAs are currently produced synthetically from petrochemicals. Realizing the potential of large-scale biobased VFA production from waste streams offers significant eco-friendly opportunities but comes with several key challenges. These include low VFA production yields, unstable acid compositions, complex and expensive purification methods, and post-processing needs. Among these, production yield and acid composition stand out as the most critical obstacles impacting economic viability and competitiveness. This paper seeks to offer a comprehensive view of combining complementary modeling approaches, including kinetic and microbial modeling, to understand the workings of microbial communities and metabolic pathways in VFA production, enhance production efficiency, and regulate acid profiles through the integration of omics and bioreactor data.
Collapse
Affiliation(s)
- Merve Atasoy
- UNLOCK, Wageningen University & Research and Delft University of Technology, Wageningen and Delft, the Netherlands; Department of Environmental Technology, Wageningen University & Research, Wageningen, the Netherlands; Laboratory of Microbiology, Wageningen University & Research, Wageningen, the Netherlands.
| | - William T Scott
- UNLOCK, Wageningen University & Research and Delft University of Technology, Wageningen and Delft, the Netherlands; Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, the Netherlands.
| | - Alberte Regueira
- CRETUS, Department of Chemical Engineering, Universidade de Santiago de Compostela, Santiago de Compostela, Spain; Center for Microbial Ecology and Technology (CMET), Ghent University, Ghent, Belgium; Center for Advanced Process Technology for Urban Resource Recovery (CAPTURE), Frieda Saeysstraat 1, Ghent, Belgium.
| | - Miguel Mauricio-Iglesias
- CRETUS, Department of Chemical Engineering, Universidade de Santiago de Compostela, Santiago de Compostela, Spain.
| | - Peter J Schaap
- UNLOCK, Wageningen University & Research and Delft University of Technology, Wageningen and Delft, the Netherlands; Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, the Netherlands.
| | - Hauke Smidt
- UNLOCK, Wageningen University & Research and Delft University of Technology, Wageningen and Delft, the Netherlands; Laboratory of Microbiology, Wageningen University & Research, Wageningen, the Netherlands.
| |
Collapse
|
3
|
Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024; 74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]
Abstract
Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.
Collapse
Affiliation(s)
- Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Satyajit Beura
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Suman Mondal
- P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Kumar Das
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
4
|
Tarzi C, Zampieri G, Sullivan N, Angione C. Emerging methods for genome-scale metabolic modeling of microbial communities. Trends Endocrinol Metab 2024; 35:533-548. [PMID: 38575441 DOI: 10.1016/j.tem.2024.02.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/28/2024] [Accepted: 02/29/2024] [Indexed: 04/06/2024]
Abstract
Genome-scale metabolic models (GEMs) are consolidating as platforms for studying mixed microbial populations, by combining biological data and knowledge with mathematical rigor. However, deploying these models to answer research questions can be challenging due to the increasing number of available computational tools, the lack of universal standards, and their inherent limitations. Here, we present a comprehensive overview of foundational concepts for building and evaluating genome-scale models of microbial communities. We then compare tools in terms of requirements, capabilities, and applications. Next, we highlight the current pitfalls and open challenges to consider when adopting existing tools and developing new ones. Our compendium can be relevant for the expanding community of modelers, both at the entry and experienced levels.
Collapse
Affiliation(s)
- Chaimaa Tarzi
- School of Computing, Engineering and Digital Technologies, Teesside University, Southfield Rd, Middlesbrough, TS1 3BX, North Yorkshire, UK
| | - Guido Zampieri
- Department of Biology, University of Padova, Padova, 35122, Veneto, Italy
| | - Neil Sullivan
- Complement Genomics Ltd, Station Rd, Lanchester, Durham, DH7 0EX, County Durham, UK
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Southfield Rd, Middlesbrough, TS1 3BX, North Yorkshire, UK; Centre for Digital Innovation, Teesside University, Southfield Rd, Middlesbrough, TS1 3BX, North Yorkshire, UK; National Horizons Centre, Teesside University, 38 John Dixon Ln, Darlington, DL1 1HG, North Yorkshire, UK.
| |
Collapse
|
5
|
Goshisht MK. Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges. ACS OMEGA 2024; 9:9921-9945. [PMID: 38463314 PMCID: PMC10918679 DOI: 10.1021/acsomega.3c05913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 01/19/2024] [Accepted: 01/30/2024] [Indexed: 03/12/2024]
Abstract
Machine learning (ML), particularly deep learning (DL), has made rapid and substantial progress in synthetic biology in recent years. Biotechnological applications of biosystems, including pathways, enzymes, and whole cells, are being probed frequently with time. The intricacy and interconnectedness of biosystems make it challenging to design them with the desired properties. ML and DL have a synergy with synthetic biology. Synthetic biology can be employed to produce large data sets for training models (for instance, by utilizing DNA synthesis), and ML/DL models can be employed to inform design (for example, by generating new parts or advising unrivaled experiments to perform). This potential has recently been brought to light by research at the intersection of engineering biology and ML/DL through achievements like the design of novel biological components, best experimental design, automated analysis of microscopy data, protein structure prediction, and biomolecular implementations of ANNs (Artificial Neural Networks). I have divided this review into three sections. In the first section, I describe predictive potential and basics of ML along with myriad applications in synthetic biology, especially in engineering cells, activity of proteins, and metabolic pathways. In the second section, I describe fundamental DL architectures and their applications in synthetic biology. Finally, I describe different challenges causing hurdles in the progress of ML/DL and synthetic biology along with their solutions.
Collapse
Affiliation(s)
- Manoj Kumar Goshisht
- Department of Chemistry, Natural and
Applied Sciences, University of Wisconsin—Green
Bay, Green
Bay, Wisconsin 54311-7001, United States
| |
Collapse
|
6
|
Woo H, Kim Y, Kim D, Yoon SH. Machine learning identifies key metabolic reactions in bacterial growth on different carbon sources. Mol Syst Biol 2024; 20:170-186. [PMID: 38291231 PMCID: PMC10912204 DOI: 10.1038/s44320-024-00017-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 01/03/2024] [Accepted: 01/11/2024] [Indexed: 02/01/2024] Open
Abstract
Carbon source-dependent control of bacterial growth is fundamental to bacterial physiology and survival. However, pinpointing the metabolic steps important for cell growth is challenging due to the complexity of cellular networks. Here, the elastic net model and multilayer perception model that integrated genome-wide gene-deletion data and simulated flux distributions were constructed to identify metabolic reactions beneficial or detrimental to Escherichia coli grown on 30 different carbon sources. Both models outperformed traditional in silico methods by identifying not just essential reactions but also nonessential ones that promote growth. They successfully predicted metabolic reactions beneficial to cell growth, with high convergence between the models. The models revealed that biosynthetic pathways generally promote growth across various carbon sources, whereas the impact of energy-generating pathways varies with the carbon source. Intriguing predictions were experimentally validated for findings beyond experimental training data and the impact of various carbon sources on the glyoxylate shunt, pyruvate dehydrogenase reaction, and redundant purine biosynthesis reactions. These highlight the practical significance and predictive power of the models for understanding and engineering microbial metabolism.
Collapse
Affiliation(s)
- Hyunjae Woo
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea
| | - Youngshin Kim
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea
| | - Dohyeon Kim
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea
| | - Sung Ho Yoon
- Department of Bioscience and Biotechnology, Konkuk University, Seoul, 05029, Republic of Korea.
| |
Collapse
|
7
|
Wu D, Xu F, Xu Y, Huang M, Li Z, Chu J. Towards a hybrid model-driven platform based on flux balance analysis and a machine learning pipeline for biosystem design. Synth Syst Biotechnol 2024; 9:33-42. [PMID: 38234412 PMCID: PMC10793177 DOI: 10.1016/j.synbio.2023.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 12/22/2023] [Accepted: 12/22/2023] [Indexed: 01/19/2024] Open
Abstract
Metabolic modeling and machine learning (ML) are crucial components of the evolving next-generation tools in systems and synthetic biology, aiming to unravel the intricate relationship between genotype, phenotype, and the environment. Nonetheless, the comprehensive exploration of integrating these two frameworks, and fully harnessing the potential of fluxomic data, remains an unexplored territory. In this study, we present, rigorously evaluate, and compare ML-based techniques for data integration. The hybrid model revealed that the overexpression of six target genes and the knockout of seven target genes contribute to enhanced ethanol production. Specifically, we investigated the influence of succinate dehydrogenase (SDH) on ethanol biosynthesis in Saccharomyces cerevisiae through shake flask experiments. The findings indicate a noticeable increase in ethanol yield, ranging from 6 % to 10 %, in SDH subunit gene knockout strains compared to the wild-type strain. Moreover, in pursuit of a high-yielding strain for ethanol production, dual-gene deletion experiments were conducted targeting glycerol-3-phosphate dehydrogenase (GPD) and SDH. The results unequivocally demonstrate significant enhancements in ethanol production for the engineered strains Δsdh4Δgpd1, Δsdh5Δgpd1, Δsdh6Δgpd1, Δsdh4Δgpd2, Δsdh5Δgpd2, and Δsdh6Δgpd2, with improvements of 21.6 %, 27.9 %, and 22.7 %, respectively. Overall, the results highlighted that integrating mechanistic flux features substantially improves the prediction of gene knockout strains not accounted for in metabolic reconstructions. In addition, the finding in this study delivers valuable tools for comprehending and manipulating intricate phenotypes, thereby enhancing prediction accuracy and facilitating deeper insights into mechanistic aspects within the field of synthetic biology.
Collapse
Affiliation(s)
| | | | - Yaying Xu
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, People's Republic of China
| | - Mingzhi Huang
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, People's Republic of China
| | - Zhimin Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, People's Republic of China
| | - Ju Chu
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, People's Republic of China
| |
Collapse
|
8
|
Occhipinti A, Verma S, Doan LMT, Angione C. Mechanism-aware and multimodal AI: beyond model-agnostic interpretation. Trends Cell Biol 2024; 34:85-89. [PMID: 38087709 DOI: 10.1016/j.tcb.2023.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 11/03/2023] [Accepted: 11/07/2023] [Indexed: 02/04/2024]
Abstract
Artificial intelligence (AI) is widely used for exploiting multimodal biomedical data, with increasingly accurate predictions and model-agnostic interpretations, which are however also agnostic to biological mechanisms. Combining metabolic modelling, 'omics, and imaging data via multimodal AI can generate predictions that can be interpreted mechanistically and transparently, therefore with significantly higher therapeutic potential.
Collapse
Affiliation(s)
- Annalisa Occhipinti
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesborough, UK; Centre for Digital Innovation, Teesside University, Middlesborough, UK; National Horizons Centre, Teesside University, Darlington, UK
| | - Suraj Verma
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesborough, UK
| | - Le Minh Thao Doan
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesborough, UK
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesborough, UK; Centre for Digital Innovation, Teesside University, Middlesborough, UK; National Horizons Centre, Teesside University, Darlington, UK.
| |
Collapse
|
9
|
Muller E, Shiryan I, Borenstein E. Multi-omic integration of microbiome data for identifying disease-associated modules. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.03.547607. [PMID: 37461534 PMCID: PMC10349976 DOI: 10.1101/2023.07.03.547607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
The human gut microbiome is a complex ecosystem with profound implications for health and disease. This recognition has led to a surge in multi-omic microbiome studies, employing various molecular assays to elucidate the microbiome's role in diseases across multiple functional layers. However, despite the clear value of these multi-omic datasets, rigorous integrative analysis of such data poses significant challenges, hindering a comprehensive understanding of microbiome-disease interactions. Perhaps most notably, multiple approaches, including univariate and multivariate analyses, as well as machine learning, have been applied to such data to identify disease-associated markers, namely, specific features (e.g., species, pathways, metabolites) that are significantly altered in disease state. These methods, however, often yield extensive lists of features associated with the disease without effectively capturing the multi-layered structure of multi-omic data or offering clear, interpretable hypotheses about underlying microbiome-disease mechanisms. Here, we address this challenge by introducing MintTea - an intermediate integration-based method for analyzing multi-omic microbiome data. MintTea combines a canonical correlation analysis (CCA) extension, consensus analysis, and an evaluation protocol to robustly identify disease-associated multi-omic modules. Each such module consists of a set of features from the various omics that both shift in concord, and collectively associate with the disease. Applying MintTea to diverse case-control cohorts with multi-omic data, we show that this framework is able to capture modules with high predictive power for disease, significant cross-omic correlations, and alignment with known microbiome-disease associations. For example, analyzing samples from a metabolic syndrome (MS) study, we found a MS-associated module comprising of a highly correlated cluster of serum glutamate- and TCA cycle-related metabolites, as well as bacterial species previously implicated in insulin resistance. In another cohort, we identified a module associated with late-stage colorectal cancer, featuring Peptostreptococcus and Gemella species and several fecal amino acids, in agreement with these species' reported role in the metabolism of these amino acids and their coordinated increase in abundance during disease development. Finally, comparing modules identified in different datasets, we detected multiple significant overlaps, suggesting common interactions between microbiome features. Combined, this work serves as a proof of concept for the potential benefits of advanced integration methods in generating integrated multi-omic hypotheses underlying microbiome-disease interactions and a promising avenue for researchers seeking systems-level insights into coherent mechanisms governing microbiome-related diseases.
Collapse
|
10
|
Gaikani HK, Stolar M, Kriti D, Nislow C, Giaever G. From beer to breadboards: yeast as a force for biological innovation. Genome Biol 2024; 25:10. [PMID: 38178179 PMCID: PMC10768129 DOI: 10.1186/s13059-023-03156-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 12/21/2023] [Indexed: 01/06/2024] Open
Abstract
The history of yeast Saccharomyces cerevisiae, aka brewer's or baker's yeast, is intertwined with our own. Initially domesticated 8,000 years ago to provide sustenance to our ancestors, for the past 150 years, yeast has served as a model research subject and a platform for technology. In this review, we highlight many ways in which yeast has served to catalyze the fields of functional genomics, genome editing, gene-environment interaction investigation, proteomics, and bioinformatics-emphasizing how yeast has served as a catalyst for innovation. Several possible futures for this model organism in synthetic biology, drug personalization, and multi-omics research are also presented.
Collapse
Affiliation(s)
- Hamid Kian Gaikani
- Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
| | - Monika Stolar
- Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Divya Kriti
- Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Corey Nislow
- Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada.
| | - Guri Giaever
- Faculty of Pharmaceutical Sciences, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
11
|
Shimazaki S, Yamada R, Yamamoto Y, Matsumoto T, Ogino H. Building a machine-learning model to predict optimal mevalonate pathway gene expression levels for efficient production of a carotenoid in yeast. Biotechnol J 2024; 19:e2300285. [PMID: 37953664 DOI: 10.1002/biot.202300285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 11/06/2023] [Accepted: 11/08/2023] [Indexed: 11/14/2023]
Abstract
Simultaneous modification of the expression levels of many metabolic enzyme genes results in diverse expression ratios of these genes; however, the relationship between gene expression levels and chemical productivity remains unclear. However, clarification of this relationship is expected to improve the productivity of useful chemicals. Supervised machine learning is considered to be an effective means to clarify this relationship. In this study, to improve the productivity of carotenoids in yeast Saccharomyces cerevisiae, we aimed to build a machine-learning model that can predict the optimal gene expression level for carotenoid production. First, we obtained data on the expression levels of mevalonate pathway enzyme genes and carotenoid production. Then, based on these data, we built a machine-learning model to predict carotenoid productivity based on gene expression levels. The prediction accuracy of 0.6292 (coefficient of determination) was achieved using the test data. The maximum predicted carotenoid productivity was 4.3 times higher in the engineered strain than in the parental strain, suggesting that the expression levels of the mevalonate pathway enzyme genes tHMG1 and ERG8 have a particularly large impact on carotenoid productivity. This study could be one of the important achievements in addressing the uncertainty of genotype-phenotype correlations, which is one of the challenges facing metabolic engineering strategies.
Collapse
Affiliation(s)
- Shun Shimazaki
- Department of Chemical Engineering, Osaka Metropolitan University, Sakai, Osaka, Japan
| | - Ryosuke Yamada
- Department of Chemical Engineering, Osaka Metropolitan University, Sakai, Osaka, Japan
| | - Yoshiki Yamamoto
- Department of Chemical Engineering, Osaka Metropolitan University, Sakai, Osaka, Japan
| | - Takuya Matsumoto
- Department of Chemical Engineering, Osaka Metropolitan University, Sakai, Osaka, Japan
| | - Hiroyasu Ogino
- Department of Chemical Engineering, Osaka Metropolitan University, Sakai, Osaka, Japan
| |
Collapse
|
12
|
Parthiban S, Vijeesh T, Gayathri T, Shanmugaraj B, Sharma A, Sathishkumar R. Artificial intelligence-driven systems engineering for next-generation plant-derived biopharmaceuticals. FRONTIERS IN PLANT SCIENCE 2023; 14:1252166. [PMID: 38034587 PMCID: PMC10684705 DOI: 10.3389/fpls.2023.1252166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 10/17/2023] [Indexed: 12/02/2023]
Abstract
Recombinant biopharmaceuticals including antigens, antibodies, hormones, cytokines, single-chain variable fragments, and peptides have been used as vaccines, diagnostics and therapeutics. Plant molecular pharming is a robust platform that uses plants as an expression system to produce simple and complex recombinant biopharmaceuticals on a large scale. Plant system has several advantages over other host systems such as humanized expression, glycosylation, scalability, reduced risk of human or animal pathogenic contaminants, rapid and cost-effective production. Despite many advantages, the expression of recombinant proteins in plant system is hindered by some factors such as non-human post-translational modifications, protein misfolding, conformation changes and instability. Artificial intelligence (AI) plays a vital role in various fields of biotechnology and in the aspect of plant molecular pharming, a significant increase in yield and stability can be achieved with the intervention of AI-based multi-approach to overcome the hindrance factors. Current limitations of plant-based recombinant biopharmaceutical production can be circumvented with the aid of synthetic biology tools and AI algorithms in plant-based glycan engineering for protein folding, stability, viability, catalytic activity and organelle targeting. The AI models, including but not limited to, neural network, support vector machines, linear regression, Gaussian process and regressor ensemble, work by predicting the training and experimental data sets to design and validate the protein structures thereby optimizing properties such as thermostability, catalytic activity, antibody affinity, and protein folding. This review focuses on, integrating systems engineering approaches and AI-based machine learning and deep learning algorithms in protein engineering and host engineering to augment protein production in plant systems to meet the ever-expanding therapeutics market.
Collapse
Affiliation(s)
- Subramanian Parthiban
- Plant Genetic Engineering Laboratory, Department of Biotechnology, Bharathiar University, Coimbatore, India
| | - Thandarvalli Vijeesh
- Plant Genetic Engineering Laboratory, Department of Biotechnology, Bharathiar University, Coimbatore, India
| | - Thashanamoorthi Gayathri
- Plant Genetic Engineering Laboratory, Department of Biotechnology, Bharathiar University, Coimbatore, India
| | - Balamurugan Shanmugaraj
- Plant Genetic Engineering Laboratory, Department of Biotechnology, Bharathiar University, Coimbatore, India
| | - Ashutosh Sharma
- Tecnologico de Monterrey, School of Engineering and Sciences, Centre of Bioengineering, Queretaro, Mexico
| | - Ramalingam Sathishkumar
- Plant Genetic Engineering Laboratory, Department of Biotechnology, Bharathiar University, Coimbatore, India
| |
Collapse
|
13
|
Nambiar A, Dubinkina V, Liu S, Maslov S. FUN-PROSE: A deep learning approach to predict condition-specific gene expression in fungi. PLoS Comput Biol 2023; 19:e1011563. [PMID: 37971967 PMCID: PMC10653424 DOI: 10.1371/journal.pcbi.1011563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Accepted: 09/30/2023] [Indexed: 11/19/2023] Open
Abstract
mRNA levels of all genes in a genome is a critical piece of information defining the overall state of the cell in a given environmental condition. Being able to reconstruct such condition-specific expression in fungal genomes is particularly important to metabolically engineer these organisms to produce desired chemicals in industrially scalable conditions. Most previous deep learning approaches focused on predicting the average expression levels of a gene based on its promoter sequence, ignoring its variation across different conditions. Here we present FUN-PROSE-a deep learning model trained to predict differential expression of individual genes across various conditions using their promoter sequences and expression levels of all transcription factors. We train and test our model on three fungal species and get the correlation between predicted and observed condition-specific gene expression as high as 0.85. We then interpret our model to extract promoter sequence motifs responsible for variable expression of individual genes. We also carried out input feature importance analysis to connect individual transcription factors to their gene targets. A sizeable fraction of both sequence motifs and TF-gene interactions learned by our model agree with previously known biological information, while the rest corresponds to either novel biological facts or indirect correlations.
Collapse
Affiliation(s)
- Ananthan Nambiar
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America
- Carl R. Woese Institute for Genomic Biology, Urbana, Illinois, United States of America
| | - Veronika Dubinkina
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America
- Carl R. Woese Institute for Genomic Biology, Urbana, Illinois, United States of America
- The Gladstone Institute of Data Science and Biotechnology, San Francisco, California, United States of America
| | - Simon Liu
- Carl R. Woese Institute for Genomic Biology, Urbana, Illinois, United States of America
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America
| | - Sergei Maslov
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America
- Carl R. Woese Institute for Genomic Biology, Urbana, Illinois, United States of America
- Department of Physics, University of Illinois Urbana-Champaign, Urbana, Illinois, United States of America
- Computing, Environment and Life Sciences, Argonne National Laboratory, Lemont, Illinois, United States of America
| |
Collapse
|
14
|
Gonçalves DM, Henriques R, Costa RS. Predicting metabolic fluxes from omics data via machine learning: Moving from knowledge-driven towards data-driven approaches. Comput Struct Biotechnol J 2023; 21:4960-4973. [PMID: 37876626 PMCID: PMC10590844 DOI: 10.1016/j.csbj.2023.10.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 10/01/2023] [Accepted: 10/01/2023] [Indexed: 10/26/2023] Open
Abstract
The accurate prediction of phenotypes in microorganisms is a main challenge for systems biology. Genome-scale models (GEMs) are a widely used mathematical formalism for predicting metabolic fluxes using constraint-based modeling methods such as flux balance analysis (FBA). However, they require prior knowledge of the metabolic network of an organism and appropriate objective functions, often hampering the prediction of metabolic fluxes under different conditions. Moreover, the integration of omics data to improve the accuracy of phenotype predictions in different physiological states is still in its infancy. Here, we present a novel approach for predicting fluxes under various conditions. We explore the use of supervised machine learning (ML) models using transcriptomics and/or proteomics data and compare their performance against the standard parsimonious FBA (pFBA) approach using case studies of Escherichia coli organism as an example. Our results show that the proposed omics-based ML approach is promising to predict both internal and external metabolic fluxes with smaller prediction errors in comparison to the pFBA approach. The code, data, and detailed results are available at the project's repository[1].
Collapse
Affiliation(s)
- Daniel M. Gonçalves
- INESC-ID, Rua Alves Redol, 9, Lisbon, 1000-029, Portugal
- Instituto Superior Técnico, Av. Rovisco Pais, 1, Lisbon, 1049-001, Portugal
- LAQV-REQUIMTE, Department of Chemistry, NOVA School of Science and Technology, Universidade NOVA de Lisboa, Caparica, 2829-516, Portugal
| | - Rui Henriques
- INESC-ID, Rua Alves Redol, 9, Lisbon, 1000-029, Portugal
- Instituto Superior Técnico, Av. Rovisco Pais, 1, Lisbon, 1049-001, Portugal
| | - Rafael S. Costa
- LAQV-REQUIMTE, Department of Chemistry, NOVA School of Science and Technology, Universidade NOVA de Lisboa, Caparica, 2829-516, Portugal
| |
Collapse
|
15
|
Liyanaarachchi VC, Nishshanka GKSH, Nimarshana PHV, Chang JS, Ariyadasa TU, Nagarajan D. Modeling of astaxanthin biosynthesis via machine learning, mathematical and metabolic network modeling. Crit Rev Biotechnol 2023:1-22. [PMID: 37587012 DOI: 10.1080/07388551.2023.2237183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 05/04/2023] [Accepted: 06/17/2023] [Indexed: 08/18/2023]
Abstract
Natural astaxanthin is synthesized by diverse organisms including: bacteria, fungi, microalgae, and plants involving complex cellular processes, which depend on numerous interrelated parameters. Nonetheless, existing knowledge regarding astaxanthin biosynthesis and the conditions influencing astaxanthin accumulation is fairly limited. Thus, manipulation of the growth conditions to achieve desired biomass and astaxanthin yields can be a complicated process requiring cost-intensive and time-consuming experiment-based research. As a potential solution, modeling and simulation of biological systems have recently emerged, allowing researchers to predict/estimate astaxanthin production dynamics in selected organisms. Moreover, mathematical modeling techniques would enable further optimization of astaxanthin synthesis in a shorter period of time, ultimately contributing to a notable reduction in production costs. Thus, the present review comprehensively discusses existing mathematical modeling techniques which simulate the bioaccumulation of astaxanthin in diverse organisms. Associated challenges, solutions, and future perspectives are critically analyzed and presented.
Collapse
Affiliation(s)
| | | | - P H Viraj Nimarshana
- Department of Mechanical Engineering, Faculty of Engineering, University of Moratuwa, Moratuwa, Sri Lanka
| | - Jo-Shu Chang
- Department of Chemical Engineering, National Cheng Kung University, Tainan, Taiwan
- Department of Chemical and Materials Engineering, Tunghai University, Taichung, Taiwan
- Research Center for Smart Sustainable Circular Economy, Tunghai University, Taichung, Taiwan
- Department of Chemical Engineering and Materials Science, Yuan Ze University, Chung-Li, Taiwan
| | - Thilini U Ariyadasa
- Department of Chemical and Process Engineering, Faculty of Engineering, University of Moratuwa, Moratuwa, Sri Lanka
| | - Dillirani Nagarajan
- Department of Chemical Engineering, National Cheng Kung University, Tainan, Taiwan
| |
Collapse
|
16
|
Karlsen ST, Rau MH, Sánchez BJ, Jensen K, Zeidan AA. From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry. FEMS Microbiol Rev 2023; 47:fuad030. [PMID: 37286882 PMCID: PMC10337747 DOI: 10.1093/femsre/fuad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/09/2023] Open
Abstract
When selecting microbial strains for the production of fermented foods, various microbial phenotypes need to be taken into account to achieve target product characteristics, such as biosafety, flavor, texture, and health-promoting effects. Through continuous advances in sequencing technologies, microbial whole-genome sequences of increasing quality can now be obtained both cheaper and faster, which increases the relevance of genome-based characterization of microbial phenotypes. Prediction of microbial phenotypes from genome sequences makes it possible to quickly screen large strain collections in silico to identify candidates with desirable traits. Several microbial phenotypes relevant to the production of fermented foods can be predicted using knowledge-based approaches, leveraging our existing understanding of the genetic and molecular mechanisms underlying those phenotypes. In the absence of this knowledge, data-driven approaches can be applied to estimate genotype-phenotype relationships based on large experimental datasets. Here, we review computational methods that implement knowledge- and data-driven approaches for phenotype prediction, as well as methods that combine elements from both approaches. Furthermore, we provide examples of how these methods have been applied in industrial biotechnology, with special focus on the fermented food industry.
Collapse
Affiliation(s)
- Signe T Karlsen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Martin H Rau
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Benjamín J Sánchez
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Kristian Jensen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Ahmad A Zeidan
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| |
Collapse
|
17
|
Chicco D, Cumbo F, Angione C. Ten quick tips for avoiding pitfalls in multi-omics data integration analyses. PLoS Comput Biol 2023; 19:e1011224. [PMID: 37410704 DOI: 10.1371/journal.pcbi.1011224] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/08/2023] Open
Abstract
Data are the most important elements of bioinformatics: Computational analysis of bioinformatics data, in fact, can help researchers infer new knowledge about biology, chemistry, biophysics, and sometimes even medicine, influencing treatments and therapies for patients. Bioinformatics and high-throughput biological data coming from different sources can even be more helpful, because each of these different data chunks can provide alternative, complementary information about a specific biological phenomenon, similar to multiple photos of the same subject taken from different angles. In this context, the integration of bioinformatics and high-throughput biological data gets a pivotal role in running a successful bioinformatics study. In the last decades, data originating from proteomics, metabolomics, metagenomics, phenomics, transcriptomics, and epigenomics have been labelled -omics data, as a unique name to refer to them, and the integration of these omics data has gained importance in all biological areas. Even if this omics data integration is useful and relevant, due to its heterogeneity, it is not uncommon to make mistakes during the integration phases. We therefore decided to present these ten quick tips to perform an omics data integration correctly, avoiding common mistakes we experienced or noticed in published studies in the past. Even if we designed our ten guidelines for beginners, by using a simple language that (we hope) can be understood by anyone, we believe our ten recommendations should be taken into account by all the bioinformaticians performing omics data integration, including experts.
Collapse
Affiliation(s)
- Davide Chicco
- Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| | - Fabio Cumbo
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Claudio Angione
- School of Computing Engineering and Digital Technologies, Teesside University, Middlesbrough, United Kingdom
| |
Collapse
|
18
|
Wang X, Mohsin A, Sun Y, Li C, Zhuang Y, Wang G. From Spatial-Temporal Multiscale Modeling to Application: Bridging the Valley of Death in Industrial Biotechnology. Bioengineering (Basel) 2023; 10:744. [PMID: 37370675 DOI: 10.3390/bioengineering10060744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 06/13/2023] [Accepted: 06/19/2023] [Indexed: 06/29/2023] Open
Abstract
The Valley of Death confronts industrial biotechnology with a significant challenge to the commercialization of products. Fortunately, with the integration of computation, automation and artificial intelligence (AI) technology, the industrial biotechnology accelerates to cross the Valley of Death. The Fourth Industrial Revolution (Industry 4.0) has spurred advanced development of intelligent biomanufacturing, which has evolved the industrial structures in line with the worldwide trend. To achieve this, intelligent biomanufacturing can be structured into three main parts that comprise digitalization, modeling and intellectualization, with modeling forming a crucial link between the other two components. This paper provides an overview of mechanistic models, data-driven models and their applications in bioprocess development. We provide a detailed elaboration of the hybrid model and its applications in bioprocess engineering, including strain design, process control and optimization, as well as bioreactor scale-up. Finally, the challenges and opportunities of biomanufacturing towards Industry 4.0 are also discussed.
Collapse
Affiliation(s)
- Xueting Wang
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology (ECUST), Shanghai 200237, China
| | - Ali Mohsin
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology (ECUST), Shanghai 200237, China
| | - Yifei Sun
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology (ECUST), Shanghai 200237, China
| | - Chao Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology (ECUST), Shanghai 200237, China
| | - Yingping Zhuang
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology (ECUST), Shanghai 200237, China
| | - Guan Wang
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology (ECUST), Shanghai 200237, China
| |
Collapse
|
19
|
Park H, Faulkner M, Toogood HS, Chen GQ, Scrutton N. Online Omics Platform Expedites Industrial Application of Halomonas bluephagenesis TD1.0. Bioinform Biol Insights 2023; 17:11779322231171779. [PMID: 37200674 PMCID: PMC10185862 DOI: 10.1177/11779322231171779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 04/07/2023] [Indexed: 05/20/2023] Open
Abstract
Multi-omic data mining has the potential to revolutionize synthetic biology especially in non-model organisms that have not been extensively studied. However, tangible engineering direction from computational analysis remains elusive due to the interpretability of large datasets and the difficulty in analysis for non-experts. New omics data are generated faster than our ability to use and analyse results effectively, resulting in strain development that proceeds through classic methods of trial-and-error without insight into complex cell dynamics. Here we introduce a user-friendly, interactive website hosting multi-omics data. Importantly, this new platform allows non-experts to explore questions in an industrially important chassis whose cellular dynamics are still largely unknown. The web platform contains a complete KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment analysis derived from principal components analysis, an interactive bio-cluster heatmap analysis of genes, and the Halomonas TD1.0 genome-scale metabolic (GEM) model. As a case study of the effectiveness of this platform, we applied unsupervised machine learning to determine key differences between Halomonas bluephagenesis TD1.0 cultivated under varied conditions. Specifically, cell motility and flagella apparatus are identified to drive energy expenditure usage at different osmolarities, and predictions were verified experimentally using microscopy and fluorescence labelled flagella staining. As more omics projects are completed, this landing page will facilitate exploration and targeted engineering efforts of the robust, industrial chassis H bluephagenesis for researchers without extensive bioinformatics background.
Collapse
Affiliation(s)
- Helen Park
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
- Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua-Peking Center for Life Sciences, Tsinghua University, Beijing, China
| | - Matthew Faulkner
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| | - Helen S Toogood
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| | - Guo-Qiang Chen
- Center for Synthetic and Systems Biology, School of Life Sciences, Tsinghua-Peking Center for Life Sciences, Tsinghua University, Beijing, China
| | - Nigel Scrutton
- EPSRC/BBSRC Future Biomanufacturing Research Hub and BBSRC Synthetic Biology Research Centre SYNBIOCHEM, Manchester Institute of Biotechnology and Department of Chemistry, The University of Manchester, Manchester, UK
| |
Collapse
|
20
|
Angione C, Wang H, Burtt N. Editorial: Artificial intelligence for data discovery and reuse in endocrinology and metabolism. Front Endocrinol (Lausanne) 2023; 14:1180254. [PMID: 37214239 PMCID: PMC10196622 DOI: 10.3389/fendo.2023.1180254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Accepted: 04/18/2023] [Indexed: 05/24/2023] Open
Affiliation(s)
- Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, North Yorkshire, United Kingdom
- Centre for Digital Innovation, Teesside University, North Yorkshire, United Kingdom
- National Horizons Centre, Teesside University, North Yorkshire, United Kingdom
| | - Huajin Wang
- Open Science & Data Collaborations, Carnegie Mellon University, Pittsburgh, PA, United States
| | - Noël Burtt
- Medical and Population Genetics Program and Metabolism Program, The Broad Institute of MIT and Harvard, Cambridge, MA, United States
| |
Collapse
|
21
|
Sopic M, Robinson EL, Emanueli C, Srivastava P, Angione C, Gaetano C, Condorelli G, Martelli F, Pedrazzini T, Devaux Y. Integration of epigenetic regulatory mechanisms in heart failure. Basic Res Cardiol 2023; 118:16. [PMID: 37140699 PMCID: PMC10158703 DOI: 10.1007/s00395-023-00986-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/27/2023] [Accepted: 04/10/2023] [Indexed: 05/05/2023]
Abstract
The number of "omics" approaches is continuously growing. Among others, epigenetics has appeared as an attractive area of investigation by the cardiovascular research community, notably considering its association with disease development. Complex diseases such as cardiovascular diseases have to be tackled using methods integrating different omics levels, so called "multi-omics" approaches. These approaches combine and co-analyze different levels of disease regulation. In this review, we present and discuss the role of epigenetic mechanisms in regulating gene expression and provide an integrated view of how these mechanisms are interlinked and regulate the development of cardiac disease, with a particular attention to heart failure. We focus on DNA, histone, and RNA modifications, and discuss the current methods and tools used for data integration and analysis. Enhancing the knowledge of these regulatory mechanisms may lead to novel therapeutic approaches and biomarkers for precision healthcare and improved clinical outcomes.
Collapse
Affiliation(s)
- Miron Sopic
- Department of Medical Biochemistry, Faculty of Pharmacy, University of Belgrade, Belgrade, Serbia
| | - Emma L Robinson
- Division of Cardiology, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Costanza Emanueli
- National Heart & Lung Institute, Imperial College London, London, UK
| | | | - Claudio Angione
- School of Computing, Engineering & Digital Technologies, Teesside University, Tees Valley, Middlesbrough, TS1 3BA, UK
- Centre for Digital Innovation, Teesside University, Campus Heart, Tees Valley, Middlesbrough, TS1 3BX, UK
- National Horizons Centre, Darlington, DL1 1HG, UK
| | - Carlo Gaetano
- Laboratorio di Epigenetica, Istituti Clinici Scientifici Maugeri IRCCS, Via Maugeri 10, 27100, Pavia, Italy
| | - Gianluigi Condorelli
- IRCCS-Humanitas Research Hospital, Via Manzoni 56, 20089, Rozzano, MI, Italy
- Institute of Genetic and Biomedical Research, National Research Council of Italy, Arnold-Heller-Str.3, 24105, Milan, Italy
| | - Fabio Martelli
- Molecular Cardiology Laboratory, IRCCS-Policlinico San Donato, Via Morandi 30, San Donato Milanese, 20097, Milan, Italy
| | - Thierry Pedrazzini
- Experimental Cardiology Unit, Division of Cardiology, Department of Cardiovascular Medicine, University of Lausanne Medical School, 1011, Lausanne, Switzerland
| | - Yvan Devaux
- Cardiovascular Research Unit, Department of Population Health, Luxembourg Institute of Health, L-1445, Strassen, Luxembourg.
| |
Collapse
|
22
|
Multi-dimensional experimental and computational exploration of metabolism pinpoints complex probiotic interactions. Metab Eng 2023; 76:120-132. [PMID: 36720400 DOI: 10.1016/j.ymben.2023.01.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Revised: 12/13/2022] [Accepted: 01/21/2023] [Indexed: 01/29/2023]
Abstract
Multi-strain probiotics are widely regarded as effective products for improving gut microbiota stability and host health, providing advantages over single-strain probiotics. However, in general, it is unclear to what extent different strains would cooperate or compete for resources, and how the establishment of a common biofilm microenvironment could influence their interactions. In this work, we develop an integrative experimental and computational approach to comprehensively assess the metabolic functionality and interactions of probiotics across growth conditions. Our approach combines co-culture assays with genome-scale modelling of metabolism and multivariate data analysis, thus exploiting complementary data- and knowledge-driven systems biology techniques. To show the advantages of the proposed approach, we apply it to the study of the interactions between two widely used probiotic strains of Lactobacillus reuteri and Saccharomyces boulardii, characterising their production potential for compounds that can be beneficial to human health. Our results show that these strains can establish a mixed cooperative-antagonistic interaction best explained by competition for shared resources, with an increased individual exchange but an often decreased net production of amino acids and short-chain fatty acids. Overall, our work provides a strategy that can be used to explore microbial metabolic fingerprints of biotechnological interest, capable of capturing multifaceted equilibria even in simple microbial consortia.
Collapse
|
23
|
Ting TY, Li Y, Bunawan H, Ramzi AB, Goh HH. Current advancements in systems and synthetic biology studies of Saccharomyces cerevisiae. J Biosci Bioeng 2023; 135:259-265. [PMID: 36803862 DOI: 10.1016/j.jbiosc.2023.01.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 01/03/2023] [Accepted: 01/26/2023] [Indexed: 02/18/2023]
Abstract
Saccharomyces cerevisiae has a long-standing history of biotechnological applications even before the dawn of modern biotechnology. The field is undergoing accelerated advancement with the recent systems and synthetic biology approaches. In this review, we highlight the recent findings in the field with a focus on omics studies of S. cerevisiae to investigate its stress tolerance in different industries. The latest advancements in S. cerevisiae systems and synthetic biology approaches for the development of genome-scale metabolic models (GEMs) and molecular tools such as multiplex Cas9, Cas12a, Cpf1, and Csy4 genome editing tools, modular expression cassette with optimal transcription factors, promoters, and terminator libraries as well as metabolic engineering. Omics data analysis is key to the identification of exploitable native genes/proteins/pathways in S. cerevisiae with the optimization of heterologous pathway implementation and fermentation conditions. Through systems and synthetic biology, various heterologous compound productions that require non-native biosynthetic pathways in a cell factory have been established via different strategies of metabolic engineering integrated with machine learning.
Collapse
Affiliation(s)
- Tiew-Yik Ting
- Institute of Systems Biology, University Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - YaDong Li
- Institute of Systems Biology, University Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Hamidun Bunawan
- Institute of Systems Biology, University Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Ahmad Bazli Ramzi
- Institute of Systems Biology, University Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Hoe-Han Goh
- Institute of Systems Biology, University Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia.
| |
Collapse
|
24
|
Maglietta R, Saccotelli L, Fanizza C, Telesca V, Dimauro G, Causio S, Lecci R, Federico I, Coppini G, Cipriano G, Carlucci R. Environmental variables and machine learning models to predict cetacean abundance in the Central-eastern Mediterranean Sea. Sci Rep 2023; 13:2600. [PMID: 36788321 PMCID: PMC9929343 DOI: 10.1038/s41598-023-29681-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 02/08/2023] [Indexed: 02/16/2023] Open
Abstract
Although the Mediterranean Sea is a crucial hotspot in marine biodiversity, it has been threatened by numerous anthropogenic pressures. As flagship species, Cetaceans are exposed to those anthropogenic impacts and global changes. Assessing their conservation status becomes strategic to set effective management plans. The aim of this paper is to understand the habitat requirements of cetaceans, exploiting the advantages of a machine-learning framework. To this end, 28 physical and biogeochemical variables were identified as environmental predictors related to the abundance of three odontocete species in the Northern Ionian Sea (Central-eastern Mediterranean Sea). In fact, habitat models were built using sighting data collected for striped dolphins Stenella coeruleoalba, common bottlenose dolphins Tursiops truncatus, and Risso's dolphins Grampus griseus between July 2009 and October 2021. Random Forest was a suitable machine learning algorithm for the cetacean abundance estimation. Nitrate, phytoplankton carbon biomass, temperature, and salinity were the most common influential predictors, followed by latitude, 3D-chlorophyll and density. The habitat models proposed here were validated using sighting data acquired during 2022 in the study area, confirming the good performance of the strategy. This study provides valuable information to support management decisions and conservation measures in the EU marine spatial planning context.
Collapse
Affiliation(s)
- Rosalia Maglietta
- Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing, National Research Council, via Amendola 122/D-I, 70126, Bari, Italy.
| | - Leonardo Saccotelli
- Ocean Predictions and Applications Division, Centro Euro-Mediterraneo sui Cambiamenti Climatici, Lecce, Italy
| | - Carmelo Fanizza
- Jonian Dolphin Conservation, viale Virgilio 102, 74121, Taranto, Italy
| | - Vito Telesca
- School of Engineering, University of Basilicata, viale Ateneo Lucano 10, 85100, Potenza, Italy
| | - Giovanni Dimauro
- Department of Computer Science, University of Bari, via Orabona 4, 70125, Bari, Italy
| | - Salvatore Causio
- Ocean Predictions and Applications Division, Centro Euro-Mediterraneo sui Cambiamenti Climatici, Lecce, Italy
| | - Rita Lecci
- Ocean Predictions and Applications Division, Centro Euro-Mediterraneo sui Cambiamenti Climatici, Lecce, Italy
| | - Ivan Federico
- Ocean Predictions and Applications Division, Centro Euro-Mediterraneo sui Cambiamenti Climatici, Lecce, Italy
| | - Giovanni Coppini
- Ocean Predictions and Applications Division, Centro Euro-Mediterraneo sui Cambiamenti Climatici, Lecce, Italy
| | - Giulia Cipriano
- Department of Biology, University of Bari, via Orabona 4, 70125, Bari, Italy
| | - Roberto Carlucci
- Department of Biology, University of Bari, via Orabona 4, 70125, Bari, Italy
| |
Collapse
|
25
|
Contributions of Adaptive Laboratory Evolution towards the Enhancement of the Biotechnological Potential of Non-Conventional Yeast Species. J Fungi (Basel) 2023; 9:jof9020186. [PMID: 36836301 PMCID: PMC9964053 DOI: 10.3390/jof9020186] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 01/19/2023] [Accepted: 01/29/2023] [Indexed: 02/04/2023] Open
Abstract
Changes in biological properties over several generations, induced by controlling short-term evolutionary processes in the laboratory through selective pressure, and whole-genome re-sequencing, help determine the genetic basis of microorganism's adaptive laboratory evolution (ALE). Due to the versatility of this technique and the imminent urgency for alternatives to petroleum-based strategies, ALE has been actively conducted for several yeasts, primarily using the conventional species Saccharomyces cerevisiae, but also non-conventional yeasts. As a hot topic at the moment since genetically modified organisms are a debatable subject and a global consensus on their employment has not yet been attained, a panoply of new studies employing ALE approaches have emerged and many different applications have been exploited in this context. In the present review, we gathered, for the first time, relevant studies showing the ALE of non-conventional yeast species towards their biotechnological improvement, cataloging them according to the aim of the study, and comparing them considering the species used, the outcome of the experiment, and the employed methodology. This review sheds light on the applicability of ALE as a powerful tool to enhance species features and improve their performance in biotechnology, with emphasis on the non-conventional yeast species, as an alternative or in combination with genome editing approaches.
Collapse
|
26
|
Yamamoto Y, Yamada R, Matsumoto T, Ogino H. Construction of a machine-learning model to predict the optimal gene expression level for efficient production of D-lactic acid in yeast. World J Microbiol Biotechnol 2023; 39:69. [PMID: 36607503 DOI: 10.1007/s11274-022-03515-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 12/30/2022] [Indexed: 01/07/2023]
Abstract
The modification of gene expression is being researched in the production of useful chemicals by metabolic engineering of the yeast Saccharomyces cerevisiae. When the expression levels of many metabolic enzyme genes are modified simultaneously, the expression ratio of these genes becomes diverse; the relationship between the gene expression ratio and chemical productivity remains unclear. In other words, it is challenging to predict phenotypes from genotypes. However, the productivity of useful chemicals can be improved if this relationship is clarified. In this study, we aimed to construct a machine-learning model that can be used to clarify the relationship between gene expression levels and D-lactic acid productivity and predict the optimal gene expression level for efficient D-lactic acid production in yeast. A machine-learning model was constructed using data on D-lactate dehydrogenase and glycolytic genes expression (13 dimensions) and D-lactic acid productivity. The coefficient of determination of the completed machine-learning model was 0.6932 when using the training data and 0.6628 when using the test data. Using the constructed machine-learning model, we predicted the optimal gene expression level for high D-lactic acid production. We successfully constructed a machine-learning model to predict both D-lactic acid productivity and the suitable gene expression ratio for the production of D-lactic acid. The technique established in this study could be key for predicting phenotypes from genotypes, a problem faced by recent metabolic engineering strategies.
Collapse
Affiliation(s)
- Yoshiki Yamamoto
- Department of Chemical Engineering, Osaka Metropolitan University, 1-1 Gakuen-cho, Naka-ku, Sakai, Osaka, 599-8531, Japan
| | - Ryosuke Yamada
- Department of Chemical Engineering, Osaka Metropolitan University, 1-1 Gakuen-cho, Naka-ku, Sakai, Osaka, 599-8531, Japan.
| | - Takuya Matsumoto
- Department of Chemical Engineering, Osaka Metropolitan University, 1-1 Gakuen-cho, Naka-ku, Sakai, Osaka, 599-8531, Japan
| | - Hiroyasu Ogino
- Department of Chemical Engineering, Osaka Metropolitan University, 1-1 Gakuen-cho, Naka-ku, Sakai, Osaka, 599-8531, Japan
| |
Collapse
|
27
|
Cuperlovic-Culf M, Nguyen-Tran T, Bennett SAL. Machine Learning and Hybrid Methods for Metabolic Pathway Modeling. Methods Mol Biol 2023; 2553:417-439. [PMID: 36227553 DOI: 10.1007/978-1-0716-2617-7_18] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Computational cell metabolism models seek to provide metabolic explanations of cell behavior under different conditions or following genetic alterations, help in the optimization of in vitro cell growth environments, or predict cellular behavior in vivo and in vitro. In the extremes, mechanistic models can include highly detailed descriptions of a small number of metabolic reactions or an approximate representation of an entire metabolic network. To date, all mechanistic models have required details of individual metabolic reactions, either kinetic parameters or metabolic flux, as well as information about extracellular and intracellular metabolite concentrations. Despite the extensive efforts and the increasing availability of high-quality data, required in vivo data are not available for the majority of known metabolic reactions; thus, mechanistic models are based primarily on ex vivo kinetic measurements and limited flux information. Machine learning approaches provide an alternative for derivation of functional dependencies from existing data. The increasing availability of metabolomic and lipidomic data, with growing feature coverage as well as sample set size, is expected to provide new data options needed for derivation of machine learning models of cell metabolic processes. Moreover, machine learning analysis of longitudinal data can lead to predictive models of cell behaviors over time. Conversely, machine learning models trained on steady-state data can provide descriptive models for the comparison of metabolic states in different environments or disease conditions. Additionally, inclusion of metabolic network knowledge in these analyses can further help in the development of models with limited data.This chapter will explore the application of machine learning to the modeling of cell metabolism. We first provide a theoretical explanation of several machine learning and hybrid mechanistic machine learning methods currently being explored to model metabolism. Next, we introduce several avenues for improving these models with machine learning. Finally, we provide protocols for specific examples of the utilization of machine learning in the development of predictive cell metabolism models using metabolomic data. We describe data preprocessing, approaches for training of machine learning models for both descriptive and predictive models, and the utilization of these models in synthetic and systems biology. Detailed protocols provide a list of software tools and libraries used for these applications, step-by-step modeling protocols, troubleshooting, as well as an overview of existing limitations to these approaches.
Collapse
Affiliation(s)
- Miroslava Cuperlovic-Culf
- Digital Technologies Research Centre, National Research Council of Canada, Ottawa, ON, Canada.
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada.
| | - Thao Nguyen-Tran
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada
- Neural Regeneration Laboratory, Ottawa Institute of Systems Biology, Brain and Mind Research Institute, University of Ottawa, Ottawa, ON, Canada
- Department of Chemistry and Biomolecular Sciences, Centre for Catalysis Research and Innovation, University of Ottawa, Ottawa, ON, Canada
| | - Steffany A L Bennett
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada
- Neural Regeneration Laboratory, Ottawa Institute of Systems Biology, Brain and Mind Research Institute, University of Ottawa, Ottawa, ON, Canada
- Department of Chemistry and Biomolecular Sciences, Centre for Catalysis Research and Innovation, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
28
|
Doan LMT, Angione C, Occhipinti A. Machine Learning Methods for Survival Analysis with Clinical and Transcriptomics Data of Breast Cancer. Methods Mol Biol 2023; 2553:325-393. [PMID: 36227551 DOI: 10.1007/978-1-0716-2617-7_16] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Breast cancer is one of the most common cancers in women worldwide, which causes an enormous number of deaths annually. However, early diagnosis of breast cancer can improve survival outcomes enabling simpler and more cost-effective treatments. The recent increase in data availability provides unprecedented opportunities to apply data-driven and machine learning methods to identify early-detection prognostic factors capable of predicting the expected survival and potential sensitivity to treatment of patients, with the final aim of enhancing clinical outcomes. This tutorial presents a protocol for applying machine learning models in survival analysis for both clinical and transcriptomic data. We show that integrating clinical and mRNA expression data is essential to explain the multiple biological processes driving cancer progression. Our results reveal that machine-learning-based models such as random survival forests, gradient boosted survival model, and survival support vector machine can outperform the traditional statistical methods, i.e., Cox proportional hazard model. The highest C-index among the machine learning models was recorded when using survival support vector machine, with a value 0.688, whereas the C-index recorded using the Cox model was 0.677. Shapley Additive Explanation (SHAP) values were also applied to identify the feature importance of the models and their impact on the prediction outcomes.
Collapse
Affiliation(s)
- Le Minh Thao Doan
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK
- Healthcare Innovation Centre, Teesside University, Middlesbrough, UK
- National Horizons Centre, Teesside University, Darlington, UK
| | - Annalisa Occhipinti
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK.
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK.
- National Horizons Centre, Teesside University, Darlington, UK.
| |
Collapse
|
29
|
Sieow BFL, De Sotto R, Seet ZRD, Hwang IY, Chang MW. Synthetic Biology Meets Machine Learning. Methods Mol Biol 2023; 2553:21-39. [PMID: 36227537 DOI: 10.1007/978-1-0716-2617-7_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
This chapter outlines the myriad applications of machine learning (ML) in synthetic biology, specifically in engineering cell and protein activity, and metabolic pathways. Though by no means comprehensive, the chapter highlights several prominent computational tools applied in the field and their potential use cases. The examples detailed reinforce how ML algorithms can enhance synthetic biology research by providing data-driven insights into the behavior of living systems, even without detailed knowledge of their underlying mechanisms. By doing so, ML promises to increase the efficiency of research projects by modeling hypotheses in silico that can then be tested through experiments. While challenges related to training dataset generation and computational costs remain, ongoing improvements in ML tools are paving the way for smarter and more streamlined synthetic biology workflows that can be readily employed to address grand challenges across manufacturing, medicine, engineering, agriculture, and beyond.
Collapse
Affiliation(s)
- Brendan Fu-Long Sieow
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- NUS Graduate School for Integrative Sciences and Engineering Programme, National University of Singapore, Singapore, Singapore
| | - Ryan De Sotto
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Zhi Ren Darren Seet
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - In Young Hwang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Matthew Wook Chang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore.
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
30
|
Patra P, B R D, Kundu P, Das M, Ghosh A. Recent advances in machine learning applications in metabolic engineering. Biotechnol Adv 2023; 62:108069. [PMID: 36442697 DOI: 10.1016/j.biotechadv.2022.108069] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/18/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]
Abstract
Metabolic engineering encompasses several widely-used strategies, which currently hold a high seat in the field of biotechnology when its potential is manifesting through a plethora of research and commercial products with a strong societal impact. The genomic revolution that occurred almost three decades ago has initiated the generation of large omics-datasets which has helped in gaining a better understanding of cellular behavior. The itinerary of metabolic engineering that has occurred based on these large datasets has allowed researchers to gain detailed insights and a reasonable understanding of the intricacies of biosystems. However, the existing trail-and-error approaches for metabolic engineering are laborious and time-intensive when it comes to the production of target compounds with high yields through genetic manipulations in host organisms. Machine learning (ML) coupled with the available metabolic engineering test instances and omics data brings a comprehensive and multidisciplinary approach that enables scientists to evaluate various parameters for effective strain design. This vast amount of biological data should be standardized through knowledge engineering to train different ML models for providing accurate predictions in gene circuits designing, modification of proteins, optimization of bioprocess parameters for scaling up, and screening of hyper-producing robust cell factories. This review briefs on the premise of ML, followed by mentioning various ML methods and algorithms alongside the numerous omics datasets available to train ML models for predicting metabolic outcomes with high-accuracy. The combinative interplay between the ML algorithms and biological datasets through knowledge engineering have guided the recent advancements in applications such as CRISPR/Cas systems, gene circuits, protein engineering, metabolic pathway reconstruction, and bioprocess engineering. Finally, this review addresses the probable challenges of applying ML in metabolic engineering which will guide the researchers toward novel techniques to overcome the limitations.
Collapse
Affiliation(s)
- Pradipta Patra
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Disha B R
- B.M.S College of Engineering, Basavanagudi, Bengaluru, Karnataka 560019, India
| | - Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Manali Das
- School of Bioscience, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
31
|
Magazzù G, Zampieri G, Angione C. Clinical stratification improves the diagnostic accuracy of small omics datasets within machine learning and genome-scale metabolic modelling methods. Comput Biol Med 2022; 151:106244. [PMID: 36343407 DOI: 10.1016/j.compbiomed.2022.106244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 10/07/2022] [Accepted: 10/22/2022] [Indexed: 12/27/2022]
Abstract
BACKGROUND Recently, multi-omic machine learning architectures have been proposed for the early detection of cancer. However, for rare cancers and their associated small datasets, it is still unclear how to use the available multi-omics data to achieve a mechanistic prediction of cancer onset and progression, due to the limited data available. Hepatoblastoma is the most frequent liver cancer in infancy and childhood, and whose incidence has been lately increasing in several developed countries. Even though some studies have been conducted to understand the causes of its onset and discover potential biomarkers, the role of metabolic rewiring has not been investigated in depth so far. METHODS Here, we propose and implement an interpretable multi-omics pipeline that combines mechanistic knowledge from genome-scale metabolic models with machine learning algorithms, and we use it to characterise the underlying mechanisms controlling hepatoblastoma. RESULTS AND CONCLUSIONS While the obtained machine learning models generally present a high diagnostic classification accuracy, our results show that the type of omics combinations used as input to the machine learning models strongly affects the detection of important genes, reactions and metabolic pathways linked to hepatoblastoma. Our method also suggests that, in the context of computer-aided diagnosis of cancer, optimal diagnostic accuracy can be achieved by adopting a combination of omics that depends on the patient's clinical characteristics.
Collapse
Affiliation(s)
- Giuseppe Magazzù
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, England, United Kingdom
| | - Guido Zampieri
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, England, United Kingdom; Department of Biology, University of Padova, Padova, Italy
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, England, United Kingdom; Centre for Digital Innovation, Teesside University, Middlesbrough, England, United Kingdom; National Horizons Centre, Teesside University, Darlington, England, United Kingdom.
| |
Collapse
|
32
|
Debnath T, Hattori R, Okamoto S, Shibata T, Santra TS, Nagai M. Automated detection of patterned single-cells within hydrogel using deep learning. Sci Rep 2022; 12:18343. [PMID: 36316380 PMCID: PMC9622733 DOI: 10.1038/s41598-022-22774-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 10/19/2022] [Indexed: 11/20/2022] Open
Abstract
Single-cell analysis has been widely used in various biomedical engineering applications, ranging from cancer diagnostics, and immune response monitoring to drug screening. Single-cell isolation is fundamental for observing single-cell activities and an automatic finding method of accurate and reliable cell detection with few possible human errors is also essential. This paper reports trapping single cells into photo patternable hydrogel microwell arrays and isolating them. Additionally, we present an object detection-based DL algorithm that detects single cells in microwell arrays and predicts the presence of cells in resource-limited environments at the highest possible mAP (mean average precision) of 0.989 with an average inference time of 0.06 s. This algorithm leads to the enhancement of the high-throughput single-cell analysis, establishing high detection precision and reduced experimentation time.
Collapse
Affiliation(s)
- Tanmay Debnath
- grid.412804.b0000 0001 0945 2394Department of Mechanical Engineering, Toyohashi University of Technology, Toyohashi, Aichi 441-8580 Japan
| | - Ren Hattori
- grid.412804.b0000 0001 0945 2394Department of Mechanical Engineering, Toyohashi University of Technology, Toyohashi, Aichi 441-8580 Japan
| | - Shunya Okamoto
- grid.412804.b0000 0001 0945 2394Department of Mechanical Engineering, Toyohashi University of Technology, Toyohashi, Aichi 441-8580 Japan
| | - Takayuki Shibata
- grid.412804.b0000 0001 0945 2394Department of Mechanical Engineering, Toyohashi University of Technology, Toyohashi, Aichi 441-8580 Japan
| | - Tuhin Subhra Santra
- grid.417969.40000 0001 2315 1926Indian Institute of Technology Madras, Chennai, Tamil Nadu 600036 India
| | - Moeto Nagai
- grid.412804.b0000 0001 0945 2394Department of Mechanical Engineering, Toyohashi University of Technology, Toyohashi, Aichi 441-8580 Japan ,grid.412804.b0000 0001 0945 2394Electronic Inspired Interdisciplinary Research Institute (EIIRIS), Toyohashi University of Technology, Toyohashi, Aichi 441-8580 Japan
| |
Collapse
|
33
|
Enuh BM, Nural Yaman B, Tarzi C, Aytar Çelik P, Mutlu MB, Angione C. Whole-genome sequencing and genome-scale metabolic modeling of Chromohalobacter canadensis 85B to explore its salt tolerance and biotechnological use. Microbiologyopen 2022; 11:e1328. [PMID: 36314754 PMCID: PMC9597258 DOI: 10.1002/mbo3.1328] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 10/01/2022] [Indexed: 11/06/2022] Open
Abstract
Salt tolerant organisms are increasingly being used for the industrial production of high-value biomolecules due to their better adaptability compared to mesophiles. Chromohalobacter canadensis is one of the early halophiles to show promising biotechnology potential, which has not been explored to date. Advanced high throughput technologies such as whole-genome sequencing allow in-depth insight into the potential of organisms while at the frontiers of systems biology. At the same time, genome-scale metabolic models (GEMs) enable phenotype predictions through a mechanistic representation of metabolism. Here, we sequence and analyze the genome of C. canadensis 85B, and we use it to reconstruct a GEM. We then analyze the GEM using flux balance analysis and validate it against literature data on C. canadensis. We show that C. canadensis 85B is a metabolically versatile organism with many features for stress and osmotic adaptation. Pathways to produce ectoine and polyhydroxybutyrates were also predicted. The GEM reveals the ability to grow on several carbon sources in a minimal medium and reproduce osmoadaptation phenotypes. Overall, this study reveals insights from the genome of C. canadensis 85B, providing genomic data and a draft GEM that will serve as the first steps towards a better understanding of its metabolism, for novel applications in industrial biotechnology.
Collapse
Affiliation(s)
- Blaise Manga Enuh
- Biotechnology and Biosafety Department, Graduate and Natural Applied ScienceEskişehir Osmangazi UniversityEskişehirTurkey
| | - Belma Nural Yaman
- Biotechnology and Biosafety Department, Graduate and Natural Applied ScienceEskişehir Osmangazi UniversityEskişehirTurkey,Department of Biomedical Engineering, Faculty of Engineering and ArchitectureEskişehir Osmangazi UniversityEskişehirTurkey
| | - Chaimaa Tarzi
- School of Computing, Engineering & Digital TechnologiesTeesside UniversityMiddlesbroughUK
| | - Pınar Aytar Çelik
- Biotechnology and Biosafety Department, Graduate and Natural Applied ScienceEskişehir Osmangazi UniversityEskişehirTurkey,Environmental Protection and Control ProgramEskişehir Osmangazi UniversityEskişehirTurkey
| | - Mehmet Burçin Mutlu
- Department of Biology, Faculty of ScienceEskisehir Technical UniversityEskisehirTurkey
| | - Claudio Angione
- School of Computing, Engineering & Digital TechnologiesTeesside UniversityMiddlesbroughUK,Centre for Digital InnovationTeesside UniversityMiddlesbroughUK,National Horizons CentreTeesside UniversityDarlingtonUK
| |
Collapse
|
34
|
Sidak D, Schwarzerová J, Weckwerth W, Waldherr S. Interpretable machine learning methods for predictions in systems biology from omics data. Front Mol Biosci 2022; 9:926623. [PMID: 36387282 PMCID: PMC9650551 DOI: 10.3389/fmolb.2022.926623] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Accepted: 08/15/2022] [Indexed: 12/02/2022] Open
Abstract
Machine learning has become a powerful tool for systems biologists, from diagnosing cancer to optimizing kinetic models and predicting the state, growth dynamics, or type of a cell. Potential predictions from complex biological data sets obtained by “omics” experiments seem endless, but are often not the main objective of biological research. Often we want to understand the molecular mechanisms of a disease to develop new therapies, or we need to justify a crucial decision that is derived from a prediction. In order to gain such knowledge from data, machine learning models need to be extended. A recent trend to achieve this is to design “interpretable” models. However, the notions around interpretability are sometimes ambiguous, and a universal recipe for building well-interpretable models is missing. With this work, we want to familiarize systems biologists with the concept of model interpretability in machine learning. We consider data sets, data preparation, machine learning methods, and software tools relevant to omics research in systems biology. Finally, we try to answer the question: “What is interpretability?” We introduce views from the interpretable machine learning community and propose a scheme for categorizing studies on omics data. We then apply these tools to review and categorize recent studies where predictive machine learning models have been constructed from non-sequential omics data.
Collapse
Affiliation(s)
- David Sidak
- Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Molecular Systems Biology (MOSYS), University of Vienna, Vienna, Austria
| | - Jana Schwarzerová
- Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Molecular Systems Biology (MOSYS), University of Vienna, Vienna, Austria
- Department of Biomedical Engineering, Faculty of Electrical Engineering and Communication, Brno University of Technology, Brno, Czech Republic
| | - Wolfram Weckwerth
- Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Molecular Systems Biology (MOSYS), University of Vienna, Vienna, Austria
- Vienna Metabolomics Center (VIME), Faculty of Life Sciences, University of Vienna, Vienna, Austria
| | - Steffen Waldherr
- Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Molecular Systems Biology (MOSYS), University of Vienna, Vienna, Austria
- *Correspondence: Steffen Waldherr,
| |
Collapse
|
35
|
Development of a Prediction Method of Cell Density in Autotrophic/Heterotrophic Microorganism Mixtures by Machine Learning Using Absorbance Spectrum Data. BIOTECH (BASEL (SWITZERLAND)) 2022; 11:biotech11040046. [PMID: 36278558 PMCID: PMC9624369 DOI: 10.3390/biotech11040046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Revised: 10/09/2022] [Accepted: 10/11/2022] [Indexed: 11/06/2022]
Abstract
Microflora is actively used to produce value-added materials in industry, and each cell density should be controlled for stable microflora use. In this study, a simple system evaluating the cell density was constructed with artificial intelligence (AI) using the absorbance spectra data of microflora. To set up the system, the prediction system for cell density based on machine learning was constructed using the spectra data as the feature from the mixture of Saccharomyces cerevisiae and Chlamydomonas reinhardtii. As the results of predicting cell density by extremely randomized trees, when the cell densities of S. cerevisiae and C. reinhardtii were shifted and fixed, the coefficient of determination (R2) was 0.8495; on the other hand, when the cell densities of S. cerevisiae and C. reinhardtii were fixed and shifted, the R2 was 0.9232. To explain the prediction system, the randomized trees regressor of the decision tree-based ensemble learning method as the machine learning algorithm and Shapley additive explanations (SHAPs) as the explainable AI (XAI) to interpret the features contributing to the prediction results were used. As a result of the SHAP analyses, not only the optical density, but also the absorbance of the Soret and Q bands derived from the chloroplasts of C. reinhardtii could contribute to the prediction as the features. The simple cell density evaluating system could have an industrial impact.
Collapse
|
36
|
Aminian-Dehkordi J, Valiei A, Mofrad MRK. Emerging computational paradigms to address the complex role of gut microbial metabolism in cardiovascular diseases. Front Cardiovasc Med 2022; 9:987104. [PMID: 36299869 PMCID: PMC9589059 DOI: 10.3389/fcvm.2022.987104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 09/20/2022] [Indexed: 11/13/2022] Open
Abstract
The human gut microbiota and its associated perturbations are implicated in a variety of cardiovascular diseases (CVDs). There is evidence that the structure and metabolic composition of the gut microbiome and some of its metabolites have mechanistic associations with several CVDs. Nevertheless, there is a need to unravel metabolic behavior and underlying mechanisms of microbiome-host interactions. This need is even more highlighted when considering that microbiome-secreted metabolites contributing to CVDs are the subject of intensive research to develop new prevention and therapeutic techniques. In addition to the application of high-throughput data used in microbiome-related studies, advanced computational tools enable us to integrate omics into different mathematical models, including constraint-based models, dynamic models, agent-based models, and machine learning tools, to build a holistic picture of metabolic pathological mechanisms. In this article, we aim to review and introduce state-of-the-art mathematical models and computational approaches addressing the link between the microbiome and CVDs.
Collapse
Affiliation(s)
| | | | - Mohammad R. K. Mofrad
- Department of Bioengineering and Mechanical Engineering, University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
37
|
Yeo HC, Selvarajoo K. Machine learning alternative to systems biology should not solely depend on data. Brief Bioinform 2022; 23:6731718. [PMID: 36184188 PMCID: PMC9677488 DOI: 10.1093/bib/bbac436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 08/24/2022] [Accepted: 09/09/2022] [Indexed: 12/14/2022] Open
Abstract
In recent years, artificial intelligence (AI)/machine learning has emerged as a plausible alternative to systems biology for the elucidation of biological phenomena and in attaining specified design objective in synthetic biology. Although considered highly disruptive with numerous notable successes so far, we seek to bring attention to both the fundamental and practical pitfalls of their usage, especially in illuminating emergent behaviors from chaotic or stochastic systems in biology. Without deliberating on their suitability and the required data qualities and pre-processing approaches beforehand, the research and development community could experience similar 'AI winters' that had plagued other fields. Instead, we anticipate the integration or combination of the two approaches, where appropriate, moving forward.
Collapse
Affiliation(s)
- Hock Chuan Yeo
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore
| | | |
Collapse
|
38
|
Du YH, Wang MY, Yang LH, Tong LL, Guo DS, Ji XJ. Optimization and Scale-Up of Fermentation Processes Driven by Models. Bioengineering (Basel) 2022; 9:bioengineering9090473. [PMID: 36135019 PMCID: PMC9495923 DOI: 10.3390/bioengineering9090473] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 09/05/2022] [Accepted: 09/09/2022] [Indexed: 11/16/2022] Open
Abstract
In the era of sustainable development, the use of cell factories to produce various compounds by fermentation has attracted extensive attention; however, industrial fermentation requires not only efficient production strains, but also suitable extracellular conditions and medium components, as well as scaling-up. In this regard, the use of biological models has received much attention, and this review will provide guidance for the rapid selection of biological models. This paper first introduces two mechanistic modeling methods, kinetic modeling and constraint-based modeling (CBM), and generalizes their applications in practice. Next, we review data-driven modeling based on machine learning (ML), and highlight the application scope of different learning algorithms. The combined use of ML and CBM for constructing hybrid models is further discussed. At the end, we also discuss the recent strategies for predicting bioreactor scale-up and culture behavior through a combination of biological models and computational fluid dynamics (CFD) models.
Collapse
Affiliation(s)
- Yuan-Hang Du
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
| | - Min-Yu Wang
- State Key Laboratory of Materials-Oriented Chemical Engineering, College of Biotechnology and Pharmaceutical Engineering, Nanjing Tech University, Nanjing 211816, China
| | - Lin-Hui Yang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
| | - Ling-Ling Tong
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
| | - Dong-Sheng Guo
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210023, China
- Correspondence: (D.-S.G.); (X.-J.J.)
| | - Xiao-Jun Ji
- State Key Laboratory of Materials-Oriented Chemical Engineering, College of Biotechnology and Pharmaceutical Engineering, Nanjing Tech University, Nanjing 211816, China
- Correspondence: (D.-S.G.); (X.-J.J.)
| |
Collapse
|
39
|
Beardall WA, Stan GB, Dunlop MJ. Deep Learning Concepts and Applications for Synthetic Biology. GEN BIOTECHNOLOGY 2022; 1:360-371. [PMID: 36061221 PMCID: PMC9428732 DOI: 10.1089/genbio.2022.0017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 07/14/2022] [Indexed: 12/24/2022]
Abstract
Synthetic biology has a natural synergy with deep learning. It can be used to generate large data sets to train models, for example by using DNA synthesis, and deep learning models can be used to inform design, such as by generating novel parts or suggesting optimal experiments to conduct. Recently, research at the interface of engineering biology and deep learning has highlighted this potential through successes including the design of novel biological parts, protein structure prediction, automated analysis of microscopy data, optimal experimental design, and biomolecular implementations of artificial neural networks. In this review, we present an overview of synthetic biology-relevant classes of data and deep learning architectures. We also highlight emerging studies in synthetic biology that capitalize on deep learning to enable novel understanding and design, and discuss challenges and future opportunities in this space.
Collapse
Affiliation(s)
- William A.V. Beardall
- Department of Bioengineering, Imperial College London, London, United Kingdom
- Imperial College Centre of Excellence in Synthetic Biology, Imperial College London, London, United Kingdom
| | - Guy-Bart Stan
- Department of Bioengineering, Imperial College London, London, United Kingdom
- Imperial College Centre of Excellence in Synthetic Biology, Imperial College London, London, United Kingdom
| | - Mary J. Dunlop
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, USA
- Biological Design Center, Boston University, Boston, Massachusetts, USA
| |
Collapse
|
40
|
Liao X, Ma H, Tang YJ. Artificial intelligence: a solution to involution of design–build–test–learn cycle. Curr Opin Biotechnol 2022; 75:102712. [DOI: 10.1016/j.copbio.2022.102712] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 02/05/2022] [Accepted: 03/01/2022] [Indexed: 01/08/2023]
|
41
|
Bi X, Liu Y, Li J, Du G, Lv X, Liu L. Construction of Multiscale Genome-Scale Metabolic Models: Frameworks and Challenges. Biomolecules 2022; 12:biom12050721. [PMID: 35625648 PMCID: PMC9139095 DOI: 10.3390/biom12050721] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 05/15/2022] [Accepted: 05/16/2022] [Indexed: 12/04/2022] Open
Abstract
Genome-scale metabolic models (GEMs) are effective tools for metabolic engineering and have been widely used to guide cell metabolic regulation. However, the single gene–protein-reaction data type in GEMs limits the understanding of biological complexity. As a result, multiscale models that add constraints or integrate omics data based on GEMs have been developed to more accurately predict phenotype from genotype. This review summarized the recent advances in the development of multiscale GEMs, including multiconstraint, multiomic, and whole-cell models, and outlined machine learning applications in GEM construction. This review focused on the frameworks, toolkits, and algorithms for constructing multiscale GEMs. The challenges and perspectives of multiscale GEM development are also discussed.
Collapse
Affiliation(s)
- Xinyu Bi
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; (X.B.); (Y.L.); (J.L.); (G.D.); (X.L.)
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Yanfeng Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; (X.B.); (Y.L.); (J.L.); (G.D.); (X.L.)
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jianghua Li
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; (X.B.); (Y.L.); (J.L.); (G.D.); (X.L.)
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Guocheng Du
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; (X.B.); (Y.L.); (J.L.); (G.D.); (X.L.)
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Xueqin Lv
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; (X.B.); (Y.L.); (J.L.); (G.D.); (X.L.)
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Long Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; (X.B.); (Y.L.); (J.L.); (G.D.); (X.L.)
- Science Center for Future Foods, Ministry of Education, Jiangnan University, Wuxi 214122, China
- Correspondence: ; Tel.: +86-0510-8591-8312; Fax: +86-0510-8591-8309
| |
Collapse
|
42
|
Lv X, Hueso-Gil A, Bi X, Wu Y, Liu Y, Liu L, Ledesma-Amaro R. New synthetic biology tools for metabolic control. Curr Opin Biotechnol 2022; 76:102724. [PMID: 35489308 DOI: 10.1016/j.copbio.2022.102724] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 02/28/2022] [Accepted: 03/20/2022] [Indexed: 11/29/2022]
Abstract
In industrial bioprocesses, microbial metabolism dictates the product yields, and therefore, our capacity to control it has an enormous potential to help us move towards a bio-based economy. The rapid development of multiomics data has accelerated our systematic understanding of complex metabolic regulatory mechanisms, which allow us to develop tools to manipulate them. In the last few years, machine learning-based metabolic modeling, Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) derived synthetic biology tools, and synthetic genetic circuits have been widely used to control the metabolism of microorganisms, manipulate gene expression, and build synthetic pathways for bioproduction. This review describes the latest developments for metabolic control, and focuses on the trends and challenges of metabolic engineering strategies.
Collapse
Affiliation(s)
- Xueqin Lv
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Angeles Hueso-Gil
- Department of Bioengineering and Imperial College Centre for Synthetic Biology, Imperial College London, London SW72AZ, UK
| | - Xinyu Bi
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Yaokang Wu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Yanfeng Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Long Liu
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China; Science Center for Future Foods, Jiangnan University, Wuxi 214122, China.
| | - Rodrigo Ledesma-Amaro
- Department of Bioengineering and Imperial College Centre for Synthetic Biology, Imperial College London, London SW72AZ, UK.
| |
Collapse
|
43
|
Sampaio M, Rocha M, Dias O. Exploring synergies between plant metabolic modelling and machine learning. Comput Struct Biotechnol J 2022; 20:1885-1900. [PMID: 35521559 PMCID: PMC9052043 DOI: 10.1016/j.csbj.2022.04.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 04/08/2022] [Accepted: 04/11/2022] [Indexed: 11/03/2022] Open
|
44
|
Regulatory network-based model to simulate the biochemical regulation of chondrocytes in healthy and osteoarthritic environments. Sci Rep 2022; 12:3856. [PMID: 35264634 PMCID: PMC8907219 DOI: 10.1038/s41598-022-07776-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 02/18/2022] [Indexed: 11/21/2022] Open
Abstract
In osteoarthritis (OA), chondrocyte metabolism dysregulation increases relative catabolic activity, which leads to cartilage degradation. To enable the semiquantitative interpretation of the intricate mechanisms of OA progression, we propose a network-based model at the chondrocyte level that incorporates the complex ways in which inflammatory factors affect structural protein and protease expression and nociceptive signals. Understanding such interactions will leverage the identification of new potential therapeutic targets that could improve current pharmacological treatments. Our computational model arises from a combination of knowledge-based and data-driven approaches that includes in-depth analyses of evidence reported in the specialized literature and targeted network enrichment. We achieved a mechanistic network of molecular interactions that represent both biosynthetic, inflammatory and degradative chondrocyte activity. The network is calibrated against experimental data through a genetic algorithm, and 81% of the responses tested have a normalized root squared error lower than 0.15. The model captures chondrocyte-reported behaviors with 95% accuracy, and it correctly predicts the main outcomes of OA treatment based on blood-derived biologics. The proposed methodology allows us to model an optimal regulatory network that controls chondrocyte metabolism based on measurable soluble molecules. Further research should target the incorporation of mechanical signals.
Collapse
|
45
|
Angione C, Silverman E, Yaneske E. Using machine learning as a surrogate model for agent-based simulations. PLoS One 2022; 17:e0263150. [PMID: 35143521 PMCID: PMC8830643 DOI: 10.1371/journal.pone.0263150] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Accepted: 01/12/2022] [Indexed: 02/02/2023] Open
Abstract
In this proof-of-concept work, we evaluate the performance of multiple machine-learning methods as surrogate models for use in the analysis of agent-based models (ABMs). Analysing agent-based modelling outputs can be challenging, as the relationships between input parameters can be non-linear or even chaotic even in relatively simple models, and each model run can require significant CPU time. Surrogate modelling, in which a statistical model of the ABM is constructed to facilitate detailed model analyses, has been proposed as an alternative to computationally costly Monte Carlo methods. Here we compare multiple machine-learning methods for ABM surrogate modelling in order to determine the approaches best suited as a surrogate for modelling the complex behaviour of ABMs. Our results suggest that, in most scenarios, artificial neural networks (ANNs) and gradient-boosted trees outperform Gaussian process surrogates, currently the most commonly used method for the surrogate modelling of complex computational models. ANNs produced the most accurate model replications in scenarios with high numbers of model runs, although training times were longer than the other methods. We propose that agent-based modelling would benefit from using machine-learning methods for surrogate modelling, as this can facilitate more robust sensitivity analyses for the models while also reducing CPU time consumption when calibrating and analysing the simulation.
Collapse
Affiliation(s)
- Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, United Kingdom
- Healthcare Innovation Centre, Teesside University, Middlesbrough, United Kingdom
- National Horizons Centre, Teesside University, Darlington, United Kingdom
- Centre for Digital Innovation, Teesside University, Middlesbrough, United Kingdom
| | - Eric Silverman
- Institute for Health and Wellbeing, University of Glasgow, Glasgow, United Kingdom
| | - Elisabeth Yaneske
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, United Kingdom
| |
Collapse
|
46
|
Fordjour E, Mensah EO, Hao Y, Yang Y, Liu X, Li Y, Liu CL, Bai Z. Toward improved terpenoids biosynthesis: strategies to enhance the capabilities of cell factories. BIORESOUR BIOPROCESS 2022; 9:6. [PMID: 38647812 PMCID: PMC10992668 DOI: 10.1186/s40643-022-00493-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 01/04/2022] [Indexed: 02/22/2023] Open
Abstract
Terpenoids form the most diversified class of natural products, which have gained application in the pharmaceutical, food, transportation, and fine and bulk chemical industries. Extraction from naturally occurring sources does not meet industrial demands, whereas chemical synthesis is often associated with poor enantio-selectivity, harsh working conditions, and environmental pollutions. Microbial cell factories come as a suitable replacement. However, designing efficient microbial platforms for isoprenoid synthesis is often a challenging task. This has to do with the cytotoxic effects of pathway intermediates and some end products, instability of expressed pathways, as well as high enzyme promiscuity. Also, the low enzymatic activity of some terpene synthases and prenyltransferases, and the lack of an efficient throughput system to screen improved high-performing strains are bottlenecks in strain development. Metabolic engineering and synthetic biology seek to overcome these issues through the provision of effective synthetic tools. This review sought to provide an in-depth description of novel strategies for improving cell factory performance. We focused on improving transcriptional and translational efficiencies through static and dynamic regulatory elements, enzyme engineering and high-throughput screening strategies, cellular function enhancement through chromosomal integration, metabolite tolerance, and modularization of pathways.
Collapse
Affiliation(s)
- Eric Fordjour
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China
| | - Emmanuel Osei Mensah
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China
| | - Yunpeng Hao
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China
| | - Yankun Yang
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China
| | - Xiuxia Liu
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China
| | - Ye Li
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China
| | - Chun-Li Liu
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China.
| | - Zhonghu Bai
- National Engineering Laboratory for Cereal Fermentation Technology, Jiangnan University, 1800 Lihu Road, Wuxi, 214122, Jiangsu, China.
- Jiangsu Provincial Research Centre for Bioactive Product Processing Technology, Jiangnan University, Wuxi, China.
| |
Collapse
|
47
|
Pio G, Mignone P, Magazzù G, Zampieri G, Ceci M, Angione C. Integrating genome-scale metabolic modelling and transfer learning for human gene regulatory network reconstruction. Bioinformatics 2022; 38:487-493. [PMID: 34499112 DOI: 10.1093/bioinformatics/btab647] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 07/23/2021] [Accepted: 09/06/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Gene regulation is responsible for controlling numerous physiological functions and dynamically responding to environmental fluctuations. Reconstructing the human network of gene regulatory interactions is thus paramount to understanding the cell functional organization across cell types, as well as to elucidating pathogenic processes and identifying molecular drug targets. Although significant effort has been devoted towards this direction, existing computational methods mainly rely on gene expression levels, possibly ignoring the information conveyed by mechanistic biochemical knowledge. Moreover, except for a few recent attempts, most of the existing approaches only consider the information of the organism under analysis, without exploiting the information of related model organisms. RESULTS We propose a novel method for the reconstruction of the human gene regulatory network, based on a transfer learning strategy that synergically exploits information from human and mouse, conveyed by gene-related metabolic features generated in silico from gene expression data. Specifically, we learn a predictive model from metabolic activity inferred via tissue-specific metabolic modelling of artificial gene knockouts. Our experiments show that the combination of our transfer learning approach with the constructed metabolic features provides a significant advantage in terms of reconstruction accuracy, as well as additional clues on the contribution of each constructed metabolic feature. AVAILABILITY AND IMPLEMENTATION The method, the datasets and all the results obtained in this study are available at: https://doi.org/10.6084/m9.figshare.c.5237687. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Gianvito Pio
- Department of Computer Science, University of Bari Aldo Moro, Bari 70125, Italy.,Big Data Lab, National Interuniversity Consortium for Informatics (CINI), Rome 00185, Italy
| | - Paolo Mignone
- Department of Computer Science, University of Bari Aldo Moro, Bari 70125, Italy.,Big Data Lab, National Interuniversity Consortium for Informatics (CINI), Rome 00185, Italy
| | - Giuseppe Magazzù
- School of Computing, Engineering & Digital Technologies, Teesside University, Tees Valley TS1 3BA, UK
| | - Guido Zampieri
- School of Computing, Engineering & Digital Technologies, Teesside University, Tees Valley TS1 3BA, UK.,Department of Biology, University of Padova, Padova 35121, Italy
| | - Michelangelo Ceci
- Department of Computer Science, University of Bari Aldo Moro, Bari 70125, Italy.,Big Data Lab, National Interuniversity Consortium for Informatics (CINI), Rome 00185, Italy.,Department of Knowledge Technologies, Jozef Stefan Institute, Ljubljana 1000, Slovenia
| | - Claudio Angione
- School of Computing, Engineering & Digital Technologies, Teesside University, Tees Valley TS1 3BA, UK.,Centre for Digital Innovation, Teesside University, Campus Heart, Tees Valley TS1 3BX, UK.,Healthcare Innovation Centre, Teesside University, Campus Heart, Tees Valley TS1 3BX, UK
| |
Collapse
|
48
|
Vijayakumar S, Magazzù G, Moon P, Occhipinti A, Angione C. A Practical Guide to Integrating Multimodal Machine Learning and Metabolic Modeling. Methods Mol Biol 2022; 2399:87-122. [PMID: 35604554 DOI: 10.1007/978-1-0716-1831-8_5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Complex, distributed, and dynamic sets of clinical biomedical data are collectively referred to as multimodal clinical data. In order to accommodate the volume and heterogeneity of such diverse data types and aid in their interpretation when they are combined with a multi-scale predictive model, machine learning is a useful tool that can be wielded to deconstruct biological complexity and extract relevant outputs. Additionally, genome-scale metabolic models (GSMMs) are one of the main frameworks striving to bridge the gap between genotype and phenotype by incorporating prior biological knowledge into mechanistic models. Consequently, the utilization of GSMMs as a foundation for the integration of multi-omic data originating from different domains is a valuable pursuit towards refining predictions. In this chapter, we show how cancer multi-omic data can be analyzed via multimodal machine learning and metabolic modeling. Firstly, we focus on the merits of adopting an integrative systems biology led approach to biomedical data mining. Following this, we propose how constraint-based metabolic models can provide a stable yet adaptable foundation for the integration of multimodal data with machine learning. Finally, we provide a step-by-step tutorial for the combination of machine learning and GSMMs, which includes: (i) tissue-specific constraint-based modeling; (ii) survival analysis using time-to-event prediction for cancer; and (iii) classification and regression approaches for multimodal machine learning. The code associated with the tutorial can be found at https://github.com/Angione-Lab/Tutorials_Combining_ML_and_GSMM .
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Giuseppe Magazzù
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Pradip Moon
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Annalisa Occhipinti
- Computational Systems Biology and Data Analytics Research Group, Middlebrough, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK
| | - Claudio Angione
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK.
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK.
- Healthcare Innovation Centre, Teesside University, Middlesbrough, UK.
| |
Collapse
|
49
|
Passi A, Tibocha-Bonilla JD, Kumar M, Tec-Campos D, Zengler K, Zuniga C. Genome-Scale Metabolic Modeling Enables In-Depth Understanding of Big Data. Metabolites 2021; 12:14. [PMID: 35050136 PMCID: PMC8778254 DOI: 10.3390/metabo12010014] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/18/2021] [Accepted: 12/20/2021] [Indexed: 11/16/2022] Open
Abstract
Genome-scale metabolic models (GEMs) enable the mathematical simulation of the metabolism of archaea, bacteria, and eukaryotic organisms. GEMs quantitatively define a relationship between genotype and phenotype by contextualizing different types of Big Data (e.g., genomics, metabolomics, and transcriptomics). In this review, we analyze the available Big Data useful for metabolic modeling and compile the available GEM reconstruction tools that integrate Big Data. We also discuss recent applications in industry and research that include predicting phenotypes, elucidating metabolic pathways, producing industry-relevant chemicals, identifying drug targets, and generating knowledge to better understand host-associated diseases. In addition to the up-to-date review of GEMs currently available, we assessed a plethora of tools for developing new GEMs that include macromolecular expression and dynamic resolution. Finally, we provide a perspective in emerging areas, such as annotation, data managing, and machine learning, in which GEMs will play a key role in the further utilization of Big Data.
Collapse
Affiliation(s)
- Anurag Passi
- Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0760, USA; (A.P.); (M.K.); (D.T.-C.); (K.Z.)
| | - Juan D. Tibocha-Bonilla
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0760, USA;
| | - Manish Kumar
- Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0760, USA; (A.P.); (M.K.); (D.T.-C.); (K.Z.)
| | - Diego Tec-Campos
- Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0760, USA; (A.P.); (M.K.); (D.T.-C.); (K.Z.)
- Facultad de Ingeniería Química, Campus de Ciencias Exactas e Ingenierías, Universidad Autónoma de Yucatán, Merida 97203, Yucatan, Mexico
| | - Karsten Zengler
- Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0760, USA; (A.P.); (M.K.); (D.T.-C.); (K.Z.)
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093-0412, USA
- Center for Microbiome Innovation, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0403, USA
| | - Cristal Zuniga
- Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0760, USA; (A.P.); (M.K.); (D.T.-C.); (K.Z.)
| |
Collapse
|
50
|
Vijayakumar S, Angione C. Protocol for hybrid flux balance, statistical, and machine learning analysis of multi-omic data from the cyanobacterium Synechococcus sp. PCC 7002. STAR Protoc 2021; 2:100837. [PMID: 34632416 PMCID: PMC8488602 DOI: 10.1016/j.xpro.2021.100837] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Combining a computational framework for flux balance analysis with machine learning improves the accuracy of predicting metabolic activity across conditions, while enabling mechanistic interpretation. This protocol presents a guide to condition-specific metabolic modeling that integrates regularized flux balance analysis with machine learning approaches to extract key features from transcriptomic and fluxomic data. We demonstrate the protocol as applied to Synechococcus sp. PCC 7002; we also outline how it can be adapted to any species or community with available multi-omic data. For complete details on the use and execution of this protocol, please refer to Vijayakumar et al. (2020).
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
| | - Claudio Angione
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough TS1 3BX, UK
- Healthcare Innovation Centre, Teesside University, Middlesbrough TS1 3BX, UK
| |
Collapse
|