1
|
Helmy M, Elhalis H, Rashid MM, Selvarajoo K. Can digital twin efforts shape microorganism-based alternative food? Curr Opin Biotechnol 2024; 87:103115. [PMID: 38547588 DOI: 10.1016/j.copbio.2024.103115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/05/2024] [Accepted: 03/06/2024] [Indexed: 06/09/2024]
Abstract
With the continuous increment in global population growth, compounded by post-pandemic food security challenges due to labor shortages, effects of climate change, political conflicts, limited land for agriculture, and carbon emissions control, addressing food production in a sustainable manner for future generations is critical. Microorganisms are potential alternative food sources that can help close the gap in food production. For the development of more efficient and yield-enhancing products, it is necessary to have a better understanding on the underlying regulatory molecular pathways of microbial growth. Nevertheless, as microbes are regulated at multiomics scales, current research focusing on single omics (genomics, proteomics, or metabolomics) independently is inadequate for optimizing growth and product output. Here, we discuss digital twin (DT) approaches that integrate systems biology and artificial intelligence in analyzing multiomics datasets to yield a microbial replica model for in silico testing before production. DT models can thus provide a holistic understanding of microbial growth, metabolite biosynthesis mechanisms, as well as identifying crucial production bottlenecks. Our argument, therefore, is to support the development of novel DT models that can potentially revolutionize microorganism-based alternative food production efficiency.
Collapse
Affiliation(s)
- Mohamed Helmy
- Vaccine and Infectious Disease Organization (VIDO), University of Saskatchewan, SK, Canada; Department of Computer Science, Lakehead University, ON, Canada; Department of Computer Science, College of Science and Engineering, Idaho State University, ID, USA; Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), Singapore 138671, Singapore
| | - Hosam Elhalis
- Research School of Biology, Australian National University, Canberra, Australia
| | - Md Mamunur Rashid
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), Singapore 138671, Singapore
| | - Kumar Selvarajoo
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), Singapore 138671, Singapore; Synthetic Biology Translational Research Program and SynCTI, Yong Loo Lin School of Medicine, National University of Singapore (NUS), Singapore 117456, Singapore; School of Biological Sciences, Nanyang Technological University (NTU), Singapore 637551, Singapore.
| |
Collapse
|
2
|
Siddharth T, Lewis NE. Predicting pathways for old and new metabolites through clustering. J Theor Biol 2024; 578:111684. [PMID: 38048983 PMCID: PMC11139542 DOI: 10.1016/j.jtbi.2023.111684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 11/17/2023] [Accepted: 11/29/2023] [Indexed: 12/06/2023]
Abstract
The diverse metabolic pathways are fundamental to all living organisms, as they harvest energy, synthesize biomass components, produce molecules to interact with the microenvironment, and neutralize toxins. While the discovery of new metabolites and pathways continues, the prediction of pathways for new metabolites can be challenging. It can take vast amounts of time to elucidate pathways for new metabolites; thus, according to HMDB (Human Metabolome Database), only 60% of metabolites get assigned to pathways. Here, we present an approach to identify pathways based on metabolite structure. We extracted 201 features from SMILES annotations and identified new metabolites from PubMed abstracts and HMDB. After applying clustering algorithms to both groups of features, we quantified correlations between metabolites, and found the clusters accurately linked 92% of known metabolites to their respective pathways. Thus, this approach could be valuable for predicting metabolic pathways for new metabolites.
Collapse
Affiliation(s)
- Thiru Siddharth
- Department of Computer Science and Engineering, Indian Institute of Information Technology, Bhopal, MP 462003, India
| | - Nathan E Lewis
- Department of Pediatrics and Bioengineering, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
3
|
de Vries R, Farzan N, Fabius T, De Jongh FHC, Jak PMC, Haarman EG, Snoey E, In 't Veen JCCM, Dagelet YWF, Maitland-Van Der Zee AH, Lucas A, Van Den Heuvel MM, Wolf-Lansdorf M, Muller M, Baas P, Sterk PJ. Prospective Detection of Early Lung Cancer in Patients With COPD in Regular Care by Electronic Nose Analysis of Exhaled Breath. Chest 2023; 164:1315-1324. [PMID: 37209772 PMCID: PMC10635840 DOI: 10.1016/j.chest.2023.04.050] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 04/18/2023] [Accepted: 04/25/2023] [Indexed: 05/22/2023] Open
Abstract
BACKGROUND Patients with COPD are at high risk of lung cancer developing, but no validated predictive biomarkers have been reported to identify these patients. Molecular profiling of exhaled breath by electronic nose (eNose) technology may qualify for early detection of lung cancer in patients with COPD. RESEARCH QUESTION Can eNose technology be used for prospective detection of early lung cancer in patients with COPD? STUDY DESIGN AND METHODS BreathCloud is a real-world multicenter prospective follow-up study using diagnostic and monitoring visits in day-to-day clinical care of patients with a standardized diagnosis of asthma, COPD, or lung cancer. Breath profiles were collected at inclusion in duplicate by a metal-oxide semiconductor eNose positioned at the rear end of a pneumotachograph (SpiroNose; Breathomix). All patients with COPD were managed according to standard clinical care, and the incidence of clinically diagnosed lung cancer was prospectively monitored for 2 years. Data analysis involved advanced signal processing, ambient air correction, and statistics based on principal component (PC) analysis, linear discriminant analysis, and receiver operating characteristic analysis. RESULTS Exhaled breath data from 682 patients with COPD and 211 patients with lung cancer were available. Thirty-seven patients with COPD (5.4%) demonstrated clinically manifest lung cancer within 2 years after inclusion. Principal components 1, 2, and 3 were significantly different between patients with COPD and those with lung cancer in both training and validation sets with areas under the receiver operating characteristic curve of 0.89 (95% CI, 0.83-0.95) and 0.86 (95% CI, 0.81-0.89). The same three PCs showed significant differences (P < .01) at baseline between patients with COPD who did and did not subsequently demonstrate lung cancer within 2 years, with a cross-validation value of 87% and an area under the receiver operating characteristic curve of 0.90 (95% CI, 0.84-0.95). INTERPRETATION Exhaled breath analysis by eNose identified patients with COPD in whom lung cancer became clinically manifest within 2 years after inclusion. These results show that eNose assessment may detect early stages of lung cancer in patients with COPD.
Collapse
Affiliation(s)
- Rianne de Vries
- Amsterdam University Medical Centers, University of Amsterdam, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands; Breathomix B.V, Leiden, The Netherlands.
| | | | - Timon Fabius
- Medisch Spectrum Twente, Enschede, The Netherlands
| | | | - Patrick M C Jak
- Emma Children's Hospital, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | - Eric G Haarman
- Emma Children's Hospital, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | - Erik Snoey
- Franciscus Gasthuis & Vlietland, Rotterdam, The Netherlands
| | | | | | - Anke-Hilse Maitland-Van Der Zee
- Amsterdam University Medical Centers, University of Amsterdam, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | | | | | | | - Mirte Muller
- The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Paul Baas
- The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Peter J Sterk
- Amsterdam University Medical Centers, University of Amsterdam, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
4
|
Ateia M, Sigmund G, Bentel MJ, Washington JW, Lai A, Merrill NH, Wang Z. Integrated data-driven cross-disciplinary framework to prevent chemical water pollution. ONE EARTH (CAMBRIDGE, MASS.) 2023; 6:10.1016/j.oneear.2023.07.001. [PMID: 38264630 PMCID: PMC10802893 DOI: 10.1016/j.oneear.2023.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]
Abstract
Access to a clean and healthy environment is a human right and a prerequisite for maintaining a sustainable ecosystem. Experts across domains along the chemical life cycle have traditionally operated in isolation, leading to limited connectivity between upstream chemical innovation to downstream development of water-treatment technologies. This fragmented and historically reactive approach to managing emerging contaminants has resulted in significant externalized societal costs. Herein, we propose an integrated data-driven framework to foster proactive action across domains to effectively address chemical water pollution. By implementing this integrated framework, it will not only enhance the capabilities of experts in their respective fields but also create opportunities for novel approaches that yield co-benefits across multiple domains. To successfully operationalize the integrated framework, several concerted efforts are warranted, including adopting open and FAIR (findable, accessible, interoperable, and reusable) data practices, developing common knowledge bases/platforms, and staying vigilant against new substance "properties" of concern.
Collapse
Affiliation(s)
- Mohamed Ateia
- United States Environmental Protection Agency, Center for Environmental Solutions & Emergency Response, Cincinnati, OH 45220, USA
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, TX, USA
| | - Gabriel Sigmund
- Environmental Geosciences, Centre for Microbiology and Environmental Systems Science, University of Vienna, Josef-Holaubeck-Platz 2, 1090 Vienna, Austria
- Environmental Technology, Wageningen University & Research, P.O. Box 17, 6700 AA Wageningen, the Netherlands
| | - Michael J. Bentel
- Department of Environmental Engineering and Earth Sciences, Clemson University, Clemson, SC 29634, USA
| | - John W. Washington
- United States Environmental Protection Agency, Center for Environmental Measurement and Modeling, Athens, GA 30605, USA
| | - Adelene Lai
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367 Belvaux, Luxembourg
- Institute for Inorganic and Analytical Chemistry, Friedrich-Schiller-University, 07743 Jena, Germany
| | - Nathaniel H. Merrill
- United States Environmental Protection Agency, Center for Environmental Measurement and Modeling, Narragansett, RI, USA
| | - Zhanyun Wang
- Empa Swiss – Federal Laboratories for Materials Science and Technology, Technology and Society Laboratory, 9014 St. Gallen, Switzerland
| |
Collapse
|
5
|
Shah HA, Liu J, Yang Z, Yang F, Zhang Q, Feng J. DeepRT: Predicting compounds presence in pathway modules and classifying into module classes using deep neural networks based on molecular properties. J Bioinform Comput Biol 2023; 21:2350017. [PMID: 37632195 DOI: 10.1142/s0219720023500178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/27/2023]
Abstract
Metabolic pathways play a crucial role in understanding the biochemistry of organisms. In metabolic pathways, modules refer to clusters of interconnected reactions or sub-networks representing specific functional units or biological processes within the overall pathway. In pathway modules, compounds are major elements and refer to the various molecules that participate in the biochemical reactions within the pathway modules. These molecules can include substrates, intermediates and final products. Determining the presence relation of compounds and pathway modules is essential for synthesizing new molecules and predicting hidden reactions. To date, several computational methods have been proposed to address this problem. However, all methods only predict the metabolic pathways and their types, not the pathway modules. To address this issue, we proposed a novel deep learning model, DeepRT that integrates message passing neural networks (MPNNs) and transformer encoder. This combination allows DeepRT to effectively extract global and local structure information from the molecular graph. The model is designed to perform two tasks: first, determining the present relation of the compound with the pathway module, and second, predicting the relation of query compound and module classes. The proposed DeepRT model evaluated on a dataset comprising compounds and pathway modules, and it outperforms existing approaches.
Collapse
Affiliation(s)
- Hayat Ali Shah
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Juan Liu
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Zhihui Yang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Feng Yang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Qiang Zhang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| | - Jing Feng
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, P. R. China
| |
Collapse
|
6
|
Sen P, Orešič M. Integrating Omics Data in Genome-Scale Metabolic Modeling: A Methodological Perspective for Precision Medicine. Metabolites 2023; 13:855. [PMID: 37512562 PMCID: PMC10383060 DOI: 10.3390/metabo13070855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 07/11/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023] Open
Abstract
Recent advancements in omics technologies have generated a wealth of biological data. Integrating these data within mathematical models is essential to fully leverage their potential. Genome-scale metabolic models (GEMs) provide a robust framework for studying complex biological systems. GEMs have significantly contributed to our understanding of human metabolism, including the intrinsic relationship between the gut microbiome and the host metabolism. In this review, we highlight the contributions of GEMs and discuss the critical challenges that must be overcome to ensure their reproducibility and enhance their prediction accuracy, particularly in the context of precision medicine. We also explore the role of machine learning in addressing these challenges within GEMs. The integration of omics data with GEMs has the potential to lead to new insights, and to advance our understanding of molecular mechanisms in human health and disease.
Collapse
Affiliation(s)
- Partho Sen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, FI-20520 Turku, Finland
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, 702 81 Örebro, Sweden
| | - Matej Orešič
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, FI-20520 Turku, Finland
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, 702 81 Örebro, Sweden
| |
Collapse
|
7
|
Hassan S, Ganai BA. Deciphering the recent trends in pesticide bioremediation using genome editing and multi-omics approaches: a review. World J Microbiol Biotechnol 2023; 39:151. [PMID: 37029313 DOI: 10.1007/s11274-023-03603-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 04/03/2023] [Indexed: 04/09/2023]
Abstract
Pesticide pollution in recent times has emerged as a grave environmental problem contaminating both aquatic and terrestrial ecosystems owing to their widespread use. Bioremediation using gene editing and system biology could be developed as an eco-friendly and proficient tool to remediate pesticide-contaminated sites due to its advantages and greater public acceptance over the physical and chemical methods. However, it is indispensable to understand the different aspects associated with microbial metabolism and their physiology for efficient pesticide remediation. Therefore, this review paper analyses the different gene editing tools and multi-omics methods in microbes to produce relevant evidence regarding genes, proteins and metabolites associated with pesticide remediation and the approaches to contend against pesticide-induced stress. We systematically discussed and analyzed the recent reports (2015-2022) on multi-omics methods for pesticide degradation to elucidate the mechanisms and the recent advances associated with the behaviour of microbes under diverse environmental conditions. This study envisages that CRISPR-Cas, ZFN and TALEN as gene editing tools utilizing Pseudomonas, Escherichia coli and Achromobacter sp. can be employed for remediation of chlorpyrifos, parathion-methyl, carbaryl, triphenyltin and triazophos by creating gRNA for expressing specific genes for the bioremediation. Similarly, systems biology accompanying multi-omics tactics revealed that microbial strains from Paenibacillus, Pseudomonas putida, Burkholderia cenocepacia, Rhodococcus sp. and Pencillium oxalicum are capable of degrading deltamethrin, p-nitrophenol, chlorimuron-ethyl and nicosulfuron. This review lends notable insights into the research gaps and provides potential solutions for pesticide remediation by using different microbe-assisted technologies. The inferences drawn from the current study will help researchers, ecologists, and decision-makers gain comprehensive knowledge of value and application of systems biology and gene editing in bioremediation assessments.
Collapse
Affiliation(s)
- Shahnawaz Hassan
- Department of Environmental Science, University of Kashmir, Srinagar, 190006, India.
| | - Bashir Ahmad Ganai
- Centre of Research for Development, University of Kashmir, Srinagar, 190006, India.
| |
Collapse
|
8
|
Miller WM, Ziegler KM, Yilmaz A, Saiyed N, Ustun I, Akyol S, Idler J, Sims MD, Maddens ME, Graham SF. Association of Metabolomic Biomarkers with Sleeve Gastrectomy Weight Loss Outcomes. Metabolites 2023; 13:metabo13040506. [PMID: 37110164 PMCID: PMC10145663 DOI: 10.3390/metabo13040506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 03/29/2023] [Accepted: 03/29/2023] [Indexed: 04/03/2023] Open
Abstract
This prospective observational study aimed to evaluate the association of metabolomic alterations with weight loss outcomes following sleeve gastrectomy (SG). We evaluated the metabolomic profile of serum and feces prior to SG and three months post-SG, along with weight loss outcomes in 45 adults with obesity. The percent total weight loss for the highest versus the lowest weight loss tertiles (T3 vs. T1) was 17.0 ± 1.3% and 11.1 ± 0.8%, p < 0.001. Serum metabolite alterations specific to T3 at three months included a decrease in methionine sulfoxide concentration as well as alterations to tryptophan and methionine metabolism (p < 0.03). Fecal metabolite changes specific to T3 included a decrease in taurine concentration and perturbations to arachidonic acid metabolism, and taurine and hypotaurine metabolism (p < 0.002). Preoperative metabolites were found to be highly predictive of weight loss outcomes in machine learning algorithms, with an average area under the curve of 94.6% for serum and 93.4% for feces. This comprehensive metabolomics analysis of weight loss outcome differences post-SG highlights specific metabolic alterations as well as machine learning algorithms predictive of weight loss. These findings could contribute to the development of novel therapeutic targets to enhance weight loss outcomes after SG.
Collapse
Affiliation(s)
- Wendy M. Miller
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Oakland University William Beaumont School of Medicine, 586 Pioneer Dr, Rochester, MI 48309, USA
| | - Kathryn M. Ziegler
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Oakland University William Beaumont School of Medicine, 586 Pioneer Dr, Rochester, MI 48309, USA
| | - Ali Yilmaz
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| | - Nazia Saiyed
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| | - Ilyas Ustun
- DePaul University Jarvis College of Computing and Digital Media, 243 S Wabash Ave, Chicago, IL 60604, USA
| | - Sumeyya Akyol
- NX Prenatal Inc. Laboratory, 4800 Fournace Place, Suite BW28, Bellaire, TX 77401, USA
| | - Jay Idler
- Allegheny Health Network, West Penn Hospital, 4815 Liberty Ave, Suite GR50, Pittsburgh, PA 15224, USA
- Drexel University College of Medicine, 2900 W Queen Ln, Philadelphia, PA 19129, USA
| | - Matthew D. Sims
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| | - Michael E. Maddens
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Oakland University William Beaumont School of Medicine, 586 Pioneer Dr, Rochester, MI 48309, USA
| | - Stewart F. Graham
- Department of Nutrition and Preventive Medicine, Corewell Health William Beaumont University Hospital, 3601 W. 13 Mile Road, Royal Oak, MI 48073, USA
- Beaumont Research Institute, 3811 W 13 Mile Rd, Royal Oak, MI 48073, USA
| |
Collapse
|
9
|
Zhang X, Su Y, Lane AN, Stromberg AJ, Fan TWM, Wang C. Bayesian kinetic modeling for tracer-based metabolomic data. BMC Bioinformatics 2023; 24:108. [PMID: 36949395 PMCID: PMC10035190 DOI: 10.1186/s12859-023-05211-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 02/24/2023] [Indexed: 03/24/2023] Open
Abstract
BACKGROUND Stable Isotope Resolved Metabolomics (SIRM) is a new biological approach that uses stable isotope tracers such as uniformly [Formula: see text]-enriched glucose ([Formula: see text]-Glc) to trace metabolic pathways or networks at the atomic level in complex biological systems. Non-steady-state kinetic modeling based on SIRM data uses sets of simultaneous ordinary differential equations (ODEs) to quantitatively characterize the dynamic behavior of metabolic networks. It has been increasingly used to understand the regulation of normal metabolism and dysregulation in the development of diseases. However, fitting a kinetic model is challenging because there are usually multiple sets of parameter values that fit the data equally well, especially for large-scale kinetic models. In addition, there is a lack of statistically rigorous methods to compare kinetic model parameters between different experimental groups. RESULTS We propose a new Bayesian statistical framework to enhance parameter estimation and hypothesis testing for non-steady-state kinetic modeling of SIRM data. For estimating kinetic model parameters, we leverage the prior distribution not only to allow incorporation of experts' knowledge but also to provide robust parameter estimation. We also introduce a shrinkage approach for borrowing information across the ensemble of metabolites to stably estimate the variance of an individual isotopomer. In addition, we use a component-wise adaptive Metropolis algorithm with delayed rejection to perform efficient Monte Carlo sampling of the posterior distribution over high-dimensional parameter space. For comparing kinetic model parameters between experimental groups, we propose a new reparameterization method that converts the complex hypothesis testing problem into a more tractable parameter estimation problem. We also propose an inference procedure based on credible interval and credible value. Our method is freely available for academic use at https://github.com/xuzhang0131/MCMCFlux . CONCLUSIONS Our new Bayesian framework provides robust estimation of kinetic model parameters and enables rigorous comparison of model parameters between experimental groups. Simulation studies and application to a lung cancer study demonstrate that our framework performs well for non-steady-state kinetic modeling of SIRM data.
Collapse
Affiliation(s)
- Xu Zhang
- Dr. Bing Zhang Department of Statistics, University of Kentucky, Lexington, 40536, USA.
| | - Ya Su
- Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, Richmond, 23220, USA
| | - Andrew N Lane
- Markey Cancer Center, University of Kentucky, Lexington, 40536, USA
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, 40536, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, 40536, USA
| | - Arnold J Stromberg
- Dr. Bing Zhang Department of Statistics, University of Kentucky, Lexington, 40536, USA
| | - Teresa W M Fan
- Markey Cancer Center, University of Kentucky, Lexington, 40536, USA
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, 40536, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, 40536, USA
| | - Chi Wang
- Dr. Bing Zhang Department of Statistics, University of Kentucky, Lexington, 40536, USA.
- Markey Cancer Center, University of Kentucky, Lexington, 40536, USA.
- Division of Cancer Biostatistics, Department of Internal Medicine, University of Kentucky, Lexington, 40536, USA.
| |
Collapse
|
10
|
Liu W, Zhang L, Bao L, Shen G, Feng J. Accurate Classification and Prediction of Acute Myocardial Infarction through an ARMD Procedure. J Proteome Res 2023; 22:758-767. [PMID: 36710647 DOI: 10.1021/acs.jproteome.2c00488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The risk stratification of acute myocardial infarction (AMI) patients is of prime importance for clinical management and prognosis assessment. Thus, we propose an ensemble machine learning analysis procedure named ADASYN-RFECV-MDA-DNN (ARMD) to address sample-unbalanced problems and enable stratification and prediction of AMI outcomes. The ARMD analysis procedure was applied to the NMR data of sera from 534 AMI-related subjects in four categories with an extremely imbalanced sample proportion. Firstly, the adaptive synthetic sampling (ADASYN) algorithm was used to address the issue of the original sample imbalance. Secondly, the recursive feature elimination with cross-validation (RFECV) processing and random forest mean decrease accuracy (RF-MDA) algorithm was performed to identify the differential metabolites corresponding to each AMI outcome. Finally, the deep neural network (DNN) was employed to classify and predict AMI events, and its performance was evaluated by comparing the four traditional machine learning methods. Compared with the other four machine learning models, DNN presented consistent superiority in almost all of the model parameters including precision, f1-score, sensitivity, specificity, area under the receiver operating characteristic curve (AUC), and classification accuracy, highlighting the potential of deep learning in classification and stratification of clinical diseases. The ARMD analysis procedure was a practical analysis tool for supervised classification and regression modeling of clinical diseases.
Collapse
Affiliation(s)
- Wuping Liu
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Lirong Zhang
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Lijun Bao
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Guiping Shen
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| | - Jianghua Feng
- Department of Electronic Science, Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, 422 Siming South Road, Siming District, Xiamen, Fujian 361005, China
| |
Collapse
|
11
|
Jala A, Dutta R, Josyula JVN, Mutheneni SR, Borkar RM. Environmental phenol exposure associates with urine metabolome alteration in young Northeast Indian females. CHEMOSPHERE 2023; 317:137830. [PMID: 36640981 DOI: 10.1016/j.chemosphere.2023.137830] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 12/12/2022] [Accepted: 01/10/2023] [Indexed: 06/17/2023]
Abstract
Urinary biomonitoring delivers the most accurate environmental phenols exposure assessment. However, environmental phenol exposure-related biomarkers are required to improve risk assessment to understand the internal processes perturbed, which may link exposure to specific health outcomes. This study aimed to investigate the association between environmental phenols exposure and the metabolome of young adult females from India. Urinary metabolomics was performed using liquid chromatography-mass spectrometry. Environmental phenols-related metabolic biomarkers were investigated by comparing the low and high exposure of environmental phenols. Seven potential biomarkers, namely histidine, cysteine-s-sulfate, 12-KETE, malonic acid, p-hydroxybenzoic acid, PE (36:2), and PS (36:0), were identified, revealing that environmental phenol exposure altered the metabolic pathways such as histidine metabolism, beta-Alanine metabolism, glycerophospholipid metabolism, and other pathways. This study also conceived an innovative strategy for the early prediction of diseases by combining urinary metabolomics with machine learning (ML) algorithms. The differential metabolites predictive accuracy by ML models was >80%. This is the first mass spectrometry-based metabolomics study on young adult females from India with environmental phenols exposure. The study is valuable in demonstrating multiple urine metabolic changes linked to environmental phenol exposure and a better understanding of the mechanisms behind environmental phenol-induced effects in young female adults.
Collapse
Affiliation(s)
- Aishwarya Jala
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research, Guwahati, 781101, Assam, India
| | - Ratul Dutta
- Down Town Hospital, Guwahati, Assam, 781106, India
| | | | - Srinivasa Rao Mutheneni
- Applied Biology Division, CSIR-Indian Institute of Chemical Technology, Tarnaka, Hyderabad, 500007, Telangana, India
| | - Roshan M Borkar
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research, Guwahati, 781101, Assam, India.
| |
Collapse
|
12
|
Wijbenga N, Hoek RAS, Mathot BJ, Seghers L, Moor CC, Aerts JGJV, Bos D, Manintveld OC, Hellemons ME. Diagnostic performance of electronic nose technology in chronic lung allograft dysfunction. J Heart Lung Transplant 2023; 42:236-245. [PMID: 36283951 DOI: 10.1016/j.healun.2022.09.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 08/22/2022] [Accepted: 09/12/2022] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND There is a need for reliable biomarkers for the diagnosis of chronic lung allograft dysfunction (CLAD). In this light, we investigated the diagnostic value of exhaled breath analysis using an electronic nose (eNose) for CLAD, CLAD phenotype, and CLAD stage in lung transplant recipients (LTR). METHODS We performed eNose measurements in LTR with and without CLAD, visiting the outpatient clinic. Through supervised machine learning, the diagnostic value of eNose for CLAD was assessed in a random training and validation set. Next, we investigated the diagnostic value of the eNose measurements combined with known risk factors for CLAD. Model performance was evaluated using ROC-analysis. RESULTS We included 152 LTR (median age 60 years, 49% females), of whom 38 with CLAD. eNose-based classification of patients with and without CLAD provided an AUC of 0.86 in the training set, and 0.82 in the validation set. After adding established risk factors for CLAD (age, gender, type of transplantation, time after transplantation and prior occurrence of acute cellular rejection) to a model with the eNose data, the discriminative ability of the model improved to an AUC of 0.94 (p = 0.02) in the training set and 0.94 (p = 0.04) in the validation set. Discrimination between BOS and RAS was good (AUC 0.95). Discriminative ability for other phenotypes (AUCs ranging 0.50-0.92) or CLAD stages (AUC 0.56) was limited. CONCLUSION Exhaled breath analysis using eNose is a promising novel biomarker for enabling diagnosis and phenotyping CLAD. eNose technology could be a valuable addition to the diagnostic armamentarium for suspected graft failure in LTR.
Collapse
Affiliation(s)
- Nynke Wijbenga
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Rogier A S Hoek
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Bas J Mathot
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Leonard Seghers
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Catharina C Moor
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Joachim G J V Aerts
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Daniel Bos
- Department of Radiology & Nuclear Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Department of Epidemiology, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Olivier C Manintveld
- Department of Cardiology, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Merel E Hellemons
- Department of Respiratory Medicine, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands; Erasmus MC Transplant Institute, Erasmus University Medical Center Rotterdam, Rotterdam, The Netherlands.
| |
Collapse
|
13
|
Aida H, Uchida K, Nagai M, Hashizume T, Masuo S, Takaya N, Ying BW. Machine learning-assisted medium optimization revealed the discriminated strategies for improved production of the foreign and native metabolites. Comput Struct Biotechnol J 2023; 21:2654-2663. [PMID: 37138901 PMCID: PMC10149329 DOI: 10.1016/j.csbj.2023.04.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 04/18/2023] [Accepted: 04/19/2023] [Indexed: 05/05/2023] Open
Abstract
The composition of medium components is crucial for achieving the best performance of synthetic construction in genetically engineered cells. Which and how medium components determine the performance, e.g., productivity, remain poorly investigated. To address the questions, a comparative survey with two genetically engineered Escherichia coli strains was performed. As a case study, the strains carried the synthetic pathways for producing the aromatic compounds of 4-aminophenylalanine (4APhe) or tyrosine (Tyr), common in the upstream but differentiated in the downstream metabolism. Bacterial growth and compound production were examined in hundreds of medium combinations that comprised 48 pure chemicals. The resultant data sets linking the medium composition to bacterial growth and production were subjected to machine learning for improved production. Intriguingly, the primary medium components determining the production of 4PheA and Tyr were differentiated, which were the initial resource (glucose) of the synthetic pathway and the inducer (IPTG) of the synthetic construction, respectively. Fine-tuning of the primary component significantly increased the yields of 4APhe and Tyr, indicating that a single component could be crucial for the performance of synthetic construction. Transcriptome analysis observed the local and global changes in gene expression for improved production of 4APhe and Tyr, respectively, revealing divergent metabolic strategies for producing the foreign and native metabolites. The study demonstrated that ML-assisted medium optimization could provide a novel point of view on how to make the synthetic construction meet the designed working principle and achieve the expected biological function.
Collapse
Affiliation(s)
- Honoka Aida
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Keisuke Uchida
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Motoki Nagai
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Takamasa Hashizume
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Shunsuke Masuo
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Microbiology Research Center for Sustainability, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
| | - Naoki Takaya
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Microbiology Research Center for Sustainability, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Corresponding author at: School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan.
| | - Bei-Wen Ying
- School of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, 305-8572 Ibaraki, Japan
- Corresponding author.
| |
Collapse
|
14
|
Cuperlovic-Culf M, Nguyen-Tran T, Bennett SAL. Machine Learning and Hybrid Methods for Metabolic Pathway Modeling. Methods Mol Biol 2023; 2553:417-439. [PMID: 36227553 DOI: 10.1007/978-1-0716-2617-7_18] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Computational cell metabolism models seek to provide metabolic explanations of cell behavior under different conditions or following genetic alterations, help in the optimization of in vitro cell growth environments, or predict cellular behavior in vivo and in vitro. In the extremes, mechanistic models can include highly detailed descriptions of a small number of metabolic reactions or an approximate representation of an entire metabolic network. To date, all mechanistic models have required details of individual metabolic reactions, either kinetic parameters or metabolic flux, as well as information about extracellular and intracellular metabolite concentrations. Despite the extensive efforts and the increasing availability of high-quality data, required in vivo data are not available for the majority of known metabolic reactions; thus, mechanistic models are based primarily on ex vivo kinetic measurements and limited flux information. Machine learning approaches provide an alternative for derivation of functional dependencies from existing data. The increasing availability of metabolomic and lipidomic data, with growing feature coverage as well as sample set size, is expected to provide new data options needed for derivation of machine learning models of cell metabolic processes. Moreover, machine learning analysis of longitudinal data can lead to predictive models of cell behaviors over time. Conversely, machine learning models trained on steady-state data can provide descriptive models for the comparison of metabolic states in different environments or disease conditions. Additionally, inclusion of metabolic network knowledge in these analyses can further help in the development of models with limited data.This chapter will explore the application of machine learning to the modeling of cell metabolism. We first provide a theoretical explanation of several machine learning and hybrid mechanistic machine learning methods currently being explored to model metabolism. Next, we introduce several avenues for improving these models with machine learning. Finally, we provide protocols for specific examples of the utilization of machine learning in the development of predictive cell metabolism models using metabolomic data. We describe data preprocessing, approaches for training of machine learning models for both descriptive and predictive models, and the utilization of these models in synthetic and systems biology. Detailed protocols provide a list of software tools and libraries used for these applications, step-by-step modeling protocols, troubleshooting, as well as an overview of existing limitations to these approaches.
Collapse
Affiliation(s)
- Miroslava Cuperlovic-Culf
- Digital Technologies Research Centre, National Research Council of Canada, Ottawa, ON, Canada.
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada.
| | - Thao Nguyen-Tran
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada
- Neural Regeneration Laboratory, Ottawa Institute of Systems Biology, Brain and Mind Research Institute, University of Ottawa, Ottawa, ON, Canada
- Department of Chemistry and Biomolecular Sciences, Centre for Catalysis Research and Innovation, University of Ottawa, Ottawa, ON, Canada
| | - Steffany A L Bennett
- Department of Biochemistry, Microbiology, and Immunology, University of Ottawa, Ottawa, ON, Canada
- Neural Regeneration Laboratory, Ottawa Institute of Systems Biology, Brain and Mind Research Institute, University of Ottawa, Ottawa, ON, Canada
- Department of Chemistry and Biomolecular Sciences, Centre for Catalysis Research and Innovation, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
15
|
Peralbo-Molina Á, Solà-Santos P, Perera-Lluna A, Chicano-Gálvez E. Data Processing and Analysis in Mass Spectrometry-Based Metabolomics. Methods Mol Biol 2023; 2571:207-239. [PMID: 36152164 DOI: 10.1007/978-1-0716-2699-3_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Metabolomics is the latest of the omics sciences. It attempts to measure and characterize metabolites-small chemical compounds <1500 Da-on cells, tissue, or biofluids, which are usually products of biological reactions. As metabolic reactions are closer to the phenotype, metabolomics has emerged as an attractive science for various areas of research, including personalized medicine. However, due to the complexity of data obtained and the absence of curated databases for metabolite identification, data processing is the major bottleneck in this area since most technicians lack the required bioinformatics expertise to process datasets in a reliable and fast manner. The aim of this chapter is to describe the available tools for data processing that makes an inexperienced researcher capable of obtaining reliable results without having to undergo through huge parametrization steps.
Collapse
Affiliation(s)
- Ángela Peralbo-Molina
- IMIBIC Mass Spectrometry and Molecular Imaging Unit, Maimonides, Biomedical Research Institute of Cordoba (IMIBIC), Reina Sofia University Hospital, University of Cordoba (UCO), Córdoba, Spain.
| | - Pol Solà-Santos
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain
- Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain
- Institut de Recerca Sant Joan de Déu, Barcelona, Spain
| | - Alexandre Perera-Lluna
- B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, Barcelona, Spain
- Networking Biomedical Research Centre in the Subject Area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain
- Institut de Recerca Sant Joan de Déu, Barcelona, Spain
| | - Eduardo Chicano-Gálvez
- IMIBIC Mass Spectrometry and Molecular Imaging Unit, Maimonides, Biomedical Research Institute of Cordoba (IMIBIC), Reina Sofia University Hospital, University of Cordoba (UCO), Córdoba, Spain
| |
Collapse
|
16
|
Metabolomics and modelling approaches for systems metabolic engineering. Metab Eng Commun 2022; 15:e00209. [PMID: 36281261 PMCID: PMC9587336 DOI: 10.1016/j.mec.2022.e00209] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 10/13/2022] [Accepted: 10/14/2022] [Indexed: 11/21/2022] Open
Abstract
Metabolic engineering involves the manipulation of microbes to produce desirable compounds through genetic engineering or synthetic biology approaches. Metabolomics involves the quantitation of intracellular and extracellular metabolites, where mass spectrometry and nuclear magnetic resonance based analytical instrumentation are often used. Here, the experimental designs, sample preparations, metabolite quenching and extraction are essential to the quantitative metabolomics workflow. The resultant metabolomics data can then be used with computational modelling approaches, such as kinetic and constraint-based modelling, to better understand underlying mechanisms and bottlenecks in the synthesis of desired compounds, thereby accelerating research through systems metabolic engineering. Constraint-based models, such as genome scale models, have been used successfully to enhance the yield of desired compounds from engineered microbes, however, unlike kinetic or dynamic models, constraint-based models do not incorporate regulatory effects. Nevertheless, the lack of time-series metabolomic data generation has hindered the usefulness of dynamic models till today. In this review, we show that improvements in automation, dynamic real-time analysis and high throughput workflows can drive the generation of more quality data for dynamic models through time-series metabolomics data generation. Spatial metabolomics also has the potential to be used as a complementary approach to conventional metabolomics, as it provides information on the localization of metabolites. However, more effort must be undertaken to identify metabolites from spatial metabolomics data derived through imaging mass spectrometry, where machine learning approaches could prove useful. On the other hand, single-cell metabolomics has also seen rapid growth, where understanding cell-cell heterogeneity can provide more insights into efficient metabolic engineering of microbes. Moving forward, with potential improvements in automation, dynamic real-time analysis, high throughput workflows, and spatial metabolomics, more data can be produced and studied using machine learning algorithms, in conjunction with dynamic models, to generate qualitative and quantitative predictions to advance metabolic engineering efforts.
Collapse
|
17
|
Antony S, Antony S, Rebello S, George S, Biju DT, R R, Madhavan A, Binod P, Pandey A, Sindhu R, Awasthi MK. Bioremediation of Endocrine Disrupting Chemicals- Advancements and Challenges. ENVIRONMENTAL RESEARCH 2022; 213:113509. [PMID: 35660566 DOI: 10.1016/j.envres.2022.113509] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Revised: 05/08/2022] [Accepted: 05/18/2022] [Indexed: 06/15/2023]
Abstract
Endocrine Disrupting Chemicals (EDCs), major group of recalcitrant compounds, poses a serious threat to the health and future of millions of human beings, and other flora and fauna for years to come. A close analysis of various xenobiotics undermines the fact that EDC is structurally diverse chemical compounds generated as a part of anthropogenic advancements as well as part of their degradation. Regardless of such structural diversity, EDC is common in their ultimate drastic effect of impeding the proper functioning of the endocrinal system, basic physiologic systems, resulting in deregulated growth, malformations, and cancerous outcomes in animals as well as humans. The current review outlines an overview of various EDCs, their toxic effects on the ecosystem and its inhabitants. Conventional remediation methods such as physico-chemical methods and enzymatic approaches have been put into action as some form of mitigation measures. However, the last decade has seen the hunt for newer technologies and methodologies at an accelerated pace. Genetically engineered microbial degradation, gene editing strategies, metabolic and protein engineering, and in-silico predictive approaches - modern day's additions to our armamentarium in combating the EDCs are addressed. These additions have greater acceptance socially with lesser dissonance owing to reduced toxic by-products, lower health trepidations, better degradation, and ultimately the prevention of bioaccumulation. The positive impact of such new approaches on controlling the menace of EDCs has been outlaid. This review will shed light on sources of EDCs, their impact, significance, and the different remediation and bioremediation approaches, with a special emphasis on the recent trends and perspectives in using sustainable approaches for bioremediation of EDCs. Strict regulations to prevent the release of estrogenic chemicals to the ecosystem, adoption of combinatorial methods to remove EDC and prevalent use of bioremediation techniques should be followed in all future endeavors to combat EDC pollution. Moreover, the proper development, growth and functioning of future living forms relies on their non-exposure to EDCs, thus remediation of such chemicals present even in nano-concentrations should be addressed gravely.
Collapse
Affiliation(s)
- Sherly Antony
- Department of Microbiology, Pushpagiri Institute of Medical Sciences and Research Centre, Thiruvalla, 689 101, Kerala, India
| | - Sham Antony
- Pushpagiri Research Centre, Pushpagiri Institute of Medical Sciences and Research Centre, Thriuvalla, 689 101, Kerala, India
| | - Sharrel Rebello
- School of Food Science & Technology, Mahatma Gandhi University, Kottayam, India
| | - Sandhra George
- Pushpagiri Research Centre, Pushpagiri Institute of Medical Sciences and Research Centre, Thriuvalla, 689 101, Kerala, India
| | - Devika T Biju
- Pushpagiri Research Centre, Pushpagiri Institute of Medical Sciences and Research Centre, Thriuvalla, 689 101, Kerala, India
| | - Reshmy R
- Department of Science and Humanities, Providence College of Engineering, Chengannur, 689 122, Kerala, India
| | - Aravind Madhavan
- Rajiv Gandhi Centre for Biotechnology, Jagathy, Trivandrum, 695 014, India
| | - Parameswaran Binod
- Microbial Processes and Technology Division, CSIR-National Institute for Interdisciplinary Science and Technology (CSIR-NIIST), Thiruvananthapuram, 695 019, Kerala, India
| | - Ashok Pandey
- Center for Innovation and Translational Research, CSIR-Indian Institute of Toxicology Research, Lucknow, 226 001, India; Centre for Energy and Environmental Sustainability, Lucknow, 226 029, Uttar Pradesh, India
| | - Raveendran Sindhu
- Department of Food Technology, T K M Institute of Technology, Kollam, 691 505, Kerala, India.
| | - Mukesh Kumar Awasthi
- College of Natural Resources and Environment, Northwest A&F University, Yangling, Shaanxi Province, 712100, China.
| |
Collapse
|
18
|
Lao Z, Matsui Y, Ijichi S, Ying BW. Global coordination of the mutation and growth rates across the genetic and nutritional variety in Escherichia coli. Front Microbiol 2022; 13:990969. [PMID: 36204613 PMCID: PMC9530902 DOI: 10.3389/fmicb.2022.990969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 08/30/2022] [Indexed: 11/24/2022] Open
Abstract
Fitness and mutability are the primary traits of living organisms for adaptation and evolution. However, their quantitative linkage remained largely deficient. Whether there is any general relationship between the two features and how genetic and environmental variables influence them remained unclear and were addressed here. The mutation and growth rates of an assortment of Escherichia coli strain collections, including the wild-type strains and the genetically disturbed strains of either reduced genomes or deletion of the genes involved in the DNA replication fidelity, were evaluated in various media. The contribution of media to the mutation and growth rates was differentiated depending on the types of genetic disturbance. Nevertheless, the negative correlation between the mutation and growth rates was observed across the genotypes and was common in all media. It indicated the comprehensive association of the correlated mutation and growth rates with the genetic and medium variation. Multiple linear regression and support vector machine successfully predicted the mutation and growth rates and the categories of genotypes and media, respectively. Taken together, the study provided a quantitative dataset linking the mutation and growth rates, genotype, and medium and presented a simple and successful example of predicting bacterial growth and mutability by data-driven approaches.
Collapse
|
19
|
Chen H, Gao Y. Behavior Data Analysis of English Learners Based on Discrete Dynamic System Modeling Method. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:5409571. [PMID: 36093503 PMCID: PMC9462973 DOI: 10.1155/2022/5409571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/11/2022] [Accepted: 08/17/2022] [Indexed: 11/30/2022]
Abstract
With the rapid development of computer science, there are more and more kinds of discrete dynamic systems. Computer integrated system CIMS, network communication database administrator system, and human behavior analysis system are all discrete dynamic systems. At present, many researchers have studied by adding human behavior data to discrete dynamic systems. This paper aims to study the behavior data of English learners by using the discrete dynamic modeling technology of complex systems and the discrete dynamic system modeling method of Petri nets. By adding the behavior data of learners to the discrete dynamic system of fuzzy Petri nets, the system is diagnosed and optimized. The experimental results show that the complex discrete dynamic system in this paper has achieved good experimental results according to the performance indicators selected in theory. Based on the combination of the above technologies and systems, the fuzzy Petri net discrete dynamic system studied in this paper improves the processing speed of English learners' behavior data.
Collapse
Affiliation(s)
- Hongli Chen
- School of Foreign Languages, Handan University, Handan, Hebei 056006, China
| | - Yuzheng Gao
- School of Foreign Languages, Handan University, Handan, Hebei 056006, China
| |
Collapse
|
20
|
Lo-Thong-Viramoutou O, Charton P, Cadet XF, Grondin-Perez B, Saavedra E, Damour C, Cadet F. Non-linearity of Metabolic Pathways Critically Influences the Choice of Machine Learning Model. Front Artif Intell 2022; 5:744755. [PMID: 35757298 PMCID: PMC9226554 DOI: 10.3389/frai.2022.744755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 04/29/2022] [Indexed: 11/13/2022] Open
Abstract
The use of machine learning (ML) in life sciences has gained wide interest over the past years, as it speeds up the development of high performing models. Important modeling tools in biology have proven their worth for pathway design, such as mechanistic models and metabolic networks, as they allow better understanding of mechanisms involved in the functioning of organisms. However, little has been done on the use of ML to model metabolic pathways, and the degree of non-linearity associated with them is not clear. Here, we report the construction of different metabolic pathways with several linear and non-linear ML models. Different types of data are used; they lead to the prediction of important biological data, such as pathway flux and final product concentration. A comparison reveals that the data features impact model performance and highlight the effectiveness of non-linear models (e.g., QRF: RMSE = 0.021 nmol·min-1 and R2 = 1 vs. Bayesian GLM: RMSE = 1.379 nmol·min-1 R2 = 0.823). It turns out that the greater the degree of non-linearity of the pathway, the better suited a non-linear model will be. Therefore, a decision-making support for pathway modeling is established. These findings generally support the hypothesis that non-linear aspects predominate within the metabolic pathways. This must be taken into account when devising possible applications of these pathways for the identification of biomarkers of diseases (e.g., infections, cancer, neurodegenerative diseases) or the optimization of industrial production processes.
Collapse
Affiliation(s)
- Ophélie Lo-Thong-Viramoutou
- University of Paris, BIGR—Biologie Intégrée du Globule Rouge, Inserm, UMR_S1134, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Laboratory DSIMB, UMR_S1134, BIGR, Inserm, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | - Philippe Charton
- University of Paris, BIGR—Biologie Intégrée du Globule Rouge, Inserm, UMR_S1134, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Laboratory DSIMB, UMR_S1134, BIGR, Inserm, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | | | - Brigitte Grondin-Perez
- EnergyLab, EA 4079, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | - Emma Saavedra
- Departamento de Bioquímica, Instituto Nacional de Cardiología Ignacio Chávez, Mexico City, Mexico
| | - Cédric Damour
- EnergyLab, EA 4079, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| | - Frédéric Cadet
- University of Paris, BIGR—Biologie Intégrée du Globule Rouge, Inserm, UMR_S1134, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Laboratory DSIMB, UMR_S1134, BIGR, Inserm, Faculty of Sciences and Technology, University of La Reunion, Saint-Denis, France
| |
Collapse
|
21
|
Vanhove K, Derveaux E, Mesotten L, Thomeer M, Criel M, Mariën H, Adriaensens P. Unraveling the Rewired Metabolism in Lung Cancer Using Quantitative NMR Metabolomics. Int J Mol Sci 2022; 23:ijms23105602. [PMID: 35628415 PMCID: PMC9146819 DOI: 10.3390/ijms23105602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 05/14/2022] [Accepted: 05/15/2022] [Indexed: 11/16/2022] Open
Abstract
Lung cancer cells are well documented to rewire their metabolism and energy production networks to enable proliferation and survival in a nutrient-poor and hypoxic environment. Although metabolite profiling of blood plasma and tissue is still emerging in omics approaches, several techniques have shown potential in cancer diagnosis. In this paper, the authors describe the alterations in the metabolic phenotype of lung cancer patients. In addition, we focus on the metabolic cooperation between tumor cells and healthy tissue. Furthermore, the authors discuss how metabolomics could improve the management of lung cancer patients.
Collapse
Affiliation(s)
- Karolien Vanhove
- Applied and Analytical Chemistry, Institute for Materials Research, Hasselt University, Agoralaan 1-Building D, B-3590 Diepenbeek, Belgium;
- Department of Respiratory Medicine, AZ Vesalius, Hazelereik 51, B-3700 Tongeren, Belgium
- Correspondence:
| | - Elien Derveaux
- Faculty of Medicine and Life Sciences, Hasselt University, Martelarenlaan 42, B-3500 Hasselt, Belgium; (E.D.); (H.M.)
| | - Liesbet Mesotten
- Department of Nuclear Medicine, Ziekenhuis Oost-Limburg, Schiepse Bos 6, B-3600 Genk, Belgium;
| | - Michiel Thomeer
- Department of Respiratory Medicine, Ziekenhuis Oost-Limburg, Schiepse Bos 6, B-3600 Genk, Belgium; (M.T.); (M.C.)
| | - Maarten Criel
- Department of Respiratory Medicine, Ziekenhuis Oost-Limburg, Schiepse Bos 6, B-3600 Genk, Belgium; (M.T.); (M.C.)
| | - Hanne Mariën
- Faculty of Medicine and Life Sciences, Hasselt University, Martelarenlaan 42, B-3500 Hasselt, Belgium; (E.D.); (H.M.)
| | - Peter Adriaensens
- Applied and Analytical Chemistry, Institute for Materials Research, Hasselt University, Agoralaan 1-Building D, B-3590 Diepenbeek, Belgium;
| |
Collapse
|
22
|
Parvandeh S, Donehower LA, Katsonis P, Hsu TK, Asmussen J, Lee K, Lichtarge O. EPIMUTESTR: a nearest neighbor machine learning approach to predict cancer driver genes from the evolutionary action of coding variants. Nucleic Acids Res 2022; 50:e70. [PMID: 35412634 PMCID: PMC9262594 DOI: 10.1093/nar/gkac215] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 03/17/2022] [Accepted: 03/21/2022] [Indexed: 02/01/2023] Open
Abstract
Discovering rare cancer driver genes is difficult because their mutational frequency is too low for statistical detection by computational methods. EPIMUTESTR is an integrative nearest-neighbor machine learning algorithm that identifies such marginal genes by modeling the fitness of their mutations with the phylogenetic Evolutionary Action (EA) score. Over cohorts of sequenced patients from The Cancer Genome Atlas representing 33 tumor types, EPIMUTESTR detected 214 previously inferred cancer driver genes and 137 new candidates never identified computationally before of which seven genes are supported in the COSMIC Cancer Gene Census. EPIMUTESTR achieved better robustness and specificity than existing methods in a number of benchmark methods and datasets.
Collapse
Affiliation(s)
- Saeid Parvandeh
- To whom correspondence should be addressed. Tel: +1 713 798 7677;
| | - Lawrence A Donehower
- Department of Molecular Virology and Microbiology, Houston, TX 77030, USA,Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Panagiotis Katsonis
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Teng-Kuei Hsu
- Department of Biochemistry & Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | - Jennifer K Asmussen
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Kwanghyuk Lee
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Olivier Lichtarge
- Correspondence may also be addressed to Olivier Lichtarge. Tel: +1 713 798 5646;
| |
Collapse
|
23
|
Sampaio M, Rocha M, Dias O. Exploring synergies between plant metabolic modelling and machine learning. Comput Struct Biotechnol J 2022; 20:1885-1900. [PMID: 35521559 PMCID: PMC9052043 DOI: 10.1016/j.csbj.2022.04.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 04/08/2022] [Accepted: 04/11/2022] [Indexed: 11/03/2022] Open
|
24
|
Lavrentev FV, Rumyantsev IS, Ivanov AS, Shilovskikh VV, Orlova OY, Nikolaev KG, Andreeva DV, Skorb EV. Soft Hydrogel Actuator for Fast Machine-Learning-Assisted Bacteria Detection. ACS APPLIED MATERIALS & INTERFACES 2022; 14:7321-7328. [PMID: 35080838 DOI: 10.1021/acsami.1c22470] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
We demonstrate that our bio-electrochemical platform facilitates the reduction of detection time from the 3-day period of the existing tests to 15 min. Machine learning and robotized bioanalytical platforms require the principles such as hydrogel-based actuators for fast and easy analysis of bioactive analytes. Bacteria are fragile and environmentally sensitive microorganisms that require a special environment to support their lifecycles during analytical tests. Here, we develop a bio-electrochemical platform based on the soft hydrogel/eutectic gallium-indium alloy interface for the detection of Streptococcus thermophilus and Bacillus coagulans bacteria in various mediums. The soft hydrogel-based device is capable to support bacteria' viability during detection time. Current-voltage data are used for multilayer perceptron algorithm training. The multilayer perceptron model is capable of detecting bacterial concentrations in the 104 to 108 cfu/mL range of the culture medium or in the dairy products with high accuracy (94%). Such a fast and easy biodetection is extremely important for food and agriculture industries and biomedical and environmental science.
Collapse
Affiliation(s)
- Filipp V Lavrentev
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Igor S Rumyantsev
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Artemii S Ivanov
- Institute for Functional Intelligent Materials, National University of Singapore, Singapore 117544, Singapore
- Department of Materials Science and Engineering, National University of Singapore, Singapore 117575, Singapore
| | - Vladimir V Shilovskikh
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Olga Yu Orlova
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Konstantin G Nikolaev
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| | - Daria V Andreeva
- Institute for Functional Intelligent Materials, National University of Singapore, Singapore 117544, Singapore
- Department of Materials Science and Engineering, National University of Singapore, Singapore 117575, Singapore
| | - Ekaterina V Skorb
- Infochemistry Scientific Center of ITMO University, Lomonosova Street 9, St. Petersburg 191002, Russia
| |
Collapse
|
25
|
Verissimo T, Faivre A, Sgardello S, Naesens M, de Seigneux S, Criton G, Legouis D. Estimated Renal Metabolomics at Reperfusion Predicts One-Year Kidney Graft Function. Metabolites 2022; 12:57. [PMID: 35050179 PMCID: PMC8778290 DOI: 10.3390/metabo12010057] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 12/26/2021] [Accepted: 01/04/2022] [Indexed: 02/04/2023] Open
Abstract
Renal transplantation is the gold-standard procedure for end-stage renal disease patients, improving quality of life and life expectancy. Despite continuous advancement in the management of post-transplant complications, progress is still needed to increase the graft lifespan. Early identification of patients at risk of rapid graft failure is critical to optimize their management and slow the progression of the disease. In 42 kidney grafts undergoing protocol biopsies at reperfusion, we estimated the renal metabolome from RNAseq data. The estimated metabolites' abundance was further used to predict the renal function within the first year of transplantation through a random forest machine learning algorithm. Using repeated K-fold cross-validation we first built and then tuned our model on a training dataset. The optimal model accurately predicted the one-year eGFR, with an out-of-bag root mean square root error (RMSE) that was 11.8 ± 7.2 mL/min/1.73 m2. The performance was similar in the test dataset, with a RMSE of 12.2 ± 3.2 mL/min/1.73 m2. This model outperformed classic statistical models. Reperfusion renal metabolome may be used to predict renal function one year after allograft kidney recipients.
Collapse
Affiliation(s)
- Thomas Verissimo
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
| | - Anna Faivre
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
| | - Sebastian Sgardello
- Department of Surgery, University Hospital of Geneva, 1205 Geneva, Switzerland;
| | - Maarten Naesens
- Service of Nephrology, University Hospitals of Leuven, 3000 Leuven, Belgium;
| | - Sophie de Seigneux
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
- Service of Nephrology, Department of Internal Medicine Specialties, University Hospital of Geneva, 1205 Geneva, Switzerland
| | - Gilles Criton
- Geneva School of Economics and Management, University of Geneva, 1205 Geneva, Switzerland;
| | - David Legouis
- Laboratory of Nephrology, Department of Medicine, University Hospitals of Geneva, 1205 Geneva, Switzerland; (T.V.); (A.F.); (S.d.S.)
- Division of Intensive Care, Department of Acute Medicine, University hospital of Geneva, 1205 Geneva, Switzerland
| |
Collapse
|
26
|
Vijayakumar S, Magazzù G, Moon P, Occhipinti A, Angione C. A Practical Guide to Integrating Multimodal Machine Learning and Metabolic Modeling. Methods Mol Biol 2022; 2399:87-122. [PMID: 35604554 DOI: 10.1007/978-1-0716-1831-8_5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Complex, distributed, and dynamic sets of clinical biomedical data are collectively referred to as multimodal clinical data. In order to accommodate the volume and heterogeneity of such diverse data types and aid in their interpretation when they are combined with a multi-scale predictive model, machine learning is a useful tool that can be wielded to deconstruct biological complexity and extract relevant outputs. Additionally, genome-scale metabolic models (GSMMs) are one of the main frameworks striving to bridge the gap between genotype and phenotype by incorporating prior biological knowledge into mechanistic models. Consequently, the utilization of GSMMs as a foundation for the integration of multi-omic data originating from different domains is a valuable pursuit towards refining predictions. In this chapter, we show how cancer multi-omic data can be analyzed via multimodal machine learning and metabolic modeling. Firstly, we focus on the merits of adopting an integrative systems biology led approach to biomedical data mining. Following this, we propose how constraint-based metabolic models can provide a stable yet adaptable foundation for the integration of multimodal data with machine learning. Finally, we provide a step-by-step tutorial for the combination of machine learning and GSMMs, which includes: (i) tissue-specific constraint-based modeling; (ii) survival analysis using time-to-event prediction for cancer; and (iii) classification and regression approaches for multimodal machine learning. The code associated with the tutorial can be found at https://github.com/Angione-Lab/Tutorials_Combining_ML_and_GSMM .
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Giuseppe Magazzù
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Pradip Moon
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK
| | - Annalisa Occhipinti
- Computational Systems Biology and Data Analytics Research Group, Middlebrough, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK
| | - Claudio Angione
- Computational Systems Biology and Data Analytics Research Group, Teesside University, Middlebrough, UK.
- Centre for Digital Innovation, Teesside University, Middlesbrough, UK.
- Healthcare Innovation Centre, Teesside University, Middlesbrough, UK.
| |
Collapse
|
27
|
Vijayakumar S, Angione C. Protocol for hybrid flux balance, statistical, and machine learning analysis of multi-omic data from the cyanobacterium Synechococcus sp. PCC 7002. STAR Protoc 2021; 2:100837. [PMID: 34632416 PMCID: PMC8488602 DOI: 10.1016/j.xpro.2021.100837] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Combining a computational framework for flux balance analysis with machine learning improves the accuracy of predicting metabolic activity across conditions, while enabling mechanistic interpretation. This protocol presents a guide to condition-specific metabolic modeling that integrates regularized flux balance analysis with machine learning approaches to extract key features from transcriptomic and fluxomic data. We demonstrate the protocol as applied to Synechococcus sp. PCC 7002; we also outline how it can be adapted to any species or community with available multi-omic data. For complete details on the use and execution of this protocol, please refer to Vijayakumar et al. (2020).
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
| | - Claudio Angione
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough TS1 3BX, UK
- Healthcare Innovation Centre, Teesside University, Middlesbrough TS1 3BX, UK
| |
Collapse
|
28
|
Urine-Based Metabolomics and Machine Learning Reveals Metabolites Associated with Renal Cell Carcinoma Stage. Cancers (Basel) 2021; 13:cancers13246253. [PMID: 34944874 PMCID: PMC8699523 DOI: 10.3390/cancers13246253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 12/08/2021] [Accepted: 12/09/2021] [Indexed: 11/17/2022] Open
Abstract
Urine metabolomics profiling has potential for non-invasive RCC staging, in addition to providing metabolic insights into disease progression. In this study, we utilized liquid chromatography-mass spectrometry (LC-MS), nuclear magnetic resonance (NMR), and machine learning (ML) for the discovery of urine metabolites associated with RCC progression. Two machine learning questions were posed in the study: Binary classification into early RCC (stage I and II) and advanced RCC stages (stage III and IV), and RCC tumor size estimation through regression analysis. A total of 82 RCC patients with known tumor size and metabolomic measurements were used for the regression task, and 70 RCC patients with complete tumor-nodes-metastasis (TNM) staging information were used for the classification tasks under ten-fold cross-validation conditions. A voting ensemble regression model consisting of elastic net, ridge, and support vector regressor predicted RCC tumor size with a R2 value of 0.58. A voting classifier model consisting of random forest, support vector machines, logistic regression, and adaptive boosting yielded an AUC of 0.96 and an accuracy of 87%. Some identified metabolites associated with renal cell carcinoma progression included 4-guanidinobutanoic acid, 7-aminomethyl-7-carbaguanine, 3-hydroxyanthranilic acid, lysyl-glycine, glycine, citrate, and pyruvate. Overall, we identified a urine metabolic phenotype associated with renal cell carcinoma stage, exploring the promise of a urine-based metabolomic assay for staging this disease.
Collapse
|
29
|
Li R, Li L, Xu Y, Yang J. Machine learning meets omics: applications and perspectives. Brief Bioinform 2021; 23:6425809. [PMID: 34791021 DOI: 10.1093/bib/bbab460] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 09/29/2021] [Accepted: 10/07/2021] [Indexed: 02/07/2023] Open
Abstract
The innovation of biotechnologies has allowed the accumulation of omics data at an alarming rate, thus introducing the era of 'big data'. Extracting inherent valuable knowledge from various omics data remains a daunting problem in bioinformatics. Better solutions often need some kind of more innovative methods for efficient handlings and effective results. Recent advancements in integrated analysis and computational modeling of multi-omics data helped address such needs in an increasingly harmonious manner. The development and application of machine learning have largely advanced our insights into biology and biomedicine and greatly promoted the development of therapeutic strategies, especially for precision medicine. Here, we propose a comprehensive survey and discussion on what happened, is happening and will happen when machine learning meets omics. Specifically, we describe how artificial intelligence can be applied to omics studies and review recent advancements at the interface between machine learning and the ever-widest range of omics including genomics, transcriptomics, proteomics, metabolomics, radiomics, as well as those at the single-cell resolution. We also discuss and provide a synthesis of ideas, new insights, current challenges and perspectives of machine learning in omics.
Collapse
Affiliation(s)
- Rufeng Li
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China
| | - Lixin Li
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China
| | - Yungang Xu
- School of Electronics and Information, Northwestern Polytechnical University, Xi'an, 710129, China
| | - Juan Yang
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Xi'an Jiaotong University Health Science Center, Xi'an 710061, P. R. China.,Key Laboratory of Environment and Genes Related to Diseases (Xi'an Jiaotong University), Ministry of Education of China, Xi'an 710061, P. R. China
| |
Collapse
|
30
|
Mey F, Clauwaert J, Van Huffel K, Waegeman W, De Mey M. Improving the performance of machine learning models for biotechnology: The quest for deus ex machina. Biotechnol Adv 2021; 53:107858. [PMID: 34695560 DOI: 10.1016/j.biotechadv.2021.107858] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 11/24/2022]
Abstract
Machine learning is becoming an integral part of the Design-Build-Test-Learn cycle in biotechnology. Machine learning models learn from collected datasets such as omics data and predict a defined outcome, which has led to both production improvements and predictive tools in the field. Robust prediction of the behavior of microbial cell factories and production processes not only greatly increases our understanding of the function of such systems, but also provides significant savings of development time. However, many pitfalls when modeling biological data - bad fit, noisy data, model instability, low data quantity and imbalances in the data - cause models to suffer in their performance. Here we provide an accessible, in-depth analysis on the problems created by these pitfalls, as well as means of their detection and mediation, with a focus on supervised learning. Assessing the state of the art, we show that, currently, in-depth analyses of model performance are often absent and must be improved. This review provides a toolbox for the analysis of model robustness and performance, and simultaneously proposes a standard for the community to facilitate future work. It is further accompanied by an interactive online tutorial on the discussed issues.
Collapse
Affiliation(s)
- Friederike Mey
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium
| | - Jim Clauwaert
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Kirsten Van Huffel
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium
| | - Willem Waegeman
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Marjan De Mey
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium.
| |
Collapse
|
31
|
Seif Y, Palsson BØ. Path to improving the life cycle and quality of genome-scale models of metabolism. Cell Syst 2021; 12:842-859. [PMID: 34555324 PMCID: PMC8480436 DOI: 10.1016/j.cels.2021.06.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 02/17/2021] [Accepted: 06/23/2021] [Indexed: 11/28/2022]
Abstract
Genome-scale models of metabolism (GEMs) are key computational tools for the systems-level study of metabolic networks. Here, we describe the "GEM life cycle," which we subdivide into four stages: inception, maturation, specialization, and amalgamation. We show how different types of GEM reconstruction workflows fit in each stage and proceed to highlight two fundamental bottlenecks for GEM quality improvement: GEM maturation and content removal. We identify common characteristics contributing to increasing quality of maturing GEMs drawing from past independent GEM maturation efforts. We then shed some much-needed light on the latent and unrecognized but pervasive issue of content removal, demonstrating the substantial effects of model pruning on its solution space. Finally, we propose a novel framework for content removal and associated confidence-level assignment which will help guide future GEM development efforts, reduce duplication of effort across groups, potentially aid automated reconstruction platforms, and boost the reproducibility of model development.
Collapse
Affiliation(s)
- Yara Seif
- Department of Bioengineering, University of California, San Diego, La Jolla, San Diego, CA 92093, USA
| | - Bernhard Ørn Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, San Diego, CA 92093, USA.
| |
Collapse
|
32
|
McGranaghan P, Kirwan JA, Garcia-Rivera MA, Pieske B, Edelmann F, Blaschke F, Appunni S, Saxena A, Rubens M, Veledar E, Trippel TD. Lipid Metabolite Biomarkers in Cardiovascular Disease: Discovery and Biomechanism Translation from Human Studies. Metabolites 2021; 11:metabo11090621. [PMID: 34564437 PMCID: PMC8470800 DOI: 10.3390/metabo11090621] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 08/30/2021] [Accepted: 09/06/2021] [Indexed: 12/12/2022] Open
Abstract
Lipids represent a valuable target for metabolomic studies since altered lipid metabolism is known to drive the pathological changes in cardiovascular disease (CVD). Metabolomic technologies give us the ability to measure thousands of metabolites providing us with a metabolic fingerprint of individual patients. Metabolomic studies in humans have supported previous findings into the pathomechanisms of CVD, namely atherosclerosis, apoptosis, inflammation, oxidative stress, and insulin resistance. The most widely studied classes of lipid metabolite biomarkers in CVD are phospholipids, sphingolipids/ceramides, glycolipids, cholesterol esters, fatty acids, and acylcarnitines. Technological advancements have enabled novel strategies to discover individual biomarkers or panels that may aid in the diagnosis and prognosis of CVD, with sphingolipids/ceramides as the most promising class of biomarkers thus far. In this review, application of metabolomic profiling for biomarker discovery to aid in the diagnosis and prognosis of CVD as well as metabolic abnormalities in CVD will be discussed with particular emphasis on lipid metabolites.
Collapse
Affiliation(s)
- Peter McGranaghan
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
| | - Jennifer A. Kirwan
- Metabolomics Platform, Berlin Institute of Health at Charité Universitätsmedizin Berlin, 13353 Berlin, Germany; (J.A.K.); (M.A.G.-R.)
- Max Delbrück Center for Molecular Research, 13125 Berlin, Germany
- School of Veterinary Medicine and Science, University of Nottingham, Leicestershire LE12 5RD, UK
| | - Mariel A. Garcia-Rivera
- Metabolomics Platform, Berlin Institute of Health at Charité Universitätsmedizin Berlin, 13353 Berlin, Germany; (J.A.K.); (M.A.G.-R.)
- Max Delbrück Center for Molecular Research, 13125 Berlin, Germany
| | - Burkert Pieske
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
- Berlin Institute of Health, 13353 Berlin, Germany
- German Heart Center Berlin, Department of Cardiology, 13353 Berlin, Germany
| | - Frank Edelmann
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
- German Heart Center Berlin, Department of Cardiology, 13353 Berlin, Germany
| | - Florian Blaschke
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
| | - Sandeep Appunni
- Department of Biochemistry, Government Medical College, Kozhikode, Kerala 673008, India;
| | - Anshul Saxena
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
| | - Muni Rubens
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
| | - Emir Veledar
- Baptist Health South Florida, Miami, FL 33143, USA; (A.S.); (M.R.); (E.V.)
- Department of Biostatistics, Florida International University, Miami, FL 33199, USA
- Division of Cardiology, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Tobias Daniel Trippel
- Department of Internal Medicine and Cardiology, Charité Campus Virchow-Klinikum, 13353 Berlin, Germany; (P.M.); (B.P.); (F.E.); (F.B.)
- DZHK (German Centre for Cardiovascular Research), 13353 Berlin, Germany
- Correspondence: ; Tel.: +49-30-450-553765
| |
Collapse
|
33
|
Chen Z, Huang X, Gao Y, Zeng S, Mao W. Plasma-metabolite-based machine learning is a promising diagnostic approach for esophageal squamous cell carcinoma investigation. J Pharm Anal 2021; 11:505-514. [PMID: 34513127 PMCID: PMC8424362 DOI: 10.1016/j.jpha.2020.11.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 11/23/2020] [Accepted: 11/24/2020] [Indexed: 12/12/2022] Open
Abstract
The aim of this study was to develop a diagnostic strategy for esophageal squamous cell carcinoma (ESCC) that combines plasma metabolomics with machine learning algorithms. Plasma-based untargeted metabolomics analysis was performed with samples derived from 88 ESCC patients and 52 healthy controls. The dataset was split into a training set and a test set. After identification of differential metabolites in training set, single-metabolite-based receiver operating characteristic (ROC) curves and multiple-metabolite-based machine learning models were used to distinguish between ESCC patients and healthy controls. Kaplan-Meier survival analysis and Cox proportional hazards regression analysis were performed to investigate the prognostic significance of the plasma metabolites. Finally, twelve differential plasma metabolites (six up-regulated and six down-regulated) were annotated. The predictive performance of the six most prevalent diagnostic metabolites through the diagnostic models in the test set were as follows: arachidonic acid (accuracy: 0.887), sebacic acid (accuracy: 0.867), indoxyl sulfate (accuracy: 0.850), phosphatidylcholine (PC) (14:0/0:0) (accuracy: 0.825), deoxycholic acid (accuracy: 0.773), and trimethylamine N-oxide (accuracy: 0.653). The prediction accuracies of the machine learning models in the test set were partial least-square (accuracy: 0.947), random forest (accuracy: 0.947), gradient boosting machine (accuracy: 0.960), and support vector machine (accuracy: 0.980). Additionally, survival analysis demonstrated that acetoacetic acid was an unfavorable prognostic factor (hazard ratio (HR): 1.752), while PC (14:0/0:0) (HR: 0.577) was a favorable prognostic factor for ESCC. This study devised an innovative strategy for ESCC diagnosis by combining plasma metabolomics with machine learning algorithms and revealed its potential to become a novel screening test for ESCC. Six most prevalent diagnostic plasma metabolites were identified in ESCC. Plasma-metabolite-based machine learning models (PLS, RF, GBM, and SVM) for ESCC diagnosis. Acetoacetic acid was an unfavorable prognostic factor, while PC (14:0/0:0) was a favorable prognostic factor for ESCC.
Collapse
Affiliation(s)
- Zhongjian Chen
- Laboratory of Pharmaceutical Analysis and Drug Metabolism, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.,The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| | - Xiancong Huang
- The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| | - Yun Gao
- The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| | - Su Zeng
- Laboratory of Pharmaceutical Analysis and Drug Metabolism, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Weimin Mao
- The Cancer Research Institute, The Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital), Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Hangzhou, 310022, China
| |
Collapse
|
34
|
Khaleghi MK, Savizi ISP, Lewis NE, Shojaosadati SA. Synergisms of machine learning and constraint-based modeling of metabolism for analysis and optimization of fermentation parameters. Biotechnol J 2021; 16:e2100212. [PMID: 34390201 DOI: 10.1002/biot.202100212] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 08/10/2021] [Accepted: 08/11/2021] [Indexed: 11/06/2022]
Abstract
Recent noteworthy advances in the development of high-performing microbial and mammalian strains have enabled the sustainable production of bio-economically valuable substances such as bio-compounds, biofuels, and biopharmaceuticals. However, to obtain an industrially viable mass-production scheme, much time and effort are required. The robust and rational design of fermentation processes requires analysis and optimization of different extracellular conditions and medium components, which have a massive effect on growth and productivity. In this regard, knowledge- and data-driven modeling methods have received much attention. Constraint-based modeling (CBM) is a knowledge-driven mathematical approach that has been widely used in fermentation analysis and optimization due to its capabilities of predicting the cellular phenotype from genotype through high-throughput means. On the other hand, machine learning (ML) is a data-driven statistical method that identifies the data patterns within sophisticated biological systems and processes, where there is inadequate knowledge to represent underlying mechanisms. Furthermore, ML models are becoming a viable complement to constraint-based models in a reciprocal manner when one is used as a pre-step of another. As a result, more predictable model is produced. This review highlights the applications of CBM and ML independently and the combination of these two approaches for analyzing and optimizing fermentation parameters. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Mohammad Karim Khaleghi
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Iman Shahidi Pour Savizi
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| | - Nathan E Lewis
- Department of Bioengineering, University of California, San Diego, USA.,Department of Pediatrics, University of California, San Diego, USA
| | - Seyed Abbas Shojaosadati
- Biotechnology Department, Faculty of Chemical Engineering, Tarbiat Modares University, Tehran, Iran
| |
Collapse
|
35
|
Tinte MM, Chele KH, van der Hooft JJJ, Tugizimana F. Metabolomics-Guided Elucidation of Plant Abiotic Stress Responses in the 4IR Era: An Overview. Metabolites 2021; 11:445. [PMID: 34357339 PMCID: PMC8305945 DOI: 10.3390/metabo11070445] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 06/30/2021] [Accepted: 07/03/2021] [Indexed: 12/27/2022] Open
Abstract
Plants are constantly challenged by changing environmental conditions that include abiotic stresses. These are limiting their development and productivity and are subsequently threatening our food security, especially when considering the pressure of the increasing global population. Thus, there is an urgent need for the next generation of crops with high productivity and resilience to climate change. The dawn of a new era characterized by the emergence of fourth industrial revolution (4IR) technologies has redefined the ideological boundaries of research and applications in plant sciences. Recent technological advances and machine learning (ML)-based computational tools and omics data analysis approaches are allowing scientists to derive comprehensive metabolic descriptions and models for the target plant species under specific conditions. Such accurate metabolic descriptions are imperatively essential for devising a roadmap for the next generation of crops that are resilient to environmental deterioration. By synthesizing the recent literature and collating data on metabolomics studies on plant responses to abiotic stresses, in the context of the 4IR era, we point out the opportunities and challenges offered by omics science, analytical intelligence, computational tools and big data analytics. Specifically, we highlight technological advancements in (plant) metabolomics workflows and the use of machine learning and computational tools to decipher the dynamics in the chemical space that define plant responses to abiotic stress conditions.
Collapse
Affiliation(s)
- Morena M. Tinte
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa; (M.M.T.); (K.H.C.)
| | - Kekeletso H. Chele
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa; (M.M.T.); (K.H.C.)
| | | | - Fidele Tugizimana
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa; (M.M.T.); (K.H.C.)
- International Research and Development Division, Omnia Group, Ltd., Johannesburg 2021, South Africa
| |
Collapse
|
36
|
Shah HA, Liu J, Yang Z, Feng J. Review of Machine Learning Methods for the Prediction and Reconstruction of Metabolic Pathways. Front Mol Biosci 2021; 8:634141. [PMID: 34222327 PMCID: PMC8247443 DOI: 10.3389/fmolb.2021.634141] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Accepted: 06/01/2021] [Indexed: 11/13/2022] Open
Abstract
Prediction and reconstruction of metabolic pathways play significant roles in many fields such as genetic engineering, metabolic engineering, drug discovery, and are becoming the most active research topics in synthetic biology. With the increase of related data and with the development of machine learning techniques, there have many machine leaning based methods been proposed for prediction or reconstruction of metabolic pathways. Machine learning techniques are showing state-of-the-art performance to handle the rapidly increasing volume of data in synthetic biology. To support researchers in this field, we briefly review the research progress of metabolic pathway reconstruction and prediction based on machine learning. Some challenging issues in the reconstruction of metabolic pathways are also discussed in this paper.
Collapse
Affiliation(s)
- Hayat Ali Shah
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| | - Juan Liu
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| | - Zhihui Yang
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| | - Jing Feng
- Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan, China
| |
Collapse
|
37
|
Licen S, Franzon M, Rodani T, Barbieri P. SOMEnv: An R package for mining environmental monitoring datasets by Self-Organizing Map and k-means algorithms with a graphical user interface. Microchem J 2021. [DOI: 10.1016/j.microc.2021.106181] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
38
|
Sha C, Cuperlovic-Culf M, Hu T. SMILE: systems metabolomics using interpretable learning and evolution. BMC Bioinformatics 2021; 22:284. [PMID: 34049495 PMCID: PMC8161935 DOI: 10.1186/s12859-021-04209-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 05/18/2021] [Indexed: 11/23/2022] Open
Abstract
Background Direct link between metabolism and cell and organism phenotype in health and disease makes metabolomics, a high throughput study of small molecular metabolites, an essential methodology for understanding and diagnosing disease development and progression. Machine learning methods have seen increasing adoptions in metabolomics thanks to their powerful prediction abilities. However, the “black-box” nature of many machine learning models remains a major challenge for wide acceptance and utility as it makes the interpretation of decision process difficult. This challenge is particularly predominant in biomedical research where understanding of the underlying decision making mechanism is essential for insuring safety and gaining new knowledge. Results In this article, we proposed a novel computational framework, Systems Metabolomics using Interpretable Learning and Evolution (SMILE), for supervised metabolomics data analysis. Our methodology uses an evolutionary algorithm to learn interpretable predictive models and to identify the most influential metabolites and their interactions in association with disease. Moreover, we have developed a web application with a graphical user interface that can be used for easy analysis, interpretation and visualization of the results. Performance of the method and utilization of the web interface is shown using metabolomics data for Alzheimer’s disease. Conclusions SMILE was able to identify several influential metabolites on AD and to provide interpretable predictive models that can be further used for a better understanding of the metabolic background of AD. SMILE addresses the emerging issue of interpretability and explainability in machine learning, and contributes to more transparent and powerful applications of machine learning in bioinformatics. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04209-1.
Collapse
Affiliation(s)
- Chengyuan Sha
- School of Computing, Queen's University, Kingston, ON, Canada
| | | | - Ting Hu
- School of Computing, Queen's University, Kingston, ON, Canada.
| |
Collapse
|
39
|
Cakmak A, Celik MH. Personalized Metabolic Analysis of Diseases. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1014-1025. [PMID: 32750887 DOI: 10.1109/tcbb.2020.3008196] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The metabolic wiring of patient cells is altered drastically in many diseases, including cancer. Understanding the nature of such changes may pave the way for new therapeutic opportunities as well as the development of personalized treatment strategies for patients. In this paper, we propose an algorithm called Metabolitics, which allows systems-level analysis of changes in the biochemical network of cells in disease states. It enables the study of a disease at both reaction- and pathway-level granularities for a detailed and summarized view of disease etiology. Metabolitics employs flux variability analysis with a dynamically built objective function based on biofluid metabolomics measurements in a personalized manner. Moreover, Metabolitics builds supervised classification models to discriminate between patients and healthy subjects based on the computed metabolic network changes. The use of Metabolitics is demonstrated for three distinct diseases, namely, breast cancer, Crohn's disease, and colorectal cancer. Our results show that the constructed supervised learning models successfully differentiate patients from healthy individuals by an average f1-score of 88 percent. Besides, in addition to the confirmation of previously reported breast cancer-associated pathways, we discovered that Biotin Metabolism along with Arginine and Proline Metabolism is subject to a significant increase in flux capacity, which have not been reported before.
Collapse
|
40
|
Peng GCY, Alber M, Tepole AB, Cannon WR, De S, Dura-Bernal S, Garikipati K, Karniadakis G, Lytton WW, Perdikaris P, Petzold L, Kuhl E. Multiscale modeling meets machine learning: What can we learn? ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING : STATE OF THE ART REVIEWS 2021; 28:1017-1037. [PMID: 34093005 PMCID: PMC8172124 DOI: 10.1007/s11831-020-09405-5] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Accepted: 02/09/2020] [Indexed: 05/10/2023]
Abstract
Machine learning is increasingly recognized as a promising technology in the biological, biomedical, and behavioral sciences. There can be no argument that this technique is incredibly successful in image recognition with immediate applications in diagnostics including electrophysiology, radiology, or pathology, where we have access to massive amounts of annotated data. However, machine learning often performs poorly in prognosis, especially when dealing with sparse data. This is a field where classical physics-based simulation seems to remain irreplaceable. In this review, we identify areas in the biomedical sciences where machine learning and multiscale modeling can mutually benefit from one another: Machine learning can integrate physics-based knowledge in the form of governing equations, boundary conditions, or constraints to manage ill-posted problems and robustly handle sparse and noisy data; multiscale modeling can integrate machine learning to create surrogate models, identify system dynamics and parameters, analyze sensitivities, and quantify uncertainty to bridge the scales and understand the emergence of function. With a view towards applications in the life sciences, we discuss the state of the art of combining machine learning and multiscale modeling, identify applications and opportunities, raise open questions, and address potential challenges and limitations. We anticipate that it will stimulate discussion within the community of computational mechanics and reach out to other disciplines including mathematics, statistics, computer science, artificial intelligence, biomedicine, systems biology, and precision medicine to join forces towards creating robust and efficient models for biological systems.
Collapse
Affiliation(s)
| | - Mark Alber
- University of California, Riverside, USA
| | | | - William R Cannon
- Pacific Northwest National Laboratory, Richland, Washington, USA
| | - Suvranu De
- Rensselaer Polytechnic Institute, Troy, New York, USA
| | | | | | | | | | | | - Linda Petzold
- University of California, Santa Barbara, California, USA
| | - Ellen Kuhl
- Stanford University, Stanford, California, USA
| |
Collapse
|
41
|
Muzio G, O’Bray L, Borgwardt K. Biological network analysis with deep learning. Brief Bioinform 2021; 22:1515-1530. [PMID: 33169146 PMCID: PMC7986589 DOI: 10.1093/bib/bbaa257] [Citation(s) in RCA: 46] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/26/2020] [Accepted: 09/11/2020] [Indexed: 12/17/2022] Open
Abstract
Recent advancements in experimental high-throughput technologies have expanded the availability and quantity of molecular data in biology. Given the importance of interactions in biological processes, such as the interactions between proteins or the bonds within a chemical compound, this data is often represented in the form of a biological network. The rise of this data has created a need for new computational tools to analyze networks. One major trend in the field is to use deep learning for this goal and, more specifically, to use methods that work with networks, the so-called graph neural networks (GNNs). In this article, we describe biological networks and review the principles and underlying algorithms of GNNs. We then discuss domains in bioinformatics in which graph neural networks are frequently being applied at the moment, such as protein function prediction, protein-protein interaction prediction and in silico drug discovery and development. Finally, we highlight application areas such as gene regulatory networks and disease diagnosis where deep learning is emerging as a new tool to answer classic questions like gene interaction prediction and automatic disease prediction from data.
Collapse
Affiliation(s)
- Giulia Muzio
- Machine Learning and Computational Biology Lab at ETH Zürich
| | - Leslie O’Bray
- Machine Learning and Computational Biology Lab at ETH Zürich
| | | |
Collapse
|
42
|
Muzio G, O'Bray L, Borgwardt K. Biological network analysis with deep learning. Brief Bioinform 2021; 22:1515-1530. [PMID: 33169146 DOI: 10.1145/3447548.3467442] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 08/26/2020] [Accepted: 09/11/2020] [Indexed: 05/28/2023] Open
Abstract
Recent advancements in experimental high-throughput technologies have expanded the availability and quantity of molecular data in biology. Given the importance of interactions in biological processes, such as the interactions between proteins or the bonds within a chemical compound, this data is often represented in the form of a biological network. The rise of this data has created a need for new computational tools to analyze networks. One major trend in the field is to use deep learning for this goal and, more specifically, to use methods that work with networks, the so-called graph neural networks (GNNs). In this article, we describe biological networks and review the principles and underlying algorithms of GNNs. We then discuss domains in bioinformatics in which graph neural networks are frequently being applied at the moment, such as protein function prediction, protein-protein interaction prediction and in silico drug discovery and development. Finally, we highlight application areas such as gene regulatory networks and disease diagnosis where deep learning is emerging as a new tool to answer classic questions like gene interaction prediction and automatic disease prediction from data.
Collapse
Affiliation(s)
- Giulia Muzio
- Machine Learning and Computational Biology Lab at ETH Zürich
| | - Leslie O'Bray
- Machine Learning and Computational Biology Lab at ETH Zürich
| | | |
Collapse
|
43
|
Ngueilbaye A, Wang H, Khan M, Mahamat DA. Adoption of human metabolic processes as Data Quality Based Models. THE JOURNAL OF SUPERCOMPUTING 2021; 77:1779-1817. [DOI: 10.1007/s11227-020-03300-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/07/2023]
|
44
|
Messa GM, Napolitano F, Elsea SH, di Bernardo D, Gao X. A Siamese neural network model for the prioritization of metabolic disorders by integrating real and simulated data. Bioinformatics 2020; 36:i787-i794. [PMID: 33381827 DOI: 10.1093/bioinformatics/btaa841] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Untargeted metabolomic approaches hold a great promise as a diagnostic tool for inborn errors of metabolisms (IEMs) in the near future. However, the complexity of the involved data makes its application difficult and time consuming. Computational approaches, such as metabolic network simulations and machine learning, could significantly help to exploit metabolomic data to aid the diagnostic process. While the former suffers from limited predictive accuracy, the latter is normally able to generalize only to IEMs for which sufficient data are available. Here, we propose a hybrid approach that exploits the best of both worlds by building a mapping between simulated and real metabolic data through a novel method based on Siamese neural networks (SNN). RESULTS The proposed SNN model is able to perform disease prioritization for the metabolic profiles of IEM patients even for diseases that it was not trained to identify. To the best of our knowledge, this has not been attempted before. The developed model is able to significantly outperform a baseline model that relies on metabolic simulations only. The prioritization performances demonstrate the feasibility of the method, suggesting that the integration of metabolic models and data could significantly aid the IEM diagnosis process in the near future. AVAILABILITY AND IMPLEMENTATION Metabolic datasets used in this study are publicly available from the cited sources. The original data produced in this study, including the trained models and the simulated metabolic profiles, are also publicly available (Messa et al., 2020).
Collapse
Affiliation(s)
- Gian Marco Messa
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Francesco Napolitano
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Sarah H Elsea
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Diego di Bernardo
- Telethon Institute of Genetics and Medicine (TIGEM), Pozzuoli 80078, Italy.,Department of Chemical, Materials and Industrial Production Engineering, University of Naples Federico II, 80125 Naples, Italy
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| |
Collapse
|
45
|
Helmy M, Smith D, Selvarajoo K. Systems biology approaches integrated with artificial intelligence for optimized metabolic engineering. Metab Eng Commun 2020; 11:e00149. [PMID: 33072513 PMCID: PMC7546651 DOI: 10.1016/j.mec.2020.e00149] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/01/2020] [Accepted: 10/07/2020] [Indexed: 12/05/2022] Open
Abstract
Metabolic engineering aims to maximize the production of bio-economically important substances (compounds, enzymes, or other proteins) through the optimization of the genetics, cellular processes and growth conditions of microorganisms. This requires detailed understanding of underlying metabolic pathways involved in the production of the targeted substances, and how the cellular processes or growth conditions are regulated by the engineering. To achieve this goal, a large system of experimental techniques, compound libraries, computational methods and data resources, including multi-omics data, are used. The recent advent of multi-omics systems biology approaches significantly impacted the field by opening new avenues to perform dynamic and large-scale analyses that deepen our knowledge on the manipulations. However, with the enormous transcriptomics, proteomics and metabolomics available, it is a daunting task to integrate the data for a more holistic understanding. Novel data mining and analytics approaches, including Artificial Intelligence (AI), can provide breakthroughs where traditional low-throughput experiment-alone methods cannot easily achieve. Here, we review the latest attempts of combining systems biology and AI in metabolic engineering research, and highlight how this alliance can help overcome the current challenges facing industrial biotechnology, especially for food-related substances and compounds using microorganisms.
Collapse
Affiliation(s)
- Mohamed Helmy
- Singapore Institute of Food and Biotechnology Innovation (SIFBI), Agency for Science, Technology and Research (A∗STAR), Singapore, Singapore
| | - Derek Smith
- Singapore Institute of Food and Biotechnology Innovation (SIFBI), Agency for Science, Technology and Research (A∗STAR), Singapore, Singapore
| | - Kumar Selvarajoo
- Singapore Institute of Food and Biotechnology Innovation (SIFBI), Agency for Science, Technology and Research (A∗STAR), Singapore, Singapore
- Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore (NUS), Singapore, Singapore
| |
Collapse
|
46
|
Sailwal M, Das AJ, Gazara RK, Dasgupta D, Bhaskar T, Hazra S, Ghosh D. Connecting the dots: Advances in modern metabolomics and its application in yeast system. Biotechnol Adv 2020; 44:107616. [DOI: 10.1016/j.biotechadv.2020.107616] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 08/15/2020] [Accepted: 08/17/2020] [Indexed: 12/15/2022]
|
47
|
Hesse J, Malhan D, Yalҫin M, Aboumanify O, Basti A, Relógio A. An Optimal Time for Treatment-Predicting Circadian Time by Machine Learning and Mathematical Modelling. Cancers (Basel) 2020; 12:cancers12113103. [PMID: 33114254 PMCID: PMC7690897 DOI: 10.3390/cancers12113103] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/15/2020] [Accepted: 10/20/2020] [Indexed: 02/07/2023] Open
Abstract
Tailoring medical interventions to a particular patient and pathology has been termed personalized medicine. The outcome of cancer treatments is improved when the intervention is timed in accordance with the patient's internal time. Yet, one challenge of personalized medicine is how to consider the biological time of the patient. Prerequisite for this so-called chronotherapy is an accurate characterization of the internal circadian time of the patient. As an alternative to time-consuming measurements in a sleep-laboratory, recent studies in chronobiology predict circadian time by applying machine learning approaches and mathematical modelling to easier accessible observables such as gene expression. Embedding these results into the mathematical dynamics between clock and cancer in mammals, we review the precision of predictions and the potential usage with respect to cancer treatment and discuss whether the patient's internal time and circadian observables, may provide an additional indication for individualized treatment timing. Besides the health improvement, timing treatment may imply financial advantages, by ameliorating side effects of treatments, thus reducing costs. Summarizing the advances of recent years, this review brings together the current clinical standard for measuring biological time, the general assessment of circadian rhythmicity, the usage of rhythmic variables to predict biological time and models of circadian rhythmicity.
Collapse
Affiliation(s)
- Janina Hesse
- Institute for Theoretical Biology (ITB), Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany; (J.H.); (D.M.); (M.Y.); (O.A.); (A.B.)
- Molecular Cancer Research Center (MKFZ), Medical Department of Hematology, Oncology and Tumor Immunology, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany
| | - Deeksha Malhan
- Institute for Theoretical Biology (ITB), Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany; (J.H.); (D.M.); (M.Y.); (O.A.); (A.B.)
- Molecular Cancer Research Center (MKFZ), Medical Department of Hematology, Oncology and Tumor Immunology, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany
| | - Müge Yalҫin
- Institute for Theoretical Biology (ITB), Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany; (J.H.); (D.M.); (M.Y.); (O.A.); (A.B.)
- Molecular Cancer Research Center (MKFZ), Medical Department of Hematology, Oncology and Tumor Immunology, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany
| | - Ouda Aboumanify
- Institute for Theoretical Biology (ITB), Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany; (J.H.); (D.M.); (M.Y.); (O.A.); (A.B.)
- Molecular Cancer Research Center (MKFZ), Medical Department of Hematology, Oncology and Tumor Immunology, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany
| | - Alireza Basti
- Institute for Theoretical Biology (ITB), Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany; (J.H.); (D.M.); (M.Y.); (O.A.); (A.B.)
- Molecular Cancer Research Center (MKFZ), Medical Department of Hematology, Oncology and Tumor Immunology, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany
| | - Angela Relógio
- Institute for Theoretical Biology (ITB), Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany; (J.H.); (D.M.); (M.Y.); (O.A.); (A.B.)
- Molecular Cancer Research Center (MKFZ), Medical Department of Hematology, Oncology and Tumor Immunology, Charité—Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin Humboldt—Universität zu Berlin and Berlin Institute of Health, 10117 Berlin, Germany
- Department of Human Medicine, Institute for Systems Medicine and Bioinformatics, MSH Medical School Hamburg—University of Applied Sciences and Medical University, 20457 Hamburg, Germany
- Correspondence: or
| |
Collapse
|
48
|
Role of Glutathione in Cancer: From Mechanisms to Therapies. Biomolecules 2020; 10:biom10101429. [PMID: 33050144 PMCID: PMC7600400 DOI: 10.3390/biom10101429] [Citation(s) in RCA: 310] [Impact Index Per Article: 77.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 09/30/2020] [Accepted: 10/04/2020] [Indexed: 12/17/2022] Open
Abstract
Glutathione (GSH) is the most abundant non-protein thiol present at millimolar concentrations in mammalian tissues. As an important intracellular antioxidant, it acts as a regulator of cellular redox state protecting cells from damage caused by lipid peroxides, reactive oxygen and nitrogen species, and xenobiotics. Recent studies have highlighted the importance of GSH in key signal transduction reactions as a controller of cell differentiation, proliferation, apoptosis, ferroptosis and immune function. Molecular changes in the GSH antioxidant system and disturbances in GSH homeostasis have been implicated in tumor initiation, progression, and treatment response. Hence, GSH has both protective and pathogenic roles. Although in healthy cells it is crucial for the removal and detoxification of carcinogens, elevated GSH levels in tumor cells are associated with tumor progression and increased resistance to chemotherapeutic drugs. Recently, several novel therapies have been developed to target the GSH antioxidant system in tumors as a means for increased response and decreased drug resistance. In this comprehensive review we explore mechanisms of GSH functionalities and different therapeutic approaches that either target GSH directly, indirectly or use GSH-based prodrugs. Consideration is also given to the computational methods used to describe GSH related processes for in silico testing of treatment effects.
Collapse
|
49
|
Balcerczyk A, Damblon C, Elena-Herrmann B, Panthu B, Rautureau GJP. Metabolomic Approaches to Study Chemical Exposure-Related Metabolism Alterations in Mammalian Cell Cultures. Int J Mol Sci 2020; 21:E6843. [PMID: 32961865 PMCID: PMC7554780 DOI: 10.3390/ijms21186843] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 09/11/2020] [Accepted: 09/14/2020] [Indexed: 12/12/2022] Open
Abstract
Biological organisms are constantly exposed to an immense repertoire of molecules that cover environmental or food-derived molecules and drugs, triggering a continuous flow of stimuli-dependent adaptations. The diversity of these chemicals as well as their concentrations contribute to the multiplicity of induced effects, including activation, stimulation, or inhibition of physiological processes and toxicity. Metabolism, as the foremost phenotype and manifestation of life, has proven to be immensely sensitive and highly adaptive to chemical stimuli. Therefore, studying the effect of endo- or xenobiotics over cellular metabolism delivers valuable knowledge to apprehend potential cellular activity of individual molecules and evaluate their acute or chronic benefits and toxicity. The development of modern metabolomics technologies such as mass spectrometry or nuclear magnetic resonance spectroscopy now offers unprecedented solutions for the rapid and efficient determination of metabolic profiles of cells and more complex biological systems. Combined with the availability of well-established cell culture techniques, these analytical methods appear perfectly suited to determine the biological activity and estimate the positive and negative effects of chemicals in a variety of cell types and models, even at hardly detectable concentrations. Metabolic phenotypes can be estimated from studying intracellular metabolites at homeostasis in vivo, while in vitro cell cultures provide additional access to metabolites exchanged with growth media. This article discusses analytical solutions available for metabolic phenotyping of cell culture metabolism as well as the general metabolomics workflow suitable for testing the biological activity of molecular compounds. We emphasize how metabolic profiling of cell supernatants and intracellular extracts can deliver valuable and complementary insights for evaluating the effects of xenobiotics on cellular metabolism. We note that the concepts and methods discussed primarily for xenobiotics exposure are widely applicable to drug testing in general, including endobiotics that cover active metabolites, nutrients, peptides and proteins, cytokines, hormones, vitamins, etc.
Collapse
Affiliation(s)
- Aneta Balcerczyk
- Department of Molecular Biophysics, Faculty of Biology and Environmental Protection, University of Lodz, Pomorska 141/143, 90-236 Lodz, Poland;
| | - Christian Damblon
- Unité de Recherche MolSys, Faculté des sciences, Université de Liège, 4000 Liège, Belgium;
| | | | - Baptiste Panthu
- CarMeN Laboratory, INSERM, INRA, INSA Lyon, Univ Lyon, Université Claude Bernard Lyon 1, 69921 Oullins CEDEX, France;
- Hospices Civils de Lyon, Faculté de Médecine, Hôpital Lyon Sud, 69921 Oullins CEDEX, France
| | - Gilles J. P. Rautureau
- Centre de Résonance Magnétique Nucléaire à Très Hauts Champs (CRMN FRE 2034 CNRS, UCBL, ENS Lyon), Université Claude Bernard Lyon 1, 69100 Villeurbanne, France
| |
Collapse
|
50
|
Sen P, Lamichhane S, Mathema VB, McGlinchey A, Dickens AM, Khoomrung S, Orešič M. Deep learning meets metabolomics: a methodological perspective. Brief Bioinform 2020; 22:1531-1542. [PMID: 32940335 DOI: 10.1093/bib/bbaa204] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 08/08/2020] [Accepted: 08/10/2020] [Indexed: 12/15/2022] Open
Abstract
Deep learning (DL), an emerging area of investigation in the fields of machine learning and artificial intelligence, has markedly advanced over the past years. DL techniques are being applied to assist medical professionals and researchers in improving clinical diagnosis, disease prediction and drug discovery. It is expected that DL will help to provide actionable knowledge from a variety of 'big data', including metabolomics data. In this review, we discuss the applicability of DL to metabolomics, while presenting and discussing several examples from recent research. We emphasize the use of DL in tackling bottlenecks in metabolomics data acquisition, processing, metabolite identification, as well as in metabolic phenotyping and biomarker discovery. Finally, we discuss how DL is used in genome-scale metabolic modelling and in interpretation of metabolomics data. The DL-based approaches discussed here may assist computational biologists with the integration, prediction and drawing of statistical inference about biological outcomes, based on metabolomics data.
Collapse
Affiliation(s)
- Partho Sen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland.,School of Medical Sciences, Örebro University, 702 81 Örebro, Sweden
| | - Santosh Lamichhane
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Vivek B Mathema
- Metabolomics and Systems Biology, Department of Biochemistry, and Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
| | - Aidan McGlinchey
- School of Medical Sciences, Örebro University, 702 81 Örebro, Sweden
| | - Alex M Dickens
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Sakda Khoomrung
- Metabolomics and Systems Biology, Department of Biochemistry, and Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Center for Innovation in Chemistry (PERCH), Faculty of Science, Mahidol University, Rama 6 Road, Bangkok 10400, Thailand
| | - Matej Orešič
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland.,School of Medical Sciences, Örebro University, 702 81 Örebro, Sweden
| |
Collapse
|