1
|
Qi F, Gao N, Li J, Zhou C, Jiang J, Zhou B, Guo L, Feng X, Ji J, Cai Q, Yang L, Zhu R, Que X, Wu J, Xi W, Qin W, Zhang J. A multidimensional recommendation framework for identifying biological targets to aid the diagnosis and treatment of liver metastasis in patients with colorectal cancer. Mol Cancer 2024; 23:239. [PMID: 39449040 PMCID: PMC11515508 DOI: 10.1186/s12943-024-02155-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2024] [Accepted: 10/11/2024] [Indexed: 10/26/2024] Open
Abstract
The quest to understand the molecular mechanisms of tumour metastasis and identify pivotal biomarkers for cancer therapy is increasing in importance. Single-omics analyses, constrained by their focus on a single biological layer, cannot fully elucidate the complexities of tumour molecular profiles and can thus overlook crucial molecular targets. In response to this limitation, we developed a multiobjective recommendation system (RJH-Metastasis 1.0) anchored in a multiomics knowledge graph to integrate genome, transcriptome, and proteome data and corroborative literature evidence and then conducted comprehensive analyses of colorectal cancer with liver metastasis (CRCLM). A total of 25 key genes significantly associated with CRCLM were recommended by our system, and GNB1, GATAD2A, GBP2, MACROD1, and EIF5B were further highlighted. Specifically, GNB1 presented fewer mutations but elevated RNA transcription and protein expression in CRCLM patients. The role of GNB1 in promoting the malignant behaviours of colon cancer cells was demonstrated via in vitro and in vivo studies. Aberrant expression of GNB1 could be regulated by METTL1-driven m7G modification. METTL1 knockdown decreased m7G modification in the 3' UTR of GNB1, increasing its mRNA transcription and translation during liver metastasis. Furthermore, GNB1 induced the formation of an immunosuppressive microenvironment by promoting the CLEC2C-KLRB1 interaction between memory B cells and KLRB1+PD-1+CD8+ cells. GNB1 expression and the efficacy of PD-1 antibody-based treatment in CRCLM patients were significantly correlated. In summary, our recommendation system can be used for effective exploration of key molecules in colorectal cancer, among which GNB1 was identified as a critical CRCLM promoter and immunotherapy biomarker in colorectal cancer patients.
Collapse
Affiliation(s)
- Feng Qi
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China.
| | - Na Gao
- Department of Laboratory Medicine, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan, 430071, P. R. China
| | - Jia Li
- Department of Thoracic Surgery, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 20025, P. R. China
| | - Chenfei Zhou
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Jinling Jiang
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Bin Zhou
- Department of Hepatic Surgery IV, Eastern Hepatobiliary Surgery Hospital, Naval Medical University, Shanghai, 200438, P. R. China
| | - Liting Guo
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Xiaohui Feng
- Department of Oncology, Loujiang New City Hospital of Taicang (Taicang Branch of Ruijin Hospital Affiliated with Shanghai Jiao Tong University School of Medicine), Suzhou, 215400, P. R. China
| | - Jun Ji
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Qu Cai
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Liu Yang
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Rongjia Zhu
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Xinyi Que
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Junwei Wu
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China
| | - Wenqi Xi
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China.
| | - Wenxing Qin
- Department of Medical Oncology, Fudan University Shanghai Cancer Center, Shanghai, 200032, P. R. China.
- Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, 200032, P. R. China.
| | - Jun Zhang
- Department of Oncology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, P. R. China.
| |
Collapse
|
2
|
Gelbach PE, Finley SD. Genome-scale modeling predicts metabolic differences between macrophage subtypes in colorectal cancer. iScience 2023; 26:107569. [PMID: 37664588 PMCID: PMC10474475 DOI: 10.1016/j.isci.2023.107569] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 05/24/2023] [Accepted: 08/07/2023] [Indexed: 09/05/2023] Open
Abstract
Colorectal cancer (CRC) shows high incidence and mortality, partly due to the tumor microenvironment (TME), which is viewed as an active promoter of disease progression. Macrophages are among the most abundant cells in the TME. These immune cells are generally categorized as M1, with inflammatory and anti-cancer properties, or M2, which promote tumor proliferation and survival. Although the M1/M2 subclassification scheme is strongly influenced by metabolism, the metabolic divergence between the subtypes remains poorly understood. Therefore, we generated a suite of computational models that characterize the M1- and M2-specific metabolic states. Our models show key differences between the M1 and M2 metabolic networks and capabilities. We leverage the models to identify metabolic perturbations that cause the metabolic state of M2 macrophages to more closely resemble M1 cells. Overall, this work increases understanding of macrophage metabolism in CRC and elucidates strategies to promote the metabolic state of anti-tumor macrophages.
Collapse
Affiliation(s)
- Patrick E. Gelbach
- Alfred E. Mann Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
| | - Stacey D. Finley
- Alfred E. Mann Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
3
|
Gelbach PE, Finley SD. Ensemble-based genome-scale modeling predicts metabolic differences between macrophage subtypes in colorectal cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.09.532000. [PMID: 36993493 PMCID: PMC10052244 DOI: 10.1101/2023.03.09.532000] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
1Colorectal cancer (CRC) shows high incidence and mortality, partly due to the tumor microenvironment, which is viewed as an active promoter of disease progression. Macrophages are among the most abundant cells in the tumor microenvironment. These immune cells are generally categorized as M1, with inflammatory and anti-cancer properties, or M2, which promote tumor proliferation and survival. Although the M1/M2 subclassification scheme is strongly influenced by metabolism, the metabolic divergence between the subtypes remains poorly understood. Therefore, we generated a suite of computational models that characterize the M1- and M2-specific metabolic states. Our models show key differences between the M1 and M2 metabolic networks and capabilities. We leverage the models to identify metabolic perturbations that cause the metabolic state of M2 macrophages to more closely resemble M1 cells. Overall, this work increases understanding of macrophage metabolism in CRC and elucidates strategies to promote the metabolic state of anti-tumor macrophages.
Collapse
Affiliation(s)
- Patrick E. Gelbach
- Alfred E. Mann Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
| | - Stacey D. Finley
- Alfred E. Mann Department of Biomedical Engineering, University of Southern California, Los Angeles, CA 90089, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
4
|
Rose TD, Köhler N, Falk L, Klischat L, Lazareva OE, Pauling JK. Lipid network and moiety analysis for revealing enzymatic dysregulation and mechanistic alterations from lipidomics data. Brief Bioinform 2023; 24:bbac572. [PMID: 36592059 PMCID: PMC9851308 DOI: 10.1093/bib/bbac572] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 11/10/2022] [Accepted: 11/24/2022] [Indexed: 01/03/2023] Open
Abstract
Lipidomics is of growing importance for clinical and biomedical research due to many associations between lipid metabolism and diseases. The discovery of these associations is facilitated by improved lipid identification and quantification. Sophisticated computational methods are advantageous for interpreting such large-scale data for understanding metabolic processes and their underlying (patho)mechanisms. To generate hypothesis about these mechanisms, the combination of metabolic networks and graph algorithms is a powerful option to pinpoint molecular disease drivers and their interactions. Here we present lipid network explorer (LINEX$^2$), a lipid network analysis framework that fuels biological interpretation of alterations in lipid compositions. By integrating lipid-metabolic reactions from public databases, we generate dataset-specific lipid interaction networks. To aid interpretation of these networks, we present an enrichment graph algorithm that infers changes in enzymatic activity in the context of their multispecificity from lipidomics data. Our inference method successfully recovered the MBOAT7 enzyme from knock-out data. Furthermore, we mechanistically interpret lipidomic alterations of adipocytes in obesity by leveraging network enrichment and lipid moieties. We address the general lack of lipidomics data mining options to elucidate potential disease mechanisms and make lipidomics more clinically relevant.
Collapse
Affiliation(s)
- Tim D Rose
- LipiTUM, Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Nikolai Köhler
- LipiTUM, Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Lisa Falk
- LipiTUM, Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Lucie Klischat
- LipiTUM, Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| | - Olga E Lazareva
- Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
- Junior Clinical Cooperation Unit Multiparametric methods for early detection of prostate cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
- European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany
| | - Josch K Pauling
- LipiTUM, Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
| |
Collapse
|
5
|
Delmas M, Filangi O, Duperier C, Paulhe N, Vinson F, Rodriguez-Mier P, Giacomoni F, Jourdan F, Frainay C. Suggesting disease associations for overlooked metabolites using literature from metabolic neighbors. Gigascience 2022; 12:giad065. [PMID: 37712592 PMCID: PMC10502579 DOI: 10.1093/gigascience/giad065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 06/13/2023] [Accepted: 07/28/2023] [Indexed: 09/16/2023] Open
Abstract
In human health research, metabolic signatures extracted from metabolomics data have a strong added value for stratifying patients and identifying biomarkers. Nevertheless, one of the main challenges is to interpret and relate these lists of discriminant metabolites to pathological mechanisms. This task requires experts to combine their knowledge with information extracted from databases and the scientific literature. However, we show that most compounds (>99%) in the PubChem database lack annotated literature. This dearth of available information can have a direct impact on the interpretation of metabolic signatures, which is often restricted to a subset of significant metabolites. To suggest potential pathological phenotypes related to overlooked metabolites that lack annotated literature, we extend the "guilt-by-association" principle to literature information by using a Bayesian framework. The underlying assumption is that the literature associated with the metabolic neighbors of a compound can provide valuable insights, or an a priori, into its biomedical context. The metabolic neighborhood of a compound can be defined from a metabolic network and correspond to metabolites to which it is connected through biochemical reactions. With the proposed approach, we suggest more than 35,000 associations between 1,047 overlooked metabolites and 3,288 diseases (or disease families). All these newly inferred associations are freely available on the FORUM ftp server (see information at https://github.com/eMetaboHUB/Forum-LiteraturePropagation).
Collapse
Affiliation(s)
- Maxime Delmas
- Toxalim (Research Center in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, 31300 Toulouse, France
| | - Olivier Filangi
- IGEPP, INRAE, Institut Agro, Université de Rennes, Domaine de la Motte, 35653 Le Rheu, France
| | - Christophe Duperier
- Université Clermont Auvergne, INRAE, UNH, Plateforme d’Exploration du Métabolisme, MetaboHUB Clermont, F-63000 Clermont-Ferrand, France
| | - Nils Paulhe
- Université Clermont Auvergne, INRAE, UNH, Plateforme d’Exploration du Métabolisme, MetaboHUB Clermont, F-63000 Clermont-Ferrand, France
| | - Florence Vinson
- Toxalim (Research Center in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, 31300 Toulouse, France
- MetaboHUB-Metatoul, National Infrastructure of Metabolomics and Fluxomics, Toulouse, 31300, France
| | - Pablo Rodriguez-Mier
- Toxalim (Research Center in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, 31300 Toulouse, France
| | - Franck Giacomoni
- Université Clermont Auvergne, INRAE, UNH, Plateforme d’Exploration du Métabolisme, MetaboHUB Clermont, F-63000 Clermont-Ferrand, France
| | - Fabien Jourdan
- Toxalim (Research Center in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, 31300 Toulouse, France
- MetaboHUB-Metatoul, National Infrastructure of Metabolomics and Fluxomics, Toulouse, 31300, France
| | - Clément Frainay
- Toxalim (Research Center in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, 31300 Toulouse, France
| |
Collapse
|
6
|
Gogleva A, Polychronopoulos D, Pfeifer M, Poroshin V, Ughetto M, Martin MJ, Thorpe H, Bornot A, Smith PD, Sidders B, Dry JR, Ahdesmäki M, McDermott U, Papa E, Bulusu KC. Knowledge graph-based recommendation framework identifies drivers of resistance in EGFR mutant non-small cell lung cancer. Nat Commun 2022; 13:1667. [PMID: 35351890 PMCID: PMC8964738 DOI: 10.1038/s41467-022-29292-7] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 03/09/2022] [Indexed: 12/25/2022] Open
Abstract
Resistance to EGFR inhibitors (EGFRi) presents a major obstacle in treating non-small cell lung cancer (NSCLC). One of the most exciting new ways to find potential resistance markers involves running functional genetic screens, such as CRISPR, followed by manual triage of significantly enriched genes. This triage process to identify 'high value' hits resulting from the CRISPR screen involves manual curation that requires specialized knowledge and can take even experts several months to comprehensively complete. To find key drivers of resistance faster we build a recommendation system on top of a heterogeneous biomedical knowledge graph integrating pre-clinical, clinical, and literature evidence. The recommender system ranks genes based on trade-offs between diverse types of evidence linking them to potential mechanisms of EGFRi resistance. This unbiased approach identifies 57 resistance markers from >3,000 genes, reducing hit identification time from months to minutes. In addition to reproducing known resistance markers, our method identifies previously unexplored resistance mechanisms that we prospectively validate.
Collapse
Affiliation(s)
- Anna Gogleva
- Biological Insight Knowledge Graph (BIKG), AI Engineering, R&D IT, AstraZeneca, Cambridge, UK
| | - Dimitris Polychronopoulos
- Early Computational Oncology, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Matthias Pfeifer
- Bioscience, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | | | - Michaël Ughetto
- Biological Insight Knowledge Graph (BIKG), AI Engineering, R&D IT, AstraZeneca, Gothenburg, Sweden
| | - Matthew J Martin
- Bioscience, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Hannah Thorpe
- Bioscience, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Aurelie Bornot
- Data Sciences & Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge, UK
| | - Paul D Smith
- Bioscience, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Ben Sidders
- Early Computational Oncology, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Jonathan R Dry
- Early Computational Oncology, Research and Early Development, Oncology R&D, AstraZeneca, Waltham, MA, USA
| | - Miika Ahdesmäki
- Early Computational Oncology, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Ultan McDermott
- Bioscience, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK
| | - Eliseo Papa
- Biological Insight Knowledge Graph (BIKG), AI Engineering, R&D IT, AstraZeneca, Cambridge, UK.
| | - Krishna C Bulusu
- Early Computational Oncology, Research and Early Development, Oncology R&D, AstraZeneca, Cambridge, UK.
| |
Collapse
|
7
|
Galvão Ferrarini M, Ziska I, Andrade R, Julien-Laferrière A, Duchemin L, César RM, Mary A, Vinga S, Sagot MF. Totoro: Identifying Active Reactions During the Transient State for Metabolic Perturbations. Front Genet 2022; 13:815476. [PMID: 35281848 PMCID: PMC8905348 DOI: 10.3389/fgene.2022.815476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 01/24/2022] [Indexed: 11/16/2022] Open
Abstract
Motivation: The increasing availability of metabolomic data and their analysis are improving the understanding of cellular mechanisms and how biological systems respond to different perturbations. Currently, there is a need for novel computational methods that facilitate the analysis and integration of increasing volume of available data. Results: In this paper, we present Totoro a new constraint-based approach that integrates quantitative non-targeted metabolomic data of two different metabolic states into genome-wide metabolic models and predicts reactions that were most likely active during the transient state. We applied Totoro to real data of three different growth experiments (pulses of glucose, pyruvate, succinate) from Escherichia coli and we were able to predict known active pathways and gather new insights on the different metabolisms related to each substrate. We used both the E. coli core and the iJO1366 models to demonstrate that our approach is applicable to both smaller and larger networks. Availability:Totoro is an open source method (available at https://gitlab.inria.fr/erable/totoro) suitable for any organism with an available metabolic model. It is implemented in C++ and depends on IBM CPLEX which is freely available for academic purposes.
Collapse
Affiliation(s)
- Mariana Galvão Ferrarini
- Laboratoire de Biométrie et Biologie Évolutive, UMR 5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France.,Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, Villeurbanne, France
| | - Irene Ziska
- Laboratoire de Biométrie et Biologie Évolutive, UMR 5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France.,INRIA Grenoble Rhône-Alpes, Villeurbanne, France
| | - Ricardo Andrade
- Laboratoire de Biométrie et Biologie Évolutive, UMR 5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France.,Institute of Mathematics and Statistics (IME), University of São Paulo, São Paulo, Brazil
| | | | - Louis Duchemin
- Laboratoire de Biométrie et Biologie Évolutive, UMR 5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France
| | | | - Arnaud Mary
- Laboratoire de Biométrie et Biologie Évolutive, UMR 5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France.,INRIA Grenoble Rhône-Alpes, Villeurbanne, France
| | - Susana Vinga
- INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal
| | - Marie-France Sagot
- Laboratoire de Biométrie et Biologie Évolutive, UMR 5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France.,INRIA Grenoble Rhône-Alpes, Villeurbanne, France
| |
Collapse
|
8
|
Amara A, Frainay C, Jourdan F, Naake T, Neumann S, Novoa-del-Toro EM, Salek RM, Salzer L, Scharfenberg S, Witting M. Networks and Graphs Discovery in Metabolomics Data Analysis and Interpretation. Front Mol Biosci 2022; 9:841373. [PMID: 35350714 PMCID: PMC8957799 DOI: 10.3389/fmolb.2022.841373] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 02/18/2022] [Indexed: 01/19/2023] Open
Abstract
Both targeted and untargeted mass spectrometry-based metabolomics approaches are used to understand the metabolic processes taking place in various organisms, from prokaryotes, plants, fungi to animals and humans. Untargeted approaches allow to detect as many metabolites as possible at once, identify unexpected metabolic changes, and characterize novel metabolites in biological samples. However, the identification of metabolites and the biological interpretation of such large and complex datasets remain challenging. One approach to address these challenges is considering that metabolites are connected through informative relationships. Such relationships can be formalized as networks, where the nodes correspond to the metabolites or features (when there is no or only partial identification), and edges connect nodes if the corresponding metabolites are related. Several networks can be built from a single dataset (or a list of metabolites), where each network represents different relationships, such as statistical (correlated metabolites), biochemical (known or putative substrates and products of reactions), or chemical (structural similarities, ontological relations). Once these networks are built, they can subsequently be mined using algorithms from network (or graph) theory to gain insights into metabolism. For instance, we can connect metabolites based on prior knowledge on enzymatic reactions, then provide suggestions for potential metabolite identifications, or detect clusters of co-regulated metabolites. In this review, we first aim at settling a nomenclature and formalism to avoid confusion when referring to different networks used in the field of metabolomics. Then, we present the state of the art of network-based methods for mass spectrometry-based metabolomics data analysis, as well as future developments expected in this area. We cover the use of networks applications using biochemical reactions, mass spectrometry features, chemical structural similarities, and correlations between metabolites. We also describe the application of knowledge networks such as metabolic reaction networks. Finally, we discuss the possibility of combining different networks to analyze and interpret them simultaneously.
Collapse
Affiliation(s)
- Adam Amara
- Section of Nutrition and Metabolism, International Agency for Research on Cancer (IARC-WHO), Lyon, France
| | - Clément Frainay
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France
| | - Fabien Jourdan
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France
- MetaboHUB-Metatoul, National Infrastructure of Metabolomics and Fluxomics, Toulouse, France
| | - Thomas Naake
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Steffen Neumann
- Bioinformatics and Scientific Data, Leibniz Institute of Plant Biochemistry, Halle (Saale), Germany
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| | - Elva María Novoa-del-Toro
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France
| | | | - Liesa Salzer
- Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, Neuherberg, Germany
| | - Sarah Scharfenberg
- Bioinformatics and Scientific Data, Leibniz Institute of Plant Biochemistry, Halle (Saale), Germany
| | - Michael Witting
- Metabolomics and Proteomics Core, Helmholtz Zentrum München, Neuherberg, Germany
- Chair of Analytical Food Chemistry, TUM School of Life Sciences, Freising, Germany
| |
Collapse
|
9
|
High-confidence structural annotation of metabolites absent from spectral libraries. Nat Biotechnol 2021; 40:411-421. [PMID: 34650271 PMCID: PMC8926923 DOI: 10.1038/s41587-021-01045-9] [Citation(s) in RCA: 97] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 08/04/2021] [Indexed: 12/14/2022]
Abstract
Untargeted metabolomics experiments rely on spectral libraries for structure annotation, but, typically, only a small fraction of spectra can be matched. Previous in silico methods search in structure databases but cannot distinguish between correct and incorrect annotations. Here we introduce the COSMIC workflow that combines in silico structure database generation and annotation with a confidence score consisting of kernel density P value estimation and a support vector machine with enforced directionality of features. On diverse datasets, COSMIC annotates a substantial number of hits at low false discovery rates and outperforms spectral library search. To demonstrate that COSMIC can annotate structures never reported before, we annotated 12 natural bile acids. The annotation of nine structures was confirmed by manual evaluation and two structures using synthetic standards. In human samples, we annotated and manually validated 315 molecular structures currently absent from the Human Metabolome Database. Application of COSMIC to data from 17,400 metabolomics experiments led to 1,715 high-confidence structural annotations that were absent from spectral libraries.
Collapse
|
10
|
Tomi-Andrino C, Norman R, Millat T, Soucaille P, Winzer K, Barrett DA, King J, Kim DH. Physicochemical and metabolic constraints for thermodynamics-based stoichiometric modelling under mesophilic growth conditions. PLoS Comput Biol 2021; 17:e1007694. [PMID: 33493151 PMCID: PMC7861524 DOI: 10.1371/journal.pcbi.1007694] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Revised: 02/04/2021] [Accepted: 12/28/2020] [Indexed: 12/11/2022] Open
Abstract
Metabolic engineering in the post-genomic era is characterised by the development of new methods for metabolomics and fluxomics, supported by the integration of genetic engineering tools and mathematical modelling. Particularly, constraint-based stoichiometric models have been widely studied: (i) flux balance analysis (FBA) (in silico), and (ii) metabolic flux analysis (MFA) (in vivo). Recent studies have enabled the incorporation of thermodynamics and metabolomics data to improve the predictive capabilities of these approaches. However, an in-depth comparison and evaluation of these methods is lacking. This study presents a thorough analysis of two different in silico methods tested against experimental data (metabolomics and 13C-MFA) for the mesophile Escherichia coli. In particular, a modified version of the recently published matTFA toolbox was created, providing a broader range of physicochemical parameters. Validating against experimental data allowed the determination of the best physicochemical parameters to perform the TFA (Thermodynamics-based Flux Analysis). An analysis of flux pattern changes in the central carbon metabolism between 13C-MFA and TFA highlighted the limited capabilities of both approaches for elucidating the anaplerotic fluxes. In addition, a method based on centrality measures was suggested to identify important metabolites that (if quantified) would allow to further constrain the TFA. Finally, this study emphasised the need for standardisation in the fluxomics community: novel approaches are frequently released but a thorough comparison with currently accepted methods is not always performed. Biotechnology has benefitted from the development of high throughput methods characterising living systems at different levels (e.g. concerning genes or proteins), allowing the industrial production of chemical commodities. Recently, focus has been placed on determining reaction rates (or metabolic fluxes) in the metabolic network of certain microorganisms, in order to identify bottlenecks hindering their exploitation. Two main approaches are commonly used, termed metabolic flux analysis (MFA) and flux balance analysis (FBA), based on measuring and estimating fluxes, respectively. While the influence of thermodynamics in living systems was accepted several decades ago, its application to study biochemical networks has only recently been enabled. In this sense, a multitude of different approaches constraining well-established modelling methods with thermodynamics has been suggested. However, physicochemical parameters are generally not properly adjusted to the experimental conditions, which might affect their predictive capabilities. In this study, we have explored the reliability of currently available tools by investigating the impact of varying said parameters in the simulation of metabolic fluxes and metabolite concentration values. Additionally, our in-depth analysis allowed us to highlight limitations and potential solutions that should be considered in future studies.
Collapse
Affiliation(s)
- Claudio Tomi-Andrino
- Centre for Analytical Bioscience, Advanced Materials and Healthcare Technologies Division, School of Pharmacy, University of Nottingham, Nottingham, United Kingdom
- Nottingham BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Life Sciences, BioDiscovery Institute, University of Nottingham, Nottingham, United Kingdom
- Nottingham BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Mathematical Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Rupert Norman
- Nottingham BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Life Sciences, BioDiscovery Institute, University of Nottingham, Nottingham, United Kingdom
| | - Thomas Millat
- Nottingham BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Life Sciences, BioDiscovery Institute, University of Nottingham, Nottingham, United Kingdom
| | - Philippe Soucaille
- Nottingham BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Life Sciences, BioDiscovery Institute, University of Nottingham, Nottingham, United Kingdom
- INSA, UPS, INP, Toulouse Biotechnology Institute, (TBI), Université de Toulouse, Toulouse, France
- INRA, UMR792, Toulouse, France
- CNRS, UMR5504, Toulouse, France
| | - Klaus Winzer
- Nottingham BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Life Sciences, BioDiscovery Institute, University of Nottingham, Nottingham, United Kingdom
| | - David A. Barrett
- Centre for Analytical Bioscience, Advanced Materials and Healthcare Technologies Division, School of Pharmacy, University of Nottingham, Nottingham, United Kingdom
| | - John King
- Nottingham BBSRC/EPSRC Synthetic Biology Research Centre (SBRC), School of Mathematical Sciences, University of Nottingham, Nottingham, United Kingdom
| | - Dong-Hyun Kim
- Centre for Analytical Bioscience, Advanced Materials and Healthcare Technologies Division, School of Pharmacy, University of Nottingham, Nottingham, United Kingdom
- * E-mail:
| |
Collapse
|
11
|
Perez De Souza L, Alseekh S, Brotman Y, Fernie AR. Network-based strategies in metabolomics data analysis and interpretation: from molecular networking to biological interpretation. Expert Rev Proteomics 2020; 17:243-255. [PMID: 32380880 DOI: 10.1080/14789450.2020.1766975] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
INTRODUCTION Metabolomics has become a crucial part of systems biology; however, data analysis is still often undertaken in a reductionist way focusing on changes in individual metabolites. Whilst such approaches indeed provide relevant insights into the metabolic phenotype of an organism, the intricate nature of metabolic relationships may be better explored when considering the whole system. AREAS COVERED This review highlights multiple network strategies that can be applied for metabolomics data analysis from different perspectives including: association networks based on quantitative information, mass spectra similarity networks to assist metabolite annotation and biochemical networks for systematic data interpretation. We also highlight some relevant insights into metabolic organization obtained through the exploration of such approaches. EXPERT OPINION Network based analysis is an established method that allows the identification of non-intuitive metabolic relationships as well as the identification of unknown compounds in mass spectrometry. Additionally, the representation of data from metabolomics within the context of metabolic networks is intuitive and allows for the use of statistical analysis that can better summarize relevant metabolic changes from a systematic perspective.
Collapse
Affiliation(s)
- Leonardo Perez De Souza
- Department of molecular physiology, Max-Planck-Institute of Molecular Plant Physiology , Potsdam-Golm, Germany
| | - Saleh Alseekh
- Department of molecular physiology, Max-Planck-Institute of Molecular Plant Physiology , Potsdam-Golm, Germany.,Department of plant metabolomics, Centre of Plant Systems Biology and Biotechnology , Plovdiv, Bulgaria
| | - Yariv Brotman
- Department of Life Sciences, Ben-Gurion University of the Negev , Beersheba, Israel
| | - Alisdair R Fernie
- Department of molecular physiology, Max-Planck-Institute of Molecular Plant Physiology , Potsdam-Golm, Germany.,Department of plant metabolomics, Centre of Plant Systems Biology and Biotechnology , Plovdiv, Bulgaria
| |
Collapse
|
12
|
Vitale L, Piovesan A, Antonaros F, Strippoli P, Pelleri MC, Caracausi M. Dataset of differential gene expression between total normal human thyroid and histologically normal thyroid adjacent to papillary thyroid carcinoma. Data Brief 2019; 24:103835. [PMID: 31049370 PMCID: PMC6479735 DOI: 10.1016/j.dib.2019.103835] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Revised: 02/28/2019] [Accepted: 03/07/2019] [Indexed: 12/16/2022] Open
Abstract
This article contains further data and information from our published manuscript [1]. We aim to identify significant transcriptome alterations of total normal human thyroid vs. histologically normal thyroid adjacent to papillary thyroid carcinoma. We performed a systematic meta-analysis of all the available gene expression profiles for the whole organ also collecting gene expression data for the normal thyroid adjacent to papillary thyroid carcinoma. A differential quantitative transcriptome reference map was generated by using TRAM (Transcriptome Mapper) software able to combine, normalize and integrate a total of 35 datasets from total normal thyroid and 40 datasets from histologically normal thyroid adjacent to papillary thyroid carcinoma from different sources. This analysis identified genes and genome segments that significantly discriminated the two groups of samples. Differentially expressed genes were grouped and enrichment function analyses were performed identifying the main features of the differentially expressed genes between total normal thyroid and histologically normal thyroid adjacent to papillary thyroid carcinoma. The search for housekeeping genes retrieved 414 loci.
Collapse
|