1
|
Mohanty I, Allaband C, Mannochio-Russo H, El Abiead Y, Hagey LR, Knight R, Dorrestein PC. The changing metabolic landscape of bile acids - keys to metabolism and immune regulation. Nat Rev Gastroenterol Hepatol 2024; 21:493-516. [PMID: 38575682 DOI: 10.1038/s41575-024-00914-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/14/2024] [Indexed: 04/06/2024]
Abstract
Bile acids regulate nutrient absorption and mitochondrial function, they establish and maintain gut microbial community composition and mediate inflammation, and they serve as signalling molecules that regulate appetite and energy homeostasis. The observation that there are hundreds of bile acids, especially many amidated bile acids, necessitates a revision of many of the classical descriptions of bile acids and bile acid enzyme functions. For example, bile salt hydrolases also have transferase activity. There are now hundreds of known modifications to bile acids and thousands of bile acid-associated genes, especially when including the microbiome, distributed throughout the human body (for example, there are >2,400 bile salt hydrolases alone). The fact that so much of our genetic and small-molecule repertoire, in both amount and diversity, is dedicated to bile acid function highlights the centrality of bile acids as key regulators of metabolism and immune homeostasis, which is, in large part, communicated via the gut microbiome.
Collapse
Affiliation(s)
- Ipsita Mohanty
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Celeste Allaband
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Helena Mannochio-Russo
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Yasin El Abiead
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Lee R Hagey
- Department of Medicine, University of California San Diego, San Diego, CA, USA
| | - Rob Knight
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Pieter C Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA.
- Department of Pharmacology, University of California San Diego, La Jolla, CA, USA.
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
2
|
Shahneh MRZ, Strobel M, Vitale GA, Geibel C, Abiead YE, Garg N, Wagner B, Forchhammer K, Aron A, Phelan VV, Petras D, Wang M. ModiFinder: Tandem Mass Spectral Alignment Enables Structural Modification Site Localization. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024. [PMID: 38830143 DOI: 10.1021/jasms.4c00061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Untargeted tandem mass spectrometry (MS/MS) has become a high-throughput method to measure small molecules in complex samples. One key goal is the transformation of these MS/MS spectra into chemical structures. Computational techniques such as MS/MS library search have enabled the reidentification of known compounds. Analog library search and molecular networking extend this identification to unknown compounds. While there have been advancements in metrics for the similarity of MS/MS spectra of structurally similar compounds, there is still a lack of automated methods to provide site specific information about structural modifications. Here we introduce ModiFinder which leverages the alignment of peaks in MS/MS spectra between structurally related known and unknown small molecules. Specifically, ModiFinder focuses on shifted MS/MS fragment peaks in the MS/MS alignment. These shifted peaks putatively represent substructures of the known molecule that contain the site of the modification. ModiFinder synthesizes this information together and scores the likelihood for each atom in the known molecule to be the modification site. We demonstrate in this manuscript how ModiFinder can effectively localize modifications which extends the capabilities of MS/MS analog searching and molecular networking to accelerate the discovery of novel compounds.
Collapse
Affiliation(s)
- Mohammad Reza Zare Shahneh
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| | - Michael Strobel
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| | - Giovanni Andrea Vitale
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 24, Tuebingen 72076, Germany
| | - Christian Geibel
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 24, Tuebingen 72076, Germany
| | - Yasin El Abiead
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Dr., San Diego, California 92093, United States
| | - Neha Garg
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta,, 950 Atlantic Drive, Atlanta, Georgia 30332, United States
| | - Berenike Wagner
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 28, Tuebingen 72076, Germany
| | - Karl Forchhammer
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 28, Tuebingen 72076, Germany
| | - Allegra Aron
- Department of Chemistry and Biochemistry, University of Denver, 2101 East Wesley Ave, Denver, Colorado 80210, United States
| | - Vanessa V Phelan
- Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado, Anschutz Medical Campus, 12850 E Montview Blvd, Aurora, Colorado 80045, United States
| | - Daniel Petras
- Department of Biochemistry, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| | - Mingxun Wang
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| |
Collapse
|
3
|
Bui-Thi D, Liu Y, Lippens JL, Laukens K, De Vijlder T. TransExION: a transformer based explainable similarity metric for comparing IONS in tandem mass spectrometry. J Cheminform 2024; 16:61. [PMID: 38807166 PMCID: PMC11134763 DOI: 10.1186/s13321-024-00858-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 05/12/2024] [Indexed: 05/30/2024] Open
Abstract
Small molecule identification is a crucial task in analytical chemistry and life sciences. One of the most commonly used technologies to elucidate small molecule structures is mass spectrometry. Spectral library search of product ion spectra (MS/MS) is a popular strategy to identify or find structural analogues. This approach relies on the assumption that spectral similarity and structural similarity are correlated. However, popular spectral similarity measures, usually calculated based on identical fragment matches between the MS/MS spectra, do not always accurately reflect the structural similarity. In this study, we propose TransExION, a Transformer based Explainable similarity metric for IONS. TransExION detects related fragments between MS/MS spectra through their mass difference and uses these to estimate spectral similarity. These related fragments can be nearly identical, but can also share a substructure. TransExION also provides a post-hoc explanation of its estimation, which can be used to support scientists in evaluating the spectral library search results and thus in structure elucidation of unknown molecules. Our model has a Transformer based architecture and it is trained on the data derived from GNPS MS/MS libraries. The experimental results show that it improves existing spectral similarity measures in searching and interpreting structural analogues as well as in molecular networking. SCIENTIFIC CONTRIBUTION: We propose a transformer-based spectral similarity metrics that improves the comparison of small molecule tandem mass spectra. We provide a post hoc explanation that can serve as a good starting point for unknown spectra annotation based on database spectra.
Collapse
Affiliation(s)
- Danh Bui-Thi
- Computer Science Department, University of Antwerp, Middelheimlaan 1, 2020, Antwerp, Belgium
| | - Youzhong Liu
- Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Jennifer L Lippens
- Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Kris Laukens
- Computer Science Department, University of Antwerp, Middelheimlaan 1, 2020, Antwerp, Belgium
| | - Thomas De Vijlder
- Therapeutic Development and Supply, Janssen Pharmaceutica N.V., Turnhoutseweg 30, 2340, Beerse, Belgium.
| |
Collapse
|
4
|
Dos Reis JBA, Lorenzi AS, Pinho DB, Cortelo PC, do Vale HMM. The hidden treasures in endophytic fungi: a comprehensive review on the diversity of fungal bioactive metabolites, usual analytical methodologies, and applications. Arch Microbiol 2024; 206:185. [PMID: 38506928 DOI: 10.1007/s00203-024-03911-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 02/20/2024] [Accepted: 02/26/2024] [Indexed: 03/22/2024]
Abstract
This review provides a comprehensive overview of the key aspects of the natural metabolite production by endophytic fungi, which has attracted significant attention due to its diverse biological activities and wide range of applications. Synthesized by various fungal species, these metabolites encompass compounds with therapeutic, agricultural, and commercial significance. We delved into strategies and advancements aimed at optimizing fungal metabolite production. Fungal cultivation, especially by Aspergillus, Penicillium, and Fusarium, plays a pivotal role in metabolite biosynthesis, and researchers have explored both submerged and solid-state cultivation processes to harness the full potential of fungal species. Nutrient optimization, pH, and temperature control are critical factors in ensuring high yields of the targeted bioactive metabolites especially for scaling up processes. Analytical methods that includes High-Performance Liquid Chromatography (HPLC), Liquid Chromatography-Mass Spectrometry (LC-MS), Gas Chromatography-Mass Spectrometry (GC-MS), Nuclear Magnetic Resonance (NMR), and Mass Spectrometry (MS), are indispensable for the identification and quantification of the compounds. Moreover, genetic engineering and metabolic pathway manipulation have emerged as powerful tools to enhance metabolite production and develop novel fungal strains with increased yields. Regulation and control mechanisms at the genetic, epigenetic, and metabolic levels are explored to fine-tune the biosynthesis of fungal metabolites. Ongoing research aims to overcome the complexity of the steps involved to ensure the efficient production and utilization of fungal metabolites.
Collapse
Affiliation(s)
| | - Adriana Sturion Lorenzi
- Department of Cellular Biology, Institute of Biological Sciences, University of Brasília (UnB), Brasília, DF, Brazil
| | - Danilo Batista Pinho
- Department of Phytopathology, Institute of Biological Sciences, University of Brasília (UnB), Brasília, DF, Brazil
| | | | - Helson Mario Martins do Vale
- Department of Phytopathology, Institute of Biological Sciences, University of Brasília (UnB), Brasília, DF, Brazil
| |
Collapse
|
5
|
Bittremieux W, Avalon NE, Thomas SP, Kakhkhorov SA, Aksenov AA, Gomes PWP, Aceves CM, Caraballo-Rodríguez AM, Gauglitz JM, Gerwick WH, Huan T, Jarmusch AK, Kaddurah-Daouk RF, Kang KB, Kim HW, Kondić T, Mannochio-Russo H, Meehan MJ, Melnik AV, Nothias LF, O'Donovan C, Panitchpakdi M, Petras D, Schmid R, Schymanski EL, van der Hooft JJJ, Weldon KC, Yang H, Xing S, Zemlin J, Wang M, Dorrestein PC. Open access repository-scale propagated nearest neighbor suspect spectral library for untargeted metabolomics. Nat Commun 2023; 14:8488. [PMID: 38123557 PMCID: PMC10733301 DOI: 10.1038/s41467-023-44035-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 11/28/2023] [Indexed: 12/23/2023] Open
Abstract
Despite the increasing availability of tandem mass spectrometry (MS/MS) community spectral libraries for untargeted metabolomics over the past decade, the majority of acquired MS/MS spectra remain uninterpreted. To further aid in interpreting unannotated spectra, we created a nearest neighbor suspect spectral library, consisting of 87,916 annotated MS/MS spectra derived from hundreds of millions of MS/MS spectra originating from published untargeted metabolomics experiments. Entries in this library, or "suspects," were derived from unannotated spectra that could be linked in a molecular network to an annotated spectrum. Annotations were propagated to unknowns based on structural relationships to reference molecules using MS/MS-based spectrum alignment. We demonstrate the broad relevance of the nearest neighbor suspect spectral library through representative examples of propagation-based annotation of acylcarnitines, bacterial and plant natural products, and drug metabolism. Our results also highlight how the library can help to better understand an Alzheimer's brain phenotype. The nearest neighbor suspect spectral library is openly available for download or for data analysis through the GNPS platform to help investigators hypothesize candidate structures for unknown MS/MS spectra in untargeted metabolomics data.
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Computer Science, University of Antwerp, 2020, Antwerpen, Belgium.
| | - Nicole E Avalon
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, 92093, USA
| | - Sydney P Thomas
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Sarvar A Kakhkhorov
- Laboratory of Physical and Chemical Methods of Research, Center for Advanced Technologies, Tashkent, 100174, Uzbekistan
- Department of Food Science, Faculty of Science, University of Copenhagen, Rolighedsvej 26, 1958, Frederiksberg C, Denmark
| | - Alexander A Aksenov
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Department of Chemistry, University of Connecticut, Storrs, CT, 06269, USA
- Arome Science inc., Farmington, CT, 06032, USA
| | - Paulo Wender P Gomes
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Christine M Aceves
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, 92037, USA
| | - Andrés Mauricio Caraballo-Rodríguez
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Julia M Gauglitz
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - William H Gerwick
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
| | - Tao Huan
- Department of Chemistry, University of British Columbia, Vancouver, BC, V6T 1Z1, Canada
| | - Alan K Jarmusch
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Immunity, Inflammation, and Disease Laboratory, Division of Intramural Research, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, Durham, NC, 27709, USA
| | - Rima F Kaddurah-Daouk
- Department of Psychiatry and Behavioral Sciences, Duke University School of Medicine, Durham, NC, 27701, USA
- Department of Medicine, Duke University, Durham, NC, 27710, USA
- Duke Institute of Brain Sciences, Duke University, Durham, NC, 27710, USA
| | - Kyo Bin Kang
- College of Pharmacy and Research Institute of Pharmaceutical Sciences, Sookmyung Women's University, Seoul, 04310, Korea
| | - Hyun Woo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University, Goyang, 10326, Korea
| | - Todor Kondić
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4367, Belvaux, Luxembourg
| | - Helena Mannochio-Russo
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Department of Biochemistry and Organic Chemistry, Institute of Chemistry, São Paulo State University, Araraquara, 14800-901, Brazil
| | - Michael J Meehan
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Alexey V Melnik
- Department of Chemistry, University of Connecticut, Storrs, CT, 06269, USA
- Arome Science inc., Farmington, CT, 06032, USA
| | - Louis-Felix Nothias
- Université Côte d'Azur, CNRS, ICN, Nice, France
- Interdisciplinary Institute for Artificial Intelligence (3iA) Côte d'Azur, Nice, France
| | - Claire O'Donovan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Morgan Panitchpakdi
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Daniel Petras
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, 72076, Tuebingen, Germany
- Department of Biochemistry, University of California Riverside, Riverside, CA, 92507, USA
| | - Robin Schmid
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Emma L Schymanski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4367, Belvaux, Luxembourg
| | - Justin J J van der Hooft
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, The Netherlands
| | - Kelly C Weldon
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Heejung Yang
- Laboratory of Natural Products Chemistry, College of Pharmacy, Kangwon National University, Chuncheon, 24341, Korea
| | - Shipei Xing
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Department of Chemistry, University of British Columbia, Vancouver, BC, V6T 1Z1, Canada
| | - Jasmine Zemlin
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
| | - Mingxun Wang
- Department of Computer Science and Engineering, University of California Riverside, Riverside, CA, 92507, USA
| | - Pieter C Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA.
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
6
|
Harwood TV, Treen DGC, Wang M, de Jong W, Northen TR, Bowen BP. BLINK enables ultrafast tandem mass spectrometry cosine similarity scoring. Sci Rep 2023; 13:13462. [PMID: 37596301 PMCID: PMC10439109 DOI: 10.1038/s41598-023-40496-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 08/11/2023] [Indexed: 08/20/2023] Open
Abstract
Metabolomics has a long history of using cosine similarity to match experimental tandem mass spectra to databases for compound identification. Here we introduce the Blur-and-Link (BLINK) approach for scoring cosine similarity. By bypassing fragment alignment and simultaneously scoring all pairs of spectra using sparse matrix operations, BLINK is over 3000 times faster than MatchMS, a widely used loop-based alignment and scoring implementation. Using a similarity cutoff of 0.7, BLINK and MatchMS had practically equivalent identification agreement, and greater than 99% of their scores and matching ion counts were identical. This performance improvement can enable calculations to be performed that would typically be limited by time and available computational resources.
Collapse
Affiliation(s)
- Thomas V Harwood
- Environmental Genomics and Systems Biology Division, The DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, 94720, USA
| | - Daniel G C Treen
- Environmental Genomics and Systems Biology Division, The DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, 94720, USA
| | - Mingxun Wang
- Department of Computer Science and Engineering, University of California Riverside, 900 University Avenue, Riverside, CA, 92521, USA
| | - Wibe de Jong
- Computational Chemistry, Materials and Climate Group, Computational Research Division, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, 94720, USA
| | - Trent R Northen
- Environmental Genomics and Systems Biology Division, The DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, 94720, USA
| | - Benjamin P Bowen
- Environmental Genomics and Systems Biology Division, The DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA, 94720, USA.
| |
Collapse
|
7
|
Wang X, Li C, Li Z, Qi Y, Zhang X, Zhao X, Zhao C, Lin X, Lu X, Xu G. A Structure-Guided Molecular Network Strategy for Global Untargeted Metabolomics Data Annotation. Anal Chem 2023; 95:11603-11612. [PMID: 37493263 DOI: 10.1021/acs.analchem.3c00849] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
Large-scale metabolite annotation is a bottleneck in untargeted metabolomics. Here, we present a structure-guided molecular network strategy (SGMNS) for deep annotation of untargeted ultra-performance liquid chromatography-high resolution mass spectrometry (MS) metabolomics data. Different from the current network-based metabolite annotation method, SGMNS is based on a global connectivity molecular network (GCMN), which was constructed by molecular fingerprint similarity of chemical structures in metabolome databases. Neighbor metabolites with similar structures in GCMN are expected to produce similar spectra. Network annotation propagation of SGMNS is performed using known metabolites as seeds. The experimental MS/MS spectra of seeds are assigned to corresponding neighbor metabolites in GCMN as their "pseudo" spectra; the propagation is done by searching predicted retention times, MS1, and "pseudo" spectra against metabolite features in untargeted metabolomics data. Then, the annotated metabolite features were used as new seeds for annotation propagation again. Performance evaluation of SGMNS showed its unique advantages for metabolome annotation. The developed method was applied to annotate six typical biological samples; a total of 701, 1557, 1147, 1095, 1237, and 2041 metabolites were annotated from the cell, feces, plasma (NIST SRM 1950), tissue, urine, and their pooled sample, respectively, and the annotation accuracy was >83% with RSD <2%. The results show that SGMNS fully exploits the chemical space of the existing metabolomes for metabolite deep annotation and overcomes the shortcoming of insufficient reference MS/MS spectra.
Collapse
Affiliation(s)
- Xinxin Wang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| | - Chao Li
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- School of Computer Science & Technology, Dalian University of Technology, Dalian 116024, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| | - Zaifang Li
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| | - Yanpeng Qi
- School of Computer Science & Technology, Dalian University of Technology, Dalian 116024, P.R. China
| | - Xiuqiong Zhang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| | - Xinjie Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| | - Chunxia Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| | - Xiaohui Lin
- School of Computer Science & Technology, Dalian University of Technology, Dalian 116024, P.R. China
| | - Xin Lu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| | - Guowang Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
- Liaoning Province Key Laboratory of Metabolomics, Dalian 116023, P.R. China
| |
Collapse
|
8
|
Gaudêncio SP, Bayram E, Lukić Bilela L, Cueto M, Díaz-Marrero AR, Haznedaroglu BZ, Jimenez C, Mandalakis M, Pereira F, Reyes F, Tasdemir D. Advanced Methods for Natural Products Discovery: Bioactivity Screening, Dereplication, Metabolomics Profiling, Genomic Sequencing, Databases and Informatic Tools, and Structure Elucidation. Mar Drugs 2023; 21:md21050308. [PMID: 37233502 DOI: 10.3390/md21050308] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/11/2023] [Accepted: 05/12/2023] [Indexed: 05/27/2023] Open
Abstract
Natural Products (NP) are essential for the discovery of novel drugs and products for numerous biotechnological applications. The NP discovery process is expensive and time-consuming, having as major hurdles dereplication (early identification of known compounds) and structure elucidation, particularly the determination of the absolute configuration of metabolites with stereogenic centers. This review comprehensively focuses on recent technological and instrumental advances, highlighting the development of methods that alleviate these obstacles, paving the way for accelerating NP discovery towards biotechnological applications. Herein, we emphasize the most innovative high-throughput tools and methods for advancing bioactivity screening, NP chemical analysis, dereplication, metabolite profiling, metabolomics, genome sequencing and/or genomics approaches, databases, bioinformatics, chemoinformatics, and three-dimensional NP structure elucidation.
Collapse
Affiliation(s)
- Susana P Gaudêncio
- Associate Laboratory i4HB-Institute for Health and Bioeconomy, NOVA School of Science and Technology, NOVA University Lisbon, 2819-516 Caparica, Portugal
- UCIBIO-Applied Molecular Biosciences Unit, Chemistry Department, NOVA School of Science and Technology, NOVA University of Lisbon, 2819-516 Caparica, Portugal
| | - Engin Bayram
- Institute of Environmental Sciences, Room HKC-202, Hisar Campus, Bogazici University, Bebek, Istanbul 34342, Turkey
| | - Lada Lukić Bilela
- Department of Biology, Faculty of Science, University of Sarajevo, 71000 Sarajevo, Bosnia and Herzegovina
| | - Mercedes Cueto
- Instituto de Productos Naturales y Agrobiología-CSIC, 38206 La Laguna, Spain
| | - Ana R Díaz-Marrero
- Instituto de Productos Naturales y Agrobiología-CSIC, 38206 La Laguna, Spain
- Instituto Universitario de Bio-Orgánica (IUBO), Universidad de La Laguna, 38206 La Laguna, Spain
| | - Berat Z Haznedaroglu
- Institute of Environmental Sciences, Room HKC-202, Hisar Campus, Bogazici University, Bebek, Istanbul 34342, Turkey
| | - Carlos Jimenez
- CICA- Centro Interdisciplinar de Química e Bioloxía, Departamento de Química, Facultade de Ciencias, Universidade da Coruña, 15071 A Coruña, Spain
| | - Manolis Mandalakis
- Institute of Marine Biology, Biotechnology and Aquaculture, Hellenic Centre for Marine Research, HCMR Thalassocosmos, 71500 Gournes, Crete, Greece
| | - Florbela Pereira
- LAQV, REQUIMTE, Chemistry Department, NOVA School of Science and Technology, NOVA University of Lisbon, 2819-516 Caparica, Portugal
| | - Fernando Reyes
- Fundación MEDINA, Avda. del Conocimiento 34, 18016 Armilla, Spain
| | - Deniz Tasdemir
- GEOMAR Centre for Marine Biotechnology (GEOMAR-Biotech), Research Unit Marine Natural Products Chemistry, GEOMAR Helmholtz Centre for Ocean Research Kiel, Am Kiel-Kanal 44, 24106 Kiel, Germany
- Faculty of Mathematics and Natural Science, Kiel University, Christian-Albrechts-Platz 4, 24118 Kiel, Germany
| |
Collapse
|
9
|
Xing S, Shen S, Xu B, Li X, Huan T. BUDDY: molecular formula discovery via bottom-up MS/MS interrogation. Nat Methods 2023:10.1038/s41592-023-01850-x. [PMID: 37055660 DOI: 10.1038/s41592-023-01850-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 03/15/2023] [Indexed: 04/15/2023]
Abstract
A substantial fraction of metabolic features remains undetermined in mass spectrometry (MS)-based metabolomics, and molecular formula annotation is the starting point for unraveling their chemical identities. Here we present bottom-up tandem MS (MS/MS) interrogation, a method for de novo formula annotation. Our approach prioritizes MS/MS-explainable formula candidates, implements machine-learned ranking and offers false discovery rate estimation. Compared with the mathematically exhaustive formula enumeration, our approach shrinks the formula candidate space by 42.8% on average. Method benchmarking on annotation accuracy was systematically carried out on reference MS/MS libraries and real metabolomics datasets. Applied on 155,321 recurrent unidentified spectra, our approach confidently annotated >5,000 novel molecular formulae absent from chemical databases. Beyond the level of individual metabolic features, we combined bottom-up MS/MS interrogation with global optimization to refine formula annotations while revealing peak interrelationships. This approach allowed the systematic annotation of 37 fatty acid amide molecules in human fecal data. All bioinformatics pipelines are available in a standalone software, BUDDY ( https://github.com/HuanLab/BUDDY ).
Collapse
Affiliation(s)
- Shipei Xing
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada
| | - Sam Shen
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada
| | - Banghua Xu
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada
| | - Xiaoxiao Li
- Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Tao Huan
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada.
| |
Collapse
|
10
|
Quantitative challenges and their bioinformatic solutions in mass spectrometry-based metabolomics. Trends Analyt Chem 2023. [DOI: 10.1016/j.trac.2023.117009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/04/2023]
|
11
|
Sahayasheela VJ, Lankadasari MB, Dan VM, Dastager SG, Pandian GN, Sugiyama H. Artificial intelligence in microbial natural product drug discovery: current and emerging role. Nat Prod Rep 2022; 39:2215-2230. [PMID: 36017693 PMCID: PMC9931531 DOI: 10.1039/d2np00035k] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Covering: up to the end of 2022Microorganisms are exceptional sources of a wide array of unique natural products and play a significant role in drug discovery. During the golden era, several life-saving antibiotics and anticancer agents were isolated from microbes; moreover, they are still widely used. However, difficulties in the isolation methods and repeated discoveries of the same molecules have caused a setback in the past. Artificial intelligence (AI) has had a profound impact on various research fields, and its application allows the effective performance of data analyses and predictions. With the advances in omics, it is possible to obtain a wealth of information for the identification, isolation, and target prediction of secondary metabolites. In this review, we discuss drug discovery based on natural products from microorganisms with the help of AI and machine learning.
Collapse
Affiliation(s)
- Vinodh J Sahayasheela
- Department of Chemistry, Graduate School of Science, Kyoto University, Kitashirakawa-Oiwakecho, Sakyo-Ku, Kyoto 606-8502, Japan.
| | - Manendra B Lankadasari
- Thoracic Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | - Vipin Mohan Dan
- Microbiology Division, Jawaharlal Nehru Tropical Botanic Garden and Research Institute, Thiruvananthapuram, Kerala, India
| | - Syed G Dastager
- NCIM Resource Centre, Division of Biochemical Sciences, CSIR - National Chemical Laboratory, Pune, Maharashtra, India
| | - Ganesh N Pandian
- Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Yoshida-Ushinomaecho, Sakyo-Ku, Kyoto 606-8501, Japan
| | - Hiroshi Sugiyama
- Department of Chemistry, Graduate School of Science, Kyoto University, Kitashirakawa-Oiwakecho, Sakyo-Ku, Kyoto 606-8502, Japan.
- Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Yoshida-Ushinomaecho, Sakyo-Ku, Kyoto 606-8501, Japan
| |
Collapse
|
12
|
Cai Y, Zhou Z, Zhu ZJ. Advanced analytical and informatic strategies for metabolite annotation in untargeted metabolomics. Trends Analyt Chem 2022. [DOI: 10.1016/j.trac.2022.116903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
13
|
Bittremieux W, Wang M, Dorrestein PC. The critical role that spectral libraries play in capturing the metabolomics community knowledge. Metabolomics 2022; 18:94. [PMID: 36409434 DOI: 10.1007/s11306-022-01947-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 10/19/2022] [Indexed: 11/22/2022]
Abstract
BACKGROUND Spectral library searching is currently the most common approach for compound annotation in untargeted metabolomics. Spectral libraries applicable to liquid chromatography mass spectrometry have grown in size over the past decade to include hundreds of thousands to millions of mass spectra and tens of thousands of compounds, forming an essential knowledge base for the interpretation of metabolomics experiments. AIM OF REVIEW We describe existing spectral library resources, highlight different strategies for compiling spectral libraries, and discuss quality considerations that should be taken into account when interpreting spectral library searching results. Finally, we describe how spectral libraries are empowering the next generation of machine learning tools in computational metabolomics, and discuss several opportunities for using increasingly accessible large spectral libraries. KEY SCIENTIFIC CONCEPTS OF REVIEW This review focuses on the current state of spectral libraries for untargeted LC-MS/MS based metabolomics. We show how the number of entries in publicly accessible spectral libraries has increased more than 60-fold in the past eight years to aid molecular interpretation and we discuss how the role of spectral libraries in untargeted metabolomics will evolve in the near future.
Collapse
Affiliation(s)
- Wout Bittremieux
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
| | - Mingxun Wang
- Department of Computer Science, University of California Riverside, Riverside, CA, 92507, USA
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, 92093, USA.
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
14
|
Bittremieux W, Schmid R, Huber F, van der Hooft JJJ, Wang M, Dorrestein PC. Comparison of Cosine, Modified Cosine, and Neutral Loss Based Spectrum Alignment For Discovery of Structurally Related Molecules. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2022; 33:1733-1744. [PMID: 35960544 DOI: 10.1021/jasms.2c00153] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Spectrum alignment of tandem mass spectrometry (MS/MS) data using the modified cosine similarity and subsequent visualization as molecular networks have been demonstrated to be a useful strategy to discover analogs of molecules from untargeted MS/MS-based metabolomics experiments. Recently, a neutral loss matching approach has been introduced as an alternative to MS/MS-based molecular networking with an implied performance advantage in finding analogs that cannot be discovered using existing MS/MS spectrum alignment strategies. To comprehensively evaluate the scoring properties of neutral loss matching, the cosine similarity, and the modified cosine similarity, similarity measures of 955 228 peptide MS/MS spectrum pairs and 10 million small molecule MS/MS spectrum pairs were compared. This comparative analysis revealed that the modified cosine similarity outperformed neutral loss matching and the cosine similarity in all cases. The data further indicated that the performance of MS/MS spectrum alignment depends on the location and type of the modification, as well as the chemical compound class of fragmented molecules.
Collapse
Affiliation(s)
- Wout Bittremieux
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Robin Schmid
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Florian Huber
- Centre for Digitalization and Digitality, University of Applied Sciences, 40476 Düsseldorf, Germany
| | - Justin J J van der Hooft
- Bioinformatics Group, Wageningen University, 6708PB Wageningen, The Netherlands
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa
| | - Mingxun Wang
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, California 92093, United States
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
| |
Collapse
|