Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Schymanski EL, Ruttkies C, Krauss M, Brouard C, Kind T, Dührkop K, Allen F, Vaniya A, Verdegem D, Böcker S, Rousu J, Shen H, Tsugawa H, Sajed T, Fiehn O, Ghesquière B, Neumann S. Critical Assessment of Small Molecule Identification 2016: automated methods. J Cheminform 2017;9:22. [PMID: 29086042 PMCID: PMC5368104 DOI: 10.1186/s13321-017-0207-1] [Citation(s) in RCA: 94] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Accepted: 03/13/2017] [Indexed: 12/30/2022] Open

For:	Schymanski EL, Ruttkies C, Krauss M, Brouard C, Kind T, Dührkop K, Allen F, Vaniya A, Verdegem D, Böcker S, Rousu J, Shen H, Tsugawa H, Sajed T, Fiehn O, Ghesquière B, Neumann S. Critical Assessment of Small Molecule Identification 2016: automated methods. J Cheminform 2017;9:22. [PMID: 29086042 PMCID: PMC5368104 DOI: 10.1186/s13321-017-0207-1] [Citation(s) in RCA: 94] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Accepted: 03/13/2017] [Indexed: 12/30/2022] Open

Number

Cited by Other Article(s)

Yu T, Chen JM, Liu W, Zhao JQ, Li P, Liu FJ, Jiang Y, Li HJ. In-depth characterization of cycloartane triterpenoids and discovery of species-specific markers from three Cimicifuga species guided by a strategy that integrates in-source fragment elimination, diagnostic ion recognition, and feature-based molecular networking. J Chromatogr A 2024;1728:465015. [PMID: 38821032 DOI: 10.1016/j.chroma.2024.465015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 05/13/2024] [Accepted: 05/21/2024] [Indexed: 06/02/2024]

Samanipour S, Barron LP, van Herwerden D, Praetorius A, Thomas KV, O’Brien JW. Exploring the Chemical Space of the Exposome: How Far Have We Gone? JACS AU 2024;4:2412-2425. [PMID: 39055136 PMCID: PMC11267556 DOI: 10.1021/jacsau.4c00220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 05/29/2024] [Accepted: 05/31/2024] [Indexed: 07/27/2024]

Bui-Thi D, Liu Y, Lippens JL, Laukens K, De Vijlder T. TransExION: a transformer based explainable similarity metric for comparing IONS in tandem mass spectrometry. J Cheminform 2024;16:61. [PMID: 38807166 PMCID: PMC11134763 DOI: 10.1186/s13321-024-00858-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 05/12/2024] [Indexed: 05/30/2024] Open

Kalinski JCJ, Noundou XS, Petras D, Matcher GF, Polyzois A, Aron AT, Gentry EC, Bornman TG, Adams JB, Dorrington RA. Urban and agricultural influences on the coastal dissolved organic matter pool in the Algoa Bay estuaries. CHEMOSPHERE 2024;355:141782. [PMID: 38548083 DOI: 10.1016/j.chemosphere.2024.141782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 02/28/2024] [Accepted: 03/22/2024] [Indexed: 04/08/2024]

Abstract

While anthropogenic pollution is a major threat to aquatic ecosystem health, our knowledge of the presence of xenobiotics in coastal Dissolved Organic Matter (DOM) is still relatively poor. This is especially true for water bodies in the Global South with limited information gained mostly from targeted studies that rely on comparison with authentic standards. In recent years, non-targeted tandem mass spectrometry has emerged as a powerful tool to collectively detect and identify pollutants and biogenic DOM components in the environment, but this approach has yet to be widely utilized for monitoring ecologically important aquatic systems. In this study we compared the DOM composition of Algoa Bay, Eastern Cape, South Africa, and its two estuaries. The Swartkops Estuary is highly urbanized and severely impacted by anthropogenic pollution, while the Sundays Estuary is impacted by commercial agriculture in its catchment. We employed solid-phase extraction followed by liquid chromatography tandem mass spectrometry to annotate more than 200 pharmaceuticals, pesticides, urban xenobiotics, and natural products based on spectral matching. The identification with authentic standards confirmed the presence of methamphetamine, carbamazepine, sulfamethoxazole, N-acetylsulfamethoxazole, imazapyr, caffeine and hexa(methoxymethyl)melamine, and allowed semi-quantitative estimations for annotated xenobiotics. The Swartkops Estuary DOM composition was strongly impacted by features annotated as urban pollutants including pharmaceuticals such as melamines and antiretrovirals. By contrast, the Sundays Estuary exhibited significant enrichment of molecules annotated as agrochemicals widely used in the citrus farming industry, with predicted concentrations for some of them exceeding predicted no-effect concentrations. This study provides new insight into anthropogenic impact on the Algoa Bay system and demonstrates the utility of non-targeted tandem mass spectrometry as a sensitive tool for assessing the health of ecologically important coastal ecosystems and will serve as a valuable foundation for strategizing long-term monitoring efforts.

Collapse

Affiliation(s)

Jarmo-Charles J Kalinski Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa
Xavier Siwe Noundou Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa; Department of Pharmaceutical Sciences, Sefako Makgatho Health Sciences University, Pretoria, South Africa
Daniel Petras Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, USA; Department of Biochemistry, University of California Riverside, Riverside, USA; CMFI Cluster of Excellence, Interfaculty Institute of Microbiology and Medicine, University of Tuebingen, Tuebingen, Germany
Gwynneth F Matcher Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa; South African Institute for Aquatic Biodiversity, 6139, Makhanda, South Africa
Alexandros Polyzois Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa; Boyce Thompson Institute and Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, 14853, United States
Allegra T Aron Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, USA; Department of Chemistry and Biochemistry, University of Denver, Denver, CO, 80210, United States
Emily C Gentry Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, USA; Department of Chemistry, Virginia Tech, Blacksburg, VA, 24061, United States
Thomas G Bornman Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa; South African Environmental Observation Network SAEON, Elwandle Coastal Node, Gqeberha, South Africa; Institute for Coastal and Marine Research, Nelson Mandela University, Gqeberha, South Africa
Janine B Adams DSI/NRF Research Chair, Shallow Water Ecosystems, Department of Botany and Institute for Coastal and Marine Research, Nelson Mandela University, Gqeberha, South Africa; Department of Botany, Institute for Coastal and Marine Research CMR, Nelson Mandela University, Gqeberha, South Africa
Rosemary A Dorrington Department of Biochemistry and Microbiology, Rhodes University, Makhanda, South Africa; South African Institute for Aquatic Biodiversity, 6139, Makhanda, South Africa.

Collapse

van Tetering L, Spies S, Wildeman QDK, Houthuijs KJ, van Outersterp RE, Martens J, Wevers RA, Wishart DS, Berden G, Oomens J. A spectroscopic test suggests that fragment ion structure annotations in MS/MS libraries are frequently incorrect. Commun Chem 2024;7:30. [PMID: 38355930 PMCID: PMC10867025 DOI: 10.1038/s42004-024-01112-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 01/22/2024] [Indexed: 02/16/2024] Open

Adalia R, Patel S, Paiva A, Kaufman T, Zamora I, Cai X, Sanjuan G, Shou WZ. Development of a Predictive Multiple Reaction Monitoring (MRM) Model for High-Throughput ADME Analyses Using Learning-to-Rank (LTR) Techniques. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024;35:131-139. [PMID: 38014625 DOI: 10.1021/jasms.3c00363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]

Li S, Bohman B, Flematti GR, Jayatilaka D. Determining the parent and associated fragment formulae in mass spectrometry via the parent subformula graph. J Cheminform 2023;15:104. [PMID: 37936244 PMCID: PMC10631010 DOI: 10.1186/s13321-023-00776-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 10/25/2023] [Indexed: 11/09/2023] Open

Abstract

BACKGROUND

Identifying the molecular formula and fragmentation reactions of an unknown compound from its mass spectrum is crucial in areas such as natural product chemistry and metabolomics. We propose a method for identifying the correct candidate formula of an unidentified natural product from its mass spectrum. The method involves scoring the plausibility of parent candidate formulae based on a parent subformula graph (PSG), and two possible metrics relating to the number of edges in the PSG. This method is applicable to both electron-impact mass spectrometry (EI-MS) and tandem mass spectrometry (MS/MS) data. Additionally, this work introduces the two-dimensional fragmentation plot (2DFP) for visualizing PSGs.

RESULTS

Our results suggest that incorporating information regarding the edges of the PSG results in enhanced performance in correctly identifying parent formulae, in comparison to the more well-accepted "MS/MS score", on the 2016 Computational Assessment of Small Molecule Identification (CASMI 2016) data set (76.3 vs 58.9% correct formula identification) and the Research Centre for Toxic Compounds in the Environment (RECETOX) data set (66.2% vs 59.4% correct formula identification). In the extension of our method to identify the correct candidate formula from complex EI-MS data of semiochemicals, our method again performed better (correct formula appearing in the top 4 candidates in 20/23 vs 7/23 cases) than the MS/MS score, and enables the rapid identification of both the correct parent ion mass and the correct parent formula with minimal expert intervention.

CONCLUSION

Our method reliably identifies the correct parent formula even when the mass information is ambiguous. Furthermore, should parent formula identification be successful, the majority of associated fragment formulae can also be correctly identified. Our method can also identify the parent ion and its associated fragments in EI-MS spectra where the identity of the parent ion is unclear due to low quantities and overlapping compounds. Finally, our method does not inherently require empirical fitting of parameters or statistical learning, meaning it is easy to implement and extend upon.

SCIENTIFIC CONTRIBUTION

Developed, implemented and tested new metrics for assessing plausibility of candidate molecular formulae obtained from HR-MS data.

Collapse

Abram KJ, McCloskey D. In Search of Disentanglement in Tandem Mass Spectrometry Datasets. Biomolecules 2023;13:1343. [PMID: 37759743 PMCID: PMC10526774 DOI: 10.3390/biom13091343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 08/16/2023] [Accepted: 08/25/2023] [Indexed: 09/29/2023] Open

Houthuijs KJ, Berden G, Engelke UFH, Gautam V, Wishart DS, Wevers RA, Martens J, Oomens J. An In Silico Infrared Spectral Library of Molecular Ions for Metabolite Identification. Anal Chem 2023. [PMID: 37262385 DOI: 10.1021/acs.analchem.3c01078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Gaudêncio SP, Bayram E, Lukić Bilela L, Cueto M, Díaz-Marrero AR, Haznedaroglu BZ, Jimenez C, Mandalakis M, Pereira F, Reyes F, Tasdemir D. Advanced Methods for Natural Products Discovery: Bioactivity Screening, Dereplication, Metabolomics Profiling, Genomic Sequencing, Databases and Informatic Tools, and Structure Elucidation. Mar Drugs 2023;21:md21050308. [PMID: 37233502 DOI: 10.3390/md21050308] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/11/2023] [Accepted: 05/12/2023] [Indexed: 05/27/2023] Open

Affiliation(s)

Susana P Gaudêncio Associate Laboratory i4HB-Institute for Health and Bioeconomy, NOVA School of Science and Technology, NOVA University Lisbon, 2819-516 Caparica, Portugal UCIBIO-Applied Molecular Biosciences Unit, Chemistry Department, NOVA School of Science and Technology, NOVA University of Lisbon, 2819-516 Caparica, Portugal
Engin Bayram Institute of Environmental Sciences, Room HKC-202, Hisar Campus, Bogazici University, Bebek, Istanbul 34342, Turkey
Lada Lukić Bilela Department of Biology, Faculty of Science, University of Sarajevo, 71000 Sarajevo, Bosnia and Herzegovina
Mercedes Cueto Instituto de Productos Naturales y Agrobiología-CSIC, 38206 La Laguna, Spain
Ana R Díaz-Marrero Instituto de Productos Naturales y Agrobiología-CSIC, 38206 La Laguna, Spain Instituto Universitario de Bio-Orgánica (IUBO), Universidad de La Laguna, 38206 La Laguna, Spain
Berat Z Haznedaroglu Institute of Environmental Sciences, Room HKC-202, Hisar Campus, Bogazici University, Bebek, Istanbul 34342, Turkey
Carlos Jimenez CICA- Centro Interdisciplinar de Química e Bioloxía, Departamento de Química, Facultade de Ciencias, Universidade da Coruña, 15071 A Coruña, Spain
Manolis Mandalakis Institute of Marine Biology, Biotechnology and Aquaculture, Hellenic Centre for Marine Research, HCMR Thalassocosmos, 71500 Gournes, Crete, Greece
Florbela Pereira LAQV, REQUIMTE, Chemistry Department, NOVA School of Science and Technology, NOVA University of Lisbon, 2819-516 Caparica, Portugal
Fernando Reyes Fundación MEDINA, Avda. del Conocimiento 34, 18016 Armilla, Spain
Deniz Tasdemir GEOMAR Centre for Marine Biotechnology (GEOMAR-Biotech), Research Unit Marine Natural Products Chemistry, GEOMAR Helmholtz Centre for Ocean Research Kiel, Am Kiel-Kanal 44, 24106 Kiel, Germany Faculty of Mathematics and Natural Science, Kiel University, Christian-Albrechts-Platz 4, 24118 Kiel, Germany

Collapse

Mahood EH, Bennett AA, Komatsu K, Kruse LH, Lau V, Rahmati Ishka M, Jiang Y, Bravo A, Louie K, Bowen BP, Harrison MJ, Provart NJ, Vatamaniuk OK, Moghe GD. Information theory and machine learning illuminate large-scale metabolomic responses of Brachypodium distachyon to environmental change. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023;114:463-481. [PMID: 36880270 DOI: 10.1111/tpj.16160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 02/06/2023] [Accepted: 02/19/2023] [Indexed: 05/10/2023]

Abstract

Plant responses to environmental change are mediated via changes in cellular metabolomes. However, <5% of signals obtained from liquid chromatography tandem mass spectrometry (LC-MS/MS) can be identified, limiting our understanding of how metabolomes change under biotic/abiotic stress. To address this challenge, we performed untargeted LC-MS/MS of leaves, roots, and other organs of Brachypodium distachyon (Poaceae) under 17 organ-condition combinations, including copper deficiency, heat stress, low phosphate, and arbuscular mycorrhizal symbiosis. We found that both leaf and root metabolomes were significantly affected by the growth medium. Leaf metabolomes were more diverse than root metabolomes, but the latter were more specialized and more responsive to environmental change. We found that 1 week of copper deficiency shielded the root, but not the leaf metabolome, from perturbation due to heat stress. Machine learning (ML)-based analysis annotated approximately 81% of the fragmented peaks versus approximately 6% using spectral matches alone. We performed one of the most extensive validations of ML-based peak annotations in plants using thousands of authentic standards, and analyzed approximately 37% of the annotated peaks based on these assessments. Analyzing responsiveness of each predicted metabolite class to environmental change revealed significant perturbations of glycerophospholipids, sphingolipids, and flavonoids. Co-accumulation analysis further identified condition-specific biomarkers. To make these results accessible, we developed a visualization platform on the Bio-Analytic Resource for Plant Biology website (https://bar.utoronto.ca/efp_brachypodium_metabolites/cgi-bin/efpWeb.cgi), where perturbed metabolite classes can be readily visualized. Overall, our study illustrates how emerging chemoinformatic methods can be applied to reveal novel insights into the dynamic plant metabolome and stress adaptation.

Collapse

Borelli TC, Arini GS, Feitosa LGP, Dorrestein PC, Lopes NP, da Silva RR. Improving annotation propagation on molecular networks through random walks: introducing ChemWalker. Bioinformatics 2023;39:7067745. [PMID: 36864626 PMCID: PMC9991053 DOI: 10.1093/bioinformatics/btad078] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Revised: 01/13/2023] [Indexed: 03/04/2023] Open

Abstract

MOTIVATION

Annotation of the mass signals is still the biggest bottleneck for the untargeted mass spectrometry analysis of complex mixtures. Molecular networks are being increasingly adopted by the mass spectrometry community as a tool to annotate large-scale experiments. We have previously shown that the process of propagating annotations from spectral library matches on molecular networks can be automated using Network Annotation Propagation (NAP). One of the limitations of NAP is that the information for the spectral matches is only propagated locally, to the first neighbor of a spectral match. Here, we show that annotation propagation can be expanded to nodes not directly connected to spectral matches using random walks on graphs, introducing the ChemWalker python library.

RESULTS

Similarly to NAP, ChemWalker relies on combinatorial in silico fragmentation results, performed by MetFrag, searching biologically relevant databases. Departing from the combination of a spectral network and the structural similarity among candidate structures, we have used MetFusion Scoring function to create a weight function, producing a weighted graph. This graph was subsequently used by the random walk to calculate the probability of 'walking' through a set of candidates, departing from seed nodes (represented by spectral library matches). This approach allowed the information propagation to nodes not directly connected to the spectral library match. Compared with NAP, ChemWalker has a series of improvements, on running time, scalability and maintainability and is available as a standalone python package.

AVAILABILITY AND IMPLEMENTATION

ChemWalker is freely available at https://github.com/computational-chemical-biology/ChemWalker.

CONTACT

ridasilva@usp.br.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Boelrijk J, van Herwerden D, Ensing B, Forré P, Samanipour S. Predicting RP-LC retention indices of structurally unknown chemicals from mass spectrometry data. J Cheminform 2023;15:28. [PMID: 36829215 PMCID: PMC9960388 DOI: 10.1186/s13321-023-00699-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 02/13/2023] [Indexed: 02/26/2023] Open

MAD HATTER Correctly Annotates 98% of Small Molecule Tandem Mass Spectra Searching in PubChem. Metabolites 2023;13:metabo13030314. [PMID: 36984753 PMCID: PMC10053663 DOI: 10.3390/metabo13030314] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 02/14/2023] [Accepted: 02/15/2023] [Indexed: 02/23/2023] Open

Morehouse NJ, Clark TN, McMann EJ, van Santen JA, Haeckl FPJ, Gray CA, Linington RG. Annotation of natural product compound families using molecular networking topology and structural similarity fingerprinting. Nat Commun 2023;14:308. [PMID: 36658161 PMCID: PMC9852437 DOI: 10.1038/s41467-022-35734-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 12/20/2022] [Indexed: 01/20/2023] Open

Gomes PWP, de Tralia Medeiros TC, Maimone NM, Leão TF, de Moraes LAB, Bauermeister A. Microbial Metabolites Annotation by Mass Spectrometry-Based Metabolomics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023;1439:225-248. [PMID: 37843811 DOI: 10.1007/978-3-031-41741-2_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]

Joint structural annotation of small molecules using liquid chromatography retention order and tandem mass spectrometry data. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00577-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Menger F, Celma A, Schymanski EL, Lai FY, Bijlsma L, Wiberg K, Hernández F, Sancho JV, Ahrens L. Enhancing spectral quality in complex environmental matrices: Supporting suspect and non-target screening in zebra mussels with ion mobility. ENVIRONMENT INTERNATIONAL 2022;170:107585. [PMID: 36265356 DOI: 10.1016/j.envint.2022.107585] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 10/11/2022] [Accepted: 10/13/2022] [Indexed: 06/16/2023]

Cai Y, Zhou Z, Zhu ZJ. Advanced analytical and informatic strategies for metabolite annotation in untargeted metabolomics. Trends Analyt Chem 2022. [DOI: 10.1016/j.trac.2022.116903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Sakurai N, Yamazaki S, Suda K, Hosoki A, Akimoto N, Takahashi H, Shibata D, Aoki Y. The Thing Metabolome Repository family (XMRs): comparable untargeted metabolome databases for analyzing sample-specific unknown metabolites. Nucleic Acids Res 2022;51:D660-D677. [PMID: 36417935 PMCID: PMC9825447 DOI: 10.1093/nar/gkac1058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 10/21/2022] [Accepted: 10/25/2022] [Indexed: 11/25/2022] Open

LC-DAD-ESI-MS/MS and NMR Analysis of Conifer Wood Specialized Metabolites. Cells 2022;11:cells11203332. [PMID: 36291197 PMCID: PMC9600761 DOI: 10.3390/cells11203332] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 10/13/2022] [Accepted: 10/20/2022] [Indexed: 11/16/2022] Open

Ljoncheva M, Stepišnik T, Kosjek T, Džeroski S. Machine learning for identification of silylated derivatives from mass spectra. J Cheminform 2022;14:62. [PMID: 36109826 PMCID: PMC9476372 DOI: 10.1186/s13321-022-00636-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 07/31/2022] [Indexed: 11/10/2022] Open

Abstract Abstract Motivation Compound structure identification is using increasingly more sophisticated computational tools, among which machine learning tools are a recent addition that quickly gains in importance. These tools, of which the method titled Compound Structure Identification:Input Output Kernel Regression (CSI:IOKR) is an excellent example, have been used to elucidate compound structure from mass spectral (MS) data with significant accuracy, confidence and speed. They have, however, largely focused on data coming from liquid chromatography coupled to tandem mass spectrometry (LC–MS). Gas chromatography coupled to mass spectrometry (GC–MS) is an alternative which offers several advantages as compared to LC–MS, including higher data reproducibility. Of special importance is the substantial compound coverage offered by GC–MS, further expanded by derivatization procedures, such as silylation, which can improve the volatility, thermal stability and chromatographic peak shape of semi-volatile analytes. Despite these advantages and the increasing size of compound databases and MS libraries, GC–MS data have not yet been used by machine learning approaches to compound structure identification. Results This study presents a successful application of the CSI:IOKR machine learning method for the identification of environmental contaminants from GC–MS spectra. We use CSI:IOKR as an alternative to exhaustive search of MS libraries, independent of instrumental platform and data processing software. We use a comprehensive dataset of GC–MS spectra of trimethylsilyl derivatives and their molecular structures, derived from a large commercially available MS library, to train a model that maps between spectra and molecular structures. We test the learned model on a different dataset of GC–MS spectra of trimethylsilyl derivatives of environmental contaminants, generated in-house and made publicly available. The results show that 37% (resp. 50%) of the tested compounds are correctly ranked among the top 10 (resp. 20) candidate compounds suggested by the model. Even though spectral comparisons with reference standards or de novo structural elucidations are neccessary to validate the predictions, machine learning provides efficient candidate prioritization and reduction of the time spent for compound annotation. Collapse

Bremer PL, Vaniya A, Kind T, Wang S, Fiehn O. How Well Can We Predict Mass Spectra from Structures? Benchmarking Competitive Fragmentation Modeling for Metabolite Identification on Untrained Tandem Mass Spectra. J Chem Inf Model 2022;62:4049-4056. [PMID: 36043939 DOI: 10.1021/acs.jcim.2c00936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

MSNovelist: de novo structure generation from mass spectra. Nat Methods 2022;19:865-870. [PMID: 35637304 PMCID: PMC9262714 DOI: 10.1038/s41592-022-01486-3] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 04/07/2022] [Indexed: 12/29/2022]

Sussman EM, Oktem B, Isayeva IS, Liu J, Wickramasekara S, Chandrasekar V, Nahan K, Shin HY, Zheng J. Chemical Characterization and Non-targeted Analysis of Medical Device Extracts: A Review of Current Approaches, Gaps, and Emerging Practices. ACS Biomater Sci Eng 2022;8:939-963. [PMID: 35171560 DOI: 10.1021/acsbiomaterials.1c01119] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Abstract

The developers of medical devices evaluate the biocompatibility of their device prior to FDA's review and subsequent introduction to the market. Chemical characterization, described in ISO 10993-18:2020, can generate information for toxicological risk assessment and is an alternative approach for addressing some biocompatibility end points (e.g., systemic toxicity, genotoxicity, carcinogenicity, reproductive/developmental toxicity) that can reduce the time and cost of testing and the need for animal testing. Additionally, chemical characterization can be used to determine whether modifications to the materials and manufacturing processes alter the chemistry of a patient-contacting device to an extent that could impact device safety. Extractables testing is one approach to chemical characterization that employs combinations of non-targeted analysis, non-targeted screening, and/or targeted analysis to establish the identities and quantities of the various chemical constituents that can be released from a device. Due to the difficulty in obtaining a priori information on all the constituents in finished devices, information generation strategies in the form of analytical chemistry testing are often used. Identified and quantified extractables are then assessed using toxicological risk assessment approaches to determine if reported quantities are sufficiently low to overcome the need for further chemical analysis, biological evaluation of select end points, or risk control. For extractables studies to be useful as a screening tool, comprehensive and reliable non-targeted methods are needed. Although non-targeted methods have been adopted by many laboratories, they are laboratory-specific and require expensive analytical instruments and advanced technical expertise to perform. In this Perspective, we describe the elements of extractables studies and provide an overview of the current practices, identified gaps, and emerging practices that may be adopted on a wider scale in the future. This Perspective is outlined according to the steps of an extractables study: information gathering, extraction, extract sample processing, system selection, qualification, quantification, and identification.

Collapse

Wasito H, Causon T, Hann S. Alternating in-source fragmentation with single-stage high-resolution mass spectrometry with high annotation confidence in non-targeted metabolomics. Talanta 2022;236:122828. [PMID: 34635218 DOI: 10.1016/j.talanta.2021.122828] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 08/18/2021] [Accepted: 08/24/2021] [Indexed: 02/07/2023]

Dührkop K. OUP accepted manuscript. Bioinformatics 2022;38:i342-i349. [PMID: 35758813 PMCID: PMC9235503 DOI: 10.1093/bioinformatics/btac260] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open

Place BJ, Ulrich EM, Challis JK, Chao A, Du B, Favela K, Feng YL, Fisher CM, Gardinali P, Hood A, Knolhoff AM, McEachran AD, Nason SL, Newton SR, Ng B, Nuñez J, Peter KT, Phillips AL, Quinete N, Renslow R, Sobus JR, Sussman EM, Warth B, Wickramasekara S, Williams AJ. An Introduction to the Benchmarking and Publications for Non-Targeted Analysis Working Group. Anal Chem 2021;93:16289-16296. [PMID: 34842413 PMCID: PMC8848292 DOI: 10.1021/acs.analchem.1c02660] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Affiliation(s)

Benjamin J. Place National Institute of Standards and Technology, Gaithersburg, MD, USA 20899,*Corresponding author,
Elin M. Ulrich U.S. Environmental Protection Agency, Office of Research and Development, Center for Computational Toxicology and Exposure, Research Triangle Park, NC, USA 27711
Jonathan K. Challis Toxicology Centre, University of Saskatchewan, Saskatoon, Canada S7N 5B3
Alex Chao U.S. Environmental Protection Agency, Office of Research and Development, Center for Computational Toxicology and Exposure, Research Triangle Park, NC, USA 27711
Bowen Du Southern California Coastal Water Research Project Authority, Costa Mesa, CA, USA 92626
Kristin Favela Southwest Research Institute, San Antonio, TX, USA 78238
Yong-Lai Feng Exposure and Biomonitoring Division, Environmental Health Science and Research Bureau, Health Canada, Ottawa, Ontario, Canada, K1A 0K9
Christine M. Fisher U.S. Food and Drug Administration, Center for Food Safety and Applied Nutrition, College Park, MD, USA 20740
Piero Gardinali Institute of Environment & Department of Chemistry and Biochemistry, Florida International University, North Miami, FL 33181
Alan Hood U.S. Food and Drug Administration, Center for Devices and Radiological Health, Silver Spring, MD, USA 20993
Ann M. Knolhoff U.S. Food and Drug Administration, Center for Food Safety and Applied Nutrition, College Park, MD, USA 20740
Andrew D. McEachran Agilent Technologies, Inc. Santa Clara, CA, USA 95051
Sara L. Nason Connecticut Agricultural Experiment Station, New Haven, CT, USA 06511
Seth R. Newton U.S. Environmental Protection Agency, Office of Research and Development, Center for Computational Toxicology and Exposure, Research Triangle Park, NC, USA 27711
Brian Ng Institute of Environment & Department of Chemistry and Biochemistry, Florida International University, North Miami, FL 33181
Jamie Nuñez Pacific Northwest National Laboratory, Richland, WA, USA 99352
Katherine T. Peter National Institute of Standards and Technology, Charleston, SC, USA 29412
Allison L. Phillips U.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, Research Triangle Park, NC, USA 27711
Natalia Quinete Institute of Environment & Department of Chemistry and Biochemistry, Florida International University, North Miami, FL 33181
Ryan Renslow Pacific Northwest National Laboratory, Richland, WA, USA 99352
Jon R. Sobus U.S. Environmental Protection Agency, Office of Research and Development, Center for Computational Toxicology and Exposure, Research Triangle Park, NC, USA 27711
Eric M. Sussman U.S. Food and Drug Administration, Center for Devices and Radiological Health, Silver Spring, MD, USA 20993
Benedikt Warth Department of Food Chemistry and Toxicology, Faculty of Chemistry, University of Vienna, 1090 Vienna, Austria
Samanthi Wickramasekara U.S. Food and Drug Administration, Center for Devices and Radiological Health, Silver Spring, MD, USA 20993
Antony J. Williams U.S. Environmental Protection Agency, Office of Research and Development, Center for Computational Toxicology and Exposure, Research Triangle Park, NC, USA 27711

Collapse

Beniddir MA, Kang KB, Genta-Jouve G, Huber F, Rogers S, van der Hooft JJJ. Advances in decomposing complex metabolite mixtures using substructure- and network-based computational metabolomics approaches. Nat Prod Rep 2021;38:1967-1993. [PMID: 34821250 PMCID: PMC8597898 DOI: 10.1039/d1np00023c] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Indexed: 12/13/2022]

Abstract

Covering: up to the end of 2020Recently introduced computational metabolome mining tools have started to positively impact the chemical and biological interpretation of untargeted metabolomics analyses. We believe that these current advances make it possible to start decomposing complex metabolite mixtures into substructure and chemical class information, thereby supporting pivotal tasks in metabolomics analysis including metabolite annotation, the comparison of metabolic profiles, and network analyses. In this review, we highlight and explain key tools and emerging strategies covering 2015 up to the end of 2020. The majority of these tools aim at processing and analyzing liquid chromatography coupled to mass spectrometry fragmentation data. We start with defining what substructures are, how they relate to molecular fingerprints, and how recognizing them helps to decompose complex mixtures. We continue with chemical classes that are based on the presence or absence of particular molecular scaffolds and/or functional groups and are thus intrinsically related to substructures. We discuss novel tools to mine substructures, annotate chemical compound classes, and create mass spectral networks from metabolomics data and demonstrate them using two case studies. We also review and speculate about the opportunities that NMR spectroscopy-based metabolome mining of complex metabolite mixtures offers to discover substructures and chemical classes. Finally, we will describe the main benefits and limitations of the current tools and strategies that rely on them, and our vision on how this exciting field can develop toward repository-scale-sized metabolomics analyses. Complementary sources of structural information from genomics analyses and well-curated taxonomic records are also discussed. Many research fields such as natural products discovery, pharmacokinetic and drug metabolism studies, and environmental metabolomics increasingly rely on untargeted metabolomics to gain biochemical and biological insights. The here described technical advances will benefit all those metabolomics disciplines by transforming spectral data into knowledge that can answer biological questions.

Collapse

Tsugawa H, Rai A, Saito K, Nakabayashi R. Metabolomics and complementary techniques to investigate the plant phytochemical cosmos. Nat Prod Rep 2021;38:1729-1759. [PMID: 34668509 DOI: 10.1039/d1np00014d] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

High-confidence structural annotation of metabolites absent from spectral libraries. Nat Biotechnol 2021;40:411-421. [PMID: 34650271 PMCID: PMC8926923 DOI: 10.1038/s41587-021-01045-9] [Citation(s) in RCA: 91] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 08/04/2021] [Indexed: 12/14/2022]

Wang F, Liigand J, Tian S, Arndt D, Greiner R, Wishart DS. CFM-ID 4.0: More Accurate ESI-MS/MS Spectral Prediction and Compound Identification. Anal Chem 2021;93:11692-11700. [PMID: 34403256 DOI: 10.1021/acs.analchem.1c01465] [Citation(s) in RCA: 122] [Impact Index Per Article: 40.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Bach E, Rogers S, Williamson J, Rousu J. Probabilistic framework for integration of mass spectrum and retention time information in small molecule identification. Bioinformatics 2021;37:1724-1731. [PMID: 33244585 PMCID: PMC8289373 DOI: 10.1093/bioinformatics/btaa998] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 10/27/2020] [Accepted: 11/17/2020] [Indexed: 11/14/2022] Open

Li D, Gaquerel E. Next-Generation Mass Spectrometry Metabolomics Revives the Functional Analysis of Plant Metabolic Diversity. ANNUAL REVIEW OF PLANT BIOLOGY 2021;72:867-891. [PMID: 33781077 DOI: 10.1146/annurev-arplant-071720-114836] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

González-Gaya B, Lopez-Herguedas N, Bilbao D, Mijangos L, Iker AM, Etxebarria N, Irazola M, Prieto A, Olivares M, Zuloaga O. Suspect and non-target screening: the last frontier in environmental analysis. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2021;13:1876-1904. [PMID: 33913946 DOI: 10.1039/d1ay00111f] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Dührkop K, Nothias LF, Fleischauer M, Reher R, Ludwig M, Hoffmann MA, Petras D, Gerwick WH, Rousu J, Dorrestein PC, Böcker S. Systematic classification of unknown metabolites using high-resolution fragmentation mass spectra. Nat Biotechnol 2021;39:462-471. [PMID: 33230292 DOI: 10.1038/s41587-020-0740-8] [Citation(s) in RCA: 252] [Impact Index Per Article: 84.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 10/16/2020] [Indexed: 12/12/2022]

Krettler CA, Thallinger GG. A map of mass spectrometry-based in silico fragmentation prediction and compound identification in metabolomics. Brief Bioinform 2021;22:6184408. [PMID: 33758925 DOI: 10.1093/bib/bbab073] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 01/29/2021] [Accepted: 02/12/2021] [Indexed: 12/27/2022] Open

Peters K, Balcke G, Kleinenkuhnen N, Treutler H, Neumann S. Untargeted In Silico Compound Classification-A Novel Metabolomics Method to Assess the Chemodiversity in Bryophytes. Int J Mol Sci 2021;22:ijms22063251. [PMID: 33806786 PMCID: PMC8005083 DOI: 10.3390/ijms22063251] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 03/16/2021] [Accepted: 03/18/2021] [Indexed: 12/29/2022] Open

Abstract

In plant ecology, biochemical analyses of bryophytes and vascular plants are often conducted on dried herbarium specimen as species typically grow in distant and inaccessible locations. Here, we present an automated in silico compound classification framework to annotate metabolites using an untargeted data independent acquisition (DIA)–LC/MS–QToF-sequential windowed acquisition of all theoretical fragment ion mass spectra (SWATH) ecometabolomics analytical method. We perform a comparative investigation of the chemical diversity at the global level and the composition of metabolite families in ten different species of bryophytes using fresh samples collected on-site and dried specimen stored in a herbarium for half a year. Shannon and Pielou’s diversity indices, hierarchical clustering analysis (HCA), sparse partial least squares discriminant analysis (sPLS-DA), distance-based redundancy analysis (dbRDA), ANOVA with post-hoc Tukey honestly significant difference (HSD) test, and the Fisher’s exact test were used to determine differences in the richness and composition of metabolite families, with regard to herbarium conditions, ecological characteristics, and species. We functionally annotated metabolite families to biochemical processes related to the structural integrity of membranes and cell walls (proto-lignin, glycerophospholipids, carbohydrates), chemical defense (polyphenols, steroids), reactive oxygen species (ROS) protection (alkaloids, amino acids, flavonoids), nutrition (nitrogen- and phosphate-containing glycerophospholipids), and photosynthesis. Changes in the composition of metabolite families also explained variance related to ecological functioning like physiological adaptations of bryophytes to dry environments (proteins, peptides, flavonoids, terpenes), light availability (flavonoids, terpenes, carbohydrates), temperature (flavonoids), and biotic interactions (steroids, terpenes). The results from this study allow to construct chemical traits that can be attributed to biogeochemistry, habitat conditions, environmental changes and biotic interactions. Our classification framework accelerates the complex annotation process in metabolomics and can be used to simplify biochemical patterns. We show that compound classification is a powerful tool that allows to explore relationships in both molecular biology by “zooming in” and in ecology by “zooming out”. The insights revealed by our framework allow to construct new research hypotheses and to enable detailed follow-up studies.

Collapse

Schymanski EL, Kondić T, Neumann S, Thiessen PA, Zhang J, Bolton EE. Empowering large chemical knowledge bases for exposomics: PubChemLite meets MetFrag. J Cheminform 2021;13:19. [PMID: 33685519 PMCID: PMC7938590 DOI: 10.1186/s13321-021-00489-0] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 01/13/2021] [Accepted: 01/22/2021] [Indexed: 12/31/2022] Open

Abstract

Compound (or chemical) databases are an invaluable resource for many scientific disciplines. Exposomics researchers need to find and identify relevant chemicals that cover the entirety of potential (chemical and other) exposures over entire lifetimes. This daunting task, with over 100 million chemicals in the largest chemical databases, coupled with broadly acknowledged knowledge gaps in these resources, leaves researchers faced with too much-yet not enough-information at the same time to perform comprehensive exposomics research. Furthermore, the improvements in analytical technologies and computational mass spectrometry workflows coupled with the rapid growth in databases and increasing demand for high throughput "big data" services from the research community present significant challenges for both data hosts and workflow developers. This article explores how to reduce candidate search spaces in non-target small molecule identification workflows, while increasing content usability in the context of environmental and exposomics analyses, so as to profit from the increasing size and information content of large compound databases, while increasing efficiency at the same time. In this article, these methods are explored using PubChem, the NORMAN Network Suspect List Exchange and the in silico fragmentation approach MetFrag. A subset of the PubChem database relevant for exposomics, PubChemLite, is presented as a database resource that can be (and has been) integrated into current workflows for high resolution mass spectrometry. Benchmarking datasets from earlier publications are used to show how experimental knowledge and existing datasets can be used to detect and fill gaps in compound databases to progressively improve large resources such as PubChem, and topic-specific subsets such as PubChemLite. PubChemLite is a living collection, updating as annotation content in PubChem is updated, and exported to allow direct integration into existing workflows such as MetFrag. The source code and files necessary to recreate or adjust this are jointly hosted between the research parties (see data availability statement). This effort shows that enhancing the FAIRness (Findability, Accessibility, Interoperability and Reusability) of open resources can mutually enhance several resources for whole community benefit. The authors explicitly welcome additional community input on ideas for future developments.

Collapse

Data processing strategies for non-targeted analysis of foods using liquid chromatography/high-resolution mass spectrometry. Trends Analyt Chem 2021. [DOI: 10.1016/j.trac.2021.116188] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Chemically informed analyses of metabolomics mass spectrometry data with Qemistree. Nat Chem Biol 2021;17:146-151. [PMID: 33199911 PMCID: PMC8189545 DOI: 10.1038/s41589-020-00677-3] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Accepted: 09/18/2020] [Indexed: 01/28/2023]

Dueñas ME, Lee YJ. Single-Cell Metabolomics by Mass Spectrometry Imaging. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2021;1280:69-82. [PMID: 33791975 DOI: 10.1007/978-3-030-51652-9_5] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Rodrigues JF, Florea L, de Oliveira MCF, Diamond D, Oliveira ON. Big data and machine learning for materials science. DISCOVER MATERIALS 2021;1:12. [PMID: 33899049 PMCID: PMC8054236 DOI: 10.1007/s43939-021-00012-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/01/2021] [Indexed: 05/11/2023]

Xing S, Hu Y, Yin Z, Liu M, Tang X, Fang M, Huan T. Retrieving and Utilizing Hypothetical Neutral Losses from Tandem Mass Spectra for Spectral Similarity Analysis and Unknown Metabolite Annotation. Anal Chem 2020;92:14476-14483. [DOI: 10.1021/acs.analchem.0c02521] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Ludwig M, Nothias LF, Dührkop K, Koester I, Fleischauer M, Hoffmann MA, Petras D, Vargas F, Morsy M, Aluwihare L, Dorrestein PC, Böcker S. Database-independent molecular formula annotation using Gibbs sampling through ZODIAC. NAT MACH INTELL 2020. [DOI: 10.1038/s42256-020-00234-6] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Fan Z, Alley A, Ghaffari K, Ressom HW. MetFID: artificial neural network-based compound fingerprint prediction for metabolite annotation. Metabolomics 2020;16:104. [PMID: 32997169 PMCID: PMC9547616 DOI: 10.1007/s11306-020-01726-7] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Accepted: 09/19/2020] [Indexed: 12/11/2022]

Abstract

INTRODUCTION

Metabolite annotation is a critical and challenging step in mass spectrometry-based metabolomic profiling. In a typical untargeted MS/MS-based metabolomic study, experimental MS/MS spectra are matched against those in spectral libraries for metabolite annotation. Yet, existing spectral libraries comprise merely a marginal percentage of known compounds.

OBJECTIVE

The objective is to develop a method that helps rank putative metabolite IDs for analytes whose reference MS/MS spectra are not present in spectral libraries.

METHODS

We introduce MetFID, which uses an artificial neural network (ANN) trained for predicting molecular fingerprints based on experimental MS/MS data. To narrow the search space, MetFID retrieves candidates from metabolite databases using molecular formula or m/z value of the precursor ions of the analytes. The candidate whose fingerprint is most analogous to the predicted fingerprint is used for metabolite annotation. A comprehensive evaluation was performed by training MetFID using MS/MS spectra from the MoNA repository and NIST library and by testing with structure-disjoint MS/MS spectra from the NIST library, the CASMI 2016 dataset, and in-house MS/MS data from a cancer biomarker discovery study.

RESULTS

We observed that training separate models for distinct ranges of collision energies enhanced model performance compared to a single model that covers a wide range of collision energies. Using MetaboQuest to retrieve candidates, MetFID prioritized the correct putative ID in the first place rank for about 50% of the testing cases. Through the independent testing dataset, we demonstrated that MetFID has the potential to improve the accuracy of ranking putative metabolite IDs by more than 5% compared to other tools such as ChemDistiller, CSI:FingerID, and MetFrag.

CONCLUSION

MetFID offers a promising opportunity to enhance the accuracy of metabolite annotation by using ANN for molecular fingerprint prediction.

Collapse

Li Y, Kuhn M, Gavin AC, Bork P. Identification of metabolites from tandem mass spectra with a machine learning approach utilizing structural features. Bioinformatics 2020;36:1213-1218. [PMID: 31605112 PMCID: PMC7703789 DOI: 10.1093/bioinformatics/btz736] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 07/30/2019] [Accepted: 09/25/2019] [Indexed: 01/11/2023] Open

Application of High Resolution Mass Spectrometric methods coupled with chemometric techniques in olive oil authenticity studies - A review. Anal Chim Acta 2020;1134:150-173. [PMID: 33059861 DOI: 10.1016/j.aca.2020.07.029] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 07/13/2020] [Accepted: 07/14/2020] [Indexed: 12/21/2022]

Abstract

Extra Virgin Olive Oil (EVOO), the emblematic food of the Mediterranean diet, is recognized for its nutritional value and beneficial health effects. The main authenticity issues associated with EVOO's quality involve the organoleptic properties (EVOO or defective), mislabeling of production type (organic or conventional), variety and geographical origin, and adulteration. Currently, there is an emerging need to characterize EVOOs and evaluate their genuineness. This can be achieved through the development of analytical methodologies applying advanced "omics" technologies and the investigation of EVOOs chemical fingerprints. The objective of this review is to demonstrate the analytical performance of High Resolution Mass Spectrometry (HRMS) in the field of food authenticity assessment, allowing the determination of a wide range of food constituents with exceptional identification capabilities. HRMS-based workflows used for the investigation of critical olive oil authenticity issues are presented and discussed, combined with advanced data processing, comprehensive data mining and chemometric tools. The use of unsupervised classification tools, such as Principal Component Analysis (PCA) and Hierarchical Clustering Analysis (HCA), as well as supervised classification techniques, including Linear Discriminant Analysis (LDA), Support Vector Machine (SVM), Partial Least Square Discriminant Analysis (PLS-DA), Orthogonal Projection to Latent Structure-Discriminant Analysis (OPLS-DA), Counter Propagation Artificial Neural Networks (CP-ANNs), Self-Organizing Maps (SOMs) and Random Forest (RF) is summarized. The combination of HRMS methodologies with chemometrics improves the quality and reliability of the conclusions from experimental data (profile or fingerprints), provides valuable information suggesting potential authenticity markers and is widely applied in food authenticity studies.

Collapse

Senan O, Aguilar-Mogas A, Navarro M, Capellades J, Noon L, Burks D, Yanes O, Guimerà R, Sales-Pardo M. CliqueMS: a computational tool for annotating in-source metabolite ions from LC-MS untargeted metabolomics data based on a coelution similarity network. Bioinformatics 2020;35:4089-4097. [PMID: 30903689 PMCID: PMC6792096 DOI: 10.1093/bioinformatics/btz207] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Revised: 01/30/2019] [Accepted: 03/21/2019] [Indexed: 11/26/2022] Open

McEachran AD, Chao A, Al-Ghoul H, Lowe C, Grulke C, Sobus JR, Williams AJ. Revisiting Five Years of CASMI Contests with EPA Identification Tools. Metabolites 2020;10:E260. [PMID: 32585902 PMCID: PMC7345619 DOI: 10.3390/metabo10060260] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 06/03/2020] [Accepted: 06/17/2020] [Indexed: 01/02/2023] Open