1
|
Liu Y, Yoshizawa AC, Ling Y, Okuda S. Insights into predicting small molecule retention times in liquid chromatography using deep learning. J Cheminform 2024; 16:113. [PMID: 39375739 PMCID: PMC11460055 DOI: 10.1186/s13321-024-00905-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 09/13/2024] [Indexed: 10/09/2024] Open
Abstract
In untargeted metabolomics, structures of small molecules are annotated using liquid chromatography-mass spectrometry by leveraging information from the molecular retention time (RT) in the chromatogram and m/z (formerly called ''mass-to-charge ratio'') in the mass spectrum. However, correct identification of metabolites is challenging due to the vast array of small molecules. Therefore, various in silico tools for mass spectrometry peak alignment and compound prediction have been developed; however, the list of candidate compounds remains extensive. Accurate RT prediction is important to exclude false candidates and facilitate metabolite annotation. Recent advancements in artificial intelligence (AI) have led to significant breakthroughs in the use of deep learning models in various fields. Release of a large RT dataset has mitigated the bottlenecks limiting the application of deep learning models, thereby improving their application in RT prediction tasks. This review lists the databases that can be used to expand training datasets and concerns the issue about molecular representation inconsistencies in datasets. It also discusses the application of AI technology for RT prediction, particularly in the 5 years following the release of the METLIN small molecule RT dataset. This review provides a comprehensive overview of the AI applications used for RT prediction, highlighting the progress and remaining challenges. SCIENTIFIC CONTRIBUTION: This article focuses on the advancements in small molecule retention time prediction in computational metabolomics over the past five years, with a particular emphasis on the application of AI technologies in this field. It reviews the publicly available datasets for small molecule retention time, the molecular representation methods, the AI algorithms applied in recent studies. Furthermore, it discusses the effectiveness of these models in assisting with the annotation of small molecule structures and the challenges that must be addressed to achieve practical applications.
Collapse
Affiliation(s)
- Yuting Liu
- Medical AI Center, Niigata University School of Medicine, Niigata City, Niigata, 951-8514, Japan
| | - Akiyasu C Yoshizawa
- Medical AI Center, Niigata University School of Medicine, Niigata City, Niigata, 951-8514, Japan
| | - Yiwei Ling
- Medical AI Center, Niigata University School of Medicine, Niigata City, Niigata, 951-8514, Japan
| | - Shujiro Okuda
- Medical AI Center, Niigata University School of Medicine, Niigata City, Niigata, 951-8514, Japan.
| |
Collapse
|
2
|
Xue J, Wang B, Ji H, Li W. RT-Transformer: retention time prediction for metabolite annotation to assist in metabolite identification. Bioinformatics 2024; 40:btae084. [PMID: 38402516 PMCID: PMC10914443 DOI: 10.1093/bioinformatics/btae084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 01/14/2024] [Accepted: 02/22/2024] [Indexed: 02/26/2024] Open
Abstract
MOTIVATION Liquid chromatography retention times prediction can assist in metabolite identification, which is a critical task and challenge in nontargeted metabolomics. However, different chromatographic conditions may result in different retention times for the same metabolite. Current retention time prediction methods lack sufficient scalability to transfer from one specific chromatographic method to another. RESULTS Therefore, we present RT-Transformer, a novel deep neural network model coupled with graph attention network and 1D-Transformer, which can predict retention times under any chromatographic methods. First, we obtain a pre-trained model by training RT-Transformer on the large small molecule retention time dataset containing 80 038 molecules, and then transfer the resulting model to different chromatographic methods based on transfer learning. When tested on the small molecule retention time dataset, as other authors did, the average absolute error reached 27.30 after removing not retained molecules. Still, it reached 33.41 when no samples were removed. The pre-trained RT-Transformer was further transferred to 5 datasets corresponding to different chromatographic conditions and fine-tuned. According to the experimental results, RT-Transformer achieves competitive performance compared to state-of-the-art methods. In addition, RT-Transformer was applied to 41 external molecular retention time datasets. Extensive evaluations indicate that RT-Transformer has excellent scalability in predicting retention times for liquid chromatography and improves the accuracy of metabolite identification. AVAILABILITY AND IMPLEMENTATION The source code for the model is available at https://github.com/01dadada/RT-Transformer. The web server is available at https://huggingface.co/spaces/Xue-Jun/RT-Transformer.
Collapse
Affiliation(s)
- Jun Xue
- School of Information Science and Engineering, Yunnan University, Kunming, Yunnan 650500, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - Bingyi Wang
- Yunnan Police College, Kunming, Yunnan 650223, China
- Key Laboratory of Smart Drugs Control (Yunnan Police College), Ministry of Education, Kunming, Yunnan 650223, China
| | - Hongchao Ji
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518120, China
| | - WeiHua Li
- School of Information Science and Engineering, Yunnan University, Kunming, Yunnan 650500, China
| |
Collapse
|
3
|
Singh YR, Shah DB, Kulkarni M, Patel SR, Maheshwari DG, Shah JS, Shah S. Current trends in chromatographic prediction using artificial intelligence and machine learning. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2023; 15:2785-2797. [PMID: 37264667 DOI: 10.1039/d3ay00362k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Artificial intelligence (AI) and machine learning (ML) gained tremendous growth and are rapidly becoming popular in various fields of prediction due to their potential abilities, accuracy, and speed. Machine learning algorithms employ historical data to analyze or predict information using patterns or trends. AI and ML were most employed in chromatographic predictions and particularly attractive options for liquid chromatography method development, as they can help achieve desired results faster, more accurately, and more efficiently. This review aims at exploring various AI and ML models employed in the determination of chromatographic characteristics. This review also aims to provide deep insight into reported artificial neural network (ANN) associated techniques which maintained better accuracy and significant possibilities for chromatographic characteristics prediction in liquid chromatography over classical linear models and also emphasizes the integration of a fuzzy system with an ANN, as this integrated study provides more efficient and accurate methods in chromatographic prediction than other linear models. This study also focuses on the retention prediction of a target molecule employing QSRR methodology combined with an ANN, highlighting a more effective technique than the QSRR alone. This approach showed the benefits of combining AI or ML algorithms with the QSRR to obtain more accurate retention predictions, emphasizing the potential of artificial intelligence and machine learning for overcoming adversities in analytical chemistry.
Collapse
Affiliation(s)
- Yash Raj Singh
- Department of Pharmaceutical Quality Assurance, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Darshil B Shah
- Department of Pharmaceutical Quality Assurance, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Mangesh Kulkarni
- Department of Pharmaceutical Technology, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Shreyanshu R Patel
- Department of Pharmaceutical Technology, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Dilip G Maheshwari
- Department of Pharmaceutical Quality Assurance, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Jignesh S Shah
- Department of Pharmaceutical Regulatory Affairs, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| | - Shreeraj Shah
- Department of Pharmaceutical Technology, LJ Institute of Pharmacy, LJ University, Ahmedabad, Gujarat, India
| |
Collapse
|
4
|
Qin S, Yan F, E S, Xiong P, Tang S, Yu K, Zhang M, Cheng Y, Cai W. Comprehensive characterization of multiple components of Ziziphus jujuba Mill using UHPLC-Q-Exactive Orbitrap Mass Spectrometers. Food Sci Nutr 2022; 10:4270-4295. [PMID: 36514751 PMCID: PMC9731542 DOI: 10.1002/fsn3.3020] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 07/19/2022] [Accepted: 07/24/2022] [Indexed: 12/16/2022] Open
Abstract
Ziziphus jujuba Mill is the dried ripe fruit of the Rhamnaceae family; it is widely distributed in Shandong, Henan, Liaoning, and other places in China. In folk medicine, it was used to restore vital energy, as a blood tonic, and for the treatment of spleen deficiency. To date, a complete investigation of the compounds of Z. jujuba has rarely been performed. Therefore, a reliable strategy based on UHPLC-Q-Exactive Orbitrap MS, combined with trace data acquisition mode (parallel reaction monitoring scanning, PRM) and multiple data processing methods, is necessary for the characterization of compounds in the Z. jujuba. Ultimately, 295 compounds, including 69 flavonoids, 60 alkaloids, 82 phenylpropanoids, 52 organic acids, and 32 other components, were identified in the Z. jujuba; of these, 270 have been reported in Z. jujuba for the first time. This study provides deep insights into the chemistry of Z. jujuba and could be useful for further studies aimed at identifying the factors contributing to the health benefits attributed to this fruit.
Collapse
Affiliation(s)
- Shi‐han Qin
- School of PharmacyWeifang Medical UniversityWeifangChina
- School of Pharmaceutical SciencesHunan University of MedicineHuaihuaChina
| | - Fang Yan
- School of PharmacyWeifang Medical UniversityWeifangChina
| | - Shuai E
- School of PharmacyWeifang Medical UniversityWeifangChina
- School of Pharmaceutical SciencesHunan University of MedicineHuaihuaChina
| | - Pei Xiong
- School of Pharmaceutical SciencesHunan University of MedicineHuaihuaChina
| | - Su‐nv Tang
- School of Pharmaceutical SciencesHunan University of MedicineHuaihuaChina
| | - Kai‐quan Yu
- School of Pharmaceutical SciencesHunan University of MedicineHuaihuaChina
| | - Min Zhang
- School of Pharmaceutical SciencesHunan University of MedicineHuaihuaChina
| | - Yung‐chi Cheng
- Department of PharmacologyYale University School of MedicineNew HavenConnecticutUSA
| | - Wei Cai
- School of Pharmaceutical SciencesHunan University of MedicineHuaihuaChina
| |
Collapse
|
5
|
Ding J, Feng YQ. Mass spectrometry-based metabolomics for clinical study: Recent progresses and applications. Trends Analyt Chem 2022. [DOI: 10.1016/j.trac.2022.116896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
6
|
Retention Time Prediction with Message-Passing Neural Networks. SEPARATIONS 2022. [DOI: 10.3390/separations9100291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023] Open
Abstract
Retention time prediction, facilitated by advances in machine learning, has become a useful tool in untargeted LC-MS applications. State-of-the-art approaches include graph neural networks and 1D-convolutional neural networks that are trained on the METLIN small molecule retention time dataset (SMRT). These approaches demonstrate accurate predictions comparable with the experimental error for the training set. The weak point of retention time prediction approaches is the transfer of predictions to various systems. The accuracy of this step depends both on the method of mapping and on the accuracy of the general model trained on SMRT. Therefore, improvements to both parts of prediction workflows may lead to improved compound annotations. Here, we evaluate capabilities of message-passing neural networks (MPNN) that have demonstrated outstanding performance on many chemical tasks to accurately predict retention times. The model was initially trained on SMRT, providing mean and median absolute cross-validation errors of 32 and 16 s, respectively. The pretrained MPNN was further fine-tuned on five publicly available small reversed-phase retention sets in a transfer learning mode and demonstrated up to 30% improvement of prediction accuracy for these sets compared with the state-of-the-art methods. We demonstrated that filtering isomeric candidates by predicted retention with the thresholds obtained from ROC curves eliminates up to 50% of false identities.
Collapse
|
7
|
Lu Y, Wang X, Wu Y, Wang Z, Zhou N, Li J, Shang X, Lin P. Chemical characterization of the antioxidant and α-glucosidase inhibitory active fraction of Malus transitoria leaves. Food Chem 2022; 386:132863. [PMID: 35367798 DOI: 10.1016/j.foodchem.2022.132863] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 02/21/2022] [Accepted: 03/28/2022] [Indexed: 11/27/2022]
Abstract
Chinese Tibetan tea made from the tender leaves of Malus transitoria is a widely consumed health drink, but there are few reports on its chemical composition and biological activity. In this study, we found that a 50% ethanol extract of M. transitoria had good antioxidant and α-glucosidase inhibitory activities in vitro. Guided by in vitro bioassays, chromatographic separation and purification were conducted, and the most active fraction in M. transitoria was determined. UPLC-Orbitrap-MS/MS was used to further quickly and comprehensively characterize the chemical composition. Library searches, MS/MS fragmentation patterns of two isolated reference compounds, and bibliography were used to annotate 81 compounds, of which 2 were new compounds, and 79 were identified from M. transitoria for the first time. This study provides a scientific basis for the development of antioxidant and anti-diabetic functional foods from M. transitoria.
Collapse
Affiliation(s)
- Yongchang Lu
- Qinghai Provincial Key Laboratory of Phytochemistry for Tibetan Plateau, Qinghai University for Nationalities, Xining 810000, China.
| | - Xin Wang
- Beijing Key Laboratory of Bioactive Substances and Functional Foods, Beijing Union University, Beijing 100191, China.
| | - Yong Wu
- Beijing Key Laboratory of Bioactive Substances and Functional Foods, Beijing Union University, Beijing 100191, China.
| | - Zeyu Wang
- Beijing Key Laboratory of Bioactive Substances and Functional Foods, Beijing Union University, Beijing 100191, China.
| | - Na Zhou
- Beijing Key Laboratory of Bioactive Substances and Functional Foods, Beijing Union University, Beijing 100191, China.
| | - Jinjie Li
- Beijing Key Laboratory of Bioactive Substances and Functional Foods, Beijing Union University, Beijing 100191, China.
| | - Xiaoya Shang
- Beijing Key Laboratory of Bioactive Substances and Functional Foods, Beijing Union University, Beijing 100191, China.
| | - Pengcheng Lin
- Qinghai Provincial Key Laboratory of Phytochemistry for Tibetan Plateau, Qinghai University for Nationalities, Xining 810000, China.
| |
Collapse
|
8
|
Mueller P, Bonner R, Hopfgartner G. Controlled Formation of Protonated and Radical Cation Precursor Ions by Atmospheric Pressure Photoionization with μLC-MS Enables Electron Ionization and MS/MS Library Search. Anal Chem 2022; 94:12103-12110. [PMID: 36001638 DOI: 10.1021/acs.analchem.2c02105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Atmospheric pressure photoionization (APPI) was developed as an alternative to electrospray ionization (ESI) for the generation of protonated molecules using liquid chromatography and optimized using dopants such as toluene, which predominantly forms protonated molecules, and chlorobenzene, which favors the formation of radical cations, although the latter has not been fully exploited. Based on 40 diverse low-molecular-weight compounds and micro liquid chromatography (μLC) coupled with APPI tandem mass spectrometry (APPI-MS/MS), the potential of radical cations was investigated. Chromatographic and ionization conditions were decoupled by post-column addition of methanol, allowing separate study and optimization. Due to the mass flow sensitive behavior of APPI, sensitivity is not affected by post-column dilution, and for 8 of 35 analytes, the radical cation response with μLC-APPI is better than for protonated molecules using μLC-ESI. Collision-induced fragmentation (CID) of radical cations produced within a collision energy range from 10-115 eV have, in the median, 65% of the fragments found in electron ionization (EI) spectra. This similarity allowed identification of 86% of the analytes using data-dependent acquisition (DDA) of radical cations and NIST EI library searches. We propose a workflow that uses multimodal DDA of protonated precursor molecules using ESI or APPI with toluene as a dopant, and radical cations produced by chlorobenzene-assisted μLC-APPI with post-column addition of methanol. This increases the confidence of molecular identification by allowing orthogonal library searches using MS/MS libraries for protonated precursor CID spectra and EI libraries for radical cation CID spectra.
Collapse
Affiliation(s)
- Patrick Mueller
- Life Sciences Mass Spectrometry, Department of Inorganic and Analytical Chemistry, University of Geneva, 24 Quai Ernest Ansermet, CH-1211 Geneva 4, Switzerland
| | - Ron Bonner
- Ron Bonner Consulting, Newmarket, ON L3Y 3C7, Canada
| | - Gérard Hopfgartner
- Life Sciences Mass Spectrometry, Department of Inorganic and Analytical Chemistry, University of Geneva, 24 Quai Ernest Ansermet, CH-1211 Geneva 4, Switzerland
| |
Collapse
|
9
|
Hu Q, Sun Y, Yuan P, Lei H, Zhong H, Wang Y, Tang H. Quantitative structure-retention relationship for reliable metabolite identification and quantification in metabolomics using ion-pair reversed-phase chromatography coupled with tandem mass spectrometry. Talanta 2022; 238:123059. [PMID: 34808567 DOI: 10.1016/j.talanta.2021.123059] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 11/09/2021] [Accepted: 11/10/2021] [Indexed: 10/19/2022]
Abstract
Hydrophilic metabolites are essential for all biological systems with multiple functions and their quantitative analysis forms an important part of metabolomics. However, poor retention of these metabolites on reversed-phase (RP) chromatographic column hinders their effective analysis with RPLC-MS methods. Herein, we developed a method for detecting hydrophilic metabolites using the ion-pair reversed-phase liquid-chromatography coupled with mass spectrometry (IPRP-LC-MS/MS) in scheduled multiple-reaction-monitoring (sMRM) mode. We first developed a hexylamine-based IPRP-UHPLC-QTOFMS method and experimentally measured retention time (tR) for 183 hydrophilic metabolites. We found that tRs of these metabolites were dominated by their electrostatic potential depending upon the numbers and types of their ionizable groups. We then systematically investigated the quantitative structure-retention relationship (QSRR) and constructed QSRR models using the measured tR. Subsequently, we developed a retention time predictive model using the random-forest regression algorithm (r2 = 0.93, q2 = 0.70, MAE = 1.28 min) for predicting metabolite retention time, which was applied in IPRP-UHPLC-MS/MS method in sMRM mode for quantitative metabolomic analysis. Our method can simultaneously quantify more than 260 metabolites. Moreover, we found that this method was applicable for multiple major biological matrices including biofluids and tissues. This approach offers an efficient method for large-scale quantitative hydrophilic metabolomic profiling even when metabolite standards are unavailable.
Collapse
Affiliation(s)
- Qingyu Hu
- CAS Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Centre for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, 430071, China; State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute, Metabonomics and Systems Biology Laboratory at Shanghai International Centre for Molecular Phenomics, Zhongshan Hospital, Fudan University, Shanghai, 200032, China; University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yuting Sun
- CAS Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Centre for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, 430071, China; State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute, Metabonomics and Systems Biology Laboratory at Shanghai International Centre for Molecular Phenomics, Zhongshan Hospital, Fudan University, Shanghai, 200032, China; University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Peihong Yuan
- CAS Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Centre for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, 430071, China; Department of Laboratory Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China
| | - Hehua Lei
- CAS Key Laboratory of Magnetic Resonance in Biological Systems, State Key Laboratory of Magnetic Resonance and Atomic and Molecular Physics, National Centre for Magnetic Resonance in Wuhan, Wuhan Institute of Physics and Mathematics, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, 430071, China
| | - Huiqin Zhong
- Waters Technologies (Shanghai) Limited, 1000 Jinhai Road, Shanghai, 201206, China
| | - Yulan Wang
- Singapore Phenome Centre, Lee Kong Chian School of Medicine, Nanyang Technological University, 639798, Singapore.
| | - Huiru Tang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute, Metabonomics and Systems Biology Laboratory at Shanghai International Centre for Molecular Phenomics, Zhongshan Hospital, Fudan University, Shanghai, 200032, China.
| |
Collapse
|
10
|
Liapikos T, Zisi C, Kodra D, Kademoglou K, Diamantidou D, Begou O, Pappa-Louisi A, Theodoridis G. Quantitative Structure Retention Relationship (QSRR) Modelling for Analytes’ Retention Prediction in LC-HRMS by Applying Different Machine Learning Algorithms and Evaluating Their Performance. J Chromatogr B Analyt Technol Biomed Life Sci 2022; 1191:123132. [DOI: 10.1016/j.jchromb.2022.123132] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 01/12/2022] [Accepted: 01/16/2022] [Indexed: 12/26/2022]
|
11
|
Bride E, Heinisch S, Bonnefille B, Guillemain C, Margoum C. Suspect screening of environmental contaminants by UHPLC-HRMS and transposable Quantitative Structure-Retention Relationship modelling. JOURNAL OF HAZARDOUS MATERIALS 2021; 409:124652. [PMID: 33277075 DOI: 10.1016/j.jhazmat.2020.124652] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2020] [Revised: 10/02/2020] [Accepted: 11/20/2020] [Indexed: 06/12/2023]
Abstract
A Quantitative Structure-Retention Relationship (QSRR) model is proposed and aims at increasing the confidence level associated to the identification of organic contaminants by Ultra-High Performance Liquid Chromatography hyphenated to High Resolution Mass Spectrometry (UHPLC-HRMS) in environmental samples under a suspect screening approach. The model was built from a selection of 8 easily accessible physicochemical descriptors, and was validated from a set of 274 organic compounds commonly found in environmental samples. The proposed predictive figure approach is based on the mobile phase composition at solute elution (expressed as % acetonitrile), that has the major advantage of making the model reusable by other laboratories, since the elution composition is independent of both the column geometry and the UHPLC-system. The model quality was assessed and was altered neither by the columns from different lots, nor by the complex matrices of environmental water samples. Then, the solute retention of any organic compound present in water samples is expected to be predicted within ± 14.3% acetonitrile by our model. Solute retention can therefore be used as a supplementary tool for the identification of environmental contaminants by UHPLC-HRMS, in addition to mass spectrometry data already used in the suspect screening approach.
Collapse
Affiliation(s)
- Eloi Bride
- INRAE, UR RiverLy, F-69625 Villeurbanne, France
| | - Sabine Heinisch
- Université de Lyon, Institut des Sciences Analytiques, UMR 5280, CNRS, F-69100 Villeurbanne, France
| | | | | | | |
Collapse
|
12
|
Feng C, Xu Q, Qiu X, Jin Y, Ji J, Lin Y, Le S, She J, Lu D, Wang G. Evaluation and application of machine learning-based retention time prediction for suspect screening of pesticides and pesticide transformation products in LC-HRMS. CHEMOSPHERE 2021; 271:129447. [PMID: 33476874 DOI: 10.1016/j.chemosphere.2020.129447] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 12/17/2020] [Accepted: 12/23/2020] [Indexed: 06/12/2023]
Abstract
Computational QSAR models have gradually been preferred for retention time prediction in data mining of emerging environmental contaminants using liquid chromatography coupled with mass spectrometry. Generally, the model performance relies on the components such as machine learning algorithms, chemical features, and example data. In this study, we evaluated the performances of four algorithms on three feature sets, using 321 and 77 pesticides as the training and validation sets, respectively. The results were varied with different combinations of algorithms on distinct feature sets. Two strategies including enhancing the complexity of chemical features and enlarging the size of the training set were proved to improve the results. XGBoost, Random Forest, and lightGBM algorithms exhibited the best results when built on a large-scale chemical descriptors, while the Keras algorithm preferred fingerprints. These four models have comparable prediction accuracies that at least 90% of pesticides in validation set can be successfully predicted with ΔRT <1.0 min. Meanwhile, a blended prediction strategy using average results from four models presented a better result than any single model. This strategy was used for assisting identification of pesticides and pesticide transformation products in 120 strawberry samples from a national survey of food contamination. Twenty pesticides and twelve pesticide transformation products were tentatively identified, where all pesticides and two pesticide transformation products (bifenazate diazene and spirotetramat-enol) were confirmed by standard materials. The outcome of this study suggested that retention time prediction is a valuable approach in compound identification when integrated with in silico MS2 spectra and other MS identification strategies.
Collapse
Affiliation(s)
- Chao Feng
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China
| | - Qian Xu
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China
| | - Xinlei Qiu
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China
| | - Yu'e Jin
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China
| | - Jieyun Ji
- Shanghai Changning Center for Disease Control and Prevention, Shanghai, 200051, China
| | - Yuanjie Lin
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China
| | - Sunyang Le
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China
| | - Jianwen She
- California Department of Public Health, Richmond, CA, 94804, USA
| | - Dasheng Lu
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China.
| | - Guoquan Wang
- Shanghai Municipal Center for Disease Control and Prevention, Shanghai, 200336, China; State Environmental Protection Key Laboratory of Environmental Health Impact Assessment of Emerging Contaminants, Shanghai, 200336, China.
| |
Collapse
|
13
|
Zhu FY, Song YC, Zhang KL, Chen X, Chen MX. Quantifying Plant Dynamic Proteomes by SWATH-based Mass Spectrometry. TRENDS IN PLANT SCIENCE 2020; 25:1171-1172. [PMID: 32891562 DOI: 10.1016/j.tplants.2020.07.014] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Accepted: 07/30/2020] [Indexed: 05/20/2023]
Affiliation(s)
- Fu-Yuan Zhu
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, Jiangsu Province, China.
| | - Yu-Chen Song
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, Jiangsu Province, China
| | - Kai-Lu Zhang
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, Jiangsu Province, China
| | - Xi Chen
- SpecAlly Life Technology Co., Ltd, Wuhan, China; Wuhan Institute of Biotechnology, Wuhan, China
| | - Mo-Xian Chen
- Co-Innovation Center for Sustainable Forestry in Southern China, College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, Jiangsu Province, China; Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
| |
Collapse
|
14
|
Haddad PR, Taraji M, Szücs R. Prediction of Analyte Retention Time in Liquid Chromatography. Anal Chem 2020; 93:228-256. [DOI: 10.1021/acs.analchem.0c04190] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Paul R. Haddad
- Australian Centre for Research on Separation Science, School of Natural Sciences, University of Tasmania, Private Bag 75, Hobart, Tasmania, Australia 7001
| | - Maryam Taraji
- Australian Centre for Research on Separation Science, School of Natural Sciences, University of Tasmania, Private Bag 75, Hobart, Tasmania, Australia 7001
- The Australian Wine Research Institute, P.O. Box 197, Adelaide, South Australia 5064, Australia
- Metabolomics Australia, P.O. Box 197, Adelaide, South Australia 5064, Australia
| | - Roman Szücs
- Pfizer R&D UK Limited, Ramsgate Road, Sandwich CT13 9NJ, U.K
- Department of Analytical Chemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Mlynská Dolina CH2, Ilkovičova 6, SK-84215 Bratislava, Slovakia
| |
Collapse
|
15
|
Machine learning to predict retention time of small molecules in nano-HPLC. Anal Bioanal Chem 2020; 412:7767-7776. [PMID: 32860519 DOI: 10.1007/s00216-020-02905-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 07/29/2020] [Accepted: 08/20/2020] [Indexed: 01/22/2023]
Abstract
Retention time is an important parameter for identification in untargeted LC-MS screening. Precise retention time prediction facilitates the annotation process and is well known for proteomics. However, the lack of available experimental information for a long time has limited the prediction accuracy for small molecules. Recently introduced large databases for small-molecule retention times make possible reliable machine learning-based predictions for the whole diversity of compounds. Applying simple projections may expand these predictions on various LC systems and conditions. In our work, we describe a complex approach to predict retention times for nano-HPLC that includes the consequent deployment of binary and regression gradient boosting models trained on the METLIN small-molecule dataset and simple projection of the results with a small number of easily available compounds onto nano-HPLC separations. The proposed model outperforms previous attempts to use machine learning for predictions with a 46-s mean absolute error. The overall performance after transfer to nano-LC conditions is less than 155 s (10.8%) in terms of the median absolute (relative) error. To illustrate the applicability of the described approach, we successfully managed to eliminate averagely 25 to 42% of false-positives with a filter threshold derived from ROC curves. Thus, the proposed approach should be used in addition to other well-established in silico methods and their integration may broaden the range of correctly identified molecules.
Collapse
|
16
|
Raetz M, Bonner R, Hopfgartner G. SWATH-MS for metabolomics and lipidomics: critical aspects of qualitative and quantitative analysis. Metabolomics 2020; 16:71. [PMID: 32504120 DOI: 10.1007/s11306-020-01692-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 05/29/2020] [Indexed: 12/19/2022]
Abstract
INTRODUCTION While liquid chromatography coupled to mass spectrometric detection in the selected reaction monitoring detection mode offers the best quantification sensitivity for omics, the number of target analytes is limited, must be predefined and specific methods developed. Data independent acquisition (DIA), including SWATH using quadrupole time of flight or orbitrap mass spectrometers and generic acquisition methods, has emerged as a powerful alternative technique for quantitative and qualitative analyses since it can cover a wide range of analytes without predefinition. OBJECTIVES Here we review the current state of DIA, SWATH-MS and highlight novel acquisition strategies for metabolomics and lipidomics and opportunities for data analysis tools. METHOD Different databases were searched for papers that report developments and applications of DIA and in particular SWATH-MS in metabolomics and lipidomics. RESULTS DIA methods generate digital sample records that can be mined retrospectively as further knowledge is gained and, with standardized acquisition schemes, used in multiple studies. The different chemical spaces of metabolites and lipids require different specificities, hence different acquisition and data processing approaches must be considered for their analysis. CONCLUSIONS Although the hardware and acquisition modes are well defined for SWATH-MS, a major challenge for routine use remains the lack of appropriate software tools capable of handling large datasets and large numbers of analytes.
Collapse
Affiliation(s)
- Michel Raetz
- Life Sciences Mass Spectrometry, Department of Inorganic and Analytical Chemistry, University of Geneva, 24 Quai Ernest Ansermet, CH-1211, Geneva, Switzerland
| | - Ron Bonner
- Ron Bonner Consulting, Newmarket, ON, L3Y 3C7, Canada
| | - Gérard Hopfgartner
- Life Sciences Mass Spectrometry, Department of Inorganic and Analytical Chemistry, University of Geneva, 24 Quai Ernest Ansermet, CH-1211, Geneva, Switzerland.
| |
Collapse
|
17
|
Zhou Z, Chen Y, Gao Y, Bi N, Yue X, He J, Zhang R, Wang L, Abliz Z. Development of a high-coverage metabolome relative quantitative method for large-scale sample analysis. Anal Chim Acta 2020; 1109:44-52. [DOI: 10.1016/j.aca.2020.02.049] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Revised: 02/18/2020] [Accepted: 02/25/2020] [Indexed: 12/23/2022]
|
18
|
Witting M, Böcker S. Current status of retention time prediction in metabolite identification. J Sep Sci 2020; 43:1746-1754. [DOI: 10.1002/jssc.202000060] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 03/04/2020] [Accepted: 03/04/2020] [Indexed: 12/23/2022]
Affiliation(s)
- Michael Witting
- Research Unit Analytical BioGeoChemistryHelmholtz Zentrum München Neuherberg Germany
- Chair of Analytical Food ChemistryTUM School of Life Sciences, Technische Universität München Freising Germany
| | - Sebastian Böcker
- Chair of BioinformaticsFriedrich‐Schiller‐Universität Jena Jena Germany
| |
Collapse
|
19
|
Akbal L, Hopfgartner G. Supercritical fluid chromatography–mass spectrometry using data independent acquisition for the analysis of polar metabolites in human urine. J Chromatogr A 2020; 1609:460449. [DOI: 10.1016/j.chroma.2019.460449] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2019] [Revised: 08/08/2019] [Accepted: 08/12/2019] [Indexed: 12/12/2022]
|
20
|
The METLIN small molecule dataset for machine learning-based retention time prediction. Nat Commun 2019; 10:5811. [PMID: 31862874 PMCID: PMC6925099 DOI: 10.1038/s41467-019-13680-7] [Citation(s) in RCA: 108] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 11/13/2019] [Indexed: 01/18/2023] Open
Abstract
Machine learning has been extensively applied in small molecule analysis to predict a wide range of molecular properties and processes including mass spectrometry fragmentation or chromatographic retention time. However, current approaches for retention time prediction lack sufficient accuracy due to limited available experimental data. Here we introduce the METLIN small molecule retention time (SMRT) dataset, an experimentally acquired reverse-phase chromatography retention time dataset covering up to 80,038 small molecules. To demonstrate the utility of this dataset, we deployed a deep learning model for retention time prediction applied to small molecule annotation. Results showed that in 70\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\%$$\end{document}% of the cases, the correct molecular identity was ranked among the top 3 candidates based on their predicted retention time. We anticipate that this dataset will enable the community to apply machine learning or first principles strategies to generate better models for retention time prediction. The use of machine learning for identifying small molecules through their retention time’s predictions has been challenging so far. Here the authors combine a large database of liquid chromatography retention time with a deep learning approach to enable accurate metabolites’s identification.
Collapse
|
21
|
|
22
|
Monge ME, Dodds JN, Baker ES, Edison AS, Fernández FM. Challenges in Identifying the Dark Molecules of Life. ANNUAL REVIEW OF ANALYTICAL CHEMISTRY (PALO ALTO, CALIF.) 2019; 12:177-199. [PMID: 30883183 PMCID: PMC6716371 DOI: 10.1146/annurev-anchem-061318-114959] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Metabolomics is the study of the metabolome, the collection of small molecules in living organisms, cells, tissues, and biofluids. Technological advances in mass spectrometry, liquid- and gas-phase separations, nuclear magnetic resonance spectroscopy, and big data analytics have now made it possible to study metabolism at an omics or systems level. The significance of this burgeoning scientific field cannot be overstated: It impacts disciplines ranging from biomedicine to plant science. Despite these advances, the central bottleneck in metabolomics remains the identification of key metabolites that play a class-discriminant role. Because metabolites do not follow a molecular alphabet as proteins and nucleic acids do, their identification is much more time consuming, with a high failure rate. In this review, we critically discuss the state-of-the-art in metabolite identification with specific applications in metabolomics and how technologies such as mass spectrometry, ion mobility, chromatography, and nuclear magnetic resonance currently contribute to this challenging task.
Collapse
Affiliation(s)
- María Eugenia Monge
- Centro de Investigaciones en Bionanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1425FQD, Ciudad de Buenos Aires, Argentina
| | - James N Dodds
- Department of Chemistry, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Erin S Baker
- Department of Chemistry, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Arthur S Edison
- Department of Genetics, Department of Biochemistry and Molecular Biology, and Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, USA
| | - Facundo M Fernández
- School of Chemistry and Biochemistry, Georgia Institute of Technology and Petit Institute for Biochemistry and Bioscience, Atlanta, Georgia 30332, USA;
| |
Collapse
|
23
|
Wang R, Yin Y, Zhu ZJ. Advancing untargeted metabolomics using data-independent acquisition mass spectrometry technology. Anal Bioanal Chem 2019; 411:4349-4357. [PMID: 30847570 DOI: 10.1007/s00216-019-01709-1] [Citation(s) in RCA: 89] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2018] [Revised: 02/14/2019] [Accepted: 02/20/2019] [Indexed: 12/28/2022]
Abstract
Metabolomics quantitatively measures metabolites in a given biological system and facilitates the understanding of physiological and pathological activities. With the recent advancement of mass spectrometry (MS) technology, liquid chromatography-mass spectrometry (LC-MS) with data-independent acquisition (DIA) has been emerged as a powerful technology for untargeted metabolomics due to its capability to acquire all MS2 spectra and high quantitative accuracy. In this trend article, we first introduced the basic principles of several common DIA techniques including MSE, all ion fragmentation (AIF), SWATH, and MSX. Then, we summarized and compared the data analysis strategies to process DIA-based untargeted metabolomics data, including metabolite identification and quantification. We think the advantages of the DIA technique will enable its broad application in untargeted metabolomics.
Collapse
Affiliation(s)
- Ruohong Wang
- Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, 200032, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yandong Yin
- Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Zheng-Jiang Zhu
- Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, 200032, China.
| |
Collapse
|
24
|
Couvillion SP, Zhu Y, Nagy G, Adkins JN, Ansong C, Renslow RS, Piehowski PD, Ibrahim YM, Kelly RT, Metz TO. New mass spectrometry technologies contributing towards comprehensive and high throughput omics analyses of single cells. Analyst 2019; 144:794-807. [PMID: 30507980 PMCID: PMC6349538 DOI: 10.1039/c8an01574k] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Mass-spectrometry based omics technologies - namely proteomics, metabolomics and lipidomics - have enabled the molecular level systems biology investigation of organisms in unprecedented detail. There has been increasing interest for gaining a thorough, functional understanding of the biological consequences associated with cellular heterogeneity in a wide variety of research areas such as developmental biology, precision medicine, cancer research and microbiome science. Recent advances in mass spectrometry (MS) instrumentation and sample handling strategies are quickly making comprehensive omics analyses of single cells feasible, but key breakthroughs are still required to push through remaining bottlenecks. In this review, we discuss the challenges faced by single cell MS-based omics analyses and highlight recent technological advances that collectively can contribute to comprehensive and high throughput omics analyses in single cells. We provide a vision of the potential of integrating pioneering technologies such as Structures for Lossless Ion Manipulations (SLIM) for improved sensitivity and resolution, novel peptide identification tactics and standards free metabolomics approaches for future applications in single cell analysis.
Collapse
Affiliation(s)
- Sneha P Couvillion
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Schlotterbeck J, Cebo M, Kolb A, Lämmerhofer M. Quantitative analysis of chemoresistance-inducing fatty acid in food supplements using UHPLC-ESI-MS/MS. Anal Bioanal Chem 2018; 411:479-491. [PMID: 30460390 DOI: 10.1007/s00216-018-1468-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 10/10/2018] [Accepted: 11/02/2018] [Indexed: 12/11/2022]
Abstract
Polyunsaturated fatty acids are important signaling molecules. A recent study reported hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid, 12-oxo-5Z,8E,10E-heptadecatrienoic acid, and (12S)-hydroxy-5Z,8E,10E-heptadecatrienoic acid as chemotherapy resistance-inducing factors when tumor cells were treated with cisplatin. Marine-based food supplements like fish oil or algae extracts are rich in polyunsaturated fatty acids and can contain large amounts of hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid. Thus, it was concluded that oral uptake of hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid might induce chemoresistance as shown in a mouse model. Cancer patients tend to consume food supplements containing polyunsaturated fatty acids on a regular basis. The uptake of hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid and (12S)-hydroxy-5Z,8E,10E-heptadecatrienoic acid should be controlled, because even low concentrations of 0.5 ng mL-1 showed chemoresistance-inducing effects in animal experiments. For accurate analysis of hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid and (12S)-hydroxy-5Z,8E,10E-heptadecatrienoic acid a validated method was developed by using ultrahigh-performance liquid chromatography hyphenated to quadrupole time of flight mass spectrometry via electrospray ionization and sample preparation by solid-phase extraction (SPE) with 3-aminopropyl silica. A combined targeted/untargeted approach was utilized using MS/MS by data-independent acquisition with SWATH and applied to commercial food supplements (refined fish oil, fish oil capsules, algae oil capsules, and flaxseed capsules). Accurate quantification of hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid and (12S)-hydroxy-5Z,8E,10E-heptadecatrienoic acid on the MS/MS level with simultaneous untargeted fatty acid screening revealed additional information. The LODs for hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid and (12S)-hydroxy-5Z,8E,10E-heptadecatrienoic acid were 0.036 ng mL-1 and 0.054 ng mL-1, respectively. Since hexadeca-4Z,7Z,10Z,13Z-tetraenoic acid was present in the samples in large amounts and (12S)-hydroxy-5Z,8E,10E-heptadecatrienoic was not expected to be present in high concentrations, two calibration ranges, namely, 0.5-20 ng mL-1 and 5-200 ng mL-1, were validated. An untargeted screening identified 18-39 free fatty acids being present in the lipid extracts of the food supplement samples. Graphical abstract ᅟ.
Collapse
Affiliation(s)
- Jörg Schlotterbeck
- Institute of Pharmaceutical Sciences, Pharmaceutical (Bio-)Analysis, University of Tübingen, Auf der Morgenstelle 8, 72076, Tübingen, Germany
| | - Malgorzata Cebo
- Institute of Pharmaceutical Sciences, Pharmaceutical (Bio-)Analysis, University of Tübingen, Auf der Morgenstelle 8, 72076, Tübingen, Germany
| | - Agnes Kolb
- Institute of Pharmaceutical Sciences, Pharmaceutical (Bio-)Analysis, University of Tübingen, Auf der Morgenstelle 8, 72076, Tübingen, Germany
| | - Michael Lämmerhofer
- Institute of Pharmaceutical Sciences, Pharmaceutical (Bio-)Analysis, University of Tübingen, Auf der Morgenstelle 8, 72076, Tübingen, Germany.
| |
Collapse
|
26
|
Amos RI, Haddad PR, Szucs R, Dolan JW, Pohl CA. Molecular modeling and prediction accuracy in Quantitative Structure-Retention Relationship calculations for chromatography. Trends Analyt Chem 2018. [DOI: 10.1016/j.trac.2018.05.019] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
27
|
Zhao X, Zeng Z, Chen A, Lu X, Zhao C, Hu C, Zhou L, Liu X, Wang X, Hou X, Ye Y, Xu G. Comprehensive Strategy to Construct In-House Database for Accurate and Batch Identification of Small Molecular Metabolites. Anal Chem 2018; 90:7635-7643. [DOI: 10.1021/acs.analchem.8b01482] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Xinjie Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Zhongda Zeng
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
- Dalian ChemDataSolution Information Technology Co. Ltd, Dalian 116023, China
| | - Aiming Chen
- Dalian ChemDataSolution Information Technology Co. Ltd, Dalian 116023, China
| | - Xin Lu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Chunxia Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Chunxiu Hu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Lina Zhou
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Xinyu Liu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Xiaolin Wang
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Xiaoli Hou
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Yaorui Ye
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| | - Guowang Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Science, Dalian 116023, China
| |
Collapse
|
28
|
Blaženović I, Kind T, Ji J, Fiehn O. Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics. Metabolites 2018; 8:E31. [PMID: 29748461 PMCID: PMC6027441 DOI: 10.3390/metabo8020031] [Citation(s) in RCA: 416] [Impact Index Per Article: 69.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2018] [Revised: 04/26/2018] [Accepted: 05/06/2018] [Indexed: 01/17/2023] Open
Abstract
The annotation of small molecules remains a major challenge in untargeted mass spectrometry-based metabolomics. We here critically discuss structured elucidation approaches and software that are designed to help during the annotation of unknown compounds. Only by elucidating unknown metabolites first is it possible to biologically interpret complex systems, to map compounds to pathways and to create reliable predictive metabolic models for translational and clinical research. These strategies include the construction and quality of tandem mass spectral databases such as the coalition of MassBank repositories and investigations of MS/MS matching confidence. We present in silico fragmentation tools such as MS-FINDER, CFM-ID, MetFrag, ChemDistiller and CSI:FingerID that can annotate compounds from existing structure databases and that have been used in the CASMI (critical assessment of small molecule identification) contests. Furthermore, the use of retention time models from liquid chromatography and the utility of collision cross-section modelling from ion mobility experiments are covered. Workflows and published examples of successfully annotated unknown compounds are included.
Collapse
Affiliation(s)
- Ivana Blaženović
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
| | - Tobias Kind
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
| | - Jian Ji
- State Key Laboratory of Food Science and Technology, School of Food Science of Jiangnan University, School of Food Science Synergetic Innovation Center of Food Safety and Nutrition, Wuxi 214122, China.
| | - Oliver Fiehn
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
- Department of Biochemistry, Faculty of Sciences, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| |
Collapse
|
29
|
Bruderer T, Varesio E, Hidasi AO, Duchoslav E, Burton L, Bonner R, Hopfgartner G. Metabolomic spectral libraries for data-independent SWATH liquid chromatography mass spectrometry acquisition. Anal Bioanal Chem 2018; 410:1873-1884. [PMID: 29411086 DOI: 10.1007/s00216-018-0860-x] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Revised: 12/12/2017] [Accepted: 01/08/2018] [Indexed: 11/27/2022]
Abstract
High-quality mass spectral libraries have become crucial in mass spectrometry-based metabolomics. Here, we investigate a workflow to generate accurate mass discrete and composite spectral libraries for metabolite identification and for SWATH mass spectrometry data processing. Discrete collision energy (5-100 eV) accurate mass spectra were collected for 532 metabolites from the human metabolome database (HMDB) by flow injection analysis and compiled into composite spectra over a large collision energy range (e.g., 10-70 eV). Full scan response factors were also calculated. Software tools based on accurate mass and predictive fragmentation were specially developed and found to be essential for construction and quality control of the spectral library. First, elemental compositions constrained by the elemental composition of the precursor ion were calculated for all fragments. Secondly, all possible fragments were generated from the compound structure and were filtered based on their elemental compositions. From the discrete spectra, it was possible to analyze the specific fragment form at each collision energy and it was found that a relatively large collision energy range (10-70 eV) gives informative MS/MS spectra for library searches. From the composite spectra, it was possible to characterize specific neutral losses as radical losses using in silico fragmentation. Radical losses (generating radical cations) were found to be more prominent than expected. From 532 metabolites, 489 provided a signal in positive mode [M+H]+ and 483 in negative mode [M-H]-. MS/MS spectra were obtained for 399 compounds in positive mode and for 462 in negative mode; 329 metabolites generated suitable spectra in both modes. Using the spectral library, LC retention time, response factors to analyze data-independent LC-SWATH-MS data allowed the identification of 39 (positive mode) and 72 (negative mode) metabolites in a plasma pool sample (total 92 metabolites) where 81 previously were reported in HMDB to be found in plasma. Graphical abstract Library generation workflow for LC-SWATH MS, using collision energy spread, accurate mass, and fragment annotation.
Collapse
Affiliation(s)
- Tobias Bruderer
- Life Sciences Mass Spectrometry, Department of Inorganic and Analytical Chemistry, University of Geneva, 24, Quai Ernest Ansermet, 1211, Geneva 4, Switzerland
| | - Emmanuel Varesio
- School of Pharmaceutical Sciences, University of Geneva, University of Lausanne, Rue Michel-Servet 1, 1211, Geneva 4, Switzerland
| | - Anita O Hidasi
- Life Sciences Mass Spectrometry, Department of Inorganic and Analytical Chemistry, University of Geneva, 24, Quai Ernest Ansermet, 1211, Geneva 4, Switzerland
| | - Eva Duchoslav
- Sciex, 71 Four Valley Drive, Concord, ON, L4K 4V8, Canada
| | - Lyle Burton
- Sciex, 71 Four Valley Drive, Concord, ON, L4K 4V8, Canada
| | - Ron Bonner
- Ron Bonner Consulting, Newmarket, ON, L3Y 3C7, Canada
| | - Gérard Hopfgartner
- Life Sciences Mass Spectrometry, Department of Inorganic and Analytical Chemistry, University of Geneva, 24, Quai Ernest Ansermet, 1211, Geneva 4, Switzerland.
| |
Collapse
|
30
|
Domingo-Almenara X, Montenegro-Burke JR, Benton HP, Siuzdak G. Annotation: A Computational Solution for Streamlining Metabolomics Analysis. Anal Chem 2018; 90:480-489. [PMID: 29039932 PMCID: PMC5750104 DOI: 10.1021/acs.analchem.7b03929] [Citation(s) in RCA: 105] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Metabolite identification is still considered an imposing bottleneck in liquid chromatography mass spectrometry (LC/MS) untargeted metabolomics. The identification workflow usually begins with detecting relevant LC/MS peaks via peak-picking algorithms and retrieving putative identities based on accurate mass searching. However, accurate mass search alone provides poor evidence for metabolite identification. For this reason, computational annotation is used to reveal the underlying metabolites monoisotopic masses, improving putative identification in addition to confirmation with tandem mass spectrometry. This review examines LC/MS data from a computational and analytical perspective, focusing on the occurrence of neutral losses and in-source fragments, to understand the challenges in computational annotation methodologies. Herein, we examine the state-of-the-art strategies for computational annotation including: (i) peak grouping or full scan (MS1) pseudo-spectra extraction, i.e., clustering all mass spectral signals stemming from each metabolite; (ii) annotation using ion adduction and mass distance among ion peaks; (iii) incorporation of biological knowledge such as biotransformations or pathways; (iv) tandem MS data; and (v) metabolite retention time calibration, usually achieved by prediction from molecular descriptors. Advantages and pitfalls of each of these strategies are discussed, as well as expected future trends in computational annotation.
Collapse
Affiliation(s)
- Xavier Domingo-Almenara
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - J Rafael Montenegro-Burke
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - H Paul Benton
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Gary Siuzdak
- Scripps Center for Metabolomics, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| |
Collapse
|
31
|
Godzien J, Gil de la Fuente A, Otero A, Barbas C. Metabolite Annotation and Identification. COMPREHENSIVE ANALYTICAL CHEMISTRY 2018. [DOI: 10.1016/bs.coac.2018.07.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
32
|
Identification of molecules from non-targeted analysis. J Chromatogr B Analyt Technol Biomed Life Sci 2017; 1071:1-2. [PMID: 29223278 DOI: 10.1016/j.jchromb.2017.11.030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
33
|
Fenaille F, Barbier Saint-Hilaire P, Rousseau K, Junot C. Data acquisition workflows in liquid chromatography coupled to high resolution mass spectrometry-based metabolomics: Where do we stand? J Chromatogr A 2017; 1526:1-12. [PMID: 29074071 DOI: 10.1016/j.chroma.2017.10.043] [Citation(s) in RCA: 85] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Revised: 10/15/2017] [Accepted: 10/16/2017] [Indexed: 01/08/2023]
Abstract
Typical mass spectrometry (MS) based untargeted metabolomics protocols are tedious as well as time- and sample-consuming. In particular, they often rely on "full-scan-only" analyses using liquid chromatography (LC) coupled to high resolution mass spectrometry (HRMS) from which metabolites of interest are first highlighted, and then tentatively identified by using targeted MS/MS experiments. However, this situation is evolving with the emergence of integrated HRMS based-data acquisition protocols able to perform multi-event acquisitions. Most of these protocols, referring to as data dependent and data independent acquisition (DDA and DIA, respectively), have been initially developed for proteomic applications and have recently demonstrated their applicability to biomedical studies. In this context, the aim of this article is to take stock of the progress made in the field of DDA- and DIA-based protocols, and evaluate their ability to change conventional metabolomic and lipidomic data acquisition workflows, through a review of HRMS instrumentation, DDA and DIA workflows, and also associated informatics tools.
Collapse
Affiliation(s)
- François Fenaille
- Service de Pharmacologie et Immuno-Analyse (SPI), Laboratoire d'Etude du Métabolisme des Médicaments, CEA, INRA, Université Paris Saclay, MetaboHUB, F-91191 Gif-sur-Yvette, France
| | - Pierre Barbier Saint-Hilaire
- Service de Pharmacologie et Immuno-Analyse (SPI), Laboratoire d'Etude du Métabolisme des Médicaments, CEA, INRA, Université Paris Saclay, MetaboHUB, F-91191 Gif-sur-Yvette, France
| | - Kathleen Rousseau
- Service de Pharmacologie et Immuno-Analyse (SPI), Laboratoire d'Etude du Métabolisme des Médicaments, CEA, INRA, Université Paris Saclay, MetaboHUB, F-91191 Gif-sur-Yvette, France
| | - Christophe Junot
- Service de Pharmacologie et Immuno-Analyse (SPI), CEA, INRA, Université Paris Saclay, MetaboHUB, F-91191 Gif-sur-Yvette, France.
| |
Collapse
|