1
|
Shahneh MRZ, Strobel M, Vitale GA, Geibel C, Abiead YE, Garg N, Wagner B, Forchhammer K, Aron A, Phelan VV, Petras D, Wang M. ModiFinder: Tandem Mass Spectral Alignment Enables Structural Modification Site Localization. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024. [PMID: 38830143 DOI: 10.1021/jasms.4c00061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Untargeted tandem mass spectrometry (MS/MS) has become a high-throughput method to measure small molecules in complex samples. One key goal is the transformation of these MS/MS spectra into chemical structures. Computational techniques such as MS/MS library search have enabled the reidentification of known compounds. Analog library search and molecular networking extend this identification to unknown compounds. While there have been advancements in metrics for the similarity of MS/MS spectra of structurally similar compounds, there is still a lack of automated methods to provide site specific information about structural modifications. Here we introduce ModiFinder which leverages the alignment of peaks in MS/MS spectra between structurally related known and unknown small molecules. Specifically, ModiFinder focuses on shifted MS/MS fragment peaks in the MS/MS alignment. These shifted peaks putatively represent substructures of the known molecule that contain the site of the modification. ModiFinder synthesizes this information together and scores the likelihood for each atom in the known molecule to be the modification site. We demonstrate in this manuscript how ModiFinder can effectively localize modifications which extends the capabilities of MS/MS analog searching and molecular networking to accelerate the discovery of novel compounds.
Collapse
Affiliation(s)
- Mohammad Reza Zare Shahneh
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| | - Michael Strobel
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| | - Giovanni Andrea Vitale
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 24, Tuebingen 72076, Germany
| | - Christian Geibel
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 24, Tuebingen 72076, Germany
| | - Yasin El Abiead
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Dr., San Diego, California 92093, United States
| | - Neha Garg
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta,, 950 Atlantic Drive, Atlanta, Georgia 30332, United States
| | - Berenike Wagner
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 28, Tuebingen 72076, Germany
| | - Karl Forchhammer
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tuebingen, Auf der Morgenstelle 28, Tuebingen 72076, Germany
| | - Allegra Aron
- Department of Chemistry and Biochemistry, University of Denver, 2101 East Wesley Ave, Denver, Colorado 80210, United States
| | - Vanessa V Phelan
- Department of Pharmaceutical Sciences, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado, Anschutz Medical Campus, 12850 E Montview Blvd, Aurora, Colorado 80045, United States
| | - Daniel Petras
- Department of Biochemistry, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| | - Mingxun Wang
- Department of Computer Science and Engineering, University of California Riverside, 900 University Ave., Riverside, California 92521, United States
| |
Collapse
|
2
|
Hoang C, Uritboonthai W, Hoang L, Billings EM, Aisporna A, Nia FA, Derks RJE, Williamson JR, Giera M, Siuzdak G. Tandem Mass Spectrometry across Platforms. Anal Chem 2024; 96:5478-5488. [PMID: 38529642 PMCID: PMC11007677 DOI: 10.1021/acs.analchem.3c05576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/12/2024] [Accepted: 03/18/2024] [Indexed: 03/27/2024]
Abstract
PubChem serves as a comprehensive repository, housing over 100 million unique chemical structures representing the breadth of our chemical knowledge across numerous fields including metabolism, pharmaceuticals, toxicology, cosmetics, agriculture, and many more. Rapid identification of these small molecules increasingly relies on electrospray ionization (ESI) paired with tandem mass spectrometry (MS/MS), particularly by comparison to genuine standard MS/MS data sets. Despite its widespread application, achieving consistency in MS/MS data across various analytical platforms remains an unaddressed concern. This study evaluated MS/MS data derived from one hundred molecular standards utilizing instruments from five manufacturers, inclusive of quadrupole time-of-flight (QTOF) and quadrupole orbitrap "exactive" (QE) mass spectrometers by Agilent (QTOF), Bruker (QTOF), SCIEX (QTOF), Waters (QTOF), and Thermo QE. We assessed fragment ion variations at multiple collisional energies (0, 10, 20, and 40 eV) using the cosine scoring algorithm for comparisons and the number of fragments observed. A parallel visual analysis of the MS/MS spectra across instruments was conducted, consistent with a standard procedure that is used to circumvent the still prevalent issue of mischaracterizations as shown for dimethyl sphingosine and C20 sphingosine. Our analysis revealed a notable consistency in MS/MS data and identifications, with fragment ions' m/z values exhibiting the highest concordance between instrument platforms at 20 eV, the other collisional energies (0, 10, and 40 eV) were significantly lower. While moving toward a standardized ESI MS/MS protocol is required for dependable molecular characterization, our results also underscore the continued importance of corroborating MS/MS data against standards to ensure accurate identifications. Our findings suggest that ESI MS/MS manufacturers, akin to the established norms for gas chromatography mass spectrometry instruments, should standardize the collision energy at 20 eV across different instrument platforms.
Collapse
Affiliation(s)
- Corey Hoang
- Scripps
Center for Metabolomics and Mass Spectrometry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Winnie Uritboonthai
- Scripps
Center for Metabolomics and Mass Spectrometry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Linh Hoang
- Scripps
Center for Metabolomics and Mass Spectrometry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Elizabeth M. Billings
- Scripps
Center for Metabolomics and Mass Spectrometry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Aries Aisporna
- Scripps
Center for Metabolomics and Mass Spectrometry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| | - Farshad A. Nia
- Department
of Integrative Structural and Computational Biology, Department of
Chemistry, The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037, United States
| | - Rico J. E. Derks
- Center
for Proteomics and Metabolomics, Leiden
University Medical Center, Albinusdreef 2, Leiden 2333ZA, Netherlands
| | - James R. Williamson
- Department
of Integrative Structural and Computational Biology, Department of
Chemistry, The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037, United States
| | - Martin Giera
- Center
for Proteomics and Metabolomics, Leiden
University Medical Center, Albinusdreef 2, Leiden 2333ZA, Netherlands
- The
Novo Nordisk Foundation Center for Stem Cell Medicine (reNEW), Leiden University Medical Center, Leiden 2333ZA, The Netherlands
| | - Gary Siuzdak
- Scripps
Center for Metabolomics and Mass Spectrometry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
- Departments
of Chemistry, Molecular, and Computational Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, United States
| |
Collapse
|
3
|
Aloor LJ, Skariyachan S, Raghavamenon AC, Kumar KM, Narayanappa R, Uttarkar A, Niranjan V, Cherian T. BRCA1/TP53 tumor proteins inhibited by novel analogues of curcumin - Insight from computational modelling, dynamic simulation and experimental validation. Int J Biol Macromol 2023; 253:126989. [PMID: 37739292 DOI: 10.1016/j.ijbiomac.2023.126989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/06/2023] [Accepted: 09/16/2023] [Indexed: 09/24/2023]
Abstract
The current study aimed to design novel curcumin analogue inhibitors with antiproliferative and antitumor activity towards BRCA1 and TP53 tumor proteins and to study their therapeutic potential by computer-aided molecular designing and experimental investigations. Four curcumin analogues were computationally designed and their drug-likeness and pharmacokinetic properties were predicted. The binding of these analogues against six protein targets belonging to BRCA1 and TP53 tumor proteins were modelled by molecular docking and their binding energies were compared with that of curcumin and the standard drug cyclophosphamide and its validated target. The stabilities of selected docked complexes were confirmed by molecular dynamic simulation (MDS) and MMGBSA calculations. The best-docked analogue was chemically synthesized, characterized, and used for in vitro cytotoxic screening using DLA, EAC, and C127I cell lines. In vivo antitumor studies were carried out in Swiss Albino Mice. The study revealed that the designed analogues satisfied drug-likeness and pharmacokinetic properties and demonstrated better binding affinity to the selected targets than curcumin. Among the analogues, NLH demonstrated significant interaction with the BRCA1-BRCT-c domain (TG3; binding energy -8.3 kcal/mol) when compared to the interaction of curcumin (binding energy -6.19 kcal) and cyclophosphamide (binding energy -3.8 kcal/mol) and its usual substrate (TG7). The MDS and MM/GBSA studies revealed that the binding free energy of the NLH-TG3 complex (-61.24 kcal/mol) was better when compared to that of the cyclophosphamide-TG7 complex (-21.67 kcal/mol). In vitro, cytotoxic studies showed that NLH demonstrated significant antiproliferative activities against tumor cell lines. The in vivo study depicted NLH possesses the potential for tumor inhibition. Thus, the newly synthesized curcumin analogue is probably used to develop novel therapeutic agents against breast cancer.
Collapse
Affiliation(s)
- Lovely Jacob Aloor
- Department of Chemistry, Little Flower College, Guruvayoor, Kerala, India; Post Graduate & Research Department of Chemistry, Christ College (Autonomous), Irinjalakuda, Kerala, India
| | - Sinosh Skariyachan
- Department of Microbiology, St. Pius X College, Rajapuram, Kerala, India.
| | | | - Kalavathi Murugan Kumar
- Department of Bioinformatics, Pondicherry University, Chinna Kalapet, Kalapet, Puducherry, Tamil Nadu, India
| | - Rajeswari Narayanappa
- Department of Biotechnology, Dayananda Sagar College of Engineering, Bengaluru, Karnataka, India
| | - Akshay Uttarkar
- Department of Biotechnology, RV College of Engineering, Bengaluru, Karnataka, India
| | - Vidya Niranjan
- Department of Biotechnology, RV College of Engineering, Bengaluru, Karnataka, India
| | - Tom Cherian
- Post Graduate & Research Department of Chemistry, Christ College (Autonomous), Irinjalakuda, Kerala, India.
| |
Collapse
|
4
|
Wang Q, Chen T, La M, Song Z, Gao M, Yang T, Li Y, He L, Zou D. Activity labelled molecular networking fuels the antioxidation active molecules profile of Ginger. Food Chem 2023; 424:136343. [PMID: 37229896 DOI: 10.1016/j.foodchem.2023.136343] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Revised: 03/29/2023] [Accepted: 05/08/2023] [Indexed: 05/27/2023]
Abstract
Ginger has been used as consumed food spice and folk medicine in daily life for thousands of years in various regions of the world. Considerable antioxidation is one of the major activities for Ginger to exhibit health-promoting effects. In this study, a bioinformatic workflow was developed to generate activity labelled molecular networking (ALMN) to fuel the antioxidation active molecules profile of Ginger. In ALMN, antioxidation activity data, which was defined as correlation (r and p value) between the relative abundance of a molecule in fractions and the activity level of each fraction, was labelled to feature-based molecular network to profile out antioxidation active molecules visually. Fragmentation tree was further computed as a complementary way to conduct high confidence structure annotations of antioxidation active molecules. Consequently, 48 molecules were prioritized as antioxidation active molecules from 11,720 metabolite molecules of Ginger in a systematical way.
Collapse
Affiliation(s)
- Qiqi Wang
- College of Pharmacy, Jinan University, Guangzhou 510632, PR China
| | - Tao Chen
- Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, PR China
| | - Mencuo La
- School of Life Science, Qinghai Normal University, Xining 810000, PR China
| | - Zhibo Song
- Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, PR China
| | - Mengze Gao
- School of Life Science, Qinghai Normal University, Xining 810000, PR China
| | - Tingqin Yang
- School of Life Science, Qinghai Normal University, Xining 810000, PR China
| | - Yulin Li
- Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, PR China.
| | - Liangliang He
- College of Pharmacy, Jinan University, Guangzhou 510632, PR China.
| | - Denglang Zou
- College of Pharmacy, Jinan University, Guangzhou 510632, PR China; Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810008, PR China; School of Life Science, Qinghai Normal University, Xining 810000, PR China.
| |
Collapse
|
5
|
Xue X, Sun H, Yang M, Liu X, Hu HY, Deng Y, Wang X. Advances in the Application of Artificial Intelligence-Based Spectral Data Interpretation: A Perspective. Anal Chem 2023; 95:13733-13745. [PMID: 37688541 DOI: 10.1021/acs.analchem.3c02540] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2023]
Abstract
The interpretation of spectral data, including mass, nuclear magnetic resonance, infrared, and ultraviolet-visible spectra, is critical for obtaining molecular structural information. The development of advanced sensing technology has multiplied the amount of available spectral data. Chemical experts must use basic principles corresponding to the spectral information generated by molecular fragments and functional groups. This is a time-consuming process that requires a solid professional knowledge base. In recent years, the rapid development of computer science and its applications in cheminformatics and the emergence of computer-aided expert systems have greatly reduced the difficulty in analyzing large quantities of data. For expert systems, however, the problem-solving strategy must be known in advance or extracted by human experts and translated into algorithms. Gratifyingly, the development of artificial intelligence (AI) methods has shown great promise for solving such problems. Traditional algorithms, including the latest neural network algorithms, have shown great potential for both extracting useful information and processing massive quantities of data. This Perspective highlights recent innovations covering all of the emerging AI-based spectral interpretation techniques. In addition, the main limitations and current obstacles are presented, and the corresponding directions for further research are proposed. Moreover, this Perspective gives the authors' personal outlook on the development and future applications of spectral interpretation.
Collapse
Affiliation(s)
- Xi Xue
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
- Beijing Key Laboratory of Active Substances Discovery and Drugability Evaluation, Department of Medicinal Chemistry, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, P. R. China
| | - Hanyu Sun
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
- Beijing Key Laboratory of Active Substances Discovery and Drugability Evaluation, Department of Medicinal Chemistry, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, P. R. China
| | - Minjian Yang
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
- Beijing Key Laboratory of Active Substances Discovery and Drugability Evaluation, Department of Medicinal Chemistry, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, P. R. China
| | - Xue Liu
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
| | - Hai-Yu Hu
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co., Ltd. Beijing 100080, China
- Department of Automation, Tsinghua University, Beijing 100084, China
| | - Xiaojian Wang
- State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
- CarbonSilicon AI Technology Co., Ltd. Beijing 100080, China
| |
Collapse
|
6
|
Flores JE, Bramer LM, Degnan DJ, Paurus VL, Corilo YE, Clendinen CS. Gaussian Mixture Modeling Extensions for Improved False Discovery Rate Estimation in GC-MS Metabolomics. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2023. [PMID: 37084380 DOI: 10.1021/jasms.3c00039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The ability to reliably identify small molecules (e.g., metabolites) is key toward driving scientific advancement in metabolomics. Gas chromatography-mass spectrometry (GC-MS) is an analytic method that may be applied to facilitate this process. The typical GC-MS identification workflow involves quantifying the similarity of an observed sample spectrum and other features (e.g., retention index) to that of several references, noting the compound of the best-matching reference spectrum as the identified metabolite. While a deluge of similarity metrics exist, none quantify the error rate of generated identifications, thereby presenting an unknown risk of false identification or discovery. To quantify this unknown risk, we propose a model-based framework for estimating the false discovery rate (FDR) among a set of identifications. Extending a traditional mixture modeling framework, our method incorporates both similarity score and experimental information in estimating the FDR. We apply these models to identification lists derived from across 548 samples of varying complexity and sample type (e.g., fungal species, standard mixtures, etc.), comparing their performance to that of the traditional Gaussian mixture model (GMM). Through simulation, we additionally assess the impact of reference library size on the accuracy of FDR estimates. In comparing the best performing model extensions to the GMM, our results indicate relative decreases in median absolute estimation error (MAE) ranging from 12% to 70%, based on comparisons of the median MAEs across all hit-lists. Results indicate that these relative performance improvements generally hold despite library size; however FDR estimation error typically worsens as the set of reference compounds diminishes.
Collapse
Affiliation(s)
- Javier E Flores
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Lisa M Bramer
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - David J Degnan
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Vanessa L Paurus
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Yuri E Corilo
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Chaevien S Clendinen
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| |
Collapse
|
7
|
Xing S, Shen S, Xu B, Li X, Huan T. BUDDY: molecular formula discovery via bottom-up MS/MS interrogation. Nat Methods 2023:10.1038/s41592-023-01850-x. [PMID: 37055660 DOI: 10.1038/s41592-023-01850-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 03/15/2023] [Indexed: 04/15/2023]
Abstract
A substantial fraction of metabolic features remains undetermined in mass spectrometry (MS)-based metabolomics, and molecular formula annotation is the starting point for unraveling their chemical identities. Here we present bottom-up tandem MS (MS/MS) interrogation, a method for de novo formula annotation. Our approach prioritizes MS/MS-explainable formula candidates, implements machine-learned ranking and offers false discovery rate estimation. Compared with the mathematically exhaustive formula enumeration, our approach shrinks the formula candidate space by 42.8% on average. Method benchmarking on annotation accuracy was systematically carried out on reference MS/MS libraries and real metabolomics datasets. Applied on 155,321 recurrent unidentified spectra, our approach confidently annotated >5,000 novel molecular formulae absent from chemical databases. Beyond the level of individual metabolic features, we combined bottom-up MS/MS interrogation with global optimization to refine formula annotations while revealing peak interrelationships. This approach allowed the systematic annotation of 37 fatty acid amide molecules in human fecal data. All bioinformatics pipelines are available in a standalone software, BUDDY ( https://github.com/HuanLab/BUDDY ).
Collapse
Affiliation(s)
- Shipei Xing
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada
| | - Sam Shen
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada
| | - Banghua Xu
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada
| | - Xiaoxiao Li
- Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, British Columbia, Canada
| | - Tao Huan
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver, British Columbia, Canada.
| |
Collapse
|
8
|
Xie Z, Morris JD, Mattingly SJ, Sutaria SR, Huang J, Nantz MH, Fu XA. Analysis of a Broad Range of Carbonyl Metabolites in Exhaled Breath by UHPLC-MS. Anal Chem 2023; 95:4344-4352. [PMID: 36815760 PMCID: PMC10521381 DOI: 10.1021/acs.analchem.2c04604] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Abstract
Analysis of volatile organic compounds (VOCs) in exhaled breath (EB) has shown great potential for disease detection including lung cancer, infectious respiratory diseases, and chronic obstructive pulmonary disease. Although many breath sample collection and analytical methods have been developed for breath analysis, analysis of metabolic VOCs in exhaled breath is still a challenge for clinical application. Many carbonyl compounds in exhaled breath are related to the metabolic processes of diseases. This work reports a method of ultrahigh-performance liquid chromatography coupled with high-resolution mass spectrometry (UHPLC-MS) for the analysis of a broad range of carbonyl metabolites in exhaled breath. Carbonyl compounds in the exhaled breath were captured by a fabricated silicon microreactor with a micropillar array coated with 2-(aminooxy)ethyl-N,N,N-trimethylammonium (ATM) triflate. A total of six subgroups consisting of saturated aldehydes and ketones, hydroxy-aldehydes, and hydroxy-ketones, unsaturated 2-alkenals, and 4-hydroxy-2-alkenals were identified in the exhaled breath. The combination of a silicon microreactor for the selective capture of carbonyl compounds with UHPLC-MS analysis may provide a quantitative method for the analysis of carbonyls to identify disease markers in exhaled breath.
Collapse
Affiliation(s)
- Zhenzhen Xie
- Department of Chemical Engineering, University of Louisville, Louisville, KY 40292, United States
| | - James D. Morris
- Department of Chemical Engineering, University of Louisville, Louisville, KY 40292, United States
| | | | - Saurin R. Sutaria
- Department of Chemistry, University of Louisville, Louisville, KY 40292, United States
| | - Jiapeng Huang
- Department of Anesthesiology and Perioperative Medicine, University of Louisville, Louisville, KY 40292, United States
| | - Michael H. Nantz
- Department of Chemistry, University of Louisville, Louisville, KY 40292, United States
| | - Xiao-An Fu
- Department of Chemical Engineering, University of Louisville, Louisville, KY 40292, United States
| |
Collapse
|
9
|
de Medeiros LS, de Araújo Júnior MB, Peres EG, da Silva JCI, Bassicheto MC, Di Gioia G, Veiga TAM, Koolen HHF. Discovering New Natural Products Using Metabolomics-Based Approaches. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1439:185-224. [PMID: 37843810 DOI: 10.1007/978-3-031-41741-2_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
The incessant search for new natural molecules with biological activities has forced researchers in the field of chemistry of natural products to seek different approaches for their prospection studies. In particular, researchers around the world are turning to approaches in metabolomics to avoid high rates of re-isolation of certain compounds, something recurrent in this branch of science. Thanks to the development of new technologies in the analytical instrumentation of spectroscopic and spectrometric techniques, as well as the advance in the computational processing modes of the results, metabolomics has been gaining more and more space in studies that involve the prospection of natural products. Thus, this chapter summarizes the precepts and good practices in the metabolomics of microbial natural products using mass spectrometry and nuclear magnetic resonance spectroscopy, and also summarizes several examples where this approach has been applied in the discovery of bioactive molecules.
Collapse
Affiliation(s)
- Lívia Soman de Medeiros
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil.
| | - Moysés B de Araújo Júnior
- Grupo de Pesquisa em Metabolômica e Espectrometria de Massas, Universidade do Estado do Amazonas, Manaus, Brazil
| | - Eldrinei G Peres
- Grupo de Pesquisa em Metabolômica e Espectrometria de Massas, Universidade do Estado do Amazonas, Manaus, Brazil
| | | | - Milena Costa Bassicheto
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | - Giordanno Di Gioia
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | - Thiago André Moura Veiga
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | | |
Collapse
|
10
|
Yang J, Cai Y, Zhao K, Xie H, Chen X. Concepts and applications of chemical fingerprint for hit and lead screening. Drug Discov Today 2022; 27:103356. [PMID: 36113834 DOI: 10.1016/j.drudis.2022.103356] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 07/28/2022] [Accepted: 09/08/2022] [Indexed: 11/22/2022]
Abstract
Molecular fingerprints are used to represent chemical (structural, physicochemical, etc.) properties of large-scale chemical sets in a low computational cost way. They have a prominent role in transforming chemical data sets into consistent input formats (bit strings or numeric values) suitable for in silico approaches. In this review, we summarize and classify common and state-of-the-art fingerprints into eight different types (dictionary based, circular, topological, pharmacophore, protein-ligand interaction, shape based, reinforced, and multi). We also highlight applications of fingerprints in early drug research and development (R&D). Thus, this review provides a guide for the selection of appropriate fingerprints of compounds (or ligand-protein complexes) for use in drug R&D.
Collapse
Affiliation(s)
- Jingbo Yang
- Department of Pharmagenomics, College of Bioinformatics Science and Technology, Harbin Medical University, 150081 Harbin, Heilongjiang, China
| | - Yiyang Cai
- Department of Pharmagenomics, College of Bioinformatics Science and Technology, Harbin Medical University, 150081 Harbin, Heilongjiang, China
| | - Kairui Zhao
- Department of Pharmagenomics, College of Bioinformatics Science and Technology, Harbin Medical University, 150081 Harbin, Heilongjiang, China
| | - Hongbo Xie
- Department of Pharmagenomics, College of Bioinformatics Science and Technology, Harbin Medical University, 150081 Harbin, Heilongjiang, China.
| | - Xiujie Chen
- Department of Pharmagenomics, College of Bioinformatics Science and Technology, Harbin Medical University, 150081 Harbin, Heilongjiang, China.
| |
Collapse
|
11
|
Computed Mass-Fragmentation Energy Profiles of Some Acetalized Monosaccharides for Identification in Mass Spectrometry. Symmetry (Basel) 2022. [DOI: 10.3390/sym14051074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Our study found that quantum calculations can differentiate fragmentation energies into isomeric structures with asymmetric carbon atoms, such as those of acetalized monosaccharides. It was justified by the good results that have been published in recent years on the discrimination of structural isomers and diastereomers by correlating the calculated mass energy fragmentation profiles with their mass spectra. Based on the quantitative structure–fragmentation relationship (QSFR), this technique compares the intensities of primary ions from the experimental spectrum using the mass energy profiles calculated for the candidate structures. Maximum fit is obtained for the true structure. For a preliminary assessment of the accuracy of the identification of some di-O-isopropylidene monosaccharide diastereomers, we used fragmentation enthalpies (ΔfH) and Gibbs energies (ΔfG) as the energetic descriptors of fragmentation. Four quantum chemical methods were used: RM1, PM7, DFT ΔfH and DFT ΔfG. The mass energy database shows that the differences between the profiles of the isomeric candidate structures could be large enough to be distinguished from each other. This database allows the optimization of energy descriptors and quantum computing methods that can ensure the correct identification of these isomers.
Collapse
|
12
|
Kostyukevich Y, Sosnin S, Osipenko S, Kovaleva O, Rumiantseva L, Kireev A, Zherebker A, Fedorov M, Nikolaev EN. PyFragMS-A Web Tool for the Investigation of the Collision-Induced Fragmentation Pathways. ACS OMEGA 2022; 7:9710-9719. [PMID: 35350354 PMCID: PMC8945079 DOI: 10.1021/acsomega.1c07272] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 02/28/2022] [Indexed: 05/13/2023]
Abstract
Dissociation induced by the accumulation of internal energy via collisions of ions with neutral molecules is one of the most important fragmentation techniques in mass spectrometry (MS), and the identification of small singly charged molecules is based mainly on the consideration of the fragmentation spectrum. Many research studies have been dedicated to the creation of databases of experimentally measured tandem mass spectrometry (MS/MS) spectra (such as MzCloud, Metlin, etc.) and developing software for predicting MS/MS fragments in silico from the molecular structure (such as MetFrag, CFM-ID, CSI:FingerID, etc.). However, the fragmentation mechanisms and pathways are still not fully understood. One of the limiting obstacles is that protomers (positive ions protonated at different sites) produce different fragmentation spectra, and these spectra overlap in the case of the presence of different protomers. Here, we are proposing to use a combination of two powerful approaches: computing fragmentation trees that carry information of all consecutive fragmentations and consideration of the MS/MS data of isotopically labeled compounds. We have created PyFragMS-a web tool consisting of a database of annotated MS/MS spectra of isotopically labeled molecules (after H/D and/or 16O/18O exchange) and a collection of instruments for computing fragmentation trees for an arbitrary molecule. Using PyFragMS, we investigated how the site of protonation influences the fragmentation pathway for small molecules. Also, PyFragMS offers capabilities for performing database search when MS/MS data of the isotopically labeled compounds are taken into account.
Collapse
|
13
|
Radical fragment ions in collision-induced dissociation-based tandem mass spectrometry. Anal Chim Acta 2022; 1200:339613. [DOI: 10.1016/j.aca.2022.339613] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 01/11/2022] [Accepted: 02/14/2022] [Indexed: 11/21/2022]
|
14
|
Dhiman V, Patil K, Velip L, Talluri MVNK, Gananadhamu S. Comprehensive degradation profiling and influence of different oxidizing reagents on tinoridine hydrochloride: Structural characterization of its degradation products using HPLC and HRMS. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2022; 36:e9210. [PMID: 34619000 DOI: 10.1002/rcm.9210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Revised: 09/17/2021] [Accepted: 10/03/2021] [Indexed: 06/13/2023]
Abstract
RATIONALE Stress testing on tinoridine hydrochloride was carried out using a multidimensional approach. This included different conditions: hydrolytic (acidic, alkaline, and neutral conditions), different oxidative reagents, thermal, photolytic conditions, HPLC method development, and structural elucidation using high-resolution mass spectrometry (HRMS). It provides the basis for quality control of tinoridine hydrochloride and its derivatives during storage conditions. METHODS The tinoridine hydrochloride was subjected to a variety of stress conditions. A gradient reversed-phase HPLC method was developed on a X-Bridge C18 column (250 × 4.6 mm, 5 μm) to separate all the degradation products (DPs). HRMS studies have been performed to elucidate the structure of DPs. RESULTS HPLC-PDA study revealed that significant degradation products were formed in hydrolytic, AIBN (radical initiator at 40°C), thermal, and solid-state photolight stress conditions, but the drug was stable under oxidative conditions (H2 O2, Fenton's reagent at room temperature and ferric chloride at 40°C). The structure of degradation products was elucidated, and mechanism of their formation was explained. CONCLUSION Stress study was successfully carried out as per ICH Q1A (R2) guideline on tinoridine hydrochloride. A total of six new degradation products were characterized, DP 2 and DP 6 formed by the effect of co-solvent. This study provides the scientifically sound basis for quality monitoring and storage conditions of tinoridine hydrochloride.
Collapse
Affiliation(s)
- Vivek Dhiman
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research (NIPER), Hyderabad, Balanagar, India
| | - Kanchan Patil
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research (NIPER), Hyderabad, Balanagar, India
| | - Laximan Velip
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research (NIPER), Hyderabad, Balanagar, India
| | - M V N Kumar Talluri
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research (NIPER), Hyderabad, Balanagar, India
| | - Samanthula Gananadhamu
- Department of Pharmaceutical Analysis, National Institute of Pharmaceutical Education and Research (NIPER), Hyderabad, Balanagar, India
| |
Collapse
|
15
|
Tkalec Ž, Negreira N, López de Alda M, Barceló D, Kosjek T. A novel workflow utilizing open-source software tools in the environmental fate studies: The example of imatinib biotransformation. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 797:149063. [PMID: 34311367 DOI: 10.1016/j.scitotenv.2021.149063] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 07/12/2021] [Accepted: 07/12/2021] [Indexed: 06/13/2023]
Abstract
The aim of this study is to utilize novel and powerful workflows with publicly available tools to efficiently process data and facilitate rapid acquisition of knowledge on environmental fate studies. Taking imatinib (IMA) as an example, we developed an efficient workflow to describe IMA biodegradation with activated sludge (AS) from wastewater treatment plants (WWTP). IMA is a cytostatic pharmaceutical; a selective tyrosine kinase inhibitor used to treat chronic myeloid leukemia. Its reported ecotoxic, endocrine and genotoxic effects imply high risk for aquatic wildlife and human health, however its fate in the environment is not yet well known. The study was conducted in a batch biotransformation setup, at two AS concentration levels and in presence and absence of carbon source. Degradation profiles and formation of IMA transformation products (TPs) were investigated using UHPLC-QqOrbitrap-MS/MS which showed that IMA is readily biodegradable. TPs were determined using multivariate statistical analysis. Eight TPs were determined and tentatively identified, six of them for first time. Hydrolysis of amide bond, oxidation, demethylation, deamination, acetylation and succinylation are proposed as major biodegradation pathways. TP235, the product of amide bond hydrolysis, was detected and quantified in actual wastewaters, at levels around 1 ng/L. This calls for more studies on the environmental fate of IMA in order to properly asses the environmental risk and hazard associated to IMA and its TPs.
Collapse
Affiliation(s)
- Žiga Tkalec
- Jožef Stefan Institute, Department of Environmental Sciences, Jamova 39, Ljubljana, Slovenia; Jožef Stefan International Postgraduate School, Jamova 39, Ljubljana, Slovenia
| | - Noelia Negreira
- Water, Environmental and Food Chemistry Unit (ENFOCHEM), Department of Environmental Chemistry, Institute of Environmental Assessment and Water Research (IDAEA), Spanish National Research Council (CSIC), Barcelona, Spain
| | - Miren López de Alda
- Water, Environmental and Food Chemistry Unit (ENFOCHEM), Department of Environmental Chemistry, Institute of Environmental Assessment and Water Research (IDAEA), Spanish National Research Council (CSIC), Barcelona, Spain.
| | - Damià Barceló
- Water, Environmental and Food Chemistry Unit (ENFOCHEM), Department of Environmental Chemistry, Institute of Environmental Assessment and Water Research (IDAEA), Spanish National Research Council (CSIC), Barcelona, Spain
| | - Tina Kosjek
- Jožef Stefan Institute, Department of Environmental Sciences, Jamova 39, Ljubljana, Slovenia; Jožef Stefan International Postgraduate School, Jamova 39, Ljubljana, Slovenia.
| |
Collapse
|
16
|
Li D, Gaquerel E. Next-Generation Mass Spectrometry Metabolomics Revives the Functional Analysis of Plant Metabolic Diversity. ANNUAL REVIEW OF PLANT BIOLOGY 2021; 72:867-891. [PMID: 33781077 DOI: 10.1146/annurev-arplant-071720-114836] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The remarkable diversity of specialized metabolites produced by plants has inspired several decades of research and nucleated a long list of theories to guide empirical ecological studies. However, analytical constraints and the lack of untargeted processing workflows have long precluded comprehensive metabolite profiling and, consequently, the collection of the critical currencies to test theory predictions for the ecological functions of plant metabolic diversity. Developments in mass spectrometry (MS) metabolomics have revolutionized the large-scale inventory and annotation of chemicals from biospecimens. Hence, the next generation of MS metabolomics propelled by new bioinformatics developments provides a long-awaited framework to revisit metabolism-centered ecological questions, much like the advances in next-generation sequencing of the last two decades impacted all research horizons in genomics. Here, we review advances in plant (computational) metabolomics to foster hypothesis formulation from complex metabolome data. Additionally, we reflect on how next-generation metabolomics could reinvigorate the testing of long-standing theories on plant metabolic diversity.
Collapse
Affiliation(s)
- Dapeng Li
- Department of Molecular Ecology, Max Planck Institute for Chemical Ecology, 07745 Jena, Germany;
| | - Emmanuel Gaquerel
- Institut de Biologie Moléculaire des Plantes du CNRS, Université de Strasbourg, 67084 Strasbourg, France;
| |
Collapse
|
17
|
Krettler CA, Thallinger GG. A map of mass spectrometry-based in silico fragmentation prediction and compound identification in metabolomics. Brief Bioinform 2021; 22:6184408. [PMID: 33758925 DOI: 10.1093/bib/bbab073] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 01/29/2021] [Accepted: 02/12/2021] [Indexed: 12/27/2022] Open
Abstract
Metabolomics, the comprehensive study of the metabolome, and lipidomics-the large-scale study of pathways and networks of cellular lipids-are major driving forces in enabling personalized medicine. Complicated and error-prone data analysis still remains a bottleneck, however, especially for identifying novel metabolites. Comparing experimental mass spectra to curated databases containing reference spectra has been the gold standard for identification of compounds, but constructing such databases is a costly and time-demanding task. Many software applications try to circumvent this process by utilizing cutting-edge advances in computational methods-including quantum chemistry and machine learning-and simulate mass spectra by performing theoretical, so called in silico fragmentations of compounds. Other solutions concentrate directly on experimental spectra and try to identify structural properties by investigating reoccurring patterns and the relationships between them. The considerable progress made in the field allows recent approaches to provide valuable clues to expedite annotation of experimental mass spectra. This review sheds light on individual strengths and weaknesses of these tools, and attempts to evaluate them-especially in view of lipidomics, when considering complex mixtures found in biological samples as well as mass spectrometer inter-instrument variability.
Collapse
Affiliation(s)
- Christoph A Krettler
- Institute of Biomedical Informatics, Graz University of Technology, Stremayrgasse 16/I, 8010, Graz, Austria.,Omics Center Graz, BioTechMed-Graz, Stiftingtalstrasse 24, 8010, Graz, Austria
| | - Gerhard G Thallinger
- Institute of Biomedical Informatics, Graz University of Technology, Stremayrgasse 16/I, 8010, Graz, Austria.,Omics Center Graz, BioTechMed-Graz, Stiftingtalstrasse 24, 8010, Graz, Austria
| |
Collapse
|
18
|
Desmet S, Saeys Y, Verstaen K, Dauwe R, Kim H, Niculaes C, Fukushima A, Goeminne G, Vanholme R, Ralph J, Boerjan W, Morreel K. Maize specialized metabolome networks reveal organ-preferential mixed glycosides. Comput Struct Biotechnol J 2021; 19:1127-1144. [PMID: 33680356 PMCID: PMC7890092 DOI: 10.1016/j.csbj.2021.01.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 01/06/2021] [Accepted: 01/07/2021] [Indexed: 12/13/2022] Open
Abstract
Despite the scientific and economic importance of maize, little is known about its specialized metabolism. Here, five maize organs were profiled using different reversed-phase liquid chromatography-mass spectrometry methods. The resulting spectral metadata, combined with candidate substrate-product pair (CSPP) networks, allowed the structural characterization of 427 of the 5,420 profiled compounds, including phenylpropanoids, flavonoids, benzoxazinoids, and auxin-related compounds, among others. Only 75 of the 427 compounds were already described in maize. Analysis of the CSPP networks showed that phenylpropanoids are present in all organs, whereas other metabolic classes are rather organ-enriched. Frequently occurring CSPP mass differences often corresponded with glycosyl- and acyltransferase reactions. The interplay of glycosylations and acylations yields a wide variety of mixed glycosides, bearing substructures corresponding to the different biochemical classes. For example, in the tassel, many phenylpropanoid and flavonoid-bearing glycosides also contain auxin-derived moieties. The characterized compounds and mass differences are an important step forward in metabolic pathway discovery and systems biology research. The spectral metadata of the 5,420 compounds is publicly available (DynLib spectral database, https://bioit3.irc.ugent.be/dynlib/).
Collapse
Affiliation(s)
- Sandrien Desmet
- Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Gent, Belgium.,Center for Plant Systems Biology, VIB, B-9052 Gent, Belgium
| | - Yvan Saeys
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, B-9052 Gent, Belgium.,Data Mining and Modelling for Biomedicine, Center for Inflammation Research, VIB, B-9052 Gent, Belgium
| | - Kevin Verstaen
- Data Mining and Modelling for Biomedicine, Center for Inflammation Research, VIB, B-9052 Gent, Belgium
| | - Rebecca Dauwe
- Unité de Recherche BIOPI EA3900, Université de Picardie Jules Verne, 80000 Amiens, France
| | - Hoon Kim
- Department of Biochemistry and the U.S. Department of Energy Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin, Madison, WI 53726, United States
| | - Claudiu Niculaes
- Plant Breeding, TUM School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - Atsushi Fukushima
- RIKEN Center for Sustainable Resource Science, Yokohama, Kanagawa 230-0045, Japan
| | - Geert Goeminne
- Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Gent, Belgium.,VIB Metabolomics Core Ghent, VIB, B-9052 Gent, Belgium
| | - Ruben Vanholme
- Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Gent, Belgium.,Center for Plant Systems Biology, VIB, B-9052 Gent, Belgium
| | - John Ralph
- Department of Biochemistry and the U.S. Department of Energy Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin, Madison, WI 53726, United States
| | - Wout Boerjan
- Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Gent, Belgium.,Center for Plant Systems Biology, VIB, B-9052 Gent, Belgium
| | - Kris Morreel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Gent, Belgium.,Center for Plant Systems Biology, VIB, B-9052 Gent, Belgium
| |
Collapse
|
19
|
Xing S, Hu Y, Yin Z, Liu M, Tang X, Fang M, Huan T. Retrieving and Utilizing Hypothetical Neutral Losses from Tandem Mass Spectra for Spectral Similarity Analysis and Unknown Metabolite Annotation. Anal Chem 2020; 92:14476-14483. [DOI: 10.1021/acs.analchem.0c02521] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Shipei Xing
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver, V6T 1Z1 BC, Canada
| | - Yan Hu
- Department of Computer Sciences, University of British Columbia, 2366 Main Mall, Vancouver, V6T 1Z1 BC, Canada
| | - Zixuan Yin
- Fortinet, 4190 Still Creek Dr, Burnaby, V5C 6C6 BC, Canada
| | - Min Liu
- School of Civil and Environmental Engineering, Nanyang Technological University, 639798, Singapore
| | - Xiaoyu Tang
- Institute of Chemical Biology, Shenzhen Bay Laboratory, Shenzhen 518132, China
| | - Mingliang Fang
- School of Civil and Environmental Engineering, Nanyang Technological University, 639798, Singapore
| | - Tao Huan
- Department of Chemistry, Faculty of Science, University of British Columbia, Vancouver Campus, 2036 Main Mall, Vancouver, V6T 1Z1 BC, Canada
| |
Collapse
|
20
|
Ji H, Deng H, Lu H, Zhang Z. Predicting a Molecular Fingerprint from an Electron Ionization Mass Spectrum with Deep Neural Networks. Anal Chem 2020; 92:8649-8653. [DOI: 10.1021/acs.analchem.0c01450] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Affiliation(s)
- Hongchao Ji
- College of Chemistry and Chemical Engineering, Central South University, Changsha, Hunan 410083, PR China
| | - Hanzi Deng
- College of Chemistry and Chemical Engineering, Central South University, Changsha, Hunan 410083, PR China
| | - Hongmei Lu
- College of Chemistry and Chemical Engineering, Central South University, Changsha, Hunan 410083, PR China
| | - Zhimin Zhang
- College of Chemistry and Chemical Engineering, Central South University, Changsha, Hunan 410083, PR China
| |
Collapse
|
21
|
Nguyen DH, Nguyen CH, Mamitsuka H. ADAPTIVE: leArning DAta-dePendenT, concIse molecular VEctors for fast, accurate metabolite identification from tandem mass spectra. Bioinformatics 2020; 35:i164-i172. [PMID: 31510641 PMCID: PMC6612897 DOI: 10.1093/bioinformatics/btz319] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Motivation Metabolite identification is an important task in metabolomics to enhance the knowledge of biological systems. There have been a number of machine learning-based methods proposed for this task, which predict a chemical structure of a given spectrum through an intermediate (chemical structure) representation called molecular fingerprints. They usually have two steps: (i) predicting fingerprints from spectra; (ii) searching chemical compounds (in database) corresponding to the predicted fingerprints. Fingerprints are feature vectors, which are usually very large to cover all possible substructures and chemical properties, and therefore heavily redundant, in the sense of having many molecular (sub)structures irrelevant to the task, causing limited predictive performance and slow prediction. Results We propose ADAPTIVE, which has two parts: learning two mappings (i) from structures to molecular vectors and (ii) from spectra to molecular vectors. The first part learns molecular vectors for metabolites from given data, to be consistent with both spectra and chemical structures of metabolites. In more detail, molecular vectors are generated by a model, being parameterized by a message passing neural network, and parameters are estimated by maximizing the correlation between molecular vectors and the corresponding spectra in terms of Hilbert-Schmidt Independence Criterion. Molecular vectors generated by this model are compact and importantly adaptive (specific) to both given data and task of metabolite identification. The second part uses input output kernel regression (IOKR), the current cutting-edge method of metabolite identification. We empirically confirmed the effectiveness of ADAPTIVE by using a benchmark data, where ADAPTIVE outperformed the original IOKR in both predictive performance and computational efficiency. Availability and implementation The code will be accessed through http://www.bic.kyoto-u.ac.jp/pathway/tools/ADAPTIVE after the acceptance of this article.
Collapse
Affiliation(s)
- Dai Hai Nguyen
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan
- To whom correspondence should be addressed.
| | - Canh Hao Nguyen
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan
| | - Hiroshi Mamitsuka
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan
- Department of Computer Science, Aalto University, Espoo, Finland
| |
Collapse
|
22
|
Hassanpour N, Alden N, Menon R, Jayaraman A, Lee K, Hassoun S. Biological Filtering and Substrate Promiscuity Prediction for Annotating Untargeted Metabolomics. Metabolites 2020; 10:E160. [PMID: 32326153 PMCID: PMC7241244 DOI: 10.3390/metabo10040160] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 04/10/2020] [Accepted: 04/15/2020] [Indexed: 02/07/2023] Open
Abstract
Mass spectrometry coupled with chromatography separation techniques provides a powerful platform for untargeted metabolomics. Determining the chemical identities of detected compounds however remains a major challenge. Here, we present a novel computational workflow, termed extended metabolic model filtering (EMMF), that aims to engineer a candidate set, a listing of putative chemical identities to be used during annotation, through an extended metabolic model (EMM). An EMM includes not only canonical substrates and products of enzymes already cataloged in a database through a reference metabolic model, but also metabolites that can form due to substrate promiscuity. EMMF aims to strike a balance between discovering previously uncharacterized metabolites and the computational burden of annotation. EMMF was applied to untargeted LC-MS data collected from cultures of Chinese hamster ovary (CHO) cells and murine cecal microbiota. EMM metabolites matched, on average, to 23.92% of measured masses, providing a > 7-fold increase in the candidate set size when compared to a reference metabolic model. Many metabolites suggested by EMMF are not catalogued in PubChem. For the CHO cell, we experimentally confirmed the presence of 4-hydroxyphenyllactate, a metabolite predicted by EMMF that has not been previously documented as part of the CHO cell metabolic model.
Collapse
Affiliation(s)
- Neda Hassanpour
- Department of Computer Science, Tufts University, Medford, MA 02421, USA;
| | - Nicholas Alden
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA 02421, USA; (N.A.); (K.L.)
| | - Rani Menon
- Department of Chemical Engineering, Texas A&M, College Station, TX 77843, USA; (R.M.); (A.J.)
| | - Arul Jayaraman
- Department of Chemical Engineering, Texas A&M, College Station, TX 77843, USA; (R.M.); (A.J.)
| | - Kyongbum Lee
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA 02421, USA; (N.A.); (K.L.)
| | - Soha Hassoun
- Department of Computer Science, Tufts University, Medford, MA 02421, USA;
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA 02421, USA; (N.A.); (K.L.)
| |
Collapse
|
23
|
Lee JU, Kim Y, Kim WY, Oh HB. Graph theory-based reaction pathway searches and DFT calculations for the mechanism studies of free radical-initiated peptide sequencing mass spectrometry (FRIPS MS): a model gas-phase reaction of GGR tri-peptide. Phys Chem Chem Phys 2020; 22:5057-5069. [PMID: 32073000 DOI: 10.1039/c9cp05433b] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Graph theory-based reaction pathway searches (ACE-Reaction program) and density functional theory calculations were performed to shed light on the mechanisms for the production of [an + H]+, xn+, yn+, zn+, and [yn + 2H]+ fragments formed in free radical-initiated peptide sequencing (FRIPS) mass spectrometry measurements of a small model system of glycine-glycine-arginine (GGR). In particular, the graph theory-based searches, which are rarely applied to gas-phase reaction studies, allowed us to investigate reaction mechanisms in an exhaustive manner without resorting to chemical intuition. As expected, radical-driven reaction pathways were favorable over charge-driven reaction pathways in terms of kinetics and thermodynamics. Charge- and radical-driven pathways for the formation of [yn + 2H]+ fragments were carefully compared, and it was revealed that the [yn + 2H]+ fragments observed in our FRIPS MS spectra originated from the radical-driven pathway, which is in contrast to the general expectation. The acquired understanding of the FRIPS fragmentation mechanism is expected to aid in the interpretation of FRIPS MS spectra. It should be emphasized that graph theory-based searches are powerful and effective methods for studying reaction mechanisms, including gas-phase reactions in mass spectrometry.
Collapse
Affiliation(s)
- Jae-Ung Lee
- Department of Chemistry, Sogang University, Seoul 04107, Republic of Korea.
| | | | | | | |
Collapse
|
24
|
Abstract
SIRIUS 4 is the best-in-class computational tool for metabolite identification from high-resolution tandem mass spectrometry data. It offers de novo molecular formula annotation with outstanding accuracy. When searching fragmentation spectra in a structure database, it reaches over 70% correct identifications. A predicted fingerprint, which indicates the presence or absence of thousands of molecular properties, helps to deduce information about the compound of interest even if it is not contained in any structure database. Here, we present best practices and describe how to leverage the full potential of SIRIUS 4, how to incorporate it into your own workflow, and how it adds value to the analysis of mass spectrometry data beyond spectral library search.
Collapse
|
25
|
Wang M, Jarmusch AK, Vargas F, Aksenov AA, Gauglitz JM, Weldon K, Petras D, da Silva R, Quinn R, Melnik AV, van der Hooft JJJ, Caraballo-Rodríguez AM, Nothias LF, Aceves CM, Panitchpakdi M, Brown E, Di Ottavio F, Sikora N, Elijah EO, Labarta-Bajo L, Gentry EC, Shalapour S, Kyle KE, Puckett SP, Watrous JD, Carpenter CS, Bouslimani A, Ernst M, Swafford AD, Zúñiga EI, Balunas MJ, Klassen JL, Loomba R, Knight R, Bandeira N, Dorrestein PC. Mass spectrometry searches using MASST. Nat Biotechnol 2020; 38:23-26. [PMID: 31894142 PMCID: PMC7236533 DOI: 10.1038/s41587-019-0375-9] [Citation(s) in RCA: 135] [Impact Index Per Article: 33.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Mingxun Wang
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
- Ometa Labs LLC, San Diego, CA, USA
| | - Alan K Jarmusch
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Fernando Vargas
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
- Division of Biological Sciences, University of California San Diego, La Jolla, CA, USA
| | - Alexander A Aksenov
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Julia M Gauglitz
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Kelly Weldon
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
| | - Daniel Petras
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Ricardo da Silva
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Robert Quinn
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Alexey V Melnik
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Justin J J van der Hooft
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
- Bioinformatics Group, Wageningen University, Wageningen, The Netherlands
| | - Andrés Mauricio Caraballo-Rodríguez
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Louis Felix Nothias
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Christine M Aceves
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Morgan Panitchpakdi
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Elizabeth Brown
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Francesca Di Ottavio
- Faculty of Bioscience and Technology for Food, Agriculture, and Environment, University of Teramo, Teramo, TE, Italy
| | - Nicole Sikora
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Emmanuel O Elijah
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Lara Labarta-Bajo
- Division of Biological Sciences, University of California San Diego, La Jolla, CA, USA
| | - Emily C Gentry
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Shabnam Shalapour
- Department of Pharmacology, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Kathleen E Kyle
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Sara P Puckett
- Division of Medicinal Chemistry, Department of Pharmaceutical Sciences, University of Connecticut, Storrs, CT, USA
| | - Jeramie D Watrous
- Department of Medicine, University of California San Diego, San Diego, California, USA
| | - Carolina S Carpenter
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
| | - Amina Bouslimani
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Madeleine Ernst
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Austin D Swafford
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
| | - Elina I Zúñiga
- Division of Biological Sciences, University of California San Diego, La Jolla, CA, USA
| | - Marcy J Balunas
- Division of Medicinal Chemistry, Department of Pharmaceutical Sciences, University of Connecticut, Storrs, CT, USA
| | - Jonathan L Klassen
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Rohit Loomba
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
- Division of Gastroenterology, University of California San Diego, La Jolla, CA, USA
| | - Rob Knight
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA, USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
| | - Nuno Bandeira
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA.
- Department of Pharmacology, School of Medicine, University of California San Diego, La Jolla, CA, USA.
- Department of Pediatrics, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
26
|
Liu Y, Romijn EP, Verniest G, Laukens K, De Vijlder T. Mass spectrometry-based structure elucidation of small molecule impurities and degradation products in pharmaceutical development. Trends Analyt Chem 2019. [DOI: 10.1016/j.trac.2019.115686] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
27
|
Nguyen DH, Nguyen CH, Mamitsuka H. Recent advances and prospects of computational methods for metabolite identification: a review with emphasis on machine learning approaches. Brief Bioinform 2019; 20:2028-2043. [PMID: 30099485 PMCID: PMC6954430 DOI: 10.1093/bib/bby066] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 06/14/2018] [Accepted: 07/03/2018] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Metabolomics involves studies of a great number of metabolites, which are small molecules present in biological systems. They play a lot of important functions such as energy transport, signaling, building block of cells and inhibition/catalysis. Understanding biochemical characteristics of the metabolites is an essential and significant part of metabolomics to enlarge the knowledge of biological systems. It is also the key to the development of many applications and areas such as biotechnology, biomedicine or pharmaceuticals. However, the identification of the metabolites remains a challenging task in metabolomics with a huge number of potentially interesting but unknown metabolites. The standard method for identifying metabolites is based on the mass spectrometry (MS) preceded by a separation technique. Over many decades, many techniques with different approaches have been proposed for MS-based metabolite identification task, which can be divided into the following four groups: mass spectra database, in silico fragmentation, fragmentation tree and machine learning. In this review paper, we thoroughly survey currently available tools for metabolite identification with the focus on in silico fragmentation, and machine learning-based approaches. We also give an intensive discussion on advanced machine learning methods, which can lead to further improvement on this task.
Collapse
Affiliation(s)
- Dai Hai Nguyen
- Department of machine learning and bioinformatics, Bioinformatics Center, Kyoto University, Uji, Japan
| | - Canh Hao Nguyen
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan
| | - Hiroshi Mamitsuka
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan
- Department of Computer Science, Aalto University, Otakaari, FI, Finland
| |
Collapse
|
28
|
Nguyen DH, Nguyen CH, Mamitsuka H. SIMPLE: Sparse Interaction Model over Peaks of moLEcules for fast, interpretable metabolite identification from tandem mass spectra. Bioinformatics 2019; 34:i323-i332. [PMID: 29950009 PMCID: PMC6022642 DOI: 10.1093/bioinformatics/bty252] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Motivation Recent success in metabolite identification from tandem mass spectra has been led by machine learning, which has two stages: mapping mass spectra to molecular fingerprint vectors and then retrieving candidate molecules from the database. In the first stage, i.e. fingerprint prediction, spectrum peaks are features and considering their interactions would be reasonable for more accurate identification of unknown metabolites. Existing approaches of fingerprint prediction are based on only individual peaks in the spectra, without explicitly considering the peak interactions. Also the current cutting-edge method is based on kernels, which are computationally heavy and difficult to interpret. Results We propose two learning models that allow to incorporate peak interactions for fingerprint prediction. First, we extend the state-of-the-art kernel learning method by developing kernels for peak interactions to combine with kernels for peaks through multiple kernel learning (MKL). Second, we formulate a sparse interaction model for metabolite peaks, which we call SIMPLE, which is computationally light and interpretable for fingerprint prediction. The formulation of SIMPLE is convex and guarantees global optimization, for which we develop an alternating direction method of multipliers (ADMM) algorithm. Experiments using the MassBank dataset show that both models achieved comparative prediction accuracy with the current top-performance kernel method. Furthermore SIMPLE clearly revealed individual peaks and peak interactions which contribute to enhancing the performance of fingerprint prediction. Availability and implementation The code will be accessed through http://mamitsukalab.org/tools/SIMPLE/.
Collapse
Affiliation(s)
- Dai Hai Nguyen
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan
| | - Canh Hao Nguyen
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan
| | - Hiroshi Mamitsuka
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Japan.,Department of Computer Science, Alato University, Espoo, Finland
| |
Collapse
|
29
|
Koopman J, Grimme S. Calculation of Electron Ionization Mass Spectra with Semiempirical GFNn-xTB Methods. ACS OMEGA 2019; 4:15120-15133. [PMID: 31552357 PMCID: PMC6751715 DOI: 10.1021/acsomega.9b02011] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Accepted: 08/05/2019] [Indexed: 05/31/2023]
Abstract
In this work, we have tested two different extended tight-binding methods in the framework of the quantum chemistry electron ionization mass spectrometry (QCEIMS) program to calculate electron ionization mass spectra. The QCEIMS approach provides reasonable, first-principles computed spectra, which can be directly compared to experiment. Furthermore, it provides detailed insight into the reaction mechanisms of mass spectrometry experiments. It sheds light upon the complicated fragmentation procedures of bond breakage and structural rearrangements that are difficult to derive otherwise. The required accuracy and computational demands for successful reproduction of a mass spectrum in relation to the underlying quantum chemical method are discussed. To validate the new GFN2-xTB approach, we conduct simulations for 15 organic, transition-metal, and main-group inorganic systems. Major fragmentation patterns are analyzed, and the entire calculated spectra are directly compared to experimental data taken from the literature. We discuss the computational costs and the robustness (outliers) of several calculation protocols presented. Overall, the new, theoretically more sophisticated semiempirical method GFN2-xTB performs well and robustly for a wide range of organic, inorganic, and organometallic systems.
Collapse
Affiliation(s)
- Jeroen Koopman
- Mulliken Center for Theoretical
Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Stefan Grimme
- Mulliken Center for Theoretical
Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| |
Collapse
|
30
|
Hautbergue T, Jamin EL, Debrauwer L, Puel O, Oswald IP. From genomics to metabolomics, moving toward an integrated strategy for the discovery of fungal secondary metabolites. Nat Prod Rep 2019; 35:147-173. [PMID: 29384544 DOI: 10.1039/c7np00032d] [Citation(s) in RCA: 96] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Fungal secondary metabolites are defined by bioactive properties that ensure adaptation of the fungus to its environment. Although some of these natural products are promising sources of new lead compounds especially for the pharmaceutical industry, others pose risks to human and animal health. The identification of secondary metabolites is critical to assessing both the utility and risks of these compounds. Since fungi present biological specificities different from other microorganisms, this review covers the different strategies specifically used in fungal studies to perform this critical identification. Strategies focused on the direct detection of the secondary metabolites are firstly reported. Particularly, advances in high-throughput untargeted metabolomics have led to the generation of large datasets whose exploitation and interpretation generally require bioinformatics tools. Then, the genome-based methods used to study the entire fungal metabolic potential are reported. Transcriptomic and proteomic tools used in the discovery of fungal secondary metabolites are presented as links between genomic methods and metabolomic experiments. Finally, the influence of the culture environment on the synthesis of secondary metabolites by fungi is highlighted as a major factor to consider in research on fungal secondary metabolites. Through this review, we seek to emphasize that the discovery of natural products should integrate all of these valuable tools. Attention is also drawn to emerging technologies that will certainly revolutionize fungal research and to the use of computational tools that are necessary but whose results should be interpreted carefully.
Collapse
Affiliation(s)
- T Hautbergue
- Toxalim (Research Centre in Food Toxicology) Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, F-31027 Toulouse, France.
| | | | | | | | | |
Collapse
|
31
|
Wang S, Alseekh S, Fernie AR, Luo J. The Structure and Function of Major Plant Metabolite Modifications. MOLECULAR PLANT 2019; 12:899-919. [PMID: 31200079 DOI: 10.1016/j.molp.2019.06.001] [Citation(s) in RCA: 193] [Impact Index Per Article: 38.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 05/27/2019] [Accepted: 06/04/2019] [Indexed: 05/23/2023]
Abstract
Plants produce a myriad of structurally and functionally diverse metabolites that play many different roles in plant growth and development and in plant response to continually changing environmental conditions as well as abiotic and biotic stresses. This metabolic diversity is, to a large extent, due to chemical modification of the basic skeletons of metabolites. Here, we review the major known plant metabolite modifications and summarize the progress that has been achieved and the challenges we are facing in the field. We focus on discussing both technical and functional aspects in studying the influences that various modifications have on biosynthesis, degradation, transport, and storage of metabolites, as well as their bioactivity and toxicity. Finally, we discuss some emerging insights into the evolution of metabolic pathways and metabolite functionality.
Collapse
Affiliation(s)
- Shouchuang Wang
- Hainan Key Laboratory for Sustainable Utilization of Tropical Bioresource, College of Tropical Crops, Hainan University, Haikou 572208, China
| | - Saleh Alseekh
- Max Planck Institute of Molecular Plant Physiology, Potsdam-Golm 14476, Germany; Centre of Plant Systems Biology and Biotechnology, Plovdiv 4000, Bulgaria
| | - Alisdair R Fernie
- Max Planck Institute of Molecular Plant Physiology, Potsdam-Golm 14476, Germany; Centre of Plant Systems Biology and Biotechnology, Plovdiv 4000, Bulgaria.
| | - Jie Luo
- Hainan Key Laboratory for Sustainable Utilization of Tropical Bioresource, College of Tropical Crops, Hainan University, Haikou 572208, China; National Key Laboratory of Crop Genetic Improvement and National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
32
|
Abouzeid S, Beutling U, Selmar D. Stress-induced modification of indole alkaloids:Phytomodificines as a new category of specialized metabolites. PHYTOCHEMISTRY 2019; 159:102-107. [PMID: 30605851 DOI: 10.1016/j.phytochem.2018.12.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 12/19/2018] [Accepted: 12/22/2018] [Indexed: 06/09/2023]
Abstract
This study focuses on the elucidation of the stress-induced reverse changes of major indole alkaloids in Vinca minor, primarily on the postulated conversion of vincamine and vincadifformine to yield 9-methoxyvincamine, minovincine, and minovincinine, respectively. By applying the P450 enzyme inhibitors, naproxen and resveratrol, it was shown that the oxidative reaction involved in the postulated conversion of vincamine and vincadifformine is catalyzed by cytochrome P450 enzymes. In combination with the identification of 9-hydroxyvincamine as a postulated intermediate, this result confirms that the observed stress-induced reverse changes in the alkaloid pattern are caused by modifications of the alkaloids which regularly accumulate in the healthy Vinca minor plants. Up to now, just two main types of defense compounds are distinguished: phytoalexins, which are synthesized de novo from primary metabolites and phytoanticipins, which are constitutively present in plants - either intrinsically active or are activated after cell death by hydrolysis or oxidation of the precursors. In contrast, the results presented in this paper demonstrate that indole alkaloids, representing typical phytoanticipins, are just slightly modified in response to a stress-related elicitation. Accordingly, these modified alkaloids neither represent classical phytoalexins (being synthesized de novo), nor can they be classified as phytoanticipins, since modification does not occur postmortem. Consequently, we propose a new category for these modified alkaloids that we call phytomodificines.
Collapse
Affiliation(s)
- Sara Abouzeid
- Institute for Plant Biology, TU Braunschweig, Mendelssohnsstr. 4, 38106, Braunschweig, Germany; Pharmacognosy Department, Faculty of Pharmacy, Mansoura University, Mansoura, 35516, Egypt
| | - Ulrike Beutling
- Department of Chemical Biology, Helmholtz Centre for Infection Research, Inhoffenstraße7, 38124, Braunschweig, Germany
| | - Dirk Selmar
- Institute for Plant Biology, TU Braunschweig, Mendelssohnsstr. 4, 38106, Braunschweig, Germany.
| |
Collapse
|
33
|
Schüler JA, Neumann S, Müller-Hannemann M, Brandt W. ChemFrag: Chemically meaningful annotation of fragment ion mass spectra. JOURNAL OF MASS SPECTROMETRY : JMS 2018; 53:1104-1115. [PMID: 30103269 DOI: 10.1002/jms.4278] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2018] [Revised: 07/23/2018] [Accepted: 07/25/2018] [Indexed: 05/26/2023]
Abstract
Identification and structural determination of small molecules by mass spectrometry is an important step in chemistry and biochemistry. However, the chemically realistic annotation of a fragment ion spectrum can be a difficult challenge. We developed ChemFrag, for the detection of fragmentation pathways and the annotation of fragment ions with chemically reasonable structures. ChemFrag combines a quantum chemical with a rule-based approach. For different doping substances as test instances, ChemFrag correctly annotates fragment ions. In most cases, the predicted fragments are chemically more realistic than those from purely combinatorial approaches, or approaches based on machine learning. The annotation generated by ChemFrag often coincides with spectra that have been manually annotated by experts. This is a major advance in peak annotation and allows a more precise automatic interpretation of mass spectra.
Collapse
Affiliation(s)
- Jördis-Ann Schüler
- Institute of Computer Science, Martin Luther University Halle-Wittenberg, Von-Seckendorff-Platz 1, Halle (Saale), 06120, Germany
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry, Weinberg 3, Halle (Saale), 06120, Germany
| | - Steffen Neumann
- Department of Stress and Development Biology, Leibniz Institute of Plant Biochemistry, Weinberg 3, Halle (Saale), 06120, Germany
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Deutscher Platz 5e, Leipzig, 04103, Germany
| | - Matthias Müller-Hannemann
- Institute of Computer Science, Martin Luther University Halle-Wittenberg, Von-Seckendorff-Platz 1, Halle (Saale), 06120, Germany
| | - Wolfgang Brandt
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry, Weinberg 3, Halle (Saale), 06120, Germany
| |
Collapse
|
34
|
Zuber J, Rathsack P, Otto M. Structural Characterization of Acidic Compounds in Pyrolysis Liquids Using Collision-Induced Dissociation and Fourier Transform Ion Cyclotron Resonance Mass Spectrometry. Anal Chem 2018; 90:12655-12662. [PMID: 30280888 DOI: 10.1021/acs.analchem.8b02873] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In this study, a novel approach to characterize and identify acidic oil compounds utilizing the fragmentational behavior of their corresponding precursor ions is presented. Precursor ions of seven analyzed pyrolysis oils that were generated from pyrolysis educts of different origins and degrees of coalification were produced by electrospray ionization in the negative ion mode (ESI(-)). Following a fragmentation of all ions in the ion cloud by collision-induced dissociation (CID), the precursor and product ions were subsequently detected by ultrahigh resolving Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS). The ESI(-)-CID data sets were evaluated by applying either a targeted classification or untargeted clustering approach. In the case of the targeted classification, 10% of the ionized precursor ions of the analyzed pyrolysis liquid samples could be classified into one of 11 compound classes utilizing theoretical fragmentation pathways of these classes. In contrast, theoretical fragmentation pathways were not necessary for the untargeted clustering approach, making it the more transmittable method. Results from both approaches were verified by analyzing standard compounds of known structure. The analysis and data evaluation methods presented in this work can be used to characterize complex organic mixtures, such as pyrolysis oils, and their compounds in-depth on a structural level.
Collapse
Affiliation(s)
- Jan Zuber
- Institute of Analytical Chemistry , TU Bergakademie Freiberg , Leipziger Straße 29 , 09599 Freiberg , Germany
| | - Philipp Rathsack
- Institute of Analytical Chemistry , TU Bergakademie Freiberg , Leipziger Straße 29 , 09599 Freiberg , Germany.,German Centre for Energy Resources , Reiche Zeche , Fuchsmuehlenweg 9 , 09599 Freiberg , Germany
| | - Matthias Otto
- Institute of Analytical Chemistry , TU Bergakademie Freiberg , Leipziger Straße 29 , 09599 Freiberg , Germany
| |
Collapse
|
35
|
Mohimani H, Gurevich A, Shlemov A, Mikheenko A, Korobeynikov A, Cao L, Shcherbin E, Nothias LF, Dorrestein PC, Pevzner PA. Dereplication of microbial metabolites through database search of mass spectra. Nat Commun 2018; 9:4035. [PMID: 30279420 PMCID: PMC6168521 DOI: 10.1038/s41467-018-06082-8] [Citation(s) in RCA: 174] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 08/14/2018] [Indexed: 12/24/2022] Open
Abstract
Natural products have traditionally been rich sources for drug discovery. In order to clear the road toward the discovery of unknown natural products, biologists need dereplication strategies that identify known ones. Here we report DEREPLICATOR+, an algorithm that improves on the previous approaches for identifying peptidic natural products, and extends them for identification of polyketides, terpenes, benzenoids, alkaloids, flavonoids, and other classes of natural products. We show that DEREPLICATOR+ can search all spectra in the recently launched Global Natural Products Social molecular network and identify an order of magnitude more natural products than previous dereplication efforts. We further demonstrate that DEREPLICATOR+ enables cross-validation of genome-mining and peptidogenomics/glycogenomics results.
Collapse
Affiliation(s)
- Hosein Mohimani
- Computational Biology Department, School of Computer Sciences, Carnegie Mellon University, Pittsburgh, PA, USA.
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA.
| | - Alexey Gurevich
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
| | - Alexander Shlemov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
| | - Alla Mikheenko
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
| | - Anton Korobeynikov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
- Department of Statistical Modelling, St. Petersburg State University, St. Petersburg, Russia
| | - Liu Cao
- Computational Biology Department, School of Computer Sciences, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Egor Shcherbin
- National Research University Higher School of Economics, St. Petersburg, Russia
| | - Louis-Felix Nothias
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, USA
- Department of Pharmacology and Pediatrics, University of California, San Diego, La Jolla, CA, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
| |
Collapse
|
36
|
Zhang K, Kurita KL, Venkatramani C, Russell D. Seeking universal detectors for analytical characterizations. J Pharm Biomed Anal 2018; 162:192-204. [PMID: 30265979 DOI: 10.1016/j.jpba.2018.09.029] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Revised: 09/14/2018] [Accepted: 09/15/2018] [Indexed: 12/11/2022]
Abstract
It is highly desirable to have a universal detector that can detect all types of compounds and give a uniform response regardless of the physiochemical properties of the compounds. With such a universal detector, all components in a sample can be accurately quantified without the need for individual standards. This is especially needed for the characterization of unknowns and for non-targeted analysis, or for samples that have no isolated standards available for each component. Over the years, much effort has been put into seeking a universal detection technology. In this review, we discuss the commonly used detectors for analytical characterization, including UV, RI, ELSD, CAD, CLND, FID, VUV, MS, NMR, and hyphenated detection, with the focuses on the "universal" features of these detectors regarding the types of molecules they can detect and the uniformity of responses.
Collapse
Affiliation(s)
- Kelly Zhang
- Genentech Inc., 1 DNA Way, South San Francisco, CA 94080, United States.
| | - Kenji L Kurita
- Genentech Inc., 1 DNA Way, South San Francisco, CA 94080, United States
| | | | - David Russell
- Genentech Inc., 1 DNA Way, South San Francisco, CA 94080, United States
| |
Collapse
|
37
|
De Vijlder T, Valkenborg D, Lemière F, Romijn EP, Laukens K, Cuyckens F. A tutorial in small molecule identification via electrospray ionization-mass spectrometry: The practical art of structural elucidation. MASS SPECTROMETRY REVIEWS 2018; 37:607-629. [PMID: 29120505 PMCID: PMC6099382 DOI: 10.1002/mas.21551] [Citation(s) in RCA: 127] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 10/03/2017] [Indexed: 05/10/2023]
Abstract
The identification of unknown molecules has been one of the cornerstone applications of mass spectrometry for decades. This tutorial reviews the basics of the interpretation of electrospray ionization-based MS and MS/MS spectra in order to identify small-molecule analytes (typically below 2000 Da). Most of what is discussed in this tutorial also applies to other atmospheric pressure ionization methods like atmospheric pressure chemical/photoionization. We focus primarily on the fundamental steps of MS-based structural elucidation of individual unknown compounds, rather than describing strategies for large-scale identification in complex samples. We critically discuss topics like the detection of protonated and deprotonated ions ([M + H]+ and [M - H]- ) as well as other adduct ions, the determination of the molecular formula, and provide some basic rules on the interpretation of product ion spectra. Our tutorial focuses primarily on the fundamental steps of MS-based structural elucidation of individual unknown compounds (eg, contaminants in chemical production, pharmacological alteration of drugs), rather than describing strategies for large-scale identification in complex samples. This tutorial also discusses strategies to obtain useful orthogonal information (UV/Vis, H/D exchange, chemical derivatization, etc) and offers an overview of the different informatics tools and approaches that can be used for structural elucidation of small molecules. It is primarily intended for beginning mass spectrometrists and researchers from other mass spectrometry sub-disciplines that want to get acquainted with structural elucidation are interested in some practical tips and tricks.
Collapse
Affiliation(s)
- Thomas De Vijlder
- Pharmaceutical Development & Manufacturing Sciences (PDMS)Janssen Research & DevelopmentBeerseBelgium
| | - Dirk Valkenborg
- Interuniversity Institute for Biostatistics and Statistical BioinformaticsHasselt UniversityDiepenbeekBelgium
- Center for Proteomics (CFP)University of AntwerpAntwerpBelgium
- Flemish Institute for Technological Research (VITO)MolBelgium
| | - Filip Lemière
- Center for Proteomics (CFP)University of AntwerpAntwerpBelgium
- Department of Chemistry, Biomolecular and Analytical Mass SpectrometryUniversity of AntwerpAntwerpBelgium
| | - Edwin P. Romijn
- Pharmaceutical Development & Manufacturing Sciences (PDMS)Janssen Research & DevelopmentBeerseBelgium
| | - Kris Laukens
- Department of Mathematics and Computer Science, Advanced Database Research and Modelling (ADReM)University of AntwerpAntwerpBelgium
- Biomedical Informatics Network Antwerp (Biomina)University of AntwerpAntwerpBelgium
| | - Filip Cuyckens
- Pharmacokinetics, Dynamics & MetabolismJanssen Research & DevelopmentBeerseBelgium
| |
Collapse
|
38
|
Fu Y, Zhang Y, Zhou Z, Lu X, Lin X, Zhao C, Xu G. Screening and Determination of Potential Risk Substances Based on Liquid Chromatography–High-Resolution Mass Spectrometry. Anal Chem 2018; 90:8454-8461. [DOI: 10.1021/acs.analchem.8b01153] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Yanqing Fu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanhui Zhang
- School of Computer Science & Technology, Dalian University of Technology, Dalian 116023, China
| | - Zhihui Zhou
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xin Lu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiaohui Lin
- School of Computer Science & Technology, Dalian University of Technology, Dalian 116023, China
| | - Chunxia Zhao
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guowang Xu
- CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
39
|
Blaženović I, Kind T, Ji J, Fiehn O. Software Tools and Approaches for Compound Identification of LC-MS/MS Data in Metabolomics. Metabolites 2018; 8:E31. [PMID: 29748461 PMCID: PMC6027441 DOI: 10.3390/metabo8020031] [Citation(s) in RCA: 402] [Impact Index Per Article: 67.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2018] [Revised: 04/26/2018] [Accepted: 05/06/2018] [Indexed: 01/17/2023] Open
Abstract
The annotation of small molecules remains a major challenge in untargeted mass spectrometry-based metabolomics. We here critically discuss structured elucidation approaches and software that are designed to help during the annotation of unknown compounds. Only by elucidating unknown metabolites first is it possible to biologically interpret complex systems, to map compounds to pathways and to create reliable predictive metabolic models for translational and clinical research. These strategies include the construction and quality of tandem mass spectral databases such as the coalition of MassBank repositories and investigations of MS/MS matching confidence. We present in silico fragmentation tools such as MS-FINDER, CFM-ID, MetFrag, ChemDistiller and CSI:FingerID that can annotate compounds from existing structure databases and that have been used in the CASMI (critical assessment of small molecule identification) contests. Furthermore, the use of retention time models from liquid chromatography and the utility of collision cross-section modelling from ion mobility experiments are covered. Workflows and published examples of successfully annotated unknown compounds are included.
Collapse
Affiliation(s)
- Ivana Blaženović
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
| | - Tobias Kind
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
| | - Jian Ji
- State Key Laboratory of Food Science and Technology, School of Food Science of Jiangnan University, School of Food Science Synergetic Innovation Center of Food Safety and Nutrition, Wuxi 214122, China.
| | - Oliver Fiehn
- NIH West Coast Metabolomics Center, UC Davis Genome Center, University of California, Davis, CA 95616, USA.
- Department of Biochemistry, Faculty of Sciences, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
| |
Collapse
|
40
|
Cai T, Guo ZQ, Xu XY, Wu ZJ. Recent (2000-2015) developments in the analysis of minor unknown natural products based on characteristic fragment information using LC-MS. MASS SPECTROMETRY REVIEWS 2018; 37:202-216. [PMID: 27341181 DOI: 10.1002/mas.21514] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 06/02/2016] [Indexed: 06/06/2023]
Abstract
Liquid chromatography-Mass Spectrometry (LC-MS) has been widely used in natural product analysis. Global detection and identification of nontargeted components are desirable in natural product research, for example, in quality control of Chinese herbal medicine. Nontargeted components analysis continues to expand to exciting life science application domains such as metabonomics. With this background, the present review summarizes recent developments in the analysis of minor unknown natural products using LC-MS and mainly focuses on the determination of the molecular formulae, selection of precursor ions, and characteristic fragmentation patterns of the known compounds. This review consists of three parts. Firstly, the methods used to determine unique molecular formula of unknown compounds such as accurate mass measurements, MSn spectra, or relative isotopic abundance information, are introduced. Secondly, the methods improving signal-to-noise ratio of MS/MS spectra by manual-MS/MS or workflow targeting-only signals were elucidated; pure precursor ions can be selected by changing the precursor ion isolated window. Lastly, characteristic fragmentation patterns such as Retro-Diels-Alder (RDA), McLafferty rearrangements, "internal residue loss," and so on, occurring in the molecular ions of natural products are summarized. Classical application of characteristic fragmentation patterns in identifying unknown compounds in extracts and relevant fragmentation mechanisms are presented (RDA reactions occurring readily in the molecular ions of flavanones or isoflavanones, McLafferty-type fragmentation reactions of some natural products such as epipolythiodioxopiperazines; fragmentation by "internal residue loss" possibly involving ion-neutral complex intermediates). © 2016 Wiley Periodicals, Inc. Mass Spec Rev 37:202-216, 2018.
Collapse
Affiliation(s)
- Tian Cai
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China
- Chengdu Institute of Organic Chemistry, Chinese Academy of Sciences, Chengdu, 610041, China
- University of Chinese Academy of Sciences, Beijing, 100039, China
| | - Ze-Qin Guo
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China
- College of Bioengineering, Chongqing University, Chongqing, 400044, China
| | - Xiao-Ying Xu
- Chengdu Institute of Organic Chemistry, Chinese Academy of Sciences, Chengdu, 610041, China
| | - Zhi-Jun Wu
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041, China
| |
Collapse
|
41
|
Scheubert K, Hufsky F, Petras D, Wang M, Nothias LF, Dührkop K, Bandeira N, Dorrestein PC, Böcker S. Significance estimation for large scale metabolomics annotations by spectral matching. Nat Commun 2017; 8:1494. [PMID: 29133785 PMCID: PMC5684233 DOI: 10.1038/s41467-017-01318-5] [Citation(s) in RCA: 106] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 09/08/2017] [Indexed: 12/17/2022] Open
Abstract
The annotation of small molecules in untargeted mass spectrometry relies on the matching of fragment spectra to reference library spectra. While various spectrum-spectrum match scores exist, the field lacks statistical methods for estimating the false discovery rates (FDR) of these annotations. We present empirical Bayes and target-decoy based methods to estimate the false discovery rate (FDR) for 70 public metabolomics data sets. We show that the spectral matching settings need to be adjusted for each project. By adjusting the scoring parameters and thresholds, the number of annotations rose, on average, by +139% (ranging from -92 up to +5705%) when compared with a default parameter set available at GNPS. The FDR estimation methods presented will enable a user to assess the scoring criteria for large scale analysis of mass spectrometry based metabolomics data that has been essential in the advancement of proteomics, transcriptomics, and genomics science.
Collapse
Affiliation(s)
- Kerstin Scheubert
- Chair for Bioinformatics, Friedrich Schiller University Jena, Jena, 07743, Germany
| | - Franziska Hufsky
- Chair for Bioinformatics, Friedrich Schiller University Jena, Jena, 07743, Germany
- RNA Bioinformatics and High Throughput Analysis, Friedrich Schiller University Jena, Jena, 07743, Germany
| | - Daniel Petras
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
| | - Mingxun Wang
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
| | - Louis-Félix Nothias
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
| | - Kai Dührkop
- Chair for Bioinformatics, Friedrich Schiller University Jena, Jena, 07743, Germany
| | - Nuno Bandeira
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, CA, 92093, USA
| | - Sebastian Böcker
- Chair for Bioinformatics, Friedrich Schiller University Jena, Jena, 07743, Germany.
| |
Collapse
|
42
|
Fu Y, Zhao C, Lu X, Xu G. Nontargeted screening of chemical contaminants and illegal additives in food based on liquid chromatography–high resolution mass spectrometry. Trends Analyt Chem 2017. [DOI: 10.1016/j.trac.2017.07.014] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
43
|
Hufsky F, Böcker S. Mining molecular structure databases: Identification of small molecules based on fragmentation mass spectrometry data. MASS SPECTROMETRY REVIEWS 2017; 36:624-633. [PMID: 26763615 DOI: 10.1002/mas.21489] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Accepted: 12/18/2015] [Indexed: 06/05/2023]
Abstract
Mass spectrometry (MS) is a key technology for the analysis of small molecules. For the identification and structural elucidation of novel molecules, new approaches beyond straightforward spectral comparison are required. In this review, we will cover computational methods that help with the identification of small molecules by analyzing fragmentation MS data. We focus on the four main approaches to mine a database of metabolite structures, that is rule-based fragmentation spectrum prediction, combinatorial fragmentation, competitive fragmentation modeling, and molecular fingerprint prediction. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 36:624-633, 2017.
Collapse
Affiliation(s)
- Franziska Hufsky
- Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Ernst-Abbe-Platz 2, Jena, 07743, Germany
- Bioinformatik für Hochdurchsatzverfahren, Friedrich-Schiller-Universität Jena, Leutragraben 1, Jena, 07743, Germany
| | - Sebastian Böcker
- Lehrstuhl für Bioinformatik, Friedrich-Schiller-Universität Jena, Ernst-Abbe-Platz 2, Jena, 07743, Germany
| |
Collapse
|
44
|
Ásgeirsson V, Bauer CA, Grimme S. Quantum chemical calculation of electron ionization mass spectra for general organic and inorganic molecules. Chem Sci 2017; 8:4879-4895. [PMID: 28959412 PMCID: PMC5603848 DOI: 10.1039/c7sc00601b] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 05/04/2017] [Indexed: 01/03/2023] Open
Abstract
We introduce a fully stand-alone version of the Quantum Chemistry Electron Ionization Mass Spectra (QCEIMS) program [S. Grimme, Angew. Chem. Int. Ed., 2013, 52, 6306] allowing efficient simulations for molecules composed of elements with atomic numbers up to Z = 86. The recently developed extended tight-binding semi-empirical method GFN-xTB has been combined with QCEIMS, thereby eliminating dependencies on third-party electronic structure software. Furthermore, for reasonable calculations of ionization potentials, as required by the method, a second tight-binding variant, IPEA-xTB, is introduced here. This novel combination of methods allows the automatic, fast and reasonably accurate computation of electron ionization mass spectra for structurally different molecules across the periodic table. In order to validate and inspect the transferability of the method, we perform large-scale simulations for some representative organic, organometallic, and main-group inorganic systems. Theoretical spectra for 23 molecules are compared directly to experimental data taken from standard databases. For the first time, realistic quantum chemistry based EI-MS for organometallic systems like ferrocene or copper(ii)acetylacetonate are presented. Compared to previously used semiempirical methods, GFN-xTB is faster, more robust, and yields overall higher quality spectra. The partially analysed theoretical reaction and fragmentation mechanisms are chemically reasonable and reveal in unprecedented detail the extreme complexity of high energy gas phase ion chemistry including complicated rearrangement reactions prior to dissociation.
Collapse
Affiliation(s)
- Vilhjálmur Ásgeirsson
- Mulliken Center for Theoretical Chemistry , Institute of Physical and Theoretical Chemistry , University of Bonn , Beringstr. 4 , 53115 Bonn , Germany . ; Tel: +49 228 73 2351
- Faculty of Physical Sciences and Science Institute , University of Iceland , 107 Reykjavík , Iceland
| | - Christoph A Bauer
- Mulliken Center for Theoretical Chemistry , Institute of Physical and Theoretical Chemistry , University of Bonn , Beringstr. 4 , 53115 Bonn , Germany . ; Tel: +49 228 73 2351
| | - Stefan Grimme
- Mulliken Center for Theoretical Chemistry , Institute of Physical and Theoretical Chemistry , University of Bonn , Beringstr. 4 , 53115 Bonn , Germany . ; Tel: +49 228 73 2351
| |
Collapse
|
45
|
What can we do to refine the redundant data in LC–MS and GC–MS based metabolomics? Bioanalysis 2017; 9:235-238. [DOI: 10.4155/bio-2016-0272] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
|
46
|
Meusel M, Hufsky F, Panter F, Krug D, Müller R, Böcker S. Predicting the Presence of Uncommon Elements in Unknown Biomolecules from Isotope Patterns. Anal Chem 2016; 88:7556-66. [DOI: 10.1021/acs.analchem.6b01015] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
- Marvin Meusel
- Chair
for Bioinformatics, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Franziska Hufsky
- Chair
for Bioinformatics, Friedrich Schiller University Jena, 07743 Jena, Germany
- RNA
Bioinformatics and High Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Fabian Panter
- Department
of Microbial Natural Products, Helmholtz-Institute for Pharmaceutical
Research Saarland, Helmholtz Centre for Infection Research and Pharmaceutical
Biotechnology, Saarland University, 66123 Saarbrücken, Germany
| | - Daniel Krug
- Department
of Microbial Natural Products, Helmholtz-Institute for Pharmaceutical
Research Saarland, Helmholtz Centre for Infection Research and Pharmaceutical
Biotechnology, Saarland University, 66123 Saarbrücken, Germany
| | - Rolf Müller
- Department
of Microbial Natural Products, Helmholtz-Institute for Pharmaceutical
Research Saarland, Helmholtz Centre for Infection Research and Pharmaceutical
Biotechnology, Saarland University, 66123 Saarbrücken, Germany
| | - Sebastian Böcker
- Chair
for Bioinformatics, Friedrich Schiller University Jena, 07743 Jena, Germany
| |
Collapse
|
47
|
Bauer CA, Grimme S. How to Compute Electron Ionization Mass Spectra from First Principles. J Phys Chem A 2016; 120:3755-66. [DOI: 10.1021/acs.jpca.6b02907] [Citation(s) in RCA: 72] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Christoph Alexander Bauer
- Mulliken Center for Theoretical
Chemistry, Institut für Physikalische und Theoretische Chemie
der Rheinischen Friedrich-Wilhelms, Universität Bonn, Beringstr. 4, D-53115 Bonn, Germany
| | - Stefan Grimme
- Mulliken Center for Theoretical
Chemistry, Institut für Physikalische und Theoretische Chemie
der Rheinischen Friedrich-Wilhelms, Universität Bonn, Beringstr. 4, D-53115 Bonn, Germany
| |
Collapse
|
48
|
Yi L, Dong N, Yun Y, Deng B, Ren D, Liu S, Liang Y. Chemometric methods in data processing of mass spectrometry-based metabolomics: A review. Anal Chim Acta 2016; 914:17-34. [PMID: 26965324 DOI: 10.1016/j.aca.2016.02.001] [Citation(s) in RCA: 159] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Revised: 01/28/2016] [Accepted: 02/01/2016] [Indexed: 01/03/2023]
Abstract
This review focuses on recent and potential advances in chemometric methods in relation to data processing in metabolomics, especially for data generated from mass spectrometric techniques. Metabolomics is gradually being regarded a valuable and promising biotechnology rather than an ambitious advancement. Herein, we outline significant developments in metabolomics, especially in the combination with modern chemical analysis techniques, and dedicated statistical, and chemometric data analytical strategies. Advanced skills in the preprocessing of raw data, identification of metabolites, variable selection, and modeling are illustrated. We believe that insights from these developments will help narrow the gap between the original dataset and current biological knowledge. We also discuss the limitations and perspectives of extracting information from high-throughput datasets.
Collapse
Affiliation(s)
- Lunzhao Yi
- Yunnan Food Safety Research Institute, Kunming University of Science and Technology, Kunming, 650500, China.
| | - Naiping Dong
- Department of Applied Biology and Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, 999077, China
| | - Yonghuan Yun
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China
| | - Baichuan Deng
- College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Dabing Ren
- Yunnan Food Safety Research Institute, Kunming University of Science and Technology, Kunming, 650500, China
| | - Shao Liu
- Xiangya Hospital, Central South University, Changsha, 410008, China
| | - Yizeng Liang
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China
| |
Collapse
|
49
|
Brack W, Ait-Aissa S, Burgess RM, Busch W, Creusot N, Di Paolo C, Escher BI, Mark Hewitt L, Hilscherova K, Hollender J, Hollert H, Jonker W, Kool J, Lamoree M, Muschket M, Neumann S, Rostkowski P, Ruttkies C, Schollee J, Schymanski EL, Schulze T, Seiler TB, Tindall AJ, De Aragão Umbuzeiro G, Vrana B, Krauss M. Effect-directed analysis supporting monitoring of aquatic environments--An in-depth overview. THE SCIENCE OF THE TOTAL ENVIRONMENT 2016; 544:1073-118. [PMID: 26779957 DOI: 10.1016/j.scitotenv.2015.11.102] [Citation(s) in RCA: 237] [Impact Index Per Article: 29.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Revised: 11/20/2015] [Accepted: 11/20/2015] [Indexed: 05/18/2023]
Abstract
Aquatic environments are often contaminated with complex mixtures of chemicals that may pose a risk to ecosystems and human health. This contamination cannot be addressed with target analysis alone but tools are required to reduce this complexity and identify those chemicals that might cause adverse effects. Effect-directed analysis (EDA) is designed to meet this challenge and faces increasing interest in water and sediment quality monitoring. Thus, the present paper summarizes current experience with the EDA approach and the tools required, and provides practical advice on their application. The paper highlights the need for proper problem formulation and gives general advice for study design. As the EDA approach is directed by toxicity, basic principles for the selection of bioassays are given as well as a comprehensive compilation of appropriate assays, including their strengths and weaknesses. A specific focus is given to strategies for sampling, extraction and bioassay dosing since they strongly impact prioritization of toxicants in EDA. Reduction of sample complexity mainly relies on fractionation procedures, which are discussed in this paper, including quality assurance and quality control. Automated combinations of fractionation, biotesting and chemical analysis using so-called hyphenated tools can enhance the throughput and might reduce the risk of artifacts in laboratory work. The key to determining the chemical structures causing effects is analytical toxicant identification. The latest approaches, tools, software and databases for target-, suspect and non-target screening as well as unknown identification are discussed together with analytical and toxicological confirmation approaches. A better understanding of optimal use and combination of EDA tools will help to design efficient and successful toxicant identification studies in the context of quality monitoring in multiply stressed environments.
Collapse
Affiliation(s)
- Werner Brack
- UFZ Helmholtz Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany; RWTH Aachen University, Worringerweg 1, 52074 Aachen, Germany
| | - Selim Ait-Aissa
- Institut National de l'Environnement Industriel et des Risques INERIS, BP2, 60550 Verneuil-en-Halatte, France
| | - Robert M Burgess
- US Environmental Protection Agency, Office of Research and Development, National Health and Environmental Effects Research Laboratory, Atlantic Ecology Division, Narragansett, RI, USA
| | - Wibke Busch
- UFZ Helmholtz Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany
| | - Nicolas Creusot
- Institut National de l'Environnement Industriel et des Risques INERIS, BP2, 60550 Verneuil-en-Halatte, France
| | | | - Beate I Escher
- UFZ Helmholtz Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany; Eberhard Karls University Tübingen, 72074 Tübingen, Germany
| | - L Mark Hewitt
- Water Science and Technology Directorate, Environment Canada, 867 Lakeshore Road, Burlington, Ontario L7S 1A1, Canada
| | - Klara Hilscherova
- Masaryk University, Research Centre for Toxic Compounds in the Environment (RECETOX), Kamenice 753/5, 625 00 Brno, Czech Republic
| | - Juliane Hollender
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, 8600 Dübendorf, Switzerland
| | - Henner Hollert
- RWTH Aachen University, Worringerweg 1, 52074 Aachen, Germany
| | - Willem Jonker
- VU University, BioMolecular Analysis Group, Amsterdam, The Netherlands
| | - Jeroen Kool
- VU University, BioMolecular Analysis Group, Amsterdam, The Netherlands
| | - Marja Lamoree
- VU Amsterdam, Institute for Environmental Studies, Amsterdam, The Netherlands
| | - Matthias Muschket
- UFZ Helmholtz Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany
| | - Steffen Neumann
- Leibniz Institute of Plant Biochemistry, Halle (Saale), Germany
| | - Pawel Rostkowski
- NILU - Norwegian Institute for Air Research, Instituttveien 18, 2007 Kjeller, Norway
| | | | - Jennifer Schollee
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, 8600 Dübendorf, Switzerland
| | - Emma L Schymanski
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, 8600 Dübendorf, Switzerland
| | - Tobias Schulze
- UFZ Helmholtz Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany
| | | | - Andrew J Tindall
- WatchFrag, Bâtiment Genavenir 3, 1 Rue Pierre Fontaine, 91000 Evry, France
| | | | - Branislav Vrana
- Masaryk University, Research Centre for Toxic Compounds in the Environment (RECETOX), Kamenice 753/5, 625 00 Brno, Czech Republic
| | - Martin Krauss
- UFZ Helmholtz Centre for Environmental Research, Permoserstraße 15, 04318 Leipzig, Germany
| |
Collapse
|
50
|
Abstract
BACKGROUND Untargeted metabolomics commonly uses liquid chromatography mass spectrometry to measure abundances of metabolites; subsequent tandem mass spectrometry is used to derive information about individual compounds. One of the bottlenecks in this experimental setup is the interpretation of fragmentation spectra to accurately and efficiently identify compounds. Fragmentation trees have become a powerful tool for the interpretation of tandem mass spectrometry data of small molecules. These trees are determined from the data using combinatorial optimization, and aim at explaining the experimental data via fragmentation cascades. Fragmentation tree computation does not require spectral or structural databases. To obtain biochemically meaningful trees, one needs an elaborate optimization function (scoring). RESULTS We present a new scoring for computing fragmentation trees, transforming the combinatorial optimization into a Maximum A Posteriori estimator. We demonstrate the superiority of the new scoring for two tasks: both for the de novo identification of molecular formulas of unknown compounds, and for searching a database for structurally similar compounds, our method SIRIUS 3, performs significantly better than the previous version of our method, as well as other methods for this task. CONCLUSION SIRIUS 3 can be a part of an untargeted metabolomics workflow, allowing researchers to investigate unknowns using automated computational methods.Graphical abstractWe present a new scoring for computing fragmentation trees from tandem mass spectrometry data based on Bayesian statistics. The best scoring fragmentation tree most likely explains the molecular formula of the measured parent ion.
Collapse
Affiliation(s)
- Sebastian Böcker
- Friedrich-Schiller-University, Ernst-Abbe-Platz 2, 07743 Jena, Germany
| | - Kai Dührkop
- Friedrich-Schiller-University, Ernst-Abbe-Platz 2, 07743 Jena, Germany
| |
Collapse
|