1
|
Borges R, Colby SM, Das S, Edison AS, Fiehn O, Kind T, Lee J, Merrill AT, Merz KM, Metz TO, Nunez JR, Tantillo DJ, Wang LP, Wang S, Renslow RS. Quantum Chemistry Calculations for Metabolomics. Chem Rev 2021; 121:5633-5670. [PMID: 33979149 PMCID: PMC8161423 DOI: 10.1021/acs.chemrev.0c00901] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Indexed: 02/07/2023]
Abstract
A primary goal of metabolomics studies is to fully characterize the small-molecule composition of complex biological and environmental samples. However, despite advances in analytical technologies over the past two decades, the majority of small molecules in complex samples are not readily identifiable due to the immense structural and chemical diversity present within the metabolome. Current gold-standard identification methods rely on reference libraries built using authentic chemical materials ("standards"), which are not available for most molecules. Computational quantum chemistry methods, which can be used to calculate chemical properties that are then measured by analytical platforms, offer an alternative route for building reference libraries, i.e., in silico libraries for "standards-free" identification. In this review, we cover the major roadblocks currently facing metabolomics and discuss applications where quantum chemistry calculations offer a solution. Several successful examples for nuclear magnetic resonance spectroscopy, ion mobility spectrometry, infrared spectroscopy, and mass spectrometry methods are reviewed. Finally, we consider current best practices, sources of error, and provide an outlook for quantum chemistry calculations in metabolomics studies. We expect this review will inspire researchers in the field of small-molecule identification to accelerate adoption of in silico methods for generation of reference libraries and to add quantum chemistry calculations as another tool at their disposal to characterize complex samples.
Collapse
Affiliation(s)
- Ricardo
M. Borges
- Walter
Mors Institute of Research on Natural Products, Federal University of Rio de Janeiro, Rio de Janeiro 21941-901, Brazil
| | - Sean M. Colby
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| | - Susanta Das
- Department
of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
| | - Arthur S. Edison
- Departments
of Genetics and Biochemistry and Molecular Biology, Complex Carbohydrate
Research Center and Institute of Bioinformatics, University of Georgia, Athens, Georgia 30602, United States
| | - Oliver Fiehn
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
| | - Tobias Kind
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
| | - Jesi Lee
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Amy T. Merrill
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Kenneth M. Merz
- Department
of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
| | - Thomas O. Metz
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| | - Jamie R. Nunez
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| | - Dean J. Tantillo
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Lee-Ping Wang
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Shunyang Wang
- West
Coast Metabolomics Center for Compound Identification, UC Davis Genome
Center, University of California, Davis, California 95616, United States
- Department
of Chemistry, University of California, Davis, California 95616, United States
| | - Ryan S. Renslow
- Biological
Science Division, Pacific Northwest National
Laboratory, Richland, Washington 99352, United States
| |
Collapse
|
2
|
Abstract
A major challenge for metabolomic analysis is to obtain an unambiguous identification of the metabolites detected in a sample. Among metabolomics techniques, NMR spectroscopy is a sophisticated, powerful, and generally applicable spectroscopic tool that can be used to ascertain the correct structure of newly isolated biogenic molecules. However, accurate structure prediction using computational NMR techniques depends on how much of the relevant conformational space of a particular compound is considered. It is intrinsically challenging to calculate NMR chemical shifts using high-level DFT when the conformational space of a metabolite is extensive. In this work, we developed NMR chemical shift calculation protocols using a machine learning model in conjunction with standard DFT methods. The pipeline encompasses the following steps: (1) conformation generation using a force field (FF)-based method, (2) filtering the FF generated conformations using the ASE-ANI machine learning model, (3) clustering of the optimized conformations based on structural similarity to identify chemically unique conformations, (4) DFT structural optimization of the unique conformations, and (5) DFT NMR chemical shift calculation. This protocol can calculate the NMR chemical shifts of a set of molecules using any available combination of DFT theory, solvent model, and NMR-active nuclei, using both user-selected reference compounds and/or linear regression methods. Our protocol reduces the overall computational time by 2 orders of magnitude over methods that optimize the conformations using fully ab initio methods, while still producing good agreement with experimental observations. The complete protocol is designed in such a manner that makes the computation of chemical shifts tractable for a large number of conformationally flexible metabolites.
Collapse
Affiliation(s)
- Susanta Das
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, USA
| | - Arthur S. Edison
- Departments of Genetics and Biochemistry, Institute of Bioinformatics and Complex Carbohydrate Center, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, USA
| | - Kenneth M. Merz
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, USA
| |
Collapse
|
3
|
Socha O, Dračínský M. Dimerization of Acetic Acid in the Gas Phase-NMR Experiments and Quantum-Chemical Calculations. Molecules 2020; 25:molecules25092150. [PMID: 32375390 PMCID: PMC7248931 DOI: 10.3390/molecules25092150] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Revised: 04/28/2020] [Accepted: 04/30/2020] [Indexed: 11/24/2022] Open
Abstract
Due to the nature of the carboxylic group, acetic acid can serve as both a donor and acceptor of a hydrogen bond. Gaseous acetic acid is known to form cyclic dimers with two strong hydrogen bonds. However, trimeric and various oligomeric structures have also been hypothesized to exist in both the gas and liquid phases of acetic acid. In this work, a combination of gas-phase NMR experiments and advanced computational approaches were employed in order to validate the basic dimerization model of gaseous acetic acid. The gas-phase experiments performed in a glass tube revealed interactions of acetic acid with the glass surface. On the other hand, variable-temperature and variable-pressure NMR parameters obtained for acetic acid in a polymer insert provided thermodynamic parameters that were in excellent agreement with the MP2 (the second order Møller–Plesset perturbation theory) and CCSD(T) (coupled cluster with single, double and perturbative triple excitation) calculations based on the basic dimerization model. A slight disparity between the theoretical dimerization model and the experimental data was revealed only at low temperatures. This observation might indicate the presence of other, entropically disfavored, supramolecular structures at low temperatures.
Collapse
Affiliation(s)
- Ondřej Socha
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Flemingovo nám. 2, 166 10 Prague, Czech Republic;
- Faculty of Mathematics and Physics, Charles University, Ke Karlovu 3, 166 10 Prague, Czech Republic
| | - Martin Dračínský
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Flemingovo nám. 2, 166 10 Prague, Czech Republic;
- Correspondence: ; Tel./Fax: +42-02-2018-3139
| |
Collapse
|
4
|
Tantillo DJ. Questions in natural products synthesis research that can (and cannot) be answered using computational chemistry. Chem Soc Rev 2018; 47:7845-7850. [PMID: 29900461 PMCID: PMC6205925 DOI: 10.1039/c8cs00298c] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Questions of relevance to those working in the field of natural products synthesis that can be answered, at least in part, using computational chemistry approaches are described. Illustrative examples are provided, as are descriptions of limitations.
Collapse
Affiliation(s)
- Dean J Tantillo
- Department of Chemistry, University of California - Davis, Davis, CA 95616, USA.
| |
Collapse
|
5
|
Pereira F, Aires-de-Sousa J. Computational Methodologies in the Exploration of Marine Natural Product Leads. Mar Drugs 2018; 16:md16070236. [PMID: 30011882 PMCID: PMC6070892 DOI: 10.3390/md16070236] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Revised: 07/02/2018] [Accepted: 07/06/2018] [Indexed: 12/18/2022] Open
Abstract
Computational methodologies are assisting the exploration of marine natural products (MNPs) to make the discovery of new leads more efficient, to repurpose known MNPs, to target new metabolites on the basis of genome analysis, to reveal mechanisms of action, and to optimize leads. In silico efforts in drug discovery of NPs have mainly focused on two tasks: dereplication and prediction of bioactivities. The exploration of new chemical spaces and the application of predicted spectral data must be included in new approaches to select species, extracts, and growth conditions with maximum probabilities of medicinal chemistry novelty. In this review, the most relevant current computational dereplication methodologies are highlighted. Structure-based (SB) and ligand-based (LB) chemoinformatics approaches have become essential tools for the virtual screening of NPs either in small datasets of isolated compounds or in large-scale databases. The most common LB techniques include Quantitative Structure–Activity Relationships (QSAR), estimation of drug likeness, prediction of adsorption, distribution, metabolism, excretion, and toxicity (ADMET) properties, similarity searching, and pharmacophore identification. Analogously, molecular dynamics, docking and binding cavity analysis have been used in SB approaches. Their significance and achievements are the main focus of this review.
Collapse
Affiliation(s)
- Florbela Pereira
- LAQV and REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, 2829-516 Caparica, Portugal.
| | - Joao Aires-de-Sousa
- LAQV and REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, 2829-516 Caparica, Portugal.
| |
Collapse
|
6
|
Seitz T, Millán RE, Lentz D, Jiménez C, Rodríguez J, Christmann M. Synthesis of Thelepamide via Catalyst-Controlled 1,4-Addition of Cysteine Derivatives and Structure Revision of Thelepamide. Org Lett 2018; 20:594-597. [DOI: 10.1021/acs.orglett.7b03706] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Tobias Seitz
- Freie Universität Berlin, Institute of Chemistry
and Biochemistry, Takustraße
3, 14195 Berlin, Germany
| | - Ramón E. Millán
- Centro
de Investigaciones Cientı́ficas Avanzadas (CICA), Departamento
de Quı́mica, Facultade de Ciencias, Universidade da Coruña, 15071 A Coruña, Spain
| | - Dieter Lentz
- Freie Universität Berlin, Institute of Chemistry
and Biochemistry, Takustraße
3, 14195 Berlin, Germany
| | - Carlos Jiménez
- Centro
de Investigaciones Cientı́ficas Avanzadas (CICA), Departamento
de Quı́mica, Facultade de Ciencias, Universidade da Coruña, 15071 A Coruña, Spain
| | - Jaime Rodríguez
- Centro
de Investigaciones Cientı́ficas Avanzadas (CICA), Departamento
de Quı́mica, Facultade de Ciencias, Universidade da Coruña, 15071 A Coruña, Spain
| | - Mathias Christmann
- Freie Universität Berlin, Institute of Chemistry
and Biochemistry, Takustraße
3, 14195 Berlin, Germany
| |
Collapse
|