1
|
Laplaza R, Wodrich MD, Corminboeuf C. Overcoming the Pitfalls of Computing Reaction Selectivity from Ensembles of Transition States. J Phys Chem Lett 2024; 15:7363-7370. [PMID: 38990895 DOI: 10.1021/acs.jpclett.4c01657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
The prediction of reaction selectivity is a challenging task for computational chemistry, not only because many molecules adopt multiple conformations but also due to the exponential relationship between effective activation energies and rate constants. To account for molecular flexibility, an increasing number of methods exist that generate conformational ensembles of transition state (TS) structures. Typically, these TS ensembles are Boltzmann weighted and used to compute selectivity assuming Curtin-Hammett conditions. This strategy, however, can lead to erroneous predictions if the appropriate filtering of the conformer ensembles is not conducted. Here, we demonstrate how any possible selectivity can be obtained by processing the same sets of TS ensembles for a model reaction. To address the burdensome filtering task in a consistent and automated way, we introduce marc, a tool for the modular analysis of representative conformers that aids in avoiding human errors while minimizing the number of reoptimization computations needed to obtain correct reaction selectivity.
Collapse
Affiliation(s)
- Ruben Laplaza
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Matthew D Wodrich
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland
| |
Collapse
|
2
|
Weymuth T, Unsleber JP, Türtscher PL, Steiner M, Sobez JG, Müller CH, Mörchen M, Klasovita V, Grimmel SA, Eckhoff M, Csizi KS, Bosia F, Bensberg M, Reiher M. SCINE-Software for chemical interaction networks. J Chem Phys 2024; 160:222501. [PMID: 38857173 DOI: 10.1063/5.0206974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 05/09/2024] [Indexed: 06/12/2024] Open
Abstract
The software for chemical interaction networks (SCINE) project aims at pushing the frontier of quantum chemical calculations on molecular structures to a new level. While calculations on individual structures as well as on simple relations between them have become routine in chemistry, new developments have pushed the frontier in the field to high-throughput calculations. Chemical relations may be created by a search for specific molecular properties in a molecular design attempt, or they can be defined by a set of elementary reaction steps that form a chemical reaction network. The software modules of SCINE have been designed to facilitate such studies. The features of the modules are (i) general applicability of the applied methodologies ranging from electronic structure (no restriction to specific elements of the periodic table) to microkinetic modeling (with little restrictions on molecularity), full modularity so that SCINE modules can also be applied as stand-alone programs or be exchanged for external software packages that fulfill a similar purpose (to increase options for computational campaigns and to provide alternatives in case of tasks that are hard or impossible to accomplish with certain programs), (ii) high stability and autonomous operations so that control and steering by an operator are as easy as possible, and (iii) easy embedding into complex heterogeneous environments for molecular structures taken individually or in the context of a reaction network. A graphical user interface unites all modules and ensures interoperability. All components of the software have been made available as open source and free of charge.
Collapse
Affiliation(s)
- Thomas Weymuth
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Jan P Unsleber
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Paul L Türtscher
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Miguel Steiner
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Jan-Grimo Sobez
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Charlotte H Müller
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Maximilian Mörchen
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Veronika Klasovita
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Stephanie A Grimmel
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Marco Eckhoff
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Katja-Sophia Csizi
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Francesco Bosia
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Moritz Bensberg
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Markus Reiher
- ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| |
Collapse
|
3
|
Sigmund LM, S SS, Albers A, Erdmann P, Paton RS, Greb L. Predicting Lewis Acidity: Machine Learning the Fluoride Ion Affinity of p-Block-Atom-Based Molecules. Angew Chem Int Ed Engl 2024; 63:e202401084. [PMID: 38452299 DOI: 10.1002/anie.202401084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/01/2024] [Accepted: 03/04/2024] [Indexed: 03/09/2024]
Abstract
"How strong is this Lewis acid?" is a question researchers often approach by calculating its fluoride ion affinity (FIA) with quantum chemistry. Here, we present FIA49k, an extensive FIA dataset with 48,986 data points calculated at the RI-DSD-BLYP-D3(BJ)/def2-QZVPP//PBEh-3c level of theory, including 13 different p-block atoms as the fluoride accepting site. The FIA49k dataset was used to train FIA-GNN, two message-passing graph neural networks, which predict gas and solution phase FIA values of molecules excluded from training with a mean absolute error of 14 kJ mol-1 (r2=0.93) from the SMILES string of the Lewis acid as the only input. The level of accuracy is notable, given the wide energetic range of 750 kJ mol-1 spanned by FIA49k. The model's value was demonstrated with four case studies, including predictions for molecules extracted from the Cambridge Structural Database and by reproducing results from catalysis research available in the literature. Weaknesses of the model are evaluated and interpreted chemically. FIA-GNN and the FIA49k dataset can be reached via a free web app (www.grebgroup.de/fia-gnn).
Collapse
Affiliation(s)
- Lukas M Sigmund
- Anorganisch-Chemisches Institut, Ruprecht-Karls-Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
- Department of Chemistry, Colorado State University, 1301 Center Avenue, Fort Collins, CO, 80523, USA
| | - Shree Sowndarya S
- Department of Chemistry, Colorado State University, 1301 Center Avenue, Fort Collins, CO, 80523, USA
| | - Andreas Albers
- Anorganisch-Chemisches Institut, Ruprecht-Karls-Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| | - Philipp Erdmann
- Anorganisch-Chemisches Institut, Ruprecht-Karls-Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| | - Robert S Paton
- Department of Chemistry, Colorado State University, 1301 Center Avenue, Fort Collins, CO, 80523, USA
| | - Lutz Greb
- Anorganisch-Chemisches Institut, Ruprecht-Karls-Universität Heidelberg, Im Neuenheimer Feld 270, 69120, Heidelberg, Germany
| |
Collapse
|
4
|
Seidel T, Permann C, Wieder O, Kohlbacher SM, Langer T. High-Quality Conformer Generation with CONFORGE: Algorithm and Performance Assessment. J Chem Inf Model 2023; 63:5549-5570. [PMID: 37624145 PMCID: PMC10498443 DOI: 10.1021/acs.jcim.3c00563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Indexed: 08/26/2023]
Abstract
Knowledge of the putative bound-state conformation of a molecule is an essential prerequisite for the successful application of many computer-aided drug design methods that aim to assess or predict its capability to bind to a particular target receptor. An established approach to predict bioactive conformers in the absence of receptor structure information is to sample the low-energy conformational space of the investigated molecules and derive representative conformer ensembles that can be expected to comprise members closely resembling possible bound-state ligand conformations. The high relevance of such conformer generation functionality led to the development of a wide panel of dedicated commercial and open-source software tools throughout the last decades. Several published benchmarking studies have shown that open-source tools usually lag behind their commercial competitors in many key aspects. In this work, we introduce the open-source conformer ensemble generator CONFORGE, which aims at delivering state-of-the-art performance for all types of organic molecules in drug-like chemical space. The ability of CONFORGE and several well-known commercial and open-source conformer ensemble generators to reproduce experimental 3D structures as well as their computational efficiency and robustness has been assessed thoroughly for both typical drug-like molecules and macrocyclic structures. For small molecules, CONFORGE clearly outperformed all other tested open-source conformer generators and performed at least equally well as the evaluated commercial generators in terms of both processing speed and accuracy. In the case of macrocyclic structures, CONFORGE achieved the best average accuracy among all benchmarked generators, with RDKit's generator coming close in second place.
Collapse
Affiliation(s)
- Thomas Seidel
- Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Christian Permann
- NeGeMac
Research Platform, Department of Pharmaceutical Sciences, Division
of Pharmaceutical Chemistry, University
of Vienna, Josef-Holaubek-Platz
2, 1090 Vienna, Austria
| | - Oliver Wieder
- Christian
Doppler Laboratory for Molecular Informatics in the Biosciences, Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Stefan M. Kohlbacher
- Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Thierry Langer
- Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
- NeGeMac
Research Platform, Department of Pharmaceutical Sciences, Division
of Pharmaceutical Chemistry, University
of Vienna, Josef-Holaubek-Platz
2, 1090 Vienna, Austria
- Christian
Doppler Laboratory for Molecular Informatics in the Biosciences, Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| |
Collapse
|
5
|
Besel V, Todorović M, Kurtén T, Rinke P, Vehkamäki H. Atomic structures, conformers and thermodynamic properties of 32k atmospheric molecules. Sci Data 2023; 10:450. [PMID: 37438370 DOI: 10.1038/s41597-023-02366-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 07/05/2023] [Indexed: 07/14/2023] Open
Abstract
Low-volatile organic compounds (LVOCs) drive key atmospheric processes, such as new particle formation (NPF) and growth. Machine learning tools can accelerate studies of these phenomena, but extensive and versatile LVOC datasets relevant for the atmospheric research community are lacking. We present the GeckoQ dataset with atomic structures of 31,637 atmospherically relevant molecules resulting from the oxidation of α-pinene, toluene and decane. For each molecule, we performed comprehensive conformer sampling with the COSMOconf program and calculated thermodynamic properties with density functional theory (DFT) using the Conductor-like Screening Model (COSMO). Our dataset contains the geometries of the 7 Mio. conformers we found and their corresponding structural and thermodynamic properties, including saturation vapor pressures (pSat), chemical potentials and free energies. The pSat were compared to values calculated with the group contribution method SIMPOL. To validate the dataset, we explored the relationship between structural and thermodynamic properties, and then demonstrated a first machine-learning application with Gaussian process regression.
Collapse
Affiliation(s)
- Vitus Besel
- University of Helsinki, Institute for Atmospheric and Earth System Research, Helsinki, 00014, Finland.
| | - Milica Todorović
- University of Turku, Dept. Mechanical and Materials Engineering, Turku, FI-20014, Finland
| | - Theo Kurtén
- University of Helsinki, Institute for Atmospheric and Earth System Research, Helsinki, 00014, Finland
| | - Patrick Rinke
- Aalto University, Dept. of Applied Physics, P.O. Box 11100, FI-00076 Aalto, Espoo, Finland
| | - Hanna Vehkamäki
- University of Helsinki, Institute for Atmospheric and Earth System Research, Helsinki, 00014, Finland
| |
Collapse
|
6
|
Alexandrov V, Kirpich A, Kantidze O, Gankin Y. A multi-reference poly-conformational method for in silico design, optimization, and repositioning of pharmaceutical compounds illustrated for selected SARS-CoV-2 ligands. PeerJ 2022; 10:e14252. [PMID: 36447514 PMCID: PMC9701500 DOI: 10.7717/peerj.14252] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 09/26/2022] [Indexed: 11/25/2022] Open
Abstract
Background This work presents a novel computational multi-reference poly-conformational algorithm for design, optimization, and repositioning of pharmaceutical compounds. Methods The algorithm searches for candidates by comparing similarities between conformers of the same compound and identifies target compounds, whose conformers are collectively close to the conformers of each compound in the reference set. Reference compounds may possess highly variable MoAs, which directly, and simultaneously, shape the properties of target candidate compounds. Results The algorithm functionality has been case study validated in silico, by scoring ChEMBL drugs against FDA-approved reference compounds that either have the highest predicted binding affinity to our chosen SARS-CoV-2 targets or are confirmed to be inhibiting such targets in-vivo. All our top scoring ChEMBL compounds also turned out to be either high-affinity ligands to the chosen targets (as confirmed in separate studies) or show significant efficacy, in-vivo, against those selected targets. In addition to method case study validation, in silico search for new compounds within two virtual libraries from the Enamine database is presented. The library's virtual compounds have been compared to the same set of reference drugs that we used for case study validation: Olaparib, Tadalafil, Ergotamine and Remdesivir. The large reference set of four potential SARS-CoV-2 compounds has been selected, since no drug has been identified to be 100% effective against the virus so far, possibly because each candidate drug was targeting only one, particular MoA. The goal here was to introduce a new methodology for identifying potential candidate(s) that cover multiple MoA-s presented within a set of reference compounds.
Collapse
Affiliation(s)
- Vadim Alexandrov
- Liquid Algo LLC, Hopewell Junction, NY, United States of America
| | - Alexander Kirpich
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, United States of America
| | | | - Yuriy Gankin
- Quantori, Cambridge, MA, United States of America
| |
Collapse
|
7
|
Wu J, Li J, Li Y, Ma X, Zhang W, Hao Y, Cai W, Liu Z, Gong M. Steering the Glycerol Electro‐Reforming Selectivity via Cation–Intermediate Interactions. Angew Chem Int Ed Engl 2022. [DOI: 10.1002/ange.202113362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Jianxiang Wu
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
| | - Jili Li
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
| | - Yefei Li
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
- Key Laboratory of Computational Physical Science Fudan University Shanghai 200438 P. R. China
| | - Xian‐Yin Ma
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
| | - Wei‐Yi Zhang
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
| | - Yaming Hao
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
| | - Wen‐Bin Cai
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
| | - Zhi‐Pan Liu
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
- Key Laboratory of Computational Physical Science Fudan University Shanghai 200438 P. R. China
| | - Ming Gong
- Department of Chemistry Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials Fudan University Shanghai 200438 P. R. China
| |
Collapse
|
8
|
Wang S, Krummenacher K, Landrum GA, Sellers BD, Di Lello P, Robinson SJ, Martin B, Holden JK, Tom JYK, Murthy AC, Popovych N, Riniker S. Incorporating NOE-Derived Distances in Conformer Generation of Cyclic Peptides with Distance Geometry. J Chem Inf Model 2022; 62:472-485. [PMID: 35029985 DOI: 10.1021/acs.jcim.1c01165] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Nuclear magnetic resonance (NMR) data from NOESY (nuclear Overhauser enhancement spectroscopy) and ROESY (rotating frame Overhauser enhancement spectroscopy) experiments can easily be combined with distance geometry (DG) based conformer generators by modifying the molecular distance bounds matrix. In this work, we extend the modern DG based conformer generator ETKDG, which has been shown to reproduce experimental crystal structures from small molecules to large macrocycles well, to include NOE-derived interproton distances. In noeETKDG, the experimentally derived interproton distances are incorporated into the distance bounds matrix as loose upper (or lower) bounds to generate large conformer sets. Various subselection techniques can subsequently be applied to yield a conformer bundle that best reproduces the NOE data. The approach is benchmarked using a set of 24 (mostly) cyclic peptides for which NOE-derived distances as well as reference solution structures obtained by other software are available. With respect to other packages currently available, the advantages of noeETKDG are its speed and that no prior force-field parametrization is required, which is especially useful for peptides with unnatural amino acids. The resulting conformer bundles can be further processed with the use of structural refinement techniques to improve the modeling of the intramolecular nonbonded interactions. The noeETKDG code is released as a fully open-source software package available at www.github.com/rinikerlab/customETKDG.
Collapse
Affiliation(s)
- Shuzhe Wang
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Kajo Krummenacher
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Gregory A Landrum
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Benjamin D Sellers
- Department of Discovery Chemistry, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | - Paola Di Lello
- Department of Structural Biology, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | - Sarah J Robinson
- Department of Discovery Chemistry, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | - Bryan Martin
- Department of Structural Biology, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | - Jeffrey K Holden
- Department of Early Discovery Biochemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Jeffrey Y K Tom
- Department of Early Discovery Biochemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Anastasia C Murthy
- Department of Early Discovery Biochemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Nataliya Popovych
- Department of Early Discovery Biochemistry, Genentech, Inc., South San Francisco, California 94080, United States
| | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
9
|
OUP accepted manuscript. Mutagenesis 2022; 37:191-202. [DOI: 10.1093/mutage/geac010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 04/09/2022] [Indexed: 11/14/2022] Open
|
10
|
Wu J, Li J, Li Y, Ma XY, Zhang WY, Hao Y, Cai WB, Liu Z, Gong M. Steering the Glycerol Electro-Reforming Selectivity via Cation-Intermediate Interactions. Angew Chem Int Ed Engl 2021; 61:e202113362. [PMID: 34957665 DOI: 10.1002/anie.202113362] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Indexed: 11/11/2022]
Abstract
Electro-reforming of renewable biomass resources is an alternative technology for sustainable pure H2 production. Herein, we discovered an unconventional cation effect on the concurrent formate and H2 production via glycerol electro-reforming. In stark contrast to the cation effect via forming the double layers in cathodic reactions, the presence of residual cations at the anode were discovered to interact with the glycerol oxidation intermediates to steer its product selectivity. Through a combination of product analysis, transient kinetics, crown ether trapping experiments, in situ IRRAS spectroscopy and DFT calculation, the aldehyde intermediates were discovered to be stabilized by the Li+ cations to favor the non-oxidative C-C cleavage for formate production. The maximal formate efficiency could reach 81.3% under ~ 60 mA/cm2 in LiOH. This work emphasizes the significance of engineering the microenvironment at the electrode-electrolyte interface for efficient electrolytic processes.
Collapse
Affiliation(s)
- Jianxiang Wu
- Fudan University, Department of Chemistry, CHINA
| | - Jili Li
- Fudan University, Department of Chemistry, CHINA
| | - Yefei Li
- Fudan University, Department of Chemistry, CHINA
| | - Xian-Yin Ma
- Fudan University, Department of Chemistry, CHINA
| | - Wei-Yi Zhang
- Fudan University, Department of Chemistry, CHINA
| | - Yaming Hao
- Fudan University, Department of Chemistry, CHINA
| | - Wen-Bin Cai
- Fudan University, Department of Chemistry, CHINA
| | - Zhipan Liu
- Fudan University, Department of Chemistry, CHINA
| | - Ming Gong
- Fudan University, Chemistry, No.2005, Songhu Rd., 200438, Shanghai, CHINA
| |
Collapse
|
11
|
Yoshida Y, Sato H. Distance as coordinate: A distance geometry study on isomerizations of small Lennard-Jones and Au6+ clusters. Chem Phys Lett 2021. [DOI: 10.1016/j.cplett.2021.138942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
12
|
Sobez JG, Reiher M. Molassembler: Molecular Graph Construction, Modification, and Conformer Generation for Inorganic and Organic Molecules. J Chem Inf Model 2020; 60:3884-3900. [DOI: 10.1021/acs.jcim.0c00503] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jan-Grimo Sobez
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Markus Reiher
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| |
Collapse
|
13
|
Wang S, Witek J, Landrum GA, Riniker S. Improving Conformer Generation for Small Rings and Macrocycles Based on Distance Geometry and Experimental Torsional-Angle Preferences. J Chem Inf Model 2020; 60:2044-2058. [PMID: 32155061 DOI: 10.1021/acs.jcim.0c00025] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
The conformer generator ETKDG is a stochastic search method that utilizes distance geometry together with knowledge derived from experimental crystal structures. It has been shown to generate good conformers for acyclic, flexible molecules. This work builds on ETKDG to improve conformer generation of molecules containing small or large aliphatic (i.e., non-aromatic) rings. For one, we devise additional torsional-angle potentials to describe small aliphatic rings and adapt the previously developed potentials for acyclic bonds to facilitate the sampling of macrocycles. However, due to the larger number of degrees of freedom of macrocycles, the conformational space to sample is much broader than for small molecules, creating a challenge for conformer generators. We therefore introduce different heuristics to restrict the search space of macrocycles and bias the sampling toward more experimentally relevant structures. Specifically, we show the usage of elliptical geometry and customizable Coulombic interactions as heuristics. The performance of the improved ETKDG is demonstrated on test sets of diverse macrocycles and cyclic peptides. The code developed here will be incorporated into the 2020.03 release of the open-source cheminformatics library RDKit.
Collapse
Affiliation(s)
- Shuzhe Wang
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Jagna Witek
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | | | - Sereina Riniker
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| |
Collapse
|
14
|
Bonanno E, Ebejer JP. Applying Machine Learning to Ultrafast Shape Recognition in Ligand-Based Virtual Screening. Front Pharmacol 2020; 10:1675. [PMID: 32140104 PMCID: PMC7042174 DOI: 10.3389/fphar.2019.01675] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 12/23/2019] [Indexed: 11/13/2022] Open
Abstract
Ultrafast Shape Recognition (USR), along with its derivatives, are Ligand-Based Virtual Screening (LBVS) methods that condense 3-dimensional information about molecular shape, as well as other properties, into a small set of numeric descriptors. These can be used to efficiently compute a measure of similarity between pairs of molecules using a simple inverse Manhattan Distance metric. In this study we explore the use of suitable Machine Learning techniques that can be trained using USR descriptors, so as to improve the similarity detection of potential new leads. We use molecules from the Directory for Useful Decoys-Enhanced to construct machine learning models based on three different algorithms: Gaussian Mixture Models (GMMs), Isolation Forests and Artificial Neural Networks (ANNs). We train models based on full molecule conformer models, as well as the Lowest Energy Conformations (LECs) only. We also investigate the performance of our models when trained on smaller datasets so as to model virtual screening scenarios when only a small number of actives are known a priori. Our results indicate significant performance gains over a state of the art USR-derived method, ElectroShape 5D, with GMMs obtaining a mean performance up to 430% better than that of ElectroShape 5D in terms of Enrichment Factor with a maximum improvement of up to 940%. Additionally, we demonstrate that our models are capable of maintaining their performance, in terms of enrichment factor, within 10% of the mean as the size of the training dataset is successively reduced. Furthermore, we also demonstrate that running times for retrospective screening using the machine learning models we selected are faster than standard USR, on average by a factor of 10, including the time required for training. Our results show that machine learning techniques can significantly improve the virtual screening performance and efficiency of the USR family of methods.
Collapse
Affiliation(s)
- Etienne Bonanno
- Department of Artificial Intelligence, University of Malta, Msida, Malta
| | - Jean-Paul Ebejer
- Centre for Molecular Medicine and Biobanking, University of Malta, Msida, Malta
| |
Collapse
|
15
|
Mansimov E, Mahmood O, Kang S, Cho K. Molecular Geometry Prediction using a Deep Generative Graph Neural Network. Sci Rep 2019; 9:20381. [PMID: 31892716 PMCID: PMC6938476 DOI: 10.1038/s41598-019-56773-5] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Accepted: 12/16/2019] [Indexed: 11/25/2022] Open
Abstract
A molecule's geometry, also known as conformation, is one of a molecule's most important properties, determining the reactions it participates in, the bonds it forms, and the interactions it has with other molecules. Conventional conformation generation methods minimize hand-designed molecular force field energy functions that are often not well correlated with the true energy function of a molecule observed in nature. They generate geometrically diverse sets of conformations, some of which are very similar to the lowest-energy conformations and others of which are very different. In this paper, we propose a conditional deep generative graph neural network that learns an energy function by directly learning to generate molecular conformations that are energetically favorable and more likely to be observed experimentally in data-driven manner. On three large-scale datasets containing small molecules, we show that our method generates a set of conformations that on average is far more likely to be close to the corresponding reference conformations than are those obtained from conventional force field methods. Our method maintains geometrical diversity by generating conformations that are not too similar to each other, and is also computationally faster. We also show that our method can be used to provide initial coordinates for conventional force field methods. On one of the evaluated datasets we show that this combination allows us to combine the best of both methods, yielding generated conformations that are on average close to reference conformations with some very similar to reference conformations.
Collapse
Affiliation(s)
- Elman Mansimov
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, 60 5th Avenue, New York, New York, 10011, United States
| | - Omar Mahmood
- Center for Data Science, New York University, 60 5th Avenue, New York, New York, 10011, United States
| | - Seokho Kang
- Department of Systems Management Engineering, Sungkyunkwan University, 2066 Seobu-ro, Jangan-gu, Suwon, 16419, Republic of Korea
| | - Kyunghyun Cho
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, 60 5th Avenue, New York, New York, 10011, United States.
- Center for Data Science, New York University, 60 5th Avenue, New York, New York, 10011, United States.
- Facebook AI Research, 770 Broadway, New York, New York, 10003, United States.
- CIFAR Azrieli Global Scholar, Canadian Institute for Advanced Research, 661 University Avenue, Toronto, ON, M5G 1M1, Canada.
| |
Collapse
|
16
|
Yoshikawa N, Hutchison GR. Fast, efficient fragment-based coordinate generation for Open Babel. J Cheminform 2019; 11:49. [PMID: 31372768 PMCID: PMC6676618 DOI: 10.1186/s13321-019-0372-5] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Accepted: 07/23/2019] [Indexed: 12/19/2022] Open
Abstract
Rapidly predicting an accurate three dimensional geometry of a molecule is a crucial task for cheminformatics and across a wide range of molecular modeling. Consequently, developing a fast, accurate, and open implementation of structure prediction is necessary for reproducible cheminformatics research. We introduce a fragment-based coordinate generation implementation for Open Babel, a widely-used open source toolkit for cheminformatics. The new implementation improves speed and stereochemical accuracy, while retaining or improving accuracy of bond lengths, bond angles, and dihedral torsions. Input molecules are broken into fragments by cutting at rotatable bonds. The coordinates of fragments are set according to a fragment library, prepared from open crystallographic databases. Since the coordinates of multiple atoms are decided at once, coordinate prediction is accelerated over the previous rules-based implementation in Open Babel, as well as the widely-used distance geometry methods in RDKit. This new implementation will be beneficial for a wide range of applications, including computational property prediction in polymers, molecular materials and drug design.
Collapse
Affiliation(s)
- Naruki Yoshikawa
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
| | - Geoffrey R Hutchison
- Department of Chemistry and Chemical Engineering, University of Pittsburgh, 219 Parkman Avenue, Pittsburgh, PA, 15260, USA.
| |
Collapse
|
17
|
Integrated Chemoinformatics Approaches Toward Epigenetic Drug Discovery. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2019. [DOI: 10.1007/978-3-030-05282-9_8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
18
|
Friedrich NO, Meyder A, de Bruyn Kops C, Sommer K, Flachsenberg F, Rarey M, Kirchmair J. High-Quality Dataset of Protein-Bound Ligand Conformations and Its Application to Benchmarking Conformer Ensemble Generators. J Chem Inf Model 2017; 57:529-539. [PMID: 28206754 DOI: 10.1021/acs.jcim.6b00613] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
We developed a cheminformatics pipeline for the fully automated selection and extraction of high-quality protein-bound ligand conformations from X-ray structural data. The pipeline evaluates the validity and accuracy of the 3D structures of small molecules according to multiple criteria, including their fit to the electron density and their physicochemical and structural properties. Using this approach, we compiled two high-quality datasets from the Protein Data Bank (PDB): a comprehensive dataset and a diversified subset of 4626 and 2912 structures, respectively. The datasets were applied to benchmarking seven freely available conformer ensemble generators: Balloon (two different algorithms), the RDKit standard conformer ensemble generator, the Experimental-Torsion basic Knowledge Distance Geometry (ETKDG) algorithm, Confab, Frog2 and Multiconf-DOCK. Substantial differences in the performance of the individual algorithms were observed, with RDKit and ETKDG generally achieving a favorable balance of accuracy, ensemble size and runtime. The Platinum datasets are available for download from http://www.zbh.uni-hamburg.de/platinum_dataset .
Collapse
Affiliation(s)
- Nils-Ole Friedrich
- University of Hamburg , ZBH - Center for Bioinformatics, Bundesstraße 43, Hamburg 20146, Germany
| | - Agnes Meyder
- University of Hamburg , ZBH - Center for Bioinformatics, Bundesstraße 43, Hamburg 20146, Germany
| | - Christina de Bruyn Kops
- University of Hamburg , ZBH - Center for Bioinformatics, Bundesstraße 43, Hamburg 20146, Germany
| | - Kai Sommer
- University of Hamburg , ZBH - Center for Bioinformatics, Bundesstraße 43, Hamburg 20146, Germany
| | - Florian Flachsenberg
- University of Hamburg , ZBH - Center for Bioinformatics, Bundesstraße 43, Hamburg 20146, Germany
| | - Matthias Rarey
- University of Hamburg , ZBH - Center for Bioinformatics, Bundesstraße 43, Hamburg 20146, Germany
| | - Johannes Kirchmair
- University of Hamburg , ZBH - Center for Bioinformatics, Bundesstraße 43, Hamburg 20146, Germany
| |
Collapse
|
19
|
Wang T, Wu MB, Lin JP, Yang LR. Quantitative structure–activity relationship: promising advances in drug discovery platforms. Expert Opin Drug Discov 2015; 10:1283-300. [DOI: 10.1517/17460441.2015.1083006] [Citation(s) in RCA: 68] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
20
|
Loharch S, Bhutani I, Jain K, Gupta P, Sahoo DK, Parkesh R. EpiDBase: a manually curated database for small molecule modulators of epigenetic landscape. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav013. [PMID: 25776023 PMCID: PMC4360624 DOI: 10.1093/database/bav013] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
We have developed EpiDBase (www.epidbase.org), an interactive database of small molecule ligands of epigenetic protein families by bringing together experimental, structural and chemoinformatic data in one place. Currently, EpiDBase encompasses 5784 unique ligands (11 422 entries) of various epigenetic markers such as writers, erasers and readers. The EpiDBase includes experimental IC50 values, ligand molecular weight, hydrogen bond donor and acceptor count, XlogP, number of rotatable bonds, number of aromatic rings, InChIKey, two-dimensional and three-dimensional (3D) chemical structures. A catalog of all epidbase ligands based on the molecular weight is also provided. A structure editor is provided for 3D visualization of ligands. EpiDBase is integrated with tools like text search, disease-specific search, advanced search, substructure, and similarity analysis. Advanced analysis can be performed using substructure and OpenBabel-based chemical similarity fingerprints. The EpiDBase is curated to identify unique molecular scaffolds. Initially, molecules were selected by removing peptides, macrocycles and other complex structures and then processed for conformational sampling by generating 3D conformers. Subsequent filtering through Zinc Is Not Commercial (ZINC: a free database of commercially available compounds for virtual screening) and Lilly MedChem regular rules retained many distinctive drug-like molecules. These molecules were then analyzed for physicochemical properties using OpenBabel descriptors and clustered using various methods such as hierarchical clustering, binning partition and multidimensional scaling. EpiDBase provides comprehensive resources for further design, development and refinement of small molecule modulators of epigenetic markers. Database URL:www.epidbase.org
Collapse
Affiliation(s)
- Saurabh Loharch
- Department of Advanced Protein Science, Institute of Microbial Technology, Chandigarh 160036, India
| | - Isha Bhutani
- Department of Advanced Protein Science, Institute of Microbial Technology, Chandigarh 160036, India
| | - Kamal Jain
- Department of Advanced Protein Science, Institute of Microbial Technology, Chandigarh 160036, India
| | - Pawan Gupta
- Department of Advanced Protein Science, Institute of Microbial Technology, Chandigarh 160036, India
| | - Debendra K Sahoo
- Department of Advanced Protein Science, Institute of Microbial Technology, Chandigarh 160036, India
| | - Raman Parkesh
- Department of Advanced Protein Science, Institute of Microbial Technology, Chandigarh 160036, India
| |
Collapse
|
21
|
Ebejer JP, Morris GM, Deane CM. Freely Available Conformer Generation Methods: How Good Are They? J Chem Inf Model 2012; 52:1146-58. [DOI: 10.1021/ci2004658] [Citation(s) in RCA: 139] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jean-Paul Ebejer
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, U.K.,
and
- InhibOx Limited, Oxford Centre for Innovation, New Road, Oxford, OX1
1BY, U.K
| | - Garrett M. Morris
- InhibOx Limited, Oxford Centre for Innovation, New Road, Oxford, OX1
1BY, U.K
| | - Charlotte M. Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, U.K.,
and
| |
Collapse
|
22
|
|
23
|
Reichling S, Huttner G. How to Find Correlations Between Molecular Shape and Packing in a Molecular Crystal: Application of a Novel Strategy to Recognizen-Point Polyhedra in Three-Dimensional Space. Eur J Inorg Chem 2000. [DOI: 10.1002/(sici)1099-0682(200005)2000:5%3c857::aid-ejic857%3e3.0.co;2-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
24
|
Reichling S, Huttner G. How to Find Correlations Between Molecular Shape and Packing in a Molecular Crystal: Application of a Novel Strategy to Recognizen-Point Polyhedra in Three-Dimensional Space. Eur J Inorg Chem 2000. [DOI: 10.1002/(sici)1099-0682(200005)2000:5<857::aid-ejic857>3.0.co;2-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
25
|
Beyreuther S, Hunger J, Cunskis S, Diercks T, Frick A, Planker E, Huttner G. How to Predict Conformations Accessible to a Molecule in Solution: Validation of a Force Field-Based Prediction of NOE Distances by Comparison with the Experimental Data for the Series of Compounds CH3C[CH2P(Bzl)R]3Mo(CO)3 (R = Ph,m-Xyl). Eur J Inorg Chem 1998. [DOI: 10.1002/(sici)1099-0682(199811)1998:11<1641::aid-ejic1641>3.0.co;2-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
26
|
Eyermann CJ, Jadhav P, Nicholas Hodge C, Chang CH, Rodgers JD, Y.S.L P. The role of computer-aided and structure-based design techniques in the discovery and optimization of cyclic urea inhibitors of hiv protease. ADVANCES IN AMINO ACID MIMETICS AND PEPTIDOMIMETICS 1997. [DOI: 10.1016/s1874-5113(97)80003-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
|
27
|
Lau WF, Tabernero L, Sack JS, Iwanowicz EJ. Molecular modeling studies of novel retro-binding tripeptide active-site inhibitors of thrombin. Bioorg Med Chem 1995; 3:1039-48. [PMID: 7582978 DOI: 10.1016/0968-0896(95)00100-u] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
A novel series of retro-binding tripeptide thrombin active-site inhibitors was recently developed (Iwanowicz, E. I. et al. J. Med. Chem. 1994, 37, 2111(1)). It was hypothesized that the binding mode for these inhibitors is similar to that of the first three N-terminal residues of hirudin. This binding hypothesis was subsequently verified when the crystal structure of a member of this series, BMS-183,507 (N-[N-[N-[4-(Aminoiminomethyl)amino[-1-oxobutyl]-L- phenylalanyl]-L-allo-threonyl]-L-phenylalanine, methyl ester), was determined (Taberno, L.J. Mol. Biol. 1995, 246, 14). The methodology for developing the binding models of these inhibitors, the structure-activity relationships (SAR) and modeling studies that led to the elucidation of the proposed binding mode is described. The crystal structure of BMS-183,507/human alpha-thrombin is compared with the crystal structure of hirudin/human alpha-thrombin (Rydel, T.J. et al. Science 1990, 249,227; Rydel, T.J. et al. J. Mol Biol. 1991, 221, 583; Grutter, M.G. et al. EMBO J. 1990, 9, 2361) and with the computational binding model of BMS-183,507.
Collapse
Affiliation(s)
- W F Lau
- Bristol-Myers Squibb Pharmaceutical Research Institute, Princeton, NJ 08543-4000, USA
| | | | | | | |
Collapse
|