1
|
Kuznetsov M, Ryabov F, Schutski R, Shayakhmetov R, Lin YC, Aliper A, Polykovskiy D. COSMIC: Molecular Conformation Space Modeling in Internal Coordinates with an Adversarial Framework. J Chem Inf Model 2024; 64:3610-3620. [PMID: 38668753 PMCID: PMC11094738 DOI: 10.1021/acs.jcim.3c00989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 03/29/2024] [Accepted: 04/02/2024] [Indexed: 05/14/2024]
Abstract
The fast and accurate conformation space modeling is an essential part of computational approaches for solving ligand and structure-based drug discovery problems. Recent state-of-the-art diffusion models for molecular conformation generation show promising distribution coverage and physical plausibility metrics but suffer from a slow sampling procedure. We propose a novel adversarial generative framework, COSMIC, that shows comparable generative performance but provides a time-efficient sampling and training procedure. Given a molecular graph and random noise, the generator produces a conformation in two stages. First, it constructs a conformation in a rotation and translation invariant representation─internal coordinates. In the second step, the model predicts the distances between neighboring atoms and performs a few fast optimization steps to refine the initial conformation. The proposed model considers conformation energy, achieving comparable space coverage, and diversity metrics results.
Collapse
Affiliation(s)
- Maksim Kuznetsov
- Insilico
Medicine Canada Inc., 1250 René-Lévesque Ouest, Suite 3710, Montréal, Québec H3B 4W8, Canada
| | - Fedor Ryabov
- Insilico
Medicine Hong Kong Ltd., Unit 310, 3/F, Building 8W, Phase 2, Hong Kong Science Park, Pak
Shek Kok, New Territories, Hong Kong 999077, China
| | - Roman Schutski
- Insilico
Medicine Hong Kong Ltd., Unit 310, 3/F, Building 8W, Phase 2, Hong Kong Science Park, Pak
Shek Kok, New Territories, Hong Kong 999077, China
| | - Rim Shayakhmetov
- Insilico
Medicine Canada Inc., 1250 René-Lévesque Ouest, Suite 3710, Montréal, Québec H3B 4W8, Canada
| | - Yen-Chu Lin
- Insilico
Medicine Taiwan Ltd., Taipei City 110208, Taiwan
| | - Alex Aliper
- Insilico
Medicine Hong Kong Ltd., Unit 310, 3/F, Building 8W, Phase 2, Hong Kong Science Park, Pak
Shek Kok, New Territories, Hong Kong 999077, China
| | - Daniil Polykovskiy
- Insilico
Medicine Canada Inc., 1250 René-Lévesque Ouest, Suite 3710, Montréal, Québec H3B 4W8, Canada
| |
Collapse
|
2
|
Chandrasekhar V, Sharma N, Schaub J, Steinbeck C, Rajan K. Cheminformatics Microservice: unifying access to open cheminformatics toolkits. J Cheminform 2023; 15:98. [PMID: 37845745 PMCID: PMC10577930 DOI: 10.1186/s13321-023-00762-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 09/19/2023] [Indexed: 10/18/2023] Open
Abstract
In recent years, cheminformatics has experienced significant advancements through the development of new open-source software tools based on various cheminformatics programming toolkits. However, adopting these toolkits presents challenges, including proper installation, setup, deployment, and compatibility management. In this work, we present the Cheminformatics Microservice. This open-source solution provides a unified interface for accessing commonly used functionalities of multiple cheminformatics toolkits, namely RDKit, Chemistry Development Kit (CDK), and Open Babel. In addition, more advanced functionalities like structure generation and Optical Chemical Structure Recognition (OCSR) are made available through the Cheminformatics Microservice based on pre-existing tools. The software service also enables developers to extend the functionalities easily and to seamlessly integrate them with existing workflows and applications. It is built on FastAPI and containerized using Docker, making it highly scalable. An instance of the microservice is publicly available at https://api.naturalproducts.net . The source code is publicly accessible on GitHub, accompanied by comprehensive documentation, version control, and continuous integration and deployment workflows. All resources can be found at the following link: https://github.com/Steinbeck-Lab/cheminformatics-microservice .
Collapse
Affiliation(s)
- Venkata Chandrasekhar
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University Jena, Lessingstr. 8, 07743, Jena, Germany
| | - Nisha Sharma
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University Jena, Lessingstr. 8, 07743, Jena, Germany
| | - Jonas Schaub
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University Jena, Lessingstr. 8, 07743, Jena, Germany
| | - Christoph Steinbeck
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University Jena, Lessingstr. 8, 07743, Jena, Germany
| | - Kohulan Rajan
- Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University Jena, Lessingstr. 8, 07743, Jena, Germany.
| |
Collapse
|
3
|
Peng Y, Yu W, Feng X, Xu T, Früchtl H, van Mourik T, Kirk SR, Jenkins S. The Cis-Effect Explained Using Next-Generation QTAIM. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27186099. [PMID: 36144830 PMCID: PMC9506152 DOI: 10.3390/molecules27186099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 09/05/2022] [Accepted: 09/14/2022] [Indexed: 11/22/2022]
Abstract
We used next-generation QTAIM (NG-QTAIM) to explain the cis-effect for two families of molecules: C2X2 (X = H, F, Cl) and N2X2 (X = H, F, Cl). We explained why the cis-effect is the exception rather than the rule. This was undertaken by tracking the motion of the bond critical point (BCP) of the stress tensor trajectories Tσ(s) used to sample the Uσ-space cis- and trans-characteristics. The Tσ(s) were constructed by subjecting the C1-C2 BCP and N1-N2 BCP to torsions ± θ and summing all possible Tσ(s) from the bonding environment. During this process, care was taken to fully account for multi-reference effects. We associated bond-bending and bond-twisting components of the Tσ(s) with cis- and trans-characteristics, respectively, based on the relative ease of motion of the electronic charge density ρ(rb). Qualitative agreement is found with existing experimental data and predictions are made where experimental data is not available.
Collapse
Affiliation(s)
- Yuting Peng
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha 410081, China
| | - Wenjing Yu
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha 410081, China
| | - Xinxin Feng
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha 410081, China
| | - Tianlv Xu
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha 410081, China
| | - Herbert Früchtl
- EaStCHEM School of Chemistry, University of St Andrews, North Haugh, St Andrews, Fife KY16 9ST, Scotland, UK
| | - Tanja van Mourik
- EaStCHEM School of Chemistry, University of St Andrews, North Haugh, St Andrews, Fife KY16 9ST, Scotland, UK
| | - Steven R. Kirk
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha 410081, China
- Correspondence: (S.R.K.); (S.J.)
| | - Samantha Jenkins
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha 410081, China
- Correspondence: (S.R.K.); (S.J.)
| |
Collapse
|
4
|
Mixed chiral and achiral character in substituted ethane: A next generation QTAIM perspective. Chem Phys Lett 2022. [DOI: 10.1016/j.cplett.2022.139762] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
5
|
Xu T, Nie X, Li S, Yang Y, Früchtl H, van Mourik T, Kirk SR, Paterson MJ, Shigeta Y, Jenkins S. Chirality without Stereoisomers: Insight from the Helical Response of Bond Electrons. Chemphyschem 2021; 22:1989-1995. [PMID: 34269504 DOI: 10.1002/cphc.202100397] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 07/14/2021] [Indexed: 11/05/2022]
Abstract
The association between molecular chirality and helical characteristics known as the chirality-helicity equivalence is determined for the first time by quantifying a chirality-helicity measure consistent with photoexcitation circular dichroism experiments. This is demonstrated using a formally achiral SN 2 reaction and a chiral SN 2 reaction. Both the achiral and chiral SN 2 reactions possess significant values of the chirality-helicity measure and display a Walden inversion, i. e. an inversion of the chirality between the reactant and product. We also track the chirality-helicity measure along the reaction path and discover the presence of chirality at the transition state and two intermediate structures for both reactions. We demonstrate the need for the chirality-helicity measure to differentiate between steric effects due to eclipsed conformations and chiral behaviors in formally achiral species. We explain the significance of this work for asymmetric synthetic reactions including the intermediate structures where the Cahn-Ingold-Prelog (CIP) rules cannot be used.
Collapse
Affiliation(s)
- Tianlv Xu
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource, National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha, Hunan, 410081, China
| | - Xing Nie
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource, National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha, Hunan, 410081, China
| | - Shuman Li
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource, National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha, Hunan, 410081, China
| | - Yong Yang
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource, National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha, Hunan, 410081, China
| | - Herbert Früchtl
- EaStCHEM School of Chemistry, University of Saint Andrews, North Haugh, St Andrews, Fife, KY16 9ST, Scotland, United Kingdom
| | - Tanja van Mourik
- EaStCHEM School of Chemistry, University of Saint Andrews, North Haugh, St Andrews, Fife, KY16 9ST, Scotland, United Kingdom
| | - Steven R Kirk
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource, National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha, Hunan, 410081, China
| | - Martin J Paterson
- Institute of Chemical Sciences, School of Engineering and Physical Sciences, Heriot-Watt University, Edinburgh, EH14 4AS, UK
| | - Yasuteru Shigeta
- Center for Computational Sciences, University of Tsukuba, Tsukuba, 305-8577, Japan
| | - Samantha Jenkins
- Key Laboratory of Chemical Biology and Traditional Chinese Medicine Research and Key Laboratory of Resource, National and Local Joint Engineering Laboratory for New Petro-chemical Materials and Fine Utilization of Resources, College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha, Hunan, 410081, China
| |
Collapse
|
6
|
Goodman JM, Pletnev I, Thiessen P, Bolton E, Heller SR. InChI version 1.06: now more than 99.99% reliable. J Cheminform 2021; 13:40. [PMID: 34030732 PMCID: PMC8147039 DOI: 10.1186/s13321-021-00517-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 05/04/2021] [Indexed: 12/19/2022] Open
Abstract
The software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integration of large chemical databases. InChI version 1.05 was released in January 2017 and version 1.06 in December 2020. In this paper, we report on the current state of the InChI Software, the details of the improvements in the v1.06 release, and the results of a test of the InChI run on PubChem, a database of more than a hundred million molecules. The upgrade introduces significant new features, including support for pseudo-element atoms and an improved description of polymers. We expect that few, if any, applications using the standard InChI will need to change as a result of the changes in version 1.06. Numerical instability was discovered for 0.002% of this database, and a small number of other molecules were discovered for which the algorithm did not run smoothly. On the basis of PubChem data, we can demonstrate that InChI version 1.05 was 99.996% accurate, and InChI version 1.06 represents a step closer to perfection. Finally, we look forward to future releases and extensions for the InChI Chemical identifier.
Collapse
Affiliation(s)
- Jonathan M Goodman
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Igor Pletnev
- InChI Trust, Cambridge, UK.,Department of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia
| | - Paul Thiessen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Evan Bolton
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Stephen R Heller
- InChI Trust, Cambridge, UK. .,National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
7
|
Yahiaoui O, Patel HD, Chinner KS, Pašteka LF, Fallon T. Stereomutation of Substituted Bullvalenes. Org Lett 2021; 23:1157-1162. [PMID: 33146538 DOI: 10.1021/acs.orglett.0c03470] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The stereomutation of substituted bullvalenes is an inevitable consequence of the valence isomerism that automerizes this unique fluxional hydrocarbon. The introduction of external stereogenicity in the substituents expands the reaction graphs and leads to a wealth of complex diastereochemical relationships. In this communication, we explore these possibilities and prepare a range of stereochemically rich substituted bullvalenes. This includes a series of disubstituted bullvalenes with two external stereocenters as a platform for fluxional, shape-diverse compound libraries. We also prepare a tethered bisbullvalene with central stereogenicity in the tether as an ensemble of 900 unique isomers that are completely stereomutable.
Collapse
Affiliation(s)
- Oussama Yahiaoui
- Department of Chemistry, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Harshal D Patel
- Department of Chemistry, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Kylie S Chinner
- Department of Chemistry, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Lukáš F Pašteka
- Department of Physical and Theoretical Chemistry, Faculty of Natural Sciences, Comenius University, 81806 Bratislava, Slovakia
| | - Thomas Fallon
- Department of Chemistry, The University of Adelaide, Adelaide, SA 5005, Australia
| |
Collapse
|
8
|
Sobez JG, Reiher M. Molassembler: Molecular Graph Construction, Modification, and Conformer Generation for Inorganic and Organic Molecules. J Chem Inf Model 2020; 60:3884-3900. [DOI: 10.1021/acs.jcim.0c00503] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Jan-Grimo Sobez
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| | - Markus Reiher
- Laboratory of Physical Chemistry, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland
| |
Collapse
|
9
|
García-Jacas CR, Marrero-Ponce Y, Vivas-Reyes R, Suárez-Lezcano J, Martinez-Rios F, Terán JE, Aguilera-Mendoza L. Distributed and multicore QuBiLS-MIDAS software v2.0: Computing chiral, fuzzy, weighted and truncated geometrical molecular descriptors based on tensor algebra. J Comput Chem 2020; 41:1209-1227. [PMID: 32058625 DOI: 10.1002/jcc.26167] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Revised: 01/22/2020] [Accepted: 01/26/2020] [Indexed: 12/12/2022]
Abstract
Advances to the distributed, multi-core and fully cross-platform QuBiLS-MIDAS software v2.0 (http://tomocomd.com/qubils-midas) are reported in this article since the v1.0 release. The QuBiLS-MIDAS software is the only one that computes atom-pair and alignment-free geometrical MDs (3D-MDs) from several distance metrics other than the Euclidean distance, as well as alignment-free 3D-MDs that codify structural information regarding the relations among three and four atoms of a molecule. The most recent features added to the QuBiLS-MIDAS software v2.0 are related (a) to the calculation of atomic weightings from indices based on the vertex-degree invariant (e.g., Alikhanidi index); (b) to consider central chirality during the molecular encoding; (c) to use measures based on clustering methods and statistical functions to codify structural information among more than two atoms; (d) to the use of a novel method based on fuzzy membership functions to spherically truncate inter-atomic relations; and (e) to the use of weighted and fuzzy aggregation operators to compute global 3D-MDs according to the importance and/or interrelation of the atoms of a molecule during the molecular encoding. Moreover, a novel module to compute QuBiLS-MIDAS 3D-MDs from their headings was also developed. This module can be used either by the graphical user interface or by means of the software library. By using the library, both the predictive models built with the QuBiLS-MIDAS 3D-MDs and the QuBiLS-MIDAS 3D-MDs calculation can be embedded in other tools. A set of predefined QuBiLS-MIDAS 3D-MDs with high information content and low redundancy on a set comprised of 20,469 compounds is also provided to be employed in further cheminformatics tasks. This set of predefined 3D-MDs evidenced better performance than all the universe of Dragon (v5.5) and PaDEL 0D-to-3D MDs in variability studies, whereas a linear independence study proved that these QuBiLS-MIDAS 3D-MDs codify chemical information orthogonal to the Dragon 0D-to-3D MDs. This set of predefined 3D-MDs would be periodically updated as long as new results be achieved. In general, this report highlights our continued efforts to provide a better tool for a most suitable characterization of compounds, and in this way, to contribute to obtaining better outcomes in future applications.
Collapse
Affiliation(s)
- César R García-Jacas
- Cátedras Conacyt - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja, California, Mexico
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas, Quito, Pichincha, Ecuador.,Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito, Pichincha, Ecuador.,Grupo GINUMED, Corporacion Universitaria Rafael Nuñez, Facultad de Salud, Programa de Medicina, Cartagena, Colombia.,Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Spain
| | - Ricardo Vivas-Reyes
- Grupo de Química Cuántica y Teórica de la Universidad de Cartagena - Facultad de Ciencias Exactas y Naturales. Programa de Química. Campus de San Pablo, Cartagena, Colombia.,Grupo CipTec, Facultad de Ingenierias. Fundacion Universitaria Tecnologico Comfenalco - Cartagena, Cartagena, Bolívar, Colombia
| | - José Suárez-Lezcano
- Pontificia Universidad Católica del Ecuador Sede Esmeraldas (PUCESE), Esmeraldas, Ecuador
| | | | - Julio E Terán
- Department of Textile Engineering, Chemistry and Science, College of Textiles, NorthCarolina State University, Raleigh, NC, USA
| | - Longendri Aguilera-Mendoza
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California, Mexico
| |
Collapse
|