1
|
Ries B, Alibay I, Swenson DWH, Baumann HM, Henry MM, Eastwood JRB, Gowers RJ. Kartograf: A Geometrically Accurate Atom Mapper for Hybrid-Topology Relative Free Energy Calculations. J Chem Theory Comput 2024; 20:1862-1877. [PMID: 38330251 PMCID: PMC10941767 DOI: 10.1021/acs.jctc.3c01206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 01/17/2024] [Accepted: 01/18/2024] [Indexed: 02/10/2024]
Abstract
Relative binding free energy (RBFE) calculations have emerged as a powerful tool that supports ligand optimization in drug discovery. Despite many successes, the use of RBFEs can often be limited by automation problems, in particular, the setup of such calculations. Atom mapping algorithms are an essential component in setting up automatic large-scale hybrid-topology RBFE calculation campaigns. Traditional algorithms typically employ a 2D subgraph isomorphism solver (SIS) in order to estimate the maximum common substructure. SIS-based approaches can be limited by time-intensive operations and issues with capturing geometry-linked chemical properties, potentially leading to suboptimal solutions. To overcome these limitations, we have developed Kartograf, a geometric-graph-based algorithm that uses primarily the 3D coordinates of atoms to find a mapping between two ligands. In free energy approaches, the ligand conformations are usually derived from docking or other previous modeling approaches, giving the coordinates a certain importance. By considering the spatial relationships between atoms related to the molecule coordinates, our algorithm bypasses the computationally complex subgraph matching of SIS-based approaches and reduces the problem to a much simpler bipartite graph matching problem. Moreover, Kartograf effectively circumvents typical mapping issues induced by molecule symmetry and stereoisomerism, making it a more robust approach for atom mapping from a geometric perspective. To validate our method, we calculated mappings with our novel approach using a diverse set of small molecules and used the mappings in relative hydration and binding free energy calculations. The comparison with two SIS-based algorithms showed that Kartograf offers a fast alternative approach. The code for Kartograf is freely available on GitHub (https://github.com/OpenFreeEnergy/kartograf). While developed for the OpenFE ecosystem, Kartograf can also be utilized as a standalone Python package.
Collapse
Affiliation(s)
- Benjamin Ries
- Medicinal
Chemistry, Boehringer Ingelheim Pharma GmbH
& Co KG, Birkendorfer Str 65, 88397 Biberach an der Riss, Germany
- Open
Free Energy, Open Molecular Software Foundation, Davis, 95616 California, United States
| | - Irfan Alibay
- Open
Free Energy, Open Molecular Software Foundation, Davis, 95616 California, United States
| | - David W. H. Swenson
- Open
Free Energy, Open Molecular Software Foundation, Davis, 95616 California, United States
| | - Hannah M. Baumann
- Open
Free Energy, Open Molecular Software Foundation, Davis, 95616 California, United States
| | - Michael M. Henry
- Open
Free Energy, Open Molecular Software Foundation, Davis, 95616 California, United States
- Computational
and Systems Biology Program, Sloan Kettering
Institute, Memorial Sloan Kettering Cancer Center, New York, 1275 New York, United States
| | - James R. B. Eastwood
- Open
Free Energy, Open Molecular Software Foundation, Davis, 95616 California, United States
| | - Richard J. Gowers
- Open
Free Energy, Open Molecular Software Foundation, Davis, 95616 California, United States
| |
Collapse
|
2
|
Stroet M, Caron B, Engler MS, van der Woning J, Kauffmann A, van Dijk M, El-Kebir M, Visscher KM, Holownia J, Macfarlane C, Bennion BJ, Gelpi-Dominguez S, Lightstone FC, van der Storm T, Geerke DP, Mark AE, Klau GW. OFraMP: a fragment-based tool to facilitate the parametrization of large molecules. J Comput Aided Mol Des 2023:10.1007/s10822-023-00511-7. [PMID: 37310542 DOI: 10.1007/s10822-023-00511-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 05/29/2023] [Indexed: 06/14/2023]
Abstract
An Online tool for Fragment-based Molecule Parametrization (OFraMP) is described. OFraMP is a web application for assigning atomic interaction parameters to large molecules by matching sub-fragments within the target molecule to equivalent sub-fragments within the Automated Topology Builder (ATB, atb.uq.edu.au) database. OFraMP identifies and compares alternative molecular fragments from the ATB database, which contains over 890,000 pre-parameterized molecules, using a novel hierarchical matching procedure. Atoms are considered within the context of an extended local environment (buffer region) with the degree of similarity between an atom in the target molecule and that in the proposed match controlled by varying the size of the buffer region. Adjacent matching atoms are combined into progressively larger matched sub-structures. The user then selects the most appropriate match. OFraMP also allows users to manually alter interaction parameters and automates the submission of missing substructures to the ATB in order to generate parameters for atoms in environments not represented in the existing database. The utility of OFraMP is illustrated using the anti-cancer agent paclitaxel and a dendrimer used in organic semiconductor devices. OFraMP applied to paclitaxel (ATB ID 35922).
Collapse
Affiliation(s)
- Martin Stroet
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Bertrand Caron
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Martin S Engler
- Centrum Wiskunde & Informatica, Science Park 123, 1098 XG, Amsterdam, The Netherlands
- Algorithmic Bioinformatics, Heinrich Heine University Düsseldorf, Universitätsstr. 1, 40225, Düsseldorf, Germany
| | - Jimi van der Woning
- Centrum Wiskunde & Informatica, Science Park 123, 1098 XG, Amsterdam, The Netherlands
| | - Aude Kauffmann
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Marc van Dijk
- Department of Chemistry and Pharmaceutical Sciences, Amsterdam Institute of Molecular and Life Sciences (AIMMS), Vrije Universiteit Amsterdam, De Boelelaan 1108, 1081 HZ, Amsterdam, the Netherlands
| | - Mohammed El-Kebir
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Koen M Visscher
- Department of Chemistry and Pharmaceutical Sciences, Amsterdam Institute of Molecular and Life Sciences (AIMMS), Vrije Universiteit Amsterdam, De Boelelaan 1108, 1081 HZ, Amsterdam, the Netherlands
| | - Josef Holownia
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Callum Macfarlane
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia
| | - Brian J Bennion
- Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, 7000 East Ave, Livermore, CA, 94552, USA
| | - Svetlana Gelpi-Dominguez
- Department of Chemistry, University of Connecticut, 55 North Eagleville Road, Storrs, CT, 06269, USA
| | - Felice C Lightstone
- Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, 7000 East Ave, Livermore, CA, 94552, USA
| | - Tijs van der Storm
- Centrum Wiskunde & Informatica, Science Park 123, 1098 XG, Amsterdam, The Netherlands
- Faculty of Science and Engineering, University of Groningen, Nijenborgh 4, 9747 AG, Groningen, The Netherlands
| | - Daan P Geerke
- Department of Chemistry and Pharmaceutical Sciences, Amsterdam Institute of Molecular and Life Sciences (AIMMS), Vrije Universiteit Amsterdam, De Boelelaan 1108, 1081 HZ, Amsterdam, the Netherlands
| | - Alan E Mark
- School of Chemistry & Molecular Biosciences, The University of Queensland, Brisbane, QLD, 4072, Australia.
| | - Gunnar W Klau
- Algorithmic Bioinformatics, Heinrich Heine University Düsseldorf, Universitätsstr. 1, 40225, Düsseldorf, Germany
| |
Collapse
|
3
|
Schietgat L, Cuissart B, De Grave K, Efthymiadis K, Bureau R, Crémilleux B, Ramon J, Lepailleur A. Automated detection of toxicophores and prediction of mutagenicity using PMCSFG algorithm. Mol Inform 2023; 42:e2200232. [PMID: 36529710 DOI: 10.1002/minf.202200232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/13/2022] [Accepted: 12/18/2022] [Indexed: 12/23/2022]
Abstract
Maximum common substructures (MCS) have received a lot of attention in the chemoinformatics community. They are typically used as a similarity measure between molecules, showing high predictive performance when used in classification tasks, while being easily explainable substructures. In the present work, we applied the Pairwise Maximum Common Subgraph Feature Generation (PMCSFG) algorithm to automatically detect toxicophores (structural alerts) and to compute fingerprints based on MCS. We present a comparison between our MCS-based fingerprints and 12 well-known chemical fingerprints when used as features in machine learning models. We provide an experimental evaluation and discuss the usefulness of the different methods on mutagenicity data. The features generated by the MCS method have a state-of-the-art performance when predicting mutagenicity, while they are more interpretable than the traditional chemical fingerprints.
Collapse
Affiliation(s)
- Leander Schietgat
- Artificial Intelligence Lab, Vrije Universiteit Brussel, Brussel, Belgium.,Department of Computer Science, KU Leuven, Leuven, Belgium
| | - Bertrand Cuissart
- Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen, UNICAEN, ENSICAEN, CNRS - UMR GREYC, Normandie Univ., Caen, France
| | | | | | - Ronan Bureau
- Centre d'Etudes et de Recherche sur le Médicament de Normandie, UNICAEN, CERMN, Normandie Univ., Caen, France
| | - Bruno Crémilleux
- Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen, UNICAEN, ENSICAEN, CNRS - UMR GREYC, Normandie Univ., Caen, France
| | - Jan Ramon
- INRIA Lille Nord Europe, Lille, France
| | - Alban Lepailleur
- Centre d'Etudes et de Recherche sur le Médicament de Normandie, UNICAEN, CERMN, Normandie Univ., Caen, France
| |
Collapse
|
4
|
Abstract
Alignments of discrete objects can be constructed in a very general setting as super-objects from which the constituent objects are recovered by means of projections. Here, we focus on contact maps, i.e. undirected graphs with an ordered set of vertices. These serve as natural discretizations of RNA and protein structures. In the general case, the alignment problem for vertex-ordered graphs is NP-complete. In the special case of RNA secondary structures, i.e. crossing-free matchings, however, the alignments have a recursive structure. The alignment problem then can be solved by a variant of the Sankoff algorithm in polynomial time. Moreover, the tree or forest alignments of RNA secondary structure can be understood as the alignments of ordered edge sets.
Collapse
Affiliation(s)
- Peter F Stadler
- Bioinformatics Group, Department of Computer Science and Interdisciplinary Centre for Bioinformatics, Universität Leipzig, Härtelstraße 16-18, 04107 Leipzig, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Centre for Scalable Data Services and Solutions Dresden-Leipzig, Leipzig Research Centre for Civilization Diseases, and Centre for Biotechnology and Biomedicine at Leipzig University, Universität Leipzig, Leipzig, Germany.,Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, 04103 Leipzig, Germany.,Institute for Theoretical Chemistry, University of Vienna, Währingerstrasse 17, 1090 Wien, Austria.,Facultad de Ciencias, Universidad National de Colombia, Bogotá, Colombia.,Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| |
Collapse
|
5
|
Schmidt R, Krull F, Heinzke AL, Rarey M. Disconnected Maximum Common Substructures under Constraints. J Chem Inf Model 2020; 61:167-178. [PMID: 33325698 DOI: 10.1021/acs.jcim.0c00741] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The maximum common substructure (MCS) problem is an important, well-studied problem in cheminformatics. It is applied in several application scenarios like molecule superimposition and scaffold detection or as a similarity measure in virtual screening and clustering. In many cases, the connected MCS is preferred since it is faster to calculate and a highly fragmented MCS is not very meaningful from a chemical point of view. Nevertheless, a disconnected MCS (dMCS) can be very instructive if it consists of reasonably sized molecular parts connected by variable groups. We present a new algorithm named RIMACS, which is able to calculate the dMCS under constraints. We can control the maximum number of connected components and their minimal size using a modified local substructure mapping approach. A formal proof of correctness is provided as well as extended runtime evaluations on chemical data. The evaluation of RIMACS shows that a small number of connected components helps us to improve MCS similarity in a meaningful way while keeping the runtime requirements in a reasonable range.
Collapse
Affiliation(s)
- Robert Schmidt
- Universität Hamburäg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| | - Florian Krull
- Universität Hamburäg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| | - Anna Lina Heinzke
- Universität Hamburäg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| | - Matthias Rarey
- Universität Hamburäg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| |
Collapse
|