1
|
Li J, Guan X, Zhang O, Sun K, Wang Y, Bagni D, Head-Gordon T. Leak Proof PDBBind: A Reorganized Dataset of Protein-Ligand Complexes for More Generalizable Binding Affinity Prediction. ARXIV 2024:arXiv:2308.09639v2. [PMID: 37645037 PMCID: PMC10462179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Many physics-based and machine-learned scoring functions (SFs) used to predict protein-ligand binding free energies have been trained on the PDBBind dataset. However, it is controversial as to whether new SFs are actually improving since the general, refined, and core datasets of PDBBind are cross-contaminated with proteins and ligands with high similarity, and hence they may not perform comparably well in binding prediction of new protein-ligand complexes. In this work we have carefully prepared a cleaned PDBBind data set of non-covalent binders that are split into training, validation, and test datasets to control for data leakage, defined as proteins and ligands with high sequence and structural similarity. The resulting leak-proof (LP)-PDBBind data is used to retrain four popular SFs: AutoDock Vina, Random Forest (RF)-Score, InteractionGraphNet (IGN), and DeepDTA, to better test their capabilities when applied to new protein-ligand complexes. In particular we have formulated a new independent data set, BDB2020+, by matching high quality binding free energies from BindingDB with co-crystalized ligand-protein complexes from the PDB that have been deposited since 2020. Based on all the benchmark results, the retrained models using LP-PDBBind consistently perform better, with IGN especially being recommended for scoring and ranking applications for new protein-ligand systems.
Collapse
|
2
|
Xu X, Han W, Ning X, Zang C, Xu C, Zeng C, Pu C, Zhang Y, Chen Y, Liu H. Constructing Innovative Covalent and Noncovalent Compound Libraries: Insights from 3D Protein-Ligand Interactions. J Chem Inf Model 2024; 64:1543-1559. [PMID: 38381562 DOI: 10.1021/acs.jcim.3c01689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
Noncovalent interactions between small-molecule drugs and protein targets assume a pivotal role in drug design. Moreover, the design of covalent inhibitors, forming covalent bonds with amino acid residues, requires rational reactivity for their covalent warheads, presenting a key challenge as well. Understanding the intricacies of these interactions provides a more comprehensive understanding of molecular binding mechanisms, thereby guiding the rational design of potent inhibitors. In this study, we adopted the fragment-based drug design approach, introducing a novel methodology to extract noncovalent and covalent fragments according to distinct three-dimensional (3D) interaction modes from noncovalent and covalent compound libraries. Additionally, we systematically replaced existing ligands with rational fragment substitutions, based on the spatial orientation of fragments in 3D space. Furthermore, we adopted a molecular generation approach to create innovative covalent inhibitors. This process resulted in the recombination of a noncovalent compound library and several covalent compound libraries, constructed by two commonly encountered covalent amino acids: cysteine and serine. We utilized noncovalent ligands in KLIFS and covalent ligands in CovBinderInPDB as examples to recombine noncovalent and covalent libraries. These recombined compound libraries cover a substantial portion of the chemical space present in the original compound libraries and exhibit superior performance in terms of molecular scaffold diversity compared to the original compound libraries and other 11 commercial libraries. We also recombined BTK-focused libraries, and 23 compounds within our libraries have been validated by former researchers to possess potential biological activity. The establishment of these compound libraries provides valuable resources for virtual screening of covalent and noncovalent drugs targeting similar molecular targets.
Collapse
Affiliation(s)
- Xiaohe Xu
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Weijie Han
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Xiangzhen Ning
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Chengdong Zang
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Chengcheng Xu
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Chen Zeng
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Chengtao Pu
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Yanmin Zhang
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Yadong Chen
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| | - Haichun Liu
- Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, 639 Longmian Avenue, Nanjing 211198, China
| |
Collapse
|
3
|
Goullieux M, Zoete V, Röhrig UF. Two-Step Covalent Docking with Attracting Cavities. J Chem Inf Model 2023; 63:7847-7859. [PMID: 38049143 PMCID: PMC10751798 DOI: 10.1021/acs.jcim.3c01055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 11/07/2023] [Accepted: 11/13/2023] [Indexed: 12/06/2023]
Abstract
Due to their various advantages, interest in the development of covalent drugs has been renewed in the past few years. It is therefore important to accurately describe and predict their interactions with biological targets by computer-aided drug design tools such as docking algorithms. Here, we report a covalent docking procedure for our in-house docking code Attracting Cavities (AC), which mimics the two-step mechanism of covalent ligand binding. Ligand binding to the protein cavity is driven by nonbonded interactions, followed by the formation of a covalent bond between the ligand and the protein through a chemical reaction. To test the performance of this method, we developed a diverse, high-quality, openly accessible re-docking benchmark set of 95 covalent complexes bound by 8 chemical reactions to 5 different reactive amino acids. Combination with structures from previous studies resulted in a set of 304 complexes, on which AC obtained a success rate (rmsd ≤ 2 Å) of 78%, outperforming two state-of-the-art covalent docking codes, genetic optimization for ligand docking (GOLD (66%)) and AutoDock (AD (35%)). Using a more stringent success criterion (rmsd ≤ 1.5 Å), AC reached a success rate of 71 vs 55% for GOLD and 26% for AD. We additionally assessed the cross-docking performance of AC on a set of 76 covalent complexes of the SARS-CoV-2 main protease. On this challenging test set of mainly small and highly solvent-exposed ligands, AC yielded success rates of 58 and 28% for re-docking and cross-docking, respectively, compared to 45 and 17% for GOLD.
Collapse
Affiliation(s)
- Mathilde Goullieux
- SIB
Swiss Institute of Bioinformatics, Molecular Modeling Group, CH-1015 Lausanne, Switzerland
| | - Vincent Zoete
- SIB
Swiss Institute of Bioinformatics, Molecular Modeling Group, CH-1015 Lausanne, Switzerland
- Department
of Oncology UNIL-CHUV, Lausanne University, Ludwig Institute for Cancer Research
Lausanne Branch, CH-1066 Epalinges, Switzerland
| | - Ute F. Röhrig
- SIB
Swiss Institute of Bioinformatics, Molecular Modeling Group, CH-1015 Lausanne, Switzerland
| |
Collapse
|
4
|
Bugnon M, Goullieux M, Röhrig UF, Perez MAS, Daina A, Michielin O, Zoete V. SwissParam 2023: A Modern Web-Based Tool for Efficient Small Molecule Parametrization. J Chem Inf Model 2023; 63:6469-6475. [PMID: 37853543 PMCID: PMC10649791 DOI: 10.1021/acs.jcim.3c01053] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Indexed: 10/20/2023]
Abstract
Most steps of drug discovery are now routinely supported and accelerated by computer-aided drug design tools. Among them, structure-based approaches use the three-dimensional structure of the targeted biomacromolecule as a major source of information. When it comes to calculating the interactions of small molecules with proteins using the equations of molecular mechanics, topologies, atom typing, and force field parameters are required. However, generating parameters for small molecules remains challenging due to the large number of existing chemical groups. The SwissParam web tool was first released in 2011 with the aim of generating parameters and topologies for small molecules based on the Merck molecular force field (MMFF) while being compatible with the CHARMM22/27 force field. Here, we present an updated version of SwissParam, providing various new features, including the possibility to setup covalent ligands. Molecules can now be imported from different file formats or via a molecular sketcher. The MMFF-based approach has been updated to provide parameters and topologies compatible with the CHARMM36 force field. An option was added to generate small molecule parametrizations following the CHARMM General Force Field via the multipurpose atom-typer for CHARMM (MATCH) approach. Additionally, SwissParam now generates information on probable alternative tautomers and protonation states of the query molecule so that the user can consider all microspecies relevant to its compound. The new version of SwissParam is freely available at www.swissparam.ch and can also be accessed through a newly implemented command-line interface.
Collapse
Affiliation(s)
- Marine Bugnon
- Molecular
Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Mathilde Goullieux
- Molecular
Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Ute F. Röhrig
- Molecular
Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Marta A. S. Perez
- Molecular
Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Antoine Daina
- Molecular
Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Olivier Michielin
- Molecular
Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
- Department
of Oncology, Geneva University Hospital
(HUG), CH-1205 Genève, Switzerland
| | - Vincent Zoete
- Molecular
Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
- Department
of Oncology UNIL-CHUV, Ludwig Institute for Cancer Research, Lausanne Branch, University of Lausanne (UNIL), CH-1015 Lausanne, Switzerland
| |
Collapse
|
5
|
Yu W, Weber DJ, MacKerell AD. Integrated Covalent Drug Design Workflow Using Site Identification by Ligand Competitive Saturation. J Chem Theory Comput 2023; 19:3007-3021. [PMID: 37115781 PMCID: PMC10205696 DOI: 10.1021/acs.jctc.3c00232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Covalent drug design is an important component in drug discovery. Traditional drugs interact with their target in a reversible equilibrium, while irreversible covalent drugs increase the drug-target interaction duration by forming a covalent bond with targeted residues and thus may offer a more effective therapeutic approach. To facilitate the design of this class of ligands, computational methods can be used to help identify reactive nucleophilic residues, frequently cysteines, on a target protein for covalent binding, to test various warhead groups for their potential reactivities, and to predict noncovalent contributions to binding that can facilitate drug-target interactions that are important for binding specificity. To further aid covalent drug design, we extended a functional group mapping approach based on explicit solvent all-atom molecular simulations (SILCS: site identification by ligand competitive saturation) that intrinsically considers protein flexibility, functional group, and protein desolvation along with functional group-protein interactions. Through docking of a library of representative warhead fragments using SILCS-Monte Carlo (SILCS-MC), reactive cysteines can be correctly identified for proteins being tested. Furthermore, a machine learning model was trained to quantify the effectiveness of various warhead groups for proteins using metrics from SILCS-MC as well as experimental model compound warhead reactivity data. The ability to rank covalent molecular binders with similar warheads using SILCS ligand grid free energy (LGFE) ranking was also tested for several proteins. Based on these tools, an integrated SILCS-based workflow was developed, named SILCS-Covalent, which can both qualitatively and quantitatively inform covalent drug discovery.
Collapse
Affiliation(s)
- Wenbo Yu
- Computer-Aided Drug Design Center, Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland Baltimore, Baltimore, Maryland 21201, United States
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, Maryland 20850, United States
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland Baltimore, Baltimore, Maryland 21201, United States
| | - David J. Weber
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, Maryland 20850, United States
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland Baltimore, Baltimore, Maryland 21201, United States
| | - Alexander D. MacKerell
- Computer-Aided Drug Design Center, Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland Baltimore, Baltimore, Maryland 21201, United States
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, Maryland 20850, United States
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland Baltimore, Baltimore, Maryland 21201, United States
| |
Collapse
|