1
|
Song RX, Nicklaus MC, Tarasova NI. Correlation of protein binding pocket properties with hits' chemistries used in generation of ultra-large virtual libraries. J Comput Aided Mol Des 2024; 38:22. [PMID: 38753096 PMCID: PMC11098933 DOI: 10.1007/s10822-024-00562-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/22/2024] [Indexed: 05/19/2024]
Abstract
Although the size of virtual libraries of synthesizable compounds is growing rapidly, we are still enumerating only tiny fractions of the drug-like chemical universe. Our capability to mine these newly generated libraries also lags their growth. That is why fragment-based approaches that utilize on-demand virtual combinatorial libraries are gaining popularity in drug discovery. These à la carte libraries utilize synthetic blocks found to be effective binders in parts of target protein pockets and a variety of reliable chemistries to connect them. There is, however, no data on the potential impact of the chemistries used for making on-demand libraries on the hit rates during virtual screening. There are also no rules to guide in the selection of these synthetic methods for production of custom libraries. We have used the SAVI (Synthetically Accessible Virtual Inventory) library, constructed using 53 reliable reaction types (transforms), to evaluate the impact of these chemistries on docking hit rates for 40 well-characterized protein pockets. The data shows that the virtual hit rates differ significantly for different chemistries with cross coupling reactions such as Sonogashira, Suzuki-Miyaura, Hiyama and Liebeskind-Srogl coupling producing the highest hit rates. Virtual hit rates appear to depend not only on the property of the formed chemical bond but also on the diversity of available building blocks and the scope of the reaction. The data identifies reactions that deserve wider use through increasing the number of corresponding building blocks and suggests the reactions that are more effective for pockets with certain physical and hydrogen bond-forming properties.
Collapse
Affiliation(s)
- Robert X Song
- Cancer Innovation Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, 21702, USA
| | - Marc C Nicklaus
- Computer-Aided Drug Design Group, Chemical Biology Laboratory, Center for Cancer Research, National Cancer Institute, NIH, Frederick, MD, 21702, USA
| | - Nadya I Tarasova
- Cancer Innovation Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, 21702, USA.
| |
Collapse
|
2
|
Bonilla PA, Hoop CL, Stefanisko K, Tarasov SG, Sinha S, Nicklaus MC, Tarasova NI. Virtual screening of ultra-large chemical libraries identifies cell-permeable small-molecule inhibitors of a "non-druggable" target, STAT3 N-terminal domain. Front Oncol 2023; 13:1144153. [PMID: 37182134 PMCID: PMC10167007 DOI: 10.3389/fonc.2023.1144153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 03/23/2023] [Indexed: 05/16/2023] Open
Abstract
STAT3 N-terminal domain is a promising molecular target for cancer treatment and modulation of immune responses. However, STAT3 is localized in the cytoplasm, mitochondria, and nuclei, and thus, is inaccessible to therapeutic antibodies. Its N-terminal domain lacks deep pockets on the surface and represents a typical "non-druggable" protein. In order to successfully identify potent and selective inhibitors of the domain, we have used virtual screening of billion structure-sized virtual libraries of make-on-demand screening samples. The results suggest that the expansion of accessible chemical space by cutting-edge ultra-large virtual compound databases can lead to successful development of small molecule drugs for hard-to-target intracellular proteins.
Collapse
Affiliation(s)
- Pedro Andrade Bonilla
- Cancer Innovation Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, United States
| | - Cody L. Hoop
- Cancer Innovation Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, United States
| | - Karen Stefanisko
- Cancer Innovation Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, United States
| | - Sergey G. Tarasov
- Center for Structural Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, United States
| | | | - Marc C. Nicklaus
- Computer-Aided Drug Design Group, Chemical Biology Laboratory, Center for Cancer Research, National Cancer Institute, National Institute of Health (NIH), Frederick, MD, United States
| | - Nadya I. Tarasova
- Cancer Innovation Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD, United States
| |
Collapse
|
3
|
Ajala A, Uzairu A, Shallangwa GA, Abechi SE. QSAR, simulation techniques, and ADMET/pharmacokinetics assessment of a set of compounds that target MAO-B as anti-Alzheimer agent. FUTURE JOURNAL OF PHARMACEUTICAL SCIENCES 2023. [DOI: 10.1186/s43094-022-00452-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Abstract
Background
Alzheimer’s disease (AD), the most common cause of dementia in the elderly, is a progressive neurodegenerative disorder that gradually affects cognitive function and eventually causes death. Most approved drugs can only treat the disease alleviating the disease symptoms; therefore, there is a need to develop drugs that can treat this illness holistically. The medical community is searching for new drugs and new drug targets to cure this disease. In this study, QSAR, molecular docking evaluation, and ADMET/pharmacokinetics assessment were used as modeling methods to identify the compounds with outstanding physicochemical properties.
Results
The 37 MAO-B compounds were screened using the aforementioned methods and yielded a model with the following molecular properties: AATS1v, AATS3v, GATS4m, and GATS6e. Good statistical values were R2train = 0.69, R2adj = 0.63, R2pred = 0.57, LOF = 0.23, and RMSE = 0.38. The model was validated using an evaluation set that confirmed its robustness. The molecular docking was also utilized using crystal structure of human monoamine oxidase B in complex with chlorophenylchromone-carboxamide with ID code of 6FW0, and three compounds were identified with outstanding high binding affinity (13 = − 30.51 kcal mol−1, 31 = − 31.85 kcal mol−1, and 33 = − 33.70 kcal mol−1), and better than the Eldepryl (referenced) drug (− 11.40 kcal mol−1).
Conclusions
These three compounds (13, 31, and 33) were analyzed for ADMET/pharmacokinetics evaluation and found worthy of further analysis as promising drug candidates to cure AD and could also serve as a template to design several monoamine oxidase B inhibitors in the future to cure AD.
Collapse
|
4
|
Yang C, Zhang Y. Delta Machine Learning to Improve Scoring-Ranking-Screening Performances of Protein-Ligand Scoring Functions. J Chem Inf Model 2022; 62:2696-2712. [PMID: 35579568 DOI: 10.1021/acs.jcim.2c00485] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Protein-ligand scoring functions are widely used in structure-based drug design for fast evaluation of protein-ligand interactions, and it is of strong interest to develop scoring functions with machine-learning approaches. In this work, by expanding the training set, developing physically meaningful features, employing our recently developed linear empirical scoring function Lin_F9 (Yang, C. J. Chem. Inf. Model. 2021, 61, 4630-4644) as the baseline, and applying extreme gradient boosting (XGBoost) with Δ-machine learning, we have further improved the robustness and applicability of machine-learning scoring functions. Besides the top performances for scoring-ranking-screening power tests of the CASF-2016 benchmark, the new scoring function ΔLin_F9XGB also achieves superior scoring and ranking performances in different structure types that mimic real docking applications. The scoring powers of ΔLin_F9XGB for locally optimized poses, flexible redocked poses, and ensemble docked poses of the CASF-2016 core set achieve Pearson's correlation coefficient (R) values of 0.853, 0.839, and 0.813, respectively. In addition, the large-scale docking-based virtual screening test on the LIT-PCBA data set demonstrates the reliability and robustness of ΔLin_F9XGB in virtual screening application. The ΔLin_F9XGB scoring function and its code are freely available on the web at (https://yzhang.hpc.nyu.edu/Delta_LinF9_XGB).
Collapse
Affiliation(s)
- Chao Yang
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Yingkai Zhang
- Department of Chemistry, New York University, New York, New York 10003, United States.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
5
|
Abstract
Molecular docking is one of the most widely used computational tools in structure-based drug design and is critically dependent on accuracy and robustness of the scoring function. In this work, we introduce a new scoring function Lin_F9, which is a linear combination of nine empirical terms, including a unified metal bond term to specifically describe metal-ligand interactions. Parameters in Lin_F9 are obtained with a multistage fitting protocol using explicit water-included structures. For the CASF-2016 benchmark test set, Lin_F9 achieves the top scoring power among all 34 classical scoring functions for both original crystal poses and locally optimized poses with Pearson correlation coefficients (R) of 0.680 and 0.687, respectively. Meanwhile, in comparison with Vina, Lin_F9 achieves consistently better scoring power and ranking power with various types of protein-ligand complex structures that mimic real docking applications, including end-to-end flexible docking for the CASF-2016 benchmark test set using a single or an ensemble of protein receptor structures, as well as for D3R Grand Challenge (GC4) test sets. Lin_F9 has been implemented in a fork of Smina as an optional built-in scoring function that can be used for docking applications as well as for further improvement of scoring functions and docking protocols. Lin_F9 is accessible through https://yzhang.hpc.nyu.edu/Lin_F9/.
Collapse
Affiliation(s)
- Chao Yang
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Yingkai Zhang
- Department of Chemistry, New York University, New York, New York 10003, United States
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
6
|
Eberhardt J, Santos-Martins D, Tillack AF, Forli S. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J Chem Inf Model 2021; 61:3891-3898. [PMID: 34278794 PMCID: PMC10683950 DOI: 10.1021/acs.jcim.1c00203] [Citation(s) in RCA: 1925] [Impact Index Per Article: 481.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
AutoDock Vina is arguably one of the fastest and most widely used open-source programs for molecular docking. However, compared to other programs in the AutoDock Suite, it lacks support for modeling specific features such as macrocycles or explicit water molecules. Here, we describe the implementation of this functionality in AutoDock Vina 1.2.0. Additionally, AutoDock Vina 1.2.0 supports the AutoDock4.2 scoring function, simultaneous docking of multiple ligands, and a batch mode for docking a large number of ligands. Furthermore, we implemented Python bindings to facilitate scripting and the development of docking workflows. This work is an effort toward the unification of the features of the AutoDock4 and AutoDock Vina programs. The source code is available at https://github.com/ccsb-scripps/AutoDock-Vina.
Collapse
Affiliation(s)
- Jerome Eberhardt
- Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, 92037 California, United States
| | - Diogo Santos-Martins
- Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, 92037 California, United States
| | - Andreas F Tillack
- Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, 92037 California, United States
| | - Stefano Forli
- Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, 92037 California, United States
| |
Collapse
|
7
|
Alekseenko A, Kotelnikov S, Ignatov M, Egbert M, Kholodov Y, Vajda S, Kozakov D. ClusPro LigTBM: Automated Template-based Small Molecule Docking. J Mol Biol 2019; 432:3404-3410. [PMID: 31863748 DOI: 10.1016/j.jmb.2019.12.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 12/03/2019] [Accepted: 12/04/2019] [Indexed: 12/31/2022]
Abstract
The template-based approach has been essential for achieving high-quality models in the recent rounds of blind protein-protein docking competition CAPRI (Critical Assessment of Predicted Interactions). However, few such automated methods exist for protein-small molecule docking. In this paper, we present an algorithm for template-based docking of small molecules. It searches for known complexes with ligands that have partial coverage of the target ligand, performs conformational sampling and template-guided energy refinement to produce a variety of possible poses, and then scores the refined poses. The algorithm is available as the automated ClusPro LigTBM server. It allows the user to specify the target protein as a PDB file and the ligand as a SMILES string. The server then searches for templates and uses them for docking, presenting the user with top-scoring poses and their confidence scores. The method is tested on the Astex Diverse benchmark, as well as on the targets from the last round of the D3R (Drug Design Data Resource) Grand Challenge. The server is publicly available as part of the ClusPro docking server suite at https://ligtbm.cluspro.org/.
Collapse
Affiliation(s)
- Andrey Alekseenko
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794 Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, 11794 Stony Brook, NY, USA
| | - Sergei Kotelnikov
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794 Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, 11794 Stony Brook, NY, USA; Innopolis University, 420500, Innopolis, Russia
| | - Mikhail Ignatov
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794 Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, 11794 Stony Brook, NY, USA; Institute for Advanced Computational Sciences, Stony Brook University, 11794, Stony Brook, NY, USA
| | - Megan Egbert
- Department of Biomedical Engineering, Boston University, 02215, Boston, MA, USA
| | | | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, 02215, Boston, MA, USA; Department of Chemistry, Boston University, 02215, Boston, MA, USA
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, 11794 Stony Brook, NY, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, 11794 Stony Brook, NY, USA; Institute for Advanced Computational Sciences, Stony Brook University, 11794, Stony Brook, NY, USA.
| |
Collapse
|