1
|
Gómez-Sacristán P, Simeon S, Tran-Nguyen VK, Patil S, Ballester PJ. Inactive-enriched machine-learning models exploiting patent data improve structure-based virtual screening for PDL1 dimerizers. J Adv Res 2025; 67:185-196. [PMID: 38280715 PMCID: PMC11725107 DOI: 10.1016/j.jare.2024.01.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 12/01/2023] [Accepted: 01/21/2024] [Indexed: 01/29/2024] Open
Abstract
INTRODUCTION Small-molecule Programmable Cell Death Protein 1/Programmable Death-Ligand 1 (PD1/PDL1) inhibition via PDL1 dimerization has the potential to lead to inexpensive drugs with better cancer patient outcomes and milder side effects. However, this therapeutic approach has proven challenging, with only one PDL1 dimerizer reaching early clinical trials so far. There is hence a need for fast and accurate methods to develop alternative PDL1 dimerizers. OBJECTIVES We aim to show that structure-based virtual screening (SBVS) based on PDL1-specific machine-learning (ML) scoring functions (SFs) is a powerful drug design tool for detecting PD1/PDL1 inhibitors via PDL1 dimerization. METHODS By incorporating the latest MLSF advances, we generated and evaluated PDL1-specific MLSFs (classifiers and inactive-enriched regressors) on two demanding test sets. RESULTS 60 PDL1-specific MLSFs (30 classifiers and 30 regressors) were generated. Our large-scale analysis provides highly predictive PDL1-specific MLSFs that benefitted from training with large volumes of docked inactives and enabling inactive-enriched regression. CONCLUSION PDL1-specific MLSFs strongly outperformed generic SFs of various types on this target and are released here without restrictions.
Collapse
Affiliation(s)
| | - Saw Simeon
- Centre de Recherche en Cancérologie de Marseille, Marseille 13009, France
| | | | - Sachin Patil
- NanoBio Laboratory, Widener University, Chester, PA 19013, USA
| | - Pedro J Ballester
- Department of Bioengineering, Imperial College London, London SW7 2AZ, UK.
| |
Collapse
|
2
|
Noor F, Junaid M, Almalki AH, Almaghrabi M, Ghazanfar S, Tahir Ul Qamar M. Deep learning pipeline for accelerating virtual screening in drug discovery. Sci Rep 2024; 14:28321. [PMID: 39550439 PMCID: PMC11569207 DOI: 10.1038/s41598-024-79799-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Accepted: 11/12/2024] [Indexed: 11/18/2024] Open
Abstract
In the race to combat ever-evolving diseases, the drug discovery process often faces the hurdles of high-cost and time-consuming procedures. To tackle these challenges and enhance the efficiency of identifying new therapeutic agents, we introduce VirtuDockDL, which is a streamlined Python-based web platform utilizing deep learning for drug discovery. This pipeline employs a Graph Neural Network to analyze and predict the effectiveness of various compounds as potential drug candidates. During the validation phase, VirtuDockDL was instrumental in identifying non-covalent inhibitors against the VP35 protein of the Marburg virus, a critical target given the virus's high fatality rate and limited treatment options. Further, in benchmarking, VirtuDockDL achieved 99% accuracy, an F1 score of 0.992, and an AUC of 0.99 on the HER2 dataset, surpassing DeepChem (89% accuracy) and AutoDock Vina (82% accuracy). Compared to RosettaVS, MzDOCK, and PyRMD, VirtuDockDL outperformed them by combining both ligand- and structure-based screening with deep learning. While RosettaVS excels in accurate docking but lacks high-throughput screening, and PyRMD focuses on ligand-based methods without AI integration, VirtuDockDL offers superior predictive accuracy and full automation for large-scale datasets, making it ideal for comprehensive drug discovery workflows. These results underscore the tool's capability to identify high-affinity inhibitors accurately across various targets, including the HER2 protein for cancer therapy, TEM-1 beta-lactamase for bacterial infections, and the CYP51 enzyme for fungal infections like Candidiasis. To sum up, VirtuDockDL combines user-friendly interface design with powerful computational capabilities to facilitate rapid, cost-effective drug discovery and development. The integration of AI in drug discovery could potentially transform the landscape of pharmaceutical research, providing faster responses to global health challenges. The VirtuDockDL is available at https://github.com/FatimaNoor74/VirtuDockDL .
Collapse
Affiliation(s)
- Fatima Noor
- Institute of Molecular Biology and Biotechnology, The University of Lahore, Lahore, 35000, Pakistan
- Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, 38000, Pakistan
| | - Muhammad Junaid
- Institute for Advanced Study, Shenzhen University, Shenzhen, 518060, China
| | - Atiah H Almalki
- Department of Pharmaceutical Chemistry, College of Pharmacy, Taif University, P.O. Box 11099, 21944, Taif, Saudi Arabia
- Addiction and Neuroscience Research Unit, College of Pharmacy, Taif University, Al-Hawiah, 21944, Taif, Saudi Arabia
| | - Mohammed Almaghrabi
- Pharmacognosy and Pharmaceutical Chemistry Department, Faculty of Pharmacy, Taibah University, 30001, Al Madinah Al Munawarah, Saudi Arabia
| | - Shakira Ghazanfar
- National Institute for Genomics and Advanced Biotechnology (NIGAB), National Agricultural Research Center (NARC), Islamabad, Pakistan
| | - Muhammad Tahir Ul Qamar
- Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad, 38000, Pakistan.
| |
Collapse
|
3
|
Wang XY, Chai X, Shan LH, Xu XH, Xu L, Hou TJ, Sun HY, Li D. A potent new-scaffold androgen receptor antagonist discovered on the basis of a MIEC-SVM model. Acta Pharmacol Sin 2024; 45:1978-1991. [PMID: 38750073 PMCID: PMC11335958 DOI: 10.1038/s41401-024-01284-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 04/03/2024] [Indexed: 08/22/2024] Open
Abstract
Prostate cancer (PCa) is the second most prevalent malignancy among men worldwide. The aberrant activation of androgen receptor (AR) signaling has been recognized as a crucial oncogenic driver for PCa and AR antagonists are widely used in PCa therapy. To develop novel AR antagonist, a machine-learning MIEC-SVM model was established for the virtual screening and 51 candidates were selected and submitted for bioactivity evaluation. To our surprise, a new-scaffold AR antagonist C2 with comparable bioactivity with Enz was identified at the initial round of screening. C2 showed pronounced inhibition on the transcriptional function (IC50 = 0.63 μM) and nuclear translocation of AR and significant antiproliferative and antimetastatic activity on PCa cell line of LNCaP. In addition, C2 exhibited a stronger ability to block the cell cycle of LNCaP than Enz at lower dose and superior AR specificity. Our study highlights the success of MIEC-SVM in discovering AR antagonists, and compound C2 presents a promising new scaffold for the development of AR-targeted therapeutics.
Collapse
Affiliation(s)
- Xin-Yue Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Xin Chai
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Lu-Hu Shan
- Institute of Cancer Research and Basic Medical Sciences of Chinese Academy of Sciences, Cancer Hospital of University of Chinese Academy of Sciences, Zhejiang Cancer Hospital, Hangzhou, 310022, China
| | - Xiao-Hong Xu
- Institute of Cancer Research and Basic Medical Sciences of Chinese Academy of Sciences, Cancer Hospital of University of Chinese Academy of Sciences, Zhejiang Cancer Hospital, Hangzhou, 310022, China
| | - Lei Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, 213001, China
| | - Ting-Jun Hou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
- State Key Lab of CAD&CG, Zhejiang University, Hangzhou, 310058, China
| | - Hui-Yong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing, 210009, China.
| | - Dan Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.
- Jinhua Institute of Zhejiang University, Jinhua, 321000, China.
| |
Collapse
|
4
|
Deng Q, Wang Z, Xiang S, Wang Q, Liu Y, Hou T, Sun H. RLpMIEC: High-Affinity Peptide Generation Targeting Major Histocompatibility Complex-I Guided and Interpreted by Interaction Spectrum-Navigated Reinforcement Learning. J Chem Inf Model 2024; 64:6432-6449. [PMID: 39118363 DOI: 10.1021/acs.jcim.4c01153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2024]
Abstract
Major histocompatibility complex (MHC) plays a vital role in presenting epitopes (short peptides from pathogenic proteins) to T-cell receptors (TCRs) to trigger the subsequent immune responses. Vaccine design targeting MHC generally aims to find epitopes with a high binding affinity for MHC presentation. Nevertheless, to find novel epitopes usually requires high-throughput screening of bulk peptide database, which is time-consuming, labor-intensive, more unaffordable, and very expensive. Excitingly, the past several years have witnessed the great success of artificial intelligence (AI) in various fields, such as natural language processing (NLP, e.g., GPT-4), protein structure prediction and engineering (e.g., AlphaFold2), and so on. Therefore, herein, we propose a deep reinforcement-learning (RL)-based generative algorithm, RLpMIEC, to quantitatively design peptide targeting MHC-I systems. Specifically, RLpMIEC combines the energetic spectrum (namely, the molecular interaction energy component, MIEC) based on the peptide-MHC interaction and the sequence information to generate peptides with strong binding affinity and precise MIEC spectra to accelerate the discovery of candidate peptide vaccines. RLpMIEC performs well in all the generative capability evaluations and can generate peptides with strong binding affinities and precise MIECs and, moreover, with high interpretability, demonstrating its powerful capability in participation for accelerating peptide-based vaccine development.
Collapse
Affiliation(s)
- Qirui Deng
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009 Jiangsu, P. R. China
| | - Zhe Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058 Zhejiang, P. R. China
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, P. R. China
| | - Sutong Xiang
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009 Jiangsu, P. R. China
| | - Qinghua Wang
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009 Jiangsu, P. R. China
| | - Yifei Liu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058 Zhejiang, P. R. China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058 Zhejiang, P. R. China
| | - Huiyong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009 Jiangsu, P. R. China
| |
Collapse
|
5
|
Caba K, Tran-Nguyen VK, Rahman T, Ballester PJ. Comprehensive machine learning boosts structure-based virtual screening for PARP1 inhibitors. J Cheminform 2024; 16:40. [PMID: 38582911 PMCID: PMC10999096 DOI: 10.1186/s13321-024-00832-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 03/23/2024] [Indexed: 04/08/2024] Open
Abstract
Poly ADP-ribose polymerase 1 (PARP1) is an attractive therapeutic target for cancer treatment. Machine-learning scoring functions constitute a promising approach to discovering novel PARP1 inhibitors. Cutting-edge PARP1-specific machine-learning scoring functions were investigated using semi-synthetic training data from docking activity-labelled molecules: known PARP1 inhibitors, hard-to-discriminate decoys property-matched to them with generative graph neural networks and confirmed inactives. We further made test sets harder by including only molecules dissimilar to those in the training set. Comprehensive analysis of these datasets using five supervised learning algorithms, and protein-ligand fingerprints extracted from docking poses and ligand only features revealed one highly predictive scoring function. This is the PARP1-specific support vector machine-based regressor, when employing PLEC fingerprints, which achieved a high Normalized Enrichment Factor at the top 1% on the hardest test set (NEF1% = 0.588, median of 10 repetitions), and was more predictive than any other investigated scoring function, especially the classical scoring function employed as baseline.
Collapse
Affiliation(s)
- Klaudia Caba
- Department of Bioengineering, Imperial College London, London, SW7 2AZ, UK
| | - Viet-Khoa Tran-Nguyen
- Unité de Biologie Fonctionnelle et Adaptative (BFA), UFR Sciences du Vivant, Université Paris Cité, 75013, Paris, France
| | - Taufiq Rahman
- Department of Pharmacology, University of Cambridge, Cambridge, CB2 1PD, UK
| | - Pedro J Ballester
- Department of Bioengineering, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
6
|
Tran-Nguyen VK, Junaid M, Simeon S, Ballester PJ. A practical guide to machine-learning scoring for structure-based virtual screening. Nat Protoc 2023; 18:3460-3511. [PMID: 37845361 DOI: 10.1038/s41596-023-00885-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 07/03/2023] [Indexed: 10/18/2023]
Abstract
Structure-based virtual screening (SBVS) via docking has been used to discover active molecules for a range of therapeutic targets. Chemical and protein data sets that contain integrated bioactivity information have increased both in number and in size. Artificial intelligence and, more concretely, its machine-learning (ML) branch, including deep learning, have effectively exploited these data sets to build scoring functions (SFs) for SBVS against targets with an atomic-resolution 3D model (e.g., generated by X-ray crystallography or predicted by AlphaFold2). Often outperforming their generic and non-ML counterparts, target-specific ML-based SFs represent the state of the art for SBVS. Here, we present a comprehensive and user-friendly protocol to build and rigorously evaluate these new SFs for SBVS. This protocol is organized into four sections: (i) using a public benchmark of a given target to evaluate an existing generic SF; (ii) preparing experimental data for a target from public repositories; (iii) partitioning data into a training set and a test set for subsequent target-specific ML modeling; and (iv) generating and evaluating target-specific ML SFs by using the prepared training-test partitions. All necessary code and input/output data related to three example targets (acetylcholinesterase, HMG-CoA reductase, and peroxisome proliferator-activated receptor-α) are available at https://github.com/vktrannguyen/MLSF-protocol , can be run by using a single computer within 1 week and make use of easily accessible software/programs (e.g., Smina, CNN-Score, RF-Score-VS and DeepCoy) and web resources. Our aim is to provide practical guidance on how to augment training data to enhance SBVS performance, how to identify the most suitable supervised learning algorithm for a data set, and how to build an SF with the highest likelihood of discovering target-active molecules within a given compound library.
Collapse
Affiliation(s)
| | - Muhammad Junaid
- Centre de Recherche en Cancérologie de Marseille, Marseille, France
| | - Saw Simeon
- Centre de Recherche en Cancérologie de Marseille, Marseille, France
| | | |
Collapse
|
7
|
Pratap Reddy Gajulapalli V. Development of Kinase-Centric Drugs: A Computational Perspective. ChemMedChem 2023; 18:e202200693. [PMID: 37442809 DOI: 10.1002/cmdc.202200693] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 07/12/2023] [Accepted: 07/12/2023] [Indexed: 07/15/2023]
Abstract
Kinases are prominent drug targets in the pharmaceutical and research community due to their involvement in signal transduction, physiological responses, and upon dysregulation, in diseases such as cancer, neurological and autoimmune disorders. Several FDA-approved small-molecule drugs have been developed to combat human diseases since Gleevec was approved for the treatment of chronic myelogenous leukemia. Kinases were considered "undruggable" in the beginning. Several FDA-approved small-molecule drugs have become available in recent years. Most of these drugs target ATP-binding sites, but a few target allosteric sites. Among kinases that belong to the same family, the catalytic domain shows high structural and sequence conservation. Inhibitors of ATP-binding sites can cause off-target binding. Because members of the same family have similar sequences and structural patterns, often complex relationships between kinases and inhibitors are observed. To design and develop drugs with desired selectivity, it is essential to understand the target selectivity for kinase inhibitors. To create new inhibitors with the desired selectivity, several experimental methods have been designed to profile the kinase selectivity of small molecules. Experimental approaches are often expensive, laborious, time-consuming, and limited by the available kinases. Researchers have used computational methodologies to address these limitations in the design and development of effective therapeutics. Many computational methods have been developed over the last few decades, either to complement experimental findings or to forecast kinase inhibitor activity and selectivity. The purpose of this review is to provide insight into recent advances in theoretical/computational approaches for the design of new kinase inhibitors with the desired selectivity and optimization of existing inhibitors.
Collapse
|
8
|
Yu Y, Wang Z, Wang L, Wang Q, Tang R, Xiang S, Deng Q, Hou T, Sun H. Deciphering the Shared and Specific Drug Resistance Mechanisms of Anaplastic Lymphoma Kinase via Binding Free Energy Computation. RESEARCH (WASHINGTON, D.C.) 2023; 6:0170. [PMID: 37342628 PMCID: PMC10278961 DOI: 10.34133/research.0170] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 05/25/2023] [Indexed: 06/23/2023]
Abstract
Anaplastic lymphoma kinase (ALK), a tyrosine receptor kinase, has been proven to be associated with the occurrence of numerous malignancies. Although there have been already at least 3 generations of ALK inhibitors approved by FDA or in clinical trials, the occurrence of various mutations seriously attenuates the effectiveness of the drugs. Unfortunately, most of the drug resistance mechanisms still remain obscure. Therefore, it is necessary to reveal the bottom reasons of the drug resistance mechanisms caused by the mutations. In this work, on the basis of verifying the accuracy of 2 main kinds of binding free energy calculation methodologies [end-point method of Molecular Mechanics with Poisson-Boltzmann/Generalized Born and Surface Area (MM/PB(GB)SA) and alchemical method of Thermodynamic Integration (TI)], we performed a systematic analysis on the ALK systems to explore the underlying shared and specific drug resistance mechanisms, covering the one-drug-multiple-mutation and multiple-drug-one-mutation cases. Through conventional molecular dynamics (cMD) simulation in conjunction with MM/PB(GB)SA and umbrella sampling (US) in conjunction with contact network analysis (CNA), the resistance mechanisms of the in-pocket, out-pocket, and multiple-site mutations were revealed. Especially for the out-pocket mutation, a possible transfer chain of the mutation effect was revealed, and the reason why different drugs exhibited various sensitivities to the same mutation was also uncovered. The proposed mechanisms may be prevalent in various drug resistance cases.
Collapse
Affiliation(s)
- Yang Yu
- Department of Medicinal Chemistry,
China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Zhe Wang
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Lingling Wang
- Department of Medicinal Chemistry,
China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Qinghua Wang
- Department of Medicinal Chemistry,
China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Rongfan Tang
- Department of Medicinal Chemistry,
China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Sutong Xiang
- Department of Medicinal Chemistry,
China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Qirui Deng
- Department of Medicinal Chemistry,
China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine ofZhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Huiyong Sun
- Department of Medicinal Chemistry,
China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| |
Collapse
|
9
|
Tran-Nguyen VK, Ballester PJ. Beware of Simple Methods for Structure-Based Virtual Screening: The Critical Importance of Broader Comparisons. J Chem Inf Model 2023; 63:1401-1405. [PMID: 36848585 PMCID: PMC10015451 DOI: 10.1021/acs.jcim.3c00218] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2023]
Abstract
We discuss how data unbiasing and simple methods such as protein-ligand Interaction FingerPrint (IFP) can overestimate virtual screening performance. We also show that IFP is strongly outperformed by target-specific machine-learning scoring functions, which were not considered in a recent report concluding that simple methods were better than machine-learning scoring functions at virtual screening.
Collapse
Affiliation(s)
| | - Pedro J Ballester
- Department of Bioengineering, Imperial College London, London SW7 2AZ, U.K
| |
Collapse
|
10
|
Wang L, Shi SH, Li H, Zeng XX, Liu SY, Liu ZQ, Deng YF, Lu AP, Hou TJ, Cao DS. Reducing false positive rate of docking-based virtual screening by active learning. Brief Bioinform 2023; 24:6987822. [PMID: 36642412 DOI: 10.1093/bib/bbac626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 12/10/2022] [Accepted: 12/20/2022] [Indexed: 01/17/2023] Open
Abstract
Machine learning-based scoring functions (MLSFs) have become a very favorable alternative to classical scoring functions because of their potential superior screening performance. However, the information of negative data used to construct MLSFs was rarely reported in the literature, and meanwhile the putative inactive molecules recorded in existing databases usually have obvious bias from active molecules. Here we proposed an easy-to-use method named AMLSF that combines active learning using negative molecular selection strategies with MLSF, which can iteratively improve the quality of inactive sets and thus reduce the false positive rate of virtual screening. We chose energy auxiliary terms learning as the MLSF and validated our method on eight targets in the diverse subset of DUD-E. For each target, we screened the IterBioScreen database by AMLSF and compared the screening results with those of the four control models. The results illustrate that the number of active molecules in the top 1000 molecules identified by AMLSF was significantly higher than those identified by the control models. In addition, the free energy calculation results for the top 10 molecules screened out by the AMLSF, null model and control models based on DUD-E also proved that more active molecules can be identified, and the false positive rate can be reduced by AMLSF.
Collapse
Affiliation(s)
- Lei Wang
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Shao-Hua Shi
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China
| | - Hui Li
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Xiang-Xiang Zeng
- Department of Computer Science, Hunan University, Changsha 410082, Hunan, China
| | - Su-You Liu
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Zhao-Qian Liu
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Ya-Feng Deng
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, Zhejiang 310018, China
| | - Ai-Ping Lu
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China
| | - Ting-Jun Hou
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, China.,Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China
| |
Collapse
|
11
|
Liao Y, Cao P, Luo L. Identification of Novel Arachidonic Acid 15-Lipoxygenase Inhibitors Based on the Bayesian Classifier Model and Computer-Aided High-Throughput Virtual Screening. Pharmaceuticals (Basel) 2022; 15:1440. [PMID: 36422570 PMCID: PMC9695033 DOI: 10.3390/ph15111440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 11/11/2022] [Accepted: 11/16/2022] [Indexed: 08/29/2023] Open
Abstract
Ferroptosis is an iron-dependent lipid peroxidative form of cell death that is distinct from apoptosis and necrosis. ALOX15, also known as arachidonic acid 15-lipoxygenase, promotes ferroptosis by converting intracellular unsaturated lipids into oxidized lipid intermediates and is an important ferroptosis target. In this study, a naive Bayesian machine learning classifier with a structure-based, high-throughput screening approach and a molecular docking program were combined to screen for three compounds with excellent target-binding potential. In the absorption, distribution, metabolism, excretion, and toxicity characterization, three candidate molecules were predicted to exhibit drug-like properties. The subsequent molecular dynamics simulations confirmed their stable binding to the targets. The findings indicated that the compounds exhibited excellent potential ALOX15 inhibitor capacity, thereby providing novel candidates for the treatment of inflammatory ischemia-related diseases caused by ferroptosis.
Collapse
Affiliation(s)
- Yinglin Liao
- The First Clinical College, Guangdong Medical University, Zhanjiang 524023, China
| | - Peng Cao
- Department of Pharmacy, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Lianxiang Luo
- The Marine Biomedical Research Institute, Guangdong Medical University, Zhanjiang 524023, China
- The Marine Biomedical Research Institute of Guangdong Zhanjiang, Zhanjiang 524023, China
| |
Collapse
|
12
|
Wang Q, Wang Z, Tian S, Wang L, Tang R, Yu Y, Ge J, Hou T, Hao H, Sun H. Determination of Molecule Category of Ligands Targeting the Ligand-Binding Pocket of Nuclear Receptors with Structural Elucidation and Machine Learning. J Chem Inf Model 2022; 62:3993-4007. [PMID: 36040137 DOI: 10.1021/acs.jcim.2c00851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The mechanism of transcriptional activation/repression of the nuclear receptors (NRs) involves two main conformations of the NR protein, namely, the active (agonistic) and inactive (antagonistic) conformations. Binding of agonists or antagonists to the ligand-binding pocket (LBP) of NRs can regulate the downstream signaling pathways with different physiological effects. However, it is still hard to determine the molecular type of a LBP-bound ligand because both the agonists and antagonists bind to the same position of the protein. Therefore, it is necessary to develop precise and efficient methods to facilitate the discrimination of agonists and antagonists targeting the LBP of NRs. Here, combining structural and energetic analyses with machine-learning (ML) algorithms, we constructed a series of structure-based ML models to determine the molecular category of the LBP-bound ligands. We show that the proposed models work robustly and with high accuracy (ACC > 0.9) for determining the category of molecules derived from docking-based and crystallized poses. Furthermore, the models are also capable of determining the molecular category of ligands with dual opposite functions on different NRs (i.e., working as an agonist in one NR target, whereas functioning as an antagonist in another) with reasonable accuracy. The proposed method is expected to facilitate the determination of the molecular properties of ligands targeting the LBP of NRs with structural interpretation.
Collapse
Affiliation(s)
- Qinghua Wang
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Zhe Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Sheng Tian
- Department of Medicinal Chemistry, College of Pharmaceutical Sciences, Soochow University, Suzhou 215123, P. R. China
| | - Lingling Wang
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Rongfan Tang
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Yang Yu
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| | - Jingxuan Ge
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China.,Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Haiping Hao
- State Key Laboratory of Natural Medicines, Key Lab of Drug Metabolism and Pharmacokinetics, China Pharmaceutical University, 210009 Nanjing, China
| | - Huiyong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, P. R. China
| |
Collapse
|
13
|
Wang Z, Pan H, Sun H, Kang Y, Liu H, Cao D, Hou T. fastDRH: a webserver to predict and analyze protein-ligand complexes based on molecular docking and MM/PB(GB)SA computation. Brief Bioinform 2022; 23:6587180. [PMID: 35580866 DOI: 10.1093/bib/bbac201] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 04/25/2022] [Accepted: 04/28/2022] [Indexed: 01/12/2023] Open
Abstract
Predicting the native or near-native binding pose of a small molecule within a protein binding pocket is an extremely important task in structure-based drug design, especially in the hit-to-lead and lead optimization phases. In this study, fastDRH, a free and open accessed web server, was developed to predict and analyze protein-ligand complex structures. In fastDRH server, AutoDock Vina and AutoDock-GPU docking engines, structure-truncated MM/PB(GB)SA free energy calculation procedures and multiple poses based per-residue energy decomposition analysis were well integrated into a user-friendly and multifunctional online platform. Benefit from the modular architecture, users can flexibly use one or more of three features, including molecular docking, docking pose rescoring and hotspot residue prediction, to obtain the key information clearly based on a result analysis panel supported by 3Dmol.js and Apache ECharts. In terms of protein-ligand binding mode prediction, the integrated structure-truncated MM/PB(GB)SA rescoring procedures exhibit a success rate of >80% in benchmark, which is much better than the AutoDock Vina (~70%). For hotspot residue identification, our multiple poses based per-residue energy decomposition analysis strategy is a more reliable solution than the one using only a single pose, and the performance of our solution has been experimentally validated in several drug discovery projects. To summarize, the fastDRH server is a useful tool for predicting the ligand binding mode and the hotspot residue of protein for ligand binding. The fastDRH server is accessible free of charge at http://cadd.zju.edu.cn/fastdrh/.
Collapse
Affiliation(s)
- Zhe Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Hong Pan
- Day Surgery Center, Sir Run-Run Shaw Hospital, Zhejiang University School of Medicine, 310016, Hangzhou, China
| | - Huiyong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, Jiangsu, China
| | - Yu Kang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Huanxiang Liu
- Faculty of Applied Science, Macao Polytechnic University, Macao, SAR, China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences and Cancer Center, Zhejiang University, Hangzhou, Zhejiang 310058, China
| |
Collapse
|
14
|
Shen Y, Liu C, Chi K, Gao Q, Bai X, Xu Y, Guo N. Development of a machine learning-based predictor for identifying and discovering antioxidant peptides based on a new strategy. Food Control 2022. [DOI: 10.1016/j.foodcont.2021.108439] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
15
|
Wang DD, Chan MT, Yan H. Structure-based protein-ligand interaction fingerprints for binding affinity prediction. Comput Struct Biotechnol J 2021; 19:6291-6300. [PMID: 34900139 PMCID: PMC8637032 DOI: 10.1016/j.csbj.2021.11.018] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/09/2021] [Accepted: 11/13/2021] [Indexed: 11/17/2022] Open
Abstract
Binding affinity prediction (BAP) using protein–ligand complex structures is crucial to computer-aided drug design, but remains a challenging problem. To achieve efficient and accurate BAP, machine-learning scoring functions (SFs) based on a wide range of descriptors have been developed. Among those descriptors, protein–ligand interaction fingerprints (IFPs) are competitive due to their simple representations, elaborate profiles of key interactions and easy collaborations with machine-learning algorithms. In this paper, we have adopted a building-block-based taxonomy to review a broad range of IFP models, and compared representative IFP-based SFs in target-specific and generic scoring tasks. Atom-pair-counts-based and substructure-based IFPs show great potential in these tasks.
Collapse
Affiliation(s)
- Debby D Wang
- School of Health Science and Engineering, University of Shanghai for Science and Technology, 516 Jungong Rd, Shanghai 200093, China
| | - Moon-Tong Chan
- School of Science and Technology, Hong Kong Metropolitan University, 30 Good Shepherd St, Ho Man Tin, Hong Kong
| | - Hong Yan
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
| |
Collapse
|
16
|
Vijayan RSK, Kihlberg J, Cross JB, Poongavanam V. Enhancing preclinical drug discovery with artificial intelligence. Drug Discov Today 2021; 27:967-984. [PMID: 34838731 DOI: 10.1016/j.drudis.2021.11.023] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 10/15/2021] [Accepted: 11/19/2021] [Indexed: 12/14/2022]
Abstract
Artificial intelligence (AI) is becoming an integral part of drug discovery. It has the potential to deliver across the drug discovery and development value chain, starting from target identification and reaching through clinical development. In this review, we provide an overview of current AI technologies and a glimpse of how AI is reimagining preclinical drug discovery by highlighting examples where AI has made a real impact. Considering the excitement and hyperbole surrounding AI in drug discovery, we aim to present a realistic view by discussing both opportunities and challenges in adopting AI in drug discovery.
Collapse
Affiliation(s)
- R S K Vijayan
- Institute for Applied Cancer Science, MD Anderson Cancer Center, Houston, TX, USA
| | - Jan Kihlberg
- Department of Chemistry-BMC, Uppsala University, Uppsala, Sweden
| | - Jason B Cross
- Institute for Applied Cancer Science, MD Anderson Cancer Center, Houston, TX, USA.
| | | |
Collapse
|
17
|
Shen C, Hu X, Gao J, Zhang X, Zhong H, Wang Z, Xu L, Kang Y, Cao D, Hou T. The impact of cross-docked poses on performance of machine learning classifier for protein-ligand binding pose prediction. J Cheminform 2021; 13:81. [PMID: 34656169 PMCID: PMC8520186 DOI: 10.1186/s13321-021-00560-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Accepted: 10/05/2021] [Indexed: 02/06/2023] Open
Abstract
Structure-based drug design depends on the detailed knowledge of the three-dimensional (3D) structures of protein-ligand binding complexes, but accurate prediction of ligand-binding poses is still a major challenge for molecular docking due to deficiency of scoring functions (SFs) and ignorance of protein flexibility upon ligand binding. In this study, based on a cross-docking dataset dedicatedly constructed from the PDBbind database, we developed several XGBoost-trained classifiers to discriminate the near-native binding poses from decoys, and systematically assessed their performance with/without the involvement of the cross-docked poses in the training/test sets. The calculation results illustrate that using Extended Connectivity Interaction Features (ECIF), Vina energy terms and docking pose ranks as the features can achieve the best performance, according to the validation through the random splitting or refined-core splitting and the testing on the re-docked or cross-docked poses. Besides, it is found that, despite the significant decrease of the performance for the threefold clustered cross-validation, the inclusion of the Vina energy terms can effectively ensure the lower limit of the performance of the models and thus improve their generalization capability. Furthermore, our calculation results also highlight the importance of the incorporation of the cross-docked poses into the training of the SFs with wide application domain and high robustness for binding pose prediction. The source code and the newly-developed cross-docking datasets can be freely available at https://github.com/sc8668/ml_pose_prediction and https://zenodo.org/record/5525936 , respectively, under an open-source license. We believe that our study may provide valuable guidance for the development and assessment of new machine learning-based SFs (MLSFs) for the predictions of protein-ligand binding poses.
Collapse
Affiliation(s)
- Chao Shen
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China.,State Key Lab of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China
| | - Xueping Hu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China
| | - Junbo Gao
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China
| | - Xujun Zhang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China
| | - Haiyang Zhong
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China
| | - Zhe Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China
| | - Lei Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, 213001, China
| | - Yu Kang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China.
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, Hunan, 410013, People's Republic of China.
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China. .,State Key Lab of CAD&CG, Zhejiang University, Hangzhou, Zhejiang, 310058, People's Republic of China.
| |
Collapse
|
18
|
Discovery of potential inhibitors targeting the kinase domain of polynucleotide kinase/phosphatase (PNKP): Homology modeling, virtual screening based on multiple conformations, and molecular dynamics simulation. Comput Biol Chem 2021; 94:107517. [PMID: 34456161 DOI: 10.1016/j.compbiolchem.2021.107517] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 05/11/2021] [Accepted: 05/16/2021] [Indexed: 12/15/2022]
Abstract
In recent years, the level of interest has been increased in developing the DNA-repair inhibitors, to enhance the cytotoxic effects in the treatment of cancers. Polynucleotide kinase/phosphatase (PNKP) is a critical human DNA repair enzyme that repairs DNA strand breaks by catalyzing the restoration of 5'-phosphate and 3'-hydroxyl termini that are required for subsequent processing by DNA ligases and polymerases. PNKP is the only protein that repairs the 3'-hydroxyl group and 5'-phosphate group, which depicts PNKP as a potential therapeutic target. Besides, PNKP is the only DNA-repair enzyme that contains the 5'-kinase activity, therefore, targeting this kinase domain would motivate the development of novel PNKP-specific inhibitors. However, there are neither crystal structures of human PNKP nor the kinase inhibitors reported so far. Thus, in this present study, a sequential molecular docking-based virtual screening with multiple PNKP conformations integrating homology modeling, molecular dynamics simulation, and binding free energy calculation was developed to discover novel PNKP kinase inhibitors, and the top-scored molecule was finally submitted to molecular dynamics simulation to reveal the binding mechanism between the inhibitor and PNKP. Taken together, the current study could provide some guidance for the molecular docking based-virtual screening of novel PNKP kinase inhibitors.
Collapse
|
19
|
Xiong G, Shen C, Yang Z, Jiang D, Liu S, Lu A, Chen X, Hou T, Cao D. Featurization strategies for protein–ligand interactions and their applications in scoring function development. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1567] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Guoli Xiong
- Xiangya School of Pharmaceutical Sciences Central South University Changsha China
| | - Chao Shen
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences Zhejiang University Hangzhou China
| | - Ziyi Yang
- Xiangya School of Pharmaceutical Sciences Central South University Changsha China
| | - Dejun Jiang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences Zhejiang University Hangzhou China
- College of Computer Science and Technology Zhejiang University Hangzhou China
| | - Shao Liu
- Department of Pharmacy Xiangya Hospital, Central South University Changsha China
| | - Aiping Lu
- Institute for Advancing Translational Medicine in Bone & Joint Diseases, School of Chinese Medicine Hong Kong Baptist University Hong Kong SAR China
| | - Xiang Chen
- Department of Dermatology, Hunan Engineering Research Center of Skin Health and Disease, Hunan Key Laboratory of Skin Cancer and Psoriasis Xiangya Hospital, Central South University Changsha China
| | - Tingjun Hou
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences Zhejiang University Hangzhou China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences Central South University Changsha China
- Institute for Advancing Translational Medicine in Bone & Joint Diseases, School of Chinese Medicine Hong Kong Baptist University Hong Kong SAR China
| |
Collapse
|
20
|
Xie L, Xu L, Chang S, Xu X, Meng L. Multitask deep networks with grid featurization achieve improved scoring performance for protein-ligand binding. Chem Biol Drug Des 2021; 96:973-983. [PMID: 33058459 DOI: 10.1111/cbdd.13648] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Revised: 10/11/2019] [Accepted: 10/27/2019] [Indexed: 01/10/2023]
Abstract
Deep learning-based methods have been extensively developed to improve scoring performance in structure-based drug discovery. Extending multitask deep networks in addressing pharmaceutical problems shows remarkable improvements over single task network. Recently, grid featurization has been introduced to convert protein-ligand complex co-ordinates into fingerprints with the advantage of incorporating inter- and intra-molecular information. The combination of grid featurization with multitask deep networks would hold great potential to boost the scoring performance. We examined the performance of three novel multitask deep networks (standard multitask, bypass, and progressive network) in reproducing the binding affinities of protein-ligand complexes in comparison with AutoDock Vina docking and MM/GBSA method. Among five evaluated methods, progressive network combined with grid featurization provided the best Pearson correlation coefficient (0.74) and least mean absolute average error (0.98) for the overall scoring performance. Moreover, all networks increased screening ability for the re-docking pose and progressive network even achieved AUC of 0.87 over 0.52 of AutoDock Vina. Our results demonstrated that progressive network combined with grid featurization would be one powerful rescoring approach to strengthen screening results after obtaining protein-ligand complex in the conventional docking software.
Collapse
Affiliation(s)
- Liangxu Xie
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Lei Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Shan Chang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Li Meng
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| |
Collapse
|
21
|
Bai Q, Tan S, Xu T, Liu H, Huang J, Yao X. MolAICal: a soft tool for 3D drug design of protein targets by artificial intelligence and classical algorithm. Brief Bioinform 2021; 22:5890512. [PMID: 32778891 PMCID: PMC7454275 DOI: 10.1093/bib/bbaa161] [Citation(s) in RCA: 129] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Revised: 06/23/2020] [Accepted: 06/26/2020] [Indexed: 12/27/2022] Open
Abstract
Deep learning is an important branch of artificial intelligence that has been successfully applied into medicine and two-dimensional ligand design. The three-dimensional (3D) ligand generation in the 3D pocket of protein target is an interesting and challenging issue for drug design by deep learning. Here, the MolAICal software is introduced to supply a way for generating 3D drugs in the 3D pocket of protein targets by combining with merits of deep learning model and classical algorithm. The MolAICal software mainly contains two modules for 3D drug design. In the first module of MolAICal, it employs the genetic algorithm, deep learning model trained by FDA-approved drug fragments and Vinardo score fitting on the basis of PDBbind database for drug design. In the second module, it uses deep learning generative model trained by drug-like molecules of ZINC database and molecular docking invoked by Autodock Vina automatically. Besides, the Lipinski's rule of five, Pan-assay interference compounds (PAINS), synthetic accessibility (SA) and other user-defined rules are introduced for filtering out unwanted ligands in MolAICal. To show the drug design modules of MolAICal, the membrane protein glucagon receptor and non-membrane protein SARS-CoV-2 main protease are chosen as the investigative drug targets. The results show MolAICal can generate the various and novel ligands with good binding scores and appropriate XLOGP values. We believe that MolAICal can use the advantages of deep learning model and classical programming for designing 3D drugs in protein pocket. MolAICal is freely for any nonprofit purpose and accessible at https://molaical.github.io.
Collapse
|
22
|
Ghislat G, Rahman T, Ballester PJ. Recent progress on the prospective application of machine learning to structure-based virtual screening. Curr Opin Chem Biol 2021; 65:28-34. [PMID: 34052776 DOI: 10.1016/j.cbpa.2021.04.009] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 04/13/2021] [Accepted: 04/23/2021] [Indexed: 12/30/2022]
Abstract
As more bioactivity and protein structure data become available, scoring functions (SFs) using machine learning (ML) to leverage these data sets continue to gain further accuracy and broader applicability. Advances in our understanding of the optimal ways to train and evaluate these ML-based SFs have introduced further improvements. One of these advances is how to select the most suitable decoys (molecules assumed inactive) to train or test an ML-based SF on a given target. We also review the latest applications of ML-based SFs for prospective structure-based virtual screening (SBVS), with a focus on the observed improvement over those using classical SFs. Finally, we provide recommendations for future prospective SBVS studies based on the findings of recent methodological studies.
Collapse
Affiliation(s)
- Ghita Ghislat
- U1104, CNRS UMR7280, Centre D'Immunologie de Marseille-Luminy, Inserm, Marseille, France
| | - Taufiq Rahman
- Department of Pharmacology, University of Cambridge, Cambridge, CB2 1PD, UK
| | - Pedro J Ballester
- Centre de Recherche en Cancérologie de Marseille (CRCM), Inserm, U1068, Marseille, F-13009, France; CNRS, UMR7258, Marseille, F-13009, France; Institut Paoli-Calmettes, Marseille, F-13009, France; Aix-Marseille University, UM 105, F-13284, Marseille, France.
| |
Collapse
|
23
|
Vatansever S, Schlessinger A, Wacker D, Kaniskan HÜ, Jin J, Zhou M, Zhang B. Artificial intelligence and machine learning-aided drug discovery in central nervous system diseases: State-of-the-arts and future directions. Med Res Rev 2021; 41:1427-1473. [PMID: 33295676 PMCID: PMC8043990 DOI: 10.1002/med.21764] [Citation(s) in RCA: 131] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 10/30/2020] [Accepted: 11/20/2020] [Indexed: 01/11/2023]
Abstract
Neurological disorders significantly outnumber diseases in other therapeutic areas. However, developing drugs for central nervous system (CNS) disorders remains the most challenging area in drug discovery, accompanied with the long timelines and high attrition rates. With the rapid growth of biomedical data enabled by advanced experimental technologies, artificial intelligence (AI) and machine learning (ML) have emerged as an indispensable tool to draw meaningful insights and improve decision making in drug discovery. Thanks to the advancements in AI and ML algorithms, now the AI/ML-driven solutions have an unprecedented potential to accelerate the process of CNS drug discovery with better success rate. In this review, we comprehensively summarize AI/ML-powered pharmaceutical discovery efforts and their implementations in the CNS area. After introducing the AI/ML models as well as the conceptualization and data preparation, we outline the applications of AI/ML technologies to several key procedures in drug discovery, including target identification, compound screening, hit/lead generation and optimization, drug response and synergy prediction, de novo drug design, and drug repurposing. We review the current state-of-the-art of AI/ML-guided CNS drug discovery, focusing on blood-brain barrier permeability prediction and implementation into therapeutic discovery for neurological diseases. Finally, we discuss the major challenges and limitations of current approaches and possible future directions that may provide resolutions to these difficulties.
Collapse
Affiliation(s)
- Sezen Vatansever
- Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Avner Schlessinger
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Daniel Wacker
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of NeuroscienceIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - H. Ümit Kaniskan
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Jian Jin
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Ming‐Ming Zhou
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Bin Zhang
- Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| |
Collapse
|
24
|
Ji B, He X, Zhai J, Zhang Y, Man VH, Wang J. Machine learning on ligand-residue interaction profiles to significantly improve binding affinity prediction. Brief Bioinform 2021; 22:6184410. [PMID: 33758923 DOI: 10.1093/bib/bbab054] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Revised: 01/06/2021] [Accepted: 02/02/2021] [Indexed: 01/01/2023] Open
Abstract
Structure-based virtual screenings (SBVSs) play an important role in drug discovery projects. However, it is still a challenge to accurately predict the binding affinity of an arbitrary molecule binds to a drug target and prioritize top ligands from an SBVS. In this study, we developed a novel method, using ligand-residue interaction profiles (IPs) to construct machine learning (ML)-based prediction models, to significantly improve the screening performance in SBVSs. Such a kind of the prediction model is called an IP scoring function (IP-SF). We systematically investigated how to improve the performance of IP-SFs from many perspectives, including the sampling methods before interaction energy calculation and different ML algorithms. Using six drug targets with each having hundreds of known ligands, we conducted a critical evaluation on the developed IP-SFs. The IP-SFs employing a gradient boosting decision tree (GBDT) algorithm in conjunction with the MIN + GB simulation protocol achieved the best overall performance. Its scoring power, ranking power and screening power significantly outperformed the Glide SF. First, compared with Glide, the average values of mean absolute error and root mean square error of GBDT/MIN + GB decreased about 38 and 36%, respectively. Second, the mean values of squared correlation coefficient and predictive index increased about 225 and 73%, respectively. Third, more encouragingly, the average value of the areas under the curve of receiver operating characteristic for six targets by GBDT, 0.87, is significantly better than that by Glide, which is only 0.71. Thus, we expected IP-SFs to have broad and promising applications in SBVSs.
Collapse
Affiliation(s)
- Beihong Ji
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Xibing He
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Jingchen Zhai
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Yuzhao Zhang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Viet Hoang Man
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Junmei Wang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, PA 15261, USA
| |
Collapse
|
25
|
Stein RM, Yang Y, Balius TE, O'Meara MJ, Lyu J, Young J, Tang K, Shoichet BK, Irwin JJ. Property-Unmatched Decoys in Docking Benchmarks. J Chem Inf Model 2021; 61:699-714. [PMID: 33494610 PMCID: PMC7913603 DOI: 10.1021/acs.jcim.0c00598] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Enrichment of ligands versus property-matched decoys is widely used to test and optimize docking library screens. However, the unconstrained optimization of enrichment alone can mislead, leading to false confidence in prospective performance. This can arise by over-optimizing for enrichment against property-matched decoys, without considering the full spectrum of molecules to be found in a true large library screen. Adding decoys representing charge extrema helps mitigate over-optimizing for electrostatic interactions. Adding decoys that represent the overall characteristics of the library to be docked allows one to sample molecules not represented by ligands and property-matched decoys but that one will encounter in a prospective screen. An optimized version of the DUD-E set (DUDE-Z), as well as Extrema and sets representing broad features of the library (Goldilocks), is developed here. We also explore the variability that one can encounter in enrichment calculations and how that can temper one's confidence in small enrichment differences. The new tools and new decoy sets are freely available at http://tldr.docking.org and http://dudez.docking.org.
Collapse
Affiliation(s)
- Reed M Stein
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Ying Yang
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Trent E Balius
- Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research, Inc., P.O. Box B, Frederick, Maryland 21702, United States
| | - Matt J O'Meara
- Department of Computational Medicine and Bioinformatics, University of Michigan, Palmer Commons, 100 Washtenaw Ave. #2017, Ann Arbor, Michigan 48109, United States
| | - Jiankun Lyu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Jennifer Young
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Khanh Tang
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - Brian K Shoichet
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| | - John J Irwin
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California 94158, United States
| |
Collapse
|
26
|
Imrie F, Bradley AR, Deane CM. Generating Property-Matched Decoy Molecules Using Deep Learning. Bioinformatics 2021; 37:2134-2141. [PMID: 33532838 PMCID: PMC8352508 DOI: 10.1093/bioinformatics/btab080] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 01/05/2021] [Accepted: 01/28/2021] [Indexed: 01/30/2023] Open
Abstract
MOTIVATION An essential step in the development of virtual screening methods is the use of established sets of actives and decoys for benchmarking and training. However, the decoy molecules in commonly used sets are biased meaning that methods often exploit these biases to separate actives and decoys, and do not necessarily learn to perform molecular recognition. This fundamental issue prevents generalisation and hinders virtual screening method development. RESULTS We have developed a deep learning method (DeepCoy) that generates decoys to a user's preferred specification in order to remove such biases or construct sets with a defined bias.We validated DeepCoy using two established benchmarks, DUD-E and DEKOIS 2.0. For all 102 DUD-E targets and 80 of the 81 DEKOIS 2.0 targets, our generated decoy molecules more closely matched the active molecules' physicochemical properties while introducing no discernible additional risk of false negatives. The DeepCoy decoys improved the Deviation from Optimal Embedding (DOE) score by an average of 81% and 66%, respectively, decreasing from 0.166 to 0.032 for DUD-E and from 0.109 to 0.038 for DEKOIS 2.0. Further, the generated decoys are harder to distinguish than the original decoy molecules via docking with Autodock Vina, with virtual screening performance falling from an AUC ROC of 0.70 to 0.63. AVAILABILITY AND IMPLEMENTATION The code is available at https://github.com/oxpig/DeepCoy. Generated molecules can be downloaded from http://opig.stats.ox.ac.uk/resources. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Fergus Imrie
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | | | - Charlotte M Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| |
Collapse
|
27
|
Wang X, Li W. Comparative Study of Interactions between Human cGAS and Inhibitors: Insights from Molecular Dynamics and MM/PBSA Studies. Int J Mol Sci 2021; 22:ijms22031164. [PMID: 33503880 PMCID: PMC7865699 DOI: 10.3390/ijms22031164] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 01/20/2021] [Accepted: 01/21/2021] [Indexed: 11/20/2022] Open
Abstract
Recent studies have identified cyclic GMP-AMP synthase (cGAS) as an important target for treating autoimmune diseases, and several inhibitors of human cGAS (hcGAS) and their structures in complexation with hcGAS have been reported. However, the mechanisms via which these inhibitors interact with hcGAS are not completely understood. Here, we aimed to assess the performance of molecular mechanics/Poisson–Boltzmann solvent-accessible surface area (MM/PBSA) in evaluating the binding affinity of various hcGAS inhibitors and to elucidate their detailed interactions with hcGAS from an energetic viewpoint. Using molecular dynamics (MD) simulation and MM/PBSA approaches, the estimated free energies were in good agreement with the experimental ones, with a Pearson’s correlation coefficient and Spearman’s rank coefficient of 0.67 and 0.46, respectively. In per-residue energy decomposition analysis, four residues, K362, R376, Y436, and K439 in hcGAS were found to contribute significantly to the binding with inhibitors via hydrogen bonding, salt bridges, and various π interactions, such as π· · ·π stacking, cation· · ·π, hydroxyl· · ·π, and alkyl· · ·π interactions. In addition, we discussed other key interactions between specific residues and ligands, in particular, between H363 and JUJ, F379 and 9BY, and H437 and 8ZM. The sandwiched structures of the inhibitor bound to the guanidinium group of R376 and the phenyl ring of Y436 were also consistent with the experimental data. The results indicated that MM/PBSA in combination with other virtual screening methods, could be a reliable approach to discover new hcGAS inhibitors and thus is valuable for potential treatments of cGAS-dependent inflammatory diseases.
Collapse
Affiliation(s)
- Xiaowen Wang
- Institute for Advanced Study, Shenzhen University, Shenzhen 518060, China;
- College of Physics and Optoelectronic Engineering, Shenzhen University, Shenzhen 518060, China
| | - Wenjin Li
- Institute for Advanced Study, Shenzhen University, Shenzhen 518060, China;
- Correspondence: ; Tel.: +86-755-2694-2336
| |
Collapse
|
28
|
Shen C, Weng G, Zhang X, Leung ELH, Yao X, Pang J, Chai X, Li D, Wang E, Cao D, Hou T. Accuracy or novelty: what can we gain from target-specific machine-learning-based scoring functions in virtual screening? Brief Bioinform 2021; 22:6070382. [PMID: 33418562 DOI: 10.1093/bib/bbaa410] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Revised: 11/26/2020] [Accepted: 12/12/2020] [Indexed: 12/13/2022] Open
Abstract
Machine-learning (ML)-based scoring functions (MLSFs) have gradually emerged as a promising alternative for protein-ligand binding affinity prediction and structure-based virtual screening. However, clouds of doubts have still been raised against the benefits of this novel type of scoring functions (SFs). In this study, to benchmark the performance of target-specific MLSFs on a relatively unbiased dataset, the MLSFs trained from three representative protein-ligand interaction representations were assessed on the LIT-PCBA dataset, and the classical Glide SP SF and three types of ligand-based quantitative structure-activity relationship (QSAR) models were also utilized for comparison. Two major aspects in virtual screening campaigns, including prediction accuracy and hit novelty, were systematically explored. The calculation results illustrate that the tested target-specific MLSFs yielded generally superior performance over the classical Glide SP SF, but they could hardly outperform the 2D fingerprint-based QSAR models. Although substantial improvements could be achieved by integrating multiple types of protein-ligand interaction features, the MLSFs were still not sufficient to exceed MACCS-based QSAR models. In terms of the correlations between the hit ranks or the structures of the top-ranked hits, the MLSFs developed by different featurization strategies would have the ability to identify quite different hits. Nevertheless, it seems that target-specific MLSFs do not have the intrinsic attributes of a traditional SF and may not be a substitute for classical SFs. In contrast, MLSFs can be regarded as a new derivative of ligand-based QSAR models. It is expected that our study may provide valuable guidance for the assessment and further development of target-specific MLSFs.
Collapse
Affiliation(s)
- Chao Shen
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Gaoqi Weng
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Xujun Zhang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Elaine Lai-Han Leung
- State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau, SAR, China
| | - Xiaojun Yao
- State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau, SAR, China
| | - Jinping Pang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Xin Chai
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Dan Li
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Ercheng Wang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410013, Hunan, P. R. China
| | - Tingjun Hou
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China
| |
Collapse
|
29
|
Vázquez J, López M, Gibert E, Herrero E, Luque FJ. Merging Ligand-Based and Structure-Based Methods in Drug Discovery: An Overview of Combined Virtual Screening Approaches. Molecules 2020; 25:E4723. [PMID: 33076254 PMCID: PMC7587536 DOI: 10.3390/molecules25204723] [Citation(s) in RCA: 97] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 10/06/2020] [Accepted: 10/11/2020] [Indexed: 12/20/2022] Open
Abstract
Virtual screening (VS) is an outstanding cornerstone in the drug discovery pipeline. A variety of computational approaches, which are generally classified as ligand-based (LB) and structure-based (SB) techniques, exploit key structural and physicochemical properties of ligands and targets to enable the screening of virtual libraries in the search of active compounds. Though LB and SB methods have found widespread application in the discovery of novel drug-like candidates, their complementary natures have stimulated continued efforts toward the development of hybrid strategies that combine LB and SB techniques, integrating them in a holistic computational framework that exploits the available information of both ligand and target to enhance the success of drug discovery projects. In this review, we analyze the main strategies and concepts that have emerged in the last years for defining hybrid LB + SB computational schemes in VS studies. Particularly, attention is focused on the combination of molecular similarity and docking, illustrating them with selected applications taken from the literature.
Collapse
Affiliation(s)
- Javier Vázquez
- Pharmacelera, Plaça Pau Vila, 1, Sector C 2a, Edificio Palau de Mar, 08039 Barcelona, Spain;
- Department of Nutrition, Food Science and Gastronomy, Faculty of Pharmacy and Food Sciences, Institute of Biomedicine (IBUB), and Institute of Theoretical and Computational Chemistry (IQTC-UB), University of Barcelona, Av. Prat de la Riba 171, E-08921 Santa Coloma de Gramanet, Spain
| | - Manel López
- AB Science, Parc Scientifique de Luminy, Zone Luminy Enterprise, Case 922, 163 Av. de Luminy, 13288 Marseille, France;
| | - Enric Gibert
- Pharmacelera, Plaça Pau Vila, 1, Sector C 2a, Edificio Palau de Mar, 08039 Barcelona, Spain;
| | - Enric Herrero
- Pharmacelera, Plaça Pau Vila, 1, Sector C 2a, Edificio Palau de Mar, 08039 Barcelona, Spain;
| | - F. Javier Luque
- Department of Nutrition, Food Science and Gastronomy, Faculty of Pharmacy and Food Sciences, Institute of Biomedicine (IBUB), and Institute of Theoretical and Computational Chemistry (IQTC-UB), University of Barcelona, Av. Prat de la Riba 171, E-08921 Santa Coloma de Gramanet, Spain
| |
Collapse
|
30
|
Selecting machine-learning scoring functions for structure-based virtual screening. DRUG DISCOVERY TODAY. TECHNOLOGIES 2020; 32-33:81-87. [PMID: 33386098 DOI: 10.1016/j.ddtec.2020.09.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Revised: 09/02/2020] [Accepted: 09/07/2020] [Indexed: 12/27/2022]
Abstract
Interest in docking technologies has grown parallel to the ever increasing number and diversity of 3D models for macromolecular therapeutic targets. Structure-Based Virtual Screening (SBVS) aims at leveraging these experimental structures to discover the necessary starting points for the drug discovery process. It is now established that Machine Learning (ML) can strongly enhance the predictive accuracy of scoring functions for SBVS by exploiting large datasets from targets, molecules and their associations. However, with greater choice, the question of which ML-based scoring function is the most suitable for prospective use on a given target has gained importance. Here we analyse two approaches to select an existing scoring function for the target along with a third approach consisting in generating a scoring function tailored to the target. These analyses required discussing the limitations of popular SBVS benchmarks, the alternatives to benchmark scoring functions for SBVS and how to generate them or use them using freely-available software.
Collapse
|
31
|
Fresnais L, Ballester PJ. The impact of compound library size on the performance of scoring functions for structure-based virtual screening. Brief Bioinform 2020; 22:5855396. [PMID: 32568385 DOI: 10.1093/bib/bbaa095] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Revised: 04/20/2020] [Accepted: 04/28/2020] [Indexed: 12/20/2022] Open
Abstract
Larger training datasets have been shown to improve the accuracy of machine learning (ML)-based scoring functions (SFs) for structure-based virtual screening (SBVS). In addition, massive test sets for SBVS, known as ultra-large compound libraries, have been demonstrated to enable the fast discovery of selective drug leads with low-nanomolar potency. This proof-of-concept was carried out on two targets using a single docking tool along with its SF. It is thus unclear whether this high level of performance would generalise to other targets, docking tools and SFs. We found that screening a larger compound library results in more potent actives being identified in all six additional targets using a different docking tool along with its classical SF. Furthermore, we established that a way to improve the potency of the retrieved molecules further is to rank them with more accurate ML-based SFs (we found this to be true in four of the six targets; the difference was not significant in the remaining two targets). A 3-fold increase in average hit rate across targets was also achieved by the ML-based SFs. Lastly, we observed that classical and ML-based SFs often find different actives, which supports using both types of SFs on those targets.
Collapse
|
32
|
Xiong GL, Ye WL, Shen C, Lu AP, Hou TJ, Cao DS. Improving structure-based virtual screening performance via learning from scoring function components. Brief Bioinform 2020; 22:5851268. [PMID: 32496540 DOI: 10.1093/bib/bbaa094] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Revised: 03/30/2020] [Accepted: 04/28/2020] [Indexed: 11/12/2022] Open
Abstract
Scoring functions (SFs) based on complex machine learning (ML) algorithms have gradually emerged as a promising alternative to overcome the weaknesses of classical SFs. However, extensive efforts have been devoted to the development of SFs based on new protein-ligand interaction representations and advanced alternative ML algorithms instead of the energy components obtained by the decomposition of existing SFs. Here, we propose a new method named energy auxiliary terms learning (EATL), in which the scoring components are extracted and used as the input for the development of three levels of ML SFs including EATL SFs, docking-EATL SFs and comprehensive SFs with ascending VS performance. The EATL approach not only outperforms classical SFs for the absolute performance (ROC) and initial enrichment (BEDROC) but also yields comparable performance compared with other advanced ML-based methods on the diverse subset of Directory of Useful Decoys: Enhanced (DUD-E). The test on the relatively unbiased actives as decoys (AD) dataset also proved the effectiveness of EATL. Furthermore, the idea of learning from SF components to yield improved screening power can also be extended to other docking programs and SFs available.
Collapse
|
33
|
Shen C, Hu Y, Wang Z, Zhang X, Pang J, Wang G, Zhong H, Xu L, Cao D, Hou T. Beware of the generic machine learning-based scoring functions in structure-based virtual screening. Brief Bioinform 2020; 22:5850047. [PMID: 32484221 DOI: 10.1093/bib/bbaa070] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 04/17/2020] [Accepted: 03/30/2020] [Indexed: 12/14/2022] Open
Abstract
Machine learning-based scoring functions (MLSFs) have attracted extensive attention recently and are expected to be potential rescoring tools for structure-based virtual screening (SBVS). However, a major concern nowadays is whether MLSFs trained for generic uses rather than a given target can consistently be applicable for VS. In this study, a systematic assessment was carried out to re-evaluate the effectiveness of 14 reported MLSFs in VS. Overall, most of these MLSFs could hardly achieve satisfactory results for any dataset, and they could even not outperform the baseline of classical SFs such as Glide SP. An exception was observed for RFscore-VS trained on the Directory of Useful Decoys-Enhanced dataset, which showed its superiority for most targets. However, in most cases, it clearly illustrated rather limited performance on the targets that were dissimilar to the proteins in the corresponding training sets. We also used the top three docking poses rather than the top one for rescoring and retrained the models with the updated versions of the training set, but only minor improvements were observed. Taken together, generic MLSFs may have poor generalization capabilities to be applicable for the real VS campaigns. Therefore, it should be quite cautious to use this type of methods for VS.
Collapse
Affiliation(s)
| | - Ye Hu
- Central South University, China
| | | | | | | | | | | | - Lei Xu
- Central South University, China
| | | | | |
Collapse
|
34
|
Li H, Sze K, Lu G, Ballester PJ. Machine‐learning scoring functions for structure‐based virtual screening. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1478] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Hongjian Li
- Cancer Research Center of Marseille (INSERM U1068, Institut Paoli‐Calmettes, Aix‐Marseille Université UM105, CNRS UMR7258) Marseille France
- CUHK‐SDU Joint Laboratory on Reproductive Genetics, School of Biomedical Sciences Chinese University of Hong Kong Shatin Hong Kong
| | - Kam‐Heung Sze
- CUHK‐SDU Joint Laboratory on Reproductive Genetics, School of Biomedical Sciences Chinese University of Hong Kong Shatin Hong Kong
| | - Gang Lu
- CUHK‐SDU Joint Laboratory on Reproductive Genetics, School of Biomedical Sciences Chinese University of Hong Kong Shatin Hong Kong
| | - Pedro J. Ballester
- Cancer Research Center of Marseille (INSERM U1068, Institut Paoli‐Calmettes, Aix‐Marseille Université UM105, CNRS UMR7258) Marseille France
| |
Collapse
|
35
|
Su M, Feng G, Liu Z, Li Y, Wang R. Tapping on the Black Box: How Is the Scoring Power of a Machine-Learning Scoring Function Dependent on the Training Set? J Chem Inf Model 2020; 60:1122-1136. [DOI: 10.1021/acs.jcim.9b00714] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Minyi Su
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- University of Chinese Academy of Sciences, Beijing 100049, People’s Republic of China
| | - Guoqin Feng
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- University of Chinese Academy of Sciences, Beijing 100049, People’s Republic of China
| | - Zhihai Liu
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
| | - Yan Li
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- Department of Medicinal Chemistry, School of Pharmacy, Fudan University, 826 Zhangheng Road, Shanghai 201203, People’s Republic of China
| | - Renxiao Wang
- State Key Laboratory of Bioorganic and Natural Products Chemistry, Center for Excellence in Molecular Synthesis, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 Lingling Road, Shanghai 200032, People’s Republic of China
- Department of Medicinal Chemistry, School of Pharmacy, Fudan University, 826 Zhangheng Road, Shanghai 201203, People’s Republic of China
- Shanxi Key Laboratory of Innovative Drugs for the Treatment of Serious Diseases Basing on Chronic Inflammation, College of Traditional Chinese Medicines, Shanxi University of Chinese Medicine, Taiyuan, Shanxi 030619, People’s Republic of China
| |
Collapse
|
36
|
Zhang X, Fu T, He Q, Gao X, Luo Y. Glucose-6-Phosphate Upregulates Txnip Expression by Interacting With MondoA. Front Mol Biosci 2020; 6:147. [PMID: 31993438 PMCID: PMC6962712 DOI: 10.3389/fmolb.2019.00147] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 12/03/2019] [Indexed: 11/13/2022] Open
Abstract
The major metabolic fates of glucose in cells are glycolysis and the pentose phosphate pathway, and they share the first step: converting glucose to glucose-6-phosphate (G6P). Here, we show that G6P can be sensed by the transcription factor MondoA/Mlx to modulate Txnip expression. Endogenous knockdown and EMSA (gel migration assay) analyses both confirmed that G6P is the metabolic intermediate that activates the heterocomplex MondoA/Mlx to elicit the expression of Txnip. Additionally, the three-dimensional structure of MondoA is modeled, and the binding mode of G6P to MondoA is also predicted by in silico molecular docking and binding free energy calculation. Finally, free energy decomposition and mutational analyses suggest that certain residues in MondoA, GKL139-141 in particular, mediate its binding with G6P to activate MondoA, which signals the upregulation of the expression of Txnip.
Collapse
Affiliation(s)
- Xueyun Zhang
- Department of Biochemistry, School of Medicine, Cancer Institute of the Second Affiliated Hospital, Zhejiang University, Hangzhou, China.,Key Laboratory of Cancer Prevention and Intervention of China National Ministry of Education, Hangzhou, China
| | - Tao Fu
- Department of Biochemistry, School of Medicine, Cancer Institute of the Second Affiliated Hospital, Zhejiang University, Hangzhou, China.,Key Laboratory of Cancer Prevention and Intervention of China National Ministry of Education, Hangzhou, China
| | - Qian He
- Department of Biochemistry, School of Medicine, Cancer Institute of the Second Affiliated Hospital, Zhejiang University, Hangzhou, China.,Key Laboratory of Cancer Prevention and Intervention of China National Ministry of Education, Hangzhou, China
| | - Xiang Gao
- Department of Biochemistry, School of Medicine, Cancer Institute of the Second Affiliated Hospital, Zhejiang University, Hangzhou, China.,Key Laboratory of Cancer Prevention and Intervention of China National Ministry of Education, Hangzhou, China
| | - Yan Luo
- Department of Biochemistry, School of Medicine, Cancer Institute of the Second Affiliated Hospital, Zhejiang University, Hangzhou, China.,Key Laboratory of Cancer Prevention and Intervention of China National Ministry of Education, Hangzhou, China
| |
Collapse
|
37
|
Wang Z, Sun H, Shen C, Hu X, Gao J, Li D, Cao D, Hou T. Combined strategies in structure-based virtual screening. Phys Chem Chem Phys 2020; 22:3149-3159. [PMID: 31995074 DOI: 10.1039/c9cp06303j] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The identification and optimization of lead compounds are inalienable components in drug design and discovery pipelines. As a powerful computational approach for the identification of hits with novel structural scaffolds, structure-based virtual screening (SBVS) has exhibited a remarkably increasing influence in the early stages of drug discovery. During the past decade, a variety of techniques and algorithms have been proposed and tested with different purposes in the scope of SBVS. Although SBVS has been a common and proven technology, it still shows some challenges and problems that are needed to be addressed, where the negative influence regardless of protein flexibility and the inaccurate prediction of binding affinity are the two major challenges. Here, focusing on these difficulties, we summarize a series of combined strategies or workflows developed by our group and others. Furthermore, several representative successful applications from recent publications are also discussed to demonstrate the effectiveness of the combined SBVS strategies in drug discovery campaigns.
Collapse
Affiliation(s)
- Zhe Wang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
| | - Huiyong Sun
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
| | - Chao Shen
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
| | - Xueping Hu
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
| | - Junbo Gao
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
| | - Dan Li
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410004, Hunan, P. R. China.
| | - Tingjun Hou
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
| |
Collapse
|
38
|
Li Z, Zhang Z, Sun H, Xu L, Jiang C. Identification of novel peptidomimetics targeting the polo-box domain of polo-like kinase 1. Bioorg Chem 2019; 91:103148. [PMID: 31376784 DOI: 10.1016/j.bioorg.2019.103148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2019] [Revised: 07/09/2019] [Accepted: 07/23/2019] [Indexed: 12/26/2022]
Abstract
A series of new peptidomimetics targeting the polo-box domain (PBD) of polo-like kinase 1 (Plk1) was identified based on the potent and selective pentapeptide Plk1 PBD inhibitor PLHSpT. Unnatural amino acid residues were introduced to the newly designed compound and the N-terminal substituent of the peptidomimetic was investigated. The optimized compound 9 inhibited the Plk1 PBD with IC50 of 0.267 μM and showed almost no inhibition to Plk2 PBD or Plk3 PBD at 100 μM. Biolayer interferometry studies demonstrated that compound 9 showed potent binding affinity to Plk1 with a Kd value of 0.164 μM, while no Kd were detected against Plk2 and Plk3. Compound 9 showed improved stability in rat plasma compared to PLHSpT. Binding mode analysis was performed and in agreement with the observed experimental results. There are only two natural amino acids remained in the chemical structure of 9. This study may provide new information for further research on Plk1 PBD inhibitors.
Collapse
Affiliation(s)
- Zhiyan Li
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China
| | - Zhenguo Zhang
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China
| | - Huiyong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China
| | - Lili Xu
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China.
| | - Cheng Jiang
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China.
| |
Collapse
|
39
|
Yu E, Xu Y, Shi Y, Yu Q, Liu J, Xu L. Discovery of novel natural compound inhibitors targeting estrogen receptor α by an integrated virtual screening strategy. J Mol Model 2019; 25:278. [PMID: 31463793 DOI: 10.1007/s00894-019-4156-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Accepted: 08/14/2019] [Indexed: 12/20/2022]
Abstract
Estrogen receptor (ER) is a nuclear hormone receptor and plays an important role in mediating the cellular effects of estrogen. ER can be classified into two receptors: estrogen receptor alpha (ERα) and beta (ERβ), and the former is expressed in 50~80% of breast tumors and has been extensively investigated in breast cancer for decades. Excessive exposure to estrogen can obviously stimulate the growth of breast cancers primarily mediated by ERα, and thus anti-estrogen therapies by small molecules are of concern to clinicians and pharmaceutical industry in the treatment of ERα-positive breast cancers. Although a series of estrogen receptor modulators have been developed, these drugs can lead to resistance and side effects. Therefore, the development of small molecule inhibitors with high target specificity has been intensified. In this pursuit, an integrated computer-aided virtual screening technique, including molecular docking and pharmacophore model screening, was used to screen traditional Chinese medicine (TCM) databases. The compounds with high docking scores and fit values were subjected to ADME (adsorption, distribution, metabolism, excretion) and toxicity prediction, and ten hits were identified as potential inhibitors targeting ERα. Molecular docking was used to investigate the binding modes between ERα and three most potent hits, and molecular dynamic simulations were chosen to explore the stability of these complexes. The rank of the predicted binding free energies evaluated by MM/GBSA is consistent with the docking score. These novel scaffolds discovered in the present study can be used as critical starting point in the drug discovery process for treating ERα-positive breast cancer. Graphical abstract .
Collapse
Affiliation(s)
- Enguang Yu
- Department of Chinese Surgery, Jiaxing University Affiliated Jiaxing Hospital of Traditional Chinese Medicine, Jiaxing, 314000, Zhejiang, People's Republic of China
| | - Yueping Xu
- Department of Nursing, Jiaxing University Affiliated Jiaxing Hospital of Traditional Chinese Medicine, Jiaxing, 314000, Zhejiang, People's Republic of China
| | - Yanbo Shi
- Central Laboratory of Molecular Medicine Research Center, Jiaxing University Affiliated Jiaxing Hospital of Traditional Chinese Medicine, Jiaxing, 314000, Zhejiang, People's Republic of China
| | - Qiuyan Yu
- Department of Breast Surgery, Jiaxing University Affiliated Jiaxing Hospital of Traditional Chinese Medicine, Jiaxing, 314000, Zhejiang, People's Republic of China
| | - Jie Liu
- Department of Traditional Chinese Medicine Oncology, Jiaxing University Affiliated Jiaxing Hospital of Traditional Chinese Medicine, Jiaxing, 314000, Zhejiang, People's Republic of China
| | - Lei Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, 213001, Jiangsu, China.
| |
Collapse
|
40
|
Yang X, Wang Y, Byrne R, Schneider G, Yang S. Concepts of Artificial Intelligence for Computer-Assisted Drug Discovery. Chem Rev 2019; 119:10520-10594. [PMID: 31294972 DOI: 10.1021/acs.chemrev.8b00728] [Citation(s) in RCA: 369] [Impact Index Per Article: 61.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Artificial intelligence (AI), and, in particular, deep learning as a subcategory of AI, provides opportunities for the discovery and development of innovative drugs. Various machine learning approaches have recently (re)emerged, some of which may be considered instances of domain-specific AI which have been successfully employed for drug discovery and design. This review provides a comprehensive portrayal of these machine learning techniques and of their applications in medicinal chemistry. After introducing the basic principles, alongside some application notes, of the various machine learning algorithms, the current state-of-the art of AI-assisted pharmaceutical discovery is discussed, including applications in structure- and ligand-based virtual screening, de novo drug design, physicochemical and pharmacokinetic property prediction, drug repurposing, and related aspects. Finally, several challenges and limitations of the current methods are summarized, with a view to potential future directions for AI-assisted drug discovery and design.
Collapse
Affiliation(s)
- Xin Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Yifei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| | - Ryan Byrne
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Gisbert Schneider
- ETH Zurich , Department of Chemistry and Applied Biosciences , Vladimir-Prelog-Weg 4 , CH-8093 Zurich , Switzerland
| | - Shengyong Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital , Sichuan University , Chengdu , Sichuan 610041 , China
| |
Collapse
|
41
|
Shen C, Ding J, Wang Z, Cao D, Ding X, Hou T. From machine learning to deep learning: Advances in scoring functions for protein–ligand docking. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2019. [DOI: 10.1002/wcms.1429] [Citation(s) in RCA: 76] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Chao Shen
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University Hangzhou P. R. China
| | - Junjie Ding
- Beijing Institute of Pharmaceutical Chemistry Beijing P. R. China
| | - Zhe Wang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University Hangzhou P. R. China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University Changsha P. R. China
| | - Xiaoqin Ding
- Beijing Institute of Pharmaceutical Chemistry Beijing P. R. China
| | - Tingjun Hou
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University Hangzhou P. R. China
| |
Collapse
|
42
|
Wang E, Sun H, Wang J, Wang Z, Liu H, Zhang JZH, Hou T. End-Point Binding Free Energy Calculation with MM/PBSA and MM/GBSA: Strategies and Applications in Drug Design. Chem Rev 2019; 119:9478-9508. [DOI: 10.1021/acs.chemrev.9b00055] [Citation(s) in RCA: 578] [Impact Index Per Article: 96.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Ercheng Wang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Huiyong Sun
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Junmei Wang
- Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Zhe Wang
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Hui Liu
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - John Z. H. Zhang
- Shanghai Engineering Research Center of Molecular Therapeutics & New Drug Development, Shanghai Key Laboratory of Green Chemistry & Chemical Process, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU−ECNU Center for Computational Chemistry, NYU Shanghai, Shanghai 200122, China
- Department of Chemistry, New York University, New York, New York 10003, United States
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China
| | - Tingjun Hou
- Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| |
Collapse
|
43
|
Slater O, Kontoyianni M. The compromise of virtual screening and its impact on drug discovery. Expert Opin Drug Discov 2019; 14:619-637. [PMID: 31025886 DOI: 10.1080/17460441.2019.1604677] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Introduction: Docking and structure-based virtual screening (VS) have been standard approaches in structure-based design for over two decades. However, our understanding of the limitations, potential, and strength of these techniques has enhanced, raising expectations. Areas covered: Based on a survey of reports in the past five years, we assess whether VS: (1) predicts binding poses in agreement with crystallographic data (when available); (2) is a superior screening tool, as often claimed; (3) is successful in identifying chemical scaffolds that can be starting points for subsequent lead optimization cycles. Data shows that knowledge of the target and its chemotypes in postprocessing lead to viable hits in early drug discovery endeavors. Expert opinion: VS is capable of accurate placements in the pocket for the most part, but does not consistently score screening collections accurately. What matters is capitalization on available resources to get closer to a viable lead or optimizable series. Integration of approaches, subjective hit selection guided by knowledge of the receptor or endogenous ligand, libraries driven by experimental guides, validation studies to identify the best docking/scoring that reproduces experimental findings, constraints regarding receptor-ligand interactions, thoroughly designed methodologies, and predefined cutoff scoring criteria strengthen VS's position in pharmaceutical research.
Collapse
Affiliation(s)
- Olivia Slater
- a Department of Pharmaceutical Sciences , Southern Illinois University Edwardsville , Edwardsville , IL , USA
| | - Maria Kontoyianni
- a Department of Pharmaceutical Sciences , Southern Illinois University Edwardsville , Edwardsville , IL , USA
| |
Collapse
|
44
|
Li H, Peng J, Sidorov P, Leung Y, Leung KS, Wong MH, Lu G, Ballester PJ. Classical scoring functions for docking are unable to exploit large volumes of structural and interaction data. Bioinformatics 2019; 35:3989-3995. [DOI: 10.1093/bioinformatics/btz183] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 02/04/2019] [Accepted: 03/13/2019] [Indexed: 12/15/2022] Open
Abstract
Abstract
Motivation
Studies have shown that the accuracy of random forest (RF)-based scoring functions (SFs), such as RF-Score-v3, increases with more training samples, whereas that of classical SFs, such as X-Score, does not. Nevertheless, the impact of the similarity between training and test samples on this matter has not been studied in a systematic manner. It is therefore unclear how these SFs would perform when only trained on protein-ligand complexes that are highly dissimilar or highly similar to the test set. It is also unclear whether SFs based on machine learning algorithms other than RF can also improve accuracy with increasing training set size and to what extent they learn from dissimilar or similar training complexes.
Results
We present a systematic study to investigate how the accuracy of classical and machine-learning SFs varies with protein-ligand complex similarities between training and test sets. We considered three types of similarity metrics, based on the comparison of either protein structures, protein sequences or ligand structures. Regardless of the similarity metric, we found that incorporating a larger proportion of similar complexes to the training set did not make classical SFs more accurate. In contrast, RF-Score-v3 was able to outperform X-Score even when trained on just 32% of the most dissimilar complexes, showing that its superior performance owes considerably to learning from dissimilar training complexes to those in the test set. In addition, we generated the first SF employing Extreme Gradient Boosting (XGBoost), XGB-Score, and observed that it also improves with training set size while outperforming the rest of SFs. Given the continuous growth of training datasets, the development of machine-learning SFs has become very appealing.
Availability and implementation
https://github.com/HongjianLi/MLSF
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hongjian Li
- SDIVF R&D Centre, Hong Kong Science Park, Sha Tin, New Territories, Hong Kong
- CUHK-SDU Joint Laboratory on Reproductive Genetics School of Biomedical Sciences, The Chinese University of Hong Kong, Sha Tin, New Territories, Hong Kong
| | - Jiangjun Peng
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi’an, China
| | - Pavel Sidorov
- Cancer Research Center of Marseille CRCM, INSERM, Institut Paoli-Calmettes, Aix-Marseille University, CNRS, F-13009 Marseille, France
| | | | - Kwong-Sak Leung
- Institute of Future Cities
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, New Territories, Hong Kong
| | - Man-Hon Wong
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Sha Tin, New Territories, Hong Kong
| | - Gang Lu
- CUHK-SDU Joint Laboratory on Reproductive Genetics School of Biomedical Sciences, The Chinese University of Hong Kong, Sha Tin, New Territories, Hong Kong
| | - Pedro J Ballester
- Cancer Research Center of Marseille CRCM, INSERM, Institut Paoli-Calmettes, Aix-Marseille University, CNRS, F-13009 Marseille, France
| |
Collapse
|
45
|
Akher FB, Farrokhzadeh A, Soliman MES. Covalent vs. Non-Covalent Inhibition: Tackling Drug Resistance in EGFR - A Thorough Dynamic Perspective. Chem Biodivers 2019; 16:e1800518. [PMID: 30548188 DOI: 10.1002/cbdv.201800518] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 12/10/2018] [Indexed: 12/20/2022]
Abstract
A persistent challenge in the treatment of non-small cell lung cancer (NSCLC) with EGFR is the emergence of drug-resistant caused by somatic mutations. The EGFR L858R/T790 M double mutant (EGFRDM ) was found to be the most alarming variant. Despite the development of a wide range of inhibitors, none of them could inhibit EGFRDM effectively. Recently, 11h and 45a, have been found to be potent inhibitors against EGFRDM through two distinctive mechanisms, non-covalent and covalent binding, respectively. However, the structural and dynamic implications of the two modes of inhibitions remain unexplored. Herein, two molecular dynamics simulation protocols, coupled with free-energy calculations, were applied to gain insight into the atomistic nature of each binding mode. The comparative analysis confirmed that there is a significant difference in the binding free energy between 11h and 45a (ΔΔGbind =-21.17 kcal/mol). The main binding force that governs the binding of both inhibitors is vdW, with a higher contribution for 45a. Two residues ARG841 and THR854 were found to have curtailed role in the binding of 45a to EGFRDM by stabilizing its flexible alcohol chain. The 45a binding to EGFRDM induces structural rearrangement in the active site to allow easier accessibility of 45a to target residue CYS797. The findings of this work can substantially shed light on new strategies for developing novel classes of covalent and non-covalent inhibitors with increased specificity and potency.
Collapse
Affiliation(s)
- Farideh Badichi Akher
- Bio-computation and Drug Design Laboratory, School of Health Sciences, University of KwaZulu-Natal, Westville Campus, Durban, 4001, South Africa
| | - Abdolkarim Farrokhzadeh
- Bio-computation and Drug Design Laboratory, School of Health Sciences, University of KwaZulu-Natal, Westville Campus, Durban, 4001, South Africa
| | - Mahmoud E S Soliman
- Bio-computation and Drug Design Laboratory, School of Health Sciences, University of KwaZulu-Natal, Westville Campus, Durban, 4001, South Africa
| |
Collapse
|
46
|
Zhang S, Li Y, Qin Z, Tu G, Chen G, Yan A. SAR study on inhibitors of GIIA secreted phospholipase A
2
using machine learning methods. Chem Biol Drug Des 2019; 93:666-684. [DOI: 10.1111/cbdd.13470] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Revised: 12/14/2018] [Accepted: 12/17/2018] [Indexed: 12/24/2022]
Affiliation(s)
- Shengde Zhang
- State Key Laboratory of Chemical Resource EngineeringDepartment of Pharmaceutical EngineeringBeijing University of Chemical Technology Beijing China
| | - Yang Li
- Institute of Science and TechnologyShandong University of Traditional Chinese Medicine Ji'nan Shandong China
| | - Zijian Qin
- State Key Laboratory of Chemical Resource EngineeringDepartment of Pharmaceutical EngineeringBeijing University of Chemical Technology Beijing China
| | - Guipin Tu
- State Key Laboratory of Chemical Resource EngineeringDepartment of Pharmaceutical EngineeringBeijing University of Chemical Technology Beijing China
| | - Guang Chen
- College of Life Science and TechnologyBeijing University of Chemical Technology Beijing China
| | - Aixia Yan
- State Key Laboratory of Chemical Resource EngineeringDepartment of Pharmaceutical EngineeringBeijing University of Chemical Technology Beijing China
| |
Collapse
|
47
|
Wójcikowski M, Siedlecki P, Ballester PJ. Building Machine-Learning Scoring Functions for Structure-Based Prediction of Intermolecular Binding Affinity. Methods Mol Biol 2019; 2053:1-12. [PMID: 31452095 DOI: 10.1007/978-1-4939-9752-7_1] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Molecular docking enables large-scale prediction of whether and how small molecules bind to a macromolecular target. Machine-learning scoring functions are particularly well suited to predict the strength of this interaction. Here we describe how to build RF-Score, a scoring function utilizing the machine-learning technique known as Random Forest (RF). We also point out how to use different data, features, and regression models using either R or Python programming languages.
Collapse
Affiliation(s)
| | - Pawel Siedlecki
- Institute of Biochemistry and Biophysics PAS, Warsaw, Poland
- Department of Systems Biology, Institute of Experimental Plant Biology and Biotechnology, University of Warsaw, Warsaw, Poland
| | - Pedro J Ballester
- Cancer Research Center of Marseille, INSERM U1068, Marseille, France.
- Institut Paoli-Calmettes, Marseille, France.
- Aix-Marseille Université, Marseille, France.
- CNRS UMR7258, Marseille, France.
| |
Collapse
|
48
|
Chen Y, Li Z, Liu Y, Lin T, Sun H, Yang D, Jiang C. Identification of novel and selective non-peptide inhibitors targeting the polo-box domain of polo-like kinase 1. Bioorg Chem 2018; 81:278-288. [PMID: 30170276 DOI: 10.1016/j.bioorg.2018.08.030] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2018] [Revised: 08/15/2018] [Accepted: 08/21/2018] [Indexed: 01/05/2023]
Abstract
A series of non-peptide inhibitors targeting the polo-box domain (PBD) of polo-like kinase 1 (Plk1) was designed based on the potent and selective minimal tripeptide Plk1 PBD inhibitor. Seven compounds were designed, synthesized and evaluated for fluorescence polarization (FP) assay. The most promising compound 10 bound to Plk1 PBD with IC50 of 3.37 μM and had no binding to Plk2 PBD or Plk3 PBD at 100 μM. Molecular docking study was performed and possible binding mode was proposed. MM/GBSA binding free energy calculation were in agreement with the observed experimental results. These novel non-peptide selective Plk1 PBD inhibitors provided new lead compounds for further optimization.
Collapse
Affiliation(s)
- Yanhong Chen
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China; Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China
| | - Zhiyan Li
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China; Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China
| | - Yu Liu
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China; Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China
| | - Tongyuan Lin
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory of Drug Quality Control and Pharmacovigilance, China Pharmaceutical University, Nanjing 210009, China
| | - Huiyong Sun
- Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China
| | - Dasong Yang
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory of Drug Quality Control and Pharmacovigilance, China Pharmaceutical University, Nanjing 210009, China.
| | - Cheng Jiang
- Jiangsu Key Laboratory of Drug Design and Optimization, China Pharmaceutical University, Nanjing 210009, China; Key Laboratory on Protein Chemistry and Structural Biology, China Pharmaceutical University, Nanjing 210009, China; Department of Medicinal Chemistry, China Pharmaceutical University, Nanjing 210009, China.
| |
Collapse
|
49
|
Huang X, Xu P, Cao Y, Liu L, Song G, Xu L. Exploring the binding mechanisms of PDE5 with chromeno[2,3- c]pyrrol-9(2 H)-one by theoretical approaches. RSC Adv 2018; 8:30481-30490. [PMID: 35546827 PMCID: PMC9085377 DOI: 10.1039/c8ra06405a] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Accepted: 08/17/2018] [Indexed: 01/10/2023] Open
Abstract
Cyclic nucleotide phosphodiesterase type 5 (PDE5), exclusively specific for the cyclic guanosine monophosphate (cGMP), is an important drug target for the treatment of erectile dysfunction and pulmonary arterial hypertension (PAH). Although many PDE5 inhibitors have been approved, such as sildenafil, vardenafil, tadalafil and so on, extensive studies have reported some side effects, such as vision disturbance and hearing loss as a result of the amino acid sequence and the secondary structural similarity of other PDEs to the catalytic domain of PDE5. In this study, multiple docking strategies, molecular dynamics (MD) simulations, free energy calculations and decomposition were employed to explore the structural determinants of PDE5 with a series of chromeno[2,3-c]pyrrol-9(2H)-one derivatives. First, reliable docking results were obtained using quantum mechanics/molecular mechanics (QM/MM) docking. Then, MD simulations and MM/GBSA free energy calculations were used to explore the dynamic binding process and characterize the binding modes of the inhibitors with different activities. The predicted binding free energies are in good agreement with the experimental data, and the MM/GBSA free energy decomposition analysis sheds light on the importance of hydrogen bonds with Gln817, π-π stacks against Phe820 and hydrophobic residues for the PDE5 binding of the studied inhibitors. The structural and energetic insights obtained here are useful for understanding the molecular mechanism of ligand binding and designing novel potent and selective PDE5 inhibitors with new scaffolds.
Collapse
Affiliation(s)
- Xianfeng Huang
- School of Pharmaceutical Engineering and Life Sciences, Changzhou University Changzhou Jiangsu 213164 PR China +86-519-86334598 +86-519-86330600
| | - Peng Xu
- Department of Orthopedics, Second Military Medical University Affiliated Changzheng Hospital Shanghai 200003 China
| | - Yijing Cao
- School of Pharmaceutical Engineering and Life Sciences, Changzhou University Changzhou Jiangsu 213164 PR China +86-519-86334598 +86-519-86330600
| | - Li Liu
- School of Pharmaceutical Engineering and Life Sciences, Changzhou University Changzhou Jiangsu 213164 PR China +86-519-86334598 +86-519-86330600
| | - Guoqiang Song
- School of Pharmaceutical Engineering and Life Sciences, Changzhou University Changzhou Jiangsu 213164 PR China +86-519-86334598 +86-519-86330600
| | - Lei Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology Changzhou 213001 China +86-519-86953223 +86-519-86953220
| |
Collapse
|
50
|
Sun H, Duan L, Chen F, Liu H, Wang Z, Pan P, Zhu F, Zhang JZH, Hou T. Assessing the performance of MM/PBSA and MM/GBSA methods. 7. Entropy effects on the performance of end-point binding free energy calculation approaches. Phys Chem Chem Phys 2018; 20:14450-14460. [PMID: 29785435 DOI: 10.1039/c7cp07623a] [Citation(s) in RCA: 232] [Impact Index Per Article: 33.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Entropy effects play an important role in drug-target interactions, but the entropic contribution to ligand-binding affinity is often neglected by end-point binding free energy calculation methods, such as MM/GBSA and MM/PBSA, due to the expensive computational cost of normal mode analysis (NMA). Here, we systematically investigated entropy effects on the prediction power of MM/GBSA and MM/PBSA using >1500 protein-ligand systems and six representative AMBER force fields. Two computationally efficient methods, including NMA based on truncated structures and the interaction entropy approach, were used to estimate the entropic contributions to ligand-target binding free energies. In terms of the overall accuracy, we found that, for the minimized structures, in most cases the inclusion of the conformational entropies predicted by truncated NMA (enthalpynmode_min_9Å) compromises the overall accuracy of MM/GBSA and MM/PBSA compared with the enthalpies calculated based on the minimized structures (enthalpymin). However, for the MD trajectories, the binding free energies can be improved by the inclusion of the conformation entropies predicted by either truncated-NMA for a relatively high dielectric constant (εin = 4) or the interaction entropy method for εin = 1-4. In terms of reproducing the absolute binding free energies, the binding free energies estimated by including the truncated-NMA entropies based on the MD trajectories (ΔGnmode_md_9Å) give the lowest average absolute deviations against the experimental data among all the tested strategies for both MM/GBSA and MM/PBSA. Although the inclusion of the truncated NMA based on the MD trajectories (ΔGnmode_md_9Å) for a relatively high dielectric constant gave the overall best result and the lowest average absolute deviations against the experimental data (for the ff03 force field), it needs too much computational time. Alternatively, considering that the interaction entropy method does not incur any additional computational cost and can give comparable (at high dielectric constant, εin = 4) or even better (at low dielectric constant, εin = 1-2) results than the truncated-NMA entropy (ΔGnmode_md_9Å), the interaction entropy approach is recommended to estimate the entropic component for MM/GBSA and MM/PBSA based on MD trajectories, especially for a diverse dataset. Furthermore, we compared the predictions of MM/GBSA with six different AMBER force fields. The results show that the ff03 force field (ff03 for proteins and gaff with AM1-BCC charges for ligands) performs the best, but the predictions given by the tested force fields are comparable, implying that the MM/GBSA predictions are not very sensitive to force fields.
Collapse
Affiliation(s)
- Huiyong Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
| | | | | | | | | | | | | | | | | |
Collapse
|