1
|
Khachatryan H, Matevosyan M, Harutyunyan V, Gevorgyan S, Shavina A, Tirosyan I, Gabrielyan Y, Ayvazyan M, Bozdaganyan M, Fakhar Z, Gharaghani S, Zakaryan H. Computational evaluation and benchmark study of 342 crystallographic holo-structures of SARS-CoV-2 Mpro enzyme. Sci Rep 2024; 14:14255. [PMID: 38902397 PMCID: PMC11189913 DOI: 10.1038/s41598-024-65228-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 06/18/2024] [Indexed: 06/22/2024] Open
Abstract
The coronavirus disease 19 pandemic, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has led to a global health crisis with millions of confirmed cases and related deaths. The main protease (Mpro) of SARS-CoV-2 is crucial for viral replication and presents an attractive target for drug development. Despite the approval of some drugs, the search for effective treatments continues. In this study, we systematically evaluated 342 holo-crystal structures of Mpro to identify optimal conformations for structure-based virtual screening (SBVS). Our analysis revealed limited structural flexibility among the structures. Three docking programs, AutoDock Vina, rDock, and Glide were employed to assess the efficiency of virtual screening, revealing diverse performances across selected Mpro structures. We found that the structures 5RHE, 7DDC, and 7DPU (PDB Ids) consistently displayed the lowest EF, AUC, and BEDROCK scores. Furthermore, these structures demonstrated the worst pose prediction results in all docking programs. Two structural differences contribute to variations in docking performance: the absence of the S1 subsite in 7DDC and 7DPU, and the presence of a subpocket in the S2 subsite of 7DDC, 7DPU, and 5RHE. These findings underscore the importance of selecting appropriate Mpro conformations for SBVS, providing valuable insights for advancing drug discovery efforts.
Collapse
Affiliation(s)
- Hamlet Khachatryan
- Denovo Sciences Inc, 0060, Yerevan, Armenia.
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia.
| | - Mher Matevosyan
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia
| | - Vardan Harutyunyan
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia
| | - Smbat Gevorgyan
- Denovo Sciences Inc, 0060, Yerevan, Armenia
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia
| | - Anastasiya Shavina
- Denovo Sciences Inc, 0060, Yerevan, Armenia
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia
| | - Irina Tirosyan
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia
| | - Yeva Gabrielyan
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia
| | - Marusya Ayvazyan
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia
| | | | - Zeynab Fakhar
- Laboratory of Bioinformatics and Drug Design (LBD), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Sajjad Gharaghani
- Laboratory of Bioinformatics and Drug Design (LBD), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Hovakim Zakaryan
- Denovo Sciences Inc, 0060, Yerevan, Armenia.
- Laboratory of Antiviral Drug Discovery, Institute of Molecular Biology of NAS, Hasratyan 7, 0014, Yerevan, Armenia.
| |
Collapse
|
2
|
Meller A, Kelly D, Smith LG, Bowman GR. Toward physics-based precision medicine: Exploiting protein dynamics to design new therapeutics and interpret variants. Protein Sci 2024; 33:e4902. [PMID: 38358129 PMCID: PMC10868452 DOI: 10.1002/pro.4902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/01/2023] [Accepted: 01/04/2024] [Indexed: 02/16/2024]
Abstract
The goal of precision medicine is to utilize our knowledge of the molecular causes of disease to better diagnose and treat patients. However, there is a substantial mismatch between the small number of food and drug administration (FDA)-approved drugs and annotated coding variants compared to the needs of precision medicine. This review introduces the concept of physics-based precision medicine, a scalable framework that promises to improve our understanding of sequence-function relationships and accelerate drug discovery. We show that accounting for the ensemble of structures a protein adopts in solution with computer simulations overcomes many of the limitations imposed by assuming a single protein structure. We highlight studies of protein dynamics and recent methods for the analysis of structural ensembles. These studies demonstrate that differences in conformational distributions predict functional differences within protein families and between variants. Thanks to new computational tools that are providing unprecedented access to protein structural ensembles, this insight may enable accurate predictions of variant pathogenicity for entire libraries of variants. We further show that explicitly accounting for protein ensembles, with methods like alchemical free energy calculations or docking to Markov state models, can uncover novel lead compounds. To conclude, we demonstrate that cryptic pockets, or cavities absent in experimental structures, provide an avenue to target proteins that are currently considered undruggable. Taken together, our review provides a roadmap for the field of protein science to accelerate precision medicine.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular BiophysicsWashington University in St. LouisSt. LouisMissouriUSA
- Medical Scientist Training ProgramWashington University in St. LouisSt. LouisMissouriUSA
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Devin Kelly
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Louis G. Smith
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Gregory R. Bowman
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| |
Collapse
|
3
|
Leemann M, Sagasta A, Eberhardt J, Schwede T, Robin X, Durairaj J. Automated benchmarking of combined protein structure and ligand conformation prediction. Proteins 2023; 91:1912-1924. [PMID: 37885318 DOI: 10.1002/prot.26605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 09/15/2023] [Accepted: 09/21/2023] [Indexed: 10/28/2023]
Abstract
The prediction of protein-ligand complexes (PLC), using both experimental and predicted structures, is an active and important area of research, underscored by the inclusion of the Protein-Ligand Interaction category in the latest round of the Critical Assessment of Protein Structure Prediction experiment CASP15. The prediction task in CASP15 consisted of predicting both the three-dimensional structure of the receptor protein as well as the position and conformation of the ligand. This paper addresses the challenges and proposed solutions for devising automated benchmarking techniques for PLC prediction. The reliability of experimentally solved PLC as ground truth reference structures is assessed using various validation criteria. Similarity of PLC to previously released complexes are employed to judge PLC diversity and the difficulty of a PLC as a prediction target. We show that the commonly used PDBBind time-split test-set is inappropriate for comprehensive PLC evaluation, with state-of-the-art tools showing conflicting results on a more representative and high quality dataset constructed for benchmarking purposes. We also show that redocking on crystal structures is a much simpler task than docking into predicted protein models, demonstrated by the two PLC-prediction-specific scoring metrics created. Finally, we introduce a fully automated pipeline that predicts PLC and evaluates the accuracy of the protein structure, ligand pose, and protein-ligand interactions.
Collapse
Affiliation(s)
- Michèle Leemann
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Ander Sagasta
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Jerome Eberhardt
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Xavier Robin
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Janani Durairaj
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
4
|
Dong T, Yang Z, Zhou J, Chen CYC. Equivariant Flexible Modeling of the Protein-Ligand Binding Pose with Geometric Deep Learning. J Chem Theory Comput 2023; 19:8446-8459. [PMID: 37938978 DOI: 10.1021/acs.jctc.3c00273] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2023]
Abstract
Flexible modeling of the protein-ligand complex structure is a fundamental challenge for in silico drug development. Recent studies have improved commonly used docking tools by incorporating extra-deep learning-based steps. However, such strategies limit their accuracy and efficiency because they retain massive sampling pressure and lack consideration for flexible biomolecular changes. In this study, we propose FlexPose, a geometric graph network capable of direct flexible modeling of complex structures in Euclidean space without the following conventional sampling and scoring strategies. Our model adopts two key designs: scalar-vector dual feature representation and SE(3)-equivariant network, to manage dynamic structural changes, as well as two strategies: conformation-aware pretraining and weakly supervised learning, to boost model generalizability in unseen chemical space. Benefiting from these paradigms, our model dramatically outperforms all tested popular docking tools and recently advanced deep learning methods, especially in tasks involving protein conformation changes. We further investigate the impact of protein and ligand similarity on the model performance with two conformation-aware strategies. Moreover, FlexPose provides an affinity estimation and model confidence for postanalysis.
Collapse
Affiliation(s)
- Tiejun Dong
- Intelligent Medical Research Center, School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, Guangdong 510275, China
| | - Ziduo Yang
- Intelligent Medical Research Center, School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, Guangdong 510275, China
| | - Jun Zhou
- Intelligent Medical Research Center, School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, Guangdong 510275, China
| | - Calvin Yu-Chian Chen
- Intelligent Medical Research Center, School of Intelligent Systems Engineering, Sun Yat-sen University, Shenzhen, Guangdong 510275, China
- AI for Science (AI4S)-Preferred Program, Peking University Shenzhen Graduate School, Shenzhen, Guangdong 518055, China
- School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, Guangdong 518055, China
- Department of Medical Research, China Medical University Hospital, Taichung 40447, Taiwan
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung 41354, Taiwan
| |
Collapse
|
5
|
Parijat P, Attili S, Hoare Z, Shattock M, Kenyon V, Kampourakis T. Discovery of a novel cardiac-specific myosin modulator using artificial intelligence-based virtual screening. Nat Commun 2023; 14:7692. [PMID: 38001148 PMCID: PMC10673995 DOI: 10.1038/s41467-023-43538-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 11/13/2023] [Indexed: 11/26/2023] Open
Abstract
Direct modulation of cardiac myosin function has emerged as a therapeutic target for both heart disease and heart failure. However, the development of myosin-based therapeutics has been hampered by the lack of targeted in vitro screening assays. In this study we use Artificial Intelligence-based virtual high throughput screening (vHTS) to identify novel small molecule effectors of human β-cardiac myosin. We test the top scoring compounds from vHTS in biochemical counter-screens and identify a novel chemical scaffold called 'F10' as a cardiac-specific low-micromolar myosin inhibitor. Biochemical and biophysical characterization in both isolated proteins and muscle fibers show that F10 stabilizes both the biochemical (i.e. super-relaxed state) and structural (i.e. interacting heads motif) OFF state of cardiac myosin, and reduces force and left ventricular pressure development in isolated myofilaments and Langendorff-perfused hearts, respectively. F10 is a tunable scaffold for the further development of a novel class of myosin modulators.
Collapse
Affiliation(s)
- Priyanka Parijat
- Randall Centre for Cell and Molecular Biophysics; and British Heart Foundation Centre of Research Excellence, King's College London, London, SE1 1UL, United Kingdom
| | - Seetharamaiah Attili
- Randall Centre for Cell and Molecular Biophysics; and British Heart Foundation Centre of Research Excellence, King's College London, London, SE1 1UL, United Kingdom
| | - Zoe Hoare
- School of Cardiovascular and Metabolic Medicine and Sciences; Rayne Institute and British Heart Foundation Centre of Research Excellence, King's College London, London, SE5 9NU, United Kingdom
| | - Michael Shattock
- School of Cardiovascular and Metabolic Medicine and Sciences; Rayne Institute and British Heart Foundation Centre of Research Excellence, King's College London, London, SE5 9NU, United Kingdom
| | | | - Thomas Kampourakis
- Randall Centre for Cell and Molecular Biophysics; and British Heart Foundation Centre of Research Excellence, King's College London, London, SE1 1UL, United Kingdom.
| |
Collapse
|
6
|
Meller A, de Oliveira S, Davtyan A, Abramyan T, Bowman GR, van den Bedem H. Discovery of a cryptic pocket in the AI-predicted structure of PPM1D phosphatase explains the binding site and potency of its allosteric inhibitors. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.22.533829. [PMID: 36993233 PMCID: PMC10055338 DOI: 10.1101/2023.03.22.533829] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Virtual screening is a widely used tool for drug discovery, but its predictive power can vary dramatically depending on how much structural data is available. In the best case, crystal structures of a ligand-bound protein can help find more potent ligands. However, virtual screens tend to be less predictive when only ligand-free crystal structures are available, and even less predictive if a homology model or other predicted structure must be used. Here, we explore the possibility that this situation can be improved by better accounting for protein dynamics, as simulations started from a single structure have a reasonable chance of sampling nearby structures that are more compatible with ligand binding. As a specific example, we consider the cancer drug target PPM1D/Wip1 phosphatase, a protein that lacks crystal structures. High-throughput screens have led to the discovery of several allosteric inhibitors of PPM1D, but their binding mode remains unknown. To enable further drug discovery efforts, we assessed the predictive power of an AlphaFold-predicted structure of PPM1D and a Markov state model (MSM) built from molecular dynamics simulations initiated from that structure. Our simulations reveal a cryptic pocket at the interface between two important structural elements, the flap and hinge regions. Using deep learning to predict the pose quality of each docked compound for the active site and cryptic pocket suggests that the inhibitors strongly prefer binding to the cryptic pocket, consistent with their allosteric effect. The predicted affinities for the dynamically uncovered cryptic pocket also recapitulate the relative potencies of the compounds (τ b =0.70) better than the predicted affinities for the static AlphaFold-predicted structure (τ b =0.42). Taken together, these results suggest that targeting the cryptic pocket is a good strategy for drugging PPM1D and, more generally, that conformations selected from simulation can improve virtual screening when limited structural data is available.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S Euclid Ave, St. Louis, MO, 63110
- Medical Scientist Training Program, Washington University in St. Louis, 660 S Euclid Ave., St. Louis, MO, 63110
| | - Saulo de Oliveira
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103
| | - Aram Davtyan
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103
| | - Tigran Abramyan
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103
| | - Gregory R. Bowman
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, PA, 19104
| | - Henry van den Bedem
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94158
| |
Collapse
|
7
|
Meller A, De Oliveira S, Davtyan A, Abramyan T, Bowman GR, van den Bedem H. Discovery of a cryptic pocket in the AI-predicted structure of PPM1D phosphatase explains the binding site and potency of its allosteric inhibitors. Front Mol Biosci 2023; 10:1171143. [PMID: 37143823 PMCID: PMC10151774 DOI: 10.3389/fmolb.2023.1171143] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 04/07/2023] [Indexed: 05/06/2023] Open
Abstract
Virtual screening is a widely used tool for drug discovery, but its predictive power can vary dramatically depending on how much structural data is available. In the best case, crystal structures of a ligand-bound protein can help find more potent ligands. However, virtual screens tend to be less predictive when only ligand-free crystal structures are available, and even less predictive if a homology model or other predicted structure must be used. Here, we explore the possibility that this situation can be improved by better accounting for protein dynamics, as simulations started from a single structure have a reasonable chance of sampling nearby structures that are more compatible with ligand binding. As a specific example, we consider the cancer drug target PPM1D/Wip1 phosphatase, a protein that lacks crystal structures. High-throughput screens have led to the discovery of several allosteric inhibitors of PPM1D, but their binding mode remains unknown. To enable further drug discovery efforts, we assessed the predictive power of an AlphaFold-predicted structure of PPM1D and a Markov state model (MSM) built from molecular dynamics simulations initiated from that structure. Our simulations reveal a cryptic pocket at the interface between two important structural elements, the flap and hinge regions. Using deep learning to predict the pose quality of each docked compound for the active site and cryptic pocket suggests that the inhibitors strongly prefer binding to the cryptic pocket, consistent with their allosteric effect. The predicted affinities for the dynamically uncovered cryptic pocket also recapitulate the relative potencies of the compounds (τb = 0.70) better than the predicted affinities for the static AlphaFold-predicted structure (τb = 0.42). Taken together, these results suggest that targeting the cryptic pocket is a good strategy for drugging PPM1D and, more generally, that conformations selected from simulation can improve virtual screening when limited structural data is available.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, St. Louis, MO, United States
- Medical Scientist Training Program, Washington University in St. Louis, St. Louis, MO, United States
| | | | - Aram Davtyan
- Atomwise, Inc., San Francisco, CA, United States
| | | | - Gregory R. Bowman
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, PA, United States
- *Correspondence: Gregory R. Bowman, ; Henry van den Bedem,
| | - Henry van den Bedem
- Atomwise, Inc., San Francisco, CA, United States
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
- *Correspondence: Gregory R. Bowman, ; Henry van den Bedem,
| |
Collapse
|
8
|
Maddah M, Hoseinian N, Pourfath M. An ensemble docking-based virtual screening according to different TRPV1 pore states toward identifying phytochemical activators. NEW J CHEM 2023. [DOI: 10.1039/d2nj04918j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Identifying phytochemical activators for TRPV1 using ensemble-based virtual screening, machine learning, and MD simulation.
Collapse
Affiliation(s)
- Mina Maddah
- School of Electrical and Computer Engineering, University College of Engineering, University of Tehran, Tehran, Iran
| | - Nadia Hoseinian
- School of Electrical and Computer Engineering, University College of Engineering, University of Tehran, Tehran, Iran
| | - Mahdi Pourfath
- School of Electrical and Computer Engineering, University College of Engineering, University of Tehran, Tehran, Iran
| |
Collapse
|
9
|
Blanes-Mira C, Fernández-Aguado P, de Andrés-López J, Fernández-Carvajal A, Ferrer-Montiel A, Fernández-Ballester G. Comprehensive Survey of Consensus Docking for High-Throughput Virtual Screening. Molecules 2022; 28:molecules28010175. [PMID: 36615367 PMCID: PMC9821981 DOI: 10.3390/molecules28010175] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/19/2022] [Accepted: 12/21/2022] [Indexed: 12/28/2022] Open
Abstract
The rapid advances of 3D techniques for the structural determination of proteins and the development of numerous computational methods and strategies have led to identifying highly active compounds in computer drug design. Molecular docking is a method widely used in high-throughput virtual screening campaigns to filter potential ligands targeted to proteins. A great variety of docking programs are currently available, which differ in the algorithms and approaches used to predict the binding mode and the affinity of the ligand. All programs heavily rely on scoring functions to accurately predict ligand binding affinity, and despite differences in performance, none of these docking programs is preferable to the others. To overcome this problem, consensus scoring methods improve the outcome of virtual screening by averaging the rank or score of individual molecules obtained from different docking programs. The successful application of consensus docking in high-throughput virtual screening highlights the need to optimize the predictive power of molecular docking methods.
Collapse
|
10
|
Hantz ER, Lindert S. Actives-Based Receptor Selection Strongly Increases the Success Rate in Structure-Based Drug Design and Leads to Identification of 22 Potent Cancer Inhibitors. J Chem Inf Model 2022; 62:5675-5687. [PMID: 36321808 DOI: 10.1021/acs.jcim.2c00848] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Computer-aided drug design, an important component of the early stages of the drug discovery pipeline, routinely identifies large numbers of false positive hits that are subsequently confirmed to be experimentally inactive compounds. We have developed a methodology to improve true positive prediction rates in structure-based drug design and have successfully applied the protocol to twenty target systems and identified the top three performing conformers for each of the targets. Receptor performance was evaluated based on the area under the curve of the receiver operating characteristic curve for two independent sets of known actives. For a subset of five diverse cancer-related disease targets, we validated our approach through experimental testing of the top 50 compounds from a blind screening of a small molecule library containing hundreds of thousands of compounds. Our methods of receptor and compound selection resulted in the identification of 22 novel inhibitors in the low μM-nM range, with the most potent being an EGFR inhibitor with an IC50 value of 7.96 nM. Additionally, for a subset of five independent target systems, we demonstrated the utility of Gaussian accelerated molecular dynamics to thoroughly explore a target system's potential energy surface and generate highly predictive receptor conformations.
Collapse
Affiliation(s)
- Eric R Hantz
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio43210, United States
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, Ohio43210, United States
| |
Collapse
|
11
|
Yang C, Chen EA, Zhang Y. Protein-Ligand Docking in the Machine-Learning Era. Molecules 2022; 27:4568. [PMID: 35889440 PMCID: PMC9323102 DOI: 10.3390/molecules27144568] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 07/14/2022] [Indexed: 11/16/2022] Open
Abstract
Molecular docking plays a significant role in early-stage drug discovery, from structure-based virtual screening (VS) to hit-to-lead optimization, and its capability and predictive power is critically dependent on the protein-ligand scoring function. In this review, we give a broad overview of recent scoring function development, as well as the docking-based applications in drug discovery. We outline the strategies and resources available for structure-based VS and discuss the assessment and development of classical and machine learning protein-ligand scoring functions. In particular, we highlight the recent progress of machine learning scoring function ranging from descriptor-based models to deep learning approaches. We also discuss the general workflow and docking protocols of structure-based VS, such as structure preparation, binding site detection, docking strategies, and post-docking filter/re-scoring, as well as a case study on the large-scale docking-based VS test on the LIT-PCBA data set.
Collapse
Affiliation(s)
- Chao Yang
- Department of Chemistry, New York University, New York, NY 10003, USA; (C.Y.); (E.A.C.)
| | - Eric Anthony Chen
- Department of Chemistry, New York University, New York, NY 10003, USA; (C.Y.); (E.A.C.)
| | - Yingkai Zhang
- Department of Chemistry, New York University, New York, NY 10003, USA; (C.Y.); (E.A.C.)
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
12
|
Miñarro-Lleonar M, Ruiz-Carmona S, Alvarez-Garcia D, Schmidtke P, Barril X. Development of an Automatic Pipeline for Participation in the CELPP Challenge. Int J Mol Sci 2022; 23:ijms23094756. [PMID: 35563148 PMCID: PMC9105952 DOI: 10.3390/ijms23094756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 04/20/2022] [Accepted: 04/21/2022] [Indexed: 12/01/2022] Open
Abstract
The prediction of how a ligand binds to its target is an essential step for Structure-Based Drug Design (SBDD) methods. Molecular docking is a standard tool to predict the binding mode of a ligand to its macromolecular receptor and to quantify their mutual complementarity, with multiple applications in drug design. However, docking programs do not always find correct solutions, either because they are not sampled or due to inaccuracies in the scoring functions. Quantifying the docking performance in real scenarios is essential to understanding their limitations, managing expectations and guiding future developments. Here, we present a fully automated pipeline for pose prediction validated by participating in the Continuous Evaluation of Ligand Pose Prediction (CELPP) Challenge. Acknowledging the intrinsic limitations of the docking method, we devised a strategy to automatically mine and exploit pre-existing data, defining—whenever possible—empirical restraints to guide the docking process. We prove that the pipeline is able to generate predictions for most of the proposed targets as well as obtain poses with low RMSD values when compared to the crystal structure. All things considered, our pipeline highlights some major challenges in the automatic prediction of protein–ligand complexes, which will be addressed in future versions of the pipeline.
Collapse
Affiliation(s)
- Marina Miñarro-Lleonar
- Pharmacy Faculty, University of Barcelona, Av. de Joan XXIII 27-31, 08028 Barcelona, Spain;
| | | | - Daniel Alvarez-Garcia
- GAIN Therapeutics, Parc Cientific de Barcelona, Baldiri i Reixac 10, 08029 Barcelona, Spain;
| | - Peter Schmidtke
- Discngine S.A.S., 79 Avenue Ledru Rollin, 75012 Paris, France;
| | - Xavier Barril
- Pharmacy Faculty, University of Barcelona, Av. de Joan XXIII 27-31, 08028 Barcelona, Spain;
- GAIN Therapeutics, Parc Cientific de Barcelona, Baldiri i Reixac 10, 08029 Barcelona, Spain;
- Catalan Institute for Research and Advanced Studies (ICREA), Passeig de Lluis Companys 23, 08010 Barcelona, Spain
- Correspondence:
| |
Collapse
|
13
|
Stafford KA, Anderson BM, Sorenson J, van den Bedem H. AtomNet PoseRanker: Enriching Ligand Pose Quality for Dynamic Proteins in Virtual High-Throughput Screens. J Chem Inf Model 2022; 62:1178-1189. [PMID: 35235748 PMCID: PMC8924924 DOI: 10.1021/acs.jcim.1c01250] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Structure-based, virtual High-Throughput Screening (vHTS) methods for predicting ligand activity in drug discovery are important when there are no or relatively few known compounds that interact with a therapeutic target of interest. State-of-the-art computational vHTS necessarily relies on effective methods for pose sampling and docking and generating an accurate affinity score from the docked poses. However, proteins are dynamic; in vivo ligands bind to a conformational ensemble. In silico docking to the single conformation represented by a crystal structure can adversely affect the pose quality. Here, we introduce AtomNet PoseRanker (ANPR), a graph convolutional network trained to identify and rerank crystal-like ligand poses from a sampled ensemble of protein conformations and ligand poses. In contrast to conventional vHTS methods that incorporate receptor flexibility, a deep learning approach can internalize valid cognate and noncognate binding modes corresponding to distinct receptor conformations, thereby learning to infer and account for receptor flexibility even on single conformations. ANPR significantly enriched pose quality in docking to cognate and noncognate receptors of the PDBbind v2019 data set. Improved pose rankings that better represent experimentally observed ligand binding modes improve hit rates in vHTS campaigns and thereby advance computational drug discovery, especially for novel therapeutic targets or novel binding sites.
Collapse
Affiliation(s)
- Kate A Stafford
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103, United States
| | - Brandon M Anderson
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103, United States
| | - Jon Sorenson
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103, United States
| | - Henry van den Bedem
- Atomwise, Inc., 717 Market Street, Suite 800, San Francisco, California 94103, United States.,Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94158, United States
| |
Collapse
|
14
|
Janežič M, Valjavec K, Loboda KB, Herlah B, Ogris I, Kozorog M, Podobnik M, Grdadolnik SG, Wolber G, Perdih A. Dynophore-Based Approach in Virtual Screening: A Case of Human DNA Topoisomerase IIα. Int J Mol Sci 2021; 22:ijms222413474. [PMID: 34948269 PMCID: PMC8703789 DOI: 10.3390/ijms222413474] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 11/23/2021] [Accepted: 12/10/2021] [Indexed: 12/04/2022] Open
Abstract
In this study, we utilized human DNA topoisomerase IIα as a model target to outline a dynophore-based approach to catalytic inhibitor design. Based on MD simulations of a known catalytic inhibitor and the native ATP ligand analog, AMP-PNP, we derived a joint dynophore model that supplements the static structure-based-pharmacophore information with a dynamic component. Subsequently, derived pharmacophore models were employed in a virtual screening campaign of a library of natural compounds. Experimental evaluation identified flavonoid compounds with promising topoisomerase IIα catalytic inhibition and binding studies confirmed interaction with the ATPase domain. We constructed a binding model through docking and extensively investigated it with molecular dynamics MD simulations, essential dynamics, and MM-GBSA free energy calculations, thus reconnecting the new results to the initial dynophore-based screening model. We not only demonstrate a new design strategy that incorporates a dynamic component of molecular recognition, but also highlight new derivates in the established flavonoid class of topoisomerase II inhibitors.
Collapse
Affiliation(s)
- Matej Janežič
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
- Laboratory for Structural Bioinformatics, RIKEN Center for Biosystems Dynamics Research, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan
| | - Katja Valjavec
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
| | - Kaja Bergant Loboda
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
- Faculty of Pharmacy, University of Ljubljana, Aškerčeva 7, SI-1000 Ljubljana, Slovenia
| | - Barbara Herlah
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
- Faculty of Pharmacy, University of Ljubljana, Aškerčeva 7, SI-1000 Ljubljana, Slovenia
| | - Iza Ogris
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
- Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1000 Ljubljana, Slovenia
| | - Mirijam Kozorog
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
| | - Marjetka Podobnik
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
| | - Simona Golič Grdadolnik
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
| | - Gerhard Wolber
- Institute of Pharmacy, Freie Universität Berlin, Königin-Luise-Straße 2-4, 14195 Berlin, Germany;
| | - Andrej Perdih
- National Institute of Chemistry, Hajdrihova 19, SI-1000 Ljubljana, Slovenia; (M.J.); (K.V.); (K.B.L.); (B.H.); (I.O.); (M.K.); (M.P.); (S.G.G.)
- Faculty of Pharmacy, University of Ljubljana, Aškerčeva 7, SI-1000 Ljubljana, Slovenia
- Correspondence: ; Tel.: +386-1-4760-376
| |
Collapse
|
15
|
Hall-Swan S, Devaurs D, Rigo MM, Antunes DA, Kavraki LE, Zanatta G. DINC-COVID: A webserver for ensemble docking with flexible SARS-CoV-2 proteins. Comput Biol Med 2021; 139:104943. [PMID: 34717233 PMCID: PMC8518241 DOI: 10.1016/j.compbiomed.2021.104943] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 09/27/2021] [Accepted: 10/11/2021] [Indexed: 12/16/2022]
Abstract
An unprecedented research effort has been undertaken in response to the ongoing COVID-19 pandemic. This has included the determination of hundreds of crystallographic structures of SARS-CoV-2 proteins, and numerous virtual screening projects searching large compound libraries for potential drug inhibitors. Unfortunately, these initiatives have had very limited success in producing effective inhibitors against SARS-CoV-2 proteins. A reason might be an often overlooked factor in these computational efforts: receptor flexibility. To address this issue we have implemented a computational tool for ensemble docking with SARS-CoV-2 proteins. We have extracted representative ensembles of protein conformations from the Protein Data Bank and from in silico molecular dynamics simulations. Twelve pre-computed ensembles of SARS-CoV-2 protein conformations have now been made available for ensemble docking via a user-friendly webserver called DINC-COVID (dinc-covid.kavrakilab.org). We have validated DINC-COVID using data on tested inhibitors of two SARS-CoV-2 proteins, obtaining good correlations between docking-derived binding energies and experimentally-determined binding affinities. Some of the best results have been obtained on a dataset of large ligands resolved via room temperature crystallography, and therefore capturing alternative receptor conformations. In addition, we have shown that the ensembles available in DINC-COVID capture different ranges of receptor flexibility, and that this diversity is useful in finding alternative binding modes of ligands. Overall, our work highlights the importance of accounting for receptor flexibility in docking studies, and provides a platform for the identification of new inhibitors against SARS-CoV-2 proteins.
Collapse
Affiliation(s)
- Sarah Hall-Swan
- Department of Computer Science, Rice University, Houston, 77005, Texas, United States
| | - Didier Devaurs
- MRC Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU, United Kingdom
| | - Mauricio M. Rigo
- Department of Computer Science, Rice University, Houston, 77005, Texas, United States
| | - Dinler A. Antunes
- Department of Computer Science, Rice University, Houston, 77005, Texas, United States,Department of Biology and Biochemistry, University of Houston, Houston, 77005, Texas, United States,Corresponding author. Department of Computer Science, Rice University, Houston, 77005, Texas, United States
| | - Lydia E. Kavraki
- Department of Computer Science, Rice University, Houston, 77005, Texas, United States,Corresponding author
| | - Geancarlo Zanatta
- Department of Physics, Federal University of Ceará, Fortaleza, CE, Brazil,Corresponding author
| |
Collapse
|
16
|
Ricci-Lopez J, Aguila SA, Gilson MK, Brizuela CA. Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning. J Chem Inf Model 2021; 61:5362-5376. [PMID: 34652141 DOI: 10.1021/acs.jcim.1c00511] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
One of the main challenges of structure-based virtual screening (SBVS) is the incorporation of the receptor's flexibility, as its explicit representation in every docking run implies a high computational cost. Therefore, a common alternative to include the receptor's flexibility is the approach known as ensemble docking. Ensemble docking consists of using a set of receptor conformations and performing the docking assays over each of them. However, there is still no agreement on how to combine the ensemble docking results to obtain the final ligand ranking. A common choice is to use consensus strategies to aggregate the ensemble docking scores, but these strategies exhibit slight improvement regarding the single-structure approach. Here, we claim that using machine learning (ML) methodologies over the ensemble docking results could improve the predictive power of SBVS. To test this hypothesis, four proteins were selected as study cases: CDK2, FXa, EGFR, and HSP90. Protein conformational ensembles were built from crystallographic structures, whereas the evaluated compound library comprised up to three benchmarking data sets (DUD, DEKOIS 2.0, and CSAR-2012) and cocrystallized molecules. Ensemble docking results were processed through 30 repetitions of 4-fold cross-validation to train and validate two ML classifiers: logistic regression and gradient boosting trees. Our results indicate that the ML classifiers significantly outperform traditional consensus strategies and even the best performance case achieved with single-structure docking. We provide statistical evidence that supports the effectiveness of ML to improve the ensemble docking performance.
Collapse
Affiliation(s)
- Joel Ricci-Lopez
- Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California C.P. 22860, Mexico.,Centro de Nanociencias y Nanotecnología, Universidad Nacional Autónoma de México (UNAM), Ensenada, Baja California C.P. 22860, Mexico
| | - Sergio A Aguila
- Centro de Nanociencias y Nanotecnología, Universidad Nacional Autónoma de México (UNAM), Ensenada, Baja California C.P. 22860, Mexico
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, La Jolla, San Diego, California 92093, United States
| | - Carlos A Brizuela
- Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California C.P. 22860, Mexico
| |
Collapse
|
17
|
Bender BJ, Gahbauer S, Luttens A, Lyu J, Webb CM, Stein RM, Fink EA, Balius TE, Carlsson J, Irwin JJ, Shoichet BK. A practical guide to large-scale docking. Nat Protoc 2021; 16:4799-4832. [PMID: 34561691 PMCID: PMC8522653 DOI: 10.1038/s41596-021-00597-z] [Citation(s) in RCA: 184] [Impact Index Per Article: 61.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 06/22/2021] [Indexed: 02/08/2023]
Abstract
Structure-based docking screens of large compound libraries have become common in early drug and probe discovery. As computer efficiency has improved and compound libraries have grown, the ability to screen hundreds of millions, and even billions, of compounds has become feasible for modest-sized computer clusters. This allows the rapid and cost-effective exploration and categorization of vast chemical space into a subset enriched with potential hits for a given target. To accomplish this goal at speed, approximations are used that result in undersampling of possible configurations and inaccurate predictions of absolute binding energies. Accordingly, it is important to establish controls, as are common in other fields, to enhance the likelihood of success in spite of these challenges. Here we outline best practices and control docking calculations that help evaluate docking parameters for a given target prior to undertaking a large-scale prospective screen, with exemplification in one particular target, the melatonin receptor, where following this procedure led to direct docking hits with activities in the subnanomolar range. Additional controls are suggested to ensure specific activity for experimentally validated hit compounds. These guidelines should be useful regardless of the docking software used. Docking software described in the outlined protocol (DOCK3.7) is made freely available for academic research to explore new hits for a range of targets.
Collapse
Affiliation(s)
- Brian J Bender
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA
| | - Stefan Gahbauer
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA
| | - Andreas Luttens
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Jiankun Lyu
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA
| | - Chase M Webb
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA
| | - Reed M Stein
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA
| | - Elissa A Fink
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA
| | - Trent E Balius
- NCI RAS Initiative, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Leidos Biomedical Research, Inc, Frederick, MD, USA
| | - Jens Carlsson
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - John J Irwin
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA
| | - Brian K Shoichet
- Department of Pharmaceutical Chemistry, University of California-San Francisco, San Francisco, CA, USA.
| |
Collapse
|
18
|
Stanzione F, Giangreco I, Cole JC. Use of molecular docking computational tools in drug discovery. PROGRESS IN MEDICINAL CHEMISTRY 2021; 60:273-343. [PMID: 34147204 DOI: 10.1016/bs.pmch.2021.01.004] [Citation(s) in RCA: 125] [Impact Index Per Article: 41.7] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Molecular docking has become an important component of the drug discovery process. Since first being developed in the 1980s, advancements in the power of computer hardware and the increasing number of and ease of access to small molecule and protein structures have contributed to the development of improved methods, making docking more popular in both industrial and academic settings. Over the years, the modalities by which docking is used to assist the different tasks of drug discovery have changed. Although initially developed and used as a standalone method, docking is now mostly employed in combination with other computational approaches within integrated workflows. Despite its invaluable contribution to the drug discovery process, molecular docking is still far from perfect. In this chapter we will provide an introduction to molecular docking and to the different docking procedures with a focus on several considerations and protocols, including protonation states, active site waters and consensus, that can greatly improve the docking results.
Collapse
Affiliation(s)
| | - Ilenia Giangreco
- Cambridge Crystallographic Data Centre, Cambridge, United Kingdom
| | - Jason C Cole
- Cambridge Crystallographic Data Centre, Cambridge, United Kingdom
| |
Collapse
|
19
|
Chakraborti S, Chakraborty M, Bose A, Srinivasan N, Visweswariah SS. Identification of Potential Binders of Mtb Universal Stress Protein (Rv1636) Through an in silico Approach and Insights Into Compound Selection for Experimental Validation. Front Mol Biosci 2021; 8:599221. [PMID: 34012976 PMCID: PMC8126637 DOI: 10.3389/fmolb.2021.599221] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 03/01/2021] [Indexed: 12/28/2022] Open
Abstract
Millions of deaths caused by Mycobacterium tuberculosis (Mtb) are reported worldwide every year. Treatment of tuberculosis (TB) involves the use of multiple antibiotics over a prolonged period. However, the emergence of resistance leading to multidrug-resistant TB (MDR-TB) and extensively drug-resistant TB (XDR-TB) is the most challenging aspect of TB treatment. Therefore, there is a constant need to search for novel therapeutic strategies that could tackle the growing problem of drug resistance. One such strategy could be perturbing the functions of novel targets in Mtb, such as universal stress protein (USP, Rv1636), which binds to cAMP with a higher affinity than ATP. Orthologs of these proteins are conserved in all mycobacteria and act as “sink” for cAMP, facilitating the availability of this second messenger for signaling when required. Here, we have used the cAMP-bound crystal structure of USP from Mycobacterium smegmatis, a closely related homolog of Mtb, to conduct a structure-guided hunt for potential binders of Rv1636, primarily employing molecular docking approach. A library of 1.9 million compounds was subjected to virtual screening to obtain an initial set of ~2,000 hits. An integrative strategy that uses the available experimental data and consensus indications from other computational analyses has been employed to prioritize 22 potential binders of Rv1636 for experimental validations. Binding affinities of a few compounds among the 22 prioritized compounds were tested through microscale thermophoresis assays, and two compounds of natural origin showed promising binding affinities with Rv1636. We believe that this study provides an important initial guidance to medicinal chemists and biochemists to synthesize and test an enriched set of compounds that have the potential to inhibit Mtb USP (Rv1636), thereby aiding the development of novel antitubercular lead candidates.
Collapse
Affiliation(s)
- Sohini Chakraborti
- Molecular Biophysics Unit, Indian Institute of Science, Bengaluru, India
| | - Moubani Chakraborty
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Science, Bengaluru, India
| | - Avipsa Bose
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Science, Bengaluru, India
| | | | - Sandhya S Visweswariah
- Department of Molecular Reproduction, Development and Genetics, Indian Institute of Science, Bengaluru, India
| |
Collapse
|
20
|
Djokovic N, Ruzic D, Djikic T, Cvijic S, Ignjatovic J, Ibric S, Baralic K, Buha Djordjevic A, Curcic M, Djukic‐Cosic D, Nikolic K. An Integrative in silico Drug Repurposing Approach for Identification of Potential Inhibitors of SARS-CoV-2 Main Protease. Mol Inform 2021; 40:e2000187. [PMID: 33787066 PMCID: PMC8250230 DOI: 10.1002/minf.202000187] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 02/15/2021] [Indexed: 12/25/2022]
Abstract
Considering the urgent need for novel therapeutics in ongoing COVID-19 pandemic, drug repurposing approach might offer rapid solutions comparing to de novo drug design. In this study, we designed an integrative in silico drug repurposing approach for rapid selection of potential candidates against SARS-CoV-2 Main Protease (Mpro ). To screen FDA-approved drugs, we implemented structure-based molecular modelling techniques, physiologically-based pharmacokinetic (PBPK) modelling of drugs disposition and data mining analysis of drug-gene-COVID-19 association. Through presented approach, we selected the most promising FDA approved drugs for further COVID-19 drug development campaigns and analysed them in context of available experimental data. To the best of our knowledge, this is unique in silico study which integrates structure-based molecular modeling of Mpro inhibitors with predictions of their tissue disposition, drug-gene-COVID-19 associations and prediction of pleiotropic effects of selected candidates.
Collapse
Affiliation(s)
- Nemanja Djokovic
- Department of Pharmaceutical ChemistryFaculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| | - Dusan Ruzic
- Department of Pharmaceutical ChemistryFaculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| | - Teodora Djikic
- Department of Pharmaceutical ChemistryFaculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| | - Sandra Cvijic
- Department of Pharmaceutical Technology and CosmetologyUniversity of BelgradeFaculty of PharmacyVojvode Stepe 45011221BelgradeSerbia
| | - Jelisaveta Ignjatovic
- Department of Pharmaceutical Technology and CosmetologyUniversity of BelgradeFaculty of PharmacyVojvode Stepe 45011221BelgradeSerbia
| | - Svetlana Ibric
- Department of Pharmaceutical Technology and CosmetologyUniversity of BelgradeFaculty of PharmacyVojvode Stepe 45011221BelgradeSerbia
| | - Katarina Baralic
- Department of Toxicology “Akademik Danilo Soldatovic”Faculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| | - Aleksandra Buha Djordjevic
- Department of Toxicology “Akademik Danilo Soldatovic”Faculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| | - Marijana Curcic
- Department of Toxicology “Akademik Danilo Soldatovic”Faculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| | - Danijela Djukic‐Cosic
- Department of Toxicology “Akademik Danilo Soldatovic”Faculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| | - Katarina Nikolic
- Department of Pharmaceutical ChemistryFaculty of PharmacyUniversity of BelgradeVojvode Stepe 45011221BelgradeSerbia
| |
Collapse
|
21
|
Chandak T, Wong CF. EDock-ML: A web server for using ensemble docking with machine learning to aid drug discovery. Protein Sci 2021; 30:1087-1097. [PMID: 33733530 DOI: 10.1002/pro.4065] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/07/2021] [Accepted: 03/15/2021] [Indexed: 01/08/2023]
Abstract
EDock-ML is a web server that facilitates the use of ensemble docking with machine learning to help decide whether a compound is worthwhile to be considered further in a drug discovery process. Ensemble docking provides an economical way to account for receptor flexibility in molecular docking. Machine learning improves the use of the resulting docking scores to evaluate whether a compound is likely to be useful. EDock-ML takes a bottom-up approach in which machine-learning models are developed one protein at a time to improve predictions for the proteins included in its database. Because the machine-learning models are intended to be used without changing the docking and model parameters with which the models were trained, novice users can use it directly without worrying about what parameters to choose. A user simply submits a compound specified by an ID from the ZINC database (Sterling, T.; Irwin, J. J., J Chem Inf Model 2015, 55[11], 2,324-2,337.) or upload a file prepared by a chemical drawing program and receives an output helping the user decide the likelihood of the compound to be active or inactive for a drug target. EDock-ML can be accessed freely at edock-ml.umsl.edu.
Collapse
Affiliation(s)
- Tanay Chandak
- Department of Chemistry and Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri, USA
| | - Chung F Wong
- Department of Chemistry and Biochemistry, University of Missouri-St. Louis, St. Louis, Missouri, USA
| |
Collapse
|
22
|
Fischer A, Smieško M, Sellner M, Lill MA. Decision Making in Structure-Based Drug Discovery: Visual Inspection of Docking Results. J Med Chem 2021; 64:2489-2500. [PMID: 33617246 DOI: 10.1021/acs.jmedchem.0c02227] [Citation(s) in RCA: 85] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Molecular docking is a computational method widely used in drug discovery. Due to the inherent inaccuracies of molecular docking, visual inspection of binding modes is a crucial routine in the decision making process of computational medicinal chemists. Despite its apparent importance for medicinal chemistry projects, guidelines for the visual docking pose assessment have been hardly discussed in the literature. Here, we review the medicinal chemistry literature with the aim of identifying consistent principles for visual inspection, highlighting cases of its successful application, and discussing its limitations. In this context, we conducted a survey reaching experts in both academia and the pharmaceutical industry, which also included a challenge to distinguish native from incorrect poses. We were able to collect 93 expert opinions that offer valuable insights into visually supported decision-making processes. This perspective shall motivate discussions among experienced computational medicinal chemists and guide young scientists new to the field to stratify their compounds.
Collapse
Affiliation(s)
- André Fischer
- Computational Pharmacy, Department of Pharmaceutical Sciences, University of Basel, Klingelbergstrasse 61, 4056 Basel, Switzerland
| | - Martin Smieško
- Computational Pharmacy, Department of Pharmaceutical Sciences, University of Basel, Klingelbergstrasse 61, 4056 Basel, Switzerland
| | - Manuel Sellner
- Computational Pharmacy, Department of Pharmaceutical Sciences, University of Basel, Klingelbergstrasse 61, 4056 Basel, Switzerland
| | - Markus A Lill
- Computational Pharmacy, Department of Pharmaceutical Sciences, University of Basel, Klingelbergstrasse 61, 4056 Basel, Switzerland
| |
Collapse
|
23
|
Azhagiya Singam ER, Tachachartvanich P, Fourches D, Soshilov A, Hsieh JCY, La Merrill MA, Smith MT, Durkin KA. Structure-based virtual screening of perfluoroalkyl and polyfluoroalkyl substances (PFASs) as endocrine disruptors of androgen receptor activity using molecular docking and machine learning. ENVIRONMENTAL RESEARCH 2020; 190:109920. [PMID: 32795691 DOI: 10.1016/j.envres.2020.109920] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 07/04/2020] [Indexed: 06/11/2023]
Abstract
Perfluoroalkyl and polyfluoroalkyl substances (PFASs) pose a substantial threat as endocrine disruptors, and thus early identification of those that may interact with steroid hormone receptors, such as the androgen receptor (AR), is critical. In this study we screened 5,206 PFASs from the CompTox database against the different binding sites on the AR using both molecular docking and machine learning techniques. We developed support vector machine models trained on Tox21 data to classify the active and inactive PFASs for AR using different chemical fingerprints as features. The maximum accuracy was 95.01% and Matthew's correlation coefficient (MCC) was 0.76 respectively, based on MACCS fingerprints (MACCSFP). The combination of docking-based screening and machine learning models identified 29 PFASs that have strong potential for activity against the AR and should be considered priority chemicals for biological toxicity testing.
Collapse
Affiliation(s)
| | | | - Denis Fourches
- Department of Chemistry, North Carolina State University, Raleigh, NC, USA
| | - Anatoly Soshilov
- Division of Scientific Programs, Pesticide and Environmental Toxicology Branch, Water Toxicology Section, Office of Environmental Health Hazard Assessment, California Environmental Protection Agency, USA
| | - Jennifer C Y Hsieh
- Division of Scientific Programs, Reproductive and Cancer Hazard Assessment Branch, Cancer Toxicology and Epidemiology Section, Office of Environmental Health Hazard Assessment, California Environmental Protection Agency, USA
| | - Michele A La Merrill
- Department of Environmental Toxicology, University of California, Davis, CA, USA
| | - Martyn T Smith
- Division of Environmental Health Sciences, School of Public Health, University of California, Berkeley, CA, USA.
| | - Kathleen A Durkin
- Molecular Graphics and Computation Facility, College of Chemistry, University of California, Berkeley, CA, USA.
| |
Collapse
|
24
|
Singh N, Decroly E, Khatib AM, Villoutreix BO. Structure-based drug repositioning over the human TMPRSS2 protease domain: search for chemical probes able to repress SARS-CoV-2 Spike protein cleavages. Eur J Pharm Sci 2020; 153:105495. [PMID: 32730844 PMCID: PMC7384984 DOI: 10.1016/j.ejps.2020.105495] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 07/16/2020] [Accepted: 07/27/2020] [Indexed: 12/28/2022]
Abstract
In December 2019, a new coronavirus was identified in the Hubei province of central china and named SARS-CoV-2. This new virus induces COVID-19, a severe respiratory disease with high death rate. A putative target to interfere with the virus is the host transmembrane serine protease family member II (TMPRSS2). This enzyme is critical for the entry of coronaviruses into human cells by cleaving and activating the spike protein (S) of SARS-CoV-2. Repositioning approved, investigational and experimental drugs on the serine protease domain of TMPRSS2 could thus be valuable. There is no experimental structure for TMPRSS2 but it is possible to develop quality structural models for the serine protease domain using comparative modeling strategies as such domains are highly structurally conserved. Beside the TMPRSS2 catalytic site, we predicted on our structural models a main exosite that could be important for the binding of protein partners and/or substrates. To block the catalytic site or the exosite of TMPRSS2 we used structure-based virtual screening computations and two different collections of approved, investigational and experimental drugs. We propose a list of 156 molecules that could bind to the catalytic site and 100 compounds that may interact with the exosite. These small molecules should now be tested in vitro to gain novel insights over the roles of TMPRSS2 or as starting point for the development of second generation analogs.
Collapse
Affiliation(s)
- Natesh Singh
- Univ. Lille, INSERM, Institut Pasteur de Lille, U1177, F-59000 Lille, France
| | | | - Abdel-Majid Khatib
- Univ. Bordeaux, Allée Geoffroy St Hilaire, 33615 Pessac, France
- INSERM, LAMC, UMR 1029, Allée Geoffroy St Hilaire, 33615 Pessac, France
- Corresponding authors.
| | - Bruno O. Villoutreix
- Univ. Lille, INSERM, Institut Pasteur de Lille, U1177, F-59000 Lille, France
- Corresponding authors.
| |
Collapse
|
25
|
Saikia S, Bordoloi M. Molecular Docking: Challenges, Advances and its Use in Drug Discovery Perspective. Curr Drug Targets 2020; 20:501-521. [PMID: 30360733 DOI: 10.2174/1389450119666181022153016] [Citation(s) in RCA: 203] [Impact Index Per Article: 50.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Revised: 06/08/2018] [Accepted: 08/28/2018] [Indexed: 01/21/2023]
Abstract
Molecular docking is a process through which small molecules are docked into the macromolecular structures for scoring its complementary values at the binding sites. It is a vibrant research area with dynamic utility in structure-based drug-designing, lead optimization, biochemical pathway and for drug designing being the most attractive tools. Two pillars for a successful docking experiment are correct pose and affinity prediction. Each program has its own advantages and drawbacks with respect to their docking accuracy, ranking accuracy and time consumption so a general conclusion cannot be drawn. Moreover, users don't always consider sufficient diversity in their test sets which results in certain programs to outperform others. In this review, the prime focus has been laid on the challenges of docking and troubleshooters in existing programs, underlying algorithmic background of docking, preferences regarding the use of docking programs for best results illustrated with examples, comparison of performance for existing tools and algorithms, state of art in docking, recent trends of diseases and current drug industries, evidence from clinical trials and post-marketing surveillance are discussed. These aspects of the molecular drug designing paradigm are quite controversial and challenging and this review would be an asset to the bioinformatics and drug designing communities.
Collapse
Affiliation(s)
- Surovi Saikia
- Natural Products Chemistry Group, CSIR North East Institute of Science & Technology, Jorhat-785006, Assam, India
| | - Manobjyoti Bordoloi
- Natural Products Chemistry Group, CSIR North East Institute of Science & Technology, Jorhat-785006, Assam, India
| |
Collapse
|
26
|
Chandak T, Mayginnes JP, Mayes H, Wong CF. Using machine learning to improve ensemble docking for drug discovery. Proteins 2020; 88:1263-1270. [PMID: 32401384 DOI: 10.1002/prot.25899] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 04/09/2020] [Accepted: 05/07/2020] [Indexed: 01/26/2023]
Abstract
Ensemble docking has provided an inexpensive method to account for receptor flexibility in molecular docking for virtual screening. Unfortunately, as there is no rigorous theory to connect the docking scores from multiple structures to measured activity, researchers have not yet come up with effective ways to use these scores to classify compounds into actives and inactives. This shortcoming has led to the decrease, rather than an increase in the performance of classifying compounds when more structures are added to the ensemble. Previously, we suggested machine learning, implemented in the form of a naïve Bayesian model could alleviate this problem. However, the naïve Bayesian model assumed that the probabilities of observing the docking scores to different structures to be independent. This approximation might prevent it from achieving even higher performance. In the work presented in this paper, we have relaxed this approximation when using several other machine learning methods-k nearest neighbor, logistic regression, support vector machine, and random forest-to improve ensemble docking. We found significant improvement.
Collapse
Affiliation(s)
- Tanay Chandak
- Department of Chemistry and Biochemistry and Center for Nanoscience, University of Missouri-St. Louis, Saint Louis, Missouri, USA
| | - John P Mayginnes
- Department of Chemistry and Biochemistry and Center for Nanoscience, University of Missouri-St. Louis, Saint Louis, Missouri, USA
| | - Howard Mayes
- Department of Chemistry and Biochemistry and Center for Nanoscience, University of Missouri-St. Louis, Saint Louis, Missouri, USA
| | - Chung F Wong
- Department of Chemistry and Biochemistry and Center for Nanoscience, University of Missouri-St. Louis, Saint Louis, Missouri, USA
| |
Collapse
|
27
|
Ensemble docking-based virtual screening toward identifying inhibitors against Wee1 kinase. Future Med Chem 2020; 11:1889-1906. [PMID: 31517534 DOI: 10.4155/fmc-2019-0022] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Aim: Wee1 kinase plays a key role in the arrest of G2/M checkpoint that prevents mitotic entry in response to DNA damage. This work is to discover potent Wee1 inhibitors which can be considered valuable. Materials & Methods: Herein, Ensemble docking using multiple crystal structures was considered an effective strategy in the virtual screening. The performance of 17 scoring functions obtained from different docking software was evaluated for molecular docking. Results: Two novel compounds B1 and A2 were identified as Wee1 inhibitors with IC50 values of 10.23 ± 0.505 and 8.72 ± 0.323 μM, respectively. Further cell viability assay demonstrated that the two active compounds exhibited good anticancer activities. Conclusion: This provides a meaningful starting point for further structure optimization to discover more potent Wee1 inhibitors.
Collapse
|
28
|
Vásquez AF, Reyes Muñoz A, Duitama J, González Barrios A. Discovery of new potential CDK2/VEGFR2 type II inhibitors by fragmentation and virtual screening of natural products. J Biomol Struct Dyn 2020; 39:3285-3299. [DOI: 10.1080/07391102.2020.1763839] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- Andrés Felipe Vásquez
- Grupo de Diseño de Productos y Procesos (GDPP), Department of Chemical Engineering, Universidad de los Andes, Bogotá, Colombia
| | - Alejandro Reyes Muñoz
- Grupo de Biología Computacional Ecología Microbiana (BCEM), Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia
- Max Planck Tandem Group in Computational Biology, Universidad de los Andes, Bogotá, Colombia
| | - Jorge Duitama
- Systems and Computing Engineering Department, Universidad de los Andes, Bogotá, Colombia
| | - Andrés González Barrios
- Grupo de Diseño de Productos y Procesos (GDPP), Department of Chemical Engineering, Universidad de los Andes, Bogotá, Colombia
| |
Collapse
|
29
|
Rallabandi HR, Ganesan P, Kim YJ. Targeting the C-Terminal Domain Small Phosphatase 1. Life (Basel) 2020; 10:life10050057. [PMID: 32397221 PMCID: PMC7281111 DOI: 10.3390/life10050057] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Revised: 05/05/2020] [Accepted: 05/07/2020] [Indexed: 12/15/2022] Open
Abstract
The human C-terminal domain small phosphatase 1 (CTDSP1/SCP1) is a protein phosphatase with a conserved catalytic site of DXDXT/V. CTDSP1’s major activity has been identified as dephosphorylation of the 5th Ser residue of the tandem heptad repeat of the RNA polymerase II C-terminal domain (RNAP II CTD). It is also implicated in various pivotal biological activities, such as acting as a driving factor in repressor element 1 (RE-1)-silencing transcription factor (REST) complex, which silences the neuronal genes in non-neuronal cells, G1/S phase transition, and osteoblast differentiation. Recent findings have denoted that negative regulation of CTDSP1 results in suppression of cancer invasion in neuroglioma cells. Several researchers have focused on the development of regulating materials of CTDSP1, due to the significant roles it has in various biological activities. In this review, we focused on this emerging target and explored the biological significance, challenges, and opportunities in targeting CTDSP1 from a drug designing perspective.
Collapse
|
30
|
Santos KB, Guedes IA, Karl ALM, Dardenne LE. Highly Flexible Ligand Docking: Benchmarking of the DockThor Program on the LEADS-PEP Protein-Peptide Data Set. J Chem Inf Model 2020; 60:667-683. [PMID: 31922754 DOI: 10.1021/acs.jcim.9b00905] [Citation(s) in RCA: 104] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Protein-peptide interactions play a crucial role in many cellular and biological functions, which justify the increasing interest in the development of peptide-based drugs. However, predicting experimental binding modes and affinities in protein-peptide docking remains a great challenge for most docking programs due to some particularities of this class of ligands, such as the high degree of flexibility. In this paper, we present the performance of the DockThor program on the LEADS-PEP data set, a benchmarking set composed of 53 diverse protein-peptide complexes with peptides ranging from 3 to 12 residues and with up to 51 rotatable bonds. The DockThor performance for pose prediction on redocking studies was compared with some state-of-the-art docking programs that were also evaluated on the LEADS-PEP data set, AutoDock, AutoDock Vina, Surflex, GOLD, Glide, rDock, and DINC, as well as with the task-specific docking protocol HPepDock. Our results indicate that DockThor could dock 40% of the cases with an overall backbone RMSD below 2.5 Å when the top-scored docking pose was considered, exhibiting similar results to Glide and outperforming other protein-ligand docking programs, whereas rDock and HPepDock achieved superior results. Assessing the docking poses closest to the crystal structure (i.e., best-RMSD pose), DockThor achieved a success rate of 60% in pose prediction. Due to the great overall performance of handling peptidic compounds, the DockThor program can be considered as suitable for docking highly flexible and challenging ligands, with up to 40 rotatable bonds. DockThor is freely available as a virtual screening Web server at https://www.dockthor.lncc.br/ .
Collapse
Affiliation(s)
- Karina B Santos
- National Laboratory for Scientific Computing - LNCC , Petrópolis , Rio de Janeiro 25651-075 , Brazil
| | - Isabella A Guedes
- National Laboratory for Scientific Computing - LNCC , Petrópolis , Rio de Janeiro 25651-075 , Brazil
| | - Ana L M Karl
- National Laboratory for Scientific Computing - LNCC , Petrópolis , Rio de Janeiro 25651-075 , Brazil
| | - Laurent E Dardenne
- National Laboratory for Scientific Computing - LNCC , Petrópolis , Rio de Janeiro 25651-075 , Brazil
| |
Collapse
|
31
|
Fan N, Bauer CA, Stork C, de Bruyn Kops C, Kirchmair J. ALADDIN: Docking Approach Augmented by Machine Learning for Protein Structure Selection Yields Superior Virtual Screening Performance. Mol Inform 2019; 39:e1900103. [PMID: 31663691 PMCID: PMC7187304 DOI: 10.1002/minf.201900103] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 10/28/2019] [Indexed: 01/16/2023]
Abstract
Protein flexibility and solvation pose major challenges to docking algorithms and scoring functions. One established strategy for addressing these challenges is to use multiple protein conformations for docking (all-against-all ensemble docking). Recent studies have shown that the performance of ensemble docking can be improved by selecting the most relevant protein structures for docking. In search for a robust approach to protein structure selection, we have come up with an integrated mAchine Learning AnD DockINg approach (ALADDIN). ALADDIN employs a battery of random forest classifiers to select, individually for each compound of interest, from an ensemble of protein structures, the single most suitable protein structure for docking. ALADDIN outperformed the best single-structure docking runs, ensemble docking and a similarity-based docking approach on three out of four investigated targets, with up to 0.15, 0.11 and 0.16 higher area under the receiver operating characteristic curve (AUC) values, respectively. Only in the case of cytochrome P450 3A4, ALADDIN, like any of the other tested approaches, failed to obtain decent performance. ALADDIN can be particularly useful for structure-based virtual screening of malleable proteins, including kinases, some viral enzymes and anti-targets.
Collapse
Affiliation(s)
- Ningning Fan
- Universität Hamburg, Faculty of Mathematics, Informatics and Natural Sciences, Department of Informatics, Center for Bioinformatics, 20146, Hamburg, Germany
| | - Christoph A Bauer
- University of Bergen, Department of Chemistry, N-5020, Bergen, Norway.,University of Bergen, Computational Biology Unit (CBU), N-5020, Bergen, Norway
| | - Conrad Stork
- Universität Hamburg, Faculty of Mathematics, Informatics and Natural Sciences, Department of Informatics, Center for Bioinformatics, 20146, Hamburg, Germany
| | - Christina de Bruyn Kops
- Universität Hamburg, Faculty of Mathematics, Informatics and Natural Sciences, Department of Informatics, Center for Bioinformatics, 20146, Hamburg, Germany
| | - Johannes Kirchmair
- Universität Hamburg, Faculty of Mathematics, Informatics and Natural Sciences, Department of Informatics, Center for Bioinformatics, 20146, Hamburg, Germany.,University of Bergen, Department of Chemistry, N-5020, Bergen, Norway.,University of Bergen, Computational Biology Unit (CBU), N-5020, Bergen, Norway
| |
Collapse
|
32
|
Preto J, Gentile F. Assessing and improving the performance of consensus docking strategies using the DockBox package. J Comput Aided Mol Des 2019; 33:817-829. [DOI: 10.1007/s10822-019-00227-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 09/26/2019] [Indexed: 10/25/2022]
|
33
|
Wong CF. Improving ensemble docking for drug discovery by machine learning. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2019. [DOI: 10.1142/s0219633619200013] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Ensemble docking has provided an inexpensive method to account for receptor flexibility in molecular docking. However, it is still unclear how best to use the docking scores from multiple structures to classify compounds into actives and inactives. Previous studies have also found that the performance of classification could decrease rather than increase with the number of structures included in the ensemble. Machine learning could help to alleviate these problems.
Collapse
Affiliation(s)
- Chung F. Wong
- Department of Chemistry and Biochemistry and Center for Nanoscience, University of Missouri-St. Louis, Saint Louis, MO 63121, USA
| |
Collapse
|
34
|
Slater O, Kontoyianni M. The compromise of virtual screening and its impact on drug discovery. Expert Opin Drug Discov 2019; 14:619-637. [PMID: 31025886 DOI: 10.1080/17460441.2019.1604677] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Introduction: Docking and structure-based virtual screening (VS) have been standard approaches in structure-based design for over two decades. However, our understanding of the limitations, potential, and strength of these techniques has enhanced, raising expectations. Areas covered: Based on a survey of reports in the past five years, we assess whether VS: (1) predicts binding poses in agreement with crystallographic data (when available); (2) is a superior screening tool, as often claimed; (3) is successful in identifying chemical scaffolds that can be starting points for subsequent lead optimization cycles. Data shows that knowledge of the target and its chemotypes in postprocessing lead to viable hits in early drug discovery endeavors. Expert opinion: VS is capable of accurate placements in the pocket for the most part, but does not consistently score screening collections accurately. What matters is capitalization on available resources to get closer to a viable lead or optimizable series. Integration of approaches, subjective hit selection guided by knowledge of the receptor or endogenous ligand, libraries driven by experimental guides, validation studies to identify the best docking/scoring that reproduces experimental findings, constraints regarding receptor-ligand interactions, thoroughly designed methodologies, and predefined cutoff scoring criteria strengthen VS's position in pharmaceutical research.
Collapse
Affiliation(s)
- Olivia Slater
- a Department of Pharmaceutical Sciences , Southern Illinois University Edwardsville , Edwardsville , IL , USA
| | - Maria Kontoyianni
- a Department of Pharmaceutical Sciences , Southern Illinois University Edwardsville , Edwardsville , IL , USA
| |
Collapse
|
35
|
Palacio-Rodríguez K, Lans I, Cavasotto CN, Cossio P. Exponential consensus ranking improves the outcome in docking and receptor ensemble docking. Sci Rep 2019; 9:5142. [PMID: 30914702 PMCID: PMC6435795 DOI: 10.1038/s41598-019-41594-3] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Accepted: 03/04/2019] [Indexed: 12/21/2022] Open
Abstract
Consensus-scoring methods are commonly used with molecular docking in virtual screening campaigns to filter potential ligands for a protein target. Traditional consensus methods combine results from different docking programs by averaging the score or rank of each molecule obtained from individual programs. Unfortunately, these methods fail if one of the docking programs has poor performance, which is likely to occur due to training-set dependencies and scoring-function parameterization. In this work, we introduce a novel consensus method that overcomes these limitations. We combine the results from individual docking programs using a sum of exponential distributions as a function of the molecule rank for each program. We test the method over several benchmark systems using individual and ensembles of target structures from diverse protein families with challenging decoy/ligand datasets. The results demonstrate that the novel method outperforms the best traditional consensus strategies over a wide range of systems. Moreover, because the novel method is based on the rank rather than the score, it is independent of the score units, scales and offsets, which can hinder the combination of results from different structures or programs. Our method is simple and robust, providing a theoretical basis not only for molecular docking but also for any consensus strategy in general.
Collapse
Affiliation(s)
- Karen Palacio-Rodríguez
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia, Medellín, Colombia
| | - Isaias Lans
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia, Medellín, Colombia
| | - Claudio N Cavasotto
- Computational Drug Design and Drug Discovery Informatics Laboratory, Translational Medicine Research Institute (IIMT), CONICET-Universidad Austral, Pilar-Derqui, Buenos Aires, Argentina. .,Facultad de Ciencias Biomédicas, Universidad Austral, Pilar-Derqui, Buenos Aires, Argentina. .,Facultad de Ingeniería, Universidad Austral, Pilar-Derqui, Buenos Aires, Argentina.
| | - Pilar Cossio
- Biophysics of Tropical Diseases Max Planck Tandem Group, University of Antioquia, Medellín, Colombia. .,Department of Theoretical Biophysics, Max Planck Institute of Biophysics, 60438, Frankfurt am Main, Germany.
| |
Collapse
|
36
|
Abstract
Computational methods, applied at the early stages of the drug design process, use current technology to provide valuable insights into the understanding of chemical systems in a virtual manner, complementing experimental analysis. Molecular docking is an in silico method employed to foresee binding modes of small compounds or macromolecules in contact with a receptor and to predict their molecular interactions. Moreover, the methodology opens up the possibility of ranking these compounds according to a hierarchy determined using particular scoring functions. Docking protocols assign many approximations, and most of them lack receptor flexibility. Therefore, the reliability of the resulting protein-ligand complexes is uncertain. The association with the costly but more accurate MD techniques provides significant complementary with docking. MD simulations can be used before docking since a series of "new" and broader protein conformations can be extracted from the processing of the resulting trajectory and employed as targets for docking. They also can be utilized a posteriori to optimize the structures of the final complexes from docking, calculate more detailed interaction energies, and provide information about the ligand binding mechanism. Here, we focus on protocols that offer the docking-MD combination as a logical approach to improving the drug discovery process.
Collapse
|
37
|
Mavrogeni ME, Pronios F, Zareifi D, Vasilakaki S, Lozach O, Alexopoulos L, Meijer L, Myrianthopoulos V, Mikros E. A facile consensus ranking approach enhances virtual screening robustness and identifies a cell-active DYRK1α inhibitor. Future Med Chem 2018; 10:2411-2430. [PMID: 30325204 PMCID: PMC6479281 DOI: 10.4155/fmc-2018-0198] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2018] [Accepted: 08/16/2018] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Virtual screening is vital for contemporary drug discovery but striking performance fluctuations are commonly encountered, thus hampering error-free use. Results and Methodology: A conceptual framework is suggested for combining screening algorithms characterized by orthogonality (docking-scoring calculations, 3D shape similarity, 2D fingerprint similarity) into a simple, efficient and expansible python-based consensus ranking scheme. An original experimental dataset is created for comparing individual screening methods versus the novel approach. Its utilization leads to identification and phosphoproteomic evaluation of a cell-active DYRK1α inhibitor. CONCLUSION Consensus ranking considerably stabilizes screening performance at reasonable computational cost, whereas individual screens are heavily dependent on calculation settings. Results indicate that the novel approach, currently available as a free online tool, is highly suitable for prospective screening by nonexperts.
Collapse
Affiliation(s)
- Maria E Mavrogeni
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of Athens, Panepistimiopolis Zografou, 157 71 Athens, Greece
| | - Filippos Pronios
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of Athens, Panepistimiopolis Zografou, 157 71 Athens, Greece
| | - Danae Zareifi
- ProtATonce Ltd, Dimokritos Science Park, Agia Paraskevi, 153 43 Athens, Greece
| | - Sofia Vasilakaki
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of Athens, Panepistimiopolis Zografou, 157 71 Athens, Greece
| | - Olivier Lozach
- Laboratoire Chimie Electrochimie Moléculaires et Chimie Analytique, University of Brest, 29238 Brest, France
| | - Leonidas Alexopoulos
- School of Mechanical Engineering, National Technical University of Athens, 157 80 Athens, Greece
| | - Laurent Meijer
- ManRos Therapeutics, Perharidy Research Center, 29680 Roscoff, Bretagne, France
| | - Vassilios Myrianthopoulos
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of Athens, Panepistimiopolis Zografou, 157 71 Athens, Greece
- ‘Athena’ Research & Innovation Center, 151 25 Athens, Greece
| | - Emmanuel Mikros
- Department of Pharmaceutical Chemistry, School of Pharmacy, University of Athens, Panepistimiopolis Zografou, 157 71 Athens, Greece
- ‘Athena’ Research & Innovation Center, 151 25 Athens, Greece
| |
Collapse
|
38
|
Xie B, Clark JD, Minh DDL. Efficiency of Stratification for Ensemble Docking Using Reduced Ensembles. J Chem Inf Model 2018; 58:1915-1925. [PMID: 30114370 PMCID: PMC6338335 DOI: 10.1021/acs.jcim.8b00314] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Molecular docking can account for receptor flexibility by combining the docking score over multiple rigid receptor conformations, such as snapshots from a molecular dynamics simulation. Here, we evaluate a number of common snapshot selection strategies using a quality metric from stratified sampling, the efficiency of stratification, which compares the variance of a selection strategy to simple random sampling. We also extend the metric to estimators of exponential averages (which involve an exponential transformation, averaging, and inverse transformation) and minima. For docking sets of over 500 ligands to four different proteins of varying flexibility, we observe that, for estimating ensemble averages and exponential averages, many clustering algorithms have similar performance trends: for a few snapshots (less than 25), medoids are the most efficient, while, for a larger number, optimal (the allocation that minimizes the variance) and proportional (to the size of each cluster) allocation become more efficient. Proportional allocation appears to be the most consistently efficient for estimating minima.
Collapse
Affiliation(s)
- Bing Xie
- Department of Chemistry, Illinois Institute of Technology, Chicago, IL 60616, USA
| | - John D. Clark
- Department of Chemistry, Illinois Institute of Technology, Chicago, IL 60616, USA
| | - David D. L. Minh
- Department of Chemistry, Illinois Institute of Technology, Chicago, IL 60616, USA
| |
Collapse
|
39
|
Lagarde N, Rey J, Gyulkhandanyan A, Tufféry P, Miteva MA, Villoutreix BO. Online structure-based screening of purchasable approved drugs and natural compounds: retrospective examples of drug repositioning on cancer targets. Oncotarget 2018; 9:32346-32361. [PMID: 30190791 PMCID: PMC6122352 DOI: 10.18632/oncotarget.25966] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2018] [Accepted: 07/31/2018] [Indexed: 12/11/2022] Open
Abstract
Drug discovery is a long and difficult process that benefits from the integration of virtual screening methods in experimental screening campaigns such as to generate testable hypotheses, accelerate and/or reduce the cost of drug development. Current drug attrition rate is still a major issue in all therapeutic areas and especially in the field of cancer. Drug repositioning as well as the screening of natural compounds constitute promising approaches to accelerate and improve the success rate of drug discovery. We developed three compounds libraries of purchasable compounds: Drugs-lib, FOOD-lib and NP-lib that contain approved drugs, food constituents and natural products, respectively, that are optimized for structure-based virtual screening studies. The three compounds libraries are implemented in the MTiOpenScreen web server that allows users to perform structure-based virtual screening computations on their selected protein targets. The server outputs a list of 1,500 molecules with predicted binding scores that can then be processed further by the users and purchased for experimental validation. To illustrate the potential of our service for drug repositioning endeavours, we selected five recently published drugs that have been repositioned in vitro and/or in vivo on cancer targets. For each drug, we used the MTiOpenScreen service to screen the Drugs-lib collection against the corresponding anti-cancer target and we show that our protocol is able to rank these drugs within the top ranked compounds. This web server should assist the discovery of promising molecules that could benefit patients, with faster development times, and reduced costs and risk.
Collapse
Affiliation(s)
- Nathalie Lagarde
- Université Paris Diderot, Sorbonne Paris Cité, Molécules Thérapeutiques In Silico, INSERM UMR-S 973, Paris, France
- INSERM, U973, Paris, France
| | - Julien Rey
- Université Paris Diderot, Sorbonne Paris Cité, Molécules Thérapeutiques In Silico, INSERM UMR-S 973, Paris, France
- INSERM, U973, Paris, France
| | - Aram Gyulkhandanyan
- Université Paris Diderot, Sorbonne Paris Cité, Molécules Thérapeutiques In Silico, INSERM UMR-S 973, Paris, France
- INSERM, U973, Paris, France
| | - Pierre Tufféry
- Université Paris Diderot, Sorbonne Paris Cité, Molécules Thérapeutiques In Silico, INSERM UMR-S 973, Paris, France
- INSERM, U973, Paris, France
| | - Maria A. Miteva
- Université Paris Diderot, Sorbonne Paris Cité, Molécules Thérapeutiques In Silico, INSERM UMR-S 973, Paris, France
- INSERM, U973, Paris, France
| | - Bruno O. Villoutreix
- Université Paris Diderot, Sorbonne Paris Cité, Molécules Thérapeutiques In Silico, INSERM UMR-S 973, Paris, France
- INSERM, U973, Paris, France
| |
Collapse
|
40
|
Pinzi L, Caporuscio F, Rastelli G. Selection of protein conformations for structure-based polypharmacology studies. Drug Discov Today 2018; 23:1889-1896. [PMID: 30099123 DOI: 10.1016/j.drudis.2018.08.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 08/03/2018] [Accepted: 08/06/2018] [Indexed: 11/29/2022]
Abstract
Several drugs exert their therapeutic effect through the modulation of multiple targets. Structure-based approaches hold great promise for identifying compounds with the desired polypharmacological profiles. These methods use knowledge of the protein binding sites to identify stereoelectronically complementary ligands. The selection of the most suitable protein conformations to be used in the design process is vital, especially for multitarget drug design in which the same ligand has to be accommodated in multiple binding pockets. Herein, we focus on currently available techniques for the selection of the most suitable protein conformations for multitarget drug design, compare the potential advantages and limitations of each method, and comment on how their combination could help in polypharmacology drug design.
Collapse
Affiliation(s)
- Luca Pinzi
- Department of Life Sciences, University of Modena and Reggio Emilia, Via Giuseppe Campi 103, 41125, Modena, Italy
| | - Fabiana Caporuscio
- Department of Life Sciences, University of Modena and Reggio Emilia, Via Giuseppe Campi 103, 41125, Modena, Italy
| | - Giulio Rastelli
- Department of Life Sciences, University of Modena and Reggio Emilia, Via Giuseppe Campi 103, 41125, Modena, Italy.
| |
Collapse
|
41
|
Strecker C, Meyer B. Plasticity of the Binding Site of Renin: Optimized Selection of Protein Structures for Ensemble Docking. J Chem Inf Model 2018; 58:1121-1131. [PMID: 29683661 DOI: 10.1021/acs.jcim.8b00010] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Protein flexibility poses a major challenge to docking of potential ligands in that the binding site can adopt different shapes. Docking algorithms usually keep the protein rigid and only allow the ligand to be treated as flexible. However, a wrong assessment of the shape of the binding pocket can prevent a ligand from adapting a correct pose. Ensemble docking is a simple yet promising method to solve this problem: Ligands are docked into multiple structures, and the results are subsequently merged. Selection of protein structures is a significant factor for this approach. In this work we perform a comprehensive and comparative study evaluating the impact of structure selection on ensemble docking. We perform ensemble docking with several crystal structures and with structures derived from molecular dynamics simulations of renin, an attractive target for antihypertensive drugs. Here, 500 ns of MD simulations revealed binding site shapes not found in any available crystal structure. We evaluate the importance of structure selection for ensemble docking by comparing binding pose prediction, ability to rank actives above nonactives (screening utility), and scoring accuracy. As a result, for ensemble definition k-means clustering appears to be better suited than hierarchical clustering with average linkage. The best performing ensemble consists of four crystal structures and is able to reproduce the native ligand poses better than any individual crystal structure. Moreover this ensemble outperforms 88% of all individual crystal structures in terms of screening utility as well as scoring accuracy. Similarly, ensembles of MD-derived structures perform on average better than 75% of any individual crystal structure in terms of scoring accuracy at all inspected ensembles sizes.
Collapse
Affiliation(s)
- Claas Strecker
- Department of Chemistry , University of Hamburg , Martin-Luther-King-Platz 6 , 20146 Hamburg , Germany
| | - Bernd Meyer
- Department of Chemistry , University of Hamburg , Martin-Luther-King-Platz 6 , 20146 Hamburg , Germany
| |
Collapse
|
42
|
Ganser LR, Lee J, Rangadurai A, Merriman DK, Kelly ML, Kansal AD, Sathyamoorthy B, Al-Hashimi HM. High-performance virtual screening by targeting a high-resolution RNA dynamic ensemble. Nat Struct Mol Biol 2018; 25:425-434. [PMID: 29728655 PMCID: PMC5942591 DOI: 10.1038/s41594-018-0062-4] [Citation(s) in RCA: 65] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 03/27/2018] [Indexed: 12/22/2022]
Abstract
Dynamic ensembles hold great promise in advancing RNA-targeted drug discovery. Here we subjected the transactivation response element (TAR) RNA from human immunodeficiency virus type-1 to experimental high-throughput screening against ~100,000 drug-like small molecules. Results were augmented with 170 known TAR-binding molecules and used to generate sublibraries optimized for evaluating enrichment when virtually screening a dynamic ensemble of TAR determined by combining NMR spectroscopy data and molecular dynamics simulations. Ensemble-based virtual screening scores molecules with an area under the receiver operator characteristic curve of ~0.85-0.94 and with ~40-75% of all hits falling within the top 2% of scored molecules. The enrichment decreased significantly for ensembles generated from the same molecular dynamics simulations without input NMR data and for other control ensembles. The results demonstrate that experimentally determined RNA ensembles can significantly enrich libraries with true hits and that the degree of enrichment is dependent on the accuracy of the ensemble.
Collapse
Affiliation(s)
- Laura R Ganser
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA
| | - Janghyun Lee
- Department of Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Atul Rangadurai
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA
| | | | - Megan L Kelly
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA
| | - Aman D Kansal
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA
| | | | - Hashim M Al-Hashimi
- Department of Biochemistry, Duke University School of Medicine, Durham, NC, USA.
- Department of Chemistry, Duke University, Durham, NC, USA.
| |
Collapse
|
43
|
Gioia D, Bertazzo M, Recanatini M, Masetti M, Cavalli A. Dynamic Docking: A Paradigm Shift in Computational Drug Discovery. Molecules 2017; 22:molecules22112029. [PMID: 29165360 PMCID: PMC6150405 DOI: 10.3390/molecules22112029] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Revised: 11/18/2017] [Accepted: 11/19/2017] [Indexed: 12/18/2022] Open
Abstract
Molecular docking is the methodology of choice for studying in silico protein-ligand binding and for prioritizing compounds to discover new lead candidates. Traditional docking simulations suffer from major limitations, mostly related to the static or semi-flexible treatment of ligands and targets. They also neglect solvation and entropic effects, which strongly limits their predictive power. During the last decade, methods based on full atomistic molecular dynamics (MD) have emerged as a valid alternative for simulating macromolecular complexes. In principle, compared to traditional docking, MD allows the full exploration of drug-target recognition and binding from both the mechanistic and energetic points of view (dynamic docking). Binding and unbinding kinetic constants can also be determined. While dynamic docking is still too computationally expensive to be routinely used in fast-paced drug discovery programs, the advent of faster computing architectures and advanced simulation methodologies are changing this scenario. It is feasible that dynamic docking will replace static docking approaches in the near future, leading to a major paradigm shift in in silico drug discovery. Against this background, we review the key achievements that have paved the way for this progress.
Collapse
Affiliation(s)
- Dario Gioia
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum-Universita' di Bologna, via Belmeloro 6, I-40126 Bologna, Italy.
| | - Martina Bertazzo
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum-Universita' di Bologna, via Belmeloro 6, I-40126 Bologna, Italy.
- Computational Sciences, Istituto Italiano di Tecnologia, via Morego 30, 16163 Genova, Italy.
| | - Maurizio Recanatini
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum-Universita' di Bologna, via Belmeloro 6, I-40126 Bologna, Italy.
| | - Matteo Masetti
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum-Universita' di Bologna, via Belmeloro 6, I-40126 Bologna, Italy.
| | - Andrea Cavalli
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum-Universita' di Bologna, via Belmeloro 6, I-40126 Bologna, Italy.
- Computational Sciences, Istituto Italiano di Tecnologia, via Morego 30, 16163 Genova, Italy.
| |
Collapse
|
44
|
Wang N, Wang L, Xie XQ. ProSelection: A Novel Algorithm to Select Proper Protein Structure Subsets for in Silico Target Identification and Drug Discovery Research. J Chem Inf Model 2017; 57:2686-2698. [PMID: 29016123 DOI: 10.1021/acs.jcim.7b00277] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Molecular docking is widely applied to computer-aided drug design and has become relatively mature in the recent decades. Application of docking in modeling varies from single lead compound optimization to large-scale virtual screening. The performance of molecular docking is highly dependent on the protein structures selected. It is especially challenging for large-scale target prediction research when multiple structures are available for a single target. Therefore, we have established ProSelection, a docking preferred-protein selection algorithm, in order to generate the proper structure subset(s). By the ProSelection algorithm, protein structures of "weak selectors" are filtered out whereas structures of "strong selectors" are kept. Specifically, the structure which has a good statistical performance of distinguishing active ligands from inactive ligands is defined as a strong selector. In this study, 249 protein structures of 14 autophagy-related targets are investigated. Surflex-dock was used as the docking engine to distinguish active and inactive compounds against these protein structures. Both t test and Mann-Whitney U test were used to distinguish the strong from the weak selectors based on the normality of the docking score distribution. The suggested docking score threshold for active ligands (SDA) was generated for each strong selector structure according to the receiver operating characteristic (ROC) curve. The performance of ProSelection was further validated by predicting the potential off-targets of 43 U.S. Federal Drug Administration approved small molecule antineoplastic drugs. Overall, ProSelection will accelerate the computational work in protein structure selection and could be a useful tool for molecular docking, target prediction, and protein-chemical database establishment research.
Collapse
Affiliation(s)
- Nanyi Wang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy; NIH National Center of Excellence for Computational Drug Abuse Research; Drug Discovery Institute, University of Pittsburgh , Pittsburgh, Pennsylvania 15260, United States
| | - Lirong Wang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy; NIH National Center of Excellence for Computational Drug Abuse Research; Drug Discovery Institute, University of Pittsburgh , Pittsburgh, Pennsylvania 15260, United States
| | - Xiang-Qun Xie
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy; NIH National Center of Excellence for Computational Drug Abuse Research; Drug Discovery Institute, University of Pittsburgh , Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
45
|
Lam PCH, Abagyan R, Totrov M. Ligand-biased ensemble receptor docking (LigBEnD): a hybrid ligand/receptor structure-based approach. J Comput Aided Mol Des 2017; 32:187-198. [PMID: 28887659 PMCID: PMC5767200 DOI: 10.1007/s10822-017-0058-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2017] [Accepted: 08/30/2017] [Indexed: 11/29/2022]
Abstract
Ligand docking to flexible protein molecules can be efficiently carried out through ensemble docking to multiple protein conformations, either from experimental X-ray structures or from in silico simulations. The success of ensemble docking often requires the careful selection of complementary protein conformations, through docking and scoring of known co-crystallized ligands. False positives, in which a ligand in a wrong pose achieves a better docking score than that of native pose, arise as additional protein conformations are added. In the current study, we developed a new ligand-biased ensemble receptor docking method and composite scoring function which combine the use of ligand-based atomic property field (APF) method with receptor structure-based docking. This method helps us to correctly dock 30 out of 36 ligands presented by the D3R docking challenge. For the six mis-docked ligands, the cognate receptor structures prove to be too different from the 40 available experimental Pocketome conformations used for docking and could be identified only by receptor sampling beyond experimentally explored conformational subspace.
Collapse
Affiliation(s)
- Polo C-H Lam
- Molsoft L.L.C., 11199 Sorrento Valley Road, S209, San Diego, CA, 92121, USA
| | - Ruben Abagyan
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, 92093, USA
| | - Maxim Totrov
- Molsoft L.L.C., 11199 Sorrento Valley Road, S209, San Diego, CA, 92121, USA.
| |
Collapse
|
46
|
Kumar A, Zhang KYJ. A cross docking pipeline for improving pose prediction and virtual screening performance. J Comput Aided Mol Des 2017; 32:163-173. [PMID: 28836076 DOI: 10.1007/s10822-017-0048-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Accepted: 08/18/2017] [Indexed: 02/02/2023]
Abstract
Pose prediction and virtual screening performance of a molecular docking method depend on the choice of protein structures used for docking. Multiple structures for a target protein are often used to take into account the receptor flexibility and problems associated with a single receptor structure. However, the use of multiple receptor structures is computationally expensive when docking a large library of small molecules. Here, we propose a new cross-docking pipeline suitable to dock a large library of molecules while taking advantage of multiple target protein structures. Our method involves the selection of a suitable receptor for each ligand in a screening library utilizing ligand 3D shape similarity with crystallographic ligands. We have prospectively evaluated our method in D3R Grand Challenge 2 and demonstrated that our cross-docking pipeline can achieve similar or better performance than using either single or multiple-receptor structures. Moreover, our method displayed not only decent pose prediction performance but also better virtual screening performance over several other methods.
Collapse
Affiliation(s)
- Ashutosh Kumar
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan
| | - Kam Y J Zhang
- Structural Bioinformatics Team, Center for Life Science Technologies, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan.
| |
Collapse
|
47
|
Identification of Novel Non-secosteroidal Vitamin D Receptor Agonists with Potent Cardioprotective Effects and devoid of Hypercalcemia. Sci Rep 2017; 7:8427. [PMID: 28814738 PMCID: PMC5559458 DOI: 10.1038/s41598-017-08670-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2016] [Accepted: 07/12/2017] [Indexed: 12/22/2022] Open
Abstract
Vitamin D regulates many biological processes, but its clinical utility is limited by its hypercalcemic effect. Using a virtual screening platform to search novel chemical probes that activate the vitamin D signaling, we report discovery of novel non-steroidal small-molecule compounds that activate the vitamin D receptor (VDR), but are devoid of hypercalcemia. A lead compound (known as VDR 4-1) demonstrated potent transcriptional activities in a VDR reporter gene assay, and significantly ameliorated cardiac hypertrophy in cell culture studies and in animal models. VDR 4-1 also effectively suppressed secondary hyperparathyroidism in 1α-hydroxylase knockout mice. In contrast to 1α,25-dihydroxyvitamin D3 (1,25-D3 or calcitriol), a naturally occurring VDR agonist, VDR 4-1 therapy even at high doses did not induce hypercalcemia. These findings were accompanied by a lack of upregulation of calcium transport genes in kidney and in the gut providing a mechanism for the lack of hypercalcemia. Furthermore, VDR 4-1 therapy significantly suppressed cardiac hypertrophy and progression to heart failure in both vitamin D deficient and normal mice without inducing significant hypercalcemia. In conclusion, we have identified a unique VDR agonist compound with beneficial effects in mouse models of hyperparathyroidism and heart failure without inducing significant hypercalcemia.
Collapse
|
48
|
De Vivo M, Cavalli A. Recent advances in dynamic docking for drug discovery. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2017. [DOI: 10.1002/wcms.1320] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Affiliation(s)
- Marco De Vivo
- Laboratory of Molecular Modeling and Drug DiscoveryIstituto Italiano di TecnologiaGenoaItaly
- IAS‐S/INM‐9 Computational BiomedicineForschungszentrum JülichJülichGermany
| | - Andrea Cavalli
- CompunetIstituto Italiano di TecnologiaGenoaItaly
- Department of Pharmacy and Biotechnology, Alma Mater StudiorumUniversity of BolognaBolognaItaly
| |
Collapse
|
49
|
Krishna S, Shukla S, Lakra AD, Meeran SM, Siddiqi MI. Identification of potent inhibitors of DNA methyltransferase 1 (DNMT1) through a pharmacophore-based virtual screening approach. J Mol Graph Model 2017; 75:174-188. [PMID: 28582695 DOI: 10.1016/j.jmgm.2017.05.014] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 05/20/2017] [Accepted: 05/22/2017] [Indexed: 02/06/2023]
Abstract
DNA methylation is an epigenetic change that results in the addition of a methyl group at the carbon-5 position of cytosine residues. DNA methyltransferase (DNMT) inhibitors can suppress tumour growth and have significant therapeutic value. However, the established inhibitors are limited in their application due to their substantial cytotoxicity. Additionally, the standard drugs for DNMT inhibition are non-selective cytosine analogues with considerable cytotoxic side-effects. In the present study, we have designed a workflow by integrating various ligand-based and structure-based approaches to discover new agents active against DNMT1. We have derived a pharmacophore model with the help of available DNMT1 inhibitors. Utilising this model, we performed the virtual screening of Maybridge chemical library and the identified hits were then subsequently filtered based on the Naïve Bayesian classification model. The molecules that have returned from this classification model were subjected to ensemble based docking. We have selected 10 molecules for the biological assay by inspecting the interactions portrayed by these molecules. Three out of the ten tested compounds have shown DNMT1 inhibitory activity. These compounds were also found to demonstrate potential inhibition of cellular proliferation in human breast cancer MDA-MB-231 cells. In the present study, we have utilized a multi-step virtual screening protocol to identify inhibitors of DNMT1, which offers a starting point to develop more potent DNMT1 inhibitors as anti-cancer agents.
Collapse
Affiliation(s)
- Shagun Krishna
- Molecular & Structural Biology Division, CSIR-Central Drug Research Institute, Lucknow, 260031, India
| | - Samriddhi Shukla
- Endocrinology Division, CSIR-Central Drug Research Institute, Lucknow, 260031, India
| | - Amar Deep Lakra
- Endocrinology Division, CSIR-Central Drug Research Institute, Lucknow, 260031, India
| | - Syed Musthapa Meeran
- Endocrinology Division, CSIR-Central Drug Research Institute, Lucknow, 260031, India
| | - Mohammad Imran Siddiqi
- Molecular & Structural Biology Division, CSIR-Central Drug Research Institute, Lucknow, 260031, India.
| |
Collapse
|
50
|
Uehara S, Tanaka S. Cosolvent-Based Molecular Dynamics for Ensemble Docking: Practical Method for Generating Druggable Protein Conformations. J Chem Inf Model 2017; 57:742-756. [PMID: 28388074 DOI: 10.1021/acs.jcim.6b00791] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Protein flexibility is a major hurdle in current structure-based virtual screening (VS). In spite of the recent advances in high-performance computing, protein-ligand docking methods still demand tremendous computational cost to take into account the full degree of protein flexibility. In this context, ensemble docking has proven its utility and efficiency for VS studies, but it still needs a rational and efficient method to select and/or generate multiple protein conformations. Molecular dynamics (MD) simulations are useful to produce distinct protein conformations without abundant experimental structures. In this study, we present a novel strategy that makes use of cosolvent-based molecular dynamics (CMD) simulations for ensemble docking. By mixing small organic molecules into a solvent, CMD can stimulate dynamic protein motions and induce partial conformational changes of binding pocket residues appropriate for the binding of diverse ligands. The present method has been applied to six diverse target proteins and assessed by VS experiments using many actives and decoys of DEKOIS 2.0. The simulation results have revealed that the CMD is beneficial for ensemble docking. Utilizing cosolvent simulation allows the generation of druggable protein conformations, improving the VS performance compared with the use of a single experimental structure or ensemble docking by standard MD with pure water as the solvent.
Collapse
Affiliation(s)
- Shota Uehara
- Department of Computational Science, Graduate School of System Informatics, Kobe University , 1-1 Rokkodai, Nada, Kobe, Hyogo 657-8501, Japan
| | - Shigenori Tanaka
- Department of Computational Science, Graduate School of System Informatics, Kobe University , 1-1 Rokkodai, Nada, Kobe, Hyogo 657-8501, Japan
| |
Collapse
|