1
|
Clyde A. Ultrahigh Throughput Protein-Ligand Docking with Deep Learning. Methods Mol Biol 2022; 2390:301-319. [PMID: 34731475 DOI: 10.1007/978-1-0716-1787-8_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Ultrahigh-throughput virtual screening (uHTVS) is an emerging field linking together classical docking techniques with high-throughput AI methods. We outline mechanistic docking models' goals and successes. We present different AI accelerated workflows for uHTVS, mainly through surrogate docking models. We showcase a novel feature representation technique, molecular depictions (images), as a surrogate model for docking. Along with a discussion on analyzing screens using regression enrichment surfaces at the tens of billion scale, we outline a future for uHTVS screening pipelines with deep learning.
Collapse
Affiliation(s)
- Austin Clyde
- Department of Computer Science, University of Chicago, Chicago, IL, USA.
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL, USA.
| |
Collapse
|
2
|
Wu Y, Brooks CL. Flexible CDOCKER: Hybrid Searching Algorithm and Scoring Function with Side Chain Conformational Entropy. J Chem Inf Model 2021; 61:5535-5549. [PMID: 34704754 PMCID: PMC8684595 DOI: 10.1021/acs.jcim.1c01078] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The binding of small-molecule ligands to protein or nucleic acid targets is important to numerous biological processes. Accurate prediction of the binding modes between a ligand and a macromolecule is of fundamental importance in structure-based structure-function exploration. When multiple ligands with different sizes are docked to a target receptor, it is reasonable to assume that the residues in the binding pocket may adopt alternative conformations upon interacting with the different ligands. In addition, it has been suggested that the entropic contribution to binding can be important. However, only a few attempts to include the side chain conformational entropy upon binding within the application of flexible receptor docking methodology exist. Here, we propose a new physics-based scoring function that includes both enthalpic and entropic contributions upon binding by considering the conformational variability of the flexible side chains within the ensemble of docked poses. We also describe a novel hybrid searching algorithm that combines both molecular dynamics (MD)-based simulated annealing and genetic algorithm crossovers to address the enhanced sampling of the increased search space. We demonstrate improved accuracy in flexible cross-docking experiments compared with rigid cross-docking. We test our developments by considering five protein targets, thrombin, dihydrofolate reductase(DHFR), T4 L99A, T4 L99A/M102Q, and PDE10A, which belong to different enzyme classes with different binding pocket environments, as a representative set of diverse ligands and receptors. Each target contains dozens of different ligands bound to the same binding pocket. We also demonstrate that this flexible docking algorithm may be applicable to RNA docking with a representative riboswitch example. Our findings show significant improvements in top ranking accuracy across this set, with the largest improvement relative to rigid, 23.64%, occurring for ligands binding to DHFR. We then evaluate the ability to identify lead compounds among a large chemical space for the proposed flexible receptor docking algorithm using a subset of the DUD-E containing receptor targets MCR, GCR, and ANDR. We demonstrate that our new algorithms show improved performance in modeling flexible binding site residues compared to DOCK. Finally, we select the T4 L99A and T4 L99A/M102Q decoy sets, containing dozens of binders and experimentally validated nonbinders, to test our approach in distinguishing binders from nonbinders. We illustrate that our new algorithms for searching and scoring have superior performance to rigid receptor CDOCKER as well as AutoDock Vina. Finally, we suggest that flexible CDOCKER is sufficiently fast to be utilized in high-throughput docking screens in the context of hierarchical approaches.
Collapse
Affiliation(s)
- Yujin Wu
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Charles L Brooks
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
- Biophysics Program, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
3
|
Farhadi Z, Farhadi T, Hashemian SM. Virtual screening for potential inhibitors of β(1,3)-D-glucan synthase as drug candidates against fungal cell wall. J Drug Assess 2020; 9:52-59. [PMID: 32284908 PMCID: PMC7144292 DOI: 10.1080/21556660.2020.1734010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 02/07/2020] [Indexed: 01/17/2023] Open
Abstract
Background To enhance the outcome in patients with invasive candidiasis, initiation of an efficient antifungal treatment in a suitable dosage is necessary. Echinocandins (e.g. caspofungin) inhibit the enzyme β(1,3)-D-glucan synthase of the fungal cell wall. Compared to azoles and other antifungal agents, echinocandins have lower adverse effects and toxicity in humans. Echinocandins are available in injectable (intravenous) form. Methods In this study, to identify the novel oral drug-like compounds that affect the fungal cell wall, downloaded oral drug-like compounds from the ZINC database were processed with a virtual screening procedure. The docking free energies were calculated and compared with the known inhibitor caspofungin. Four molecules were selected as the most potent ligands and subjected to hydrogen bonds analysis. Results Considering the hydrogen bond analysis, two compounds (ZINC71336662 and ZINC40910772) were predicted to better interact with the active site of β(1,3)-D-glucan synthase compared with caspofungin. Conclusion The introduced compound in this study may be valuable to analyze experimentally as a novel oral drug candidate targeting fungal cell walls.
Collapse
Affiliation(s)
- Zinat Farhadi
- Chronic Respiratory Diseases Research Center (CRDRC), National Research Institute of Tuberculosis and Lung Diseases (NRITLD), Shahid Beheshti University of Medical Sciences, Tehran, Iran.,Behavioral Disease Counseling Center, Marvdasht Health Center, Shiraz University of Medical Sciences, Shiraz, Iran.,Department of Microbiology, Shiraz Branch, Islamic Azad University, Shiraz, Iran
| | - Tayebeh Farhadi
- Chronic Respiratory Diseases Research Center (CRDRC), National Research Institute of Tuberculosis and Lung Diseases (NRITLD), Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Seyed MohammadReza Hashemian
- Chronic Respiratory Diseases Research Center (CRDRC), National Research Institute of Tuberculosis and Lung Diseases (NRITLD), Shahid Beheshti University of Medical Sciences, Tehran, Iran.,Critical Care Department, Farhikhtegan Hospital, Tehran Medical Branch, Islamic Azad University, Tehran, Iran
| |
Collapse
|
4
|
A 3D-QSAR assisted activity prediction strategy for expanding substrate spectra of an aldehyde ketone reductase. MOLECULAR CATALYSIS 2018. [DOI: 10.1016/j.mcat.2018.06.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
5
|
Nguyen TH, Zhou HX, Minh DDL. Using the fast fourier transform in binding free energy calculations. J Comput Chem 2017; 39:621-636. [PMID: 29270990 DOI: 10.1002/jcc.25139] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 09/23/2017] [Accepted: 11/27/2017] [Indexed: 12/21/2022]
Abstract
According to implicit ligand theory, the standard binding free energy is an exponential average of the binding potential of mean force (BPMF), an exponential average of the interaction energy between the unbound ligand ensemble and a rigid receptor. Here, we use the fast Fourier transform (FFT) to efficiently evaluate BPMFs by calculating interaction energies when rigid ligand configurations from the unbound ensemble are discretely translated across rigid receptor conformations. Results for standard binding free energies between T4 lysozyme and 141 small organic molecules are in good agreement with previous alchemical calculations based on (1) a flexible complex ( R≈0.9 for 24 systems) and (2) flexible ligand with multiple rigid receptor configurations ( R≈0.8 for 141 systems). While the FFT is routinely used for molecular docking, to our knowledge this is the first time that the algorithm has been used for rigorous binding free energy calculations. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Trung Hai Nguyen
- Department of Chemistry, Illinois Institute of Technology, Chicago, Illinois, 60616
| | - Huan-Xiang Zhou
- Departments of Chemistry and Physics, University of Illinois at Chicago, Chicago, Illinois, 60607
| | - David D L Minh
- Department of Chemistry, Illinois Institute of Technology, Chicago, Illinois, 60616
| |
Collapse
|
6
|
Xie B, Nguyen TH, Minh DDL. Absolute Binding Free Energies between T4 Lysozyme and 141 Small Molecules: Calculations Based on Multiple Rigid Receptor Configurations. J Chem Theory Comput 2017; 13:2930-2944. [PMID: 28430432 PMCID: PMC5612505 DOI: 10.1021/acs.jctc.6b01183] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We demonstrate the feasibility of estimating protein-ligand binding free energies using multiple rigid receptor configurations. On the basis of T4 lysozyme snapshots extracted from six alchemical binding free energy calculations with a flexible receptor, binding free energies were estimated for a total of 141 ligands. For 24 ligands, the calculations reproduced flexible-receptor estimates with a correlation coefficient of 0.90 and a root-mean-square error of 1.59 kcal/mol. The accuracy of calculations based on Poisson-Boltzmann/surface area implicit solvent was comparable to that of previously reported free energy calculations.
Collapse
Affiliation(s)
- Bing Xie
- Department of Chemistry, Illinois Institute of Technology, Chicago, IL 60616, USA
| | - Trung Hai Nguyen
- Department of Chemistry, Illinois Institute of Technology, Chicago, IL 60616, USA
| | - David D. L. Minh
- Department of Chemistry, Illinois Institute of Technology, Chicago, IL 60616, USA
| |
Collapse
|
7
|
Virtual Screening for Potential Inhibitors of CTX-M-15 Protein of Klebsiella pneumoniae. Interdiscip Sci 2017; 10:694-703. [PMID: 28374117 DOI: 10.1007/s12539-017-0222-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Revised: 02/28/2017] [Accepted: 03/09/2017] [Indexed: 12/31/2022]
Abstract
The Gram-negative bacterium Klebsiella pneumoniae, responsible for a wide variety of nosocomial infections in immuno-deficient patients, involves the respiratory, urinary and gastrointestinal tract infections and septicemia. Extended spectrum β-lactamases (ESBL) belong to β-lactamases capable of conferring antibiotic resistance in Gram-negative bacteria. CTX-M-15, a prevalent ESBL reported from Enterobacteriaceae including K. pneumoniae, was selected as a potent anti-bacterial target. To identify the novel drug-like compounds, structure-based screening procedure was employed against downloaded drug-like compounds from ZINC database. An acronym for "ZINC" is not commercial. The docking free energy values were investigated and compared to the known inhibitor Avibactam. Six best novel drug-like compounds were selected and their hydrogen bindings with the receptor were determined. Based on the binding efficiency mode, three among these six identified most potential inhibitors, ZINC21811621, ZINC93091917 and ZINC19488569, were predicted as potential competitive inhibitors against CTX-M-15 compared to Avibactam. These three inhibitors may provide a framework for the experimental studies to develop anti-Klebsiella novel drug candidates targeting CTX-M-15.
Collapse
|
8
|
Pavadai E, El Mazouni F, Wittlin S, de Kock C, Phillips MA, Chibale K. Identification of New Human Malaria Parasite Plasmodium falciparum Dihydroorotate Dehydrogenase Inhibitors by Pharmacophore and Structure-Based Virtual Screening. J Chem Inf Model 2016; 56:548-62. [PMID: 26915022 DOI: 10.1021/acs.jcim.5b00680] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Plasmodium falciparum dihydroorotate dehydrogenase (PfDHODH), a key enzyme in the de novo pyrimidine biosynthesis pathway, which the Plasmodium falciparum relies on exclusively for survival, has emerged as a promising target for antimalarial drugs. In an effort to discover new and potent PfDHODH inhibitors, 3D-QSAR pharmacophore models were developed based on the structures of known PfDHODH inhibitors and the validated Hypo1 model was used as a 3D search query for virtual screening of the National Cancer Institute database. The virtual hit compounds were further filtered based on molecular docking and Molecular Mechanics/Generalized Born Surface Area binding energy calculations. The combination of the pharmacophore and structure-based virtual screening resulted in the identification of nine new compounds that showed >25% inhibition of PfDHODH at a concentration of 10 μM, three of which exhibited IC50 values in the range of 0.38-20 μM. The most active compound, NSC336047, displayed species-selectivity for PfDHODH over human DHODH and inhibited parasite growth with an IC50 of 26 μM. In addition to this, 13 compounds inhibited parasite growth with IC50 values of ≤ 50 μM, 4 of which showed IC50 values in the range of 5-12 μM. These compounds could be further explored in the identification and development of more potent PfDHODH and parasite growth inhibitors.
Collapse
Affiliation(s)
| | - Farah El Mazouni
- Departments of Pharmacology, University of Texas Southwestern Medical Center at Dallas , 6001 Forest Park Blvd, Dallas, Texas 75390-9041, United States
| | - Sergio Wittlin
- Swiss Tropical and Public Health Institute , Socinstrasse 57, 4002 Basel, Switzerland.,University of Basel , 4002 Basel, Switzerland
| | - Carmen de Kock
- Division of Clinical Pharmacology, Department of Medicine, University of Cape Town , Observatory 7925, South Africa
| | - Margaret A Phillips
- Departments of Pharmacology, University of Texas Southwestern Medical Center at Dallas , 6001 Forest Park Blvd, Dallas, Texas 75390-9041, United States
| | | |
Collapse
|
9
|
Forli S. Charting a Path to Success in Virtual Screening. Molecules 2015; 20:18732-58. [PMID: 26501243 PMCID: PMC4630810 DOI: 10.3390/molecules201018732] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 10/07/2015] [Accepted: 10/12/2015] [Indexed: 12/27/2022] Open
Abstract
Docking is commonly applied to drug design efforts, especially high-throughput virtual screenings of small molecules, to identify new compounds that bind to a given target. Despite great advances and successful applications in recent years, a number of issues remain unsolved. Most of the challenges and problems faced when running docking experiments are independent of the specific software used, and can be ascribed to either improper input preparation or to the simplified approaches applied to achieve high-throughput speed. Being aware of approximations and limitations of such methods is essential to prevent errors, deal with misleading results, and increase the success rate of virtual screening campaigns. In this review, best practices and most common issues of docking and virtual screening will be discussed, covering the journey from the design of the virtual experiment to the hit identification.
Collapse
Affiliation(s)
- Stefano Forli
- Molecular Graphics Laboratory, Department of Integrative Structural and Computational Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA.
| |
Collapse
|
10
|
|
11
|
Meng XY, Zhang HX, Mezei M, Cui M. Molecular docking: a powerful approach for structure-based drug discovery. Curr Comput Aided Drug Des 2011; 7:146-57. [PMID: 21534921 DOI: 10.2174/157340911795677602] [Citation(s) in RCA: 1521] [Impact Index Per Article: 117.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2010] [Accepted: 09/29/2010] [Indexed: 11/22/2022]
Abstract
Molecular docking has become an increasingly important tool for drug discovery. In this review, we present a brief introduction of the available molecular docking methods, and their development and applications in drug discovery. The relevant basic theories, including sampling algorithms and scoring functions, are summarized. The differences in and performance of available docking software are also discussed. Flexible receptor molecular docking approaches, especially those including backbone flexibility in receptors, are a challenge for available docking methods. A recently developed Local Move Monte Carlo (LMMC) based approach is introduced as a potential solution to flexible receptor docking problems. Three application examples of molecular docking approaches for drug discovery are provided.
Collapse
Affiliation(s)
- Xuan-Yu Meng
- State Key Laboratory of Theoretical and Computational Chemistry, Institute of Theoretical Chemistry, Jilin University, Changchun, China
| | | | | | | |
Collapse
|
12
|
Mysinger MM, Shoichet BK. Rapid context-dependent ligand desolvation in molecular docking. J Chem Inf Model 2011; 50:1561-73. [PMID: 20735049 DOI: 10.1021/ci100214a] [Citation(s) in RCA: 241] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
In structure-based screens for new ligands, a molecular docking algorithm must rapidly score many molecules in multiple configurations, accounting for both the ligand's interactions with receptor and its competing interactions with solvent. Here we explore a context-dependent ligand desolvation scoring term for molecular docking. We relate the Generalized-Born effective Born radii for every ligand atom to a fractional desolvation and then use this fraction to scale an atom-by-atom decomposition of the full transfer free energy. The fractional desolvation is precomputed on a scoring grid by numerically integrating over the volume of receptor proximal to a ligand atom, weighted by distance. To test this method's performance, we dock ligands versus property-matched decoys over 40 DUD targets. Context-dependent desolvation better enriches ligands compared to both the raw full transfer free energy penalty and compared to ignoring desolvation altogether, though the improvement is modest. More compellingly, the new method improves docking performance across receptor types. Thus, whereas entirely ignoring desolvation works best for charged sites and overpenalizing with full desolvation works well for neutral sites, the physically more correct context-dependent ligand desolvation is competitive across both types of targets. The method also reliably discriminates ligands from highly charged molecules, where ignoring desolvation performs poorly. Since this context-dependent ligand desolvation may be precalculated, it improves docking reliability with minimal cost to calculation time and may be readily incorporated into any physics-based docking program.
Collapse
Affiliation(s)
- Michael M Mysinger
- Department of Pharmaceutical Chemistry, University of California, San Francisco, California, USA
| | | |
Collapse
|
13
|
Vijayan RSK, Prabu M, Mascarenhas NM, Ghoshal N. Hybrid structure-based virtual screening protocol for the identification of novel BACE1 inhibitors. J Chem Inf Model 2009; 49:647-57. [PMID: 19434899 DOI: 10.1021/ci800386v] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
BACE1, also called beta-secretase or memapsin 2, is an extensively studied aspartic protease, involved in etiopathogenesis and progression of Alzheimer's disease (AD). We report herein a modified structure-based virtual screening protocol that augments the lead identification process against BACE1 during virtual screening endeavors. A hybrid structure-based virtual screening protocol that incorporates elements from both ligand-based and structure-based techniques was used for the identification of prospective small molecule inhibitors. Virtual screening, using an active-site-derived pharmacophore, followed by ROCS (rapid overlay of chemical structures)-based GOLD (genetic optimization in ligand docking) docking was used to identify a library of focused candidates. The efficacy of the ROCS-based GOLD docking method together with our customized weighted consensus scoring function was evaluated against conventional docking methods for its ability to discern true positives from a screening library. An in-depth structural analysis of the binding mode of the top-ranking molecules reveals that emulation of the curial interaction patterns deemed necessary for BACE1 inhibition. The results obtained from our validation study ensure the superiority of our docking methodology over conventional docking methods in yielding higher enrichment rates.
Collapse
Affiliation(s)
- R S K Vijayan
- Structural Biology and Bioinformatics Division, Indian Institute of Chemical Biology (A unit of CSIR), Kolkata 700032, India
| | | | | | | |
Collapse
|
14
|
Huggins DJ, Altman MD, Tidor B. Evaluation of an inverse molecular design algorithm in a model binding site. Proteins 2009; 75:168-86. [PMID: 18831031 DOI: 10.1002/prot.22226] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Computational molecular design is a useful tool in modern drug discovery. Virtual screening is an approach that docks and then scores individual members of compound libraries. In contrast to this forward approach, inverse approaches construct compounds from fragments, such that the computed affinity, or a combination of relevant properties, is optimized. We have recently developed a new inverse approach to drug design based on the dead-end elimination and A* algorithms employing a physical potential function. This approach has been applied to combinatorially constructed libraries of small-molecule ligands to design high-affinity HIV-1 protease inhibitors (Altman et al., J Am Chem Soc 2008;130:6099-6013). Here we have evaluated the new method using the well-studied W191G mutant of cytochrome c peroxidase. This mutant possesses a charged binding pocket and has been used to evaluate other design approaches. The results show that overall the new inverse approach does an excellent job of separating binders from nonbinders. For a few individual cases, scoring inaccuracies led to false positives. The majority of these involve erroneous solvation energy estimation for charged amines, anilinium ions, and phenols, which has been observed previously for a variety of scoring algorithms. Interestingly, although inverse approaches are generally expected to identify some but not all binders in a library, due to limited conformational searching, these results show excellent coverage of the known binders while still showing strong discrimination of the nonbinders.
Collapse
Affiliation(s)
- David J Huggins
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | | | | |
Collapse
|
15
|
Machicado C, López-Llano J, Cuesta-López S, Bueno M, Sancho J. Design of ligand binding to an engineered protein cavity using virtual screening and thermal up-shift evaluation. J Comput Aided Mol Des 2008; 19:421-43. [PMID: 16231201 DOI: 10.1007/s10822-005-7969-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2005] [Accepted: 05/25/2005] [Indexed: 11/29/2022]
Abstract
Proteins could be used to carry and deliver small compounds. As a tool for designing ligand binding sites in protein cores, a three-step virtual screening method is presented that has been optimised using existing data on T4 lysozyme complexes and tested in a newly engineered cavity in flavodoxin. The method can pinpoint, in large databases, ligands of specific protein cavities. In the first step, physico-chemical filters are used to screen the library and discard a majority of compounds. In the second step, a flexible, fast docking procedure is used to score and select a smaller number of compounds as potential binders. In the third step, a finer method is used to dock promising molecules of the hit list into the protein cavity, and an optimised free energy function allows discarding the few false positives by calculating the affinity of the modelled complexes. To demonstrate the portability of the method, several cavities have been designed and engineered in the flavodoxin from Anabaena PCC 7119, and the W66F/L44A double mutant has been selected as a suitable host protein. The NCI database has then been screened for potential binders, and the binding to the engineered cavity of five promising compounds and three tentative non-binders has been experimentally tested by thermal up-shift assays and spectroscopic titrations. The five tentative binders (some apolar and some polar), unlike the three tentative non-binders, are shown to bind to the host mutant and, importantly, not to bind to the wild type protein. The three-step virtual screening method developed can thus be used to identify ligands of buried protein cavities. We anticipate that the method could also be used, in a reverse manner, to identify natural or engineerable protein cavities for the hosting of ligands of interest.
Collapse
Affiliation(s)
- Claudia Machicado
- Departamento de Bioquímica y Biología Molecular y Celular, , Universidad de Zaragoza, 50009, Zaragoza, Spain
| | | | | | | | | |
Collapse
|
16
|
Grabowski K, Baringhaus KH, Schneider G. Scaffold diversity of natural products: inspiration for combinatorial library design. Nat Prod Rep 2008; 25:892-904. [PMID: 18820757 DOI: 10.1039/b715668p] [Citation(s) in RCA: 162] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Natural products contain scaffold structures that can be systematically exploited for the design of combinatorial compound libraries with druglike properties. We review approaches for scaffold identification, and compare properties and pharmacophoric features of drugs and natural products. In particular, an application of the self-organizing map technique is presented for natural product-derived compound and library design.
Collapse
Affiliation(s)
- Kristina Grabowski
- Institute of Organic Chemistry and Chemical Biology, ZAFES/CMP, Goethe-University, Siesmayerstrasse 70, Frankfurt a.M., Germany
| | | | | |
Collapse
|
17
|
Barreiro G, Guimarães CRW, Tubert-Brohman I, Lyons TM, Tirado-Rives J, Jorgensen WL. Search for non-nucleoside inhibitors of HIV-1 reverse transcriptase using chemical similarity, molecular docking, and MM-GB/SA scoring. J Chem Inf Model 2007; 47:2416-28. [PMID: 17949071 PMCID: PMC2564819 DOI: 10.1021/ci700271z] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A virtual screening protocol has been applied to seek non-nucleoside inhibitors of HIV-1 reverse transcriptase (NNRTIs) and its K103N mutant. First, a chemical similarity search on the Maybridge library was performed using known NNRTIs as reference structures. The top-ranked molecules obtained from this procedure plus 26 known NNRTIs were then docked into the binding sites of the wild-type reverse transcriptase (HIV-RT) and its K103N variant (K103N-RT) using Glide 3.5. The top-ranked 100 compounds from the docking for both proteins were post-scored with a procedure using molecular mechanics and continuum solvation (MM-GB/SA). The validity of the virtual screening protocol was supported by (i) testing of the MM-GB/SA procedure, (ii) agreement between predicted and crystallographic binding poses, (iii) recovery of known potent NNRTIs at the top of both rankings, and (iv) identification of top-scoring library compounds that are close in structure to recently reported NNRTI HTS hits. However, purchase and assaying of selected top-scoring compounds from the library failed to yield active anti-HIV agents. Nevertheless, the highest-ranked database compound, S10087, was pursued as containing a potentially viable core. Subsequent synthesis and assaying of S10087 analogues proposed by further computational analysis yielded anti-HIV agents with EC50 values as low as 310 nM. Thus, with the aid of computational tools, it was possible to evolve a false positive into a true active.
Collapse
Affiliation(s)
- Gabriela Barreiro
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520−8107
| | | | - Ivan Tubert-Brohman
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520−8107
| | - Theresa M. Lyons
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520−8107
| | - Julian Tirado-Rives
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520−8107
| | - William L. Jorgensen
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520−8107
| |
Collapse
|
18
|
Favia AD, Nobeli I, Glaser F, Thornton JM. Molecular docking for substrate identification: the short-chain dehydrogenases/reductases. J Mol Biol 2007; 375:855-74. [PMID: 18036612 DOI: 10.1016/j.jmb.2007.10.065] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2007] [Accepted: 10/24/2007] [Indexed: 10/22/2022]
Abstract
Protein ligand docking has recently been investigated as a tool for protein function identification, with some success in identifying both known and unknown substrates of proteins. However, identifying a protein's substrate when cross-docking a large number of enzymes and their cognate ligands remains a challenge. To explore a more limited yet practically important and timely problem in more detail, we have used docking for identifying the substrates of a single protein family with remarkable substrate diversity, the short-chain dehydrogenases/reductases. We examine different protocols for identifying candidate substrates for 27 short-chain dehydrogenase/reductase proteins of known catalytic function. We present the results of docking >900 metabolites from the human metabolome to each of these proteins together with their known cognate substrates and products, and we investigate the ability of docking to (a) reproduce a viable binding mode for the substrate and (b) to rank the substrate highly amongst the dataset of other metabolites. In addition, we examine whether our docking results provide information about the nature of the substrate, based on the best-scoring metabolites in the dataset. We compare two different docking methods and two alternative scoring functions for one of the docking methods, and we attempt to rationalise both successes and failures. Finally, we introduce a new protocol, whereby we dock only a set of representative structures (medoids) to each of the proteins, in the hope of characterising each binding site in terms of its ligand preferences, with a reduced computational cost. We compare the results from this protocol with our original docking experiments, and we find that although the rank of the representatives correlates well with the mean rank of the clusters to which they belong, a simple structure-based clustering is too naive for the purpose of substrate identification. Many clusters comprise ligands with widely varying affinities for the same protein; hence important candidates can be missed if a single representative is used.
Collapse
Affiliation(s)
- Angelo D Favia
- European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| | | | | | | |
Collapse
|
19
|
Schellhammer I, Rarey M. TrixX: structure-based molecule indexing for large-scale virtual screening in sublinear time. J Comput Aided Mol Des 2007; 21:223-38. [PMID: 17294247 DOI: 10.1007/s10822-007-9103-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2006] [Accepted: 01/05/2007] [Indexed: 11/25/2022]
Abstract
Structure-based virtual screening today is basically organized as a sequential process where the molecules of a screening library are evaluated for instance with respect to their fit with a biological target. In this paper, we present a novel structure-based screening paradigm avoiding sequential searching and therefore enabling sublinear runtime behavior. We implemented the novel paradigm in the virtual screening tool TrixX and successfully applied it in screening experiments on four targets from relevant therapeutic areas. With the screening paradigm implemented in TrixX, we propose some important extensions and modifications to traditional virtual screening approaches: Instead of processing all compounds in the screening library sequentially, TrixX first analyzes the geometric and physicochemical binding site characteristics and then draws compounds with matching features from a compound catalog. The catalog organizes the compounds by their physicochemical and geometric features making use of relational database technology with indexed tables in order to support efficient queries for compounds with specific features. A key element of the compound catalog is a highly selective geometric descriptor that carries information on the type of functional groups of the compound, their Euclidian distance, the preferred interaction direction of each functional group, and the location of steric bulk around the triangle. In a re-docking experiment with 200 protein-ligand complexes, we could show that TrixX is able to correctly predict the location of ligand functional groups in co-crystallized complexes. In a retrospective virtual screening experiment for four different targets, the enrichment factors of TrixX are comparable to the enrichment factors of FlexX and FlexX-Scan. With computing times clearly below one second per compound, TrixX counts among the fastest virtual screening tools currently available and is nearly two orders of magnitude faster than standard FlexX.
Collapse
Affiliation(s)
- Ingo Schellhammer
- Center for Bioinformatics, Research Group for Computational Molecular Design, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| | | |
Collapse
|
20
|
Franceschi F, Duffy EM. Structure-based drug design meets the ribosome. Biochem Pharmacol 2006; 71:1016-25. [PMID: 16443192 DOI: 10.1016/j.bcp.2005.12.026] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2005] [Revised: 12/08/2005] [Accepted: 12/13/2005] [Indexed: 11/19/2022]
Abstract
The high-resolution structures of the bacterial ribosomal subunits and those of their complexes with antibiotics have advanced significantly our understanding of small-molecule interactions with RNA. The wealth of RNA structural data generated by these structures has allowed computational chemists to employ a drug discovery paradigm focused on RNA-based targets. The structures also show how target-based resistance affects antibiotics acting at the level of the ribosome. Not only are the sites pinpointed where different classes of antibiotics inhibit protein synthesis, but their orientations, relative dispositions, and unique mechanisms of action are also revealed at the atomic level. Both the 30S and the 50S ribosomal subunits have been shown to be "targets of targets", offering several adjacent, functionally relevant binding pockets for antibiotics. It is the detailed knowledge of these validated locations, or ribofunctional loci, plus the mapping of the resistance hot-spots that allow the rational design of next-generation antibacterials. When the structural information is combined with a data-driven computational toolkit able to describe and predict molecular properties appropriate for bacterial cell penetration and drug-likeness, a structure-based drug design approach for novel antibacterials shows great promise.
Collapse
Affiliation(s)
- François Franceschi
- Rib-X Pharmaceuticals, Inc., 300 George Street, Suite 301, New Haven, CT 06511, USA.
| | | |
Collapse
|
21
|
Abstract
Molecular docking is widely used to predict novel lead compounds for drug discovery. Success depends on the quality of the docking scoring function, among other factors. An imperfect scoring function can mislead by predicting incorrect ligand geometries or by selecting nonbinding molecules over true ligands. These false-positive hits may be considered "decoys". Although these decoys are frustrating, they potentially provide important tests for a docking algorithm; the more subtle the decoy, the more rigorous the test. Indeed, decoy databases have been used to improve protein structure prediction algorithms and protein-protein docking algorithms. Here, we describe 20 geometric decoys in five enzymes and 166 "hit list" decoys-i.e., molecules predicted to bind by our docking program that were tested and found not to do so-for beta-lactamase and two cavity sites in lysozyme. Especially in the cavity sites, which are very simple, these decoys highlight particular weaknesses in our scoring function. We also consider the performance of five other widely used docking scoring functions against our geometric and hit list decoys. Intriguingly, whereas many of these other scoring functions performed better on the geometric decoys, they typically performed worse on the hit list decoys, often highly ranking molecules that seemed to poorly complement the model sites. Several of these "hits"from the other scoring functions were tested experimentally and found, in fact, to be decoys. Collectively, these decoys provide a tool for the development and improvement of molecular docking scoring functions. Such improvements may, in turn, be rapidly tested experimentally against these and related experimental systems, which are well-behaved in assays and for structure determination.
Collapse
Affiliation(s)
| | | | - Brian K. Shoichet
- * To whom correspondence should be addressed. Tel: 415-514-4126. Fax: 415-514-1460. E-mail:
| |
Collapse
|
22
|
Joseph-McCarthy D. Chapter 12 Structure-Based Lead Optimization. ACTA ACUST UNITED AC 2005. [DOI: 10.1016/s1574-1400(05)01012-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2023]
|
23
|
Chema D, Eren D, Yayon A, Goldblum A, Zaliani A. Identifying the binding mode of a molecular scaffold. J Comput Aided Mol Des 2004; 18:23-40. [PMID: 15143801 DOI: 10.1023/b:jcam.0000022561.76694.5b] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We describe a method for docking of a scaffold-based series and present its advantages over docking of individual ligands, for determining the binding mode of a molecular scaffold in a binding site. The method has been applied to eight different scaffolds of protein kinase inhibitors (PKI). A single analog of each of these eight scaffolds was previously crystallized with different protein kinases. We have used FlexX to dock a set of molecules that share the same scaffold, rather than docking a single molecule. The main mode of binding is determined by the mode of binding of the largest cluster among the docked molecules that share a scaffold. Clustering is based on our 'nearest single neighbor' method [J. Chem. Inf. Comput. Sci., 43 (2003) 208-217]. Additional criteria are applied in those cases in which more than one significant binding mode is found. Using the proposed method, most of the crystallographic binding modes of these scaffolds were reconstructed. Alternative modes, that have not been detected yet by experiments, could also be identified. The method was applied to predict the binding mode of an additional molecular scaffold that was not yet reported and the predicted binding mode has been found to be very similar to experimental results for a closely related scaffold. We suggest that this approach be used as a virtual screening tool for scaffold-based design processes.
Collapse
Affiliation(s)
- Doron Chema
- Department of Medicinal Chemistry, David R. Bloom Center for Pharmacy, School of Pharmacy, Hebrew University of Jerusalem 91120, Israel
| | | | | | | | | |
Collapse
|
24
|
Kitchen DB, Decornez H, Furr JR, Bajorath J. Docking and scoring in virtual screening for drug discovery: methods and applications. Nat Rev Drug Discov 2004; 3:935-49. [PMID: 15520816 DOI: 10.1038/nrd1549] [Citation(s) in RCA: 2052] [Impact Index Per Article: 102.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Computational approaches that 'dock' small molecules into the structures of macromolecular targets and 'score' their potential complementarity to binding sites are widely used in hit identification and lead optimization. Indeed, there are now a number of drugs whose development was heavily influenced by or based on structure-based design and screening strategies, such as HIV protease inhibitors. Nevertheless, there remain significant challenges in the application of these approaches, in particular in relation to current scoring schemes. Here, we review key concepts and specific features of small-molecule-protein docking methods, highlight selected applications and discuss recent advances that aim to address the acknowledged limitations of established approaches.
Collapse
Affiliation(s)
- Douglas B Kitchen
- Department of Computer-Aided Drug Discovery, Albany Molecular Research, Inc., 21 Corporate Circle, Albany, New York 12212-5098, USA
| | | | | | | |
Collapse
|
25
|
Ferrari AM, Wei BQ, Costantino L, Shoichet BK. Soft docking and multiple receptor conformations in virtual screening. J Med Chem 2004; 47:5076-84. [PMID: 15456251 PMCID: PMC1413506 DOI: 10.1021/jm049756p] [Citation(s) in RCA: 164] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Protein conformational change is an important consideration in ligand-docking screens, but it is difficult to predict. A simple way to account for protein flexibility is to soften the criterion for steric fit between ligand and receptor. A more comprehensive but more expensive method would be to sample multiple receptor conformations explicitly. Here, these two approaches are compared. A "soft" scoring function was created by attenuating the repulsive term in the Lennard-Jones potential, allowing for a closer approach between ligand and protein. The standard, "hard" Lennard-Jones potential was used for docking to multiple receptor conformations. The Available Chemicals Directory (ACD) was screened against two cavity sites in the T4 lysozyme. These sites undergo small but significant conformational changes on ligand binding, making them good systems for soft docking. The ACD was also screened against the drug target aldose reductase, which can undergo large conformational changes on ligand binding. We evaluated the ability of the scoring functions to identify known ligands from among the over 200 000 decoy molecules in the database. The soft potential was always better at identifying known ligands than the hard scoring function when only a single receptor conformation was used. Conversely, the soft function was worse at identifying known leads than the hard function when multiple receptor conformations were used. This was true even for the cavity sites and was especially true for aldose reductase. To test the multiple-conformation method predictively, we screened the ACD for molecules that preferentially docked to the expanded conformation of aldose reductase, known to bind larger ligands. Six novel molecules that ranked among the top 0.66% of hits from the multiple-conformation calculation, but ranked relatively poorly in the soft docking calculation, were tested experimentally for enzyme inhibition. Four of these six inhibited the enzyme, the best with an IC(50) of 8 microM. Although ligands can get better scores in soft docking, the same is also true for decoys. The improved ranking of such decoys can come at the expense of true ligands.
Collapse
Affiliation(s)
| | | | | | - Brian K. Shoichet
- * Corresponding author. Phone: 415-514-4126. Fax 415-502-1411. E-mail:
| |
Collapse
|
26
|
Wei BQ, Weaver LH, Ferrari AM, Matthews BW, Shoichet BK. Testing a Flexible-receptor Docking Algorithm in a Model Binding Site. J Mol Biol 2004; 337:1161-82. [PMID: 15046985 DOI: 10.1016/j.jmb.2004.02.015] [Citation(s) in RCA: 136] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2003] [Revised: 12/08/2003] [Accepted: 02/05/2004] [Indexed: 11/18/2022]
Abstract
Sampling receptor flexibility is challenging for database docking. We consider a method that treats multiple flexible regions of the binding site independently, recombining them to generate different discrete conformations. This algorithm scales linearly rather than exponentially with the receptor's degrees of freedom. The method was first evaluated for its ability to identify known ligands of a hydrophobic cavity mutant of T4 lysozyme (L99A). Some 200000 molecules of the Available Chemical Directory (ACD) were docked against an ensemble of cavity conformations. Surprisingly, the enrichment of known ligands from among a much larger number of decoys in the ACD was worse than simply docking to the apo conformation alone. Large decoys, accommodated in the larger cavity conformations sampled in the ensemble, were ranked better than known small ligands. The calculation was redone with an energy correction term that considered the cost of forming the larger cavity conformations. Enrichment improved, as did the balance between high-ranking large and small ligands. In a second retrospective test, the ACD was docked against a conformational ensemble of thymidylate synthase. Compared to docking against individual enzyme conformations, the flexible receptor docking approach improved enrichment of known ligands. Including a receptor conformational energy weighting term improved enrichment further. To test the method prospectively, the ACD database was docked against another cavity mutant of lysozyme (L99A/M102Q). A total of 18 new compounds predicted to bind this polar cavity and to change its conformation were tested experimentally; 14 were found to bind. The bound structures for seven ligands were determined by X-ray crystallography. The predicted geometries of these ligands all corresponded to the observed geometries to within 0.7A RMSD or better. Significant conformational changes of the cavity were observed in all seven complexes. In five structures, part of the observed accommodations were correctly predicted; in two structures, the receptor conformational changes were unanticipated and thus never sampled. These results suggest that although sampling receptor flexibility can lead to novel ligands that would have been missed when docking a rigid structure, it is also important to consider receptor conformational energy.
Collapse
Affiliation(s)
- Binqing Q Wei
- Department of Pharmaceutical Chemistry, University of California, 600 16th St, San Francisco, CA 94143-2240, USA
| | | | | | | | | |
Collapse
|
27
|
Joseph-McCarthy D, Thomas BE, Belmarsh M, Moustakas D, Alvarez JC. Pharmacophore-based molecular docking to account for ligand flexibility. Proteins 2003; 51:172-88. [PMID: 12660987 DOI: 10.1002/prot.10266] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Rapid computational mining of large 3D molecular databases is central to generating new drug leads. Accurate virtual screening of large 3D molecular databases requires consideration of the conformational flexibility of the ligand molecules. Ligand flexibility can be included without prohibitively increasing the search time by docking ensembles of precomputed conformers from a conformationally expanded database. A pharmacophore-based docking method whereby conformers of the same or different molecules are overlaid by their largest 3D pharmacophore and simultaneously docked by partial matches to that pharmacophore is presented. The method is implemented in DOCK 4.0.
Collapse
Affiliation(s)
- Diane Joseph-McCarthy
- Wyeth Research, Biological Chemistry Department, Cambridge, Massachusetts 02140, USA.
| | | | | | | | | |
Collapse
|
28
|
Atreya CE, Johnson EF, Irwin JJ, Dow A, Massimine KM, Coppens I, Stempliuk V, Beverley S, Joiner KA, Shoichet BK, Anderson KS. A molecular docking strategy identifies Eosin B as a non-active site inhibitor of protozoal bifunctional thymidylate synthase-dihydrofolate reductase. J Biol Chem 2003; 278:14092-100. [PMID: 12556445 DOI: 10.1074/jbc.m212690200] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Protozoal parasites are unusual in that their thymidylate synthase (TS) and dihydrofolate reductase (DHFR) enzymes exist on a single polypeptide. In an effort to probe the possibility of substrate channeling between the TS and DHFR active sites and to identify inhibitors specific for bifunctional TS-DHFR, we used molecular docking to screen for inhibitors targeting the shallow groove connecting the two active sites. Eosin B is a 100 microm non-active site inhibitor of Leishmania major TS-DHFR identified by molecular docking. Eosin B slows both the TS and DHFR reaction rates. When Arg-283, a key residue to which eosin B is predicted to bind, is mutated to glutamate, however, eosin B only minimally inhibits the TS-DHFR reaction. Additionally, eosin B was found to be a 180 microm inhibitor of Toxoplasma gondii in both biochemical and cell culture assays.
Collapse
Affiliation(s)
- Chloé E Atreya
- Department of Pharmacology, Yale University School of Medicine, New Haven, Connecticut 06520, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Wei BQ, Baase WA, Weaver LH, Matthews BW, Shoichet BK. A model binding site for testing scoring functions in molecular docking. J Mol Biol 2002; 322:339-55. [PMID: 12217695 DOI: 10.1016/s0022-2836(02)00777-5] [Citation(s) in RCA: 172] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Prediction of interaction energies between ligands and their receptors remains a major challenge for structure-based inhibitor discovery. Much effort has been devoted to developing scoring schemes that can successfully rank the affinities of a diverse set of possible ligands to a binding site for which the structure is known. To test these scoring functions, well-characterized experimental systems can be very useful. Here, mutation-created binding sites in T4 lysozyme were used to investigate how the quality of atomic charges and solvation energies affects molecular docking. Atomic charges and solvation energies were calculated for 172,118 molecules in the Available Chemicals Directory using a semi-empirical quantum mechanical approach by the program AMSOL. The database was first screened against the apolar cavity site created by the mutation Leu99Ala (L99A). Compared to the electronegativity-based charges that are widely used, the new charges and desolvation energies improved ranking of known apolar ligands, and better distinguished them from more polar isosteres that are not observed to bind. To investigate whether the new charges had predictive value, the non-polar residue Met102, which forms part of the binding site, was changed to the polar residue glutamine. The structure of the resulting Leu99Ala and Met102Gln double mutant of T4 lysozyme (L99A/M102Q) was determined and the docking calculation was repeated for the new site. Seven representative polar molecules that preferentially docked to the polar versus the apolar binding site were tested experimentally. All seven bind to the polar cavity (L99A/M102Q) but do not detectably bind to the apolar cavity (L99A). Five ligand-bound structures of L99A/M102Q were determined by X-ray crystallography. Docking predictions corresponded to the crystallographic results to within 0.4A RMSD. Improved treatment of partial atomic charges and desolvation energies in database docking appears feasible and leads to better distinction of true ligands. Simple model binding sites, such as L99A and its more polar variants, may find broad use in the development and testing of docking algorithms.
Collapse
Affiliation(s)
- Binqing Q Wei
- Department of Molecular Pharmacology and Biological Chemistry, Northwestern University School of Medicine, Chicago, IL 60611-3008, USA
| | | | | | | | | |
Collapse
|
30
|
Stahura FL, Bajorath J. Bio- and chemo-informatics beyond data management: crucial challenges and future opportunities. Drug Discov Today 2002; 7:S41-7. [PMID: 12047879 DOI: 10.1016/s1359-6446(02)02271-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Bio- and chemo-informatics are now thought to be crucial to the success and integration of biotechnology and drug discovery. Research in this area has expanded to go beyond data- and information-management. Here, we review exemplary areas, such as target identification and validation, virtual screening, and prediction of downstream characteristics of leads, where further research will play a key role in progressing the field.
Collapse
Affiliation(s)
- Florence L Stahura
- Albany Molecular Research, Bothell Research Center, (AMRI-BRC), 18804 North Creek Parkway, Bothell, WA 98011, USA
| | | |
Collapse
|
31
|
Halperin I, Ma B, Wolfson H, Nussinov R. Principles of docking: An overview of search algorithms and a guide to scoring functions. Proteins 2002; 47:409-43. [PMID: 12001221 DOI: 10.1002/prot.10115] [Citation(s) in RCA: 771] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
The docking field has come of age. The time is ripe to present the principles of docking, reviewing the current state of the field. Two reasons are largely responsible for the maturity of the computational docking area. First, the early optimism that the very presence of the "correct" native conformation within the list of predicted docked conformations signals a near solution to the docking problem, has been replaced by the stark realization of the extreme difficulty of the next scoring/ranking step. Second, in the last couple of years more realistic approaches to handling molecular flexibility in docking schemes have emerged. As in folding, these derive from concepts abstracted from statistical mechanics, namely, populations. Docking and folding are interrelated. From the purely physical standpoint, binding and folding are analogous processes, with similar underlying principles. Computationally, the tools developed for docking will be tremendously useful for folding. For large, multidomain proteins, domain docking is probably the only rational way, mimicking the hierarchical nature of protein folding. The complexity of the problem is huge. Here we divide the computational docking problem into its two separate components. As in folding, solving the docking problem involves efficient search (and matching) algorithms, which cover the relevant conformational space, and selective scoring functions, which are both efficient and effectively discriminate between native and non-native solutions. It is universally recognized that docking of drugs is immensely important. However, protein-protein docking is equally so, relating to recognition, cellular pathways, and macromolecular assemblies. Proteins function when they are bound to other molecules. Consequently, we present the review from both the computational and the biological points of view. Although large, it covers only partially the extensive body of literature, relating to small (drug) and to large protein-protein molecule docking, to rigid and to flexible. Unfortunately, when reviewing these, a major difficulty in assessing the results is the non-uniformity in the formats in which they are presented in the literature. Consequently, we further propose a way to rectify it here.
Collapse
Affiliation(s)
- Inbal Halperin
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | | | | | | |
Collapse
|
32
|
Verkhivker GM, Bouzida D, Gehlhaar DK, Rejto PA, Freer ST, Rose PW. Complexity and simplicity of ligand-macromolecule interactions: the energy landscape perspective. Curr Opin Struct Biol 2002; 12:197-203. [PMID: 11959497 DOI: 10.1016/s0959-440x(02)00310-x] [Citation(s) in RCA: 84] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
The energy landscape approach has contributed to recent progress in understanding the complexity and simplicity of ligand-macromolecule interactions. Significant advances in computational structure prediction of ligand-protein complexes have been made using approaches that include the effects of protein flexibility and incorporate a hierarchy of energy functions. The results suggest that the complexity of structure prediction in molecular recognition may be determined by low-resolution properties of the underlying binding energy landscapes and by the nature of the energy funnels near the native structures of the complexes.
Collapse
Affiliation(s)
- Gennady M Verkhivker
- Department of Computational Chemistry, Agouron Pharmaceuticals Inc, A Pfizer Company, 10777 Science Center Drive, San Diego, California 92121-1111, USA.
| | | | | | | | | | | |
Collapse
|
33
|
Abstract
Recent improvements in flexible docking technology may lead to a bigger role for computational methods in lead discovery. Although fast and accurate computational prediction of binding affinities for an arbitrary molecule is still beyond the limits of current methods, the docking and screening procedures can select small sets of likely lead candidates from large libraries of either commercially or synthetically available compounds.
Collapse
Affiliation(s)
- R Abagyan
- Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines, TCP-28, La Jolla, CA 92037, USA.
| | | |
Collapse
|