Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Horvath D, Marcou G, Varnek A. Do not hesitate to use Tversky-and other hints for successful active analogue searches with feature count descriptors. J Chem Inf Model 2013;53:1543-62. [PMID: 23731338 DOI: 10.1021/ci400106g] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

For:	Horvath D, Marcou G, Varnek A. Do not hesitate to use Tversky-and other hints for successful active analogue searches with feature count descriptors. J Chem Inf Model 2013;53:1543-62. [PMID: 23731338 DOI: 10.1021/ci400106g] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Wahl J. PheSA: An Open-Source Tool for Pharmacophore-Enhanced Shape Alignment. J Chem Inf Model 2024;64:5944-5953. [PMID: 39092495 DOI: 10.1021/acs.jcim.4c00516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]

Koukos PI, Réau M, Bonvin AMJJ. Shape-Restrained Modeling of Protein-Small-Molecule Complexes with High Ambiguity Driven DOCKing. J Chem Inf Model 2021;61:4807-4818. [PMID: 34436890 PMCID: PMC8479858 DOI: 10.1021/acs.jcim.1c00796] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Cleves AE, Johnson SR, Jain AN. Electrostatic-field and surface-shape similarity for virtual screening and pose prediction. J Comput Aided Mol Des 2019;33:865-886. [PMID: 31650386 PMCID: PMC6856045 DOI: 10.1007/s10822-019-00236-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 10/11/2019] [Indexed: 02/04/2023]

Abstract

We introduce a new method for rapid computation of 3D molecular similarity that combines electrostatic field comparison with comparison of molecular surface-shape and directional hydrogen-bonding preferences (called "eSim"). Rather than employing heuristic "colors" or user-defined molecular feature types to represent conformation-dependent molecular electrostatics, eSim calculates the similarity of the electrostatic fields of two molecules (in addition to shape and hydrogen-bonding). We present detailed virtual screening performance data on the standard 102 target DUD-E set. In its moderately fast screening mode, eSim running on a single computing core is capable of processing over 60 molecules per second. In this mode, eSim performed significantly better than all alternate methods for which full DUD-E data were available (mean ROC area of 0.74, p [Formula: see text], by paired t-test, compared with the best performing alternate method). In addition, for 92 targets of the DUD-E set where multiple ligand-bound crystal structures were available, screening performance was assessed using alternate ligands or sets thereof (in their bound poses) as similarity targets. Using the joint alignment of five ligands for each protein target, mean ROC area exceeded 0.82 for the 92 targets. Design-focused application of ligand similarity methods depends on accurate predictions of geometric molecular relationships. We comprehensively assessed pose prediction accuracy by curating nearly 400,000 bound ligand pose pairs across the DUD-E targets. Overall, beginning from agnostic initial poses, we observed an 80% success rate for RMSD [Formula: see text] Å among the top 20 predicted eSim poses. These examples were split roughly 50/50 into cases with high direct atomic overlap (where a shared scaffold exists between a pair) and low direct atomic overlap (where where a ligand pair has dissimilar scaffolds but largely occupies the same space). Within the high direct atomic overlap subset, the pose prediction success rate was 93%. For the more challenging subset (where dissimilar scaffolds are to be aligned), the success rate was 70%. The eSim approach enables both large-scale screening and rational design of ligands and is rooted in physically meaningful, non-heuristic, molecular comparisons.

Collapse

Laufkötter O, Miyao T, Bajorath J. Large-Scale Comparison of Alternative Similarity Search Strategies with Varying Chemical Information Contents. ACS OMEGA 2019;4:15304-15311. [PMID: 31552377 PMCID: PMC6751733 DOI: 10.1021/acsomega.9b02470] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2019] [Accepted: 08/23/2019] [Indexed: 06/10/2023]

Kumar A, Zhang KYJ. Shape similarity guided pose prediction: lessons from D3R Grand Challenge 3. J Comput Aided Mol Des 2018;33:47-59. [PMID: 30084081 DOI: 10.1007/s10822-018-0142-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2018] [Accepted: 08/01/2018] [Indexed: 12/15/2022]

Abstract

To extend the utility of ligand 3D shape similarity into pose prediction and virtual screening, we have previously developed CDVS and PoPSS methods. Both of them utilize ligand 3D shape similarity with the crystallographic ligands to improve pose prediction. While CDVS utilizes shape similarity to select suitable receptor structures for molecular docking, PoPSS places a ligand conformation of the highest shape similarity with crystal ligands into the target protein binding pocket which is then refined by side-chain repacking and Monte Carlo energy minimization. Analyses of PoPSS revealed some drawbacks in ligand conformation generation and the scoring scheme used. Moreover, as PoPSS does not sample the ligand conformation after placing it in the binding pocket, it relies solely on conformation generation methods to produce native like conformations. To address these limitations of PoPSS method, we report here a modified approach named as PoPSS-Lite, where side-chain repacking was replaced by a simple grid-based energy minimization. This modification also allowed the sampling of terminal functional groups while keeping the core scaffold fixed. Furthermore, shape similarity calculations were improved by increasing the number of ligand conformations and using a different similarity metric. The performance of PoPSS-Lite was prospectively evaluated in D3R GC3. Comparison of PoPSS-Lite demonstrated superior performance over PoPSS and CDVS with lower mean and median RMSDs. Furthermore, comparison with other D3R GC3 pose prediction submissions revealed top performance for PoPSS-Lite. Our D3R GC3 result extends our perspective that ligand 3D shape similarity with known crystallographic information can be successfully used to predict the binding pose of ligands with unknown binding modes. Our D3R GC3 results further highlight the necessity for improvement in conformer generation methods in order to improve shape similarity guided pose prediction.

Collapse

Assessment of tautomer distribution using the condensed reaction graph approach. J Comput Aided Mol Des 2018;32:401-414. [DOI: 10.1007/s10822-018-0101-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 01/18/2018] [Indexed: 02/07/2023]

Wong LWY, Tam GSS, Chen X, So FTK, Soecipto A, Sheong FK, Sung HHY, Lin Z, Williams ID. A chiral spiroborate anion from diphenyl-l-tartramide [B{l-Tar(NHPh)2}2]−applied to some challenging resolutions. CrystEngComm 2018. [DOI: 10.1039/c8ce00855h] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

QSAR modeling and chemical space analysis of antimalarial compounds. J Comput Aided Mol Des 2017;31:441-451. [PMID: 28374255 DOI: 10.1007/s10822-017-0019-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Accepted: 03/18/2017] [Indexed: 10/19/2022]

O'Hagan S, Kell DB. Analysis of drug-endogenous human metabolite similarities in terms of their maximum common substructures. J Cheminform 2017;9:18. [PMID: 28316656 PMCID: PMC5344883 DOI: 10.1186/s13321-017-0198-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Accepted: 02/09/2017] [Indexed: 12/21/2022] Open

Horvath D, Marcou G, Varnek A. Generative Topographic Mapping Approach to Chemical Space Analysis. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2017. [DOI: 10.1007/978-3-319-56850-8_6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]

Kearnes S, Pande V. ROCS-derived features for virtual screening. J Comput Aided Mol Des 2016;30:609-17. [PMID: 27624668 DOI: 10.1007/s10822-016-9959-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Accepted: 08/31/2016] [Indexed: 10/21/2022]

Kunimoto R, Vogt M, Bajorath J. Maximum common substructure-based Tversky index: an asymmetric hybrid similarity measure. J Comput Aided Mol Des 2016;30:523-31. [PMID: 27515428 DOI: 10.1007/s10822-016-9935-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2016] [Accepted: 08/04/2016] [Indexed: 12/01/2022]

Johnson DK, Karanicolas J. Ultra-High-Throughput Structure-Based Virtual Screening for Small-Molecule Inhibitors of Protein-Protein Interactions. J Chem Inf Model 2016;56:399-411. [PMID: 26726827 DOI: 10.1021/acs.jcim.5b00572] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Abstract

Protein-protein interactions play important roles in virtually all cellular processes, making them enticing targets for modulation by small-molecule therapeutics: specific examples have been well validated in diseases ranging from cancer and autoimmune disorders, to bacterial and viral infections. Despite several notable successes, however, overall these remain a very challenging target class. Protein interaction sites are especially challenging for computational approaches, because the target protein surface often undergoes a conformational change to enable ligand binding: this confounds traditional approaches for virtual screening. Through previous studies, we demonstrated that biased "pocket optimization" simulations could be used to build collections of low-energy pocket-containing conformations, starting from an unbound protein structure. Here, we demonstrate that these pockets can further be used to identify ligands that complement the protein surface. To do so, we first build from a given pocket its "exemplar": a perfect, but nonphysical, pseudoligand that would optimally match the shape and chemical features of the pocket. In our previous studies, we used these exemplars to quantitatively compare protein surface pockets to one another. Here, we now introduce this exemplar as a template for pharmacophore-based screening of chemical libraries. Through a series of benchmark experiments, we demonstrate that this approach exhibits comparable performance as traditional docking methods for identifying known inhibitors acting at protein interaction sites. However, because this approach is predicated on ligand/exemplar overlays, and thus does not require explicit calculation of protein-ligand interactions, exemplar screening provides a tremendous speed advantage over docking: 6 million compounds can be screened in about 15 min on a single 16-core, dual-GPU computer. The extreme speed at which large compound libraries can be traversed easily enables screening against a "pocket-optimized" ensemble of protein conformations, which in turn facilitates identification of more diverse classes of active compounds for a given protein target.

Collapse

Muegge I, Mukherjee P. An overview of molecular fingerprint similarity search in virtual screening. Expert Opin Drug Discov 2015;11:137-48. [PMID: 26558489 DOI: 10.1517/17460441.2016.1117070] [Citation(s) in RCA: 119] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Sidorov P, Gaspar H, Marcou G, Varnek A, Horvath D. Mappability of drug-like space: towards a polypharmacologically competent map of drug-relevant compounds. J Comput Aided Mol Des 2015;29:1087-108. [PMID: 26564142 DOI: 10.1007/s10822-015-9882-z] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Accepted: 11/06/2015] [Indexed: 11/30/2022]

Abstract

Intuitive, visual rendering--mapping--of high-dimensional chemical spaces (CS), is an important topic in chemoinformatics. Such maps were so far dedicated to specific compound collections--either limited series of known activities, or large, even exhaustive enumerations of molecules, but without associated property data. Typically, they were challenged to answer some classification problem with respect to those same molecules, admired for their aesthetical virtues and then forgotten--because they were set-specific constructs. This work wishes to address the question whether a general, compound set-independent map can be generated, and the claim of "universality" quantitatively justified, with respect to all the structure-activity information available so far--or, more realistically, an exploitable but significant fraction thereof. The "universal" CS map is expected to project molecules from the initial CS into a lower-dimensional space that is neighborhood behavior-compliant with respect to a large panel of ligand properties. Such map should be able to discriminate actives from inactives, or even support quantitative neighborhood-based, parameter-free property prediction (regression) models, for a wide panel of targets and target families. It should be polypharmacologically competent, without requiring any target-specific parameter fitting. This work describes an evolutionary growth procedure of such maps, based on generative topographic mapping, followed by the validation of their polypharmacological competence. Validation was achieved with respect to a maximum of exploitable structure-activity information, covering all of Homo sapiens proteins of the ChEMBL database, antiparasitic and antiviral data, etc. Five evolved maps satisfactorily solved hundreds of activity-based ligand classification challenges for targets, and even in vivo properties independent from training data. They also stood chemogenomics-related challenges, as cumulated responsibility vectors obtained by mapping of target-specific ligand collections were shown to represent validated target descriptors, complying with currently accepted target classification in biology. Therefore, they represent, in our opinion, a robust and well documented answer to the key question "What is a good CS map?"

Collapse

Gadhe CG, Lee E, Kim MH. Finding new scaffolds of JAK3 inhibitors in public database: 3D-QSAR models & shape-based screening. Arch Pharm Res 2015;38:2008-19. [PMID: 25956696 DOI: 10.1007/s12272-015-0607-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2014] [Accepted: 04/20/2015] [Indexed: 01/02/2023]

Duesbury E, Holliday J, Willett P. Maximum Common Substructure-Based Data Fusion in Similarity Searching. J Chem Inf Model 2015;55:222-30. [DOI: 10.1021/ci5005702] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Gan S, Cosgrove DA, Gardiner EJ, Gillet VJ. Investigation of the use of spectral clustering for the analysis of molecular data. J Chem Inf Model 2014;54:3302-19. [PMID: 25379955 DOI: 10.1021/ci500480b] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Computational chemogenomics: is it more than inductive transfer? J Comput Aided Mol Des 2014;28:597-618. [PMID: 24771144 DOI: 10.1007/s10822-014-9743-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2014] [Accepted: 04/11/2014] [Indexed: 10/25/2022]

Abstract

High-throughput assays challenge us to extract knowledge from multi-ligand, multi-target activity data. In QSAR, weights are statically fitted to each ligand descriptor with respect to a single endpoint or target. However, computational chemogenomics (CG) has demonstrated benefits of learning from entire grids of data at once, rather than building target-specific QSARs. A possible reason for this is the emergence of inductive knowledge transfer (IT) between targets, providing statistical robustness to the model, with no assumption about the structure of the targets. Relevant protein descriptors in CG should allow one to learn how to dynamically adjust ligand attribute weights with respect to protein structure. Hence, models built through explicit learning (EL) by including protein information, while benefitting from IT enhancement, should provide additional predictive capability, notably for protein deorphanization. This interplay between IT and EL in CG modeling is not sufficiently studied. While IT is likely to occur irrespective of the injected target information, it is not clear whether and when boosting due to EL may occur. EL is only possible if protein description is appropriate to the target set under investigation. The key issue here is the search for evidence of genuine EL exceeding expectations based on pure IT. We explore the problem in the context of Support Vector Regression, using more than 9,400 pKi values of 31 GPCRs, where compound-protein interactions are represented by the concatenation of vectorial descriptions of compounds and proteins. This provides a unified framework to generate both IT-enhanced and potentially EL-enabled models, where the difference is toggled by supplied protein information. For EL-enabled models, protein information includes genuine protein descriptors such as typical sequence-based terms, but also the experimentally determined affinity cross-correlation fingerprints. These latter benchmark the expected behavior of a quasi-ideal descriptor capturing the actual functional protein-protein relatedness, and therefore thought to be the most likely to enable EL. EL- and IT-based methods were benchmarked alongside classical QSAR, with respect to cross-validation and deorphanization challenges. A rational method for projecting benchmarked methodologies into a strategy space is given, in the aims that the projection will provide directions for the types of molecule designs possible using a given methodology. While EL-enabled strategies outperform classical QSARs and favorably compare to similar published results, they are, in all respects evaluated herein, not strongly distinguished from IT-enhanced models. Moreover, EL-enabled strategies failed to prove superior in deorphanization challenges. Therefore, this paper raises caution that, contrary to common belief and intuitive expectation, the benefits of chemogenomics models over classical QSAR are quite possibly due less to the injection of protein-related information, and rather impacted more by the effect of inductive transfer, due to simultaneous learning from all of the modeled endpoints. These results show that the field of protein descriptor research needs further improvements to truly realize the expected benefit of EL.

Collapse