1
|
Duda J, Podlewska S. Prediction of probability distributions of molecular properties: towards more efficient virtual screening and better understanding of compound representations. Mol Divers 2024; 28:437-448. [PMID: 36586082 DOI: 10.1007/s11030-022-10589-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 12/18/2022] [Indexed: 01/01/2023]
Abstract
Various in silico approaches to predict activity and properties of chemical compounds constitute nowadays the basis of computer-aided drug design. While there is a general focus on the predictions of values, mathematically more appropriate is the prognosis of probability distributions, which offers additional possibilities, such as the evaluation of uncertainty, higher moments, and quantiles. In this study, we applied the Hierarchical Correlation Reconstruction approach to assess several ADMET properties of chemical compounds. It uses multiple linear regression to independently assess multiple moments, which are then finally combined into predicted probability distribution. The method enables inexpensive selection of compounds with properties nearly certain to fall into the particular range during virtual screening and automatic rejection of predictions characterized by high rate of uncertainty; however, unlike to the currently used virtual screening methods, it focuses on the prediction of the property distribution, not its actual value. Moreover, the presented protocol enables detection of structural features, which should be carefully considered when optimizing compounds towards particular property, as well as it provides deeper understanding of the examined compound representations.
Collapse
Affiliation(s)
- Jarosław Duda
- Faculty of Mathematics and Computer Science, Jagiellonian University, Łojasiewicza 6, 30-348, Kraków, Poland
| | - Sabina Podlewska
- Department of Medicinal Chemistry, Maj Institute of Pharmacology, Polish Academy of Sciences, Smętna Street 12, 31-343, Kraków, Poland.
| |
Collapse
|
2
|
Gu J, Peng RK, Guo CL, Zhang M, Yang J, Yan X, Zhou Q, Li H, Wang N, Zhu J, Ouyang Q. Construction of a synthetic methodology-based library and its application in identifying a GIT/PIX protein-protein interaction inhibitor. Nat Commun 2022; 13:7176. [PMID: 36418900 PMCID: PMC9684509 DOI: 10.1038/s41467-022-34598-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 10/24/2022] [Indexed: 11/24/2022] Open
Abstract
In recent years, the flourishing of synthetic methodology studies has provided concise access to numerous molecules with new chemical space. These compounds form a large library with unique scaffolds, but their application in hit discovery is not systematically evaluated. In this work, we establish a synthetic methodology-based compound library (SMBL), integrated with compounds obtained from our synthetic researches, as well as their virtual derivatives in significantly larger scale. We screen the library and identify small-molecule inhibitors to interrupt the protein-protein interaction (PPI) of GIT1/β-Pix complex, an unrevealed target involved in gastric cancer metastasis. The inhibitor 14-5-18 with a spiro[bicyclo[2.2.1]heptane-2,3'-indolin]-2'-one scaffold, considerably retards gastric cancer metastasis in vitro and in vivo. Since the PPI targets are considered undruggable as they are hard to target, the successful application illustrates the structural specificity of SMBL, demonstrating its potential to be utilized as compound source for more challenging targets.
Collapse
Affiliation(s)
- Jing Gu
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Rui-Kun Peng
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Chun-Ling Guo
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Meng Zhang
- grid.16821.3c0000 0004 0368 8293Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - Jie Yang
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Xiao Yan
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Qian Zhou
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Hongwei Li
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Na Wang
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| | - Jinwei Zhu
- grid.16821.3c0000 0004 0368 8293Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China
| | - Qin Ouyang
- grid.410570.70000 0004 1760 6682Department of Medicinal Chemistry, College of Pharmacy, Third Military Medical University, 400038 Chongqing, China
| |
Collapse
|
3
|
Abdulhakeem Mansour Alhasbary A, Hashimah Ahamed Hassain Malim N. Turbo Similarity Searching: Effect of Partial Ranking and Fusion Rules on ChEMBL Database. Mol Inform 2021; 41:e2100106. [PMID: 34878229 DOI: 10.1002/minf.202100106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 11/25/2021] [Indexed: 11/08/2022]
Abstract
Turbo Similarity Searching (TSS) is the simplest and most recent chemical similarity searching (SS) approach, which improves the effectiveness of SS by performing a multi-target searching. TSS has four important elements, namely structural representation, similarity coefficient, number of nearest neighbours (NNs), and fusion rule, and any changes in these elements could affect the TSS results. A previous study suggested the advantage of using large numbers of reference compounds with small fractions of the database structures to obtain a better recall in group fusion. Therefore, this study aims to investigate the effect of partial ranking on TSS utilising different fusion rules and different numbers of NNs on the ChEMBL database and to evaluate whether these observations hold in TSS. Furthermore, the objective is to observe the effect of the indirect relationship feature of TSS on the partial ranking investigation. The results showed that the effect of using partial ranking on TSS was significant. This study also found that the performance of TSS improved as the database proportions used in the fusion process decreased and by using a small number of NNs. In addition, fusion rules based on reciprocal rank positions (RKP), maximum similarity score (sMAX), and sMNZ were superior to all the other fusion rules.
Collapse
|
4
|
Miranda-Quintana RA, Rácz A, Bajusz D, Héberger K. Extended similarity indices: the benefits of comparing more than two objects simultaneously. Part 2: speed, consistency, diversity selection. J Cheminform 2021; 13:33. [PMID: 33892799 PMCID: PMC8067665 DOI: 10.1186/s13321-021-00504-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 03/12/2021] [Indexed: 11/10/2022] Open
Abstract
Despite being a central concept in cheminformatics, molecular similarity has so far been limited to the simultaneous comparison of only two molecules at a time and using one index, generally the Tanimoto coefficent. In a recent contribution we have not only introduced a complete mathematical framework for extended similarity calculations, (i.e. comparisons of more than two molecules at a time) but defined a series of novel idices. Part 1 is a detailed analysis of the effects of various parameters on the similarity values calculated by the extended formulas. Their features were revealed by sum of ranking differences and ANOVA. Here, in addition to characterizing several important aspects of the newly introduced similarity metrics, we will highlight their applicability and utility in real-life scenarios using datasets with popular molecular fingerprints. Remarkably, for large datasets, the use of extended similarity measures provides an unprecedented speed-up over “traditional” pairwise similarity matrix calculations. We also provide illustrative examples of a more direct algorithm based on the extended Tanimoto similarity to select diverse compound sets, resulting in much higher levels of diversity than traditional approaches. We discuss the inner and outer consistency of our indices, which are key in practical applications, showing whether the n-ary and binary indices rank the data in the same way. We demonstrate the use of the new n-ary similarity metrics on t-distributed stochastic neighbor embedding (t-SNE) plots of datasets of varying diversity, or corresponding to ligands of different pharmaceutical targets, which show that our indices provide a better measure of set compactness than standard binary measures. We also present a conceptual example of the applicability of our indices in agglomerative hierarchical algorithms. The Python code for calculating the extended similarity metrics is freely available at: https://github.com/ramirandaq/MultipleComparisons
Collapse
Affiliation(s)
| | - Anita Rácz
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
| | - Dávid Bajusz
- Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary
| | - Károly Héberger
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, 1117, Budapest, Hungary.
| |
Collapse
|