Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rao H, Li Z, Li X, Ma X, Ung C, Li H, Liu X, Chen Y. Identification of small molecule aggregators from large compound libraries by support vector machines. J Comput Chem 2010;31:752-63. [PMID: 19569201 DOI: 10.1002/jcc.21347] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

For:	Rao H, Li Z, Li X, Ma X, Ung C, Li H, Liu X, Chen Y. Identification of small molecule aggregators from large compound libraries by support vector machines. J Comput Chem 2010;31:752-63. [PMID: 19569201 DOI: 10.1002/jcc.21347] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Number

Cited by Other Article(s)

Abou Hajal A, Bryce RA, Amor BB, Atatreh N, Ghattas MA. Boosting the Accuracy and Chemical Space Coverage of the Detection of Small Colloidal Aggregating Molecules Using the BAD Molecule Filter. J Chem Inf Model 2024;64:4991-5005. [PMID: 38920403 DOI: 10.1021/acs.jcim.4c00363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/27/2024]

Abstract

The ability to conduct effective high throughput screening (HTS) campaigns in drug discovery is often hampered by the detection of false positives in these assays due to small colloidally aggregating molecules (SCAMs). SCAMs can produce artifactual hits in HTS by nonspecific inhibition of the protein target. In this work, we present a new computational prediction tool for detecting SCAMs based on their 2D chemical structure. The tool, called the boosted aggregation detection (BAD) molecule filter, employs decision tree ensemble methods, namely, the CatBoost classifier and the light gradient-boosting machine, to significantly improve the detection of SCAMs. In developing the filter, we explore models trained on individual data sets, a consensus approach using these models, and, third, a merged data set approach, each tailored for specific drug discovery needs. The individual data set method emerged as most effective, achieving 93% sensitivity and 90% specificity, outperforming existing state-of-the-art models by 20 and 5%, respectively. The consensus models offer broader chemical space coverage, exceeding 90% for all testing sets. This feature is an important aspect particularly for early stage medicinal chemistry projects, and provides information on applicability domain. Meanwhile, the merged data set models demonstrated robust performance, with a notable sensitivity of 79% in the comprehensive 10-fold cross-validation test set. A SHAP analysis of model features indicates the importance of hydrophobicity and molecular complexity as primary factors influencing the aggregation propensity. The BAD molecule filter is readily accessible for the public usage on https://molmodlab-aau.com/Tools.html. This filter provides a new, more robust tool for aggregate prediction in the early stages of drug discovery to optimize hit rates and reduce associated testing and validation overheads.

Collapse

Kombo DC, Stepp JD, Lim S, Elshorst B, Li Y, Cato L, Shomali M, Fink D, LaMarche MJ. Predictions of Colloidal Molecular Aggregation Using AI/ML Models. ACS OMEGA 2024;9:28691-28706. [PMID: 38973835 PMCID: PMC11223200 DOI: 10.1021/acsomega.4c02886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 06/10/2024] [Accepted: 06/12/2024] [Indexed: 07/09/2024]

Molina C, Ait-Ouarab L, Minoux H. Isometric Stratified Ensembles: A Partial and Incremental Adaptive Applicability Domain and Consensus-Based Classification Strategy for Highly Imbalanced Data Sets with Application to Colloidal Aggregation. J Chem Inf Model 2022;62:1849-1862. [PMID: 35357194 DOI: 10.1021/acs.jcim.2c00293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Sun J, Zhong H, Wang K, Li N, Chen L. Gains from no real PAINS: Where 'Fair Trial Strategy' stands in the development of multi-target ligands. Acta Pharm Sin B 2021;11:3417-3432. [PMID: 34900527 PMCID: PMC8642439 DOI: 10.1016/j.apsb.2021.02.023] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2020] [Revised: 02/15/2021] [Accepted: 02/25/2021] [Indexed: 12/26/2022] Open

Kaya I, Colmenarejo G. Analysis of Nuisance Substructures and Aggregators in a Comprehensive Database of Food Chemical Compounds. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2020;68:8812-8824. [PMID: 32687707 DOI: 10.1021/acs.jafc.0c02521] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Alves VM, Capuzzi SJ, Braga RC, Korn D, Hochuli JE, Bowler KH, Yasgar A, Rai G, Simeonov A, Muratov EN, Zakharov AV, Tropsha A. SCAM Detective: Accurate Predictor of Small, Colloidally Aggregating Molecules. J Chem Inf Model 2020;60:4056-4063. [PMID: 32678597 DOI: 10.1021/acs.jcim.0c00415] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Affiliation(s)

Vinicius M Alves Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
Stephen J Capuzzi Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
Rodolpho C Braga InsilicAll, São Paulo, São Paulo 04363-090, Brazil
Daniel Korn Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
Joshua E Hochuli Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
Kyle H Bowler Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States
Adam Yasgar National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
Ganesha Rai National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
Anton Simeonov National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
Eugene N Muratov Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States.,Department of Pharmaceutical Sciences, Federal University of Paraiba, João Pessoa, Paraíba 58059, Brazil
Alexey V Zakharov National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, 9800 Medical Center Drive, Rockville, Maryland 20850, United States
Alexander Tropsha Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, United States

Collapse

Yang ZY, He JH, Lu AP, Hou TJ, Cao DS. Application of Negative Design To Design a More Desirable Virtual Screening Library. J Med Chem 2020;63:4411-4429. [DOI: 10.1021/acs.jmedchem.9b01476] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Ayotte Y, Marando VM, Vaillancourt L, Bouchard P, Heffron G, Coote PW, Larda ST, LaPlante SR. Exposing Small-Molecule Nanoentities by a Nuclear Magnetic Resonance Relaxation Assay. J Med Chem 2019;62:7885-7896. [PMID: 31422659 DOI: 10.1021/acs.jmedchem.9b00653] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Yang ZY, Yang ZJ, Dong J, Wang LL, Zhang LX, Ding JJ, Ding XQ, Lu AP, Hou TJ, Cao DS. Structural Analysis and Identification of Colloidal Aggregators in Drug Discovery. J Chem Inf Model 2019;59:3714-3726. [DOI: 10.1021/acs.jcim.9b00541] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Kim S. Getting the most out of PubChem for virtual screening. Expert Opin Drug Discov 2016;11:843-55. [PMID: 27454129 PMCID: PMC5045798 DOI: 10.1080/17460441.2016.1216967] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Mining Chemical Activity Status from High-Throughput Screening Assays. PLoS One 2015;10:e0144426. [PMID: 26658480 PMCID: PMC4682830 DOI: 10.1371/journal.pone.0144426] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 11/18/2015] [Indexed: 01/20/2023] Open

Irwin JJ, Duan D, Torosyan H, Doak AK, Ziebart KT, Sterling T, Tumanian G, Shoichet BK. An Aggregation Advisor for Ligand Discovery. J Med Chem 2015;58:7076-87. [PMID: 26295373 DOI: 10.1021/acs.jmedchem.5b01105] [Citation(s) in RCA: 309] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Abstract

Colloidal aggregation of organic molecules is the dominant mechanism for artifactual inhibition of proteins, and controls against it are widely deployed. Notwithstanding an increasingly detailed understanding of this phenomenon, a method to reliably predict aggregation has remained elusive. Correspondingly, active molecules that act via aggregation continue to be found in early discovery campaigns and remain common in the literature. Over the past decade, over 12 thousand aggregating organic molecules have been identified, potentially enabling a precedent-based approach to match known aggregators with new molecules that may be expected to aggregate and lead to artifacts. We investigate an approach that uses lipophilicity, affinity, and similarity to known aggregators to advise on the likelihood that a candidate compound is an aggregator. In prospective experimental testing, five of seven new molecules with Tanimoto coefficients (Tc's) between 0.95 and 0.99 to known aggregators aggregated at relevant concentrations. Ten of 19 with Tc's between 0.94 and 0.90 and three of seven with Tc's between 0.89 and 0.85 also aggregated. Another three of the predicted compounds aggregated at higher concentrations. This method finds that 61 827 or 5.1% of the ligands acting in the 0.1 to 10 μM range in the medicinal chemistry literature are at least 85% similar to a known aggregator with these physical properties and may aggregate at relevant concentrations. Intriguingly, only 0.73% of all drug-like commercially available compounds resemble the known aggregators, suggesting that colloidal aggregators are enriched in the literature. As a percentage of the literature, aggregator-like compounds have increased 9-fold since 1995, partly reflecting the advent of high-throughput and virtual screens against molecular targets. Emerging from this study is an aggregator advisor database and tool ( http://advisor.bkslab.org ), free to the community, that may help distinguish between fruitful and artifactual screening hits acting by this mechanism.

Collapse

Xie XQ. Exploiting PubChem for Virtual Screening. Expert Opin Drug Discov 2010;5:1205-1220. [PMID: 21691435 PMCID: PMC3117665 DOI: 10.1517/17460441.2010.524924] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]