Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Martin E, Cao E. Euclidean chemical spaces from molecular fingerprints: Hamming distance and Hempel's ravens. J Comput Aided Mol Des 2014;29:387-95. [PMID: 25475496 DOI: 10.1007/s10822-014-9819-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Accepted: 11/24/2014] [Indexed: 10/24/2022]

For:	Martin E, Cao E. Euclidean chemical spaces from molecular fingerprints: Hamming distance and Hempel's ravens. J Comput Aided Mol Des 2014;29:387-95. [PMID: 25475496 DOI: 10.1007/s10822-014-9819-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Accepted: 11/24/2014] [Indexed: 10/24/2022]

Number

Cited by Other Article(s)

Akgüller Ö, Balcı MA, Cioca G. Network Models of BACE-1 Inhibitors: Exploring Structural and Biochemical Relationships. Int J Mol Sci 2024;25:6890. [PMID: 38999999 PMCID: PMC11240958 DOI: 10.3390/ijms25136890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/14/2024] [Accepted: 06/21/2024] [Indexed: 07/14/2024] Open

Abstract

This study investigates the clustering patterns of human β-secretase 1 (BACE-1) inhibitors using complex network methodologies based on various distance functions, including Euclidean, Tanimoto, Hamming, and Levenshtein distances. Molecular descriptor vectors such as molecular mass, Merck Molecular Force Field (MMFF) energy, Crippen partition coefficient (ClogP), Crippen molar refractivity (MR), eccentricity, Kappa indices, Synthetic Accessibility Score, Topological Polar Surface Area (TPSA), and 2D/3D autocorrelation entropies are employed to capture the diverse properties of these inhibitors. The Euclidean distance network demonstrates the most reliable clustering results, with strong agreement metrics and minimal information loss, indicating its robustness in capturing essential structural and physicochemical properties. Tanimoto and Hamming distance networks yield valuable clustering outcomes, albeit with moderate performance, while the Levenshtein distance network shows significant discrepancies. The analysis of eigenvector centrality across different networks identifies key inhibitors acting as hubs, which are likely critical in biochemical pathways. Community detection results highlight distinct clustering patterns, with well-defined communities providing insights into the functional and structural groupings of BACE-1 inhibitors. The study also conducts non-parametric tests, revealing significant differences in molecular descriptors, validating the clustering methodology. Despite its limitations, including reliance on specific descriptors and computational complexity, this study offers a comprehensive framework for understanding molecular interactions and guiding therapeutic interventions. Future research could integrate additional descriptors, advanced machine learning techniques, and dynamic network analysis to enhance clustering accuracy and applicability.

Collapse

Xue X, Sun H, Yang M, Liu X, Hu HY, Deng Y, Wang X. Advances in the Application of Artificial Intelligence-Based Spectral Data Interpretation: A Perspective. Anal Chem 2023;95:13733-13745. [PMID: 37688541 DOI: 10.1021/acs.analchem.3c02540] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2023]

Affiliation(s)

Xi Xue State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China Beijing Key Laboratory of Active Substances Discovery and Drugability Evaluation, Department of Medicinal Chemistry, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, P. R. China
Hanyu Sun State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China Beijing Key Laboratory of Active Substances Discovery and Drugability Evaluation, Department of Medicinal Chemistry, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, P. R. China
Minjian Yang State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China Beijing Key Laboratory of Active Substances Discovery and Drugability Evaluation, Department of Medicinal Chemistry, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, P. R. China
Xue Liu State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
Hai-Yu Hu State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China
Yafeng Deng CarbonSilicon AI Technology Co., Ltd. Beijing 100080, China Department of Automation, Tsinghua University, Beijing 100084, China
Xiaojian Wang State Key Laboratory of Bioactive Substances and Functions of Natural Medicines, Institute of Materia Medica, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100050, China CarbonSilicon AI Technology Co., Ltd. Beijing 100080, China

Collapse

Dost K, Pullar-Strecker Z, Brydon L, Zhang K, Hafner J, Riddle PJ, Wicker JS. Combatting over-specialization bias in growing chemical databases. J Cheminform 2023;15:53. [PMID: 37208694 DOI: 10.1186/s13321-023-00716-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 03/25/2023] [Indexed: 05/21/2023] Open

Abstract

BACKGROUND

Predicting in advance the behavior of new chemical compounds can support the design process of new products by directing the research toward the most promising candidates and ruling out others. Such predictive models can be data-driven using Machine Learning or based on researchers' experience and depend on the collection of past results. In either case: models (or researchers) can only make reliable assumptions about compounds that are similar to what they have seen before. Therefore, consequent usage of these predictive models shapes the dataset and causes a continuous specialization shrinking the applicability domain of all trained models on this dataset in the future, and increasingly harming model-based exploration of the space.

PROPOSED SOLUTION

In this paper, we propose CANCELS (CounterActiNg Compound spEciaLization biaS), a technique that helps to break the dataset specialization spiral. Aiming for a smooth distribution of the compounds in the dataset, we identify areas in the space that fall short and suggest additional experiments that help bridge the gap. Thereby, we generally improve the dataset quality in an entirely unsupervised manner and create awareness of potential flaws in the data. CANCELS does not aim to cover the entire compound space and hence retains a desirable degree of specialization to a specified research domain.

RESULTS

An extensive set of experiments on the use-case of biodegradation pathway prediction not only reveals that the bias spiral can indeed be observed but also that CANCELS produces meaningful results. Additionally, we demonstrate that mitigating the observed bias is crucial as it cannot only intervene with the continuous specialization process, but also significantly improves a predictor's performance while reducing the number of required experiments. Overall, we believe that CANCELS can support researchers in their experimentation process to not only better understand their data and potential flaws, but also to grow the dataset in a sustainable way. All code is available under github.com/KatDost/Cancels .

Collapse

Dekker T, Janssen MAC, Sutherland C, Aben RWM, Scheeren HW, Blanco-Ania D, Rutjes FPJT, Wijtmans M, de Esch IJP. An Automated, Open-Source Workflow for the Generation of (3D) Fragment Libraries. ACS Med Chem Lett 2023;14:583-590. [PMID: 37197454 PMCID: PMC10184156 DOI: 10.1021/acsmedchemlett.2c00503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 04/27/2023] [Indexed: 05/19/2023] Open

Cameron AR, Proud AJ, Pearson JK. Machine Learned Composite Methods for Electronic Structure Theory. J Chem Theory Comput 2023;19:51-60. [PMID: 36507875 DOI: 10.1021/acs.jctc.2c00564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Abstract

Because of the prohibitive scaling of ab initio techniques for modeling chemical species with high accuracy, they are not generally tractable for large systems. It is therefore of considerable interest to develop high-accuracy computational models with low computational cost that can afford predictions of electronic structure and properties of macromolecular species. Composite methods, as first introduced by Pople [Pople, J. A.; Head-Gordon, M.; Fox, D. J.; Raghavachari, K.; Curtiss, L. A. J. Chem. Phys.1989, 90, 5622.], are an intuitive solution to this problem as they seek to systematically increase accuracy in model chemistries by taking advantage of favorable error cancellation among reasonably low-cost models. By linearly combining a series of carefully chosen model chemistries, the result of a prohibitive-scaling correlated model chemistry with a large basis set may be approximated with relatively good fidelity. However, the full extent to which the choice of low-cost models dictates the predictive accuracy of composite methods is not known, and a full exploration of all model chemistries would be advantageous for the design and validation of a generalizable composite method for widespread application. Here, we show that remarkable accuracy can be generally achieved with composite methods that are more judiciously constructed, leading to increased accuracy with significantly reduced computational cost. By designing a systematic procedure for the automated generation and assessment of over 10 billion unique composite methods, we have extensively explored the space of modern model chemistries to elucidate important design principles in the construction of reliable composite procedures. We anticipate our work to be the starting point in the pursuit of creative approaches to modeling large chemical systems with high accuracy by using novel combinatorial modeling.

Collapse

Rehioui H, Cuissart B, Ouali A, Lepailleur A, Lamotte JL, Bureau R, Zimmermann A. New Pharmacophore Fingerprints and Weight-matrix Learning for Virtual Screening. Application to Bcr-Abl Data. Mol Inform 2023;42:e2200210. [PMID: 36221998 DOI: 10.1002/minf.202200210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 10/11/2022] [Indexed: 01/20/2023]

Bosc N, Felix E, Arcila R, Mendez D, Saunders MR, Green DVS, Ochoada J, Shelat AA, Martin EJ, Iyer P, Engkvist O, Verras A, Duffy J, Burrows J, Gardner JMF, Leach AR. MAIP: a web service for predicting blood-stage malaria inhibitors. J Cheminform 2021;13:13. [PMID: 33618772 PMCID: PMC7898753 DOI: 10.1186/s13321-021-00487-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 01/20/2021] [Indexed: 12/17/2022] Open

Affiliation(s)

Nicolas Bosc European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom.
Eloy Felix European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom
Ricardo Arcila European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom
David Mendez European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom
Martin R Saunders Department of Molecular Design, Data and Computational Sciences, GlaxoSmithKline, Gunnels Wood Road, Hertfordshire, SG1 2NY, Stevenage, UK
Darren V S Green Department of Molecular Design, Data and Computational Sciences, GlaxoSmithKline, Gunnels Wood Road, Hertfordshire, SG1 2NY, Stevenage, UK
Jason Ochoada Department of Chemical Biology and Therapeutics, St. Jude Children's Research Hospital, 262 Danny Thomas Place, Tennessee, 38105, Memphis, USA
Anang A Shelat Department of Chemical Biology and Therapeutics, St. Jude Children's Research Hospital, 262 Danny Thomas Place, Tennessee, 38105, Memphis, USA
Eric J Martin Novartis Institute for Biomedical Research, 5300 Chiron Way, California, 94608- 2916, Emeryville, USA
Preeti Iyer Hit Discovery, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
Ola Engkvist Hit Discovery, Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
Andreas Verras Schrodinger Inc, 120 West 45th Street, 10036-4041, New York, NY, USA
James Duffy Medicines for Malaria Ventures Discovery, 1215, Geneva, Switzerland
Jeremy Burrows Medicines for Malaria Ventures Discovery, 1215, Geneva, Switzerland
J Mark F Gardner AMG Consultants Ltd, Discovery Park House, Discovery Park, Ramsgate Road, CT13 9ND, Sandwich, Kent, UK
Andrew R Leach European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom.

Collapse

Martin EJ, Jansen JM. Biased Diversity for Effective Virtual Screening. J Chem Inf Model 2020;60:4116-4119. [PMID: 32026691 DOI: 10.1021/acs.jcim.9b01155] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Voicu A, Duteanu N, Voicu M, Vlad D, Dumitrascu V. The rcdk and cluster R packages applied to drug candidate selection. J Cheminform 2020;12:3. [PMID: 33430987 PMCID: PMC6970292 DOI: 10.1186/s13321-019-0405-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 12/20/2019] [Indexed: 11/10/2022] Open

Ehrt C, Brinkjost T, Koch O. Binding site characterization - similarity, promiscuity, and druggability. MEDCHEMCOMM 2019;10:1145-1159. [PMID: 31391887 DOI: 10.1039/c9md00102f] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 05/31/2019] [Indexed: 12/19/2022]

Analysis of Solar Irradiation Time Series Complexity and Predictability by Combining Kolmogorov Measures and Hamming Distance for La Reunion (France). ENTROPY 2018;20:e20080570. [PMID: 33265658 PMCID: PMC7513096 DOI: 10.3390/e20080570] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 07/28/2018] [Accepted: 07/30/2018] [Indexed: 11/16/2022]

Nicholls A. Statistics in molecular modeling: a summary. J Comput Aided Mol Des 2016;30:279-80. [PMID: 27001050 DOI: 10.1007/s10822-016-9907-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 03/02/2016] [Indexed: 10/22/2022]

Awale M, Reymond JL. Similarity Mapplet: Interactive Visualization of the Directory of Useful Decoys and ChEMBL in High Dimensional Chemical Spaces. J Chem Inf Model 2015. [PMID: 26207526 DOI: 10.1021/acs.jcim.5b00182] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Osolodkin DI, Radchenko EV, Orlov AA, Voronkov AE, Palyulin VA, Zefirov NS. Progress in visual representations of chemical space. Expert Opin Drug Discov 2015;10:959-73. [DOI: 10.1517/17460441.2015.1060216] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]