Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Carracedo-Reboredo P, Liñares-Blanco J, Rodríguez-Fernández N, Cedrón F, Novoa FJ, Carballal A, Maojo V, Pazos A, Fernandez-Lozano C. A review on machine learning approaches and trends in drug discovery. Comput Struct Biotechnol J 2021;19:4538-4558. [PMID: 34471498 PMCID: PMC8387781 DOI: 10.1016/j.csbj.2021.08.011] [Citation(s) in RCA: 125] [Impact Index Per Article: 41.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 08/06/2021] [Accepted: 08/06/2021] [Indexed: 12/30/2022] Open

For:	Carracedo-Reboredo P, Liñares-Blanco J, Rodríguez-Fernández N, Cedrón F, Novoa FJ, Carballal A, Maojo V, Pazos A, Fernandez-Lozano C. A review on machine learning approaches and trends in drug discovery. Comput Struct Biotechnol J 2021;19:4538-4558. [PMID: 34471498 PMCID: PMC8387781 DOI: 10.1016/j.csbj.2021.08.011] [Citation(s) in RCA: 125] [Impact Index Per Article: 41.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 08/06/2021] [Accepted: 08/06/2021] [Indexed: 12/30/2022] Open

Number

Cited by Other Article(s)

Galati S, Di Stefano M, Bertini S, Granchi C, Giordano A, Gado F, Macchia M, Tuccinardi T, Poli G. Identification of New GSK3β Inhibitors through a Consensus Machine Learning-Based Virtual Screening. Int J Mol Sci 2023;24:17233. [PMID: 38139062 PMCID: PMC10743990 DOI: 10.3390/ijms242417233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 12/05/2023] [Accepted: 12/06/2023] [Indexed: 12/24/2023] Open

Wang Y, Wang G, Zhao Y, Wang C, Chen C, Ding Y, Lin J, You J, Gao S, Pang X. A deep learning model for predicting multidrug-resistant organism infection in critically ill patients. J Intensive Care 2023;11:49. [PMID: 37941079 PMCID: PMC10633993 DOI: 10.1186/s40560-023-00695-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 10/12/2023] [Indexed: 11/10/2023] Open

Rodríguez-Belenguer P, March-Vila E, Pastor M, Mangas-Sanjuan V, Soria-Olivas E. Usage of model combination in computational toxicology. Toxicol Lett 2023;389:34-44. [PMID: 37890682 DOI: 10.1016/j.toxlet.2023.10.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/17/2023] [Accepted: 10/24/2023] [Indexed: 10/29/2023]

Abstract

New Approach Methodologies (NAMs) have ushered in a new era in the field of toxicology, aiming to replace animal testing. However, despite these advancements, they are not exempt from the inherent complexities associated with the study's endpoint. In this review, we have identified three major groups of complexities: mechanistic, chemical space, and methodological. The mechanistic complexity arises from interconnected biological processes within a network that are challenging to model in a single step. In the second group, chemical space complexity exhibits significant dissimilarity between compounds in the training and test series. The third group encompasses algorithmic and molecular descriptor limitations and typical class imbalance problems. To address these complexities, this work provides a guide to the usage of a combination of predictive Quantitative Structure-Activity Relationship (QSAR) models, known as metamodels. This combination of low-level models (LLMs) enables a more precise approach to the problem by focusing on different sub-mechanisms or sub-processes. For mechanistic complexity, multiple Molecular Initiating Events (MIEs) or levels of information are combined to form a mechanistic-based metamodel. Regarding the complexity arising from chemical space, two types of approaches were reviewed to construct a fragment-based chemical space metamodel: those with and without structure sharing. Metamodels with structure sharing utilize unsupervised strategies to identify data patterns and build low-level models for each cluster, which are then combined. For situations without structure sharing due to pharmaceutical industry intellectual property, the use of prediction sharing, and federated learning approaches have been reviewed. Lastly, to tackle methodological complexity, various algorithms are combined to overcome their limitations, diverse descriptors are employed to enhance problem definition and balanced dataset combinations are used to address class imbalance issues (methodological-based metamodels). Remarkably, metamodels consistently outperformed classical QSAR models across all cases, highlighting the importance of alternatives to classical QSAR models when faced with such complexities.

Collapse

Haddad S, Oktay L, Erol I, Şahin K, Durdagi S. Utilizing Heteroatom Types and Numbers from Extensive Ligand Libraries to Develop Novel hERG Blocker QSAR Models Using Machine Learning-Based Classifiers. ACS OMEGA 2023;8:40864-40877. [PMID: 37929100 PMCID: PMC10620895 DOI: 10.1021/acsomega.3c06074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 09/13/2023] [Indexed: 11/07/2023]

Liu W, Hopkins AM, Yan P, Du S, Luyt LG, Li Y, Hou J. Can machine learning 'transform' peptides/peptidomimetics into small molecules? A case study with ghrelin receptor ligands. Mol Divers 2023;27:2239-2255. [PMID: 36331785 DOI: 10.1007/s11030-022-10555-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/19/2022] [Indexed: 11/06/2022]

Luo L, Wu A, Shu X, Liu L, Feng Z, Zeng Q, Wang Z, Hu T, Cao Y, Tu Y, Li Z. Hub gene identification and molecular subtype construction for Helicobacter pylori in gastric cancer via machine learning methods and NMF algorithm. Aging (Albany NY) 2023;15:11782-11810. [PMID: 37768204 PMCID: PMC10683617 DOI: 10.18632/aging.205053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 07/19/2023] [Indexed: 09/29/2023]

Abstract

Helicobacter pylori (HP) is a gram-negative and spiral-shaped bacterium colonizing the human stomach and has been recognized as the risk factor of gastritis, peptic ulcer disease, and gastric cancer (GC). Moreover, it was recently identified as a class I carcinogen, which affects the occurrence and progression of GC via inducing various oncogenic pathways. Therefore, identifying the HP-related key genes is crucial for understanding the oncogenic mechanisms and improving the outcomes of GC patients. We retrieved the list of HP-related gene sets from the Molecular Signatures Database. Based on the HP-related genes, unsupervised non-negative matrix factorization (NMF) clustering method was conducted to stratify TCGA-STAD, GSE15459, GSE84433 samples into two clusters with distinct clinical outcomes and immune infiltration characterization. Subsequently, two machine learning (ML) strategies, including support vector machine-recursive feature elimination (SVM-RFE) and random forest (RF), were employed to determine twelve hub HP-related genes. Beyond that, receiver operating characteristic and Kaplan-Meier curves further confirmed the diagnostic value and prognostic significance of hub genes. Finally, expression of HP-related hub genes was tested by qRT-PCR array and immunohistochemical images. Additionally, functional pathway enrichment analysis indicated that these hub genes were implicated in the genesis and progression of GC by activating or inhibiting the classical cancer-associated pathways, such as epithelial-mesenchymal transition, cell cycle, apoptosis, RAS/MAPK, etc. In the present study, we constructed a novel HP-related tumor classification in different datasets, and screened out twelve hub genes via performing the ML algorithms, which may contribute to the molecular diagnosis and personalized therapy of GC.

Collapse

Affiliation(s)

Lianghua Luo Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Ahao Wu Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Xufeng Shu Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Li Liu Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Zongfeng Feng Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Qingwen Zeng Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Zhonghao Wang Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Tengcheng Hu Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China Medical Innovation Center, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Yi Cao Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Yi Tu Department of Pathology, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China
Zhengrong Li Department of General Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, Jiangxi, China

Collapse

Khondkaryan L, Tevosyan A, Navasardyan H, Khachatrian H, Tadevosyan G, Apresyan L, Chilingaryan G, Navoyan Z, Stopper H, Babayan N. Datasets Construction and Development of QSAR Models for Predicting Micronucleus In Vitro and In Vivo Assay Outcomes. TOXICS 2023;11:785. [PMID: 37755795 PMCID: PMC10537630 DOI: 10.3390/toxics11090785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/07/2023] [Accepted: 09/11/2023] [Indexed: 09/28/2023]

Hosseini-Gerami L, Hernansaiz Ballesteros R, Liu A, Broughton H, Collier DA, Bender A. MAVEN: compound mechanism of action analysis and visualisation using transcriptomics and compound structure data in R/Shiny. BMC Bioinformatics 2023;24:344. [PMID: 37715141 PMCID: PMC10502988 DOI: 10.1186/s12859-023-05416-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 07/18/2023] [Indexed: 09/17/2023] Open

Rampogu S, Shaik MR, Khan M, Khan M, Oh TH, Shaik B. CBPDdb: a curated database of compounds derived from Coumarin-Benzothiazole-Pyrazole. Database (Oxford) 2023;2023:baad062. [PMID: 37702993 PMCID: PMC10498939 DOI: 10.1093/database/baad062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 08/01/2023] [Accepted: 08/26/2023] [Indexed: 09/14/2023]

Han R, Yoon H, Kim G, Lee H, Lee Y. Revolutionizing Medicinal Chemistry: The Application of Artificial Intelligence (AI) in Early Drug Discovery. Pharmaceuticals (Basel) 2023;16:1259. [PMID: 37765069 PMCID: PMC10537003 DOI: 10.3390/ph16091259] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/24/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open

Wang Z, Zhou L, Hao W, Liu Y, Xiao X, Shan X, Zhang C, Wei B. Comparative antioxidant activity and untargeted metabolomic analyses of cherry extracts of two Chinese cherry species based on UPLC-QTOF/MS and machine learning algorithms. Food Res Int 2023;171:113059. [PMID: 37330825 DOI: 10.1016/j.foodres.2023.113059] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 05/03/2023] [Accepted: 05/26/2023] [Indexed: 06/19/2023]

Gorostiola González M, van den Broek RL, Braun TGM, Chatzopoulou M, Jespers W, IJzerman AP, Heitman LH, van Westen GJP. 3DDPDs: describing protein dynamics for proteochemometric bioactivity prediction. A case for (mutant) G protein-coupled receptors. J Cheminform 2023;15:74. [PMID: 37641107 PMCID: PMC10463931 DOI: 10.1186/s13321-023-00745-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 08/10/2023] [Indexed: 08/31/2023] Open

Abstract

Proteochemometric (PCM) modelling is a powerful computational drug discovery tool used in bioactivity prediction of potential drug candidates relying on both chemical and protein information. In PCM features are computed to describe small molecules and proteins, which directly impact the quality of the predictive models. State-of-the-art protein descriptors, however, are calculated from the protein sequence and neglect the dynamic nature of proteins. This dynamic nature can be computationally simulated with molecular dynamics (MD). Here, novel 3D dynamic protein descriptors (3DDPDs) were designed to be applied in bioactivity prediction tasks with PCM models. As a test case, publicly available G protein-coupled receptor (GPCR) MD data from GPCRmd was used. GPCRs are membrane-bound proteins, which are activated by hormones and neurotransmitters, and constitute an important target family for drug discovery. GPCRs exist in different conformational states that allow the transmission of diverse signals and that can be modified by ligand interactions, among other factors. To translate the MD-encoded protein dynamics two types of 3DDPDs were considered: one-hot encoded residue-specific (rs) and embedding-like protein-specific (ps) 3DDPDs. The descriptors were developed by calculating distributions of trajectory coordinates and partial charges, applying dimensionality reduction, and subsequently condensing them into vectors per residue or protein, respectively. 3DDPDs were benchmarked on several PCM tasks against state-of-the-art non-dynamic protein descriptors. Our rs- and ps3DDPDs outperformed non-dynamic descriptors in regression tasks using a temporal split and showed comparable performance with a random split and in all classification tasks. Combinations of non-dynamic descriptors with 3DDPDs did not result in increased performance. Finally, the power of 3DDPDs to capture dynamic fluctuations in mutant GPCRs was explored. The results presented here show the potential of including protein dynamic information on machine learning tasks, specifically bioactivity prediction, and open opportunities for applications in drug discovery, including oncology.

Collapse

Kong X, Lin K, Wu G, Tao X, Zhai X, Lv L, Dong D, Zhu Y, Yang S. Machine Learning Techniques Applied to the Study of Drug Transporters. Molecules 2023;28:5936. [PMID: 37630188 PMCID: PMC10459831 DOI: 10.3390/molecules28165936] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 07/27/2023] [Accepted: 08/02/2023] [Indexed: 08/27/2023] Open

Elkashlan M, Ahmad RM, Hajar M, Al Jasmi F, Corchado JM, Nasarudin NA, Mohamad MS. A review of SARS-CoV-2 drug repurposing: databases and machine learning models. Front Pharmacol 2023;14:1182465. [PMID: 37601065 PMCID: PMC10436567 DOI: 10.3389/fphar.2023.1182465] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 07/06/2023] [Indexed: 08/22/2023] Open

Du Y, Hua Z, Liu C, Lv R, Jia W, Su M. ATR-FTIR combined with machine learning for the fast non-targeted screening of new psychoactive substances. Forensic Sci Int 2023;349:111761. [PMID: 37327724 DOI: 10.1016/j.forsciint.2023.111761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 05/15/2023] [Accepted: 06/06/2023] [Indexed: 06/18/2023]

Zhao J, Shi X, Wang Z, Xiong S, Lin Y, Wei X, Li Y, Tang X. Hepatotoxicity assessment investigations on PFASs targeting L-FABP using binding affinity data and machine learning-based QSAR model. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023;262:115310. [PMID: 37523843 DOI: 10.1016/j.ecoenv.2023.115310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 07/23/2023] [Accepted: 07/27/2023] [Indexed: 08/02/2023]

Zhu Q, Gao S, Xiao B, He Z, Hu S. Plasmer: an Accurate and Sensitive Bacterial Plasmid Prediction Tool Based on Machine Learning of Shared k-mers and Genomic Features. Microbiol Spectr 2023;11:e0464522. [PMID: 37191574 PMCID: PMC10269668 DOI: 10.1128/spectrum.04645-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 04/26/2023] [Indexed: 05/17/2023] Open

Abstract

Identification of plasmids in bacterial genomes is critical for many factors, including horizontal gene transfer, antibiotic resistance genes, host-microbe interactions, cloning vectors, and industrial production. There are several in silico methods to predict plasmid sequences in assembled genomes. However, existing methods have evident shortcomings, such as unbalance in sensitivity and specificity, dependency on species-specific models, and performance reduction in sequences shorter than 10 kb, which has limited their scope of applicability. In this work, we proposed Plasmer, a novel plasmid predictor based on machine-learning of shared k-mers and genomic features. Unlike existing k-mer or genomic-feature based methods, Plasmer employs the random forest algorithm to make predictions using the percent of shared k-mers with plasmid and chromosome databases combined with other genomic features, including alignment E value and replicon distribution scores (RDS). Plasmer can predict on multiple species and has achieved an average the area under the curve (AUC) of 0.996 with accuracy of 98.4%. Compared to existing methods, tests of both sliding sequences and simulated and de novo assemblies have consistently shown that Plasmer has outperforming accuracy and stable performance across long and short contigs above 500 bp, demonstrating its applicability for fragmented assemblies. Plasmer also has excellent and balanced performance on both sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which has eliminated the bias on sensitivity or specificity that was common in existing methods. Plasmer also provides taxonomy classification to help identify the origin of plasmids. IMPORTANCE In this study, we proposed a novel plasmid prediction tool named Plasmer. Technically, unlike existing k-mer or genomic features-based methods, Plasmer is the first tool to combine the advantages of the percent of shared k-mers and the alignment score of genomic features. This has given Plasmer (i) evident improvement in performance compared to other methods, with the best F1-score and accuracy on sliding sequences, simulated contigs, and de novo assemblies; (ii) applicability for contigs above 500 bp with highest accuracy, enabling plasmid prediction in fragmented short-read assemblies; (iii) excellent and balanced performance between sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which eliminated the bias on sensitivity or specificity that commonly existed in other methods; and (iv) no dependency of species-specific training models. We believe that Plasmer provides a more reliable alternative for plasmid prediction in bacterial genome assemblies.

Collapse

Bo T, Lin Y, Han J, Hao Z, Liu J. Machine learning-assisted data filtering and QSAR models for prediction of chemical acute toxicity on rat and mouse. JOURNAL OF HAZARDOUS MATERIALS 2023;452:131344. [PMID: 37027914 DOI: 10.1016/j.jhazmat.2023.131344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 03/20/2023] [Accepted: 03/31/2023] [Indexed: 05/03/2023]

Lunghini F, Fava A, Pisapia V, Sacco F, Iaconis D, Beccari AR. ProfhEX: AI-based platform for small molecules liability profiling. J Cheminform 2023;15:60. [PMID: 37296454 DOI: 10.1186/s13321-023-00728-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 05/28/2023] [Indexed: 06/12/2023] Open

Abstract

Off-target drug interactions are a major reason for candidate failure in the drug discovery process. Anticipating potential drug's adverse effects in the early stages is necessary to minimize health risks to patients, animal testing, and economical costs. With the constantly increasing size of virtual screening libraries, AI-driven methods can be exploited as first-tier screening tools to provide liability estimation for drug candidates. In this work we present ProfhEX, an AI-driven suite of 46 OECD-compliant machine learning models that can profile small molecules on 7 relevant liability groups: cardiovascular, central nervous system, gastrointestinal, endocrine, renal, pulmonary and immune system toxicities. Experimental affinity data was collected from public and commercial data sources. The entire chemical space comprised 289'202 activity data for a total of 210'116 unique compounds, spanning over 46 targets with dataset sizes ranging from 819 to 18896. Gradient boosting and random forest algorithms were initially employed and ensembled for the selection of a champion model. Models were validated according to the OECD principles, including robust internal (cross validation, bootstrap, y-scrambling) and external validation. Champion models achieved an average Pearson correlation coefficient of 0.84 (SD of 0.05), an R² determination coefficient of 0.68 (SD = 0.1) and a root mean squared error of 0.69 (SD of 0.08). All liability groups showed good hit-detection power with an average enrichment factor at 5% of 13.1 (SD of 4.5) and AUC of 0.92 (SD of 0.05). Benchmarking against already existing tools demonstrated the predictive power of ProfhEX models for large-scale liability profiling. This platform will be further expanded with the inclusion of new targets and through complementary modelling approaches, such as structure and pharmacophore-based models. ProfhEX is freely accessible at the following address: https://profhex.exscalate.eu/ .

Collapse

Hou R, Xie C, Gui Y, Li G, Li X. Machine-Learning-Based Data Analysis Method for Cell-Based Selection of DNA-Encoded Libraries. ACS OMEGA 2023;8:19057-19071. [PMID: 37273617 PMCID: PMC10233830 DOI: 10.1021/acsomega.3c02152] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Abstract

DNA-encoded library (DEL) is a powerful ligand discovery technology that has been widely adopted in the pharmaceutical industry. DEL selections are typically performed with a purified protein target immobilized on a matrix or in solution phase. Recently, DELs have also been used to interrogate the targets in the complex biological environment, such as membrane proteins on live cells. However, due to the complex landscape of the cell surface, the selection inevitably involves significant nonspecific interactions, and the selection data are much noisier than the ones with purified proteins, making reliable hit identification highly challenging. Researchers have developed several approaches to denoise DEL datasets, but it remains unclear whether they are suitable for cell-based DEL selections. Here, we report the proof-of-principle of a new machine-learning (ML)-based approach to process cell-based DEL selection datasets by using a Maximum A Posteriori (MAP) estimation loss function, a probabilistic framework that can account for and quantify uncertainties of noisy data. We applied the approach to a DEL selection dataset, where a library of 7,721,415 compounds was selected against a purified carbonic anhydrase 2 (CA-2) and a cell line expressing the membrane protein carbonic anhydrase 12 (CA-12). The extended-connectivity fingerprint (ECFP)-based regression model using the MAP loss function was able to identify true binders and also reliable structure-activity relationship (SAR) from the noisy cell-based selection datasets. In addition, the regularized enrichment metric (known as MAP enrichment) could also be calculated directly without involving the specific machine-learning model, effectively suppressing low-confidence outliers and enhancing the signal-to-noise ratio. Future applications of this method will focus on de novo ligand discovery from cell-based DEL selections.

Collapse

Guha R, Velegol D. Harnessing Shannon entropy-based descriptors in machine learning models to enhance the prediction accuracy of molecular properties. J Cheminform 2023;15:54. [PMID: 37211605 DOI: 10.1186/s13321-023-00712-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 03/18/2023] [Indexed: 05/23/2023] Open

Kao PY, Yang YC, Chiang WY, Hsiao JY, Cao Y, Aliper A, Ren F, Aspuru-Guzik A, Zhavoronkov A, Hsieh MH, Lin YC. Exploring the Advantages of Quantum Generative Adversarial Networks in Generative Chemistry. J Chem Inf Model 2023. [PMID: 37171372 DOI: 10.1021/acs.jcim.3c00562] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]

Abstract

De novo drug design with desired biological activities is crucial for developing novel therapeutics for patients. The drug development process is time- and resource-consuming, and it has a low probability of success. Recent advances in machine learning and deep learning technology have reduced the time and cost of the discovery process and therefore, improved pharmaceutical research and development. In this paper, we explore the combination of two rapidly developing fields with lead candidate discovery in the drug development process. First, artificial intelligence has already been demonstrated to successfully accelerate conventional drug design approaches. Second, quantum computing has demonstrated promising potential in different applications, such as quantum chemistry, combinatorial optimizations, and machine learning. This article explores hybrid quantum-classical generative adversarial networks (GAN) for small molecule discovery. We substituted each element of GAN with a variational quantum circuit (VQC) and demonstrated the quantum advantages in the small drug discovery. Utilizing a VQC in the noise generator of a GAN to generate small molecules achieves better physicochemical properties and performance in the goal-directed benchmark than the classical counterpart. Moreover, we demonstrate the potential of a VQC with only tens of learnable parameters in the generator of GAN to generate small molecules. We also demonstrate the quantum advantage of a VQC in the discriminator of GAN. In this hybrid model, the number of learnable parameters is significantly less than the classical ones, and it can still generate valid molecules. The hybrid model with only tens of training parameters in the quantum discriminator outperforms the MLP-based one in terms of both generated molecule properties and the achieved KL divergence. However, the hybrid quantum-classical GANs still face challenges in generating unique and valid molecules compared to their classical counterparts.

Collapse

Li X, Wang H, Jiang M, Ding M, Xu X, Xu B, Zou Y, Yu Y, Yang W. Collision Cross Section Prediction Based on Machine Learning. Molecules 2023;28:molecules28104050. [PMID: 37241791 DOI: 10.3390/molecules28104050] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 05/10/2023] [Accepted: 05/10/2023] [Indexed: 05/28/2023] Open

Affiliation(s)

Xiaohang Li State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Hongda Wang State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Meiting Jiang State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Mengxiang Ding State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Xiaoyan Xu State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Bei Xu State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Yadan Zou State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Yuetong Yu State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China
Wenzhi Yang State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China Haihe Laboratory of Modern Chinese Medicine, Tianjin University of Traditional Chinese Medicine, 10 Poyanghu Road, Tianjin 301617, China

Collapse

Nemoto S, Mizuno T, Kusuhara H. Investigation of chemical structure recognition by encoder-decoder models in learning progress. J Cheminform 2023;15:45. [PMID: 37046349 PMCID: PMC10100163 DOI: 10.1186/s13321-023-00713-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 03/18/2023] [Indexed: 04/14/2023] Open

Gholampour M, Seradj H, Sakhteman A. Structure-Selectivity Relationship Prediction of Tau Imaging Tracers Using Machine Learning-Assisted QSAR Models and Interaction Fingerprint Map. ACS Chem Neurosci 2023. [PMID: 37037183 DOI: 10.1021/acschemneuro.3c00038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2023] Open

Abstract

Protein aggregates composed of tau fibrils are major pathologic findings in different tauopathies. An ideal agent for imaging tau fibrils must be highly selective. The molecular basis for the binding of current available compounds to tau aggregates is not well understood. Herein, we provide insights into previously studied positron emission tomography tracers using various computational methods, including machine learning-based quantitative structure-activity relationship (QSAR) classification, docking, and molecular dynamics (MD) simulations to investigate the structural basis of selective tau aggregate binding for potential compounds. The QSAR classification model based on the Random Forest algorithm with an accuracy of 96.6% for the selective and 97.6% for the nonselective class of compounds revealed essential selective moieties. The combination of molecular docking, MD simulations, and molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) binding free-energy calculation showed superior binding energy of ligand 63 toward tau and PHF6, a key hexapeptide in tau aggregation, as the most selective compound in the data set. Dissecting the binding properties of ligand 63 and ligand 8 (the least selective compound) within tau and Aβ structures confirmed that these two compounds favor different binding sites of tau; however, the preferential binding site in Aβ was similar for both with lower binding energies calculated for ligand 8. Results revealed that the number of N-heterocycles, the position of nitrogen atoms, and the presence of tertiary amine are important components of selective binding moieties, and they should be maintained in molecules for selective binding to tau aggregates. The predicted structure-selectivity relationship will facilitate the rational design and further development of selective tau imaging agents.

Collapse

Zhang T, Mo Q, Jiang N, Wu Y, Yang X, Chen W, Li Q, Yang S, Yang J, Zeng J, Huang F, Huang Q, Luo J, Wu J, Wang L. The combination of machine learning and transcriptomics reveals a novel megakaryopoiesis inducer, MO-A, that promotes thrombopoiesis by activating FGF1/FGFR1/PI3K/Akt/NF-κB signaling. Eur J Pharmacol 2023;944:175604. [PMID: 36804544 DOI: 10.1016/j.ejphar.2023.175604] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 01/20/2023] [Accepted: 02/16/2023] [Indexed: 02/19/2023]

Affiliation(s)

Ting Zhang Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Qi Mo Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Nan Jiang Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Yuesong Wu Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Xin Yang Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Wang Chen Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Qinyao Li Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Shuo Yang Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Jing Yang Department of Pharmacy, Chengdu Fifth People's Hospital, Chengdu University of Traditional Chinese Medicine, Chengdu, Sichuan, 611137, China
Jing Zeng Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Feihong Huang Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Qianqian Huang Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China
Jiesi Luo Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China.
Jianming Wu Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China; School of Basic Medical Sciences, Southwest Medical University, Luzhou, Sichuan, 646000, China; Education Ministry Key Laboratory of Medical Electrophysiology, Sichuan Key Medical Laboratory of New Drug Discovery and Druggability Evaluation, Luzhou Key Laboratory of Activity Screening and Druggability Evaluation for Chinese Materia Medica, Southwest Medical University, Luzhou, Sichuan, 646000, China.
Long Wang Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, 646000, China.

Collapse

Jaramillo DN, Millán D, Guevara-Pulido J. Design, synthesis and cytotoxic evaluation of a selective serotonin reuptake inhibitor (SSRI) by virtual screening. Eur J Pharm Sci 2023;183:106403. [PMID: 36758772 DOI: 10.1016/j.ejps.2023.106403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 01/24/2023] [Accepted: 02/06/2023] [Indexed: 02/11/2023]

Lien ST, Lin TE, Hsieh JH, Sung TY, Chen JH, Hsu KC. Establishment of extensive artificial intelligence models for kinase inhibitor prediction: Identification of novel PDGFRB inhibitors. Comput Biol Med 2023;156:106722. [PMID: 36878123 DOI: 10.1016/j.compbiomed.2023.106722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 02/16/2023] [Accepted: 02/26/2023] [Indexed: 03/06/2023]

Verhaegen F, Butterworth KT, Chalmers AJ, Coppes RP, de Ruysscher D, Dobiasch S, Fenwick JD, Granton PV, Heijmans SHJ, Hill MA, Koumenis C, Lauber K, Marples B, Parodi K, Persoon LCGG, Staut N, Subiel A, Vaes RDW, van Hoof S, Verginadis IL, Wilkens JJ, Williams KJ, Wilson GD, Dubois LJ. Roadmap for precision preclinical x-ray radiation studies. Phys Med Biol 2023;68:06RM01. [PMID: 36584393 DOI: 10.1088/1361-6560/acaf45] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 12/30/2022] [Indexed: 12/31/2022]

Affiliation(s)

Frank Verhaegen MAASTRO Clinic, Radiotherapy Division, GROW-School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands SmART Scientific Solutions BV, Maastricht, The Netherlands
Karl T Butterworth Patrick G. Johnston, Centre for Cancer Research, Queen's University Belfast, Belfast, Northern Ireland, United Kingdom
Anthony J Chalmers School of Cancer Sciences, University of Glasgow, Glasgow G61 1QH, United Kingdom
Rob P Coppes Departments of Biomedical Sciences of Cells & Systems, Section Molecular Cell Biology and Radiation Oncology, University Medical Center Groningen, University of Groningen, 9700 AD Groningen, The Netherlands
Dirk de Ruysscher MAASTRO Clinic, Radiotherapy Division, GROW-School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands
Sophie Dobiasch Department of Radiation Oncology, Technical University of Munich (TUM), School of Medicine and Klinikum rechts der Isar, Germany Department of Medical Physics, Institute of Radiation Medicine (IRM), Department of Radiation Sciences (DRS), Helmholtz Zentrum München, Germany
John D Fenwick Department of Medical Physics & Biomedical Engineering University College LondonMalet Place Engineering Building, London WC1E 6BT, United Kingdom
Patrick V Granton Department of Radiation Therapy, ErasmusMC, The Netherlands
Stefan H J Heijmans Demcon Advanced Mechatronics Best B.V., Best, The Netherlands
Mark A Hill MRC Oxford Institute for Radiation Oncology, University of Oxford, ORCRB Roosevelt Drive, Oxford OX3 7DQ, United Kingdom
Constantinos Koumenis Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
Kirsten Lauber Department of Radiation Oncology, University Hospital, LMU München, Munich, Germany German Cancer Consortium (DKTK), Partner site Munich, Germany
Brian Marples Department of Radiation Oncology, University of Rochester, NY, United States of America
Katia Parodi German Cancer Consortium (DKTK), Partner site Munich, Germany Department of Medical Physics, Faculty of Physics, Ludwig-Maximilians-Universität München, Garching b. Munich, Germany
Lucas C G G Persoon Demcon Advanced Mechatronics Best B.V., Best, The Netherlands
Nick Staut SmART Scientific Solutions BV, Maastricht, The Netherlands
Anna Subiel National Physical Laboratory, Medical Radiation Science Hampton Road, Teddington, Middlesex, TW11 0LW, United Kingdom
Rianne D W Vaes MAASTRO Clinic, Radiotherapy Division, GROW-School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, The Netherlands
Stefan van Hoof SmART Scientific Solutions BV, Maastricht, The Netherlands
Ioannis L Verginadis Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
Jan J Wilkens Department of Radiation Oncology, Technical University of Munich (TUM), School of Medicine and Klinikum rechts der Isar, Germany Physics Department, Technical University of Munich (TUM), Germany
Kaye J Williams Division of Pharmacy and Optometry, University of Manchester, Manchester, United Kingdom
George D Wilson Department of Radiation Oncology, Beaumont Health, MI, United States of America Henry Ford Health, Detroit, MI, United States of America
Ludwig J Dubois The M-Lab, Department of Precision Medicine, GROW-School for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands

Collapse

Zheng W, Chen Q, Yao L, Zhuang J, Huang J, Hu Y, Yu S, Chen T, Wei N, Zeng Y, Zhang Y, Fan C, Wang Y. Prediction Models for Sleep Quality Among College Students During the COVID-19 Outbreak: Cross-sectional Study Based on the Internet New Media. J Med Internet Res 2023;25:e45721. [PMID: 36961495 PMCID: PMC10131726 DOI: 10.2196/45721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 02/15/2023] [Accepted: 02/16/2023] [Indexed: 02/18/2023] Open

Abstract

BACKGROUND

COVID-19 has been reported to affect the sleep quality of Chinese residents; however, the epidemic's effects on the sleep quality of college students during closed-loop management remain unclear, and a screening tool is lacking.

OBJECTIVE

This study aimed to understand the sleep quality of college students in Fujian Province during the epidemic and determine sensitive variables, in order to develop an efficient prediction model for the early screening of sleep problems in college students.

METHODS

From April 5 to 16, 2022, a cross-sectional internet-based survey was conducted. The Pittsburgh Sleep Quality Index (PSQI) scale, a self-designed general data questionnaire, and the sleep quality influencing factor questionnaire were used to understand the sleep quality of respondents in the previous month. A chi-square test and a multivariate unconditioned logistic regression analysis were performed, and influencing factors obtained were applied to develop prediction models. The data were divided into a training-testing set (n=14,451, 70%) and an independent validation set (n=6194, 30%) by stratified sampling. Four models using logistic regression, an artificial neural network, random forest, and naïve Bayes were developed and validated.

RESULTS

In total, 20,645 subjects were included in this survey, with a mean global PSQI score of 6.02 (SD 3.112). The sleep disturbance rate was 28.9% (n=5972, defined as a global PSQI score >7 points). A total of 11 variables related to sleep quality were taken as parameters of the prediction models, including age, gender, residence, specialty, respiratory history, coffee consumption, stay up, long hours on the internet, sudden changes, fears of infection, and impatient closed-loop management. Among the generated models, the artificial neural network model proved to be the best, with an area under curve, accuracy, sensitivity, specificity, positive predictive value, and negative predictive value of 0.713, 73.52%, 25.51%, 92.58%, 57.71%, and 75.79%, respectively. It is noteworthy that the logistic regression, random forest, and naive Bayes models achieved high specificities of 94.41%, 94.77%, and 86.40%, respectively.

CONCLUSIONS

The COVID-19 containment measures affected the sleep quality of college students on multiple levels, indicating that it is desiderate to provide targeted university management and social support. The artificial neural network model has presented excellent predictive efficiency and is favorable for implementing measures earlier in order to improve present conditions.

Collapse

Mirza Z, Karim S. Structure-Based Profiling of Potential Phytomolecules with AKT1 a Key Cancer Drug Target. Molecules 2023;28:molecules28062597. [PMID: 36985568 PMCID: PMC10051420 DOI: 10.3390/molecules28062597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 03/07/2023] [Accepted: 03/09/2023] [Indexed: 03/14/2023] Open

Yu T, Nantasenamat C, Kachenton S, Anuwongcharoen N, Piacham T. Cheminformatic Analysis and Machine Learning Modeling to Investigate Androgen Receptor Antagonists to Combat Prostate Cancer. ACS OMEGA 2023;8:6729-6742. [PMID: 36844574 PMCID: PMC9948163 DOI: 10.1021/acsomega.2c07346] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 02/01/2023] [Indexed: 06/18/2023]

Abstract

Prostate cancer (PCa) is a major leading cause of mortality of cancer among males. There have been numerous studies to develop antagonists against androgen receptor (AR), a crucial therapeutic target for PCa. This study is a systematic cheminformatic analysis and machine learning modeling to study the chemical space, scaffolds, structure-activity relationship, and landscape of human AR antagonists. There are 1678 molecules as final data sets. Chemical space visualization by physicochemical property visualization has demonstrated that molecules from the potent/active class generally have a mildly smaller molecular weight (MW), octanol-water partition coefficient (log P), number of hydrogen-bond acceptors (nHA), number of rotatable bonds (nRot), and topological polar surface area (TPSA) than molecules from intermediate/inactive class. The chemical space visualization in the principal component analysis (PCA) plot shows significant overlapping distributions between potent/active class molecules and intermediate/inactive class molecules; potent/active class molecules are intensively distributed, while intermediate/inactive class molecules are widely and sparsely distributed. Murcko scaffold analysis has shown low scaffold diversity in general, and scaffold diversity of potent/active class molecules is even lower than intermediate/inactive class molecules, indicating the necessity for developing molecules with novel scaffolds. Furthermore, scaffold visualization has identified 16 representative Murcko scaffolds. Among them, scaffolds 1, 2, 3, 4, 7, 8, 10, 11, 15, and 16 are highly favorable scaffolds due to their high scaffold enrichment factor values. Based on scaffold analysis, their local structure-activity relationships (SARs) were investigated and summarized. In addition, the global SAR landscape was explored by quantitative structure-activity relationship (QSAR) modelings and structure-activity landscape visualization. A QSAR classification model incorporating all of the 1678 molecules stands out as the best model from a total of 12 candidate models for AR antagonists (built on PubChem fingerprint, extra trees algorithm, accuracy for training set: 0.935, 10-fold cross-validation set: 0.735 and test set: 0.756). Deeper insights into the structure-activity landscape highlighted a total of seven significant activity cliff (AC) generators (ChEMBL molecule IDs: 160257, 418198, 4082265, 348918, 390728, 4080698, and 6530), which provide valuable SAR information for medicinal chemistry. The findings in this study provide new insights and guidelines for hit identification and lead optimization for the development of novel AR antagonists.

Collapse

Comparative Studies on Resampling Techniques in Machine Learning and Deep Learning Models for Drug-Target Interaction Prediction. Molecules 2023;28:molecules28041663. [PMID: 36838652 PMCID: PMC9964614 DOI: 10.3390/molecules28041663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 02/12/2023] Open

Exploring the Chemical Space of CYP17A1 Inhibitors Using Cheminformatics and Machine Learning. Molecules 2023;28:molecules28041679. [PMID: 36838665 PMCID: PMC9966999 DOI: 10.3390/molecules28041679] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 01/01/2023] [Accepted: 01/12/2023] [Indexed: 02/12/2023] Open

A deep learning-based framework for automatic detection of drug resistance in tuberculosis patients. EGYPTIAN INFORMATICS JOURNAL 2023. [DOI: 10.1016/j.eij.2023.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Firooz A, Funkhouser AT, Martin JC, Edenfield WJ, Valafar H, Blenda AV. Comprehensive and User-Analytics-Friendly Cancer Patient Database for Physicians and Researchers. ARXIV 2023:arXiv:2302.01337v1. [PMID: 36776819 PMCID: PMC9915752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/14/2023]

Mirzaei M, Furxhi I, Murphy F, Mullins M. Employing Supervised Algorithms for the Prediction of Nanomaterial's Antioxidant Efficiency. Int J Mol Sci 2023;24:ijms24032792. [PMID: 36769135 PMCID: PMC9918003 DOI: 10.3390/ijms24032792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 01/25/2023] [Accepted: 01/29/2023] [Indexed: 02/05/2023] Open

Vemula D, Jayasurya P, Sushmitha V, Kumar YN, Bhandari V. CADD, AI and ML in drug discovery: A comprehensive review. Eur J Pharm Sci 2023;181:106324. [PMID: 36347444 DOI: 10.1016/j.ejps.2022.106324] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 10/26/2022] [Accepted: 11/03/2022] [Indexed: 11/06/2022]

McNair D. Artificial Intelligence and Machine Learning for Lead-to-Candidate Decision-Making and Beyond. Annu Rev Pharmacol Toxicol 2023;63:77-97. [PMID: 35679624 DOI: 10.1146/annurev-pharmtox-051921-023255] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Yeh KB, Parekh FK, Mombo I, Leimer J, Hewson R, Olinger G, Fair JM, Sun Y, Hay J. Climate change and infectious disease: A prologue on multidisciplinary cooperation and predictive analytics. Front Public Health 2023;11:1018293. [PMID: 36741948 PMCID: PMC9895942 DOI: 10.3389/fpubh.2023.1018293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Accepted: 01/02/2023] [Indexed: 01/22/2023] Open

Djokovic N, Rahnasto-Rilla M, Lougiakis N, Lahtela-Kakkonen M, Nikolic K. SIRT2i_Predictor: A Machine Learning-Based Tool to Facilitate the Discovery of Novel SIRT2 Inhibitors. Pharmaceuticals (Basel) 2023;16:ph16010127. [PMID: 36678624 PMCID: PMC9864763 DOI: 10.3390/ph16010127] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 01/10/2023] [Accepted: 01/11/2023] [Indexed: 01/17/2023] Open

López Barreiro D, Folch-Fortuny A, Muntz I, Thies JC, Sagt CM, Koenderink GH. Sequence Control of the Self-Assembly of Elastin-Like Polypeptides into Hydrogels with Bespoke Viscoelastic and Structural Properties. Biomacromolecules 2023;24:489-501. [PMID: 36516874 PMCID: PMC9832484 DOI: 10.1021/acs.biomac.2c01405] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Lerksuthirat T, Chitphuk S, Stitchantrakul W, Dejsuphong D, Malik AA, Nantasenamat C. PARP1pred: a web server for screening the bioactivity of inhibitors against DNA repair enzyme PARP-1. EXCLI JOURNAL 2023;22:84-107. [PMID: 36814851 PMCID: PMC9939779 DOI: 10.17179/excli2022-5602] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 12/23/2022] [Indexed: 02/24/2023]

Wang CC, Hung YT, Chou CY, Hsuan SL, Chen ZW, Chang PY, Jan TR, Tung CW. Using random forest to predict antimicrobial minimum inhibitory concentrations of nontyphoidal Salmonella in Taiwan. Vet Res 2023;54:11. [PMID: 36747286 PMCID: PMC9903507 DOI: 10.1186/s13567-023-01141-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 01/13/2023] [Indexed: 02/08/2023] Open

Sobańska AW. Immobilized artificial membrane-chromatographic and computational descriptors in studies of soil-water partition of environmentally relevant compounds. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023;30:6192-6200. [PMID: 35994147 PMCID: PMC9895004 DOI: 10.1007/s11356-022-22514-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 08/09/2022] [Indexed: 05/27/2023]

Ogawa K, Sakamoto D, Hosoki R. Computer Science Technology in Natural Products Research: A Review of Its Applications and Implications. Chem Pharm Bull (Tokyo) 2023;71:486-494. [PMID: 37394596 DOI: 10.1248/cpb.c23-00039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]

Mou M, Pan Z, Lu M, Sun H, Wang Y, Luo Y, Zhu F. Application of Machine Learning in Spatial Proteomics. J Chem Inf Model 2022;62:5875-5895. [PMID: 36378082 DOI: 10.1021/acs.jcim.2c01161] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Coutinho MG, Câmara GB, Barbosa RDM, Fernandes MA. SARS-CoV-2 virus classification based on stacked sparse autoencoder. Comput Struct Biotechnol J 2022;21:284-298. [PMID: 36530948 PMCID: PMC9742810 DOI: 10.1016/j.csbj.2022.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 12/04/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022] Open

Baručić D, Kaushik S, Kybic J, Stanková J, Džubák P, Hajdúch M. Characterization of drug effects on cell cultures from phase-contrast microscopy images. Comput Biol Med 2022;151:106171. [PMID: 36306582 DOI: 10.1016/j.compbiomed.2022.106171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 08/30/2022] [Accepted: 10/01/2022] [Indexed: 12/27/2022]

100

Machine learning and structure-based modeling for the prediction of UDP-glucuronosyltransferase inhibition. iScience 2022;25:105290. [PMID: 36304105 PMCID: PMC9593791 DOI: 10.1016/j.isci.2022.105290] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 09/05/2022] [Accepted: 10/03/2022] [Indexed: 11/23/2022] Open