Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rodríguez-Pérez R, Vogt M, Bajorath J. Support Vector Machine Classification and Regression Prioritize Different Structural Features for Binary Compound Activity and Potency Value Prediction. ACS Omega 2017;2:6371-6379. [PMID: 30023518 PMCID: PMC6045367 DOI: 10.1021/acsomega.7b01079] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 09/22/2017] [Indexed: 05/15/2023]

For:	Rodríguez-Pérez R, Vogt M, Bajorath J. Support Vector Machine Classification and Regression Prioritize Different Structural Features for Binary Compound Activity and Potency Value Prediction. ACS Omega 2017;2:6371-6379. [PMID: 30023518 PMCID: PMC6045367 DOI: 10.1021/acsomega.7b01079] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 09/22/2017] [Indexed: 05/15/2023]

Number

Cited by Other Article(s)

Liu Q, He D, Fan M, Wang J, Cui Z, Wang H, Mi Y, Li N, Meng Q, Hou Y. Prediction and Interpretation Microglia Cytotoxicity by Machine Learning. J Chem Inf Model 2024. [PMID: 38949724 DOI: 10.1021/acs.jcim.4c00366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]

Abstract

Ameliorating microglia-mediated neuroinflammation is a crucial strategy in developing new drugs for neurodegenerative diseases. Plant compounds are an important screening target for the discovery of drugs for the treatment of neurodegenerative diseases. However, due to the spatial complexity of phytochemicals, it becomes particularly important to evaluate the effectiveness of compounds while avoiding the mixing of cytotoxic substances in the early stages of compound screening. Traditional high-throughput screening methods suffer from high cost and low efficiency. A computational model based on machine learning provides a novel avenue for cytotoxicity determination. In this study, a microglia cytotoxicity classifier was developed using a machine learning approach. First, we proposed a data splitting strategy based on the molecule murcko generic scaffold, under this condition, three machine learning approaches were coupled with three kinds of molecular representation methods to construct microglia cytotoxicity classifier, which were then compared and assessed by the predictive accuracy, balanced accuracy, F1-score, and Matthews Correlation Coefficient. Then, the recursive feature elimination integrated with support vector machine (RFE-SVC) dimension reduction method was introduced to molecular fingerprints with high dimensions to further improve the model performance. Among all the microglial cytotoxicity classifiers, the SVM coupled with ECFP4 fingerprint after feature selection (ECFP4-RFE-SVM) obtained the most accurate classification for the test set (ACC of 0.99, BA of 0.99, F1-score of 0.99, MCC of 0.97). Finally, the Shapley additive explanations (SHAP) method was used in interpreting the microglia cytotoxicity classifier and key substructure smart identified as structural alerts. Experimental results show that ECFP4-RFE-SVM have reliable classification capability for microglia cytotoxicity, and SHAP can not only provide a rational explanation for microglia cytotoxicity predictions, but also offer a guideline for subsequent molecular cytotoxicity modifications.

Collapse

Affiliation(s)

Qing Liu College of Information Science and Engineering, State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, P. R. China
Dakuo He College of Information Science and Engineering, State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, P. R. China
Mengmeng Fan College of Information Science and Engineering, State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, P. R. China
Jinpeng Wang College of Information Science and Engineering, State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, P. R. China
Zeyu Cui College of Information Science and Engineering, State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, P. R. China
Hao Wang College of Information Science and Engineering, State Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, P. R. China
Yan Mi Key Laboratory of Bioresource Research and Development of Liaoning Province, College of Life and Health Sciences, National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Key Laboratory of Data Analytics and Optimization for Smart Industry, Ministry of Education, Northeastern University, Shenyang 110169, P. R. China
Ning Li School of Traditional Chinese Materia Medica, Key Laboratory for TCM Material Basis Study and Innovative Drug Development of Shenyang City, Shenyang Pharmaceutical University, Shenyang 110016, P. R. China
Qingqi Meng Key Laboratory of Bioresource Research and Development of Liaoning Province, College of Life and Health Sciences, National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Key Laboratory of Data Analytics and Optimization for Smart Industry, Ministry of Education, Northeastern University, Shenyang 110169, P. R. China
Yue Hou Key Laboratory of Bioresource Research and Development of Liaoning Province, College of Life and Health Sciences, National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Key Laboratory of Data Analytics and Optimization for Smart Industry, Ministry of Education, Northeastern University, Shenyang 110169, P. R. China

Collapse

Julkaew S, Wongsirichot T, Damkliang K, Sangthawan P. Improving accuracy of vascular access quality classification in hemodialysis patients using deep learning with K highest score feature selection. J Int Med Res 2024;52:3000605241232519. [PMID: 38573764 PMCID: PMC10996358 DOI: 10.1177/03000605241232519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 01/26/2024] [Indexed: 04/05/2024] Open

Lai J, Chen Z, Liu J, Zhu C, Huang H, Yi Y, Cai G, Liao N. A radiogenomic multimodal and whole-transcriptome sequencing for preoperative prediction of axillary lymph node metastasis and drug therapeutic response in breast cancer: a retrospective, machine learning and international multicohort study. Int J Surg 2024;110:2162-2177. [PMID: 38215256 PMCID: PMC11019980 DOI: 10.1097/js9.0000000000001082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 12/27/2023] [Indexed: 01/14/2024]

Abstract

BACKGROUND

Axillary lymph nodes (ALN) status serves as a crucial prognostic indicator in breast cancer (BC). The aim of this study was to construct a radiogenomic multimodal model, based on machine learning and whole-transcriptome sequencing (WTS), to accurately evaluate the risk of ALN metastasis (ALNM), drug therapeutic response and avoid unnecessary axillary surgery in BC patients.

METHODS

In this study, conducted a retrospective analysis of 1078 BC patients from The Cancer Genome Atlas (TCGA), The Cancer Imaging Archive (TCIA), and Foshan cohort. These patients were divided into the TCIA cohort ( N =103), TCIA validation cohort ( N =51), Duke cohort ( N =138), Foshan cohort ( N =106), and TCGA cohort ( N =680). Radiological features were extracted from BC radiological images and differentially expressed gene expression was calibrated using technology. A support vector machine model was employed to screen radiological and genetic features, and a multimodal model was established based on radiogenomic and clinical pathological features to predict ALNM. The accuracy of the model predictions was assessed using the area under the curve (AUC) and the clinical benefit was measured using decision curve analysis. Risk stratification analysis of BC patients was performed by gene set enrichment analysis, differential comparison of immune checkpoint gene expression, and drug sensitivity testing.

RESULTS

For the prediction of ALNM, rad-score was able to significantly differentiate between ALN- and ALN+ patients in both the Duke and Foshan cohorts ( P <0.05). Similarly, the gene-score was able to significantly differentiate between ALN- and ALN+ patients in the TCGA cohort ( P <0.05). The radiogenomic multimodal nomogram demonstrated satisfactory performance in the TCIA cohort (AUC 0.82, 95% CI: 0.74-0.91) and the TCIA validation cohort (AUC 0.77, 95% CI: 0.63-0.91). In the risk sub-stratification analysis, there were significant differences in gene pathway enrichment between high and low-risk groups ( P <0.05). Additionally, different risk groups may exhibit varying treatment responses ( P <0.05).

CONCLUSION

Overall, the radiogenomic multimodal model employs multimodal data, including radiological images, genetic, and clinicopathological typing. The radiogenomic multimodal nomogram can precisely predict ALNM and drug therapeutic response in BC patients.

Collapse

Cabral L, Calabro FJ, Foran W, Parr AC, Ojha A, Rasmussen J, Ceschin R, Panigrahy A, Luna B. Multivariate and regional age-related change in basal ganglia iron in neonates. Cereb Cortex 2024;34:bhad456. [PMID: 38059685 DOI: 10.1093/cercor/bhad456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 10/31/2023] [Accepted: 11/01/2023] [Indexed: 12/08/2023] Open

Ferdous J, Rahman ME, Sraboni FS, Dutta AK, Rahman MS, Ali MR, Sikdar B, Khan A, Hasan MF. Assessment of the hypoglycemic and anti-hemostasis effects of Paederia foetida (L.) in controlling diabetes and thrombophilia combining in vivo and computational analysis. Comput Biol Chem 2023;107:107954. [PMID: 37738820 DOI: 10.1016/j.compbiolchem.2023.107954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/03/2023] [Accepted: 09/05/2023] [Indexed: 09/24/2023]

Moreira-Filho JT, Neves BJ, Cajas RA, Moraes JD, Andrade CH. Artificial intelligence-guided approach for efficient virtual screening of hits against Schistosoma mansoni. Future Med Chem 2023;15:2033-2050. [PMID: 37937522 DOI: 10.4155/fmc-2023-0152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 10/06/2023] [Indexed: 11/09/2023] Open

Han R, Yoon H, Kim G, Lee H, Lee Y. Revolutionizing Medicinal Chemistry: The Application of Artificial Intelligence (AI) in Early Drug Discovery. Pharmaceuticals (Basel) 2023;16:1259. [PMID: 37765069 PMCID: PMC10537003 DOI: 10.3390/ph16091259] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/24/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open

Houssein EH, Hassan HN, Samee NA, Jamjoom MM. A Novel Hybrid Runge Kutta Optimizer with Support Vector Machine on Gene Expression Data for Cancer Classification. Diagnostics (Basel) 2023;13:diagnostics13091621. [PMID: 37175012 PMCID: PMC10178557 DOI: 10.3390/diagnostics13091621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 03/05/2023] [Accepted: 04/18/2023] [Indexed: 05/15/2023] Open

Dutschmann TM, Kinzel L, Ter Laak A, Baumann K. Large-scale evaluation of k-fold cross-validation ensembles for uncertainty estimation. J Cheminform 2023;15:49. [PMID: 37118768 PMCID: PMC10142532 DOI: 10.1186/s13321-023-00709-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Accepted: 03/10/2023] [Indexed: 04/30/2023] Open

Chadha A, Dara R, Pearl DL, Sharif S, Poljak Z. Predictive analysis for pathogenicity classification of H5Nx avian influenza strains using machine learning techniques. Prev Vet Med 2023;216:105924. [PMID: 37224663 DOI: 10.1016/j.prevetmed.2023.105924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 03/17/2023] [Accepted: 04/21/2023] [Indexed: 05/26/2023]

Abstract

Over the past decades, avian influenza (AI) outbreaks have been reported across different parts of the globe, resulting in large-scale economic and livestock loss and, in some cases raising concerns about their zoonotic potential. The virulence and pathogenicity of H5Nx (e.g., H5N1, H5N2) AI strains for poultry could be inferred through various approaches, and it has been frequently performed by detecting certain pathogenicity markers in their haemagglutinin (HA) gene. The utilization of predictive modeling methods represents a possible approach to exploring this genotypic-phenotypic relationship for assisting experts in determining the pathogenicity of circulating AI viruses. Therefore, the main objective of this study was to evaluate the predictive performance of different machine learning (ML) techniques for in-silico prediction of pathogenicity of H5Nx viruses in poultry, using complete genetic sequences of the HA gene. We annotated 2137 H5Nx HA gene sequences based on the presence of the polybasic HA cleavage site (HACS) with 46.33% and 53.67% of sequences previously identified as highly pathogenic (HP) and low pathogenic (LP), respectively. We compared the performance of different ML classifiers (e.g., logistic regression (LR) with the lasso and ridge regularization, random forest (RF), K-nearest neighbor (KNN), Naïve Bayes (NB), support vector machine (SVM), and convolutional neural network (CNN)) for pathogenicity classification of raw H5Nx nucleotide and protein sequences using a 10-fold cross-validation technique. We found that different ML techniques can be successfully used for the pathogenicity classification of H5 sequences with ∼99% classification accuracy. Our results indicate that for pathogenicity classification of (1) aligned deoxyribonucleic acid (DNA) and protein sequences, with NB classifier had the lowest accuracies of 98.41% (+/-0.89) and 98.31% (+/-1.06), respectively; (2) aligned DNA and protein sequences, with LR (L1/L2), KNN, SVM (radial basis function (RBF)) and CNN classifiers had the highest accuracies of 99.20% (+/-0.54) and 99.20% (+/-0.38), respectively; (3) unaligned DNA and protein sequences, with CNN's achieved accuracies of 98.54% (+/-0.68) and 99.20% (+/-0.50), respectively. ML methods show potential for regular classification of H5Nx virus pathogenicity for poultry species, particularly when sequences containing regular markers were frequently present in the training dataset.

Collapse

Astray G, Soria-Lopez A, Barreiro E, Mejuto JC, Cid-Samamed A. Machine Learning to Predict the Adsorption Capacity of Microplastics. NANOMATERIALS (BASEL, SWITZERLAND) 2023;13:1061. [PMID: 36985954 PMCID: PMC10051191 DOI: 10.3390/nano13061061] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 03/10/2023] [Accepted: 03/11/2023] [Indexed: 06/18/2023]

Kovačević S, Banjac MK, Podunavac-Kuzmanović S, Ajduković J, Salaković B, Rárová L, Đorđević M, Ivanov M. Local QSAR modeling of cytotoxic activity of newly designed androstane 3-oximes towards malignant melanoma cells. J Mol Struct 2023. [DOI: 10.1016/j.molstruc.2023.135272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]

Shi M, Huang Z, Xiao G, Xu B, Ren Q, Zhao H. Estimating the Depth of Anesthesia from EEG Signals Based on a Deep Residual Shrinkage Network. SENSORS (BASEL, SWITZERLAND) 2023;23:1008. [PMID: 36679805 PMCID: PMC9865536 DOI: 10.3390/s23021008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 01/11/2023] [Accepted: 01/12/2023] [Indexed: 06/17/2023]

Machine Learning Models to Predict Protein-Protein Interaction Inhibitors. MOLECULES (BASEL, SWITZERLAND) 2022;27:molecules27227986. [PMID: 36432086 PMCID: PMC9694076 DOI: 10.3390/molecules27227986] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 11/09/2022] [Accepted: 11/16/2022] [Indexed: 11/19/2022]

Houssein EH, Hosney ME, Mohamed WM, Ali AA, Younis EMG. Fuzzy-based hunger games search algorithm for global optimization and feature selection using medical data. Neural Comput Appl 2022;35:5251-5275. [PMID: 36340595 PMCID: PMC9628476 DOI: 10.1007/s00521-022-07916-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 09/30/2022] [Indexed: 11/06/2022]

Deep Transfer Learning for Question Classification Based on Semantic Information Features of Category Labels. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:7178818. [PMID: 36211009 PMCID: PMC9546665 DOI: 10.1155/2022/7178818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 08/29/2022] [Accepted: 09/06/2022] [Indexed: 11/25/2022]

Zhang X, Bai Y, Ngando FJ, Qu H, Shang Y, Ren L, Guo Y. Predicting the Weathering Time by the Empty Puparium of Sarcophaga peregrina (Diptera: Sarcophagidae) with the ANN Models. INSECTS 2022;13:insects13090808. [PMID: 36135509 PMCID: PMC9502838 DOI: 10.3390/insects13090808] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 08/20/2022] [Accepted: 08/30/2022] [Indexed: 06/01/2023]

Asahara R, Miyao T. Extended Connectivity Fingerprints as a Chemical Reaction Representation for Enantioselective Organophosphorus-Catalyzed Asymmetric Reaction Prediction. ACS OMEGA 2022;7:26952-26964. [PMID: 35936487 PMCID: PMC9352214 DOI: 10.1021/acsomega.2c03812] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 07/07/2022] [Indexed: 06/15/2023]

Ikemoto K, Akiyoshi M, Mio T, Nishioka K, Sato S, Isobe H. Synthesis of a Negatively Curved Nanocarbon Molecule with an Octagonal Omphalos via Design-of-Experiments Optimizations Supplemented by Machine Learning. Angew Chem Int Ed Engl 2022;61:e202204035. [PMID: 35603558 DOI: 10.1002/anie.202204035] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Indexed: 12/16/2022]

Ikemoto K, Akiyoshi M, Mio T, Nishioka K, Sato S, Isobe H. Synthesis of a Negatively Curved Nanocarbon Molecule with an Octagonal Omphalos via Design‐of‐Experiments Optimizations Supplemented by Machine Learning. Angew Chem Int Ed Engl 2022. [DOI: 10.1002/ange.202204035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Cho BH, Kim YH, Lee KB, Hong YK, Kim KC. Potential of Snapshot-Type Hyperspectral Imagery Using Support Vector Classifier for the Classification of Tomatoes Maturity. SENSORS 2022;22:s22124378. [PMID: 35746159 PMCID: PMC9227650 DOI: 10.3390/s22124378] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/07/2022] [Accepted: 06/07/2022] [Indexed: 02/01/2023]

Packwood D, Nguyen LTH, Cesana P, Zhang G, Staykov A, Fukumoto Y, Nguyen DH. Machine Learning in Materials Chemistry: An Invitation. MACHINE LEARNING WITH APPLICATIONS 2022. [DOI: 10.1016/j.mlwa.2022.100265] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Rodríguez-Pérez R, Miljković F, Bajorath J. Machine Learning in Chemoinformatics and Medicinal Chemistry. Annu Rev Biomed Data Sci 2022;5:43-65. [PMID: 35440144 DOI: 10.1146/annurev-biodatasci-122120-124216] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Rodríguez-Pérez R, Bajorath J. Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery. J Comput Aided Mol Des 2022;36:355-362. [PMID: 35304657 PMCID: PMC9325859 DOI: 10.1007/s10822-022-00442-9] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 02/15/2022] [Indexed: 11/05/2022]

Comparison of Rainfall-Runoff Simulation between Support Vector Regression and HEC-HMS for a Rural Watershed in Taiwan. WATER 2022. [DOI: 10.3390/w14020191] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Rodríguez-Pérez R, Bajorath J. Explainable Machine Learning for Property Predictions in Compound Optimization. J Med Chem 2021;64:17744-17752. [PMID: 34902252 DOI: 10.1021/acs.jmedchem.1c01789] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Gene Selection for Microarray Cancer Classification based on Manta Rays Foraging Optimization and Support Vector Machines. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2021. [DOI: 10.1007/s13369-021-06102-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Pourashraf T, Shokri S, Yousefi M, Ahmadi A, Azar PA. Implementing Machine Learning in Laboratory Synthesis by Hybrid of SVR Model and Optimization Algorithms. ADVANCED THEORY AND SIMULATIONS 2021. [DOI: 10.1002/adts.202100225] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Jesus B, Cassani R, McGeown WJ, Cecchi M, Fadem KC, Falk TH. Multimodal Prediction of Alzheimer's Disease Severity Level Based on Resting-State EEG and Structural MRI. Front Hum Neurosci 2021;15:700627. [PMID: 34566600 PMCID: PMC8458963 DOI: 10.3389/fnhum.2021.700627] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 08/05/2021] [Indexed: 11/13/2022] Open

Harnessing artificial intelligence for the next generation of 3D printed medicines. Adv Drug Deliv Rev 2021;175:113805. [PMID: 34019957 DOI: 10.1016/j.addr.2021.05.015] [Citation(s) in RCA: 57] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 05/02/2021] [Accepted: 05/13/2021] [Indexed: 02/06/2023]

Modeling and Predicting the Cell Migration Properties from Scratch Wound Healing Assay on Cisplatin-Resistant Ovarian Cancer Cell Lines Using Artificial Neural Network. Healthcare (Basel) 2021;9:healthcare9070911. [PMID: 34356289 PMCID: PMC8305856 DOI: 10.3390/healthcare9070911] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 07/14/2021] [Accepted: 07/14/2021] [Indexed: 01/04/2023] Open

Mekni N, Coronnello C, Langer T, Rosa MD, Perricone U. Support Vector Machine as a Supervised Learning for the Prioritization of Novel Potential SARS-CoV-2 Main Protease Inhibitors. Int J Mol Sci 2021;22:7714. [PMID: 34299333 PMCID: PMC8305792 DOI: 10.3390/ijms22147714] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 07/14/2021] [Accepted: 07/15/2021] [Indexed: 12/04/2022] Open

Cordero JA, He K, Janya K, Echigo S, Itoh S. Predicting formation of haloacetic acids by chlorination of organic compounds using machine-learning-assisted quantitative structure-activity relationships. JOURNAL OF HAZARDOUS MATERIALS 2021;408:124466. [PMID: 33191030 DOI: 10.1016/j.jhazmat.2020.124466] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Revised: 10/30/2020] [Accepted: 10/31/2020] [Indexed: 06/11/2023]

Evaluation of multi-target deep neural network models for compound potency prediction under increasingly challenging test conditions. J Comput Aided Mol Des 2021;35:285-295. [PMID: 33598870 PMCID: PMC7982389 DOI: 10.1007/s10822-021-00376-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 02/03/2021] [Indexed: 11/25/2022]

Galati S, Yonchev D, Rodríguez-Pérez R, Vogt M, Tuccinardi T, Bajorath J. Predicting Isoform-Selective Carbonic Anhydrase Inhibitors via Machine Learning and Rationalizing Structural Features Important for Selectivity. ACS OMEGA 2021;6:4080-4089. [PMID: 33585783 PMCID: PMC7876851 DOI: 10.1021/acsomega.0c06153] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 01/14/2021] [Indexed: 05/03/2023]

Shibayama S, Funatsu K. Industrial Case Study: Identification of Important Substructures and Exploration of Monomers for the Rapid Design of Novel Network Polymers with Distributed Representation. BULLETIN OF THE CHEMICAL SOCIETY OF JAPAN 2021. [DOI: 10.1246/bcsj.20200220] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Houssein EH, Hosney ME, Elhoseny M, Oliva D, Mohamed WM, Hassaballah M. Hybrid Harris hawks optimization with cuckoo search for drug design and discovery in chemoinformatics. Sci Rep 2020;10:14439. [PMID: 32879410 PMCID: PMC7468137 DOI: 10.1038/s41598-020-71502-z] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Accepted: 07/23/2020] [Indexed: 11/09/2022] Open

Abstract

One of the major drawbacks of cheminformatics is a large amount of information present in the datasets. In the majority of cases, this information contains redundant instances that affect the analysis of similarity measurements with respect to drug design and discovery. Therefore, using classical methods such as the protein bank database and quantum mechanical calculations are insufficient owing to the dimensionality of search spaces. In this paper, we introduce a hybrid metaheuristic algorithm called CHHO-CS, which combines Harris hawks optimizer (HHO) with two operators: cuckoo search (CS) and chaotic maps. The role of CS is to control the main position vectors of the HHO algorithm to maintain the balance between exploitation and exploration phases, while the chaotic maps are used to update the control energy parameters to avoid falling into local optimum and premature convergence. Feature selection (FS) is a tool that permits to reduce the dimensionality of the dataset by removing redundant and non desired information, then FS is very helpful in cheminformatics. FS methods employ a classifier that permits to identify the best subset of features. The support vector machines (SVMs) are then used by the proposed CHHO-CS as an objective function for the classification process in FS. The CHHO-CS-SVM is tested in the selection of appropriate chemical descriptors and compound activities. Various datasets are used to validate the efficiency of the proposed CHHO-CS-SVM approach including ten from the UCI machine learning repository. Additionally, two chemical datasets (i.e., quantitative structure-activity relation biodegradation and monoamine oxidase) were utilized for selecting the most significant chemical descriptors and chemical compounds activities. The extensive experimental and statistical analyses exhibit that the suggested CHHO-CS method accomplished much-preferred trade-off solutions over the competitor algorithms including the HHO, CS, particle swarm optimization, moth-flame optimization, grey wolf optimizer, Salp swarm algorithm, and sine-cosine algorithm surfaced in the literature. The experimental results proved that the complexity associated with cheminformatics can be handled using chaotic maps and hybridizing the meta-heuristic methods.

Collapse

Interpretation of machine learning models using shapley values: application to compound potency and multi-target activity predictions. J Comput Aided Mol Des 2020;34:1013-1026. [PMID: 32361862 PMCID: PMC7449951 DOI: 10.1007/s10822-020-00314-0] [Citation(s) in RCA: 146] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 04/24/2020] [Indexed: 02/07/2023]

Houssein EH, Hosney ME, Oliva D, Mohamed WM, Hassaballah M. A novel hybrid Harris hawks optimization and support vector machines for drug design and discovery. Comput Chem Eng 2020. [DOI: 10.1016/j.compchemeng.2019.106656] [Citation(s) in RCA: 124] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Luchi A, Villafañe RN, Gómez Chávez JL, Bogado ML, Angelina EL, Peruchena NM. Combining Charge Density Analysis with Machine Learning Tools To Investigate the Cruzain Inhibition Mechanism. ACS OMEGA 2019;4:19582-19594. [PMID: 31788588 PMCID: PMC6881835 DOI: 10.1021/acsomega.9b01934] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Accepted: 10/18/2019] [Indexed: 05/28/2023]

Abstract

Trypanosoma cruzi, a flagellate protozoan parasite, is responsible for Chagas disease. The parasite major cysteine protease, cruzain (Cz), plays a vital role at every stage of its life cycle and the active-site region of the enzyme, similar to those of other members of the papain superfamily, is well characterized. Taking advantage of structural information available in public databases about Cz bound to known covalent inhibitors, along with their corresponding activity annotations, in this work, we performed a deep analysis of the molecular interactions at the Cz binding cleft, in order to investigate the enzyme inhibition mechanism. Our toolbox for performing this study consisted of the charge density topological analysis of the complexes to extract the molecular interactions and machine learning classification models to relate the interactions with biological activity. More precisely, such a combination was useful for the classification of molecular interactions as "active-like" or "inactive-like" according to whether they are prevalent in the most active or less active complexes, respectively. Further analysis of interactions with the help of unsupervised learning tools also allowed the understanding of how these interactions come into play together to trigger the enzyme into a particular conformational state. Most active inhibitors induce some conformational changes within the enzyme that lead to an overall better fit of the inhibitor into the binding cleft. Curiously, some of these conformational changes can be considered as a hallmark of the substrate recognition event, which means that most active inhibitors are likely recognized by the enzyme as if they were its own substrate so that the catalytic machinery is arranged as if it is about to break the substrate scissile bond. Overall, these results contribute to a better understanding of the enzyme inhibition mechanism. Moreover, the information about main interactions extracted through this work is already being used in our lab to guide docking solutions in ongoing prospective virtual screening campaigns to search for novel noncovalent cruzain inhibitors.

Collapse

Chemogenomic Analysis of the Druggable Kinome and Its Application to Repositioning and Lead Identification Studies. Cell Chem Biol 2019;26:1608-1622.e6. [DOI: 10.1016/j.chembiol.2019.08.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 07/18/2019] [Accepted: 08/21/2019] [Indexed: 02/06/2023]

Rodríguez-Pérez R, Bajorath J. Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values. J Med Chem 2019;63:8761-8777. [PMID: 31512867 DOI: 10.1021/acs.jmedchem.9b01101] [Citation(s) in RCA: 137] [Impact Index Per Article: 27.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Jabeen A, Ranganathan S. Applications of machine learning in GPCR bioactive ligand discovery. Curr Opin Struct Biol 2019;55:66-76. [PMID: 31005679 DOI: 10.1016/j.sbi.2019.03.022] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Revised: 03/14/2019] [Accepted: 03/14/2019] [Indexed: 12/17/2022]

Zheng S, Wang Y, Liu H, Chang W, Xu Y, Lin F. Prediction of Hemolytic Toxicity for Saponins by Machine-Learning Methods. Chem Res Toxicol 2019;32:1014-1026. [DOI: 10.1021/acs.chemrestox.8b00347] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Zheng S, Chang W, Xu W, Xu Y, Lin F. e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness. Front Chem 2019;7:35. [PMID: 30761295 PMCID: PMC6363693 DOI: 10.3389/fchem.2019.00035] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 01/14/2019] [Indexed: 11/23/2022] Open

Abstract

Artificial sweeteners (AS) can elicit the strong sweet sensation with the low or zero calorie, and are widely used to replace the nutritive sugar in the food and beverage industry. However, the safety issue of current AS is still controversial. Thus, it is imperative to develop more safe and potent AS. Due to the costly and laborious experimental-screening of AS, in-silico sweetener/sweetness prediction could provide a good avenue to identify the potential sweetener candidates before experiment. In this work, we curate the largest dataset of 530 sweeteners and 850 non-sweeteners, and collect the second largest dataset of 352 sweeteners with the relative sweetness (RS) from the literature. In light of these experimental datasets, we adopt five machine-learning methods and conformational-independent molecular fingerprints to derive the classification and regression models for the prediction of sweetener and its RS, respectively via the consensus strategy. Our best classification model achieves the 95% confidence intervals for the accuracy (0.91 ± 0.01), precision (0.90 ± 0.01), specificity (0.94 ± 0.01), sensitivity (0.86 ± 0.01), F1-score (0.88 ± 0.01), and NER (Non-error Rate: 0.90 ± 0.01) on the test set, which outperforms the model (NER = 0.85) of Rojas et al. in terms of NER, and our best regression model gives the 95% confidence intervals for the R²(test set) and ΔR² [referring to |R²(test set)- R²(cross-validation)|] of 0.77 ± 0.01 and 0.03 ± 0.01, respectively, which is also better than the other works based on the conformation-independent 2D descriptors (e.g., 2D Dragon) according to R²(test set) and ΔR². Our models are obtained by averaging over nineteen data-splitting schemes, and fully comply with the guidelines of Organization for Economic Cooperation and Development (OECD), which are not completely followed by the previous relevant works that are all on the basis of only one random data-splitting scheme for the cross-validation set and test set. Finally, we develop a user-friendly platform “e-Sweet” for the automatic prediction of sweetener and its corresponding RS. To our best knowledge, it is a first and free platform that can enable the experimental food scientists to exploit the current machine-learning methods to boost the discovery of more AS with the low or zero calorie content.

Collapse

Zheng S, Chang W, Liu W, Liang G, Xu Y, Lin F. Computational Prediction of a New ADMET Endpoint for Small Molecules: Anticommensal Effect on Human Gut Microbiota. J Chem Inf Model 2018;59:1215-1220. [PMID: 30352151 DOI: 10.1021/acs.jcim.8b00600] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Zheng S, Jiang M, Zhao C, Zhu R, Hu Z, Xu Y, Lin F. e-Bitter: Bitterant Prediction by the Consensus Voting From the Machine-Learning Methods. Front Chem 2018;6:82. [PMID: 29651416 PMCID: PMC5885771 DOI: 10.3389/fchem.2018.00082] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2017] [Accepted: 03/12/2018] [Indexed: 11/25/2022] Open