1
|
Feng C, Qiao C, Ji W, Pang H, Wang L, Feng Q, Ge Y, Rui M. In silico screening and in vivo experimental validation of 15-PGDH inhibitors from traditional Chinese medicine promoting liver regeneration. Int J Biol Macromol 2024; 274:133263. [PMID: 38901515 DOI: 10.1016/j.ijbiomac.2024.133263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 05/25/2024] [Accepted: 06/17/2024] [Indexed: 06/22/2024]
Abstract
The enzyme 15-hydroxyprostaglandin dehydrogenase (15-PGDH), which acts as a negative regulator of prostaglandin E2 (PGE2) levels and activity, represents a promising pharmacological target for promoting liver regeneration. In this study, we collected data on 15-PGDH homologous family proteins, their inhibitors, and traditional Chinese medicine (TCM) compounds. Leveraging machine learning and molecular docking techniques, we constructed a prediction model for virtual screening of 15-PGDH inhibitors from TCM compound library and successfully screened genistein as a potential 15-PGDH inhibitor. Through further validation, it was discovered that genistein considerably enhances liver regeneration by inhibiting 15-PGDH, resulting in a significant increase in the PGE2 level. Genistein's effectiveness suggests its potential as a novel therapeutic agent for liver diseases, highlighting this study's contribution to expanding the clinical applications of TCM.
Collapse
Affiliation(s)
- Chunlai Feng
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China
| | - Chunxue Qiao
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China
| | - Wei Ji
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China
| | - Hui Pang
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China
| | - Li Wang
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China
| | - Qiuqi Feng
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China
| | - Yingying Ge
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China
| | - Mengjie Rui
- Department of Pharmaceutics, School of Pharmacy, Jiangsu University, Zhenjiang, PR China.
| |
Collapse
|
2
|
Matúška J, Bucinsky L, Gall M, Pitoňák M, Štekláč M. SchNetPack Hyperparameter Optimization for a More Reliable Top Docking Scores Prediction. J Phys Chem B 2024; 128:4943-4951. [PMID: 38733335 DOI: 10.1021/acs.jpcb.4c00296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2024]
Abstract
Options to improve the extrapolation power of the neural network designed using the SchNetPack package with respect to top docking scores prediction are presented. It is shown that hyperparameter tuning of the atomistic model representation (in the schnetpack.representation) improves the prediction of the top scoring compounds, which have characteristically a low incidence in randomized data sets for training of machine learning models. The prediction robustness is evaluated according to the mean square error (MSE) and the entropy of the average loss landscape decrease. Admittedly, the improvement of the top scoring compounds' prediction accuracy comes with the penalty of worsening the overall prediction power. It is revealed that the most impactful hyperparameter is the cutoff (5 Å is reported as the optimal choice). Other parameters (e.g., number of radial basis functions, number of interaction layers of the neural network, feature vector size or its batch size) are found to not affect the prediction robustness of the top scoring compounds in any comparable way relative to the cutoff. The MSE of the best docking score prediction (below -13 kcal/mol) improves from ca. 3.5 to 0.9 kcal/mol, while the prediction of less potent compounds (-13 to -11 kcal/mol) shows a lesser improvement, i.e., a decrease of MSE from 1.6 to 1.3 kcal/mol. Additionally, oversampling and undersampling of the training set with respect to the top scoring compounds' abundance is presented. The results indicate that the cutoff choice performs better than over- or undersampling of the training set, with undersampling performing better than oversampling.
Collapse
Affiliation(s)
- Ján Matúška
- Institute of Physical Chemistry and Chemical Physics, Faculty of Chemical and Food Technology, Slovak University of Technology in Bratislava, Radlinského 9, SK-81237 Bratislava, Slovak Republic
| | - Lukas Bucinsky
- Institute of Physical Chemistry and Chemical Physics, Faculty of Chemical and Food Technology, Slovak University of Technology in Bratislava, Radlinského 9, SK-81237 Bratislava, Slovak Republic
| | - Marián Gall
- Institute of Information Engineering, Automation and Mathematics, Faculty of Chemical and Food Technology, Slovak University of Technology in Bratislava, Radlinského 9, SK-81237 Bratislava, Slovak Republic
- National SuperComputing Center, Dúbravská cesta č. 9, SK-84104 Bratislava, Slovak Republic
| | - Michal Pitoňák
- National SuperComputing Center, Dúbravská cesta č. 9, SK-84104 Bratislava, Slovak Republic
- Department of Physical and Theoretical Chemistry, Faculty of Natural Sciences, Comenius University in Bratislava, Mlynská dolina Ilkovičova 6, SK-84215 Bratislava, Slovak Republic
| | - Marek Štekláč
- Institute of Physical Chemistry and Chemical Physics, Faculty of Chemical and Food Technology, Slovak University of Technology in Bratislava, Radlinského 9, SK-81237 Bratislava, Slovak Republic
- Computing Centre, Centre of Operations of the Slovak Academy of Sciences, Dúbravská cesta č. 9, SK-84535 Bratislava, Slovak Republic
| |
Collapse
|
3
|
Khylyuk D. Protein-Protein Docking Approach to GPCR Oligomerization. Methods Mol Biol 2024; 2780:281-287. [PMID: 38987473 DOI: 10.1007/978-1-0716-3985-6_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
G-protein-coupled receptors (GPCRs), the largest family of human membrane proteins, play a crucial role in cellular control and are the target of approximately one-third of all drugs on the market. Targeting these complexes with selectivity or formulating small molecules capable of modulating receptor-receptor interactions could potentially offer novel avenues for drug discovery, fostering the development of more refined and safer pharmacotherapies. Due to the lack of experimentally derived X-ray crystallography spectra of GPCR oligomers, there is growing evidence supporting the development of new in silico approaches for predicting GPCR self-assembling structures. The significance of GPCR oligomerization, the challenges in modeling these structures, and the potential of protein-protein docking algorithms to address these challenges are discussed. The study also underscores the use of various software solutions for modeling GPCR oligomeric structures and presents practical cases where these techniques have been successfully applied.
Collapse
Affiliation(s)
- Dmytro Khylyuk
- Chair and Department of Organic Chemistry , Medical University of Lublin, Lublin, Poland.
| |
Collapse
|
4
|
Feng H, Wang F, Li N, Xu Q, Zheng G, Sun X, Hu M, Li X, Xing G, Zhang G. Use of tree-based machine learning methods to screen affinitive peptides based on docking data. Mol Inform 2023; 42:e202300143. [PMID: 37696773 DOI: 10.1002/minf.202300143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 09/03/2023] [Accepted: 09/11/2023] [Indexed: 09/13/2023]
Abstract
Screening peptides with good affinity is an important step in peptide-drug discovery. Recent advancement in computer and data science have made machine learning a useful tool in accurately affinitive-peptide screening. In current study, four different tree-based algorithms, including Classification and regression trees (CART), C5.0 decision tree (C50), Bagged CART (BAG) and Random Forest (RF), were employed to explore the relationship between experimental peptide affinities and virtual docking data, and the performance of each model was also compared in parallel. All four algorithms showed better performances on dataset pre-scaled, -centered and -PCA than other pre-processed dataset. After model re-built and hyperparameter optimization, the optimal C50 model (C50O) showed the best performances in terms of Accuracy, Kappa, Sensitivity, Specificity, F1, MCC and AUC when validated on test data and an unknown PEDV datasets evaluation (Accuracy=80.4 %). BAG and RFO (the optimal RF), as two best models during training process, did not performed as expecting during in testing and unknown dataset validations. Furthermore, the high correlation of the predictions of RFO and BAG to C50O implied the high stability and robustness of their prediction. Whereas although the good performance on unknown dataset, the poor performance in test data validation and correlation analysis indicated CARTO could not be used for future data prediction. To accurately evaluate the peptide affinity, the current study firstly gave a tree-model competition on affinitive peptide prediction by using virtual docking data, which would expand the application of machine learning algorithms in studying PepPIs and benefit the development of peptide therapeutics.
Collapse
Affiliation(s)
- Hua Feng
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
| | - Fangyu Wang
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
| | - Ning Li
- College of Food Science and Technology, Henan Agricultural University, Zhengzhou, China
| | - Qian Xu
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
| | - Guanming Zheng
- Public Health and Preventive Medicine Teaching and Research Center, Henan University of Chinese Medicine, Zhengzhou, Henan, China
| | - Xuefeng Sun
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
| | - Man Hu
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
| | - Xuewu Li
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
| | - Guangxu Xing
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
| | - Gaiping Zhang
- Henan Key Laboratory of Animal Immunology, Henan Academy of Agricultural Sciences, Zhengzhou, China
- Longhu Modern Immunology Laboratory, Zhengzhou, China
- School of Advanced Agricultural sciences, Peking University, Beijing, China
- Jiangsu Co-Innovation Center for the Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Yangzhou University, Yangzhou, Jiangsu, China
| |
Collapse
|
5
|
Cheng Z, Bhave M, Hwang SS, Rahman T, Chee XW. Identification of Potential p38γ Inhibitors via In Silico Screening, In Vitro Bioassay and Molecular Dynamics Simulation Studies. Int J Mol Sci 2023; 24:ijms24087360. [PMID: 37108523 PMCID: PMC10139033 DOI: 10.3390/ijms24087360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 04/06/2023] [Accepted: 04/13/2023] [Indexed: 04/29/2023] Open
Abstract
Protein kinase p38γ is an attractive target against cancer because it plays a pivotal role in cancer cell proliferation by phosphorylating the retinoblastoma tumour suppressor protein. Therefore, inhibition of p38γ with active small molecules represents an attractive alternative for developing anti-cancer drugs. In this work, we present a rigorous and systematic virtual screening framework to identify potential p38γ inhibitors against cancer. We combined the use of machine learning-based quantitative structure activity relationship modelling with conventional computer-aided drug discovery techniques, namely molecular docking and ligand-based methods, to identify potential p38γ inhibitors. The hit compounds were filtered using negative design techniques and then assessed for their binding stability with p38γ through molecular dynamics simulations. To this end, we identified a promising compound that inhibits p38γ activity at nanomolar concentrations and hepatocellular carcinoma cell growth in vitro in the low micromolar range. This hit compound could serve as a potential scaffold for further development of a potent p38γ inhibitor against cancer.
Collapse
Affiliation(s)
- Zixuan Cheng
- School of Engineering and Science, Swinburne University of Technology Sarawak, Kuching 93350, Malaysia
| | - Mrinal Bhave
- Department of Chemistry and Biotechnology, Swinburne University of Technology, Melbourne, VIC 3122, Australia
| | - Siaw San Hwang
- School of Engineering and Science, Swinburne University of Technology Sarawak, Kuching 93350, Malaysia
| | - Taufiq Rahman
- Department of Pharmacology, University of Cambridge, Cambridge CB2 1PD, UK
| | - Xavier Wezen Chee
- School of Engineering and Science, Swinburne University of Technology Sarawak, Kuching 93350, Malaysia
| |
Collapse
|
6
|
Lin Y, Zhang Y, Wang D, Yang B, Shen YQ. Computer especially AI-assisted drug virtual screening and design in traditional Chinese medicine. PHYTOMEDICINE : INTERNATIONAL JOURNAL OF PHYTOTHERAPY AND PHYTOPHARMACOLOGY 2022; 107:154481. [PMID: 36215788 DOI: 10.1016/j.phymed.2022.154481] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 09/14/2022] [Accepted: 09/27/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND Traditional Chinese medicine (TCM), as a significant part of the global pharmaceutical science, the abundant molecular compounds it contains is a valuable potential source of designing and screening new drugs. However, due to the un-estimated quantity of the natural molecular compounds and diversity of the related problems drug discovery such as precise screening of molecular compounds or the evaluation of efficacy, physicochemical properties and pharmacokinetics, it is arduous for researchers to design or screen applicable compounds through old methods. With the rapid development of computer technology recently, especially artificial intelligence (AI), its innovation in the field of virtual screening contributes to an increasing efficiency and accuracy in the process of discovering new drugs. PURPOSE This study systematically reviewed the application of computational approaches and artificial intelligence in drug virtual filtering and devising of TCM and presented the potential perspective of computer-aided TCM development. STUDY DESIGN We made a systematic review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Then screening the most typical articles for our research. METHODS The systematic review was performed by following the PRISMA guidelines. The databases PubMed, EMBASE, Web of Science, CNKI were used to search for publications that focused on computer-aided drug virtual screening and design in TCM. RESULT Totally, 42 corresponding articles were included in literature reviewing. Aforementioned studies were of great significance to the treatment and cost control of many challenging diseases such as COVID-19, diabetes, Alzheimer's Disease (AD), etc. Computational approaches and AI were widely used in virtual screening in the process of TCM advancing, which include structure-based virtual screening (SBVS) and ligand-based virtual screening (LBVS). Besides, computational technologies were also extensively applied in absorption, distribution, metabolism, excretion and toxicity (ADMET) prediction of candidate drugs and new drug design in crucial course of drug discovery. CONCLUSIONS The applications of computer and AI play an important role in the drug virtual screening and design in the field of TCM, with huge application prospects.
Collapse
Affiliation(s)
- Yumeng Lin
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - You Zhang
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Dongyang Wang
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Bowen Yang
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Ying-Qiang Shen
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China.
| |
Collapse
|
7
|
Yang B, Bao W, Chen B. Disease-Ligand Identification Based on Flexible Neural Tree. Front Microbiol 2022; 13:912145. [PMID: 35733966 PMCID: PMC9207514 DOI: 10.3389/fmicb.2022.912145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 05/06/2022] [Indexed: 12/04/2022] Open
Abstract
In order to screen the disease-related compounds of a traditional Chinese medicine prescription in network pharmacology research accurately, a new virtual screening method based on flexible neural tree (FNT) model, hybrid evolutionary method and negative sample selection algorithm is proposed. A novel hybrid evolutionary algorithm based on the Grammar-guided genetic programming and salp swarm algorithm is proposed to infer the optimal FNT. According to hypertension, diabetes, and Corona Virus Disease 2019, disease-related compounds are collected from the up-to-date literatures. The unrelated compounds are chosen by negative sample selection algorithm. ECFP6, MACCS, Macrocycle, and RDKit are utilized to numerically characterize the chemical structure of each compound collected, respectively. The experiment results show that our proposed method performs better than classical classifiers [Support Vector Machine (SVM), random forest (RF), AdaBoost, decision tree (DT), Gradient Boosting Decision Tree (GBDT), KNN, logic regression (LR), and Naive Bayes (NB)], up-to-date classifier (gcForest), and deep learning method (forgeNet) in terms of AUC, ROC, TPR, FPR, Precision, Specificity, and F1. MACCS method is suitable for the maximum number of classifiers. All methods perform poorly with ECFP6 molecular descriptor.
Collapse
Affiliation(s)
- Bin Yang
- School of Information Science and Engineering, Zaozhuang University, Zaozhuang, China
| | - Wenzheng Bao
- School of Information and Electrical Engineering, Xuzhou University of Technology, Xuzhou, China
- *Correspondence: Wenzheng Bao,
| | | |
Collapse
|
8
|
Ji Y, Li R, Tian Y, Chen G, Yan A. Classification models and SAR analysis on thromboxane A 2 synthase inhibitors by machine learning methods. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2022; 33:429-462. [PMID: 35678125 DOI: 10.1080/1062936x.2022.2078880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Accepted: 05/11/2022] [Indexed: 06/15/2023]
Abstract
Thromboxane A2 synthase (TXS) is a promising drug target for cardiovascular diseases and cancer. In this work, we conducted a structure-activity relationship (SAR) study on 526 TXS inhibitors for bioactivity prediction. Three types of descriptors (MACCS fingerprints, ECFP4 fingerprints, and MOE descriptors) were utilized to characterize inhibitors, 24 classification models were developed by support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and deep neural networks (DNN). Then we reduced the number of fingerprints according to the contribution of descriptors to the models, and constructed 16 extra models on simplified fingerprints. In general, Model_4D built by DNN algorithm and 67 bits MACCS fingerprints performs best. The prediction accuracy of the model on the test set is 0.969, and Matthews correlation coefficient (MCC) is 0.936. The distance between compound and model (dSTD-PRO) was used to characterize the application domain of the model. In the test set of Model_4D, dSTD-PRO of 91.5% compounds is lower than the corresponding training set threshold (threshold0.90 = 0.1055), and the accuracy of these compounds is 0.983. In addition, the important descriptors were summarized and further analyzed. It showed that aromatic nitrogenous heterocyclic groups were beneficial to improve the bioactivity of TXS inhibitors.
Collapse
Affiliation(s)
- Y Ji
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, P. R. China
| | - R Li
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, P. R. China
| | - Y Tian
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, P. R. China
| | - G Chen
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing, China
| | - A Yan
- State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, Beijing, P. R. China
| |
Collapse
|
9
|
In silico design of novel PIN1 inhibitors by combined of 3D-QSAR, molecular docking, molecular dynamic simulation and ADMET studies. J Mol Struct 2022. [DOI: 10.1016/j.molstruc.2021.132291] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
10
|
Torkamannia A, Omidi Y, Ferdousi R. A review of machine learning approaches for drug synergy prediction in cancer. Brief Bioinform 2022; 23:6552269. [PMID: 35323854 DOI: 10.1093/bib/bbac075] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 01/19/2022] [Accepted: 02/14/2022] [Indexed: 02/06/2023] Open
Abstract
Combinational pharmacotherapy with the synergistic/additive effect is a powerful treatment strategy for complex diseases such as malignancies. Identifying synergistic combinations with various compounds and structures requires testing a large number of compound combinations. However, in practice, examining different compounds by in vivo and in vitro approaches is costly, infeasible and challenging. In the last decades, significant success has been achieved by expanding computational methods in different pharmacological and bioinformatics domains. As promising tools, computational approaches such as machine learning algorithms (MLAs) are used for prioritizing combinational pharmacotherapies. This review aims to provide the models developed to predict synergistic drug combinations in cancer by MLAs with various information, including gene expression, protein-protein interactions, metabolite interactions, pathways and pharmaceutical information such as chemical structure, molecular descriptor and drug-target interactions.
Collapse
Affiliation(s)
- Anna Torkamannia
- Department of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Yadollah Omidi
- Department of Pharmaceutical Sciences, College of Pharmacy, Nova Southeastern University, Fort Lauderdale, Florida, United States
| | - Reza Ferdousi
- Department of Health Information Technology, School of Management and Medical Informatics, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
11
|
Bucinsky L, Bortňák D, Gall M, Matúška J, Milata V, Pitoňák M, Štekláč M, Végh D, Zajaček D. Machine learning prediction of 3CLpro SARS-CoV-2 docking scores. Comput Biol Chem 2022; 98:107656. [PMID: 35288359 PMCID: PMC8881816 DOI: 10.1016/j.compbiolchem.2022.107656] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 02/23/2022] [Accepted: 02/24/2022] [Indexed: 12/14/2022]
Abstract
Molecular docking results of two training sets containing 866 and 8,696 compounds were used to train three different machine learning (ML) approaches. Neural network approaches according to Keras and TensorFlow libraries and the gradient boosted decision trees approach of XGBoost were used with DScribe’s Smooth Overlap of Atomic Positions molecular descriptors. In addition, neural networks using the SchNetPack library and descriptors were used. The ML performance was tested on three different sets, including compounds for future organic synthesis. The final evaluation of the ML predicted docking scores was based on the ZINC in vivo set, from which 1,200 compounds were randomly selected with respect to their size. The results obtained showed a consistent ML prediction capability of docking scores, and even though compounds with more than 60 atoms were found slightly overestimated they remain valid for a subsequent evaluation of their drug repurposing suitability.
Collapse
|
12
|
Ebenezer O, Damoyi N, Shapi M. Predicting New Anti-Norovirus Inhibitor With the Help of Machine Learning Algorithms and Molecular Dynamics Simulation-Based Model. Front Chem 2021; 9:753427. [PMID: 34869204 PMCID: PMC8636098 DOI: 10.3389/fchem.2021.753427] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 10/13/2021] [Indexed: 12/30/2022] Open
Abstract
Hepatitis C virus (HCV) inhibitors are essential in the treatment of human norovirus (HuNoV). This study aimed to map out HCV NS5B RNA-dependent RNA polymerase inhibitors that could potentially be responsible for the inhibitory activity of HuNoV RdRp. It is necessary to develop robust machine learning and in silico methods to predict HuNoV RdRp compounds. In this study, Naïve Bayesian and random forest models were built to categorize norovirus RdRp inhibitors from the non-inhibitors using their molecular descriptors and PubChem fingerprints. The best model observed had accuracy, specificity, and sensitivity values of 98.40%, 97.62%, and 97.62%, respectively. Meanwhile, an external test set was used to validate model performance before applicability to the screened HCV compounds database. As a result, 775 compounds were predicted as NoV RdRp inhibitors. The pharmacokinetics calculations were used to filter out the inhibitors that lack drug-likeness properties. Molecular docking and molecular dynamics simulation investigated the inhibitors' binding modes and residues critical for the HuNoV RdRp receptor. The most active compound, CHEMBL167790, closely binds to the binding pocket of the RdRp enzyme and depicted stable binding with RMSD 0.8-3.2 Å, and the RMSF profile peak was between 1.0-4.0 Å, and the conformational fluctuations were at 450-460 residues. Moreover, the dynamic residue cross-correlation plot also showed the pairwise correlation between the binding residues 300-510 of the HuNoV RdRp receptor and CHEMBL167790. The principal component analysis depicted the enhanced movement of protein atoms. Moreover, additional residues such as Glu510 and Asn505 interacted with CHEMBL167790 via water bridge and established H-bond interactions after the simulation. http://zinc15.docking.org/substances/ZINC000013589565.
Collapse
Affiliation(s)
- Oluwakemi Ebenezer
- Department of Chemistry, Faculty of Natural Science, Mangosuthu University of Technology, Durban, South Africa
| | | | | |
Collapse
|
13
|
Molecular machine based on Rotaxane@Tricyclic antidepressant carrier: Theoretical molecular dynamic simulation. COMPUT THEOR CHEM 2021. [DOI: 10.1016/j.comptc.2020.113138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
14
|
Shi C, Dong F, Zhao G, Zhu N, Lao X, Zheng H. Applications of machine-learning methods for the discovery of NDM-1 inhibitors. Chem Biol Drug Des 2020; 96:1232-1243. [PMID: 32418370 DOI: 10.1111/cbdd.13708] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 04/25/2020] [Accepted: 05/06/2020] [Indexed: 12/11/2022]
Abstract
The emergence of New Delhi metal beta-lactamase (NDM-1)-producing bacteria and their worldwide spread pose great challenges for the treatment of drug-resistant bacterial infections. These bacteria can hydrolyze most β-lactam antibacterials. Unfortunately, there are no clinically useful NDM-1 inhibitors. In the current work, we manually collected NDM-1 inhibitors reported in the past decade and established the first NDM-1 inhibitor database. Four machine-learning models were constructed using the structural and property characteristics of the collected compounds as input training set to discover potential NDM-1 inhibitors. In order to distinguish between high active inhibitors and putative positive drugs, a three-classification strategy was introduced in our study. In detail, the commonly used positive and negative divisions are converted into strongly active, weakly active, and inactive. The accuracy of the best prediction model designed based on this strategy reached 90.5%, compared with 69.14% achieved by the traditional docking-based virtual screening method. Consequently, the best model was used to virtually screen a natural product library. The safety of the selected compounds was analyzed by the ADMET prediction model based on machine learning. Seven novel NDM-1 inhibitors were identified, which will provide valuable clues for the discovery of NDM-1 inhibitors.
Collapse
Affiliation(s)
- Cheng Shi
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, China
| | - Fanyi Dong
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, China
| | - Guiling Zhao
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, China
| | - Ning Zhu
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, China
| | - Xingzhen Lao
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, China
| | - Heng Zheng
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, China
| |
Collapse
|
15
|
Huang X, Ribeiro JD, Franklin JC. The Differences Between Individuals Engaging in Nonsuicidal Self-Injury and Suicide Attempt Are Complex ( vs. Complicated or Simple). Front Psychiatry 2020; 11:239. [PMID: 32317991 PMCID: PMC7154073 DOI: 10.3389/fpsyt.2020.00239] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 03/11/2020] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Why do some people engage in nonsuicidal self-injury (NSSI) while others attempt suicide? One way to advance knowledge about this question is to shed light on the differences between people who engage in NSSI and people who attempt suicide. These groups could differ in three broad ways. First, these two groups may differ in a simple way, such that one or a small set of factors is both necessary and sufficient to accurately distinguish the two groups. Second, they might differ in a complicated way, meaning that a specific set of a large number of factors is both necessary and sufficient to accurately classify them. Third, they might differ in a complex way, with no necessary factor combinations and potentially no sufficient factor combinations. In this scenario, at the group level, complicated algorithms would either be insufficient (i.e., no complicated algorithm produces good accuracy) or unnecessary (i.e., many complicated algorithms produce good accuracy) to distinguish between groups. This study directly tested these three possibilities in a sample of people with a history of NSSI and/or suicide attempt. METHOD A total of 954 participants who have either engaged in NSSI and/or suicide attempt in their lifetime were recruited from online forums. Participants completed a series of measures on factors commonly associated with NSSI and suicide attempt. To test for simple differences, univariate logistic regressions were conducted. One theoretically informed multiple logistic regression model with suicidal desire, capability for suicide, and their interaction term was considered as well. To examine complicated and complex differences, multiple logistic regression and machine learning analyses were conducted. RESULTS No simple algorithm (i.e., single factor or small set of factors) accurately distinguished between groups. Complicated algorithms constructed with cross-validation methods produced fair accuracy; complicated algorithms constructed with bootstrap optimism methods produced good accuracy, but multiple different algorithms with this method produced similar results. CONCLUSIONS Findings were consistent with complex differences between people who engage in NSSI and suicide attempts. Specific complicated algorithms were either insufficient (cross-validation results) or unnecessary (bootstrap optimism results) to distinguish between these groups with high accuracy.
Collapse
Affiliation(s)
| | | | - Joseph C. Franklin
- Department of Psychology, Florida State University, Tallahassee, FL, United States
| |
Collapse
|
16
|
Yu JW, Yuan HW, Bao LD, Si LG. Interaction between piperine and genes associated with sciatica and its mechanism based on molecular docking technology and network pharmacology. Mol Divers 2020; 25:233-248. [PMID: 32130644 PMCID: PMC7870775 DOI: 10.1007/s11030-020-10055-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Accepted: 02/17/2020] [Indexed: 12/15/2022]
Abstract
Abstract Piperine is the main active component of Piper longum L., which is also the main component of anti-sciatica Mongolian medicine Naru Sanwei pill. It has many pharmacological activities such as anti-inflammatory and immune regulation.
This paper aims to preliminarily explore the potential mechanism of piperine in the treatment of sciatica through network pharmacology and molecular docking. TCMSP, ETCM database and literature mining were used to collect the active compounds of Piper longum L. Swiss TargetPrediction and SuperPred server were used to find the targets of compounds. At the same time, CTD database was used to collect the targets of sciatica. Then the above targets were compared and analyzed to select the targets of anti-sciatica in Piper longum L. The Go (gene ontology) annotation and KEGG pathway of the targets were enriched and analyzed by Metascape database platform. The molecular docking between the effective components and the targets was verified by Autodock. After that, the sciatica model of rats was established and treated with piperine. The expression level of inflammatory factors and proteins in the serum and tissues of rat sciatic nerve were detected by ELISA and Western blot. HE staining and immunohistochemistry were carried out on the sciatica tissues of rats. The results showed that Piper longum L. can regulate the development of sciatica and affect the expressions of PPARG and NF-kB1 through its active ingredient piperine, and there is endogenous interaction between PPARG and NF-kB1. Graphic abstract ![]()
Collapse
Affiliation(s)
- Jiu-Wang Yu
- Department of Pharmacy, Affiliated Hospital of Inner Mongolia Medical University, Hohhot, 010059, Inner Mongolia, People's Republic of China
| | - Hong-Wei Yuan
- Department of Pathology, Affiliated Hospital of Inner Mongolia Medical University, Hohhot, 010059, Inner Mongolia, People's Republic of China
| | - Li-Dao Bao
- Department of Pharmacy, Affiliated Hospital of Inner Mongolia Medical University, Hohhot, 010059, Inner Mongolia, People's Republic of China.
| | - Leng-Ge Si
- Mongolia Medical School, Inner Mongolia Medical University, Hohhot, 010110, Inner Mongolia, People's Republic of China
| |
Collapse
|
17
|
Xue M, Su Y, Li C, Wang S, Yao H. Identification of Potential Type II Diabetes in a Large-Scale Chinese Population Using a Systematic Machine Learning Framework. J Diabetes Res 2020; 2020:6873891. [PMID: 33029536 PMCID: PMC7532405 DOI: 10.1155/2020/6873891] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 08/01/2020] [Accepted: 09/02/2020] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND An estimated 425 million people globally have diabetes, accounting for 12% of the world's health expenditures, and the number continues to grow, placing a huge burden on the healthcare system, especially in those remote, underserved areas. METHODS A total of 584,168 adult subjects who have participated in the national physical examination were enrolled in this study. The risk factors for type II diabetes mellitus (T2DM) were identified by p values and odds ratio, using logistic regression (LR) based on variables of physical measurement and a questionnaire. Combined with the risk factors selected by LR, we used a decision tree, a random forest, AdaBoost with a decision tree (AdaBoost), and an extreme gradient boosting decision tree (XGBoost) to identify individuals with T2DM, compared the performance of the four machine learning classifiers, and used the best-performing classifier to output the degree of variables' importance scores of T2DM. RESULTS The results indicated that XGBoost had the best performance (accuracy = 0.906, precision = 0.910, recall = 0.902, F-1 = 0.906, and AUC = 0.968). The degree of variables' importance scores in XGBoost showed that BMI was the most significant feature, followed by age, waist circumference, systolic pressure, ethnicity, smoking amount, fatty liver, hypertension, physical activity, drinking status, dietary ratio (meat to vegetables), drink amount, smoking status, and diet habit (oil loving). CONCLUSIONS We proposed a classifier based on LR-XGBoost which used fourteen variables of patients which are easily obtained and noninvasive as predictor variables to identify potential incidents of T2DM. The classifier can accurately screen the risk of diabetes in the early phrase, and the degree of variables' importance scores gives a clue to prevent diabetes occurrence.
Collapse
Affiliation(s)
- Mingyue Xue
- Hospital of Traditional Chinese Medicine Affiliated to the Fourth Clinical Medical College of Xinjiang Medical University, Urumqi, China
- College of Public Health, Xinjiang Medical University, Urumqi, China
| | - Yinxia Su
- College of Public Health, Xinjiang Medical University, Urumqi, China
| | - Chen Li
- The First Affiliated Hospital of Xinjiang Medical University, Urumqi, China
| | - Shuxia Wang
- Center of Health Management, The First Affiliated Hospital, Xinjiang Medical University, Urumqi, China
| | - Hua Yao
- Center of Health Management, The First Affiliated Hospital, Xinjiang Medical University, Urumqi, China
| |
Collapse
|
18
|
Kadioglu O, Efferth T. A Machine Learning-Based Prediction Platform for P-Glycoprotein Modulators and Its Validation by Molecular Docking. Cells 2019; 8:E1286. [PMID: 31640190 PMCID: PMC6829872 DOI: 10.3390/cells8101286] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 10/18/2019] [Accepted: 10/20/2019] [Indexed: 12/20/2022] Open
Abstract
P-glycoprotein (P-gp) is an important determinant of multidrug resistance (MDR) because its overexpression is associated with increased efflux of various established chemotherapy drugs in many clinically resistant and refractory tumors. This leads to insufficient therapeutic targeting of tumor populations, representing a major drawback of cancer chemotherapy. Therefore, P-gp is a target for pharmacological inhibitors to overcome MDR. In the present study, we utilized machine learning strategies to establish a model for P-gp modulators to predict whether a given compound would behave as substrate or inhibitor of P-gp. Random forest feature selection algorithm-based leave-one-out random sampling was used. Testing the model with an external validation set revealed high performance scores. A P-gp modulator list of compounds from the ChEMBL database was used to test the performance, and predictions from both substrate and inhibitor classes were selected for the last step of validation with molecular docking. Predicted substrates revealed similar docking poses than that of doxorubicin, and predicted inhibitors revealed similar docking poses than that of the known P-gp inhibitor elacridar, implying the validity of the predictions. We conclude that the machine-learning approach introduced in this investigation may serve as a tool for the rapid detection of P-gp substrates and inhibitors in large chemical libraries.
Collapse
Affiliation(s)
- Onat Kadioglu
- Department of Pharmaceutical Biology, Institute of Pharmacy and Biochemistry, Johannes Gutenberg University, 55128 Mainz, Germany.
| | - Thomas Efferth
- Department of Pharmaceutical Biology, Institute of Pharmacy and Biochemistry, Johannes Gutenberg University, 55128 Mainz, Germany.
| |
Collapse
|
19
|
Computational Health Engineering Applied to Model Infectious Diseases and Antimicrobial Resistance Spread. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9122486] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Infectious diseases are the primary cause of mortality worldwide. The dangers of infectious disease are compounded with antimicrobial resistance, which remains the greatest concern for human health. Although novel approaches are under investigation, the World Health Organization predicts that by 2050, septicaemia caused by antimicrobial resistant bacteria could result in 10 million deaths per year. One of the main challenges in medical microbiology is to develop novel experimental approaches, which enable a better understanding of bacterial infections and antimicrobial resistance. After the introduction of whole genome sequencing, there was a great improvement in bacterial detection and identification, which also enabled the characterization of virulence factors and antimicrobial resistance genes. Today, the use of in silico experiments jointly with computational and machine learning offer an in depth understanding of systems biology, allowing us to use this knowledge for the prevention, prediction, and control of infectious disease. Herein, the aim of this review is to discuss the latest advances in human health engineering and their applicability in the control of infectious diseases. An in-depth knowledge of host–pathogen–protein interactions, combined with a better understanding of a host’s immune response and bacterial fitness, are key determinants for halting infectious diseases and antimicrobial resistance dissemination.
Collapse
|