Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sun H. A Naive Bayes Classifier for Prediction of Multidrug Resistance Reversal Activity on the Basis of Atom Typing. J Med Chem 2005;48:4031-9. [PMID: 15943476 DOI: 10.1021/jm050180t] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

For:	Sun H. A Naive Bayes Classifier for Prediction of Multidrug Resistance Reversal Activity on the Basis of Atom Typing. J Med Chem 2005;48:4031-9. [PMID: 15943476 DOI: 10.1021/jm050180t] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Yu S, Liao B, Zhu W, Peng D, Wu F. Accurate prediction and key protein sequence feature identification of cyclins. Brief Funct Genomics 2023;22:411-419. [PMID: 37118891 DOI: 10.1093/bfgp/elad014] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Revised: 03/03/2023] [Accepted: 03/17/2023] [Indexed: 04/30/2023] Open

Win ZM, Cheong AMY, Hopkins WS. Using Machine Learning To Predict Partition Coefficient (Log P) and Distribution Coefficient (Log D) with Molecular Descriptors and Liquid Chromatography Retention Time. J Chem Inf Model 2023;63:1906-1913. [PMID: 36926888 DOI: 10.1021/acs.jcim.2c01373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]

Hao Y, Fan T, Sun G, Li F, Zhang N, Zhao L, Zhong R. Environmental toxicity risk evaluation of nitroaromatic compounds: Machine learning driven binary/multiple classification and design of safe alternatives. Food Chem Toxicol 2022;170:113461. [PMID: 36243219 DOI: 10.1016/j.fct.2022.113461] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 09/11/2022] [Accepted: 10/04/2022] [Indexed: 11/06/2022]

Chen S, Li T, Yang L, Zhai F, Jiang X, Xiang R, Ling G. Artificial intelligence-driven prediction of multiple drug interactions. Brief Bioinform 2022;23:6720429. [PMID: 36168896 DOI: 10.1093/bib/bbac427] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 09/01/2022] [Accepted: 09/02/2022] [Indexed: 12/14/2022] Open

Yu S, Peng D, Zhu W, Liao B, Wang P, Yang D, Wu F. Hybrid_DBP: Prediction of DNA-binding proteins using hybrid features and convolutional neural networks. Front Pharmacol 2022;13:1031759. [PMID: 36299898 PMCID: PMC9589247 DOI: 10.3389/fphar.2022.1031759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 09/27/2022] [Indexed: 11/21/2022] Open

Affiliation(s)

Shaoyou Yu Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China School of Mathematics and Statistics, Hainan Normal University, Haikou, China
Dejun Peng Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China School of Mathematics and Statistics, Hainan Normal University, Haikou, China
Wen Zhu Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China School of Mathematics and Statistics, Hainan Normal University, Haikou, China *Correspondence: Wen Zhu,
Bo Liao Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China School of Mathematics and Statistics, Hainan Normal University, Haikou, China
Peng Wang Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China School of Mathematics and Statistics, Hainan Normal University, Haikou, China
Dongxuan Yang Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China School of Mathematics and Statistics, Hainan Normal University, Haikou, China
Fangxiang Wu Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China School of Mathematics and Statistics, Hainan Normal University, Haikou, China

Collapse

Wang Y, Michael S, Yang SM, Huang R, Cruz-Gutierrez K, Zhang Y, Zhao J, Xia M, Shinn P, Sun H. Retro Drug Design: From Target Properties to Molecular Structures. J Chem Inf Model 2022;62:2659-2669. [PMID: 35653613 PMCID: PMC9198977 DOI: 10.1021/acs.jcim.2c00123] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Abstract

To deliver more therapeutics to more patients more quickly and economically is the ultimate goal of pharmaceutical researchers. The advent and rapid development of artificial intelligence (AI), in combination with other powerful computational methods in drug discovery, makes this goal more practical than ever before. Here, we describe a new strategy, retro drug design, or RDD, to create novel small-molecule drugs from scratch to meet multiple predefined requirements, including biological activity against a drug target and optimal range of physicochemical and ADMET properties. The molecular structure was represented by an atom typing based molecular descriptor system, optATP, which was further transformed to the space of loading vectors from principal component analysis. Traditional predictive models were trained over experimental data for the target properties using optATP and shallow machine learning methods. The Monte Carlo sampling algorithm was then utilized to find the solutions in the space of loading vectors that have the target properties. Finally, a deep learning model was employed to decode molecular structures from the solutions. To test the feasibility of the algorithm, we challenged RDD to generate novel kinase inhibitors from random numbers with five different ADMET properties optimized at the same time. The best Tanimoto similarity score between the generated valid structures and the available 4,314 kinase inhibitors was < 0.50, indicating a high extent of novelty of the generated compounds. From the 3,040 structures that met all six target properties, 20 were selected for synthesis and experimental measurement of inhibition activity over 97 representative kinases and the ADMET properties. Fifteen and eight compounds were determined to be hits or strong hits, respectively. Five of the six strong kinase inhibitors have excellent experimental ADMET properties. The results presented in this paper illustrate that RDD has the potential to significantly improve the current drug discovery process.

Collapse

Tang W, Liu W, Wang Z, Hong H, Chen J. Machine learning models on chemical inhibitors of mitochondrial electron transport chain. JOURNAL OF HAZARDOUS MATERIALS 2022;426:128067. [PMID: 34920224 DOI: 10.1016/j.jhazmat.2021.128067] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 12/05/2021] [Accepted: 12/08/2021] [Indexed: 06/14/2023]

Wang Y, Michael S, Huang R, Zhao J, Recabo K, Bougie D, Shu Q, Shinn P, Sun H. Retro Drug Design: From Target Properties to Molecular Structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021. [PMID: 34013260 PMCID: PMC8132216 DOI: 10.1101/2021.05.11.442656] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Abstract

To generate drug molecules of desired properties with computational methods is the holy grail in pharmaceutical research. Here we describe an AI strategy, retro drug design, or RDD, to generate novel small molecule drugs from scratch to meet predefined requirements, including but not limited to biological activity against a drug target, and optimal range of physicochemical and ADMET properties. Traditional predictive models were first trained over experimental data for the target properties, using an atom typing based molecular descriptor system, ATP. Monte Carlo sampling algorithm was then utilized to find the solutions in the ATP space defined by the target properties, and the deep learning model of Seq2Seq was employed to decode molecular structures from the solutions. To test feasibility of the algorithm, we challenged RDD to generate novel drugs that can activate μ opioid receptor (MOR) and penetrate blood brain barrier (BBB). Starting from vectors of random numbers, RDD generated 180,000 chemical structures, of which 78% were chemically valid. About 42,000 (31%) of the valid structures fell into the property space defined by MOR activity and BBB permeability. Out of the 42,000 structures, only 267 chemicals were commercially available, indicating a high extent of novelty of the AI-generated compounds. We purchased and assayed 96 compounds, and 25 of which were found to be MOR agonists. These compounds also have excellent BBB scores. The results presented in this paper illustrate that RDD has potential to revolutionize the current drug discovery process and create novel structures with multiple desired properties, including biological functions and ADMET properties. Availability of an AI-enabled fast track in drug discovery is essential to cope with emergent public health threat, such as pandemic of COVID-19.

Collapse

Ye Z, Yang W, Yang Y, Ouyang D. Interpretable machine learning methods for in vitro pharmaceutical formulation development. FOOD FRONTIERS 2021. [DOI: 10.1002/fft2.78] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Patel L, Shukla T, Huang X, Ussery DW, Wang S. Machine Learning Methods in Drug Discovery. Molecules 2020;25:E5277. [PMID: 33198233 PMCID: PMC7696134 DOI: 10.3390/molecules25225277] [Citation(s) in RCA: 118] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 11/04/2020] [Accepted: 11/09/2020] [Indexed: 12/30/2022] Open

Tang W, Chen J, Hong H. Development of classification models for predicting inhibition of mitochondrial fusion and fission using machine learning methods. CHEMOSPHERE 2020;273:128567. [PMID: 34756375 DOI: 10.1016/j.chemosphere.2020.128567] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 10/03/2020] [Accepted: 10/06/2020] [Indexed: 06/13/2023]

Tang W, Chen J, Hong H. Discriminant models on mitochondrial toxicity improved by consensus modeling and resolving imbalance in training. CHEMOSPHERE 2020;253:126768. [PMID: 32464767 DOI: 10.1016/j.chemosphere.2020.126768] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/08/2020] [Accepted: 04/08/2020] [Indexed: 06/11/2023]

Korkmaz S. Deep Learning-Based Imbalanced Data Classification for Drug Discovery. J Chem Inf Model 2020;60:4180-4190. [DOI: 10.1021/acs.jcim.9b01162] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Hao Y, Sun G, Fan T, Sun X, Liu Y, Zhang N, Zhao L, Zhong R, Peng Y. Prediction on the mutagenicity of nitroaromatic compounds using quantum chemistry descriptors based QSAR and machine learning derived classification methods. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2019;186:109822. [PMID: 31634658 DOI: 10.1016/j.ecoenv.2019.109822] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Revised: 10/11/2019] [Accepted: 10/14/2019] [Indexed: 06/10/2023]

Hinge VK, Roy D, Kovalenko A. Prediction of P-glycoprotein inhibitors with machine learning classification models and 3D-RISM-KH theory based solvation energy descriptors. J Comput Aided Mol Des 2019;33:965-971. [PMID: 31745705 DOI: 10.1007/s10822-019-00253-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 11/14/2019] [Indexed: 11/24/2022]

Morita S. Chemometrics and Related Fields in Python. ANAL SCI 2019;36:107-112. [PMID: 31735763 DOI: 10.2116/analsci.19r006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Jiang C, Yang H, Di P, Li W, Tang Y, Liu G. In silico prediction of chemical reproductive toxicity using machine learning. J Appl Toxicol 2019;39:844-854. [DOI: 10.1002/jat.3772] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2018] [Revised: 12/05/2018] [Accepted: 12/15/2018] [Indexed: 12/30/2022]

Sun G, Fan T, Sun X, Hao Y, Cui X, Zhao L, Ren T, Zhou Y, Zhong R, Peng Y. In Silico Prediction of O⁶-Methylguanine-DNA Methyltransferase Inhibitory Potency of Base Analogs with QSAR and Machine Learning Methods. Molecules 2018;23:E2892. [PMID: 30404161 PMCID: PMC6278368 DOI: 10.3390/molecules23112892] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2018] [Revised: 11/04/2018] [Accepted: 11/06/2018] [Indexed: 12/24/2022] Open

Abstract

O⁶-methylguanine-DNA methyltransferase (MGMT), a unique DNA repair enzyme, can confer resistance to DNA anticancer alkylating agents that modify the O⁶-position of guanine. Thus, inhibition of MGMT activity in tumors has a great interest for cancer researchers because it can significantly improve the anticancer efficacy of such alkylating agents. In this study, we performed a quantitative structure activity relationship (QSAR) and classification study based on a total of 134 base analogs related to their ED50 values (50% inhibitory concentration) against MGMT. Molecular information of all compounds were described by quantum chemical descriptors and Dragon descriptors. Genetic algorithm (GA) and multiple linear regression (MLR) analysis were combined to develop QSAR models. Classification models were generated by seven machine-learning methods based on six types of molecular fingerprints. Performances of all developed models were assessed by internal and external validation techniques. The best QSAR model was obtained with Q²Loo = 0.83, R² = 0.87, Q²ext = 0.67, and R²ext = 0.69 based on 84 compounds. The results from QSAR studies indicated topological charge indices, polarizability, ionization potential (IP), and number of primary aromatic amines are main contributors for MGMT inhibition of base analogs. For classification studies, the accuracies of 10-fold cross-validation ranged from 0.750 to 0.885 for top ten models. The range of accuracy for the external test set ranged from 0.800 to 0.880 except for PubChem-Tree model, suggesting a satisfactory predictive ability. Three models (Ext-SVM, Ext-Tree and Graph-RF) showed high and reliable predictive accuracy for both training and external test sets. In addition, several representative substructures for characterizing MGMT inhibitors were identified by information gain and substructure frequency analysis method. Our studies might be useful for further study to design and rapidly identify potential MGMT inhibitors.

Collapse

Fan T, Sun G, Zhao L, Cui X, Zhong R. QSAR and Classification Study on Prediction of Acute Oral Toxicity of N-Nitroso Compounds. Int J Mol Sci 2018;19:E3015. [PMID: 30282923 PMCID: PMC6213880 DOI: 10.3390/ijms19103015] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 09/29/2018] [Accepted: 09/30/2018] [Indexed: 12/30/2022] Open

Kaiser TM, Burger PB, Butch CJ, Pelly SC, Liotta DC. A Machine Learning Approach for Predicting HIV Reverse Transcriptase Mutation Susceptibility of Biologically Active Compounds. J Chem Inf Model 2018;58:1544-1552. [PMID: 29953819 DOI: 10.1021/acs.jcim.7b00475] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Volpe DA, Qosa H. Challenges with the precise prediction of ABC-transporter interactions for improved drug discovery. Expert Opin Drug Discov 2018;13:697-707. [PMID: 29943645 DOI: 10.1080/17460441.2018.1493454] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Ghasemi F, Mehridehnavi A, Pérez-Garrido A, Pérez-Sánchez H. Neural network and deep-learning algorithms used in QSAR studies: merits and drawbacks. Drug Discov Today 2018;23:1784-1790. [PMID: 29936244 DOI: 10.1016/j.drudis.2018.06.016] [Citation(s) in RCA: 91] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2017] [Revised: 06/05/2018] [Accepted: 06/14/2018] [Indexed: 10/28/2022]

Fan D, Yang H, Li F, Sun L, Di P, Li W, Tang Y, Liu G. In silico prediction of chemical genotoxicity using machine learning methods and structural alerts. Toxicol Res (Camb) 2018;7:211-220. [PMID: 30090576 PMCID: PMC6062245 DOI: 10.1039/c7tx00259a] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 12/14/2017] [Indexed: 01/19/2023] Open

Yang M, Chen J, Xu L, Shi X, Zhou X, Xi Z, An R, Wang X. A novel adaptive ensemble classification framework for ADME prediction. RSC Adv 2018;8:11661-11683. [PMID: 35542768 PMCID: PMC9079056 DOI: 10.1039/c8ra01206g] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2018] [Accepted: 03/20/2018] [Indexed: 12/20/2022] Open

Sun H, Huang R, Xia M, Shahane S, Southall N, Wang Y. Prediction of hERG Liability - Using SVM Classification, Bootstrapping and Jackknifing. Mol Inform 2017;36:10.1002/minf.201600126. [PMID: 28000393 PMCID: PMC5382096 DOI: 10.1002/minf.201600126] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Accepted: 11/14/2016] [Indexed: 12/11/2022]

Wang Q, Li X, Yang H, Cai Y, Wang Y, Wang Z, Li W, Tang Y, Liu G. In silico prediction of serious eye irritation or corrosion potential of chemicals. RSC Adv 2017. [DOI: 10.1039/c6ra25267b] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Sun H, Nguyen K, Kerns E, Yan Z, Yu KR, Shah P, Jadhav A, Xu X. Highly predictive and interpretable models for PAMPA permeability. Bioorg Med Chem 2016;25:1266-1276. [PMID: 28082071 DOI: 10.1016/j.bmc.2016.12.049] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Revised: 12/22/2016] [Accepted: 12/27/2016] [Indexed: 11/28/2022]

Ngo TD, Tran TD, Le MT, Thai KM. Machine learning-, rule- and pharmacophore-based classification on the inhibition of P-glycoprotein and NorA. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2016;27:747-780. [PMID: 27667641 DOI: 10.1080/1062936x.2016.1233137] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2016] [Accepted: 09/02/2016] [Indexed: 06/06/2023]

Niu AQ, Xie LJ, Wang H, Zhu B, Wang SQ. Prediction of selective estrogen receptor beta agonist using open data and machine learning approach. Drug Des Devel Ther 2016;10:2323-31. [PMID: 27486309 PMCID: PMC4958355 DOI: 10.2147/dddt.s110603] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Abstract

Background

Estrogen receptors (ERs) are nuclear transcription factors that are involved in the regulation of many complex physiological processes in humans. ERs have been validated as important drug targets for the treatment of various diseases, including breast cancer, ovarian cancer, osteoporosis, and cardiovascular disease. ERs have two subtypes, ER-α and ER-β. Emerging data suggest that the development of subtype-selective ligands that specifically target ER-β could be a more optimal approach to elicit beneficial estrogen-like activities and reduce side effects.

Methods

Herein, we focused on ER-β and developed its in silico quantitative structure-activity relationship models using machine learning (ML) methods.

Results

The chemical structures and ER-β bioactivity data were extracted from public chemogenomics databases. Four types of popular fingerprint generation methods including MACCS fingerprint, PubChem fingerprint, 2D atom pairs, and Chemistry Development Kit extended fingerprint were used as descriptors. Four ML methods including Naïve Bayesian classifier, k-nearest neighbor, random forest, and support vector machine were used to train the models. The range of classification accuracies was 77.10% to 88.34%, and the range of area under the ROC (receiver operating characteristic) curve values was 0.8151 to 0.9475, evaluated by the 5-fold cross-validation. Comparison analysis suggests that both the random forest and the support vector machine are superior for the classification of selective ER-β agonists. Chemistry Development Kit extended fingerprints and MACCS fingerprint performed better in structural representation between active and inactive agonists.

Conclusion

These results demonstrate that combining the fingerprint and ML approaches leads to robust ER-β agonist prediction models, which are potentially applicable to the identification of selective ER-β agonists.

Collapse

Zhang C, Cheng F, Li W, Liu G, Lee PW, Tang Y. In silico Prediction of Drug Induced Liver Toxicity Using Substructure Pattern Recognition Method. Mol Inform 2016;35:136-44. [PMID: 27491923 DOI: 10.1002/minf.201500055] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2015] [Accepted: 12/14/2015] [Indexed: 02/05/2023]

Zhang C, Zhou Y, Gu S, Wu Z, Wu W, Liu C, Wang K, Liu G, Li W, Lee PW, Tang Y. In silico prediction of hERG potassium channel blockage by chemical category approaches. Toxicol Res (Camb) 2016;5:570-582. [PMID: 30090371 DOI: 10.1039/c5tx00294j] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 01/13/2016] [Indexed: 12/18/2022] Open

Affiliation(s)

Chen Zhang Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Yuan Zhou Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Shikai Gu Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Zengrui Wu Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Wenjie Wu Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Changming Liu Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Kaidong Wang Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Guixia Liu Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Weihua Li Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Philip W Lee Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052
Yun Tang Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , 130 Meilong Road , Shanghai 200237 , China . ; ; Tel: +86-21-64251052

Collapse

Yang M, Chen J, Shi X, Xu L, Xi Z, You L, An R, Wang X. Development of in Silico Models for Predicting P-Glycoprotein Inhibitors Based on a Two-Step Approach for Feature Selection and Its Application to Chinese Herbal Medicine Screening. Mol Pharm 2015;12:3691-713. [PMID: 26376206 DOI: 10.1021/acs.molpharmaceut.5b00465] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]

Abstract

P-glycoprotein (P-gp) is regarded as an important factor in determining the ADMET (absorption, distribution, metabolism, elimination, and toxicity) characteristics of drugs and drug candidates. Successful prediction of P-gp inhibitors can thus lead to an improved understanding of the underlying mechanisms of both changes in the pharmacokinetics of drugs and drug-drug interactions. Therefore, there has been considerable interest in the development of in silico modeling of P-gp inhibitors in recent years. Considering that a large number of molecular descriptors are used to characterize diverse structural moleculars, efficient feature selection methods are required to extract the most informative predictors. In this work, we constructed an extensive available data set of 2428 molecules that includes 1518 P-gp inhibitors and 910 P-gp noninhibitors from multiple resources. Importantly, a two-step feature selection approach based on a genetic algorithm and a greedy forward-searching algorithm was employed to select the minimum set of the most informative descriptors that contribute to the prediction of P-gp inhibitors. To determine the best machine learning algorithm, 18 classifiers coupled with the feature selection method were compared. The top three best-performing models (flexible discriminant analysis, support vector machine, and random forest) and their ensemble model using respectively only 3, 9, 7, and 14 descriptors achieve an overall accuracy of 83.2%-86.7% for the training set containing 1040 compounds, an overall accuracy of 82.3%-85.5% for the test set containing 1039 compounds, and a prediction accuracy of 77.4%-79.9% for the external validation set containing 349 compounds. The models were further extensively validated by DrugBank database (1890 compounds). The proposed models are competitive with and in some cases better than other published models in terms of prediction accuracy and minimum number of descriptors. Applicability domain then was addressed by developing an ensemble classification model to obtain more reliable predictions. Finally, we employed these models as a virtual screening tool for identifying potential P-gp inhibitors in Traditional Chinese Medicine Systems Pharmacology (TCMSP) database containing a total of 13 051 unique compounds from 498 herbs, resulting in 875 potential P-gp inhibitors and 15 inhibitor-rich herbs. These predictions were partly supported by a literature search and are valuable not only to develop novel P-gp inhibitors from TCM in the early stages of drug development, but also to optimize the use of herbal remedies.

Collapse

Korkmaz S, Zararsiz G, Goksuluk D. MLViS: A Web Tool for Machine Learning-Based Virtual Screening in Early-Phase of Drug Discovery and Development. PLoS One 2015;10:e0124600. [PMID: 25928885 PMCID: PMC4415797 DOI: 10.1371/journal.pone.0124600] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2015] [Accepted: 03/03/2015] [Indexed: 12/18/2022] Open

Subhani S, Jayaraman A, Jamil K. Homology modelling and molecular docking of MDR1 with chemotherapeutic agents in non-small cell lung cancer. Biomed Pharmacother 2015;71:37-45. [PMID: 25960213 DOI: 10.1016/j.biopha.2015.02.009] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 02/09/2015] [Indexed: 10/24/2022] Open

Thai KM, Huynh NT, Ngo TD, Mai TT, Nguyen TH, Tran TD. Three- and four-class classification models for P-glycoprotein inhibitors using counter-propagation neural networks. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2015;26:139-163. [PMID: 25588022 DOI: 10.1080/1062936x.2014.995701] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Erić S, Kalinić M, Ilić K, Zloh M. Computational classification models for predicting the interaction of drugs with P-glycoprotein and breast cancer resistance protein. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2014;25:939-966. [PMID: 25435255 DOI: 10.1080/1062936x.2014.976265] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2014] [Accepted: 08/13/2014] [Indexed: 06/04/2023]

Balfer J, Bajorath J. Introduction of a methodology for visualization and graphical interpretation of Bayesian classification models. J Chem Inf Model 2014;54:2451-68. [PMID: 25137527 DOI: 10.1021/ci500410g] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Liu Z, Zheng M, Yan X, Gu Q, Gasteiger J, Tijhuis J, Maas P, Li J, Xu J. ChemStable: a web server for rule-embedded naïve Bayesian learning approach to predict compound stability. J Comput Aided Mol Des 2014;28:941-50. [PMID: 25031075 DOI: 10.1007/s10822-014-9778-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2014] [Accepted: 07/09/2014] [Indexed: 11/26/2022]

Li D, Chen L, Li Y, Tian S, Sun H, Hou T. ADMET Evaluation in Drug Discovery. 13. Development of in Silico Prediction Models for P-Glycoprotein Substrates. Mol Pharm 2014;11:716-26. [DOI: 10.1021/mp400450m] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Xu C, Cheng F, Chen L, Du Z, Li W, Liu G, Lee PW, Tang Y. In silico Prediction of Chemical Ames Mutagenicity. J Chem Inf Model 2012;52:2840-7. [DOI: 10.1021/ci300400a] [Citation(s) in RCA: 114] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Modi S, Li J, Malcomber S, Moore C, Scott A, White A, Carmichael P. Integrated in silico approaches for the prediction of Ames test mutagenicity. J Comput Aided Mol Des 2012;26:1017-33. [DOI: 10.1007/s10822-012-9595-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2011] [Accepted: 08/09/2012] [Indexed: 02/04/2023]

Chen L, Li Y, Yu H, Zhang L, Hou T. Computational models for predicting substrates or inhibitors of P-glycoprotein. Drug Discov Today 2012;17:343-51. [DOI: 10.1016/j.drudis.2011.11.003] [Citation(s) in RCA: 108] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2011] [Revised: 10/24/2011] [Accepted: 11/10/2011] [Indexed: 01/11/2023]

Carbon-Mangels M, Hutter MC. Selecting Relevant Descriptors for Classification by Bayesian Estimates: A Comparison with Decision Trees and Support Vector Machines Approaches for Disparate Data Sets. Mol Inform 2011;30:885-95. [PMID: 27468108 DOI: 10.1002/minf.201100069] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Accepted: 08/19/2011] [Indexed: 11/12/2022]

Hao M, Li Y, Wang Y, Zhang S. A classification study of human β 3-adrenergic receptor agonists using BCUT descriptors. Mol Divers 2011;15:877-87. [DOI: 10.1007/s11030-011-9321-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2011] [Accepted: 05/17/2011] [Indexed: 10/18/2022]

Chen L, Li Y, Zhao Q, Peng H, Hou T. ADME Evaluation in Drug Discovery. 10. Predictions of P-Glycoprotein Inhibitors Using Recursive Partitioning and Naive Bayesian Classification Techniques. Mol Pharm 2011;8:889-900. [DOI: 10.1021/mp100465q] [Citation(s) in RCA: 127] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Li Y, Du G, Cai W, Shao X. Classification and Quantitative Analysis of Azithromycin Tablets by Raman Spectroscopy and Chemometrics. ACTA ACUST UNITED AC 2011. [DOI: 10.4236/ajac.2011.22015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Rogers D, Hahn M. Extended-connectivity fingerprints. J Chem Inf Model 2010;50:742-54. [PMID: 20426451 DOI: 10.1021/ci100050t] [Citation(s) in RCA: 3748] [Impact Index Per Article: 267.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Cucurull-Sanchez L. Successful identification of key chemical structure modifications that lead to improved ADME profiles. J Comput Aided Mol Des 2010;24:449-58. [DOI: 10.1007/s10822-010-9361-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2009] [Accepted: 04/26/2010] [Indexed: 11/28/2022]

Lead Discovery Using Virtual Screening. TOPICS IN MEDICINAL CHEMISTRY 2009. [PMCID: PMC7176223 DOI: 10.1007/7355_2009_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Chen X, Liang YZ, Yuan DL, Xu QS. A modified uncorrelated linear discriminant analysis model coupled with recursive feature elimination for the prediction of bioactivity. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2009;20:1-26. [PMID: 19343582 DOI: 10.1080/10629360902724127] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]