1
|
Matsuzaka Y, Uesawa Y. Optimization of a Deep-Learning Method Based on the Classification of Images Generated by Parameterized Deep Snap a Novel Molecular-Image-Input Technique for Quantitative Structure-Activity Relationship (QSAR) Analysis. Front Bioeng Biotechnol 2019; 7:65. [PMID: 30984753 PMCID: PMC6447703 DOI: 10.3389/fbioe.2019.00065] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 03/07/2019] [Indexed: 12/22/2022] Open
Abstract
Numerous chemical compounds are distributed around the world and may affect the homeostasis of the endocrine system by disrupting the normal functions of hormone receptors. Although the risks associated with these compounds have been evaluated by acute toxicity testing in mammalian models, the chronic toxicity of many chemicals remains due to high cost of the compounds and the testing, etc. However, computational approaches may be promising alternatives and reduce these evaluations. Recently, deep learning (DL) has been shown to be promising prediction models with high accuracy for recognition of images, speech, signals, and videos since it greatly benefits from large datasets. Recently, a novel DL-based technique called DeepSnap was developed to conduct QSAR analysis using three-dimensional images of chemical structures. It can be used to predict the potential toxicity of many different chemicals to various receptors without extraction of descriptors. DeepSnap has been shown to have a very high capacity in tests using Tox21 quantitative qHTP datasets. Numerous parameters must be adjusted to use the DeepSnap method but they have not been optimized. In this study, the effects of these parameters on the performance of the DL prediction model were evaluated in terms of the loss in validation as an indicator for evaluating the performance of the DL using the toxicity information in the Tox21 qHTP database. The relations of the parameters of DeepSnap such as (1) number of molecules per SDF split into (2) zoom factor percentage, (3) atom size for van der waals percentage, (4) bond radius, (5) minimum bond distance, and (6) bond tolerance, with the validation loss following quadratic function curves, which suggests that optimal thresholds exist to attain the best performance with these prediction models. Using the parameter values set with the best performance, the prediction model of chemical compounds for CAR agonist was built using 64 images, at 105° angle, with AUC of 0.791. Thus, based on these parameters, the proposed DeepSnap-DL approach will be highly reliable and beneficial to establish models to assess the risk associated with various chemicals.
Collapse
Affiliation(s)
| | - Yoshihiro Uesawa
- Department of Medical Molecular Informatics, Meiji Pharmaceutical University, Tokyo, Japan
| |
Collapse
|
2
|
Uesawa Y. Quantitative structure-activity relationship analysis using deep learning based on a novel molecular image input technique. Bioorg Med Chem Lett 2018; 28:3400-3403. [PMID: 30177377 DOI: 10.1016/j.bmcl.2018.08.032] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 08/17/2018] [Accepted: 08/25/2018] [Indexed: 11/17/2022]
Abstract
Quantitative structure-activity relationship (QSAR) analysis uses structural, quantum chemical, and physicochemical features calculated from molecular geometry as explanatory variables predicting physiological activity. Recently, deep learning based on advanced artificial neural networks has demonstrated excellent performance in the discipline of QSAR research. While it has properties of feature representation learning that directly calculate feature values from molecular structure, the use of this potential function is limited in QSAR modeling. The present study applied this function of feature representation learning to QSAR analysis by incorporating 360° images of molecular conformations into deep learning. Accordingly, I successfully constructed a highly versatile identification model for chemical compounds that induce mitochondrial membrane potential disruption with the external validation area under the receiver operating characteristic curve of ≥0.9.
Collapse
Affiliation(s)
- Yoshihiro Uesawa
- Department of Medical Molecular Informatics, Meiji Pharmaceutical University, 2-522-1 Noshio, Kiyose, Tokyo 204-8588, Japan.
| |
Collapse
|
3
|
Todorov M. Recent advances in computational chemistry for identification of ligands for biological receptors: interdisciplinary aspects. MEDICAL SCIENCE PULSE 2018. [DOI: 10.5604/01.3001.0011.6670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Background: Computational (in silico) methods, such as quantitative structure-activity relationships (QSARs)
are already well recognized and used in many screening programs related to environmental, industrial and medical
chemistry. The main idea of the QSAR is that there is a relationship between molecular structure and ultimate
biological effect caused by a chemical compound. In this respect the approach could be used successfully for
prediction of various biological endpoints caused by chemical compounds including receptor binding affinity.
Aim of the study: In the current study the capabilities for structure-activity modelling incorporated in noncommercial
software tool have been employed for investigating the binding effect of xenobiotics toward estrogen
and human pregnane X receptor.
Material and methods: The analysis was performed by making use of the non-commercial software platform
QSAR Toolbox. This system allows application of a set of built-in models for different biological effects, and also
allows incorporation of new models for other endpoints.
Results: Two models have been applied for predicting the binding effect toward estrogen and human pregnane
X receptors of a large number of chemicals collected in a single database of high practical concern. The results
show that there are many chemicals which are able to bind the investigated receptors. Since those chemicals are
encountered in the environment, they could be considered as potential threat for society.
Conclusions: The obtained results could be used as initial step for further experimental testing of those chemicals
in order to confirm their potential to harm biological systems in the body.
Collapse
Affiliation(s)
- Milen Todorov
- University “Prof. Dr. Assen Zlatarov”, Burgas, Bulgaria A
| |
Collapse
|
4
|
Mota K, Lima Neto J, Lima Costa A, Oliveira J, Bezerra K, Albuquerque E, Caetano E, Freire V, Fulco U. A quantum biochemistry model of the interaction between the estrogen receptor and the two antagonists used in breast cancer treatment. COMPUT THEOR CHEM 2016. [DOI: 10.1016/j.comptc.2016.05.006] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
5
|
Shi H, Tian S, Li Y, Li D, Yu H, Zhen X, Hou T. Absorption, Distribution, Metabolism, Excretion, and Toxicity Evaluation in Drug Discovery. 14. Prediction of Human Pregnane X Receptor Activators by Using Naive Bayesian Classification Technique. Chem Res Toxicol 2014; 28:116-25. [DOI: 10.1021/tx500389q] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- Huali Shi
- Institute
of Functional Nano and Soft Materials (FUNSOM), Soochow University, Suzhou, Jiangsu 215123, People’s Republic of China
| | - Sheng Tian
- College
of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215123, People’s Republic of China
| | - Youyong Li
- Institute
of Functional Nano and Soft Materials (FUNSOM), Soochow University, Suzhou, Jiangsu 215123, People’s Republic of China
| | - Dan Li
- College
of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, People’s Republic of China
| | - Huidong Yu
- Crystal Pharmatech Inc., 707
Alexander Road, Building 2, Suite 208, Princeton, New Jersey 08540, United States
| | - Xuechu Zhen
- College
of Pharmaceutical Sciences, Soochow University, Suzhou, Jiangsu 215123, People’s Republic of China
| | - Tingjun Hou
- Institute
of Functional Nano and Soft Materials (FUNSOM), Soochow University, Suzhou, Jiangsu 215123, People’s Republic of China
- College
of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, People’s Republic of China
| |
Collapse
|
6
|
Rapado LN, Freitas GC, Polpo A, Rojas-Cardozo M, Rincón JV, Scotti MT, Kato MJ, Nakano E, Yamaguchi LF. A benzoic acid derivative and flavokawains from Piper species as schistosomiasis vector controls. Molecules 2014; 19:5205-18. [PMID: 24762961 PMCID: PMC6271750 DOI: 10.3390/molecules19045205] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Revised: 04/11/2014] [Accepted: 04/15/2014] [Indexed: 11/16/2022] Open
Abstract
The search of alternative compounds to control tropical diseases such as schistosomiasis has pointed to secondary metabolites derived from natural sources. Piper species are candidates in strategies to control the transmission of schistosomiasis due to their production of molluscicidal compounds. A new benzoic acid derivative and three flavokawains from Piper diospyrifolium, P. cumanense and P. gaudichaudianum displayed significant activities against Biomphalaria glabrata snails. Additionally, "in silico" studies were performed using docking assays and Molecular Interaction Fields to evaluate the physical-chemical differences among the compounds in order to characterize the observed activities of the test compounds against Biomphalaria glabrata snails.
Collapse
Affiliation(s)
- Ludmila N. Rapado
- Laboratório de Parasitologia, Instituto Butantan, Av. Vital Brasil, 1500, São Paulo, SP, CEP 05503-900, Brazil; E-Mail:
- Instituto de Ciências Biomédicas, Universidade de São Paulo, Av. Prof. Lineu Prestes, 1374, São Paulo, SP, CEP 05508-000, Brazil
- Authors to whom correspondence should be addressed; E-Mails: (L.N.R.); (L.F.Y.); Tel.: +55-11-3091-7335 (L.N.R.); +55-11-3091-3813 (L.F.Y.)
| | - Giovana C. Freitas
- Research Support Center in Diversity of Natural Products, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, sala 1124, São Paulo, SP, CEP 05508-000, Brazil; E-Mails: (G.C.F.); (M.J.K.)
| | - Adriano Polpo
- Departamento de Estatística, Centro de Ciências Exatas e de Tecnologia, Universidade Federal de São Carlos, Via Washington Luís, km 235, Sao Carlos, SP, Caixa-postal 676, CEP 13565-905, Brazil; E-Mail:
| | - Maritza Rojas-Cardozo
- Department of Pharmacy, Faculty of Sciences, Universidad Nacional de Colombia, Kr 30 45-03, Bogotá, Colombia; E-Mails: (M.R.-C.); (J.V.R.)
| | - Javier V. Rincón
- Department of Pharmacy, Faculty of Sciences, Universidad Nacional de Colombia, Kr 30 45-03, Bogotá, Colombia; E-Mails: (M.R.-C.); (J.V.R.)
| | - Marcus T. Scotti
- Centro de Ciências Aplicadas e Educação, Universidade Federal da Paraíba, Campus IV, Rua da Mangueira, s/n, Rio Tinto, PB, CEP 5829-7000, Brazil; E-Mail:
| | - Massuo J. Kato
- Research Support Center in Diversity of Natural Products, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, sala 1124, São Paulo, SP, CEP 05508-000, Brazil; E-Mails: (G.C.F.); (M.J.K.)
| | - Eliana Nakano
- Laboratório de Parasitologia, Instituto Butantan, Av. Vital Brasil, 1500, São Paulo, SP, CEP 05503-900, Brazil; E-Mail:
| | - Lydia F. Yamaguchi
- Research Support Center in Diversity of Natural Products, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, sala 1124, São Paulo, SP, CEP 05508-000, Brazil; E-Mails: (G.C.F.); (M.J.K.)
- Authors to whom correspondence should be addressed; E-Mails: (L.N.R.); (L.F.Y.); Tel.: +55-11-3091-7335 (L.N.R.); +55-11-3091-3813 (L.F.Y.)
| |
Collapse
|
7
|
Nendza M, Gabbert S, Kühne R, Lombardo A, Roncaglioni A, Benfenati E, Benigni R, Bossa C, Strempel S, Scheringer M, Fernández A, Rallo R, Giralt F, Dimitrov S, Mekenyan O, Bringezu F, Schüürmann G. A comparative survey of chemistry-driven in silico methods to identify hazardous substances under REACH. Regul Toxicol Pharmacol 2013; 66:301-14. [DOI: 10.1016/j.yrtph.2013.05.007] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2012] [Revised: 05/09/2013] [Accepted: 05/11/2013] [Indexed: 11/29/2022]
|
8
|
Yang W, Shen S, Mu L, Yu H. Structure-activity relationship study on the binding of PBDEs with thyroxine transport proteins. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2011; 30:2431-2439. [PMID: 21842493 DOI: 10.1002/etc.645] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2011] [Revised: 07/01/2011] [Accepted: 07/25/2011] [Indexed: 05/31/2023]
Abstract
Molecular docking and three-dimensional quantitative structure-activity relationships (3D-QSAR) were used to develop models to predict binding affinity of polybrominated diphenyl ether (PBDE) compounds to the human transthyretin (TTR). Based on the molecular conformations derived from the molecular docking, predictive comparative molecular similarity indices analysis (CoMSIA) models were developed. The results of CoMSIA models were as follows: leave-one-out (LOO) cross-validated squared coefficient q² (LOO) = 0.827 (full model, for all 28 compounds); q² (LOO) = 0.752 (split model, for 22 compounds in the training set); leave-many-out (LMO) cross-validated squared coefficient q² (LMO, two groups) = 0.723 ± 0.100 (full model, for all 28 compounds); q² (LMO, five groups) = 0.795 ± 0.030 (full model, for all 28 compounds); and the predictive squared correlation coefficient r²(pred) = 0.928 (for six compounds in the test set). The developed CoMSIA models can be used to infer the activities of compounds with similar structural characteristics. In addition, the interaction mechanism between hydroxylated polybrominated diphenyl ethers (HO-PBDEs) and the TTR was explored. Hydrogen bonding with amino acid residues Asp74, Ala29, and Asn27 may be an important determinant for HO-PBDEs binding to TTR. Among them, forming hydrogen bonds with amino acid residues Asp74 might exert a more important function.
Collapse
Affiliation(s)
- Weihua Yang
- Xuzhou Normal University, Jiangsu Key Laboratory of Green Synthetic Chemistry for Functional Materials, Xuzhou, Peoples Republic of China.
| | | | | | | |
Collapse
|
9
|
Yang W, Wei S, Liu H, Yu H. Insights into the structural and conformational requirements of polybrominated diphenyl ethers and metabolites as potential estrogens based on molecular docking. CHEMOSPHERE 2011; 84:328-35. [PMID: 21601234 DOI: 10.1016/j.chemosphere.2011.04.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2010] [Revised: 03/21/2011] [Accepted: 04/01/2011] [Indexed: 05/22/2023]
Abstract
PBDEs and their metabolites are of concern due to their increasing concentrations in the environment and their toxic effects. Knowledge about the toxicological mechanisms of PBDEs and metabolites is urgently needed for further screening. The objective of the present study was to explore the structural and conformational requirements of PBDE compounds as human estrogen receptor alpha (hERα) agonists, and further screened out hERα agonists from PBDE compounds. Molecular docking and postdocking analysis were adopted to attain the aim. The obtained results revealed that PBDEs can be primarily screened for their estrogenicity using score values, hydrogen bonds interaction with amino acid residues Glu353 and/or Arg394 might be important for HO-PBDEs' estrogenicity. For most MeO-PBDEs, hydrophobic interaction might be the key factor affecting their estrogenic activity. The current study suggested that molecular docking and postdocking analysis can serve as an efficient pre-screening technique for identifying potential estrogens.
Collapse
Affiliation(s)
- Weihua Yang
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210093, PR China
| | | | | | | |
Collapse
|
10
|
Recent Advances in the Molecular Modeling of Estrogen Receptor-Mediated Toxicity. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2011. [DOI: 10.1016/b978-0-12-386485-7.00006-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register]
|
11
|
Stojić N, Erić S, Kuzmanovski I. Prediction of toxicity and data exploratory analysis of estrogen-active endocrine disruptors using counter-propagation artificial neural networks. J Mol Graph Model 2010; 29:450-60. [PMID: 20952233 DOI: 10.1016/j.jmgm.2010.09.001] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2010] [Revised: 09/05/2010] [Accepted: 09/09/2010] [Indexed: 11/29/2022]
Abstract
In this work, a novel algorithm for optimization of counter-propagation artificial neural networks has been used for development of quantitative structure-activity relationships model for prediction of the estrogenic activity of endocrine-disrupting chemicals. The search for the best model was performed using genetic algorithms. Genetic algorithms were used not only for selection of the most suitable descriptors for modeling, but also for automatic adjustment of their relative importance. Using our recently developed algorithm for automatic adjustment of the relative importance of the input variables, we have developed simple models with very good generalization performances using only few interpretable descriptors. One of the developed models is in details discussed in this article. The simplicity of the chosen descriptors and their relative importance for this model helped us in performing a detailed data exploratory analysis which gave us an insight in the structural features required for the activity of the estrogenic endocrine-disrupting chemicals.
Collapse
Affiliation(s)
- Nataša Stojić
- Institut za Hemija, PMF, Univerzitet "Sv. Kiril i Metodij", PO Box 162, 1001 Skopje, Macedonia
| | | | | |
Collapse
|
12
|
Gramatica P. Chemometric Methods and Theoretical Molecular Descriptors in Predictive QSAR Modeling of the Environmental Behavior of Organic Pollutants. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2010. [DOI: 10.1007/978-1-4020-9783-6_12] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
13
|
Rabinowitz JR, Little SB, Laws SC, Goldsmith MR. Molecular Modeling for Screening Environmental Chemicals for Estrogenicity: Use of the Toxicant-Target Approach. Chem Res Toxicol 2009; 22:1594-602. [DOI: 10.1021/tx900135x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- James R. Rabinowitz
- National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina
| | - Stephen B. Little
- National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina
| | - Susan C. Laws
- National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina
| | - Michael-Rock Goldsmith
- National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina
| |
Collapse
|
14
|
Cronin M, Worth A. (Q)SARs for Predicting Effects Relating to Reproductive Toxicity. ACTA ACUST UNITED AC 2008. [DOI: 10.1002/qsar.200710118] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
15
|
Korhonen SP, Tuppurainen K, Asikainen A, Laatikainen R, Peräkylä M. SOMFA on Large Diverse Xenoestrogen Dataset: The Effect of Superposition Algorithms and External Regression Tools. ACTA ACUST UNITED AC 2007. [DOI: 10.1002/qsar.200610003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
16
|
Liu H, Papa E, Walker JD, Gramatica P. In silico screening of estrogen-like chemicals based on different nonlinear classification models. J Mol Graph Model 2007; 26:135-44. [PMID: 17293141 DOI: 10.1016/j.jmgm.2007.01.003] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2006] [Revised: 01/10/2007] [Accepted: 01/12/2007] [Indexed: 01/28/2023]
Abstract
Increasing concern is being shown by the scientific community, government regulators, and the public about endocrine-disrupting chemicals that are adversely affecting human and wildlife health through a variety of mechanisms. There is a great need for an effective means of rapidly assessing endocrine-disrupting activity, especially estrogen-simulating activity, because of the large number of such chemicals in the environment. In this study, quantitative structure activity relationship (QSAR) models were developed to quickly and effectively identify possible estrogen-like chemicals based on 232 structurally-diverse chemicals (training set) by using several nonlinear classification methodologies (least-square support vector machine (LS-SVM), counter-propagation artificial neural network (CP-ANN), and k nearest neighbour (kNN)) based on molecular structural descriptors. The models were externally validated by 87 chemicals (prediction set) not included in the training set. All three methods can give satisfactory prediction results both for training and prediction sets, and the most accurate model was obtained by the LS-SVM approach through the comparison of performance. In addition, our model was also applied to about 58,000 discrete organic chemicals; about 76% were predicted not to bind to Estrogen Receptor. The obtained results indicate that the proposed QSAR models are robust, widely applicable and could provide a feasible and practical tool for the rapid screening of potential estrogens.
Collapse
Affiliation(s)
- Huanxiang Liu
- Department of Structural and Functional Biology, QSAR Research Unit in Environmental Chemistry and Ecotoxicology, University of Insubria, via Dunant 3, 21100 Varese, Italy
| | | | | | | |
Collapse
|
17
|
Honório KM, Garratt RC, Polikarpov I, Andricopulo AD. 3D QSAR comparative molecular field analysis on nonsteroidal farnesoid X receptor activators. J Mol Graph Model 2006; 25:921-7. [PMID: 17055759 DOI: 10.1016/j.jmgm.2006.09.003] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2006] [Revised: 09/08/2006] [Accepted: 09/10/2006] [Indexed: 11/23/2022]
Abstract
Three-dimensional quantitative structure-activity relationships (3D QSAR) were performed for a series of farnesoid X receptor activators using comparative molecular field analysis (CoMFA). A training set containing 77 compounds served to establish the models. The best statistical results among all models were obtained with region focusing weighted by a S.D. x coefficient values of 0.8 and a grid spacing of 1.0 (r2=0.963, SEE=0.097; q2=0.742, SEP=0.255). The model was used to predict the potency of 20 test set compounds that were not included in the training set, and the predicted values were in good agreement with the experimental results. The final CoMFA model along with the information obtained from 3D contour maps should be useful for the design of novel FXR ligands having improved potency.
Collapse
Affiliation(s)
- Káthia M Honório
- Laboratório de Química Medicinal e Computacional, Centro de Biotecnologia Molecular Estrutural, Instituto de Física de São Carlos, Universidade de São Paulo, Av. Trabalhador São-Carlense, 400, 13560-970 São Carlos, SP, Brazil
| | | | | | | |
Collapse
|
18
|
Saliner AG, Netzeva TI, Worth AP. Prediction of estrogenicity: validation of a classification model. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2006; 17:195-223. [PMID: 16644558 DOI: 10.1080/10659360600636022] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
(Q)SAR models can be used to reduce animal testing as well as to minimise the testing costs. In particular, classification models have been widely used for estimating endpoints with binary activity. The aim of the present study was to develop and validate a classification-based quantitative structure-activity relationship (QSAR) model for endocrine disruption, based on interpretable mechanistic descriptors related to estrogenic gene activation. The model predicts the presence or absence of estrogenic activity according to a pre-defined cut-off in activity as determined in a recombinant yeast assay. The experimental data was obtained from the literature. A two-descriptor classification model was developed that has the form of a decision tree. The predictivity of the model was evaluated by using an external test set and by taking into account the limitations associated with the applicability domain (AD) of the model. The AD was determined as coverage of the model descriptor space. After removing the compounds present in the training set and the compounds outside of the AD, the overall accuracy of classification of the test chemicals was used to assess the predictivity of the model. In addition, the model was shown to meet the OECD Principles for (Q)SAR Validation, making it potentially useful for regulatory purposes.
Collapse
Affiliation(s)
- A Gallegos Saliner
- European Chemicals Bureau (ECB), Institute for Health and Consumer Protection, Joint Research Centre, European Commission, 21020 Ispra (VA), Italy.
| | | | | |
Collapse
|
19
|
Honorio KM, Garratt RC, Andricopulo AD. Hologram quantitative structure–activity relationships for a series of farnesoid X receptor activators. Bioorg Med Chem Lett 2005; 15:3119-25. [PMID: 15893927 DOI: 10.1016/j.bmcl.2005.04.017] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2005] [Revised: 04/07/2005] [Accepted: 04/11/2005] [Indexed: 10/25/2022]
Abstract
The farnesoid X receptor (FXR) is an attractive drug target for the development of novel therapeutic agents for the treatment of dyslipidemia and cholestasis. Hologram quantitative structure-activity relationship (HQSAR) studies were conducted on a series of potent FXR activators originated from natural product-like libraries. A training set containing 82 compounds served to establish the models. The best HQSAR model was generated using atoms, bonds, connections, chirality, and donor and acceptor as fragment distinction and fragment size default (4-7) with six components. The model was used to predict the potency of 20 test set compounds that were not included in the training set, and the predicted values were in good agreement with the experimental results. The final HQSAR model and the information obtained from HQSAR 2D contribution maps should be useful for the design of novel FXR ligands having improved potency.
Collapse
Affiliation(s)
- Kathia M Honorio
- Laboratório de Química Medicinal e Computacional, Centro de Biotecnologia Molecular Estrutural, Instituto de Física de São Carlos, Universidade de São Paulo, Av. Trabalhador São-Carlense 400, 13560-970 São Carlos-SP, Brazil
| | | | | |
Collapse
|
20
|
Asikainen AH, Ruuskanen J, Tuppurainen KA. Consensus kNN QSAR: a versatile method for predicting the estrogenic activity of organic compounds in silico. A comparative study with five estrogen receptors and a large, diverse set of ligands. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2004; 38:6724-6729. [PMID: 15669333 DOI: 10.1021/es049665h] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Quantitative structure-activity relationships (QSARs) have proved increasingly useful for predicting the biological activities of molecules (e.g., their binding affinities to different receptors) and can be used in environmental chemistry as a preliminary tool for screening the activities of untested molecules, producing valuable information on which compounds should be tested more thoroughly with experimental affinity assays or in animals. The predictive ability of the consensus kNN QSAR method is corroborated here using a diverse set of 245 compounds, which have been assayed for their relative binding affinities to the estrogen receptor of four species: human (ER alpha and ER beta), calf, mouse, and rat. Leave-one-out cross-validation (LOO-CV) and gamma-randomization tests were applied to the QSAR models for internal validation, and separate training and test sets were used for external validation. The internal predictive abilities of the consensus models for all five data sets were convincing, with cross-validated correlation coefficients (LOO-CV q2 values) varying from 0.69 (human ER beta data) to 0.79 (human ER alpha data). The external predictive abilities were also encouraging, as the predictive r2 scores (pr-r2 values) varied from 0.62 (human ER beta data) to 0.77 (calf and mouse data). The results indicate that consensus kNN QSAR is a feasible method for rapid screening of the estrogenic activity of organic compounds.
Collapse
Affiliation(s)
- Arja H Asikainen
- Department of Environmental Sciences, University of Kuopio, P.O. Box 1627, FIN-70211 Kuopio, Finland.
| | | | | |
Collapse
|
21
|
Tong W, Xie Q, Hong H, Shi L, Fang H, Perkins R. Assessment of prediction confidence and domain extrapolation of two structure-activity relationship models for predicting estrogen receptor binding activity. ENVIRONMENTAL HEALTH PERSPECTIVES 2004; 112:1249-1254. [PMID: 15345371 PMCID: PMC1277118 DOI: 10.1289/txg.7125] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2001] [Accepted: 07/15/2004] [Indexed: 05/24/2023]
Abstract
Quantitative structure-activity relationship (QSAR) methods have been widely applied in drug discovery, lead optimization, toxicity prediction, and regulatory decisions. Despite major advances in algorithms and software, QSAR models have inherent limitations associated with a size and chemical-structure diversity of the training set, experimental error, and many characteristics of structure representation and correlation algorithms. Whereas excellent fit to the training data may be readily attainable, often models fail to predict accurately chemicals that are outside their domain of applicability. A QSAR's utility and, in the case of regulatory decisions, justification for usage increasingly depend on the ability to quantify a model's potential for predicting unknown chemicals with some known degree of certainty. It is never possible to predict an unknown chemical with absolute certainty. Here we report on two QSAR models based on different data sets for classification of chemicals according to their ability to bind to the estrogen receptor. The models were developed by using a novel QSAR method, Decision Forest, which combines the results of multiple heterogeneous but comparable Decision Tree models to produce a consensus prediction. We used an extensive cross-validation process to define an applicability domain for model predictions based on two quantitative measures: prediction confidence and domain extrapolation. Together, these measures quantify the accuracy of each prediction within and outside of the training domain. Despite being based on large and diverse training sets, both QSAR models had poor accuracy for chemicals within the domain of low confidence, whereas good accuracy was obtained for those within the domain of high confidence. For prediction in the high confidence domain, accuracy was inversely proportional to the degree of domain extrapolation. The model with a larger training set of 1,092, compared with 232 for the other, was more accurate in predicting chemicals at larger domain extrapolation, and could be particularly useful for rapidly prioritizing potential endocrine disruptors from large chemical universe.
Collapse
Affiliation(s)
- Weida Tong
- Center for Toxicoinformatics, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas 72079, USA.
| | | | | | | | | | | |
Collapse
|
22
|
Ai N, DeLisle RK, Yu SJ, Welsh WJ. Computational models for predicting the binding affinities of ligands for the wild-type androgen receptor and a mutated variant associated with human prostate cancer. Chem Res Toxicol 2004; 16:1652-60. [PMID: 14680380 DOI: 10.1021/tx034168k] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In the present study, values of the binding energy (BE) were calculated for the rat androgen receptor on a data set of 25 steroidal and nonsteroidal compounds for which published values of the observed binding affinity (K(i)) are available. A correlation between BE and pK(i) was evident (r(2) = 0.50) for the entire data set and became more pronounced when the steroids and nonsteroids were plotted separately (r(2) congruent with 0.76). Including BE as an additional descriptor to supplement the default steric-electrostatic descriptors in comparative molecular field analysis dramatically improved the predictive ability of the resulting three-dimensional quantitative structure-activity relationship models. We also demonstrate that the observed loss in ligand specificity between the wild-type (wt) AR and the T877A mutant AR associated with androgen-independent prostate cancer is reflected in decreased BE values (i.e., higher binding affinity) for the antiandrogen pharmaceutical hydroxyflutamide and for several nonandrogenic endogenous steroids, most notably cortisol, corticosterone, 17beta-estradiol, progesterone, and 17alpha-hydroxyprogesterone.
Collapse
Affiliation(s)
- Ni Ai
- Department of Pharmacology, University of Medicine & Dentistry of New Jersey, Robert Wood Johnson Medical School, 675 Hoes Lane, Piscataway, New Jersey 08854, USA
| | | | | | | |
Collapse
|
23
|
Asikainen AH, Ruuskanen J, Tuppurainen KA. Performance of (consensus) kNN QSAR for predicting estrogenic activity in a large diverse set of organic compounds. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2004; 15:19-32. [PMID: 15113066 DOI: 10.1080/1062936032000169642] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
A novel method (in the context of quantitative structure-activity relationship (QSAR)) based on the k nearest neighbour (kNN) principle, has recently been introduced for the derivation of predictive structure-activity relationships. Its performance has been tested for estimating the estrogen binding affinity of a diverse set of 142 organic molecules. Highly predictive models have been obtained. Moreover, it has been demonstrated that consensus-type kNN QSAR models, derived from the arithmetic mean of individual QSAR models were statistically robust and provided more accurate predictions than the great majority of the individual QSAR models. Finally, the consensus QSAR method was tested with 3D QSAR and log P data from a widely used steroid benchmark data set.
Collapse
Affiliation(s)
- A H Asikainen
- Department of Environmental Sciences, University of Kuopio, P.O. Box 1627, FIN-70211 Kuopio, Finland
| | | | | |
Collapse
|
24
|
Asikainen A, Ruuskanen J, Tuppurainen K. Spectroscopic QSAR Methods and Self-Organizing Molecular Field Analysis for Relating Molecular Structure and Estrogenic Activity. ACTA ACUST UNITED AC 2003; 43:1974-81. [PMID: 14632448 DOI: 10.1021/ci034110b] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The performance of three "spectroscopic" quantitative structure-activity relationship (QSAR) methods (eigenvalue (EVA), electronic eigenvalue (EEVA), and comparative spectra analysis (CoSA)) for relating molecular structure and estrogenic activity are critically evaluated. The methods were tested with respect to the relative binding affinities (RBA) of a diverse set of 36 estrogens previously examined in detail by the comparative molecular field analysis method. The CoSA method with (13)C chemical shifts appears to provide a predictive QSAR model for this data set. EEVA (i.e., molecular orbital energy in this context) is a borderline case, whereas the performances of EVA (i.e., vibrational normal mode) and CoSA with (1)H shifts are substandard and only semiquantitative. The CoSA method with (13)C chemical shifts provides an alternative and supplement to conventional 3D QSAR methods for rationalizing and predicting the estrogenic activity of molecules. If CoSA is to be applied to large data sets, however, it is desirable that the chemical shifts are available from common databases or, alternatively, that they can be estimated with sufficient accuracy using fast prediction schemes. Calculations of NMR chemical shifts by quantum mechanical methods, as in this case study, seem to be too time-consuming at this moment, but the situation is changing rapidly. An inherent shortcoming common to all spectroscopic QSAR methods is that they cannot take the chirality of molecules into account, at least as formulated at present. Moreover, the symmetry of molecules may cause additional problems. There are three pairs of enantiomers and nine symmetric (C(2) or C(2)(v)) molecules present in the data set, so that the predictive ability of full 3D QSAR methods is expected to be better than that of spectroscopic methods. This is demonstrated with SOMFA (self-organizing molecular field analysis). In general, the use of external test sets with randomized data is encouraged as a validation tool in QSAR studies.
Collapse
Affiliation(s)
- Arja Asikainen
- Department of Environmental Sciences, University of Kuopio, PO Box 1627, FIN-70211, Kuopio, Finland
| | | | | |
Collapse
|
25
|
Hong H, Fang H, Xie Q, Perkins R, Sheehan DM, Tong W. Comparative molecular field analysis (CoMFA) model using a large diverse set of natural, synthetic and environmental chemicals for binding to the androgen receptor. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2003; 14:373-88. [PMID: 14758981 DOI: 10.1080/10629360310001623962] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
A large number of natural, synthetic and environmental chemicals are capable of disrupting the endocrine systems of experimental animals, wildlife and humans. These so-called endocrine disrupting chemicals (EDCs), some mimic the functions of the endogenous androgens, have become a concern to the public health. Androgens play an important role in many physiological processes, including the development and maintenance of male sexual characteristics. A common mechanism for androgen to produce both normal and adverse effects is binding to the androgen receptor (AR). In this study, we used Comparative Molecular Field Analysis (CoMFA), a three-dimensional quantitative structure-activity relationship (3D-QSAR) technique, to examine AR-ligand binding affinities. A CoMFA model with r2 = 0.902 and q2 = 0.571 was developed using a large training data set containing 146 structurally diverse natural, synthetic, and environmental chemicals with a 10(6)-fold range of relative binding affinity (RBA). By comparing the binding characteristics derived from the CoMFA contour map with these observed in a human AR crystal structure, we found that the steric and electrostatic properties encoded in this training data set are necessary and sufficient to describe the RBA of AR ligands. Finally, the CoMFA model was challenged with an external test data set; the predicted results were close to the actual values with average difference of 0.637 logRBA. This study demonstrates the utility of this CoMFA model for real-world use in predicting the AR binding affinities of structurally diverse chemicals over a wide RBA range.
Collapse
Affiliation(s)
- H Hong
- Northrop Grumman Information Technology, Jefferson, AR 72079, USA
| | | | | | | | | | | |
Collapse
|
26
|
Hamblen EL, Cronin MTD, Schultz TW. Estrogenicity and acute toxicity of selected anilines using a recombinant yeast assay. CHEMOSPHERE 2003; 52:1173-1181. [PMID: 12820998 DOI: 10.1016/s0045-6535(03)00333-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Suspected estrogen modulators include industrial organic chemicals (i.e., xenoestrogens), and have been shown to consist of alkylphenols, bisphenols, biphenylols, and some hydroxy-substituted polycyclic aromatic hydrocarbons. The most prominent structural feature identified to be important for estrogenic activity is a polar group capable of donating hydrogen bonds (i.e., hydroxyl) on an aromatic system. The present study was undertaken to explore the estrogenic activity and acute toxicity of chemicals containing a weaker hydrogen bond donor group on aromatic systems, i.e., the amino substituent. There is a great deal of chemical similarity between aromatic amines (anilines) and aromatic alcohols (phenols). The chemicals chosen for the current study contained an amino-substituted benzene ring with hydrophobic constituents varying in size and shape. Thus, 37 substituted aromatic amines were assayed for estrogenic activity EC50 and acute toxicity LC50 using the Saccharomyces cerevisiae recombinant yeast assay. While the EC50 of 17-beta-estradiol occurs at the 10(-10) range, the aniline with the greatest activity had an EC50 of 10(-6) M. Thus, anilines, in general, are capable only of very weak estrogenic activity in this assay. A comparison of estrogenic potency between the present group of anilines and a set of previously tested analogous phenols indicated that anilines are consistently less estrogenic than phenols. A comparison of hazard indices (EC50/LC50) of these chemicals revealed that, for the vast majority of anilines, the EC50 and LC50 were in the same order of magnitude. More specifically, estrogenic activity of para-substituted alkylanilines increases with alkyl group size up to 5 carbons in length, after which the acute toxicity of the larger alkyl-substituents precluded the ability of the compound to induce the estrogenic response.
Collapse
Affiliation(s)
- Elizabeth L Hamblen
- Department of Ecology and Evolutionary Biology, The University of Tennessee, 2407 River Drive, Knoxville, TN 37996-4543, USA
| | | | | |
Collapse
|