1
|
Petrescu AM, Ilia G. Molecular docking study to evaluate the carcinogenic potential and mammalian toxicity of thiophosphonate pesticides by cluster and discriminant analysis. ENVIRONMENTAL TOXICOLOGY AND PHARMACOLOGY 2016; 47:62-78. [PMID: 27636985 DOI: 10.1016/j.etap.2016.09.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2016] [Revised: 09/03/2016] [Accepted: 09/06/2016] [Indexed: 06/06/2023]
Abstract
In this paper, the carcinogenic potential and mammalian toxicity on rodents, based on the quantitative relationship models between structure and biological activity (QSAR), were evaluated. The carcinogenicity and acute toxicity were evaluated by docking molecular physicochemical descriptors, on a series of 33 thiophosphonates. These properties, mainly hydrophobicity, electronic distribution, hydrogen bonding characteristics, molecule size and flexibility, and the presence of various pharmacophoric features, influence the behavior of molecule in a living organism, including bioavailability, transport properties, affinity to proteins, reactivity, toxicity, metabolic stability and many others. The model was validated using linear regression methods: principal component analysis (PCA), partial least squares (PLS) and multiple linear regression (MLR); non-linear regression methods: cluster analysis (CA) and discriminant analysis (DA); and neural network analysis: probabilistic neural network (PNN), identifying the best predictor.
Collapse
Affiliation(s)
- Alina-Maria Petrescu
- West University of Timisoara, Faculty of Chemistry, Biology, Geography, Dept. of Biology-Chemistry, 16 Pestalozzi Street, 300115 Timisoara, Romania
| | - Gheorghe Ilia
- West University of Timisoara, Faculty of Chemistry, Biology, Geography, Dept. of Biology-Chemistry, 16 Pestalozzi Street, 300115 Timisoara, Romania; Institute of Chemistry Timisoara of Romanian Academy, 24 Mihai Viteazu Bvd., 300223 Timisoara, Romania.
| |
Collapse
|
2
|
Hong H, Rua D, Sakkiah S, Selvaraj C, Ge W, Tong W. Consensus Modeling for Prediction of Estrogenic Activity of Ingredients Commonly Used in Sunscreen Products. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2016; 13:ijerph13100958. [PMID: 27690075 PMCID: PMC5086697 DOI: 10.3390/ijerph13100958] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2016] [Revised: 09/16/2016] [Accepted: 09/20/2016] [Indexed: 11/16/2022]
Abstract
Sunscreen products are predominantly regulated as over-the-counter (OTC) drugs by the US FDA. The "active" ingredients function as ultraviolet filters. Once a sunscreen product is generally recognized as safe and effective (GRASE) via an OTC drug review process, new formulations using these ingredients do not require FDA review and approval, however, the majority of ingredients have never been tested to uncover any potential endocrine activity and their ability to interact with the estrogen receptor (ER) is unknown, despite the fact that this is a very extensively studied target related to endocrine activity. Consequently, we have developed an in silico model to prioritize single ingredient estrogen receptor activity for use when actual animal data are inadequate, equivocal, or absent. It relies on consensus modeling to qualitatively and quantitatively predict ER binding activity. As proof of concept, the model was applied to ingredients commonly used in sunscreen products worldwide and a few reference chemicals. Of the 32 chemicals with unknown ER binding activity that were evaluated, seven were predicted to be active estrogenic compounds. Five of the seven were confirmed by the published data. Further experimental data is needed to confirm the other two predictions.
Collapse
Affiliation(s)
- Huixiao Hong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA.
| | - Diego Rua
- Division of Nonprescription Drug Products, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, 10903 New Hampshire Avenue, Silver Spring, MD 20993, USA.
| | - Sugunadevi Sakkiah
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA.
| | - Chandrabose Selvaraj
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA.
| | - Weigong Ge
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA.
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR 72079, USA.
| |
Collapse
|
3
|
Experimental Data Extraction and in Silico Prediction of the Estrogenic Activity of Renewable Replacements for Bisphenol A. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2016; 13:ijerph13070705. [PMID: 27420082 PMCID: PMC4962246 DOI: 10.3390/ijerph13070705] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Revised: 07/01/2016] [Accepted: 07/05/2016] [Indexed: 01/23/2023]
Abstract
Bisphenol A (BPA) is a ubiquitous compound used in polymer manufacturing for a wide array of applications; however, increasing evidence has shown that BPA causes significant endocrine disruption and this has raised public concerns over safety and exposure limits. The use of renewable materials as polymer feedstocks provides an opportunity to develop replacement compounds for BPA that are sustainable and exhibit unique properties due to their diverse structures. As new bio-based materials are developed and tested, it is important to consider the impacts of both monomers and polymers on human health. Molecular docking simulations using the Estrogenic Activity Database in conjunction with the decision forest were performed as part of a two-tier in silico model to predict the activity of 29 bio-based platform chemicals in the estrogen receptor-α (ERα). Fifteen of the candidates were predicted as ER binders and fifteen as non-binders. Gaining insight into the estrogenic activity of the bio-based BPA replacements aids in the sustainable development of new polymeric materials.
Collapse
|
4
|
Abstract
Quantitative structure-activity relationship (QSAR) has been used in the scientific research community for many decades and applied to drug discovery and development in the industry. QSAR technologies are advancing fast and attracting possible applications in regulatory science. To facilitate the development of reliable QSAR models, the FDA had invested a lot of efforts in constructing chemical databases with a variety of efficacy and safety endpoint data, as well as in the development of computational algorithms. In this chapter, we briefly describe some of the often used databases developed at the FDA such as EDKB (Endocrine Disruptor Knowledge Base), EADB (Estrogenic Activity Database), LTKB (Liver Toxicity Knowledge Base), and CERES (Chemical Evaluation and Risk Estimation System) and the technologies adopted by the agency such as Mold(2) program for calculation of a large and diverse set of molecular descriptors and decision forest algorithm for QSAR model development. We also summarize some QSAR models that have been developed for safety evaluation of the FDA-regulated products.
Collapse
Affiliation(s)
- Huixiao Hong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR, 72079, USA.
| | - Minjun Chen
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR, 72079, USA
| | - Hui Wen Ng
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR, 72079, USA
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, 3900 NCTR Road, Jefferson, AR, 72079, USA
| |
Collapse
|
5
|
Ng HW, Doughty SW, Luo H, Ye H, Ge W, Tong W, Hong H. Development and Validation of Decision Forest Model for Estrogen Receptor Binding Prediction of Chemicals Using Large Data Sets. Chem Res Toxicol 2015; 28:2343-51. [PMID: 26524122 DOI: 10.1021/acs.chemrestox.5b00358] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Some chemicals in the environment possess the potential to interact with the endocrine system in the human body. Multiple receptors are involved in the endocrine system; estrogen receptor α (ERα) plays very important roles in endocrine activity and is the most studied receptor. Understanding and predicting estrogenic activity of chemicals facilitates the evaluation of their endocrine activity. Hence, we have developed a decision forest classification model to predict chemical binding to ERα using a large training data set of 3308 chemicals obtained from the U.S. Food and Drug Administration's Estrogenic Activity Database. We tested the model using cross validations and external data sets of 1641 chemicals obtained from the U.S. Environmental Protection Agency's ToxCast project. The model showed good performance in both internal (92% accuracy) and external validations (∼ 70-89% relative balanced accuracies), where the latter involved the validations of the model across different ER pathway-related assays in ToxCast. The important features that contribute to the prediction ability of the model were identified through informative descriptor analysis and were related to current knowledge of ER binding. Prediction confidence analysis revealed that the model had both high prediction confidence and accuracy for most predicted chemicals. The results demonstrated that the model constructed based on the large training data set is more accurate and robust for predicting ER binding of chemicals than the published models that have been developed using much smaller data sets. The model could be useful for the evaluation of ERα-mediated endocrine activity potential of environmental chemicals.
Collapse
Affiliation(s)
- Hui Wen Ng
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration , 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Stephen W Doughty
- School of Pharmacy, University of Nottingham Malaysia Campus , Jalan Broga, 43500 Semenyih, Selangor, Malaysia
| | - Heng Luo
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration , 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Hao Ye
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration , 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Weigong Ge
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration , 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration , 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| | - Huixiao Hong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration , 3900 NCTR Road, Jefferson, Arkansas 72079, United States
| |
Collapse
|
6
|
Ng HW, Zhang W, Shu M, Luo H, Ge W, Perkins R, Tong W, Hong H. Competitive molecular docking approach for predicting estrogen receptor subtype α agonists and antagonists. BMC Bioinformatics 2014; 15 Suppl 11:S4. [PMID: 25349983 PMCID: PMC4251048 DOI: 10.1186/1471-2105-15-s11-s4] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Background Endocrine disrupting chemicals (EDCs) are exogenous compounds that interfere with the endocrine system of vertebrates, often through direct or indirect interactions with nuclear receptor proteins. Estrogen receptors (ERs) are particularly important protein targets and many EDCs are ER binders, capable of altering normal homeostatic transcription and signaling pathways. An estrogenic xenobiotic can bind ER as either an agonist or antagonist to increase or inhibit transcription, respectively. The receptor conformations in the complexes of ER bound with agonists and antagonists are different and dependent on interactions with co-regulator proteins that vary across tissue type. Assessment of chemical endocrine disruption potential depends not only on binding affinity to ERs, but also on changes that may alter the receptor conformation and its ability to subsequently bind DNA response elements and initiate transcription. Using both agonist and antagonist conformations of the ERα, we developed an in silico approach that can be used to differentiate agonist versus antagonist status of potential binders. Methods The approach combined separate molecular docking models for ER agonist and antagonist conformations. The ability of this approach to differentiate agonists and antagonists was first evaluated using true agonists and antagonists extracted from the crystal structures available in the protein data bank (PDB), and then further validated using a larger set of ligands from the literature. The usefulness of the approach was demonstrated with enrichment analysis in data sets with a large number of decoy ligands. Results The performance of individual agonist and antagonist docking models was found comparable to similar models in the literature. When combined in a competitive docking approach, they provided the ability to discriminate agonists from antagonists with good accuracy, as well as the ability to efficiently select true agonists and antagonists from decoys during enrichment analysis. Conclusion This approach enables evaluation of potential ER biological function changes caused by chemicals bound to the receptor which, in turn, allows the assessment of a chemical's endocrine disrupting potential. The approach can be used not only by regulatory authorities to perform risk assessments on potential EDCs but also by the industry in drug discovery projects to screen for potential agonists and antagonists.
Collapse
|
7
|
Gramatica P. Chemometric Methods and Theoretical Molecular Descriptors in Predictive QSAR Modeling of the Environmental Behavior of Organic Pollutants. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2010. [DOI: 10.1007/978-1-4020-9783-6_12] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
8
|
Ghosh P, Bagchi M. Comparative QSAR studies of nitrofuranyl amide derivatives using theoretical structural properties. MOLECULAR SIMULATION 2009. [DOI: 10.1080/08927020903033141] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
9
|
Stenberg M, Linusson A, Tysklind M, Andersson PL. A multivariate chemical map of industrial chemicals--assessment of various protocols for identification of chemicals of potential concern. CHEMOSPHERE 2009; 76:878-884. [PMID: 19515399 DOI: 10.1016/j.chemosphere.2009.05.011] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2009] [Revised: 05/07/2009] [Accepted: 05/13/2009] [Indexed: 05/27/2023]
Abstract
In present study the Industrial chemical map was created, and investigated. Molecular descriptors were calculated for 56072 organic substances from the European inventory of existing commercial chemical substances (EINECS). The resulting multivariate dataset was subjected to principal component analysis (PCA), giving five principal components, mainly reflecting size, hydrophobicity, flexibility, halogenation and electronical properties. It is these five PCs that form the basis of the map of organic, industrial chemicals, the Industrial chemical map. The similarities and diversity in chemical characteristics of the substances in relation to their persistence (P), bioaccumulation (B) and long-range transport potential were then examined, by superimposing five sets of entries obtained from other relevant databases onto the Industrial chemical map. These sets displayed very similar diversity patterns in the map, although with a spread in all five PC vectors. Substances listed by the United Nations Environment Program as persistent organic pollutants (UNEP POPs) were on the other hand clearly grouped with respect to each of the five PCs. Illustrating similarities and differences in chemical properties are one of the strengths of the multivariate data analysis method, and to be able to make predictions of, and investigate new chemicals. Further, the results demonstrate that non-testing methods as read-across, based on molecular similarities, can reduce the requirements to test industrial chemicals, provided that they are applied carefully, in combination with sound chemical knowledge.
Collapse
Affiliation(s)
- Mia Stenberg
- Department of Chemistry, Umeå University, SE-901 87 Umeå, Sweden.
| | | | | | | |
Collapse
|
10
|
Ahlers J, Stock F, Werschkun B. Integrated testing and intelligent assessment-new challenges under REACH. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2008; 15:565-572. [PMID: 18818964 DOI: 10.1007/s11356-008-0043-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2008] [Accepted: 09/10/2008] [Indexed: 05/26/2023]
Abstract
BACKGROUND, AIM AND SCOPE Due to a number of drawbacks associated with the previous regime for the assessment of new and existing chemicals, the European Union established a new regulation concerning the registration, evaluation, authorisation and restriction of chemicals (REACH). All relevant industrial chemicals must now be assessed. Instead of the authorities, industry itself is responsible for the risk assessment. To achieve better and more efficient assessments while reducing animal testing, all information-standard, non-standard and non-testing-has to be used in an integrated manner. To meet these challenges, the current technical guidance documents for risk assessment of new and existing chemicals had to be updated and extended considerably. This was done by experts in a number of REACH Implementation Projects. This paper presents the most relevant results of the expert Endpoint Working Group on Aquatic Toxicity in order to illustrate the change of paradigm in the future assessment of hazards to the aquatic environment by chemical substances. MAIN FEATURES AND CHALLENGES REACH sets certain minimum data requirements in order to achieve a high level of protection for human health and the environment. It encourages the assessor to use alternative information instead of or in addition to standard one. This information has to be equivalent to the standard information requirement and adequate to draw overall conclusions with respect to the regulatory endpoints classification and labelling, persistent, bioaccumulative and toxic (PBT) assessment and predicted no-effect concentrations (PNEC) derivation. The main task of the expert working group was to develop guidance on how to evaluate the toxicity of a substance based on integration of information from different sources and of various degrees of uncertainty in a weight of evidence approach. INTEGRATED TESTING AND INTELLIGENT ASSESSMENT In order to verify the equivalence and adequacy of different types of information, a flexible sequence of steps was proposed, covering characterisation of the substance, analysis of modes of action, identification of possible analogues, evaluation of existing in vivo and in vitro testing data as well as of QSAR results. Finally, all available data from the different steps have to be integrated to come to an overall conclusion on the toxicity of the substance. This weight of evidence approach is the basis for the development of integrated testing strategies (ITS), in that the available evidence can help to determine subsequent testing steps and is essential for an optimal assessment. Its flexibility helps to meet the different requirements for drawing conclusions on the endpoints classification and labelling, PNEC derivation as well as PBT assessment. The integration of all kinds of additional information in a multi-criteria assessment reduces the uncertainties involved with extrapolation to the ecosystem level. The weight of evidence approach is illustrated by practical examples. CONCLUSIONS AND PERSPECTIVES REACH leads to higher challenges in order to make sound decisions with fewer resources, i.e. to move away from extensive standard testing to an intelligent substance-tailored approach. Expert judgement and integrated thinking are key elements of the weight of evidence concept and ITS, potentially leading to better risk assessments. Important sub-lethal effects such as endocrine disruption, which are not covered by the current procedure, can be considered. Conclusions have to be fully substantiated: Risk communication will be an important aspect of future assessments.
Collapse
Affiliation(s)
- Jan Ahlers
- Umweltbundesamt, Ahrenshooper Zeile 1A, 14129 Berlin, Germany.
| | | | | |
Collapse
|
11
|
Progress and perspectives of quantitative structure-activity relationships used for ecological risk assessment of toxic organic compounds. ACTA ACUST UNITED AC 2008. [DOI: 10.1007/s11426-008-0076-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
12
|
QSAR study on estrogenic activity of structurally diverse compounds using generalized regression neural network. ACTA ACUST UNITED AC 2008. [DOI: 10.1007/s11426-008-0070-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
13
|
Cronin M, Worth A. (Q)SARs for Predicting Effects Relating to Reproductive Toxicity. ACTA ACUST UNITED AC 2008. [DOI: 10.1002/qsar.200710118] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
14
|
Lo Piparo E, Smiesko M, Mazzatorta P, Benfenati E, Idinger J, Blümel S. Preliminary analysis of toxicity of benzoxazinones and their metabolites for folsomia Candida. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2006; 54:1099-104. [PMID: 16478222 DOI: 10.1021/jf050916v] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
The overall objective of this study was to explore the toxicity of benzoxazinone allelochemicals and their metabolites to Folsomia candida (Collembola: Isotomidae) (Willem, 1902). Experimental tests showed transformation products to have more pronounced toxicity than parent compounds. The underlying relationship between the chemical structure and toxicity was then studied using three-dimensional QSAR approaches, and results highlighted the role of the steric contribution.
Collapse
Affiliation(s)
- Elena Lo Piparo
- Istituto di Ricerche Farmacologiche "Mario Negri" Milano, Via Eritrea 62, 20157 Milano, Italy.
| | | | | | | | | | | |
Collapse
|
15
|
Tong W, Xie Q, Hong H, Shi L, Fang H, Perkins R. Assessment of prediction confidence and domain extrapolation of two structure-activity relationship models for predicting estrogen receptor binding activity. ENVIRONMENTAL HEALTH PERSPECTIVES 2004; 112:1249-1254. [PMID: 15345371 PMCID: PMC1277118 DOI: 10.1289/txg.7125] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2001] [Accepted: 07/15/2004] [Indexed: 05/24/2023]
Abstract
Quantitative structure-activity relationship (QSAR) methods have been widely applied in drug discovery, lead optimization, toxicity prediction, and regulatory decisions. Despite major advances in algorithms and software, QSAR models have inherent limitations associated with a size and chemical-structure diversity of the training set, experimental error, and many characteristics of structure representation and correlation algorithms. Whereas excellent fit to the training data may be readily attainable, often models fail to predict accurately chemicals that are outside their domain of applicability. A QSAR's utility and, in the case of regulatory decisions, justification for usage increasingly depend on the ability to quantify a model's potential for predicting unknown chemicals with some known degree of certainty. It is never possible to predict an unknown chemical with absolute certainty. Here we report on two QSAR models based on different data sets for classification of chemicals according to their ability to bind to the estrogen receptor. The models were developed by using a novel QSAR method, Decision Forest, which combines the results of multiple heterogeneous but comparable Decision Tree models to produce a consensus prediction. We used an extensive cross-validation process to define an applicability domain for model predictions based on two quantitative measures: prediction confidence and domain extrapolation. Together, these measures quantify the accuracy of each prediction within and outside of the training domain. Despite being based on large and diverse training sets, both QSAR models had poor accuracy for chemicals within the domain of low confidence, whereas good accuracy was obtained for those within the domain of high confidence. For prediction in the high confidence domain, accuracy was inversely proportional to the degree of domain extrapolation. The model with a larger training set of 1,092, compared with 232 for the other, was more accurate in predicting chemicals at larger domain extrapolation, and could be particularly useful for rapidly prioritizing potential endocrine disruptors from large chemical universe.
Collapse
Affiliation(s)
- Weida Tong
- Center for Toxicoinformatics, National Center for Toxicological Research, Food and Drug Administration, Jefferson, Arkansas 72079, USA.
| | | | | | | | | | | |
Collapse
|