1
|
Yang JR, Chen Q, Wang H, Hu XY, Guo YM, Chen JZ. Reliable CA-(Q)SAR generation based on entropy weight optimized by grid search and correction factors. Comput Biol Med 2022; 146:105573. [DOI: 10.1016/j.compbiomed.2022.105573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 03/31/2022] [Accepted: 04/26/2022] [Indexed: 11/03/2022]
|
2
|
Vian M, Raitano G, Roncaglioni A, Benfenati E. In silico model for mutagenicity (Ames test), taking into account metabolism. Mutagenesis 2019; 34:41-48. [PMID: 30715441 DOI: 10.1093/mutage/gey045] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Bacterial reverse mutation test is one of the most common methods used to address genotoxicity. The experimental test is designed to include a step to simulate mammalian metabolism. The most common metabolic activation system is the incubation with S9 fraction prepared from the livers of rodents. Usually, in silico models addressing this endpoint are developed on the basis of an overall call disregarding the fact that the toxic effect was observed before or after metabolic activation. Here, we present a new in silico model to predict mutagenicity as measured by activity in the bacterial reverse mutation test, bearing in mind the role of S9 activation to stimulate metabolism. We applied the software SARpy, which identifies structural alerts associated with the effect. Different rules codified by these structural alerts were found in case of positive or negative mutagenicity, observed in the presence or absence of the S9 fraction. These rules can be used to understand the role of metabolism in mutagenicity better. We also identified a possible association of the results from these models with carcinogenicity.
Collapse
Affiliation(s)
- Matteo Vian
- Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Giuseppe La Masa, Milan, Italy
| | | | - Alessandra Roncaglioni
- Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Giuseppe La Masa, Milan, Italy
| | - Emilio Benfenati
- Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via Giuseppe La Masa, Milan, Italy
| |
Collapse
|
3
|
Li X, Du Z, Wang J, Wu Z, Li W, Liu G, Shen X, Tang Y. In Silico Estimation of Chemical Carcinogenicity with Binary and Ternary Classification Methods. Mol Inform 2015; 34:228-35. [PMID: 27490168 DOI: 10.1002/minf.201400127] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 01/11/2015] [Indexed: 11/07/2022]
Abstract
Carcinogenicity is one of the most concerned properties of chemicals to human health, thus it is important to identify chemical carcinogenicity as early as possible. In this study, 829 diverse compounds with rat carcinogenicity were collected from Carcinogenic Potency Database (CPDB). Using six types of fingerprints to represent the molecules, 30 binary and ternary classification models were generated to predict chemical carcinogenicity by five machine learning methods. The models were evaluated by an external validation set containing 87 chemicals from ISSCAN database. The best binary model was developed by MACCS keys and kNN algorithm with predictive accuracy at 83.91 %, while the best ternary model was also generated by MACCS keys and kNN algorithm with overall accuracy at 80.46 %. Furthermore, the best binary and ternary classification models were used to estimate carcinogenicity of tobacco smoke components containing 2251 compounds. 981 ones were predicted as carcinogens by binary classification model, while 110 compounds were predicted as strong carcinogens and 807 ones as weak carcinogens by ternary classification model. The results indicated that our models would be helpful for prediction of chemical carcinogenicity.
Collapse
Affiliation(s)
- Xiao Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, P. R. China phone: +86-21-6425-1052; fax: +86-21-6425-1033.,Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, P. R. China
| | - Zheng Du
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, P. R. China phone: +86-21-6425-1052; fax: +86-21-6425-1033
| | - Jie Wang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, P. R. China phone: +86-21-6425-1052; fax: +86-21-6425-1033
| | - Zengrui Wu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, P. R. China phone: +86-21-6425-1052; fax: +86-21-6425-1033
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, P. R. China phone: +86-21-6425-1052; fax: +86-21-6425-1033
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, P. R. China phone: +86-21-6425-1052; fax: +86-21-6425-1033
| | - Xu Shen
- Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, P. R. China
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, P. R. China phone: +86-21-6425-1052; fax: +86-21-6425-1033. .,Key Laboratory of Cigarette Smoke, Technical Center, Shanghai Tobacco Group Co. Ltd. Shanghai 200082, P. R. China.
| |
Collapse
|
4
|
Rybacka A, Rudén C, Andersson PL. On the Use ofIn SilicoTools for Prioritising Toxicity Testing of the Low-Volume Industrial Chemicals in REACH. Basic Clin Pharmacol Toxicol 2014; 115:77-87. [DOI: 10.1111/bcpt.12193] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Accepted: 01/02/2014] [Indexed: 11/29/2022]
Affiliation(s)
| | - Christina Rudén
- Department of Applied Environmental Science; Stockholm University; Stockholm Sweden
| | | |
Collapse
|
5
|
Cassano A, Raitano G, Mombelli E, Fernández A, Cester J, Roncaglioni A, Benfenati E. Evaluation of QSAR models for the prediction of ames genotoxicity: a retrospective exercise on the chemical substances registered under the EU REACH regulation. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART C, ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS 2014; 32:273-298. [PMID: 25226221 DOI: 10.1080/10590501.2014.938955] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
We evaluated the performance of seven freely available quantitative structure-activity relationship models predicting Ames genotoxicity thanks to a dataset of chemicals that were registered under the EU Registration, Evaluation, Authorization and Restriction of Chemicals (REACH) regulation. The performance of the models was estimated according to Cooper's statistics and Matthew's Correlation Coefficients (MCC). The Benigni/Bossa rule base originally implemented in Toxtree and re-implemented within the Virtual models for property Evaluation of chemicals within a Global Architecture (VEGA) platform displayed the best performance (accuracy = 92%, sensitivity = 83%, specificity = 93%, MCC = 0.68) indicating that this rule base provides a reliable tool for the identification of genotoxic chemicals. Finally, we elaborated a consensus model that outperformed the accuracy of the individual models.
Collapse
Affiliation(s)
- Antonio Cassano
- a Unité Modèles pour l'Ecotoxicologie et la Toxicologie (METO) , Institut National de l'Environnement Industriel et des Risques (INERIS) , Verneuil en Halatte , France
| | | | | | | | | | | | | |
Collapse
|
6
|
Fioravanzo E, Bassan A, Pavan M, Mostrag-Szlichtyng A, Worth AP. Role of in silico genotoxicity tools in the regulatory assessment of pharmaceutical impurities. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2012; 23:257-277. [PMID: 22369620 DOI: 10.1080/1062936x.2012.657236] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The toxicological assessment of genotoxic impurities is important in the regulatory framework for pharmaceuticals. In this context, the application of promising computational methods (e.g. Quantitative Structure-Activity Relationships (QSARs), Structure-Activity Relationships (SARs) and/or expert systems) for the evaluation of genotoxicity is needed, especially when very limited information on impurities is available. To gain an overview of how computational methods are used internationally in the regulatory assessment of pharmaceutical impurities, the current regulatory documents were reviewed. The software recommended in the guidelines (e.g. MCASE, MC4PC, Derek for Windows) or used practically by various regulatory agencies (e.g. US Food and Drug Administration, US and Danish Environmental Protection Agencies), as well as other existing programs were analysed. Both statistically based and knowledge-based (expert system) tools were analysed. The overall conclusions on the available in silico tools for genotoxicity and carcinogenicity prediction are quite optimistic, and the regulatory application of QSAR methods is constantly growing. For regulatory purposes, it is recommended that predictions of genotoxicity/carcinogenicity should be based on a battery of models, combining high-sensitivity models (low rate of false negatives) with high-specificity ones (low rate of false positives) and in vitro assays in an integrated manner.
Collapse
|
7
|
Doktorova TY, Pauwels M, Vinken M, Vanhaecke T, Rogiers V. Opportunities for an alternative integrating testing strategy for carcinogen hazard assessment? Crit Rev Toxicol 2011; 42:91-106. [DOI: 10.3109/10408444.2011.623151] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
8
|
Enoch SJ, Cronin MT, Ellison CM. The Use of a Chemistry-based Profiler for Covalent DNA Binding in the Development of Chemical Categories for Read-across for Genotoxicity. Altern Lab Anim 2011; 39:131-45. [DOI: 10.1177/026119291103900206] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
An important molecular initiating event for genotoxicity is the ability of a compound to bind covalently with DNA. However, not all compounds that can undergo covalent binding mechanisms will result in genotoxicity. One approach to solving this problem, when in silico prediction techniques are being used, is to develop tools that allow chemicals to be grouped into categories based on their ability to bind covalently to DNA. For this analysis to take place, compounds need to be placed within categories where the trend in toxicity can be explained by simple descriptors, such as hydrophobicity. However, this can occur only when the compounds within a category are structurally and mechanistically similar. Chemistry-based profilers have the ability to screen compounds and highlight those with similar structures to a target compound, and are thus likely to act via a similar mechanism of action. Here, examples are reported to highlight how structure-based profilers can be used to form categories and hence fill data gaps. The importance of developing a well-defined and robust category is discussed in terms of both mechanisms of action and structural similarity.
Collapse
Affiliation(s)
- Steven J. Enoch
- School of Pharmacy and Chemistry, Liverpool John Moores University, Liverpool, UK
| | - Mark T.D. Cronin
- School of Pharmacy and Chemistry, Liverpool John Moores University, Liverpool, UK
| | - Claire M. Ellison
- School of Pharmacy and Chemistry, Liverpool John Moores University, Liverpool, UK
| |
Collapse
|
9
|
Devillers J, Mombelli E, Samsera R. Structural alerts for estimating the carcinogenicity of pesticides and biocides. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2011; 22:89-106. [PMID: 21391143 DOI: 10.1080/1062936x.2010.548349] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
More than 20 years ago, Ashby and Tennant showed the interest of structural alerts for the prediction of the carcinogenicity of chemicals. These structural alerts are functional groups or structural features of various sizes that are linked to the level of carcinogenicity of chemicals. Since this pioneering work it has been possible to refine the alerts over time, as more experimental results have become available and additional mechanistic insights have been gained. To date, one of the most advanced lists of structural alerts for evaluating the carcinogenic potential of chemicals is the list proposed by Benigni and Bossa and that is implemented as a rule-based system in Toxtree and in the OECD QSAR Application Toolbox. In order to gain insight into the applicability of this system to the detection of potential carcinogens we screened about 200 pesticides and biocides showing a high structural diversity. Prediction results were compared with experimental data retrieved from an extensive bibliographical review. The prediction correctness was only equal to 60.14%. Attempts were made to analyse the sources of mispredictions.
Collapse
|
10
|
Abstract
IMPORTANCE OF THE FIELD: PubChem is a public molecular information repository, a scientific showcase of the NIH Roadmap Initiative. The PubChem database holds over 27 million records of unique chemical structures of compounds (CID) derived from nearly 70 million substance depositions (SID), and contains more than 449,000 bioassay records with over thousands of in vitro biochemical and cell-based screening bioassays established, with targeting more than 7000 proteins and genes linking to over 1.8 million of substances. AREAS COVERED IN THIS REVIEW: This review builds on recent PubChem-related computational chemistry research reported by other authors while providing readers with an overview of the PubChem database, focusing on its increasing role in cheminformatics, virtual screening and toxicity prediction modeling. WHAT THE READER WILL GAIN: These publicly available datasets in PubChem provide great opportunities for scientists to perform cheminformatics and virtual screening research for computer-aided drug design. However, the high volume and complexity of the datasets, in particular the bioassay-associated false positives/negatives and highly imbalanced datasets in PubChem, also creates major challenges. Several approaches regarding the modeling of PubChem datasets and development of virtual screening models for bioactivity and toxicity predictions are also reviewed. TAKE HOME MESSAGE: Novel data-mining cheminformatics tools and virtual screening algorithms are being developed and used to retrieve, annotate and analyze the large-scale and highly complex PubChem biological screening data for drug design.
Collapse
Affiliation(s)
- Xiang-Qun Xie
- Department of Pharmaceutical Sciences, School of Pharmacy; Drug Discovery Institute/Pittsburgh Molecular Library Screening Center (PMLSC); Pittsburgh Chemical Methodologies & Library Development (PCMLD) Center; Departments of Computational Biology and Structural Biology; University of Pittsburgh, Pittsburgh, PA 15260, USA
| |
Collapse
|
11
|
Abstract
Expert systems offer the facility to predict a toxicity endpoint, as well sometimes as additional relevant information, simply by inputting the chemical structure of a compound. There is now a number of expert systems available, mostly on a commercial basis although a few are free to use or download. This chapter discusses nineteen currently available expert systems, and their performances (if known). Published studies of consensus predictions with these expert systems indicate that these give better results than do individual expert systems.
A test set of compounds with Tetrahymena pyriformis toxicities has been run through the two expert systems known to predict these toxicities; the predictions were quite good, with standard errors of prediction of 0.395 and 0.433 log unit. A further test set of compounds with local lymph node assay skin sensitisation data has been run through seven expert systems, and it was found that consensus predictions were better than were those from any individual expert system.
Collapse
Affiliation(s)
- J. C. Dearden
- School of Pharmacy and Chemistry, Liverpool John Moores University Byrom Street Liverpool L3 3AF UK
| |
Collapse
|
12
|
Tanabe K, Lučić B, Amić D, Kurita T, Kaihara M, Onodera N, Suzuki T. Prediction of carcinogenicity for diverse chemicals based on substructure grouping and SVM modeling. Mol Divers 2010; 14:789-802. [PMID: 20186479 DOI: 10.1007/s11030-010-9232-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2009] [Accepted: 02/05/2010] [Indexed: 01/22/2023]
Abstract
The Carcinogenicity Reliability Database (CRDB) was constructed by collecting experimental carcinogenicity data on about 1,500 chemicals from six sources, including IARC, and NTP databases, and then by ranking their reliabilities into six unified categories. A wide variety of 911 organic chemicals were selected from the database for QSAR modeling, and 1,504 kinds of different molecular descriptors were calculated, based on their 3D molecular structures as modeled by the Dragon software. Positive (carcinogenic) and negative (non-carcinogenic) chemicals containing various substructures were counted using atom and functional group count descriptors, and the statistical significance of ratios of positives to negatives was tested for those substructures. Very few were judged to be strongly related to carcinogenicity, among substructures known to be responsible for carcinogens as revealed from biomedical studies. In order to develop QSAR models for the prediction of the carcinogenicities of a wide variety of chemicals with a satisfactory performance level, the relationship between the carcinogenicity data with improved reliability and a subset of significant descriptors selected from 1,504 Dragon descriptors was analyzed with a support vector machine (SVM) method: the classification function (SVC) for weighted data in LIBSVM program was used to classify chemicals into two carcinogenic categories (positive or negative), where weights were set depending on the reliabilities of the carcinogenicity data. The quality and stability of the models presented were tested by performing a dual cross-validation procedure. A single SVM model as the first step was developed for all the 911 chemicals using 250 selected descriptors, achieving an overall accuracy level, i.e., positive and negative correct estimate, of about 70%. In order to improve the accuracy of the final model, the 911 chemicals were classified into 20 mutually overlapping subgroups according to contained substructures, a specific SVM model was optimized for each subgroup, and the predicted carcinogenicities of the 911 chemicals were determined by the majorities of the outputs of the corresponding SVM models. The model developed on the basis of grouping of chemicals into 20 substructures predicts the carcinogenicities of a wide variety of chemicals with a satisfactory overall accuracy of approximately 80%.
Collapse
Affiliation(s)
- Kazutoshi Tanabe
- Neuroscience Research Institute, National Institute of Advanced Industrial Science and Technology, Umezono 1-1-1, Tsukuba, 305-8568, Japan.
| | | | | | | | | | | | | |
Collapse
|
13
|
Arvidson KB, Valerio LG, Diaz M, Chanderbhan RF. In Silico Toxicological Screening of Natural Products. Toxicol Mech Methods 2008; 18:229-42. [DOI: 10.1080/15376510701856991] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
14
|
Marchant CA, Briggs KA, Long A. In Silico Tools for Sharing Data and Knowledge on Toxicity and Metabolism: Derek for Windows, Meteor, and Vitic. Toxicol Mech Methods 2008; 18:177-87. [DOI: 10.1080/15376510701857320] [Citation(s) in RCA: 178] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
15
|
Benigni R, Bossa C. Predictivity and Reliability of QSAR Models: The Case of Mutagens and Carcinogens. Toxicol Mech Methods 2008; 18:137-47. [PMID: 20020910 DOI: 10.1080/15376510701857056] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
16
|
Abstract
A range of good quality, local QSARs for mutagenicity and carcinogenicity have been assessed and challenged for their predictivity in respect to real external test sets (i.e., chemicals never considered by the authors while developing their models). The QSARs for potency (applicable only to toxic chemicals) generated predictions 30-70% correct, whereas the QSARs for discriminating between active and inactive chemicals were 70-100% correct in their external predictions: thus the latter can be used with good reliability for applicative purposes. On the other hand internal, statistical validation methods, which are often assumed to be good diagnostics for predictivity, did not correlate well with the predictivity of the QSARs when challenged in external prediction tests. Nonlocal models for noncongeneric chemicals were considered as well, pointing to the critical role of an adequate definition of the applicability domain.
Collapse
Affiliation(s)
- Romualdo Benigni
- Environment and Health Department, Istituto Superiore di Sanita', Viale Regina Elena 299, 00161 Rome, Italy.
| | | |
Collapse
|
17
|
Utilizing high throughput screening data for predictive toxicology models: protocols and application to MLSCN assays. J Comput Aided Mol Des 2008; 22:367-84. [DOI: 10.1007/s10822-008-9192-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2007] [Accepted: 01/30/2008] [Indexed: 01/27/2023]
|