1
|
Ghosh V, Bhattacharjee A, Kumar A, Ojha PK. q-RASTR modelling for prediction of diverse toxic chemicals towards T. pyriformis. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024; 35:11-30. [PMID: 38193248 DOI: 10.1080/1062936x.2023.2298452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 12/16/2023] [Indexed: 01/10/2024]
Abstract
A series of diverse organic compounds impose serious detrimental effects on the health of living organisms and the environment. Determination of the structural aspects of compounds that impart toxicity and evaluation of the same is crucial before public usage. The present study aims to determine the structural characteristics of compounds for Tetrahymena pyriformis toxicity using the q-RASTR (Quantitative Read Across Structure-Toxicity Relationship) model. It was developed using RASTR and 2-D descriptors for a dataset of 1792 compounds with defined endpoint (pIGC50) against a model organism, T. pyriformis. For the current study, the whole dataset was divided based on activity/property into the training and test sets, and the q-RASTR model was developed employing six descriptors (three latent variables) having r2, Q2F1 and Q2 values of 0.739, 0.767, and 0.735, respectively. The generated model was thoroughly validated using internationally recognized internal and external validation criteria to assess the model's dependability and predictability. It was highlighted that high molecular weight, aromatic hydroxyls, nitrogen, double bonds, and hydrophobicity increase the toxicity of organic compounds. The current study demonstrates the applicability of the RASTR algorithm in QSTR model development for the prediction of toxic chemicals (pIGC50) towards T. pyriformis.
Collapse
Affiliation(s)
- V Ghosh
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - A Bhattacharjee
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - A Kumar
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - P K Ojha
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| |
Collapse
|
2
|
Fang Z, Yu X, Zeng Q. Random forest algorithm-based accurate prediction of chemical toxicity to Tetrahymena pyriformis. Toxicology 2022; 480:153325. [PMID: 36115645 DOI: 10.1016/j.tox.2022.153325] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 09/09/2022] [Accepted: 09/13/2022] [Indexed: 12/01/2022]
Abstract
The random forest (RF) algorithm, together with ten Dragon descriptors, was used to develop a quantitative structure-toxicity/activity relationship (QSTR/QSAR) model for a larger data set of 1792 chemical toxicity pIGC50 towards Tetrahymena pyriformis. The optimal RF (ntree =300 and mtry =3) model yielded root mean square (rms) errors of 0.261 for the training set (1434 chemicals) and 0.348 for the test set (358 chemicals). Compared with other QSTR models reported in the literature, the optimal RF model in this paper is more accurate. The feasibility of applying the RF algorithm to predict chemical toxicity pIGC50 towards Tetrahymena pyriformis has been verified.
Collapse
Affiliation(s)
- Zhengjun Fang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan 411104, China
| | - Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan 411104, China.
| | - Qun Zeng
- Department of Neurosurgery, Xiangtan Central Hospital, Xiangtan, Hunan 411100, China
| |
Collapse
|
3
|
Ali A. Development of antidiabetic drugs from benzamide derivatives as glucokinase activator: A computational approach. Saudi J Biol Sci 2022; 29:3313-3325. [PMID: 35844378 PMCID: PMC9280248 DOI: 10.1016/j.sjbs.2022.01.058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Revised: 01/27/2022] [Accepted: 01/30/2022] [Indexed: 11/24/2022] Open
Abstract
Hyperglycemia is a condition known for the impairment of insulin secretion and is responsible for diabetes mellitus. Various small molecule inhibitors have been discovered as glucokinase activators. Recent studies on benzamide derivatives showed their importance in the treatment of diabetes as glucokinase activator. The present manuscript showed a computation study on benzamide derivatives to help in the production of potent glucokinase activators. In the present study, pharmacophore development, 3D-QSAR, and docking studies were performed on benzamide derivatives to find out the important features required for the development of a potential glucokinase activator. The generated pharmacophore hypothesis ADRR_1 consisted of essential features required for the activity. The resultant statistical data showed high significant values with R2 > 0.99; 0.98 for the training set and Q2 > 0.52; 0.71 for test set based on atom-based and field-based models, respectively. The potent compound 15b of the series showed a good docking score via binding with different amino acid residues such as (NH…ARG63), (SO2…ARG250, THR65), and π-π staking with (phenyl……TYR214). The virtual screening study used 3563 compounds from ZINC database and screened hit compound ZINC08974524, binds with similar amino acids as shown by compound 15b and crystal ligand with docking scores SP (-11.17 kcal/mol) and XP (-8.43 kcal/mol). Compounds were further evaluated by ADME and MMGBSA parameters. Ligands and ZINC hits showed no violation of Lipinski rules. All the screened compounds showed good synthetic accessibility. The present study may be used by researchers for the development of novel benzamide derivatives as glucokinase activator.
Collapse
|
4
|
Yu X. Prediction of chemical toxicity to Tetrahymena pyriformis with four-descriptor models. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2020; 190:110146. [PMID: 31923753 DOI: 10.1016/j.ecoenv.2019.110146] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2019] [Revised: 12/27/2019] [Accepted: 12/28/2019] [Indexed: 06/10/2023]
Abstract
A quantitative structure-toxicity relationship (QSTR) model based on four descriptors was successfully developed for 1163 chemical toxicants against Tetrahymena pyriformis by applying general regression neural network (GRNN). The training set consisting of 600 organic compounds was used to train GRNN models that were evaluated with the test set of 563 compounds. For the optimal GRNN model, the training set possesses the coefficient of determination R2 of 0.86 and root mean square (rms) error of 0.41, and the test set has R2 of 0.80 and rms of 0.41. Investigated results indicate that the optimal GRNN model is accurate, although the GRNN model has only four descriptor and more samples in the test set.
Collapse
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Chemistry and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China.
| |
Collapse
|
5
|
Ask the experts: computational chemistry. Future Med Chem 2018; 10:1521-1524. [DOI: 10.4155/fmc-2018-0176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
6
|
An Investigation on the Quantitative Structure-Activity Relationships of the Anti-Inflammatory Activity of Diterpenoid Alkaloids. Molecules 2017; 22:molecules22030363. [PMID: 28264454 PMCID: PMC6155234 DOI: 10.3390/molecules22030363] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2016] [Revised: 02/21/2017] [Accepted: 02/23/2017] [Indexed: 12/18/2022] Open
Abstract
Diterpenoid alkaloids are extracted from plants. These compounds have broad biological activities, including effects on the cardiovascular system, anti-inflammatory and analgesic actions, and anti-tumor activity. The anti-inflammatory activity was determined by carrageenan-induced rat paw edema and experimental trauma in rats. The number of studies focused on the determination, quantitation and pharmacological properties of these alkaloids has increased dramatically during the past few years. In this work we built a dataset composed of 15 diterpenoid alkaloid compounds with diverse structures, of which 11 compounds were included in the training set and the remaining compounds were included in the test set. The quantitative chemistry parameters of the 15 diterpenoid alkaloids compound were calculated using the HyperChem software, and the quantitative structure-activity relationship (QSAR) of these diterpenoid alkaloid compounds were assessed in an anti-inflammation model based on half maximal effective concentration (EC50) measurements obtained from rat paw edema data. The QSAR prediction model is as follows: log ( E C 50 ) = - 0.0260 × SAA + 0.0086 × SAG + 0.0011 × VOL - 0.0641 × HE - 0.2628 × LogP - 0.5594 × REF - 0.2211 × POL - 0.1964 × MASS + 0.088 × BE + 0.1398 × HF (R² = 0.981, Q² = 0.92). The validated consensus EC50 for the QSAR model, developed from the rat paw edema anti-inflammation model used in this study, indicate that this model was capable of effective prediction and can be used as a reliable computational predictor of diterpenoid alkaloid activity.
Collapse
|
7
|
Tong J, Li L, Bai M, Li K. A New Descriptor of Amino Acids-SVGER and its Applications in Peptide QSAR. Mol Inform 2016; 36. [PMID: 27739658 DOI: 10.1002/minf.201501023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 09/27/2016] [Indexed: 11/10/2022]
Abstract
In the study of peptide quantitative structure activity relationship (QSAR), a new descriptor of amino acids (SVGER) was calculated. It was applied in two peptides which are angiotensin converting enzyme inhibitors and bitter tasting threshold of di-peptide. QSAR models were built by stepwise multiple regression-multiple linear regression (SMR-MLR) and stepwise multiple regression-partial least square regression (SMR-PLS). In the SMR-MLR models for angiotensin converting enzyme inhibitors, the squared cross-validation correlation coefficient (QLOO2 ) was 0.907, squared correlation coefficient between predicted and observed activities (Rcum2 ) was 0.977 and external multiple correlation coefficient (Qext2 ) was 0.867. The corresponding data for the bitter tasting threshold of di-peptide were 0.802, 0.966, 0.719. While in the SMR-PLS model, QLOO2 , Rcum2 and Qext2 were 0.804, 0.915, 0.858 for angiotensin converting enzyme inhibitors and 0.782, 0.881, 0.747 for bitter tasting threshold of di-peptide. Our results showed that descriptor SVGER can afford good account of relationships between activity and structure of peptide drugs.
Collapse
Affiliation(s)
- Jianbo Tong
- Shaanxi University of Science & Technology, Xi'an, PR China
| | - Lingxiao Li
- Shaanxi University of Science & Technology, Xi'an, PR China
| | - Min Bai
- Shaanxi University of Science & Technology, Xi'an, PR China
| | - Kangnan Li
- Shaanxi University of Science & Technology, Xi'an, PR China
| |
Collapse
|
8
|
Geometry optimization method versus predictive ability in QSPR modeling for ionic liquids. J Comput Aided Mol Des 2016; 30:165-76. [DOI: 10.1007/s10822-016-9894-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Accepted: 01/13/2016] [Indexed: 01/03/2023]
|
9
|
Matta* CF. Modeling biophysical and biological properties from the characteristics of the molecular electron density, electron localization and delocalization matrices, and the electrostatic potential. J Comput Chem 2014; 35:1165-98. [PMID: 24777743 PMCID: PMC4368384 DOI: 10.1002/jcc.23608] [Citation(s) in RCA: 109] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2013] [Revised: 03/16/2014] [Accepted: 03/21/2014] [Indexed: 11/11/2022]
Abstract
The electron density and the electrostatic potential are fundamentally related to the molecular hamiltonian, and hence are the ultimate source of all properties in the ground- and excited-states. The advantages of using molecular descriptors derived from these fundamental scalar fields, both accessible from theory and from experiment, in the formulation of quantitative structure-to-activity and structure-to-property relationships, collectively abbreviated as QSAR, are discussed. A few such descriptors encode for a wide variety of properties including, for example, electronic transition energies, pK(a)'s, rates of ester hydrolysis, NMR chemical shifts, DNA dimers binding energies, π-stacking energies, toxicological indices, cytotoxicities, hepatotoxicities, carcinogenicities, partial molar volumes, partition coefficients (log P), hydrogen bond donor capacities, enzyme-substrate complementarities, bioisosterism, and regularities in the genetic code. Electronic fingerprinting from the topological analysis of the electron density is shown to be comparable and possibly superior to Hammett constants and can be used in conjunction with traditional bulk and liposolubility descriptors to accurately predict biological activities. A new class of descriptors obtained from the quantum theory of atoms in molecules' (QTAIM) localization and delocalization indices and bond properties, cast in matrix format, is shown to quantify transferability and molecular similarity meaningfully. Properties such as "interacting quantum atoms (IQA)" energies which are expressible into an interaction matrix of two body terms (and diagonal one body "self" terms, as IQA energies) can be used in the same manner. The proposed QSAR-type studies based on similarity distances derived from such matrix representatives of molecular structure necessitate extensive investigation before their utility is unequivocally established.
Collapse
Affiliation(s)
- Chérif F Matta*
- Department of Chemistry and Physics, Mount Saint Vincent UniversityHalifax, Nova Scotia, Canada, B3M 2J6
- Department of Chemistry, Dalhousie UniversityHalifax, Nova Scotia, Canada, B3H 4J3
- Department of Chemistry, Saint Mary's UniversityHalifax, Nova Scotia, Canada, B3H 3C3
| |
Collapse
|
10
|
Kar S, Gajewicz A, Puzyn T, Roy K. Nano-quantitative structure-activity relationship modeling using easily computable and interpretable descriptors for uptake of magnetofluorescent engineered nanoparticles in pancreatic cancer cells. Toxicol In Vitro 2014; 28:600-6. [PMID: 24412539 DOI: 10.1016/j.tiv.2013.12.018] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2013] [Revised: 12/23/2013] [Accepted: 12/27/2013] [Indexed: 01/28/2023]
Abstract
As experimental evaluation of the safety of nanoparticles (NPs) is expensive and time-consuming, computational approaches have been found to be an efficient alternative for predicting the potential toxicity of new NPs before mass production. In this background, we have developed here a regression-based nano quantitative structure-activity relationship (nano-QSAR) model to establish statistically significant relationships between the measured cellular uptakes of 109 magnetofluorescent NPs in pancreatic cancer cells with their physical, chemical, and structural properties encoded within easily computable, interpretable and reproducible descriptors. The developed model was rigorously validated internally as well as externally with the application of the principles of Organization for Economic Cooperation and Development (OECD). The test for domain of applicability was also carried out for checking reliability of the predictions. Important fragments contributing to higher/lower cellular uptake of NPs were identified through critical analysis and interpretation of the developed model. Considering all these identified structural attributes, one can choose or design safe, economical and suitable surface modifiers for NPs. The presented approach provides rich information in the context of virtual screening of relevant NP libraries.
Collapse
Affiliation(s)
- Supratik Kar
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| | - Agnieszka Gajewicz
- Laboratory of Environmental Chemometrics, Institute for Environmental and Human Health Protection, Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
| | - Tomasz Puzyn
- Laboratory of Environmental Chemometrics, Institute for Environmental and Human Health Protection, Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India.
| |
Collapse
|
11
|
Kar S, Roy K. Predictive Chemometric Modeling and Three-Dimensional Toxicophore Mapping of Diverse Organic Chemicals Causing Bioluminescent Repression of the Bacterium Genus Pseudomonas. Ind Eng Chem Res 2013. [DOI: 10.1021/ie402803h] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- Supratik Kar
- Drug Theoretics and Cheminformatics
Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| | - Kunal Roy
- Drug Theoretics and Cheminformatics
Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700032, India
| |
Collapse
|
12
|
Tong J, Xu X, Liu S, Che T, Li Y, Hu Z, Meng Y. Prediction of glass transition temperature of polyacrylate using a quantitative structure property relationship model. POLYMER SCIENCE SERIES A 2013. [DOI: 10.1134/s0965545x13080075] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
13
|
Griffiths MZ, Alkorta I, Popelier PLA. Predicting pKa Values in Aqueous Solution for the Guanidine Functional Group from Gas Phase Ab Initio Bond Lengths. Mol Inform 2013; 32:363-76. [PMID: 27481593 DOI: 10.1002/minf.201300008] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Accepted: 03/11/2013] [Indexed: 11/06/2022]
Abstract
Here we applied a novel method1a to predict pKa values of the guanidine functional group, which is a notoriously difficult. This method, which was developed in our lab, uses only one ab initio bond length obtained at a low level of theory. The method is shown to work for drug molecules, delivers prediction errors of less than 0.5 log units, successfully treats tautomerisation in close relation with experiment, and demonstrates strong correlations with only a few data points. The high structural content of the ab initio bond length makes a given data set essentially divide itself into high correlation subsets. One then observes that molecules within a subset possess a common substructure. Each high correlation subset exists in its own region of chemical space. The high correlation subset method is explored with respect to this position in chemical space, in particular tautomerisation. The proposed method is able to distinguish between different tautomeric forms and the preferred tautomeric form emerges naturally, in agreement with experiment.
Collapse
Affiliation(s)
- Mark Z Griffiths
- Manchester Institute of Biotechnology (MIB), 131 Princess Street, M1 7DN, GB.,School of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, GB
| | - Ibon Alkorta
- Instituto de Química Médica (IQM-CSIC), Juan de la Cierva, 3, 28006 Madrid, Spain
| | - Paul L A Popelier
- Manchester Institute of Biotechnology (MIB), 131 Princess Street, M1 7DN, GB. .,School of Chemistry, University of Manchester, Oxford Road, Manchester M13 9PL, GB.
| |
Collapse
|
14
|
Cao C, Wang Z, Niu C, Desneux N, Gao X. Transcriptome profiling of Chironomus kiinensis under phenol stress using Solexa sequencing technology. PLoS One 2013; 8:e58914. [PMID: 23527048 PMCID: PMC3604134 DOI: 10.1371/journal.pone.0058914] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2012] [Accepted: 02/08/2013] [Indexed: 12/25/2022] Open
Abstract
Phenol is a major pollutant in aquatic ecosystems due to its chemical stability, water solubility and environmental mobility. To date, little is known about the molecular modifications of invertebrates under phenol stress. In the present study, we used Solexa sequencing technology to investigate the transcriptome and differentially expressed genes (DEGs) of midges (Chironomus kiinensis) in response to phenol stress. A total of 51,518,972 and 51,150,832 clean reads in the phenol-treated and control libraries, respectively, were obtained and assembled into 51,014 non-redundant (Nr) consensus sequences. A total of 6,032 unigenes were classified by Gene Ontology (GO), and 18,366 unigenes were categorized into 238 Kyoto Encyclopedia of Genes and Genomes (KEGG) categories. These genes included representatives from almost all functional categories. A total of 10,724 differentially expressed genes (P value <0.05) were detected in a comparative analysis of the expression profiles between phenol-treated and control C. kiinensis including 8,390 upregulated and 2,334 downregulated genes. The expression levels of 20 differentially expressed genes were confirmed by real-time RT-PCR, and the trends in gene expression that were observed matched the Solexa expression profiles, although the magnitude of the variations was different. Through pathway enrichment analysis, significantly enriched pathways were identified for the DEGs, including metabolic pathways, aryl hydrocarbon receptor (AhR), pancreatic secretion and neuroactive ligand-receptor interaction pathways, which may be associated with the phenol responses of C. kiinensis. Using Solexa sequencing technology, we identified several groups of key candidate genes as well as important biological pathways involved in the molecular modifications of chironomids under phenol stress.
Collapse
Affiliation(s)
- Chuanwang Cao
- Department of Forest Protection, Northeast Forestry University, Harbin, China
| | - Zhiying Wang
- Department of Forest Protection, Northeast Forestry University, Harbin, China
| | - Changying Niu
- Hubei Key Laboratory of Utilization of Insect Resources and Sustainable Control of Pests, Huazhong Agricultural University, Wuhan, China
| | - Nicolas Desneux
- French National Institute for Agricultural Research (INRA), Sophia-Antipolis, France
| | - Xiwu Gao
- Department of Entomology, China Agricultural University, Beijing, China
- * E-mail:
| |
Collapse
|
15
|
Kar S, Roy K. First report on predictive chemometric modeling, 3D-toxicophore mapping and in silico screening of in vitro basal cytotoxicity of diverse organic chemicals. Toxicol In Vitro 2013; 27:597-608. [DOI: 10.1016/j.tiv.2012.10.015] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2012] [Revised: 10/19/2012] [Accepted: 10/24/2012] [Indexed: 12/12/2022]
|
16
|
|
17
|
Prediction of hERG Potassium Channel Blocking Actions Using Combination of Classification and Regression Based Models: A Mixed Descriptors Approach. Mol Inform 2012; 31:879-94. [DOI: 10.1002/minf.201200039] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2012] [Accepted: 11/15/2012] [Indexed: 11/07/2022]
|
18
|
In silico prediction of toxic action mechanisms of phenols for imbalanced data with Random Forest learner. J Mol Graph Model 2012; 35:21-7. [DOI: 10.1016/j.jmgm.2012.01.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2011] [Revised: 01/07/2012] [Accepted: 01/09/2012] [Indexed: 11/20/2022]
|
19
|
Kar S, Roy K. First report on development of quantitative interspecies structure-carcinogenicity relationship models and exploring discriminatory features for rodent carcinogenicity of diverse organic chemicals using OECD guidelines. CHEMOSPHERE 2012; 87:339-355. [PMID: 22225702 DOI: 10.1016/j.chemosphere.2011.12.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2011] [Revised: 12/08/2011] [Accepted: 12/08/2011] [Indexed: 05/31/2023]
Abstract
Different regulatory agencies in food and drug administration and environmental protection worldwide are employing quantitative structure-activity relationship (QSAR) models to fill the data gaps related with properties of chemicals affecting the environment and human health. Carcinogenicity is a toxicity endpoint of major concern in recent times. Interspecies toxicity correlations may provide a tool for estimating sensitivity towards toxic chemical exposure with known levels of uncertainty for a diversity of wildlife species. In this background, we have developed quantitative interspecies structure-carcinogenicity correlation models for rat and mouse [rodent species according to the Organization for Economic Cooperation and Development (OECD) guidelines] based on the carcinogenic potential of 166 organic chemicals with wide diversity of molecular structures, spanning a large number of chemical classes and biological mechanisms. All the developed models have been assessed according to the OECD principles for the validation of QSAR models. Consensus predictions for carcinogenicity of the individual compounds are presented here for any one species when the data for the other species are available. Informative illustrations of the contributing structural fragments of chemicals which are responsible for specific carcinogenicity endpoints are identified by the developed models. The models have also been used to predict mouse carcinogenicities of 247 organic chemicals (for which rat carcinogenicities are present) and rat carcinogenicities of 150 chemicals (for which mouse carcinogenicities are present). Discriminatory features for rat and mouse carcinogenicity values have also been explored.
Collapse
Affiliation(s)
- Supratik Kar
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700 032, India
| | | |
Collapse
|
20
|
Devillers J, Doucet JP, Doucet-Panaye A, Decourtye A, Aupinel P. Linear and non-linear QSAR modelling of juvenile hormone esterase inhibitors. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2012; 23:357-369. [PMID: 22443267 DOI: 10.1080/1062936x.2012.664562] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
A tight control of juvenile hormone (JH) titre is crucial during the life cycle of a holometabolous insect. JH metabolism is made through the action of enzymes, particularly the juvenile hormone esterase (JHE). Trifluoromethylketones (TFKs) are able to inhibit this enzyme to disrupt the endocrine function of the targeted insect. In this context, a set of 96 TFKs, tested on Trichoplusia ni for their JHE inhibition, was split into a training set (n = 77) and a test set (n = 19) to derive a QSAR model. TFKs were initially described by 42 CODESSA (Comprehensive Descriptors for Structural and Statistical Analysis) descriptors, but a feature selection process allowed us to consider only five descriptors encoding the structural characteristics of the TFKs and their reactivity. A classical and spline regression analysis, a three-layer perceptron, a radial basis function network and a support vector regression were experienced as statistical tools. The best results were obtained with the support vector regression (r(2) and r(test)(2) = 0.91). The model provides information on the structural features and properties responsible for the high JHE inhibition activity of TFKs.
Collapse
|
21
|
New autocorrelation QTMS-based descriptors for use in QSAM of peptides. JOURNAL OF THE IRANIAN CHEMICAL SOCIETY 2012. [DOI: 10.1007/s13738-012-0070-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
22
|
Tian D, Lin Z, Yin D, Zhang Y, Kong D. Atomic charges of individual reactive chemicals in binary mixtures determine their joint effects: an example of cyanogenic toxicants and aldehydes. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2012; 31:270-278. [PMID: 22105991 DOI: 10.1002/etc.1701] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2011] [Revised: 09/02/2011] [Accepted: 09/27/2011] [Indexed: 05/31/2023]
Abstract
Environmental contaminants are usually encountered as mixtures, and many of these mixtures yield synergistic or antagonistic effects attributable to an intracellular chemical reaction that pose a potential threat on ecological systems. However, how atomic charges of individual chemicals determine their intracellular chemical reactions, and then determine the joint effects for mixtures containing reactive toxicants, is not well understood. To address this issue, the joint effects between cyanogenic toxicants and aldehydes on Photobacterium phosphoreum were observed in the present study. Their toxicological joint effects differed from one another. This difference is inherently related to the two atomic charges of the individual chemicals: the oxygen charge of -CHO (O(aldehyde toxicant)) in aldehyde toxicants and the carbon-atom charge of a carbon chain in the cyanogenic toxicant (C(cyanogenic toxicant)). Based on these two atomic charges, the following QSAR (quantitative structure-activity relationship) model was proposed: When (O(aldehyde toxicant) -C(cyanogenic toxicant) )> -0.125, the joint effect of equitoxic binary mixtures at median inhibition (TU, the sum of toxic units) can be calculated as TU = 1.00 ± 0.20; when (O(aldehyde toxicant) -C(cyanogenic toxicant) ) ≤ -0.125, the joint effect can be calculated using TU = - 27.6 x O (aldehyde toxicant) - 5.22 x C (cyanogenic toxicant) - 6.97 (n = 40, r = 0.887, SE = 0.195, F = 140, p < 0.001, q(2) (Loo) = 0.748; SE is the standard error of the regression, F is the F test statistic). The result provides insight into the relationship between the atomic charges and the joint effects for mixtures containing cyanogenic toxicants and aldehydes. This demonstrates that the essence of the joint effects resulting from intracellular chemical reactions depends on the atomic charges of individual chemicals. The present study provides a possible approach for the development of a QSAR model for mixtures containing reactive toxicants based on the atomic charges.
Collapse
Affiliation(s)
- Dayong Tian
- State Key Laboratory of Pollution Control and Resource Reuse, College of Environmental Science and Engineering, Tongji University, Shanghai, China
| | | | | | | | | |
Collapse
|
23
|
Tong J, Che T, Liu S, Li Y, Wang P, Xu X, Chen Y. SVEEVA Descriptor Application to Peptide QSAR. Arch Pharm (Weinheim) 2011; 344:719-25. [DOI: 10.1002/ardp.201100093] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2011] [Revised: 05/21/2011] [Accepted: 06/10/2011] [Indexed: 11/07/2022]
|
24
|
Tong J, Che T, Li Y, Wang P, Xu X, Chen Y. A descriptor of amino acids: SVRG and its application to peptide quantitative structure-activity relationship. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2011; 22:611-620. [PMID: 21830880 DOI: 10.1080/1062936x.2011.604099] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
In this work, a descriptor, SVRG (principal component scores vector of radial distribution function descriptors and geometrical descriptors), was derived from principal component analysis (PCA) of a matrix of two structural variables of coded amino acids, including radial distribution function index (RDF) and geometrical index. SVRG scales were then applied in three panels of peptide quantitative structure-activity relationships (QSARs) which were modelled by partial least squares regression (PLS). The obtained models with the correlation coefficient (R²(cum)), cross-validation correlation coefficient (Q²(LOO)) were 0.910 and 0.863 for 48 bitter-tasting dipeptides; 0.968 and 0.931 for 21 oxytocin analogues; and 0.992 and 0.954 for 20 thromboplastin inhibitors. Satisfactory results showed that SVRG contained much chemical information relating to bioactivities. The approach may be a useful structural expression methodology for studies on peptide QSAR.
Collapse
Affiliation(s)
- J Tong
- College of Chemistry & Chemical Engineering, Shaanxi University of Science & Technology, Xi'an, PR China.
| | | | | | | | | | | |
Collapse
|
25
|
Toropova AP, Toropov AA, Benfenati E, Gini G, Leszczynska D, Leszczynski J. CORAL: Quantitative structure-activity relationship models for estimating toxicity of organic compounds in rats. J Comput Chem 2011; 32:2727-33. [DOI: 10.1002/jcc.21848] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2011] [Revised: 05/06/2011] [Accepted: 05/09/2011] [Indexed: 11/11/2022]
|
26
|
Harding AP, Popelier PLA. pKa prediction from an ab initio bond length: part 2--phenols. Phys Chem Chem Phys 2011; 13:11264-82. [PMID: 21573301 DOI: 10.1039/c1cp20379g] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The prediction of pK(a) continues to attract much attention with ongoing investigations into new ways to predict pK(a) accurately, where predicted pK(a) values deviate less than 0.50 log units from experiment. We show that a single descriptor, i.e. an ab initio bond length, can predict pK(a). The emphasis was placed on model simplicity and a demonstration that more accurate predictions emerge from single-bond-length models. A data set of 171 phenols was studied. The carbon-oxygen bond length, connecting the OH to the phenyl ring, consistently provided accurate predictions. The pK(a) of meta- and para-substituted phenols is predicted here by a single-bond-length model within 0.50 log units. However, accurate prediction of the pK(a) of ortho-substituted phenols necessitated their splitting into groups called high-correlation subsets in which the pK(a) of the compounds strongly correlated with a single bond-length. The highly compound-specific single-bond-length models produced better predictions than models constructed with more compounds and more bond lengths. Outliers were easily identified using single-bond-length models and in most cases we were able to determine the reason for the outlier discrepancy. Furthermore, the single-bond-length models showed better cross-validation statistics than the PLS models constructed using more than one bond length. For all of the single-bond-length models, RMSEE was less than 0.50. For the majority of the models, RMSEP was less than 0.50. The results support the use of multiple high-correlation subsets and a single bond-length to predict pK(a). Six one-term linear equations are listed as a starting point for the construction of a more comprehensive list covering a larger variety of compound classes.
Collapse
Affiliation(s)
- A P Harding
- Manchester Interdisciplinary Biocentre (MIB), Manchester, Great Britain
| | | |
Collapse
|
27
|
Harding AP, Popelier PLA. pKa prediction from an ab initio bond length: part 3--benzoic acids and anilines. Phys Chem Chem Phys 2011; 13:11283-93. [PMID: 21573302 DOI: 10.1039/c1cp20380k] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The prediction of pK(a) from a single ab initio bond length has been extended to provide equations for benzoic acids and anilines. The HF/6-31G(d) level of theory is used for all geometry optimisations. Similarly to phenols (Part 2 of this series of publications), the meta-/para-substituted benzoic acids can be predicted from a single model constructed from one bond length. This model had an impressive RMSEP of 0.13 pK(a) units. The prediction of ortho-substituted benzoic acids required the identification of high-correlation subsets, where the compounds in the same subset have at least one of the same (e.g. halogens, hydroxy) ortho substituent. Two pK(a) equations are provided for o-halogen benzoic acids and o-hydroxybenzoic acids, where the RMSEP values are 0.19 and 0.15 pK(a) units, respectively. Interestingly, the bond length that provided the best model differed between these two high-correlation subsets. This demonstrates the importance of investigating the most predictive bond length, which is not necessarily the bond involving the acid hydrogen. Three high-correlation subsets were identified for the ortho-substituted anilines. These were o-halogen, o-nitro and o-alkyl-substituted aniline high-correlation subsets, where the RMSEP ranged from 0.23 to 0.44 pK(a) units. The RMSEP for the meta-/para-substituted aniline model was 0.54 pK(a) units. This value exceeded our threshold of 0.50 pK(a) units and was higher than both the m-/p-benzoic acids in this work and the m-/p-phenols (RMSEP = 0.43) of Part 2. Constructing two separate models for the meta- and para- substituted anilines, where RMSEP values of 0.63 and 0.33 pK(a) units were obtained respectively, revealed it was the meta-substituted anilines that caused the large RMSEP value. For unknown reasons the RMSEP value increased with the addition of a further twenty meta-substituted anilines to this model. The C-N bond always produced the best correlations with pK(a) for all the high-correlation subsets. A higher level of theory and an ammonia probe improved the statistics only marginally for the hydroxybenzoic acid high-correlation subsets.
Collapse
Affiliation(s)
- A P Harding
- Manchester Interdisciplinary Biocentre (MIB), Manchester, UK
| | | |
Collapse
|
28
|
Cheng F, Shen J, Yu Y, Li W, Liu G, Lee PW, Tang Y. In silico prediction of Tetrahymena pyriformis toxicity for diverse industrial chemicals with substructure pattern recognition and machine learning methods. CHEMOSPHERE 2011; 82:1636-43. [PMID: 21145574 DOI: 10.1016/j.chemosphere.2010.11.043] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2010] [Revised: 11/08/2010] [Accepted: 11/16/2010] [Indexed: 05/12/2023]
Abstract
There is an increasing need for the rapid safety assessment of chemicals by both industries and regulatory agencies throughout the world. In silico techniques are practical alternatives in the environmental hazard assessment. It is especially true to address the persistence, bioaccumulative and toxicity potentials of organic chemicals. Tetrahymena pyriformis toxicity is often used as a toxic endpoint. In this study, 1571 diverse unique chemicals were collected from the literature and composed of the largest diverse data set for T. pyriformis toxicity. Classification predictive models of T. pyriformis toxicity were developed by substructure pattern recognition and different machine learning methods, including support vector machine (SVM), C4.5 decision tree, k-nearest neighbors and random forest. The results of a 5-fold cross-validation showed that the SVM method performed better than other algorithms. The overall predictive accuracies of the SVM classification model with radial basis functions kernel was 92.2% for the 5-fold cross-validation and 92.6% for the external validation set, respectively. Furthermore, several representative substructure patterns for characterizing T. pyriformis toxicity were also identified via the information gain analysis methods.
Collapse
Affiliation(s)
- Feixiong Cheng
- Department of Pharmaceutical Sciences, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | | | | | | | | | | | | |
Collapse
|
29
|
Roy K, Das RN. QSTR with extended topochemical atom (ETA) indices. 14. QSAR modeling of toxicity of aromatic aldehydes to Tetrahymena pyriformis. JOURNAL OF HAZARDOUS MATERIALS 2010; 183:913-922. [PMID: 20739120 DOI: 10.1016/j.jhazmat.2010.07.116] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2010] [Revised: 07/27/2010] [Accepted: 07/27/2010] [Indexed: 05/29/2023]
Abstract
Aldehydes are a toxic class of chemicals causing severe health hazards. In this background, quantitative structure-toxicity relationship (QSTR) models have been developed in the present study using Extended Topochemical Atom (ETA) indices for a large group of 77 aromatic aldehydes for their acute toxicity against the protozoan ciliate Tetrahymena pyriformis. The ETA models have been compared with those developed using various non-ETA topological indices. Attempt was also made to include the n-octanol/water partition coefficient (logK(o/w)) as an additional descriptor considering the importance of hydrophobicity in toxicity prediction. Thirty different models were developed using different chemometric tools. All the models have been validated using internal validation and external validation techniques. The statistical quality of the ETA models was found to be comparable to that of the non-ETA models. The ETA models have shown the important effects of steric bulk, lipophilicity, presence of electronegative atom containing substituents and functionality of the aldehydic oxygen to the toxicity of the aldehydes. The best ETA model (without using logK(o/w)) shows encouraging statistical quality (Q(int)(2)=0.709,Q(ext)(2)=0.744). It is interesting to note that some of the topological models reported here are better in statistical quality than previously reported models using quantum chemical descriptors.
Collapse
Affiliation(s)
- Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Division of Medicinal and Pharmaceutical Chemistry, Department of Pharmaceutical Technology, Jadavpur University, Kolkata 700 032, India.
| | | |
Collapse
|