1
|
Lephalala M, Vives SS, Bisetty K. Chaotic neural network algorithm with competitive learning integrated with partial Least Square models for the prediction of the toxicity of fragrances in sanitizers and disinfectants. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 942:173754. [PMID: 38844215 DOI: 10.1016/j.scitotenv.2024.173754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 05/18/2024] [Accepted: 06/02/2024] [Indexed: 06/10/2024]
Abstract
This study addresses the need for accurate structural data regarding the toxicity of fragrances in sanitizers and disinfectants. We compare the predictive and descriptive (model stability) potential of multiple linear regression (MLR) and partial least squares (PLS) models optimized through variable selection (VS). A novel hybrid chaotic neural network algorithm with competitive learning (CCLNNA)-PLS modeling strategy can offer specific optimization with satisfactory results, even for a limited dataset. While also exploring the preliminary comparative analysis, the goal is to introduce an adapted novel CCLNNA optimization strategy for VS, inspired by neural networks, along with exploring the influence of the percentage of significant descriptors in the optimization function to enhance the final model's capabilities. We analyzed an available dataset of 24 molecules, incorporating ADMET and PaDEL descriptors as predictor variables, to explore the relationship between the response/target variable (pLC50) and the meticulously optimized set of descriptors. The suitability of the selected PLS models (cross- and external-validated accuracy combined with percentage of significant descriptors at a level equal to or >80 %) underscores the importance of expanding the dataset to amplify the validation protocols, thus enhancing future model reliability and environmental impact.
Collapse
Affiliation(s)
- Matshidiso Lephalala
- Department of Chemistry, Durban University of Technology, P.O. Box 1334, Durban 4000, South Africa
| | - Salvador Sagrado Vives
- Departamento de Química Analítica, Facultad de Farmacia. Universitat de València, E-46100 Burjassot, Valencia, Spain; Instituto Interuniversitario de Investigación de Reconocimiento Molecular y Desarrollo Tecnológico (IDM), Universitat Politècnica de València, Universitat de València, Valencia, Spain
| | - Krishna Bisetty
- Department of Chemistry, Durban University of Technology, P.O. Box 1334, Durban 4000, South Africa.
| |
Collapse
|
2
|
Chen C, Yang B, Li M, Huang S, Huang X. Quantitative structure-activity relationship predicting toxicity of pesticides towards Daphnia magna. ECOTOXICOLOGY (LONDON, ENGLAND) 2024:10.1007/s10646-024-02751-1. [PMID: 38592644 DOI: 10.1007/s10646-024-02751-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 03/31/2024] [Indexed: 04/10/2024]
Abstract
Global pesticide usage reaching 2.7 million metric tons annually, brings a grave threat to non-target organisms, especially aquatic organisms, resulting in serious concerns. Predicting aquatic toxicity of pesticides towards Daphnia magna is significant. In this work, random forest (RF) algorithm, together with ten Dragon molecular descriptors, was successfully utilized to develop a quantitative structure-activity/toxicity relationship (QSAR/QSTR) model for the toxicity pEC50 of 745 pesticides towards Daphnia magna. The optimal QSTR model (RF Model I) based on the RF parameters of ntree = 50, mtry = 3 and nodesize = 5, yielded R2 = 0.877, MAE = 0.570, rms = 0.739 (training set of 596 pEC50), R2 = 0.807, MAE = 0.732, rms = 0.902 (test set of 149 pEC50), and R2 = 0.863, MAE = 0.602, rms = 0.774 (total set of 745 pEC50), which are accurate and satisfactory. The optimal RF model is comparable to other published QSTR models for Daphnia magna, although the optimal RF model possessed a small descriptor subset and dealt with a large dataset of pesticide toxicity pEC50. Thus, the investigation in this work provides a reliable, applicable QSTR model for predicting the toxicity pEC50 of pesticides towards Daphnia magna.
Collapse
Affiliation(s)
- Cong Chen
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China
| | - Bowen Yang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China
| | - Mingwang Li
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China
| | - Saijin Huang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China.
| | - Xianwei Huang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China.
| |
Collapse
|
3
|
Qiu M, Shao Z, Zhang W, Zheng Y, Yin X, Gai G, Han Z, Zhao J. Water-richness evaluation method and application of clastic rock aquifer in mining seam roof. Sci Rep 2024; 14:6465. [PMID: 38499707 PMCID: PMC10948766 DOI: 10.1038/s41598-024-57033-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 03/13/2024] [Indexed: 03/20/2024] Open
Abstract
Clastic rock aquifer of the coal seam roof often constitutes the direct water-filling aquifer of the coal seam and its water-richness is closely related to the risk of roof water inrush. Therefore, the evaluation of the water-richness of clastic rock aquifer is the basic work of coal seam roof water disaster prevention. This article took the 4th coal seam in Huafeng mine field as an example. It combined the empirical formula method and generalized regression neural network (GRNN) to calculate the development height of water-conducting fracture zone, determined the vertical spatial range of water-richness evaluation. Depth of the sandstone floor, brittle rock ratio, lithological structure index, fault strength index, and fault intersections and endpoints density were selected as the main controlling factors. A combination weighting method based on the analytic hierarchy process (AHP), rough set theory (RS), and minimum deviation method (MD) was proposed to determine the weight of the main controlling factors. Introduced the theory of unascertained measures and confidence recognition criteria to construct an evaluation model for the water-richness of clastic rock aquifers, the study area was divided into three zones: relatively weak water-richness zones, medium water-richness zones, and relatively strong water-richness zones. By comparing with the water inrush points and the water inflow of workfaces, the evaluation model's water yield zoning was consistent with the actual situation, and the prediction effect was good. This provided a new idea for the evaluation of the water-richness of the clastic rock aquifer on the roof of the mining coal seam.
Collapse
Affiliation(s)
- Mei Qiu
- College of Earth Sciences and Engineering, Shandong University of Science and Technology, Qingdao, 266590, China.
- Key Laboratory of Sedimentary Mineralization and Sedimentary Minerals in Shandong Province, Shandong University of Science and Technology, Qingdao, 266590, China.
| | - Zhendong Shao
- College of Earth Sciences and Engineering, Shandong University of Science and Technology, Qingdao, 266590, China
- Key Laboratory of Sedimentary Mineralization and Sedimentary Minerals in Shandong Province, Shandong University of Science and Technology, Qingdao, 266590, China
| | - Weiqiang Zhang
- Shandong Shengyuan Geological Exploration Co., Ltd, Taian, 271000, China
| | - Yan Zheng
- College of Earth Sciences and Engineering, Shandong University of Science and Technology, Qingdao, 266590, China
- Key Laboratory of Sedimentary Mineralization and Sedimentary Minerals in Shandong Province, Shandong University of Science and Technology, Qingdao, 266590, China
| | - Xinyu Yin
- Jinan Rail Transit Group CO., LTD, Jinan, 250013, China
| | - Guichao Gai
- College of Earth Sciences and Engineering, Shandong University of Science and Technology, Qingdao, 266590, China
- Key Laboratory of Sedimentary Mineralization and Sedimentary Minerals in Shandong Province, Shandong University of Science and Technology, Qingdao, 266590, China
| | - Zhaodi Han
- College of Earth Sciences and Engineering, Shandong University of Science and Technology, Qingdao, 266590, China
- Key Laboratory of Sedimentary Mineralization and Sedimentary Minerals in Shandong Province, Shandong University of Science and Technology, Qingdao, 266590, China
| | - Jianfei Zhao
- College of Earth Sciences and Engineering, Shandong University of Science and Technology, Qingdao, 266590, China
- Key Laboratory of Sedimentary Mineralization and Sedimentary Minerals in Shandong Province, Shandong University of Science and Technology, Qingdao, 266590, China
| |
Collapse
|
4
|
Ghosh V, Bhattacharjee A, Kumar A, Ojha PK. q-RASTR modelling for prediction of diverse toxic chemicals towards T. pyriformis. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024; 35:11-30. [PMID: 38193248 DOI: 10.1080/1062936x.2023.2298452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 12/16/2023] [Indexed: 01/10/2024]
Abstract
A series of diverse organic compounds impose serious detrimental effects on the health of living organisms and the environment. Determination of the structural aspects of compounds that impart toxicity and evaluation of the same is crucial before public usage. The present study aims to determine the structural characteristics of compounds for Tetrahymena pyriformis toxicity using the q-RASTR (Quantitative Read Across Structure-Toxicity Relationship) model. It was developed using RASTR and 2-D descriptors for a dataset of 1792 compounds with defined endpoint (pIGC50) against a model organism, T. pyriformis. For the current study, the whole dataset was divided based on activity/property into the training and test sets, and the q-RASTR model was developed employing six descriptors (three latent variables) having r2, Q2F1 and Q2 values of 0.739, 0.767, and 0.735, respectively. The generated model was thoroughly validated using internationally recognized internal and external validation criteria to assess the model's dependability and predictability. It was highlighted that high molecular weight, aromatic hydroxyls, nitrogen, double bonds, and hydrophobicity increase the toxicity of organic compounds. The current study demonstrates the applicability of the RASTR algorithm in QSTR model development for the prediction of toxic chemicals (pIGC50) towards T. pyriformis.
Collapse
Affiliation(s)
- V Ghosh
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - A Bhattacharjee
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - A Kumar
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| | - P K Ojha
- Drug Discovery and Development Laboratory (DDD Lab), Department of Pharmaceutical Technology, Jadavpur University, Kolkata, India
| |
Collapse
|
5
|
Yu X. Global classification models for predicting acute toxicity of chemicals towards Daphnia magna. ENVIRONMENTAL RESEARCH 2023; 238:117239. [PMID: 37778597 DOI: 10.1016/j.envres.2023.117239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 09/10/2023] [Accepted: 09/18/2023] [Indexed: 10/03/2023]
Abstract
Molecular descriptors reflecting structural information on hydrophobicity, reactivity, polarizability, hydrogen bond and charged groups, were used to predict the toxicity (pLC50) of chemicals towards Daphnia magna with global quantitative structure-activity/toxicity relationship (QSAR/QSTR) models. A sufficiently large dataset including 1517 chemical toxicity to Daphnia magna was divided into a training set (758 pLC50) and a test set (759 pLC50). By applying random forest algorithm, two classification models, Class Model A and Class Model B were developed, having prediction accuracy, sensitivity and specificity above 85% for Class 1 (with pLC50 ≤ 4.48) and Class 2 (with pLC50 > 4.48). The Class Model A was based on nine molecular descriptors and RF parameters of nodesize = 1, ntree = 80 and mtry = 2, and yielded accuracy of 92.3% (training set), 85.6% (test set) and 88.9% (total data set). Class Model B was based on ten descriptors and parameters, nodesize = 1, ntree = 90 and mtry = 2, produced accuracy of 88.3% (training set), 86.8% (test set) and 87.5% (total data set). The two classification models were satisfactory compared with other classification model reported in the literature, although classification models in this work dealt with more samples. Thus, the two classification models with a larger applicability domain provided efficient tools for assessing chemical aquatic toxicity towards Daphnia magna.
Collapse
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan, 411104, China.
| |
Collapse
|
6
|
Yu X, He M, Su L. Large Dataset-Based Regression Model of Chemical Toxicity to Vibrio fischeri. ARCHIVES OF ENVIRONMENTAL CONTAMINATION AND TOXICOLOGY 2023:10.1007/s00244-023-01010-4. [PMID: 37407875 DOI: 10.1007/s00244-023-01010-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 06/20/2023] [Indexed: 07/07/2023]
Abstract
For the first time, a global regression quantitative structure-toxicity/activity relationship (QSTR/QSAR) model was developed for the toxicity of a large data set including 1236 chemicals towards Vibrio fischeri, by using random forest (RF) regression algorithm. The optimal RF model with RF parameters of mtry = 3, ntree = 150 and nodesize = 5 was based on 13 molecular descriptors. It can achieve accurate prediction for the toxicity of 99.1% of 1236 chemicals, and yield coefficients of determination R2 of 0.893 for 930 log(Mw/IBC50) in the training set, 0.723 for 306 log(Mw/IBC50) in the test se, and 0.865 for 1236 toxicity log(Mw/IBC50) in the total set. The optimal RF global model proposed in this work is comparable to other published local QSTR models on small datasets of the toxicity to Vibrio fischeri.
Collapse
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis and Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, 411104, Hunan, People's Republic of China.
| | - Minghui He
- School of Environment, and State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, Northeast Normal University, Changchun, 130117, Jilin, People's Republic of China
| | - Limin Su
- School of Environment, and State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, Northeast Normal University, Changchun, 130117, Jilin, People's Republic of China.
| |
Collapse
|
7
|
Liao M, Wu F, Yu X, Zhao L, Wu H, Zhou J. Random Forest Algorithm-Based Prediction of Solvation Gibbs Energies. J SOLUTION CHEM 2023. [DOI: 10.1007/s10953-023-01247-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
8
|
Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri. Molecules 2023; 28:molecules28062703. [PMID: 36985675 PMCID: PMC10057455 DOI: 10.3390/molecules28062703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 03/12/2023] [Accepted: 03/15/2023] [Indexed: 03/19/2023] Open
Abstract
Vibrio fischeri is widely used as the model species in toxicity and risk assessment. For the first time, a global classification model was proposed in this paper for a two-class problem (Class − 1 with log1/IBC50 ≤ 4.2 and Class + 1 with log1/IBC50 > 4.2, the unit of IBC50: mol/L) by utilizing a large data set of 601 toxicity log1/IBC50 of organic compounds to Vibrio fischeri. Dragon software was used to calculate 4885 molecular descriptors for each compound. Stepwise multiple linear regression (MLR) analysis was used to select the descriptor subset for the models. The ten molecular descriptors used in the classification model reflect the structural information on the Michael-type addition of nucleophiles, molecular branching, molecular size, polarizability, hydrophobic, and so on. Furthermore, these descriptors were interpreted from the point of view of toxicity mechanisms. The optimal support vector machine (SVM) model (C = 253.8 and γ = 0.009) was obtained with the genetic algorithm. The SVM classification model produced a prediction accuracy of 89.1% for the training set (451 log1/IBC50), of 80.0% for the test set (150 log1/IBC50), and of 86.9% for the total data set (601 log1/IBC50), which are higher than that (80.5%, 76%, and 79.4%, respectively) from the binary logistic regression (BLR) model. The global SVM classification model is successful, although it deals with a large data set in relation to the toxicity of organics to Vibrio fischeri.
Collapse
|
9
|
Yu X, Acree Jr. WE. QSPR-based model extrapolation prediction of enthalpy of solvation. J Mol Liq 2023. [DOI: 10.1016/j.molliq.2023.121455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
|
10
|
Jia Q, Wang S, Yu M, Wang Q, Yan F. Two QSAR models for predicting the toxicity of chemicals towards Tetrahymena pyriformis based on topological-norm descriptors and spatial-norm descriptors. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:147-161. [PMID: 36749040 DOI: 10.1080/1062936x.2023.2171478] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 01/17/2023] [Indexed: 06/18/2023]
Abstract
Quantitative structure-activity relationship (QSAR) is important for safe, rapid and effective risk assessment of chemicals. In this study, two QSAR models were established with 1230 chemicals to predict toxicity towards Tetrahymena pyriformis using multiple linear regression (MLR) method. The topological(T)-QSAR model was developed by using topological-norm descriptors generated from the topological structure, and the spatial(S)-QSAR model were built with spatial-norm descriptors obtained from the three-dimensional structure of molecules and topological-norm descriptors. The r2training and r2test are 0.8304 and 0.8338 for the T-QSAR model, and 0.8485 and 0.8585 for the S-QSAR model, which means that T-QSAR model and S-QSAR model can be used to predict toxicity quickly and accurately. In addition, we also conducted validation on the developed models. Satisfying validation results and statistical parameters demonstrated that QSAR models based on the topological-norm descriptors and spatial-norm descriptors proposed in this paper could be further utilized to estimate the toxicity of chemicals towards Tetrahymena pyriformis.
Collapse
Affiliation(s)
- Q Jia
- School of Marine and Environmental Science, Tianjin Marine Environmental Protection and Restoration Technology Engineering Center, Tianjin University of Science and Technology, Tianjin, PR China
| | - S Wang
- School of Marine and Environmental Science, Tianjin Marine Environmental Protection and Restoration Technology Engineering Center, Tianjin University of Science and Technology, Tianjin, PR China
| | - M Yu
- School of Chemical Engineering and Material Science, Tianjin University of Science and Technology, Tianjin, PR China
| | - Q Wang
- School of Chemical Engineering and Material Science, Tianjin University of Science and Technology, Tianjin, PR China
| | - F Yan
- School of Chemical Engineering and Material Science, Tianjin University of Science and Technology, Tianjin, PR China
| |
Collapse
|
11
|
Quantitative relationships between national cultures and the increase in cases of novel coronavirus pneumonia. Sci Rep 2023; 13:1646. [PMID: 36717639 PMCID: PMC9885052 DOI: 10.1038/s41598-023-28980-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 01/27/2023] [Indexed: 02/01/2023] Open
Abstract
Support vector machine (SVM) and genetic algorithm were successfully used to predict the changes in the prevalence rate (ΔPR) measured by the increase of reported cases per million population from the 16th to the 45th day during a nation's lockdown after the COVID-19 outbreak. The national cultural indices [individualism-collectivism (Ind), tightness-looseness (Tight)], and the number of people per square kilometer (Pop_density) were used to develop the SVM model of lnΔPR. The SVM model has R2 of 0.804 for the training set (44 samples) and 0.853 for the test set (11 samples), which were much higher than those (0.416 and 0.593) of the multiple linear regression model. The statistical results indicate that there are nonlinear relationships between lnΔPR and Tight, Ind, and Pop_density. It is feasible to build the model for lnΔPR with SVM algorithm. The results suggested that the risk of COVID-19 epidemic spread will be reduced if a nation implements severe measures to strengthen the tightness of national culture and individuals realize the importance of collectivism.
Collapse
|
12
|
Xiao L, Deng J, Yang L, Huang X, Yu X. Random forest algorithm-based accurate prediction of rat acute oral toxicity. Mol Phys 2022. [DOI: 10.1080/00268976.2022.2140083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Linrong Xiao
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, People’s Republic of China
| | - Jiyong Deng
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, People’s Republic of China
| | - Liping Yang
- Shenzhen Expressway Environment Co., Ltd., Shenzhen, People’s Republic of China
| | - Xianwei Huang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, People’s Republic of China
| | - Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, People’s Republic of China
| |
Collapse
|
13
|
Diéguez-Santana K, Nachimba-Mayanchi MM, Puris A, Gutiérrez RT, González-Díaz H. Prediction of acute toxicity of pesticides for Americamysis bahia using linear and nonlinear QSTR modelling approaches. ENVIRONMENTAL RESEARCH 2022; 214:113984. [PMID: 35981614 DOI: 10.1016/j.envres.2022.113984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 06/19/2022] [Accepted: 07/22/2022] [Indexed: 06/15/2023]
Abstract
Globally, pesticides are toxic substances with wide applications. However, the widespread use of pesticides has received increasing attention from regulatory agencies due to their various acute and chronic effects on multiple organisms. In this study, Quantitative Structure-Toxicity Relationship (QSTR) models were established using Multiple Linear Regression (MLR) and five Machine Learning (ML) algorithms to predict pesticide toxicity in Americamysis bahia. The most influential descriptors included in the MLR model are RBF, JGI2, nCbH, nRCOOR, nRSR, nPO4 and 'Cl-090', with positive contributions to the dependent variable (negative decimal logarithm of median lethal concentration at 96-h). The Random Forest (RF) regression model was superior amongst the five ML models. We observed higher values of R2 (0.812) and lower values of RMSE (0.595) and MAE (0.462) in the cross-validation training set and external validation set. Similarly, this study had a high level of fitness and was internally robust and externally predictive compared to models presented in similar studies. The results suggest that the developed QSTR models are suitable for reliably predicting the aquatic toxicity of structurally diverse pesticides and can be used for screening, prioritising new pesticides, filling data gaps and overcoming the limitations of in vivo and in vitro tests.
Collapse
Affiliation(s)
- Karel Diéguez-Santana
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, 48940, Leioa, Spain; Universidad Regional Amazónica Ikiam, Tena, Ecuador.
| | | | - Amilkar Puris
- Facultad de Ciencias de la Ingeniería, Universidad Técnica Estatal de Quevedo, Ecuador
| | | | - Humberto González-Díaz
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, 48940, Leioa, Spain; Basque Center for Biophysics CSIC-UPVEH, University of Basque Country UPV/EHU, 48940, Leioa, Spain; IKERBASQUE, Basque Foundation for Science, 48011, Bilbao, Biscay, Spain
| |
Collapse
|
14
|
Yu X, Zeng Q. Random forest algorithm-based classification model of pesticide aquatic toxicity to fishes. AQUATIC TOXICOLOGY (AMSTERDAM, NETHERLANDS) 2022; 251:106265. [PMID: 36030712 DOI: 10.1016/j.aquatox.2022.106265] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 08/12/2022] [Accepted: 08/13/2022] [Indexed: 06/15/2023]
Abstract
Aquatic toxicity of pesticides can result in poisoning of many non-target organisms, of which various fishes are the most prominent one. It is a challenge to predict the toxicity (LC50) classes of organic pesticides to various fish species from global QSAR models with a larger applicability domain. In this paper, by applying the random forest (RF) algorithm for a two-class problem, only eight molecular descriptors were used to develop a quantitative structure-activity relationship (QSAR) model for 1106 toxicity data (96 h, LC50) of organic pesticides to various fish species including Oncorhynchus mykiss, Lepomis macrochirus, Pimephales promelas, Brachydanio rerio, Cyprinodon, Cyprinus carpio, etc. By the prediction of the optimal RF Model I (ntree =280, mtry = 3 and nodesize = 5), the training set (885 organic pesticides) has the prediction accuracies of 99.6% for Class 1 (LC50 ≤ 10) and 96.7% for Class 2 (LC50 > 10); the test set (221 organic pesticides) has the accuracies being 90.8% for Class 1 and 91.2% for Class 2. The optimal RF Model I is satisfactory compared with other QSAR model reported in the literature, although its descriptor subset is small.
Collapse
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan 411104, China.
| | - Qun Zeng
- Department of Neurosurgery, Central Hospital of Xiangtan, Xiangtan, Hunan 411100, China
| |
Collapse
|
15
|
Fang Z, Yu X, Zeng Q. Random forest algorithm-based accurate prediction of chemical toxicity to Tetrahymena pyriformis. Toxicology 2022; 480:153325. [PMID: 36115645 DOI: 10.1016/j.tox.2022.153325] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 09/09/2022] [Accepted: 09/13/2022] [Indexed: 12/01/2022]
Abstract
The random forest (RF) algorithm, together with ten Dragon descriptors, was used to develop a quantitative structure-toxicity/activity relationship (QSTR/QSAR) model for a larger data set of 1792 chemical toxicity pIGC50 towards Tetrahymena pyriformis. The optimal RF (ntree =300 and mtry =3) model yielded root mean square (rms) errors of 0.261 for the training set (1434 chemicals) and 0.348 for the test set (358 chemicals). Compared with other QSTR models reported in the literature, the optimal RF model in this paper is more accurate. The feasibility of applying the RF algorithm to predict chemical toxicity pIGC50 towards Tetrahymena pyriformis has been verified.
Collapse
Affiliation(s)
- Zhengjun Fang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan 411104, China
| | - Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, Hunan 411104, China.
| | - Qun Zeng
- Department of Neurosurgery, Xiangtan Central Hospital, Xiangtan, Hunan 411100, China
| |
Collapse
|
16
|
Ahmadianfar I, Shirvani-Hosseini S, He J, Samadi-Koucheksaraee A, Yaseen ZM. An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction. Sci Rep 2022; 12:4934. [PMID: 35322087 PMCID: PMC8943002 DOI: 10.1038/s41598-022-08875-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 03/14/2022] [Indexed: 11/09/2022] Open
Abstract
Precise prediction of water quality parameters plays a significant role in making an early alert of water pollution and making better decisions for the management of water resources. As one of the influential indicative parameters, electrical conductivity (EC) has a crucial role in calculating the proportion of mineralization. In this study, the integration of an adaptive hybrid of differential evolution and particle swarm optimization (A-DEPSO) with adaptive neuro fuzzy inference system (ANFIS) model is adopted for EC prediction. The A-DEPSO method uses unique mutation and crossover processes to correspondingly boost global and local search mechanisms. It also uses a refreshing operator to prevent the solution from being caught inside the local optimal solutions. This study uses A-DEPSO optimizer for ANFIS training phase to eliminate defects and predict accurately the EC water quality parameter every month at the Maroon River in the southwest of Iran. Accordingly, the recorded dataset originated from the Tange-Takab station from 1980 to 2016 was operated to develop the ANFIS-A-DEPSO model. Besides, the wavelet analysis was jointed to the proposed algorithm in which the original time series of EC was disintegrated into the sub-time series through two mother wavelets to boost the prediction certainty. In the following, the comparison between statistical metrics of the standalone ANFIS, least-square support vector machine (LSSVM), multivariate adaptive regression spline (MARS), generalized regression neural network (GRNN), wavelet-LSSVM (WLSSVM), wavelet-MARS (W-MARS), wavelet-ANFIS (W-ANFIS) and wavelet-GRNN (W-GRNN) models was implemented. As a result, it was apparent that not only was the W-ANFIS-A-DEPSO model able to rise remarkably the EC prediction certainty, but W-ANFIS-A-DEPSO (R = 0.988, RMSE = 53.841, and PI = 0.485) also had the edge over other models with Dmey mother in terms of EC prediction. Moreover, the W-ANFIS-A-DEPSO can improve the RMSE compared to the standalone ANFIS-DEPSO model, accounting for 80%. Hence, this model can create a closer approximation of EC value through W-ANFIS-A-DEPSO model, which is likely to act as a promising procedure to simulate the prediction of EC data.
Collapse
Affiliation(s)
- Iman Ahmadianfar
- Department of Civil Engineering, Behbahan Khatam Alanbia University of Technology, Behbahan, Iran.
| | | | - Jianxun He
- Department of Civil Engineering, University of Calgary, Calgary, AB, Canada
| | | | - Zaher Mundher Yaseen
- Adjunct Research Fellow, USQ's Advanced Data Analytics Research Group, School of Mathematics Physics and Computing, University of Southern Queensland, QLD 4350, Toowoomba, Australia.,New Era and Development in Civil Engineering Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, 64001, Iraq
| |
Collapse
|
17
|
Yu X. Prediction of enantioselectivity in thiol addition to imines catalyzed by chiral phosphoric acids. J PHYS ORG CHEM 2022. [DOI: 10.1002/poc.4338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering Hunan Institute of Engineering Xiangtan China
| |
Collapse
|
18
|
Tinkov OV, Grigorev VY, Grigoreva LD. QSAR analysis of the acute toxicity of avermectins towards Tetrahymena pyriformis. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2021; 32:541-571. [PMID: 34157880 DOI: 10.1080/1062936x.2021.1932583] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 05/17/2021] [Indexed: 06/13/2023]
Abstract
Avermectins have been effectively used in medicine, veterinary medicine, and agriculture as antiparasitic agents for many years. However, there are still no reliable data on the main ecotoxicological characteristics of most individual avermectins. Although many QSAR models have been proposed to describe the acute toxicity of organic compounds towards Tetrahymena pyriformis (T. pyriformis), avermectins are outside the applicability domain of these models. The influence of the molecular structures of various organic compounds on the acute toxicity towards T. pyriformis was studied using the OCHEM web platform (https://ochem.eu). A data set of 1792 toxicants was used to create models. The QSAR (Quantitative Structure-Activity Relationship) models were developed using the molecular descriptors Dragon, ISIDA, CDK, PyDescriptor, alvaDesc, and SIRMS and machine learning methods, such as Least Squares Support Vector Machine and Transformer Convolutional Neural Network. The HYBOT descriptors and Random Forest were used for a comparative QSAR investigation. Since the best predictive ability was demonstrated by the Transformer Convolutional Neural Network model, it was used to predict the toxicity of individual avermectins towards T. pyriformis. During a structural interpretation of the developed QSAR model, we determined the significant molecular transformations that increase and decrease the acute toxicity of organic compounds.
Collapse
Affiliation(s)
- O V Tinkov
- Department of Pharmacology and Pharmaceutical Chemistry, Medical Faculty, Shevchenko Transnistria State University, Tiraspol, Moldova
- Department of Computer Science, Military Institute of the Ministry of Defense, Tiraspol, Moldova
| | - V Y Grigorev
- Department of Computer-aided Molecular Design, Institute of Physiologically Active Compounds of the Russian Academy of Science, Chernogolovka, Russia
| | - L D Grigoreva
- Department of Fundamental Physicochemical Engineering, Moscow State University, Moscow, Russia
| |
Collapse
|
19
|
Yu X, Liu J. Prediction of reaction rate constants of hydroxyl radical with chemicals in water. WATER ENVIRONMENT RESEARCH : A RESEARCH PUBLICATION OF THE WATER ENVIRONMENT FEDERATION 2021; 93:934-939. [PMID: 33249688 DOI: 10.1002/wer.1485] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2020] [Revised: 10/31/2020] [Accepted: 11/20/2020] [Indexed: 06/12/2023]
Abstract
The rate constants (kOH ) of the reactions between organic micropollutants with hydroxyl radical (•OH) in aqueous systems are an important parameter to evaluate the persistence of organic compounds in the environment. In this paper, a support vector machine (SVM) model based on five descriptors was built to predict the reaction rate constants (log K = (log kOH )/MW ). The quantitative structure-activity relationship (QSAR) model of log K was obtained from a training set (600 compounds) and validated with a test set (395 compounds). The coefficients of determination R2 of the training and test sets are 0.923 and 0.925, respectively. The results suggest that the SVM model developed in this work possesses satisfactory prediction ability. PRACTITIONER POINTS: The rate constants of the reactions of organic micropollutants with •OH in aqueous systems were investigated. SVM model was established for the reaction rate constants (log K = (log kOH )/MW ). Only five molecular descriptors were used to predict 995 log K. A large data set was used for the test set (395 compounds).
Collapse
Affiliation(s)
- Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, China
| | - Jun Liu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, China
| |
Collapse
|
20
|
Tinkov O, Polishchuk P, Matveieva M, Grigorev V, Grigoreva L, Porozov Y. The Influence of Structural Patterns on Acute Aquatic Toxicity of Organic Compounds. Mol Inform 2020; 40:e2000209. [PMID: 33029954 DOI: 10.1002/minf.202000209] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 10/01/2020] [Indexed: 12/28/2022]
Abstract
Investigation of the influence of molecular structure of different organic compounds on acute toxicity towards Fathead minnow, Daphnia magna, and Tetrahymena pyriformis has been carried out using 2D simplex representation of molecular structure and two modelling methods: Random Forest (RF) and Gradient Boosting Machine (GBM). Suitable QSAR (Quantitative Structure - Activity Relationships) models were obtained. The study was focused on QSAR models interpretation. The aim of the study was to develop a set of structural fragments that simultaneously consistently increase toxicity toward Fathead minnow, Daphnia magna, Tetrahymena pyriformis. The interpretation allowed to gain more details about known toxicophores and to propose new fragments. The results obtained made it possible to rank the contributions of molecular fragments to various types of toxicity to aquatic organisms. This information can be used for molecular optimization of chemicals. According to the results of structural interpretation, the most significant common mechanisms of the toxic effect of organic compounds on Fathead minnow, Daphnia magna and Tetrahymena pyriformis are reactions of nucleophilic substitution and inhibition of oxidative phosphorylation in mitochondria. In addition acetylcholinesterase and voltage-gated ion channel of Fathead minnow and Daphnia magna are important targets for toxicants. The on-line version of the OCHEM expert system (https://ochem.eu) were used for a comparative QSAR investigation. The proposed QSAR models comply with the OECD principles and can be used to reliably predict acute toxicity of organic compounds towards Fathead minnow, Daphnia magna and Tetrahymena pyriformis with allowance for applicability domain estimation.
Collapse
Affiliation(s)
- Oleg Tinkov
- Department of Computer Science, Military Institute of the Ministry of Defense, 3300, Gogol str. 2"B", Tiraspol, Transdniestria, Moldova.,Department of Pharmacology and Pharmaceutical Chemistry, Medical Faculty, Transnistrian State University, 3300, October 25 str. 128, Tiraspol, Transdniestria, Moldova
| | - Pavel Polishchuk
- Institute of Molecular and Translational Medicine Faculty of Medicine and Dentistry Palacký University and University Hospital in Olomouc, Hnevotinska 5, 77900, Olomouc, Czech Republic
| | - Mariia Matveieva
- Institute of Molecular and Translational Medicine Faculty of Medicine and Dentistry Palacký University and University Hospital in Olomouc, Hnevotinska 5, 77900, Olomouc, Czech Republic
| | - Veniamin Grigorev
- Institute of Physiologically Active Compounds, Russian Academy of Sciences, 142432, Severniy proezd 1, Chernogolovka, Moscow region, Russia
| | - Ludmila Grigoreva
- Department of Fundamental Physical and Chemical Engineering, Moscow State University, 119991, Leninskiye Gory 1/51, Moscow, Russia
| | - Yuri Porozov
- World-Class Research Center "Digital biodesign and personalized healthcare", I.M. Sechenov First Moscow State Medical University, Moscow, Russia.,Department of Computational Biology, Sirius University of Science and Technology, 354340, Olympic Ave 1, Sochi, Russia
| |
Collapse
|
21
|
Toropov AA, Toropova AP, Benfenati E. 'Ideal correlations' for the predictive toxicity to Tetrahymena pyriformis. Toxicol Mech Methods 2020; 30:605-610. [PMID: 32718259 DOI: 10.1080/15376516.2020.1801928] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
OBJECTIVES Predictive models for toxicity to Tetrahymena pyriformis are an important component of natural sciences. The present study aims to build up a predictive model for the endpoint using the so-called index of ideality of correlation (IIC). Besides, the comparison of the predictive potential of these models with the predictive potential of models suggested in the literature is the task of the present study. METHODS The Monte Carlo technique is a tool to build up the predictive model applied in this study. The molecular structure is represented via a simplified molecular input-line entry system (SMILES). The IIC is a statistical characteristic sensitive to both the correlation coefficient and mean absolute error. Applying of the IIC to build up quantitative structure-activity relationships (QSARs) for the toxicity to Tetrahymena pyriformis improves the predictive potential of those models for random splits into the training set and the validation set. The calculation was carried out with CORAL software (http://www.insilico.eu/coral). RESULTS The statistical quality of the suggested models is incredibly good for the external validation set, but the statistical quality of the models for the training set is modest. This is the paradox of ideal correlation, which is obtained with applying the IIC. CONCLUSIONS The Monte Carlo technique is a convenient and reliable way to build up a predictive model for toxicity to Tetrahymena pyriformis. The IIC is a useful statistical criterion for building up predictive models as well as for the assessment of their statistical quality.
Collapse
Affiliation(s)
- Andrey A Toropov
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
| | - Alla P Toropova
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
| | - Emilio Benfenati
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milano, Italy
| |
Collapse
|
22
|
Liu Q, Deng J, Liu M. Classification models for predicting the antimalarial activity against Plasmodium falciparum. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2020; 31:313-324. [PMID: 32191533 DOI: 10.1080/1062936x.2020.1740890] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Accepted: 03/07/2020] [Indexed: 06/10/2023]
Abstract
Support vector machine (SVM) and general regression neural network (GRNN) were used to develop classification models for predicting the antimalarial activity against Plasmodium falciparum. Only 15 molecular descriptors were used to build the classification models for the antimalarial activities of 4750 compounds, which were divided into a training set (3887 compounds) and a test set (863 compounds). For the SVM model, its prediction accuracies are 89.5% for the training set and 87.3% for the test set. For the GRNN model, the prediction accuracies for the two sets are 99.7% and 88.9%, respectively. Both SVC and GRNN models have better prediction ability than the classification model based on binary logistic regression (BLR) analysis. Compared with previously published classification models both SVC and GRNN models are satisfactory in predicting antimalarial activities of compounds with in addition of fewer descriptors.
Collapse
Affiliation(s)
- Q Liu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, China
| | - J Deng
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration, College of Materials and Chemical Engineering, Hunan Institute of Engineering, Xiangtan, China
| | - M Liu
- School of Chemistry and Materials Engineering, Huizhou University, Huizhou, PR China
| |
Collapse
|
23
|
Chen X, Dang L, Yang H, Huang X, Yu X. Machine learning-based prediction of toxicity of organic compounds towards fathead minnow. RSC Adv 2020; 10:36174-36180. [PMID: 35517078 PMCID: PMC9056962 DOI: 10.1039/d0ra05906d] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Accepted: 09/14/2020] [Indexed: 01/19/2023] Open
Abstract
Predicting the acute toxicity of a large dataset of diverse chemicals against fathead minnows (Pimephales promelas) is challenging. In this paper, 963 organic compounds with acute toxicity towards fathead minnows were split into a training set (482 compounds) and a test set (481 compounds) with an approximate ratio of 1 : 1. Only six molecular descriptors were used to establish the quantitative structure–activity/toxicity relationship (QSAR/QSTR) model for 96 hour pLC50 through a support vector machine (SVM) along with genetic algorithm. The optimal SVM model (R2 = 0.756) was verified using both internal (leave-one-out cross-validation) and external validations. The validation results (qint2 = 0.699 and qext2 = 0.744) were satisfactory in predicting acute toxicity in fathead minnows compared with other models reported in the literature, although our SVM model has only six molecular descriptors and a large data set for the test set consisting of 481 compounds. A quantitative structure–toxicity relationship of 963 chemicals against fathead minnow was developed by using support vector machine and genetic algorithm.![]()
Collapse
Affiliation(s)
- Xingmei Chen
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration
- College of Materials and Chemical Engineering
- Hunan Institute of Engineering
- Xiangtan
- China
| | - Limin Dang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration
- College of Materials and Chemical Engineering
- Hunan Institute of Engineering
- Xiangtan
- China
| | - Hai Yang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration
- College of Materials and Chemical Engineering
- Hunan Institute of Engineering
- Xiangtan
- China
| | - Xianwei Huang
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration
- College of Materials and Chemical Engineering
- Hunan Institute of Engineering
- Xiangtan
- China
| | - Xinliang Yu
- Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration
- College of Materials and Chemical Engineering
- Hunan Institute of Engineering
- Xiangtan
- China
| |
Collapse
|