1
|
Karaduman G, Kelleci Çelik F. Towards safer pesticide management: A quantitative structure-activity relationship based hazard prediction model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 916:170173. [PMID: 38266732 DOI: 10.1016/j.scitotenv.2024.170173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Revised: 01/07/2024] [Accepted: 01/13/2024] [Indexed: 01/26/2024]
Abstract
Pesticides are recognized as common environmental contaminants. The potential pesticide hazard to non-target organisms, including various mammal species, is a global concern. The global problem requires a comprehensive risk assessment. To assess the toxic effects of pesticides at the early stage, a toxicological risk analysis is conducted to determine pesticide hazard levels. World Health Organization (WHO) has established five pesticide hazard classes based on lethal dose (LD50) values to perform these assessments. In this paper, we have developed one-vs-all quantitative structure-activity relationship (OvA-QSAR) models using five machine-learning techniques with the selected optimum molecular descriptors. Descriptor selection was conducted based on correlation to evaluate the relevance and significance of individual features in our dataset. Our OvA-QSAR model was built using a dataset obtained from the WHO, covering a wide range of chemical pesticides. These models can predict the hazard category for a pesticide within the five available categories. Notably, our experiments demonstrate the outstanding performance and robustness of the Random Forest (RF) model in addressing the challenge of multi-class classification with the selected descriptors.
Collapse
Affiliation(s)
- Gül Karaduman
- Karamanoğlu Mehmetbey University, Vocational School of Health Services, 70200 Karaman, Turkey; University of Texas at Arlington, Department of Mathematics, Arlington, TX 76019-0408, USA.
| | - Feyza Kelleci Çelik
- Karamanoğlu Mehmetbey University, Vocational School of Health Services, 70200 Karaman, Turkey.
| |
Collapse
|
2
|
Kumar A, Kumar V, Ojha PK, Roy K. Chronic aquatic toxicity assessment of diverse chemicals on Daphnia magna using QSAR and chemical read-across. Regul Toxicol Pharmacol 2024; 148:105572. [PMID: 38325631 DOI: 10.1016/j.yrtph.2024.105572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 01/06/2024] [Accepted: 01/26/2024] [Indexed: 02/09/2024]
Abstract
We have modeled here chronic Daphnia toxicity taking pNOEC (negative logarithm of no observed effect concentration in mM) and pEC50 (negative logarithm of half-maximal effective concentration in mM) as endpoints using QSAR and chemical read-across approaches. The QSAR models were developed by strictly obeying the OECD guidelines and were found to be reliable, predictive, accurate, and robust. From the selected features in the developed models, we have found that an increase in lipophilicity and saturation, the presence of electrophilic or electronegative or heavy atoms, the presence of sulphur, amine, and their related functionality, an increase in mean atomic polarizability, and higher number of (thio-) carbamates (aromatic) groups are responsible for chronic toxicity. Therefore, this information might be useful for the development of environmentally friendly and safer chemicals and data-gap filling as well as reducing the use of identified toxic chemicals which have chronic toxic effects on aquatic ecosystems. Approved classes of drugs from DrugBank databases and diverse groups of chemicals from the Chemical and Product Categories (CPDat) database were also assessed through the developed models.
Collapse
Affiliation(s)
- Ankur Kumar
- Drug Discovery and Development (DDD) Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Vinay Kumar
- Drug Theoretics and Cheminformatics (DTC) Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Probir Kumar Ojha
- Drug Discovery and Development (DDD) Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India
| | - Kunal Roy
- Drug Theoretics and Cheminformatics (DTC) Laboratory, Department of Pharmaceutical Technology, Jadavpur University, Kolkata, 700032, India.
| |
Collapse
|
3
|
Gao YY, Zhao W, Huang YQ, Kumar V, Zhang X, Hao GF. In silico environmental risk assessment improves efficiency for pesticide safety management. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 908:167878. [PMID: 37858821 DOI: 10.1016/j.scitotenv.2023.167878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 10/09/2023] [Accepted: 10/14/2023] [Indexed: 10/21/2023]
Abstract
Pesticides are indispensable to maintain crop quality and food production worldwide, but their use also poses environmental risks. Pesticide risk assessment involves a series of complex, expensive and time-consuming toxicity tests. To improve the efficiency and accuracy for assessing the environmental impact of pesticides, numerous computational tools have been developed. However, there is a notable deficiency in critical analysis or a systematic summary of environmental risk assessment tools and their applicable contexts. Here, many of the current approaches and tools for assessing environmental risks posed by pesticides are reviewed, and the question of whether these tools are fit for use on complex multicomponent scenarios is discussed. We analyze the adaptations of these tools to aquatic and terrestrial ecosystems, followed by the provision of resources for predicting pesticide concentrations in environmental medias, including air, soil and water. The successful application of computational tools for risk assessment and interpretation of predicted results will also be discussed. This assessment serves as a valuable resource, enabling scientists to utilize suitable models to enhance the robustness of pesticides risk assessments.
Collapse
Affiliation(s)
- Yang-Yang Gao
- National Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Wei Zhao
- National Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Yuan-Qin Huang
- National Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Vinit Kumar
- National Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Xiao Zhang
- National Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Ge-Fei Hao
- National Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang 550025, PR China; National Key Laboratory of Green Pesticide, Central China Normal University, Wuhan 430079, PR China.
| |
Collapse
|
4
|
Schlender T, Viljanen M, van Rijn JN, Mohr F, Peijnenburg WJGM, Hoos HH, Rorije E, Wong A. The Bigger Fish: A Comparison of Meta-Learning QSAR Models on Low-Resourced Aquatic Toxicity Regression Tasks. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:17818-17830. [PMID: 37315216 PMCID: PMC10666535 DOI: 10.1021/acs.est.3c00334] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 05/25/2023] [Accepted: 05/25/2023] [Indexed: 06/16/2023]
Abstract
Toxicological information as needed for risk assessments of chemical compounds is often sparse. Unfortunately, gathering new toxicological information experimentally often involves animal testing. Simulated alternatives, e.g., quantitative structure-activity relationship (QSAR) models, are preferred to infer the toxicity of new compounds. Aquatic toxicity data collections consist of many related tasks─each predicting the toxicity of new compounds on a given species. Since many of these tasks are inherently low-resource, i.e., involve few associated compounds, this is challenging. Meta-learning is a subfield of artificial intelligence that can lead to more accurate models by enabling the utilization of information across tasks. In our work, we benchmark various state-of-the-art meta-learning techniques for building QSAR models, focusing on knowledge sharing between species. Specifically, we employ and compare transformational machine learning, model-agnostic meta-learning, fine-tuning, and multi-task models. Our experiments show that established knowledge-sharing techniques outperform single-task approaches. We recommend the use of multi-task random forest models for aquatic toxicity modeling, which matched or exceeded the performance of other approaches and robustly produced good results in the low-resource settings we studied. This model functions on a species level, predicting toxicity for multiple species across various phyla, with flexible exposure duration and on a large chemical applicability domain.
Collapse
Affiliation(s)
- Thalea Schlender
- Leiden
Institute of Advanced Computer Science, Leiden University, Leiden 2333 CA, The Netherlands
- National
Institute for Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
| | - Markus Viljanen
- National
Institute for Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
| | - Jan N. van Rijn
- Leiden
Institute of Advanced Computer Science, Leiden University, Leiden 2333 CA, The Netherlands
| | - Felix Mohr
- Universidad
de La Sabana, Chía 250001, Colombia
| | - Willie JGM. Peijnenburg
- National
Institute for Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
- Institute
of Environmental Sciences, Leiden University, Leiden 2333 CC, The Netherlands
| | - Holger H. Hoos
- Leiden
Institute of Advanced Computer Science, Leiden University, Leiden 2333 CA, The Netherlands
- Chair
for AI Methodology, RWTH Aaachen University, Aachen 52056, Germany
- Department
of Computer Science, The University of British
Columbia, Vancouver V6T 1Z4, Canada
| | - Emiel Rorije
- National
Institute for Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
| | - Albert Wong
- National
Institute for Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
| |
Collapse
|
5
|
Viljanen M, Minnema J, Wassenaar PNH, Rorije E, Peijnenburg W. What is the ecotoxicity of a given chemical for a given aquatic species? Predicting interactions between species and chemicals using recommender system techniques. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:765-788. [PMID: 37670728 DOI: 10.1080/1062936x.2023.2254225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 08/27/2023] [Indexed: 09/07/2023]
Abstract
Ecotoxicological safety assessment of chemicals requires toxicity data on multiple species, despite the general desire of minimizing animal testing. Predictive models, specifically machine learning (ML) methods, are one of the tools capable of solving this apparent contradiction as they allow to generalize toxicity patterns across chemicals and species. However, despite the availability of large public toxicity datasets, the data is highly sparse, complicating model development. The aim of this study is to provide insights into how ML can predict toxicity using a large but sparse dataset. We developed models to predict LC50-values, based on experimental LC50-data covering 2431 organic chemicals and 1506 aquatic species from the ECOTOX-database. Several well-known ML techniques were evaluated and a new ML model was developed, inspired by recommender systems. This new model involves a simple linear model that learns low-rank interactions between species and chemicals using factorization machines. We evaluated the predictive performances of the developed models based on two validation settings: 1) predicting unseen chemical-species pairs, and 2) predicting unseen chemicals. The results of this study show that ML models can accurately predict LC50-values in both validation settings. Moreover, we show that the novel factorization machine approach can match well-tuned, complex, ML approaches.
Collapse
Affiliation(s)
- M Viljanen
- Department of Statistics, Data Science and Modelling, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - J Minnema
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - P N H Wassenaar
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - E Rorije
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - W Peijnenburg
- Center for Safety of Substances and Products, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
- Institute of Environmental Sciences (CML), Leiden University, Leiden, The Netherlands
| |
Collapse
|
6
|
Diéguez-Santana K, Nachimba-Mayanchi MM, Puris A, Gutiérrez RT, González-Díaz H. Prediction of acute toxicity of pesticides for Americamysis bahia using linear and nonlinear QSTR modelling approaches. ENVIRONMENTAL RESEARCH 2022; 214:113984. [PMID: 35981614 DOI: 10.1016/j.envres.2022.113984] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 06/19/2022] [Accepted: 07/22/2022] [Indexed: 06/15/2023]
Abstract
Globally, pesticides are toxic substances with wide applications. However, the widespread use of pesticides has received increasing attention from regulatory agencies due to their various acute and chronic effects on multiple organisms. In this study, Quantitative Structure-Toxicity Relationship (QSTR) models were established using Multiple Linear Regression (MLR) and five Machine Learning (ML) algorithms to predict pesticide toxicity in Americamysis bahia. The most influential descriptors included in the MLR model are RBF, JGI2, nCbH, nRCOOR, nRSR, nPO4 and 'Cl-090', with positive contributions to the dependent variable (negative decimal logarithm of median lethal concentration at 96-h). The Random Forest (RF) regression model was superior amongst the five ML models. We observed higher values of R2 (0.812) and lower values of RMSE (0.595) and MAE (0.462) in the cross-validation training set and external validation set. Similarly, this study had a high level of fitness and was internally robust and externally predictive compared to models presented in similar studies. The results suggest that the developed QSTR models are suitable for reliably predicting the aquatic toxicity of structurally diverse pesticides and can be used for screening, prioritising new pesticides, filling data gaps and overcoming the limitations of in vivo and in vitro tests.
Collapse
Affiliation(s)
- Karel Diéguez-Santana
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, 48940, Leioa, Spain; Universidad Regional Amazónica Ikiam, Tena, Ecuador.
| | | | - Amilkar Puris
- Facultad de Ciencias de la Ingeniería, Universidad Técnica Estatal de Quevedo, Ecuador
| | | | - Humberto González-Díaz
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, 48940, Leioa, Spain; Basque Center for Biophysics CSIC-UPVEH, University of Basque Country UPV/EHU, 48940, Leioa, Spain; IKERBASQUE, Basque Foundation for Science, 48011, Bilbao, Biscay, Spain
| |
Collapse
|
7
|
Chen D, Hao G, Song B. Finding the Missing Property Concepts in Pesticide-Likeness. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2022; 70:10090-10099. [PMID: 35971945 DOI: 10.1021/acs.jafc.2c02757] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Poor bioavailability of pesticides is one of the major bottlenecks in the development of pesticides. Applications of the concept of pesticide-likeness have been widely accepted as one of the ways to break the bottleneck. At present, the evaluation of pesticide-likeness is mainly based on absorption, distribution, metabolism, excretion, and toxicity (ADME-T) property concepts of pesticides. However, a few significant property concepts of pesticides are ignored in the research of pesticide-likeness. Herein, we summarize the current study of ADME-T and other property concepts and analyze physicochemical properties for pesticides in the last 30 years, such as Fsp3, log P, and chiral centers. On the basis of these analyses, we propose that molecular complexity and residual property concepts of pesticides should be considered in the pesticide-likeness study. We hope that this work can help pesticide researchers and students, who are less knowledgeable in the field, to assess pesticide-likeness.
Collapse
Affiliation(s)
- Dongyu Chen
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang, Guizhou 550025, People's Republic of China
| | - Gefei Hao
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang, Guizhou 550025, People's Republic of China
| | - Baoan Song
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for Research and Development of Fine Chemicals, Guizhou University, Guiyang, Guizhou 550025, People's Republic of China
| |
Collapse
|
8
|
Larras F, Charles S, Chaumot A, Pelosi C, Le Gall M, Mamy L, Beaudouin R. A critical review of effect modeling for ecological risk assessment of plant protection products. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:43448-43500. [PMID: 35391640 DOI: 10.1007/s11356-022-19111-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 02/03/2022] [Indexed: 06/14/2023]
Abstract
A wide diversity of plant protection products (PPP) is used for crop protection leading to the contamination of soil, water, and air, which can have ecotoxicological impacts on living organisms. It is inconceivable to study the effects of each compound on each species from each compartment, experimental studies being time consuming and cost prohibitive, and animal testing having to be avoided. Therefore, numerous models are developed to assess PPP ecotoxicological effects. Our objective was to provide an overview of the modeling approaches enabling the assessment of PPP effects (including biopesticides) on the biota. Six categories of models were inventoried: (Q)SAR, DR and TKTD, population, multi-species, landscape, and mixture models. They were developed for various species (terrestrial and aquatic vertebrates and invertebrates, primary producers, micro-organisms) belonging to diverse environmental compartments, to address different goals (e.g., species sensitivity or PPP bioaccumulation assessment, ecosystem services protection). Among them, mechanistic models are increasingly recognized by EFSA for PPP regulatory risk assessment but, to date, remain not considered in notified guidance documents. The strengths and limits of the reviewed models are discussed together with improvement avenues (multigenerational effects, multiple biotic and abiotic stressors). This review also underlines a lack of model testing by means of field data and of sensitivity and uncertainty analyses. Accurate and robust modeling of PPP effects and other stressors on living organisms, from their application in the field to their functional consequences on the ecosystems at different scales of time and space, would help going toward a more sustainable management of the environment. Graphical Abstract Combination of the keyword lists composing the first bibliographic query. Columns were joined together with the logical operator AND. All keyword lists are available in Supplementary Information at https://doi.org/10.5281/zenodo.5775038 (Larras et al. 2021).
Collapse
Affiliation(s)
- Floriane Larras
- INRAE, Directorate for Collective Scientific Assessment, Foresight and Advanced Studies, Paris, 75338, France
| | - Sandrine Charles
- University of Lyon, University Lyon 1, CNRS UMR 5558, Laboratory of Biometry and Evolutionary Biology, Villeurbanne Cedex, 69622, France
| | - Arnaud Chaumot
- INRAE, UR RiverLy, Ecotoxicology laboratory, Villeurbanne, F-69625, France
| | - Céline Pelosi
- Avignon University, INRAE, UMR EMMAH, Avignon, 84000, France
| | - Morgane Le Gall
- Ifremer, Information Scientifique et Technique, Bibliothèque La Pérouse, Plouzané, 29280, France
| | - Laure Mamy
- Université Paris-Saclay, INRAE, AgroParisTech, UMR ECOSYS, Thiverval-Grignon, 78850, France
| | - Rémy Beaudouin
- Ineris, Experimental Toxicology and Modelling Unit, UMR-I 02 SEBIO, Verneuil en Halatte, 65550, France.
| |
Collapse
|
9
|
Wu J, D'Ambrosi S, Ammann L, Stadnicka-Michalak J, Schirmer K, Baity-Jesi M. Predicting chemical hazard across taxa through machine learning. ENVIRONMENT INTERNATIONAL 2022; 163:107184. [PMID: 35306252 DOI: 10.1016/j.envint.2022.107184] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 02/07/2022] [Accepted: 03/11/2022] [Indexed: 06/14/2023]
Abstract
We applied machine learning methods to predict chemical hazards focusing on fish acute toxicity across taxa. We analyzed the relevance of taxonomy and experimental setup, showing that taking them into account can lead to considerable improvements in the classification performance. We quantified the gain obtained throught the introduction of taxonomic and experimental information, compared to classification based on chemical information alone. We used our approach with standard machine learning models (K-nearest neighbors, random forests and deep neural networks), as well as the recently proposed Read-Across Structure Activity Relationship (RASAR) models, which were very successful in predicting chemical hazards to mammals based on chemical similarity. We were able to obtain accuracies of over 93% on datasets where, due to noise in the data, the maximum achievable accuracy was expected to be below 96%. The best performances were obtained by random forests and RASAR models. We analyzed metrics to compare our results with animal test reproducibility, and despite most of our models "outperform animal test reproducibility" as measured through recently proposed metrics, we showed that the comparison between machine learning performance and animal test reproducibility should be addressed with particular care. While we focused on fish mortality, our approach, provided that the right data is available, is valid for any combination of chemicals, effects and taxa.
Collapse
Affiliation(s)
- Jimeng Wu
- Eawag, Überlandstrasse 133, CH-8600 Dübendorf, Switzerland; Department of Environmental Engineering, ETHZ, Zurich, Switzerland.
| | - Simone D'Ambrosi
- Department of Statistics, Sapienza University of Rome, Piazzale Aldo Moro, 5, 00185 Rome, RM, Italy
| | - Lorenz Ammann
- Swiss Federal Institute for Forest, Snow, and Landscape Research WSL, Zürcherstrasse 111, CH-8903 Birmensdorf, Switzerland
| | | | - Kristin Schirmer
- Eawag, Überlandstrasse 133, CH-8600 Dübendorf, Switzerland; School of Architecture, Civil and Environmental Engineering, EPFL, Lausanne, Switzerland.
| | | |
Collapse
|
10
|
Predicting the membrane permeability of organic fluorescent probes by the deep neural network based lipophilicity descriptor DeepFl-LogP. Sci Rep 2021; 11:6991. [PMID: 33772099 PMCID: PMC7997998 DOI: 10.1038/s41598-021-86460-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 03/16/2021] [Indexed: 01/17/2023] Open
Abstract
Light microscopy has become an indispensable tool for the life sciences, as it enables the rapid acquisition of three-dimensional images from the interior of living cells/tissues. Over the last decades, super-resolution light microscopy techniques have been developed, which allow a resolution up to an order of magnitude higher than that of conventional light microscopy. Those techniques require labelling of cellular structures with fluorescent probes exhibiting specific properties, which are supplied from outside and therefore have to surpass cell membranes. Currently, major efforts are undertaken to develop probes which can surpass cell membranes and exhibit the photophysical properties required for super-resolution imaging. However, the process of probe development is still based on a tedious and time consuming manual screening. An accurate computer based model that enables the prediction of the cell permeability based on their chemical structure would therefore be an invaluable asset for the development of fluorescent probes. Unfortunately, current models, which are based on multiple molecular descriptors, are not well suited for this task as they require high effort in the usage and exhibit moderate accuracy in their prediction. Here, we present a novel fragment based lipophilicity descriptor DeepFL-LogP, which was developed on the basis of a deep neural network. DeepFL-LogP exhibits excellent correlation with the experimental partition coefficient reference data (R2 = 0.892 and MSE = 0.359) of drug-like substances. Further a simple threshold permeability model on the basis of this descriptor allows to categorize the permeability of fluorescent probes with 96% accuracy. This novel descriptor is expected to largely simplify and speed up the development process for novel cell permeable fluorophores.
Collapse
|
11
|
Gajewicz-Skretna A, Kar S, Piotrowska M, Leszczynski J. The kernel-weighted local polynomial regression (KwLPR) approach: an efficient, novel tool for development of QSAR/QSAAR toxicity extrapolation models. J Cheminform 2021; 13:9. [PMID: 33579384 PMCID: PMC7881668 DOI: 10.1186/s13321-021-00484-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Accepted: 01/11/2021] [Indexed: 11/10/2022] Open
Abstract
The ability of accurate predictions of biological response (biological activity/property/toxicity) of a given chemical makes the quantitative structure‐activity/property/toxicity relationship (QSAR/QSPR/QSTR) models unique among the in silico tools. In addition, experimental data of selected species can also be used as an independent variable along with other structural as well as physicochemical variables to predict the response for different species formulating quantitative activity–activity relationship (QAAR)/quantitative structure–activity–activity relationship (QSAAR) approach. Irrespective of the models' type, the developed model's quality, and reliability need to be checked through multiple classical stringent validation metrics. Among the validation metrics, error-based metrics are more significant as the basic idea of a good predictive model is to improve the predictions' quality by lowering the predicted residuals for new query compounds. Following the concept, we have checked the predictive quality of the QSAR and QSAAR models employing kernel-weighted local polynomial regression (KwLPR) approach over the traditional linear and non-linear regression-based approaches tools such as multiple linear regression (MLR) and k nearest neighbors (kNN). Five datasets which were previously modeled using linear and non-linear regression method were considered to implement the KwPLR approach, followed by comparison of their validation metrics outcomes. For all five cases, the KwLPR based models reported better results over the traditional approaches. The present study's focus is not to develop a better or improved QSAR/QSAAR model over the previous ones, but to demonstrate the advantage, prediction power, and reliability of the KwLPR algorithm and establishing it as a novel, powerful cheminformatic tool. To facilitate the use of the KwLPR algorithm for QSAR/QSPR/QSTR/QSAAR modeling, the authors provide an in-house developed KwLPR.RMD script under the open-source R programming language. ![]()
Collapse
Affiliation(s)
- Agnieszka Gajewicz-Skretna
- Laboratory of Environmental Chemometrics, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308, Gdansk, Poland.
| | - Supratik Kar
- Interdisciplinary Center for Nanotoxicity, Department of Chemistry, Physics and Atmospheric Sciences, Jackson State University, 1400 J. R. Lynch Street, P. O. Box 17910, Jackson, MS, 39217, USA
| | - Magdalena Piotrowska
- Laboratory of Environmental Chemometrics, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308, Gdansk, Poland
| | - Jerzy Leszczynski
- Interdisciplinary Center for Nanotoxicity, Department of Chemistry, Physics and Atmospheric Sciences, Jackson State University, 1400 J. R. Lynch Street, P. O. Box 17910, Jackson, MS, 39217, USA
| |
Collapse
|
12
|
Yang L, Wang Y, Chang J, Pan Y, Wei R, Li J, Wang H. QSAR modeling the toxicity of pesticides against Americamysis bahia. CHEMOSPHERE 2020; 258:127217. [PMID: 32535437 DOI: 10.1016/j.chemosphere.2020.127217] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 05/24/2020] [Accepted: 05/24/2020] [Indexed: 06/11/2023]
Abstract
The widespread use of pesticides has received increasing attention in regulatory agencies because their extensive overuse and various adverse effects on all living organisms. Organizations such as EPA and ECHA have published laws that pesticides should be fully evaluated before bring them to market. In the present study, we evaluated the pesticides toxicity using the Quantitative Structural-Activity Relationship (QSAR) method. The models for the single class pesticides (herbicides, insecticides and fungicides) as well as the general class pesticides (the combined dataset plus some microbicides, molluscicides, etc.) were developed using the Genetic Algorithm and Multiple Linear Regression method. The internal and external validation results suggested that all the obtained models were stable and predictive. According to the modeling descriptors, the lipophilic descriptors contributed positively while all the electrotopological state descriptors showed a negative contribution, their presences in every model verified the conspicuous influence of molecular lipophilicity and hydrophilicity on the pesticides toxicity. However, the influence of topological structure descriptors was different and varies with the physiochemical information they encode. Finally, the models presented in this paper would help assess the pesticides toxicity against Americamysis bahia, shorten test time, and reduce the cost of pesticides risk assessment.
Collapse
Affiliation(s)
- Lu Yang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China; University of Chinese Academy of Sciences, Yuquan RD 19A, Beijing, 100049, China
| | - Yinghuan Wang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China
| | - Jing Chang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China
| | - Yifan Pan
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China; University of Chinese Academy of Sciences, Yuquan RD 19A, Beijing, 100049, China
| | - Ruojin Wei
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China; University of Chinese Academy of Sciences, Yuquan RD 19A, Beijing, 100049, China
| | - Jianzhong Li
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China
| | - Huili Wang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China.
| |
Collapse
|
13
|
Wang Y, Li X, Xu C, Yang G, Wang D, Wang X, Wang Q. Toxicological interactions of cadmium and four pesticides on early life stage of rare minnow (Gobiocypris rarus). ECOTOXICOLOGY (LONDON, ENGLAND) 2020; 29:1453-1461. [PMID: 32880082 DOI: 10.1007/s10646-020-02269-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 08/10/2020] [Indexed: 06/11/2023]
Abstract
Although chemicals have been traditionally regulated on an individual basis in aquatic ecosystems, they often co-exist as different types of complex mixtures. Laboratory assays were conducted for assessing the responses of rare minnow (Gobiocypris rarus) to individual and mixture chemicals [trace element cadmium (Cd), thiamethoxam, deltamethrin, malathion and prochloraz]. Data obtained from 96 h semi-static toxicity assays implied that deltamethrin elicited the highest toxic effect on the various developmental phases (larval, juvenile and adult phases) of G. rarus with LC50 values ranging from 0.00061 to 0.25 mg a.i. L-1, followed by prochloraz, malathion and Cd with 96-h LC50 values ranging from 0.49 to 1.1, from 7.1 to 26, and from 7.6 to 15 mg a.i. L-1, respectively. Thiamethoxam elicited the lowest toxic effect on the organisms with 96-h LC50 values ranging from 38 to 202 mg a.i. L-1. Larval phase was not always the most sensitive period in the three detected phases to most of chemicals. Chemical combinations containing deltamethrin and malathion displayed synergetic responses to the larvae of G. rarus. Besides, the binary mixtures of Cd-deltamethrin and Cd-prochloraz also exhibited synergetic response to rare minnows. Our results indicate that extra information is necessary to develop practical criteria for selecting chemical combinations that require legislative attention according to their likelihood to exert synergetic responses. Thence, more investigations on mixture toxicities of various chemicals should be taken as a priority for producing synergetic interaction to improve the environmental risk assessment of chemicals.
Collapse
Affiliation(s)
- Yanhua Wang
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-products/Key Laboratory for Pesticide Residue Detection of Ministry of Agriculture/Laboratory (Hangzhou) for Risk Assessment of Agricultural Products of Ministry of Agriculture, Institute of Quality and Standard for Agro-products, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021, Zhejiang, China
| | - Xinfang Li
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-products/Key Laboratory for Pesticide Residue Detection of Ministry of Agriculture/Laboratory (Hangzhou) for Risk Assessment of Agricultural Products of Ministry of Agriculture, Institute of Quality and Standard for Agro-products, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021, Zhejiang, China
| | - Chao Xu
- College of Environment, Zhejiang University of Technology, Hangzhou, 310014, Zhejiang, China
| | - Guiling Yang
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-products/Key Laboratory for Pesticide Residue Detection of Ministry of Agriculture/Laboratory (Hangzhou) for Risk Assessment of Agricultural Products of Ministry of Agriculture, Institute of Quality and Standard for Agro-products, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021, Zhejiang, China
| | - Dou Wang
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-products/Key Laboratory for Pesticide Residue Detection of Ministry of Agriculture/Laboratory (Hangzhou) for Risk Assessment of Agricultural Products of Ministry of Agriculture, Institute of Quality and Standard for Agro-products, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021, Zhejiang, China
| | - Xinquan Wang
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-products/Key Laboratory for Pesticide Residue Detection of Ministry of Agriculture/Laboratory (Hangzhou) for Risk Assessment of Agricultural Products of Ministry of Agriculture, Institute of Quality and Standard for Agro-products, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021, Zhejiang, China
| | - Qiang Wang
- State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-products/Key Laboratory for Pesticide Residue Detection of Ministry of Agriculture/Laboratory (Hangzhou) for Risk Assessment of Agricultural Products of Ministry of Agriculture, Institute of Quality and Standard for Agro-products, Zhejiang Academy of Agricultural Sciences, Hangzhou, 310021, Zhejiang, China.
| |
Collapse
|
14
|
Tinkov OV, Grigorev VY, Razdolsky AN, Grigoryeva LD, Dearden JC. Effect of the structural factors of organic compounds on the acute toxicity toward Daphnia magna. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2020; 31:615-641. [PMID: 32713201 DOI: 10.1080/1062936x.2020.1791250] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 06/30/2020] [Indexed: 06/11/2023]
Abstract
The acute toxicity of organic compounds towards Daphina magna was subjected to QSAR analysis. The two-dimensional simplex representation of molecular structure (2D SiRMS) and the support vector machine (SVM), gradient boosting (GBM) methods were used to develop QSAR models. Adequate regression QSAR models were developed for incubation of 24 h. Their interpretation allowed us to quantitatively describe and rank the well-known toxicophores, to refine their molecular surroundings, and to distinguish the structural derivatives of the fragments that significantly contribute to the acute toxicity (LC50) of organic compounds towards D. magna. Based on the results of the interpretation of the regression models, a molecular design (modification) of highly toxic compounds was performed in order to reduce their hazard. In addition, acceptable classification QSAR models were developed to reliably predict the following mode of action (MOA): specific and non-specific toxicity of organic compounds towards D. magna. When interpreting these models, we were able to determine the structural fragments and the physicochemical characteristics of molecules that are responsible for the manifestation of one of the modes of action. The on-line version of the OCHEM expert system (https://ochem.eu), HYBOT descriptors, and the random forest and SVM methods were used for a comparative QSAR investigation.
Collapse
Affiliation(s)
- O V Tinkov
- Department of Computer Science, Military Institute of the Ministry of Defense , Tiraspol, Moldova
| | - V Y Grigorev
- Department of Computer-aided Molecular Design, Institute of Physiologically Active Compounds of the Russian Academy of Science , Chernogolovka, Russia
| | - A N Razdolsky
- Department of Computer-aided Molecular Design, Institute of Physiologically Active Compounds of the Russian Academy of Science , Chernogolovka, Russia
| | - L D Grigoryeva
- Department of Fundamental Physicochemical Engineering, Moscow State University , Moscow, Russia
| | - J C Dearden
- School of Pharmacy and Biomolecular Sciences, Liverpool John Moores University , Liverpool, UK
| |
Collapse
|
15
|
Yang L, Wang Y, Hao W, Chang J, Pan Y, Li J, Wang H. Modeling pesticides toxicity to Sheepshead minnow using QSAR. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2020; 193:110352. [PMID: 32120163 DOI: 10.1016/j.ecoenv.2020.110352] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2019] [Revised: 02/14/2020] [Accepted: 02/15/2020] [Indexed: 06/10/2023]
Abstract
Nowadays, the environmental risk caused by the widespread use of pesticides and their ubiquitous residuals has received more and more attention in academia and regulatory agencies. Due to the large number of pesticides used in agriculture and their adverse effects on all living organisms and the numerous end-points, it is necessary to employ the in silico tools to quickly highlight hazardous pesticides. In this study, we have evaluated the toxicity of pesticides against Sheepshead minnow with the Quantitative Structure-Activity Relationship (QSAR) approach. The models for the specific-type (insecticides, herbicides and fungicides) as well as the general-type (combing all the specific-type pesticides and some microbicides, nematicides, etc.) pesticides were developed using the Genetic Algorithm and the Multiple Linear Regression method, subsequently validated with various metrics. The validation results suggested that the obtained models were highly robust, externally predictive and characterized by a broad applicability domain. Considering the modeling descriptors, the toxicity of pesticides would increase with the lipophilicity and decrease with the polarity and hydrophilicity. Most electrotopological state descriptors contribute negatively to the toxicity, while the influence of topological structure descriptors mainly depends on the physiochemical information they encode. The models proposed in this paper would be useful in filling the data gaps, prioritizing and then focusing experiments on more hazardous pesticides.
Collapse
Affiliation(s)
- Lu Yang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China; University of Chinese Academy of Sciences, Yuquan RD 19A, Beijing, 100049, China
| | - Yinghuan Wang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China
| | - Weiyu Hao
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China
| | - Jing Chang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China
| | - Yifan Pan
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China; University of Chinese Academy of Sciences, Yuquan RD 19A, Beijing, 100049, China
| | - Jianzhong Li
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China
| | - Huili Wang
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, 18 Shuangqing Road, Haidian District, Beijing, 100085, PR China.
| |
Collapse
|
16
|
Khan PM, Roy K, Benfenati E. Chemometric modeling of Daphnia magna toxicity of agrochemicals. CHEMOSPHERE 2019; 224:470-479. [PMID: 30831498 DOI: 10.1016/j.chemosphere.2019.02.147] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 02/21/2019] [Accepted: 02/22/2019] [Indexed: 06/09/2023]
Abstract
Over the past few years, the ecotoxicological hazard potential of agrochemicals has received much attention in the industries and regulatory agencies. In the current work, we have developed quantitative structure-activity relationship (QSAR) models for Daphnia magna toxicities of different classes of agrochemicals (fungicides, herbicides, insecticides and microbiocides) individually as well as for the combined set with the application of Organization for Economic Co-operation and Development (OECD) recommended guidelines. The models for the individual data sets as well as for the combined set were generated employing only simple and interpretable two-dimensional descriptors, and subsequently strictly validated using test set compounds. The validated individual models were used to generate consensus models, with the objective to improve the prediction quality and reduced prediction errors. All the individual models of different classes of agrochemicals as well as the global set of agrochemicals showed encouraging statistical quality and prediction ability. The general observations from the derived models suggest that the toxicity increases with lipophilicity and decreases with polarity. The generated models of different classes of agrochemicals and also for the combined set should be applicable for data gap filling for new or untested agrochemical compounds falling within the applicability domain of the developed models.
Collapse
Affiliation(s)
- Pathan Mohsin Khan
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Educational and Research (NIPER), Chunilal Bhawan, 168, Manikata Main Road, 700054, Kolkata, India
| | - Kunal Roy
- Drug Theoretics and Cheminformatics Laboratory, Department of Pharmaceutical Technology, Jadavpur University, 188 Raja S C Mullick Road, 700032, Kolkata, India; Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto Di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa, 19, 20156, Milano, Italy.
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto Di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa, 19, 20156, Milano, Italy
| |
Collapse
|
17
|
Liu L, Yang H, Cai Y, Cao Q, Sun L, Wang Z, Li W, Liu G, Lee PW, Tang Y. In silico prediction of chemical aquatic toxicity for marine crustaceans via machine learning. Toxicol Res (Camb) 2019; 8:341-352. [PMID: 31160968 PMCID: PMC6505403 DOI: 10.1039/c8tx00331a] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Accepted: 01/24/2019] [Indexed: 12/30/2022] Open
Abstract
Aquatic toxicity is a crucial endpoint for evaluating chemically adverse effects on ecosystems. Therefore, we developed in silico methods for the prediction of chemical aquatic toxicity in marine environment. At first, a diverse data set including different crustacean species was constructed. We then built local binary models using Mysidae data and global binary models using Mysidae, Palaemonidae, and Penaeidae data. Molecular fingerprints and descriptors were employed to represent chemical structures separately. All the models were built by six machine learning methods. The AUC (area under the receiver operating characteristic curve) values of the better local and global models were around 0.8 and 0.9 for the test sets, respectively. We also identified several chemicals with selective toxicity on different species. The analysis of selective toxicity would promote to design greener chemicals in a specific environment. Finally, to understand and interpret the models, we explored the relationships between chemical aquatic toxicity and the molecular descriptors. Our study would be helpful in gaining further insights into marine organisms, prediction of chemical aquatic toxicity and prioritization of environmental hazard assessment.
Collapse
Affiliation(s)
- Lin Liu
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Hongbin Yang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Yingchun Cai
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Qianqian Cao
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Lixia Sun
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Zhuang Wang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Philip W Lee
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China .
| |
Collapse
|
18
|
Li F, Fan D, Wang H, Yang H, Li W, Tang Y, Liu G. In silico prediction of pesticide aquatic toxicity with chemical category approaches. Toxicol Res (Camb) 2017; 6:831-842. [PMID: 30090546 PMCID: PMC6062408 DOI: 10.1039/c7tx00144d] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2017] [Accepted: 07/27/2017] [Indexed: 01/03/2023] Open
Abstract
Aquatic toxicity is an important issue in pesticide development. In this study, using nine molecular fingerprints to describe pesticides, binary and ternary classification models were constructed to predict aquatic toxicity of pesticides via six machine learning methods: Naïve Bayes (NB), Artificial Neural Network (ANN), k-Nearest Neighbor (kNN), Classification Tree (CT), Random Forest (RF) and Support Vector Machine (SVM). For the binary models, local models were obtained with 829 pesticides on rainbow trout (RT) and 151 pesticides on lepomis (LP), and global models were constructed on the basis of 1258 diverse pesticides on RT and LP and 278 on other fish species. After analyzing the local binary models, we found that fish species caused influence in terms of accuracy. Considering the data size and predictive range, the 1258 pesticides were also used to build global ternary models. The best local binary models were Maccs_ANN for RT and Maccs_SVM for LP, which exhibited accuracies of 0.90 and 0.90, respectively. For global binary models, the best model was Graph_SVM with an accuracy of 0.89. Accuracy of the best global ternary model Graph_SVM was 0.81, which was a little lower than that of the best global binary model. In addition, several substructural alerts were identified including nitrobenzene, chloroalkene and nitrile, which could significantly correlate with pesticide aquatic toxicity. This study provides a useful tool for an early evaluation of pesticide aquatic toxicity in environmental risk assessment.
Collapse
Affiliation(s)
- Fuxing Li
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Defang Fan
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Hao Wang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Hongbin Yang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| |
Collapse
|
19
|
Abbasitabar F, Zare-Shahabadi V. In silico prediction of toxicity of phenols to Tetrahymena pyriformis by using genetic algorithm and decision tree-based modeling approach. CHEMOSPHERE 2017; 172:249-259. [PMID: 28081509 DOI: 10.1016/j.chemosphere.2016.12.095] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2016] [Revised: 11/29/2016] [Accepted: 12/19/2016] [Indexed: 05/27/2023]
Abstract
Risk assessment of chemicals is an important issue in environmental protection; however, there is a huge lack of experimental data for a large number of end-points. The experimental determination of toxicity of chemicals involves high costs and time-consuming process. In silico tools such as quantitative structure-toxicity relationship (QSTR) models, which are constructed on the basis of computational molecular descriptors, can predict missing data for toxic end-points for existing or even not yet synthesized chemicals. Phenol derivatives are known to be aquatic pollutants. With this background, we aimed to develop an accurate and reliable QSTR model for the prediction of toxicity of 206 phenols to Tetrahymena pyriformis. A multiple linear regression (MLR)-based QSTR was obtained using a powerful descriptor selection tool named Memorized_ACO algorithm. Statistical parameters of the model were 0.72 and 0.68 for Rtraining2 and Rtest2, respectively. To develop a high-quality QSTR model, classification and regression tree (CART) was employed. Two approaches were considered: (1) phenols were classified into different modes of action using CART and (2) the phenols in the training set were partitioned to several subsets by a tree in such a manner that in each subset, a high-quality MLR could be developed. For the first approach, the statistical parameters of the resultant QSTR model were improved to 0.83 and 0.75 for Rtraining2 and Rtest2, respectively. Genetic algorithm was employed in the second approach to obtain an optimal tree, and it was shown that the final QSTR model provided excellent prediction accuracy for the training and test sets (Rtraining2 and Rtest2 were 0.91 and 0.93, respectively). The mean absolute error for the test set was computed as 0.1615.
Collapse
Affiliation(s)
- Fatemeh Abbasitabar
- Department of Chemistry, Marvdasht Branch, Islamic Azad University, Marvdasht, Iran.
| | - Vahid Zare-Shahabadi
- Department of Chemistry, Mahshahr Branch, Islamic Azad University, Mahshahr, Iran
| |
Collapse
|
20
|
Basant N, Gupta S, Singh KP. QSAR modeling for predicting reproductive toxicity of chemicals in rats for regulatory purposes. Toxicol Res (Camb) 2016; 5:1029-1038. [PMID: 30090410 PMCID: PMC6062388 DOI: 10.1039/c6tx00083e] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 04/07/2016] [Indexed: 01/08/2023] Open
Abstract
The experimental determination of multi-generation reproductive toxicity of chemicals involves high costs and a large number of animal studies over a long period of time. Computational toxicology offers possibilities to overcome such difficulties. In this study, we have established ensemble machine learning (EML) based quantitative structure-activity relationship models for predicting the reproductive toxicity potential (LOAEL) of structurally diverse chemicals in accordance with the OECD guidelines. Accordingly, decision tree forest (DTF) and decision tree boost (DTB) QSAR models were developed using a novel dataset composed of the toxicity endpoints for 334 chemicals. Relevant structural features of chemicals responsible for toxicity potential were identified and used in QSAR modeling. The generalization and prediction abilities of the constructed QSAR models were evaluated by internal and external validation procedures and by deriving several stringent statistical criteria parameters. In the test set, the two models (DTF and DTB) yielded R2 of 0.856 and 0.945, between the experimental and predicted endpoint toxicity values. The models were also evaluated for predictive use through the most recent criteria based on root mean squared error (RMSE) and mean absolute error (MAE). The values of various statistical validation coefficients derived for the test data were above their respective threshold limits and thus put a high confidence in this analysis. The applicability domains of the constructed QSAR models were defined using the leverage and standardization approaches. The results suggest that the proposed QSAR models can reliably predict the reproductive toxicity potential of diverse chemicals and can be useful tools for screening new chemicals for safety assessment.
Collapse
Affiliation(s)
| | - Shikha Gupta
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ;
| | - Kunwar P Singh
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ;
| |
Collapse
|
21
|
Gupta S, Basant N, Mohan D, Singh KP. Room-temperature and temperature-dependent QSRR modelling for predicting the nitrate radical reaction rate constants of organic chemicals using ensemble learning methods. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2016; 27:539-558. [PMID: 27385532 DOI: 10.1080/1062936x.2016.1199592] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Accepted: 06/06/2016] [Indexed: 06/06/2023]
Abstract
Experimental determinations of the rate constants of the reaction of NO3 with a large number of organic chemicals are tedious, and time and resource intensive; and the development of computational methods has widely been advocated. In this study, we have developed room-temperature (298 K) and temperature-dependent quantitative structure-reactivity relationship (QSRR) models based on the ensemble learning approaches (decision tree forest (DTF) and decision treeboost (DTB)) for predicting the rate constant of the reaction of NO3 radicals with diverse organic chemicals, under OECD guidelines. Predictive powers of the developed models were established in terms of statistical coefficients. In the test phase, the QSRR models yielded a correlation (r(2)) of >0.94 between experimental and predicted rate constants. The applicability domains of the constructed models were determined. An attempt has been made to provide the mechanistic interpretation of the selected features for QSRR development. The proposed QSRR models outperformed the previous reports, and the temperature-dependent models offered a much wider applicability domain. This is the first report presenting a temperature-dependent QSRR model for predicting the nitrate radical reaction rate constant at different temperatures. The proposed models can be useful tools in predicting the reactivities of chemicals towards NO3 radicals in the atmosphere, hence, their persistence and exposure risk assessment.
Collapse
Affiliation(s)
- S Gupta
- a Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Lucknow , India
| | | | - D Mohan
- c School of Environmental Sciences, Jawaharlal Nehru University , New Delhi , India
| | - K P Singh
- a Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Lucknow , India
| |
Collapse
|
22
|
Basant N, Gupta S, Singh KP. In silico prediction of the developmental toxicity of diverse organic chemicals in rodents for regulatory purposes. Toxicol Res (Camb) 2016; 5:773-787. [PMID: 30090388 PMCID: PMC6061034 DOI: 10.1039/c5tx00493d] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2015] [Accepted: 02/10/2016] [Indexed: 11/21/2022] Open
Abstract
The experimental determination of the developmental toxicity potential (LEL) of chemicals is not only tedious, time and resource intensive, but it also involves unethical tests on animals. In this study, we have established quantitative structure activity relationship (QSAR) models for predicting the developmental toxicity potential of chemicals in rodents following the OECD guidelines. Accordingly, decision tree forest (DTF) and decision tree boost (DTB) based local (L-QSAR), global (G-QSAR) and interspecies quantitative structure activity-activity relationship (ISC QSAAR) models were developed for estimating the LEL (lowest effective level) dose of chemicals for developmental toxicity in rats and rabbits. The structural features of chemicals responsible for developmental toxicity in rodents were extracted and used in QSAR/QSAAR analysis. The external predictive power of the developed models was evaluated through the internal and external validation procedures. In test data, the L-QSAR models (DTF, DTB) yielded R2 values of >0.846 (rat) and >0.906 (rabbit), whereas in G-QSAR, the correlation value was >0.870 between the measured and predicted endpoint values. In ISC QSAAR models, the R2 values in test data were 0.830 (DTF) and 0.927 (DTB), respectively. Values of various statistical validation coefficients derived from the test data (except rm2 in DTF based rat L-QSAR and ISC QSAAR models) were above their respective threshold limits, thus putting a high confidence in this analysis. The prediction quality of the developed QSAR/QSAAR models was also assessed using the mean absolute error (MAE) criteria and found good. The applicability domains of the constructed models were defined using the descriptor range, leverage, and standardization approaches. The results suggest that the developed QSAR/QSAAR models can reliably predict the developmental toxicity potential of structurally diverse chemicals in rodents, generating useful toxicity data for risk assessment in humans.
Collapse
Affiliation(s)
| | - Shikha Gupta
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ;
| | - Kunwar P Singh
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ;
| |
Collapse
|