1
|
Yang X, Sun J, Jin B, Lu Y, Cheng J, Jiang J, Zhao Q, Shuai J. Multi-task aquatic toxicity prediction model based on multi-level features fusion. J Adv Res 2024:S2090-1232(24)00226-1. [PMID: 38844122 DOI: 10.1016/j.jare.2024.06.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 05/21/2024] [Accepted: 06/02/2024] [Indexed: 06/09/2024] Open
Abstract
INTRODUCTION With the escalating menace of organic compounds in environmental pollution imperiling the survival of aquatic organisms, the investigation of organic compound toxicity across diverse aquatic species assumes paramount significance for environmental protection. Understanding how different species respond to these compounds helps assess the potential ecological impact of pollution on aquatic ecosystems as a whole. Compared with traditional experimental methods, deep learning methods have higher accuracy in predicting aquatic toxicity, faster data processing speed and better generalization ability. OBJECTIVES This article presents ATFPGT-multi, an advanced multi-task deep neural network prediction model for organic toxicity. METHODS The model integrates molecular fingerprints and molecule graphs to characterize molecules, enabling the simultaneous prediction of acute toxicity for the same organic compound across four distinct fish species. Furthermore, to validate the advantages of multi-task learning, we independently construct prediction models, named ATFPGT-single, for each fish species. We employ cross-validation in our experiments to assess the performance and generalization ability of ATFPGT-multi. RESULTS The experimental results indicate, first, that ATFPGT-multi outperforms ATFPGT-single on four fish datasets with AUC improvements of 9.8%, 4%, 4.8%, and 8.2%, respectively, demonstrating the superiority of multi-task learning over single-task learning. Furthermore, in comparison with previous algorithms, ATFPGT-multi outperforms comparative methods, emphasizing that our approach exhibits higher accuracy and reliability in predicting aquatic toxicity. Moreover, ATFPGT-multi utilizes attention scores to identify molecular fragments associated with fish toxicity in organic molecules, as demonstrated by two organic molecule examples in the main text, demonstrating the interpretability of ATFPGT-multi. CONCLUSION In summary, ATFPGT-multi provides important support and reference for the further development of aquatic toxicity assessment. All of codes and datasets are freely available online at https://github.com/zhaoqi106/ATFPGT-multi.
Collapse
Affiliation(s)
- Xin Yang
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China; Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China
| | - Jianqiang Sun
- School of Information Science and Engineering, Linyi University, Linyi 276000, China
| | - Bingyu Jin
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China
| | - Yuer Lu
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China
| | - Jinyan Cheng
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China
| | - Jiaju Jiang
- College of Life Sciences, Sichuan University, Chengdu 610064, China
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China.
| | - Jianwei Shuai
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China; Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou 325001, China.
| |
Collapse
|
2
|
Rosner A, Ballarin L, Barnay-Verdier S, Borisenko I, Drago L, Drobne D, Concetta Eliso M, Harbuzov Z, Grimaldi A, Guy-Haim T, Karahan A, Lynch I, Giulia Lionetto M, Martinez P, Mehennaoui K, Oruc Ozcan E, Pinsino A, Paz G, Rinkevich B, Spagnuolo A, Sugni M, Cambier S. A broad-taxa approach as an important concept in ecotoxicological studies and pollution monitoring. Biol Rev Camb Philos Soc 2024; 99:131-176. [PMID: 37698089 DOI: 10.1111/brv.13015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 08/23/2023] [Accepted: 08/28/2023] [Indexed: 09/13/2023]
Abstract
Aquatic invertebrates play a pivotal role in (eco)toxicological assessments because they offer ethical, cost-effective and repeatable testing options. Additionally, their significance in the food chain and their ability to represent diverse aquatic ecosystems make them valuable subjects for (eco)toxicological studies. To ensure consistency and comparability across studies, international (eco)toxicology guidelines have been used to establish standardised methods and protocols for data collection, analysis and interpretation. However, the current standardised protocols primarily focus on a limited number of aquatic invertebrate species, mainly from Arthropoda, Mollusca and Annelida. These protocols are suitable for basic toxicity screening, effectively assessing the immediate and severe effects of toxic substances on organisms. For more comprehensive and ecologically relevant assessments, particularly those addressing long-term effects and ecosystem-wide impacts, we recommended the use of a broader diversity of species, since the present choice of taxa exacerbates the limited scope of basic ecotoxicological studies. This review provides a comprehensive overview of (eco)toxicological studies, focusing on major aquatic invertebrate taxa and how they are used to assess the impact of chemicals in diverse aquatic environments. The present work supports the use of a broad-taxa approach in basic environmental assessments, as it better represents the natural populations inhabiting various ecosystems. Advances in omics and other biochemical and computational techniques make the broad-taxa approach more feasible, enabling mechanistic studies on non-model organisms. By combining these approaches with in vitro techniques together with the broad-taxa approach, researchers can gain insights into less-explored impacts of pollution, such as changes in population diversity, the development of tolerance and transgenerational inheritance of pollution responses, the impact on organism phenotypic plasticity, biological invasion outcomes, social behaviour changes, metabolome changes, regeneration phenomena, disease susceptibility and tissue pathologies. This review also emphasises the need for harmonised data-reporting standards and minimum annotation checklists to ensure that research results are findable, accessible, interoperable and reusable (FAIR), maximising the use and reusability of data. The ultimate goal is to encourage integrated and holistic problem-focused collaboration between diverse scientific disciplines, international standardisation organisations and decision-making bodies, with a focus on transdisciplinary knowledge co-production for the One-Health approach.
Collapse
Affiliation(s)
- Amalia Rosner
- Israel Oceanographic and Limnological Research, National Institute of Oceanography, PO 2336 Sha'ar Palmer 1, Haifa, 3102201, Israel
| | - Loriano Ballarin
- Department of Biology, University of Padova, via Ugo Bassi 58/B, Padova, I-35121, Italy
| | - Stéphanie Barnay-Verdier
- Sorbonne Université; CNRS, INSERM, Université Côte d'Azur, Institute for Research on Cancer and Aging Nice, 28 avenue Valombrose, Nice, F-06107, France
| | - Ilya Borisenko
- Faculty of Biology, Department of Embryology, Saint Petersburg State University, Universitetskaya embankment 7/9, Saint Petersburg, 199034, Russia
| | - Laura Drago
- Department of Biology, University of Padova, via Ugo Bassi 58/B, Padova, I-35121, Italy
| | - Damjana Drobne
- Department of Biology, Biotechnical Faculty, University of Ljubljana, Večna pot 111, Ljubljana, 1111, Slovenia
| | - Maria Concetta Eliso
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Naples, 80121, Italy
- Department of Chemical, Biological, Pharmaceutical and Environmental Sciences, University of Messina, Messina, Italy
| | - Zoya Harbuzov
- Israel Oceanographic and Limnological Research, National Institute of Oceanography, PO 2336 Sha'ar Palmer 1, Haifa, 3102201, Israel
- Leon H. Charney School of Marine Sciences, Department of Marine Biology, University of Haifa, 199 Aba Koushy Ave., Haifa, 3498838, Israel
| | - Annalisa Grimaldi
- Department of Biotechnology and Life Sciences, University of Insubria, Via J. H. Dunant, Varese, 3-21100, Italy
| | - Tamar Guy-Haim
- Israel Oceanographic and Limnological Research, National Institute of Oceanography, PO 2336 Sha'ar Palmer 1, Haifa, 3102201, Israel
| | - Arzu Karahan
- Middle East Technical University, Institute of Marine Sciences, Erdemli-Mersin, PO 28, 33731, Turkey
| | - Iseult Lynch
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - Maria Giulia Lionetto
- Department of Biological and Environmental Sciences and Technologies, University of Salento, via prov. le Lecce -Monteroni, Lecce, I-73100, Italy
- NBFC, National Biodiversity Future Center, Piazza Marina, 61, Palermo, I-90133, Italy
| | - Pedro Martinez
- Department de Genètica, Microbiologia i Estadística, Universitat de Barcelona, Av. Diagonal 643, Barcelona, 08028, Spain
- Institut Català de Recerca i Estudis Avançats (ICREA), Passeig de Lluís Companys, Barcelona, 08010, Spain
| | - Kahina Mehennaoui
- Environmental Research and Innovation (ERIN) Department, Luxembourg Institute of Science and Technology (LIST), 41, rue du Brill, Belvaux, L-4422, Luxembourg
| | - Elif Oruc Ozcan
- Faculty of Arts and Science, Department of Biology, Cukurova University, Balcali, Saricam, Adana, 01330, Turkey
| | - Annalisa Pinsino
- National Research Council, Institute of Translational Pharmacology (IFT), National Research Council (CNR), Via Ugo La Malfa 153, Palermo, 90146, Italy
| | - Guy Paz
- Israel Oceanographic and Limnological Research, National Institute of Oceanography, PO 2336 Sha'ar Palmer 1, Haifa, 3102201, Israel
| | - Baruch Rinkevich
- Israel Oceanographic and Limnological Research, National Institute of Oceanography, PO 2336 Sha'ar Palmer 1, Haifa, 3102201, Israel
| | - Antonietta Spagnuolo
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Naples, 80121, Italy
| | - Michela Sugni
- Department of Environmental Science and Policy, University of Milan, Via Celoria 26, Milan, 20133, Italy
| | - Sébastien Cambier
- Environmental Research and Innovation (ERIN) Department, Luxembourg Institute of Science and Technology (LIST), 41, rue du Brill, Belvaux, L-4422, Luxembourg
| |
Collapse
|
3
|
Zubrod JP, Galic N, Vaugeois M, Dreier DA. Physiological variables in machine learning QSARs allow for both cross-chemical and cross-species predictions. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023; 263:115250. [PMID: 37487435 DOI: 10.1016/j.ecoenv.2023.115250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 06/23/2023] [Accepted: 07/09/2023] [Indexed: 07/26/2023]
Abstract
A major challenge in ecological risk assessment is estimating chemical-induced effects across taxa without species-specific testing. Where ecotoxicological data may be more challenging to gather, information on species physiology is more available for a broad range of taxa. Physiology is known to drive species sensitivity but understanding about the relative contribution of specific underlying processes is still elusive. Consequently, there remains a need to understand which physiological processes lead to differences in species sensitivity. The objective of our study was to utilize existing knowledge about organismal physiology to both understand and predict differences in species sensitivity. Machine learning models were trained to predict chemical- and species-specific endpoints as a function of both chemical fingerprints/descriptors and physiological properties represented by dynamic energy budget (DEB) parameters. We found that random forest models were able to predict chemical- and species-specific endpoints, and that DEB parameters were relatively important in the models, particularly for invertebrates. Our approach illuminates how physiological properties may drive species sensitivity, which will allow more realistic predictions of effects across species without the need for additional animal testing.
Collapse
Affiliation(s)
| | - Nika Galic
- Syngenta Crop Protection AG, Basel, Switzerland
| | - Maxime Vaugeois
- Syngenta Crop Protection, LLC, Greensboro, NC, United States
| | - David A Dreier
- Syngenta Crop Protection, LLC, Greensboro, NC, United States.
| |
Collapse
|
4
|
Scott-Fordsmand JJ, Amorim MJB. Using Machine Learning to make nanomaterials sustainable. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 859:160303. [PMID: 36410486 DOI: 10.1016/j.scitotenv.2022.160303] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 11/06/2022] [Accepted: 11/15/2022] [Indexed: 06/16/2023]
Abstract
Sustainable development is a key challenge for contemporary human societies; failure to achieve sustainability could threaten human survival. In this review article, we illustrate how Machine Learning (ML) could support more sustainable development, covering the basics of data gathering through each step of the Environmental Risk Assessment (ERA). The literature provides several examples showing how ML can be employed in most steps of a typical ERA.A key observation is that there are currently no clear guidance for using such autonomous technologies in ERAs or which standards/checks are required. Steering thus seems to be the most important task for supporting the use of ML in the ERA of nano- and smart-materials. Resources should be devoted to developing a strategy for implementing ML in ERA with a strong emphasis on data foundations, methodologies, and the related sensitivities/uncertainties. We should recognise historical errors and biases (e.g., in data) to avoid embedding them during ML programming.
Collapse
Affiliation(s)
| | - Mónica J B Amorim
- Department of Biology & CESAM, University of Aveiro, 3810-193 Aveiro, Portugal.
| |
Collapse
|
5
|
de Sá AGC, Long Y, Portelli S, Pires DEV, Ascher DB. toxCSM: comprehensive prediction of small molecule toxicity profiles. Brief Bioinform 2022; 23:6673851. [PMID: 35998885 DOI: 10.1093/bib/bbac337] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 07/17/2022] [Accepted: 07/23/2022] [Indexed: 01/29/2023] Open
Abstract
Drug discovery is a lengthy, costly and high-risk endeavour that is further convoluted by high attrition rates in later development stages. Toxicity has been one of the main causes of failure during clinical trials, increasing drug development time and costs. To facilitate early identification and optimisation of toxicity profiles, several computational tools emerged aiming at improving success rates by timely pre-screening drug candidates. Despite these efforts, there is an increasing demand for platforms capable of assessing both environmental as well as human-based toxicity properties at large scale. Here, we present toxCSM, a comprehensive computational platform for the study and optimisation of toxicity profiles of small molecules. toxCSM leverages on the well-established concepts of graph-based signatures, molecular descriptors and similarity scores to develop 36 models for predicting a range of toxicity properties, which can assist in developing safer drugs and agrochemicals. toxCSM achieved an Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) of up to 0.99 and Pearson's correlation coefficients of up to 0.94 on 10-fold cross-validation, with comparable performance on blind test sets, outperforming all alternative methods. toxCSM is freely available as a user-friendly web server and API at http://biosig.lab.uq.edu.au/toxcsm.
Collapse
Affiliation(s)
- Alex G C de Sá
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane City, Queensland, 4072, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville, Victoria, 3052, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, 3004, Australia.,Baker Department of Cardiometabolic Health, University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Yangyang Long
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville, Victoria, 3052, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, 3004, Australia.,School of Computing and Information Systems, University of Melbourne, Parkville, Victoria, 3052, Australia
| | - Stephanie Portelli
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane City, Queensland, 4072, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville, Victoria, 3052, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, 3004, Australia
| | - Douglas E V Pires
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville, Victoria, 3052, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, 3004, Australia.,School of Computing and Information Systems, University of Melbourne, Parkville, Victoria, 3052, Australia
| | - David B Ascher
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane City, Queensland, 4072, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville, Victoria, 3052, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, 3004, Australia.,Baker Department of Cardiometabolic Health, University of Melbourne, Parkville, Victoria, 3010, Australia
| |
Collapse
|
6
|
Wang X, Li F, Chen J, Teng Y, Ji C, Wu H. Critical features identification for chemical chronic toxicity based on mechanistic forecast models. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2022; 307:119584. [PMID: 35688391 DOI: 10.1016/j.envpol.2022.119584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 05/03/2022] [Accepted: 06/03/2022] [Indexed: 06/15/2023]
Abstract
Facing billions of tons of pollutants entering the ocean each year, aquatic toxicity is becoming a crucial endpoint for evaluating chemical adverse effects on ecosystems. Notably, huge amount of toxic chemicals at environmental relevant doses can cause potential adverse effects. However, chronic aquatic toxicity effects of chemicals are much scarcer, especially at population level. Rotifers are highly sensitive to toxicants even at chronic low-doses and their communities are usually considered as effective indicators for assessing the status of aquatic ecosystems. Therefore, the no observed effect concentration (NOEC) for population abundance of rotifers were selected as endpoints to develop machine learning models for the prediction of chemical aquatic chronic toxicity. In this study, forty-eight binary models were built by eight types of chemical descriptors combined with six machine learning algorithms. The best binary model was 1D & 2D molecular descriptors - random trees model (RT) with high balanced accuracy (BA) (0.83 for training and 0.83 for validation set), and Matthews correlation coefficient (MCC) (0.72 for training set and 0.67 for validation set). Moreover, the optimal model identified the primary factors (SpMAD_Dzp, AMW, MATS2v) and filtered out three high alerting substructures [c1cc(Cl)cc1, CNCO, CCOP(=S)(OCC)O] influencing the chronic aquatic toxicity. These results showed that the compounds with low molecular volume, high polarity and molecular weight could contribute to adverse effects on rotifers, facilitating the deeper understanding of chronic toxicity mechanisms. In addition, forecast models had better performances than the common models embedded into ECOSAR software. This study provided insights into structural features responsible for the toxicity of different groups of chemicals and thereby allowed for the rational design of green and safer alternatives.
Collapse
Affiliation(s)
- Xiaoqing Wang
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; University of Chinese Academy of Sciences, Beijing, 100049, PR China
| | - Fei Li
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, 266071, PR China.
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (MOE), School of Environmental Science and Technology, Dalian University of Technology, Linggong Road 2, Dalian, 116024, China
| | - Yuefa Teng
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; University of Chinese Academy of Sciences, Beijing, 100049, PR China
| | - Chenglong Ji
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, 266071, PR China
| | - Huifeng Wu
- CAS Key Laboratory of Coastal Environmental Processes and Ecological Remediation, Yantai Institute of Coastal Zone Research (YIC), Chinese Academy of Sciences (CAS), Shandong Key Laboratory of Coastal Environmental Processes, YICCAS, Yantai, 264003, PR China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, 266071, PR China
| |
Collapse
|
7
|
Xu M, Yang H, Liu G, Tang Y, Li W. In Silico Prediction of Chemical Aquatic Toxicity by Multiple Machine Learning and Deep Learning Approaches. J Appl Toxicol 2022; 42:1766-1776. [PMID: 35653511 DOI: 10.1002/jat.4354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 05/16/2022] [Accepted: 05/31/2022] [Indexed: 11/08/2022]
Abstract
Fish is one of the model animals used to evaluate the adverse effects of a chemical exposed to the ecosystem. However, its low throughput and relevantly high expense make it impossible to test all new chemicals in manufacture. Hence using in silico models to prioritize compounds to be tested has been widely applied in environmental risk assessment and drug discovery. In this study, we constructed the local predictive models for four fish species, including bluegill sunfish, rainbow trout, fathead minnow, and sheepshead minnow, and the global models with all four fish data. A total of 1874 unique compounds with their labels, i.e. toxic (LC50 < 10 ppm) or nontoxic were collected from ECOTOX and literature. Both conventional machine learning methods and the deep learning architecture, graph convolutional network (GCN), were used to build predictive models. The classification accuracy of the best local model for each fish species was higher than 0.83. For the global models, two strategies including consistency prediction and probability threshold were adopted to improve the predictive capability at the cost of limiting applicability domain. For 63% of compounds in domain, the accuracy was around 0.97. By comparison of the deep learning and machine learning methods, we found that the single-task GCN showed specific advantages in performance and multi-task GCN showed no advantages over the conventional machine learning methods. The data and models are available on GitHub (https://github.com/ChemPredict/ChemicalAquaticToxicity).
Collapse
Affiliation(s)
- Minjie Xu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Hongbin Yang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| |
Collapse
|
8
|
Zhang X, Zhao P, Wang Z, Xu X, Liu G, Tang Y, Li W. In Silico Prediction of CYP2C8 Inhibition with Machine-Learning Methods. Chem Res Toxicol 2021; 34:1850-1859. [PMID: 34255486 DOI: 10.1021/acs.chemrestox.1c00078] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Cytochrome P450 2C8 (CYP2C8) is a major drug-metabolizing enzyme in humans and is responsible for the metabolism of ∼5% drugs in clinical use. Thus, inhibition of CYP2C8, which causes potential adverse drug events, cannot be neglected. The in vitro drug interaction studies guidelines for industry issued by the FDA also point out that it needs to be determined whether investigated drugs are CYP2C8 inhibitors before clinical trials. However, current studies mainly focus on predicting the inhibitors of other major P450 enzymes, and the importance of CYP2C8 inhibition has been overlooked. Therefore, there is a need to develop models for identifying potential CYP2C8 inhibition. In this study, in silico classification models for predicting CYP2C8 inhibition were built by five machine-learning methods combined with nine molecular fingerprints. The performance of the models built was evaluated by test and external validation sets. The best model had AUC values of 0.85 and 0.90 for the test and external validation sets, respectively. The applicability domain was analyzed based on the molecular similarity and exhibited an impact on the improvement of prediction accuracy. Furthermore, several representative privileged substructures such as 1H-benzo[d]imidazole, 1-phenyl-1H-pyrazole, and quinoline were identified by information gain and substructure frequency analysis. Overall, our results would be helpful for the prediction of CYP2C8 inhibition.
Collapse
Affiliation(s)
- Xiaoxiao Zhang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Piaopiao Zhao
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Zhiyuan Wang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Xuan Xu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
| |
Collapse
|
9
|
Singh AK, Bilal M, Iqbal HMN, Raj A. Trends in predictive biodegradation for sustainable mitigation of environmental pollutants: Recent progress and future outlook. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 770:144561. [PMID: 33736422 DOI: 10.1016/j.scitotenv.2020.144561] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 12/13/2020] [Accepted: 12/13/2020] [Indexed: 02/05/2023]
Abstract
The feasibility of in-silico techniques, together with the computational framework, has been applied to predictive bioremediation aiming to clean-up contaminants, toxicity evaluation, and possibilities for the degradation of complex recalcitrant compounds. Emerging contaminants from different industries have posed a significant hazard to the environment and public health. Given current bioremediation strategies, it is often a failure or inadequate for sustainable mitigation of hazardous pollutants. However, clear-cut vital information about biodegradation is quite incomplete from a conventional remediation techniques perspective. Lacking complete information on bio-transformed compounds leads to seeking alternative methods. Only scarce information about the transformed products and toxicity profile is available in the published literature. To fulfill this literature gap, various computational or in-silico technologies have emerged as alternating techniques, which are being recognized as in-silico approaches for bioremediation. Molecular docking, molecular dynamics simulation, and biodegradation pathways predictions are the vital part of predictive biodegradation, including the Quantitative Structure-Activity Relationship (QSAR), Quantitative structure-biodegradation relationship (QSBR) model system. Furthermore, machine learning (ML), artificial neural network (ANN), genetic algorithm (GA) based programs offer simultaneous biodegradation prediction along with toxicity and environmental fate prediction. Herein, we spotlight the feasibility of in-silico remediation approaches for various persistent, recalcitrant contaminants while traditional bioremediation fails to mitigate such pollutants. Such could be addressed by exploiting described model systems and algorithm-based programs. Furthermore, recent advances in QSAR modeling, algorithm, and dedicated biodegradation prediction system have been summarized with unique attributes.
Collapse
Affiliation(s)
- Anil Kumar Singh
- Environmental Microbiology Laboratory, Environmental Toxicology Group, CSIR-Indian Institute of Toxicology Research (CSIR-IITR), Vishvigyan Bhawan, 31, Mahatma Gandhi Marg, Lucknow 226001, Uttar Pradesh, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Muhammad Bilal
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huaian 223003, China
| | - Hafiz M N Iqbal
- Tecnologico de Monterrey, School of Engineering and Sciences, Monterrey 64849, Mexico.
| | - Abhay Raj
- Environmental Microbiology Laboratory, Environmental Toxicology Group, CSIR-Indian Institute of Toxicology Research (CSIR-IITR), Vishvigyan Bhawan, 31, Mahatma Gandhi Marg, Lucknow 226001, Uttar Pradesh, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
| |
Collapse
|
10
|
Zhao P, Peng Y, Xu X, Wang Z, Wu Z, Li W, Tang Y, Liu G. In silico prediction of mitochondrial toxicity of chemicals using machine learning methods. J Appl Toxicol 2021; 41:1518-1526. [PMID: 33469990 DOI: 10.1002/jat.4141] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 12/15/2020] [Accepted: 12/30/2020] [Indexed: 12/16/2022]
Abstract
Mitochondria are important organelles in human cells, providing more than 95% of the energy. However, some drugs and environmental chemicals could induce mitochondrial dysfunction, which might cause complex diseases and even worsen the condition of patients with mitochondrial damage. Some drugs have been withdrawn from the market due to their severe mitochondrial toxicity, such as troglitazone. Therefore, there is an urgent need to develop models that could accurately predict the mitochondrial toxicity of chemicals. In this paper, suitable data were obtained from literature and databases first. Then nine types of fingerprints were used to characterize these compounds. Finally, different algorithms were used to build models. Meanwhile, the applicability domain of the prediction models was defined. We have also explored the structural alerts of mitochondrial toxicity, which would be helpful for medicinal chemists to better predict mitochondrial toxicity and further optimize lead compounds.
Collapse
Affiliation(s)
- Piaopiao Zhao
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Yayuan Peng
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Xuan Xu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Zhiyuan Wang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Zengrui Wu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| |
Collapse
|
11
|
Shan X, Wang X, Li CD, Chu Y, Zhang Y, Xiong Y, Wei DQ. Prediction of CYP450 Enzyme–Substrate Selectivity Based on the Network-Based Label Space Division Method. J Chem Inf Model 2019; 59:4577-4586. [DOI: 10.1021/acs.jcim.9b00749] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Affiliation(s)
- Xiaoqi Shan
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xiangeng Wang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Cheng-dong Li
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yanyi Chu
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yufang Zhang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yi Xiong
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, and Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
- Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan
District, Shenzhen, Guangdong 518055, China
| |
Collapse
|