1
|
Meng L, Zhou B, Liu H, Chen Y, Yuan R, Chen Z, Luo S, Chen H. Advancing toxicity studies of per- and poly-fluoroalkyl substances (pfass) through machine learning: Models, mechanisms, and future directions. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 946:174201. [PMID: 38936709 DOI: 10.1016/j.scitotenv.2024.174201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 06/17/2024] [Accepted: 06/20/2024] [Indexed: 06/29/2024]
Abstract
Perfluorinated and perfluoroalkyl substances (PFASs), encompassing a vast array of isomeric chemicals, are recognized as typical emerging contaminants with direct or potential impacts on human health and the ecological environment. With the complex and elusive toxicological profiles of PFASs, machine learning (ML) has been increasingly employed in their toxicity studies due to its proficiency in prediction and data analytics. This integration is poised to become a predominant trend in environmental toxicology, propelled by the swift advancements in computational technology. This review diligently examines the literature to encapsulate the varied objectives of employing ML in the toxicity studies of PFASs: (1) Utilizing ML to establish Quantitative Structure-Activity Relationship (QSAR) models for PFASs with diverse toxicity endpoints, facilitating the targeted toxicity prediction of unidentified PFASs; (2) Investigating and substantiating the Adverse Outcome Pathway (AOP) through the synergy of ML and traditional toxicological methods, with this refining the toxicity assessment framework for PFASs; (3) Dissecting and elucidating the features of established ML models to advance Open Research into the toxicity of PFASs, with a primary focus on determinants and mechanisms. The discourse extends to an in-depth examination of ML studies, segregating findings based on their distinct application trajectories. Given that ML represents a nascent paradigm within PFASs research, this review delineates the collective challenges encountered in the ML-mediated study of PFAS toxicity and proffers strategic guidance for ensuing investigations.
Collapse
Affiliation(s)
- Lingxuan Meng
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Beihai Zhou
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Haijun Liu
- School of Resources and Environment, Anqing Normal University, Anqing, China.
| | - Yuefang Chen
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China.
| | - Rongfang Yuan
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Zhongbing Chen
- Faculty of Environmental Sciences, Czech University of Life Sciences Prague, Kamýcká 129, 16500 Praha-Suchdol, Czech Republic.
| | - Shuai Luo
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Huilun Chen
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China.
| |
Collapse
|
2
|
Sui S, Zhou N, Liu H, Watson P, Yang X. Recognizing high-priority disinfection byproducts based on experimental and predicted endocrine disrupting data: Virtual screening and in vitro study. CHEMOSPHERE 2024; 358:142239. [PMID: 38705414 DOI: 10.1016/j.chemosphere.2024.142239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 04/25/2024] [Accepted: 05/02/2024] [Indexed: 05/07/2024]
Abstract
So far, about 130 disinfection by-products (DBPs) and several DBPs-groups have had their potential endocrine-disrupting effects tested on some endocrine endpoints. However, it is still not clear which specific DBPs, DBPs-groups/subgroups may be the most toxic substances or groups/subgroups for any given endocrine endpoint. In this study, we attempt to address this issue. First, a list of relevant DBPs was updated, and 1187 DBPs belonging to 4 main-groups (aliphatic, aromatic, alicyclic, heterocyclic) and 84 subgroups were described. Then, the high-priority endocrine endpoints, DBPs-groups/subgroups, and specific DBPs were determined from 18 endpoints, 4 main-groups, 84 subgroups, and 1187 specific DBPs by a virtual-screening method. The results demonstrate that most of DBPs could not disturb the endocrine endpoints in question because the proportion of active compounds associated with the endocrine endpoints ranged from 0 (human thyroid receptor beta) to 32% (human transthyretin (hTTR)). All the endpoints with a proportion of active compounds greater than 10% belonged to the thyroid system, highlighting that the potential disrupting effects of DBPs on the thyroid system should be given more attention. The aromatic and alicyclic DBPs may have higher priority than that of aliphatic and heterocyclic DBPs by considering the activity rate and potential for disrupting effects. There were 2 (halophenols and estrogen DBPs), 12, and 24 subgroups that belonged to high, moderate, and low priority classes, respectively. For individual DBPs, there were 23 (2%), 193 (16%), and 971 (82%) DBPs belonging to the high, moderate, and low priority groups, respectively. Lastly, the hTTR binding affinity of 4 DBPs was determined by an in vitro assay and all the tested DBPs exhibited dose-dependent binding potency with hTTR, which was consistent with the predicted result. Thus, more efforts should be performed to reveal the potential endocrine disruption of those high research-priority main-groups, subgroups, and individual DBPs.
Collapse
Affiliation(s)
- Shuxin Sui
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Nan Zhou
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Huihui Liu
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
| | - Peter Watson
- Los Alamos National Laboratory, Los Alamos, 87545, New Mexico, United States
| | - Xianhai Yang
- Jiangsu Key Laboratory of Chemical Pollution Control and Resources Reuse, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China.
| |
Collapse
|
3
|
Zhang H, Yi H, Hao Y, Zhao L, Pan W, Xue Q, Liu X, Fu J, Zhang A. Deciphering exogenous chemical carcinogenicity through interpretable deep learning: A novel approach for evaluating atmospheric pollutant hazards. JOURNAL OF HAZARDOUS MATERIALS 2024; 465:133092. [PMID: 38039812 DOI: 10.1016/j.jhazmat.2023.133092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 11/07/2023] [Accepted: 11/23/2023] [Indexed: 12/03/2023]
Abstract
Cancer remains a significant global health concern, with millions of deaths attributed to it annually. Environmental pollutants play a pivotal role in cancer etiology and contribute to the growing prevalence of this disease. The carcinogenic assessment of these pollutants is crucial for chemical health evaluation and environmental risk assessments. Traditional experimental methods are expensive and time-consuming, prompting the development of alternative approaches such as in silico methods. In this regard, deep learning (DL) has shown potential but lacks optimal performance and interpretability. This study introduces an interpretable DL model called CarcGC for chemical carcinogenicity prediction, utilizing a graph convolutional neural network (GCN) that employs molecular structural graphs as inputs. Compared to existing models, CarcGC demonstrated enhanced performance, with the area under the receiver operating characteristic curve (AUCROC) reaching 0.808 on the test set. Due to air pollution is closely related to the incidence of lung cancers, we applied the CarcGC to predict the potential carcinogenicity of chemicals listed in the United States Environmental Protection Agency's Hazardous Air Pollutants (HAPs) inventory, offering a foundation for environmental carcinogenicity screening. This study highlights the potential of artificially intelligent methods in carcinogenicity prediction and underscores the value of CarcGC interpretability in revealing the structural basis and molecular mechanisms underlying chemical carcinogenicity.
Collapse
Affiliation(s)
- Huazhou Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China
| | - Hang Yi
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China
| | - Yuxing Hao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Lu Zhao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Wenxiao Pan
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China
| | - Qiao Xue
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China
| | - Xian Liu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China.
| | - Jianjie Fu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China; Institute of Environment and Health, Jianghan University, Wuhan 430056, PR China
| | - Aiqian Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China; Institute of Environment and Health, Jianghan University, Wuhan 430056, PR China.
| |
Collapse
|
4
|
Zhao L, Xue Q, Zhang H, Hao Y, Yi H, Liu X, Pan W, Fu J, Zhang A. CatNet: Sequence-based deep learning with cross-attention mechanism for identifying endocrine-disrupting chemicals. JOURNAL OF HAZARDOUS MATERIALS 2024; 465:133055. [PMID: 38016311 DOI: 10.1016/j.jhazmat.2023.133055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 11/02/2023] [Accepted: 11/20/2023] [Indexed: 11/30/2023]
Abstract
Endocrine-disrupting chemicals (EDCs) pose significant environmental and health risks due to their potential to interfere with nuclear receptors (NRs), key regulators of physiological processes. Despite the evident risks, the majority of existing research narrows its focus on the interaction between compounds and the individual NR target, neglecting a comprehensive assessment across the entire NR family. In response, this study assembled a comprehensive human NR dataset, capturing 49,244 interactions between 35,467 unique compounds and 42 NRs. We introduced a cross-attention network framework, "CatNet", innovatively integrating compound and protein representations through cross-attention mechanisms. The results showed that CatNet model achieved excellent performance with an area under the receiver operating characteristic curve (AUCROC) = 0.916 on the test set, and exhibited reliable generalization on unseen compound-NR pairs. A distinguishing feature of our research is its capacity to expand to novel targets. Beyond its predictive accuracy, CatNet offers a valuable mechanistic perspective on compound-NR interactions through feature visualization. Augmenting the utility of our research, we have also developed a graphical user interface, empowering researchers to predict chemical binding to diverse NRs. Our model enables the prediction of human NR-related EDCs and shows the potential to identify EDCs related to other targets.
Collapse
Affiliation(s)
- Lu Zhao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Qiao Xue
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China.
| | - Huazhou Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Yuxing Hao
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Hang Yi
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China
| | - Xian Liu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China
| | - Wenxiao Pan
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China
| | - Jianjie Fu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China; School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, PR China
| | - Aiqian Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, PR China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, PR China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, PR China; School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, PR China.
| |
Collapse
|
5
|
Cao C, Wang H, Yang JR, Chen Q, Guo YM, Chen JZ. MCPNET: Development of an interpretable deep learning model based on multiple conformations of the compound for predicting developmental toxicity. Comput Biol Med 2024; 171:108037. [PMID: 38377716 DOI: 10.1016/j.compbiomed.2024.108037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Revised: 12/21/2023] [Accepted: 01/26/2024] [Indexed: 02/22/2024]
Abstract
The development of deep learning models for predicting toxicological endpoints has shown great promise, but one of the challenges in the field is the accuracy and interpretability of these models. The bioactive conformation of a compound plays a critical role for it to bind in the target. It is a big issue to figure out the bioactive conformation in deep learning without the co-crystal structure or highly precise molecular simulations. In this study, we developed a deep learning framework of Multi-Conformation Point Network (MCPNET) to construct classification and regression models, respectively, based on electrostatic potential distributions on vdW surfaces around multiple conformations of the compound using a dataset of compounds with developmental toxicity in zebrafish embryo. MCPNET applied 3D multi-conformational surface point cloud to extract the molecular features for model training, which may be critical for capturing the structural diversity of compounds. The models achieved an accuracy of 85 % on the classification task and R2 of 0.66 on the regression task, outperforming traditional machine learning models and other deep learning models. The key feature of our model is its interpretability with the component visualization to identify the factors contributing to the prediction and to understand the compound action mechanism. MCPNET may predict the conformation quietly close to the bioactive conformation of a compound by attention-based multi-conformation pooling mechanism. Our results demonstrated the potential of deep learning based on 3D molecular representations in accurately predicting developmental toxicity. The source code is publicly available at https://github.com/Superlit-CC/MCPNET.
Collapse
Affiliation(s)
- Cheng Cao
- College of Pharmaceutical Sciences, Zhejiang University, 866 Yuhangtang Rd., Hangzhou, Zhejiang, 310058, China; Polytechnic Institute, Zhejiang University, 269 Shixiang Rd, Hangzhou, Zhejiang, 310015, China
| | - Hao Wang
- College of Pharmaceutical Sciences, Zhejiang University, 866 Yuhangtang Rd., Hangzhou, Zhejiang, 310058, China
| | - Jin-Rong Yang
- College of Pharmaceutical Sciences, Zhejiang University, 866 Yuhangtang Rd., Hangzhou, Zhejiang, 310058, China; Polytechnic Institute, Zhejiang University, 269 Shixiang Rd, Hangzhou, Zhejiang, 310015, China
| | - Qiang Chen
- College of Pharmaceutical Sciences, Zhejiang University, 866 Yuhangtang Rd., Hangzhou, Zhejiang, 310058, China
| | - Ya-Min Guo
- College of Pharmaceutical Sciences, Zhejiang University, 866 Yuhangtang Rd., Hangzhou, Zhejiang, 310058, China
| | - Jian-Zhong Chen
- College of Pharmaceutical Sciences, Zhejiang University, 866 Yuhangtang Rd., Hangzhou, Zhejiang, 310058, China.
| |
Collapse
|
6
|
Han M, Liang J, Jin B, Wang Z, Wu W, Arp HPH. Machine learning coupled with causal inference to identify COVID-19 related chemicals that pose a high concern to drinking water. iScience 2024; 27:109012. [PMID: 38352231 PMCID: PMC10863329 DOI: 10.1016/j.isci.2024.109012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 01/07/2024] [Accepted: 01/22/2024] [Indexed: 02/16/2024] Open
Abstract
Various synthetic substances were utilized in large quantities during the recent coronavirus pandemic, COVID-19. Some of these chemicals could potentially enter drinking water sources. Persistent, mobile, and toxic (PMT) substances have been recognized as a threat to drinking water resources. It has not yet been assessed how many COVID-19 related substances could be considered PMT substances. One reason is the lack of high-quality experimental data for the identification of PMT substances. To solve this problem, we applied a machine learning model to identify the PMT substances among COVID-19 related chemicals. The optimal model achieved an accuracy of 90.6% based on external test data. The model interpretation and causal inference indicated that our approach understood causation between PMT properties and molecular descriptors. Notably, the screening results showed that over 60% of the COVID-19 chemicals considered are candidate PMT substances, which should be prioritized to prevent undue pollution of water resources.
Collapse
Affiliation(s)
- Min Han
- State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- CAS Center for Excellence in Deep Earth Science, Guangzhou 510640, China
- University of Chinese Academy of Sciences, Beijing 10069, China
- Guangdong Provincial Key Laboratory of Environmental Protection and Resources Utilization, Guangzhou 510640, China
| | - Jun Liang
- School of Software, South China Normal University, Foshan 528225, China
| | - Biao Jin
- State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- CAS Center for Excellence in Deep Earth Science, Guangzhou 510640, China
- University of Chinese Academy of Sciences, Beijing 10069, China
- Guangdong Provincial Key Laboratory of Environmental Protection and Resources Utilization, Guangzhou 510640, China
| | - Ziwei Wang
- State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- CAS Center for Excellence in Deep Earth Science, Guangzhou 510640, China
- University of Chinese Academy of Sciences, Beijing 10069, China
| | - Wanlu Wu
- State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- CAS Center for Excellence in Deep Earth Science, Guangzhou 510640, China
- University of Chinese Academy of Sciences, Beijing 10069, China
| | - Hans Peter H. Arp
- Norwegian Geotechnical Institute (NGI), P.O. Box 3930 Ullevaal Stadion, N-0806 Oslo, Norway
- Norwegian University of Science and Technology (NTNU), NO-7491 Trondheim, Norway
| |
Collapse
|
7
|
Fu X, Jiang J, Wu X, Huang L, Han R, Li K, Liu C, Roy K, Chen J, Mahmoud NTA, Wang Z. Deep learning in water protection of resources, environment, and ecology: achievement and challenges. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:14503-14536. [PMID: 38305966 DOI: 10.1007/s11356-024-31963-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 01/06/2024] [Indexed: 02/03/2024]
Abstract
The breathtaking economic development put a heavy toll on ecology, especially on water pollution. Efficient water resource management has a long-term influence on the sustainable development of the economy and society. Economic development and ecology preservation are tangled together, and the growth of one is not possible without the other. Deep learning (DL) is ubiquitous in autonomous driving, medical imaging, speech recognition, etc. The spectacular success of deep learning comes from its power of richer representation of data. In view of the bright prospects of DL, this review comprehensively focuses on the development of DL applications in water resources management, water environment protection, and water ecology. First, the concept and modeling steps of DL are briefly introduced, including data preparation, algorithm selection, and model evaluation. Finally, the advantages and disadvantages of commonly used algorithms are analyzed according to their structures and mechanisms, and recommendations on the selection of DL algorithms for different studies, as well as prospects for the application and development of DL in water science are proposed. This review provides references for solving a wider range of water-related problems and brings further insights into the intelligent development of water science.
Collapse
Affiliation(s)
- Xiaohua Fu
- Ecological Environment Management and Assessment Center, Central South University of Forestry and Technology, Changsha, 410004, People's Republic of China
| | - Jie Jiang
- Ecological Environment Management and Assessment Center, Central South University of Forestry and Technology, Changsha, 410004, People's Republic of China
- State Environmental Protection Key Laboratory of Water Environmental Simulation and Pollution Control, Ministry of Ecology and Environment, South China Institute of Environmental Sciences, Guangzhou, 510655, People's Republic of China
| | - Xie Wu
- China Railway Water Information Technology Co, LTD, Nanchang, 330000, People's Republic of China
| | - Lei Huang
- School of Environmental Science and Engineering, Guangzhou University, Guangzhou, 510006, People's Republic of China
| | - Rui Han
- China Environment Publishing Group, Beijing, 100062, People's Republic of China
| | - Kun Li
- Freeman Business School, Tulane University, New Orleans, LA, 70118, USA
- Guangzhou Huacai Environmental Protection Technology Co., Ltd, Guangzhou, 511480, People's Republic of China
| | - Chang Liu
- State Environmental Protection Key Laboratory of Water Environmental Simulation and Pollution Control, Ministry of Ecology and Environment, South China Institute of Environmental Sciences, Guangzhou, 510655, People's Republic of China
| | - Kallol Roy
- Institute of Computer Science, University of Tartu, 51009, Tartu, Estonia
| | - Jianyu Chen
- State Environmental Protection Key Laboratory of Water Environmental Simulation and Pollution Control, Ministry of Ecology and Environment, South China Institute of Environmental Sciences, Guangzhou, 510655, People's Republic of China
| | | | - Zhenxing Wang
- State Environmental Protection Key Laboratory of Water Environmental Simulation and Pollution Control, Ministry of Ecology and Environment, South China Institute of Environmental Sciences, Guangzhou, 510655, People's Republic of China.
| |
Collapse
|
8
|
Fan F, Wu G, Yang Y, Liu F, Qian Y, Yu Q, Ren H, Geng J. A Graph Neural Network Model with a Transparent Decision-Making Process Defines the Applicability Domain for Environmental Estrogen Screening. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:18236-18245. [PMID: 37749748 DOI: 10.1021/acs.est.3c04571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
The application of deep learning (DL) models for screening environmental estrogens (EEs) for the sound management of chemicals has garnered significant attention. However, the currently available DL model for screening EEs lacks both a transparent decision-making process and effective applicability domain (AD) characterization, making the reliability of its prediction results uncertain and limiting its practical applications. To address this issue, a graph neural network (GNN) model was developed to screen EEs, achieving accuracy rates of 88.9% and 92.5% on the internal and external test sets, respectively. The decision-making process of the GNN model was explored through the network-like similarity graphs (NSGs) based on the model features (FT). We discovered that the accuracy of the predictions is dependent on the feature distribution of compounds in NSGs. An AD characterization method called ADFT was proposed, which excludes predictions falling outside of the model's prediction range, leading to a 15% improvement in the F1 score of the GNN model. The GNN model with the AD method may serve as an efficient tool for screening EEs, identifying 800 potential EEs in the Inventory of Existing Chemical Substances of China. Additionally, this study offers new insights into comprehending the decision-making process of DL models.
Collapse
Affiliation(s)
- Fan Fan
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China
| | - Gang Wu
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China
| | - Yining Yang
- School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Fu Liu
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China
| | - Yuli Qian
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China
| | - Qingmiao Yu
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environment, Ministry of Education, Chongqing University, Chongqing 400044, China
| | - Hongqiang Ren
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China
| | - Jinju Geng
- State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China
- Key Laboratory of the Three Gorges Reservoir Region's Eco-Environment, Ministry of Education, Chongqing University, Chongqing 400044, China
| |
Collapse
|
9
|
Han M, Jin B, Liang J, Huang C, Arp HPH. Developing machine learning approaches to identify candidate persistent, mobile and toxic (PMT) and very persistent and very mobile (vPvM) substances based on molecular structure. WATER RESEARCH 2023; 244:120470. [PMID: 37595327 DOI: 10.1016/j.watres.2023.120470] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 08/07/2023] [Accepted: 08/08/2023] [Indexed: 08/20/2023]
Abstract
Determining which substances on the global market could be classified as persistent, mobile and toxic (PMT) substances or very persistent, very mobile (vPvM) substances is essential to prevent or reduce drinking water contamination from them. This study developed machine learning models based on different molecular descriptors (MDs) and defined applicability domains for the screening of PMT/vPvM substances. The models were trained with 3111 substances with expert weight-of-evidence based PMT/vPvM hazard classifications that considered the highest quality data available. The model was based on the hypothesis that PMT/vPvM substances contain similar MDs, representative of chemical structures resistant to degradation, be associated with low sorption (or high-water solubility) and in some cases be associated with known toxic mechanisms. All possible model combinations were tested by integrating different molecular description methods, data balancing strategies and machine learning algorithms. Our model allows one-step prediction of candidate PMT/vPvM substances, and our method was compared with the approach predicting P, M and T separately (i.e. three-step prediction). The results showed that the one-step model achieved a higher accuracy of 92% for PMT/vPvM identification (i.e. positive samples) for an internal test set, and also resulted in a higher accuracy of 90% for an external test set of chemical pollutants detected in Taihu Lake, China. Furthermore, prediction mechanism of the model was interpreted by Shapley additive explanations (SHAP). This work presents an advance of big data in silico screening models for the identification of substances that potentially meet the PMT/vPvM criteria.
Collapse
Affiliation(s)
- Min Han
- State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou, 510640, China; CAS Center for Excellence in Deep Earth Science, Guangzhou, 510640, China; University of Chinese Academy of Sciences, Beijing, 10069, China
| | - Biao Jin
- State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou, 510640, China; CAS Center for Excellence in Deep Earth Science, Guangzhou, 510640, China; University of Chinese Academy of Sciences, Beijing, 10069, China.
| | - Jun Liang
- School of Software, South China Normal University, Foshan, 528225, China
| | - Chen Huang
- State Key Laboratory of Organic Geochemistry, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou, 510640, China; CAS Center for Excellence in Deep Earth Science, Guangzhou, 510640, China; University of Chinese Academy of Sciences, Beijing, 10069, China
| | - Hans Peter H Arp
- Norwegian Geotechnical Institute (NGI), P.O. Box 3930 Ullevaal Stadion, Oslo, N-0806, Norway; Norwegian University of Science and Technology (NTNU), Trondheim, NO-7491, Norway
| |
Collapse
|
10
|
Zhang R, Zhu R, Jia M, Pang Y, Zhang B, Bao X, Wang Y. Improvement of a Rapid Method of Detecting Gasoline Detergency Based on the Image Recognition. ACS OMEGA 2023; 8:34134-34145. [PMID: 37744810 PMCID: PMC10515347 DOI: 10.1021/acsomega.3c05350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 08/29/2023] [Indexed: 09/26/2023]
Abstract
The detergency of motor gasoline is closely related to vehicle exhaust emissions and fuel economy. This paper proposed an improved method for the rapid detection of gasoline detergency based on the deposit images of test gasoline on aluminum plates produced by a multichannel gasoline detergency simulation test (MGST). The detection algorithm system was structured to recognize the deposit plate images by computer vision based on the convolutional neural networks (CNNs). Compared with the traditional simulation test, the improved MGST method resulted in significant reductions in fuel consumption, cost, and test time. The performance of three transfer learning models (Inception-ResNet-V2, Inception-V3, and ResNet50-V2) and a customized CNN was evaluated in the detection algorithm system, and their detection accuracies reached 94, 94, 88, and 82%. Inception-RsNet-V2 was selected due to its higher accuracy and better robustness. Based on the model interpretation, it is evident that the model undergoes feature extraction from the sediment deposits on the deposit plate. Subsequently, it employed the acquired deposit features to accurately detect gasoline samples that failed to meet detergency standards. This approach was proved to be effective in enhancing the detection process and ensuring reliable results for gasoline detergency evaluation. It is beneficial to environmental protection regulators for managing market gasoline detergency and urban mobile source pollution. In addition, a deposit plate image database should be established to further improve the detection model performance during the environmental regulation.
Collapse
Affiliation(s)
- Rongshuo Zhang
- School
of Ecology and Environment, Zhengzhou University, Zhengzhou 450001, China
| | - Rencheng Zhu
- School
of Ecology and Environment, Zhengzhou University, Zhengzhou 450001, China
- State
Environmental Protection Key Laboratory of Vehicle Emission Control
and Simulation, Chinese Research Academy
of Environmental Sciences, Beijing 100012, China
| | - Ming Jia
- State
Environmental Protection Key Laboratory of Vehicle Emission Control
and Simulation, Chinese Research Academy
of Environmental Sciences, Beijing 100012, China
| | - Yujie Pang
- School
of Ecology and Environment, Zhengzhou University, Zhengzhou 450001, China
| | - Bowen Zhang
- School
of Ecology and Environment, Zhengzhou University, Zhengzhou 450001, China
| | - Xiaofeng Bao
- State
Environmental Protection Key Laboratory of Vehicle Emission Control
and Simulation, Chinese Research Academy
of Environmental Sciences, Beijing 100012, China
- National
Engineering Laboratory for Mobile Source Emission Control Technology, Tianjin 300399, China
| | - Yunjing Wang
- State
Environmental Protection Key Laboratory of Vehicle Emission Control
and Simulation, Chinese Research Academy
of Environmental Sciences, Beijing 100012, China
| |
Collapse
|
11
|
Liu W, Wang Z, Chen J, Tang W, Wang H. Machine Learning Model for Screening Thyroid Stimulating Hormone Receptor Agonists Based on Updated Datasets and Improved Applicability Domain Metrics. Chem Res Toxicol 2023. [PMID: 37209109 DOI: 10.1021/acs.chemrestox.3c00074] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Machine learning (ML) models for screening endocrine-disrupting chemicals (EDCs), such as thyroid stimulating hormone receptor (TSHR) agonists, are essential for sound management of chemicals. Previous models for screening TSHR agonists were built on imbalanced datasets and lacked applicability domain (AD) characterization essential for regulatory application. Herein, an updated TSHR agonist dataset was built, for which the ratio of active to inactive compounds greatly increased to 1:2.6, and chemical spaces of structure-activity landscapes (SALs) were enhanced. Resulting models based on 7 molecular representations and 4 ML algorithms were proven to outperform previous ones. Weighted similarity density (ρs) and weighted inconsistency of activities (IA) were proposed to characterize the SALs, and a state-of-the-art AD characterization methodology ADSAL{ρs, IA} was established. An optimal classifier developed with PubChem fingerprints and the random forest algorithm, coupled with ADSAL{ρs ≥ 0.15, IA ≤ 0.65}, exhibited good performance on the validation set with the area under the receiver operating characteristic curve being 0.984 and balanced accuracy being 0.941 and identified 90 TSHR agonist classes that could not be found previously. The classifier together with the ADSAL{ρs, IA} may serve as efficient tools for screening EDCs, and the AD characterization methodology may be applied to other ML models.
Collapse
Affiliation(s)
- Wenjia Liu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Zhongyu Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Weihao Tang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Haobo Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| |
Collapse
|
12
|
Liu C, Wang P, Huang P, Yang Z, Zhou G. Photo-induced heterogeneous regeneration of Fe(Ⅱ) in Fenton reaction for efficient polycyclic antibiotics removal and in-depth charge transfer mechanism. J Colloid Interface Sci 2023; 638:768-777. [PMID: 36780855 DOI: 10.1016/j.jcis.2023.02.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 01/23/2023] [Accepted: 02/01/2023] [Indexed: 02/09/2023]
Abstract
Fenton reaction is regarded as a potential treatment for antibiotics removal, but challenges remain due to the sluggish reaction kinetics of Fe(III) reduction and incomplete degradation from insufficient active substance. Distinguished from traditional Fe(Ⅱ) regeneration techniques, this work focuses on utilizing the aliovalent redox pairs and built-in electric field to induce photo-excited electrons to cross the material interface and achieve Fe(III) reduction (heterogeneous regeneration). Herein, oxygen-deficient CeO2 particles are anchored on metal-organic frameworks (MIL-88A) and thus constitute the heterojunction with enhanced photoelectric properties, accelerating the directional charge transfer. Consequently, the synthesized MIL-88A/CeO2(OV) composite can degrade 95.76% of oxytetracycline within 60 min in photo-Fenton reaction and maintain a high mineralization rate (75.33%) after 4 cyclic tests. Furthermore, the charge transfer mechanisms of Fe cycle and antibiotics mineralization are both unveiled via experiment results and theorical calculation. This work proposes a new paradigm for constructing self-sufficient photo-Fenton catalytic system for efficient and sustainable removal of polycyclic antibiotics.
Collapse
Affiliation(s)
- Chongchong Liu
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lake of Ministry of Education, College of Environment, Hohai University, Nanjing 210098, People's Republic of China
| | - Peifang Wang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lake of Ministry of Education, College of Environment, Hohai University, Nanjing 210098, People's Republic of China.
| | - Peilin Huang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lake of Ministry of Education, College of Environment, Hohai University, Nanjing 210098, People's Republic of China
| | - Zhimin Yang
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lake of Ministry of Education, College of Environment, Hohai University, Nanjing 210098, People's Republic of China
| | - Gang Zhou
- Key Laboratory of Integrated Regulation and Resource Development on Shallow Lake of Ministry of Education, College of Environment, Hohai University, Nanjing 210098, People's Republic of China.
| |
Collapse
|
13
|
Young TJ, Jubery TZ, Carley CN, Carroll M, Sarkar S, Singh AK, Singh A, Ganapathysubramanian B. "Canopy fingerprints" for characterizing three-dimensional point cloud data of soybean canopies. FRONTIERS IN PLANT SCIENCE 2023; 14:1141153. [PMID: 37063230 PMCID: PMC10090282 DOI: 10.3389/fpls.2023.1141153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Accepted: 02/28/2023] [Indexed: 06/19/2023]
Abstract
Advances in imaging hardware allow high throughput capture of the detailed three-dimensional (3D) structure of plant canopies. The point cloud data is typically post-processed to extract coarse-scale geometric features (like volume, surface area, height, etc.) for downstream analysis. We extend feature extraction from 3D point cloud data to various additional features, which we denote as 'canopy fingerprints'. This is motivated by the successful application of the fingerprint concept for molecular fingerprints in chemistry applications and acoustic fingerprints in sound engineering applications. We developed an end-to-end pipeline to generate canopy fingerprints of a three-dimensional point cloud of soybean [Glycine max (L.) Merr.] canopies grown in hill plots captured by a terrestrial laser scanner (TLS). The pipeline includes noise removal, registration, and plot extraction, followed by the canopy fingerprint generation. The canopy fingerprints are generated by splitting the data into multiple sub-canopy scale components and extracting sub-canopy scale geometric features. The generated canopy fingerprints are interpretable and can assist in identifying patterns in a database of canopies, querying similar canopies, or identifying canopies with a certain shape. The framework can be extended to other modalities (for instance, hyperspectral point clouds) and tuned to find the most informative fingerprint representation for downstream tasks. These canopy fingerprints can aid in the utilization of canopy traits at previously unutilized scales, and therefore have applications in plant breeding and resilient crop production.
Collapse
Affiliation(s)
- Therin J. Young
- Department of Mechanical Engineering, Iowa State University, Ames, IA, United States
| | | | - Clayton N. Carley
- Department of Agronomy, Iowa State University, Ames, IA, United States
| | - Matthew Carroll
- Department of Agronomy, Iowa State University, Ames, IA, United States
| | - Soumik Sarkar
- Department of Mechanical Engineering, Iowa State University, Ames, IA, United States
- Translational AI Center, Iowa State University, Ames, IA, United States
| | - Asheesh K. Singh
- Department of Agronomy, Iowa State University, Ames, IA, United States
| | - Arti Singh
- Department of Agronomy, Iowa State University, Ames, IA, United States
| | - Baskar Ganapathysubramanian
- Department of Mechanical Engineering, Iowa State University, Ames, IA, United States
- Translational AI Center, Iowa State University, Ames, IA, United States
| |
Collapse
|
14
|
Xing Z, Zhao S, Guo W, Guo X, Wang S, Li M, Wang Y, He H. Analyzing point cloud of coal mining process in much dust environment based on dynamic graph convolution neural network. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:4044-4061. [PMID: 35963970 DOI: 10.1007/s11356-022-22490-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 08/07/2022] [Indexed: 06/15/2023]
Abstract
Environmental perception is an important research direction of coal mine sustainable development. There is much dust in the underground working environment of coal mine. This study is to identify the marker (ball) in the coal mine, which provides a basic to convert the coordinate of large-scale fully mechanized mining face point cloud to the geodetic coordinate. Firstly, in the face of the phenomenon that the uneven distribution of underground point cloud is more serious, this study further has studied on the basis of complete and incomplete geometry point cloud and generated multi-density geometry point cloud for the first time. Secondly, aiming at the problem that the geometric features of underground point cloud are not obvious enough, this study has increased the weight of point cloud normal vector in the training process of network model, so that the network model is more sensitive to different geometric features. Finally, this study has used a variety of advanced deep neural networks to directly analyze point clouds to verify the proposed method. The results show that the method proposed in this study has been combined with the dynamic graph convolution neural network (DGCNN) established earlier, which can more accurately identify the ball in tens of millions of the point clouds of coal mining process. Most importantly, this work is not only of great significance to improve the production efficiency and safety in fully mechanized mining face but also lays a foundation for realizing intelligence in the mining field and avoiding the harm of dust explosion and other accidents to workers.
Collapse
Affiliation(s)
- Zhizhong Xing
- College of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an, 710054, China
| | - Shuanfeng Zhao
- College of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an, 710054, China.
| | - Wei Guo
- College of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an, 710054, China
| | - Xiaojun Guo
- School of Mechanical & Automotive Engineering, South China University of Technology, Guangzhou, 510641, China
| | - Shenquan Wang
- College of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an, 710054, China
| | - Mingyue Li
- College of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an, 710054, China
| | - Yuan Wang
- College of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an, 710054, China
| | - Haitao He
- College of Mechanical Engineering, Xi'an University of Science and Technology, Xi'an, 710054, China
- Shendong Coal Group Co., Ltd. of National Energy Group, Yulin, 719315, China
| |
Collapse
|
15
|
Jeong J, Kim D, Choi J. Application of ToxCast/Tox21 data for toxicity mechanism-based evaluation and prioritization of environmental chemicals: Perspective and limitations. Toxicol In Vitro 2022; 84:105451. [PMID: 35921976 DOI: 10.1016/j.tiv.2022.105451] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 07/28/2022] [Indexed: 01/28/2023]
Abstract
In response to the need to minimize the use of experimental animals, new approach methodologies (NAMs) using advanced technology have emerged in the 21st century. ToxCast/Tox21 aims to evaluate the adverse effects of chemicals quickly and efficiently using a high-throughput screening and to transform the paradigm of toxicity assessment into mechanism-based toxicity prediction. The ToxCast/Tox21 database, which contains extensive data from over 1400 assays with numerous biological targets and activity data for over 9000 chemicals, can be used for various purposes in the field of chemical prioritization and toxicity prediction. In this study, an overview of the database was explored to aid mechanism-based chemical prioritization and toxicity prediction. Implications for the utilization of the ToxCast/Tox21 database in chemical prioritization and toxicity prediction were derived. The research trends in ToxCast/Tox21 assay data were reviewed in the context of toxicity mechanism identification, chemical priority, environmental monitoring, assay development, and toxicity prediction. Finally, the potential applications and limitations of using ToxCast/Tox21 assay data in chemical risk assessment were discussed. The analysis of the toxicity mechanism-based assays of ToxCast/Tox21 will help in chemical prioritization and regulatory applications without the use of laboratory animals.
Collapse
Affiliation(s)
- Jaeseong Jeong
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| | - Donghyeon Kim
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea
| | - Jinhee Choi
- School of Environmental Engineering, University of Seoul, 163 Seoulsiripdae-ro, Dongdaemun-gu, Seoul 02504, Republic of Korea.
| |
Collapse
|
16
|
Wang H, Wang Z, Chen J, Liu W. Graph Attention Network Model with Defined Applicability Domains for Screening PBT Chemicals. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:6774-6785. [PMID: 35475611 DOI: 10.1021/acs.est.2c00765] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In silico models for screening environmentally persistent, bio-accumulative, and toxic (PBT) substances are necessary for sound management of chemicals. Due to the complex structure-activity landscapes (SALs) on the PBT attributes, previous models for screening PBT chemicals lack either applicability domain (AD) characterizations or interpretability, restricting their applications. Herein, graph attention networks (GATs), a novel neural network architecture, were introduced to construct models for screening PBT chemicals. Results show that the GAT model not only outperformed those in previous studies but also exhibited interpretability since it optimizes attention weight parameters (PAW) that indicate contributions of each atom to the PBT attributes. An AD characterization termed ADFP-AC, which considers both molecular fingerprint (FP) similarities and compounds at activity cliffs (ACs) of SALs, was proposed to describe the ADs, which further assured the performance of the GAT model. Eight previously unidentified classes of compounds were identified as PBT chemicals from the Inventory of Existing Chemical Substances in China. The GAT model together with the ADFP-AC characterization may serve as efficient tools for screening PBT chemicals, and the modeling methodology can be applied to other physicochemical, environmental, behavioral, and toxicological parameters of chemicals that are necessary for their risk assessment and management.
Collapse
Affiliation(s)
- Haobo Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Zhongyu Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Wenjia Liu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| |
Collapse
|
17
|
Xia D, Chen J, Fu Z, Xu T, Wang Z, Liu W, Xie HB, Peijnenburg WJGM. Potential Application of Machine-Learning-Based Quantum Chemical Methods in Environmental Chemistry. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:2115-2123. [PMID: 35084191 DOI: 10.1021/acs.est.1c05970] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
It is an important topic in environmental sciences to understand the behavior and toxicology of chemical pollutants. Quantum chemical methodologies have served as useful tools for probing behavior and toxicology of chemical pollutants in recent decades. In recent years, machine learning (ML) techniques have brought revolutionary developments to the field of quantum chemistry, which may be beneficial for investigating environmental behavior and toxicology of chemical pollutants. However, the ML-based quantum chemical methods (ML-QCMs) have only scarcely been used in environmental chemical studies so far. To promote applications of the promising methods, this Perspective summarizes recent progress in the ML-QCMs and focuses on their potential applications in environmental chemical studies that could hardly be achieved by the conventional quantum chemical methods. Potential applications and challenges of the ML-QCMs in predicting degradation networks of chemical pollutants, searching global minima for atmospheric nanoclusters, discovering heterogeneous or photochemical transformation pathways of pollutants, as well as predicting environmentally relevant end points with wave functions as descriptors are introduced and discussed.
Collapse
Affiliation(s)
- Deming Xia
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Jingwen Chen
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Zhiqiang Fu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Tong Xu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Zhongyu Wang
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Wenjia Liu
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Hong-Bin Xie
- Key Laboratory of Industrial Ecology and Environmental Engineering (Ministry of Education), Dalian Key Laboratory on Chemicals Risk Control and Pollution Prevention Technology, School of Environmental Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Willie J G M Peijnenburg
- Institute of Environmental Sciences (CML), Leiden University, Leiden 2300 RA, The Netherlands
- Centre for Safety of Substances and Products, National Institute of Public Health and the Environment (RIVM), Bilthoven 3720 BA, The Netherlands
| |
Collapse
|
18
|
Liu X, Lu D, Zhang A, Liu Q, Jiang G. Data-Driven Machine Learning in Environmental Pollution: Gains and Problems. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:2124-2133. [PMID: 35084840 DOI: 10.1021/acs.est.1c06157] [Citation(s) in RCA: 88] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The complexity and dynamics of the environment make it extremely difficult to directly predict and trace the temporal and spatial changes in pollution. In the past decade, the unprecedented accumulation of data, the development of high-performance computing power, and the rise of diverse machine learning (ML) methods provide new opportunities for environmental pollution research. The ML methodology has been used in satellite data processing to obtain ground-level concentrations of atmospheric pollutants, pollution source apportionment, and spatial distribution modeling of water pollutants. However, unlike the active practices of ML in chemical toxicity prediction, advanced algorithms such as deep neural networks in environmental process studies of pollutants are still deficient. In addition, over 40% of the environmental applications of ML go to air pollution, and its application range and acceptance in other aspects of environmental science remain to be increased. The use of ML methods to revolutionize environmental science and its problem-solving scenarios has its own challenges. Several issues should be taken into consideration, such as the tradeoff between model performance and interpretability, prerequisites of the machine learning model, model selection, and data sharing.
Collapse
Affiliation(s)
- Xian Liu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, People's Republic of China
| | - Dawei Lu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, People's Republic of China
| | - Aiqian Zhang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, People's Republic of China
- School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, People's Republic of China
- College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, People's Republic of China
- Institute of Environment and Health, Jianghan University, Wuhan 430056, People's Republic of China
| | - Qian Liu
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, People's Republic of China
- College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100190, People's Republic of China
- Institute of Environment and Health, Jianghan University, Wuhan 430056, People's Republic of China
| | - Guibin Jiang
- State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, People's Republic of China
- School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310012, People's Republic of China
| |
Collapse
|
19
|
Huang R, Ma C, Ma J, Huangfu X, He Q. Machine learning in natural and engineered water systems. WATER RESEARCH 2021; 205:117666. [PMID: 34560616 DOI: 10.1016/j.watres.2021.117666] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 09/01/2021] [Accepted: 09/11/2021] [Indexed: 06/13/2023]
Abstract
Water resources of desired quality and quantity are the foundation for human survival and sustainable development. To better protect the water environment and conserve water resources, efficient water management, purification, and transportation are of critical importance. In recent years, machine learning (ML) has exhibited its practicability, reliability, and high efficiency in numerous applications; furthermore, it has solved conventional and emerging problems in both natural and engineered water systems. For example, ML can predict various water quality indicators in situ and real-time by considering the complex interactions among water-related variables. ML approaches can also solve emerging pollution problems with proven rules or universal mechanisms summarized from the related research. Moreover, by applying image recognition technology to analyze the relationships between image information and physicochemical properties of the research object, ML can effectively identify and characterize specific contaminants. In view of the bright prospects of ML, this review comprehensively summarizes the development of ML applications in natural and engineered water systems. First, the concept and modeling steps of ML are briefly introduced, including data preparation, algorithm selection and model evaluation. In addition, comprehensive applications of ML in recent studies, including predicting water quality, mapping groundwater contaminants, classifying water resources, tracing contaminant sources, and evaluating pollutant toxicity in natural water systems, as well as modeling treatment techniques, assisting characterization analysis, purifying and distributing drinking water, and collecting and treating sewage water in engineered water systems, are summarized. Finally, the advantages and disadvantages of commonly used algorithms are analyzed according to their structures and mechanisms, and recommendations on the selection of ML algorithms for different studies, as well as prospects on the application and development of ML in water science are proposed. This review provides references for solving a wider range of water-related problems and brings further insights into the intelligent development of water science.
Collapse
Affiliation(s)
- Ruixing Huang
- Key Laboratory of Eco-environments in the Three Gorges Reservoir Region, Ministry of Education, College of Environmental and Ecology, Chongqing University, Chongqing 400044, China; State Key Laboratory of Urban Water Resource and Environment, School of Municipal and Environmental Engineering, Harbin Institute of Technology, Harbin 150090, China
| | - Chengxue Ma
- Key Laboratory of Eco-environments in the Three Gorges Reservoir Region, Ministry of Education, College of Environmental and Ecology, Chongqing University, Chongqing 400044, China; State Key Laboratory of Urban Water Resource and Environment, School of Municipal and Environmental Engineering, Harbin Institute of Technology, Harbin 150090, China
| | - Jun Ma
- State Key Laboratory of Urban Water Resource and Environment, School of Municipal and Environmental Engineering, Harbin Institute of Technology, Harbin 150090, China
| | - Xiaoliu Huangfu
- Key Laboratory of Eco-environments in the Three Gorges Reservoir Region, Ministry of Education, College of Environmental and Ecology, Chongqing University, Chongqing 400044, China.
| | - Qiang He
- Key Laboratory of Eco-environments in the Three Gorges Reservoir Region, Ministry of Education, College of Environmental and Ecology, Chongqing University, Chongqing 400044, China
| |
Collapse
|