1
|
Ban MJ, Lee DH, Lee BT, Kang JH. Assessing the environmental determinants of micropollutant contamination in streams using explainable machine learning and network analysis. CHEMOSPHERE 2024; 370:144041. [PMID: 39733950 DOI: 10.1016/j.chemosphere.2024.144041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Revised: 12/10/2024] [Accepted: 12/26/2024] [Indexed: 12/31/2024]
Abstract
Even at trace concentrations, micropollutants, including pesticides and pharmaceuticals, pose considerable ecological risks, and the increasing presence of synthetic chemical substances in aquatic systems has emerged as a growing concern. Moreover, limited machine-learning (ML) approaches exist for analyzing environmental data, and the increasing complexity of ML models has made it challenging to understand predictor-outcome relationships. In particular, understanding complex interactions among multiple variables remains challenging. This study applies and integrates explainable ML techniques and network analysis to identify the sources of micropollutants in a large watershed and determine the factors affecting micropollutant levels. We assessed the performance of four ML algorithms-support vector machine, random forest, extreme gradient boosting (XGB), and autoencoder-XGB-in predicting micropollutant levels based on the spatial characteristics of the watershed. We applied the synthetic minority oversampling technique to address the data imbalance. The XGB model demonstrated superior predictive performance, particularly for high concentration levels, achieving an accuracy of 87%-99%. Shapley additive explanations (SHAP) analysis identified temperature and rainfall as significant factors. Moreover, agricultural activities contributed to pesticide pollution, whereas urban activities contributed to pharmaceutical contamination. The network analysis corroborated the SHAP findings and revealed event-specific contamination characteristics. This included distinct discharge pathways during a dry summer event and shared pathways during a wet winter event. This approach enhances an understanding of contamination sources and pathways and subsequently aids in developing control measures and making informed policy decisions to preserve water quality in mixed land-use areas.
Collapse
Affiliation(s)
- Min Jeong Ban
- Department of Civil and Environmental Engineering, Dongguk University-Seoul, Seoul, 04620, South Korea
| | - Dong Hoon Lee
- Department of Civil and Environmental Engineering, Dongguk University-Seoul, Seoul, 04620, South Korea
| | - Byung-Tae Lee
- Central Research Facilities, Gwangju Institute of Science and Technology, Gwangju, South Korea.
| | - Joo-Hyon Kang
- Department of Civil and Environmental Engineering, Dongguk University-Seoul, Seoul, 04620, South Korea.
| |
Collapse
|
2
|
Kang D, Ahn YY, Moon HB, Kim K, Jeon J. Exploring micropollutants in polar environments based on non-target analysis using LC-HRMS. MARINE POLLUTION BULLETIN 2024; 209:117083. [PMID: 39393234 DOI: 10.1016/j.marpolbul.2024.117083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 07/31/2024] [Accepted: 09/30/2024] [Indexed: 10/13/2024]
Abstract
The routine use of chemicals in polar regions contributes to unexpected occurrence of micropollutants, with sewage discharge as a prominent pollution source. The aim of this study was to identify and quantify micropollutants in polar environments near potential point sources using non-target analysis (NTA) with liquid chromatography high-resolution mass spectrometry. Seawater samples were collected from Ny-Ålesund, Svalbard and Marian Cove, King George Island, in 2023. We tentatively identified 32 compounds with NTA, along with 105 homologous series substances. Of these, 18 substances were confirmed, and 13 were quantified using the internal standard method. Most quantified substances in the Ny-Ålesund, including caffeine, naproxen, and polyethylene glycols (PEGs), exhibited concentrations ranged from 0.9 to 770,000 ng/L. In Marian Cove, the analysis predominantly detected acetaminophen, with concentrations ranging from 13 to 35 ng/L. The findings underscore the presence and spatial distribution of emerging micropollutants resulting from wastewater discharge in polar regions.
Collapse
Affiliation(s)
- Daeho Kang
- Department of Environmental Engineering, Changwon National University, Changwon, Gyeongsangnam-do 51140, Republic of Korea
| | - Yong-Yoon Ahn
- Korea Polar Research Institute (KOPRI), Incheon 21990, Republic of Korea
| | - Hyo-Bang Moon
- Department of Marine Sciences and Convergent Technology, College of Science and Convergence Technology, Hanyang University, Ansan 15588, Republic of Korea
| | - Kitae Kim
- Korea Polar Research Institute (KOPRI), Incheon 21990, Republic of Korea; Department of Polar Science, University of Science of Technology (UST), Incheon 21990, Republic of Korea
| | - Junho Jeon
- Department of Environmental Engineering, Changwon National University, Changwon, Gyeongsangnam-do 51140, Republic of Korea; School of Smart and Green Engineering, Changwon National University, Changwon, Gyeongsangnam-do 51140, Republic of Korea.
| |
Collapse
|
3
|
Xu H, Yang X, Hu Y, Wang D, Liang Z, Mu H, Wang Y, Shi L, Gao H, Song D, Cheng Z, Lu Z, Zhao X, Lu J, Wang B, Hu Z. Trusted artificial intelligence for environmental assessments: An explainable high-precision model with multi-source big data. ENVIRONMENTAL SCIENCE AND ECOTECHNOLOGY 2024; 22:100479. [PMID: 39286480 PMCID: PMC11402945 DOI: 10.1016/j.ese.2024.100479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 08/19/2024] [Accepted: 08/22/2024] [Indexed: 09/19/2024]
Abstract
Environmental assessments are critical for ensuring the sustainable development of human civilization. The integration of artificial intelligence (AI) in these assessments has shown great promise, yet the "black box" nature of AI models often undermines trust due to the lack of transparency in their decision-making processes, even when these models demonstrate high accuracy. To address this challenge, we evaluated the performance of a transformer model against other AI approaches, utilizing extensive multivariate and spatiotemporal environmental datasets encompassing both natural and anthropogenic indicators. We further explored the application of saliency maps as a novel explainability tool in multi-source AI-driven environmental assessments, enabling the identification of individual indicators' contributions to the model's predictions. We find that the transformer model outperforms others, achieving an accuracy of about 98% and an area under the receiver operating characteristic curve (AUC) of 0.891. Regionally, the environmental assessment values are predominantly classified as level II or III in the central and southwestern study areas, level IV in the northern region, and level V in the western region. Through explainability analysis, we identify that water hardness, total dissolved solids, and arsenic concentrations are the most influential indicators in the model. Our AI-driven environmental assessment model is accurate and explainable, offering actionable insights for targeted environmental management. Furthermore, this study advances the application of AI in environmental science by presenting a robust, explainable model that bridges the gap between machine learning and environmental governance, enhancing both understanding and trust in AI-assisted environmental assessments.
Collapse
Affiliation(s)
- Haoli Xu
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Jianghuai Advance Technology Center, Hefei, 230000, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Xing Yang
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Yihua Hu
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Daqing Wang
- Defense Engineering College, Army Engineering University of PLA, Nanjing, 210007, China
| | - Zhenyu Liang
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Hua Mu
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Yangyang Wang
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Liang Shi
- Jianghuai Advance Technology Center, Hefei, 230000, China
| | - Haoqi Gao
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Daoqing Song
- International Studies College, National University of Defense Technology, Nanjing, 210000, China
| | - Zijian Cheng
- Defense Engineering College, Army Engineering University of PLA, Nanjing, 210007, China
| | - Zhao Lu
- Defense Engineering College, Army Engineering University of PLA, Nanjing, 210007, China
| | - Xiaoning Zhao
- Defense Engineering College, Army Engineering University of PLA, Nanjing, 210007, China
| | - Jun Lu
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Bingwen Wang
- State Key Laboratory of Pulsed Power Laser, College of Electronic Engineering, National University of Defense Technology, Hefei, 230037, China
- Key Laboratory of Electronic Restriction of Anhui Province, Hefei, 230037, China
| | - Zhiyang Hu
- School of Electrical Engineering and Automation, Hefei University of Technology, Hefei, 230009, China
| |
Collapse
|
4
|
Muambo KE, Kim MG, Kim DH, Park S, Oh JE. Pharmaceuticals in raw and treated water from drinking water treatment plants nationwide: Insights into their sources and exposure risk assessment. WATER RESEARCH X 2024; 24:100256. [PMID: 39291270 PMCID: PMC11406100 DOI: 10.1016/j.wroa.2024.100256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2024] [Revised: 08/06/2024] [Accepted: 09/02/2024] [Indexed: 09/19/2024]
Abstract
Due to the large amounts of pharmaceuticals and personal care products (PPCPs) currently being consumed and released into the environment, this study provides a comprehensive analysis of pharmaceutical pollution in both raw and treated water from full-scale drinking water treatment plants nationwide. Our investigation revealed that 30 out of 37 PPCPs were present in raw water with mean concentrations ranging from 0.01-131 ng/L. The raw water sources, surface water (ND - 147 ng/L), subsurface water (ND - 123 ng/L) and reservoir sources (ND - 135 ng/L) exhibited higher mean concentration levels of pharmaceutical residues compared to groundwater sources (ND - 1.89 ng/L). Meanwhile, in treated water, 17 of the 37 analyzed PPCPs were present with carbamazepine, clarithromycin, fluconazole, telmisartan, valsartan, and cotinine being the most common (detection frequency > 40 %), and having mean concentrations of 1.22, 0.12, 3.48, 40.1, 6.36, and 3.73 ng/L, respectively. These findings highlight that, while water treatment processes are effective, there are some persistent compounds that prove challenging to fully eliminate. Using Monte Carlo simulations, risk assessment indicated that most of these compounds are likely to have negligible impact on human health, except for the antihypertensives. Telmisartan was identified as posing the highest ecological risk (RQ > 1), warranting further investigation, and monitoring. The study concludes by prioritizing specific 14 pharmaceuticals, including telmisartan, clarithromycin, lamotrigine, cotinine, lidocaine, tramadol, and others, for future monitoring to safeguard both ecological and human health.
Collapse
Affiliation(s)
- Kimberly Etombi Muambo
- Department of Civil and Environmental Engineering, Pusan National University, Busan 46241, Republic of Korea
| | - Min-Gyeong Kim
- Department of Civil and Environmental Engineering, Pusan National University, Busan 46241, Republic of Korea
| | - Da-Hye Kim
- Institute for Environment and Energy, Pusan National University Busan 46241, Republic of Korea
| | - Sangmin Park
- Department of Environmental Infrastructure Research, National Institute of Environmental Research, Ministry of Environment, Incheon 22689, South Korea
| | - Jeong-Eun Oh
- Department of Civil and Environmental Engineering, Pusan National University, Busan 46241, Republic of Korea
- Institute for Environment and Energy, Pusan National University Busan 46241, Republic of Korea
| |
Collapse
|
5
|
Kang D, Yun D, Cho KH, Baek SS, Jeon J. Profiling emerging micropollutants in urban stormwater runoff using suspect and non-target screening via high-resolution mass spectrometry. CHEMOSPHERE 2024; 352:141402. [PMID: 38346509 DOI: 10.1016/j.chemosphere.2024.141402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 01/31/2024] [Accepted: 02/05/2024] [Indexed: 02/24/2024]
Abstract
Urban surface runoff contains chemicals that can negatively affect water quality. Urban runoff studies have determined the transport dynamics of many legacy pollutants. However, less attention has been paid to determining the first-flush effects (FFE) of emerging micropollutants using suspect and non-target screening (SNTS). Therefore, this study employed suspect and non-target analyses using liquid chromatography-high resolution mass spectrometry to detect emerging pollutants in urban receiving waters during stormwater events. Time-interval sampling was used to determine occurrence trends during stormwater events. Suspect screening tentatively identified 65 substances, then, their occurrence trend was grouped using correlation analysis. Non-target peaks were prioritized through hierarchical cluster analysis, focusing on the first flush-concentrated peaks. This approach revealed 38 substances using in silico identification. Simultaneously, substances identified through homologous series observation were evaluated for their observed trends in individual events using network analysis. The results of SNTS were normalized through internal standards to assess the FFE, and the most of tentatively identified substances showed observed FFE. Our findings suggested that diverse pollutants that could not be covered by target screening alone entered urban water through stormwater runoff during the first flush. This study showcases the applicability of the SNTS in evaluating the FFE of urban pollutants, offering insights for first-flush stormwater monitoring and management.
Collapse
Affiliation(s)
- Daeho Kang
- Department of Environmental Engineering, Changwon National University, Changwon, Gyeongsangnamdo, 51140, South Korea
| | - Daeun Yun
- Civil Urban Earth and Environmental Engineering, Ulsan National Institute of Science and Technology, 50 UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan, 44919, South Korea
| | - Kyung Hwa Cho
- School of Civil, Environmental and Architectural Engineering, Korea University, Seoul, 02841, South Korea
| | - Sang-Soo Baek
- Department of Environmental Engineering, Yeungnam University, 280 Daehak-Ro, Gyeongsan-Si, Gyeongbuk, 38541, South Korea
| | - Junho Jeon
- Department of Environmental Engineering, Changwon National University, Changwon, Gyeongsangnamdo, 51140, South Korea; School of Smart and Green Engineering, Changwon National University, Changwon, Gyeongsangnamdo, 51140, South Korea.
| |
Collapse
|
6
|
Yun D, Kang D, Cho KH, Baek SS, Jeon J. Characterization of micropollutants in urban stormwater using high-resolution monitoring and machine learning. WATER RESEARCH 2023; 235:119865. [PMID: 36934536 DOI: 10.1016/j.watres.2023.119865] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 03/08/2023] [Accepted: 03/09/2023] [Indexed: 06/18/2023]
Abstract
Urban rainfall events can lead to the runoff of pollutants, including industrial, pesticide, and pharmaceutical chemicals. Transporting micropollutants (MPs) into water systems can harm both human health and aquatic species. Therefore, it is necessary to investigate the dynamics of MPs during rainfall events. However, few studies have examined MPs during rainfall events due to the high analytical expenses and extensive spatiotemporal variability. Few studies have investigated the occurrence patterns of MPs and factors that influence their transport, such as rainfall duration, antecedent dry periods, and variations in streamflow. Moreover, while there have been many analyses of nutrients, suspended solids, and heavy metals during the first flush effect (FFE), studies on the transport of MPs during FFE are insufficient. This study aimed to identify the dynamics of MPs and FFE in an urban catchment, using high-resolution monitoring and machine learning methods. Hierarchical clustering analysis and partial least squares regression (PLSR) were implemented to estimate the similarity between each MP and identify the factors influencing their transport during rainfall events. Eleven dominant MPs comprised 75% of the total MP concentration and had a 100% detection frequency. During rainfall events, pesticides and pharmaceutical MPs showed a higher FFE than industrial MPs. Moreover, the initial 30% of the runoff volume contained 78.0% of pesticide and 50.1% of pharmaceutical substances for events W1 (July 5 to July 6, 2021) and W6 (August 31 to September 1, 2021), respectively. The PLSR model suggested that stormflow (m3/s) and the duration of antecedent dry hours (h) significantly influenced MP dynamics, yielding the variable importance on projection scores greater than 1.0. Hence, our findings indicate that MPs in urban waters should be managed by considering FFE.
Collapse
Affiliation(s)
- Daeun Yun
- School of Urban and Environmental Engineering, Ulsan National Institute of Science and Technology, 50 UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea
| | - Daeho Kang
- Department of Environmental Engineering, Changwon National University, Changwondaehak-ro 20, Uichang-gu, Changwon-si, Gyeongsangnam-do 51140, Republic of Korea
| | - Kyung Hwa Cho
- School of Urban and Environmental Engineering, Ulsan National Institute of Science and Technology, 50 UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea; Graduate School of Carbon Neutrality, Ulsan National Institute of Science and Technology, 50 UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea
| | - Sang-Soo Baek
- Department of Environmental Engineering, Yeungnam University, 280 Daehak-Ro, Gyeongsan-Si, Gyeongbuk 38541, South Korea.
| | - Junho Jeon
- Department of Environmental Engineering, Changwon National University, Changwondaehak-ro 20, Uichang-gu, Changwon-si, Gyeongsangnam-do 51140, Republic of Korea; School of Smart and Green Engineering, Changwon National University, Changwon, Gyeongsangnamdo 51140, Korea.
| |
Collapse
|
7
|
Alygizakis N, Giannakopoulos T, Τhomaidis NS, Slobodnik J. Detecting the sources of chemicals in the Black Sea using non-target screening and deep learning convolutional neural networks. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 847:157554. [PMID: 35878861 DOI: 10.1016/j.scitotenv.2022.157554] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 07/17/2022] [Accepted: 07/18/2022] [Indexed: 06/15/2023]
Abstract
The Black Sea is an important ecosystem, which is affected by various anthropogenic pressures, such as shipping activities and wastewater inputs from large coastal cities. Significant loads of chemical pollutants are being continuously brought in by major European rivers. This study investigated the spatial distribution of chemicals in the Ukrainian shelf (the northwestern part of the Black Sea) and their main sources. Chemical occurrence data used in the study was generated within the Joint Black Sea Surveys (JBSS), which took place in 2016 and 2017 as a part of the EU/UNDP EMBLAS II project (www.emblasproject.org). During the JBSS, seawater samples were analyzed by a non-target screening workflow using liquid chromatography high-resolution mass spectrometry (LC-HRMS). Open-source algorithms were applied to generate a combined dataset of 30,489 detected chemical signals and their intensities. Out of these, 35 compounds were tentatively identified by the application of a non-target screening identification workflow based on automated matching of their mass spectra against those in available mass spectral libraries. The dataset was used to generate images, representing spatial distribution of each of the signals. These images were then used as an input to a deep learning convolutional neural network classification model. The study resulted in the development of an open-source end-to-end workflow for the estimation of the pollution load by chemicals contributed by the two major inflowing rivers (Danube and Dnieper) and other, so far unidentified, sources. A dedicated dashboard was built to facilitate data visualization per detected signal/compound. The presented model proved to be especially useful at the prioritization of signals of unknown compounds, which is of key importance for the follow up structure elucidation efforts of bulky non-target screening data. The deep learning approach for peak prioritization of unknown chemicals in the environment has been used for the first time.
Collapse
Affiliation(s)
- Nikiforos Alygizakis
- Laboratory of Analytical Chemistry, Department of Chemistry, University of Athens, Panepistimiopolis Zografou, 15771 Athens, Greece; Environmental Institute, Okružná 784/42, 97241 Koš, Slovak Republic.
| | | | - Nikolaos S Τhomaidis
- Laboratory of Analytical Chemistry, Department of Chemistry, University of Athens, Panepistimiopolis Zografou, 15771 Athens, Greece.
| | | |
Collapse
|