1
|
Zhang X, Long T, Deng S, Chen Q, Chen S, Luo M, Yu R, Zhu X. Machine Learning Modeling Based on Microbial Community for Prediction of Natural Attenuation in Groundwater. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:21212-21223. [PMID: 38064381 DOI: 10.1021/acs.est.3c05667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Natural attenuation is widely adopted as a remediation strategy, and the attenuation potential is crucial to evaluate whether remediation goals can be achieved within the specified time. In this work, long-term monitoring of indigenous microbial communities as well as benzene, toluene, ethylbenzene, and xylene (BTEX) and chlorinated aliphatic hydrocarbons (CAHs) in groundwater was conducted at a historic pesticide manufacturing site. A machine learning approach for natural attenuation prediction was developed with random forest classification (RFC) followed by either random forest regression (RFR) or artificial neural networks (ANNs), utilizing microbiological information and contaminant attenuation rates for model training and cross-validation. Results showed that the RFC could accurately predict the feasibility of natural attenuation for both BTEX and CAHs, and it could successfully identify the key genera. The RFR model was sufficient for the BTEX natural attenuation rate prediction but unreliable for CAHs. The ANN model showed better performance in the prediction of the attenuation rates for both BTEX and CAHs. Based on the assessments, a composite modeling method of RFC and ANN was proposed, which could reduce the mean absolute percentage errors. This study reveals that the combined machine learning approach under the synergistic use of field microbial data has promising potential for predicting natural attenuation.
Collapse
Affiliation(s)
- Xiaodong Zhang
- State Environmental Protection Key Laboratory of Soil Environmental Management and Pollution Control, Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of China, Nanjing 210042, Jiangsu, China
- Department of Environmental Science and Engineering, School of Energy and Environment, Southeast University, Nanjing 210096, Jiangsu, China
| | - Tao Long
- State Environmental Protection Key Laboratory of Soil Environmental Management and Pollution Control, Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of China, Nanjing 210042, Jiangsu, China
| | - Shaopo Deng
- State Environmental Protection Key Laboratory of Soil Environmental Management and Pollution Control, Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of China, Nanjing 210042, Jiangsu, China
| | - Qiang Chen
- State Environmental Protection Key Laboratory of Soil Environmental Management and Pollution Control, Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of China, Nanjing 210042, Jiangsu, China
| | - Sheng Chen
- Geo-engineering Investigation Institute of Jiangsu Province, Nanjing 210041, Jiangsu, China
| | - Moye Luo
- State Environmental Protection Key Laboratory of Soil Environmental Management and Pollution Control, Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of China, Nanjing 210042, Jiangsu, China
- Department of Environmental Science and Engineering, School of Energy and Environment, Southeast University, Nanjing 210096, Jiangsu, China
| | - Ran Yu
- Department of Environmental Science and Engineering, School of Energy and Environment, Southeast University, Nanjing 210096, Jiangsu, China
| | - Xin Zhu
- State Environmental Protection Key Laboratory of Soil Environmental Management and Pollution Control, Nanjing Institute of Environmental Sciences, Ministry of Ecology and Environment of China, Nanjing 210042, Jiangsu, China
| |
Collapse
|
2
|
Haggerty R, Sun J, Yu H, Li Y. Application of machine learning in groundwater quality modeling - A comprehensive review. WATER RESEARCH 2023; 233:119745. [PMID: 36812816 DOI: 10.1016/j.watres.2023.119745] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 11/30/2022] [Accepted: 02/13/2023] [Indexed: 06/18/2023]
Abstract
Groundwater is a crucial resource across agricultural, civil, and industrial sectors. The prediction of groundwater pollution due to various chemical components is vital for planning, policymaking, and management of groundwater resources. In the last two decades, the application of machine learning (ML) techniques for groundwater quality (GWQ) modeling has grown exponentially. This review assesses all supervised, semi-supervised, unsupervised, and ensemble ML models implemented to predict any groundwater quality parameter, making this the most extensive modern review on this topic. Neural networks are the most used ML model in GWQ modeling. Their usage has declined in recent years, giving rise to more accurate or advanced techniques such as deep learning or unsupervised algorithms. Iran and the United States lead the world in areas modeled, with a wealth of historical data available. Nitrate has been modeled most exhaustively, targeted by nearly half of all studies. Advancements in future work will be made with further implementation of deep learning and explainable artificial intelligence or other cutting-edge techniques, application of these techniques for sparsely studied variables, the modeling of new or unique study areas, and the implementation of ML techniques for groundwater quality management.
Collapse
Affiliation(s)
- Ryan Haggerty
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Jianxin Sun
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Hongfeng Yu
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States; Holland Computing Center, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Yusong Li
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States.
| |
Collapse
|
3
|
Khosravi K, Barzegar R, Miraki S, Adamowski J, Daggupati P, Alizadeh MR, Pham BT, Alami MT. Stochastic Modeling of Groundwater Fluoride Contamination: Introducing Lazy Learners. GROUND WATER 2020; 58:723-734. [PMID: 31736062 DOI: 10.1111/gwat.12963] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 10/28/2019] [Accepted: 11/08/2019] [Indexed: 06/10/2023]
Abstract
While it remains the primary source of safe drinking and irrigation water in northwest Iran's Maku Plain, the region's groundwater is prone to fluoride contamination. Accordingly, modeling techniques to accurately predict groundwater fluoride concentration are required. The current paper advances several novel data mining algorithms including Lazy learners [instance-based K-nearest neighbors (IBK); locally weighted learning (LWL); and KStar], a tree-based algorithm (M5P), and a meta classifier algorithm [regression by discretization (RBD)] to predict groundwater fluoride concentration. Drawing on several groundwater quality variables (e.g., Ca 2 + , Mg 2 + , Na + , K + , HCO 3 - , CO 3 2 - , SO 4 2 - , and Cl - concentrations), measured in each of 143 samples collected between 2004 and 2008, several models predicting groundwater fluoride concentrations were developed. The full dataset was divided into two subsets: 70% for model training (calibration) and 30% for model evaluation (validation). Models were validated using several statistical evaluation criteria and three visual evaluation approaches (i.e., scatter plots, Taylor and Violin diagrams). Although Na+ and Ca2+ showed the greatest positive and negative correlations with fluoride (r = 0.59 and -0.39, respectively), they were insufficient to reliably predict fluoride levels; therefore, other water quality variables, including those weakly correlated with fluoride, should be considered as inputs for fluoride prediction. The IBK model outperformed other models in fluoride contamination prediction, followed by KStar, RBD, M5P, and LWL. The RBD and M5P models were the least accurate in terms of predicting peaks in fluoride concentration values. Results of the current study can be used to support practical and sustainable management of water and groundwater resources.
Collapse
Affiliation(s)
| | - Rahim Barzegar
- Department of Bioresource Engineering, McGill University, 21111 Lakeshore, Ste Anne de Bellevue, Quebec, H9X3V9, Canada
- Faculty of Civil Engineering, University of Tabriz, 29 Bahman Blvd., Tabriz, 5166616471, Iran
| | - Shaghayegh Miraki
- Department of Watershed Management Engineering, Sari Agricultural Science and Natural Resources University, Farah Abad Road, Sari, 4818168984, Iran
| | - Jan Adamowski
- Department of Bioresource Engineering, McGill University, 21111 Lakeshore, Ste Anne de Bellevue, Quebec, H9X3V9, Canada
| | - Prasad Daggupati
- School of Engineering, University of Guelph, Ontario, N1G 2W1, Canada
| | - Mohammad Reza Alizadeh
- Department of Bioresource Engineering, McGill University, 21111 Lakeshore, Ste Anne de Bellevue, Quebec, H9X3V9, Canada
| | | | - Mohammad Taghi Alami
- Faculty of Civil Engineering, University of Tabriz, 29 Bahman Blvd., Tabriz, 5166616471, Iran
| |
Collapse
|
4
|
A Review of the Artificial Neural Network Models for Water Quality Prediction. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10175776] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Water quality prediction plays an important role in environmental monitoring, ecosystem sustainability, and aquaculture. Traditional prediction methods cannot capture the nonlinear and non-stationarity of water quality well. In recent years, the rapid development of artificial neural networks (ANNs) has made them a hotspot in water quality prediction. We have conducted extensive investigation and analysis on ANN-based water quality prediction from three aspects, namely feedforward, recurrent, and hybrid architectures. Based on 151 papers published from 2008 to 2019, 23 types of water quality variables were highlighted. The variables were primarily collected by the sensor, followed by specialist experimental equipment, such as a UV-visible photometer, as there is no mature sensor for measurement at present. Five different output strategies, namely Univariate-Input-Itself-Output, Univariate-Input-Other-Output, Multivariate-Input-Other(multi), Multivariate-Input-Itself-Other-Output, and Multivariate-Input-Itself-Other (multi)-Output, are summarized. From results of the review, it can be concluded that the ANN models are capable of dealing with different modeling problems in rivers, lakes, reservoirs, wastewater treatment plants (WWTPs), groundwater, ponds, and streams. The results of many of the review articles are useful to researchers in prediction and similar fields. Several new architectures presented in the study, such as recurrent and hybrid structures, are able to improve the modeling quality of future development.
Collapse
|
5
|
Bui DT, Khosravi K, Karimi M, Busico G, Khozani ZS, Nguyen H, Mastrocicco M, Tedesco D, Cuoco E, Kazakis N. Enhancing nitrate and strontium concentration prediction in groundwater by using new data mining algorithm. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 715:136836. [PMID: 32007881 DOI: 10.1016/j.scitotenv.2020.136836] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2019] [Revised: 01/19/2020] [Accepted: 01/19/2020] [Indexed: 05/14/2023]
Abstract
Groundwater resources constitute the main source of clean fresh water for domestic use and it is essential for food production in the agricultural sector. Groundwater has a vital role for water supply in the Campanian Plain in Italy and hence a future sustainability of the resource is essential for the region. In the current paper novel data mining algorithms including Gaussian Process (GP) were used in a large groundwater quality database to predict nitrate (contaminant) and strontium (potential future increasing) concentrations in groundwater. The results were compared with M5P, random forest (RF) and random tree (RT) algorithms as a benchmark to test the robustness of the modeling process. The dataset includes 246 groundwater quality samples originating from different wells, municipals and agricultural. It was divided for the modeling process into two subgroups by using the 10-fold cross validation technique including 173 samples for model building (training dataset) and 73 samples for model validation (testing dataset). Different water quality variables including T, pH, EC, HCO3-, F-, Cl-, SO42-, Na+, K+, Mg2+, and Ca2+ have been used as an input to the models. At first stage, different input combinations have been constructed based on correlation coefficient and thus the optimal combination was chosen for the modeling phase. Different quantitative criteria alongside with visual comparison approach have been used for evaluating the modeling capability. Results revealed that to obtain reliable results also variables with low correlation should be considered as an input to the models together with those variables showing high correlation coefficients. According to the model evaluation criteria, GP algorithm outperforms all the other models in predicting both nitrate and strontium concentrations followed by RF, M5P and RT, respectively. Result also revealed that model's structure together with the accuracy and structure of the data can have a relevant impact on the model's results.
Collapse
Affiliation(s)
- Dieu Tien Bui
- Geographic Information Science Research Group, Ton Duc Thang University, Ho Chi Minh City, Viet Nam; Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh City, Viet Nam.
| | | | - Mahshid Karimi
- Department of Watershed Management, Sari Agricultural Science and Natural Resources University, Sari, Iran
| | - Gianluigi Busico
- University of Campania "Luigi Vanvitelli", Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, Via Vivaldi 43, 81100, Caserta, Italy
| | - Zohreh Sheikh Khozani
- Department of Civil Engineering, Faculty of Engineering & Built Environment, Universiti Kebangsaan, Malaysia
| | - Hoang Nguyen
- Institute of Research and Development, Duy Tan University, Da Nang 550000, Viet Nam.
| | - Micol Mastrocicco
- University of Campania "Luigi Vanvitelli", Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, Via Vivaldi 43, 81100, Caserta, Italy
| | - Dario Tedesco
- University of Campania "Luigi Vanvitelli", Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, Via Vivaldi 43, 81100, Caserta, Italy; Istituto Nazionale di Geofisica e Vulcanologia, sezione di Napoli - Osservatorio Vesuvuviano, Via Diocleziano 328 - Napoli, Italy
| | - Emilio Cuoco
- University of Campania "Luigi Vanvitelli", Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, Via Vivaldi 43, 81100, Caserta, Italy
| | - Nerantzis Kazakis
- Aristotle University of Thessaloniki, Department of Geology, Lab. of Engineering Geology & Hydrogeology, 54124 Thessaloniki, Greece.
| |
Collapse
|
6
|
Sakizadeh M, Rahmatinia H. Statistical Learning Methods for Classification and Prediction of Groundwater Quality Using a Small Data Record. INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS 2017. [DOI: 10.4018/ijaeis.2017100103] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The objective of this study was to consider the efficiency of support vector machine (SVM) and artificial neural network (ANN) for the classification and prediction of groundwater quality using a small data record in Malayer, Iran. For this purpose, 14 groundwater quality variables that had been collected from 27 groundwater sampling wells were used. Cluster analysis discriminated the total sampling stations into two groups. The classification was implemented by SVM using polynomial and RBF kernel methods. The respective sensitivity and specificity of this model were 0.89 and 0.80 while that of positive predictive value and negative predictive value were 0.89 and 0.86, respectively. The prediction of water quality index (WQI) was implemented using ANN. Despite the high correlation coefficient between the predicted and observed values of WQI(r = 0.90), the generalization ability of this model was low(r = 0.60) indicating the over-fitting of the model to the training data set.
Collapse
|
7
|
Alslaibi TM, Kishawi Y, Abunada Z. Evaluating impacts of recharging partially treated wastewater on groundwater aquifer in semi-arid region by integration of monitoring program and GIS technique. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2017; 24:13674-13686. [PMID: 28397120 DOI: 10.1007/s11356-017-8789-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 03/08/2017] [Indexed: 06/07/2023]
Abstract
The current study investigates the impact of recharging of partially treated wastewater through an infiltration basin on the groundwater aquifer quality parameters. A monitoring program supported by a geographic information analysis (GIS) tool was used to conduct this study. Groundwater samples from the entire surrounding boreholes located downstream the infiltration basin, in addition to samples from the recharged wastewater coming from the Beit Lahia wastewater treatment (BLWWTP), were monitored and analysed between 2011 and 2014. The analysis was then compared with the available historical data since 2008. Results revealed a groundwater replenishment with the groundwater level increased by 1.0-2.0 m during the study period. It also showed a slight improvement in the groundwater quality parameters, mainly a decrease in TDS, Cl- and NO3- levels by 5.5, 17.1 and 20%, respectively, resulting from the relatively better quality of the recharged wastewater. Nevertheless, the level of boron and ammonium in the groundwater wells showed a significant increase over time by 96 and 100%, respectively. Moreover, the infiltration rate was slowed down in time due to the relatively high level of total suspended solid (TSS) in the infiltrated wastewater.
Collapse
Affiliation(s)
- Tamer M Alslaibi
- Environmental Quality Authority, Water Recourse Department, Gaza Strip, Palestine.
- Civil Engineering Department, Palestine University, Gaza Strip, Palestine.
| | | | - Ziyad Abunada
- Environmental Engineering Department, Antalya International University, Döşemealtı, Turkey
| |
Collapse
|
8
|
Zyoud SH, Fuchs-Hanusch D. Estimates of Arab world research productivity associated with desalination: a bibliometric analysis. ACTA ACUST UNITED AC 2016. [DOI: 10.1179/2051645215y.0000000001] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022]
|