Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Uddameri V, Silva A, Singaraju S, Mohammadi G, Hernandez E. Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas. Water 2020;12:1023. [DOI: 10.3390/w12041023] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

For:	Uddameri V, Silva A, Singaraju S, Mohammadi G, Hernandez E. Tree-Based Modeling Methods to Predict Nitrate Exceedances in the Ogallala Aquifer in Texas. Water 2020;12:1023. [DOI: 10.3390/w12041023] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Number

Cited by Other Article(s)

Mamun MAA, Islam ARMT, Aktar MN, Uddin MN, Islam MS, Pal SC, Islam A, Bari ABMM, Idris AM, Senapathi V. Predicting groundwater phosphate levels in coastal multi-aquifers: A geostatistical and data-driven approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;953:176024. [PMID: 39241889 DOI: 10.1016/j.scitotenv.2024.176024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 08/19/2024] [Accepted: 09/02/2024] [Indexed: 09/09/2024]

Abstract

The groundwater (GW) resource plays a central role in securing water supply in the coastal region of Bangladesh and therefore the future sustainability of this valuable resource is crucial for the area. However, there is limited research on the driving factors and prediction of phosphate concentration in groundwater. In this work, geostatistical modeling, self-organizing maps (SOM) and data-driven algorithms were combined to determine the driving factors and predict GW phosphate content in coastal multi-aquifers in southern Bangladesh. The SOM analysis identified three distinct spatial patterns: K+Na+pH, Ca2+Mg2+NO₃-, and HCO₃-SO₄2-PO43-F-. Four data-driven algorithms, including CatBoost, Gradient Boosting Machine (GBM), Long Short-Term Memory (LSTM), and Support Vector Regression (SVR) were used to predict phosphate concentration in GW using 380 samples and 15 prediction parameters. Forecasting accuracy was evaluated using RMSE, R2, RAE, CC, and MAE. Phosphate dissolution and saltwater intrusion, along with phosphorus fertilizers, increase PO43- content in GW. Using input parameters selected by multicollinearity and SOM, the CatBoost model showed exceptional performance in both training (RMSE = 0.002, MAE = 0.001, R2 = 0.999, RAE = 0.057, CC = 1.00) and testing (RMSE = 0.001, MAE = 0.002, R2 = 0.989, RAE = 0.057, CC = 0.998). Na+, K+, and Mg2+ significantly influenced prediction accuracy. The uncertainty study revealed a low standard error for the CatBoost model, indicating robustness and consistency. Semi-variogram models confirmed that the most influential attributes showed weak dependence, suggesting that agricultural runoff increases the heterogeneity of PO43- distribution in GW. These findings are crucial for developing conservation and strategic plans for sustainable utilization of coastal GW resources.

Collapse

Tian Y, Liu Q, Ji Y, Dang Q, Sun Y, He X, Liu Y, Su J. Prediction of sulfate concentrations in groundwater in areas with complex hydrogeological conditions based on machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;923:171312. [PMID: 38423319 DOI: 10.1016/j.scitotenv.2024.171312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 02/16/2024] [Accepted: 02/25/2024] [Indexed: 03/02/2024]

Abstract

The persistent and increasing levels of sulfate due to a variety of human activities over the last decades present a widely concerning environmental issue. Understanding the controlling factors of groundwater sulfate and predicting sulfate concentration is critical for governments or managers to provide information on groundwater protection. In this study, the integration of self-organizing map (SOM) approach and machine learning (ML) modeling offers the potential to determine the factors and predict sulfate concentrations in the Huaibei Plain, where groundwater is enriched with sulfate and the areas have complex hydrogeological conditions. The SOM calculation was used to illustrate groundwater hydrochemistry and analyze the correlations among the hydrochemical parameters. Three ML algorithms including random forest (RF), support vector machine (SVM), and back propagation neural network (BPNN) were adopted to predict sulfate levels in groundwater by using 501 groundwater samples and 8 predictor variables. The prediction performance was evaluated through statistical metrics (R2, MSE and MAE). Mine drainage mainly facilitated increase in groundwater SO42- while gypsum dissolution and pyrite oxidation were found another two potential sources. The major water chemistry type was Ca-HCO3. The dominant cation was Na+ while the dominant anion was HCO3-. There was an intuitive correlation between groundwater sulfate and total dissolved solids (TDS), Cl-, and Na+. By using input variables identified by the SOM method, the evaluation results of ML algorithms showed that the R2, MSE and MAE of RF, SVM, BPNN were 0.43-0.70, 0.16-0.49 and 0.25-0.44. Overall, BPNN showed the best prediction performance and had higher R2 values and lower error indices. TDS and Na+ had a high contribution to the prediction accuracy. These findings are crucial for developing groundwater protection and remediation policies, enabling more sustainable management.

Collapse

Prediction and Interpretation of Water Quality Recovery after a Disturbance in a Water Treatment System Using Artificial Intelligence. WATER 2022. [DOI: 10.3390/w14152423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Park J, Lee WH, Kim KT, Park CY, Lee S, Heo TY. Interpretation of ensemble learning to predict water quality using explainable artificial intelligence. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022;832:155070. [PMID: 35398119 DOI: 10.1016/j.scitotenv.2022.155070] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 03/31/2022] [Accepted: 04/02/2022] [Indexed: 06/14/2023]

Abstract

Algal bloom is a significant issue when managing water quality in freshwater; specifically, predicting the concentration of algae is essential to maintaining the safety of the drinking water supply system. The chlorophyll-a (Chl-a) concentration is a commonly used indicator to obtain an estimation of algal concentration. In this study, an XGBoost ensemble machine learning (ML) model was developed from eighteen input variables to predict Chl-a concentration. The composition and pretreatment of input variables to the model are important factors for improving model performance. Explainable artificial intelligence (XAI) is an emerging area of ML modeling that provides a reasonable interpretation of model performance. The effect of input variable selection on model performance was estimated, where the priority of input variable selection was determined using three indices: Shapley value (SHAP), feature importance (FI), and variance inflation factor (VIF). SHAP analysis is an XAI algorithm designed to compute the relative importance of input variables with consistency, providing an interpretable analysis for model prediction. The XGB models simulated with independent variables selected using three indices were evaluated with root mean square error (RMSE), RMSE-observation standard deviation ratio, and Nash-Sutcliffe efficiency. This study shows that the model exhibited the most stable performance when the priority of input variables was determined by SHAP. This implies that on-site monitoring can be designed to collect the selected input variables from the SHAP analysis to reduce the cost of overall water quality analysis. The independent variables were further analyzed using SHAP summary plot, force plot, target plot, and partial dependency plot to provide understandable interpretation on the performance of the XGB model. While XAI is still in the early stages of development, this study successfully demonstrated a good example of XAI application to improve the interpretation of machine learning model performance in predicting water quality.

Collapse

Hamlin QF, Martin SL, Kendall AD, Hyndman DW. Examining Relationships Between Groundwater Nitrate Concentrations in Drinking Water and Landscape Characteristics to Understand Health Risks. GEOHEALTH 2022;6:e2021GH000524. [PMID: 35509496 PMCID: PMC9060635 DOI: 10.1029/2021gh000524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 02/11/2022] [Accepted: 03/31/2022] [Indexed: 06/14/2023]

Alkindi KM, Mukherjee K, Pandey M, Arora A, Janizadeh S, Pham QB, Anh DT, Ahmadi K. Prediction of groundwater nitrate concentration in a semiarid region using hybrid Bayesian artificial intelligence approaches. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022;29:20421-20436. [PMID: 34735705 DOI: 10.1007/s11356-021-17224-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 10/21/2021] [Indexed: 06/13/2023]

Neural Network and Random Forest-Based Analyses of the Performance of Community Drinking Water Arsenic Treatment Plants. WATER 2021. [DOI: 10.3390/w13243507] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Abstract A plethora of technologies has been developed over decades of extensive research on arsenic remediation, although the technical and financial perspective of arsenic removal plants in the field requires critical evaluation. In the present study, focusing on some of the pronounced arsenic-affected areas in West Bengal, India, we assessed the implementation and operation of different arsenic removal technologies using a dataset of 4000 spatio-temporal data collected from an in-depth field survey of 136 arsenic removal plants engaged in the public water supply. Our statistical analysis of this dataset indicates a 120% rise in the average cumulative capacity of the plants during 2014–2021. The majorities of the plants are based on the activated alumina with FeCl3 technology and serve about 49% of the population in the study area. The average cost of water production for the activated alumina with FeCl3 technology was found to be ₹7.56/m3 (USD $1 ≈ INR ₹70), while the lowest was ₹0.39/m3 for granular ferric hydroxide technology. A machine learning-based framework was employed to analyze the impact of water quality and treatment plant parameters on the removal efficiency, capital, and operational cost of the plants. The artificial neural network model exhibited adequate statistical significance, with a high F-value and R2 of 5830.94 and 0.72 for the capital cost model, 136,954, and 0.98 for the operational cost model, respectively. The relative importance of the process variables was identified through random forest models. The models indicated that flow rate, media, and chemicals are the predominant costs, while contaminant loading in influent water and a coagulating agent was important for removal efficiency. The established framework may be instrumental as a decision-making tool for water providers to assess the expected performance and financial involvement for proposed or ongoing arsenic removal plants concerning various design and quality parameters. Collapse

Comparative Analysis of Artificial Intelligence Models for Accurate Estimation of Groundwater Nitrate Concentration. SENSORS 2020;20:s20205763. [PMID: 33053663 PMCID: PMC7599737 DOI: 10.3390/s20205763] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 09/23/2020] [Accepted: 09/28/2020] [Indexed: 11/17/2022]

Abstract

Prediction of the groundwater nitrate concentration is of utmost importance for pollution control and water resource management. This research aims to model the spatial groundwater nitrate concentration in the Marvdasht watershed, Iran, based on several artificial intelligence methods of support vector machine (SVM), Cubist, random forest (RF), and Bayesian artificial neural network (Baysia-ANN) machine learning models. For this purpose, 11 independent variables affecting groundwater nitrate changes include elevation, slope, plan curvature, profile curvature, rainfall, piezometric depth, distance from the river, distance from residential, Sodium (Na), Potassium (K), and topographic wetness index (TWI) in the study area were prepared. Nitrate levels were also measured in 67 wells and used as a dependent variable for modeling. Data were divided into two categories of training (70%) and testing (30%) for modeling. The evaluation criteria coefficient of determination (R²), mean absolute error (MAE), root mean square error (RMSE), and Nash–Sutcliffe efficiency (NSE) were used to evaluate the performance of the models used. The results of modeling the susceptibility of groundwater nitrate concentration showed that the RF (R² = 0.89, RMSE = 4.24, NSE = 0.87) model is better than the other Cubist (R² = 0.87, RMSE = 5.18, NSE = 0.81), SVM (R² = 0.74, RMSE = 6.07, NSE = 0.74), Bayesian-ANN (R² = 0.79, RMSE = 5.91, NSE = 0.75) models. The results of groundwater nitrate concentration zoning in the study area showed that the northern parts of the case study have the highest amount of nitrate, which is higher in these agricultural areas than in other areas. The most important cause of nitrate pollution in these areas is agriculture activities and the use of groundwater to irrigate these crops and the wells close to agricultural areas, which has led to the indiscriminate use of chemical fertilizers by irrigation or rainwater of these fertilizers is washed and penetrates groundwater and pollutes the aquifer.

Collapse

Prediction of Chlorophyll-a Concentrations in the Nakdong River Using Machine Learning Methods. WATER 2020. [DOI: 10.3390/w12061822] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]