1
|
Madrigal-Solís H, Vadillo-Pérez I, Jiménez-Gavilán P, Fonseca-Sánchez A, Quesada-Hernández L, Calderón-Sánchez H, Gómez-Cruz A, Murillo JH, Salazar RP. A multidisciplinary approach using hydrogeochemistry, δ 15N NO3 isotopes, land use, and statistical tools in evaluating nitrate pollution sources and biochemical processes in Costa Rican volcanic aquifers. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 951:174996. [PMID: 39067595 DOI: 10.1016/j.scitotenv.2024.174996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 07/12/2024] [Accepted: 07/21/2024] [Indexed: 07/30/2024]
Abstract
Nitrate pollution threatens the Barva and Colima multi-aquifer system, the primary drinking water source in the Greater Metropolitan Area of Costa Rica. In addressing nitrate contamination dynamics, this study proposes an integrated approach by combining multivariate statistical analyses, hydrochemical parameters, sewage discharge, and regional land-use and land-cover patterns to assess the extent and degree of contamination, dominant biogeochemical processes, and refine the interpretation of nitrate sources previously derived solely from δ15NNO3 information. Over seven years (2015-2022), 714 groundwater samples from 43 sites were analyzed for nitrate and major ions, including two sampling campaigns for dissolved organic and inorganic carbon, nitrite, ammonium, FeTotal, MnTotal, and δ15NNO3 analyses. The findings presented elevated nitrate concentrations in urban and agricultural/urban areas, surpassing the Maximum Concentration Levels on several occasions, and oxidizing conditions favoring mineralization and nitrification processes in unconfined Barva and locally confined Upper Colima/Lower Colima aquifers. Similar nitrate contents and spatial patterns in agricultural and urban zones in the shallow Barva aquifer suggest comparable contributions from nitrogen fertilizers and urban wastewaters despite the gradual increase in urban land cover and the reduction of agricultural areas. Isotopic analyses and dissolved organic carbon (DOC) indicate a shift in nitrate sources from agricultural to urban areas in both Barva and Colima aquifers. Principal Component and Hierarchical Cluster Analyses link land use, nitrate sources, and water quality. Three distinct sample clusters aligned with forest/grassland, agricultural/urban, and urban land use, emphasizing the impact of anthropogenic activities on groundwater quality, even in the deeper Colima aquifers. The study challenges nitrate isotope mixing models, enhancing accuracy in identifying pollution sources and assessing the spatial extent of contamination by incorporating DOC and other hydrochemical parameters. Similar outcomes, with and without the use of nitrate isotopes, reinforce the usefulness of the integrated approach, providing a practical and cost-effective alternative.
Collapse
Affiliation(s)
- Helga Madrigal-Solís
- Programa de Hidrología Ambiental, Escuela de Ciencias Biológicas, Universidad Nacional, 40101, Heredia, Costa Rica.
| | - Iñaki Vadillo-Pérez
- Grupo de Hidrogeología, Departamento de Ecología y Geología, Universidad de Málaga, 29016 Málaga, Spain
| | - Pablo Jiménez-Gavilán
- Grupo de Hidrogeología, Departamento de Ecología y Geología, Universidad de Málaga, 29016 Málaga, Spain
| | - Alicia Fonseca-Sánchez
- Programa de Hidrología Ambiental, Escuela de Ciencias Biológicas, Universidad Nacional, 40101, Heredia, Costa Rica
| | - Luis Quesada-Hernández
- Programa de Hidrología Ambiental, Escuela de Ciencias Biológicas, Universidad Nacional, 40101, Heredia, Costa Rica
| | - Hazel Calderón-Sánchez
- Programa de Hidrología Ambiental, Escuela de Ciencias Biológicas, Universidad Nacional, 40101, Heredia, Costa Rica
| | - Alicia Gómez-Cruz
- Programa de Hidrología Ambiental, Escuela de Ciencias Biológicas, Universidad Nacional, 40101, Heredia, Costa Rica
| | - Jorge Herrera Murillo
- Laboratorio de Análisis Ambiental, Escuela de Ciencias Ambientales, Universidad Nacional, 40101, Heredia, Costa Rica
| | - Roy Pérez Salazar
- Laboratorio de Gestión de Desechos y Aguas Residuales (LAGEDE), Escuela de Química, Universidad Nacional, 40101, Heredia, Costa Rica
| |
Collapse
|
2
|
Torres-Martínez JA, Mahlknecht J, Kumar M, Loge FJ, Kaown D. Advancing groundwater quality predictions: Machine learning challenges and solutions. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 949:174973. [PMID: 39053524 DOI: 10.1016/j.scitotenv.2024.174973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 06/22/2024] [Accepted: 07/20/2024] [Indexed: 07/27/2024]
Abstract
Machine learning (ML) is revolutionizing groundwater quality research by enhancing predictive accuracy and management strategies for contamination. This comprehensive review explores the evolution of ML technologies and their integration into environmental science, assessing 230 papers to understand the advancements and challenges in groundwater quality research. It reveals that a substantial portion of the research neglects critical preprocessing steps, crucial for model accuracy, with 83 % of the studies overlooking this phase. Furthermore, while model optimization is more commonly addressed, being implemented in 65 % of the papers, there is a noticeable gap in model interpretability, with only 15 % of the research providing explanations for model outcomes. Comparative evaluation of ML algorithms and careful selection of evaluation metrics are deemed essential for determining model fitness and reliability. The review underscores the need for interdisciplinary collaboration, methodological rigor, and continuous innovation to advance ML in groundwater management. By addressing these challenges and implementing solutions, the full potential of ML can be harnessed to tackle complex environmental issues and ensure sustainable groundwater management. This comprehensive and critical review paper can serve as a guiding framework to establish minimum standards for developing ML in groundwater quality studies.
Collapse
Affiliation(s)
- Juan Antonio Torres-Martínez
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico
| | - Jürgen Mahlknecht
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico.
| | - Manish Kumar
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico; School of Engineering, University of Petroleum & Energy Studies, Dehradun, Uttarakhand 248007, India
| | - Frank J Loge
- Department of Civil and Environmental Engineering, University of California Davis, One Shields Avenue, Davis, CA 95616, USA
| | - Dugin Kaown
- School of Earth and Environmental Sciences, Seoul National University, Seoul 08826, Republic of Korea
| |
Collapse
|
3
|
Li X, Liang G, Wang L, Yang Y, Li Y, Li Z, He B, Wang G. Identifying the spatial pattern and driving factors of nitrate in groundwater using a novel framework of interpretable stacking ensemble learning. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2024; 46:482. [PMID: 39470928 PMCID: PMC11522174 DOI: 10.1007/s10653-024-02201-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 08/27/2024] [Indexed: 11/01/2024]
Abstract
Groundwater nitrate contamination poses a potential threat to human health and environmental safety globally. This study proposes an interpretable stacking ensemble learning (SEL) framework for enhancing and interpreting groundwater nitrate spatial predictions by integrating the two-level heterogeneous SEL model and SHapley Additive exPlanations (SHAP). In the SEL model, five commonly used machine learning models were utilized as base models (gradient boosting decision tree, extreme gradient boosting, random forest, extremely randomized trees, and k-nearest neighbor), whose outputs were taken as input data for the meta-model. When applied to the agricultural intensive area, the Eden Valley in the UK, the SEL model outperformed the individual models in predictive performance and generalization ability. It reveals a mean groundwater nitrate level of 2.22 mg/L-N, with 2.46% of sandstone aquifers exceeding the drinking standard of 11.3 mg/L-N. Alarmingly, 8.74% of areas with high groundwater nitrate remain outside the designated nitrate vulnerable zones. Moreover, SHAP identified that transmissivity, baseflow index, hydraulic conductivity, the percentage of arable land, and the C:N ratio in the soil were the top five key driving factors of groundwater nitrate. With nitrate threatening groundwater globally, this study presents a high-accuracy, interpretable, and flexible modeling framework that enhances our understanding of the mechanisms behind groundwater nitrate contamination. It implies that the interpretable SEL framework has great promise for providing valuable evidence for environmental management, water resource protection, and sustainable development, particularly in the data-scarce area.
Collapse
Affiliation(s)
- Xuan Li
- School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China
- British Geological Survey, Keyworth, Nottingham, NG12 5GG, UK
| | - Guohua Liang
- School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China
| | - Lei Wang
- British Geological Survey, Keyworth, Nottingham, NG12 5GG, UK.
| | - Yuesuo Yang
- Key Laboratory of Groundwater Resources and Environment, Ministry of Education, Jilin University, Changchun, 130021, China
| | - Yuanyin Li
- British Geological Survey, Keyworth, Nottingham, NG12 5GG, UK
- Department of Geography, Durham University, Durham, DH1 3LE, UK
| | - Zhongguo Li
- Liaoning Water Affairs Service Center, Shenyang, 110003, China
| | - Bin He
- School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China
| | - Guoli Wang
- School of Hydraulic Engineering, Dalian University of Technology, Dalian, 116024, China
| |
Collapse
|
4
|
Koch J, Kim H, Tirado-Conde J, Hansen B, Møller I, Thorling L, Troldborg L, Voutchkova D, Højberg AL. Modeling groundwater redox conditions at national scale through integration of sediment color and water chemistry in a machine learning framework. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 947:174533. [PMID: 38972412 DOI: 10.1016/j.scitotenv.2024.174533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 07/01/2024] [Accepted: 07/04/2024] [Indexed: 07/09/2024]
Abstract
Redox conditions play a crucial role in determining the fate of many contaminants in groundwater, impacting ecosystem services vital for both the aquatic environment and human water supply. Geospatial machine learning has previously successfully modelled large-scale redox conditions. This study is the first to consolidate the complementary information provided by sediment color and water chemistry to enhance our understanding of redox conditions in Denmark. In the first step, the depth to the first redox interface is modelled using sediment color from 27,042 boreholes. In the second step, the depth of the first redox interface is compared against water chemistry data at 22,198 wells to classify redox complexity. The absence of nitrate containing water below the first redox interface is referred to as continuous redox conditions. In contrast, discontinuous redox conditions are identified by the presence of nitrate below the first redox interface. Both models are built using 20 covariate maps, encompassing diverse hydrologically relevant information. The first redox interface is modelled with a mean error of 0.0 m and a root-mean-squared error of 8.0 m. The redox complexity model attains an accuracy of 69.8 %. Results indicate a mean depth to the first redox interface of 8.6 m and a standard deviation of 6.5 m. 60 % of Denmark is classified as discontinuous, indicating complex redox conditions, predominantly collocated in clay rich glacial landscapes. Both maps, i.e., first redox interface and redox complexity are largely driven by the water table and hydrogeology. The developed maps contribute to our understanding of subsurface redox processes, supporting national-scale land-use and water management.
Collapse
Affiliation(s)
- Julian Koch
- Geological Survey of Denmark and Greenland, Department of Hydrology, Copenhagen, Denmark.
| | - Hyojin Kim
- Geological Survey of Denmark and Greenland, Department of Geochemistry, Copenhagen, Denmark
| | - Joel Tirado-Conde
- Geological Survey of Denmark and Greenland, Department of Hydrology, Copenhagen, Denmark
| | - Birgitte Hansen
- Geological Survey of Denmark and Greenland, Department of Geochemistry, Copenhagen, Denmark
| | - Ingelise Møller
- Geological Survey of Denmark and Greenland, Department of Near Surface Land and Marine Geology, Århus, Denmark
| | - Lærke Thorling
- Geological Survey of Denmark and Greenland, Department of Geochemistry, Copenhagen, Denmark
| | - Lars Troldborg
- Geological Survey of Denmark and Greenland, Department of Hydrology, Copenhagen, Denmark
| | - Denitza Voutchkova
- Geological Survey of Denmark and Greenland, Department of Geochemistry, Copenhagen, Denmark
| | - Anker Lajer Højberg
- Geological Survey of Denmark and Greenland, Department of Hydrology, Copenhagen, Denmark
| |
Collapse
|
5
|
Jalali R, Tishehzan P, Hashemi H. A machine learning framework for spatio-temporal vulnerability mapping of groundwaters to nitrate in a data scarce region in Lenjanat Plain, Iran. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:42088-42110. [PMID: 38862797 DOI: 10.1007/s11356-024-33920-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 06/03/2024] [Indexed: 06/13/2024]
Abstract
The temporal aspect of groundwater vulnerability to contaminants such as nitrate is often overlooked, assuming vulnerability has a static nature. This study bridges this gap by employing machine learning with Detecting Breakpoints and Estimating Segments in Trend (DBEST) algorithm to reveal the underlying relationship between nitrate, water table, vegetation cover, and precipitation time series, that are related to agricultural activities and groundwater demand in a semi-arid region. The contamination probability of Lenjanat Plain has been mapped by comparing random forest (RF), support vector machine (SVM), and K-nearest-neighbors (KNN) models, fed with 32 input variables (dem-derived factors, physiography, distance and density maps, time series data). Also, imbalanced learning and feature selection techniques were investigated as supplementary methods, adding up to four scenarios. Results showed that the RF model, integrated with forward sequential feature selection (SFS) and SMOTE-Tomek resampling method, outperformed the other models (F1-score: 0.94, MCC: 0.83). The SFS techniques outperformed other feature selection methods in enhancing the accuracy of the models with the cost of computational expenses, and the cost-sensitive function proved more efficient in tackling imbalanced data issues than the other investigated methods. The DBEST method identified significant breakpoints within each time series dataset, revealing a clear association between agricultural practices along the Zayandehrood River and substantial nitrate contamination within the Lenjanat region. Additionally, the groundwater vulnerability maps created using the candid RF model and an ensemble of the best RF, SVM, and KNN models predicted mid to high levels of vulnerability in the central parts and the downhills in the southwest.
Collapse
Affiliation(s)
- Reza Jalali
- Department of Environmental Engineering, Faculty of Water and Environmental Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran.
| | - Parvaneh Tishehzan
- Department of Environmental Engineering, Faculty of Water and Environmental Engineering, Shahid Chamran University of Ahvaz, Ahvaz, Iran
| | - Hossein Hashemi
- Division of Water Resources Engineering & Center for Advanced Middle Eastern Studies, Lund University, Lund, Sweden
| |
Collapse
|
6
|
Hu Y, Liu C, Wollheim WM, Jiao T, Ma M. A hybrid deep learning approach to predict hourly riverine nitrate concentrations using routine monitored data. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 360:121097. [PMID: 38733844 DOI: 10.1016/j.jenvman.2024.121097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 04/26/2024] [Accepted: 05/04/2024] [Indexed: 05/13/2024]
Abstract
With high-frequency data of nitrate (NO3-N) concentrations in waters becoming increasingly important for understanding of watershed system behaviors and ecosystem managements, the accurate and economic acquisition of high-frequency NO3-N concentration data has become a key point. This study attempted to use coupled deep learning neural networks and routine monitored data to predict hourly NO3-N concentrations in a river. The hourly NO3-N concentration at the outlet of the Oyster River watershed in New Hampshire, USA, was predicted through neural networks with a hybrid model architecture coupling the Convolutional Neural Networks and the Long Short-Term Memory model (CNN-LSTM). The routine monitored data (the river depth, water temperature, air temperature, precipitation, specific conductivity, pH and dissolved oxygen concentrations) for model training were collected from a nested high-frequency monitoring network, while the high-frequency NO3-N concentration data obtained at the outlet were not included as inputs. The whole dataset was separated into training, validation, and testing processes according to the ratio of 5:3:2, respectively. The hybrid CNN-LSTM model with different input lengths (1d, 3d, 7d, 15d, 30d) displayed comparable even better performance than other studies with lower frequencies, showing mean values of the Nash-Sutcliffe Efficiency 0.60-0.83. Models with shorter input lengths demonstrated both the higher modeling accuracy and stability. The water level, water temperature and pH values at monitoring sites were main controlling factors for forecasting performances. This study provided a new insight of using deep learning networks with a coupled architecture and routine monitored data for high-frequency riverine NO3-N concentration forecasting and suggestions about strategies about variable and input length selection during preprocessing of input data.
Collapse
Affiliation(s)
- Yue Hu
- State Key Laboratory of Geohazard Prevention and Geoenvironment Protection (Chengdu University of Technology), Chengdu, 610059, China
| | - Chuankun Liu
- Sichuan Academy of Environmental Policy and Planning, Department of Ecology and Environment of Sichuan Province, Chengdu, 610059, China.
| | - Wilfred M Wollheim
- Department of Natural Resources and Environment, University of New Hampshire, Durham, NH, 03824, USA
| | - Tong Jiao
- State Key Laboratory of Geohazard Prevention and Geoenvironment Protection (Chengdu University of Technology), Chengdu, 610059, China
| | - Meng Ma
- China Institute of Water Resources and Hydropower Research, Beijing, 100048, China
| |
Collapse
|
7
|
Serra J, Marques-Dos-Santos C, Marinheiro J, Cruz S, Cameira MR, de Vries W, Dalgaard T, Hutchings NJ, Graversgaard M, Giannini-Kurina F, Lassaletta L, Sanz-Cobeña A, Quemada M, Aguilera E, Medinets S, Einarsson R, Garnier J. Assessing nitrate groundwater hotspots in Europe reveals an inadequate designation of Nitrate Vulnerable Zones. CHEMOSPHERE 2024; 355:141830. [PMID: 38552801 DOI: 10.1016/j.chemosphere.2024.141830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 03/07/2024] [Accepted: 03/26/2024] [Indexed: 04/02/2024]
Abstract
Monitoring networks show that the European Union Nitrates Directive (ND) has had mixed success in reducing nitrate concentrations in groundwater. By combining machine learning and monitored nitrate concentrations (1992-2019), we estimate the total area of nitrate hotspots in Europe to be 401,000 km2, with 47% occurring outside of Nitrate Vulnerable Zones (NVZs). We also found contrasting increasing or decreasing trends, varying per country and time periods. We estimate that only 5% of the 122,000 km2 of hotspots in 2019 will meet nitrate quality standards by 2040 and that these may be offset by the appearance of new hotspots. Our results reveal that the effectiveness of the ND is limited by both time-lags between the implementation of good practices and pollution reduction and an inadequate designation of NVZs. Substantial improvements in the designation and regulation of NVZs are necessary, as well as in the quality of monitoring stations in terms of spatial density and information available concerning sampling depth, if the objectives of EU legislation to protect groundwater are to be achieved.
Collapse
Affiliation(s)
- J Serra
- Forest Research Centre CEF, Associate Laboratory TERRA, Instituto Superior de Agronomía, Universidade de Lisboa, 1349-017, Lisbon, Portugal.
| | - C Marques-Dos-Santos
- Forest Research Centre CEF, Associate Laboratory TERRA, Instituto Superior de Agronomía, Universidade de Lisboa, 1349-017, Lisbon, Portugal
| | - J Marinheiro
- Forest Research Centre CEF, Associate Laboratory TERRA, Instituto Superior de Agronomía, Universidade de Lisboa, 1349-017, Lisbon, Portugal
| | - S Cruz
- Forest Research Centre CEF, Associate Laboratory TERRA, Instituto Superior de Agronomía, Universidade de Lisboa, 1349-017, Lisbon, Portugal
| | - M R Cameira
- LEAF-Linking Landscape, Environment, Agriculture and Food-Research Center, Associated Laboratory TERRA, Instituto Superior de Agronomia, Universidade de Lisboa, Tapada da Ajuda, 1349-017, Lisbon, Portugal
| | - W de Vries
- Environmental Systems Analysis Group, Wageningen University and Research, Wageningen, the Netherlands
| | - T Dalgaard
- Department of Agroecology, Aarhus University, Blichers Allé 20, DK-8830, Tjele, Denmark
| | - N J Hutchings
- Department of Agroecology, Aarhus University, Blichers Allé 20, DK-8830, Tjele, Denmark
| | - M Graversgaard
- Department of Agroecology, Aarhus University, Blichers Allé 20, DK-8830, Tjele, Denmark
| | - F Giannini-Kurina
- Department of Agroecology, Aarhus University, Blichers Allé 20, DK-8830, Tjele, Denmark
| | - L Lassaletta
- CEIGRAM/ETSIAAB, Universidad Politécnica de Madrid, 28040, Madrid, Spain
| | - A Sanz-Cobeña
- CEIGRAM/ETSIAAB, Universidad Politécnica de Madrid, 28040, Madrid, Spain
| | - M Quemada
- CEIGRAM/ETSIAAB, Universidad Politécnica de Madrid, 28040, Madrid, Spain
| | - E Aguilera
- CEIGRAM/ETSIAAB, Universidad Politécnica de Madrid, 28040, Madrid, Spain
| | - S Medinets
- Odesa National I. I. Mechnikov University, 7 Mayakovskogo lane, 65082, Odesa, Ukraine; UK Centre for Ecology & Hydrology (Edinburgh), Bush Estate, EH26 0QB, Penicuik, UK
| | - R Einarsson
- Department of Energy and Technology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - J Garnier
- SU CNRS EPHE, UMR Metis, 7619, Paris, France
| |
Collapse
|
8
|
Cooke AK, Willkommen S, Broda S. Analysing agricultural plant protection product concentrations in groundwater in Germany: Nationwide database with site and compound insights. ENVIRONMENTAL RESEARCH 2024; 248:118231. [PMID: 38301764 DOI: 10.1016/j.envres.2024.118231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 11/14/2023] [Accepted: 01/15/2024] [Indexed: 02/03/2024]
Abstract
Pesticides from agricultural practices are among the most pressing reasons why groundwater sources do not reach the good chemical status standards as required by the European Water Framework directive. Complementary to previous federal pesticide reports, we analysed groundwater-monitoring data from 13 German Länder assembled in a database consisting of 26.192 groundwater measuring sites sampled between 1973 and 2021 of in total 521 parent compounds and metabolites. This study focuses on agricultural plant protection products. The monitored substance spectrum and site density developed over time and differs between Länder. More than 95 % of all samples lie below the respective (multiple) limits of quantification (LOQ). We thus report the frequency of exceedance above concentration thresholds, which allows to compare measurements temporally and spatially. Pesticide detections were found in all aquifer types, land uses and well screen depths. Most detections of higher concentrations were found in agricultural areas, at shallow screen depth in porous aquifers. Karst aquifers showed also a higher percentage of samples in higher concentration classes. Metabolites with high mobility and persistence were found in higher concentration ranges. Herbicides and metabolites thereof dominate the top 20 of pesticides that most frequently exceed 0.1 μg L-1. The ranking for 2010-2019 includes both authorised and banned compounds and their occurrence is discussed in the context of their mobility, persistence and underlying monitoring density. Yearly exceedance frequencies above 0.05, 0.1 μg L-1 and higher thresholds of metazachlor and its esa-metabolite, and national sales data of the parent compound did not show a temporal correlation in subsequent years. This study stresses the need for the harmonisation of heterogeneous pesticide data. Further, a characterisation of the groundwater data used to analyse pesticide occurrence in selected concentration ranges for relevant site factors and compound properties and provides a pesticide ranking based on exceedance frequencies is provided.
Collapse
Affiliation(s)
- Anne-Karin Cooke
- Federal Institute for Geosciences and Natural Resources, Wilhelmstraße 25-30, 13593, Berlin, Germany.
| | - Sandra Willkommen
- Federal Institute for Geosciences and Natural Resources, Wilhelmstraße 25-30, 13593, Berlin, Germany
| | - Stefan Broda
- Federal Institute for Geosciences and Natural Resources, Wilhelmstraße 25-30, 13593, Berlin, Germany
| |
Collapse
|
9
|
Nam SH, Kwon S, Kim YD. Development of a basin-scale total nitrogen prediction model by integrating clustering and regression methods. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 920:170765. [PMID: 38340839 DOI: 10.1016/j.scitotenv.2024.170765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 01/15/2024] [Accepted: 02/04/2024] [Indexed: 02/12/2024]
Abstract
Nutrient runoff into rivers caused by human activity has led to global eutrophication issues. The Nakdong River in South Korea is currently facing significant challenges related to eutrophication and harmful algal blooms, underscoring the critical importance of managing total nitrogen (T-N) levels. However, traditional methods of indoor analysis, which depend on sampling, are labor-intensive and face limitations in collecting high-frequency data. Despite advancements in sensor allowing for the measurement of various parameters, sensors still cannot directly measure T-N, necessitating surrogate regression methods. Therefore, we conducted T-N predictions using a water quality dataset collected from 2018 to 2022 at 157 observatories within the Nakdong River basin. To account for the water quality characteristics of each location, we employed a clustering technique to divide the basin and compared a Gaussian mixture model with K-means clustering. Moreover, optimal regressor for each cluster was selected by comparing multiple linear regression (MLR), random forest, and XGBoost. The results showed that forming four clusters via K-means clustering was the most suitable approach and MLR was reasonably accurate for all clusters. Subsequently, recursive feature elimination cross-validation was used to identify suitable parameters for T-N prediction, thus leading to the construction of high-accuracy T-N prediction models. Clustering was useful not only for improving the regressors but also for spatially analyzing the water quality characteristics of the Nakdong River. The MLR model can reveal causal relationships and thus is useful for decision-making. The results of this study revealed that the combination of a simple linear regression model and clustering method can be applied to a wide watershed. The clustering-based regression model showed potential for accurately predicting T-N at the basin level and is expected to contribute to nationwide water quality management through future applications in various fields.
Collapse
Affiliation(s)
- Su Han Nam
- Department of Civil and Environmental Engineering, Myongji University, Yongin, South Korea
| | - Siyoon Kwon
- Center for Water and the Environment, Department of Civil, Architectural and Environmental Engineering, The University of Texas at Austin, Austin, TX, USA
| | - Young Do Kim
- Department of Civil and Environmental Engineering, Myongji University, Yongin, South Korea.
| |
Collapse
|
10
|
Liang Y, Zhang X, Gan L, Chen S, Zhao S, Ding J, Kang W, Yang H. Mapping specific groundwater nitrate concentrations from spatial data using machine learning: A case study of chongqing, China. Heliyon 2024; 10:e27867. [PMID: 38524545 PMCID: PMC10958364 DOI: 10.1016/j.heliyon.2024.e27867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 02/10/2024] [Accepted: 03/07/2024] [Indexed: 03/26/2024] Open
Abstract
Groundwater resources is not only important essential water resources but also imperative connectors within the intricate framework of the ecological environment. High nitrate concentrations in groundwater can exerting adverse impacts on human health. It is imperative to accurately delineate the distribution characteristics of groundwater nitrate concentrations. Four different machine learning models (Gradient Boosting Regression (GB), Random Forest Regression (RF), Extreme Gradient Boosting Regression (XG) and Adaptive Boosting Regression (AD)) which combine spatial environmental data and different radius contributing area was developed to predict the distribution of nitrate concentration in groundwater. The models use 595 groundwater samples and included topography, remote sensing, hydrogeological and hydrological, climate, nitrate input, and socio-economic predictor. Gradient Boosting Regression model outperforms the other models (R2 = 0.627, MAE = 0.529, RMSE = 0.705, PICP = 0.924 for test dataset) under 500 m radius contributing area. A high-resolution (1 km) groundwater nitrate concentration distribution map reveal in the majority of the study area, groundwater nitrate concentrations are below 1 mg/L and high nitrate concentration (>10 mg/L) proportion in southeast, northeast and central main urban area karst valley regions is 1.89%, 0.91%, and 0.38% respectively. In study area, hydrogeological conditions, soil parameters, nitrogen input factors, and percentage of arable land are among the most influential explanatory factors. This work, serving as the inaugural application of utilizing effective spatial methods for predicting groundwater nitrate concentrations in Chongqing city, furnish decision-making support for the prevention and control of groundwater pollution, particularly in areas primarily dependent on groundwater for water supply and holds profound significance as a milestone achievement.
Collapse
Affiliation(s)
- Yuanyi Liang
- Observation and Research Station of Ecological Restoration for Chongqing Typical Mining Areas, Ministry of Natural Resources (Chongqing Institute of Geology and Mineral Resources) Chongqing, 401120, China
| | - Xingjun Zhang
- Observation and Research Station of Ecological Restoration for Chongqing Typical Mining Areas, Ministry of Natural Resources (Chongqing Institute of Geology and Mineral Resources) Chongqing, 401120, China
| | - Lin Gan
- Chongqing Institute of Geological Environment Monitoring, Chongqing, 401122, China
| | - Si Chen
- Observation and Research Station of Ecological Restoration for Chongqing Typical Mining Areas, Ministry of Natural Resources (Chongqing Institute of Geology and Mineral Resources) Chongqing, 401120, China
| | - Shandao Zhao
- Chongqing Institute of Geological Environment Monitoring, Chongqing, 401122, China
| | - Jihui Ding
- Chongqing Institute of Geological Environment Monitoring, Chongqing, 401122, China
| | - Wulue Kang
- Observation and Research Station of Ecological Restoration for Chongqing Typical Mining Areas, Ministry of Natural Resources (Chongqing Institute of Geology and Mineral Resources) Chongqing, 401120, China
| | - Han Yang
- Observation and Research Station of Ecological Restoration for Chongqing Typical Mining Areas, Ministry of Natural Resources (Chongqing Institute of Geology and Mineral Resources) Chongqing, 401120, China
| |
Collapse
|
11
|
Chu Y, He B, He J, Zou H, Sun J, Wen D. Revealing the drivers and genesis of NO 3-N pollution classification in shallow groundwater of the Shaying River Basin by explainable machine learning and pathway analysis method. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 918:170742. [PMID: 38336062 DOI: 10.1016/j.scitotenv.2024.170742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 02/04/2024] [Accepted: 02/04/2024] [Indexed: 02/12/2024]
Abstract
Nitrate (NO3-N), as one of the ubiquitous contaminants in groundwater worldwide, has posed a serious threat to public health and the ecological environment. Despite extensive research on its genesis, little is known about the differences in the genesis of NO3-N pollution across different concentrations. Herein, a study of NO3-N pollution concentration classification was conducted using the Shaying River Basin as a typical area, followed by examining the genesis differences across different pollution classifications. Results demonstrated that three classifications (0-9.98 mg/L, 10.14-27.44 mg/L, and 28.34-136.30 mg/L) were effectively identified for NO3-N pollution using Jenks natural breaks method. Random forest exhibited superior performance in describing NO3-N pollution and was thereby affirmed as the optimal explanatory method. With this method coupling SEMs, the genesis of different NO3-N pollution classifications was proven to be significantly different. Specifically, strongly reducing conditions represented by Mn2+, Eh, and NO2-N played a dominant role in causing residual NO3-N at low levels. Manure and sewage (represented by Cl-) leaching into groundwater through precipitation is mainly responsible for NO3-N in the 10-30 mg/L classification, with a cumulative contribution rate exceeding 80 %. NO3-N concentrations >30 mg/L are primarily caused by the anthropogenic loads stemming from manure, sewage, and agricultural fertilization (represented by Cl- and K+) infiltrating under precipitation in vulnerable hydrogeological conditions. Pathway analysis based on standardized effect and significance further confirmed the rationality and reliability of the above results. The findings will provide more accurate information for policymakers in groundwater resource management to implement effective strategies to mitigate NO3-N pollution.
Collapse
Affiliation(s)
- Yanjia Chu
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China
| | - Baonan He
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China.
| | - Jiangtao He
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China.
| | - Hua Zou
- Key Laboratory of Groundwater Conservation of MWR, China University of Geosciences, Beijing 100083, PR China; School of Water Resources and Environment, China University of Geosciences (Beijing), Beijing 100083, PR China
| | - Jichao Sun
- Institute of Hydrogeology and Environmental Geology, Chinese Academy of Geological Sciences, Shijiazhuang 050061, PR China
| | - Dongguang Wen
- Development Research Center of the Ministry of Water Resources, Beijing 100038, PR China
| |
Collapse
|
12
|
Sakizadeh M, Zhang C, Milewski A. Spatial distribution pattern and health risk of groundwater contamination by cadmium, manganese, lead and nitrate in groundwater of an arid area. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2024; 46:80. [PMID: 38367130 DOI: 10.1007/s10653-023-01845-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Accepted: 12/21/2023] [Indexed: 02/19/2024]
Abstract
Combining the results of base models to create a meta-model is one of the ensemble approaches known as stacking. In this study, stacking of five base learners, including eXtreme gradient boosting, random forest, feed-forward neural networks, generalized linear models with Lasso or Elastic Net regularization, and support vector machines, was used to study the spatial variation of Mn, Cd, Pb, and nitrate in Qom-Kahak Aquifers, Iran. The stacking strategy proved to be an effective substitute predictor for existing machine learning approaches due to its high accuracy and stability when compared to individual learners. Contrarily, there was not any best-performing base model for all of the involved parameters. For instance, in the case of cadmium, random forest produced the best results, with adjusted R2 and RMSE of 0.108 and 0.014, as opposed to 0.337 and 0.013 obtained by the stacking method. The Mn and Cd showed a tight link with phosphate by the redundancy analysis (RDA). This demonstrates the effect of phosphate fertilizers on agricultural operations. In order to analyze the causes of groundwater pollution, spatial methodologies can be used with multivariate analytic techniques, such as RDA, to help uncover hidden sources of contamination that would otherwise go undetected. Lead has a larger health risk than nitrate, according to the probabilistic health risk assessment, which found that 34.4% and 6.3% of the simulated values for children and adults, respectively, were higher than HQ = 1. Furthermore, cadmium exposure risk affected 84% of children and 47% of adults in the research area.
Collapse
Affiliation(s)
- Mohamad Sakizadeh
- Department of Environmental Sciences, Shahid Rajaee Teacher Training University, Lavizan, 1678815811, Tehran, Iran.
| | - Chaosheng Zhang
- International Network for Environment and Health (INEH), School of Geography, Archaeology and Irish Studies, University of Galway, Galway, Ireland
| | - Adam Milewski
- Department of Geology, University of Georgia, Athens, USA
| |
Collapse
|
13
|
Mahlknecht J, Torres-Martínez JA, Kumar M, Mora A, Kaown D, Loge FJ. Nitrate prediction in groundwater of data scarce regions: The futuristic fresh-water management outlook. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 905:166863. [PMID: 37690767 DOI: 10.1016/j.scitotenv.2023.166863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 08/28/2023] [Accepted: 09/03/2023] [Indexed: 09/12/2023]
Abstract
Nitrate contamination in groundwater poses a significant threat to water quality and public health, especially in regions with limited data availability. This study addresses this challenge by employing machine learning (ML) techniques to predict nitrate (NO3--N) concentrations in Mexico's groundwater. Four ML algorithms-Extreme Gradient Boosting (XGB), Boosted Regression Trees (BRT), Random Forest (RF), and Support Vector Machines (SVM)-were executed to model NO3--N concentrations across the country. Despite data limitations, the ML models achieved robust predictive performances. XGB and BRT algorithms demonstrated superior accuracy (0.80 and 0.78, respectively). Notably, this was achieved using ∼10 times less information than previous large-scale assessments. The novelty lies in the first-ever implementation of the 'Support Points-based Split Approach' during data pre-processing. The models considered initially 68 covariates and identified 13-19 significant predictors of NO3--N concentration spanning from climate, geomorphology, soil, hydrogeology, and human factors. Rainfall, elevation, and slope emerged as key predictors. A validation incorporated nationwide waste disposal sites, yielding an encouraging correlation. Spatial risk mapping unveiled significant pollution hotspots across Mexico. Regions with elevated NO3--N concentrations (>10 mg/L) were identified, particularly in the north-central and northeast parts of the country, associated with agricultural and industrial activities. Approximately 21 million people, accounting for 10 % of Mexico's population, are potentially exposed to elevated NO3--N levels in groundwater. Moreover, the NO3--N hotspots align with reported NO3--N health implications such as gastric and colorectal cancer. This study not only demonstrates the potential of ML in data-scarce regions but also offers actionable insights for policy and management strategies. Our research underscores the urgency of implementing sustainable agricultural practices and comprehensive domestic waste management measures to mitigate NO3--N contamination. Moreover, it advocates for the establishment of effective policies based on real-time monitoring and collaboration among stakeholders.
Collapse
Affiliation(s)
- Jürgen Mahlknecht
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico
| | - Juan Antonio Torres-Martínez
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico.
| | - Manish Kumar
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico; Sustainability Cluster, School of Advanced Engineering, UPES, Dehradun, Uttarakhand 248007, India
| | - Abrahan Mora
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Puebla, Atlixcáyotl 5718, Puebla de Zaragoza, Puebla 72453, Mexico
| | - Dugin Kaown
- School of Earth and Environmental Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, South Korea
| | - Frank J Loge
- Escuela de Ingeniería y Ciencias, Tecnologico de Monterrey, Campus Monterrey, Eugenio Garza Sada 2501, Monterrey, NL 64849, Mexico; Department of Civil and Environmental Engineering, University of California Davis, One Shields Avenue, Davis, CA 95616, USA
| |
Collapse
|
14
|
Dieser M, Zieseniß S, Mielenz H, Müller K, Greef JM, Stever-Schoo B. Nitrate leaching potential from arable land in Germany: Identifying most relevant factors. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 345:118664. [PMID: 37499418 DOI: 10.1016/j.jenvman.2023.118664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 07/13/2023] [Accepted: 07/19/2023] [Indexed: 07/29/2023]
Abstract
Diffuse nitrogen losses from agriculture in Germany continue to cause regionally increased nitrate concentrations in groundwater. Groundwater quality monitoring cannot be a timely indicator of the effects of mitigation measures being applied in agriculture, due to frequently long transport routes and high residence times of the leachate. Instead, nitrate leaching potential is often determined at field and farm scale by monitoring soil mineral nitrogen contents at 0-90 cm depth in autumn (SMNa), i.e. before the start of the annual leachate period. In this study, we developed an understanding of the controls on the soil mineral nitrogen content at the start of winter. In an on-farm approach, extensive data was collected from 48 farms in five nitrate-sensitive regions in Germany from 2017 to 2020. From this data set, 25 management and site factors were evaluated with regard to their significance for SMNa by means of a random forest model. With the random forest regression, we identified the role of the factors on SMNa with an acceptable model accuracy with R2 = 0.56. The results show that the cultivated crop is the most important factor influencing SMNa. Potatoes, oilseed rape and maize produced the highest SMNas, whereas SMNas were lowest after spring barley, sugar beet and winter barley. Among site factors, soil type and texture as well as precipitation in October were most decisive. The effects of N fertilisation parameters such as rate and timing were masked by these site factors. The results show that the reduction of nitrogen-intensive crops in crop sequences can be a promising measure for the reduction of nitrate loads. On the other hand, our analysis makes clear that soil-related factors controlling nitrogen release and risk of leaching, as well as weather, can significantly mask the effect of cultivation.
Collapse
Affiliation(s)
- Mona Dieser
- Julius Kühn Institute (JKI), - Federal Research Centre for Cultivated Plants, Institute for Crop and Soil Science, Braunschweig, Germany.
| | - Steffen Zieseniß
- Julius Kühn Institute (JKI), - Federal Research Centre for Cultivated Plants, Institute for Crop and Soil Science, Braunschweig, Germany
| | - Henrike Mielenz
- Julius Kühn Institute (JKI), - Federal Research Centre for Cultivated Plants, Institute for Crop and Soil Science, Braunschweig, Germany
| | - Karolin Müller
- Julius Kühn Institute (JKI), - Federal Research Centre for Cultivated Plants, Institute for Crop and Soil Science, Braunschweig, Germany
| | - Jörg-Michael Greef
- Julius Kühn Institute (JKI), - Federal Research Centre for Cultivated Plants, Institute for Crop and Soil Science, Braunschweig, Germany
| | - Burkhard Stever-Schoo
- Julius Kühn Institute (JKI), - Federal Research Centre for Cultivated Plants, Institute for Crop and Soil Science, Braunschweig, Germany
| |
Collapse
|
15
|
Elsayed A, Rixon S, Levison J, Binns A, Goel P. Application of classification machine learning algorithms for characterizing nutrient transport in a clay plain agricultural watershed. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 345:118924. [PMID: 37678017 DOI: 10.1016/j.jenvman.2023.118924] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Revised: 08/28/2023] [Accepted: 08/30/2023] [Indexed: 09/09/2023]
Abstract
Excess nutrients in surface water and groundwater can lead to water quality deterioration in available water resources. Thus, the classification of nutrient concentrations in water resources has gained significant attention during recent decades. Machine learning (ML) algorithms are considered an efficient tool to describe nutrient loss from agricultural land to surface water and groundwater. Previous studies have applied regression and classification ML algorithms to predict nutrient concentrations in surface water and/or groundwater, or to categorize an output variable using a limited number of input variables. However, there have been no studies that examined the application of different ML classification algorithms in agricultural settings to classify various output variables using a wide range of input variables. In this study, twenty-four ML classification algorithms were implemented on a dataset from three locations within the Upper Parkhill watershed, an agricultural watershed in southern Ontario, Canada. Nutrient concentrations in surface water were classified using geochemical and physical water parameters of surface water and groundwater (e.g., pH), climate and field conditions as the input variables. The performance of these algorithms was evaluated using four evaluation metrics (e.g., classification accuracy) to identify the optimal algorithm for classifying the output variables. Ensemble bagged trees was found to be the optimal ML algorithm for classifying nitrate concentration in surface water (accuracy of 90.9%), while the weighted KNN was the most appropriate algorithm for categorizing the total phosphorus concentration (accuracy of 87%). The ensemble subspace discriminant algorithm gave the highest overall classification accuracy for the concentration of soluble reactive phosphorus and total dissolved phosphorus in surface water with an accuracy of 79.2% and 77.9%, respectively. This study exemplifies that ML algorithms can be used to signify exceedance of recommended concentrations of nutrients in surface waters in agricultural watersheds. Results are useful for decision makers to develop nutrient management strategies.
Collapse
Affiliation(s)
- Ahmed Elsayed
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada; Irrigation and Hydraulics Department, Faculty of Engineering, Cairo University, 1 Gamaa Street, Giza, 12613, Egypt.
| | - Sarah Rixon
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Jana Levison
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Andrew Binns
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Pradeep Goel
- Ministry of the Environment, Conservation and Parks (MECP), 125 Resources Road, Etobicoke, Ontario, M9P 3V6, Canada
| |
Collapse
|
16
|
Mahboobi H, Shakiba A, Mirbagheri B. Improving groundwater nitrate concentration prediction using local ensemble of machine learning models. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 345:118782. [PMID: 37597371 DOI: 10.1016/j.jenvman.2023.118782] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Revised: 07/16/2023] [Accepted: 08/09/2023] [Indexed: 08/21/2023]
Abstract
Groundwater is one of the most important water resources around the world, which is increasingly exposed to contamination. As nitrate is a common pollutant of groundwater and has negative effects on human health, predicting its concentration is of particular importance. Ensemble machine learning (ML) algorithms have been widely employed for nitrate concentration prediction in groundwater. However, existing ensemble models often overlook spatial variation by combining ML models with conventional methods like averaging. The objective of this study is to enhance the spatial accuracy of groundwater nitrate concentration prediction by integrating the outputs of ML models using a local approach that accounts for spatial variation. Initially, three widely used ML models including random forest regression (RFR), k-nearest neighbor (KNN), and support vector regression (SVR) were employed to predict groundwater nitrate concentration of Qom aquifer in Iran. Subsequently, the output of these models were integrated using geographically weighted regression (GWR) as a local model. The findings demonstrated that the ensemble of ML models using GWR resulted in the highest performance (R2 = 0.75 and RMSE = 9.38 mg/l) compared to an ensemble model using averaging (R2 = 0.68 and RMSE = 10.56 mg/l), as well as individual models such as RFR (R2 = 0.70 and RMSE = 10.16 mg/l), SVR (R2 = 0.59 and RMSE = 11.95 mg/l), and KNN (R2 = 0.57 and RMSE = 12.19 mg/l). The resulting prediction map revealed that groundwater nitrate contamination is predominantly concentrated in urban areas located in the northwestern regions of the study area. The insights gained from this study have practical implications for managers, assisting them in preventing nitrate pollution in groundwater and formulating strategies to improve water quality.
Collapse
Affiliation(s)
- Hojjatollah Mahboobi
- Center for Remote Sensing and GIS Research, Faculty of Earth Sciences, Shahid Beheshti University, Tehran, Iran
| | - Alireza Shakiba
- Center for Remote Sensing and GIS Research, Faculty of Earth Sciences, Shahid Beheshti University, Tehran, Iran
| | - Babak Mirbagheri
- Center for Remote Sensing and GIS Research, Faculty of Earth Sciences, Shahid Beheshti University, Tehran, Iran.
| |
Collapse
|
17
|
Ahn SH, Jeong DH, Kim M, Lee TK, Kim HK. Prediction of groundwater quality index to assess suitability for drinking purpose using averaged neural network and geospatial analysis. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023; 265:115485. [PMID: 37729698 DOI: 10.1016/j.ecoenv.2023.115485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 08/29/2023] [Accepted: 09/13/2023] [Indexed: 09/22/2023]
Abstract
Groundwater quality management is pivotal for ensuring public health and ecological resilience. However, the conventional water quality indices often face challenges related to parameter selection, geographic coverage, and scalability. The integration of machine learning and spatial analysis represents a promising methodological shift, allowing for high accuracy and adaptive management strategies. The Safe Groundwater Project in Unsupplied Areas (2017-2020) employed a comprehensive Groundwater Quality Index (GQI) to evaluate potable groundwater quality across South Korea, utilizing a large dataset comprising 28 water quality parameters and 3552 wells. This study revealed that over 50 % of the evaluated wells (Total 8326 wells) were inappropriate as sources of drinking water, indicating a pressing need for policy revision. The averaged neural network model achieved a high predictive accuracy of approximately 95 % for GQI grades, outperforming other classification models. The introduction of 2D spatial analysis in conjunction with machine learning algorithms notably increased the predictive accuracy for unevenly distributed groundwater samples. Moreover, this combined approach enabled the intuitive visualization of groundwater vulnerability across various regions, which can inform targeted interventions for effective resource allocation and management. This research represents a methodologically robust, interdisciplinary approach that holds significant implications for a framework for future groundwater quality management and vulnerability assessment.
Collapse
Affiliation(s)
- Seok Hyun Ahn
- Department of Environmental Engineering, Yonsei University, Wonju 26493, South Korea
| | - Do Hwan Jeong
- Soil and Groundwater Division, National Institute of Environmental Research, Incheon 22689, South Korea
| | - MoonSu Kim
- Soil and Groundwater Division, National Institute of Environmental Research, Incheon 22689, South Korea
| | - Tae Kwon Lee
- Department of Environmental Engineering, Yonsei University, Wonju 26493, South Korea.
| | - Hyun-Koo Kim
- Soil and Groundwater Division, National Institute of Environmental Research, Incheon 22689, South Korea.
| |
Collapse
|
18
|
Haggerty R, Sun J, Yu H, Li Y. Application of machine learning in groundwater quality modeling - A comprehensive review. WATER RESEARCH 2023; 233:119745. [PMID: 36812816 DOI: 10.1016/j.watres.2023.119745] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 11/30/2022] [Accepted: 02/13/2023] [Indexed: 06/18/2023]
Abstract
Groundwater is a crucial resource across agricultural, civil, and industrial sectors. The prediction of groundwater pollution due to various chemical components is vital for planning, policymaking, and management of groundwater resources. In the last two decades, the application of machine learning (ML) techniques for groundwater quality (GWQ) modeling has grown exponentially. This review assesses all supervised, semi-supervised, unsupervised, and ensemble ML models implemented to predict any groundwater quality parameter, making this the most extensive modern review on this topic. Neural networks are the most used ML model in GWQ modeling. Their usage has declined in recent years, giving rise to more accurate or advanced techniques such as deep learning or unsupervised algorithms. Iran and the United States lead the world in areas modeled, with a wealth of historical data available. Nitrate has been modeled most exhaustively, targeted by nearly half of all studies. Advancements in future work will be made with further implementation of deep learning and explainable artificial intelligence or other cutting-edge techniques, application of these techniques for sparsely studied variables, the modeling of new or unique study areas, and the implementation of ML techniques for groundwater quality management.
Collapse
Affiliation(s)
- Ryan Haggerty
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Jianxin Sun
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Hongfeng Yu
- School of Computing, University of Nebraska-Lincoln, Lincoln, NE 68588, United States; Holland Computing Center, University of Nebraska-Lincoln, Lincoln, NE 68588, United States
| | - Yusong Li
- Department of Civil and Environmental Engineering, University of Nebraska-Lincoln, Lincoln, NE 68588, United States.
| |
Collapse
|
19
|
Yang H, Wang P, Chen A, Ye Y, Chen Q, Cui R, Zhang D. Prediction of phosphorus concentrations in shallow groundwater in intensive agricultural regions based on machine learning. CHEMOSPHERE 2023; 313:137623. [PMID: 36565764 DOI: 10.1016/j.chemosphere.2022.137623] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 12/08/2022] [Accepted: 12/20/2022] [Indexed: 06/17/2023]
Abstract
Excessive accumulation of phosphorus in soil profiles has become the main source of phosphorus in groundwater due to the application of phosphorus fertilizers in intensive agricultural regions (IARs). Elevated phosphorus concentrations in groundwater have become a global phenomenon, which places enormous pressure on the safe use of water resources and the safety of the aquatic environment. Currently, the prediction of pollutant concentrations in groundwater mainly focuses on nitrate nitrogen, while research on phosphorus prediction is limited. Taking the IARs approximately 8 plateau lakes in the Yunnan-Guizhou Plateau as an example, 570 shallow groundwater samples and 28 predictor variables were collected and measured, and a machine learning approach was used to predict phosphorus concentrations in groundwater. The performance of three machine learning algorithms and different sets of variables for predicting phosphorus concentrations in shallow groundwater was evaluated. The results showed that after all variables were introduced into the model, the R2, RMSE and MAE of support vector machine (SVM), random forest (RF) and neural network (NN) were 0.52-0.60, 0.101-0.108 and 0.074-0.081, respectively. Among them, the SVM model had the best prediction effect. The clay content and water-soluble phosphorus in soil and soluble organic carbon in groundwater had a high contribution to the prediction accuracy of the model. The prediction accuracy of the model with reduced number of variables showed that when the number of variables was equal to 6, the RF model had R2, RMSE and MAE values of 0.53, 0.108 and 0.074, respectively, and the number of variables increased again; there were small changes in R2, RMSE and MAE. Compared with the SVM and NN models, the RF model can achieve higher accuracy by inputting fewer variables.
Collapse
Affiliation(s)
- Heng Yang
- College of Resource and Environment, Yunnan Agricultural University, Kunming, 650201, China
| | - Panlei Wang
- Agricultural Environment and Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming, 650201, China
| | - Anqiang Chen
- Agricultural Environment and Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming, 650201, China.
| | - Yuanhang Ye
- College of Resource and Environment, Yunnan Agricultural University, Kunming, 650201, China
| | - Qingfei Chen
- College of Resource and Environment, Yunnan Agricultural University, Kunming, 650201, China
| | - Rongyang Cui
- Key Laboratory of Mountain Surface Processes and Ecological Regulation, Chinese Academy of Sciences, Institute of Mountain Hazards and Environment, Chinese Academy of Sciences and Ministry of Water Conservancy, Chengdu, 610041, China; University of Chinese Academy of Science, Beijing, 100049, China
| | - Dan Zhang
- College of Resource and Environment, Yunnan Agricultural University, Kunming, 650201, China.
| |
Collapse
|
20
|
Agbasi JC, Egbueri JC. Intelligent soft computational models integrated for the prediction of potentially toxic elements and groundwater quality indicators: a case study. JOURNAL OF SEDIMENTARY ENVIRONMENTS 2023; 8:57-79. [PMCID: PMC9849108 DOI: 10.1007/s43217-023-00124-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 12/25/2022] [Accepted: 01/04/2023] [Indexed: 10/21/2023]
Abstract
Reports have shown that potentially toxic elements (PTEs) in air, water, and soil systems expose humans to carcinogenic and non-carcinogenic health risks. In southeastern Nigeria, works that have used data-driven algorithms in predicting PTEs in groundwater are scarce. In addition, only a few works have simulated water quality indices using machine learning modelling methods in the region. Therefore, in this study, physicochemical analyses were carried out on groundwater samples in southeastern Nigeria. The laboratory results were used to compute two water quality indices: pollution index of groundwater (PIG) and the water pollution index (WPI), to ascertain groundwater quality. In addition, the physicochemical parameters served as input variables for multiple linear regression (MLR) and artificial neural network (ANN) modelling and prediction of Cr, Fe, Ni, NO3−, Pb, Zn, WPI, and PIG. The results of WPI and PIG computation showed that about 30–35% of the groundwater samples were unsuitable for human consumption, whereas 65–70% of the samples were deemed suitable. The insights from the PIG and WPI model also revealed that lead (Pb) was the most influential PTE that degraded the quality of groundwater resources in the research area. The findings of the MLR and ANN models indicated strong positive prediction accuracies (R 2 = 0.856–1.000) with low modeling errors. The predictive MLR and ANN models of the PIG and WPI generally outperformed those of the PTEs. The models produced in this study predicted the PTEs better compared to previous studies. Thus, this work provides insights into effective water sustainability, management, and protection.
Collapse
Affiliation(s)
- Johnson C. Agbasi
- Department of Geology, Chukwuemeka Odumegwu Ojukwu University, Uli, Nigeria
| | | |
Collapse
|
21
|
Sarkar S, Mukherjee A, Senapati B, Duttagupta S. Predicting Potential Climate Change Impacts on Groundwater Nitrate Pollution and Risk in an Intensely Cultivated Area of South Asia. ACS ENVIRONMENTAL AU 2022; 2:556-576. [PMID: 37101727 PMCID: PMC10125289 DOI: 10.1021/acsenvironau.2c00042] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 08/22/2022] [Accepted: 08/22/2022] [Indexed: 11/28/2022]
Abstract
One of the potential impacts of climate change is enhanced groundwater contamination by geogenic and anthropogenic contaminants. Such impacts should be most evident in areas with high land-use change footprint. Here, we provide a novel documentation of the impact on groundwater nitrate (GWNO3 ) pollution with and without climate change in one of the most intensely groundwater-irrigated areas of South Asia (northwest India) as a consequence of changes in land use and agricultural practices at present and predicted future times. We assessed the probabilistic risk of GWNO3 pollution considering climate changes under two representative concentration pathways (RCPs), i.e., RCP 4.5 and 8.5 for 2030 and 2040, using a machine learning (Random Forest) framework. We also evaluated variations in GWNO3 distribution against a no climate change (NCC) scenario considering 2020 status quo climate conditions. The climate change projections showed that the annual temperatures would rise under both RCPs. The precipitation is predicted to rise by 5% under RCP 8.5 by 2040, while it would decline under RCP 4.5. The predicted scenarios indicate that the areas at high risk of GWNO3 pollution will increase to 49 and 50% in 2030 and 66 and 65% in 2040 under RCP 4.5 and 8.5, respectively. These predictions are higher compared to the NCC condition (43% in 2030 and 60% in 2040). However, the areas at high risk can decrease significantly by 2040 with restricted fertilizer usage, especially under the RCP 8.5 scenario. The risk maps identified the central, south, and southeastern parts of the study area to be at persistent high risk of GWNO3 pollution. The outcomes show that the climate factors may impose a significant influence on the GWNO3 pollution, and if fertilizer inputs and land uses are not managed properly, future climate change scenarios can critically impact the groundwater quality in highly agrarian areas.
Collapse
Affiliation(s)
- Soumyajit Sarkar
- School
of Environmental Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| | - Abhijit Mukherjee
- School
of Environmental Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
- Department
of Geology and Geophysics, Indian Institute
of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| | - Balaji Senapati
- Centre
For Oceans, Rivers, Atmosphere and Land Science (CORAL), Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India
| | - Srimanti Duttagupta
- Graduate
School of Public Health, San Diego State
University, San Diego, California 92182, United States
| |
Collapse
|
22
|
Liu D, Song C, Xin Z, Fang C, Liu Z. Spatial patterns and driving factor analysis of recommended nitrogen application rate for the trade-off between economy and environment for maize in China. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2022; 322:116099. [PMID: 36058069 DOI: 10.1016/j.jenvman.2022.116099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 08/22/2022] [Accepted: 08/22/2022] [Indexed: 06/15/2023]
Abstract
Appropriate nitrogen (N) application increases crop yield, while its unreasonable application results in environmental problem. Determining the appropriate N application rate is the key to sustainable development. Here, the denitrification-decomposition (DNDC) model was used to analyze the effects of N fertilizer on maize yields, economic benefits, nitrate leaching, and nitrous oxide emissions in China. The N application rate for the trade-off between economy and environment at the county scale was further determined. The geodetector model was used to identify the main driving factors and their interactions of the recommended N rate in each agricultural zone. The results showed that the recommended N rate was generally high in the northwest but low in the south, consistent with the spatial patterns of yield potential. However, clay soils with clay ratios greater than 34% in southern China and sandy soils with bulk densities greater than 1.5 g cm-3 on the Huanghuaihai Plain experienced high N levels and low yields, and thus soils need to be improved. Potential grain yield was the main driving factor in most zones, yet its effects gradually weakened from north to south. The influence of soil characteristics increased from north to south. It was found that the current average N application rate of farmers in China was 249 kg N/ha, and 86.55% of counties had excessive N applications. Compared to the regional optimal N rate at a regional scale, a differentiated N application strategy at the county scale determined in this study increased maize yield and economic benefit by 10.51% and 10.85%, respectively, and reduced N2O emissions and NO3- leaching by 28.72% and 33.60%, respectively. The current research provides a scientific basis for China to formulate a win-win N management strategy for economy and environment and provides a method reference for other countries.
Collapse
Affiliation(s)
- Dantong Liu
- Faculty of Infrastructure Engineering, Dalian University of Technology, Dalian, 116024, China
| | - Changchun Song
- Faculty of Infrastructure Engineering, Dalian University of Technology, Dalian, 116024, China.
| | - Zhuohang Xin
- Faculty of Infrastructure Engineering, Dalian University of Technology, Dalian, 116024, China
| | - Chong Fang
- Faculty of Infrastructure Engineering, Dalian University of Technology, Dalian, 116024, China
| | - Zhihong Liu
- Faculty of Infrastructure Engineering, Dalian University of Technology, Dalian, 116024, China
| |
Collapse
|
23
|
Nölscher M, Mutz M, Broda S. Multiorder hydrologic Position for Europe - a Set of Features for Machine Learning and Analysis in Hydrology. Sci Data 2022; 9:662. [PMID: 36309509 PMCID: PMC9617849 DOI: 10.1038/s41597-022-01787-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Accepted: 10/14/2022] [Indexed: 11/16/2022] Open
Abstract
The presented dataset EU-MOHP v013.1.1 provides multiscale information on the hydrologic position (MOHP) of a geographic point within its respective river network and catchment as gridded maps. More precisely, it comprises the three measures “divide to stream distance” (DSD) as sum of the distances to the nearest stream and catchment divide, “lateral position” (LP) as a relative measure of the position between the nearest stream and divide and “stream distance” (SD) as the distance to the nearest stream. These three measures are calculated for nine hydrologic orders to reflect different spatial scales from local to continental. Its spatial extent covers major parts of the European Economic Area (EEA39) which also largely coincides with physiographical Europe. Although there are multiple potential use cases, this dataset serves predominantly as valuable static environmental descriptor or predictor variable for hydrogeological and hydrological modelling such as mapping or forecasting tasks using machine learning. The generation of this dataset uses free open source software only and therefore can be transferred to other regions or input datasets. Measurement(s) | divide to stream distance • lateral position • stream distance | Technology Type(s) | remote sensing | Sample Characteristic - Environment | drainage basin • groundwatershed • catchment | Sample Characteristic - Location | Europe |
Collapse
Affiliation(s)
- Maximilian Nölscher
- Federal Institute for Geosciences and Natural Resources (BGR), Berlin, 13593, Germany.
| | | | - Stefan Broda
- Federal Institute for Geosciences and Natural Resources (BGR), Berlin, 13593, Germany
| |
Collapse
|
24
|
Definition of hot-spots to reduce the nitrogen losses from agricultural land to groundwater in Slovakia. EKOLÓGIA (BRATISLAVA) 2022. [DOI: 10.2478/eko-2022-0030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Abstract
Agriculture is a significant contributor to nitrate pollution of groundwater which in many cases serves as a source of drinking water. Therefore, targeted reduction of nitrogen leaching losses is fully justified to address this issue. The aim of the study was to define the areas of utilized agricultural land (UAL) in Slovakia, where a nitrogen surplus needs to be reduced. Using the average values of leachable nitrogen in the period 2015-2018 and the long-term amount of percolated water, the nitrate concentration in leachate was calculated. To ensure that agricultural activities will contribute to the gradual reduction of nitrate concentration in groundwater, the nitrate concentration in leachate of 40 mg L-1 was chosen as the target value. This concentration was exceeded at 11.7% of the UAL area. The average indicative amount of nitrogen in industrial fertilizers that needs to be reduced to achieve a stricter nitrate concentration in the leachate in these hot-spots is 16 kg ha-1 with the proviso that in two districts this value exceeds 30 kg ha-1.
Collapse
|
25
|
Application and Research of Computer Intelligent Technology in Modern Agricultural Machinery Equipment. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:9978167. [PMID: 35983150 PMCID: PMC9381267 DOI: 10.1155/2022/9978167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 06/20/2022] [Accepted: 06/30/2022] [Indexed: 11/18/2022]
Abstract
Every country, including China, is deeply concerned and interested in the topic of agricultural machinery automation. The world's population is growing at an astronomical rate, and as a result, the need of food is also growing at an astronomical rate. Farmers are forced to apply more toxic pesticides since traditional methods are not up to the task of meeting the rising demand. This has a major impact on agricultural practices, and in the long run, the land becomes barren and unproductive. Intelligent technologies such as Internet of Things, wireless communication, and machine learning can help with crop disease and pesticide storage management, as well as water management and irrigation. In this paper, we design and analyze an intelligent system that automatically predicts the agricultural land features for irrigation purpose. Initially, the dataset is collected and preprocessed using normalization. The features are extracted using principal component analysis (PCA). For automatic prediction by the equipment, we propose heterogeneous fuzzy-based artificial neural network (HF-ANN) with genetic quantum spider monkey optimization (GQ-SMO) algorithm. Analyses and comparisons are made between the proposed approach and current methodologies. The findings indicate the effectiveness of the proposed system.
Collapse
|
26
|
Fallatah O, Ahmed M, Gyawali B, Alhawsawi A. Factors controlling groundwater radioactivity in arid environments: An automated machine learning approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 830:154707. [PMID: 35331768 DOI: 10.1016/j.scitotenv.2022.154707] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 03/02/2022] [Accepted: 03/16/2022] [Indexed: 06/14/2023]
Abstract
Groundwater resources in the Kingdom of Saudi Arabia (KSA) have high levels of natural radioactivity. Within the northwestern KSA, gross alpha (α) and gross beta (β) levels exceed national and international drinking-water limits. In this study, we developed and used an automated machine learning (AML) approach to quantify relationships between gross α and gross β activities and different geological, hydrogeological, and geochemical conditions. Two AML model groups (group I for gross α; group II for gross β) were constructed, using water samples collected from 360 irrigation and water supply wells, to define a robust model that explains the spatial variability in gross α and gross β activities, as well as variables that control the gross activities. Each group contained four model families: deep neural network (DNN), gradient boosting machine (GBM), generalized linear model (GLM), and distributed random forest (DRF). Model inputs include chemical compositions as well as geological and hydrogeological conditions. Three performance metrics were used to evaluate the models during training and testing: normalized root mean square error (NRMSE), Pearson's correlation coefficient (r), and Nash-Sutcliff efficiency (NSE) coefficient. Results indicate that (1) the GBM model outperformed (training: NRMSE: 0.37 ± 0.10; r: 0.92 ± 0.05; NSE: 0.85 ± 0.09; testing: NRMSE: 0.71 ± 0.08; r: 0.72 ± 0.08; NSE: 0.49 ± 0.12) the DNN, DRF, and GLM models when modelling gross α activities; (2) gross α activities are controlled by pH, stream density, nitrate, manganese, and vegetation index; (3) the DRF model outperformed (training: NRMSE: 0.41 ± 0.05; r: 0.92 ± 0.02; NSE: 0.83 ± 0.04; testing: NRMSE: 0.67 ± 0.09; r: 0.77 ± 0.07; NSE: 0.54 ± 0.12) the GBM, DNN, and GLM models when modelling gross β activities; (4) input variables that affect the gross β actives are pH, temperature, stream density, lithology, and nitrate; and (5) no single model could be used to model both gross α and gross β activities-instead, a combination of AML models should be used. Our computationally efficient approach provides a framework and insights for using AML techniques in water quality investigations and promotes more and improved use of different geological, hydrogeological, and geochemical datasets by the scientific community and decision makers to develop guidelines for mitigation.
Collapse
Affiliation(s)
- Othman Fallatah
- Department of Nuclear Engineering, Faculty of Engineering, King Abdulaziz University, P.O. Box 80204, Jeddah 21589, Saudi Arabia
| | - Mohamed Ahmed
- Department of Physical and Environmental Sciences, Texas A&M University-Corpus Christi, 6300 Ocean Drive, Corpus Christi, TX 78412, USA.
| | - Bimal Gyawali
- Department of Physical and Environmental Sciences, Texas A&M University-Corpus Christi, 6300 Ocean Drive, Corpus Christi, TX 78412, USA
| | - Abdulsalam Alhawsawi
- Department of Nuclear Engineering, Faculty of Engineering, King Abdulaziz University, P.O. Box 80204, Jeddah 21589, Saudi Arabia
| |
Collapse
|
27
|
Hybrid Machine Learning Models for Soil Saturated Conductivity Prediction. WATER 2022. [DOI: 10.3390/w14111729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The hydraulic conductivity of saturated soil is a crucial parameter in the study of any engineering problem concerning groundwater. Hydraulic conductivity mainly depends on particle size distribution, soil compaction, and properties that influence aggregation and water retention. Generally, finding simple and accurate analytical equations between the hydraulic conductivity of soil and the characteristics on which it depends is a very hard task. Machine learning algorithms can provide excellent tools for tackling highly nonlinear regression problems. Additionally, hybrid models resulting from the combination of multiple machine learning algorithms can further improve the accuracy of predictions. Five different models were built to predict saturated hydraulic conductivity using a dataset extracted from the Soil Water Infiltration Global database. The models were based on different predictors. Seven variants of each model were compared, replacing the implemented algorithm. Three variants were based on individual models, while four variants were based on hybrid models. The employed individual machine learning algorithms were Multilayer Perceptron, Random Forest, and Support Vector Regression. The model based on the largest number of predictors led to the most accurate predictions. In addition, across all models, hybrid variants based on all three algorithms and hybridized variants of Random Forest and Support Vector Regression proved to be the most accurate (R2 values up to 0.829). However, all variants showed a tendency to overestimate conductivity in soils where it is very low.
Collapse
|
28
|
Imputation of Ammonium Nitrogen Concentration in Groundwater Based on a Machine Learning Method. WATER 2022. [DOI: 10.3390/w14101595] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Ammonium is one of the main inorganic pollutants in groundwater, mainly due to agricultural, industrial and domestic pollution. Excessive ammonium can cause human health risks and environmental consequences. Its temporal and spatial distribution is affected by factors such as meteorology, hydrology, hydrogeology and land use type. Thus, a groundwater ammonium analysis based on limited sampling points produces large uncertainties. In this study, organic matter content, groundwater depth, clay thickness, total nitrogen content (TN), cation exchange capacity (CEC), pH and land-use type were selected as potential contributing factors to establish a machine learning model for fitting the ammonium concentration. The Shapley Additive exPlanations (SHAP) method, which explains the machine learning model, was applied to identify the more significant influencing factors. Finally, the machine learning model established according to the more significant influencing factors was used to impute point data in the study area. From the results, the soil organic matter feature was found to have a substantial impact on the concentration of ammonium in the model, followed by soil pH, clay thickness and groundwater depth. The ammonium concentration generally decreased from northwest to southeast. The highest values were concentrated in the northwest and northeast. The lowest values were concentrated in the southeast, southwest and parts of the east and north. The spatial interpolation based on the machine learning imputation model established according to the influencing factors provides a reliable groundwater quality assessment and was not limited by the number and the geographical location of samplings.
Collapse
|
29
|
Machine Learning Algorithms for Modeling and Mapping of Groundwater Pollution Risk: A Study to Reach Water Security and Sustainable Development (Sdg) Goals in a Mediterranean Aquifer System. REMOTE SENSING 2022. [DOI: 10.3390/rs14102379] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Groundwater pollution poses a severe threat and issue to the environment and humanity overall. That is why mitigative strategies are urgently needed. Today, studies mapping groundwater risk pollution assessment are being developed. In this study, five new hybrid/ensemble machine learning (ML) models are developed, named DRASTIC-Random Forest (RF), DRASTIC-Support Vector Machine (SVM), DRASTIC-Multilayer Perceptron (MLP), DRASTIC-RF-SVM, and DRASTIC-RF-MLP, for groundwater pollution assessment in the Saiss basin, in Morocco. The performances of these models are evaluated using the Receiver Operating Characteristic curve (ROC curve), precision, and accuracy. Based on the results of the ROC curve method, it is indicated that the use of hybrid/ensemble machine learning (ML) models improves the performance of the individual machine learning (ML) algorithms. In effect, the AUC value of the original DRASTIC is 0.51. Furthermore, both hybrid/ensemble models, DRASTIC-RF-MLP (AUC = 0.953) and DRASTIC-RF-SVM, (AUC = 0.901) achieve the best accuracy among the other models, followed by DRASTIC-RF (AUC = 0.852), DRASTIC-SVM (AUC = 0.802), and DRASTIC-MLP (AUC = 0.763). The results delineate areas vulnerable to pollution, which require urgent actions and strategies to improve the environmental and social qualities for the local population.
Collapse
|
30
|
Abstract
The scope of the present study is the estimation of the concentration of nitrates (NO3−) in groundwater using artificial neural networks (ANNs) based on easily measurable in situ data. For the purpose of the current study, two feedforward neural networks were developed to determine whether including land use variables would improve the model results. In the first network, easily measurable field data were used, i.e., pH, electrical conductivity, water temperature, air temperature, and aquifer level. This model achieved a fairly good simulation based on the root mean squared error (RMSE in mg/L) and the Nash–Sutcliffe Model Efficiency (NSE) indicators (RMSE = 26.18, NSE = 0.54). In the second model, the percentages of different land uses in a radius of 1000 m from each well was included in an attempt to obtain a better description of nitrate transport in the aquifer system. When these variables were used, the performance of the model increased significantly (RMSE = 15.95, NSE = 0.70). For the development of the models, data from chemical and physical analyses of groundwater samples from wells located in the Kopaidian Plain and the wider area of the Asopos River Basin, both in Greece, were used. The simulation that the models achieved indicates that they are a potentially useful tools for the estimation of groundwater contamination by nitrates and may therefore constitute a basis for the development of groundwater management plans.
Collapse
|
31
|
He S, Li P, Su F, Wang D, Ren X. Identification and apportionment of shallow groundwater nitrate pollution in Weining Plain, northwest China, using hydrochemical indices, nitrate stable isotopes, and the new Bayesian stable isotope mixing model (MixSIAR). ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2022; 298:118852. [PMID: 35033617 DOI: 10.1016/j.envpol.2022.118852] [Citation(s) in RCA: 51] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Revised: 01/10/2022] [Accepted: 01/11/2022] [Indexed: 05/12/2023]
Abstract
Groundwater nitrate (NO3-) pollution is a worldwide environmental problem. Therefore, identification and partitioning of its potential sources are of great importance for effective control of groundwater quality. The current study was carried out to identify the potential sources of groundwater NO3- pollution and determine their apportionment in different land use/land cover (LULC) types in a traditional agricultural area, Weining Plain, in Northwest China. Multiple hydrochemical indices, as well as dual NO3- isotopes (δ15N-NO3 and δ18O-NO3), were used to investigate the groundwater quality and its influencing factors. LULC patterns of the study area were first determined by interpreting remote sensing image data collected from the Sentinel-2 satellite, then the Bayesian stable isotope mixing model (MixSIAR) was used to estimate proportional contributions of the potential sources to groundwater NO3- concentrations. Groundwater quality in the study area was influenced by both natural and anthropogenic factors, with anthropological impact being more important. The results of LULC revealed that the irrigated land is the dominant LULC type in the plain, covering an area of 576.6 km2 (57.18% of the total surface study area of the plain). On the other hand, the results of the NO3- isotopes suggested that manure and sewage (M&S), as well as soil nitrogen (SN), were the major contributors to groundwater NO3-. Moreover, the results obtained from the MixSIAR model showed that the mean proportional contributions of M&S to groundwater NO3- were 55.5, 43.4, 21.4, and 78.7% in the forest, irrigated, paddy, and urban lands, respectively. While SN showed mean proportional contributions of 29.9, 43.4, 61.5, and 12.7% in the forest, irrigated, paddy, and urban lands, respectively. The current study provides valuable information for local authorities to support sustainable groundwater management in the study region.
Collapse
Affiliation(s)
- Song He
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China
| | - Peiyue Li
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China.
| | - Fengmei Su
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China
| | - Dan Wang
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China
| | - Xiaofei Ren
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China
| |
Collapse
|
32
|
He S, Wu J, Wang D, He X. Predictive modeling of groundwater nitrate pollution and evaluating its main impact factors using random forest. CHEMOSPHERE 2022; 290:133388. [PMID: 34952022 DOI: 10.1016/j.chemosphere.2021.133388] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 11/21/2021] [Accepted: 12/19/2021] [Indexed: 05/12/2023]
Abstract
Groundwater quality in plains and basins of arid and semi-arid regions with increased agriculture and urbanization development faces severe nitrate pollution, which is affected by both climate and anthropogenic activities. Here, shallow groundwater nitrate concentrations in the Yinchuan Region in central Yinchuan Plain were modeled during 2000, 2005, 2010, and 2015 using random forest. Multiple spatial environment factors were taken as predictor variables. The relative importance of these factors was also calculated using the constructed model. Remote sensing and GIS methods were used to compile various environmental factors to generate training and test sets for training and validation of the random forest model. Mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R2) between the observed and predicted groundwater nitrate concentrations were used to measure the model performance. As indicated by these metrics, the random forest model for groundwater nitrate prediction was performed well. The relative importance of the predictor variables computed by the model indicated groundwater nitrate was mainly affected by the distance to the Yellow River, meteorological elements (precipitation, evaporation, and mean air temperature), and water level elevation. Additionally, urban and arable land were the two land use/land cover types that mainly influenced groundwater nitrate concentration in the Yinchuan Region, of which urban land was more influential than arable land as a result of intense expansion of urban land from 2000 to 2015. Overall, the current study provides an approach to integrate multiple environmental factors for groundwater quality study and is also significant for sustainable groundwater management in the Yinchuan Region.
Collapse
Affiliation(s)
- Song He
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China
| | - Jianhua Wu
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China.
| | - Dan Wang
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China
| | - Xiaodong He
- School of Water and Environment, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China; Key Laboratory of Subsurface Hydrology and Ecological Effects in Arid Region of the Ministry of Education, Chang'an University, No. 126 Yanta Road, Xi'an, 710054, Shaanxi, China
| |
Collapse
|
33
|
Alkindi KM, Mukherjee K, Pandey M, Arora A, Janizadeh S, Pham QB, Anh DT, Ahmadi K. Prediction of groundwater nitrate concentration in a semiarid region using hybrid Bayesian artificial intelligence approaches. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:20421-20436. [PMID: 34735705 DOI: 10.1007/s11356-021-17224-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 10/21/2021] [Indexed: 06/13/2023]
Abstract
Nitrate is a major pollutant in groundwater whose main source is municipal wastewater and agricultural activities. In the present study, Bayesian approaches such as Bayesian generalized linear model (BGLM), Bayesian regularized neural network (BRNN), Bayesian additive regression tree (BART), and Bayesian ridge regression (BRR) were used to model groundwater nitrate contamination in a semiarid region Marvdasht watershed, Fars province, Iran. Eleven groundwater (GW) nitrate conditioning factors have been taken as input parameters for predictive modeling. The results showed that the Bayesian models used in this study were all competent to model groundwater nitrate and the BART model with R2 = 0.83 was more efficient than the other models. The result of variable importance showed that potassium (K) has the highest importance in the models followed by rainfall, altitude, groundwater depth, and distance from the residential area. The results of the study can support the decision-making process to control and reduce the sources of nitrate pollution.
Collapse
Affiliation(s)
- Khalifa M Alkindi
- UNESCO Chair on Aflaj Studies, Archaeohydrology, University of Nizwa, Nizwa, Oman
| | - Kaustuv Mukherjee
- Department of Geography, Chandidas Mahavidyalaya, Birbhum, WB, 731215, India
| | - Manish Pandey
- University Center for Research & Development (UCRD), Chandigarh University, Mohali, 140413, Punjab, India
- Department of Civil Engineering, University Institute of Engineering, Chandigarh University, Mohali, 140413, Punjab, India
| | - Aman Arora
- Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi, 10025, Delhi, India
| | - Saeid Janizadeh
- Department of Watershed Management Engineering and Sciences, Faculty in Natural Resources and Marine Science, Tarbiat Modares University, 14115-111, Tehran, Iran
| | - Quoc Bao Pham
- Institute of Applied Technology, Thu Dau Mot University, Binh Duong Province, Vietnam
| | - Duong Tran Anh
- Ho Chi Minh City University of Technology (HUTECH) 475A, Dien Bien Phu, Ward 25, Binh Thanh District, Ho Chi Minh City, Vietnam.
| | - Kourosh Ahmadi
- Department of Forestry, Faculty in Natural Resources and Marine Science, Tarbiat Modares University, 14115-111, Tehran, Iran
| |
Collapse
|
34
|
Singh SK, Taylor RW, Pradhan B, Shirzadi A, Pham BT. Predicting sustainable arsenic mitigation using machine learning techniques. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2022; 232:113271. [PMID: 35121252 DOI: 10.1016/j.ecoenv.2022.113271] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 01/21/2022] [Accepted: 01/28/2022] [Indexed: 06/14/2023]
Abstract
This study evaluates state-of-the-art machine learning models in predicting the most sustainable arsenic mitigation preference. A Gaussian distribution-based Naïve Bayes (NB) classifier scored the highest Area Under the Curve (AUC) of the Receiver Operating Characteristic curve (0.82), followed by Nu Support Vector Classification (0.80), and K-Neighbors (0.79). Ensemble classifiers scored higher than 70% AUC, with Random Forest being the top performer (0.77), and Decision Tree model ranked fourth with an AUC of 0.77. The multilayer perceptron model also achieved high performance (AUC=0.75). Most linear classifiers underperformed, with the Ridge classifier at the top (AUC=0.73) and perceptron at the bottom (AUC=0.57). A Bernoulli distribution-based Naïve Bayes classifier was the poorest model (AUC=0.50). The Gaussian NB was also the most robust ML model with the slightest variation of Kappa score on training (0.58) and test data (0.64). The results suggest that nonlinear or ensemble classifiers could more accurately understand the complex relationships of socio-environmental data and help develop accurate and robust prediction models of sustainable arsenic mitigation. Furthermore, Gaussian NB is the best option when data is scarce.
Collapse
Affiliation(s)
- Sushant K Singh
- Department of Earth and Environmental Studies, Montclair State University, New Jersey, USA; The Center for Artificial Intelligence and Environmental Sustainability (CAIES) Foundation, Patna, Bihar, India.
| | - Robert W Taylor
- Department of Earth and Environmental Studies, Montclair State University, New Jersey, USA.
| | - Biswajeet Pradhan
- Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), School of Civil and Environmental Engineering, University of Technology Sydney, NSW 2007, Australia; Department of Energy and Mineral Resources Engineering, Sejong University, Choongmu-gwan, 209 Neungdong-ro Gwangjin-gu, Seoul 05006, Republic of Korea; Center of Excellence for Climate Change Research, King Abdulaziz University, P. O. Box 80234, Jeddah 21589, Saudi Arabia; Earth Observation Centre, Institute of Climate Change, Universiti Kebangsaan Malaysia, 43600 UKM, Bangi, Selangor, Malaysia.
| | - Ataollah Shirzadi
- College of Natural Resources, Department of Rangeland and Watershed Management Sciences, University of Kurdistan, Sanandaj, Iran.
| | - Binh Thai Pham
- Department of Geotechnical Engineering, University of Transport Technology, 54 Trieu Khuc, Thanh Xuan, Ha Noi, Viet Nam.
| |
Collapse
|
35
|
Ransom KM, Nolan BT, Stackelberg PE, Belitz K, Fram MS. Machine learning predictions of nitrate in groundwater used for drinking supply in the conterminous United States. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 807:151065. [PMID: 34673076 DOI: 10.1016/j.scitotenv.2021.151065] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 10/14/2021] [Accepted: 10/14/2021] [Indexed: 06/13/2023]
Abstract
Groundwater is an important source of drinking water supplies in the conterminous United State (CONUS), and presence of high nitrate concentrations may limit usability of groundwater in some areas because of the potential negative health effects. Prediction of locations of high nitrate groundwater is needed to focus mitigation and relief efforts. A three-dimensional extreme gradient boosting (XGB) machine learning model was developed to predict the distribution of nitrate. Nitrate was predicted at a 1 km resolution for two drinking water zones, each of variable depth, one for domestic supply and one for public supply. The model used measured nitrate concentrations from 12,082 wells and included predictor variables representing well characteristics, hydrologic conditions, soil type, geology, land use, climate, and nitrogen inputs. Predictor variables derived from empirical or numerical process-based models were also included to integrate information on controlling processes and conditions. The model provided accurate estimates at national and regional scales: the training (R2 of 0.83) and hold-out (R2 of 0.49) data fits compared favorably to previous studies. Predicted nitrate concentrations were less than 1 mg/L across most of the CONUS. Nationally, well depth, soil and climate characteristics, and the absence of developed land use were among the most influential explanatory factors. Only 1% of the area in either water supply zone had predicted nitrate concentrations greater than 10 mg/L; however, about 1.4 M people depend on groundwater for their drinking supplies in those areas. Predicted high concentrations of nitrate were most prevalent in the central CONUS. In areas of predicted high nitrate concentration, applied manure, farm fertilizer, and agricultural land use were influential predictor variables. This work represents the first application of XGB to a three-dimensional national-scale groundwater quality model and provides a significant milestone in the efforts to document nitrate in groundwater across the CONUS.
Collapse
Affiliation(s)
- K M Ransom
- U.S. Geological Survey, California Water Science Center Sacramento, Sacramento, CA, United States.
| | - B T Nolan
- U.S. Geological Survey Headquarters, Reston, VA, United States
| | - P E Stackelberg
- U.S. Geological Survey, Water Mission Area, Troy, NY, United States
| | - K Belitz
- U.S. Geological Survey, Water Mission Area, Carlisle, MA, United States
| | - M S Fram
- U.S. Geological Survey, California Water Science Center Sacramento, Sacramento, CA, United States
| |
Collapse
|
36
|
Rosecrans CZ, Belitz K, Ransom KM, Stackelberg PE, McMahon PB. Predicting regional fluoride concentrations at public and domestic supply depths in basin-fill aquifers of the western United States using a random forest model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 806:150960. [PMID: 34656592 DOI: 10.1016/j.scitotenv.2021.150960] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 10/08/2021] [Accepted: 10/09/2021] [Indexed: 06/13/2023]
Abstract
A random forest regression (RFR) model was applied to over 12,000 wells with measured fluoride (F) concentrations in untreated groundwater to predict F concentrations at depths used for domestic and public supply in basin-fill aquifers of the western United States. The model relied on twenty-two regional-scale environmental and surficial predictor variables selected to represent factors known to control F concentrations in groundwater. The testing model fit R2 and RMSE were 0.52 and 0.78 mg/L. Comparisons of measured to predicted proportions of four F-concentrations categories (<0.7 mg/L, 0.7-2 mg/L, >2 mg/L - 4 mg/L, and > 4 mg/L) indicate that the model performed well at making regional-scale predictions. Differences between measured and predicted proportions indicate underprediction of measured F at values by between 4 and 20 mg/L, representing less than 1% of the regional scale predicted values. These residuals most often map to geographic regions where local-scale processes including evaporative discharge in closed basins or intermittent streams concentrate fluoride in shallow groundwater. Despite this, the RFR model provides spatially continuous F predictions across the basin-fill aquifers where discrete samples are missing. Further, the predictions capture documented areas that exceed the F maximum contaminant level for drinking water of 4 mg/L and areas that are below the oral-health benchmark of 0.7 mg/L. These predictions can be used to estimate fluoride concentrations in unmonitored areas and to aid in identifying geographic areas that may require further investigation at localized scales.
Collapse
|
37
|
Zhang D, Wang P, Cui R, Yang H, Li G, Chen A, Wang H. Electrical conductivity and dissolved oxygen as predictors of nitrate concentrations in shallow groundwater in Erhai Lake region. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 802:149879. [PMID: 34464801 DOI: 10.1016/j.scitotenv.2021.149879] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 08/19/2021] [Accepted: 08/20/2021] [Indexed: 06/13/2023]
Abstract
Elevated nitrogen (N) concentration in shallow groundwater is becoming increasingly problematic, putting water resources under pressure. For more effective management of such a resource, more precise predictors of N level in groundwater using smart monitoring networks are needed. However, external factors such as land use type, rainfall, and N loads from multiple sources (residential and agricultural) make it difficult to accurately predict the spatial and temporal variations of N concentration. In order to identify the key factors affecting spatial and temporal N concentration in shallow groundwater and develop a predictive model, 635 groundwater samples from drinking wells in residential areas and agricultural wells in croplands of a typical agricultural watershed in the Erhai Lake Basin, southwest China, in the period from 2018 to 2020, were collected and analyzed. The results showed that the type of land use and seasonal variations significantly affected the N forms and their concentrations in the shallow groundwater, as the ratios of ON and NO3--N to TN were 30%-39% and 52%-59% for the two land uses and 25%-44% and 46%-66% for seasonal changes. Their variations were reflected by electrical conductivity (EC) and redox environment. EC and dissolved oxygen (DO) had a positive non-linear relationship with the concentrations of total nitrogen (TN) and nitrate (NO3--N). The fitted non-linear quantitative models were established separately to predict TN and NO3--N concentrations in groundwater using easily available indictors (EC and DO). The high accuracy and performance of the models were investigated and approved by rRMSE, MAE, and 1:1 line. These findings can provide technical support for the rapid prediction and evaluation of N pollution in shallow groundwater through easily available indicators.
Collapse
Affiliation(s)
- Dan Zhang
- College of Resource and Environment, Yunnan Agricultural University, Kunming 650201, China
| | - Panlei Wang
- College of Resource and Environment, Yunnan Agricultural University, Kunming 650201, China; Agricultural Environment and Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming 650201, China
| | - Rongyang Cui
- Key Laboratory of Mountain Surface Processes and Ecological Regulation, Chinese Academy of Sciences, Institute of Mountain Hazards and Environment, Chinese Academy of Sciences and Ministry of Water Conservancy, Chengdu 610041, China
| | - Heng Yang
- College of Resource and Environment, Yunnan Agricultural University, Kunming 650201, China
| | - Guifang Li
- College of Resource and Environment, Yunnan Agricultural University, Kunming 650201, China
| | - Anqiang Chen
- Agricultural Environment and Resources Institute, Yunnan Academy of Agricultural Sciences, Kunming 650201, China.
| | - Hongyuan Wang
- Key Laboratory of Non-point Source Pollution Control, Ministry of Agriculture and Rural Affairs, Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
38
|
Islam ARMT, Pal SC, Chowdhuri I, Salam R, Islam MS, Rahman MM, Zahid A, Idris AM. Application of novel framework approach for prediction of nitrate concentration susceptibility in coastal multi-aquifers, Bangladesh. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 801:149811. [PMID: 34467937 DOI: 10.1016/j.scitotenv.2021.149811] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/26/2021] [Revised: 07/31/2021] [Accepted: 08/17/2021] [Indexed: 06/13/2023]
Abstract
This study aims to construct a novel framework approach for predicting and mapping nitrate concentration susceptibility in the coastal multi-aquifers of Bangladesh by coupling the K-fold cross-validation method and novel ensemble learning algorithms, including Boosting, Bagging and Random Forest (RF). In total, 286 nitrate sampling sites were employed in the model work. The dataset was demarcated into a 75:25 ratio for model construction (75% 3-fold ≅ 214 sites) and (25% 1-fold ≅ 72 sites) for model validation using the 4-fold cross-validation schemes. A total of 14 groundwater causative factors including salinity, depth, pH, EC, As, HCO3-, F-, Cl-, SO42-, PO42-, Na+, K+, Mg2+, and Ca2+ were adopted for the construction of the proposed models. OneR relative importance model was employed to choose and rank critical factors for spatial nitrate modeling. The results showed that depth, pH and As are the most influential causative factors in the elevated nitrate concentration in groundwater. Based on the model assessment criteria such as receiver operating characteristic (ROC)'s AUC (area under curve), sensitivity, specificity, accuracy, precession, F score, and Kappa coefficient, the Boosting model outperforms others (r = 0.92, AUG ≥ 0.90) in mapping nitrate concentration susceptibility, followed by Bagging and RF models. The results of mapping nitrate concentration also demonstrated that the south-central and western regions had an elevated amount of nitrate content than other regions due to depth variation in the study area. During our sampling campaign, we observed hundreds of fish hatcheries operation, a fish landing center and aquaculture farms which are the reasons for overexploitation and excessive agrochemicals used in the study area. Thus, the dependability of ensemble learning modeling verifies the effectiveness and applicability of the proposed novel approach for decision-makers in groundwater pollution management at the local and regional levels.
Collapse
Affiliation(s)
| | - Subodh Chandra Pal
- Department of Geography, The University of Burdwan, Bardhaman 713104, West Bengal, India.
| | - Indrajit Chowdhuri
- Department of Geography, The University of Burdwan, Bardhaman 713104, West Bengal, India
| | - Roquia Salam
- Department of Disaster Management, Begum Rokeya University, Rangpur 5400, Bangladesh
| | - Md Saiful Islam
- Department of Soil Science, Patuakhali Science and Technology University, Dumki, Patuakhali 8602, Bangladesh
| | - Md Mostafizur Rahman
- Department of Environmental Sciences, Jahangirnagar University, Dhaka 1342, Bangladesh
| | - Anwar Zahid
- Bangladesh Water Development Board (BWDB), Dhaka, Bangladesh
| | - Abubakr M Idris
- Department of Chemistry, College of Science, King Khalid University, Abha 61431, Saudi Arabia; Research Center for Advanced Materials Science (RCAMS), King Khalid University, Abha, Saudi Arabia
| |
Collapse
|
39
|
Assessing Nitrate Contamination Risks in Groundwater: A Machine Learning Approach. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app112110034] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Groundwater is one of the primary sources for the daily water requirements of the masses, but it is subjected to contamination due to the pollutants, such as nitrate, percolating through the soil with water. Especially in built-up areas, groundwater vulnerability and contamination are of major concern, and require appropriate consideration. The present study develops a novel framework for assessing groundwater nitrate contamination risk for the area along the Karakoram Highway, which is a part of the China Pakistan Economic Corridor (CPEC) route in northern Pakistan. A groundwater vulnerability map was prepared using the DRASTIC model. The nitrate concentration data from a previous study were used to formulate the nitrate contamination map. Three machine learning (ML) models, i.e., Support Vector Machine (SVM), Multivariate Discriminant Analysis (MDA), and Boosted Regression Trees (BRT), were used to analyze the probability of groundwater contamination incidence. Furthermore, groundwater contamination probability maps were obtained utilizing the ensemble modeling approach. The models were calibrated and validated through calibration trials, using the area under the receiver operating characteristic curve method (AUC), where a minimum AUC threshold value of 80% was achieved. Results indicated the accuracy of the models to be in the range of 0.82–0.87. The final groundwater contamination risk map highlights that 34% of the area is moderately vulnerable to groundwater contamination, and 13% of the area is exposed to high groundwater contamination risk. The findings of this study can facilitate decision-making regarding the location of future built-up areas properly in order to mitigate the nitrate contamination that can further reduce the associated health risks.
Collapse
|
40
|
Allocca V, Di Napoli M, Coda S, Carotenuto F, Calcaterra D, Di Martire D, De Vita P. A novel methodology for Groundwater Flooding Susceptibility assessment through Machine Learning techniques in a mixed-land use aquifer. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 790:148067. [PMID: 34111794 DOI: 10.1016/j.scitotenv.2021.148067] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 05/21/2021] [Accepted: 05/23/2021] [Indexed: 06/12/2023]
Abstract
Many areas around the world are affected by Groundwater Level rising (GWLr). One of the most severe consequences of this phenomenon is Groundwater Flooding (GF), with serious impacts for the human and natural environment. In Europe, GF has recently received specific attention with Directive 2007/60/EC, which requires Member States to map GF hazard and propose measures for risk mitigation. In this paper a methodology has been developed for Groundwater Flooding Susceptibility (GFS) assessment, using for the first time Spatial Distribution Models. These Machine Learning techniques connect occurrence data to predisposing factors (PFs) to estimate their distributions. The implemented methodology employs aquifer type, depth of piezometric level, thickness and hydraulic conductivity of unsaturated zone, drainage density and land-use as PFs, and a GF observations inventory as occurrences. The algorithms adopted to perform the analysis are Generalized Boosting Model, Artificial Neural Network and Maximum Entropy. Ensemble Models are carried out to reduce the uncertainty associated with each algorithm and increase its reliability. GFS is mapped by choosing the ensemble model with the best predictivity performance and dividing occurrence probability values into five classes, from very low to very high susceptibility, using Natural Breaks classification. The methodology has been tested and statistically validated in an area of 14,3 km2 located in the Metropolitan City of Naples (Italy), affected by GWLr since 1990 and GF in buildings and agricultural soils since 2007. The results of modeling show that about 93% of the inventoried points fall in the high and very high GFS classes, and piezometric level depth, thickness of unsaturated zone and drainage density are the most influencing PFs, in accordance with field observations and the triggering mechanism of GF. The outcomes provide a first step in the assessment of GF hazard and a decision support tool to local authorities for GF risk management.
Collapse
Affiliation(s)
- Vincenzo Allocca
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy.
| | - Mariano Di Napoli
- Department of Earth, Environment and Life Sciences, University of Genoa, Corso Europa 26, 16132 Genoa, Italy
| | - Silvio Coda
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy.
| | - Francesco Carotenuto
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| | - Domenico Calcaterra
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| | - Diego Di Martire
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| | - Pantaleone De Vita
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| |
Collapse
|
41
|
Spatial Prediction of Groundwater Potentiality in Large Semi-Arid and Karstic Mountainous Region Using Machine Learning Models. WATER 2021. [DOI: 10.3390/w13162273] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The drinking and irrigation water scarcity is a major global issue, particularly in arid and semi-arid zones. In rural areas, groundwater could be used as an alternative and additional water supply source in order to reduce human suffering in terms of water scarcity. In this context, the purpose of the present study is to facilitate groundwater potentiality mapping via spatial-modelling techniques, individual and ensemble machine-learning models. Random forest (RF), logistic regression (LR), decision tree (DT) and artificial neural networks (ANNs) are the main algorithms used in this study. The preparation of groundwater potentiality maps was assembled into 11 ensembles of models. Overall, about 374 groundwater springs was identified and inventoried in the mountain area. The spring inventory data was randomly divided into training (75%) and testing (25%) datasets. Twenty-four groundwater influencing factors (GIFs) were selected based on a multicollinearity test and the information gain calculation. The results of the groundwater potentiality mapping were validated using statistical measures and the receiver operating characteristic curve (ROC) method. Finally, a ranking of the 15 models was achieved with the prioritization rank method using the compound factor (CF) method. The ensembles of models are the most stable and suitable for groundwater potentiality mapping in mountainous aquifers compared to individual models based on success and prediction rate. The most efficient model using the area under the curve validation method is the RF-LR-DT-ANN ensemble of models. Moreover, the results of the prioritization rank indicate that the best models are the RF-DT and RF-LR-DT ensembles of models.
Collapse
|
42
|
Serra J, Cameira MDR, Cordovil CMDS, Hutchings NJ. Development of a groundwater contamination index based on the agricultural hazard and aquifer vulnerability: Application to Portugal. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 772:145032. [PMID: 33581543 DOI: 10.1016/j.scitotenv.2021.145032] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 12/13/2020] [Accepted: 01/04/2021] [Indexed: 06/12/2023]
Abstract
Reducing nitrate leaching may not result in a significant improvement of groundwater quality. The amount of nitrate reaching groundwater depends not only on the hazard related to agricultural activities but also on-site specific groundwater vulnerability. Using national databases and other compiled datasets, the agricultural hazard was calculated as the ratio of (i) the nitrate leached estimated from the N surplus, and (ii) the water surplus, a proxy of the percolating water below the root zone. By combining the hazard with a multi-parameter groundwater vulnerability, a spatially explicit groundwater contamination risk, developed for mainland Portugal, was computed for 1999 and 2009. Results show an increase from 8,800 to 82,679 ha of the territory rated with a very high contamination risk. The priority areas were successfully screened by the Index, coinciding with the current Vulnerable Zones, although additional hotspots were detected in southern Portugal. Percolation, including both irrigation activity and precipitation, was found to be a key driver for the groundwater contamination risk due to its opposite effects in the hazard and in the vulnerability. Reducing nitrogen leaching may be insufficient to reduce the risk of nitrate contamination if there is a relatively larger reduction in precipitation. This index is particularly useful when applied to contrasting situations of vulnerability and hazard, which require distinct mitigation measures to mitigate groundwater contamination.
Collapse
Affiliation(s)
- João Serra
- Instituto Superior de Agronomia, DCEB, Tapada da Ajuda, 1349-017 Lisbon, Portugal; CEF, Instituto Superior de Agronomia, Universidade de Lisboa, 1349-017 Lisboa, Portugal.
| | - Maria do Rosário Cameira
- Instituto Superior de Agronomia, DCEB, Tapada da Ajuda, 1349-017 Lisbon, Portugal; LEAF- Linking Landscape, Environment, Agriculture and Food, Instituto Superior de Agronomia, Universidade de Lisboa, Tapada da Ajuda, 1349-017 Lisbon, Portugal
| | - Cláudia M D S Cordovil
- Instituto Superior de Agronomia, DCEB, Tapada da Ajuda, 1349-017 Lisbon, Portugal; CEF, Instituto Superior de Agronomia, Universidade de Lisboa, 1349-017 Lisboa, Portugal
| | - Nicholas J Hutchings
- Department of Agroecology, Aarhus University, Blichers Allé 20, DK-8830 Tjele, Denmark
| |
Collapse
|
43
|
White K, Dickson-Anderson S, Majury A, McDermott K, Hynds P, Brown RS, Schuster-Wallace C. Exploration of E. coli contamination drivers in private drinking water wells: An application of machine learning to a large, multivariable, geo-spatio-temporal dataset. WATER RESEARCH 2021; 197:117089. [PMID: 33836295 DOI: 10.1016/j.watres.2021.117089] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Revised: 02/22/2021] [Accepted: 03/23/2021] [Indexed: 06/12/2023]
Abstract
Groundwater resources are under increasing threats from contamination and overuse, posing direct threats to human and environmental health. The purpose of this study is to better understand drivers of, and relationships between, well and aquifer characteristics, sampling frequencies, and microbiological contamination indicators (specifically E. coli) as a precursor for improving knowledge and tools to assess aquifer vulnerability and well contamination within Ontario, Canada. A dataset with 795, 023 microbiological testing observations over an eight-year period (2010 to 2017) from 253,136 unique wells across Ontario was employed. Variables in this dataset include date and location of test, test results (E. coli concentration), well characteristics (well depth, location), and hydrogeological characteristics (bottom of well stratigraphy, specific capacity). Association rule analysis, univariate and bivariate analyses, regression analyses, and variable discretization techniques were utilized to identify relationships between E. coli concentration and the other variables in the dataset. These relationships can be used to identify drivers of contamination, their relative importance, and therefore potential public health risks associated with the use of private wells in Ontario. Key findings are that: i) bedrock wells completed in sedimentary or igneous rock are more susceptible to contamination events; ii) while shallow wells pose a greater risk to consumers, deep wells are also subject to contamination events and pose a potentially unanticipated risk to health of well users; and, iii) well testing practices are influenced by results of previous tests. Further, while there is a general correlation between months with the greatest testing frequencies and concentrations of E. coli occurring in samples, an offset in this timing is observed in recent years. Testing remains highest in July while peaks in adverse results occur up to three months later. The realization of these trends prompts a need to further explore the bases for such occurrences.
Collapse
Affiliation(s)
- Katie White
- Department of Civil Engineering, McMaster University, 1280 Main St. W, Hamilton, Ontario, L8S 4L8, Canada
| | - Sarah Dickson-Anderson
- Department of Civil Engineering, McMaster University, 1280 Main St. W, Hamilton, Ontario, L8S 4L8, Canada; Department of Geography and Planning and Global Institute for Water Security, University of Saskatchewan, 117 Science Place, Saskatoon, Saskatchewan, S7N 5C8, Canada.
| | - Anna Majury
- Public Health Ontario, 181 Barrie St, Kingston, Ontario, K7L 3K2, Canada; Department of Biology and Molecular Sciences, Department of Public Health Sciences, School of Environmental Studies, Queen's University, 99 University Ave, Kingston, Ontario, K7L 3N6, Canada
| | - Kevin McDermott
- Public Health Ontario, 181 Barrie St, Kingston, Ontario, K7L 3K2, Canada
| | - Paul Hynds
- Environmental Sustainability and Health Institute, Technological University Dublin, Grangegorman Dublin 7, Republic of Ireland
| | - R Stephen Brown
- Department of Chemistry and School of Environmental Studies, Queen's University, 99 University Ave, Kingston, Ontario, K7L 3N6, Canada
| | - Corinne Schuster-Wallace
- Department of Civil Engineering, McMaster University, 1280 Main St. W, Hamilton, Ontario, L8S 4L8, Canada; Department of Geography and Planning and Global Institute for Water Security, University of Saskatchewan, 117 Science Place, Saskatoon, Saskatchewan, S7N 5C8, Canada
| |
Collapse
|
44
|
In Situ Monitoring of Nitrate Content in Leafy Vegetables Using Attenuated Total Reflectance − Fourier-Transform Mid-infrared Spectroscopy Coupled with Machine Learning Algorithm. FOOD ANAL METHOD 2021. [DOI: 10.1007/s12161-021-02048-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
45
|
Stackelberg PE, Belitz K, Brown CJ, Erickson ML, Elliott SM, Kauffman LJ, Ransom KM, Reddy JE. Machine Learning Predictions of pH in the Glacial Aquifer System, Northern USA. GROUND WATER 2021; 59:352-368. [PMID: 33314084 PMCID: PMC8246943 DOI: 10.1111/gwat.13063] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 12/05/2020] [Accepted: 12/08/2020] [Indexed: 05/05/2023]
Abstract
A boosted regression tree model was developed to predict pH conditions in three dimensions throughout the glacial aquifer system of the contiguous United States using pH measurements in samples from 18,386 wells and predictor variables that represent aspects of the hydrogeologic setting. Model results indicate that the carbonate content of soils and aquifer materials strongly controls pH and, when coupled with long flowpaths, results in the most alkaline conditions. Conversely, in areas where glacial sediments are thin and carbonate-poor, pH conditions remain acidic. At depths typical of drinking-water supplies, predicted pH >7.5-which is associated with arsenic mobilization-occurs more frequently than predicted pH <6-which is associated with water corrosivity and the mobilization of other trace elements. A novel aspect of this model was the inclusion of numerically based estimates of groundwater flow characteristics (age and flowpath length) as predictor variables. The sensitivity of pH predictions to these variables was consistent with hydrologic understanding of groundwater flow systems and the geochemical evolution of groundwater quality. The model was not developed to provide precise estimates of pH at any given location. Rather, it can be used to more generally identify areas where contaminants may be mobilized into groundwater and where corrosivity issues may be of concern to prioritize areas for future groundwater monitoring.
Collapse
|
46
|
Lee S, Kaown D, Koh EH, Ko KS, Lee KK. Delineation of groundwater quality locations suitable for target end-use purposes through deep neural network models. JOURNAL OF ENVIRONMENTAL QUALITY 2021; 50:416-428. [PMID: 33576503 DOI: 10.1002/jeq2.20206] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 02/01/2021] [Indexed: 06/12/2023]
Abstract
Groundwater is the main source of water for beverages, and its quality varies depending on extraction location; this is particularly the case in regions with complex geology, topography, and multiple forms of land use. Thus, it is important to determine a suitable groundwater extraction location based on intended water use and the related water quality standards. In this study, deep neural network (DNN) models and GIS data relating to groundwater quality were applied to estimate potential maps of Gangwon Province in South Korea, where groundwater is frequently extracted for drinking purposes. These maps specify areas where the groundwater quality is conducive for being used as mineral water and water for brewing coffee (hereafter referred as "coffee water"). Sensitivity analysis identified how inputs were sensitive to model estimation and showed that land-use variables were the most sensitive. The importance of each variable quantified how good or bad its region is for the desired groundwater. The overall features of importance were similar between mineral water and coffee water. However, with differences in hydrogeological units, carbonate rock was a variable of high positive importance for mineral water; metamorphic rock was its equivalent for coffee water. Our results offer a potential map of desired groundwater quality in the absence of a detailed understanding of the underlying hydrochemical processes governing groundwater quality. Additionally, the development of such a potential mapping model can help to determine the appropriate development area of groundwater for their respective purposes.
Collapse
Affiliation(s)
- Sanghoon Lee
- School of Earth and Environmental Sciences, Seoul National Univ., Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Dugin Kaown
- School of Earth and Environmental Sciences, Seoul National Univ., Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Eun-Hee Koh
- School of Earth and Environmental Sciences, Seoul National Univ., Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Kyung-Seok Ko
- Korea Institute of Geoscience and Mineral Resources, Daejeon, 34132, Republic of Korea
| | - Kang-Kun Lee
- School of Earth and Environmental Sciences, Seoul National Univ., Gwanak-gu, Seoul, 08826, Republic of Korea
| |
Collapse
|
47
|
Comparative Analysis of Artificial Intelligence Models for Accurate Estimation of Groundwater Nitrate Concentration. SENSORS 2020; 20:s20205763. [PMID: 33053663 PMCID: PMC7599737 DOI: 10.3390/s20205763] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Revised: 09/23/2020] [Accepted: 09/28/2020] [Indexed: 11/17/2022]
Abstract
Prediction of the groundwater nitrate concentration is of utmost importance for pollution control and water resource management. This research aims to model the spatial groundwater nitrate concentration in the Marvdasht watershed, Iran, based on several artificial intelligence methods of support vector machine (SVM), Cubist, random forest (RF), and Bayesian artificial neural network (Baysia-ANN) machine learning models. For this purpose, 11 independent variables affecting groundwater nitrate changes include elevation, slope, plan curvature, profile curvature, rainfall, piezometric depth, distance from the river, distance from residential, Sodium (Na), Potassium (K), and topographic wetness index (TWI) in the study area were prepared. Nitrate levels were also measured in 67 wells and used as a dependent variable for modeling. Data were divided into two categories of training (70%) and testing (30%) for modeling. The evaluation criteria coefficient of determination (R2), mean absolute error (MAE), root mean square error (RMSE), and Nash–Sutcliffe efficiency (NSE) were used to evaluate the performance of the models used. The results of modeling the susceptibility of groundwater nitrate concentration showed that the RF (R2 = 0.89, RMSE = 4.24, NSE = 0.87) model is better than the other Cubist (R2 = 0.87, RMSE = 5.18, NSE = 0.81), SVM (R2 = 0.74, RMSE = 6.07, NSE = 0.74), Bayesian-ANN (R2 = 0.79, RMSE = 5.91, NSE = 0.75) models. The results of groundwater nitrate concentration zoning in the study area showed that the northern parts of the case study have the highest amount of nitrate, which is higher in these agricultural areas than in other areas. The most important cause of nitrate pollution in these areas is agriculture activities and the use of groundwater to irrigate these crops and the wells close to agricultural areas, which has led to the indiscriminate use of chemical fertilizers by irrigation or rainwater of these fertilizers is washed and penetrates groundwater and pollutes the aquifer.
Collapse
|
48
|
Susceptibility Prediction of Groundwater Hardness Using Ensemble Machine Learning Models. WATER 2020. [DOI: 10.3390/w12102770] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Groundwater resources, unlike surface water, are more vulnerable to disturbances and contaminations, as they take a very long time and significant cost to recover. So, predictive modeling and prevention strategies can empower policymakers for efficient groundwater governance through informed decisions and recommendations. Due to the importance of groundwater quality modeling, the hardness susceptibility mapping using machine learning (ML) models has not been explored. For the first time, the current research aimed to predict groundwater hardness susceptibility using the ML models. The performance of two ensemble models of boosted regression trees (BRT) and random forest (RF) is investigated through the arrangement of a comparative study with multivariate discriminant analysis (MDA). According to the hardness values in 135 groundwater quality monitoring wells, the hard and soft water are determined; then, 11 predictor variables including distance from the sea (DFS), land use, elevation, distance from the river (DFR), depth to groundwater (DTGW), pH, precipitation (PCP), evaporation (E), groundwater level (GWL), curvature, and lithology are used for predicting the groundwater hardness susceptibility map. Results indicated that the variables of DFR, DTGW, elevation, and DFS had a higher contribution to the modeling process. So, the high harness areas are mostly related to low elevations, low DTGW, and proximity to river and sea, which facilitate the percolation conditions for minerals containing calcium or magnesium into groundwater.
Collapse
|
49
|
Optimised neural network model for river-nitrogen prediction utilizing a new training approach. PLoS One 2020; 15:e0239509. [PMID: 32986717 PMCID: PMC7521719 DOI: 10.1371/journal.pone.0239509] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 09/08/2020] [Indexed: 01/18/2023] Open
Abstract
In the past few decades, there has been a rapid growth in the concentration of nitrogenous compounds such as nitrate-nitrogen and ammonia-nitrogen in rivers, primarily due to increasing agricultural and industrial activities. These nitrogenous compounds are mainly responsible for eutrophication when present in river water, and for ‘blue baby syndrome’ when present in drinking water. High concentrations of these compounds in rivers may eventually lead to the closure of treatment plants. This study presents a training and a selection approach to develop an optimum artificial neural network model for predicting monthly average nitrate-N and monthly average ammonia-N. Several studies have predicted these compounds, but most of the proposed procedures do not involve testing various model architectures in order to achieve the optimum predicting model. Additionally, none of the models have been trained for hydrological conditions such as the case of Malaysia. This study presents models trained on the hydrological data from 1981 to 2017 for the Langat River in Selangor, Malaysia. The model architectures used for training are General Regression Neural Network (GRNN), Multilayer Neural Network and Radial Basis Function Neural Network (RBFNN). These models were trained for various combinations of internal parameters, input variables and model architectures. Post-training, the optimum performing model was selected based on the regression and error values and plot of predicted versus observed values. Optimum models provide promising results with a minimum overall regression value of 0.92.
Collapse
|
50
|
Rostami AA, Karimi V, Khatibi R, Pradhan B. An investigation into seasonal variations of groundwater nitrate by spatial modelling strategies at two levels by kriging and co-kriging models. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2020; 270:110843. [PMID: 32721304 DOI: 10.1016/j.jenvman.2020.110843] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 04/21/2020] [Accepted: 05/24/2020] [Indexed: 06/11/2023]
Abstract
Nitrate pollution of groundwater through spatial models is investigated in this paper by using a sample of nitrate values at monitoring wells using the data from four seasons of a year, in which data are sparse. Two spatial modelling strategies are formulated at two levels, in which Strategy 1 comprises: three variations of kriging-based models (ordinary kriging, simple kriging and universal kriging), which are constructed at Level 1 to predict nitrate concentrations; and a Multiple Co-Kriging (MCoK) model is used at Level 2 to enhance the accuracy of the predictions. Strategy 2 is also at two levels but employs Indicator Kriging (IK) at Level 1 as a probabilistic spatial model to predict areas at risk of exceeding two thresholds of 37.5 mg/L and 50 mg/L of nitrate concentration, and Multiple Co-Indicator Kriging (MCoIK) at Level 2 for a better accuracy. The improvements at Level 2 for both strategies are remarkable and hence they are used to gain an insight into inherent problems. The results of a study delineate areas with excessive nitrate concentrations, which are in the vicinity of urban areas and hence reflect poor planning practices since the 1990s. The results further reveal the patterns on sensitivities to seasonal variations driven by aquifer recharge and strong dilution processes in spring times; and on the role of pumpage impacting aquifers giving rise to possible hotspots of nitrate concentrations.
Collapse
Affiliation(s)
- Ali Asghar Rostami
- Department of Water Engineering, University of Tabriz, Tabriz, East Azerbaijan, Iran
| | - Vahid Karimi
- Department of Water Engineering, University of Tabriz, Tabriz, East Azerbaijan, Iran.
| | | | - Biswajeet Pradhan
- The Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, NSW, 2007, Australia; Department of Energy and Mineral Resources Engineering, Choongmu-gwan, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul, 05006, Republic of Korea
| |
Collapse
|