1
|
Kim HI, Kim D, Mahdian M, Salamattalab MM, Bateni SM, Noori R. Incorporation of water quality index models with machine learning-based techniques for real-time assessment of aquatic ecosystems. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 355:124242. [PMID: 38810684 DOI: 10.1016/j.envpol.2024.124242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 05/12/2024] [Accepted: 05/26/2024] [Indexed: 05/31/2024]
Abstract
Water quality index (WQI) is a well-established tool for assessing the overall quality of fresh inland-waters. However, the effectiveness of real-time assessment of aquatic ecosystems using the WQI is usually impacted by the absence of some water quality parameters in which their accurately in-situ measurements are impossible and face difficulties. Using a rich water quality dataset spanned from 1980 to 2023, we employed four machine learning-based models to estimate the British Colombia WQI (BCWQI) in the Lake Päijänne, Finland, without parameters like chemical oxygen demand (COD) and total phosphorus (TP). Measurement of both COD and TP is time-consuming, needs laboratory equipment and labor costs, and faces sampling-related difficulties. Our results suggest the machine learning-based models successfully estimate the BCWQI in Lake Päijänne when TP and COD are omitted from the dataset. The long-short term memory model is the least sensitive model to exclusion of COD and TP from inputs. This model with the coefficient of determination and root-mean squared error of 0.91 and 0.11, respectively, outperforms the support vector regression, random forest, and neural network models in real-time estimation of the BCWQI in Lake Päijänne. Incorporation of BCWQI with the machine learning-based models could enhance assessment of overall quality of inland-waters with a limited database in a more economical and time-saving way. Our proposed method is an effort to replace the traditional offline water quality assessment tools with a real-time model and improve understanding of decision-makers on the effectiveness of management practices on the changes in lake water quality.
Collapse
Affiliation(s)
- Hyung Il Kim
- DL E&C, Civil Business Division, Donuimun, D Tower, 134 Tongil-ro, Jongno-gu, Seoul, South Korea; Department of Civil and Environmental Engineering, Hongik University, Mapo-gu, Seoul, South Korea
| | - Dongkyun Kim
- Department of Civil and Environmental Engineering, Hongik University, Mapo-gu, Seoul, South Korea.
| | - Mehran Mahdian
- School of Civil Engineering, Iran University of Science and Technology, Narmak, Tehran, 1684613114, Iran
| | | | - Sayed M Bateni
- Department of Civil, Environmental and Construction Engineering, and Water Resources Research Center, University of Hawaii at Manoa, Honolulu, HI, 96822, USA
| | - Roohollah Noori
- Graduate Faculty of Environment, University of Tehran, Tehran, 1417853111, Iran; Faculty of Governance, University of Tehran, Tehran, 1439814151, Iran
| |
Collapse
|
2
|
Moeinzadeh H, Yong KT, Withana A. A critical analysis of parameter choices in water quality assessment. WATER RESEARCH 2024; 258:121777. [PMID: 38781620 DOI: 10.1016/j.watres.2024.121777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 04/25/2024] [Accepted: 05/12/2024] [Indexed: 05/25/2024]
Abstract
The determination of water quality heavily depends on the selection of parameters recorded from water samples for the water quality index (WQI). Data-driven methods, including machine learning models and statistical approaches, are frequently used to refine the parameter set for four main reasons: reducing cost and uncertainty, addressing the eclipsing problem, and enhancing the performance of models predicting the WQI. Despite their widespread use, there is a noticeable gap in comprehensive reviews that systematically examine previous studies in this area. Such reviews are essential to assess the validity of these objectives and to demonstrate the effectiveness of data-driven methods in achieving these goals. This paper sets out with two primary aims: first, to provide a review of the existing literature on methods for selecting parameters. Second, it seeks to delineate and evaluate the four principal motivations for parameter selection identified in the literature. This manuscript categorizes existing studies into two methodological groups for refining parameters: one focuses on preserving information within the dataset, and another ensures consistent prediction using the full set of parameters. It characterizes each group and evaluates how effectively each approach meets the four predefined objectives. The study presents that the minimal WQI approach, common to both categories, is the only approach that has successfully reduced recording costs. Nonetheless, it notes that simply reducing the number of parameters does not guarantee cost savings. Furthermore, the group of studies classified as preserving information within the dataset has demonstrated potential to decrease the eclipsing problem, whereas studies in the consistent prediction group have not been able to mitigate this issue. Additionally, since data-driven approaches still rely on the initial parameters chosen by experts, they do not eliminate the need for expert judgment. The study further points out that the WQI formula is a straightforward and expedient tool for assessing water quality. Consequently, the paper argues that employing machine learning solely to reduce the number of parameters to enhance WQI prediction is not a standalone solution. Rather, this objective should be integrated with a more comprehensive set of research goals. The critical analysis of research objectives and the characterization of previous studies lay the groundwork for future research. This groundwork will enable subsequent studies to evaluate how their proposed methods can effectively achieve these objectives.
Collapse
Affiliation(s)
- Hossein Moeinzadeh
- School of Computer Science, The University of Sydney, Sydney, 2006, New South Wales, Australia.
| | - Ken-Tye Yong
- School of Computer Science, The University of Sydney, Sydney, 2006, New South Wales, Australia; School of Biomedical Engineering, The University of Sydney, Sydney, 2006, New South Wales, Australia; Sydney Nano, The University of Sydney, Sydney, 2006, New South Wales, Australia
| | - Anusha Withana
- School of Computer Science, The University of Sydney, Sydney, 2006, New South Wales, Australia; Sydney Nano, The University of Sydney, Sydney, 2006, New South Wales, Australia
| |
Collapse
|
3
|
Lin Z, Lim JY, Oh JM. Innovative interpretable AI-guided water quality evaluation with risk adversarial analysis in river streams considering spatial-temporal effects. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 350:124015. [PMID: 38657892 DOI: 10.1016/j.envpol.2024.124015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/17/2024] [Accepted: 04/18/2024] [Indexed: 04/26/2024]
Abstract
Water security remains a critical issue given the looming threats of industrial pollution, necessitating comprehensive assessments of water quality to address seasonal fluctuations and influential factors while formulating effective strategies for decision makers. This study introduces a novel approach for evaluating water quality within a complex riverine zone in South Korea: Han River that encompasses five river streams situated at each junction of North and South streams (including Gyeongan Stream) that ultimately leading towards Paldang Lake. By utilizing the monthly water characteristic data from the year 2013-2022 across 14 different locations, the significant seasonal trends and potential influences on water quality are identified. The water quality here is calculated with the proposed method of sub-index water quality index (s-WQI). A combinatorial prediction approach of s-WQI for each location is conducted through a collective of data preprocessing approaches including Hampel filtering and feature selection in prior to the machine learning predictions. In return, light gradient boosting (LGB) is the most accurate predictor by outperforming other prediction algorithms, especially through LGB-Pearson and LGB-Spearman combinations for North and South stream intersections, and LGB-Pearson for Paldang Lake. To further evaluate the robustness of this evaluation and extending the results to a foreseeable scenario, a seasonal based Monte-Carlo Simulation with 10,000 attempts targeting the water characteristic distributions obtained from each location considered are carried out to identify the risk bounds within. The results are further interpreted with SHAP analysis on identifying the contributions of each water characteristics towards the water quality through local and global spectrum. This research yields practical implications, offering tailored strategies for water quality enhancement and early warning systems. The integration of AI-based prediction and feature selection underscores the transformative potential of computational techniques in advancing data-driven water quality assessments, shaping the future of environmental science research.
Collapse
Affiliation(s)
- ZiYu Lin
- Department of Environmental Science and Engineering, Kyung Hee University, Yongin-si, 17104, Gyeonggi, Republic of Korea
| | - Juin Yau Lim
- Korea Biochar Research Center & APRU Sustainable Waste Management Program & Division of Environmental Science and Ecological Engineering, Korea University, Seoul, 02841, Republic of Korea; School of Business Administration, Korea University, Seoul, 02841, Republic of Korea
| | - Jong-Min Oh
- Department of Environmental Science and Engineering, Kyung Hee University, Yongin-si, 17104, Gyeonggi, Republic of Korea.
| |
Collapse
|
4
|
Ortiz-Lopez C, Bouchard C, Rodriguez MJ. Ensemble machine learning using hydrometeorological information to improve modeling of quality parameter of raw water supplying treatment plants. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 362:121378. [PMID: 38838533 DOI: 10.1016/j.jenvman.2024.121378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 05/03/2024] [Accepted: 06/02/2024] [Indexed: 06/07/2024]
Abstract
Source and raw water quality may deteriorate due to rainfall and river flow events that occur in watersheds. The effects on raw water quality are normally detected in drinking water treatment plants (DWTPs) with a time-lag after these events in the watersheds. Early warning systems (EWSs) in DWTPs require models with high accuracy in order to anticipate changes in raw water quality parameters. Ensemble machine learning (EML) techniques have recently been used for water quality modeling to improve accuracy and decrease variance in the outcomes. We used three decision-tree-based EML models (random forest [RF], gradient boosting [GB], and eXtreme Gradient Boosting [XGB]) to predict two critical parameters for DWTPs, raw water Turbidity and UV absorbance (UV254), using rainfall and river flow time series as predictors. When modeling raw water turbidity, the three EML models (rRF-Tu2=0.87, rGB-Tu2=0.80 and rXGB-Tu2=0.81) showed very good performance metrics. For raw water UV254, the three models (rRF-UV2=0.89, rGB-UV2=0.85 and rXGB-UV2=0.88) again showed very good performance metrics. Results from this study suggest that EML approaches could be used in EWSs to anticipate changes in the quality parameters of raw water and enhance decision-making in DWTPs.
Collapse
Affiliation(s)
- Christian Ortiz-Lopez
- Centre de Recherche en Aménagement et Développement (CRAD), Université Laval, 2325 Allée des Bibliothèques, Québec City, QC, G1V 0A6, Canada.
| | - Christian Bouchard
- Centre de Recherche en Aménagement et Développement (CRAD), Université Laval, 2325 Allée des Bibliothèques, Québec City, QC, G1V 0A6, Canada
| | - Manuel J Rodriguez
- École Supérieure d'Aménagement du Territoire et de Développement Régional (ESAD), Université Laval, 2325 Allée des Bibliothèques, Québec City, QC, G1V 0A6, Canada
| |
Collapse
|
5
|
Kushwaha NL, Kudnar NS, Vishwakarma DK, Subeesh A, Jatav MS, Gaddikeri V, Ahmed AA, Abdelaty I. Stacked hybridization to enhance the performance of artificial neural networks (ANN) for prediction of water quality index in the Bagh river basin, India. Heliyon 2024; 10:e31085. [PMID: 38784559 PMCID: PMC11112320 DOI: 10.1016/j.heliyon.2024.e31085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 05/03/2024] [Accepted: 05/09/2024] [Indexed: 05/25/2024] Open
Abstract
Water quality assessment is paramount for environmental monitoring and resource management, particularly in regions experiencing rapid urbanization and industrialization. This study introduces Artificial Neural Networks (ANN) and its hybrid machine learning models, namely ANN-RF (Random Forest), ANN-SVM (Support Vector Machine), ANN-RSS (Random Subspace), ANN-M5P (M5 Pruned), and ANN-AR (Additive Regression) for water quality assessment in the rapidly urbanizing and industrializing Bagh River Basin, India. The Relief algorithm was employed to select the most influential water quality input parameters, including Nitrate (NO3-), Magnesium (Mg2+), Sulphate (SO42-), Calcium (Ca2+), and Potassium (K+). The comparative analysis of developed ANN and its hybrid models was carried out using statistical indicators (i.e., Nash-Sutcliffe Efficiency (NSE), Pearson Correlation Coefficient (PCC), Coefficient of Determination (R2), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Relative Root Square Error (RRSE), Relative Absolute Error (RAE), and Mean Bias Error (MBE)) and graphical representations (i.e., Taylor diagram). Results indicate that the integration of support vector machine (SVM) with ANN significantly improves performance, yielding impressive statistical indicators: NSE (0.879), R2 (0.904), MAE (22.349), and MBE (12.548). The methodology outlined in this study can serve as a template for enhancing the predictive capabilities of ANN models in various other environmental and ecological applications, contributing to sustainable development and safeguarding natural resources.
Collapse
Affiliation(s)
- Nand Lal Kushwaha
- Department of Soil and Water Engineering, Punjab Agricultural University Ludhiana, Punjab, 141004, India
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Nanabhau S. Kudnar
- Department of Geography, C. J. Patel College Tirora, Gondia, Maharashtra, 441911, India
| | - Dinesh Kumar Vishwakarma
- Department of Irrigation and Drainage Engineering, G.B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, 263145, India
| | - A. Subeesh
- ICAR- Central Institute of Agricultural Engineering, Bhopal, Madhya Pradesh, 462038, India
| | - Malkhan Singh Jatav
- National Institute of Hydrology, North Western Regional Centre, Jodhpur, Rajasthan, 342003, India
| | - Venkatesh Gaddikeri
- Division of Agricultural Engineering, ICAR-Indian Agricultural Research Institute, New Delhi, 110012, India
| | - Ashraf A. Ahmed
- Department of Civil and Environmental Engineering, Brunel University London, Kingston Lane, Uxbridge UB38PH, UK
| | - Ismail Abdelaty
- Water and Water Structures Engineering Department, Faculty of Engineering, Zagazig University, Zagazig, 44519, Egypt
| |
Collapse
|
6
|
Kolli MK, Chinnasamy P. Estimating turbidity concentrations in highly dynamic rivers using Sentinel-2 imagery in Google Earth Engine: Case study of the Godavari River, India. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:33837-33847. [PMID: 38691292 DOI: 10.1007/s11356-024-33344-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 04/11/2024] [Indexed: 05/03/2024]
Abstract
Turbidity is an essential biogeochemical parameter for water quality management because it shapes the physical landscape and regulates ecological systems. It varies spatially and temporally across large water bodies, but monitoring based on point-source field observations remains a difficult task in developing countries due to the need for logistics and costs. In this study, we present a novel semi-analytical approach for estimating turbidity from remote sensing reflectance ( R rs ) in moderate to highly turbid waters in the lower part of the Godavari River (i.e., locations near Rajahmundry). The proposed method includes two sub-algorithms-Normalized Difference Turbidity Index (NDTI) and semi-empirical single-band turbidity ( T s ) algorithm-to retrieve spectral reflectance information corresponding to the study locations for turbidity modeling. Sentinel-2 Multi-Spectral Imager data have been used to quantify the turbidity in the Google Earth Engine (GEE) platform. The correlation analysis was observed between spectral reflectance values and in situ turbidity data using cubic polynomial regression equations. The results indicated that the T s , which uses the only red-edge wavelength, identified turbidity as the most accurate across all locations (highest R2 = 0.91, lowest RMSE = 0.003), followed by NDTI (highest R2 = 0.85, lowest RMSE = 0.05), respectively. The remote sensing data application provides a better way to monitor turbidity at large spatio-temporal scales in attaining the water quality standards of the Godavari River.
Collapse
Affiliation(s)
- Meena Kumari Kolli
- Centre for Technology Alternatives for Rural Areas, Indian Institute of Technology (IITB), Powai, Mumbai, Maharashtra, 400076, India
- Rural Data Research and Analysis Lab (RuDRA), IIT Bombay, Mumbai, India
| | - Pennan Chinnasamy
- Centre for Technology Alternatives for Rural Areas, Indian Institute of Technology (IITB), Powai, Mumbai, Maharashtra, 400076, India.
- Rural Data Research and Analysis Lab (RuDRA), IIT Bombay, Mumbai, India.
- Interdisciplinary Programme in Climate Studies (IDPCS), IIT Bombay, Mumbai, India.
- Centre for Machine Intelligence and Data Science(C‑MInDS), IIT Bombay, Mumbai, India.
- Ashank Desai Centre for Policy Studies, IIT Bombay, Mumbai, India.
- Nebraska Water Center, University of Nebraska, Lincoln, NE, USA.
| |
Collapse
|
7
|
Seggio M, Arcadio F, Radicchi E, Cennamo N, Zeni L, Bossi AM. Toward Nano- and Microplastic Sensors: Identification of Nano- and Microplastic Particles via Artificial Intelligence Combined with a Plasmonic Probe Functionalized with an Estrogen Receptor. ACS OMEGA 2024; 9:18984-18994. [PMID: 38708270 PMCID: PMC11064004 DOI: 10.1021/acsomega.3c09485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/23/2024] [Accepted: 02/28/2024] [Indexed: 05/07/2024]
Abstract
Nano- and microplastic particles are a global and emerging environmental issue that might pose potential threats to human health. The present work exploits artificial intelligence (AI) to identify nano- and microplastics in water by monitoring the interaction of the sample with a sensitive surface. An estrogen receptor (ER) grafted onto a gold surface, realized on a nonexpensive and easy-to-produce plastic optical fiber (POF) platform in order to excite a surface plasmon resonance (SPR) phenomenon, has been developed in order to carry out a "smart" sensitive interface (ER-SPR-POF interface). The ER-SPR-POF interface offers output data useful for exploiting a machine learning-based approach to achieve nano- and microplastic particle sensors. This work developed a proof-of-concept sensor through a training phase carried out by different particles, in terms of materials and size. The experimental results have demonstrated that the proposed "smart" ER-SPR-POF interface combined with AI can be used to identify the kind of particles in terms of the materials (polystyrene; poly(methyl methacrylate)) and size (20 μm; 100 nm) with an accuracy of 90.3%.
Collapse
Affiliation(s)
- Mimimorena Seggio
- Department
of Biotechnology, University of Verona, Strada Le Grazie 15, 37134 Verona, Italy
| | - Francesco Arcadio
- Department
of Engineering, University of Campania Luigi
Vanvitelli, via Roma 29, 81031 Aversa, Italy
| | - Eros Radicchi
- Department
of Biotechnology, University of Verona, Strada Le Grazie 15, 37134 Verona, Italy
| | - Nunzio Cennamo
- Department
of Engineering, University of Campania Luigi
Vanvitelli, via Roma 29, 81031 Aversa, Italy
| | - Luigi Zeni
- Department
of Engineering, University of Campania Luigi
Vanvitelli, via Roma 29, 81031 Aversa, Italy
| | - Alessandra Maria Bossi
- Department
of Biotechnology, University of Verona, Strada Le Grazie 15, 37134 Verona, Italy
| |
Collapse
|
8
|
Apogba JN, Anornu GK, Koon AB, Dekongmen BW, Sunkari ED, Fynn OF, Kpiebaya P. Application of machine learning techniques to predict groundwater quality in the Nabogo Basin, Northern Ghana. Heliyon 2024; 10:e28527. [PMID: 38596013 PMCID: PMC11002067 DOI: 10.1016/j.heliyon.2024.e28527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Revised: 03/20/2024] [Accepted: 03/20/2024] [Indexed: 04/11/2024] Open
Abstract
The main objective of this study was to map the quality of groundwater for domestic use in the Nabogo Basin, a sub-catchment of the White Volta Basin in Ghana, by applying machine learning techniques. The study was conducted by applying the Random Forest (RF) machine learning algorithm to predict groundwater quality, by utilizing factors that influence groundwater occurrence and quality such as Elevation, Topographical Wetness Index (TWI), Slope length (LS), Lithology, Soil type, Normalize Different Vegetation Index (NDVI), Rainfall, Aspect, Slope, Plan Curvature (PLC), Profile Curvature (PRC), Lineament density, Distance to faults, and Drainage density. The groundwater quality of the area was predicted by building a Random Forest model based on computed Arithmetic Water Quality Indices (WQI) (as dependent variable) of existing boreholes, to serve as an indicator of the groundwater quality. The predicted WQI of groundwater in the study area shows that it ranges from 9.51 to 69.99%. This implied that 21.97 %, 74.40 %, and 3.63 % of the study area had respectively the likelihood of excellent. The models were found to perform much better with an RMSE of 23.03 and an R2 value of 0.82. The study conducted highlighted an essential understanding of the groundwater quality in the study area, paving the way for further studies and policy development for groundwater management.
Collapse
Affiliation(s)
- Joseph Nzotiyine Apogba
- Civil Engineering Department-Regional Water and Environmental Sanitation Centre Kumasi, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Geophrey Kwame Anornu
- Civil Engineering Department-Regional Water and Environmental Sanitation Centre Kumasi, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Arthur B. Koon
- Civil Engineering Department-Regional Water and Environmental Sanitation Centre Kumasi, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
- Department of Geology, College of Engineering, University of Liberia, Fendall Campus, Liberia
| | - Benjamin Wullobayi Dekongmen
- Department of Agricultural Engineering, Ho Technical University, Ho, Ghana
- Department of Civil and Environmental Engineering, University of Energy and Natural Resources, Sunyani, Ghana
| | - Emmanuel Daanoba Sunkari
- Department of Geology, Faculty of Science, University of Johannesburg, Auckland Park 2006, Kingsway Campus, P. O. Box 524, Johannesburg, South Africa
- Department of Geological Engineering, Faculty of Geosciences and Environmental Studies, University of Mines and Technology, P.O Box 237, Tarkwa, Ghana
| | - Obed Fiifi Fynn
- Water Research Institute – Council for Scientific and Industrial Research, Ghana
| | - Prosper Kpiebaya
- Department of Agricultural Engineering, School of Engineering, University for Development Studies, P. O. Box TL 1882, Ghana
- Department of Soil Science, Faculty of Agriculture, Food and Consumer Sciences, University for Development Studies, P. O. Box TL 1882, Ghana
| |
Collapse
|
9
|
Gupta S, Gupta SK. Development of AI-based hybrid soft computing models for prediction of critical river water quality indicators. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:27829-27845. [PMID: 38520661 DOI: 10.1007/s11356-024-32984-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 03/15/2024] [Indexed: 03/25/2024]
Abstract
Prediction of river water quality indicators (RWQIs) using artificial intelligence (AI)-based hybrid soft computing modeling techniques could provide essential predictions required for efficient river health planning and management. The study described the development of a novel AI-based relative weighted ensemble (AIRWE) hybrid model for predicting critical RWQIs, i.e., biochemical oxygen demand (BOD) and total coliform (TC). The study involved comprehensive water quality (WQ) monitoring from 30 locations along the Damodar River to establish the baseline data and delineate the WQ. The representative input features showing a strong association with BOD and TC were identified using Spearman's rank-coupled orthogonal linear transformation (SOT). The relative weighted ensemble (RWE) method was applied to determine the relative weights for base learners in the AIRWE model. The statistical analysis of the developed model revealed that it was most efficient and accurate for predicting BOD (R2, 0.97; RMSE, 0.06; MAE, 0.04) and TC (R2, 0.98; RMSE, 0.06; MAE, 0.05) over the traditional techniques. The tstat (BOD 0.02 and TC 0.47) was lesser than tcrit (1.672), confirming its unbiased predictions. The SOT technique removed the data noise and multicollinearity, whereas RWE curtailed the individual model's limitations and predicted more reliable results. The model resulted 97% accuracy with high precision (96%) in classifying the river water quality for various end uses. The study describes a novel approach for researchers, scientists, and decision-makers for modeling and predicting various environmental attributes.
Collapse
Affiliation(s)
- Suyog Gupta
- Indian Institute of Technology (Indian School of Mines), Dhanbad, 826004, Jharkhand, India
- Harcourt Butler Technical University, Kanpur, 208002, Uttar Pradesh, India
| | - Sunil Kumar Gupta
- Indian Institute of Technology (Indian School of Mines), Dhanbad, 826004, Jharkhand, India.
| |
Collapse
|
10
|
Essamlali I, Nhaila H, El Khaili M. Advances in machine learning and IoT for water quality monitoring: A comprehensive review. Heliyon 2024; 10:e27920. [PMID: 38533055 PMCID: PMC10963334 DOI: 10.1016/j.heliyon.2024.e27920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 02/22/2024] [Accepted: 03/08/2024] [Indexed: 03/28/2024] Open
Abstract
Water holds great significance as a vital resource in our everyday lives, highlighting the important to continuously monitor its quality to ensure its usability. The advent of the. The Internet of Things (IoT) has brought about a revolutionary shift by enabling real-time data collection from diverse sources, thereby facilitating efficient monitoring of water quality (WQ). By employing Machine learning (ML) techniques, this gathered data can be analyzed to make accurate predictions regarding water quality. These predictive insights play a crucial role in decision-making processes aimed at safeguarding water quality, such as identifying areas in need of immediate attention and implementing preventive measures to avert contamination. This paper aims to provide a comprehensive review of the current state of the art in water quality monitoring, with a specific focus on the employment of IoT wireless technologies and ML techniques. The study examines the utilization of a range of IoT wireless technologies, including Low-Power Wide Area Networks (LpWAN), Wi-Fi, Zigbee, Radio Frequency Identification (RFID), cellular networks, and Bluetooth, in the context of monitoring water quality. Furthermore, it explores the application of both supervised and unsupervised ML algorithms for analyzing and interpreting the collected data. In addition to discussing the current state of the art, this survey also addresses the challenges and open research questions involved in integrating IoT wireless technologies and ML for water quality monitoring (WQM).
Collapse
Affiliation(s)
- Ismail Essamlali
- Electrical Engineering and Intelligent Systems Laboratory, ENSET Mohammedia, Hassan 2nd University of Casablanca, Mail Box 159, Morocco
| | - Hasna Nhaila
- Electrical Engineering and Intelligent Systems Laboratory, ENSET Mohammedia, Hassan 2nd University of Casablanca, Mail Box 159, Morocco
| | - Mohamed El Khaili
- Electrical Engineering and Intelligent Systems Laboratory, ENSET Mohammedia, Hassan 2nd University of Casablanca, Mail Box 159, Morocco
| |
Collapse
|
11
|
Sang C, Tan L, Cai Q, Ye L. Long-term (2003-2021) evolution trend of water quality in the Three Gorges Reservoir: An evaluation based on an enhanced water quality index. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 915:169819. [PMID: 38190913 DOI: 10.1016/j.scitotenv.2023.169819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 12/11/2023] [Accepted: 12/29/2023] [Indexed: 01/10/2024]
Abstract
The degradation of water quality induced by the construction of large-scale hydraulic projects is one of the primary public concerns; however, it is rarely addressed with long-term field observation data. Here, we reported the long-term (2003-2021) trends, seasonal patterns, and overall condition of water quality of the Three Gorges Reservoir (TGR) with an enhanced water quality index (WQI). Specifically, to emphasize the importance of the biological role in water quality assessment, chlorophyll-a (Chla) was incorporated into WQI, and then a novel workflow using machine learning approach based on Random Forest (RF) model was constructed to develop a minimal water quality index (WQImin). The enhanced WQI indicated an overall "good" water quality condition, exhibiting a gradually improving trend subsequent to the reservoir impoundment in 2003. Meanwhile, the assessment revealed that the water quality has discernible seasonal patterns, characterized by poorer conditions in the spring and summer seasons. Furthermore, the RF model identified Chla, dissolved oxygen (DO), ammonium nitrogen (NH4-N), water temperature (WT), pH, and total nitrogen (TN) as key parameters for the WQImin, with Chla emerging as the most important factor in determining WQImin in our study. Moreover, weighted WQImin models exhibited improved performance in estimating WQI. Our study emphasizes the importance of biological parameters in water quality assessment, and introduces a systematic workflow to facilitate the development of WQImin for accurate and cost-efficient water quality assessment. Furthermore, our study makes a substantial contribution to the advancement of knowledge regarding long-term trends and seasonal patterns in water quality of large reservoirs, which provides a foundational basis for guiding water quality management practices for reservoirs worldwide.
Collapse
Affiliation(s)
- Chong Sang
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China; University of Chinese Academy of Sciences, Beijing, China
| | - Lu Tan
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Qinghua Cai
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
| | - Lin Ye
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China.
| |
Collapse
|
12
|
Das CR, Das S. Coastal groundwater quality prediction using objective-weighted WQI and machine learning approach. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:19439-19457. [PMID: 38355860 DOI: 10.1007/s11356-024-32415-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 02/07/2024] [Indexed: 02/16/2024]
Abstract
The water quality index (WQI) is a globally accepted guideline to indicate the water quality standard of any groundwater resource. Water levels in existing groundwater sources are declining in several coastal zones. Therefore, for monitoring water quality and improving water management, the prediction and identification of groundwater status by an effective technique with higher accuracy is urgently needed. Therefore, this research aims to find an effective model for WQI prediction by comparing entropy and critic weight-based WQI (ENW-WQI and CRITIC-WQI) with multi-layer perceptron artificial neural network (MLP-ANN) technique and also to identify contaminated zones using GIS. Initially, 1000 water sampling datasets with concentrations of several water quality parameters of different coastal blocks of eastern India during 2018 to 2022 are considered for the estimation of ENW-WQI and CRITIC-WQI. It shows 65% and 67% of the samples are excellent to good for drinking. ENW-WQI and CRITIC-WQI-based MLP-ANN models have been established considering different data portioning and hidden neuron numbers. Input variables and appropriate dataset partitioning with hidden neurons for models obtained from correlation and trial-error analysis. Spatial distribution maps are also produced for calculated WQIs using inverse distance weighted interpolation approaches. Three fitting models are obtained: ENW-WQI-MLP-ANN, CRITIC-WQI-MLP-ANN-I and CRITIC-WQI-MLP-ANN-II. CRITIC-WQI-MLP-ANN-II model (data ratio 85:15, network structure 6-12-1, R2 = 0.986, NSE = 0.98, and error rate 0.49%) provides the best accuracy in WQI prediction. The GIS-based WQI maps record several areas related to drinking water quality. The results of this research can help in planning the provision of safe drinking water in the future.
Collapse
Affiliation(s)
- Chinmoy Ranjan Das
- School of Water Resources Engineering, Jadavpur University, Kolkata, India
- Civil Engineering Department, Global Institute of Science & Technology, Purba Medinipur 721657, Haldia, West Bengal, India
| | - Subhasish Das
- School of Water Resources Engineering, Jadavpur University, Kolkata, India.
| |
Collapse
|
13
|
Mohseni U, Pande CB, Chandra Pal S, Alshehri F. Prediction of weighted arithmetic water quality index for urban water quality using ensemble machine learning model. CHEMOSPHERE 2024; 352:141393. [PMID: 38325619 DOI: 10.1016/j.chemosphere.2024.141393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 01/29/2024] [Accepted: 02/04/2024] [Indexed: 02/09/2024]
Abstract
Urban water quality index (WQI) is an important factor for assessment quality of groundwater in the urban and rural area. In this research, the Weighted Arithmetic Water Quality Index (WA-WQI) was estimated for understanding the groundwater quality. Four machine learning (ML) models were developed including artificial neural network (ANN), support vector machine (SVM), random forest (RF), and extreme gradient boosting (XG-Boost) in addition to multiple linear regression (MLR) for WA-WQI prediction at the Ujjain city of Madhya Pradesh in India. Groundwater quality samples were collected from 54 wards under the urban area, the main eight different physiochemical parameters were selected for WA-WQI prediction. The different input parameters data were analysed and calculated for the relationships of their ability to predict the results of WA-WQI. The ML models performance were calculated using three statistical metrics such as determination coefficient (R2), mean absolute error (MAE), and root mean square error (RMSE). In this research shown the XG-Boost model is better results other than other ML models. The best modelling results over the training phase revealed R2 = 0.969, RMSE = 2.169, MAE = 2.013 and over the testing phase R2 = 0.987, RMSE = 3.273, MAE = 2.727). All the ML models results were validated using receiver operating characteristic (ROC) curve for the best models selection. The results of best model area under curve (AUC) was 0.9048. Hence, XG-Boost model was given the accurate prediction of WA-WQI in the urban area. Based on the graphical presentation evaluation, XG-Boost model showed similar results of superiority. The obtained modelling results emphasis the utility of computer aid models for better planning and essential information for decision-makers, and water experts. The implement agency can adopt the procedures of water quality to decrease pollution and safe and healthy water provide to entire Ujjain city.
Collapse
Affiliation(s)
- Usman Mohseni
- Civil Engineering Department, Indian Institute of Technology Roorkee, Roorkee, 247667, Uttarakhand, India
| | - Chaitanya B Pande
- Indian Institute of Tropical Meteorology, Pune, 411008, Maharashtra, India; New Era and Development in Civil Engineering Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, Nasiriyah, 64001, Iraq; Abdullah Alrushaid Chair for Earth Science Remote Sensing Research, Geology and Geophysics Department, College of Science, King Saud University, Riyadh, 11451, Saudi Arabia.
| | - Subodh Chandra Pal
- Department of Geography, The University of Burdwan, Purba Bardhaman, West Bengal, India
| | - Fahad Alshehri
- Abdullah Alrushaid Chair for Earth Science Remote Sensing Research, Geology and Geophysics Department, College of Science, King Saud University, Riyadh, 11451, Saudi Arabia
| |
Collapse
|
14
|
Singh RB, Patra KC, Pradhan B, Samantra A. HDTO-DeepAR: A novel hybrid approach to forecast surface water quality indicators. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 352:120091. [PMID: 38228048 DOI: 10.1016/j.jenvman.2024.120091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/23/2023] [Accepted: 01/08/2024] [Indexed: 01/18/2024]
Abstract
Water is a vital resource supporting a broad spectrum of ecosystems and human activities. The quality of river water has declined in recent years due to the discharge of hazardous materials and toxins. Deep learning and machine learning have gained significant attention for analysing time-series data. However, these methods often suffer from high complexity and significant forecasting errors, primarily due to non-linear datasets and hyperparameter settings. To address these challenges, we have developed an innovative HDTO-DeepAR approach for predicting water quality indicators. This proposed approach is compared with standalone algorithms, including DeepAR, BiLSTM, GRU and XGBoost, using performance metrics such as MAE, MSE, MAPE, and NSE. The NSE of the hybrid approach ranges between 0.8 to 0.96. Given the value's proximity to 1, the model appears to be efficient. The PICP values (ranging from 95% to 98%) indicate that the model is highly reliable in forecasting water quality indicators. Experimental results reveal a close resemblance between the model's predictions and actual values, providing valuable insights for predicting future trends. The comparative study shows that the suggested model surpasses all existing, well-known models.
Collapse
Affiliation(s)
- Rosysmita Bikram Singh
- Department of Civil Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| | - Kanhu Charan Patra
- Department of Civil Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| | - Biswajeet Pradhan
- Centre for Advanced Modelling and Geospatial Information System, School of Civil and Environmental Engineering, University of Technology Sydney, Sydney, Australia; Institute of Climate Change, Universiti Kebangsaan Malaysia, Bangi, Malaysia.
| | - Avinash Samantra
- Department of Computer Science & Engineering, National Institute of Technology, Rourkela, 769008, Odisha, India.
| |
Collapse
|
15
|
Elzain HE, Abdalla O, A Ahmed H, Kacimov A, Al-Maktoumi A, Al-Higgi K, Abdallah M, Yassin MA, Senapathi V. An innovative approach for predicting groundwater TDS using optimized ensemble machine learning algorithms at two levels of modeling strategy. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 351:119896. [PMID: 38171121 DOI: 10.1016/j.jenvman.2023.119896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 12/12/2023] [Accepted: 12/19/2023] [Indexed: 01/05/2024]
Abstract
Groundwater salinization in coastal aquifers is a major socioeconomic challenge in Oman and many other regions worldwide due to several anthropogenic activities and natural drivers. Therefore, assessing the salinization of groundwater resources is crucial to ensure the protection of water resources and sustainable management. The aim of this study is to apply a novel approach using predictive optimized ensemble trees-based (ETB) machine learning models, namely Catboost regression (CBR), Extra trees regression (ETR), and Bagging regression (BA), at two levels of modeling strategy for predicting groundwater TDS as an indicator for seawater intrusion in a coastal aquifer, Oman. At level 1, ETR and CBR models were used as base models or inputs for BA in level 2. The results show that the models at level 1 (i.e., ETR and CBR) yielded satisfactory results using a limited number of inputs (Cl, K, and Sr) from a few sets of 40 groundwater wells. The BA model at level 2 improved the overall performance of the modeling by extracting more information from ETR and CBR models at level 1 models. At level 2, the BA model achieved a significant improvement in accuracy (MSE = 0.0002, RSR = 0.062, R2 = 0.995 and NSE = 0.996) compared to each individual model of ETR (MSE = 0.0007, RSR = 0.245, R2 = 0.98 and NSE = 0.94), and CBR (MSE = 0.0035, RSR = 0.258, R2 = 0.933 and NSE = 0.934) at level 1 models in the testing dataset. BA model at level 2 outperformed all models regarding predictive accuracy, best generalization of new data, and matching the locations of the polluted and unpolluted wells. Our approach predicts groundwater TDS with high accuracy and thus provides early warnings of water quality deterioration along coastal aquifers which will improve water resources sustainability.
Collapse
Affiliation(s)
- Hussam Eldin Elzain
- Water Research Center, Sultan Qaboos University, P.O. 50, Al Khoudh 123, Oman.
| | - Osman Abdalla
- Department of Earth Sciences, College of Science, Sultan Qaboos University, P.O. 36, Al Khoudh 123, Oman.
| | - Hamdi A Ahmed
- Department of Industrial and Data Engineering, Pukyong National University, Busan, 48513, South Korea.
| | - Anvar Kacimov
- Department of Soils, Water and Agricultural Engineering, Sultan Qaboos University, P.O. 34, Al Khoudh 123, Oman.
| | - Ali Al-Maktoumi
- Water Research Center, Sultan Qaboos University, P.O. 50, Al Khoudh 123, Oman; Department of Soils, Water and Agricultural Engineering, Sultan Qaboos University, P.O. 34, Al Khoudh 123, Oman.
| | - Khalifa Al-Higgi
- Department of Earth Sciences, College of Science, Sultan Qaboos University, P.O. 36, Al Khoudh 123, Oman.
| | - Mohammed Abdallah
- College of Hydrology and Water Resources, Hohai University, Nanjing, Jiangsu, 210024, China.
| | - Mohamed A Yassin
- Interdisciplinary Research Centre for Membranes and Water Security, King Fahd University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia.
| | | |
Collapse
|
16
|
Chen J, Li H, Felix M, Chen Y, Zheng K. >Water quality prediction of artificial intelligence model: a case of Huaihe River Basin, China. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:14610-14640. [PMID: 38273086 DOI: 10.1007/s11356-024-32061-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 01/15/2024] [Indexed: 01/27/2024]
Abstract
Accurate prediction of water quality contributes to the intelligent management of water resources. Water quality indices have time series characteristics and nonlinearity, but the existing models only focus on the forward time series when long short-term memory (LSTM) is introduced and do not consider the parallel computation on the model. Owing to this, a new neural network called LSTM-multihead attention (LMA) was constructed to predict water quality, using long short-term memory to process time series data and multihead attention for parallel computing and extracting feature information. Additionally, water quality indices have the issues of multiple data types and complex data correlations, as well as missing data and abnormal data problems in water quality data. In order to solve these problems, this study proposes a water quality prediction model called GRA-LMA-based linear interpolation, gray relational analysis and LMA. Two experiments are carried out to verify the predictive performance of the GRA-LMA with the water quality data of the Huaihe River Basin as a case study sample. The first experiment focuses on data processing, including the processing of missing data and abnormal data of water quality data, and the correlation analysis of water quality indices. Linear interpolation is adapted to process the missing data, while a combination of boxplot and histogram is adopted to analyze and eliminate the abnormal data, which is then repaired the abnormal data with linear interpolation. The gray relational analysis is adopted to calculate the correlation between different water quality indices, and water quality indices with high correlation are retained to determine the input variables of the water quality prediction model. The data processing results demonstrate that repairs can be made using linear interpolation without altering the pattern of data change and the model by using the gray relational analysis to reduce the quantity of data it needs as input. In the second experiment, the predictive capacity of GRA-LMA and existing models such as backpropagation neural network (BP), recurrent neural network (RNN), long short-term memory (LSTM), and gate recurrent unit (GRU) was evaluated and compared using different numerical and graphical performance evaluation metrics. Comparative experimental results show that the mean square error of pH, dissolved oxygen, chemical oxygen demand, ammonia nitrogen, electrical conductivity, turbidity, total phosphorus, and total nitrogen of GRA-LMA is reduced to 0.05890, 0.40196, 0.32454, 0.04368, 14.71003, 8.13252, 0.01558, and 0.14345. The results indicate that GRA-LMA has superior adaptability for predicting various water quality indices and can significantly lower the induced prediction error.
Collapse
Affiliation(s)
- Jing Chen
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, L69 3BX, UK
| | - Haiyang Li
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China.
| | - Manirankunda Felix
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China
| | - Yudi Chen
- Faculty of Science and Engineering, University of Manchester, Oxford RD, Manchester, M139PL, UK
| | - Keqiang Zheng
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China
| |
Collapse
|
17
|
Talukdar S, Shahfahad, Bera S, Naikoo MW, Ramana GV, Mallik S, Kumar PA, Rahman A. Optimisation and interpretation of machine and deep learning models for improved water quality management in Lake Loktak. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 351:119866. [PMID: 38147770 DOI: 10.1016/j.jenvman.2023.119866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/28/2023] [Accepted: 12/13/2023] [Indexed: 12/28/2023]
Abstract
Loktak Lake, one of the largest freshwater lakes in Manipur, India, is critical for the eco-hydrology and economy of the region, but faces deteriorating water quality due to urbanisation, anthropogenic activities, and domestic sewage. Addressing the urgent need for effective pollution management, this study aims to assess the lake's water quality status using the water quality index (WQI) and develop advanced machine learning (ML) tools for WQI assessment and ML model interpretation to improve pollution management decision making. The WQI was assessed using entropy-based weighting arithmetic and three ML models - Gradient Boosting Machine (GBM), Random Forest (RF) and Deep Neural Network (DNN) - were optimised using a grid search algorithm in the H2O Application Programming Interface (API). These models were validated by various metrics and interpreted globally and locally via Partial Dependency Plot (PDP), Accumulated Local Effect (ALE) and SHapley Additive exPlanations (SHAP). The results show a WQI range of 72.38-100, with 52.7% of samples categorised as very poor. The RF model outperformed GBM and DNN and showed the highest accuracy and generalisation ability, which is reflected in the superior R2 values (0.97 in training, 0.9 in test) and the lower root mean square error (RMSE). RF's minimal margin of error and reliable feature interpretation contrasted with DNN's larger margin of error and inconsistency, which affected its usefulness for decision making. Turbidity was found to be a critical predictive feature in all models, significantly influencing WQI, with other variables such as pH and temperature also playing an important role. SHAP dependency plots illustrated the direct relationship between key water quality parameters such as turbidity and WQI predictions. The novelty of this study lies in its comprehensive approach to the evaluation and interpretation of ML models for WQI estimation, which provides a nuanced understanding of water quality dynamics in Loktak Lake. By identifying the most effective ML models and key predictive functions, this study provides invaluable insights for water quality management and paves the way for targeted strategies to monitor and improve water quality in this vital freshwater ecosystem.
Collapse
Affiliation(s)
- Swapan Talukdar
- Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi, 110025, India.
| | - Shahfahad
- Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi, 110025, India.
| | - Somnath Bera
- Department of Geography, Central University of South Bihar, Gaya, Bihar, 823001, India.
| | - Mohd Waseem Naikoo
- Department of Geography & Disaster Management, University of Kashmir, Srinagar, Jammu & Kashmir, 190006, India.
| | - G V Ramana
- Department of Civil Engineering, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, 110016, India.
| | - Santanu Mallik
- Department of Civil Engineering, National Institution of Technology, Agaratala, Tripura, 799046, India.
| | - Potsangbam Albino Kumar
- Department of Civil Engineering, National Institution of Technology, Imphal, Manipur, 795004, India.
| | - Atiqur Rahman
- Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi, 110025, India.
| |
Collapse
|
18
|
Uddin MG, Nash S, Rahman A, Dabrowski T, Olbert AI. Data-driven modelling for assessing trophic status in marine ecosystems using machine learning approaches. ENVIRONMENTAL RESEARCH 2024; 242:117755. [PMID: 38008200 DOI: 10.1016/j.envres.2023.117755] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 10/05/2023] [Accepted: 11/20/2023] [Indexed: 11/28/2023]
Abstract
Assessing eutrophication in coastal and transitional waters is of utmost importance, yet existing Trophic Status Index (TSI) models face challenges like multicollinearity, data redundancy, inappropriate aggregation methods, and complex classification schemes. To tackle these issues, we developed a novel tool that harnesses machine learning (ML) and artificial intelligence (AI), enhancing the reliability and accuracy of trophic status assessments. Our research introduces an improved data-driven methodology specifically tailored for transitional and coastal (TrC) waters, with a focus on Cork Harbour, Ireland, as a case study. Our innovative approach, named the Assessment Trophic Status Index (ATSI) model, comprises three main components: the selection of pertinent water quality indicators, the computation of ATSI scores, and the implementation of a new classification scheme. To optimize input data and minimize redundancy, we employed ML techniques, including advanced deep learning methods. Specifically, we developed a CHL prediction model utilizing ten algorithms, among which XGBoost demonstrated exceptional performance, showcasing minimal errors during both training (RMSE = 0.0, MSE = 0.0, MAE = 0.01) and testing (RMSE = 0.0, MSE = 0.0, MAE = 0.01) phases. Utilizing a novel linear rescaling interpolation function, we calculated ATSI scores and evaluated the model's sensitivity and efficiency across diverse application domains, employing metrics such as R2, the Nash-Sutcliffe efficiency (NSE), and the model efficiency factor (MEF). The results consistently revealed heightened sensitivity and efficiency across all application domains. Additionally, we introduced a brand new classification scheme for ranking the trophic status of transitional and coastal waters. To assess spatial sensitivity, we applied the ATSI model to four distinct waterbodies in Ireland, comparing trophic assessment outcomes with the Assessment of Trophic Status of Estuaries and Bays in Ireland (ATSEBI) System. Remarkably, significant disparities between the ATSI and ATSEBI System were evident in all domains, except for Mulroy Bay. Overall, our research significantly enhances the accuracy of trophic status assessments in marine ecosystems. The ATSI model, combined with cutting-edge ML techniques and our new classification scheme, represents a promising avenue for evaluating and monitoring trophic conditions in TrC waters. The study also demonstrated the effectiveness of ATSI in assessing trophic status across various waterbodies, including lakes, rivers, and more. These findings make substantial contributions to the field of marine ecosystem management and conservation.
Collapse
Affiliation(s)
- Md Galal Uddin
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland.
| | - Stephen Nash
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland
| | - Azizur Rahman
- School of Computing, Mathematics and Engineering, Charles Sturt University, Wagga Wagga, Australia; The Gulbali Institute of Agriculture, Water and Environment, Charles Sturt University, Wagga Wagga, Australia
| | | | - Agnieszka I Olbert
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland
| |
Collapse
|
19
|
Özsezer G, Mermer G. Prediction of drinking water quality with machine learning models: A public health nursing approach. Public Health Nurs 2024; 41:175-191. [PMID: 37997522 DOI: 10.1111/phn.13264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 11/09/2023] [Accepted: 11/11/2023] [Indexed: 11/25/2023]
Abstract
OBJECTIVE The aim of this study is to use machine learning models to predict drinking water quality from a public health nursing approach. DESIGN Machine learning study. SAMPLE "Water Quality Dataset" was used in the study. The dataset contains physical and chemical measurements of water quality for 2400 different water bodies. The process consists of four stages: Data processing with Synthetic Minority Oversampling Technique, hyperparameter tuning with 10-fold cross-validation, modeling and comparative analysis. 80% of the dataset is allocated as training data and 20% as test data. ML models logistic regression, K-nearest neighbor, support vector machine, random forest, XGBoost, AdaBoost Classifier, Decision Tree algorithms were used for water quality prediction. Accuracy, precision, recall, F1 score and AUC performance metrics of ML models were compared. To evaluate the performance of the models, 10-fold cross-validation was used and a comparative analysis was performed. The p-values of the models were also compared. RESULTS N this study, where drinking water quality was predicted with seven different ML algorithms, it can be said that XGBoost and Random Forest are the best classification models in all performance metrics. There is a significant difference in all ML algorithms according to the p-value. The H0 hypothesis is accepted for these algorithms. According to the H0 hypothesis, there is no difference between actual values and predicted values. CONCLUSION In conclusion, the use of ML models in the prediction of drinking water quality can help nurses greatly improve access to clean water, a human right, be more knowledgeable about water quality, and protect the health of individuals.
Collapse
Affiliation(s)
- Gözde Özsezer
- Çanakkale Onsekiz Mart University Faculty of Health Sciences Department of Public Health Nursing, Çanakkale, Turkey
- Ege University Health Sciences Institute, İzmir, Turkey
| | - Gülengül Mermer
- Ege University Faculty of Nursing Department of Public Health Nursing, İzmir, Turkey
| |
Collapse
|
20
|
Zamani MG, Nikoo MR, Jahanshahi S, Barzegar R, Meydani A. Forecasting water quality variable using deep learning and weighted averaging ensemble models. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:124316-124340. [PMID: 37996598 DOI: 10.1007/s11356-023-30774-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 10/27/2023] [Indexed: 11/25/2023]
Abstract
Water quality variables, including chlorophyll-a (Chl-a), play a pivotal role in comprehending and evaluating the condition of aquatic ecosystems. Chl-a, a pigment present in diverse aquatic organisms, notably algae and cyanobacteria, serves as a valuable indicator of water quality. Thus, the objectives of this study encompass: (1) the assessment of the predictive capabilities of four deep learning (DL) models - namely, recurrent neural network (RNN), long short-term memory (LSTM), gated recurrence unit (GRU), and temporal convolutional network (TCN) - in forecasting Chl-a concentrations; (2) the incorporation of these DL models into ensemble models (EMs) employing genetic algorithm (GA) and non-dominated sorting genetic algorithm (NSGA-II) to harness the strengths of each standalone model; and (3) the evaluation of the efficacy of the developed EMs. Utilizing data collected at 15-min intervals from Small Prespa Lake (SPL) in Greece, the models employed hourly Chl-a concentration lag times, extending up to 6 h, as models' inputs to forecast Chla (t+1). The proposed models underwent training on 70% of the dataset and were subsequently validated on the remaining 30%. Among the standalone DL models, the GRU model exhibited superior performance in Chl-a forecasting, surpassing the RNN, LSTM, and TCN models by 8%, 2%, and 2%, respectively. Furthermore, the integration of DL models through single-objective GA and multi-objective NSGA-II optimization algorithms yielded hybrid models adept at effectively forecasting both low and high Chl-a concentrations. The ensemble model based on NSGA-II outperformed standalone DL models as well as the GA-based model across a range of evaluation indices. For instance, considering the R-squared metric, the study's findings demonstrated that the EM-NSGA-II stands out with exceptional effectiveness compared to DL and EM-GA models, showcasing improvements of 14% (RNN), 8% (LSTM), 6% (GRU), 8% (TCN), and 3% (EM-GA) during the testing phase.
Collapse
Affiliation(s)
- Mohammad G Zamani
- Department of Water Resources Engineering, Faculty of Civil, Water and Environmental Engineering, Amirkabir University of Technology, Tehran, Iran
| | - Mohammad Reza Nikoo
- Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat, Oman.
| | - Sina Jahanshahi
- Department of Water Resources Engineering, Faculty of Civil, Water and Environmental Engineering, University of Tehran, Tehran, Iran
| | - Rahim Barzegar
- Groundwater Research Group (GRES), Research Institute on Mines and Environment (RIME), Université du Québec en Abitibi-Témiscamingue (UQAT), Amos, Québec, Canada
| | - Amirreza Meydani
- Department of Geography and Spatial Sciences, University of Delaware, Newark, DE, USA
| |
Collapse
|
21
|
Uddin MG, Jackson A, Nash S, Rahman A, Olbert AI. Comparison between the WFD approaches and newly developed water quality model for monitoring transitional and coastal water quality in Northern Ireland. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 901:165960. [PMID: 37541496 DOI: 10.1016/j.scitotenv.2023.165960] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 07/04/2023] [Accepted: 07/30/2023] [Indexed: 08/06/2023]
Abstract
This study aims to evaluate existing approaches for monitoring and assessing water quality in waterbodies in the North of Ireland using newly developed methodologies. The results reveal significant differences between the new technique and the existing "one-out, all-out" approach in rating water quality. The new approach found the water quality status to be "good," "fair," and "marginal," whereas the existing "one-out, all-out" technique classified water quality as "good," and "moderate," respectively. The new technique outperformed existing approaches in rating the water quality of different waterbody types, with high R2 = 1, NSE = 0.99, and MEF = 0 values. Furthermore, the final assessment of water quality using the new methodologies had the lowest uncertainty (<1 %), whereas the efficiency measures (NSE and MEF) indicate that the new approaches are bias-free to assess water quality at any geographic scale. The results of this study reveal that the newly proposed methodologies are effective in assessing the water quality states of transitional and coastal waterbodies in the North of Ireland. The study also highlighted the limitations of existing approaches and the importance of updating water resource management systems for better protection of these waterbodies. The findings have significant implications for water resource management and planning in the North of Ireland and other similar regions.
Collapse
Affiliation(s)
- Md Galal Uddin
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco HydroInformatics Research Group (EHIRG), School of Engineering, College of Science and Engineering, University of Galway, Ireland.
| | - Aoife Jackson
- College of Science and Engineering, Natural Sciences, University of Galway, Ireland
| | - Stephen Nash
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland
| | - Azizur Rahman
- School of Computing, Mathematics and Engineering, Charles Sturt University, Wagga Wagga, Australia; The Gulbali Institute of Agriculture, Water and Environment, Charles Sturt University, Wagga Wagga, Australia
| | - Agnieszka I Olbert
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco HydroInformatics Research Group (EHIRG), School of Engineering, College of Science and Engineering, University of Galway, Ireland
| |
Collapse
|
22
|
Uddin MG, Diganta MTM, Sajib AM, Rahman A, Nash S, Dabrowski T, Ahmadian R, Hartnett M, Olbert AI. Assessing the impact of COVID-19 lockdown on surface water quality in Ireland using advanced Irish water quality index (IEWQI) model. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 336:122456. [PMID: 37673321 DOI: 10.1016/j.envpol.2023.122456] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 07/23/2023] [Accepted: 08/23/2023] [Indexed: 09/08/2023]
Abstract
The COVID-19 pandemic has significantly impacted various aspects of life, including environmental conditions. Surface water quality (WQ) is one area affected by lockdowns imposed to control the virus's spread. Numerous recent studies have revealed the considerable impact of COVID-19 lockdowns on surface WQ. In response, this research aimed to assess the impact of COVID-19 lockdowns on surface water quality in Ireland using an advanced WQ model. To achieve this goal, six years of water quality monitoring data from 2017 to 2022 were collected for nine water quality indicators in Cork Harbour, Ireland, before, during, and after the lockdowns. These indicators include pH, water temperature (TEMP), salinity (SAL), biological oxygen demand (BOD5), dissolved oxygen (DOX), transparency (TRAN), and three nutrient enrichment indicators-dissolved inorganic nitrogen (DIN), molybdate reactive phosphorus (MRP), and total oxidized nitrogen (TON). The results showed that the lockdown had a significant impact on various WQ indicators, particularly pH, TEMP, TON, and BOD5. Over the study period, most indicators were within the permissible limit except for MRP, with the exception of during COVID-19. During the pandemic, TON and DIN decreased, while water transparency significantly improved. In contrast, after COVID-19, WQ at 7% of monitoring sites significantly deteriorated. Overall, WQ in Cork Harbour was categorized as "good," "fair," and "marginal" classes over the study period. Compared to temporal variation, WQ improved at 17% of monitoring sites during the lockdown period in Cork Harbour. However, no significant trend in WQ was observed. Furthermore, the study analyzed the advanced model's performance in assessing the impact of COVID-19 on WQ. The results indicate that the advanced WQ model could be an effective tool for monitoring and evaluating lockdowns' impact on surface water quality. The model can provide valuable information for decision-making and planning to protect aquatic ecosystems.
Collapse
Affiliation(s)
- Md Galal Uddin
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland.
| | - Mir Talas Mahammad Diganta
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland
| | - Abdul Majed Sajib
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland
| | - Azizur Rahman
- School of Computing, Mathematics and Engineering, Charles Sturt University, Wagga Wagga, Australia; The Gulbali Institute of Agriculture, Water and Environment, Charles Sturt University, Wagga Wagga, Australia
| | - Stephen Nash
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland
| | | | - Reza Ahmadian
- School of Engineering, Cardiff University, The Parade, Cardiff, CF24 3AQ, UK
| | - Michael Hartnett
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland
| | - Agnieszka I Olbert
- School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland
| |
Collapse
|
23
|
Cojbasic S, Dmitrasinovic S, Kostic M, Turk Sekulic M, Radonic J, Dodig A, Stojkovic M. Application of machine learning in river water quality management: a review. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2023; 88:2297-2308. [PMID: 37966184 PMCID: wst_2023_331 DOI: 10.2166/wst.2023.331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2023]
Abstract
Machine learning (ML), a branch of artificial intelligence (AI), has been increasingly used in environmental engineering due to the ability to analyze complex nonlinear problems (such as ones connected with water quality management) through a data-driven approach. This study provides an overview of different ML algorithms applied for monitoring and predicting river water quality. Different parameters could be monitored or predicted, such as dissolved oxygen (DO), biological and chemical oxygen demand (BOD and COD), turbidity levels, the concentration of different ions (such as Mg2+ and Ca2+), heavy metal or other pollutant's concentration, pH, temperature, and many more. Although many algorithms have been investigated for the prediction of river water quality, there are several which are most commonly used in engineering practice. These models mostly include so-called supervised learning algorithms, such as artificial neural network (ANN), support vector machine (SVM), random forest (RF), decision tree (DT), and deep learning (DL). To further enhance prediction power, novel hybrid algorithms, could be used. However, the quality of prediction is not only dependent on the applied algorithm but also on the availability of previously mentioned water quality parameters, their selection, and the combination of input data used to train the ML model.
Collapse
Affiliation(s)
- Sanja Cojbasic
- Faculty of Technical Sciences, Department of Environmental Engineering and Occupational Safety and Health, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia E-mail:
| | - Sonja Dmitrasinovic
- Faculty of Technical Sciences, Department of Environmental Engineering and Occupational Safety and Health, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia
| | - Marija Kostic
- Faculty of Technical Sciences, Department of Environmental Engineering and Occupational Safety and Health, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia
| | - Maja Turk Sekulic
- Faculty of Technical Sciences, Department of Environmental Engineering and Occupational Safety and Health, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia
| | - Jelena Radonic
- Faculty of Technical Sciences, Department of Environmental Engineering and Occupational Safety and Health, University of Novi Sad, Trg Dositeja Obradovica 6, 21000 Novi Sad, Serbia
| | - Ana Dodig
- Institute for Artificial Intelligence R&D of Serbia, Fruskogorska 1, Novi Sad, Serbia
| | - Milan Stojkovic
- Institute for Artificial Intelligence R&D of Serbia, Fruskogorska 1, Novi Sad, Serbia
| |
Collapse
|
24
|
Pathakamuri PC, Villuri VGK, Pasupuleti S, Banerjee A, Venkatesh AS. A holistic approach for understanding the status of water quality and causes of its deterioration in a drought-prone agricultural area of Southeastern India. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:116765-116780. [PMID: 36114973 DOI: 10.1007/s11356-022-22906-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 09/03/2022] [Indexed: 06/15/2023]
Abstract
This study investigates the groundwater quality in the Kadiri Basin, Ananthapuramu district of Andhra Pradesh, India. Groundwater samples from 77 locations were collected and tested for the concentration of various physicochemical parameters. The collected data were assimilated in the form of a groundwater quality index to estimate groundwater quality (drinking and irrigation) using an information entropy-based weight determination approach (EWQI). The water quality maps obtained from the study area suggest a definite trend in groundwater contamination of the study area. Furthermore, the influence of different physicochemical parameters on groundwater quality was determined using machine learning techniques. Learning and prediction accuracies of four different techniques, namely artificial neural network (ANN), deep learning (DL), random forest (RF), and gradient boosting machine (GBM), were investigated. The performance of the ANN model (MEA = 11.23, RSME = 21.22, MAPE = 7.48, and R2 = 0.91) was found to be highly effective for the present dataset. The ANN model was then used to understand the relative influence of physicochemical parameters on groundwater quality. It was observed that the deterioration in groundwater quality in the study area was primarily due to the excess concentration of turbidity and iron values. The relatively higher concentration of sulfate and nitrate had caused a significant impact on the groundwater quality. The study has wider implications for modeling in similar drought-prone agricultural areas elsewhere for assessing the groundwater quality.
Collapse
Affiliation(s)
- Prabhakara Chowdary Pathakamuri
- Department of Mining Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Dhanbad, 826004, Jharkhand, India
| | - Vasanta Govind Kumar Villuri
- Department of Mining Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Dhanbad, 826004, Jharkhand, India
| | - Srinivas Pasupuleti
- Department of Civil Engineering, Indian Institute of Technology (Indian School of Mines), Dhanbad, Dhanbad, 826004, Jharkhand, India.
| | - Ashes Banerjee
- Department of Civil Engineering, Alliance University, Bangalore, 562106, Karnataka, India
| | - Akella Satya Venkatesh
- Department of Applied Geology, Indian Institute of Technology (Indian School of Mines)), Dhanbad, 826004, Jharkhand, India
| |
Collapse
|
25
|
Naskar AK, Akhter J, Gazi M, Mondal M, Deb A. Impact of meteorological parameters on soil radon at Kolkata, India: investigation using machine learning techniques. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:105374-105386. [PMID: 37710069 DOI: 10.1007/s11356-023-29769-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 09/04/2023] [Indexed: 09/16/2023]
Abstract
The daily soil radon activity has been measured continuously over a year with BARASOL BMC2 probe at a measuring site of Jadavpur University Campus in Kolkata, India. The dependency of soil radon activity with different atmospheric parameters such as soil temperature, soil pressure, humidity, air temperature, and rainfall has been also analyzed. The whole study period is divided in four seasons as proposed by the Indian Meteorological Department (IMD). Minimum soil radon level has been observed during the winter season (December-February). On the other hand, higher soil radon level has been observed both for summer and monsoon. Except soil pressure, all other variables have shown positive correlation with soil radon activity. Among five variables, soil temperature has been the most significant variable in terms of correlation with soil radon level whereas maximum humidity has been the least significant correlated variable. It has been observed that considerable reduction of soil radon level occurred after four heavy rainfall events during the study period. The combined effect of these multi-parameters on soil radon gas has been evaluated using machine learning methods like principal component regression (PCR), support vector regression (SVR), random forest regression (RF), and gradient boosting machine (GBM). In terms of performances, RF and GBM have performed much better than SVR and PCR. More robust and consistent results have been obtained for GBM during both training and testing periods.
Collapse
Affiliation(s)
- Arindam Kumar Naskar
- Nuclear and Particle Physics Research Centre, Department of Physics, Jadavpur University, Kolkata, 700032, West Bengal, India
- School of Studies in Environmental Radiation and Archaeological Sciences, Jadavpur University, Kolkata, 700032, West Bengal, India
- Department of Physics, Bangabasi Evening College, Kolkata, 700009, West Bengal, India
| | - Javed Akhter
- Department of Atmospheric Sciences, University of Calcutta, 51/2 Hazra Road, Kolkata, 700019, India
| | - Mahasin Gazi
- Nuclear and Particle Physics Research Centre, Department of Physics, Jadavpur University, Kolkata, 700032, West Bengal, India
- School of Studies in Environmental Radiation and Archaeological Sciences, Jadavpur University, Kolkata, 700032, West Bengal, India
- Apollo Multispeciality Hospitals, 58 Canal Circular Road, Kolkata, 700054, India
| | - Mitali Mondal
- Nuclear and Particle Physics Research Centre, Department of Physics, Jadavpur University, Kolkata, 700032, West Bengal, India
- School of Studies in Environmental Radiation and Archaeological Sciences, Jadavpur University, Kolkata, 700032, West Bengal, India
| | - Argha Deb
- Nuclear and Particle Physics Research Centre, Department of Physics, Jadavpur University, Kolkata, 700032, West Bengal, India.
- School of Studies in Environmental Radiation and Archaeological Sciences, Jadavpur University, Kolkata, 700032, West Bengal, India.
| |
Collapse
|
26
|
Han H, Kim B, Kim K, Kim D, Kim HS. Machine learning approach for the estimation of missing precipitation data: a case study of South Korea. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2023; 88:556-571. [PMID: 37578874 PMCID: wst_2023_237 DOI: 10.2166/wst.2023.237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/16/2023]
Abstract
Precipitation is one of the driving forces in water cycles, and it is vital for understanding the water cycle, such as surface runoff, soil moisture, and evapotranspiration. However, missing precipitation data at the observatory becomes an obstacle to improving the accuracy and efficiency of hydrological analysis. To address this issue, we developed a machine learning algorithm-based precipitation data recovery tool to detect and predict missing precipitation data at observatories. This study investigated 30 weather stations in South Korea, evaluating the applicability of machine learning algorithms (artificial neural network and random forest) for precipitation data recovery using environmental variables, such as air pressure, temperature, humidity, and wind speed. The proposed model showed a high performance in detecting the missing precipitation occurrence with an accuracy of 80%. In addition, the prediction results from the models showed predictive ability with a correlation coefficient ranging from 0.5 to 0.7 and R2 values of 0.53. Although both algorithms performed similarly in estimating precipitation, ANN performed slightly better. Based on the results of this study, we expect that the machine learning algorithms can contribute to improving hydrological modeling performance by recovering missing precipitation data at observation stations.
Collapse
Affiliation(s)
- Heechan Han
- Department of Civil Engineering, Chosun University, Gwangju, South Korea E-mail:
| | - Boran Kim
- Department of Civil and Environmental Engineering, Colorado State University, Fort Collins, CO, USA
| | - Kyunghun Kim
- Department of Civil Engineering, Inha University, Incheon, South Korea
| | - Donghyun Kim
- Institute of Water Resource System, Inha University, Incheon, South Korea
| | - Hung Soo Kim
- Department of Civil Engineering, Inha University, Incheon, South Korea
| |
Collapse
|
27
|
Lee H, Park S, V-Minh Nguyen H, Shin HS. Proposal for a new customization process for a data-based water quality index using a random forest approach. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 323:121222. [PMID: 36754201 DOI: 10.1016/j.envpol.2023.121222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 01/20/2023] [Accepted: 02/04/2023] [Indexed: 06/18/2023]
Abstract
As the water quality index (WQI) represents water quality, it is crucial to customize the WQI for a specific purpose. In this study, to better represent water quality data using WQI, a random forest (RF) approach was used to derive the parameter weight and calculate the WQI according to the watershed and its use. Eight parameters (water temperature, dissolved oxygen, pH, electrical conductivity, suspended solids, total nitrogen, total phosphorus, and total organic carbon) were evaluated using a total of 220,103 data points collected from 900 monitoring sites throughout South Korea between 2011 and 2020. The estimation of parameter weights, key elements in developing the WQI model, was performed through the variable importance estimation method that can be derived from the RF model. The parameter weights were derived based on various spatiotemporal datasets, and it was confirmed that the spatiotemporal differences in weights according to data characteristics represented the regional and seasonal water quality characteristics. Consequently, a customized WQI representing water quality characteristics could be calculated using data-based weights, and it is expected that a data-based customized WQI could be developed to better match the previous WQI to the purpose and target source.
Collapse
Affiliation(s)
- Hansaem Lee
- Department of Mechanical and Manufacturing Engineering, University of Calgary, Calgary, Alberta, T2N 1N4, Canada
| | - Seonyoung Park
- Department of Applied Artificial Intelligence, Seoul National University of Science & Technology, Seoul, 01811, South Korea
| | - Hang V-Minh Nguyen
- Department of Environmental Engineering, Seoul National University of Science & Technology, Seoul, 01811, South Korea
| | - Hyun-Sang Shin
- Department of Environmental Engineering, Seoul National University of Science & Technology, Seoul, 01811, South Korea.
| |
Collapse
|
28
|
Dharshini S. Deep learning approach for prediction and classification of potable water. ANAL SCI 2023:10.1007/s44211-023-00328-2. [PMID: 37029332 DOI: 10.1007/s44211-023-00328-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 03/19/2023] [Indexed: 04/09/2023]
Abstract
Potable water, commonly known as drinking water, refers to water that is safe to drink and does not endanger human health. It must adhere to strict quality standards set by health organizations, be devoid of dangerous pollutants and chemicals, and meet certain requirements for safety. The health of the public and the ecosystem are directly affected by water quality. Various pollutants have posed dangers to water quality in recent years. A more efficient and affordable approach is required due to the grave effects of low water quality. In this proposed research work, deep learning algorithms are developed to predict the water quality index (WQI) and water quality classifications (WQC), which are vital parameters that can be utilized to know the status of the water. To predict the WQI, a deep learning algorithm called long short-term memory (LSTM) is used. Further, WQC is performed using a deep learning algorithm called a convolutional neural network (CNN). The proposed system considers seven water quality parameters, namely, dissolved oxygen (DO), pH, conductivity, biological oxygen demand (BOD), nitrate, fecal coliform, and total coliform. The experimental results showed that the LSTM can predict water quality with superior robustness and predict WQI with the highest accuracy of 97%. Similarly, the CNN model classifies the WQC as potable or impotable with superior accuracy and a reduced error rate of 0.02.
Collapse
Affiliation(s)
- Shri Dharshini
- Department of Information Technology, Mepco Schlenk Engineering College, Sivakasi, India
| |
Collapse
|
29
|
Zheng H, Liu Y, Wan W, Zhao J, Xie G. Large-scale prediction of stream water quality using an interpretable deep learning approach. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 331:117309. [PMID: 36657204 DOI: 10.1016/j.jenvman.2023.117309] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 01/13/2023] [Accepted: 01/14/2023] [Indexed: 06/17/2023]
Abstract
Deep learning methods, which have strong capabilities for mapping highly nonlinear relationships with acceptable calculation speed, have been increasingly applied for water quality prediction in recent studies. However, it is argued that the practicality of deep learning methods is limited due to the lack of physical mechanics to explain the prediction results of water quality changes. A knowledge gap exists in rationalizing the deep learning results for water quality predictions. To address this gap, an interpretable deep learning framework was established to predict the spatiotemporal variations of water quality parameters in a large spatial region. Mereological, land-use, and socioeconomic variables were adopted to predict the daily variations of stream water quality parameters across 138 sub-catchments in a total of over 575,250 km2 in southern China. The coefficients of determination of chemical oxygen demand (COD), total phosphorus (TP), and total nitrogen (TN) predictions were over 0.80, suggesting a satisfactory prediction performance. The model performance in terms of prediction accuracy could be improved by involving land-use and socioeconomic predictors in addition to hydrological variables. The SHapley Additive exPlanations method used in this study was demonstrated to be effective for interpreting the prediction results by identifying the significant variables and reasoning their influencing directions on the variation of each water quality parameter. The air temperature, proportion of forest area, grain production, population density, and proportion of urban area in each sub-catchment as well as the accumulated rainfall within the previous 3 days were identified as the most significant variables affecting the variations of dissolved oxygen, COD, ammoniacal nitrogen(NH3-N), TN, TP, and turbidity in the stream water in the case area, respectively.
Collapse
Affiliation(s)
- Hang Zheng
- School of Environment and Civil Engineering, Dongguan University of Technology, Dongguan, 523808, China
| | - Yueyi Liu
- School of Environment and Civil Engineering, Dongguan University of Technology, Dongguan, 523808, China
| | - Wenhua Wan
- School of Environment and Civil Engineering, Dongguan University of Technology, Dongguan, 523808, China
| | - Jianshi Zhao
- Department of Hydraulic Engineering, Tsinghua University, Beijing, 100084, China
| | - Guanti Xie
- Dongguan Shigu Sewage Treatment Co., Ltd., Dongguan, 523808, China
| |
Collapse
|
30
|
Dimple, Singh PK, Rajput J, Kumar D, Gaddikeri V, Elbeltagi A. Combination of discretization regression with data-driven algorithms for modeling irrigation water quality indices. ECOL INFORM 2023. [DOI: 10.1016/j.ecoinf.2023.102093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2023]
|
31
|
Bieroza M, Acharya S, Benisch J, ter Borg RN, Hallberg L, Negri C, Pruitt A, Pucher M, Saavedra F, Staniszewska K, van’t Veen SGM, Vincent A, Winter C, Basu NB, Jarvie HP, Kirchner JW. Advances in Catchment Science, Hydrochemistry, and Aquatic Ecology Enabled by High-Frequency Water Quality Measurements. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:4701-4719. [PMID: 36912874 PMCID: PMC10061935 DOI: 10.1021/acs.est.2c07798] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 03/03/2023] [Accepted: 03/03/2023] [Indexed: 06/18/2023]
Abstract
High-frequency water quality measurements in streams and rivers have expanded in scope and sophistication during the last two decades. Existing technology allows in situ automated measurements of water quality constituents, including both solutes and particulates, at unprecedented frequencies from seconds to subdaily sampling intervals. This detailed chemical information can be combined with measurements of hydrological and biogeochemical processes, bringing new insights into the sources, transport pathways, and transformation processes of solutes and particulates in complex catchments and along the aquatic continuum. Here, we summarize established and emerging high-frequency water quality technologies, outline key high-frequency hydrochemical data sets, and review scientific advances in key focus areas enabled by the rapid development of high-frequency water quality measurements in streams and rivers. Finally, we discuss future directions and challenges for using high-frequency water quality measurements to bridge scientific and management gaps by promoting a holistic understanding of freshwater systems and catchment status, health, and function.
Collapse
Affiliation(s)
- Magdalena Bieroza
- Department
of Soil and Environment, SLU, Box 7014, Uppsala 750
07 Sweden
| | - Suman Acharya
- Department
of Environment and Genetics, School of Agriculture, Biomedicine and
Environment, La Trobe University, Albury/Wodonga Campus, Victoria 3690, Australia
| | - Jakob Benisch
- Institute
for Urban Water Management, TU Dresden, Bergstrasse 66, Dresden 01068, Germany
| | | | - Lukas Hallberg
- Department
of Soil and Environment, SLU, Box 7014, Uppsala 750
07 Sweden
| | - Camilla Negri
- Environment
Research Centre, Teagasc, Johnstown Castle, Wexford Y35 Y521, Ireland
- The
James
Hutton Institute, Craigiebuckler, Aberdeen AB15 8QH, United Kingdom
- School
of
Archaeology, Geography and Environmental Science, University of Reading, Whiteknights, Reading RG6 6AB, United Kingdom
| | - Abagael Pruitt
- Department
of Biological Sciences, University of Notre
Dame, Notre
Dame, Indiana 46556, United States
| | - Matthias Pucher
- Institute
of Hydrobiology and Aquatic Ecosystem Management, Vienna University of Natural Resources and Life Sciences, Gregor Mendel Straße 33, Vienna 1180, Austria
| | - Felipe Saavedra
- Department
for Catchment Hydrology, Helmholtz Centre
for Environmental Research - UFZ, Theodor-Lieser-Straße 4, Halle (Saale) 06120, Germany
| | - Kasia Staniszewska
- Department
of Earth and Atmospheric Sciences, University
of Alberta, Edmonton, Alberta T6G 2E3, Canada
| | - Sofie G. M. van’t Veen
- Department
of Ecoscience, Aarhus University, Aarhus 8000, Denmark
- Envidan
A/S, Silkeborg 8600, Denmark
| | - Anna Vincent
- Department
of Biological Sciences, University of Notre
Dame, Notre
Dame, Indiana 46556, United States
| | - Carolin Winter
- Environmental
Hydrological Systems, University of Freiburg, Friedrichstraße 39, Freiburg 79098, Germany
- Department
of Hydrogeology, Helmholtz Centre for Environmental
Research - UFZ, Permoserstr.
15, Leipzig 04318, Germany
| | - Nandita B. Basu
- Department
of Civil and Environmental Engineering and Department of Earth and
Environmental Sciences, and Water Institute, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - Helen P. Jarvie
- Water Institute
and Department of Geography and Environmental Management, University of Waterloo, Waterloo, Ontario N2L 3G1, Canada
| | - James W. Kirchner
- Department
of Environmental System Sciences, ETH Zurich, Zurich CH-8092, Switzerland
- Swiss
Federal Research Institute WSL, Birmensdorf CH-8903, Switzerland
| |
Collapse
|
32
|
Al-Janabi S, Al-Barmani Z. Intelligent multi-level analytics of soft computing approach to predict water quality index (IM12CP-WQI). Soft comput 2023. [DOI: 10.1007/s00500-023-07953-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2023]
|
33
|
Simeoni C, Furlan E, Pham HV, Critto A, de Juan S, Trégarot E, Cornet CC, Meesters E, Fonseca C, Botelho AZ, Krause T, N'Guetta A, Cordova FE, Failler P, Marcomini A. Evaluating the combined effect of climate and anthropogenic stressors on marine coastal ecosystems: Insights from a systematic review of cumulative impact assessment approaches. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 861:160687. [PMID: 36473660 DOI: 10.1016/j.scitotenv.2022.160687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 11/29/2022] [Accepted: 11/30/2022] [Indexed: 06/17/2023]
Abstract
Cumulative impacts increasingly threaten marine and coastal ecosystems. To address this issue, the research community has invested efforts on designing and testing different methodological approaches and tools that apply cumulative impact appraisal schemes for a sound evaluation of the complex interactions and dynamics among multiple pressures affecting marine and coastal ecosystems. Through an iterative scientometric and systematic literature review, this paper provides the state of the art of cumulative impact assessment approaches and applications. It gives a specific attention to cutting-edge approaches that explore and model inter-relations among climatic and anthropogenic pressures, vulnerability and resilience of marine and coastal ecosystems to these pressures, and the resulting changes in ecosystem services flow. Despite recent advances in computer sciences and the rising availability of big data for environmental monitoring and management, this literature review evidenced that the implementation of advanced complex system methods for cumulative risk assessment remains limited. Moreover, experts have only recently started integrating ecosystem services flow into cumulative impact appraisal frameworks, but more as a general assessment endpoint within the overall evaluation process (e.g. changes in the bundle of ecosystem services against cumulative impacts). The review also highlights a lack of integrated approaches and complex tools able to frame, explain, and model spatio-temporal dynamics of marine and coastal ecosystems' response to multiple pressures, as required under relevant EU legislation (e.g., Water Framework and Marine Strategy Framework Directives). Progress in understanding cumulative impacts, exploiting the functionalities of more sophisticated machine learning-based approaches (e.g., big data integration), will support decision-makers in the achievement of environmental and sustainability objectives.
Collapse
Affiliation(s)
- Christian Simeoni
- Department of Environmental Sciences, Informatics and Statistics, University Ca' Foscari Venice, I-30170 Venice, Italy; Centro Euro-Mediterraneo sui Cambiamenti Climatici and Università Ca' Foscari Venezia, CMCC@Ca'Foscari - Edificio Porta dell'Innovazione, 2nd floor - Via della Libertà, 12 - 30175 Venice, Italy
| | - Elisa Furlan
- Department of Environmental Sciences, Informatics and Statistics, University Ca' Foscari Venice, I-30170 Venice, Italy; Centro Euro-Mediterraneo sui Cambiamenti Climatici and Università Ca' Foscari Venezia, CMCC@Ca'Foscari - Edificio Porta dell'Innovazione, 2nd floor - Via della Libertà, 12 - 30175 Venice, Italy
| | - Hung Vuong Pham
- Department of Environmental Sciences, Informatics and Statistics, University Ca' Foscari Venice, I-30170 Venice, Italy; Centro Euro-Mediterraneo sui Cambiamenti Climatici and Università Ca' Foscari Venezia, CMCC@Ca'Foscari - Edificio Porta dell'Innovazione, 2nd floor - Via della Libertà, 12 - 30175 Venice, Italy
| | - Andrea Critto
- Department of Environmental Sciences, Informatics and Statistics, University Ca' Foscari Venice, I-30170 Venice, Italy; Centro Euro-Mediterraneo sui Cambiamenti Climatici and Università Ca' Foscari Venezia, CMCC@Ca'Foscari - Edificio Porta dell'Innovazione, 2nd floor - Via della Libertà, 12 - 30175 Venice, Italy.
| | - Silvia de Juan
- Instituto Mediterraneo de Estudios Avanzados, IMEDEA (CSIC-UIB), Miquel Marques 21, Esporles, Islas Baleares, Spain
| | - Ewan Trégarot
- Centre for Blue Governance, Portsmouth Business School, University of Portsmouth, Richmond Building, Portland Street, Portsmouth PO1 3DE, UK
| | - Cindy C Cornet
- Centre for Blue Governance, Portsmouth Business School, University of Portsmouth, Richmond Building, Portland Street, Portsmouth PO1 3DE, UK
| | - Erik Meesters
- Wageningen Marine Research, Wageningen University and Research, 1781, AG, Den Helder, the Netherlands; Aquatic Ecology and Water Quality Management, Wageningen University and Research, 6700, AA, Wageningen, the Netherlands
| | - Catarina Fonseca
- cE3c - Centre for Ecology, Evolution and Environmental Changes, Azorean Biodiversity Group, CHANGE - Global Change and Sustainability Institute, Faculty of Sciences and Technology, University of the Azores, Rua da Mãe de Deus, 9500-321, Ponta Delgada, Portugal; CICS.NOVA - Interdisciplinary Centre of Social Sciences, Faculty of Social Sciences and Humanities (FCSH/NOVA), Avenida de Berna 26-C, Lisboa 1069-061, Portugal
| | - Andrea Zita Botelho
- Faculty of Sciences and Technology, University of the Azores, Ponta Delgada, Portugal; CIBIO (CIBIO - Research Centre in Biodiversity and Genetic Resources, InBio Associate Laboratory, Ponta Delgada, Portugal
| | - Torsten Krause
- Lund University Centre for Sustainability Studies, P.O. Box 170, 221-00 Lund, Sweden
| | - Alicia N'Guetta
- Lund University Centre for Sustainability Studies, P.O. Box 170, 221-00 Lund, Sweden
| | | | - Pierre Failler
- Centre for Blue Governance, Portsmouth Business School, University of Portsmouth, Richmond Building, Portland Street, Portsmouth PO1 3DE, UK
| | - Antonio Marcomini
- Department of Environmental Sciences, Informatics and Statistics, University Ca' Foscari Venice, I-30170 Venice, Italy; Centro Euro-Mediterraneo sui Cambiamenti Climatici and Università Ca' Foscari Venezia, CMCC@Ca'Foscari - Edificio Porta dell'Innovazione, 2nd floor - Via della Libertà, 12 - 30175 Venice, Italy
| |
Collapse
|
34
|
Uddin MG, Nash S, Rahman A, Olbert AI. A novel approach for estimating and predicting uncertainty in water quality index model using machine learning approaches. WATER RESEARCH 2023; 229:119422. [PMID: 36459893 DOI: 10.1016/j.watres.2022.119422] [Citation(s) in RCA: 35] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 11/20/2022] [Accepted: 11/23/2022] [Indexed: 06/17/2023]
Abstract
With the significant increase in WQI applications worldwide and lack of specific application guidelines, accuracy and reliability of WQI models is a major issue. It has been reported that WQI models produce significant uncertainties during the various stages of their application including: (i) water quality indicator selection, (ii) sub-index (SI) calculation, (iii) water quality indicator weighting and (iv) aggregation of sub-indices to calculate the overall index. This research provides a robust statistically sound methodology for assessment of WQI model uncertainties. Eight WQI models are considered. The Monte Carlo simulation (MCS) technique was applied to estimate model uncertainty, while the Gaussian Process Regression (GPR) algorithm was utilised to predict uncertainties in the WQI models at each sampling site. The sub-index functions were found to contribute to considerable uncertainty and hence affect the model reliability - they contributed 12.86% and 10.27% of uncertainty for summer and winter applications, respectively. Therefore, the selection of sub-index function needs to be made with care. A low uncertainty of less than 1% was produced by the water quality indicator selection and weighting processes. Significant statistical differences were found between various aggregation functions. The weighted quadratic mean (WQM) function was found to provide a plausible assessment of water quality of coastal waters at reduced uncertainty levels. The findings of this study also suggest that the unweighted root means squared (RMS) aggregation function could be potentially also used for assessment of coastal water quality. Findings from this research could inform a range of stakeholders including decision-makers, researchers, and agencies responsible for water quality monitoring, assessment and management.
Collapse
Affiliation(s)
- Md Galal Uddin
- Civil Engineering, School of Engineering, College of Science and Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland.
| | - Stephen Nash
- Civil Engineering, School of Engineering, College of Science and Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland
| | - Azizur Rahman
- School of Computing, Mathematics and Engineering, Charles Sturt University, Wagga Wagga, Australia; The Gulbali Institute of Agriculture, Water and Environment, Charles Sturt University, Wagga Wagga, Australia
| | - Agnieszka I Olbert
- Civil Engineering, School of Engineering, College of Science and Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland
| |
Collapse
|
35
|
Dong W, Zhang Y, Zhang L, Ma W, Luo L. What will the water quality of the Yangtze River be in the future? THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 857:159714. [PMID: 36302434 DOI: 10.1016/j.scitotenv.2022.159714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/11/2022] [Accepted: 10/21/2022] [Indexed: 06/16/2023]
Abstract
The long-term prediction of water quality is important for water pollution control planning and water resource management, but it has received little attention. In this study, the water quality trend in the Yangtze River is found to stabilize at most monitoring stations under environmental protection activities. Based on the physical mechanism and stochastic theory, a novel river water quality prediction model combining pollution source decomposition (including local point, local nonpoint and upstream sources) and time series decomposition (including trend, seasonal and residential components) is developed. The observed water quality data from 76 monitoring stations in the Yangtze River, including permanganate index (CODMn) and total phosphorus (TP), are used to drive this model to make long-term water quality predictions. The results show that this model has an acceptable accuracy. In the future, the concentration of CODMn will meet the water quality targets at most stations in the Yangtze River, but the concentration of TP will not be able to meet the water quality target at 28.5 % of the stations. Furthermore, the prediction value of CODMn is 62.2 % lower than the target on average. However, the prediction value of TP is only 24.4 % lower than the target on average, and it will exceed the water target by >50 % at some stations. This model has the potential to be widely used for long-term water quality prediction in the future.
Collapse
Affiliation(s)
- Wenxun Dong
- State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan 430072, China
| | - Yanjun Zhang
- State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan 430072, China.
| | - Liping Zhang
- State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan 430072, China
| | - Wei Ma
- Department of Water Ecology and Environment, China Institute of Water Resources and Hydropower Research, Beijing 100038, China
| | - Lan Luo
- State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan 430072, China
| |
Collapse
|
36
|
Lap BQ, Phan TTH, Nguyen HD, Quang LX, Hang PT, Phi NQ, Hoang VT, Linh PG, Thanh Hang BT. Predicting Water Quality Index (WQI) by feature selection and machine learning: A case study of An Kim Hai irrigation system. ECOL INFORM 2023. [DOI: 10.1016/j.ecoinf.2023.101991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
37
|
Monfared SH, Walsh C, Curtis T, Jarvis A, Darmian MD, Khodabandeh F. New coefficient for water quality modelling in meandering rivers: Fatigue factor. ECOL INFORM 2023. [DOI: 10.1016/j.ecoinf.2023.101999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
38
|
Taşan M, Taşan S, Demir Y. Estimation and uncertainty analysis of groundwater quality parameters in a coastal aquifer under seawater intrusion: a comparative study of deep learning and classic machine learning methods. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:2866-2890. [PMID: 35941499 DOI: 10.1007/s11356-022-22375-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Accepted: 07/30/2022] [Indexed: 06/15/2023]
Abstract
Excessive withdrawal of groundwater for agricultural irrigation can cause seawater intrusion into coastal aquifers. Such a case will in turn results in deterioration of irrigation water quality. Determination of irrigation water quality with traditional methods is a time-consuming and costly process. However, machine learning algorithms can be useful tools for modeling and estimating groundwater quality used for irrigation water purposes. In this study, TDS, PS, SAR, and Cl parameters of groundwater were estimated with models based on EC and pH variables. For this purpose, prediction performances of two different deep learning methods (convolutional neural network (CNN) and deep neural network (DNN)) and two different classical machine learning (Random Forest (RF) and extreme gradient boosting (XGBoost)) methods were compared. In addition, predictive uncertainty of the models was determined by quantile regression (QR) analysis. Performance criteria and results of uncertainty analysis revealed that CNN (in testing phase, NSE = 0.95 for TDS, NSE = 0.96 for PS, NSE = 0.67 for SAR and NSE = 0.93 for CI) and DNN (in testing phase, NSE = 0.91 for TDS, NSE = 0.91 for PS, NSE = 0.57 for SAR and NSE = 0.94 for Cl) models had quite a close performance in estimation of TDS, PS, SAR, and Cl parameters and higher than the other two classical machine learning methods. As a result, the CNN model can be considered the best performing model in estimating all quality parameters due to the highest NSE and lowest RMSE values. In addition, the Taylor diagram showed that the values estimated using the CNN model had the highest correlation with the measured data. It was determined that the model with the lowest uncertainty based on the PICP statistics was DNN, followed by the CNN model. However, the CNN model has predicted outliers more accurately. Present findings proved that deep learning models could offer efficient tools for predicting irrigation water quality parameters.
Collapse
Affiliation(s)
- Mehmet Taşan
- Department of Soil and Water Resources, Black Sea Agricultural Research Institute, 55300, Samsun, Turkey.
| | - Sevda Taşan
- Faculty of Agriculture, Department of Agricultural Structures and Irrigation, Ondokuz Mayis University, 55139, Samsun, Turkey
| | - Yusuf Demir
- Faculty of Agriculture, Department of Agricultural Structures and Irrigation, Ondokuz Mayis University, 55139, Samsun, Turkey
| |
Collapse
|
39
|
Uddin MG, Nash S, Mahammad Diganta MT, Rahman A, Olbert AI. Robust machine learning algorithms for predicting coastal water quality index. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2022; 321:115923. [PMID: 35988401 DOI: 10.1016/j.jenvman.2022.115923] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 07/06/2022] [Accepted: 07/30/2022] [Indexed: 06/15/2023]
Abstract
Coastal water quality assessment is an essential task to keep "good water quality" status for living organisms in coastal ecosystems. The Water quality index (WQI) is a widely used tool to assess water quality but this technique has received much criticism due to the model's reliability and inconsistence. The present study used a recently developed improved WQI model for calculating coastal WQIs in Cork Harbour. The aim of the research is to determine the most reliable and robust machine learning (ML) algorithm(s) to anticipate WQIs at each monitoring point instead of repeatedly employing SI and weight values in order to reduce model uncertainty. In this study, we compared eight commonly used algorithms, including Random Forest (RF), Decision Tree (DT), K-Nearest Neighbors (KNN), Extreme Gradient Boosting (XGB), Extra Tree (ExT), Support Vector Machine (SVM), Linear Regression (LR), and Gaussian Naïve Bayes (GNB). For the purposes of developing the prediction models, the dataset was divided into two groups: training (70%) and testing (30%), whereas the models were validated using the 10-fold cross-validation method. In order to evaluate the models' performance, the RMSE, MSE, MAE, R2, and PREI metrics were used in this study. The tree-based DT (RMSE = 0.0, MSE = 0.0, MAE = 0.0, R2 = 1.0 and PERI = 0.0) and the ExT (RMSE = 0.0, MSE = 0.0, MAE = 0.0, R2 = 1.0 and PERI = 0.0) and ensemble tree-based XGB (RMSE = 0.0, MSE = 0.0, MAE = 0.0, R2 = 1.0 and PERI = +0.16 to -0.17) and RF (RMSE = 2.0, MSE = 3.80, MAE = 1.10, R2 = 0.98, PERI = +3.52 to -25.38) models outperformed other models. The results of model performance and PREI indicate that the DT, ExT, and GXB models could be effective, robust and significantly reduce model uncertainty in predicting WQIs. The findings of this study are also useful for reducing model uncertainty and optimizing the WQM-WQI model architecture for predicting WQI values.
Collapse
Affiliation(s)
- Md Galal Uddin
- School of Engineering, National University of Ireland Galway, Ireland; Ryan Institute, National University of Ireland Galway, Ireland; MaREI Research Centre, National University of Ireland Galway, Ireland.
| | - Stephen Nash
- School of Engineering, National University of Ireland Galway, Ireland; Ryan Institute, National University of Ireland Galway, Ireland; MaREI Research Centre, National University of Ireland Galway, Ireland
| | - Mir Talas Mahammad Diganta
- School of Engineering, National University of Ireland Galway, Ireland; Ryan Institute, National University of Ireland Galway, Ireland; MaREI Research Centre, National University of Ireland Galway, Ireland
| | - Azizur Rahman
- School of Computing, Mathematics and Engineering, Charles Sturt University, Wagga Wagga, Australia; The Gulbali Institute of Agriculture, Water and Environment, Charles Sturt University, Wagga Wagga, Australia
| | - Agnieszka I Olbert
- School of Engineering, National University of Ireland Galway, Ireland; Ryan Institute, National University of Ireland Galway, Ireland; MaREI Research Centre, National University of Ireland Galway, Ireland
| |
Collapse
|
40
|
Mohd Zebaral Hoque J, Ab. Aziz NA, Alelyani S, Mohana M, Hosain M. Improving Water Quality Index Prediction Using Regression Learning Models. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:13702. [PMID: 36294286 PMCID: PMC9602497 DOI: 10.3390/ijerph192013702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 10/14/2022] [Accepted: 10/17/2022] [Indexed: 06/16/2023]
Abstract
Rivers are the main sources of freshwater supply for the world population. However, many economic activities contribute to river water pollution. River water quality can be monitored using various parameters, such as the pH level, dissolved oxygen, total suspended solids, and the chemical properties. Analyzing the trend and pattern of these parameters enables the prediction of the water quality so that proactive measures can be made by relevant authorities to prevent water pollution and predict the effectiveness of water restoration measures. Machine learning regression algorithms can be applied for this purpose. Here, eight machine learning regression techniques, including decision tree regression, linear regression, ridge, Lasso, support vector regression, random forest regression, extra tree regression, and the artificial neural network, are applied for the purpose of water quality index prediction. Historical data from Indian rivers are adopted for this study. The data refer to six water parameters. Twelve other features are then derived from the original six parameters. The performances of the models using different algorithms and sets of features are compared. The derived water quality rating scale features are identified to contribute toward the development of better regression models, while the linear regression and ridge offer the best performance. The best mean square error achieved is 0 and the correlation coefficient is 1.
Collapse
Affiliation(s)
| | - Nor Azlina Ab. Aziz
- Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia
| | - Salem Alelyani
- Center for Artificial Intelligence (CAI), King Khalid University, Abha 61421, Saudi Arabia
- College of Computer Science, King Khalid University, Abha 61421, Saudi Arabia
| | - Mohamed Mohana
- Center for Artificial Intelligence (CAI), King Khalid University, Abha 61421, Saudi Arabia
| | - Maruf Hosain
- Faculty of Engineering & Technology, Multimedia University, Melaka 75450, Malaysia
| |
Collapse
|
41
|
Virro H, Kmoch A, Vainu M, Uuemaa E. Random forest-based modeling of stream nutrients at national level in a data-scarce region. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 840:156613. [PMID: 35700783 DOI: 10.1016/j.scitotenv.2022.156613] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 05/12/2022] [Accepted: 06/07/2022] [Indexed: 06/15/2023]
Abstract
Nutrient runoff from agricultural production is one of the main causes of water quality deterioration in river systems and coastal waters. Water quality modeling can be used for gaining insight into water quality issues in order to implement effective mitigation efforts. Process-based nutrient models are very complex, requiring a lot of input parameters and computationally expensive calibration. Recently, ML approaches have shown to achieve an accuracy comparable to the process-based models and even outperform them when describing nonlinear relationships. We used observations from 242 Estonian catchments, amounting to 469 yearly TN and 470 TP measurements covering the period 2016-2020 to train random forest (RF) models for predicting annual N and P concentrations. We used a total of 82 predictor variables, including land cover, soil, climate and topography parameters and applied a feature selection strategy to reduce the number of dependent features in the models. The SHAP method was used for deriving the most relevant predictors. The performance of our models is comparable to previous process-based models used in the Baltic region with the TN and TP model having an R2 score of 0.83 and 0.52, respectively. However, as input data used in our models is easier to obtain, the models offer superior applicability in areas, where data availability is insufficient for process-based approaches. Therefore, the models enable to give a robust estimation for nutrient losses at national level and allows to capture the spatial variability of the nutrient runoff which in turn enables to provide decision-making support for regional water management plans.
Collapse
Affiliation(s)
- Holger Virro
- Department of Geography, Institute of Ecology and Earth Sciences, University of Tartu, Vanemuise 46, Tartu 51003, Estonia.
| | - Alexander Kmoch
- Department of Geography, Institute of Ecology and Earth Sciences, University of Tartu, Vanemuise 46, Tartu 51003, Estonia
| | - Marko Vainu
- Institute of Ecology, Tallinn University, Uus-Sadama 5, Tallinn 10120, Estonia
| | - Evelyn Uuemaa
- Department of Geography, Institute of Ecology and Earth Sciences, University of Tartu, Vanemuise 46, Tartu 51003, Estonia
| |
Collapse
|
42
|
Kim S, Alizamir M, Seo Y, Heddam S, Chung IM, Kim YO, Kisi O, Singh VP. Estimating the incubated river water quality indicator based on machine learning and deep learning paradigms: BOD5 Prediction. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:12744-12773. [PMID: 36654020 DOI: 10.3934/mbe.2022595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
As an indicator measured by incubating organic material from water samples in rivers, the most typical characteristic of water quality items is biochemical oxygen demand (BOD5) concentration, which is a stream pollutant with an extreme circumstance of organic loading and controlling aquatic behavior in the eco-environment. Leading monitoring approaches including machine leaning and deep learning have been evolved for a correct, trustworthy, and low-cost prediction of BOD5 concentration. The addressed research investigated the efficiency of three standalone models including machine learning (extreme learning machine (ELM) and support vector regression (SVR)) and deep learning (deep echo state network (Deep ESN)). In addition, the novel double-stage synthesis models (wavelet-extreme learning machine (Wavelet-ELM), wavelet-support vector regression (Wavelet-SVR), and wavelet-deep echo state network (Wavelet-Deep ESN)) were developed by integrating wavelet transformation (WT) with the different standalone models. Five input associations were supplied for evaluating standalone and double-stage synthesis models by determining diverse water quantity and quality items. The proposed models were assessed using the coefficient of determination (R2), Nash-Sutcliffe (NS) efficiency, and root mean square error (RMSE). The significance of addressed research can be found from the overall outcomes that the predictive accuracy of double-stage synthesis models were not always superior to that of standalone models. Overall results showed that the SVR with 3th distribution (NS = 0.915) and the Wavelet-SVR with 4th distribution (NS = 0.915) demonstrated more correct outcomes for predicting BOD5 concentration compared to alternative models at Hwangji station, and the Wavelet-SVR with 4th distribution (NS = 0.917) was judged to be the most superior model at Toilchun station. In most cases for predicting BOD5 concentration, the novel double-stage synthesis models can be utilized for efficient and organized data administration and regulation of water pollutants on both stations, South Korea.
Collapse
Affiliation(s)
- Sungwon Kim
- Department of Railroad Construction and Safety Engineering, Dongyang University, Yeongju, 36040, Republic of Korea
| | - Meysam Alizamir
- Department of Civil Engineering, Hamedan Branch, Islamic Azad University, Hamedan, Iran
| | - Youngmin Seo
- Department of Constructional and Environmental Engineering, Kyungpook National University, Sangju, 37224, Republic of Korea
| | - Salim Heddam
- Faculty of Science, Agronomy Department, Hydraulics Division, Laboratory of Research in Biodiversity Interaction Ecosystem and Biotechnology, University 20 Août 1955, Route El Hadaik, BP 26, Skikda, Algeria
| | - Il-Moon Chung
- Department of Hydro Science and Engineering Research, Korea Institute of Civil Engineering and Building Technology, Goyang-si 10223, Republic of Korea
| | - Young-Oh Kim
- Department of Civil Engineering, Seoul National University, Seoul, Republic of Korea
| | - Ozgur Kisi
- Department of Civil Engineering, University of Applied Sciences, 23562, Lübeck, Germany
| | - Vijay P Singh
- Department of Biological and Agricultural Engineering & Zachry Department of Civil Engineering, Texas A & M University, College Station, Texas, 77843-2117, USA
| |
Collapse
|
43
|
Groundwater Quality: The Application of Artificial Intelligence. JOURNAL OF ENVIRONMENTAL AND PUBLIC HEALTH 2022; 2022:8425798. [PMID: 36060879 PMCID: PMC9433268 DOI: 10.1155/2022/8425798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 07/31/2022] [Accepted: 08/04/2022] [Indexed: 11/17/2022]
Abstract
Humans and all other living things depend on having access to clean water, as it is an indispensable essential resource. Therefore, the development of a model that can predict water quality conditions in the future will have substantial societal and economic value. This can be accomplished by using a model that can predict future water quality circumstances. In this study, we employed a sophisticated artificial neural network (ANN) model. This study intends to develop a hybrid model of single exponential smoothing (SES) with bidirectional long short-term memory (BiLSTM) and an adaptive neurofuzzy inference system (ANFIS) to predict water quality (WQ) in different groundwater in the Al-Baha region of Saudi Arabia. Single exponential smoothing (SES) was employed as a preprocessing method to adjust the weight of the dataset, and the output from SES was processed using the BiLSTM and ANFIS models for predicting water quality. The data were randomly divided into two phases, training (70%) and testing (30%). Efficiency statistics were used to evaluate the SES-BiLSTM and SES-ANFIS models' prediction abilities. The results showed that while both the SES-BiLSTM and SES-ANFIS models performed well in predicting the water quality index (WQI), the SES-BiLSTM model performed best with accuracy (R = 99.95% and RMSE = 0.00910) at the testing phase, where the performance of the SES-ANFIS model was R = 99.95% and RMSE = 2.2941 × 100-07. The findings support the idea that the SES-BilSTM and SES-ANFIS models can be used to predict the WQI with high accuracy, which will help to enhance WQ. The results demonstrated that the SES-BiLSTM and SES-ANFIS models' forecasts are accurate and that both seasons' performances are consistent. Similar investigations of groundwater quality prediction for drinking purposes should benefit from the proposed SES-BiLSTM and SES-ANFIS models. Consequently, the results demonstrate that the proposed SES-BiLSTM and SES-ANFIS models are useful tools for predicting whether the groundwater in Al-Baha city is suitable for drinking and irrigation purposes.
Collapse
|
44
|
Geospatial Artificial Intelligence (GeoAI) in the Integrated Hydrological and Fluvial Systems Modeling: Review of Current Applications and Trends. WATER 2022. [DOI: 10.3390/w14142211] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
This paper reviews the current GeoAI and machine learning applications in hydrological and hydraulic modeling, hydrological optimization problems, water quality modeling, and fluvial geomorphic and morphodynamic mapping. GeoAI effectively harnesses the vast amount of spatial and non-spatial data collected with the new automatic technologies. The fast development of GeoAI provides multiple methods and techniques, although it also makes comparisons between different methods challenging. Overall, selecting a particular GeoAI method depends on the application’s objective, data availability, and user expertise. GeoAI has shown advantages in non-linear modeling, computational efficiency, integration of multiple data sources, high accurate prediction capability, and the unraveling of new hydrological patterns and processes. A major drawback in most GeoAI models is the adequate model setting and low physical interpretability, explainability, and model generalization. The most recent research on hydrological GeoAI has focused on integrating the physical-based models’ principles with the GeoAI methods and on the progress towards autonomous prediction and forecasting systems.
Collapse
|
45
|
Sakaa B, Elbeltagi A, Boudibi S, Chaffaï H, Islam ARMT, Kulimushi LC, Choudhari P, Hani A, Brouziyne Y, Wong YJ. Water quality index modeling using random forest and improved SMO algorithm for support vector machine in Saf-Saf river basin. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:48491-48508. [PMID: 35192167 DOI: 10.1007/s11356-022-18644-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 01/08/2022] [Indexed: 06/14/2023]
Abstract
The water quality index is one of the prominent general indicators to assess and classify surface water quality, which plays a critical role in river water resources practices. This research constructs a hybrid artificial intelligence model namely sequential minimal optimization-support vector machine (SMO-SVM) along with random forest (RF) as a benchmark model for predicting water quality values at the Wadi Saf-Saf river basin in Algeria. The fifteen input water quality datasets such as biochemical oxygen demand (BOD), oxygen saturation (OS), the potential for hydrogen (pH), chemical oxygen demand (COD), chloride (Cl-), dissolved oxygen (DO), electrical conductivity (EC), total dissolved solids (TDS), nitrate-nitrogen (NO3-N), nitrite-nitrogen (NO2-N), phosphate (PO43-), ammonium (NH4+), temperature (T), turbidity (NTU), and suspended solids (SS) were employed for constructing the predictive models. Different input data combinations are evaluated in terms of predictive performance, using a set of statistical metrics and graphical representation. Results show that less than 40% of samples were observed to be poor quality water during the dry season in downstream northeastern part of the basin. The findings also show that the RF model mostly generates more precise water quality index predictions than the SMO-SVM model for both training and testing stages. Although thirteen input parameters attain the optimal predictive performance (R2 testing = 0.82, RMSE testing = 5.17), a couple of five input parameters, e.g., only pH, EC, TDS, T, and saturation, gives the second optimal predictive precision (R2 test = 0.81, RMSE testing = 5.55). The sensitivity analysis results indicate a greater sensitivity by the all input variables chosen except NO2- of the predictive outcomes to the earlier influencing water quality parameters. Overall, the RF model reveals an improvement on earlier tools for predicting water quality index, according to predictive performance and reducing in the number of input variables.
Collapse
Affiliation(s)
- Bachir Sakaa
- Scientific and Technical Research Center on Arid Regions (CRSTRA), BP 1682 RP, 07000, Biskra, Algeria
- Faculty of Earth Sciences, Laboratory of Water Resource and Sustainable Development (REED), Badji Mokhtar University, BP 12, 23000, Annaba, Algeria
| | - Ahmed Elbeltagi
- Agricultural Engineering Dept, Faculty of Agriculture, Mansoura University, Mansoura, 35516, Egypt
| | - Samir Boudibi
- Scientific and Technical Research Center on Arid Regions (CRSTRA), BP 1682 RP, 07000, Biskra, Algeria.
| | - Hicham Chaffaï
- Faculty of Earth Sciences, Laboratory of Water Resource and Sustainable Development (REED), Badji Mokhtar University, BP 12, 23000, Annaba, Algeria
| | | | - Luc Cimusa Kulimushi
- Department of Environmental Studies, University of Lay Adventists of Kigali, P.O. Box: 6392, Kigali, Rwanda
| | | | - Azzedine Hani
- Faculty of Earth Sciences, Laboratory of Water Resource and Sustainable Development (REED), Badji Mokhtar University, BP 12, 23000, Annaba, Algeria
| | - Youssef Brouziyne
- International Water Research Institute, Mohammed VI Polytechnic University (UM6P), 43150, Benguerir, Morocco
| | - Yong Jie Wong
- Research Center for Environmental Quality Management, Graduate School of Engineering, Kyoto University, 1-2 Yumihama, Otsu, Shiga, 520-0811, Japan
| |
Collapse
|
46
|
Yang H, Liu S. Water quality prediction in sea cucumber farming based on a GRU neural network optimized by an improved whale optimization algorithm. PeerJ Comput Sci 2022; 8:e1000. [PMID: 35721411 PMCID: PMC9202628 DOI: 10.7717/peerj-cs.1000] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 05/13/2022] [Indexed: 06/15/2023]
Abstract
Sea cucumber farming is an important part of China's aquaculture industry, and sea cucumbers have higher requirements for aquaculture water quality. This article proposes a sea cucumber aquaculture water quality prediction model that uses an improved whale optimization algorithm to optimize the gated recurrent unit neural network(IWOA-GRU), which provides a reference for the water quality control in the sea cucumber growth environment. This model first applies variational mode decomposition (VMD) and the wavelet threshold joint denoising method to remove mixed noise in water quality time series. Then, by optimizing the convergence factor, the convergence speed and global optimization ability of the whale optimization algorithm are strengthened. Finally, the improved whale optimization algorithm is used to construct a GRU prediction model based on optimal network weights and thresholds to predict sea cucumber farming water quality. The model was trained and tested using three water quality indices (dissolved oxygen, temperature and salinity) of sea cucumber culture waters in Shandong Peninsula, China, and compared with prediction models such as support vector regression (SVR), random forest (RF), convolutional neural network (CNN), recurrent neural network (RNN), and long short-term memory neural network (LSTM). Experimental results show that the prediction accuracy and generalization performance of this model are better than those of the other compared models.
Collapse
Affiliation(s)
- Huanhai Yang
- School of Computer Science and Technology, Shandong Technology and Business University, Yantai, Shandong, China
| | - Shue Liu
- Binzhou Medical University, Yantai, Shandong, China
| |
Collapse
|
47
|
Hybrid Machine Learning Models for Soil Saturated Conductivity Prediction. WATER 2022. [DOI: 10.3390/w14111729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The hydraulic conductivity of saturated soil is a crucial parameter in the study of any engineering problem concerning groundwater. Hydraulic conductivity mainly depends on particle size distribution, soil compaction, and properties that influence aggregation and water retention. Generally, finding simple and accurate analytical equations between the hydraulic conductivity of soil and the characteristics on which it depends is a very hard task. Machine learning algorithms can provide excellent tools for tackling highly nonlinear regression problems. Additionally, hybrid models resulting from the combination of multiple machine learning algorithms can further improve the accuracy of predictions. Five different models were built to predict saturated hydraulic conductivity using a dataset extracted from the Soil Water Infiltration Global database. The models were based on different predictors. Seven variants of each model were compared, replacing the implemented algorithm. Three variants were based on individual models, while four variants were based on hybrid models. The employed individual machine learning algorithms were Multilayer Perceptron, Random Forest, and Support Vector Regression. The model based on the largest number of predictors led to the most accurate predictions. In addition, across all models, hybrid variants based on all three algorithms and hybridized variants of Random Forest and Support Vector Regression proved to be the most accurate (R2 values up to 0.829). However, all variants showed a tendency to overestimate conductivity in soils where it is very low.
Collapse
|
48
|
Using Machine Learning Models for Predicting the Water Quality Index in the La Buong River, Vietnam. WATER 2022. [DOI: 10.3390/w14101552] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
For effective management of water quantity and quality, it is absolutely essential to estimate the pollution level of the existing surface water. This case study aims to evaluate the performance of twelve machine learning (ML) models, including five boosting-based algorithms (adaptive boosting, gradient boosting, histogram-based gradient boosting, light gradient boosting, and extreme gradient boosting), three decision tree-based algorithms (decision tree, extra trees, and random forest), and four ANN-based algorithms (multilayer perceptron, radial basis function, deep feed-forward neural network, and convolutional neural network), in estimating the surface water quality of the La Buong River in Vietnam. Water quality data at four monitoring stations alongside the La Buong River for the period 2010–2017 were utilized to calculate the water quality index (WQI). Prediction performance of the ML models was evaluated by using two efficiency statistics (i.e., R2 and RMSE). The results indicated that all twelve ML models have good performance in predicting the WQI but that extreme gradient boosting (XGBoost) has the best performance with the highest accuracy (R2 = 0.989 and RMSE = 0.107). The findings strengthen the argument that ML models, especially XGBoost, may be employed for WQI prediction with a high level of accuracy, which will further improve water quality management.
Collapse
|
49
|
Nafsin N, Li J. Prediction of 5-day biochemical oxygen demand in the Buriganga River of Bangladesh using novel hybrid machine learning algorithms. WATER ENVIRONMENT RESEARCH : A RESEARCH PUBLICATION OF THE WATER ENVIRONMENT FEDERATION 2022; 94:e10718. [PMID: 35502725 DOI: 10.1002/wer.10718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 04/05/2022] [Accepted: 04/08/2022] [Indexed: 06/14/2023]
Abstract
Biochemical oxygen demand (BOD) is one of the most important variables indicating stream pollution with a severe condition of organic loading and maintaining aquatic life in ecosystems. Advanced monitoring techniques such as machine learning (ML) methods have been developed for an accurate, reliable, and cost-effective prediction of BOD. This study investigated the effectiveness of four stand-alone ML algorithms, namely, artificial neural network (ANN), support vector machine (SVM), random forest (RF), and gradient boosting machine (GBM), and six novel hybrid algorithms, namely, RF-SVM, ANN-SVM, GBM-SVM, RF-ANN, GBM-ANN, and RF-GBM, in predicting BOD of the Buriganga river system of Bangladesh. The feature importance analysis of RF algorithm indicated that chemical oxygen demand (COD), total dissolved solids (TDS), conductivity, total solids (TS), suspended solids (SS), and turbidity are the most influential parameters for predicting BOD5 . The significance of this study is the application of the novel hybrid models that resulted in higher prediction success; RF-SVM with the highest R2 value (0.908). The employed novel hybrid ML models can be particularly useful for efficient and systematic data management, water pollution control, and prevention in developing countries such as Bangladesh. PRACTITIONER POINTS: Investigated the efficiency of four stand-alone and six novel hybrid ML models for predicting BOD in a river of Bangladesh. The significance of this study is the application of the six novel hybrid models that resulted in higher prediction success. The best three prediction models were RF-SVM, ANN-SVM, and GBM-SVM with a prediction success of 91%, 89.6%, and 88.8% respectively. ML models indicated COD, conductivity, TDS, TS, SS, and turbidity as the most influential variables for predicting BOD. The novel hybrid models can be useful for developing countries for efficient systematic data management, pollution control, and prevention strategies.
Collapse
Affiliation(s)
- Nabila Nafsin
- Department of Civil and Environmental Engineering, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
| | - Jin Li
- Department of Civil and Environmental Engineering, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
| |
Collapse
|
50
|
Khosravi K, Safari MJS, Sheikh Khozani Z, Crookston B, Golkarian A. Stacking ensemble-based hybrid algorithms for discharge computation in sharp-crested labyrinth weirs. Soft comput 2022. [DOI: 10.1007/s00500-022-07073-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|