1
|
Zamani MG, Nikoo MR, Al-Rawas G, Nazari R, Rastad D, Gandomi AH. Hybrid WT-CNN-GRU-based model for the estimation of reservoir water quality variables considering spatio-temporal features. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 358:120756. [PMID: 38599080 DOI: 10.1016/j.jenvman.2024.120756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 03/09/2024] [Accepted: 03/22/2024] [Indexed: 04/12/2024]
Abstract
Water quality indicators (WQIs), such as chlorophyll-a (Chl-a) and dissolved oxygen (DO), are crucial for understanding and assessing the health of aquatic ecosystems. Precise prediction of these indicators is fundamental for the efficient administration of rivers, lakes, and reservoirs. This research utilized two unique DL algorithms-namely, convolutional neural network (CNNs) and gated recurrent units (GRUs)-alongside their amalgamation, CNN-GRU, to precisely gauge the concentration of these indicators within a reservoir. Moreover, to optimize the outcomes of the developed hybrid model, we considered the impact of a decomposition technique, specifically the wavelet transform (WT). In addition to these efforts, we created two distinct machine learning (ML) algorithms-namely, random forest (RF) and support vector regression (SVR)-to demonstrate the superior performance of deep learning algorithms over individual ML ones. We initially gathered WQIs from diverse locations and varying depths within the reservoir using an AAQ-RINKO device in the study area to achieve this. It is important to highlight that, despite utilizing diverse data-driven models in water quality estimation, a significant gap persists in the existing literature regarding implementing a comprehensive hybrid algorithm. This algorithm integrates the wavelet transform, convolutional neural network (CNN), and gated recurrent unit (GRU) methodologies to estimate WQIs accurately within a spatiotemporal framework. Subsequently, the effectiveness of the models that were developed was assessed utilizing various statistical metrics, encompassing the correlation coefficient (r), root mean square error (RMSE), mean absolute error (MAE), and Nash-Sutcliffe efficiency (NSE) throughout both the training and testing phases. The findings demonstrated that the WT-CNN-GRU model exhibited better performance in comparison with the other algorithms by 13% (SVR), 13% (RF), 9% (CNN), and 8% (GRU) when R-squared and DO were considered as evaluation indices and WQIs, respectively.
Collapse
Affiliation(s)
- Mohammad G Zamani
- Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat, Oman.
| | - Mohammad Reza Nikoo
- Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat, Oman.
| | - Ghazi Al-Rawas
- Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat, Oman.
| | - Rouzbeh Nazari
- Department of Civil, Construction, and Environmental Engineering, The University of Alabama, Alabama, USA.
| | - Dana Rastad
- Department of Civil and Environmental Engineering, Amirkabir University of Technology, Tehran, Iran.
| | - Amir H Gandomi
- Department of Engineering and I.T., University of Technology Sydney, Ultimo, NSW, 2007, Australia; University Research and Innovation Center (EKIK), Óbuda University, 1034, Budapest, Hungary.
| |
Collapse
|
2
|
Chen J, Li H, Felix M, Chen Y, Zheng K. >Water quality prediction of artificial intelligence model: a case of Huaihe River Basin, China. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:14610-14640. [PMID: 38273086 DOI: 10.1007/s11356-024-32061-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 01/15/2024] [Indexed: 01/27/2024]
Abstract
Accurate prediction of water quality contributes to the intelligent management of water resources. Water quality indices have time series characteristics and nonlinearity, but the existing models only focus on the forward time series when long short-term memory (LSTM) is introduced and do not consider the parallel computation on the model. Owing to this, a new neural network called LSTM-multihead attention (LMA) was constructed to predict water quality, using long short-term memory to process time series data and multihead attention for parallel computing and extracting feature information. Additionally, water quality indices have the issues of multiple data types and complex data correlations, as well as missing data and abnormal data problems in water quality data. In order to solve these problems, this study proposes a water quality prediction model called GRA-LMA-based linear interpolation, gray relational analysis and LMA. Two experiments are carried out to verify the predictive performance of the GRA-LMA with the water quality data of the Huaihe River Basin as a case study sample. The first experiment focuses on data processing, including the processing of missing data and abnormal data of water quality data, and the correlation analysis of water quality indices. Linear interpolation is adapted to process the missing data, while a combination of boxplot and histogram is adopted to analyze and eliminate the abnormal data, which is then repaired the abnormal data with linear interpolation. The gray relational analysis is adopted to calculate the correlation between different water quality indices, and water quality indices with high correlation are retained to determine the input variables of the water quality prediction model. The data processing results demonstrate that repairs can be made using linear interpolation without altering the pattern of data change and the model by using the gray relational analysis to reduce the quantity of data it needs as input. In the second experiment, the predictive capacity of GRA-LMA and existing models such as backpropagation neural network (BP), recurrent neural network (RNN), long short-term memory (LSTM), and gate recurrent unit (GRU) was evaluated and compared using different numerical and graphical performance evaluation metrics. Comparative experimental results show that the mean square error of pH, dissolved oxygen, chemical oxygen demand, ammonia nitrogen, electrical conductivity, turbidity, total phosphorus, and total nitrogen of GRA-LMA is reduced to 0.05890, 0.40196, 0.32454, 0.04368, 14.71003, 8.13252, 0.01558, and 0.14345. The results indicate that GRA-LMA has superior adaptability for predicting various water quality indices and can significantly lower the induced prediction error.
Collapse
Affiliation(s)
- Jing Chen
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, L69 3BX, UK
| | - Haiyang Li
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China.
| | - Manirankunda Felix
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China
| | - Yudi Chen
- Faculty of Science and Engineering, University of Manchester, Oxford RD, Manchester, M139PL, UK
| | - Keqiang Zheng
- School of Electrical and Information Engineering, Anhui University of Science and Technology, No. 168, Taifeng Road, Huainan, 232001, China
| |
Collapse
|
3
|
Zamani MG, Nikoo MR, Jahanshahi S, Barzegar R, Meydani A. Forecasting water quality variable using deep learning and weighted averaging ensemble models. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:124316-124340. [PMID: 37996598 DOI: 10.1007/s11356-023-30774-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 10/27/2023] [Indexed: 11/25/2023]
Abstract
Water quality variables, including chlorophyll-a (Chl-a), play a pivotal role in comprehending and evaluating the condition of aquatic ecosystems. Chl-a, a pigment present in diverse aquatic organisms, notably algae and cyanobacteria, serves as a valuable indicator of water quality. Thus, the objectives of this study encompass: (1) the assessment of the predictive capabilities of four deep learning (DL) models - namely, recurrent neural network (RNN), long short-term memory (LSTM), gated recurrence unit (GRU), and temporal convolutional network (TCN) - in forecasting Chl-a concentrations; (2) the incorporation of these DL models into ensemble models (EMs) employing genetic algorithm (GA) and non-dominated sorting genetic algorithm (NSGA-II) to harness the strengths of each standalone model; and (3) the evaluation of the efficacy of the developed EMs. Utilizing data collected at 15-min intervals from Small Prespa Lake (SPL) in Greece, the models employed hourly Chl-a concentration lag times, extending up to 6 h, as models' inputs to forecast Chla (t+1). The proposed models underwent training on 70% of the dataset and were subsequently validated on the remaining 30%. Among the standalone DL models, the GRU model exhibited superior performance in Chl-a forecasting, surpassing the RNN, LSTM, and TCN models by 8%, 2%, and 2%, respectively. Furthermore, the integration of DL models through single-objective GA and multi-objective NSGA-II optimization algorithms yielded hybrid models adept at effectively forecasting both low and high Chl-a concentrations. The ensemble model based on NSGA-II outperformed standalone DL models as well as the GA-based model across a range of evaluation indices. For instance, considering the R-squared metric, the study's findings demonstrated that the EM-NSGA-II stands out with exceptional effectiveness compared to DL and EM-GA models, showcasing improvements of 14% (RNN), 8% (LSTM), 6% (GRU), 8% (TCN), and 3% (EM-GA) during the testing phase.
Collapse
Affiliation(s)
- Mohammad G Zamani
- Department of Water Resources Engineering, Faculty of Civil, Water and Environmental Engineering, Amirkabir University of Technology, Tehran, Iran
| | - Mohammad Reza Nikoo
- Department of Civil and Architectural Engineering, Sultan Qaboos University, Muscat, Oman.
| | - Sina Jahanshahi
- Department of Water Resources Engineering, Faculty of Civil, Water and Environmental Engineering, University of Tehran, Tehran, Iran
| | - Rahim Barzegar
- Groundwater Research Group (GRES), Research Institute on Mines and Environment (RIME), Université du Québec en Abitibi-Témiscamingue (UQAT), Amos, Québec, Canada
| | - Amirreza Meydani
- Department of Geography and Spatial Sciences, University of Delaware, Newark, DE, USA
| |
Collapse
|
4
|
Fang Y, Liu H. A spatiotemporal dissolved oxygen prediction model based on graph attention networks suitable for missing data. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023:10.1007/s11356-023-28030-w. [PMID: 37335513 DOI: 10.1007/s11356-023-28030-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 05/28/2023] [Indexed: 06/21/2023]
Abstract
The accurate prediction of dissolved oxygen concentration is crucial for the effective prevention and control of water pollution. A spatiotemporal prediction model for dissolved oxygen content that is suitable for missing data is proposed in this study. The model utilizes a module based on neural controlled differential equations (NCDEs) to handle missing data and graph attention networks (GATs) to capture the spatiotemporal relationship of dissolved oxygen content. To enhance the performance of model, it is optimized from three aspects: an iterative optimization method based on the k-nearest neighbor graph is proposed to enhance the quality of graph; Shapley additive explanations model (SHAP) is used to select the main features into model, enabling it to handle multiple features; and a fusion graph attention mechanism is introduced to improve the robustness of model to noise. The model is evaluated using data from water quality monitoring sites in Hunan Province, China, from January 14, 2021, to June 16, 2022. The proposed model outperforms other models in long-term prediction (step = 18), with MAE of 0.194, NSE of 0.914, RAE of 0.219, and IA of 0.977. The results demonstrate that constructing appropriate spatial dependencies can enhance the accuracy of dissolved oxygen prediction models, and the NCDE module confers robustness to missing data in the model.
Collapse
Affiliation(s)
- Yamin Fang
- Institute of Artificial Intelligence and Robotics (IAIR), Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic and Transportation Engineering, Central South University, Changsha, Hunan, 410075, China
| | - Hui Liu
- Institute of Artificial Intelligence and Robotics (IAIR), Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic and Transportation Engineering, Central South University, Changsha, Hunan, 410075, China.
| |
Collapse
|
5
|
Liu W, Liu T, Liu Z, Luo H, Pei H. A novel deep learning ensemble model based on two-stage feature selection and intelligent optimization for water quality prediction. ENVIRONMENTAL RESEARCH 2023; 224:115560. [PMID: 36842699 DOI: 10.1016/j.envres.2023.115560] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/19/2023] [Accepted: 02/22/2023] [Indexed: 06/18/2023]
Abstract
Accurate prediction of effluent total nitrogen (E-TN) can assist in feed-forward control of wastewater treatment plants (WWTPs) to ensure effluent compliance with standards while reducing energy consumption. However, multivariate time series prediction of E-TN is a challenge due to the complex nonlinearity of WWTPs. This paper proposes a novel prediction framework that combines a two-stage feature selection model, the Golden Jackal Optimization (GJO) algorithm, and a hybrid deep learning model, CNN-LSTM-TCN (CLT), aiming to effectively capture the nonlinear relationships of multivariate time series in WWTPs. Specifically, convolutional neural network (CNN), long short-term memory (LSTM), and temporal convolutional network (TCN) combined to build a hybrid deep learning model CNN-LSTM-TCN (CLT). A two-stage feature selection method is utilized to determine the optimal feature subset to reduce the complexity and improve the accuracy of the prediction model, and then, the feature subset is input into the CLT. The hyperparameters of the CLT are optimized using GJO to further improve the prediction performance. Experiments indicate that the two-stage feature selection model learns the optimal feature subset to predict best, and the GJO-CLT achieves the best performance for different backtracking windows and prediction steps. These results demonstrate that the prediction system excels in the task of multivariate water quality time series prediction of WWTPs.
Collapse
Affiliation(s)
- Wenli Liu
- Dept. of Construction Management, School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| | - Tianxiang Liu
- Dept. of Construction Management, School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| | - Zihan Liu
- Dept. of Construction Management, School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| | - Hanbin Luo
- Dept. of Construction Management, School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| | - Hanmin Pei
- Dept. of Construction Management, School of Civil and Hydraulic Engineering, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| |
Collapse
|