1
|
Afan HA, Melini Wan Mohtar WH, Khaleel F, Kamel AH, Mansoor SS, Alsultani R, Ahmed AN, Sherif M, El-Shafie A. Data-driven water quality prediction for wastewater treatment plants. Heliyon 2024; 10:e36940. [PMID: 39309819 PMCID: PMC11415851 DOI: 10.1016/j.heliyon.2024.e36940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 08/16/2024] [Accepted: 08/25/2024] [Indexed: 09/25/2024] Open
Abstract
Monitoring and managing wastewater treatment plants (WWTPs) is crucial for environmental protection. The presection of the quality of treated water is essential for energy efficient operation. The current research presents a comprehensive comparison of machine learning models for water quality parameter prediction in WWTPs. Four machine learning models presented in MLP, GFFR, MLP-PCA, and RBF were employed in this study. The primary notion of this study is to apply the proposed models using two distinct modeling scenarios. The first scenario represents a straightforward approach by utilizing the inputs and outputs of WWTPs; meanwhile, the second scenario involves using multi-step modeling techniques, which incorporate intermediate outputs induced by primary and secondary settlers. The study also investigates the potential of the adopted models to handle high dimensional data as a result of the multi-step modeling since more data points and outputs are progressively integrated at each step. The results show that the GFFR model outperforms the other models across both scenarios, specifically in the second scenario in predicting conductivity (COND) by providing higher correlation accuracy (R = 0.893) and lower prediction deviations (NRMSE = 0.091 and NMAE = 0.071). However, all models across both scenarios struggle to predict the other water quality parameters, generating significantly lower prediction correlations and higher prediction deviations. Nonetheless, the innovative multi-step technique in scenario two has significantly boosted the prediction capacity of all models, with improvement ranging from 0.2 % to 157 % and an average of 60 %. The implementation of AI models has proven its ability to accomplish high accuracy for WQ parameter prediction, highlighting the impact of leveraging intermediate process data.
Collapse
Affiliation(s)
| | - Wan Hanna Melini Wan Mohtar
- Department of Civil Engineering, Faculty of Engineering and Built Environment, Universiti Kebangsaan Malaysia, 43600, UKM Bangi, Selangor, Malaysia
- Environmental Management Center, Institute of Climate Change, Universiti Kebangsaan Malaysia, 43600, UKM Bangi, Selangor, Malaysia
| | - Faidhalrahman Khaleel
- Ministry of Electricity, The State Company of Electricity Production GCEP Middle Region, Baghdad, Iraq
- Department of Civil Engineering, Atatürk University, 25240, Erzurum, Turkey
| | - Ammar Hatem Kamel
- Upper Euphrates Basin Developing Center, University of Anbar, Iraq
- Dams and Water Resources Department, College of Engineering, University of Anbar, Iraq
| | | | - Riyadh Alsultani
- Building and Construction Techniques Engineering Department, College of Engineering and Engineering Techniques, Al-Mustaqbal University, 51001, Babylon, Iraq
| | - Ali Najah Ahmed
- Research Centre For Human-Machine Collaboration (HUMAC), School of Engineering and Technology, Sunway University, No. 5, Jalan Universiti, Bandar Sunway, 47500, Selangor Darul Ehsan, Malaysia
- Department of Engineering, School of Engineering and Technology, Sunway University, No. 5, Jalan Universiti, Bandar Sunway, 47500, Selangor Darul Ehsan, Malaysia
| | - Mohsen Sherif
- National Water and Energy Center, United Arab Emirate University, P.O. Box 15551, Al Ain, United Arab Emirates
- Civil and Environmental Eng. Dept., College of Engineering, United Arab Emirates University, Al Ain, 15551, United Arab Emirates
| | - Ahmed El-Shafie
- National Water and Energy Center, United Arab Emirate University, P.O. Box 15551, Al Ain, United Arab Emirates
| |
Collapse
|
2
|
Li D, Yang C, Li Y, Chen Y, Huang D, Liu Y. Learning a neural network-based soft sensor with double-errors parallel optimization towards effluent variable prediction in wastewater treatment plants. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 366:121907. [PMID: 39047433 DOI: 10.1016/j.jenvman.2024.121907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 06/14/2024] [Accepted: 07/14/2024] [Indexed: 07/27/2024]
Abstract
With the development of machine learning and artificial intelligence (ML/AI) models, data-driven soft sensors, especially the neural network-based, have widespread utilization for the prediction of key water quality indicators in wastewater treatment plants (WWTPs). However, recent research indicates that the prediction performance and computational efficiency are greatly compromised due to the time-varying, nonlinear and high-dimensional nature of the wastewater treatment process. This paper proposes a neural network-based soft sensor with double-errors parallel optimization to achieve more accurate prediction for effluent variables timely. Firstly, relying on the Activity Based Classification (ABC) principle, an ensemble variable selection method that combines Pearson correlation coefficient (PCC) and mutual information (MI) is introduced to select the optimal process variables as auxiliary variables, thereby reducing the data dimensionality and simplifying the model complexity. Subsequently, a double-errors parallel optimization methodology with minimizing both point prediction error and distribution error simultaneously is proposed, aiming to enhancing the training efficiency and the fitting quality of neural networks. Finally, the effectiveness is quantitatively assessed in two datasets collected from the Benchmark Simulation Model no. 1 (BMS1) and an actual oxidation ditch WWTP. The experimental results illustrate that the proposed soft sensor achieves precise effluent variable prediction, with RMSE, MAE and R2 values being 0.0606, 0.0486, 0.99930, and 0.06939, 0.05381, 0.98040, respectively. Consequently, this soft sensor can expedite the convergence speed in the neural network training process and enhance the prediction performance, thereby contributing to the effective optimization management of WWTPs.
Collapse
Affiliation(s)
- Dong Li
- The School of Automation, Central South University, Changsha, 410083, China.
| | - Chunhua Yang
- The School of Automation, Central South University, Changsha, 410083, China
| | - Yonggang Li
- The School of Automation, Central South University, Changsha, 410083, China
| | - Yan Chen
- The Key Laboratory of Autonomous Systems and Networked Control, Ministry of Education, School of Automation Science & Engineering, South China University of Technology, Wushan Road, Guang Zhou, 510640, China; The School of Data Science and Information Engineering, Guizhou Minzu University, Guiyang, 550025, China
| | - Daoping Huang
- The Key Laboratory of Autonomous Systems and Networked Control, Ministry of Education, School of Automation Science & Engineering, South China University of Technology, Wushan Road, Guang Zhou, 510640, China
| | - Yiqi Liu
- The Key Laboratory of Autonomous Systems and Networked Control, Ministry of Education, School of Automation Science & Engineering, South China University of Technology, Wushan Road, Guang Zhou, 510640, China.
| |
Collapse
|
3
|
Li D, Yang C, Li Y. A multi-subsystem collaborative Bi-LSTM-based adaptive soft sensor for global prediction of ammonia-nitrogen concentration in wastewater treatment processes. WATER RESEARCH 2024; 254:121347. [PMID: 38422697 DOI: 10.1016/j.watres.2024.121347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 01/03/2024] [Accepted: 02/19/2024] [Indexed: 03/02/2024]
Abstract
Ammonia-nitrogen concentration is a key water quality indicator, which reflects changes in pollutant components during wastewater treatment processes. The timely and accurate detection results contribute to optimizing control and operational management of wastewater treatment plants (WWTPs), but current detection methods only focus on the effluent location. This paper proposes a multi-subsystem collaborative Bi-LSTM-based adaptive soft sensor to achieve the global prediction of ammonia-nitrogen concentration. Firstly, the wastewater treatment process is divided into several independent subsystems depending on the reaction mechanism, and the variable selection is performed using mutual information. Subsequently, the bidirectional long short-term memory network (Bi-LSTM) is employed to construct a model for predicting ammonia-nitrogen concentration within each subsystem, and the outputs between neighboring subsystems are incorporated as a set of new variables added into the training dataset to strengthen their connection. Finally, to address performance degradation caused by environmental factors, a probability density function (PDF)-based dynamic moving window method is proposed to enhance the robustness. The effectiveness and superiority of the proposed soft sensor are validated in the Benchmark Simulation Model no. 1 (BSM1). The experimental results demonstrate that the proposed soft sensor can accurately predict the global ammonia-nitrogen concentration in the face of different weather conditions including sunny, rainy, and stormy days. This study contributes to the stable operation of WWTPs with higher treatment efficiency and lower economic costs.
Collapse
Affiliation(s)
- Dong Li
- The School of Automation, Central South University, Changsha 410 083, China
| | - Chunhua Yang
- The School of Automation, Central South University, Changsha 410 083, China.
| | - Yonggang Li
- The School of Automation, Central South University, Changsha 410 083, China
| |
Collapse
|
4
|
Sheikh Khozani Z, Ehteram M, Mohtar WHMW, Achite M, Chau KW. Convolutional neural network-multi-kernel radial basis function neural network-salp swarm algorithm: a new machine learning model for predicting effluent quality parameters. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:99362-99379. [PMID: 37610542 DOI: 10.1007/s11356-023-29406-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 08/16/2023] [Indexed: 08/24/2023]
Abstract
A wastewater treatment plant (WWTP) is an essential part of the urban water cycle, which reduces concentration of pollutants in the river. For monitoring and control of WWTPs, researchers develop different models and systems. This study introduces a new deep learning model for predicting effluent quality parameters (EQPs) of a WWTP. A method that couples a convolutional neural network (CNN) with a novel version of radial basis function neural network (RBFNN) is proposed to simultaneously predict and estimate uncertainty of data. The multi-kernel RBFNN (MKRBFNN) uses two activation functions to improve the efficiency of the RBFNN model. The salp swarm algorithm is utilized to set the MKRBFNN and CNN parameters. The main advantage of the CNN-MKRBFNN-salp swarm algorithm (SSA) is to automatically extract features from data points. In this study, influent parameters (if) are used as inputs. Biological oxygen demand (BODif), chemical oxygen demand (CODif), total suspended solids (TSSif), volatile suspended solids (VSSif), and sediment (SEDef) are used to predict EQPs, including CODef, BODef, and TSSef. At the testing level, the Nash-Sutcliffe efficiencies of CNN-MKRBFNN-SSA are 0.98, 0.97, and 0.98 for predicting CODef, BODef, and TSSef. Results indicate that the CNN-MKRBFNN-SSA is a robust model for simulating complex phenomena.
Collapse
Affiliation(s)
- Zohreh Sheikh Khozani
- Paleoclimate Dynamics Group, Alfred Wegener Institute, Helmholtz Center for Polar and Marine Research, 27570, Bremerhaven, Germany
| | - Mohammad Ehteram
- Department of Water Engineering, Semnan University, Semnan, Iran.
| | - Wan Hanna Melini Wan Mohtar
- Department of Civil Engineering, Faculty of Engineering & Built Environment, Universiti Kebangsaan Malaysia, 43600 UKM, Bangi, Selangor, Malaysia
| | - Mohammed Achite
- Water and Environment Laboratory, Hassiba Benbouali, University of Chlef, B.P. 78COuled Fares, 02180, Chlef, Algeria
| | - Kwok-Wing Chau
- Department of Civil and Environmental Engineering, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, China
| |
Collapse
|
5
|
Gorgan-Mohammadi F, Rajaee T, Zounemat-Kermani M. Investigating machine learning models in predicting lake water quality parameters as a 3-year moving average. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:63839-63863. [PMID: 37059948 DOI: 10.1007/s11356-023-26830-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 04/03/2023] [Indexed: 04/16/2023]
Abstract
Lake water quality plays a vital role in the lake ecosystem, including biotic (for living creatures, such as plants, animals, and micro-organisms) and abiotic interactions. In this research, various types of machine learning (ML) methodologies, such as classification and regression tree (CART), chi-squared automatic interaction detector (CHAID), C5 tree, quick, unbiased, and efficient statistical tree (QUEST), along with multilayer perceptron (MLP) neural network, and radial basis function (RBF) neural network, are employed to predict the concentration of water quality parameters (P, EC, TDS, pH, DO, NH3, SO4, and θ). Lake Erie is situated at the international border of the USA and Canada. The C5 tree and QUEST tree are used to classify data and predict the number of groups, while the other methods are used to predict the concentration of water quality parameters in the form of a 3-year moving average. The greater matching between the observed and predicted data of dissolved oxygen (NSE = 0.978, bias = 0.126) shows that the CART decision tree has higher accuracy in correctly detecting the concentration of this parameter. The C5 tree could identify 33 groups correctly out of 36 total groups, which shows better accuracy for the C5 tree in classifying the data for this parameter.
Collapse
Affiliation(s)
| | - Taher Rajaee
- Department of Civil Engineering, University of Qom, Qom, Iran
| | | |
Collapse
|
6
|
|
7
|
DEIDS: a novel intrusion detection system for industrial control systems. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-06965-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
8
|
Self-organizing radial basis function neural network using accelerated second-order learning algorithm. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.10.065] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|