1
|
Roohi AM, Nazif S, Ramazi P. Tackling data challenges in forecasting effluent characteristics of wastewater treatment plants. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 354:120324. [PMID: 38364537 DOI: 10.1016/j.jenvman.2024.120324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 01/21/2024] [Accepted: 02/08/2024] [Indexed: 02/18/2024]
Abstract
In wastewater treatment plants (WWTPs), the stochastic nature of influent wastewater and operational and weather conditions cause fluctuations in effluent quality. Data-driven models can forecast effluent quality a few hours ahead as a response to the influent characteristics, providing enough time to adjust system operations and avoid undesired consequences. However, existing data for training models are often incomplete and contain missing values. On the other hand, collecting additional data by installing new sensors is costly. The trade-off between using existing incomplete data and collecting costly new data results in three data challenges faced when developing data-driven WWTP effluent forecasters. These challenges are to determine important variables to be measured, the minimum number of required data instances, and the maximum percentage of tolerable missing values that do not impede the development of an accurate model. As these issues are not discussed in previous studies, in this research, for the first time, a comprehensive analysis is done to provide answers to these challenges. Another issue that arises in all data-driven modeling is how to select an appropriate forecasting model. This paper addresses these issues by first testing nine machine learning models on data collected from three wastewater treatment plants located in Iran, Australia, and Spain. The most accurate forecaster, Bayesian network, was then used to address the articulated challenges. Key variables in forecasting effluent characteristics were flow rate, total suspended solids, electrical conductivity, phosphorus compounds, wastewater temperature, and air temperature. A minimum of 250 samples was needed during the model training to achieve a great reduction in the forecasting error. Moreover, a steep increase in the error was observed should the portion of missing values exceed 10%. The results assist plant managers in estimating the necessary data collection effort to obtain an accurate forecaster, contributing to the quality of the effluent.
Collapse
Affiliation(s)
- Ali Mohammad Roohi
- School of Civil Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Sara Nazif
- School of Civil Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| | - Pouria Ramazi
- Department of Mathematics and Statistics, Brock University, St. Catharines, ON, L2S 3A1, Canada
| |
Collapse
|
2
|
Khurshid A, Pani AK. Machine learning approaches for data-driven process monitoring of biological wastewater treatment plant: A review of research works on benchmark simulation model No. 1(BSM1). ENVIRONMENTAL MONITORING AND ASSESSMENT 2023; 195:916. [PMID: 37402850 DOI: 10.1007/s10661-023-11463-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 06/05/2023] [Indexed: 07/06/2023]
Abstract
In the past decade, machine learning techniques have seen wide industrial applications for design of data-based process monitoring systems with an aim to improve industrial productivity. An efficient process monitoring system for wastewater treatment process (WWTP) ensures increased efficiency and effluents meeting stringent emission norms. Benchmark simulation model No. 1 (BSM1) provides a simulation platform to researchers for developing efficient data-based process monitoring, quality monitoring, and process control systems for WWTPs. The present article presents a review of all research works reporting applications of various machine learning techniques for sensor and process fault detection of BSM1. The review focuses on process monitoring of biological wastewater treatment process, which uses a series of aerobic and anaerobic reactions followed by secondary settling process. Detailed information on various parameters monitored, different machine learning techniques explored, and results obtained by different researchers are presented in tabular and graphical format. In the review, it was observed that principal component analysis (PCA) and its variants account for the maximum number of research works for process monitoring in WWTPs and there are very few applications of recently developed deep learning techniques. Following the review and analysis, various future scopes of research (such as techniques yet to be explored or improvement of results for a particular fault) are also presented. These information will assist prospective researchers working on BSM1 to take forward the research.
Collapse
Affiliation(s)
- Amir Khurshid
- Department of Chemical Engineering, Birla Institute of Technology and Science, Pilani, Rajasthan, India, 333031
| | - Ajaya Kumar Pani
- Department of Chemical Engineering, Birla Institute of Technology and Science, Pilani, Rajasthan, India, 333031.
| |
Collapse
|
3
|
Review of Latest Advances in Nature-Inspired Algorithms for Optimization of Activated Sludge Processes. Processes (Basel) 2022. [DOI: 10.3390/pr11010077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
The activated sludge process (ASP) is the most widely used biological wastewater treatment system. Advances in research have led to the adoption of Artificial Intelligence (AI), in particular, Nature-Inspired Algorithm (NIA) techniques such as Genetic Algorithms (GAs) and Particle Swarm Optimization (PSO) to optimize treatment systems. This has aided in reducing the complexity and computational time of ASP modelling. This paper covers the latest NIAs used in ASP and discusses the advantages and limitations of each algorithm compared to more traditional algorithms that have been utilized over the last few decades. Algorithms were assessed based on whether they looked at real/ideal treatment plant (WWTP) data (and efficiency) and whether they outperformed the traditional algorithms in optimizing the ASP. While conventional algorithms such as Genetic Algorithms (GAs), Particle Swarm Optimization (PSO), and Ant Colony Optimization (ACO) were found to be successfully employed in optimization techniques, newer algorithms such as Whale Optimization Algorithm (WOA), Bat Algorithm (BA), and Intensive Weed Optimization Algorithm (IWO) achieved similar results in the optimization of the ASP, while also having certain unique advantages.
Collapse
|
4
|
Causal Analysis and Prevention Measures for Extreme Heavy Rainstorms in Zhengzhou to Protect Human Health. Behav Sci (Basel) 2022; 12:bs12060176. [PMID: 35735386 PMCID: PMC9219977 DOI: 10.3390/bs12060176] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 05/25/2022] [Accepted: 05/31/2022] [Indexed: 12/10/2022] Open
Abstract
This study focused on the extreme heavy rainstorm that occurred in Zhengzhou in July 2021; approximately 380 people were killed or missing as a result of this storm. To investigate the evolution behaviors of this rainstorm and take corresponding prevention measures, several methods and models were adopted, including cloud modeling, preliminary hazard analysis (PHA), fault tree analysis (FTA), bow-tie modeling, and chaos theory. The main reasons for this rainstorm can be divided into the following three aspects: force majeure, such as terrain and extreme weather conditions, issues with city construction, and insufficient emergency rescue. The secondary disasters caused by this rainstorm mainly include urban water logging, river flooding, and mountain torrents and landslides. The main causes of the subway line-5 accident that occurred can be described as follows: the location of the stabling yard was low, the relevant rules and regulations of the subway were not ideal, insufficient attention was given to the early warning information, and the emergency response mechanism was not ideal. Rainstorms result from the cross-coupling of faults in humans, objects, the environment, and management subsystems, and the evolution process shows an obvious butterfly effect. To prevent disasters caused by rainstorms, the following suggestions should be adopted: vigorously improve the risk awareness and emergency response capabilities of leading cadres, improve the overall level of urban disaster prevention and mitigation, reinforce the existing reservoirs in the city, strengthen the construction of sponge cities, and improve the capacity of urban disaster emergency rescue.
Collapse
|
5
|
Data-Driven Drift Detection in Real Process Tanks: Bridging the Gap between Academia and Practice. WATER 2022. [DOI: 10.3390/w14060926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Sensor drift in Wastewater Treatment Plants (WWTPs) reduces the efficiency of the plants and needs to be handled. Several studies have investigated anomaly detection and fault detection in WWTPs. However, these solutions often remain as academic projects. In this study, the gap between academia and practice is investigated by applying suggested algorithms on real WWTP data. The results show that it is difficult to detect drift in the data to a sufficient level due to missing and imprecise logs, ad hoc changes in control settings, low data quality and the equality in the patterns of some fault types and optimal operation. The challenges related to data quality raise the question of whether the data-driven approach for drift detection is the best solution, as this requires a high-quality data set. Several recommendations are suggested for utilities that wish to bridge the gap between academia and practice regarding drift detection. These include storing data and select data parameters at resolutions which positively contribute to this purpose. Furthermore, the data should be accompanied by sufficient logging of factors affecting the patterns of the data, such as changes in control settings.
Collapse
|
6
|
Xu Q, Xu K. Importance-Based Key Basic Event Identification and Evolution Mechanism Investigation of Hydraulic Support Failure to Protect Employee Health. SENSORS 2021; 21:s21217240. [PMID: 34770546 PMCID: PMC8587061 DOI: 10.3390/s21217240] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 10/20/2021] [Accepted: 10/28/2021] [Indexed: 11/16/2022]
Abstract
Background: Although hydraulic support can help enterprises in their production activities, it can also cause fatal accidents. Methods: This study established a composite risk-assessment method for hydraulic support failure in the mining industry. The key basic event of hydraulic support failure was identified based on fault tree analysis and gray relational analysis, and the evolution mechanism of hydraulic support failure was investigated based on chaos theory, a synthetic theory model, and cause-and-effect-layer-of-protection analysis (LOPA). Results: After the basic events of hydraulic support failure are identified based on fault tree analysis, structure importance (SI), probability importance (PI), critical importance (CI), and Fussell–Vesely importance (FVI) can be calculated. In this study, we proposed the Fussell–Vesely–Xu importance (FVXI) to reflect the comprehensive impact of basic event occurrence and nonoccurrence on the occurrence probability of the top event. Gray relational analysis was introduced to determine the integrated importance (II) of basic events and identify the key basic events. According to chaos theory, hydraulic support failure is the result of cross-coupling and infinite amplification of faults in the employee, object, environment, and management subsystems, and the evolutionary process has an obvious butterfly effect and inherent randomness. With the help of the synthetic theory model, we investigated the social and organizational factors that may lead to hydraulic support failure. The key basic event, jack leakage, was analyzed in depth based on cause-and-effect-LOPA, and corresponding independent protection layers (IPLs) were identified to prevent jack leakage. Implications: The implications of these findings with respect to hydraulic support failure can be regarded as the foundation for accident prevention in practice.
Collapse
Affiliation(s)
- Qingwei Xu
- College of Information and Management Science, Henan Agricultural University, Zhengzhou 450046, China
- Correspondence:
| | - Kaili Xu
- School of Resources and Civil Engineering, Northeastern University, Shenyang 110819, China;
| |
Collapse
|
7
|
Improved CBSO: A distributed fuzzy-based adaptive synthetic oversampling algorithm for imbalanced judicial data. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.04.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
8
|
AFCM-LSMA: New intelligent model based on Lévy slime mould algorithm and adaptive fuzzy C-means for identification of COVID-19 infection from chest X-ray images. ADVANCED ENGINEERING INFORMATICS 2021; 49. [PMCID: PMC8126092 DOI: 10.1016/j.aei.2021.101317] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Problem A worldwide challenge is to provide medical resources required for COVID-19 detection. They must be effective tools for fast detection and diagnose of the virus using a large number of tests; besides, they should be low-cost developments. While a chest X-ray scan is a powerful candidate tool, if several tests are carried out, the images produced by the devices must be interpreted accurately and rapidly. COVID-19 induces longitudinal pulmonary parenchymal ground-glass and consolidates pulmonary opacity, in some cases with rounded morphology and peripheral lung distribution, which is very difficult to predict in an early stage. Aim In this paper, we aim to develop a robust model to extract high-level features of COVID-19 from chest X-ray (CXR) images to help in rapid diagnosis. In specific, this paper proposes an optimization model for COVID-19 diagnosis based on adaptive Fuzzy C-means (AFCM) and improved Slime Mould Algorithm (SMA) based on Lévy distribution, namely AFCM-LSMA. Methods The SMA optimizer is proposed to adapt weights in oscillation mode and to mimic the process of generating positive and negative feedback from the propagation wave to shape the optimum path for food connectivity. Lévy motion is used as a permutation to perform a local search and to adapt SMA optimizer (LSMA) by generating several solutions that are apart from current candidates. Furthermore, it permits the optimizer to escape from local minima, examine large search areas and reach optimal solutions in fewer iterations with high convergence speed. The FCM algorithm is used to segment pulmonary regions from CXR images and is adapted to reduce time and amount of computations using histogram of the image intensities during the clustering process. Results The performance of the proposed AFCM-LSMA has been validated on CXR images and compared with different conventional machine learning and deep learning techniques, meta-heuristics methods, and different chaotic maps. The accuracies achieved by the proposed model are around (ACC = 0.96, RMSE = 0.23, Prec. = 0.98, F1_score = 0.98, MCC = 0.79, and Kappa = 0.79). Conclusion The experimental findings indicate that the proposed new method outperforms all other methods, which will be beneficial to the clinical practitioner for the early identification of infected COVID-19 patients.
Collapse
|
9
|
Ali R, Muhammad S, Takahashi RHC. Decision making viva genetic algorithm for the utilization of leftovers. INT J INTELL SYST 2021. [DOI: 10.1002/int.22359] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Raiz Ali
- Department of Mathematics Abdul Wali Khan University Mardan Khyber Pakhtunkhwa Pakistan
| | - Shakoor Muhammad
- Department of Mathematics Abdul Wali Khan University Mardan Khyber Pakhtunkhwa Pakistan
| | | |
Collapse
|
10
|
Pham QB, Sammen SS, Abba SI, Mohammadi B, Shahid S, Abdulkadir RA. A new hybrid model based on relevance vector machine with flower pollination algorithm for phycocyanin pigment concentration estimation. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2021; 28:10.1007/s11356-021-12792-2. [PMID: 33625698 DOI: 10.1007/s11356-021-12792-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 01/31/2021] [Indexed: 06/12/2023]
Abstract
Precise monitoring of cyanobacteria concentration in water resources is a daunting task. The development of reliable tools to monitor this contamination is an important research topic in water resources management. Indirect methods such as chlorophyll-a determination, cell counting, and toxin measurement of the cyanobacteria are tedious, cumbersome, and often lead to inaccurate results. The quantity of phycocyanin (PC) pigment is considered more appropriate for cyanobacteria monitoring. Traditional approaches for PC estimation are time-consuming, expensive, and require high expertise. Recently, some studies have proposed the application of artificial intelligence (AI) techniques to predict the amount of PC concentration. Nonetheless, most of these researches are limited to standalone modeling schemas such as artificial neural network (ANN), multilayer perceptron (MLP), and support vector machine (SVM). The independent schema provides imprecise results when faced with highly nonlinear systems and data uncertainties resulting from environmental disturbances. To alleviate the limitations of the existing models, this study proposes the first application of a hybrid AI model that integrates the potentials of relevance vector machine (RVM) and flower pollination algorithm (RVM-FPA) to predict the PC concentration in water resources. The performance of the hybrid model is compared with the standalone RVM model. The prediction performance of the proposed models was evaluated at two stations (stations 508 and 478) using different statistical and graphical performance evaluation methods. The results showed that the hybrid models exhibited higher performance at both stations compared to the standalone RVM model. The proposed hybrid RVM-FPA can therefore serve as a reliable predictive tool for PC concentration in water resources.
Collapse
Affiliation(s)
- Quoc Bao Pham
- Institute of Research and Development, Duy Tan University, Danang, 550000, Vietnam
- Faculty of Environmental and Chemical Engineering, Duy Tan University, Danang, 550000, Vietnam
| | - Saad Sh Sammen
- Department of Civil Engineering, College of Engineering, University of Diyala, Baqubah, Diyala Governorate, Iraq.
| | - Sani Isa Abba
- Faculty of Engineering, Department of Civil Engineering, Baze University, Abuja, Nigeria
| | - Babak Mohammadi
- Department of Physical Geography and Ecosystem Science, Lund University, Sölvegatan 12, SE-223 62, Lund, Sweden
| | - Shamsuddin Shahid
- Faculty of Engineering, School of Civil Engineering, Universiti Teknologi Malaysia (UTM), 81310, Skudai, Johor, Malaysia
| | - Rabiu Aliyu Abdulkadir
- Department of Electrical and Electronic, Kano University of Science and Technology, Wudil, Nigeria
- Department of Computer Science, Kano University of Science and Technology, Wudil, Nigeria
| |
Collapse
|
11
|
Combining Heuristics with Simulation and Fuzzy Logic to Solve a Flexible-Size Location Routing Problem under Uncertainty. ALGORITHMS 2021. [DOI: 10.3390/a14020045] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The location routing problem integrates both a facility location and a vehicle routing problem. Each of these problems are NP-hard in nature, which justifies the use of heuristic-based algorithms when dealing with large-scale instances that need to be solved in reasonable computing times. This paper discusses a realistic variant of the problem that considers facilities of different sizes and two types of uncertainty conditions. In particular, we assume that some customers’ demands are stochastic, while others follow a fuzzy pattern. An iterated local search metaheuristic is integrated with simulation and fuzzy logic to solve the aforementioned problem, and a series of computational experiments are run to illustrate the potential of the proposed algorithm.
Collapse
|
12
|
|
13
|
Anter AM, Bhattacharyya S, Zhang Z. Multi-stage fuzzy swarm intelligence for automatic hepatic lesion segmentation from CT scans. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106677] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|