1
|
Nagaraju TV, Sri Bala G, Bonthu S, Mantena S. Modelling biochemical oxygen demand in a large inland aquaculture zone of India: Implications and insights. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 906:167386. [PMID: 37769733 DOI: 10.1016/j.scitotenv.2023.167386] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/10/2023] [Accepted: 09/24/2023] [Indexed: 10/03/2023]
Abstract
Water quality surveillance is tough, and a specific timely management is necessary for the inland aquaculture ponds and ecology as well. Real time quality monitoring involves the study of numerous parameters includes physical (turbidity, temperature, and specific conductivity), chemical (pH, calcium, manganese, chlorides, iron, biochemical oxygen demand), and biological (bacteria and algae). It is also crucial to recognize the inter-dependence among the parameters. Alternatively, these relationships can be predicted with statistical and numerical modelling. Organic strength parameter 5-day biochemical oxygen demand (BOD) is a significant parameter to evaluate since its impact is very high on the quality of water, aquatic life, and other biological concerns. This study focuses on the prediction of BOD using six traditional and four boosting algorithms considering ten input physicochemical attributes. The attributes were fine-tuned for highly precise predictions by removing extreme values from the data set using data outlier treatment. The prediction results are compared using performance metrics such as coefficient of determination (R2), root mean square error (RMSE), mean square error (MSE), and mean absolute error (MAE). The findings revealed that boosting algorithms outperform the results of traditional models with the highest prediction accuracy. Among the boosting algorithms, eXtreme Gradient Boosting algorithm (XGBM) is found highly appropriate for the inland aquaculture waters with R2 = 0.95, RMSE = 0.31, MSE = 0.09, MAE = 0.1. Finally, this study provides a systematic evaluation of the BOD in the aquaculture waters and has a significant contribution to water management and eco-balance.
Collapse
Affiliation(s)
- T Vamsi Nagaraju
- Department of Civil Engineering, SRKR Engineering College, India; Centre for Clean and Sustainable Environment, SRKR Engineering College, India.
| | - G Sri Bala
- Department of Civil Engineering, SRKR Engineering College, India; Centre for Clean and Sustainable Environment, SRKR Engineering College, India
| | - Sridevi Bonthu
- Department of Computer Science and Engineering, Vishnu Institute of Technology, India
| | - Sireesha Mantena
- Department of Geo-Engineering, College of Engineering, Andhra University, India
| |
Collapse
|
2
|
Mekaoussi H, Heddam S, Bouslimanni N, Kim S, Zounemat-Kermani M. Predicting biochemical oxygen demand in wastewater treatment plant using advance extreme learning machine optimized by Bat algorithm. Heliyon 2023; 9:e21351. [PMID: 37954260 PMCID: PMC10637896 DOI: 10.1016/j.heliyon.2023.e21351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 10/08/2023] [Accepted: 10/19/2023] [Indexed: 11/14/2023] Open
Abstract
Wastewater quality modelling plays a vital role in planning and management of wastewater treatment plants (WWTP). This paper develops a new hybrid machine learning model based on extreme learning machine (ELM) optimized by Bat algorithm (ELM-Bat) for modelling five day effluent biochemical oxygen demand (BOD5). Specifically, this hybrid model combines the Bat algorithm for model parameters optimization and the standalone ELM. The proposed model was developed using historical measured effluents wastewater quality variables, i.e., the chemical oxygen demand (COD), temperature, pH, total suspended solid (TSS), specific conductance (SC) and the wastewater flow (Q). The performances of the hybrid ELM-Bat were compared with those of the multilayer perceptron neural network (MLPNN), the random forest regression (RFR), the Gaussian process regression (GPR), the random vector functional link network (RVFL), and the multiple linear regression (MLR) models. By comparing several input variables combination, the improvement achieved in the accuracy of prediction through the hybrid ELM-Bat was quantified. All models were first calibrated using training dataset and later tested using validation and based on four performances metrics namely, root mean square error (RMSE), mean absolute error (MAE), the correlation coefficient (R), and the Nash-Sutcliffe model efficiency (NSE). In all, it is concluded that the ELM-Bat is the most accurate model when all the six input were included as input variables, and it outperforms all other benchmark models in terms of predictive accuracy, exhibiting RMSE, MAE, R and NSE values of approximately, 0.885, 0.781, 2.621, and 1.989, respectively.
Collapse
Affiliation(s)
- Hayat Mekaoussi
- Institute of veterinary and agronomic sciences, Agronomy Department, Hydraulics Division, University Batna 1-Hadj Lakhdar- Allées 19 mai, Route de Biskra Batna, 05000 Algeria
- Laboratory of Research in Biodiversity Interaction Ecosystem and Biotechnology (LRIBEB) University 20 Août 1955 Skikda, Algeria
| | - Salim Heddam
- Laboratory of Research in Biodiversity Interaction Ecosystem and Biotechnology (LRIBEB), Faculty of Science, Agronomy Department, University 20 Août 1955-Skikda, Route El Hadaik, BP 26, Skikda, Algeria
| | - Nouri Bouslimanni
- Institute of veterinary and agronomic sciences, Agronomy Department, Chemical Division, University Batna 1-Hadj Lakhdar- Allées 19 mai, Route de Biskra Batna, 05000 Algeria
| | - Sungwon Kim
- Department of Railroad Construction and Safety Engineering, Dongyang University, Yeongju 36040, Republic of Korea
| | | |
Collapse
|
3
|
Elsayed A, Rixon S, Levison J, Binns A, Goel P. Application of classification machine learning algorithms for characterizing nutrient transport in a clay plain agricultural watershed. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2023; 345:118924. [PMID: 37678017 DOI: 10.1016/j.jenvman.2023.118924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Revised: 08/28/2023] [Accepted: 08/30/2023] [Indexed: 09/09/2023]
Abstract
Excess nutrients in surface water and groundwater can lead to water quality deterioration in available water resources. Thus, the classification of nutrient concentrations in water resources has gained significant attention during recent decades. Machine learning (ML) algorithms are considered an efficient tool to describe nutrient loss from agricultural land to surface water and groundwater. Previous studies have applied regression and classification ML algorithms to predict nutrient concentrations in surface water and/or groundwater, or to categorize an output variable using a limited number of input variables. However, there have been no studies that examined the application of different ML classification algorithms in agricultural settings to classify various output variables using a wide range of input variables. In this study, twenty-four ML classification algorithms were implemented on a dataset from three locations within the Upper Parkhill watershed, an agricultural watershed in southern Ontario, Canada. Nutrient concentrations in surface water were classified using geochemical and physical water parameters of surface water and groundwater (e.g., pH), climate and field conditions as the input variables. The performance of these algorithms was evaluated using four evaluation metrics (e.g., classification accuracy) to identify the optimal algorithm for classifying the output variables. Ensemble bagged trees was found to be the optimal ML algorithm for classifying nitrate concentration in surface water (accuracy of 90.9%), while the weighted KNN was the most appropriate algorithm for categorizing the total phosphorus concentration (accuracy of 87%). The ensemble subspace discriminant algorithm gave the highest overall classification accuracy for the concentration of soluble reactive phosphorus and total dissolved phosphorus in surface water with an accuracy of 79.2% and 77.9%, respectively. This study exemplifies that ML algorithms can be used to signify exceedance of recommended concentrations of nutrients in surface waters in agricultural watersheds. Results are useful for decision makers to develop nutrient management strategies.
Collapse
Affiliation(s)
- Ahmed Elsayed
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada; Irrigation and Hydraulics Department, Faculty of Engineering, Cairo University, 1 Gamaa Street, Giza, 12613, Egypt.
| | - Sarah Rixon
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Jana Levison
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Andrew Binns
- School of Engineering, Morwick G360 Groundwater Research Institute, University of Guelph, 50 Stone Road East, Guelph, Ontario, N1G 2W1, Canada
| | - Pradeep Goel
- Ministry of the Environment, Conservation and Parks (MECP), 125 Resources Road, Etobicoke, Ontario, M9P 3V6, Canada
| |
Collapse
|
4
|
Cotroneo S, Kang M, Clark ID, Bataille CP. Applying Machine Learning to investigate metal isotope variations at the watershed scale: A case study with lithium isotopes across the Yukon River Basin. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 896:165165. [PMID: 37394077 DOI: 10.1016/j.scitotenv.2023.165165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 06/14/2023] [Accepted: 06/25/2023] [Indexed: 07/04/2023]
Abstract
Constraining the multiple climatic, lithological, topographic, and geochemical variables controlling isotope variations in large rivers is often challenging with standard statistical methods. Machine learning (ML) is an efficient method for analyzing multidimensional datasets, resolving correlated processes, and exploring relationships between variables simultaneously. We tested four ML algorithms to elucidate the controls of riverine δ7Li variations across the Yukon River Basin (YRB). We compiled (n = 102) and analyzed new samples (n = 21), producing a dataset of 123 river water samples collected across the basin during the summer including δ7Li and extracted environmental, climatological, and geological characteristics of the drainage area for each sample from open-access geospatial databases. The ML models were trained, tuned, and tested under multiple scenarios to avoid issues such as overfitting. Random Forests (RF) performed best at predicting δ7Li across the basin, with the median model explaining 62 % of the variance. The most important variables controlling δ7Li across the basin are elevation, lithology, and past glacial coverage, which ultimately influence weathering congruence. Riverine δ7Li has a negative dependence on elevation. This reflects congruent weathering in kinetically-limited mountain zones with short residence times. The consistent ranking of lithology, specifically igneous and metamorphic rock cover, as a top feature controlling riverine δ7Li modeled by the RFs is unexpected. Further study is required to validate this finding. Rivers draining areas that were extensively covered during the last glacial maximum tend to have lower δ7Li due to immature weathering profiles resulting in short residence times, less secondary mineral formation and therefore more congruent weathering. We demonstrate that ML provides a fast, simple, visualizable, and interpretable approach for disentangling key controls of isotope variations in river water. We assert that ML should become a routine tool, and present a framework for applying ML to analyze spatial metal isotope data at the catchment scale.
Collapse
Affiliation(s)
- Sarina Cotroneo
- 25 Templeton Street, Department of Earth and Environmental Sciences, University of Ottawa, Ottawa, ON K1N 6N5, Canada.
| | - Myunghak Kang
- 25 Templeton Street, Department of Earth and Environmental Sciences, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | - Ian D Clark
- 25 Templeton Street, Department of Earth and Environmental Sciences, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| | - Clément P Bataille
- 25 Templeton Street, Department of Earth and Environmental Sciences, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| |
Collapse
|
5
|
Rao KS, Tirth V, Almujibah H, Alshahri AH, Hariprasad V, Senthilkumar N. Optimization of water reuse and modelling by saline composition with nanoparticles based on machine learning architectures. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2023; 87:2793-2805. [PMID: 37318924 PMCID: wst_2023_161 DOI: 10.2166/wst.2023.161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Water is a necessary resource that enables the existence of all life forms, including humans. Freshwater usage has become increasingly necessary in recent years. Facilities for treating seawater are less dependable and effective. Deep learning methods have the ability to improve salt particle analysis in saltwater's accuracy and efficiency, which will enhance the performance of water treatment plants. This research proposes a novel technique in optimization of water reuse with nanoparticle analysis based on machine learning architecture. Here, the optimization of water reuse is carried out based on nanoparticle solar cell for saline water treatment and the saline composition has been analyzed using a gradient discriminant random field. Experimental analysis is carried out in terms of specificity, computational cost, kappa coefficient, training accuracy, and mean average precision for various tunnelling electron microscope (TEM) image datasets. The bright-field TEM (BF-TEM) dataset attained a specificity of 75%, kappa coefficient of 44%, training accuracy of 81%, and mean average precision of 61%, whereas the annular dark-field scanning TEM (ADF-STEM) dataset produced specificity of 79%, kappa coefficient of 49%, training accuracy of 85%, and mean average precision of 66% as compared with the existing artificial neural network (ANN) approach.
Collapse
Affiliation(s)
- Koppula Srinivas Rao
- Department of Computer Science and Engineering, MLR Institute of Technology, Hyderabad, Telangana, India
| | - Vineet Tirth
- Mechanical Engineering Department, College of Engineering, King Khalid University, Abha, Asir 61421, Saudi Arabia; Research Center for Advanced Materials Science (RCAMS), King Khalid University, Guraiger, P.O. Box 9004, Abha, Asir 61413, Saudi Arabia
| | - Hamad Almujibah
- Department of Civil Engineering, College of Engineering, Taif University, P.O. Box 11099, Taif City 21974, Saudi Arabia
| | - Abdullah H Alshahri
- Department of Civil Engineering, College of Engineering, Taif University, P.O. Box 11099, Taif City 21974, Saudi Arabia
| | - V Hariprasad
- Department of Aerospace Engineering, Jain (Deemed-to-be) University, Jain Global Campus, Jakkasandra Post, Kanakapura 562112, India
| | - N Senthilkumar
- Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Thandalam, Chennai 602105, India E-mail:
| |
Collapse
|
6
|
Riazi M, Khosravi K, Shahedi K, Ahmad S, Jun C, Bateni SM, Kazakis N. Enhancing flood susceptibility modeling using multi-temporal SAR images, CHIRPS data, and hybrid machine learning algorithms. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 871:162066. [PMID: 36773901 DOI: 10.1016/j.scitotenv.2023.162066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Revised: 02/01/2023] [Accepted: 02/02/2023] [Indexed: 06/18/2023]
Abstract
Flood susceptibility maps are useful tool for planners and emergency management professionals in the early warning and mitigation stages of floods. In this study, Sentinel-1 dB radar images, which provide Synthetic-Aperture Radar (SAR) data were used to delineate flooded and non-flooded locations. 12 input parameters, including elevation, lithology, drainage density, rainfall, Normalized Difference Vegetation Index (NDVI), curvature, ground slope, Stream Power Index (SPI), Topographic Wetness Index (TWI), soil, Land Use Land Cover (LULC), and distance from the river, were selected for model development. The importance of each input parameter on flood occurrences was assessed via the Mutual Information (MI) technique. Several machine learning models, including Radial Basis Function (RBF), and three hybrid models of Bagging (BA-RBF), Random Committee (RC-RBF), and Random Subspace (RSS-RBF), were developed to delineate flood susceptibility areas at Goorganrood watershed, Iran. The performance of each model was evaluated using several error indicators, including correlation coefficient (r), Nash Sutcliffe Efficiency (NSE), Mean Absolute Error (MSE), Root Mean Square Error (RMSE), and the Area Under the Receiver Operating Characteristic (ROC) Curve (AUC). The results showed that the hybrid techniques enhanced the modeling performance of the standalone model, and generally, all hybrid models are more accurate than the standalone model. Although all developed models have performed well, RC-RBF outperforms all of them (AUC = 0.997), followed by BA-RBF (AUC = 0.996), RSS-RBF (AUC = 0.992), and RBF (AUC = 0.975). Generally, about 12 % of the study area has high and very high susceptibility to future flood occurrences.
Collapse
Affiliation(s)
- Mostafa Riazi
- Department of Civil Engineering, Islamic Azad University of Khomeinishahr, Khomeinishahr, Iran
| | - Khabat Khosravi
- Department of Earth and Environment, Florida International University, Miami, USA
| | - Kaka Shahedi
- Department of Watershed Management, Sari Agricultural Science and Natural Resources University, Sari, Iran
| | - Sajjad Ahmad
- Department of Civil and Environmental Engineering and Construction, University of Nevada, Las Vegas, USA
| | - Changhyun Jun
- Department of Civil and Environmental Engineering, College of Engineering, Chung-Ang University, Seoul, Republic of Korea.
| | - Sayed M Bateni
- Department of Civil and Environmental Engineering and Water Resources Research Center, University of Hawaii at Manoa, Honolulu, HI, USA
| | - Nerantzis Kazakis
- Department of Geology, Lab. of Engineering Geology & Hydrogeology, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece.
| |
Collapse
|
7
|
Madaki Z, Abacioglu N, Usman AG, Taner N, Sehirli AO, Abba SI. Novel Hybridized Computational Paradigms Integrated with Five Stand-Alone Algorithms for Clinical Prediction of HCV Status among Patients: A Data-Driven Technique. LIFE (BASEL, SWITZERLAND) 2022; 13:life13010079. [PMID: 36676028 PMCID: PMC9866913 DOI: 10.3390/life13010079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 12/19/2022] [Accepted: 12/21/2022] [Indexed: 12/29/2022]
Abstract
The emergence of health informatics opens new opportunities and doors for different disease diagnoses. The current work proposed the implementation of five different stand-alone techniques coupled with four different novel hybridized paradigms for the clinical prediction of hepatitis C status among patients, using both sociodemographic and clinical input variables. Both the visualized and quantitative performances of the stand-alone algorithms present the capability of the Gaussian process regression (GPR), Generalized neural network (GRNN), and Interactive linear regression (ILR) over the Support Vector Regression (SVR) and Adaptive neuro-fuzzy inference system (ANFIS) models. Hence, due to the lower performance of the stand-alone algorithms at a certain point, four different novel hybrid data intelligent algorithms were proposed, including: interactive linear regression-Gaussian process regression (ILR-GPR), interactive linear regression-generalized neural network (ILR-GRNN), interactive linear regression-Support Vector Regression (ILR-SVR), and interactive linear regression-adaptive neuro-fuzzy inference system (ILR-ANFIS), to boost the prediction accuracy of the stand-alone techniques in the clinical prediction of hepatitis C among patients. Based on the quantitative prediction skills presented by the novel hybridized paradigms, the proposed techniques were able to enhance the performance efficiency of the single paradigms up to 44% and 45% in the calibration and validation phases, respectively.
Collapse
Affiliation(s)
- Zachariah Madaki
- Department of Pharmacology, Faculty of Pharmacy, Near East University, North Cyprus, Mersin-10, 99138 Nicosia, Türkiye
| | - Nurettin Abacioglu
- Department of Pharmacology, Faculty of Pharmacy, Near East University, North Cyprus, Mersin-10, 99138 Nicosia, Türkiye
| | - A. G. Usman
- Operational Research Centre in Healthcare, Near East University, North Cyprus, Mersin-10, 99138 Nicosia, Türkiye
- Department of Analytical Chemistry, Faculty of Pharmacy, Near East University, North Cyprus, Mersin-10, 99138 Nicosia, Türkiye
- Correspondence: (A.G.U.); (S.I.A.)
| | - Neda Taner
- Department of Clinical Pharmacy, Faculty of Pharmacy, Istanbul Medipol University, 34810 Istanbul, Türkiye
| | - Ahmet. O. Sehirli
- Department of Pharmacology, Faculty of Dentistry, Nicosia, Near East University, North Cyprus, Mersin-10, 99138 Nicosia, Türkiye
| | - S. I. Abba
- Interdisciplinary Research Centre for Membrane and Water Security, Faculty of Petroleum and Minerals, King Fahd University, Dhahran 31261, Saudi Arabia
- Correspondence: (A.G.U.); (S.I.A.)
| |
Collapse
|
8
|
Moradi E, Darabi H, Heydari E, Karimi M, Kløve B. Vegetation vulnerability to hydrometeorological stresses in water-scarce areas using machine learning and remote sensing techniques. ECOL INFORM 2022. [DOI: 10.1016/j.ecoinf.2022.101838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
9
|
A Review of Hybrid Soft Computing and Data Pre-Processing Techniques to Forecast Freshwater Quality’s Parameters: Current Trends and Future Directions. ENVIRONMENTS 2022. [DOI: 10.3390/environments9070085] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Water quality has a significant influence on human health. As a result, water quality parameter modelling is one of the most challenging problems in the water sector. Therefore, the major factor in choosing an appropriate prediction model is accuracy. This research aims to analyse hybrid techniques and pre-processing data methods in freshwater quality modelling and forecasting. Hybrid approaches have generally been seen as a potential way of improving the accuracy of water quality modelling and forecasting compared with individual models. Consequently, recent studies have focused on using hybrid models to enhance forecasting accuracy. The modelling of dissolved oxygen is receiving more attention. From a review of relevant articles, it is clear that hybrid techniques are viable and precise methods for water quality prediction. Additionally, this paper presents future research directions to help researchers predict freshwater quality variables.
Collapse
|
10
|
Computational assessment of groundwater salinity distribution within coastal multi-aquifers of Bangladesh. Sci Rep 2022; 12:11165. [PMID: 35778436 PMCID: PMC9249886 DOI: 10.1038/s41598-022-15104-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 06/17/2022] [Indexed: 11/30/2022] Open
Abstract
The rising salinity trend in the country’s coastal groundwater has reached an alarming rate due to unplanned use of groundwater in agriculture and seawater seeping into the underground due to sea-level rise caused by global warming. Therefore, assessing salinity is crucial for the status of safe groundwater in coastal aquifers. In this research, a rigorous hybrid neurocomputing approach comprised of an Adaptive Neuro-Fuzzy Inference System (ANFIS) hybridized with a new meta-heuristic optimization algorithm, namely Aquila optimization (AO) and the Boruta-Random forest feature selection (FS) was developed for estimating the salinity of multi-aquifers in coastal regions of Bangladesh. In this regard, 539 data samples, including ten water quality indices, were collected to provide the predictive model. Moreover, the individual ANFIS, Slime Mould Algorithm (SMA), and Ant Colony Optimization for Continuous Domains (ACOR) coupled with ANFIS (i.e., ANFIS-SMA and ANFIS-ACOR) and LASSO regression (Lasso-Reg) schemes were examined to compare with the primary model. Several goodness-of-fit indices, such as correlation coefficient (R), the root mean squared error (RMSE), and Kling-Gupta efficiency (KGE) were used to validate the robustness of the predictive models. Here, the Boruta-Random Forest (B-RF), as a new robust tree-based FS, was adopted to identify the most significant candidate inputs and effective input combinations to reduce the computational cost and time of the modeling. The outcomes of four selected input combinations ascertained that the ANFIS-OA regarding the best accuracy in terms of (R = 0.9450, RMSE = 1.1253 ppm, and KGE = 0.9146) outperformed the ANFIS-SMA (R = 0.9406, RMSE = 1.1534 ppm, and KGE = 0.8793), ANFIS-ACOR (R = 0.9402, RMSE = 1.1388 ppm, and KGE = 0.8653), Lasso-Reg (R = 0.9358), and ANFIS (R = 0.9306) models. Besides, the first candidate input combination (C1) by three inputs, including Cl− (mg/l), Mg2+ (mg/l), Na+ (mg/l), yielded the best accuracy among all alternatives, implying the role importance of (B-RF) feature selection. Finally, the spatial salinity distribution assessment in the study area ascertained the high predictability potential of the ANFIS-OA hybrid with B-RF feature selection compared to other paradigms. The most important novelty of this research is using a robust framework comprised of the non-linear data filtering technique and a new hybrid neuro-computing approach, which can be considered as a reliable tool to assess water salinity in coastal aquifers.
Collapse
|
11
|
Alnuwaiser MA, Javed MF, Khan MI, Ahmed MW, Galal AM. Support vector regression and ANN approach for predicting the ground water quality. J INDIAN CHEM SOC 2022. [DOI: 10.1016/j.jics.2022.100538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
12
|
Application of Machine Learning and Process-Based Models for Rainfall-Runoff Simulation in DuPage River Basin, Illinois. HYDROLOGY 2022. [DOI: 10.3390/hydrology9070117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Rainfall-runoff simulation is vital for planning and controlling flood control events. Hydrology modeling using Hydrological Engineering Center—Hydrologic Modeling System (HEC-HMS) is accepted globally for event-based or continuous simulation of the rainfall-runoff operation. Similarly, machine learning is a fast-growing discipline that offers numerous alternatives suitable for hydrology research’s high demands and limitations. Conventional and process-based models such as HEC-HMS are typically created at specific spatiotemporal scales and do not easily fit the diversified and complex input parameters. Therefore, in this research, the effectiveness of Random Forest, a machine learning model, was compared with HEC-HMS for the rainfall-runoff process. Furthermore, we also performed a hydraulic simulation in Hydrological Engineering Center—Geospatial River Analysis System (HEC-RAS) using the input discharge obtained from the Random Forest model. The reliability of the Random Forest model and the HEC-HMS model was evaluated using different statistical indexes. The coefficient of determination (R2), standard deviation ratio (RSR), and normalized root mean square error (NRMSE) were 0.94, 0.23, and 0.17 for the training data and 0.72, 0.56, and 0.26 for the testing data, respectively, for the Random Forest model. Similarly, the R2, RSR, and NRMSE were 0.99, 0.16, and 0.06 for the calibration period and 0.96, 0.35, and 0.10 for the validation period, respectively, for the HEC-HMS model. The Random Forest model slightly underestimated peak discharge values, whereas the HEC-HMS model slightly overestimated the peak discharge value. Statistical index values illustrated the good performance of the Random Forest and HEC-HMS models, which revealed the suitability of both models for hydrology analysis. In addition, the flood depth generated by HEC-RAS using the Random Forest predicted discharge underestimated the flood depth during the peak flooding event. This result proves that HEC-HMS could compensate Random Forest for the peak discharge and flood depth during extreme events. In conclusion, the integrated machine learning and physical-based model can provide more confidence in rainfall-runoff and flood depth prediction.
Collapse
|
13
|
A Framework for Comparing Multi-Objective Optimization Approaches for a Stormwater Drainage Pumping System to Reduce Energy Consumption and Maintenance Costs. WATER 2022. [DOI: 10.3390/w14081248] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Reducing energy consumption and maintenance costs of a pumping system is seen as an important but difficult multi-objective optimization problem. Many evolutionary algorithms, such as particle swarm optimization (PSO), multi-objective particle swarm optimization (MOPSO), and non-dominated sorting genetic algorithm II (NSGA-II) have been used. However, a lack of comparison between these approaches poses a challenge to the selection of optimization approach for stormwater drainage pumping stations. In this paper, a new framework for comparing multi-objective approaches is proposed. Two kinds of evolutionary approaches, single-objective optimization and multi-objective optimization, are considered. Three approaches representing these two types are selected for comparison, including PSO with linear weighted sum method (PSO-LWSM), MOPSO with technique for order preference by similarity to an ideal solution (MOPSO-TOPSIS), and NSGA-II with TOPSIS (NSGA-II-TOPSIS). Four optimization objectives based on the number of pump startups/shutoffs, working hours, energy consumption, and drainage capacity are considered, of which the first two are new ones quantified in terms of operational economy in this paper. Two comparison methods—TOPSIS and operational economy and drainage capacity (E&C)—are used. The framework is demonstrated and tested by a case in China. The average values of the TOPSIS comprehensive evaluation index of the three approaches are 0.021, 0.154, and 0.375, respectively, and for E&C are 0.785, 0.813, and 0.839, respectively. The results show that the PSO-LWSM has better optimization results. The results validate the efficiency of the framework. The proposed framework will help to find a better optimization approach for pumping systems to reduce energy consumption and maintenance costs.
Collapse
|
14
|
Salinity Forecasting on Raw Water for Water Supply in the Chao Phraya River. WATER 2022. [DOI: 10.3390/w14050741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Frequent saltwater intrusions in the Chao Phraya River have had an impact on water supply to the residents of Bangkok and nearby areas. Although relocation of the raw water station is a long-term solution, it requires a large amount of time and investment. At present, knowing in advance when an intrusion occurs will support the waterworks authority in their operations. Here, we propose a method to forecast the salinity at the raw water pumping station from 24 h up to 120 h in advance. Each of the predictor variables has a physical impact on salinity. We explore a number of model candidates based on two common fitting methods: multiple linear regression and the artificial neural network. During model development, we found that the model behaved differently when the water level was high than when the water level was low (water level is measured at a point 164 km upstream of the raw water pumping station); therefore, we propose a novel multilevel model approach that combines different sub-models, each of which is suitable for a particular water level. The models have been trained and selected through cross-validation, and tested on real data. According to the test results, the salinity can be forecasted with an RMSE of 0.054 g L\({^{-1}}\) at a forecast period of 24 h and up to 0.107 g L\({^{-1}}\) at a forecast period of 120 h.
Collapse
|
15
|
Thorslund J, Bierkens MFP, Oude Essink GHP, Sutanudjaja EH, van Vliet MTH. Common irrigation drivers of freshwater salinisation in river basins worldwide. Nat Commun 2021; 12:4232. [PMID: 34244500 PMCID: PMC8270903 DOI: 10.1038/s41467-021-24281-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 06/08/2021] [Indexed: 02/06/2023] Open
Abstract
Freshwater salinisation is a growing problem, yet cross-regional assessments of freshwater salinity status and the impact of agricultural and other sectoral uses are lacking. Here, we assess inland freshwater salinity patterns and evaluate its interactions with irrigation water use, across seven regional river basins (401 river sub-basins) around the world, using long-term (1980-2010) salinity observations. While a limited number of sub-basins show persistent salinity problems, many sub-basins temporarily exceeded safe irrigation water-use thresholds and 57% experience increasing salinisation trends. We further investigate the role of agricultural activities as drivers of salinisation and find common contributions of irrigation-specific activities (irrigation water withdrawals, return flows and irrigated area) in sub-basins of high salinity levels and increasing salinisation trends, compared to regions without salinity issues. Our results stress the need for considering these irrigation-specific drivers when developing management strategies and as a key human component in water quality modelling and assessment.
Collapse
Affiliation(s)
- Josefin Thorslund
- grid.10548.380000 0004 1936 9377Department of Physical Geography and the Bolin Centre for Climate Research, Stockholm University, Stockholm, Sweden ,grid.5477.10000000120346234Department of Physical Geography, Utrecht University, Utrecht, The Netherlands
| | - Marc F. P. Bierkens
- grid.5477.10000000120346234Department of Physical Geography, Utrecht University, Utrecht, The Netherlands ,grid.6385.80000 0000 9294 0542Unit Subsurface and Groundwater Systems, Deltares, The Netherlands
| | - Gualbert H. P. Oude Essink
- grid.5477.10000000120346234Department of Physical Geography, Utrecht University, Utrecht, The Netherlands ,grid.6385.80000 0000 9294 0542Unit Subsurface and Groundwater Systems, Deltares, The Netherlands
| | - Edwin H. Sutanudjaja
- grid.5477.10000000120346234Department of Physical Geography, Utrecht University, Utrecht, The Netherlands
| | - Michelle T. H. van Vliet
- grid.5477.10000000120346234Department of Physical Geography, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
16
|
Kim S, Maleki N, Rezaie-Balf M, Singh VP, Alizamir M, Kim NW, Lee JT, Kisi O. Assessment of the total organic carbon employing the different nature-inspired approaches in the Nakdong River, South Korea. ENVIRONMENTAL MONITORING AND ASSESSMENT 2021; 193:445. [PMID: 34173069 DOI: 10.1007/s10661-021-08907-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Accepted: 01/26/2021] [Indexed: 06/13/2023]
Abstract
Total organic carbon (TOC) has vital significance for measuring water quality in river streamflow. The detection of TOC can be considered as an important evaluation because of issues on human health and environmental indicators. This research utilized the novel hybrid models to improve the predictive accuracy of TOC at Andong and Changnyeong stations in the Nakdong River, South Korea. A data pre-processing approach (i.e., complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN)) and evolutionary optimization algorithm (i.e., crow search algorithm (CSA)) were implemented for enhancing the accuracy and robustness of standalone models (i.e., multivariate adaptive regression spline (MARS) and M5Tree). Various water quality indicators (i.e., TOC, potential of Hydrogen (pH), electrical conductivity (EC), dissolved oxygen (DO), water temperature (WT), chemical oxygen demand (COD), and suspended solids (SS)) were utilized for developing the standalone and hybrid models based on three input combinations (i.e., categories 1~3). The developed models were evaluated utilizing the correlation coefficient (CC), root-mean-square error (RMSE), and Nash-Sutcliffe efficiency (NSE). The CEEMDAN-MARS-CSA based on category 2 (C-M-CSA2) model (CC = 0.762, RMSE = 0.570 mg/L, and NSE = 0.520) was the most accurate for predicting TOC at Andong station, whereas the CEEMDAN-MARS-CSA based on category 3 (C-M-CSA3) model (CC = 0.900, RMSE = 0.675 mg/L, and NSE = 0.680) was the best at Changnyeong station.
Collapse
Affiliation(s)
- Sungwon Kim
- Department of Railroad Construction and Safety Engineering, Dongyang University, Yeongju, 36040, South Korea.
| | - Niloofar Maleki
- Department of Civil Engineering, Pardisan University, Freidoonkenar, Iran
| | - Mohammad Rezaie-Balf
- Department of Civil Engineering, Graduate University of Advanced Technology-Kerman, P.O. Box 76315-116, Kerman, Iran
| | - Vijay P Singh
- Department of Biological and Agricultural Engineering & Zachry Department of Civil Engineering, Texas A&M University, College Station, TX, 77843-2117, USA
- National Water Center, UAE University, Al Ain, UAE
| | - Meysam Alizamir
- Department of Civil Engineering, Hamedan Branch, Islamic Azad University, Hamedan, Iran
| | - Nam Won Kim
- Department of Land, Water and Environment Research, Korea Institute of Civil Engineering and Building Technology, Goyang-si, 10223, South Korea
| | - Jong-Tak Lee
- Department of Water Supply and Wastewater, Hando Engineering & Architecture, Daegu, 42140, South Korea
| | - Ozgur Kisi
- Department of Civil Engineering, School of Technology, Ilia State University, Tbilisi, Georgia
- Institute of Research and Development, Duy Tan University, Da Nang, 550000, Vietnam
| |
Collapse
|
17
|
Modelling Freshwater Eutrophication with Limited Limnological Data Using Artificial Neural Networks. WATER 2021. [DOI: 10.3390/w13111590] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Artificial Neural Networks (ANNs) have wide applications in aquatic ecology and specifically in modelling water quality and biotic responses to environmental predictors. However, data scarcity is a common problem that raises the need to optimize modelling approaches to overcome data limitations. With this paper, we investigate the optimal k-fold cross validation in building an ANN using a small water-quality data set. The ANN was created to model the chlorophyll-a levels of a shallow eutrophic lake (Mikri Prespa) located in N. Greece. The typical water quality parameters serving as the ANN’s inputs are pH, dissolved oxygen, water temperature, phosphorus, nitrogen, electric conductivity, and Secchi disk depth. The available data set was small, containing only 89 data samples. For that reason, k-fold cross validation was used for training the ANN. To find the optimal k value for the k-fold cross validation, several values of k were tested (ranging from 3 to 30). Additionally, the leave-one-out (LOO) cross validation, which is an extreme case of the k-fold cross validation, was also applied. The ANN’s performance indices showed a clear trend to be improved as the k number was increased, while the best results were calculated for the LOO cross validation as expected. The computational times were calculated for each k value, where it was found the computational time is relatively low when applying the more expensive LOO cross validation; therefore, the LOO is recommended. Finally, a sensitivity analysis was examined using the ANN to investigate the interactions of the input parameters with the Chlorophyll-a, and hence examining the potential use of the ANN as a water management tool for nutrient control.
Collapse
|
18
|
Yoosefzadeh-Najafabadi M, Tulpan D, Eskandari M. Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits. PLoS One 2021; 16:e0250665. [PMID: 33930039 PMCID: PMC8087002 DOI: 10.1371/journal.pone.0250665] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Accepted: 04/12/2021] [Indexed: 12/02/2022] Open
Abstract
Improving genetic yield potential in major food grade crops such as soybean (Glycine max L.) is the most sustainable way to address the growing global food demand and its security concerns. Yield is a complex trait and reliant on various related variables called yield components. In this study, the five most important yield component traits in soybean were measured using a panel of 250 genotypes grown in four environments. These traits were the number of nodes per plant (NP), number of non-reproductive nodes per plant (NRNP), number of reproductive nodes per plant (RNP), number of pods per plant (PP), and the ratio of number of pods to number of nodes per plant (P/N). These data were used for predicting the total soybean seed yield using the Multilayer Perceptron (MLP), Radial Basis Function (RBF), and Random Forest (RF), machine learning (ML) algorithms, individually and collectively through an ensemble method based on bagging strategy (E-B). The RBF algorithm with highest Coefficient of Determination (R2) value of 0.81 and the lowest Mean Absolute Errors (MAE) and Root Mean Square Error (RMSE) values of 148.61 kg.ha-1, and 185.31 kg.ha-1, respectively, was the most accurate algorithm and, therefore, selected as the metaClassifier for the E-B algorithm. Using the E-B algorithm, we were able to increase the prediction accuracy by improving the values of R2, MAE, and RMSE by 0.1, 0.24 kg.ha-1, and 0.96 kg.ha-1, respectively. Furthermore, for the first time in this study, we allied the E-B with the genetic algorithm (GA) to model the optimum values of yield components in an ideotype genotype in which the yield is maximized. The results revealed a better understanding of the relationships between soybean yield and its components, which can be used for selecting parental lines and designing promising crosses for developing cultivars with improved genetic yield potential.
Collapse
Affiliation(s)
| | - Dan Tulpan
- Department of Animal Biosciences, University of Guelph, Guelph, Ontario, Canada
| | - Milad Eskandari
- Department of Plant Agriculture, University of Guelph, Guelph, Ontario, Canada
| |
Collapse
|
19
|
Abstract
Water is an essential resource that facilitates the existence of human life forms. In recent years, the demand for the consumption of freshwater has substantially increased. Seawater contains a high concentration of salt particles and salinity, making it unfit for consumption and domestic use. Water treatment plants used to treat seawater are less efficient and reliable. Deep learning systems can prove to be efficient and highly accurate in analyzing salt particles in seawater with higher efficiency that can improve the performance of water treatment plants. Therefore, this work classified different concentrations of salt particles in water using convolutional neural networks with the implementation of transfer learning. Salt salinity concentration images were captured using a designed Raspberry Pi based model and these images were further used for training purposes. Moreover, a data augmentation technique was also employed for the state-of-the-art results. Finally, a deep learning neural network was used to classify saline particles of varied concentration range images. The experimental results show that the proposed approach exhibited superior outcomes by achieving an overall accuracy of 90% and f-score of 87% in classifying salt particles. The proposed model was also evaluated using other evaluation metrics such as precision, recall, and specificity, and showed robust results.
Collapse
|
20
|
Modelling and Prediction of Water Quality by Using Artificial Intelligence. SUSTAINABILITY 2021. [DOI: 10.3390/su13084259] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Artificial intelligence methods can remarkably reduce costs for water supply and sanitation systems and help ensure compliance with the quality of drinking and wastewater treatment. Therefore, modelling and predicting water quality to control water pollution has been widely researched. The novelty of the proposed system is presented to develop an efficient operation of monitoring drinking water to ensure a sustainable and friendly green environment. In this work, the adaptive neuro-fuzzy inference system (ANFIS) algorithm was developed to predict the water quality index (WQI). Feed-forward neural network (FFNN) and K-nearest neighbors were applied to classify water quality. The dataset has eight significant parameters, but seven parameters were considered to show significant values. The proposed methodology was developed based on these statistical parameters. Prediction results demonstrated that the ANFIS model was superior for the prediction of WQI values. Nevertheless, the FFNN algorithm achieved the highest accuracy (100%) for water quality classification (WQC). Furthermore, the ANFIS model accurately predicted WQI, and the FFNN model showed superior robustness in classifying the WQC. In addition, the ANFIS model showed accuracy during the testing phase, with a regression coefficient of 96.17% for predicting WQI, and the FFNN model achieved the highest accuracy (100%) for WQC. This proposed method, using advanced artificial intelligence, can aid in water treatment and management.
Collapse
|
21
|
Bayesian Model Averaging: A Unique Model Enhancing Forecasting Accuracy for Daily Streamflow Based on Different Antecedent Time Series. SUSTAINABILITY 2020. [DOI: 10.3390/su12229720] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Streamflow forecasting is a vital task for hydrology and water resources engineering, and the different artificial intelligence (AI) approaches have been employed for this purposes until now. Additionally, the forecasting accuracy and uncertainty estimation are the meaningful assignments that need to be recognized. The addressed research investigates the potential of novel ensemble approach, Bayesian model averaging (BMA), in streamflow forecasting using daily time series data from two stations (i.e., Hongcheon and Jucheon), South Korea. Six categories (i.e., M1–M6) of input combination using different antecedent times were employed for streamflow forecasting. The outcomes of BMA model were compared with those of multivariate adaptive regression spline (MARS), M5 model tree (M5Tree), and Kernel extreme learning machines (KELM) models considering four assessment indexes, root mean square error (RMSE), Nash-Sutcliffe efficiency (NSE), correlation coefficient (R), and mean absolute error (MAE). The results revealed the superior accuracy of BMA model over three machine learning models in daily streamflow forecasting. Considering RMSE values among the best models during testing phase, the best BMA model (i.e., BMA2) enhanced the forecasting accuracy of MARS1, M5Tree4, and KELM3 models by 5.2%, 5.8%, and 3.4% in Hongcheon station. Additionally, the best BMA model (i.e., BMA1) improved the forecasting accuracy of MARS1, M5Tree1, and KELM1 models by 6.7%, 9.5%, and 3.7% in Jucheon station. In addition, the best BMA models in both stations allowed the uncertainty estimation, and produced higher uncertainty of peak flows compared to that of low flows. As one of the most robust and effective tools, therefore, the BMA model can be successfully employed for streamflow forecasting with different antecedent times.
Collapse
|
22
|
Glacier Mapping Based on Random Forest Algorithm: A Case Study over the Eastern Pamir. WATER 2020. [DOI: 10.3390/w12113231] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Debris-covered glaciers are common features on the eastern Pamir and serve as important indicators of climate change promptly. However, mapping of debris-covered glaciers in alpine regions is still challenging due to many factors including the spectral similarity between debris and the adjacent bedrock, shadows cast from mountains and clouds, and seasonal snow cover. Considering that few studies have added movement velocity features when extracting glacier boundaries, we innovatively developed an automatic algorithm consisting of rule-based image segmentation and Random Forest to extract information about debris-covered glaciers with Landsat-8 OLI/TIRS data for spectral, texture and temperature features, multi-digital elevation models (DEMs) for elevation and topographic features, and the Inter-mission Time Series of Land Ice Velocity and Elevation (ITS_LIVE) for movement velocity features, and accuracy evaluation was performed to determine the optimal feature combination extraction of debris-covered glaciers. The study found that the overall accuracy of extracting debris-covered glaciers using combined movement velocity features is 97.60%, and the Kappa coefficient is 0.9624, which is better than the extraction results using other schemes. The high classification accuracy obtained using our method overcomes most of the above-mentioned challenges and can detect debris-covered glaciers, illustrating that this method can be executed efficiently, which will further help water resources management.
Collapse
|