1
|
Saroughi M, Mirzania E, Achite M, Katipoğlu OM, Al-Ansari N, Vishwakarma DK, Chung IM, Alreshidi MA, Yadav KK. Evaluate effect of 126 pre-processing methods on various artificial intelligence models accuracy versus normal mode to predict groundwater level (case study: Hamedan-Bahar Plain, Iran). Heliyon 2024; 10:e29006. [PMID: 38601575 PMCID: PMC11004570 DOI: 10.1016/j.heliyon.2024.e29006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 03/25/2024] [Accepted: 03/28/2024] [Indexed: 04/12/2024] Open
Abstract
The estimation of groundwater levels is crucial and an important step in ensuring sustainable management of water resources. In this paper, selected piezometers of the Hamedan-Bahar plain located in west of Iran. The main objective of this study is to compare effect of various pre-processing methods on input data for different artificial intelligence (AI) models to predict groundwater levels (GWLs). The observed GWL, evaporation, precipitation, and temperature were used as input variables in the AI algorithms. Firstly, 126 method of data pre-processing was done by python programming which are classified into three classes: 1- statistical methods, 2- wavelet transform methods and 3- decomposition methods; later, various pre-processed data used by four types of widely used AI models with different kernels, which includes: Support Vector Machine (SVR), Artificial Neural Network (ANN), Long-Short Term memory (LSTM), and Pelican Optimization Algorithm (POA) - Artificial Neural Network (POA-ANN) are classified into three classes: 1- machine learning (SVR and ANN), 2- deep learning (LSTM) and 3- hybrid-ML (POA-ANN) models, to predict groundwater levels (GWLs). Akaike Information Criterion (AIC) were used to evaluate and validate the predictive accuracy of algorithms. According to the results, based on summation (train and test phases) of AIC value of 1778 models, average of AIC values for ML, DL, hybrid-ML classes, was decreased to -25.3%, -29.6% and -57.8%, respectively. Therefore, the results showed that all data pre-processing methods do not lead to improvement of prediction accuracy, and they should be selected very carefully by trial and error. In conclusion, wavelet-ANN model with daubechies 13 and 25 neurons (db13_ANN_25) is the best model to predict GWL that has -204.9 value for AIC which has grown by 5.23% (-194.7) compared to the state without any pre-processing method (ANN_Relu_25).
Collapse
Affiliation(s)
- Mohsen Saroughi
- Department of Irrigation and Reclamation Engineering, Faculty of Agricultural Engineering and Technology, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Ehsan Mirzania
- Department of Water Engineering, University of Tabriz, Tabriz, Iran
| | - Mohammed Achite
- Faculty of Nature and Life Sciences, Laboratory of Water and Environment, Hassiba Benbouali University of Chlef, Chlef, 02180, Algeria
| | - Okan Mert Katipoğlu
- Department of Civil Engineering, Erzincan Binali Yıldırım University, Erzincan, Turkey
| | - Nadhir Al-Ansari
- Department of Civil, Environmental, and Natural Resources Engineering, Lulea University of Technology, 97187, Lulea, Sweden
| | - Dinesh Kumar Vishwakarma
- Department of Irrigation and Drainage Engineering, Govind Ballabh Pant University of Agriculture and Technology, Pantnagar, Udham Singh Nagar, Uttarakhand, 263145, India
| | - Il-Moon Chung
- Department of Water Resources and River Research, Korea Institute of Civil Engineering and Building Technology, Goyang-si, 10223, Republic of Korea
| | | | - Krishna Kumar Yadav
- Faculty of Science and Technology, Madhyanchal Professional University, Ratibad, Bhopal, 462044, India
- Environmental and Atmospheric Sciences Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, Nasiriyah, 64001, Iraq
| |
Collapse
|
2
|
Boo KBW, El-Shafie A, Othman F, Khan MMH, Birima AH, Ahmed AN. Groundwater level forecasting with machine learning models: A review. WATER RESEARCH 2024; 252:121249. [PMID: 38330715 DOI: 10.1016/j.watres.2024.121249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 01/05/2024] [Accepted: 01/31/2024] [Indexed: 02/10/2024]
Abstract
Groundwater, the world's most abundant source of freshwater, is rapidly depleting in many regions due to a variety of factors. Accurate forecasting of groundwater level (GWL) is essential for effective management of this vital resource, but it remains a complex and challenging task. In recent years, there has been a notable increase in the use of machine learning (ML) techniques to model GWL, with many studies reporting exceptional results. In this paper, we present a comprehensive review of 142 relevant articles indexed by the Web of Science from 2017 to 2023, focusing on key ML models, including artificial neural networks (ANN), adaptive neuro-fuzzy inference systems (ANFIS), support vector regression (SVR), evolutionary computing (EC), deep learning (DL), ensemble learning (EN), and hybrid-modeling (HM). We also discussed key modeling concepts such as dataset size, data splitting, input variable selection, forecasting time-step, performance metrics (PM), study zones, and aquifers, highlighting best practices for optimal GWL forecasting with ML. This review provides valuable insights and recommendations for researchers and water management agencies working in the field of groundwater management and hydrology.
Collapse
Affiliation(s)
- Kenneth Beng Wee Boo
- Department of Civil Engineering, Faculty of Engineering, Universiti Malaya (UM), 50603 Kuala Lumpur, Malaysia.
| | - Ahmed El-Shafie
- Department of Civil Engineering, Faculty of Engineering, Universiti Malaya (UM), 50603 Kuala Lumpur, Malaysia; National Water and Energy Center, United Arab Emirates University, P.O. Box 15551, Al Ain, United Arab Emirates.
| | - Faridah Othman
- Department of Civil Engineering, Faculty of Engineering, Universiti Malaya (UM), 50603 Kuala Lumpur, Malaysia.
| | - Md Munir Hayet Khan
- Faculty of Engineering & Quantity Surveying, INTI International University (INTI-IU), Persiaran Perdana BBN, Putra Nilai, 71800 Nilai, Negeri Sembilan, Malaysia.
| | - Ahmed H Birima
- Department of Civil Engineering, College of Engineering, Qassim University, Unaizah, Saudi Arabia.
| | - Ali Najah Ahmed
- School of Engineering and Technology, Sunway University, Bandar Sunway, Petaling Jaya, 47500, Malaysia; Institute of Energy Infrastructure (IEI) , Universiti Tenaga Nasional (UNITEN), 43000, Selangor, Malaysia.
| |
Collapse
|
3
|
Boo KBW, El-Shafie A, Othman F, Sherif M, Ahmed AN. Groundwater level forecasting using ensemble coactive neuro-fuzzy inference system. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 912:168760. [PMID: 38013106 DOI: 10.1016/j.scitotenv.2023.168760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 11/12/2023] [Accepted: 11/19/2023] [Indexed: 11/29/2023]
Abstract
A modeling framework utilizing the coactive neuro-fuzzy inference system (CANFIS) has been developed for multi-lead time groundwater level (GWL) forecasting in four different wells located in Texas and Florida, USA. Various model input combinations, including GWL, precipitation, temperature, and surface water level variables, have been derived based on proposed correlation analysis using singular spectrum analysis (SSA) remainders. The models have been trained on data subsets of varying lengths to identify the optimal training data duration. Additionally, we have introduced the bagging ensemble learning method to enhance the performance of the CANFIS model. As part of a comprehensive model evaluation process, the best-performing CANFIS model for each forecasting scenario has undergone uncertainty analysis using bootstrap sampling. Our results reveal that the CANFIS model performs satisfactorily for daily forecasting but leaves room for improvement in monthly forecasting, particularly for two-month and three-month ahead forecasts. Moreover, we have identified several optimal input combinations, highlighting the significance of the temperature variable in monthly forecasting. Furthermore, our findings indicate that additional training data does not necessarily lead to improved performance. The ensemble CANFIS model has demonstrated significant performance enhancement, particularly for monthly forecasting. Finally, the CANFIS model uncertainty analysis has shown satisfactory results for daily forecasting scenarios, while monthly forecasting models exhibit higher uncertainties, particularly during periods with distinctly different GWL fluctuation patterns.
Collapse
Affiliation(s)
- Kenneth Beng Wee Boo
- Department of Civil Engineering, Faculty of Engineering, Universiti Malaya (UM), 50603 Kuala Lumpur, Malaysia.
| | - Ahmed El-Shafie
- Department of Civil Engineering, Faculty of Engineering, Universiti Malaya (UM), 50603 Kuala Lumpur, Malaysia; National Water and Energy Center, United Arab Emirates University, P.O. Box 15551, Al Ain, United Arab Emirates.
| | - Faridah Othman
- Department of Civil Engineering, Faculty of Engineering, Universiti Malaya (UM), 50603 Kuala Lumpur, Malaysia.
| | - Mohsen Sherif
- Civil and Environmental Engineering Department, College of Engineering, United Arab Emirates University, 15551 Al Ain, United Arab Emirates.
| | - Ali Najah Ahmed
- School of Engineering and Technology, Sunway University, Bandar Sunway, Petaling Jaya, 47500, Malaysia
| |
Collapse
|
4
|
Yuan Y, Zhang D, Cui J, Zeng T, Zhang G, Zhou W, Wang J, Chen F, Guo J, Chen Z, Guo H. Land subsidence prediction in Zhengzhou's main urban area using the GTWR and LSTM models combined with the Attention Mechanism. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 907:167482. [PMID: 37839477 DOI: 10.1016/j.scitotenv.2023.167482] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 09/25/2023] [Accepted: 09/28/2023] [Indexed: 10/17/2023]
Abstract
In recent years, due to urbanization and human activities, groundwater overexploitation has become increasingly severe, resulting in some degrees of land subsidence and, consequently, causing a series of geological disasters and other environmental issues. Therefore, large-scale and high-precision land subsidence prediction is of great importance for the prevention and control of geological disasters. However, the existing prediction models and methods ignore the effects of the spatiotemporal non-stationary relationships between the influencing factors and the accumulated land subsidence, causing the poor accuracy of the predicted land subsidence results. In this context, a Geographically and Temporally Weighted Regression combined with the Long Short-Term Memory (LSTM)-multivariable and Attention Mechanism (AM) (GTWR-LSTMm-AM) was proposed to more accurately predict the deformation of time series land subsidence in this study. The small baseline subset-interferometric synthetic aperture radar (SBAS-InSAR) was used to reveal the temporal deformation information of Zhengzhou's main urban area, then the GTWR model was used to assess the spatiotemporal non-stationarity relationships between the accumulated land subsidence and its influencing factors monthly groundwater stability level, monthly precipitation and Normalized Difference Vegetation Index (NDVI) data, and to determine the corresponding weight matrix. In addition, we introduced an LSTM model with AM to extract key information from the time-series land subsidence data and adjusted the dynamic weights of the three selected influencing factors to predict the land subsidence in Zhengzhou's main urban area. The prediction accuracy R2 of the GTWR-LSTMm-AM model reaches 0.972, which is higher than 0.929 of the LSTMm model. The prediction accuracy RMSE is less than 3 mm and reaches 2.403 mm. In addition, we determined the importance of the impact factor on the subsidence results by randomly interrupting the impact factor time series, disclosuring that the monthly groundwater level contributed the most to the land subsidence in Zhengzhou's main urban area.
Collapse
Affiliation(s)
- Yonghao Yuan
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
| | - Dujuan Zhang
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China
| | - Jian Cui
- Henan Institute of Geological Survey, Zhengzhou 450001, China; National Engineering Laboratory Geological Remote Sensing Center for Remote Sensing Satellite Application, Zhengzhou 450001, China; Engineering Technology Innovation Center for Multi-factor Urban Geological Data of Zhongyuan City Cluster, Ministry of Natural Resources, Zhengzhou 450001, China
| | - Tao Zeng
- Henan Institute of Geological Survey, Zhengzhou 450001, China; National Engineering Laboratory Geological Remote Sensing Center for Remote Sensing Satellite Application, Zhengzhou 450001, China; Engineering Technology Innovation Center for Multi-factor Urban Geological Data of Zhongyuan City Cluster, Ministry of Natural Resources, Zhengzhou 450001, China
| | - Gubin Zhang
- Henan Institute of Geological Survey, Zhengzhou 450001, China; National Engineering Laboratory Geological Remote Sensing Center for Remote Sensing Satellite Application, Zhengzhou 450001, China; Engineering Technology Innovation Center for Multi-factor Urban Geological Data of Zhongyuan City Cluster, Ministry of Natural Resources, Zhengzhou 450001, China
| | - Wenge Zhou
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
| | - Jinyang Wang
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
| | - Feng Chen
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
| | - Jiahui Guo
- School of Geoscience and Technology, Zhengzhou University, Zhengzhou 450001, China
| | - Zugang Chen
- Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China.
| | - Hengliang Guo
- National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou 450001, China.
| |
Collapse
|
5
|
Niu X, Lu C, Zhang Y, Zhang Y, Wu C, Saidy E, Liu B, Shu L. Hysteresis response of groundwater depth on the influencing factors using an explainable learning model framework with Shapley values. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 904:166662. [PMID: 37657541 DOI: 10.1016/j.scitotenv.2023.166662] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Revised: 08/26/2023] [Accepted: 08/26/2023] [Indexed: 09/03/2023]
Abstract
Machine learning has been widely used for groundwater prediction. However, the hysteresis response of groundwater depth (GD) to input features has not been fully investigated. This study uses an interpretation method to reveal the interplay between climate, human activity, and GD while considering the response of groundwater to multiple factors. Six factors [precipitation (P), wind speed (WS), temperature (T), population (POP), gross domestic product (GDP), and effective irrigated area (EIA)] were selected to analyze the hysteresis response of GD in terms of the lag correlation coefficient and lag time. The correlation between climatic variables and GD was weaker than that of anthropogenic variables. The lag time between variables and different types of GD was less than four months at most sites, except for EIA and WS in deep groundwater. The SVM model achieved satisfactory performance in 89 % of the sites. If there were sharp changes in GD during the testing period or significant variations in its seasonal patterns at different times, the SVM model performed poorly. The model was interpreted using the Shapley additive explanation method. The impact of POP and GDP on deep groundwater in irrigated areas was higher than that of shallow groundwater. In urban areas with intensive human activities, anthropogenic variables were the main factors affecting shallow groundwater while the impact of climate was gradually increasing in the suburbs. The influence of precipitation on shallow groundwater was decreased after water transfer from the South-to-North Water Diversion project. Furthermore, this study proposed a multifactor-driven conceptual model that can provide recommendations for analyzing groundwater dynamics in similar areas.
Collapse
Affiliation(s)
- Xinyi Niu
- College of Hydrology and Water Resources, Hohai University, Nanjing 210098, Jiangsu, China
| | - Chengpeng Lu
- College of Hydrology and Water Resources, Hohai University, Nanjing 210098, Jiangsu, China; State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, Nanjing 210098, Jiangsu, China.
| | - Ying Zhang
- Hydraulic Engineering Planning Bureau of Jiangsu Province, Nanjing 210029, Jiangsu, China
| | - Yong Zhang
- Department of Geological Sciences, University of Alabama, Tuscaloosa, AL 35487, USA
| | - Chengcheng Wu
- College of Hydrology and Water Resources, Hohai University, Nanjing 210098, Jiangsu, China
| | - Ebrima Saidy
- College of Hydrology and Water Resources, Hohai University, Nanjing 210098, Jiangsu, China
| | - Bo Liu
- College of Hydrology and Water Resources, Hohai University, Nanjing 210098, Jiangsu, China
| | - Longcang Shu
- College of Hydrology and Water Resources, Hohai University, Nanjing 210098, Jiangsu, China; State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, Nanjing 210098, Jiangsu, China
| |
Collapse
|
6
|
Gholizadeh H, Zhang Y, Frame J, Gu X, Green CT. Long short-term memory models to quantify long-term evolution of streamflow discharge and groundwater depth in Alabama. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 901:165884. [PMID: 37517717 DOI: 10.1016/j.scitotenv.2023.165884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 05/22/2023] [Accepted: 07/27/2023] [Indexed: 08/01/2023]
Abstract
Long short-term memory (LSTM) models have been shown to be efficient for rainfall-runoff modeling, and to a lesser extent, for groundwater depth forecasting. In this study, LSTMs were applied to quantify the spatiotemporal evolution of surface and subsurface hydrographs in Alabama in the Southeastern United States, where water sustainability has not been fully quantified across spatiotemporal scales. First, the surface water LSTM model with extensive dynamic (precipitation and other weather variables) and static (basin characteristics) inputs predicted the main characteristics of streamflow for six years at 19 gauged basins in Alabama. The model tended to underestimate extremely high streamflow but adding drainage density as an input feature slightly improved the predictions of extreme events. Second, to predict the groundwater depth evolution, a groundwater LSTM (GW-LSTM) model was proposed and applied using static inputs capturing the aquifers' hydrogeological properties and dynamic inputs of meteorological information. Three precipitation scenarios were also explored to evaluate the groundwater hydrograph evolution in the next two decades. The GW-LSTM model predicted the general trend of daily groundwater depth fluctuations (at 21 wells distributed across Alabama from 1990 to 2021) including most extremely high groundwater levels, and recovered groundwater depth for locations withheld from model training and validation. This study, therefore, extended the application of LSTMs in quantifying the spatiotemporal evolution of surface water and groundwater, two manifestations of a single integrated resource.
Collapse
Affiliation(s)
- Hossein Gholizadeh
- Department of Geological Sciences, University of Alabama, Tuscaloosa, AL 35487, USA
| | - Yong Zhang
- Department of Geological Sciences, University of Alabama, Tuscaloosa, AL 35487, USA.
| | | | - Xiufen Gu
- School of Mathematics and Information Science, Yantai University, Yantai, Shandong 264005, China
| | - Christopher T Green
- U.S. Geological Survey, Water Resources Mission Area, Moffett Field, CA 94035, USA
| |
Collapse
|
7
|
Yang S, Lian H, Xu B, Thanh HV, Chen W, Yin H, Dai Z. Application of robust deep learning models to predict mine water inflow: Implication for groundwater environment management. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 871:162056. [PMID: 36758705 DOI: 10.1016/j.scitotenv.2023.162056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 01/20/2023] [Accepted: 02/02/2023] [Indexed: 06/18/2023]
Abstract
Traditional mine water inflow prediction is characterized by a high degree of uncertainty in model parameters and complex mechanisms involved in the water inflow process. Data-driven models play a key role in predicting inflow mechanisms without considering physical changes. However, the existing models are limited by nonlinearity and non-stationarity. Thus, the principal objective of this study was to propose two robust models, the DIFF-TCN model and the DIFF-LSTM model, for predicting the average water inflow per day. The models consist of three methods, namely Difference Method (DIFF), Temporal Convolutional Neural Network (TCN), and Long Short-Term Memory Neural Network (LSTM). When applied to the Tingnan Coal Mine, Shanxi Province, China, the DIFF-TCN performs better in predicting the average daily water inflow, the model has a MAE of 5.88 m3/h, RMSE of 6.85 m3/h and R2 of 0.96 in the test stage of the water inflow event. Comparison with the other deep learning models (with similar complex structures) and traditional time series model shows the superiority of our proposed DIFF-TCN model. The SHAP value is used to explain the contribution of each model input to the predicted values, and it indicates that the historical time of water inflow data are the most important input, and the advance distance and the groundwater level data also contribute to the model predictions, but groundwater level data for some periods in the past may have a detrimental effect on the model. The findings of this study can provide better understanding about potential of robust deep learning models for smart hydrological forecasting, and they can also provide technical guidance for mining safety production and protection of water resources and water environment around the mining area.
Collapse
Affiliation(s)
- Songlin Yang
- College of Civil Engineering, Jilin University, Changchun, China
| | - Huiqing Lian
- Hebei State Key Laboratory of Mine Disaster Prevention, North China Institute of Science and Technology, Beijing Yanjiao 101601, China
| | - Bin Xu
- Hebei State Key Laboratory of Mine Disaster Prevention, North China Institute of Science and Technology, Beijing Yanjiao 101601, China
| | - Hung Vo Thanh
- Laboratory for Computational Mechanics, Institute for Computational Science and Artificial Intelligence, Van Lang University, Ho Chi Minh City, Viet Nam; Faculty of Mechanical - Electrical and Computer Engineering, School of Technology, Van Lang University, Ho Chi Minh City, Viet Nam
| | - Wei Chen
- College of Civil Engineering, Jilin University, Changchun, China
| | - Huichao Yin
- School of Information Engineering, Institute of Disaster Prevention, Langfang 065201, China.
| | - Zhenxue Dai
- College of Civil Engineering, Jilin University, Changchun, China; Key Laboratory of Groundwater Resources and Environment, Ministry of Education, Jilin University, Changchun, China.
| |
Collapse
|
8
|
Chidepudi SKR, Massei N, Jardani A, Henriot A, Allier D, Baulon L. A wavelet-assisted deep learning approach for simulating groundwater levels affected by low-frequency variability. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 865:161035. [PMID: 36587693 DOI: 10.1016/j.scitotenv.2022.161035] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 11/18/2022] [Accepted: 12/15/2022] [Indexed: 06/17/2023]
Abstract
Groundwater level (GWL) simulations allow the generation of reconstructions for exploring the past temporal variability of groundwater resources or provide the means for generating projections under climate change on decadal scales. In this context, analyzing GWLs affected by low-frequency variations is crucial. In this study, we assess the capabilities of three deep learning (DL) models (long short-term memory (LSTM), gated recurrent unit (GRU), and bidirectional LSTM (BiLSTM)) in simulating three types of GWLs affected by varying low-frequency behavior: inertial (dominated by low-frequency), annual (dominated by annual cyclicity) and mixed (in which both annual and low-frequency variations have high amplitude). We also tested if maximal overlap discrete wavelet transform pre-processing (MODWT) of input variables helps to better identify the frequency content most relevant for the models (MODWT-DL models). Only external variables (i.e., precipitation, air temperature as raw data, and effective precipitation (EP)) were used as input. Results indicate that for inertial-type GWLs, MODWT-DL models with raw data were notably more accurate than standalone models. However, DL models performed well for annual-type GWLs, while using EP as input, with MODWT-DL models exhibiting only minor improvements. Using raw data as input improved MODWT-DL models compared to standalone models; nevertheless, all models using EP performed better for annual-type GWLs. For mixed-type GWLs, while using EP as input, MODWT-DL models performed well, with substantial improvements over standalone models. Using raw data as input, improvement of MODWT-DL models is marginal compared to that of standalone models; nevertheless, they perform better than standalone models with EP. The Shapley Additive exPlanations (SHAP) approach used to interpret models highlighted that they preferentially learned from low-frequency in precipitation data to achieve the best simulations for inertial and mixed GWLs. This study showed that MODWT-based input pre-processing is highly suitable to better simulate low-frequency varying GWLs.
Collapse
Affiliation(s)
- Sivarama Krishna Reddy Chidepudi
- Univ Rouen Normandie, UNICAEN, CNRS, M2C UMR 6143, F-76000 Rouen, France; BRGM, 3 av. C. Guillemin, 45060 Orleans Cedex 02, France.
| | - Nicolas Massei
- Univ Rouen Normandie, UNICAEN, CNRS, M2C UMR 6143, F-76000 Rouen, France
| | - Abderrahim Jardani
- Univ Rouen Normandie, UNICAEN, CNRS, M2C UMR 6143, F-76000 Rouen, France
| | - Abel Henriot
- BRGM, 3 av. C. Guillemin, 45060 Orleans Cedex 02, France
| | | | - Lisa Baulon
- Univ Rouen Normandie, UNICAEN, CNRS, M2C UMR 6143, F-76000 Rouen, France; BRGM, 3 av. C. Guillemin, 45060 Orleans Cedex 02, France
| |
Collapse
|
9
|
Abba SI, Benaafi M, Usman AG, Ozsahin DU, Tawabini B, Aljundi IH. Mapping of groundwater salinization and modelling using meta-heuristic algorithms for the coastal aquifer of eastern Saudi Arabia. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 858:159697. [PMID: 36334664 DOI: 10.1016/j.scitotenv.2022.159697] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 10/20/2022] [Accepted: 10/20/2022] [Indexed: 06/16/2023]
Abstract
The growing increase in groundwater (GW) salinization in the coastal aquifers has reached an alarming socio-economic menace in Saudi Arabia and various places globally due to several natural and anthropogenic activities. Hence, evaluating the GW salinization is paramount to safeguarding the water resources planning and management. This study presents three different scenarios viz.: real field investigation, experimental laboratory analysis (using ion chromatography (IC) and inductively coupled plasma mass spectrometry (ICP-MS), etc.), and artificial intelligence (AI) based metaheuristic optimization (MO) algorithms in Saudi Arabia. The main purpose of this study is to validate the obtained experimental-based analysis using hybrid MO techniques comprising of adaptive neuro-fuzzy inference system (ANFIS) hybridized with genetic algorithm (GA), particle swarm optimization (PSO), and biogeography-based optimization (BBO) for identification of GW salinization in the coastal region of eastern Saudi Arabia. Additionally, ArcGIS 10.3 software generates the prediction map based on ANFIS-GA, ANFIS-PSO, and ANFIS-BBO. Feature selection was assessed using the PSO algorithm, and four indices evaluated the estimated models, namely, root mean square error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), and standard deviation (SD). The simulated results are based on three variable input combinations, which showed that the ANFIS-PSO (MAE = 0.00439) algorithm had the highest accuracy (99 %), followed by the ANFIS-GA (MAE = 0.00767) and ANFIS-BBO (MAE = 0.0132) algorithms. Besides, Ca2+, Na+, Mg2+, and Cl- were the most influential parameters. The accuracy also demonstrated the potential reliability of MO algorithms based on spatial distribution mapping. The employed approach proved to be merit and reliable tool for water resources decision-makers in the coastal aquifer of Saudi Arabia. This approach is believed to improve water scarcity as one of the essential targets for Goal 6 of Sustainable Development Vision 2030 and the Kingdom in general.
Collapse
Affiliation(s)
- S I Abba
- Interdisciplinary Research Center for Membranes and Water Security, King Fahd University of Petroleum & Minerals, Dhahran 31261, Saudi Arabia
| | - Mohammed Benaafi
- Interdisciplinary Research Center for Membranes and Water Security, King Fahd University of Petroleum & Minerals, Dhahran 31261, Saudi Arabia.
| | - A G Usman
- Near East University, Operational Research Center in Healthcare, Nicosia 99138, TRNC Mersin 10, Turkey; Department of Analytical Chemistry, Faculty of Pharmacy, Near East University, TRNC, Mersin 10, 99138 Nicosia, Turkey
| | - Dilber Uzun Ozsahin
- Sharjah University, College of Health Sciences, Department of Medical Diagnostic Imaging, United Arab Emirates; Near East University, Operational Research Center in Healthcare, Nicosia 99138, TRNC Mersin 10, Turkey
| | - Bassam Tawabini
- Interdisciplinary Research Center for Membranes and Water Security, King Fahd University of Petroleum & Minerals, Dhahran 31261, Saudi Arabia; College of Petroleum Engineering and Geosciences, King Fahd University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia
| | - Isam H Aljundi
- Interdisciplinary Research Center for Membranes and Water Security, King Fahd University of Petroleum & Minerals, Dhahran 31261, Saudi Arabia; Department of Chemical Engineering, King Fahd University of Petroleum & Minerals, Dhahran 31261, Saudi Arabia
| |
Collapse
|
10
|
Samani S, Vadiati M, Nejatijahromi Z, Etebari B, Kisi O. Groundwater level response identification by hybrid wavelet-machine learning conjunction models using meteorological data. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:22863-22884. [PMID: 36308648 DOI: 10.1007/s11356-022-23686-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 10/13/2022] [Indexed: 06/16/2023]
Abstract
Due to its heterogeneous and complex nature, groundwater modeling needs great effort to quantify the aquifer, a crucial tool for policymakers and hydrogeologists to understand the variations in groundwater levels (GWL). This study proposed a set of supervised machine learning (ML) models to delineate the GWL changes in the Zarand-Saveh complex aquifer in Iran using 15-year (2005-2020) monthly dataset. The wavelet transform (WT) procedure was also used to improve the GWL prediction ability of ML models for 3-month horizons using input datasets of precipitation, evapotranspiration, temperature, and GWL. The four well-accepted standalone ML methods, i.e., artificial neural network (ANN), adaptive neuro-fuzzy inference system (ANFIS), group method of data handling (GMDH), and least square support vector machine (LSSVM), were implemented and compared with the hybrid wavelet conjunction models. The methods were compared based on root mean square error (RMSE), mean absolute error (MAE), correlation coefficient (R), and Nash-Sutcliffe efficiency (NSE). Comparison outcomes showed that the hybrid wavelet-ML considerably improved the standalone model results. The wavelet transform-least square support vector machine (WT-LSSVM) model was superior to other standalone and hybrid wavelet-ML methods to predict GWL. The best GWL predictions were acquired from the WT-LSSVM model with input scenario 5 involving all influential variables, and this model produced RMSE, MAE, R, and NSE as 0.05, 0.04, 0.99, and 0.99 for 1 month ahead of GWL prediction, while the corresponding values were obtained as 0.18, 0.14, 0.95, and 0.90 for 3 months ahead of GWL prediction, respectively.
Collapse
Affiliation(s)
- Saeideh Samani
- Department of Water Resources Study and Research, Water Research Institute (WRI), Tehran Province, District 4, Bahar Blvd, Tehran, Iran
| | - Meysam Vadiati
- Global Affairs, Hubert H. Humphrey Fellowship Program, University of California, 10 College Park, Davis, CA, 95616, USA.
| | - Zohre Nejatijahromi
- Department of Minerals and Hydrogeology, Faculty of Earth Sciences, Shahid Beheshti University, Evin Ave, Tehran, Iran
| | - Behrooz Etebari
- CalNRA/Dept. of Water Resources/ Sustainable Groundwater Management Office, 715 P Street, Sacramento, CA, USA
| | - Ozgur Kisi
- Department of Civil Engineering, Technical University of Lübeck, 23562, Lübeck, Germany
- Department of Civil Engineering, Ilia State University, 0162, Tbilisi, Georgia
| |
Collapse
|
11
|
Wang Y, Han Q, Zhan D, Li Q. A Data-Driven OBE Magnetic Interference Compensation Method. SENSORS (BASEL, SWITZERLAND) 2022; 22:7732. [PMID: 36298084 PMCID: PMC9607135 DOI: 10.3390/s22207732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 09/30/2022] [Accepted: 10/07/2022] [Indexed: 06/16/2023]
Abstract
Aeromagnetic compensation is a technology used to reduce aircraft magnetic interference, which plays an important role in aeromagnetic surveys. In addition to maneuvering interferences, the onboard electronic (OBE) interference has been proven to be a significant part of aircraft interference, which must be reduced before further interpretation of aeromagnetic data. In the past, most researchers have focused on establishing linear models to compensate for OBE magnetic interference. However, such methods can only work using accurate reference sensors. In this paper, we propose a data-driven OBE interference compensation method, which can reduce OBE interference without relying on any other reference sensor. This network-based method can integrally detect and repair the OBE magnetic interference. The proposed method builds a prediction model by combining wavelet decomposition with a long short-term memory (LSTM) network to detect and predict OBE interference, and then estimates the local variation of the magnetic field to remove the drift of the interference. In our tests, we construct 10 semi-real datasets to quantitatively evaluate the performance of the proposed method. The F1 score of the proposed method for OBE interference detection is over 0.79, and the RMSE of the compensated signal is less than 0.009 nT. Moreover, we also test our method on real signals, and the results show that our method can detect all interference and significantly reduce the standard deviation of the magnetic field.
Collapse
Affiliation(s)
| | - Qi Han
- Correspondence: ; Tel.: +86-1393-662-2926
| | | | | |
Collapse
|
12
|
Yu JW, Kim JS, Li X, Jong YC, Kim KH, Ryang GI. Water quality forecasting based on data decomposition, fuzzy clustering and deep learning neural network. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2022; 303:119136. [PMID: 35283198 DOI: 10.1016/j.envpol.2022.119136] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 02/12/2022] [Accepted: 03/09/2022] [Indexed: 06/14/2023]
Abstract
Water quality forecasting can provide useful information for public health protection and support water resources management. In order to forecast water quality more accurately, this paper proposes a novel hybrid model by combining data decomposition, fuzzy C-means clustering and bidirectional gated recurrent unit. Firstly, the original water quality data is decomposed into several subseries by empirical wavelet transform, and then, the decomposed subseries are recombined by fuzzy C-means clustering. Next, for each clustered series, bidirectional gated recurrent unit is applied to develop prediction model. Finally, the forecast result is obtained by the summation of the predictions for the subseries. The proposed forecast model is evaluated by the water quality data of Poyang Lake, China. Results show that the proposed forecast model provides highly accurate forecast result for all of the six water quality data: the average of MAPE of the forecast results for the six water quality datasets is 4.59% for 7 day ahead prediction. Furthermore, our model shows better forecast performance than the other models. Particularly, compared with the single BiGRU model, MAPE decreased by 32.86% in average. Results demonstrate that the proposed forecast model can be used effectively for water quality forecasting.
Collapse
Affiliation(s)
- Jin-Won Yu
- School of Environmental Science and Safety Engineering, Tianjin University of Technology, Tianjin, 300384, China; University of Science, Pyongyang, 999091, Democratic People's Republic of Korea
| | - Ju-Song Kim
- School of Environmental Science and Safety Engineering, Tianjin University of Technology, Tianjin, 300384, China; University of Science, Pyongyang, 999091, Democratic People's Republic of Korea
| | - Xia Li
- School of Environmental Science and Safety Engineering, Tianjin University of Technology, Tianjin, 300384, China.
| | - Yun-Chol Jong
- University of Science, Pyongyang, 999091, Democratic People's Republic of Korea
| | - Kwang-Hun Kim
- University of Science, Pyongyang, 999091, Democratic People's Republic of Korea
| | - Gwang-Il Ryang
- University of Science, Pyongyang, 999091, Democratic People's Republic of Korea
| |
Collapse
|
13
|
Jiang Z, Yang S, Liu Z, Xu Y, Shen T, Qi S, Pang Q, Xu J, Liu F, Xu T. Can ensemble machine learning be used to predict the groundwater level dynamics of farmland under future climate: a 10-year study on Huaibei Plain. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:44653-44667. [PMID: 35133582 DOI: 10.1007/s11356-022-18809-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 01/19/2022] [Indexed: 06/14/2023]
Abstract
Accurate and simple prediction of farmland groundwater level (GWL) is an important aspect of agricultural water management. A farmland GWL prediction model, GWPRE, was developed that integrates four machine learning (ML) models (support vector machine regression, random forest, multiple perceptions, and the stacking ensemble model) with weather forecasts. Based on the GWL and meteorological data of five monitoring wells (N1, N2, N3, N4, and N5) in Huaibei plain from 2010 to 2020, the feasibility of predicting GWL by meteorological factors and ML algorithm was tested. In addition, the stacking ensemble model and future meteorological data after Bayesian model averaging were introduced for the first time to predict GWL under future climate conditions. The results showed that GWL showed an increasing trend in the past decade, but it will decrease in the future. The performance of the stacking ensemble model was better than that of any single ML model, with RMSE reduced by 4.26 ~ 96.97% and the running time reduced by 49.25 ~ 99.40%. GWL was most sensitive to rainfall, and the sensitivity index ranged from 0.2547 to 0.4039. The fluctuation range of GWL of N1, N2, and N3 was 1.5 ~ 2.5 m in the next decade. Due to the possible high rainfall, the GWL decreased in 2024 under RCP 2.6 and 2026 under RCP 8.5. It is worth noting that although the stacking ensemble model can improve the accuracy, it is not always the best among ML models in terms of portability. Nevertheless, the stacking ensemble model was recommended for GWL prediction under climate change.
Collapse
Affiliation(s)
- Zewei Jiang
- College of Agricultural Science and Engineering, Hohai University, Nanjing, 210098, People's Republic of China
| | - Shihong Yang
- College of Agricultural Science and Engineering, Hohai University, Nanjing, 210098, People's Republic of China.
- State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, Hohai University, 1 Xikang Road, Nanjing, 210098, People's Republic of China.
- Cooperative Innovation Center for Water Safety & Hydro Science, Hohai University, Nanjing, 210098, People's Republic of China.
| | - Zhenyang Liu
- College of Agricultural Science and Engineering, Hohai University, Nanjing, 210098, People's Republic of China
| | - Yi Xu
- College of Agricultural Science and Engineering, Hohai University, Nanjing, 210098, People's Republic of China
| | - Tao Shen
- Anhui and Huaihe River Institute of Hydraulic Research (Anhui Province Key Laboratory of Water Conservancy and Water Resources), Bengbu, 233000, People's Republic of China
| | - Suting Qi
- College of Agricultural Science and Engineering, Hohai University, Nanjing, 210098, People's Republic of China
| | - Qingqing Pang
- Ministry of Ecology and Environment, Nanjing Institute of Environmental Sciences, Nanjing, 210042, People's Republic of China
| | - Junzeng Xu
- College of Agricultural Science and Engineering, Hohai University, Nanjing, 210098, People's Republic of China
| | - Fangping Liu
- Jiangxi Irrigation Experiment Central Station, 309 Yinhe Road, Nanchang, 330201, Jiangxi, People's Republic of China
| | - Tao Xu
- Jiangxi Irrigation Experiment Central Station, 309 Yinhe Road, Nanchang, 330201, Jiangxi, People's Republic of China
| |
Collapse
|
14
|
A Hybrid Triple Collocation-Deep Learning Approach for Improving Soil Moisture Estimation from Satellite and Model-Based Data. REMOTE SENSING 2022. [DOI: 10.3390/rs14071744] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Satellite retrieval and land surface models have become the mainstream methods for monitoring soil moisture (SM) over large regions; however, the uncertainty and coarse spatial resolution of these products limit their applications at the regional and local scales. We proposed a hybrid approach combining the triple collocation (TC) and the long short-term memory (LSTM) network, which was designed to generate a high-quality SM dataset from satellite and modeled data. We applied the proposed approach to merge SM data from Soil Moisture Active Passive (SMAP), Global Land Data Assimilation System-Noah (GLDAS-Noah), and the land component of the fifth generation of European Reanalysis (ERA5-Land), and we then downscaled the merged SM data from 0.36° to 0.01° resolution based on the relationship between the SM data and auxiliary environmental variables (elevation, land surface temperature, vegetation index, surface albedo, and soil texture). The merged and downscaled SM results were validated against in situ observations. The results showed that: (1) the TC-based validation results were consistent with the in situ-based validation, indicating that the TC method was reasonable for the comparison and evaluation of satellite and modeled SM data. (2) TC-based merging was superior to simple arithmetic average merging when the parent products had large differences. (3) Downscaled SM of the TC-based merged product had better performance than that of the parent products in terms of ubRMSE and bias values, implying that the fusion of satellite and model-based SM data would result in better downscaling accuracy. (4) Downscaled SM of TC-based merged data not only improved the representation of the SM spatial variability but also had satisfactory accuracy with a median of R (0.7244), ubRMSE (0.0459 m3/m3), and bias (−0.0126 m3/m3). The proposed approach was effective for generating a SM dataset with fine resolution and reliable accuracy for wide hydrometeorological applications.
Collapse
|