1
|
Hybrid Machine Learning Approach for Gully Erosion Mapping Susceptibility at a Watershed Scale. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2022. [DOI: 10.3390/ijgi11070401] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Gully erosion is a serious threat to the state of ecosystems all around the world. As a result, safeguarding the soil for our own benefit and from our own actions is a must for guaranteeing the long-term viability of a variety of ecosystem services. As a result, developing gully erosion susceptibility maps (GESM) is both suggested and necessary. In this study, we compared the effectiveness of three hybrid machine learning (ML) algorithms with the bivariate statistical index frequency ratio (FR), named random forest-frequency ratio (RF-FR), support vector machine-frequency ratio (SVM-FR), and naïve Bayes-frequency ratio (NB-FR), in mapping gully erosion in the GHISS watershed in the northern part of Morocco. The models were implemented based on the inventory mapping of a total number of 178 gully erosion points randomly divided into 2 groups (70% of points were used for training the models and 30% of points were used for the validation process), and 12 conditioning variables (i.e., elevation, slope, aspect, plane curvature, topographic moisture index (TWI), stream power index (SPI), precipitation, distance to road, distance to stream, drainage density, land use, and lithology). Using the equal interval reclassification method, the spatial distribution of gully erosion was categorized into five different classes, including very high, high, moderate, low, and very low. Our results showed that the very high susceptibility classes derived using RF-FR, SVM-FR, and NB-FR models covered 25.98%, 22.62%, and 27.10% of the total area, respectively. The area under the receiver (AUC) operating characteristic curve, precision, and accuracy were employed to evaluate the performance of these models. Based on the receiver operating characteristic (ROC), the results showed that the RF-FR achieved the best performance (AUC = 0.91), followed by SVM-FR (AUC = 0.87), and then NB-FR (AUC = 0.82), respectively. Our contribution, in line with the Sustainable Development Goals (SDGs), plays a crucial role for understanding and identifying the issue of “where and why” gully erosion occurs, and hence it can serve as a first pathway to reducing gully erosion in this particular area.
Collapse
|
2
|
Allocca V, Di Napoli M, Coda S, Carotenuto F, Calcaterra D, Di Martire D, De Vita P. A novel methodology for Groundwater Flooding Susceptibility assessment through Machine Learning techniques in a mixed-land use aquifer. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 790:148067. [PMID: 34111794 DOI: 10.1016/j.scitotenv.2021.148067] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 05/21/2021] [Accepted: 05/23/2021] [Indexed: 06/12/2023]
Abstract
Many areas around the world are affected by Groundwater Level rising (GWLr). One of the most severe consequences of this phenomenon is Groundwater Flooding (GF), with serious impacts for the human and natural environment. In Europe, GF has recently received specific attention with Directive 2007/60/EC, which requires Member States to map GF hazard and propose measures for risk mitigation. In this paper a methodology has been developed for Groundwater Flooding Susceptibility (GFS) assessment, using for the first time Spatial Distribution Models. These Machine Learning techniques connect occurrence data to predisposing factors (PFs) to estimate their distributions. The implemented methodology employs aquifer type, depth of piezometric level, thickness and hydraulic conductivity of unsaturated zone, drainage density and land-use as PFs, and a GF observations inventory as occurrences. The algorithms adopted to perform the analysis are Generalized Boosting Model, Artificial Neural Network and Maximum Entropy. Ensemble Models are carried out to reduce the uncertainty associated with each algorithm and increase its reliability. GFS is mapped by choosing the ensemble model with the best predictivity performance and dividing occurrence probability values into five classes, from very low to very high susceptibility, using Natural Breaks classification. The methodology has been tested and statistically validated in an area of 14,3 km2 located in the Metropolitan City of Naples (Italy), affected by GWLr since 1990 and GF in buildings and agricultural soils since 2007. The results of modeling show that about 93% of the inventoried points fall in the high and very high GFS classes, and piezometric level depth, thickness of unsaturated zone and drainage density are the most influencing PFs, in accordance with field observations and the triggering mechanism of GF. The outcomes provide a first step in the assessment of GF hazard and a decision support tool to local authorities for GF risk management.
Collapse
Affiliation(s)
- Vincenzo Allocca
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy.
| | - Mariano Di Napoli
- Department of Earth, Environment and Life Sciences, University of Genoa, Corso Europa 26, 16132 Genoa, Italy
| | - Silvio Coda
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy.
| | - Francesco Carotenuto
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| | - Domenico Calcaterra
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| | - Diego Di Martire
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| | - Pantaleone De Vita
- Department of Earth, Environmental and Resources Sciences, University of Naples Federico II, Complesso Universitario Monte S. Angelo, Via Cinthia 21, Edificio 10, 80126 Naples, Italy
| |
Collapse
|
3
|
Saha A, Pal SC, Arabameri A, Chowdhuri I, Rezaie F, Chakrabortty R, Roy P, Shit M. Optimization modelling to establish false measures implemented with ex-situ plant species to control gully erosion in a monsoon-dominated region with novel in-situ measurements. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2021; 287:112284. [PMID: 33711662 DOI: 10.1016/j.jenvman.2021.112284] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 02/11/2021] [Accepted: 02/25/2021] [Indexed: 06/12/2023]
Abstract
Water dominated gullies formation and associated land degradation are the foremost challenges among the planners for sustainability and optimization of land resources. This type of hazardous phenomenon is utmost vulnerable due to huge loss of surface soil in the sub-tropical developing countries like India. The present study has been carried out in rugged badland topography of Garhbeta-I Community Development (C.D.) Block in eastern India for assessing the gully erosion susceptibility (GES) mapping and optimization of land use planning. The GES mapping is the first and foremost steps towards minimization this adverse affect and attaining sustainable development. In this study we also describe the importance of plantation and alternation of ex-situ tree species with in-situ species for minimizes the erosional activity. To meet our research goal here we used two prediction based machine learning algorithm (MLA) namely random forest (RF) and boosted regression tree (BRT) and one optimization model of Ecogeography based optimization (EBO). The research study also carried out by using a total of 199, in which 139 (70%) and 60 (30%) gully head-cut points were used for training and validation purposes respectively and treated as dependent factors, and twenty gully erosion conditioning factors as independent variables. These models are validated through receiver operating characteristics-area under the curve (ROC-AUC), accuracy (ACC), precision (PRE) and Kappa coefficient index analysis. The validation result showed that EBO model with the highest values of AUC-0.954, ACC-0.85, PRE-0.877 and Kappa-0.646 is the most accurate model for GES followed by BRT and RF. The outcome results should help for the sustainable development of this rugged badland topography.
Collapse
Affiliation(s)
- Asish Saha
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| | - Subodh Chandra Pal
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India.
| | - Alireza Arabameri
- Department of Geomorphology, Tarbiat Modares University, Tehran, 14117 -13116, Iran
| | - Indrajit Chowdhuri
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| | - Fatemeh Rezaie
- Geoscience Platform Research Division, Korea Institute of Geoscience and Mineral Resources (KIGAM), 124, Gwahak-ro Yuseong-gu, Daejeon, 34132, Republic of Korea; Korea University of Science and Technology, 217 Gajeong-roYuseong-gu, Daejeon, 34113, Republic of Korea
| | - Rabin Chakrabortty
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| | - Paramita Roy
- Department of Geography, The University of Burdwan, Bardhaman, West Bengal, 713104, India
| | - Manisa Shit
- Department of Geography, Raiganj University, Raiganj, Uttar Dinajpur, West Bengal, 733134, India
| |
Collapse
|
4
|
Credal decision tree based novel ensemble models for spatial assessment of gully erosion and sustainable management. Sci Rep 2021; 11:3147. [PMID: 33542340 PMCID: PMC7862281 DOI: 10.1038/s41598-021-82527-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 01/21/2021] [Indexed: 01/30/2023] Open
Abstract
We introduce novel hybrid ensemble models in gully erosion susceptibility mapping (GESM) through a case study in the Bastam sedimentary plain of Northern Iran. Four new ensemble models including credal decision tree-bagging (CDT-BA), credal decision tree-dagging (CDT-DA), credal decision tree-rotation forest (CDT-RF), and credal decision tree-alternative decision tree (CDT-ADTree) are employed for mapping the gully erosion susceptibility (GES) with the help of 14 predictor factors and 293 gully locations. The relative significance of GECFs in modelling GES is assessed by random forest algorithm. Two cut-off-independent (area under success rate curve and area under predictor rate curve) and six cut-off-dependent metrics (accuracy, sensitivity, specificity, F-score, odd ratio and Cohen Kappa) were utilized based on both calibration as well as testing dataset. Drainage density, distance to road, rainfall and NDVI were found to be the most influencing predictor variables for GESM. The CDT-RF (AUSRC = 0.942, AUPRC = 0.945, accuracy = 0.869, specificity = 0.875, sensitivity = 0.864, RMSE = 0.488, F-score = 0.869 and Cohen's Kappa = 0.305) was found to be the most robust model which showcased outstanding predictive accuracy in mapping GES. Our study shows that the GESM can be utilized for conserving soil resources and for controlling future gully erosion.
Collapse
|
5
|
Arora A, Arabameri A, Pandey M, Siddiqui MA, Shukla UK, Bui DT, Mishra VN, Bhardwaj A. Optimization of state-of-the-art fuzzy-metaheuristic ANFIS-based machine learning models for flood susceptibility prediction mapping in the Middle Ganga Plain, India. THE SCIENCE OF THE TOTAL ENVIRONMENT 2021; 750:141565. [PMID: 32882492 DOI: 10.1016/j.scitotenv.2020.141565] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2020] [Revised: 07/31/2020] [Accepted: 08/06/2020] [Indexed: 05/22/2023]
Abstract
This study is an attempt to quantitatively test and compare novel advanced-machine learning algorithms in terms of their performance in achieving the goal of predicting flood susceptible areas in a low altitudinal range, sub-tropical floodplain environmental setting, like that prevailing in the Middle Ganga Plain (MGP), India. This part of the Ganga floodplain region, which under the influence of undergoing active tectonic regime related subsidence, is the hotbed of annual flood disaster. This makes the region one of the best natural laboratories to test the flood susceptibility models for establishing a universalization of such models in low relief highly flood prone areas. Based on highly sophisticated flood inventory archived for this region, and 12 flood conditioning factors viz. annual rainfall, soil type, stream density, distance from stream, distance from road, Topographic Wetness Index (TWI), altitude, slope aspect, slope, curvature, land use/land cover, and geomorphology, an advanced novel hybrid model Adaptive Neuro Fuzzy Inference System (ANFIS), and three metaheuristic models-based ensembles with ANFIS namely ANFIS-GA (Genetic Algorithm), ANFIS-DE (Differential Evolution), and ANFIS-PSO (Particle Swarm Optimization), have been applied for zonation of the flood susceptible areas. The flood inventory dataset, prepared by collected flood samples, were apportioned into 70:30 classes to prepare training and validation datasets. One independent validation method, the Area-Under Receiver Operating Characteristic (AUROC) Curve, and other 11 cut-off-dependent model evaluation metrices have helped to conclude that the ANIFS-GA has outperformed other three models with highest success rate AUC = 0.922 and prediction rate AUC = 0.924. The accuracy was also found to be highest for ANFIS-GA during training (0.886) & validation (0.883). Better performance of ANIFS-GA than the individual models as well as some ensemble models suggests and warrants further study in this topoclimatic environment using other classes of susceptibility models. This will further help establishing a benchmark model with capability of highest accuracy and sensitivity performance in the similar topographic and climatic setting taking assumption of the quality of input parameters as constant.
Collapse
Affiliation(s)
- Aman Arora
- Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi 110025, India.
| | - Alireza Arabameri
- Department of Geomorphology, Tarbiat Modares University, Jalal Ale Ahmad Highway, Tehran 9821, Iran
| | - Manish Pandey
- University Center for Research & Development (UCRD), Chandigarh University, Mohali 140413, Punjab, India; Department of Civil Engineering, Chandigarh University, Mohali 140413, Punjab, India.
| | - Masood A Siddiqui
- Department of Geography, Faculty of Natural Sciences, Jamia Millia Islamia, New Delhi 110025, India
| | - U K Shukla
- Center for Advanced Study in Geology, Institute of Science, Banaras Hindu University, Varanasi 221005, India
| | - Dieu Tien Bui
- Institute of Research and Development, Duy Tan University, Da Nang 550000, Viet Nam
| | - Varun Narayan Mishra
- Centre for Climate Change and Water Research, Suresh Gyan Vihar University, Jaipur 302017, Rajasthan, India
| | - Anshuman Bhardwaj
- School of Geosciences, University of Aberdeen, Meston Building, King's College, Aberdeen AB24 3UE, UK; Division of Space Technology, Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden
| |
Collapse
|
6
|
Implementation of Artificial Intelligence Based Ensemble Models for Gully Erosion Susceptibility Assessment. REMOTE SENSING 2020. [DOI: 10.3390/rs12213620] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The Rarh Bengal region in West Bengal, particularly the eastern fringe area of the Chotanagpur plateau, is highly prone to water-induced gully erosion. In this study, we analyzed the spatial patterns of a potential gully erosion in the Gandheswari watershed. This area is highly affected by monsoon rainfall and ongoing land-use changes. This combination causes intensive gully erosion and land degradation. Therefore, we developed gully erosion susceptibility maps (GESMs) using the machine learning (ML) algorithms boosted regression tree (BRT), Bayesian additive regression tree (BART), support vector regression (SVR), and the ensemble of the SVR-Bee algorithm. The gully erosion inventory maps are based on a total of 178 gully head-cutting points, taken as the dependent factor, and gully erosion conditioning factors, which serve as the independent factors. We validated the ML model results using the area under the curve (AUC), accuracy (ACC), true skill statistic (TSS), and Kappa coefficient index. The AUC result of the BRT, BART, SVR, and SVR-Bee models are 0.895, 0.902, 0.927, and 0.960, respectively, which show very good GESM accuracies. The ensemble model provides more accurate prediction results than any single ML model used in this study.
Collapse
|
7
|
Modeling Spatial Flood using Novel Ensemble Artificial Intelligence Approaches in Northern Iran. REMOTE SENSING 2020. [DOI: 10.3390/rs12203423] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The uncertainty of flash flood makes them highly difficult to predict through conventional models. The physical hydrologic models of flash flood prediction of any large area is very difficult to compute as it requires lot of data and time. Therefore remote sensing data based models (from statistical to machine learning) have become highly popular due to open data access and lesser prediction times. There is a continuous effort to improve the prediction accuracy of these models through introducing new methods. This study is focused on flash flood modeling through novel hybrid machine learning models, which can improve the prediction accuracy. The hybrid machine learning ensemble approaches that combine the three meta-classifiers (Real AdaBoost, Random Subspace, and MultiBoosting) with J48 (a tree-based algorithm that can be used to evaluate the behavior of the attribute vector for any defined number of instances) were used in the Gorganroud River Basin of Iran to assess flood susceptibility (FS). A total of 426 flood positions as dependent variables and a total of 14 flood conditioning factors (FCFs) as independent variables were used to model the FS. Several threshold-dependent and independent statistical tests were applied to verify the performance and predictive capability of these machine learning models, such as the receiver operating characteristic (ROC) curve of the success rate curve (SRC) and prediction rate curve (PRC), efficiency (E), root-mean square-error (RMSE), and true skill statistics (TSS). The valuation of the FCFs was done using AdaBoost, frequency ratio (FR), and Boosted Regression Tree (BRT) models. In the flooding of the study area, altitude, land use/land cover (LU/LC), distance to stream, normalized differential vegetation index (NDVI), and rainfall played important roles. The Random Subspace J48 (RSJ48) ensemble method with an area under the curve (AUC) of 0.931 (SRC), 0.951 (PRC), E of 0.89, sensitivity of 0.87, and TSS of 0.78, has become the most effective ensemble in predicting the FS. The FR technique also showed good performance and reliability for all models. Map removal sensitivity analysis (MRSA) revealed that the FS maps have the highest sensitivity to elevation. Based on the findings of the validation methods, the FS maps prepared using the machine learning ensemble techniques have high robustness and can be used to advise flood management initiatives in flood-prone areas.
Collapse
|
8
|
Novel Ensemble of Multivariate Adaptive Regression Spline with Spatial Logistic Regression and Boosted Regression Tree for Gully Erosion Susceptibility. REMOTE SENSING 2020. [DOI: 10.3390/rs12203284] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The extreme form of land degradation through different forms of erosion is one of the major problems in sub-tropical monsoon dominated region. The formation and development of gullies is the dominant form or active process of erosion in this region. So, identification of erosion prone regions is necessary for escaping this type of situation and maintaining the correspondence between different spheres of the environment. The major goal of this study is to evaluate the gully erosion susceptibility in the rugged topography of the Hinglo River Basin of eastern India, which ultimately contributes to sustainable land management practices. Due to the nature of data instability, the weakness of the classifier andthe ability to handle data, the accuracy of a single method is not very high. Thus, in this study, a novel resampling algorithm was considered to increase the robustness of the classifier and its accuracy. Gully erosion susceptibility maps have been prepared using boosted regression trees (BRT), multivariate adaptive regression spline (MARS) and spatial logistic regression (SLR) with proposed resampling techniques. The re-sampling algorithm was able to increase the efficiency of all predicted models by improving the nature of the classifier. Each variable in the gully inventory map was randomly allocated with 5-fold cross validation, 10-fold cross validation, bootstrap and optimism bootstrap, while each consisted of 30% of the database. The ensemble model was tested using 70% and validated with the other 30% using the K-fold cross validation (CV) method to evaluate the influence of the random selection of training and validation database. Here, all resampling methods are associated with higher accuracy, but SLR bootstrap optimism is more optimal than any other methods according to its robust nature. The AUC values of BRT optimism bootstrap, MARS optimism bootstrap and SLR optimism bootstrap are 87.40%, 90.40% and 90.60%, respectively. According to the SLR optimism bootstrap, the 107,771 km2 (27.51%) area of this region is associated with a very high to high susceptible to gully erosion. This potential developmental area of the gully was found primarily in the Hinglo River Basin, where lateral exposure was mainly observed with scarce vegetation. The outcome of this work can help policy-makers to implement remedial measures to minimize the damage caused by erosion of the gully.
Collapse
|
9
|
Novel Machine Learning Approaches for Modelling the Gully Erosion Susceptibility. REMOTE SENSING 2020. [DOI: 10.3390/rs12172833] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The extreme form of land degradation caused by the formation of gullies is a major challenge for the sustainability of land resources. This problem is more vulnerable in the arid and semi-arid environment and associated damage to agriculture and allied economic activities. Appropriate modeling of such erosion is therefore needed with optimum accuracy for estimating vulnerable regions and taking appropriate initiatives. The Golestan Dam has faced an acute problem of gully erosion over the last decade and has adversely affected society. Here, the artificial neural network (ANN), general linear model (GLM), maximum entropy (MaxEnt), and support vector machine (SVM) machine learning algorithm with 90/10, 80/20, 70/30, 60/40, and 50/50 random partitioning of training and validation samples was selected purposively for estimating the gully erosion susceptibility. The main objective of this work was to predict the susceptible zone with the maximum possible accuracy. For this purpose, random partitioning approaches were implemented. For this purpose, 20 gully erosion conditioning factors were considered for predicting the susceptible areas by considering the multi-collinearity test. The variance inflation factor (VIF) and tolerance (TOL) limit were considered for multi-collinearity assessment for reducing the error of the models and increase the efficiency of the outcome. The ANN with 50/50 random partitioning of the sample is the most optimal model in this analysis. The area under curve (AUC) values of receiver operating characteristics (ROC) in ANN (50/50) for the training and validation data are 0.918 and 0.868, respectively. The importance of the causative factors was estimated with the help of the Jackknife test, which reveals that the most important factor is the topography position index (TPI). Apart from this, the prioritization of all predicted models was estimated taking into account the training and validation data set, which should help future researchers to select models from this perspective. This type of outcome should help planners and local stakeholders to implement appropriate land and water conservation measures.
Collapse
|
10
|
GIS-Based Machine Learning Algorithms for Gully Erosion Susceptibility Mapping in a Semi-Arid Region of Iran. REMOTE SENSING 2020. [DOI: 10.3390/rs12152478] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
In the present study, gully erosion susceptibility was evaluated for the area of the Robat Turk Watershed in Iran. The assessment of gully erosion susceptibility was performed using four state-of-the-art data mining techniques: random forest (RF), credal decision trees (CDTree), kernel logistic regression (KLR), and best-first decision tree (BFTree). To the best of our knowledge, the KLR and CDTree algorithms have been rarely applied to gully erosion modeling. In the first step, from the 242 gully erosion locations that were identified, 70% (170 gullies) were selected as the training dataset, and the other 30% (72 gullies) were considered for the result validation process. In the next step, twelve gully erosion conditioning factors, including topographic, geomorphological, environmental, and hydrologic factors, were selected to estimate gully erosion susceptibility. The area under the ROC curve (AUC) was used to estimate the performance of the models. The results revealed that the RF model had the best performance (AUC = 0.893), followed by the KLR (AUC = 0.825), the CDTree (AUC = 0.808), and the BFTree (AUC = 0.789) models. Overall, the RF model performed significantly better than the others, which may support the application of this method to a transferable susceptibility model in other areas. Therefore, we suggest using the RF, KLR, and CDT models for gully erosion susceptibility mapping in other prone areas to assess their reproducibility.
Collapse
|
11
|
Novel Ensemble Approaches of Machine Learning Techniques in Modeling the Gully Erosion Susceptibility. REMOTE SENSING 2020. [DOI: 10.3390/rs12111890] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Gully erosion has become one of the major environmental issues, due to the severity of its impact in many parts of the world. Gully erosion directly and indirectly affects agriculture and infrastructural development. The Golestan Dam basin, where soil erosion and degradation are very severe problems, was selected as the study area. This research maps gully erosion susceptibility (GES) by integrating four models: maximum entropy (MaxEnt), artificial neural network (ANN), support vector machine (SVM), and general linear model (GLM). Of 1042 gully locations, 729 (70%) and 313 (30%) gully locations were used for modeling and validation purposes, respectively. Fourteen effective gully erosion conditioning factors (GECFs) were selected for spatial gully erosion modeling. Tolerance and variance inflation factors (VIFs) were used to examine the collinearity among the GECFs. The random forest (RF) model was used to assess factors’ effectiveness and significance in gully erosion modeling. An ensemble of techniques can provide more accurate results than can single, standalone models. Therefore, we compared two-, three-, and four-model ensembles (ANN-SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, ANN-SVM-GLM, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM, MaxEnt-ANN-SVM and GLM-ANN-SVM-MaxEnt) for GES modeling. The susceptibility zones of the GESMs were classified as very-low, low, medium, high, and very-high using Jenks’ natural break classification method (NBM). Subsequently, the receiver operating characteristics (ROC) curve and the seed cell area index (SCAI) methods measured the reliability of the models. The success rate curve (SRC) and predication rate curve (PRC) and their area under the curve (AUC) values were obtained from the GES maps. The results show that the ANN model combined with two and three models are more accurate than the other combinations, but the ANN-SVM model had the highest accuracy. The rank of the others from best to worst accuracy is GLM, MaxEnt, SVM, GLM-ANN, GLM-MaxEnt, GLM-SVM, MaxEnt-ANN, MaxEnt-SVM, GLM-ANN-SVM-MaxEnt, GLM-MaxEnt-ANN, GLM-MaxEnt-SVM and MaxEnt-ANN-SVM. The resulting gully erosion susceptibility models (GESMs) are efficient and powerful and could be used to improve soil and water conservation and management.
Collapse
|
12
|
GIS-Based Gully Erosion Susceptibility Mapping: A Comparison of Computational Ensemble Data Mining Models. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10062039] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Gully erosion destroys agricultural and domestic grazing land in many countries, especially those with arid and semi-arid climates and easily eroded rocks and soils. It also generates large amounts of sediment that can adversely impact downstream river channels. The main objective of this research is to accurately detect and predict areas prone to gully erosion. In this paper, we couple hybrid models of a commonly used base classifier (reduced pruning error tree, REPTree) with AdaBoost (AB), bagging (Bag), and random subspace (RS) algorithms to create gully erosion susceptibility maps for a sub-basin of the Shoor River watershed in northwestern Iran. We compare the performance of these models in terms of their ability to predict gully erosion and discuss their potential use in other arid and semi-arid areas. Our database comprises 242 gully erosion locations, which we randomly divided into training and testing sets with a ratio of 70/30. Based on expert knowledge and analysis of aerial photographs and satellite images, we selected 12 conditioning factors for gully erosion. We used multi-collinearity statistical techniques in the modeling process, and checked model performance using statistical indexes including precision, recall, F-measure, Matthew correlation coefficient (MCC), receiver operatic characteristic curve (ROC), precision–recall graph (PRC), Kappa, root mean square error (RMSE), relative absolute error (PRSE), mean absolute error (MAE), and relative absolute error (RAE). Results show that rainfall, elevation, and river density are the most important factors for gully erosion susceptibility mapping in the study area. All three hybrid models that we tested significantly enhanced and improved the predictive power of REPTree (AUC=0.800), but the RS-REPTree (AUC= 0.860) ensemble model outperformed the Bag-REPTree (AUC= 0.841) and the AB-REPTree (AUC= 0.805) models. We suggest that decision makers, planners, and environmental engineers employ the RS-REPTree hybrid model to better manage gully erosion-prone areas in Iran.
Collapse
|
13
|
Saha S, Roy J, Arabameri A, Blaschke T, Tien Bui D. Machine Learning-Based Gully Erosion Susceptibility Mapping: A Case Study of Eastern India. SENSORS (BASEL, SWITZERLAND) 2020; 20:E1313. [PMID: 32121238 PMCID: PMC7085763 DOI: 10.3390/s20051313] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 02/25/2020] [Accepted: 02/26/2020] [Indexed: 12/02/2022]
Abstract
Gully erosion is a form of natural disaster and one of the land loss mechanisms causing severe problems worldwide. This study aims to delineate the areas with the most severe gully erosion susceptibility (GES) using the machine learning techniques Random Forest (RF), Gradient Boosted Regression Tree (GBRT), Naïve Bayes Tree (NBT), and Tree Ensemble (TE). The gully inventory map (GIM) consists of 120 gullies. Of the 120 gullies, 84 gullies (70%) were used for training and 36 gullies (30%) were used to validate the models. Fourteen gully conditioning factors (GCFs) were used for GES modeling and the relationships between the GCFs and gully erosion was assessed using the weight-of-evidence (WofE) model. The GES maps were prepared using RF, GBRT, NBT, and TE and were validated using area under the receiver operating characteristic(AUROC) curve, the seed cell area index (SCAI) and five statistical measures including precision (PPV), false discovery rate (FDR), accuracy, mean absolute error (MAE), and root mean squared error (RMSE). Nearly 7% of the basin has high to very high susceptibility for gully erosion. Validation results proved the excellent ability of these models to predict the GES. Of the analyzed models, the RF (AUROC = 0.96, PPV = 1.00, FDR = 0.00, accuracy = 0.87, MAE = 0.11, RMSE = 0.19 for validation dataset) is accurate enough for modeling and better suited for GES modeling than the other models. Therefore, the RF model can be used to model the GES areas not only in this river basin but also in other areas with the same geo-environmental conditions.
Collapse
Affiliation(s)
- Sunil Saha
- Department of Geography, University of Gour Banga, Malda, West Bengal 732103, India;
| | - Jagabandhu Roy
- Research Scholar, Dept. of Geography, University of Gour Banga, Malda, West Bengal 732103, India;
| | - Alireza Arabameri
- Department of Geomorphology, Tarbiat Modares University, Tehran 14117-13116, Iran
| | - Thomas Blaschke
- Department of Geoinformatics—Z_GIS, University of Salzburg, 5020 Salzburg, Austria;
| | - Dieu Tien Bui
- Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam
| |
Collapse
|
14
|
Novel Ensemble of MCDM-Artificial Intelligence Techniques for Groundwater-Potential Mapping in Arid and Semi-Arid Regions (Iran). REMOTE SENSING 2020. [DOI: 10.3390/rs12030490] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
The aim of this research is to introduce a novel ensemble approach using Vise Kriterijumska Optimizacija I Kompromisno Resenje (VIKOR), frequency ratio (FR), and random forest (RF) models for groundwater-potential mapping (GWPM) in Bastam watershed, Iran. This region suffers from freshwater shortages and the identification of new groundwater sites is a critical need. Remote sensing and geographic information system (GIS) were used to reduce time and financial costs of rapid assessment of groundwater resources. Seventeen physiographical, hydrological, and geological groundwater conditioning factors (GWCFs) were derived from a spatial geo-database. Groundwater data were gathered in field surveys and well-yield data were acquired from the Iranian Department of Water Resources Management for 89 locations with high yield potential values ≥ 11 m3 h−1. These data were mapped in a GIS. From these locations, 62 (70%) were randomly selected to be used for model training, and the remaining 27 (30%) were used for validation of the model. The relative weights of the GWCFs were determined with an RF model. For GWPM, 220 randomly selected points in the study area and their final weights were determined with the VIKOR model. A groundwater potential map was created by interpolating the values at these points using Kriging in GIS. Finally, the area under receiver operating characteristic (AUROC) curve was plotted for the groundwater potential map. The success rate curve (SRC) was computed for the training dataset, and the prediction rate curve (PRC) was calculated for the validation dataset. Results of RF analysis show that land use and land cover, lithology, and elevation are the most significant determinants of groundwater occurrence. The validation results show that the ensemble model had excellent prediction performance (PRC = 0.934) and goodness-of-fit (SRC = 0.925) and reasonably high classification accuracy. The results of this study could aid management of groundwater resources and assist planners and decision makers in groundwater-investment planning to achieve sustainability.
Collapse
|
15
|
Landslide Susceptibility Evaluation and Management Using Different Machine Learning Methods in The Gallicash River Watershed, Iran. REMOTE SENSING 2020. [DOI: 10.3390/rs12030475] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This analysis aims to generate landslide susceptibility maps (LSMs) using various machine learning methods, namely random forest (RF), alternative decision tree (ADTree) and Fisher’s Linear Discriminant Function (FLDA). The results of the FLDA, RF and ADTree models were compared with regard to their applicability for creating an LSM of the Gallicash river watershed in the northern part of Iran close to the Caspian Sea. A landslide inventory map was created using GPS points obtained in a field analysis, high-resolution satellite images, topographic maps and historical records. A total of 249 landslide sites have been identified to date and were used in this study to model and validate the LSMs of the study region. Of the 249 landslide locations, 70% were used as training data and 30% for the validation of the resulting LSMs. Sixteen factors related to topographical, hydrological, soil type, geological and environmental conditions were used and a multi-collinearity test of the landslide conditioning factors (LCFs) was performed. Using the natural break method (NBM) in a geographic information system (GIS), the LSMs generated by the RF, FLDA, and ADTree models were categorized into five classes, namely very low, low, medium, high and very high landslide susceptibility (LS) zones. The very high susceptibility zones cover 15.37% (ADTree), 16.10% (FLDA) and 11.36% (RF) of the total catchment area. The results of the different models (FLDA, RF, and ADTree) were explained and compared using the area under receiver operating characteristics (AUROC) curve, seed cell area index (SCAI), efficiency and true skill statistic (TSS). The accuracy of models was calculated considering both the training and validation data. The results revealed that the AUROC success rates are 0.89 (ADTree), 0.92 (FLDA) and 0.97 (RF) and predication rates are 0.82 (ADTree), 0.79 (FLDA) and 0.98 (RF), which justifies the approach and indicates a reasonably good landslide prediction. The results of the SCAI, efficiency and TSS methods showed that all models have an excellent modeling capability. In a comparison of the models, the RF model outperforms the boosted regression tree (BRT) and ADTree models. The results of the landslide susceptibility modeling could be useful for land-use planning and decision-makers, for managing and controlling the current and future landslides, as well as for the protection of society and the ecosystem.
Collapse
|
16
|
Arabameri A, Blaschke T, Pradhan B, Pourghasemi HR, Tiefenbacher JP, Bui DT. Evaluation of Recent Advanced Soft Computing Techniques for Gully Erosion Susceptibility Mapping: A Comparative Study. SENSORS (BASEL, SWITZERLAND) 2020; 20:E335. [PMID: 31936038 PMCID: PMC7014250 DOI: 10.3390/s20020335] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Revised: 12/22/2019] [Accepted: 12/31/2019] [Indexed: 11/16/2022]
Abstract
Gully erosion is a problem; therefore, it must be predicted using highly accurate predictive models to avoid losses caused by gully development and to guarantee sustainable development. This research investigates the predictive performance of seven multiple-criteria decision-making (MCDM), statistical, and machine learning (ML)-based models and their ensembles for gully erosion susceptibility mapping (GESM). A case study of the Dasjard River watershed, Iran uses a database of 306 gully head cuts and 15 conditioning factors. The database was divided 70:30 to train and verify the models. Their performance was assessed with the area under prediction rate curve (AUPRC), the area under success rate curve (AUSRC), accuracy, and kappa. Results show that slope is key to gully formation. The maximum entropy (ME) ML model has the best performance (AUSRC = 0.947, AUPRC = 0.948, accuracy = 0.849 and kappa = 0.699). The second best is the random forest (RF) model (AUSRC = 0.965, AUPRC = 0.932, accuracy = 0.812 and kappa = 0.624). By contrast, the TOPSIS (Technique for Order Preference by Similarity to Ideal Solution) model was the least effective (AUSRC = 0.871, AUPRC = 0.867, accuracy = 0.758 and kappa = 0.516). RF increased the performance of statistical index (SI) and frequency ratio (FR) statistical models. Furthermore, the combination of a generalized linear model (GLM), and functional data analysis (FDA) improved their performances. The results demonstrate that a combination of geographic information systems (GIS) with remote sensing (RS)-based ML models can successfully map gully erosion susceptibility, particularly in low-income and developing regions. This method can aid the analyses and decisions of natural resources managers and local planners to reduce damages by focusing attention and resources on areas prone to the worst and most damaging gully erosion.
Collapse
Affiliation(s)
- Alireza Arabameri
- Department of Geomorphology, Tarbiat Modares University, Tehran 36581-17994, Iran;
| | - Thomas Blaschke
- Department of Geoinformatics—Z_GIS, University of Salzburg, 5020 Salzburg, Austria;
| | - Biswajeet Pradhan
- Centre for Advanced Modelling and Geospatial Information Systems (CAMGIS), Faculty of Engineering and IT, University of Technology Sydney, Ultimo, NSW 2007, Australia;
- Department of Energy and Mineral Resources Engineering, Choongmu-gwan, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea
| | - Hamid Reza Pourghasemi
- Department of Natural Resources and Environmental Engineering, College of Agriculture, Shiraz University, Shiraz 71441-65186, Iran
| | | | - Dieu Tien Bui
- Institute of Research and Development, Duy Tan University, Da Nang 550000, Vietnam
| |
Collapse
|