Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sahin EK. Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest. SN Appl Sci 2020;2. [DOI: 10.1007/s42452-020-3060-1] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

For:	Sahin EK. Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest. SN Appl Sci 2020;2. [DOI: 10.1007/s42452-020-3060-1] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Number

Cited by Other Article(s)

Ahmed S, Hiraga Y, Kazama S. Land subsidence in Bangkok vicinity: Causes and long-term trend analysis using InSAR and machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;946:174285. [PMID: 38942307 DOI: 10.1016/j.scitotenv.2024.174285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 06/13/2024] [Accepted: 06/23/2024] [Indexed: 06/30/2024]

Yang C, Huang W, Lin Y, Cao S, Wang H, Sun Y, Fang T, Wang M, Kong D. Stretchable MXene/Carbon Nanotube Bilayer Strain Sensors with Tunable Sensitivity and Working Ranges. ACS APPLIED MATERIALS & INTERFACES 2024;16:30274-30283. [PMID: 38822785 DOI: 10.1021/acsami.4c04770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2024]

Affiliation(s)

Cheng Yang College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Weixi Huang College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Yong Lin College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Shitai Cao College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Hao Wang College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Yuping Sun College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Ting Fang College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Menglu Wang College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China
Desheng Kong College of Engineering and Applied Sciences, and Jiangsu Key Laboratory of Artificial Functional Materials, Nanjing University, Nanjing 210023, China State Key Laboratory of Analytical Chemistry for Life Science, Nanjing 210023, China

Collapse

Park D, Park EA, Jeong B, Lee W. A comparative analysis of deep learning-based location-adaptive threshold method software against other commercially available software. Int J Cardiovasc Imaging 2024;40:1269-1281. [PMID: 38634943 PMCID: PMC11213768 DOI: 10.1007/s10554-024-03099-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 04/02/2024] [Indexed: 04/19/2024]

Nguyen HD, Nguyen QH, Dang DK, Van CP, Truong QH, Pham SD, Bui QT, Petrisor AI. A novel flood risk management approach based on future climate and land use change scenarios. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;921:171204. [PMID: 38401735 DOI: 10.1016/j.scitotenv.2024.171204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Revised: 02/20/2024] [Accepted: 02/21/2024] [Indexed: 02/26/2024]

Abstract

Climate change and increasing urbanization are two primary factors responsible for the increased risk of serious flooding around the world. The prediction and monitoring of the effects of land use/land cover (LULC) and climate change on flood risk are critical steps in the development of appropriate strategies to reduce potential damage. This study aimed to develop a new approach by combining machine learning (namely the XGBoost, CatBoost, LightGBM, and ExtraTree models) and hydraulic modeling to predict the effects of climate change and LULC change on land that is at risk of flooding. For the years 2005, 2020, 2035, and 2050, machine learning was used to model and predict flood susceptibility under different scenarios of LULC, while hydraulic modeling was used to model and predict flood depth and flood velocity, based on the RCP 8.5 climate change scenario. The two elements were used to build a flood risk assessment, integrating socioeconomic data such as LULC, population density, poverty rate, number of women, number of schools, and cultivated area. Flood risk was then computed, using the analytical hierarchy process, by combining flood hazard, exposure, and vulnerability. The results showed that the area at high and very high flood risk increased rapidly, as did the areas of high/very high exposure, and high/very high vulnerability. They also showed how flood risk had increased rapidly from 2005 to 2020 and would continue to do so in 2035 and 2050, due to the dynamics of climate change and LULC change, population growth, the number of women, and the number of schools - particularly in the flood zone. The results highlight the relationships between flood risk and environmental and socio-economic changes and suggest that flood risk management strategies should also be integrated in future analyses. The map built in this study shows past and future flood risk, providing insights into the spatial distribution of urban area in flood zones and can be used to facilitate the development of priority measures, flood mitigation being most important.

Collapse

Oh SW, Byun SS, Kim JK, Jeong CW, Kwak C, Hwang EC, Kang SH, Chung J, Kim YJ, Ha YS, Hong SH. Machine learning models for predicting the onset of chronic kidney disease after surgery in patients with renal cell carcinoma. BMC Med Inform Decis Mak 2024;24:85. [PMID: 38519947 PMCID: PMC10960396 DOI: 10.1186/s12911-024-02473-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 03/03/2024] [Indexed: 03/25/2024] Open

Tran TTK, Janizadeh S, Bateni SM, Jun C, Kim D, Trauernicht C, Rezaie F, Giambelluca TW, Panahi M. Improving the prediction of wildfire susceptibility on Hawai'i Island, Hawai'i, using explainable hybrid machine learning models. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024;351:119724. [PMID: 38061099 DOI: 10.1016/j.jenvman.2023.119724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 11/13/2023] [Accepted: 11/25/2023] [Indexed: 01/14/2024]

Ghosh S, Pal S. Anthropogenic impacts on urban blue space and its reciprocal effect on human and socio-ecological health. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024;351:119727. [PMID: 38070422 DOI: 10.1016/j.jenvman.2023.119727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 11/10/2023] [Accepted: 11/25/2023] [Indexed: 01/14/2024]

Yang Y, Madanian S, Parry D. Enhancing Health Equity by Predicting Missed Appointments in Health Care: Machine Learning Study. JMIR Med Inform 2024;12:e48273. [PMID: 38214974 PMCID: PMC10818230 DOI: 10.2196/48273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 11/07/2023] [Accepted: 12/04/2023] [Indexed: 01/13/2024] Open

Abstract

BACKGROUND

The phenomenon of patients missing booked appointments without canceling them-known as Did Not Show (DNS), Did Not Attend (DNA), or Failed To Attend (FTA)-has a detrimental effect on patients' health and results in massive health care resource wastage.

OBJECTIVE

Our objective was to develop machine learning (ML) models and evaluate their performance in predicting the likelihood of DNS for hospital outpatient appointments at the MidCentral District Health Board (MDHB) in New Zealand.

METHODS

We sourced 5 years of MDHB outpatient records (a total of 1,080,566 outpatient visits) to build the ML prediction models. We developed 3 ML models using logistic regression, random forest, and Extreme Gradient Boosting (XGBoost). Subsequently, 10-fold cross-validation and hyperparameter tuning were deployed to minimize model bias and boost the algorithms' prediction strength. All models were evaluated against accuracy, sensitivity, specificity, and area under the receiver operating characteristic (AUROC) curve metrics.

RESULTS

Based on 5 years of MDHB data, the best prediction classifier was XGBoost, with an area under the curve (AUC) of 0.92, sensitivity of 0.83, and specificity of 0.85. The patients' DNS history, age, ethnicity, and appointment lead time significantly contributed to DNS prediction. An ML system trained on a large data set can produce useful levels of DNS prediction.

CONCLUSIONS

This research is one of the very first published studies that use ML technologies to assist with DNS management in New Zealand. It is a proof of concept and could be used to benchmark DNS predictions for the MDHB and other district health boards. We encourage conducting additional qualitative research to investigate the root cause of DNS issues and potential solutions. Addressing DNS using better strategies potentially can result in better utilization of health care resources and improve health equity.

Collapse

Thirunavukkarasu MK, Veerappapillai S, Karuppasamy R. Sequential virtual screening collaborated with machine-learning strategies for the discovery of precise medicine against non-small cell lung cancer. J Biomol Struct Dyn 2024;42:615-628. [PMID: 36995235 DOI: 10.1080/07391102.2023.2194994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 03/17/2023] [Indexed: 03/31/2023]

Azad MS, Khan SS, Hossain R, Rahman R, Momen S. Predictive modeling of consumer purchase behavior on social media: Integrating theory of planned behavior and machine learning for actionable insights. PLoS One 2023;18:e0296336. [PMID: 38150431 PMCID: PMC10752534 DOI: 10.1371/journal.pone.0296336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Accepted: 12/08/2023] [Indexed: 12/29/2023] Open

Abstract

In recent times, it has been observed that social media exerts a favorable influence on consumer purchasing behavior. Many organizations are adopting the utilization of social media platforms as a means to promote products and services. Hence, it is crucial for enterprises to understand the consumer buying behavior in order to thrive. This article presents a novel approach that combines the theory of planned behavior (TPB) with machine learning techniques to develop accurate predictive models for consumer purchase behavior. This study examines three distinct factors of the theory of planned behavior (attitude, social norm, and perceived behavioral control) that provide insights into the primary determinants influencing online purchasing behavior. A total of eight machine learning algorithms, namely K-nearest neighbor, Decision Tree, Random Forest, Logistic Regression, Naive Bayes, Support Vector Machine, AdaBoost, and Gradient Boosting, were utilized in order to forecast consumer purchasing behavior. Empirical findings indicate that gradient boosting demonstrates superior performance in predicting customer buying behavior, with an accuracy rate of 0.91 and a macro F1 score of 0.91. This holds true when all factors, namely attitude (ATTD), social norm (SN), and perceived behavioral control (PBC), are included in the analysis. Furthermore, we incorporated Explainable AI (XAI), specifically LIME (Local Interpretable Model-Agnostic Explanations), to elucidate how the best machine learning model (i.e. gradient boosting) makes its prediction. The findings indicate that LIME has demonstrated a high level of confidence in accurately predicting the influence of low and high behavior. The outcome presented in this article has several implications. For instance, this article presents a novel way to combine the theory of planned behavior with machine learning techniques in order to predict consumer purchase behavior. This integration allows for a comprehensive analysis of factors influencing online purchasing decisions. Also, the incorporation of Explainable AI enhances the transparency and interpretability of the model. This feature is valuable for organizations seeking insights into factors driving predictions and the reasons behind certain outcomes. Moreover, these observations have the potential to offer valuable insights for businesses in customizing their marketing strategies to align with these influential factors.

Collapse

Zhao W, Ma J, Liu Q, Dou L, Qu Y, Shi H, Sun Y, Chen H, Tian Y, Wu F. Accurate Prediction of Soil Heavy Metal Pollution Using an Improved Machine Learning Method: A Case Study in the Pearl River Delta, China. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023;57:17751-17761. [PMID: 36821784 DOI: 10.1021/acs.est.2c07561] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Pall R, Gauthier Y, Auer S, Mowaswes W. Predicting drug shortages using pharmacy data and machine learning. Health Care Manag Sci 2023;26:395-411. [PMID: 36913071 PMCID: PMC10009839 DOI: 10.1007/s10729-022-09627-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 12/19/2022] [Indexed: 03/14/2023]

Zhu A, Chiba S, Shimizu Y, Kunitake K, Okuno Y, Aoki Y, Yokota T. Ensemble-Learning and Feature Selection Techniques for Enhanced Antisense Oligonucleotide Efficacy Prediction in Exon Skipping. Pharmaceutics 2023;15:1808. [PMID: 37513994 PMCID: PMC10384346 DOI: 10.3390/pharmaceutics15071808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 06/13/2023] [Accepted: 06/15/2023] [Indexed: 07/30/2023] Open

Nguyen HD, Van CP, Nguyen TG, Dang DK, Pham TTN, Nguyen QH, Bui QT. Soil salinity prediction using hybrid machine learning and remote sensing in Ben Tre province on Vietnam's Mekong River Delta. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023:10.1007/s11356-023-27516-x. [PMID: 37204580 DOI: 10.1007/s11356-023-27516-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 05/04/2023] [Indexed: 05/20/2023]

Abstract

Soil salinization is considered one of the disasters that have significant effects on agricultural activities in many parts of the world, particularly in the context of climate change and sea level rise. This problem has become increasingly essential and severe in the Mekong River Delta of Vietnam. Therefore, soil salinity monitoring and assessment are critical to building appropriate strategies to develop agricultural activities. This study aims to develop a low-cost method based on machine learning and remote sensing to map soil salinity in Ben Tre province, which is located in Vietnam's Mekong River Delta. This objective was achieved by using six machine learning algorithms, including Xgboost (XGR), sparrow search algorithm (SSA), bird swarm algorithm (BSA), moth search algorithm (MSA), Harris hawk optimization (HHO), grasshopper optimization algorithm (GOA), particle swarm optimization algorithm (PSO), and 43 factors extracted from remote sensing images. Various indices were used, namely, root mean square error (RMSE), mean absolute error (MAE), and the coefficient of determination (R²) to estimate the efficiency of the prediction models. The results show that six optimization algorithms successfully improved XGR model performance with an R² value of more than 0.98. Among the proposed models, the XGR-HHO model was better than the other models with a value of R² of 0.99 and a value of RMSE of 0.051, by XGR-GOA (R² = 0.931, RMSE = 0.055), XGR-MSA (R² = 0.928, RMSE = 0.06), XGR-BSA (R² = 0.926, RMSE = 0.062), XGR-SSA (R² = 0.917, 0.07), XGR-PSO (R² = 0.916, RMSE = 0.08), XGR (R² = 0.867, RMSE = 0.1), CatBoost (R² = 0.78, RMSE = 0.12), and RF (R² = 0.75, RMSE = 0.19), respectively. These proposed models have surpassed the reference models (CatBoost and random forest). The results indicated that the soils in the eastern areas of Ben Tre province are more saline than in the western areas. The results of this study highlighted the effectiveness of using hybrid machine learning and remote sensing in soil salinity monitoring. The finding of this study provides essential tools to support farmers and policymakers in selecting appropriate crop types in the context of climate change to ensure food security.

Collapse

Rajput J, Singh M, Lal K, Khanna M, Sarangi A, Mukherjee J, Singh S. Assessment of data intelligence algorithms in modeling daily reference evapotranspiration under input data limitation scenarios in semi-arid climatic condition. WATER SCIENCE AND TECHNOLOGY : A JOURNAL OF THE INTERNATIONAL ASSOCIATION ON WATER POLLUTION RESEARCH 2023;87:2504-2528. [PMID: 37257106 PMCID: wst_2023_137 DOI: 10.2166/wst.2023.137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Al-Masnay YA, Al-Areeq NM, Ullah K, Al-Aizari AR, Rahman M, Wang C, Zhang J, Liu X. Estimate earth fissure hazard based on machine learning in the Qa' Jahran Basin, Yemen. Sci Rep 2022;12:21936. [PMID: 36536056 PMCID: PMC9763334 DOI: 10.1038/s41598-022-26526-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2022] Open

Affiliation(s)

Yousef A. Al-Masnay grid.27446.330000 0004 1789 9163Institute of Natural Disaster Research, School of Environment, Northeast Normal University, Changchun, 130024 People’s Republic of China ,7grid.216417.70000 0001 0379 7164Department of Surveying and Remote Sensing, School of Geosciences and Info-Physics, Central South University, Changsha, 410083 China
Nabil M. Al-Areeq grid.444928.70000 0000 9908 6529Department of Geology and Environment, Thamar University, Thamar, Yemen
Kashif Ullah grid.503241.10000 0004 1760 9015Institute of Geophysics and Geomatics, China University of Geosciences, Wuhan, People’s Republic of China
Ali R. Al-Aizari grid.33763.320000 0004 1761 2484Institute of Surface-Earth System Science, School of Earth System Science, Tianjin University, Tianjin, 300072 China
Mahfuzur Rahman grid.443015.70000 0001 2222 8047Department of Civil Engineering, International University of Business Agriculture and Technology (IUBAT), Dhaka, 1230 Bangladesh
Changcheng Wang grid.216417.70000 0001 0379 7164Department of Surveying and Remote Sensing, School of Geosciences and Info-Physics, Central South University, Changsha, 410083 China
Jiquan Zhang grid.27446.330000 0004 1789 9163Institute of Natural Disaster Research, School of Environment, Northeast Normal University, Changchun, 130024 People’s Republic of China ,2grid.27446.330000 0004 1789 9163Key Laboratory for Vegetation Ecology, Ministry of Education, Changchun, 130024 People’s Republic of China ,3grid.27446.330000 0004 1789 9163State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, Northeast Normal University, Changchun, 130024 People’s Republic of China
Xingpeng Liu grid.27446.330000 0004 1789 9163Institute of Natural Disaster Research, School of Environment, Northeast Normal University, Changchun, 130024 People’s Republic of China ,2grid.27446.330000 0004 1789 9163Key Laboratory for Vegetation Ecology, Ministry of Education, Changchun, 130024 People’s Republic of China ,3grid.27446.330000 0004 1789 9163State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, Northeast Normal University, Changchun, 130024 People’s Republic of China

Collapse

Karaman MO, Çabuk SN, Pekkan E. Utilization of frequency ratio method for the production of landslide susceptibility maps: Karaburun Peninsula case, Turkey. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022;29:91285-91305. [PMID: 35882738 DOI: 10.1007/s11356-022-21931-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 07/05/2022] [Indexed: 06/15/2023]

Pal S, Paul S, Debanshi S. Identifying sensitivity of factor cluster based gully erosion susceptibility models. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022;29:90964-90983. [PMID: 35881291 DOI: 10.1007/s11356-022-22063-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 07/13/2022] [Indexed: 06/15/2023]

Hybrid machine learning approach for landslide prediction, Uttarakhand, India. Sci Rep 2022;12:20101. [PMID: 36418362 PMCID: PMC9684430 DOI: 10.1038/s41598-022-22814-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Accepted: 10/19/2022] [Indexed: 11/24/2022] Open

Abstract

Natural disasters always have a damaging effect on our way of life. Landslides cause serious damage to both human and natural resources around the world. In this paper, the prediction accuracy of five hybrid models for landslide occurrence in the Uttarkashi, Uttarakhand (India) was evaluated and compared. In this approach, the Rough Set theory coupled with five different models namely Bayesian Network (HBNRS), Backpropagation Neural Network (HBPNNRS), Bagging (HBRS), XGBoost (HXGBRS), and Random Forest (HRFRS) were taken into account. The database for the models development was prepared using fifteen conditioning factors that had 373 landslide and 181 non-landslide locations that were then randomly divided into training and testing locations with a ratio of 75%:25%. The appropriateness and predictability of these conditioning factors were assessed using the multi-collinearity test and the least absolute shrinkage and selection operator approach. The accuracy, sensitivity, specificity, precision, and F-Measures, and the area under the curve (AUC)-receiver operating characteristics curve, were used to evaluate and compare the performance of the individual and hybrid created models. The findings indicate that the constructed hybrid model HXGBRS (AUC = 0.937, Precision = 0.946, F1-score = 0.926 and Accuracy = 89.92%) is the most accurate model for predicting landslides when compared to other models (HBPNNRS, HBNRS, HBRS, and HRFRS). Importantly, when the fusion is performed with the rough set method, the prediction capability of each model is improved. Simultaneously, the HXGBRS model proposed shows superior stability and can effectively avoid overfitting. After the core modules were developed, the user-friendly platform was designed as an integrated GIS environment using dynamic maps for effective landslide prediction in large prone areas. Users can predict the probability of landslide occurrence for selected region by changing the values of a conditioning factors. The created approach could be beneficial for predicting the impact of landslides on slopes and tracking landslides along national routes.

Collapse

Predictive Modeling for the Diagnosis of Gestational Diabetes Mellitus Using Epidemiological Data in the United Arab Emirates. INFORMATION 2022. [DOI: 10.3390/info13100485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Kim HM, Byun SS, Kim JK, Jeong CW, Kwak C, Hwang EC, Kang SH, Chung J, Kim YJ, Ha YS, Hong SH. Machine learning-based prediction model for late recurrence after surgery in patients with renal cell carcinoma. BMC Med Inform Decis Mak 2022;22:241. [PMID: 36100881 PMCID: PMC9472380 DOI: 10.1186/s12911-022-01964-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 07/21/2022] [Indexed: 11/24/2022] Open

Cardiovascular Disease Detection using Ensemble Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:5267498. [PMID: 36017452 PMCID: PMC9398727 DOI: 10.1155/2022/5267498] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 06/23/2022] [Accepted: 06/28/2022] [Indexed: 11/18/2022]

Gopukumar D, Ghoshal A, Zhao H. A Machine Learning Approach for Predicting Readmission Charges Billed by Hospitals. JMIR Med Inform 2022;10:e37578. [PMID: 35896038 PMCID: PMC9472041 DOI: 10.2196/37578] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 05/02/2022] [Accepted: 07/26/2022] [Indexed: 11/29/2022] Open

Abstract

Background

The Centers for Medicare and Medicaid Services projects that health care costs will continue to grow over the next few years. Rising readmission costs contribute significantly to increasing health care costs. Multiple areas of health care, including readmissions, have benefited from the application of various machine learning algorithms in several ways.

Objective

We aimed to identify suitable models for predicting readmission charges billed by hospitals. Our literature review revealed that this application of machine learning is underexplored. We used various predictive methods, ranging from glass-box models (such as regularization techniques) to black-box models (such as deep learning–based models).

Methods

We defined readmissions as readmission with the same major diagnostic category (RSDC) and all-cause readmission category (RADC). For these readmission categories, 576,701 and 1,091,580 individuals, respectively, were identified from the Nationwide Readmission Database of the Healthcare Cost and Utilization Project by the Agency for Healthcare Research and Quality for 2013. Linear regression, lasso regression, elastic net, ridge regression, eXtreme gradient boosting (XGBoost), and a deep learning model based on multilayer perceptron (MLP) were the 6 machine learning algorithms we tested for RSDC and RADC through 10-fold cross-validation.

Results

Our preliminary analysis using a data-driven approach revealed that within RADC, the subsequent readmission charge billed per patient was higher than the previous charge for 541,090 individuals, and this number was 319,233 for RSDC. The top 3 major diagnostic categories (MDCs) for such instances were the same for RADC and RSDC. The average readmission charge billed was higher than the previous charge for 21 of the MDCs in the case of RSDC, whereas it was only for 13 of the MDCs in RADC. We recommend XGBoost and the deep learning model based on MLP for predicting readmission charges. The following performance metrics were obtained for XGBoost: (1) RADC (mean absolute percentage error [MAPE]=3.121%; root mean squared error [RMSE]=0.414; mean absolute error [MAE]=0.317; root relative squared error [RRSE]=0.410; relative absolute error [RAE]=0.399; normalized RMSE [NRMSE]=0.040; mean absolute deviation [MAD]=0.031) and (2) RSDC (MAPE=3.171%; RMSE=0.421; MAE=0.321; RRSE=0.407; RAE=0.393; NRMSE=0.041; MAD=0.031). The performance obtained for MLP-based deep neural networks are as follows: (1) RADC (MAPE=3.103%; RMSE=0.413; MAE=0.316; RRSE=0.410; RAE=0.397; NRMSE=0.040; MAD=0.031) and (2) RSDC (MAPE=3.202%; RMSE=0.427; MAE=0.326; RRSE=0.413; RAE=0.399; NRMSE=0.041; MAD=0.032). Repeated measures ANOVA revealed that the mean RMSE differed significantly across models with P<.001. Post hoc tests using the Bonferroni correction method indicated that the mean RMSE of the deep learning/XGBoost models was statistically significantly (P<.001) lower than that of all other models, namely linear regression/elastic net/lasso/ridge regression.

Conclusions

Models built using XGBoost and MLP are suitable for predicting readmission charges billed by hospitals. The MDCs allow models to accurately predict hospital readmission charges.

Collapse

Highway Proneness Appraisal to Landslides along Taiping to Ipoh Segment Malaysia, Using MCDM and GIS Techniques. SUSTAINABILITY 2022. [DOI: 10.3390/su14159096] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]

Landslide Susceptibility Mapping of Landslides with Artificial Neural Networks: Multi-Approach Analysis of aBackpropagation Algorithm Applying the Neuralnet Package in Cuenca, Ecuador. REMOTE SENSING 2022. [DOI: 10.3390/rs14143495] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Minea G, Ciobotaru N, Ioana-Toroimac G, Mititelu-Ionuș O, Neculau G, Gyasi-Agyei Y, Rodrigo-Comino J. Designing grazing susceptibility to land degradation index (GSLDI) in hilly areas. Sci Rep 2022;12:9393. [PMID: 35729181 PMCID: PMC9213453 DOI: 10.1038/s41598-022-13596-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Accepted: 05/25/2022] [Indexed: 11/09/2022] Open

Abe D, Inaji M, Hase T, Takahashi S, Sakai R, Ayabe F, Tanaka Y, Otomo Y, Maehara T. A Prehospital Triage System to Detect Traumatic Intracranial Hemorrhage Using Machine Learning Algorithms. JAMA Netw Open 2022;5:e2216393. [PMID: 35687335 PMCID: PMC9187955 DOI: 10.1001/jamanetworkopen.2022.16393] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

Abstract

IMPORTANCE

An adequate system for triaging patients with head trauma in prehospital settings and choosing optimal medical institutions is essential for improving the prognosis of these patients. To our knowledge, there has been no established way to stratify these patients based on their head trauma severity that can be used by ambulance crews at an injury site.

OBJECTIVES

To develop a prehospital triage system to stratify patients with head trauma according to trauma severity by using several machine learning techniques and to evaluate the predictive accuracy of these techniques.

DESIGN, SETTING, AND PARTICIPANTS

This single-center retrospective cohort study was conducted by reviewing the electronic medical records of consecutive patients who were transported to Tokyo Medical and Dental University Hospital in Japan from April 1, 2018, to March 31, 2021. Patients younger than 16 years with cardiopulmonary arrest on arrival or with a significant amount of missing data were excluded.

MAIN OUTCOMES AND MEASURES

Machine learning-based prediction models to detect the presence of traumatic intracranial hemorrhage were constructed. The predictive accuracy of the models was evaluated with the area under the receiver operating curve (ROC-AUC), area under the precision recall curve (PR-AUC), sensitivity, specificity, and other representative statistics.

RESULTS

A total of 2123 patients (1527 male patients [71.9%]; mean [SD] age, 57.6 [19.8] years) with head trauma were enrolled in this study. Traumatic intracranial hemorrhage was detected in 258 patients (12.2%). Among several machine learning algorithms, extreme gradient boosting (XGBoost) achieved the mean (SD) highest ROC-AUC (0.78 [0.02]) and PR-AUC (0.46 [0.01]) in cross-validation studies. In the testing set, the ROC-AUC was 0.80, the sensitivity was 74.0% (95% CI, 59.7%-85.4%), and the specificity was 74.9% (95% CI, 70.2%-79.3%). The prediction model using the National Institute for Health and Care Excellence (NICE) guidelines, which was calculated after consultation with physicians, had a sensitivity of 72.0% (95% CI, 57.5%-83.8%) and a specificity of 73.3% (95% CI, 68.7%-77.7%). The McNemar test revealed no statistically significant differences between the XGBoost algorithm and the NICE guidelines for sensitivity or specificity (P = .80 and P = .55, respectively).

CONCLUSIONS AND RELEVANCE

In this cohort study, the prediction model achieved a comparatively accurate performance in detecting traumatic intracranial hemorrhage using only the simple pretransportation information from the patient. Further validation with a prospective multicenter data set is needed.

Collapse

A Comparative Assessment of Machine Learning Models for Landslide Susceptibility Mapping in the Rugged Terrain of Northern Pakistan. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12052280] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Abstract This study investigated the performances of different techniques, including random forest (RF), support vector machine (SVM), maximum entropy (maxENT), gradient-boosting machine (GBM), and logistic regression (LR), for landslide susceptibility mapping (LSM) in the rugged terrain of northern Pakistan. Initially, a landslide inventory of 200 samples was produced along with an additional 200 samples indicating nonlandslide areas and divided into training (70%) and validation (30%) groups using a stratified loop-based random sampling approach. Then, a geospatial database of 12 possible landslide influencing factors (LIFs) was generated, including elevation, slope, aspect, topographic wetness index (TWI), topographic position index (TPI), distance to drainage, distance to fault, distance to road, normalized difference vegetation index (NDVI), rainfall, land cover/land use (LCLU), and a geological map of the study area. None of the LIFs were redundant for the modeling, as indicated by the multicollinearity test (tolerance > 0.1) and information gain ratio (IGR > 0). We extended the evaluation measures of each algorithm from area-under-the-curve (AUC) analysis to the calculation of performance overall (POA) with the help of precision, recall, F1 score, accuracy (ACC), and Matthew’s correlation coefficient (MCC). The results showed that the SVM was the most promising model (AUC = 0.969, POA = 2669) for the LSM, followed by RF (AUC = 0.967, POA = 2656), GBM (AUC = 0.967, POA = 2623), maxENT (AUC = 0.872, POA = 1761), and LR (AUC = 0.836, POA = 1299). It is important to note that the SVM, RF, and GBM were the top performers, with almost similar accuracy. Thus, each of these could be equally effective for LSM and can be used for risk reduction and mitigation measures in the rugged terrain of Pakistan and other regions with similar topography. Collapse

Wang B, Han X, Zhao Z, Wang N, Zhao P, Li M, Zhang Y, Zhao T, Chen Y, Ren Z, Hong Y. EEG-Driven Prediction Model of Oxcarbazepine Treatment Outcomes in Patients With Newly-Diagnosed Focal Epilepsy. Front Med (Lausanne) 2022;8:781937. [PMID: 35047529 PMCID: PMC8761908 DOI: 10.3389/fmed.2021.781937] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 12/06/2021] [Indexed: 11/27/2022] Open

Abstract

Objective: Antiseizure medicine (ASM) is the first choice for patients with epilepsy. The choice of ASM is determined by the type of epilepsy or epileptic syndrome, which may not be suitable for certain patients. This initial choice of a particular drug affects the long-term prognosis of patients, so it is critical to select the appropriate ASMs based on the individual characteristics of a patient at the early stage of the disease. The purpose of this study is to develop a personalized prediction model to predict the probability of achieving seizure control in patients with focal epilepsy, which will help in providing a more precise initial medication to patients.

Methods: Based on response to oxcarbazepine (OXC), enrolled patients were divided into two groups: seizure-free (52 patients), not seizure-free (NSF) (22 patients). We created models to predict patients' response to OXC monotherapy by combining Electroencephalogram (EEG) complexities and 15 clinical features. The prediction models were gradient boosting decision tree-Kolmogorov complexity (GBDT-KC) and gradient boosting decision tree-Lempel-Ziv complexity (GBDT-LZC). We also constructed two additional prediction models, support vector machine-Kolmogorov complexity (SVM-KC) and SVM-LZC, and these two models were compared with the GBDT models. The performance of the models was evaluated by calculating the accuracy, precision, recall, F1-score, sensitivity, specificity, and area under the curve (AUC) of these models.

Results: The mean accuracy, precision, recall, F1-score, sensitivity, specificity, AUC of GBDT-LZC model after five-fold cross-validation were 81%, 84%, 91%, 87%, 91%, 64%, 81%, respectively. The average accuracy, precision, recall, F1-score, sensitivity, specificity, AUC of GBDT-KC model with five-fold cross-validation were 82%, 84%, 92%, 88%, 83%, 92%, 83%, respectively. We used the rank of absolute weights to separately calculate the features that have the most significant impact on the classification of the two models.

Conclusion: (1) The GBDT-KC model has the potential to be used in the clinic to predict seizure-free with OXC monotherapy. (2). Electroencephalogram complexity, especially Kolmogorov complexity (KC) may be a potential biomarker in predicting the treatment efficacy of OXC in newly diagnosed patients with focal epilepsy.

Collapse

Predictive Performances of Ensemble Machine Learning Algorithms in Landslide Susceptibility Mapping Using Random Forest, Extreme Gradient Boosting (XGBoost) and Natural Gradient Boosting (NGBoost). ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2022. [DOI: 10.1007/s13369-022-06560-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Pal S, Paul S. Linking hydrological security and landscape insecurity in the moribund deltaic wetland of India using tree-based hybrid ensemble method in python. ECOL INFORM 2021. [DOI: 10.1016/j.ecoinf.2021.101422] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Yang H, Huang K, Zhang K, Weng Q, Zhang H, Wang F. Predicting Heavy Metal Adsorption on Soil with Machine Learning and Mapping Global Distribution of Soil Adsorption Capacities. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2021;55:14316-14328. [PMID: 34617744 DOI: 10.1021/acs.est.1c02479] [Citation(s) in RCA: 66] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Gully Erosion Susceptibility Mapping in Highly Complex Terrain Using Machine Learning Models. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2021. [DOI: 10.3390/ijgi10100680] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

An emerging machine learning strategy for the assisted‐design of high-performance supercapacitor materials by mining the relationship between capacitance and structural features of porous carbon. J Electroanal Chem (Lausanne) 2021. [DOI: 10.1016/j.jelechem.2021.115684] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

The accuracy versus interpretability trade-off in fraud detection model. DATA & POLICY 2021. [DOI: 10.1017/dap.2021.3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

The Predictive Capability of a Novel Ensemble Tree-Based Algorithm for Assessing Groundwater Potential. SUSTAINABILITY 2021. [DOI: 10.3390/su13052459] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Qiu W, Lv Z, Hong Y, Jia J, Xiao X. BOW-GBDT: A GBDT Classifier Combining With Artificial Neural Network for Identifying GPCR-Drug Interaction Based on Wordbook Learning From Sequences. Front Cell Dev Biol 2021;8:623858. [PMID: 33598456 PMCID: PMC7882597 DOI: 10.3389/fcell.2020.623858] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 12/15/2020] [Indexed: 12/28/2022] Open

Abstract

Background: As a class of membrane protein receptors, G protein-coupled receptors (GPCRs) are very important for cells to complete normal life function and have been proven to be a major drug target for widespread clinical application. Hence, it is of great significance to find GPCR targets that interact with drugs in the process of drug development. However, identifying the interaction of the GPCR–drug pairs by experimental methods is very expensive and time-consuming on a large scale. As more and more database about GPCR–drug pairs are opened, it is viable to develop machine learning models to accurately predict whether there is an interaction existing in a GPCR–drug pair.

Methods: In this paper, the proposed model aims to improve the accuracy of predicting the interactions of GPCR–drug pairs. For GPCRs, the work extracts protein sequence features based on a novel bag-of-words (BOW) model improved with weighted Silhouette Coefficient and has been confirmed that it can extract more pattern information and limit the dimension of feature. For drug molecules, discrete wavelet transform (DWT) is used to extract features from the original molecular fingerprints. Subsequently, the above-mentioned two types of features are contacted, and SMOTE algorithm is selected to balance the training dataset. Then, artificial neural network is used to extract features further. Finally, a gradient boosting decision tree (GBDT) model is trained with the selected features. In this paper, the proposed model is named as BOW-GBDT.

Results: D92M and Check390 are selected for testing BOW-GBDT. D92M is used for a cross-validation dataset which contains 635 interactive GPCR–drug pairs and 1,225 non-interactive pairs. Check390 is used for an independent test dataset which consists of 130 interactive GPCR–drug pairs and 260 non-interactive GPCR–drug pairs, and each element in Check390 cannot be found in D92M. According to the results, the proposed model has a better performance in generation ability compared with the existing machine learning models.

Conclusion: The proposed predictor improves the accuracy of the interactions of GPCR–drug pairs. In order to facilitate more researchers to use the BOW-GBDT, the predictor has been settled into a brand-new server, which is available at http://www.jci-bioinfo.cn/bowgbdt.

Collapse

Risk Assessment of Resources Exposed to Rainfall Induced Landslide with the Development of GIS and RS Based Ensemble Metaheuristic Machine Learning Algorithms. SUSTAINABILITY 2021. [DOI: 10.3390/su13020457] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Abstract Disastrous natural hazards, such as landslides, floods, and forest fires cause a serious threat to natural resources, assets and human lives. Consequently, landslide risk assessment has become requisite for managing the resources in future. This study was designed to develop four ensemble metaheuristic machine learning algorithms, such as grey wolf optimized based artificial neural network (GW-ANN), grey wolf optimized based random forest (GW-RF), particle swarm optimization optimized based ANN (PSO-ANN), and PSO optimized based RF for modeling rainfall-induced landslide susceptibility (LS) in Aqabat Al-Sulbat, Asir region, Saudi Arabia, which observes landslide frequently. To obtain very high precision and robust prediction from machine learning algorithms, the grey wolf and PSO optimization algorithms were integrated to develop new ensemble machine learning techniques. Subsequently, LS maps produced by training dataset were validated using the receiver operating characteristics (ROC) curve based on the testing dataset. Based on the area under curve (AUC) value of ROC curve, the best method for LS modeling was selected. We developed ROC curve-based sensitivity analysis to investigate the influence of the parameters for LS modeling. The Gumble extreme value distribution was employed to estimate the rainfall at 2, 5, 10, 20, 50, and 100 year return periods. Then, the landslide hazard maps were prepared at different return periods by integrating the best LS model and estimated rainfall at different return periods. The theory of danger pixels was employed to prepare a final risk assessment of the resources, which have been exposed to the landslide. The results showed that 27–42 and 6–15 km2 were predicted as the very high and high LS zones using four ensemble metaheuristic machine learning algorithms. Based on the area under curve (AUC) of ROC, GR-ANN (AUC-0.905) appeared as the best model for LS modeling. The areas under high and very high landslide hazard were gradually increased over the progression of time (26 km2 at the 2 year return period and 40 km2 at the 100 year return period for the high landslide hazard zone, and 6 km2 at the 2 year return period and 20 km2 at the 100 year return period for the very high landslide hazard zone). Similarly, the areas of danger pixel also increased gradually from the 2 to 100 year return periods (37 km2 to 62 km2). Various natural resources, such as scrubland, built up, and sparse vegetation, were identified under risk zone due to landslide hazards. In addition, these resources would be exposed extensively to landslides over the advancement of return periods. Therefore, the outcome of the present study will help planners and scientists to propose high precision management plans for protecting natural resources, which have been exposed to landslides. Collapse

Mapping Landslide Susceptibility Using Machine Learning Algorithms and GIS: A Case Study in Shexian County, Anhui Province, China. Symmetry (Basel) 2020. [DOI: 10.3390/sym12121954] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

Abstract In this study, Logistics Regression (LR), Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting Machine (GBM), and Multilayer Perceptron (MLP) machine learning algorithms are combined with GIS techniques to map landslide susceptibility in Shexian County, China. By using satellite images and various topographic and geological maps, 16 landslide susceptibility factor maps of Shexian County were initially constructed. In total, 502 landslide and random safety points were then using the “Extract Multivalues To Points” tool in ArcGIS, parameters for the 16 factors were extracted and imported into models for the five algorithms, of which 70% of samples were used for training and 30% of samples were used for verification, which makes sense for date symmetry. The Shexian grid was converted into 260130 vector points and imported into the five models, and the natural breakpoint method was used to divide the grid into four levels: low, moderate, high, and very high. Finally, by using column results gained using Area Under Curve (AUC) analysis and a grid chart, susceptibility results for mapping landslide prediction in Shexian County was compared using the five methods. Results indicate that the ratio of landslide points of high or very high levels from LR, SVM, RF, GBM, and MLP was 1.52, 1.77, 1.95, 1.83, and 1.64, and the ratio of very high landslide points to grade area was 1.92, 2.20, 2.98, 2.62, and 2.14, respectively. The success rate of training samples for the five methods was 0.781, 0.824, 0.853, 0.828, and 0.811, and prediction accuracy was 0.772, 0.803, 0.821, 0.815, and 0.803, respectively; the order of accuracy of the five algorithms was RF > SVM > MLP > GBM > LR. Our results indicate that the five machine learning algorithms have good effect on landslide susceptibility evaluation in Shexian area, with Random Forest having the best effect. Collapse