1
|
Razavi-Termeh SV, Sadeghi-Niaraki A, Sorooshian A, Abuhmed T, Choi SM. Spatial mapping of land susceptibility to dust emissions using optimization of attentive Interpretable Tabular Learning (TabNet) model. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 358:120682. [PMID: 38670008 DOI: 10.1016/j.jenvman.2024.120682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 03/13/2024] [Accepted: 03/14/2024] [Indexed: 04/28/2024]
Abstract
Dust pollution poses significant risks to human health, air quality, and food safety, necessitating the identification of dust occurrence and the development of dust susceptibility maps (DSMs) to mitigate its effects. This research aims to detect dust occurrence using satellite images and prepare a DSM for Bushehr province, Iran, by enhancing the attentive interpretable tabular learning (TabNet) model through three swarm-based metaheuristic algorithms: particle swarm optimization (PSO), grey wolf optimizer (GWO), and hunger games search (HGS). A spatial database incorporating dust occurrence areas was created using Moderate Resolution Imaging Spectroradiometer (MODIS) images from 2002 to 2022, including 15 influential criteria related to climate, soil, topography, and land cover. Four models were employed for modeling and DSM generation: TabNet, TabNet-PSO, TabNet-GWO, and TabNet-HGS. Evaluation of the modeling results using performance metrics indicated that the TabNet-HGS model outperformed the other models in both training (mean absolute error (MAE) = 0.055, root-mean-square error (RMSE) = 0.1, coefficient of determination (R2) = 0.959), and testing (MAE = 0.063, RMSE = 0.114, R2 = 0.947) data. Following TabNet-HGS, the TabNet-PSO, TabNet-GWO, and TabNet models demonstrated progressively lower accuracy. The validation of the DSM was performed by assessing receiver operating characteristic (ROC) curves, revealing that the TabNet-HGS, TabNet-PSO, TabNet-GWO, and TabNet models exhibited the highest modeling accuracy, with corresponding area under the curve (AUC) values of 0.994, 0.986, 0.98, and 0.832, respectively. These results highlight the enhanced accuracy of dust susceptibility modeling achieved by integrating swarm-based metaheuristic algorithms with the TabNet model. The dust susceptibility map provides valuable insights into the sources, pathways, and impacts of dust particles on the environment and human health in the study area.
Collapse
Affiliation(s)
- Seyed Vahid Razavi-Termeh
- Dept. of Computer Science & Engineering and Convergence Engineering for Intelligent Drone, XR Research Center, Sejong University, Seoul, Republic of Korea.
| | - Abolghasem Sadeghi-Niaraki
- Dept. of Computer Science & Engineering and Convergence Engineering for Intelligent Drone, XR Research Center, Sejong University, Seoul, Republic of Korea.
| | - Armin Sorooshian
- Department of Chemical and Environmental Engineering, University of Arizona, Tucson, AZ, USA.
| | - Tamer Abuhmed
- College of Computing and Informatics, Sungkyunkwan University, Suwon, 16419, Republic of Korea.
| | - Soo-Mi Choi
- Dept. of Computer Science & Engineering and Convergence Engineering for Intelligent Drone, XR Research Center, Sejong University, Seoul, Republic of Korea.
| |
Collapse
|
2
|
Zhang L, Yu R, Chen K, Zhang Y, Li Q, Chen Y. Enhancing deep vein thrombosis prediction in patients with coronavirus disease 2019 using improved machine learning model. Comput Biol Med 2024; 173:108294. [PMID: 38537565 DOI: 10.1016/j.compbiomed.2024.108294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 02/21/2024] [Accepted: 03/12/2024] [Indexed: 04/17/2024]
Abstract
BACKGROUND Deep vein thrombosis (DVT) is a significant complication in coronavirus disease 2019 patients, arising from coagulation issues in the deep venous system. Among 424 scheduled patients, 202 developed DVT (47.64%). DVT increases hospitalization risk, and complications, and impacts prognosis. Accurate prognostication and timely intervention are crucial to prevent DVT progression and improve patient outcomes. METHODS This study introduces an effective DVT prediction model, named bSES-AC-RUN-FKNN, which integrates fuzzy k-nearest neighbor (FKNN) with enhanced Runge-Kutta optimizer (RUN). Recognizing the insufficient effectiveness of RUN in local search capability and its convergence accuracy, spherical evolutionary search (SES) and differential evolution-inspired knowledge adaptive crossover (AC) are incorporated, termed SES-AC-RUN, to enhance its optimization capability. RESULTS Based on the benchmark set by CEC 2017 and comparative analyses with several peers, it is evident that SES-AC-RUN significantly enhances search performance compared to traditional RUN, even standing comparably against leading championship algorithms. The proposed bSES-AC-RUN-FKNN model was applied to predict a dataset comprising 424 cases of DVT patients, totaling 7208 records. Remarkably, the model demonstrates outstanding accuracy, reaching 91.02%, alongside commendable sensitivity at 91.07%. CONCLUSIONS The bSES-AC-RUN-FKNN emerges as a robust and efficient predictive tool, significantly enhancing the accuracy of DVT prediction. This model can be used to manage the risk of thrombosis in the care of COVID-19 patients. Nursing staff can combine the model's predictions with clinical judgment to formulate comprehensive treatment approaches.
Collapse
Affiliation(s)
- Lufang Zhang
- The First Clinical College, Wenzhou Medical University, Wenzhou, 325000, China.
| | - Renyue Yu
- Cardiac Care Unit, Sir RUN RUN Shaw Hospital, Hangzhou, 310000, China.
| | - Keya Chen
- The First Clinical College, Wenzhou Medical University, Wenzhou, 325000, China.
| | - Ying Zhang
- Wenzhou Medical University School of Nursing, 325000, Wenzhou, 325000, China; Cixi Biomedical Research Institute, Wenzhou Medical University, Cixi, 315300, China.
| | - Qiang Li
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China.
| | - Yu Chen
- Nursing Department, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| |
Collapse
|
3
|
Xing J, Li C, Wu P, Cai X, Ouyang J. Optimized fuzzy K-nearest neighbor approach for accurate lung cancer prediction based on radial endobronchial ultrasonography. Comput Biol Med 2024; 171:108038. [PMID: 38442552 DOI: 10.1016/j.compbiomed.2024.108038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 01/02/2024] [Accepted: 01/26/2024] [Indexed: 03/07/2024]
Abstract
Radial endobronchial ultrasonography (R-EBUS) has been a surge in the development of new ultrasonography for the diagnosis of pulmonary diseases beyond the central airway. However, it faces challenges in accurately pinpointing the location of abnormal lesions. Therefore, this study proposes an improved machine learning model aimed at distinguishing between malignant lung disease (MLD) from benign lung disease (BLD) through R-EBUS features. An enhanced manta ray foraging optimization based on elite perturbation search and cyclic mutation strategy (ECMRFO) is introduced at first. Experimental validation on 29 test functions from CEC 2017 demonstrates that ECMRFO exhibits superior optimization capabilities and robustness compared to other competing algorithms. Subsequently, it was combined with fuzzy k-nearest neighbor for the classification prediction of BLD and MLD. Experimental results indicate that the proposed modal achieves a remarkable prediction accuracy of up to 99.38%. Additionally, parameters such as R-EBUS1 Circle-dense sign, R-EBUS2 Hemi-dense sign, R-EBUS5 Onionskin sign and CCT5 mediastinum lymph node are identified as having significant clinical diagnostic value.
Collapse
Affiliation(s)
- Jie Xing
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou, 325035, China.
| | - Chengye Li
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| | - Peiliang Wu
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| | - Xueding Cai
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| | - Jinsheng Ouyang
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| |
Collapse
|
4
|
Lin Y, Heidari AA, Wang S, Chen H, Zhang Y. An Enhanced Hunger Games Search Optimization with Application to Constrained Engineering Optimization Problems. Biomimetics (Basel) 2023; 8:441. [PMID: 37754192 PMCID: PMC10526405 DOI: 10.3390/biomimetics8050441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/07/2023] [Accepted: 09/12/2023] [Indexed: 09/28/2023] Open
Abstract
The Hunger Games Search (HGS) is an innovative optimizer that operates without relying on gradients and utilizes a population-based approach. It draws inspiration from the collaborative foraging activities observed in social animals in their natural habitats. However, despite its notable strengths, HGS is subject to limitations, including inadequate diversity, premature convergence, and susceptibility to local optima. To overcome these challenges, this study introduces two adjusted strategies to enhance the original HGS algorithm. The first adaptive strategy combines the Logarithmic Spiral (LS) technique with Opposition-based Learning (OBL), resulting in the LS-OBL approach. This strategy plays a pivotal role in reducing the search space and maintaining population diversity within HGS, effectively augmenting the algorithm's exploration capabilities. The second adaptive strategy, the dynamic Rosenbrock Method (RM), contributes to HGS by adjusting the search direction and step size. This adjustment enables HGS to escape from suboptimal solutions and enhances its convergence accuracy. Combined, these two strategies form the improved algorithm proposed in this study, referred to as RLHGS. To assess the efficacy of the introduced strategies, specific experiments are designed to evaluate the impact of LS-OBL and RM on enhancing HGS performance. The experimental results unequivocally demonstrate that integrating these two strategies significantly enhances the capabilities of HGS. Furthermore, RLHGS is compared against eight state-of-the-art algorithms using 23 well-established benchmark functions and the CEC2020 test suite. The experimental results consistently indicate that RLHGS outperforms the other algorithms, securing the top rank in both test suites. This compelling evidence substantiates the superior functionality and performance of RLHGS compared to its counterparts. Moreover, RLHGS is applied to address four constrained real-world engineering optimization problems. The final results underscore the effectiveness of RLHGS in tackling such problems, further supporting its value as an efficient optimization method.
Collapse
Affiliation(s)
- Yaoyao Lin
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325035, China; (Y.L.); (A.A.H.)
| | - Ali Asghar Heidari
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325035, China; (Y.L.); (A.A.H.)
| | - Shuihua Wang
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK;
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou 325035, China; (Y.L.); (A.A.H.)
| | - Yudong Zhang
- School of Computing and Mathematical Sciences, University of Leicester, Leicester LE1 7RH, UK;
| |
Collapse
|
5
|
Hamed A, Mohamed MF. A feature selection framework for anxiety disorder analysis using a novel multiview harris hawk optimization algorithm. Artif Intell Med 2023; 143:102605. [PMID: 37673574 DOI: 10.1016/j.artmed.2023.102605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 05/30/2023] [Accepted: 06/05/2023] [Indexed: 09/08/2023]
Abstract
Machine learning (ML) has demonstrated its ability to exploit important relationships within data collection, which can be used in the diagnosis, treatment, and prediction of outcomes in a variety of clinical contexts. Anxiety mental disorder analysis is one of the pending difficulties that ML can help with. A thorough study is demanded to gain a better understanding of this illness. Since the anxiety data is generally multidimensional, which complicates processing and as a result of technology improvements, medical data from several perspectives, known as multiview data (MVD), is being collected. Each view has its own data type and feature values, so there is a lot of diversity. This work introduces a novel preprocessing feature selection (FS) approach, multiview harris hawk optimization (MHHO), which has the potential to reduce the dimensionality of anxiety data, hence reducing analytical effort. The uniqueness of MHHO originates from combining a multiview linking methodology with the power of the harris hawk optimization (HHO) method. The HHO is used to identify the lowest optimal MVD feature subset, while multiview linking is utilized to find a promising fitness function to direct the HHO FS while accounting for all data views' heterogeneity. The complexity of MHHO is O(THL2), where T is the number of iterations, H is the number of involved harris hawks, and L is the number of objects. Using two publicly available anxiety MVDs, MHHO is validated against ten recent rivals in its category. The experimental findings show that MHHO has a considerable advantage in terms of convergence speed (converging in less than ten iterations), subset size (removing 75% of the views; reducing feature size by 66%), and classification accuracy (approaching 100%). Furthermore, statistical analyses reveal that MHHO is statistically different from its competitors, bolstering its applicability. Finally, feature importance is evaluated, shedding light on the most anxiety-inducing characteristics. The likelihood of developing additional disorders (such as depression or stress) is also investigated.
Collapse
Affiliation(s)
- Ahmed Hamed
- Department of Computer Science, Faculty of Computers and Information, Damanhour University, 22511, Damanhour, Egypt.
| | - Marwa F Mohamed
- Department of Computer Science, Faculty of Computers and Informatics, Suez Canal University, 41522, Ismailia, Egypt
| |
Collapse
|
6
|
Chen Z, Xinxian L, Guo R, Zhang L, Dhahbi S, Bourouis S, Liu L, Wang X. Dispersed differential hunger games search for high dimensional gene data feature selection. Comput Biol Med 2023; 163:107197. [PMID: 37390761 DOI: 10.1016/j.compbiomed.2023.107197] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 06/08/2023] [Accepted: 06/19/2023] [Indexed: 07/02/2023]
Abstract
The realms of modern medicine and biology have provided substantial data sets of genetic roots that exhibit a high dimensionality. Clinical practice and associated processes are primarily dependent on data-driven decision-making. However, the high dimensionality of the data in these domains increases the complexity and size of processing. It can be challenging to determine representative genes while reducing the data's dimensionality. A successful gene selection will serve to mitigate the computing costs and refine the accuracy of the classification by eliminating superfluous or duplicative features. To address this concern, this research suggests a wrapper gene selection approach based on the HGS, combined with a dispersed foraging strategy and a differential evolution strategy, to form a new algorithm named DDHGS. Introducing the DDHGS algorithm to the global optimization field and its binary derivative bDDHGS to the feature selection problem is anticipated to refine the existing search balance between explorative and exploitative cores. We assess and confirm the efficacy of our proposed method, DDHGS, by comparing it with DE and HGS combined with a single strategy, seven classic algorithms, and ten advanced algorithms on the IEEE CEC 2017 test suite. Furthermore, to further evaluate DDHGS' performance, we compare it with several CEC winners and DE-based techniques of great efficiency on 23 popular optimization functions and the IEEE CEC 2014 benchmark test suite. The experimentation asserted that the bDDHGS approach was able to surpass bHGS and a variety of existing methods when applied to fourteen feature selection datasets from the UCI repository. The metrics measured--classification accuracy, the number of selected features, fitness scores, and execution time--all showed marked improvements with the use of bDDHGS. Considering all results, it can be concluded that bDDHGS is an optimal optimizer and an effective feature selection tool in the wrapper mode.
Collapse
Affiliation(s)
- Zhiqing Chen
- School of Intelligent Manufacturing, Wenzhou Polytechnic, Wenzhou, 325035, China.
| | - Li Xinxian
- Wenzhou Vocational College of Science and Technology, Wenzhou, 325006, China.
| | - Ran Guo
- Cyberspace Institute Advanced Technology, Guangzhou University, Guangzhou, 510006, China.
| | - Lejun Zhang
- Cyberspace Institute Advanced Technology, Guangzhou University, Guangzhou, 510006, China; College of Information Engineering, Yangzhou University, Yangzhou, 225127, China; Research and Development Center for E-Learning, Ministry of Education, Beijing, 100039, China.
| | - Sami Dhahbi
- Department of Computer Science, College of Science and Art at Mahayil, King Khalid University, Muhayil, Aseer, 62529, Saudi Arabia.
| | - Sami Bourouis
- Department of Information Technology, College of Computers and Information Technology, Taif University, P.O.Box 11099, Taif, 21944, Saudi Arabia.
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, 610065, China.
| | - Xianchuan Wang
- Information Technology Center, Wenzhou Medical University, Wenzhou, 325035, China.
| |
Collapse
|
7
|
Zhu J, Liu J, Chen Y, Xue X, Sun S. Binary Restructuring Particle Swarm Optimization and Its Application. Biomimetics (Basel) 2023; 8:266. [PMID: 37366861 DOI: 10.3390/biomimetics8020266] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 06/14/2023] [Accepted: 06/15/2023] [Indexed: 06/28/2023] Open
Abstract
Restructuring Particle Swarm Optimization (RPSO) algorithm has been developed as an intelligent approach based on the linear system theory of particle swarm optimization (PSO). It streamlines the flow of the PSO algorithm, specifically targeting continuous optimization problems. In order to adapt RPSO for solving discrete optimization problems, this paper proposes the binary Restructuring Particle Swarm Optimization (BRPSO) algorithm. Unlike other binary metaheuristic algorithms, BRPSO does not utilize the transfer function. The particle updating process in BRPSO relies solely on comparison results between values derived from the position updating formula and a random number. Additionally, a novel perturbation term is incorporated into the position updating formula of BRPSO. Notably, BRPSO requires fewer parameters and exhibits high exploration capability during the early stages. To evaluate the efficacy of BRPSO, comprehensive experiments are conducted by comparing it against four peer algorithms in the context of feature selection problems. The experimental results highlight the competitive nature of BRPSO in terms of both classification accuracy and the number of selected features.
Collapse
Affiliation(s)
- Jian Zhu
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou 350118, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fuzhou 350118, China
| | - Jianhua Liu
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou 350118, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fuzhou 350118, China
| | - Yuxiang Chen
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou 350118, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fuzhou 350118, China
| | - Xingsi Xue
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou 350118, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fuzhou 350118, China
| | - Shuihua Sun
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou 350118, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fuzhou 350118, China
| |
Collapse
|
8
|
Chen Z, Xuan P, Heidari AA, Liu L, Wu C, Chen H, Escorcia-Gutierrez J, Mansour RF. An artificial bee bare-bone hunger games search for global optimization and high-dimensional feature selection. iScience 2023; 26:106679. [PMID: 37216098 PMCID: PMC10193239 DOI: 10.1016/j.isci.2023.106679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 03/01/2023] [Accepted: 04/12/2023] [Indexed: 05/24/2023] Open
Abstract
The domains of contemporary medicine and biology have generated substantial high-dimensional genetic data. Identifying representative genes and decreasing the dimensionality of the data can be challenging. The goal of gene selection is to minimize computing costs and enhance classification precision. Therefore, this article designs a new wrapper gene selection algorithm named artificial bee bare-bone hunger games search (ABHGS), which is the hunger games search (HGS) integrated with an artificial bee strategy and a Gaussian bare-bone structure to address this issue. To evaluate and validate the performance of our proposed method, ABHGS is compared to HGS and a single strategy embedded in HGS, six classic algorithms, and ten advanced algorithms on the CEC 2017 functions. The experimental results demonstrate that the bABHGS outperforms the original HGS. Compared to peers, it increases classification accuracy and decreases the number of selected features, indicating its actual engineering utility in spatial search and feature selection.
Collapse
Affiliation(s)
- Zhiqing Chen
- School of Intelligent Manufacturing, Wenzhou Polytechnic, Wenzhou 325035, China
| | - Ping Xuan
- Department of Computer Science, School of Engineering, Shantou University, Shantou 515063, China
| | - Ali Asghar Heidari
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China
| | - Chengwen Wu
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Huiling Chen
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - José Escorcia-Gutierrez
- Department of Computational Science and Electronics, Universidad de la Costa, CUC, Barranquilla 080002, Colombia
| | - Romany F. Mansour
- Department of Mathematics, Faculty of Science, New Valley University, El-Kharga 72511, Egypt
| |
Collapse
|
9
|
Ma BJ, Pereira JLJ, Oliva D, Liu S, Kuo YH. Manta ray foraging optimizer-based image segmentation with a two-strategy enhancement. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2022.110247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
10
|
Houssein EH, Hosney ME, Mohamed WM, Ali AA, Younis EMG. Fuzzy-based hunger games search algorithm for global optimization and feature selection using medical data. Neural Comput Appl 2022; 35:5251-5275. [PMID: 36340595 PMCID: PMC9628476 DOI: 10.1007/s00521-022-07916-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 09/30/2022] [Indexed: 11/06/2022]
Abstract
Feature selection (FS) is one of the basic data preprocessing steps in data mining and machine learning. It is used to reduce feature size and increase model generalization. In addition to minimizing feature dimensionality, it also enhances classification accuracy and reduces model complexity, which are essential in several applications. Traditional methods for feature selection often fail in the optimal global solution due to the large search space. Many hybrid techniques have been proposed depending on merging several search strategies which have been used individually as a solution to the FS problem. This study proposes a modified hunger games search algorithm (mHGS), for solving optimization and FS problems. The main advantages of the proposed mHGS are to resolve the following drawbacks that have been raised in the original HGS; (1) avoiding the local search, (2) solving the problem of premature convergence, and (3) balancing between the exploitation and exploration phases. The mHGS has been evaluated by using the IEEE Congress on Evolutionary Computation 2020 (CEC'20) for optimization test and ten medical and chemical datasets. The data have dimensions up to 20000 features or more. The results of the proposed algorithm have been compared to a variety of well-known optimization methods, including improved multi-operator differential evolution algorithm (IMODE), gravitational search algorithm, grey wolf optimization, Harris Hawks optimization, whale optimization algorithm, slime mould algorithm and hunger search games search. The experimental results suggest that the proposed mHGS can generate effective search results without increasing the computational cost and improving the convergence speed. It has also improved the SVM classification performance.
Collapse
Affiliation(s)
- Essam H. Houssein
- Faculty of Computers and Information, Minia University, Minia, Egypt
| | - Mosa E. Hosney
- Faculty of Computers and Information, Luxor University, Luxor, Egypt
| | - Waleed M. Mohamed
- Faculty of Computers and Information, Minia University, Minia, Egypt
| | - Abdelmgeid A. Ali
- Faculty of Computers and Information, Minia University, Minia, Egypt
| | - Eman M. G. Younis
- Faculty of Computers and Information, Minia University, Minia, Egypt
| |
Collapse
|
11
|
Akinola OA, Agushaka JO, Ezugwu AE. Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems. PLoS One 2022; 17:e0274850. [PMID: 36201524 PMCID: PMC9536540 DOI: 10.1371/journal.pone.0274850] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/06/2022] [Indexed: 11/13/2022] Open
Abstract
Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model's accuracy and improve classification performance without information loss. Therefore, more advanced optimization methods have been employed to locate the optimal subset of features. This paper presents a binary version of the dwarf mongoose optimization called the BDMO algorithm to solve the high-dimensional feature selection problem. The effectiveness of this approach was validated using 18 high-dimensional datasets from the Arizona State University feature selection repository and compared the efficacy of the BDMO with other well-known feature selection techniques in the literature. The results show that the BDMO outperforms other methods producing the least average fitness value in 14 out of 18 datasets which means that it achieved 77.77% on the overall best fitness values. The result also shows BDMO demonstrating stability by returning the least standard deviation (SD) value in 13 of 18 datasets (72.22%). Furthermore, the study achieved higher validation accuracy in 15 of the 18 datasets (83.33%) over other methods. The proposed approach also yielded the highest validation accuracy attainable in the COIL20 and Leukemia datasets which vividly portray the superiority of the BDMO.
Collapse
Affiliation(s)
- Olatunji A. Akinola
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, KwaZulu-Natal, South Africa
| | - Jeffrey O. Agushaka
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, KwaZulu-Natal, South Africa
- Department of Computer Science, Federal University of Lafia, Lafia, Nasarawa State, Nigeria
| | - Absalom E. Ezugwu
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, KwaZulu-Natal, South Africa
| |
Collapse
|
12
|
Feature Selection Based on Adaptive Particle Swarm Optimization with Leadership Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:1825341. [PMID: 36072739 PMCID: PMC9441366 DOI: 10.1155/2022/1825341] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 08/07/2022] [Accepted: 08/09/2022] [Indexed: 12/02/2022]
Abstract
With the rapid development of the Internet of Things (IoT), the curse of dimensionality becomes increasingly common. Feature selection (FS) is to eliminate irrelevant and redundant features in the datasets. Particle swarm optimization (PSO) is an efficient metaheuristic algorithm that has been successfully applied to obtain the optimal feature subset with essential information in an acceptable time. However, it is easy to fall into the local optima when dealing with high-dimensional datasets due to constant parameter values and insufficient population diversity. In the paper, an FS method is proposed by utilizing adaptive PSO with leadership learning (APSOLL). An adaptive updating strategy for parameters is used to replace the constant parameters, and the leadership learning strategy is utilized to provide valid population diversity. Experimental results on 10 UCI datasets show that APSOLL has better exploration and exploitation capabilities through comparison with PSO, grey wolf optimizer (GWO), Harris hawks optimization (HHO), flower pollination algorithm (FPA), salp swarm algorithm (SSA), linear PSO (LPSO), and hybrid PSO and differential evolution (HPSO-DE). Moreover, less than 8% of features in the original datasets are selected on average, and the feature subsets are more effective in most cases compared to those generated by 6 traditional FS methods (analysis of variance (ANOVA), Chi-Squared (CHI2), Pearson, Spearman, Kendall, and Mutual Information (MI)).
Collapse
|
13
|
Loss Aversion Order Strategy in Emergency Procurement during the COVID-19 Pandemic. SUSTAINABILITY 2022. [DOI: 10.3390/su14159119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The COVID-19 pandemic has had a serious impact on firms’ sourcing strategies. Since COVID-19 disrupted the supply chain, firms have had to make emergency purchases from other suppliers. In addition, emergency ordering is one of the most effective strategies to achieve sustainable operations because such a strategy can save inventory costs. We aim to address a retailer’s emergency procurement strategies during the COVID-19 pandemic. We use prospect theory and the newsvendor model to uncover the retailer’s inventory decisions. In our study, we find that retailers have the choice to order items before the selling period at the normal purchase price, and, if available, they can order them before the end of the selling period at the urgent purchase price. We perform a comparison of the optimal ordering policy and margins in this case with the conventional and loss aversion models. The influence of emergency procurement on the optimal order policy and margins is investigated as well. This paper contributes in theory that we innovatively capture the uncertainty of emergency sourcing, which is a feature that has never been considered in current research.
Collapse
|