1
|
Sun Z, Ai Z, Wang Z, Wang J, Gu X, Wang D, Lu H, Chen Y. Considering multi-scale built environment in modeling severity of traffic violations by elderly drivers: An interpretable machine learning framework. ACCIDENT; ANALYSIS AND PREVENTION 2024; 207:107740. [PMID: 39142041 DOI: 10.1016/j.aap.2024.107740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 07/14/2024] [Accepted: 08/04/2024] [Indexed: 08/16/2024]
Abstract
The causes of traffic violations by elderly drivers are different from those of other age groups. To reduce serious traffic violations that are more likely to cause serious traffic crashes, this study divided the severity of traffic violations into three levels (i.e., slight, ordinary, severe) based on point deduction, and explore the patterns of serious traffic violations (i.e., ordinary, severe) using multi-source data. This paper designed an interpretable machine learning framework, in which four popular machine learning models were enhanced and compared. Specifically, adaptive synthetic sampling method was applied to overcome the effects of imbalanced data and improve the prediction accuracy of minority classes (i.e., ordinary, severe); multi-objective feature selection based on NSGA-II was used to remove the redundant factors to increase the computational efficiency and make the patterns discovered by the explainer more effective; Bayesian hyperparameter optimization aimed to obtain more effective hyperparameters combination with fewer iterations and boost the model adaptability. Results show that the proposed interpretable machine learning framework can significantly improve and distinguish the performance of four popular machine learning models and two post-hoc interpretation methods. It is found that six of the top ten important factors belong to multi-scale built environment attributes. By comparing the results of feature contribution and interaction effects, some findings can be summarized: ordinary and severe traffic violations have some identical influencing factors and interactive effects; have the same influencing factors or the same combinations of influencing factors, but the values of the factors are different; have some unique influencing factors and unique combinations of influencing factors.
Collapse
Affiliation(s)
- Zhiyuan Sun
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing 100124, China
| | - Zhoumeng Ai
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing 100124, China
| | - Zehao Wang
- Department of Civil and Environmental Engineering, University of North Carolina at Charlotte, EPIC Building, Room 3366, 9201 University City Boulevard, Charlotte, NC 28223-0001, United States.
| | - Jianyu Wang
- Beijing Key Laboratory of General Aviation Technology, Beijing University of Civil Engineering and Architecture, Beijing 102616, China
| | - Xin Gu
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing 100124, China
| | - Duo Wang
- Department of Mechanical and Traffic Engineering, Ordos Institute of Technology, Ordos 017010, China
| | - Huapu Lu
- Institute of Transportation Engineering, Tsinghua University, Beijing 100084, China
| | - Yanyan Chen
- Beijing Key Laboratory of Traffic Engineering, Beijing University of Technology, Beijing 100124, China
| |
Collapse
|
2
|
Peng Y, Liu Y, Liu Y, Wang J. Comprehensive data optimization and risk prediction framework: machine learning methods for inflammatory bowel disease prediction based on the human gut microbiome data. Front Microbiol 2024; 15:1483084. [PMID: 39411443 PMCID: PMC11474110 DOI: 10.3389/fmicb.2024.1483084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Accepted: 09/16/2024] [Indexed: 10/19/2024] Open
Abstract
Over the past decade, the prevalence of inflammatory bowel disease (IBD) has significantly increased, making early detection crucial for improving patient survival rates. Medical research suggests that changes in the human gut microbiome are closely linked to IBD onset, playing a critical role in its prediction. However, the current gut microbiome data often exhibit missing values and high dimensionality, posing challenges to the accuracy of predictive algorithms. To address these issues, we proposed the comprehensive data optimization and risk prediction framework (CDORPF), an ensemble learning framework designed to predict IBD risk based on the human gut microbiome, aiding early diagnosis. The framework comprised two main components: data optimization and risk prediction. The data optimization module first employed triple optimization imputation (TOI) to impute missing data while preserving the biological characteristics of the microbiome. It then utilized importance-weighted variational autoencoder (IWVAE) to reduce redundant information from the high-dimensional microbiome data. This process resulted in a complete, low-dimensional representation of the data, laying the foundation for improved algorithm efficiency and accuracy. In the risk prediction module, the optimized data was classified using a random forest (RF) model, and hyperparameters were globally optimized using improved aquila optimizer (IAO), which incorporated multiple strategies. Experimental results on IBD-related gut microbiome datasets showed that the proposed framework achieved classification accuracy, recall, and F1 scores exceeding 0.9, outperforming comparison models and serving as a valuable tool for predicting IBD onset risk.
Collapse
Affiliation(s)
- Yan Peng
- School of Management, Capital Normal University, Beijing, China
| | - Yue Liu
- School of Management, Capital Normal University, Beijing, China
| | - Yifei Liu
- School of Mathematical Sciences, Capital Normal University, Beijing, China
| | - Jie Wang
- School of Management, Capital Normal University, Beijing, China
| |
Collapse
|
3
|
Xu X, Zhao S, Gong L, Zuo S. A novel contact optimization algorithm for endomicroscopic surface scanning. Int J Comput Assist Radiol Surg 2024; 19:2031-2041. [PMID: 38970745 DOI: 10.1007/s11548-024-03223-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 06/24/2024] [Indexed: 07/08/2024]
Abstract
PURPOSE Probe-based confocal laser endomicroscopy (pCLE) offers real-time, cell-level imaging and holds promise for early cancer diagnosis. However, a large area surface scanning for image acquisition is needed to overcome the limitation of field-of-view. Obtaining high-quality images during scanning requires maintaining a stable contact distance between the tissue and probe. This work presents a novel contact optimization algorithm to acquire high-quality pCLE images. METHODS The contact optimization algorithm, based on swarm intelligence of whale optimization algorithm, is designed to optimize the probe position, according to the quality of the image acquired by probe. An accurate image quality assessment of total co-occurrence entropy is introduced to evaluate the pCLE image quality. The algorithm aims to maintain a consistent probe-tissue contact, resulting in high-quality images acquisition. RESULTS Scanning experiments on sponge, ex vivo swine skin tissue and stomach tissue demonstrate the effectiveness of the contact optimization algorithm. Scanning results of the sponge with three different trajectories (spiral trajectory, circle trajectory, and raster trajectory) reveal high-quality mosaics with clear details in every part of the image and no blurred sections. CONCLUSION The contact optimization algorithm successfully identifies the optimal distance between probe and tissue, improving the quality of pCLE images. Experimental results confirm the high potential of this method in endomicroscopic surface scanning.
Collapse
Affiliation(s)
- Xingfeng Xu
- Key Laboratory of Mechanism Theory and Equipment Design of Ministry of Education, Tianjin University, Tianjin, 300072, China
| | - Shengzhe Zhao
- Key Laboratory of Mechanism Theory and Equipment Design of Ministry of Education, Tianjin University, Tianjin, 300072, China
| | - Lun Gong
- Key Laboratory of Mechanism Theory and Equipment Design of Ministry of Education, Tianjin University, Tianjin, 300072, China
| | - Siyang Zuo
- Key Laboratory of Mechanism Theory and Equipment Design of Ministry of Education, Tianjin University, Tianjin, 300072, China.
| |
Collapse
|
4
|
Zamani J, Jafadideh AT. Predicting the Conversion from Mild Cognitive Impairment to Alzheimer's Disease Using Graph Frequency Bands and Functional Connectivity-Based Features. RESEARCH SQUARE 2024:rs.3.rs-4549428. [PMID: 38947050 PMCID: PMC11213162 DOI: 10.21203/rs.3.rs-4549428/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Accurate prediction of the progression from mild cognitive impairment (MCI) to Alzheimer's disease (AD) is crucial for disease management. Machine learning techniques have demonstrated success in classifying AD and MCI cases, particularly with the use of resting-state functional magnetic resonance imaging (rs-fMRI) data.This study utilized three years of rs-fMRI data from the ADNI, involving 142 patients with stable MCI (sMCI) and 136 with progressive MCI (pMCI). Graph signal processing was applied to filter rs-fMRI data into low, middle, and high frequency bands. Connectivity-based features were derived from both filtered and unfiltered data, resulting in a comprehensive set of 100 features, including global graph metrics, minimum spanning tree (MST) metrics, triadic interaction metrics, hub tendency metrics, and the number of links. Feature selection was enhanced using particle swarm optimization (PSO) and simulated annealing (SA). A support vector machine (SVM) with a radial basis function (RBF) kernel and a 10-fold cross-validation setup were employed for classification. The proposed approach demonstrated superior performance, achieving optimal accuracy with minimal feature utilization. When PSO selected five features, SVM exhibited accuracy, specificity, and sensitivity rates of 77%, 70%, and 83%, respectively. The identified features were as follows: (Mean of clustering coefficient, Mean of strength)/Radius/(Mean Eccentricity, and Modularity) from low/middle/high frequency bands of graph. The study highlights the efficacy of the proposed framework in identifying individuals at risk of AD development using a parsimonious feature set. This approach holds promise for advancing the precision of MCI to AD progression prediction, aiding in early diagnosis and intervention strategies.
Collapse
Affiliation(s)
- Jafar Zamani
- Department of Psychiatry and Behavioral Sciences, Stanford University, California, USA
| | | |
Collapse
|
5
|
Guo H, Wang J, Sun J, Mao X. Multi-objective green vehicle scheduling problem considering time window and emission factors in ship block transportation. Sci Rep 2024; 14:10796. [PMID: 38734739 PMCID: PMC11088692 DOI: 10.1038/s41598-024-61578-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 05/07/2024] [Indexed: 05/13/2024] Open
Abstract
Logistics distribution is one of the main sources of carbon dioxide emissions at present, and there are also such distribution problems in the shipbuilding process. With the increasing attention paid to environmental problems, how to effectively reduce the energy consumption of block transportation and improve the utilization rate of resources in the factory is the key problem that China's shipbuilding industry needs to solve at present. This article considers the time windows for block transportation tasks, as well as the self-loading constraints of different types of flat cars, and establishes an optimization model that minimizes the empty transport time and energy consumption of the flat cars as the optimization objective. Then, an Improved Genetic Whale Optimization Algorithm is designed, which combines the cross and mutation ideas of genetic algorithms and proposes a whale individual position updating mechanism under a mixed strategy. Furthermore, the performance and computational efficiency of the algorithm are verified through comparative analysis with other classical optimization algorithms on standard test examples. Finally, the shipyard's block transportation example proves that the energy-saving ship block transportation scheduling method can effectively improve the efficiency of shipbuilding enterprise's block transportation and reduce the energy consumption in the block transportation process. It proves the engineering practicality of the green dispatching method proposed in this paper, which can further provide a decision-making method for shipyard managers.
Collapse
Affiliation(s)
- Hui Guo
- School of Naval Architecture & Ocean Engineering, Jiangsu University of Science and Technology, Room 311, Science and Technology Building, Zhenjiang, Jiangsu Province, People's Republic of China
| | - Jucheng Wang
- School of Naval Architecture & Ocean Engineering, Jiangsu University of Science and Technology, Room 311, Science and Technology Building, Zhenjiang, Jiangsu Province, People's Republic of China
- Jiangsu Modern Shipbuilding Technology Co., Ltd, Zhenjiang, 212100, China
| | - Jing Sun
- Jiangsu Modern Shipbuilding Technology Co., Ltd, Zhenjiang, 212100, China
| | - Xuezhang Mao
- School of Naval Architecture & Ocean Engineering, Jiangsu University of Science and Technology, Room 311, Science and Technology Building, Zhenjiang, Jiangsu Province, People's Republic of China.
| |
Collapse
|
6
|
Leiva D, Ramos-Tapia B, Crawford B, Soto R, Cisternas-Caneo F. A Novel Approach to Combinatorial Problems: Binary Growth Optimizer Algorithm. Biomimetics (Basel) 2024; 9:283. [PMID: 38786493 PMCID: PMC11117713 DOI: 10.3390/biomimetics9050283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 04/30/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024] Open
Abstract
The set-covering problem aims to find the smallest possible set of subsets that cover all the elements of a larger set. The difficulty of solving the set-covering problem increases as the number of elements and sets grows, making it a complex problem for which traditional integer programming solutions may become inefficient in real-life instances. Given this complexity, various metaheuristics have been successfully applied to solve the set-covering problem and related issues. This study introduces, implements, and analyzes a novel metaheuristic inspired by the well-established Growth Optimizer algorithm. Drawing insights from human behavioral patterns, this approach has shown promise in optimizing complex problems in continuous domains, where experimental results demonstrate the effectiveness and competitiveness of the metaheuristic compared to other strategies. The Growth Optimizer algorithm is modified and adapted to the realm of binary optimization for solving the set-covering problem, resulting in the creation of the Binary Growth Optimizer algorithm. Upon the implementation and analysis of its outcomes, the findings illustrate its capability to achieve competitive and efficient solutions in terms of resolution time and result quality.
Collapse
Affiliation(s)
| | | | - Broderick Crawford
- Escuela de Ingeniería Informática, Pontificia Universidad Católica de Valparaíso, Avenida Brasil 2241, Valparaíso 2362807, Chile; (D.L.); (B.R.-T.); (R.S.); (F.C.-C.)
| | | | | |
Collapse
|
7
|
Li M, Cao R, Zhao Y, Li Y, Deng S. Population characteristic exploitation-based multi-orientation multi-objective gene selection for microarray data classification. Comput Biol Med 2024; 170:108089. [PMID: 38330824 DOI: 10.1016/j.compbiomed.2024.108089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Revised: 01/23/2024] [Accepted: 01/27/2024] [Indexed: 02/10/2024]
Abstract
Gene selection is a process of selecting discriminative genes from microarray data that helps to diagnose and classify cancer samples effectively. Swarm intelligence evolution-based gene selection algorithms can never circumvent the problem that the population is prone to local optima in the process of gene selection. To tackle this challenge, previous research has focused primarily on two aspects: mitigating premature convergence to local optima and escaping from local optima. In contrast to these strategies, this paper introduces a novel perspective by adopting reverse thinking, where the issue of local optima is seen as an opportunity rather than an obstacle. Building on this foundation, we propose MOMOGS-PCE, a novel gene selection approach that effectively exploits the advantageous characteristics of populations trapped in local optima to uncover global optimal solutions. Specifically, MOMOGS-PCE employs a novel population initialization strategy, which involves the initialization of multiple populations that explore diverse orientations to foster distinct population characteristics. The subsequent step involved the utilization of an enhanced NSGA-II algorithm to amplify the advantageous characteristics exhibited by the population. Finally, a novel exchange strategy is proposed to facilitate the transfer of characteristics between populations that have reached near maturity in evolution, thereby promoting further population evolution and enhancing the search for more optimal gene subsets. The experimental results demonstrated that MOMOGS-PCE exhibited significant advantages in comprehensive indicators compared with six competitive multi-objective gene selection algorithms. It is confirmed that the "reverse-thinking" approach not only avoids local optima but also leverages it to uncover superior gene subsets for cancer diagnosis.
Collapse
Affiliation(s)
- Min Li
- School of Information Engineering, Nanchang Institute of Technology, No. 289 Tianxiang Road, Nanchang, Jiangxi, PR China.
| | - Rutun Cao
- School of Information Engineering, Nanchang Institute of Technology, No. 289 Tianxiang Road, Nanchang, Jiangxi, PR China
| | - Yangfan Zhao
- School of Information Engineering, Nanchang Institute of Technology, No. 289 Tianxiang Road, Nanchang, Jiangxi, PR China
| | - Yulong Li
- School of Information Engineering, Nanchang Institute of Technology, No. 289 Tianxiang Road, Nanchang, Jiangxi, PR China
| | - Shaobo Deng
- School of Information Engineering, Nanchang Institute of Technology, No. 289 Tianxiang Road, Nanchang, Jiangxi, PR China
| |
Collapse
|
8
|
Thirugnanasambandam K, Murugan J, Ramalingam R, Rashid M, Raghav RS, Kim TH, Sampedro GA, Abisado M. Optimizing multimodal feature selection using binary reinforced cuckoo search algorithm for improved classification performance. PeerJ Comput Sci 2024; 10:e1816. [PMID: 38435570 PMCID: PMC10909206 DOI: 10.7717/peerj-cs.1816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 12/19/2023] [Indexed: 03/05/2024]
Abstract
Background Feature selection is a vital process in data mining and machine learning approaches by determining which characteristics, out of the available features, are most appropriate for categorization or knowledge representation. However, the challenging task is finding a chosen subset of elements from a given set of features to represent or extract knowledge from raw data. The number of features selected should be appropriately limited and substantial to prevent results from deviating from accuracy. When it comes to the computational time cost, feature selection is crucial. A feature selection model is put out in this study to address the feature selection issue concerning multimodal. Methods In this work, a novel optimization algorithm inspired by cuckoo birds' behavior is the Binary Reinforced Cuckoo Search Algorithm (BRCSA). In addition, we applied the proposed BRCSA-based classification approach for multimodal feature selection. The proposed method aims to select the most relevant features from multiple modalities to improve the model's classification performance. The BRCSA algorithm is used to optimize the feature selection process, and a binary encoding scheme is employed to represent the selected features. Results The experiments are conducted on several benchmark datasets, and the results are compared with other state-of-the-art feature selection methods to evaluate the effectiveness of the proposed method. The experimental results demonstrate that the proposed BRCSA-based approach outperforms other methods in terms of classification accuracy, indicating its potential applicability in real-world applications. In specific on accuracy of classification (average), the proposed algorithm outperforms the existing methods such as DGUFS with 32%, MBOICO with 24%, MBOLF with 29%, WOASAT 22%, BGSA with 28%, HGSA 39%, FS-BGSK 37%, FS-pBGSK 42%, and BSSA 40%.
Collapse
Affiliation(s)
- Kalaipriyan Thirugnanasambandam
- Centre for Smart Grid Technologies, School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India
| | - Jayalakshmi Murugan
- Department of Computer Science and Engineering, Kalasalingam Academy of Research and Education, Krishnankoil, India
| | - Rajakumar Ramalingam
- Centre for Automation, School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, India
| | - Mamoon Rashid
- Department of Computer Engineering, Faculty of Science and Technology, Vishwakarma University, Pune, India
| | - R. S. Raghav
- School of Computing, SASTRA Deemed University, Villupuram, India
| | - Tai-hoon Kim
- School of Electrical and Computer Engineering, Chonnam National University, Daehak-7, Republic of Korea
| | - Gabriel Avelino Sampedro
- Faculty of Information and Communication Studies, University of the Philippines Open University, Los Baños, Philippines
- Center for Computational Imaging and Visual Innovations, De La Salle University, Malate, Philippines
| | - Mideth Abisado
- College of Computing and Information Technologies, National University, Manila, Philippines
| |
Collapse
|
9
|
Liu G, Guo Z, Liu W, Jiang F, Fu E. A feature selection method based on the Golden Jackal-Grey Wolf Hybrid Optimization Algorithm. PLoS One 2024; 19:e0295579. [PMID: 38165924 PMCID: PMC10760777 DOI: 10.1371/journal.pone.0295579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Accepted: 11/20/2023] [Indexed: 01/04/2024] Open
Abstract
This paper proposes a feature selection method based on a hybrid optimization algorithm that combines the Golden Jackal Optimization (GJO) and Grey Wolf Optimizer (GWO). The primary objective of this method is to create an effective data dimensionality reduction technique for eliminating redundant, irrelevant, and noisy features within high-dimensional datasets. Drawing inspiration from the Chinese idiom "Chai Lang Hu Bao," hybrid algorithm mechanisms, and cooperative behaviors observed in natural animal populations, we amalgamate the GWO algorithm, the Lagrange interpolation method, and the GJO algorithm to propose the multi-strategy fusion GJO-GWO algorithm. In Case 1, the GJO-GWO algorithm addressed eight complex benchmark functions. In Case 2, GJO-GWO was utilized to tackle ten feature selection problems. Experimental results consistently demonstrate that under identical experimental conditions, whether solving complex benchmark functions or addressing feature selection problems, GJO-GWO exhibits smaller means, lower standard deviations, higher classification accuracy, and reduced execution times. These findings affirm the superior optimization performance, classification accuracy, and stability of the GJO-GWO algorithm.
Collapse
Affiliation(s)
- Guangwei Liu
- College of Mining, Liaoning Technical University, Fuxin, Liaoning, China
| | - Zhiqing Guo
- College of Mining, Liaoning Technical University, Fuxin, Liaoning, China
| | - Wei Liu
- College of Science, Liaoning Technical University, Fuxin, Liaoning, China
| | - Feng Jiang
- College of Science, Liaoning Technical University, Fuxin, Liaoning, China
| | - Ensan Fu
- College of Mining, Liaoning Technical University, Fuxin, Liaoning, China
| |
Collapse
|
10
|
Betshrine Rachel R, Khanna Nehemiah H, Singh VK, Manoharan RMV. Diagnosis of Covid-19 from CT slices using Whale Optimization Algorithm, Support Vector Machine and Multi-Layer Perceptron. JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY 2024; 32:253-269. [PMID: 38189732 DOI: 10.3233/xst-230196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
BACKGROUND The coronavirus disease 2019 is a serious and highly contagious disease caused by infection with a newly discovered virus, named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). OBJECTIVE A Computer Aided Diagnosis (CAD) system to assist physicians to diagnose Covid-19 from chest Computed Tomography (CT) slices is modelled and experimented. METHODS The lung tissues are segmented using Otsu's thresholding method. The Covid-19 lesions have been annotated as the Regions of Interest (ROIs), which is followed by texture and shape extraction. The obtained features are stored as feature vectors and split into 80:20 train and test sets. To choose the optimal features, Whale Optimization Algorithm (WOA) with Support Vector Machine (SVM) classifier's accuracy is employed. A Multi-Layer Perceptron (MLP) classifier is trained to perform classification with the selected features. RESULTS Comparative experimentations of the proposed system with existing eight benchmark Machine Learning classifiers using real-time dataset demonstrates that the proposed system with 88.94% accuracy outperforms the benchmark classifier's results. Statistical analysis namely, Friedman test, Mann Whitney U test and Kendall's Rank Correlation Coefficient Test has been performed which indicates that the proposed method has a significant impact on the novel dataset considered. CONCLUSION The MLP classifier's accuracy without feature selection yielded 80.40%, whereas with feature selection using WOA, it yielded 88.94%.
Collapse
Affiliation(s)
- R Betshrine Rachel
- Ramanujan Computing Centre, College of Engineering Guindy, Anna University, Chennai, Tamil Nadu, India
| | - H Khanna Nehemiah
- Ramanujan Computing Centre, College of Engineering Guindy, Anna University, Chennai, Tamil Nadu, India
| | - Vaibhav Kumar Singh
- Alumna, Department of Information Science and Technology, College of Engineering Guindy, Anna University, Chennai, Tamil Nadu, India
| | - Rebecca Mercy Victoria Manoharan
- Alumna, Department of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai, Tamil Nadu, India
| |
Collapse
|
11
|
Singh LK, Khanna M, Garg H, Singh R. Efficient feature selection based novel clinical decision support system for glaucoma prediction from retinal fundus images. Med Eng Phys 2024; 123:104077. [PMID: 38365344 DOI: 10.1016/j.medengphy.2023.104077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 10/29/2023] [Accepted: 12/03/2023] [Indexed: 02/18/2024]
Abstract
The process of feature selection (FS) is vital aspect of machine learning (ML) model's performance enhancement where the objective is the selection of the most influential subset of features. This paper suggests the Gravitational search optimization algorithm (GSOA) technique for metaheuristic-based FS. Glaucoma disease is selected as the subject of investigation as this disease is spreading worldwide at a very fast pace; 111 million instances of glaucoma are expected by 2040, up from 64 million in 2015. It causes widespread vision impairment. Optic nerve fibres can be degraded and cannot be replaced later in this disease. As a starting point, the retinal fundus images of glaucoma infected persons and healthy persons are used, and 36 features were retrieved from these images of public benchmark datasets and private dataset. Six ML models are trained for classification on the basis of the GSOA's returned subset of features. The suggested FS technique enhances classification performance with selection of most influential features. The eight statistical performance evaluating parameters along with execution time are calculated. The training and testing have been performed using a split approach (70:30), 5-fold cross validation (CV), as well as 10-fold CV. The suggested approach achieved 95.36 % accuracy. Due to its auspicious performance, doctors might use the suggested method to receive a second opinion, which would also help overburdened skilled medical practitioners and save patients from vision loss.
Collapse
Affiliation(s)
- Law Kumar Singh
- Department of Computer Engineering & Applications, GLA University, Mathura, India.
| | - Munish Khanna
- School of Computing Science and Engineering, Galgotias University, Gautam Buddh Nagar, Greater Noida, Uttar Pradesh 226001, India
| | - Hitendra Garg
- Department of Computer Engineering & Applications, GLA University, Mathura, India
| | - Rekha Singh
- Department of Physics, Uttar Pradesh Rajarshi Tandon Open University, Prayagraj, Uttar Pradesh, India
| |
Collapse
|
12
|
Habib Z, Mughal MA, Khan MA, Shabaz M. WiFOG: Integrating deep learning and hybrid feature selection for accurate freezing of gait detection. ALEXANDRIA ENGINEERING JOURNAL 2024; 86:481-493. [DOI: 10.1016/j.aej.2023.11.075] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]
|
13
|
Tian K, Ren Y, Chang Y, Chen Z, Yang X. Influence of respondents' Differentiation of subjective response on water knowledge stock test scale: Evaluation based on two-parameter-multidimensional IRT model. ENVIRONMENTAL RESEARCH 2023; 238:117181. [PMID: 37742755 DOI: 10.1016/j.envres.2023.117181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 09/07/2023] [Accepted: 09/20/2023] [Indexed: 09/26/2023]
Abstract
Insufficient awareness of water issues is a crucial bottleneck restricting the sustainable utilization of water resources. To accurately measure citizens' water knowledge stock and overcome the differences between scales and respondents' characteristic levels on test results, the research focuses on developing and evaluating water knowledge stock test scales. The mechanism for identifying indicators is designed based on the grounded theory, and as a result, the water knowledge stock test indicator system is derived. The data was collected by the form of survey questionnaire developed with the test indicator system. A two-parameter multidimensional item response theoretical model is constructed based on item parameter estimation, data model fitting, and item information function. The survey data and optimization model are used to optimize the water knowledge stock test scale and verify the fitting degree with the characteristics of the respondents. The test information function and standard error function indicate that the scale is most informative for individuals with characteristic levels ranging from -2 to 3, resulting in a highly reliable test effect. The research has established a measurement indicator system, methodology, and presented results that serve as a foundation for measuring the stock of water knowledge.
Collapse
Affiliation(s)
- Kang Tian
- College of Information and Management Science, Henan Agricultural University, No. 218, Ping'an Avenue, Zhengzhou, 450046, PR China; Citizen's Water Literacy Research Center, North China University of Water Resources and Electric Power, No.136, Jinshui East Road, Zhengzhou, 450046, PR China.
| | - Yunlong Ren
- School of Engineering, University of Manchester, Oxford Road, Manchester. M13 9PL, UK
| | - Yuanbo Chang
- Trade Union Committee, Henan University of Economics and Law, No.180, Jinshui East Road, Zhengzhou, 450046, PR China
| | - Zhen Chen
- College of Information and Management Science, Henan Agricultural University, No. 218, Ping'an Avenue, Zhengzhou, 450046, PR China
| | - Xue Yang
- School of Management and Economics, North China University of Water Resources and Electric Power, No.136, Jinshui East Road, Zhengzhou, 450046, PR China.
| |
Collapse
|
14
|
Yogarajan G, Alsubaie N, Rajasekaran G, Revathi T, Alqahtani MS, Abbas M, Alshahrani MM, Soufiene BO. EEG-based epileptic seizure detection using binary dragonfly algorithm and deep neural network. Sci Rep 2023; 13:17710. [PMID: 37853025 PMCID: PMC10584945 DOI: 10.1038/s41598-023-44318-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Accepted: 10/06/2023] [Indexed: 10/20/2023] Open
Abstract
Electroencephalogram (EEG) is one of the most common methods used for seizure detection as it records the electrical activity of the brain. Symmetry and asymmetry of EEG signals can be used as indicators of epileptic seizures. Normally, EEG signals are symmetrical in nature, with similar patterns on both sides of the brain. However, during a seizure, there may be a sudden increase in the electrical activity in one hemisphere of the brain, causing asymmetry in the EEG signal. In patients with epilepsy, interictal EEG may show asymmetric spikes or sharp waves, indicating the presence of epileptic activity. Therefore, the detection of symmetry/asymmetry in EEG signals can be used as a useful tool in the diagnosis and management of epilepsy. However, it should be noted that EEG findings should always be interpreted in conjunction with the patient's clinical history and other diagnostic tests. In this paper, we propose an EEG-based improved automatic seizure detection system using a Deep neural network (DNN) and Binary dragonfly algorithm (BDFA). The DNN model learns the characteristics of the EEG signals through nine different statistical and Hjorth parameters extracted from various levels of decomposed signals obtained by using the Stationary Wavelet Transform. Next, the extracted features were reduced using the BDFA which helps to train DNN faster and improve its performance. The results show that the extracted features help to differentiate the normal, interictal, and ictal signals effectively with 100% accuracy, sensitivity, specificity, and F1 score with a 13% selected feature subset when compared to the existing approaches.
Collapse
Affiliation(s)
- G Yogarajan
- Department of Information Technology, Mepco Schlenk Engineering College, Sivakasi, Tamil Nadu, 626005, India
| | - Najah Alsubaie
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University (PNU), P.O. Box 84428, 11671, Riyadh, Saudi Arabia
| | - G Rajasekaran
- Department of Information Technology, Mepco Schlenk Engineering College, Sivakasi, Tamil Nadu, 626005, India
| | - T Revathi
- Department of Information Technology, Mepco Schlenk Engineering College, Sivakasi, Tamil Nadu, 626005, India
| | - Mohammed S Alqahtani
- Radiological Sciences Department, College of Applied Medical Sciences, King Khalid University, 61421, Abha, Saudi Arabia
- BioImaging Unit, Space Research Centre, University of Leicester, Michael Atiyah Building, Leicester, LE1 7RH, UK
| | - Mohamed Abbas
- Electrical Engineering Department, College of Engineering, King Khalid University, 61421, Abha, Saudi Arabia
| | | | - Ben Othman Soufiene
- PRINCE Laboratory Research, ISITcom, Hammam Sousse, University of Sousse, Sousse, Tunisia.
| |
Collapse
|
15
|
Hu T, Yang H, Ni W. A framework for intracranial aneurysm detection and rupture analysis on DSA. J Clin Neurosci 2023; 115:101-107. [PMID: 37542820 DOI: 10.1016/j.jocn.2023.07.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2023] [Revised: 07/10/2023] [Accepted: 07/27/2023] [Indexed: 08/07/2023]
Abstract
BACKGROUND Intracranial aneurysm is a severe cerebrovascular disease that can result in subarachnoid hemorrhage (SAH), leading to high incidence and mortality rates. Computer-aided detection of aneurysms can assist doctors in enhancing diagnostic accuracy. The analysis of aneurysm imaging holds considerable predictive value for aneurysm rupture. This paper presents a method for the detection of aneurysms and analysis of ruptures using digital subtraction angiography (DSA). METHODS A total of 263 aneurysms were analyzed, with 125 being ruptured and 138 being unruptured. Firstly, a filter based on the eigenvalues of the Hessian matrix was proposed for aneurysm detection. The filter's detection parameters can be automatically obtained through Bayesian optimization. Aneurysms were detected based on their structure and the response of the filter. Secondly, considering the variations in blood flow and morphology among aneurysms in DSA, intensity, texture, and blood perfusion features were extracted from the ruptured aneurysms and unruptured aneurysms. Subsequently, a sparse representation (SR) method was utilized to classify unruptured and ruptured aneurysms. RESULTS The experimental results for aneurysm detection showed that the F1-score was 94.1%. In the classification of ruptured and unruptured aneurysms, the accuracy, sensitivity, specificity, and area under curve (AUC) were 96.1%, 94.4%, 97.5%, and 0.982, respectively. CONCLUSION This paper presents a scheme combining an aneurysm detection filter and machine learning, offering a reliable solution for the diagnosis and prediction of aneurysm rupture.
Collapse
Affiliation(s)
- Tao Hu
- Department of Electronic Engineering, Fudan University, Shanghai, China.
| | - Heng Yang
- Department of Neurosurgery, Huashan Hospital, Fudan University, Shanghai, China
| | - Wei Ni
- Department of Neurosurgery, Huashan Hospital, Fudan University, Shanghai, China
| |
Collapse
|
16
|
Chen Z, Xinxian L, Guo R, Zhang L, Dhahbi S, Bourouis S, Liu L, Wang X. Dispersed differential hunger games search for high dimensional gene data feature selection. Comput Biol Med 2023; 163:107197. [PMID: 37390761 DOI: 10.1016/j.compbiomed.2023.107197] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 06/08/2023] [Accepted: 06/19/2023] [Indexed: 07/02/2023]
Abstract
The realms of modern medicine and biology have provided substantial data sets of genetic roots that exhibit a high dimensionality. Clinical practice and associated processes are primarily dependent on data-driven decision-making. However, the high dimensionality of the data in these domains increases the complexity and size of processing. It can be challenging to determine representative genes while reducing the data's dimensionality. A successful gene selection will serve to mitigate the computing costs and refine the accuracy of the classification by eliminating superfluous or duplicative features. To address this concern, this research suggests a wrapper gene selection approach based on the HGS, combined with a dispersed foraging strategy and a differential evolution strategy, to form a new algorithm named DDHGS. Introducing the DDHGS algorithm to the global optimization field and its binary derivative bDDHGS to the feature selection problem is anticipated to refine the existing search balance between explorative and exploitative cores. We assess and confirm the efficacy of our proposed method, DDHGS, by comparing it with DE and HGS combined with a single strategy, seven classic algorithms, and ten advanced algorithms on the IEEE CEC 2017 test suite. Furthermore, to further evaluate DDHGS' performance, we compare it with several CEC winners and DE-based techniques of great efficiency on 23 popular optimization functions and the IEEE CEC 2014 benchmark test suite. The experimentation asserted that the bDDHGS approach was able to surpass bHGS and a variety of existing methods when applied to fourteen feature selection datasets from the UCI repository. The metrics measured--classification accuracy, the number of selected features, fitness scores, and execution time--all showed marked improvements with the use of bDDHGS. Considering all results, it can be concluded that bDDHGS is an optimal optimizer and an effective feature selection tool in the wrapper mode.
Collapse
Affiliation(s)
- Zhiqing Chen
- School of Intelligent Manufacturing, Wenzhou Polytechnic, Wenzhou, 325035, China.
| | - Li Xinxian
- Wenzhou Vocational College of Science and Technology, Wenzhou, 325006, China.
| | - Ran Guo
- Cyberspace Institute Advanced Technology, Guangzhou University, Guangzhou, 510006, China.
| | - Lejun Zhang
- Cyberspace Institute Advanced Technology, Guangzhou University, Guangzhou, 510006, China; College of Information Engineering, Yangzhou University, Yangzhou, 225127, China; Research and Development Center for E-Learning, Ministry of Education, Beijing, 100039, China.
| | - Sami Dhahbi
- Department of Computer Science, College of Science and Art at Mahayil, King Khalid University, Muhayil, Aseer, 62529, Saudi Arabia.
| | - Sami Bourouis
- Department of Information Technology, College of Computers and Information Technology, Taif University, P.O.Box 11099, Taif, 21944, Saudi Arabia.
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan, 610065, China.
| | - Xianchuan Wang
- Information Technology Center, Wenzhou Medical University, Wenzhou, 325035, China.
| |
Collapse
|
17
|
Lemus-Romani J, Crawford B, Cisternas-Caneo F, Soto R, Becerra-Rozas M. Binarization of Metaheuristics: Is the Transfer Function Really Important? Biomimetics (Basel) 2023; 8:400. [PMID: 37754151 PMCID: PMC10526273 DOI: 10.3390/biomimetics8050400] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 08/24/2023] [Accepted: 08/25/2023] [Indexed: 09/28/2023] Open
Abstract
In this work, an approach is proposed to solve binary combinatorial problems using continuous metaheuristics. It focuses on the importance of binarization in the optimization process, as it can have a significant impact on the performance of the algorithm. Different binarization schemes are presented and a set of actions, which combine different transfer functions and binarization rules, under a selector based on reinforcement learning is proposed. The experimental results show that the binarization rules have a greater impact than transfer functions on the performance of the algorithms and that some sets of actions are statistically better than others. In particular, it was found that sets that incorporate the elite or elite roulette binarization rule are the best. Furthermore, exploration and exploitation were analyzed through percentage graphs and a statistical test was performed to determine the best set of actions. Overall, this work provides a practical approach for the selection of binarization schemes in binary combinatorial problems and offers guidance for future research in this field.
Collapse
Affiliation(s)
- José Lemus-Romani
- Escuela de Construcción Civil, Pontificia Universidad Católica de Chile, Avenida Vicuña Mackenna 4860, Macul, Santiago 7820436, Chile
| | - Broderick Crawford
- Escuela de Ingeniería Informática, Pontificia Universidad Católica de Valparaíso, Avenida Brasil 2241, Valparaíso 2362807, Chile; (F.C.-C.); (R.S.); (M.B.-R.)
| | - Felipe Cisternas-Caneo
- Escuela de Ingeniería Informática, Pontificia Universidad Católica de Valparaíso, Avenida Brasil 2241, Valparaíso 2362807, Chile; (F.C.-C.); (R.S.); (M.B.-R.)
| | - Ricardo Soto
- Escuela de Ingeniería Informática, Pontificia Universidad Católica de Valparaíso, Avenida Brasil 2241, Valparaíso 2362807, Chile; (F.C.-C.); (R.S.); (M.B.-R.)
| | - Marcelo Becerra-Rozas
- Escuela de Ingeniería Informática, Pontificia Universidad Católica de Valparaíso, Avenida Brasil 2241, Valparaíso 2362807, Chile; (F.C.-C.); (R.S.); (M.B.-R.)
| |
Collapse
|
18
|
Alweshah M, Aldabbas Y, Abu-Salih B, Oqeil S, Hasan HS, Alkhalaileh S, Kassaymeh S. Hybrid black widow optimization with iterated greedy algorithm for gene selection problems. Heliyon 2023; 9:e20133. [PMID: 37809602 PMCID: PMC10559925 DOI: 10.1016/j.heliyon.2023.e20133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 09/03/2023] [Accepted: 09/12/2023] [Indexed: 10/10/2023] Open
Abstract
Gene Selection (GS) is a strategy method targeted at reducing redundancy, limited expressiveness, and low informativeness in gene expression datasets obtained by DNA Microarray technology. These datasets contain a plethora of diverse and high-dimensional samples and genes, with a significant discrepancy in the number of samples and genes present. The complexities of GS are especially noticeable in the context of microarray expression data analysis, owing to the inherent data imbalance. The main goal of this study is to offer a simplified and computationally effective approach to dealing with the conundrum of attribute selection in microarray gene expression data. We use the Black Widow Optimization algorithm (BWO) in the context of GS to achieve this, using two unique methodologies: the unaltered BWO variation and the hybridized BWO variant combined with the Iterated Greedy algorithm (BWO-IG). By improving the local search capabilities of BWO, this hybridization attempts to promote more efficient gene selection. A series of tests was carried out using nine benchmark datasets that were obtained from the gene expression data repository in the pursuit of empirical validation. The results of these tests conclusively show that the BWO-IG technique performs better than the traditional BWO algorithm. Notably, the hybridized BWO-IG technique excels in the efficiency of local searches, making it easier to identify relevant genes and producing findings with higher levels of reliability in terms of accuracy and the degree of gene pruning. Additionally, a comparison analysis is done against five modern wrapper Feature Selection (FS) methodologies, namely BIMFOHHO, BMFO, BHHO, BCS, and BBA, in order to put the suggested BWO-IG method's effectiveness into context. The comparison that follows highlights BWO-IG's obvious superiority in reducing the number of selected genes while also obtaining remarkably high classification accuracy. The key findings were an average classification accuracy of 94.426, average fitness values of 0.061, and an average number of selected genes of 2933.767.
Collapse
Affiliation(s)
- Mohammed Alweshah
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Yasmeen Aldabbas
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Bilal Abu-Salih
- Department of Computer Science, King Abdullah II School of Information Technology, The University of Jordan, Amman, Jordan
| | - Saleh Oqeil
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Hazem S. Hasan
- Department of Plant Production and Protection, Faculty of Agricultural Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Saleh Alkhalaileh
- Prince Abdullah Bin Ghazi Faculty of Information and Communication Technology, Al-Balqa Applied University, Al-Salt, Jordan
| | - Sofian Kassaymeh
- Software Engineering Department, Faculty of Information Technology, Aqaba University of Technology, Aqaba, Jordan
| |
Collapse
|
19
|
Sheta A, Thaher T, Surani SR, Turabieh H, Braik M, Too J, Abu-El-Rub N, Mafarjah M, Chantar H, Subramanian S. Diagnosis of Obstructive Sleep Apnea Using Feature Selection, Classification Methods, and Data Grouping Based Age, Sex, and Race. Diagnostics (Basel) 2023; 13:2417. [PMID: 37510161 PMCID: PMC10377846 DOI: 10.3390/diagnostics13142417] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 07/13/2023] [Accepted: 07/15/2023] [Indexed: 07/30/2023] Open
Abstract
Obstructive sleep apnea (OSA) is a prevalent sleep disorder that affects approximately 3-7% of males and 2-5% of females. In the United States alone, 50-70 million adults suffer from various sleep disorders. OSA is characterized by recurrent episodes of breathing cessation during sleep, thereby leading to adverse effects such as daytime sleepiness, cognitive impairment, and reduced concentration. It also contributes to an increased risk of cardiovascular conditions and adversely impacts patient overall quality of life. As a result, numerous researchers have focused on developing automated detection models to identify OSA and address these limitations effectively and accurately. This study explored the potential benefits of utilizing machine learning methods based on demographic information for diagnosing the OSA syndrome. We gathered a comprehensive dataset from the Torr Sleep Center in Corpus Christi, Texas, USA. The dataset comprises 31 features, including demographic characteristics such as race, age, sex, BMI, Epworth score, M. Friedman tongue position, snoring, and more. We devised a novel process encompassing pre-processing, data grouping, feature selection, and machine learning classification methods to achieve the research objectives. The classification methods employed in this study encompass decision tree (DT), naive Bayes (NB), k-nearest neighbor (kNN), support vector machine (SVM), linear discriminant analysis (LDA), logistic regression (LR), and subspace discriminant (Ensemble) classifiers. Through rigorous experimentation, the results indicated the superior performance of the optimized kNN and SVM classifiers for accurately classifying sleep apnea. Moreover, significant enhancements in model accuracy were observed when utilizing the selected demographic variables and employing data grouping techniques. For instance, the accuracy percentage demonstrated an approximate improvement of 4.5%, 5%, and 10% with the feature selection approach when applied to the grouped data of Caucasians, females, and individuals aged 50 or below, respectively. Furthermore, a comparison with prior studies confirmed that effective data grouping and proper feature selection yielded superior performance in OSA detection when combined with an appropriate classification method. Overall, the findings of this research highlight the importance of leveraging demographic information, employing proper feature selection techniques, and utilizing optimized classification models for accurate and efficient OSA diagnosis.
Collapse
Affiliation(s)
- Alaa Sheta
- Computer Science Department, Southern Connecticut State University, New Haven, CT 06514, USA
| | - Thaer Thaher
- Department of Computer Systems Engineering, Arab American University, Jenin P.O. Box 240, Palestine
| | - Salim R Surani
- Department of Pulmonary, Critical Care & Sleep Medicine, Texas A&M University, College Station, TX 77843, USA
| | - Hamza Turabieh
- Health Management and Informatics Department, School of Medicine, University of Missouri, Columbia, MO 65212, USA
| | - Malik Braik
- Department of Computer Science, Al-Balqa Applied University, Salt 19117, Jordan
| | - Jingwei Too
- Faculty of Electrical Engineering, Universiti Teknikal Malaysia Melaka, Hang Tuah Jaya, Durian Tunggal 76100, Melaka, Malaysia
| | - Noor Abu-El-Rub
- Center of Medical Informatics and Enterprise Analytics, University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Majdi Mafarjah
- Department of Computer Science, Birzeit University, Birzeit P.O. Box 14, Palestine
| | - Hamouda Chantar
- Faculty of Information Technology, Sebha University, Sebha 18758, Libya
| | - Shyam Subramanian
- Pulmonary, Critical Care & Sleep Medicine, Sutter Health, Tracy, CA 95376, USA
| |
Collapse
|
20
|
Seyyedabbasi A. Binary Sand Cat Swarm Optimization Algorithm for Wrapper Feature Selection on Biological Data. Biomimetics (Basel) 2023; 8:310. [PMID: 37504198 PMCID: PMC10807367 DOI: 10.3390/biomimetics8030310] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 06/02/2023] [Accepted: 06/04/2023] [Indexed: 07/29/2023] Open
Abstract
In large datasets, irrelevant, redundant, and noisy attributes are often present. These attributes can have a negative impact on the classification model accuracy. Therefore, feature selection is an effective pre-processing step intended to enhance the classification performance by choosing a small number of relevant or significant features. It is important to note that due to the NP-hard characteristics of feature selection, the search agent can become trapped in the local optima, which is extremely costly in terms of time and complexity. To solve these problems, an efficient and effective global search method is needed. Sand cat swarm optimization (SCSO) is a newly introduced metaheuristic algorithm that solves global optimization algorithms. Nevertheless, the SCSO algorithm is recommended for continuous problems. bSCSO is a binary version of the SCSO algorithm proposed here for the analysis and solution of discrete problems such as wrapper feature selection in biological data. It was evaluated on ten well-known biological datasets to determine the effectiveness of the bSCSO algorithm. Moreover, the proposed algorithm was compared to four recent binary optimization algorithms to determine which algorithm had better efficiency. A number of findings demonstrated the superiority of the proposed approach both in terms of high prediction accuracy and small feature sizes.
Collapse
Affiliation(s)
- Amir Seyyedabbasi
- Software Engineering Department, Faculty of Engineering and Natural Science, Istinye University, 34396 Istanbul, Turkey
| |
Collapse
|
21
|
AbdelAty AM, Yousri D, Chelloug S, Alduailij M, Abd Elaziz M. Fractional order adaptive hunter-prey optimizer for feature selection. ALEXANDRIA ENGINEERING JOURNAL 2023; 75:531-547. [DOI: 10.1016/j.aej.2023.05.092] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
|
22
|
Braik M, Awadallah MA, Al-Betar M, Hammouri AI, Alzubi OA. Cognitively Enhanced Versions of Capuchin Search Algorithm for Feature Selection in Medical Diagnosis: a COVID-19 Case Study. Cognit Comput 2023:1-38. [PMID: 37362196 PMCID: PMC10241154 DOI: 10.1007/s12559-023-10149-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 04/28/2023] [Indexed: 06/28/2023]
Abstract
Feature selection (FS) is a crucial area of cognitive computation that demands further studies. It has recently received a lot of attention from researchers working in machine learning and data mining. It is broadly employed in many different applications. Many enhanced strategies have been created for FS methods in cognitive computation to boost the performance of the methods. The goal of this paper is to present three adaptive versions of the capuchin search algorithm (CSA) that each features a better search ability than the parent CSA. These versions are used to select optimal feature subset based on a binary version of each adapted one and the k-Nearest Neighbor (k-NN) classifier. These versions were matured by applying several strategies, including automated control of inertia weight, acceleration coefficients, and other computational factors, to ameliorate search potency and convergence speed of CSA. In the velocity model of CSA, some growth computational functions, known as exponential, power, and S-shaped functions, were adopted to evolve three versions of CSA, referred to as exponential CSA (ECSA), power CSA (PCSA), and S-shaped CSA (SCSA), respectively. The results of the proposed FS methods on 24 benchmark datasets with different dimensions from various repositories were compared with other k-NN based FS methods from the literature. The results revealed that the proposed methods significantly outperformed the performance of CSA and other well-established FS methods in several relevant criteria. In particular, among the 24 datasets considered, the proposed binary ECSA, which yielded the best overall results among all other proposed versions, is able to excel the others in 18 datasets in terms of classification accuracy, 13 datasets in terms of specificity, 10 datasets in terms of sensitivity, and 14 datasets in terms of fitness values. Simply put, the results on 15, 9, and 5 datasets out of the 24 datasets studied showed that the performance levels of the binary ECSA, PCSA, and SCSA are over 90% in respect of specificity, sensitivity, and accuracy measures, respectively. The thorough results via different comparisons divulge the efficiency of the proposed methods in widening the classification accuracy compared to other methods, ensuring the ability of the proposed methods in exploring the feature space and selecting the most useful features for classification studies.
Collapse
Affiliation(s)
- Malik Braik
- Department of Computer Science, Al-Balqa Applied University, Salt, Jordan
| | - Mohammed A. Awadallah
- Department of Computer Science, Al-Aqsa University, P.O. Box 4051, Gaza, Palestine
- Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University, Ajman, United Arab Emirates
| | - Mohammed Azmi Al-Betar
- Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University, Ajman, United Arab Emirates
- Department of Information Technology, Al-Huson University College, Al-Balqa Applied University, Al-Huson, Irbid, Jordan
| | | | - Omar A. Alzubi
- Department of Computer Science, Al-Balqa Applied University, Salt, Jordan
| |
Collapse
|
23
|
Izci D, Ekinci S, Hussien AG. Effective PID controller design using a novel hybrid algorithm for high order systems. PLoS One 2023; 18:e0286060. [PMID: 37235627 DOI: 10.1371/journal.pone.0286060] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 05/08/2023] [Indexed: 05/28/2023] Open
Abstract
This paper discusses the merging of two optimization algorithms, atom search optimization and particle swarm optimization, to create a hybrid algorithm called hybrid atom search particle swarm optimization (h-ASPSO). Atom search optimization is an algorithm inspired by the movement of atoms in nature, which employs interaction forces and neighbor interaction to guide each atom in the population. On the other hand, particle swarm optimization is a swarm intelligence algorithm that uses a population of particles to search for the optimal solution through a social learning process. The proposed algorithm aims to reach exploration-exploitation balance to improve search efficiency. The efficacy of h-ASPSO has been demonstrated in improving the time-domain performance of two high-order real-world engineering problems: the design of a proportional-integral-derivative controller for an automatic voltage regulator and a doubly fed induction generator-based wind turbine systems. The results show that h-ASPSO outperformed the original atom search optimization in terms of convergence speed and quality of solution and can provide more promising results for different high-order engineering systems without significantly increasing the computational cost. The promise of the proposed method is further demonstrated using other available competitive methods that are utilized for the automatic voltage regulator and a doubly fed induction generator-based wind turbine systems.
Collapse
Affiliation(s)
- Davut Izci
- Department of Computer Engineering, Batman University, Batman, Turkey
- MEU Research Unit, Middle East University, Amman, Jordan
| | - Serdar Ekinci
- Department of Computer Engineering, Batman University, Batman, Turkey
| | - Abdelazim G Hussien
- Department of Computer and Information Science, Linköping University, Linköping, Sweden
- Faculty of Science, Fayoum University, Fayoum, Egypt
| |
Collapse
|
24
|
Chen Z, Xuan P, Heidari AA, Liu L, Wu C, Chen H, Escorcia-Gutierrez J, Mansour RF. An artificial bee bare-bone hunger games search for global optimization and high-dimensional feature selection. iScience 2023; 26:106679. [PMID: 37216098 PMCID: PMC10193239 DOI: 10.1016/j.isci.2023.106679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 03/01/2023] [Accepted: 04/12/2023] [Indexed: 05/24/2023] Open
Abstract
The domains of contemporary medicine and biology have generated substantial high-dimensional genetic data. Identifying representative genes and decreasing the dimensionality of the data can be challenging. The goal of gene selection is to minimize computing costs and enhance classification precision. Therefore, this article designs a new wrapper gene selection algorithm named artificial bee bare-bone hunger games search (ABHGS), which is the hunger games search (HGS) integrated with an artificial bee strategy and a Gaussian bare-bone structure to address this issue. To evaluate and validate the performance of our proposed method, ABHGS is compared to HGS and a single strategy embedded in HGS, six classic algorithms, and ten advanced algorithms on the CEC 2017 functions. The experimental results demonstrate that the bABHGS outperforms the original HGS. Compared to peers, it increases classification accuracy and decreases the number of selected features, indicating its actual engineering utility in spatial search and feature selection.
Collapse
Affiliation(s)
- Zhiqing Chen
- School of Intelligent Manufacturing, Wenzhou Polytechnic, Wenzhou 325035, China
| | - Ping Xuan
- Department of Computer Science, School of Engineering, Shantou University, Shantou 515063, China
| | - Ali Asghar Heidari
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Lei Liu
- College of Computer Science, Sichuan University, Chengdu, Sichuan 610065, China
| | - Chengwen Wu
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - Huiling Chen
- Key Laboratory of Intelligent Informatics for Safety & Emergency of Zhejiang Province, Wenzhou University, Wenzhou 325035, China
| | - José Escorcia-Gutierrez
- Department of Computational Science and Electronics, Universidad de la Costa, CUC, Barranquilla 080002, Colombia
| | - Romany F. Mansour
- Department of Mathematics, Faculty of Science, New Valley University, El-Kharga 72511, Egypt
| |
Collapse
|
25
|
Shekhawat SS, Sharma H, Kumar S. Memetic Spider Monkey Optimization for Spam Review Detection Problem. BIG DATA 2023; 11:137-149. [PMID: 34152859 DOI: 10.1089/big.2020.0188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Spider monkey optimization (SMO) algorithm imitates the spider monkey's fission-fusion social behavior. It is evident through literature that the SMO is a competitive swarm-based algorithm that is used to solve difficult real-life problems. The SMO's search process is a little bit biased by the random component that drives it with high explorative searching steps. A hybridized SMO with a memetic search to improve the local search ability of SMO is proposed here. The newly developed strategy is titled Memetic SMO (MeSMO). Further, the proposed MeSMO-based clustering approach is applied to solve a big data problem, namely, the spam review detection problem. A customer usually makes decisions to purchase something or make an image of someone based on online reviews. Therefore, there is a good chance that the individuals or companies may write spam reviews to upgrade or degrade the stature or value of a trader/product/company. Therefore, an efficient spam detection algorithm, MeSMO, is proposed and tested over four complex spam datasets. The reported results of MeSMO are compared with the outcomes obtained from the six state-of-art strategies. A comparative analysis of the results proved that MeSMO is a good technique to solve the spam review detection problem and improved precision by 3.68%.
Collapse
Affiliation(s)
- Sayar Singh Shekhawat
- Department of Computer Science and Engineering, Rajasthan Technical University, Kota, India
| | - Harish Sharma
- Department of Computer Science and Engineering, Rajasthan Technical University, Kota, India
| | - Sandeep Kumar
- Department of Computer Science and Engineering, CHRIST (Deemed to be University), Bangalore, India
| |
Collapse
|
26
|
Hashem HA, Abdulazeem Y, Labib LM, Elhosseini MA, Shehata M. An Integrated Machine Learning-Based Brain Computer Interface to Classify Diverse Limb Motor Tasks: Explainable Model. SENSORS (BASEL, SWITZERLAND) 2023; 23:3171. [PMID: 36991884 PMCID: PMC10053613 DOI: 10.3390/s23063171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Revised: 02/27/2023] [Accepted: 03/13/2023] [Indexed: 06/19/2023]
Abstract
Terminal neurological conditions can affect millions of people worldwide and hinder them from doing their daily tasks and movements normally. Brain computer interface (BCI) is the best hope for many individuals with motor deficiencies. It will help many patients interact with the outside world and handle their daily tasks without assistance. Therefore, machine learning-based BCI systems have emerged as non-invasive techniques for reading out signals from the brain and interpreting them into commands to help those people to perform diverse limb motor tasks. This paper proposes an innovative and improved machine learning-based BCI system that analyzes EEG signals obtained from motor imagery to distinguish among various limb motor tasks based on BCI competition III dataset IVa. The proposed framework pipeline for EEG signal processing performs the following major steps. The first step uses a meta-heuristic optimization technique, called the whale optimization algorithm (WOA), to select the optimal features for discriminating between neural activity patterns. The pipeline then uses machine learning models such as LDA, k-NN, DT, RF, and LR to analyze the chosen features to enhance the precision of EEG signal analysis. The proposed BCI system, which merges the WOA as a feature selection method and the optimized k-NN classification model, demonstrated an overall accuracy of 98.6%, outperforming other machine learning models and previous techniques on the BCI competition III dataset IVa. Additionally, the EEG feature contribution in the ML classification model is reported using Explainable AI (XAI) tools, which provide insights into the individual contributions of the features in the predictions made by the model. By incorporating XAI techniques, the results of this study offer greater transparency and understanding of the relationship between the EEG features and the model's predictions. The proposed method shows potential levels for better use in controlling diverse limb motor tasks to help people with limb impairments and support them while enhancing their quality of life.
Collapse
Affiliation(s)
- Hend A. Hashem
- Computers and Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura 35516, Egypt
- Nile Higher Institute of Engineering and Technology, Mansoura University, Mansoura 35516, Egypt
| | - Yousry Abdulazeem
- Computer Engineering Department, MISR Higher Institute for Engineering and Technology, Mansoura University, Mansoura 35516, Egypt
| | - Labib M. Labib
- Computers and Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura 35516, Egypt
| | - Mostafa A. Elhosseini
- Computers and Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura 35516, Egypt
- College of Computer Science and Engineering, Taibah University, Yanbu 46421, Saudi Arabia
| | - Mohamed Shehata
- Computers and Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura 35516, Egypt
- Computer Science and Engineering Department, Speed School of Engineering, University of Louisville, Louisville, KY 40292, USA
| |
Collapse
|
27
|
An enhanced multi-operator differential evolution algorithm for tackling knapsack optimization problem. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08358-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2023]
|
28
|
Kottath R, Singh P, Bhowmick A. Swarm-based hybrid optimization algorithms: an exhaustive analysis and its applications to electricity load and price forecasting. Soft comput 2023; 27:1-32. [PMID: 37362291 PMCID: PMC10008129 DOI: 10.1007/s00500-023-07928-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/11/2023] [Indexed: 03/13/2023]
Abstract
In this work, we intend to propose multiple hybrid algorithms with the idea of giving a choice to the particles of a swarm to update their position for the next generation. To implement this concept, Cuckoo Search Algorithm (CSA), Grey Wolf Optimization (GWO), Harris Hawks Optimization (HHO), and Whale Optimization Algorithm (WOA) have been utilized. Exhaustive possible combinations of these algorithms are developed and benchmarked against the base algorithms. These hybrid algorithms have been validated on twenty-four well-known unimodal and multimodal benchmarks functions, and detailed analysis with varying dimensions and population size is discussed for the same. Further, the efficacy of these algorithms has been tested on short-term electricity load and price forecasting applications. For this purpose, the algorithms have been combined with Artificial Neural Networks (ANNs) to evaluate their performance on the ISO New Pool England dataset. The results demonstrate that hybrid optimization algorithms perform superior to their base algorithms in most test cases. Furthermore, the results show that the performance of CSA-GWO is significantly better than other algorithms.
Collapse
Affiliation(s)
- Rahul Kottath
- Digital Tower, Bentley Systems India Private Limited, Pune, India
- School of Electrical and Electronics Engineering, VIT Bhopal University, Bhopal, MP 466114 India
| | - Priyanka Singh
- Department of Computer Science and Engineering, SRM University-AP, Amaravati, AP 522502 India
| | - Anirban Bhowmick
- School of Electrical and Electronics Engineering, VIT Bhopal University, Bhopal, MP 466114 India
| |
Collapse
|
29
|
Marjit S, Bhattacharyya T, Chatterjee B, Sarkar R. Simulated annealing aided genetic algorithm for gene selection from microarray data. Comput Biol Med 2023; 158:106854. [PMID: 37023541 DOI: 10.1016/j.compbiomed.2023.106854] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 02/26/2023] [Accepted: 03/30/2023] [Indexed: 04/03/2023]
Abstract
In recent times, microarray gene expression datasets have gained significant popularity due to their usefulness to identify different types of cancer directly through bio-markers. These datasets possess a high gene-to-sample ratio and high dimensionality, with only a few genes functioning as bio-markers. Consequently, a significant amount of data is redundant, and it is essential to filter out important genes carefully. In this paper, we propose the Simulated Annealing aided Genetic Algorithm (SAGA), a meta-heuristic approach to identify informative genes from high-dimensional datasets. SAGA utilizes a two-way mutation-based Simulated Annealing (SA) as well as Genetic Algorithm (GA) to ensure a good trade-off between exploitation and exploration of the search space, respectively. The naive version of GA often gets stuck in a local optimum and depends on the initial population, leading to premature convergence. To address this, we have blended a clustering-based population generation with SA to distribute the initial population of GA over the entire feature space. To further enhance the performance, we reduce the initial search space by a score-based filter approach called the Mutually Informed Correlation Coefficient (MICC). The proposed method is evaluated on 6 microarray and 6 omics datasets. Comparison of SAGA with contemporary algorithms has shown that SAGA performs much better than its peers. Our code is available at https://github.com/shyammarjit/SAGA.
Collapse
|
30
|
An Optimization-Linked Intelligent Security Algorithm for Smart Healthcare Organizations. Healthcare (Basel) 2023; 11:healthcare11040580. [PMID: 36833114 PMCID: PMC9956199 DOI: 10.3390/healthcare11040580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 01/24/2023] [Accepted: 02/10/2023] [Indexed: 02/17/2023] Open
Abstract
IoT-enabled healthcare apps are providing significant value to society by offering cost-effective patient monitoring solutions in IoT-enabled buildings. However, with a large number of users and sensitive personal information readily available in today's fast-paced, internet, and cloud-based environment, the security of these healthcare systems must be a top priority. The idea of safely storing a patient's health data in an electronic format raises issues regarding patient data privacy and security. Furthermore, with traditional classifiers, processing large amounts of data is a difficult challenge. Several computational intelligence approaches are useful for effectively categorizing massive quantities of data for this goal. For many of these reasons, a novel healthcare monitoring system that tracks disease processes and forecasts diseases based on the available data obtained from patients in distant communities is proposed in this study. The proposed framework consists of three major stages, namely data collection, secured storage, and disease detection. The data are collected using IoT sensor devices. After that, the homomorphic encryption (HE) model is used for secured data storage. Finally, the disease detection framework is designed with the help of Centered Convolutional Restricted Boltzmann Machines-based whale optimization (CCRBM-WO) algorithm. The experiment is conducted on a Python-based cloud tool. The proposed system outperforms current e-healthcare solutions, according to the findings of the experiments. The accuracy, precision, F1-measure, and recall of our suggested technique are 96.87%, 97.45%, 97.78%, and 98.57%, respectively, according to the proposed method.
Collapse
|
31
|
Mafarja M, Thaher T, Al-Betar MA, Too J, Awadallah MA, Abu Doush I, Turabieh H. Classification framework for faulty-software using enhanced exploratory whale optimizer-based feature selection scheme and random forest ensemble learning. APPL INTELL 2023; 53:1-43. [PMID: 36785593 PMCID: PMC9909674 DOI: 10.1007/s10489-022-04427-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/23/2022] [Indexed: 02/11/2023]
Abstract
Software Fault Prediction (SFP) is an important process to detect the faulty components of the software to detect faulty classes or faulty modules early in the software development life cycle. In this paper, a machine learning framework is proposed for SFP. Initially, pre-processing and re-sampling techniques are applied to make the SFP datasets ready to be used by ML techniques. Thereafter seven classifiers are compared, namely K-Nearest Neighbors (KNN), Naive Bayes (NB), Linear Discriminant Analysis (LDA), Linear Regression (LR), Decision Tree (DT), Support Vector Machine (SVM), and Random Forest (RF). The RF classifier outperforms all other classifiers in terms of eliminating irrelevant/redundant features. The performance of RF is improved further using a dimensionality reduction method called binary whale optimization algorithm (BWOA) to eliminate the irrelevant/redundant features. Finally, the performance of BWOA is enhanced by hybridizing the exploration strategies of the grey wolf optimizer (GWO) and harris hawks optimization (HHO) algorithms. The proposed method is called SBEWOA. The SFP datasets utilized are selected from the PROMISE repository using sixteen datasets for software projects with different sizes and complexity. The comparative evaluation against nine well-established feature selection methods proves that the proposed SBEWOA is able to significantly produce competitively superior results for several instances of the evaluated dataset. The algorithms' performance is compared in terms of accuracy, the number of features, and fitness function. This is also proved by the 2-tailed P-values of the Wilcoxon signed ranks statistical test used. In conclusion, the proposed method is an efficient alternative ML method for SFP that can be used for similar problems in the software engineering domain.
Collapse
Affiliation(s)
- Majdi Mafarja
- Department of Computer Science, Birzeit University, Birzeit, Palestine
| | - Thaer Thaher
- Department of Computer Systems Engineering, Arab American University, Jenin, Palestine
- Information Technology Engineering, Al-Quds University, Abu Dies, Jerusalem, Palestine
| | - Mohammed Azmi Al-Betar
- Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University, Ajman, United Arab EmiratesDeepSinghML2017, Irbid, Jordan
| | - Jingwei Too
- Faculty of Electrical Engineering, Universiti Teknikal Malaysia Melaka, Hang Tuah Jaya, 76100 Durian Tunggal Melaka, Malaysia
| | - Mohammed A. Awadallah
- Department of Computer Science, Al-Aqsa University, P.O. Box 4051, Gaza, Palestine
- Artificial Intelligence Research Center (AIRC), Ajman University, Ajman, United Arab Emirates
| | - Iyad Abu Doush
- Department of Computing, College of Engineering and Applied Sciences, American University of Kuwait, Salmiya, Kuwait
- Computer Science Department, Yarmouk University, Irbid, Jordan
| | - Hamza Turabieh
- Department of Health Management and Informatics, University of Missouri, Columbia, 5 Hospital Drive, Columbia, MO 65212 USA
| |
Collapse
|
32
|
An improved feature selection approach using global best guided Gaussian artificial bee colony for EMG classification. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
33
|
An enhanced vortex search algorithm based on fluid particle density transfer for global and engineering optimization. Appl Soft Comput 2023. [DOI: 10.1016/j.asoc.2023.110024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
34
|
Pramanik P, Mukhopadhyay S, Mirjalili S, Sarkar R. Deep feature selection using local search embedded social ski-driver optimization algorithm for breast cancer detection in mammograms. Neural Comput Appl 2023; 35:5479-5499. [PMID: 36373132 PMCID: PMC9638217 DOI: 10.1007/s00521-022-07895-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 09/25/2022] [Indexed: 11/06/2022]
Abstract
Breast cancer has become a common malignancy in women. However, early detection and identification of this disease can save many lives. As computer-aided detection helps radiologists in detecting abnormalities efficiently, researchers across the world are striving to develop reliable models to deal with. One of the common approaches to identifying breast cancer is through breast mammograms. However, the identification of malignant breasts from mass lesions is a challenging research problem. In the current work, we propose a method for the classification of breast mass using mammograms which consists of two main stages. At first, we extract deep features from the input mammograms using the well-known VGG16 model while incorporating an attention mechanism into this model. Next, we apply a meta-heuristic called Social Ski-Driver (SSD) algorithm embedded with Adaptive Beta Hill Climbing based local search to obtain an optimal features subset. The optimal features subset is fed to the K-nearest neighbors (KNN) classifier for the classification. The proposed model is demonstrated to be very useful for identifying and differentiating malignant and healthy breasts successfully. For experimentation, we evaluate our model on the digital database for screening mammography (DDSM) database and achieve 96.07% accuracy using only 25% of features extracted by the attention-aided VGG16 model. The Python code of our research work is publicly available at: https://github.com/Ppayel/BreastLocalSearchSSD.
Collapse
Affiliation(s)
- Payel Pramanik
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| | | | - Seyedali Mirjalili
- Centre for Artificial Intelligence Research and Optimisation, Torrens University Australia, Brisbane, QLD 4006 Australia ,Yonsei Frontier Lab, Yonsei University, Seoul, South Korea ,University Research and Innovation Center, Óbuda University, Budapest, 1034 Hungary
| | - Ram Sarkar
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, India
| |
Collapse
|
35
|
Li Y, Wang H, Fan J, Geng Y. A novel Q-learning algorithm based on improved whale optimization algorithm for path planning. PLoS One 2022; 17:e0279438. [PMID: 36574399 PMCID: PMC9794100 DOI: 10.1371/journal.pone.0279438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 12/06/2022] [Indexed: 12/29/2022] Open
Abstract
Q-learning is a classical reinforcement learning algorithm and one of the most important methods of mobile robot path planning without a prior environmental model. Nevertheless, Q-learning is too simple when initializing Q-table and wastes too much time in the exploration process, causing a slow convergence speed. This paper proposes a new Q-learning algorithm called the Paired Whale Optimization Q-learning Algorithm (PWOQLA) which includes four improvements. Firstly, to accelerate the convergence speed of Q-learning, a whale optimization algorithm is used to initialize the values of a Q-table. Before the exploration process, a Q-table which contains previous experience is learned to improve algorithm efficiency. Secondly, to improve the local exploitation capability of the whale optimization algorithm, a paired whale optimization algorithm is proposed in combination with a pairing strategy to speed up the search for prey. Thirdly, to improve the exploration efficiency of Q-learning and reduce the number of useless explorations, a new selective exploration strategy is introduced which considers the relationship between current position and target position. Fourthly, in order to balance the exploration and exploitation capabilities of Q-learning so that it focuses on exploration in the early stage and on exploitation in the later stage, a nonlinear function is designed which changes the value of ε in ε-greedy Q-learning dynamically based on the number of iterations. Comparing the performance of PWOQLA with other path planning algorithms, experimental results demonstrate that PWOQLA achieves a higher level of accuracy and a faster convergence speed than existing counterparts in mobile robot path planning. The code will be released at https://github.com/wanghanyu0526/improveQL.git.
Collapse
Affiliation(s)
- Ying Li
- College of Computer Science and Technology, Jilin University, Changchun, People’s Republic of China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, People’s Republic of China
| | - Hanyu Wang
- College of Computer Science and Technology, Jilin University, Changchun, People’s Republic of China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, People’s Republic of China
- * E-mail:
| | - Jiahao Fan
- College of Computer Science, Sichuan University, Chengdu, People’s Republic of China
| | - Yanyu Geng
- College of Computer Science and Technology, Jilin University, Changchun, People’s Republic of China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, People’s Republic of China
| |
Collapse
|
36
|
Zhao X, Li F, Chen B, Li X, Lub S. Modeling the hardness properties of high-performance concrete via developed RBFNN coupling matheuristic algorithms. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-222805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Examining the properties of High-Performance Concrete (HPC) has been a big challenge due to the highly heterogeneous relationships and coherence among several constituents. The employment of silica fume and fly ash as eco-friendly components in mixtures benefits the concrete to improve its physical features. Although machine learning approaches are utilized broadly in many studies solitarily to estimate the mechanical features of concrete, causing to reduce accuracy and lift the cost and complexities of computational networks. Consequently, current research aims to develop a Radial Basis Function Neural Network (RBFNN) integrating with optimization algorithms in order to precisely model the mechanical characteristics of HPC mixtures including compressive strength (CS) and slump (SL). Feeding the dataset of HPC samples to hybrid models will result to reproduce the given CS and SL factors simultaneously. The results of the models showed that the maximum rate of correlation between estimated values and measured ones was obtained at 98.3% while the minimum rate of RMSE was calculated at 3.684 mm (and MPa) in the testing phase. Employing such soft-oriented approaches has been benefiting us to reduce costs and increase the result accuracy.
Collapse
Affiliation(s)
- Xiaofang Zhao
- Lanzhou Resources Environment Voc-Tech University Lan Zhou, China
| | - Faming Li
- Lanzhou Resources Environment Voc-Tech University Lan Zhou, China
| | - Biao Chen
- Lanzhou Resources Environment Voc-Tech University Lan Zhou, China
| | - Xiaofei Li
- Lanzhou Resources Environment Voc-Tech University Lan Zhou, China
| | | |
Collapse
|
37
|
Hu G, Zhong J, Wang X, Wei G. Multi-strategy assisted chaotic coot-inspired optimization algorithm for medical feature selection: A cervical cancer behavior risk study. Comput Biol Med 2022; 151:106239. [PMID: 36335810 DOI: 10.1016/j.compbiomed.2022.106239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 10/18/2022] [Accepted: 10/22/2022] [Indexed: 12/27/2022]
Abstract
Real-world optimization problems require some advanced metaheuristic algorithms, which functionally sustain a variety of solutions and technically explore the tracking space to find the global optimal solution or optimizer. One such algorithm is the newly developed COOT algorithm that is used to solve complex optimization problems. However, like other swarm intelligence algorithms, the COOT algorithm also faces the issues of low diversity, slow iteration speed, and stagnation in local optimization. In order to ameliorate these deficiencies, an improved population-initialized COOT algorithm named COBHCOOT is developed by integrating chaos map, opposition-based learning strategy and hunting strategy, which are used to accelerate the global convergence speed and boost the exploration efficiency and solution quality of the algorithm. To validate the dominance of the proposed COBHCOOT, it is compared with the original COOT algorithm and the well-known natural heuristic optimization algorithm on the recognized CEC2017 and CEC2019 benchmark suites, respectively. For the 29 CEC2017 problems, COBHCOOT performed the best in 15 (51.72%, 30-Dim), 14 (48.28%, 50-Dim) and 11 (37.93%, 100-Dim) respectively, and for the 10 CEC2019 benchmark functions, COBHCOOT performed the best in 7 of them. Furthermore, the practicability and potential of COBHCOOT are also highlighted by solving two engineering optimization problems and four truss structure optimization problems. Eventually, to examine the validity and performance of COBHCOOT for medical feature selection, eight medical datasets are used as benchmarks to compare with other superior methods in terms of average accuracy and number of features. Particularly, COBHCOOT is applied to the feature selection of cervical cancer behavior risk dataset. The findings testified that COBHCOOT achieves better accuracy with a minimal number of features compared with the comparison methods.
Collapse
Affiliation(s)
- Gang Hu
- Department of Applied Mathematics, Xi'an University of Technology, Xi'an, 710054, PR China; School of Computer Science and Engineering,, Xi'an University of Technology, Xi'an, 710048, PR China.
| | - Jingyu Zhong
- Department of Applied Mathematics, Xi'an University of Technology, Xi'an, 710054, PR China
| | - Xupeng Wang
- School of Art and Design, Xi'an University of Technology, Xi'an, 710054, China
| | - Guo Wei
- University of North Carolina at Pembroke, Pembroke, NC, 28372, USA
| |
Collapse
|
38
|
Zhu Y, Li W, Li T. A hybrid Artificial Immune optimization for high-dimensional feature selection. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.110111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
39
|
Wang Z, Gao S, Zhang Y, Guo L. Symmetric uncertainty-incorporated probabilistic sequence-based ant colony optimization for feature selection in classification. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
40
|
Zhu Y, Li T, Lan X. Feature selection optimized by the artificial immune algorithm based on genome shuffling and conditional lethal mutation. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03971-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
41
|
A novel whale optimization algorithm optimized XGBoost regression for estimating bearing capacity of concrete piles. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07896-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
42
|
An Efficient Hybrid Feature Selection Method Using the Artificial Immune Algorithm for High-Dimensional Data. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:1452301. [PMID: 36275946 PMCID: PMC9584659 DOI: 10.1155/2022/1452301] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 07/31/2022] [Accepted: 08/29/2022] [Indexed: 12/02/2022]
Abstract
Feature selection provides the optimal subset of features for data mining models. However, current feature selection methods for high-dimensional data also require a better balance between feature subset quality and computational cost. In this paper, an efficient hybrid feature selection method (HFIA) based on artificial immune algorithm optimization is proposed to solve the feature selection problem of high-dimensional data. The algorithm combines filter algorithms and improves clone selection algorithms to explore the feature space of high-dimensional data. According to the target requirements of feature selection, combined with biological research results, this method introduces the lethal mutation mechanism and the Cauchy operator to improve the search performance of the algorithm. Moreover, the adaptive adjustment factor is introduced in the mutation and update phases of the algorithm. The effective combination of these mechanisms enables the algorithm to obtain a better search ability and lower computational costs. Experimental comparisons with 19 state-of-the-art feature selection methods are conducted on 25 high-dimensional benchmark datasets. The results show that the feature reduction rate for all datasets is above 99%, and the performance improvement for the classifier is between 5% and 48.33%. Compared with the five classical filtering feature selection methods, the computational cost of HFIA is lower than the two of them, and it is far better than these five algorithms in terms of the feature reduction rate and classification accuracy improvement. Compared with the 14 hybrid feature selection methods reported in the latest literature, the average winning rates in terms of classification accuracy, feature reduction rate, and computational cost are 85.83%, 88.33%, and 96.67%, respectively.
Collapse
|
43
|
Abed-alguni BH, Alawad NA, Al-Betar MA, Paul D. Opposition-based sine cosine optimizer utilizing refraction learning and variable neighborhood search for feature selection. APPL INTELL 2022; 53:13224-13260. [PMID: 36247211 PMCID: PMC9547101 DOI: 10.1007/s10489-022-04201-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/21/2022] [Indexed: 12/03/2022]
Abstract
This paper proposes new improved binary versions of the Sine Cosine Algorithm (SCA) for the Feature Selection (FS) problem. FS is an essential machine learning and data mining task of choosing a subset of highly discriminating features from noisy, irrelevant, high-dimensional, and redundant features to best represent a dataset. SCA is a recent metaheuristic algorithm established to emulate a model based on sine and cosine trigonometric functions. It was initially proposed to tackle problems in the continuous domain. The SCA has been modified to Binary SCA (BSCA) to deal with the binary domain of the FS problem. To improve the performance of BSCA, three accumulative improved variations are proposed (i.e., IBSCA1, IBSCA2, and IBSCA3) where the last version has the best performance. IBSCA1 employs Opposition Based Learning (OBL) to help ensure a diverse population of candidate solutions. IBSCA2 improves IBSCA1 by adding Variable Neighborhood Search (VNS) and Laplace distribution to support several mutation methods. IBSCA3 improves IBSCA2 by optimizing the best candidate solution using Refraction Learning (RL), a novel OBL approach based on light refraction. For performance evaluation, 19 real-wold datasets, including a COVID-19 dataset, were selected with different numbers of features, classes, and instances. Three performance measurements have been used to test the IBSCA versions: classification accuracy, number of features, and fitness values. Furthermore, the performance of the last variation of IBSCA3 is compared against 28 existing popular algorithms. Interestingly, IBCSA3 outperformed almost all comparative methods in terms of classification accuracy and fitness values. At the same time, it was ranked 15 out of 19 in terms of number of features. The overall simulation and statistical results indicate that IBSCA3 performs better than the other algorithms.
Collapse
Affiliation(s)
| | | | - Mohammed Azmi Al-Betar
- Artificial Intelligence Research Center (AIRC), College of Engineering and Information Technology, Ajman University, Ajman, United Arab Emirates
| | - David Paul
- School of Science and Technology, University of New England, Armidale, Australia
| |
Collapse
|
44
|
Akinola OA, Agushaka JO, Ezugwu AE. Binary dwarf mongoose optimizer for solving high-dimensional feature selection problems. PLoS One 2022; 17:e0274850. [PMID: 36201524 PMCID: PMC9536540 DOI: 10.1371/journal.pone.0274850] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/06/2022] [Indexed: 11/13/2022] Open
Abstract
Selecting appropriate feature subsets is a vital task in machine learning. Its main goal is to remove noisy, irrelevant, and redundant feature subsets that could negatively impact the learning model's accuracy and improve classification performance without information loss. Therefore, more advanced optimization methods have been employed to locate the optimal subset of features. This paper presents a binary version of the dwarf mongoose optimization called the BDMO algorithm to solve the high-dimensional feature selection problem. The effectiveness of this approach was validated using 18 high-dimensional datasets from the Arizona State University feature selection repository and compared the efficacy of the BDMO with other well-known feature selection techniques in the literature. The results show that the BDMO outperforms other methods producing the least average fitness value in 14 out of 18 datasets which means that it achieved 77.77% on the overall best fitness values. The result also shows BDMO demonstrating stability by returning the least standard deviation (SD) value in 13 of 18 datasets (72.22%). Furthermore, the study achieved higher validation accuracy in 15 of the 18 datasets (83.33%) over other methods. The proposed approach also yielded the highest validation accuracy attainable in the COIL20 and Leukemia datasets which vividly portray the superiority of the BDMO.
Collapse
Affiliation(s)
- Olatunji A. Akinola
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, KwaZulu-Natal, South Africa
| | - Jeffrey O. Agushaka
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, KwaZulu-Natal, South Africa
- Department of Computer Science, Federal University of Lafia, Lafia, Nasarawa State, Nigeria
| | - Absalom E. Ezugwu
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, KwaZulu-Natal, South Africa
| |
Collapse
|
45
|
Pipeline damage identification based on an optimized back-propagation neural network improved by whale optimization algorithm. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04188-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
46
|
An Efficient High-dimensional Feature Selection Approach Driven By Enhanced Multi-strategy Grey Wolf Optimizer for Biological Data Classification. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07836-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/10/2022]
|
47
|
Obadina OO, Thaha MA, Mohamed Z, Shaheed MH. Grey-box modelling and fuzzy logic control of a Leader-Follower robot manipulator system: A hybrid Grey Wolf-Whale Optimisation approach. ISA TRANSACTIONS 2022; 129:572-593. [PMID: 35277266 DOI: 10.1016/j.isatra.2022.02.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 01/04/2022] [Accepted: 02/15/2022] [Indexed: 06/14/2023]
Abstract
This study presents the development of a grey-box modelling approach and fuzzy logic control for real time trajectory control of an experimental four degree-of-freedom Leader-Follower Robot (LFR) manipulator system using a hybrid optimisation algorithm, known as Grey Wolf Optimiser (GWO) - Whale Optimisation Algorithm (WOA). The approach has advantages in achieving an accurate model of the LFR manipulator system, and together with a better trajectory tracking performance. In the first instance, the white box model is formed by modelling the dynamics of the follower manipulator using the Euler-Lagrange formulation. This white-box model is then improved upon by re-tuning the model's parameters using GWO-WOA and experimental data from the real LFR manipulator system, thus forming the grey-box model. A minimum improvement of 73.9% is achieved by the grey-box model in comparison to the white-box model. In the latter part of this investigation, the developed grey-box model is used for the design, tuning and real-time implementation of a fuzzy PD+I controller on the experimental LFR manipulator system. A 78% improvement in the total mean squared error is realised after tuning the membership functions of the fuzzy logic controller using GWO-WOA. Experimental results show that the approach significantly improves the trajectory tracking performance of the LFR manipulator system in terms of mean squared error, steady state error and time delay.
Collapse
Affiliation(s)
- Ololade O Obadina
- School of Engineering and Materials Science, Queen Mary University of London, UK
| | - Mohamed A Thaha
- Blizard Institute, Barts and The London School of Medicine & Dentistry, Queen Mary University of London, UK; Department of Colorectal Surgery, Royal London Hospital, Barts Health NHS Trust, Whitechapel, London, UK
| | - Zaharuddin Mohamed
- School of Electrical Engineering, Universiti Teknologi Malaysia, Malaysia
| | - M Hasan Shaheed
- School of Engineering and Materials Science, Queen Mary University of London, UK.
| |
Collapse
|
48
|
R. S, S. A. A, Y. V. RR, Sadiq AS. Traffic flow forecasting using natural selection based hybrid Bald Eagle Search-Grey Wolf optimization algorithm. PLoS One 2022; 17:e0275104. [PMID: 36162064 PMCID: PMC9512416 DOI: 10.1371/journal.pone.0275104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 09/11/2022] [Indexed: 11/18/2022] Open
Abstract
In a fast-moving world, transportation consumes most of the time and resources. Traffic prediction has become a thrust application for machine learning algorithms to overcome the hurdles faced by congestion. Its accuracy determines the selection and existence of machine learning algorithms. The accuracy of such an algorithm is improved better by the proper tuning of the parameters. Support Vector Regression (SVR) is a well-known prediction mechanism. This paper exploits the Hybrid Grey Wolf Optimization-Bald Eagle Search (GWO-BES) algorithm for tuning SVR parameters, wherein the GWO selection methods are of natural selection. SVR-GWO-BES with natural selection has error performance increases by 48% in Mean Absolute Percentage Error and Root Mean Square Error, with the help of Caltrans Performance Measurement System (PeMS) open-source data and Chennai city traffic data for traffic forecasting. It is also shown that the increasing population of search agents increases the performance.
Collapse
Affiliation(s)
- Sivakumar R.
- School of Electronics Engineering, Vellore Institute of Technology, Vellore, India
| | - Angayarkanni S. A.
- Department of Networking and Communication, School of Computing, SRM Institute of Science and Technology, Chengalpattu, India
| | - Ramana Rao Y. V.
- Department of Electronics and Communication, College of Engineering Guindy, Anna University, Chennai, India
| | - Ali Safaa Sadiq
- Department of Computer Science, Nottingham Trent University, Clifton Lane, NG11 8NS, Nottingham, UK
| |
Collapse
|
49
|
Yi Z, Yangkun Z, Hongda Y, Hong W. Application of an improved Discrete Salp Swarm Algorithm to the wireless rechargeable sensor network problem. Front Bioeng Biotechnol 2022; 10:923798. [PMID: 36204468 PMCID: PMC9531118 DOI: 10.3389/fbioe.2022.923798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 08/08/2022] [Indexed: 11/13/2022] Open
Abstract
This paper presents an improved Discrete Salp Swarm Algorithm based on the Ant Colony System (DSSACS). Firstly, we use the Ant Colony System (ACS) to optimize the initialization of the salp colony and discretize the algorithm, then use the crossover operator and mutation operator to simulate the foraging behavior of the followers in the salp colony. We tested DSSACS with several algorithms on the TSP dataset. For TSP files of different sizes, the error of DSSACS is generally between 0.78% and 2.95%, while other algorithms are generally higher than 2.03%, or even 6.43%. The experiments show that our algorithm has a faster convergence speed, better positive feedback mechanism, and higher accuracy. We also apply the new algorithm for the Wireless rechargeable sensor network (WRSN) problem. For the selection of the optimal path, the path selected by DSSACS is always about 20% shorter than the path selected by ACS. Results show that DSSACS has obvious advantages over other algorithms in MCV’s multi-path planning and saves more time and economic cost than other swarm intelligence algorithms in the wireless rechargeable sensor network.
Collapse
Affiliation(s)
- Zhang Yi
- College of Electrical and Computer Science, Jilin Jianzhu University, Changchun, China
| | - Zhou Yangkun
- College of Electrical and Computer Science, Jilin Jianzhu University, Changchun, China
| | - Yu Hongda
- College of Electrical and Computer Science, Jilin Jianzhu University, Changchun, China
| | - Wang Hong
- Information Center of the Ministry of Natural Resources, Beijing, China
- *Correspondence: Wang Hong,
| |
Collapse
|
50
|
Hybrid binary COOT algorithm with simulated annealing for feature selection in high-dimensional microarray data. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07780-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|