1
|
Yang G, Li W, Xie W, Wang L, Yu K. An improved binary particle swarm optimization algorithm for clinical cancer biomarker identification in microarray data. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 244:107987. [PMID: 38157825 DOI: 10.1016/j.cmpb.2023.107987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/04/2023] [Accepted: 12/16/2023] [Indexed: 01/03/2024]
Abstract
BACKGROUND AND OBJECTIVE The limited number of samples and high-dimensional features in microarray data make selecting a small number of features for disease diagnosis a challenging problem. Traditional feature selection methods based on evolutionary algorithms are difficult to search for the optimal set of features in a limited time when dealing with the high-dimensional feature selection problem. New solutions are proposed to solve the above problems. METHODS In this paper, we propose a hybrid feature selection method (C-IFBPFE) for biomarker identification in microarray data, which combines clustering and improved binary particle swarm optimization while incorporating an embedded feature elimination strategy. Firstly, an adaptive redundant feature judgment method based on correlation clustering is proposed for feature screening to reduce the search space in the subsequent stage. Secondly, we propose an improved flipping probability-based binary particle swarm optimization (IFBPSO), better applicable to the binary particle swarm optimization problem. Finally, we also design a new feature elimination (FE) strategy embedded in the binary particle swarm optimization algorithm. This strategy gradually removes poorer features during iterations to reduce the number of features and improve accuracy. RESULTS We compared C-IFBPFE with other published hybrid feature selection methods on eight public datasets and analyzed the impact of each improvement. The proposed method outperforms other current state-of-the-art feature selection methods in terms of accuracy, number of features, sensitivity, and specificity. The ablation study of this method validates the efficacy of each component, especially the proposed feature elimination strategy significantly improves the performance of the algorithm. CONCLUSIONS The hybrid feature selection method proposed in this paper helps address the issue of high-dimensional microarray data with few samples. It can select a small subset of features and achieve high classification accuracy on microarray datasets. Additionally, independent validation of the selected features shows that those chosen by C-IFBPFE have strong correlations with disease phenotypes and can identify important biomarkers from data related to biomedical problems.
Collapse
Affiliation(s)
- Guicheng Yang
- College of Computer Science and Engineering, Northeastern University, Shenyang, 110000, Liaoning, China.
| | - Wei Li
- Key Laboratory of Intelligent Computing in Medical Image (MIIC), Northeastern University, Ministry of Education, Shenyang, 110000, Liaoning, China; National Frontiers Science Center for Industrial Intelligence and Systems Optimization, Shenyang, 110819, Liaoning, China.
| | - Weidong Xie
- College of Computer Science and Engineering, Northeastern University, Shenyang, 110000, Liaoning, China.
| | - Linjie Wang
- College of Computer Science and Engineering, Northeastern University, Shenyang, 110000, Liaoning, China.
| | - Kun Yu
- College of Medicine and Bioinformation Engineering, Northeastern University, Shenyang, 110819, Liaoning, China.
| |
Collapse
|
2
|
Rai R, Dhal KG. Recent Developments in Equilibrium Optimizer Algorithm: Its Variants and Applications. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING : STATE OF THE ART REVIEWS 2023; 30:1-54. [PMID: 37359743 PMCID: PMC10096115 DOI: 10.1007/s11831-023-09923-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 03/26/2023] [Indexed: 06/28/2023]
Abstract
There have been many algorithms created and introduced in the literature inspired by various events observable in nature, such as evolutionary phenomena, the actions of social creatures or agents, broad principles based on physical processes, the nature of chemical reactions, human behavior, superiority, and intelligence, intelligent behavior of plants, numerical techniques and mathematics programming procedure and its orientation. Nature-inspired metaheuristic algorithms have dominated the scientific literature and have become a widely used computing paradigm over the past two decades. Equilibrium Optimizer, popularly known as EO, is a population-based, nature-inspired meta-heuristics that belongs to the class of Physics based optimization algorithms, enthused by dynamic source and sink models with a physics foundation that are used to make educated guesses about equilibrium states. EO has achieved massive recognition, and there are quite a few changes made to existing EOs. This article gives a thorough review of EO and its variations. We started with 175 research articles published by several major publishers. Additionally, we discuss the strengths and weaknesses of the algorithms to help researchers find the variant that best suits their needs. The core optimization problems from numerous application areas using EO are also covered in the study, including image classification, scheduling problems, and many others. Lastly, this work recommends a few potential areas for EO research in the future.
Collapse
Affiliation(s)
- Rebika Rai
- Department of Computer Applications, Sikkim University, Sikkim, India
| | - Krishna Gopal Dhal
- Department of Computer Science and Application, Midnapore College (Autonomous), Paschim Medinipur, Midnapore, West Bengal India
| |
Collapse
|
3
|
Zhong C, Li G, Meng Z, Li H, He W. A self-adaptive quantum equilibrium optimizer with artificial bee colony for feature selection. Comput Biol Med 2023; 153:106520. [PMID: 36608463 DOI: 10.1016/j.compbiomed.2022.106520] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 11/28/2022] [Accepted: 12/31/2022] [Indexed: 01/03/2023]
Abstract
Feature selection (FS) is a popular data pre-processing technique in machine learning to extract the optimal features to maintain or increase the classification accuracy of the dataset, which is a combinatorial optimization problem, requiring a powerful optimizer to obtain the optimum subset. The equilibrium optimizer (EO) is a recent physical-based metaheuristic algorithm with good performance for various optimization problems, but it may encounter premature or the local convergence in feature selection. This work presents a self-adaptive quantum EO with artificial bee colony for feature selection, named SQEOABC. In the proposed algorithm, the quantum theory and the self-adaptive mechanism are employed into the updating rule of EO to enhance convergence, and the updating mechanism from the artificial bee colony is also incorporated into EO to achieve appropriate FS solutions. In the experiments, 25 benchmark datasets from the UCI repository are investigated to verify SQEOABC, which is compared with several state-of-the-art metaheuristic algorithms and the variants of EO. The statistical results of fitness values and accuracy demonstrate that SQEOABC has better performance than the compared algorithms and the variants of EO. Finally, a real-world FS problem from COVID-19 illustrates the effectiveness and superiority of SQEOABC.
Collapse
Affiliation(s)
- Changting Zhong
- Department of Engineering Mechanics, State Key Laboratory of Structural Analyses for Industrial Equipment, Dalian University of Technology, Dalian, 116024, China; School of Civil Engineering and Architecture, Hainan University, Haikou 570228, China.
| | - Gang Li
- Department of Engineering Mechanics, State Key Laboratory of Structural Analyses for Industrial Equipment, Dalian University of Technology, Dalian, 116024, China; Ningbo Institute of Dalian University of Technology, Ningbo, 315000, China.
| | - Zeng Meng
- School of Civil Engineering, Hefei University of Technology, Hefei, 230009, China.
| | - Haijiang Li
- BIM for Smart Engineering Centre, Cardiff School of Engineering, Cardiff University, Queen's Buildings, Cardiff, CF24 3AA, Whales, UK.
| | - Wanxin He
- Department of Engineering Mechanics, State Key Laboratory of Structural Analyses for Industrial Equipment, Dalian University of Technology, Dalian, 116024, China.
| |
Collapse
|
4
|
Xiao P, Ma K, Ye X, Wang G, Duan Z, Huang Y, Luo Z, Hu X, Chi W, Yuan J. Classification of Vogt-Koyanagi-Harada disease using feature selection and classification based on wide-field swept-source optical coherence tomography angiography. Front Bioeng Biotechnol 2023; 11:1086347. [PMID: 37200845 PMCID: PMC10185775 DOI: 10.3389/fbioe.2023.1086347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 04/20/2023] [Indexed: 05/20/2023] Open
Abstract
Background: Vogt-Koyanagi-Harada (VKH) disease is a common and easily blinded uveitis entity, with choroid being the main involved site. Classification of VKH disease and its different stages is crucial because they differ in clinical manifestations and therapeutic interventions. Wide-field swept-source optical coherence tomography angiography (WSS-OCTA) provides the advantages of non-invasiveness, large-field-of-view, high resolution, and ease of measuring and calculating choroid, offering the potential feasibility of simplified VKH classification assessment based on WSS-OCTA. Methods: 15 healthy controls (HC), 13 acute-phase and 17 convalescent-phase VKH patients were included, undertaken WSS-OCTA examination with a scanning field of 15 × 9 mm2. 20 WSS-OCTA parameters were then extracted from WSS-OCTA images. To classify HC and VKH patients in acute and convalescent phases, two 2-class VKH datasets (HC and VKH) and two 3-class VKH datasets (HC, acute-phase VKH, and convalescent-phase VKH) were established by the WSS-OCTA parameters alone or in combination with best-corrected visual acuity (logMAR BCVA) and intraocular pressure (IOP), respectively. A new feature selection and classification method that combines an equilibrium optimizer and a support vector machine (called SVM-EO) was adopted to select classification-sensitive parameters among the massive datasets and to achieve outstanding classification performance. The interpretability of the VKH classification models was demonstrated based on SHapley Additive exPlanations (SHAP). Results: Based on pure WSS-OCTA parameters, we achieved classification accuracies of 91.61% ± 12.17% and 86.69% ± 8.30% for 2- and 3-class VKH classification tasks. By combining the WSS-OCTA parameters and logMAR BCVA, we achieved better classification performance of 98.82% ± 2.63% and 96.16% ± 5.88%, respectively. Through SHAP analysis, we found that logMAR BCVA and vascular perfusion density (VPD) calculated from the whole field of view region in the choriocapillaris (whole FOV CC-VPD) were the most important features for VKH classification in our models. Conclusion: We achieved excellent VKH classification performance based on a non-invasive WSS-OCTA examination, which provides the possibility for future clinical VKH classification with high sensitivity and specificity.
Collapse
Affiliation(s)
- Peng Xiao
- *Correspondence: Peng Xiao, ; Jin Yuan,
| | | | | | | | | | | | | | | | | | - Jin Yuan
- *Correspondence: Peng Xiao, ; Jin Yuan,
| |
Collapse
|
5
|
Houssein EH, Hassan MH, Mahdy MA, Kamel S. Development and application of equilibrium optimizer for optimal power flow calculation of power system. APPL INTELL 2023; 53:7232-7253. [PMID: 35875200 PMCID: PMC9289660 DOI: 10.1007/s10489-022-03796-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/22/2022] [Indexed: 01/13/2023]
Abstract
This paper proposes an enhanced version of Equilibrium Optimizer (EO) called (EEO) for solving global optimization and the optimal power flow (OPF) problems. The proposed EEO algorithm includes a new performance reinforcement strategy with the Lévy Flight mechanism. The algorithm addresses the shortcomings of the original Equilibrium Optimizer (EO) and aims to provide better solutions (than those provided by EO) to global optimization problems, especially OPF problems. The proposed EEO efficiency was confirmed by comparing its results on the ten functions of the CEC'20 test suite, to those of other algorithms, including high-performance algorithms, i.e., CMA-ES, IMODE, AGSK and LSHADE_cnEpSin. Moreover, the statistical significance of these results was validated by the Wilcoxon's rank-sum test. After that, the proposed EEO was applied to solve the the OPF problem. The OPF is formulated as a nonlinear optimization problem with conflicting objectives and subjected to both equality and inequality constraints. The performance of this technique is deliberated and evaluated on the standard IEEE 30-bus test system for different objectives. The obtained results of the proposed EEO algorithm is compared to the original EO algorithm and those obtained using other techniques mentioned in the literature. These Simulation results revealed that the proposed algorithm provides better optimized solutions than 20 published methods and results as well as the original EO algorithm. The EEO superiority was demonstrated through six different cases, that involved the minimization of different objectives: fuel cost, fuel cost with valve-point loading effect, emission, total active power losses, voltage deviation, and voltage instability. Also, the comparison results indicate that EEO algorithm can provide a robust, high-quality feasible solutions for different OPF problems.
Collapse
Affiliation(s)
- Essam H. Houssein
- Faculty of Computers and Information, Minia University, 61519 Minia, Egypt
| | | | - Mohamed A. Mahdy
- Faculty of Computers and Artificial Intelligence, Beni-Suef University, Beni-Suef, Egypt
| | - Salah Kamel
- Electrical Engineering Department, Faculty of Engineering, Aswan University, 81542 Aswan, Egypt
| |
Collapse
|
6
|
Chamlal H, Ouaderhman T, Aaboub F. A graph based preordonnances theoretic supervised feature selection in high dimensional data. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
7
|
Chamlal H, Ouaderhman T, Rebbah FE. A hybrid feature selection approach for Microarray datasets using graph theoretic-based method. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.10.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
8
|
Akinola OA, Ezugwu AE, Oyelade ON, Agushaka JO. A hybrid binary dwarf mongoose optimization algorithm with simulated annealing for feature selection on high dimensional multi-class datasets. Sci Rep 2022; 12:14945. [PMID: 36056062 PMCID: PMC9440036 DOI: 10.1038/s41598-022-18993-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2022] [Accepted: 08/23/2022] [Indexed: 11/08/2022] Open
Abstract
The dwarf mongoose optimization (DMO) algorithm developed in 2022 was applied to solve continuous mechanical engineering design problems with a considerable balance of the exploration and exploitation phases as a metaheuristic approach. Still, the DMO is restricted in its exploitation phase, somewhat hindering the algorithm's optimal performance. In this paper, we proposed a new hybrid method called the BDMSAO, which combines the binary variants of the DMO (or BDMO) and simulated annealing (SA) algorithm. In the modelling and implementation of the hybrid BDMSAO algorithm, the BDMO is employed and used as the global search method and the simulated annealing (SA) as the local search component to enhance the limited exploitative mechanism of the BDMO. The new hybrid algorithm was evaluated using eighteen (18) UCI machine learning datasets of low and medium dimensions. The BDMSAO was also tested using three high-dimensional medical datasets to assess its robustness. The results showed the efficacy of the BDMSAO in solving challenging feature selection problems on varying datasets dimensions and its outperformance over ten other methods in the study. Specifically, the BDMSAO achieved an overall result of 61.11% in producing the highest classification accuracy possible and getting 100% accuracy on 9 of 18 datasets. It also yielded the maximum accuracy obtainable on the three high-dimensional datasets utilized while achieving competitive performance regarding the number of features selected.
Collapse
Affiliation(s)
- Olatunji A Akinola
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201, KwaZulu-Natal, South Africa
| | - Absalom E Ezugwu
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201, KwaZulu-Natal, South Africa.
| | - Olaide N Oyelade
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201, KwaZulu-Natal, South Africa
| | - Jeffrey O Agushaka
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201, KwaZulu-Natal, South Africa
| |
Collapse
|
9
|
Akinola OO, Ezugwu AE, Agushaka JO, Zitar RA, Abualigah L. Multiclass feature selection with metaheuristic optimization algorithms: a review. Neural Comput Appl 2022; 34:19751-19790. [PMID: 36060097 PMCID: PMC9424068 DOI: 10.1007/s00521-022-07705-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 08/02/2022] [Indexed: 11/24/2022]
Abstract
Selecting relevant feature subsets is vital in machine learning, and multiclass feature selection is harder to perform since most classifications are binary. The feature selection problem aims at reducing the feature set dimension while maintaining the performance model accuracy. Datasets can be classified using various methods. Nevertheless, metaheuristic algorithms attract substantial attention to solving different problems in optimization. For this reason, this paper presents a systematic survey of literature for solving multiclass feature selection problems utilizing metaheuristic algorithms that can assist classifiers selects optima or near optima features faster and more accurately. Metaheuristic algorithms have also been presented in four primary behavior-based categories, i.e., evolutionary-based, swarm-intelligence-based, physics-based, and human-based, even though some literature works presented more categorization. Further, lists of metaheuristic algorithms were introduced in the categories mentioned. In finding the solution to issues related to multiclass feature selection, only articles on metaheuristic algorithms used for multiclass feature selection problems from the year 2000 to 2022 were reviewed about their different categories and detailed descriptions. We considered some application areas for some of the metaheuristic algorithms applied for multiclass feature selection with their variations. Popular multiclass classifiers for feature selection were also examined. Moreover, we also presented the challenges of metaheuristic algorithms for feature selection, and we identified gaps for further research studies.
Collapse
Affiliation(s)
- Olatunji O. Akinola
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201 KwaZulu-Natal South Africa
| | - Absalom E. Ezugwu
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201 KwaZulu-Natal South Africa
| | - Jeffrey O. Agushaka
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201 KwaZulu-Natal South Africa
| | - Raed Abu Zitar
- Sorbonne Center of Artificial Intelligence, Sorbonne University-Abu Dhabi, 38044 Abu Dhabi, United Arab Emirates
| | - Laith Abualigah
- Hourani Center for Applied Scientific Research, Al-Ahliyya Amman University, Amman, 19328 Jordan
- Faculty of Inforsmation Technology, Middle East University, Amman, 11831 Jordan
| |
Collapse
|
10
|
EBBA: An Enhanced Binary Bat Algorithm Integrated with Chaos Theory and Lévy Flight for Feature Selection. FUTURE INTERNET 2022. [DOI: 10.3390/fi14060178] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Feature selection can efficiently improve classification accuracy and reduce the dimension of datasets. However, feature selection is a challenging and complex task that requires a high-performance optimization algorithm. In this paper, we propose an enhanced binary bat algorithm (EBBA) which is originated from the conventional binary bat algorithm (BBA) as the learning algorithm in a wrapper-based feature selection model. First, we model the feature selection problem and then transfer it as a fitness function. Then, we propose an EBBA for solving the feature selection problem. In EBBA, we introduce the Lévy flight-based global search method, population diversity boosting method and chaos-based loudness method to improve the BA and make it more applicable to feature selection problems. Finally, the simulations are conducted to evaluate the proposed EBBA and the simulation results demonstrate that the proposed EBBA outmatches other comparison benchmarks. Moreover, we also illustrate the effectiveness of the proposed improved factors by tests.
Collapse
|