1
|
Liu J, Wei J, Heidari AA, Kuang F, Zhang S, Gui W, Chen H, Pan Z. Chaotic simulated annealing multi-verse optimization enhanced kernel extreme learning machine for medical diagnosis. Comput Biol Med 2022; 144:105356. [PMID: 35299042 DOI: 10.1016/j.compbiomed.2022.105356] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 02/25/2022] [Accepted: 02/25/2022] [Indexed: 01/09/2023]
Abstract
Classification models such as Multi-Verse Optimization (MVO) play a vital role in disease diagnosis. To improve the efficiency and accuracy of MVO, in this paper, the defects of MVO are mitigated and the improved MVO is combined with kernel extreme learning machine (KELM) for effective disease diagnosis. Although MVO obtains some relatively good results on some problems of interest, it suffers from slow convergence speed and local optima entrapment for some many-sided basins, especially multi-modal problems with high dimensions. To solve these shortcomings, in this study, a new chaotic simulated annealing overhaul of MVO (CSAMVO) is proposed. Based on MVO, two approaches are adopted to offer a relatively stable and efficient convergence speed. Specifically, a chaotic intensification mechanism (CIP) is applied to the optimal universe evaluation stage to increase the depth of the universe search. After obtaining relatively satisfactory results, the simulated annealing algorithm (SA) is employed to reinforce the capability of MVO to avoid local optima. To evaluate its performance, the proposed CSAMVO approach was compared with a wide range of classical algorithms on thirty-nine benchmark functions. The results show that the improved MVO outperforms the other algorithms in terms of solution quality and convergence speed. Furthermore, based on CSAMVO, a hybrid KELM model termed CSAMVO-KELM is established for disease diagnosis. To evaluate its effectiveness, the new hybrid system was compared with a multitude of competitive classifiers on two disease diagnosis problems. The results demonstrate that the proposed CSAMVO-assisted classifier can find solutions with better learning potential and higher predictive performance.
Collapse
Affiliation(s)
- Jiacong Liu
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Jiahui Wei
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| | - Fangjun Kuang
- School of Information Engineering, Wenzhou Business College, Wenzhou, 325035, China.
| | - Siyang Zhang
- School of Information Engineering, Wenzhou Business College, Wenzhou, 325035, China.
| | - Wenyong Gui
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Zhifang Pan
- The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, China.
| |
Collapse
|
2
|
Soltanpour S, Wu QMJ. Weighted Extreme Sparse Classifier and Local Derivative Pattern for 3D Face Recognition. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019; 28:3020-3033. [PMID: 30668471 DOI: 10.1109/tip.2019.2893524] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
A novel weighted hybrid classifier and a high-order, local normal derivative pattern descriptor is proposed for 3D face recognition. The Local derivative pattern (LDP) captures detailed information, based on the local derivative variation in different directions. The LDP is computed on three normal maps in x, y, and z directions and on different scales. The surface normal captures the orientation of a surface at each point of 3D data. More informative local shape information is extracted using the surface normal, as compared to depth. The nth-order LDP on the surface normal is proposed to encode more detailed features from the (n-1)th-order's local derivative direction variations. An extreme learning machine (ELM) based autoencoder, using a multilayer network structure, is employed to select more discriminant features and provide a faster training speed. A weighted hybrid framework is proposed to handle facial challenges using a combination of the ELM and the sparse representation classifier (SRC). The advantage of speed for the ELM and accuracy for the SRC in a weighted scheme is used to enhance the performance of the recognition system. Experimental results regarding four famous 3D face databases illustrate the generalization and effectiveness of the proposed method in terms of both computational cost and recognition accuracy.
Collapse
|
3
|
|
4
|
Gong T. Improved immune computation for high-precision face recognition. Soft comput 2016. [DOI: 10.1007/s00500-016-2463-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
5
|
|
6
|
Liu Y, Xie H, Wang L, Tan K. Hyperspectral band selection based on a variable precision neighborhood rough set. APPLIED OPTICS 2016; 55:462-472. [PMID: 26835918 DOI: 10.1364/ao.55.000462] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Band selection is a well-known approach for reducing dimensionality in hyperspectral images. We propose a band-selection method based on the variable precision neighborhood rough set theory to select informative bands from hyperspectral images. A decision-making information system was established by hyperspectral data derived from soybean samples between 400 and 1000 nm wavelengths. The dependency was used to evaluate band significance. The optimal band subset was selected by a forward greedy search algorithm. After adjusting appropriate threshold values, stable optimized results were obtained. To assess the effectiveness of the proposed band-selection technique, two classification models were constructed. The experimental results showed that admitting inclusion errors could improve classification performance, including band selection and generalization ability.
Collapse
|
7
|
|
8
|
|
9
|
Chyzhyk D, Savio A, Graña M. Computer aided diagnosis of schizophrenia on resting state fMRI data by ensembles of ELM. Neural Netw 2015; 68:23-33. [PMID: 25965771 DOI: 10.1016/j.neunet.2015.04.002] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Revised: 02/24/2015] [Accepted: 04/08/2015] [Indexed: 10/23/2022]
Abstract
Resting state functional Magnetic Resonance Imaging (rs-fMRI) is increasingly used for the identification of image biomarkers of brain diseases or psychiatric conditions such as schizophrenia. This paper deals with the application of ensembles of Extreme Learning Machines (ELM) to build Computer Aided Diagnosis systems on the basis of features extracted from the activity measures computed over rs-fMRI data. The power of ELM to provide quick but near optimal solutions to the training of Single Layer Feedforward Networks (SLFN) allows extensive exploration of discriminative power of feature spaces in affordable time with off-the-shelf computational resources. Exploration is performed in this paper by an evolutionary search approach that has found functional activity map features allowing to achieve quite successful classification experiments, providing biologically plausible voxel-site localizations.
Collapse
Affiliation(s)
- Darya Chyzhyk
- Computational Intelligence Group (GIC), University of the Basque Country, UPV/EHU, Spain; CISE Department, University of Florida, Gainesville, USA.
| | - Alexandre Savio
- Computational Intelligence Group (GIC), University of the Basque Country, UPV/EHU, Spain; ENGINE Centre, Wrocław University of Technology, Wybrzeże Wyspiańskiego 27, 50-370, Wrocław, Poland
| | - Manuel Graña
- Computational Intelligence Group (GIC), University of the Basque Country, UPV/EHU, Spain; ENGINE Centre, Wrocław University of Technology, Wybrzeże Wyspiańskiego 27, 50-370, Wrocław, Poland
| |
Collapse
|
10
|
Ayerdi B, Marqués I, Graña M. Spatially regularized semisupervised Ensembles of Extreme Learning Machines for hyperspectral image segmentation. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.01.068] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
11
|
Lin S, Liu X, Fang J, Xu Z. Is extreme learning machine feasible? A theoretical assessment (part II). IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015; 26:21-34. [PMID: 25069128 DOI: 10.1109/tnnls.2014.2336665] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
An extreme learning machine (ELM) can be regarded as a two-stage feed-forward neural network (FNN) learning system that randomly assigns the connections with and within hidden neurons in the first stage and tunes the connections with output neurons in the second stage. Therefore, ELM training is essentially a linear learning problem, which significantly reduces the computational burden. Numerous applications show that such a computation burden reduction does not degrade the generalization capability. It has, however, been open that whether this is true in theory. The aim of this paper is to study the theoretical feasibility of ELM by analyzing the pros and cons of ELM. In the previous part of this topic, we pointed out that via appropriately selected activation functions, ELM does not degrade the generalization capability in the sense of expectation. In this paper, we launch the study in a different direction and show that the randomness of ELM also leads to certain negative consequences. On one hand, we find that the randomness causes an additional uncertainty problem of ELM, both in approximation and learning. On the other hand, we theoretically justify that there also exist activation functions such that the corresponding ELM degrades the generalization capability. In particular, we prove that the generalization capability of ELM with Gaussian kernel is essentially worse than that of FNN with Gaussian kernel. To facilitate the use of ELM, we also provide a remedy to such a degradation. We find that the well-developed coefficient regularization technique can essentially improve the generalization capability. The obtained results reveal the essential characteristic of ELM in a certain sense and give theoretical guidance concerning how to use ELM.
Collapse
|
12
|
Liu X, Lin S, Fang J, Xu Z. Is extreme learning machine feasible? A theoretical assessment (part I). IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015; 26:7-20. [PMID: 25069126 DOI: 10.1109/tnnls.2014.2335212] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
An extreme learning machine (ELM) is a feedforward neural network (FNN) like learning system whose connections with output neurons are adjustable, while the connections with and within hidden neurons are randomly fixed. Numerous applications have demonstrated the feasibility and high efficiency of ELM-like systems. It has, however, been open if this is true for any general applications. In this two-part paper, we conduct a comprehensive feasibility analysis of ELM. In Part I, we provide an answer to the question by theoretically justifying the following: 1) for some suitable activation functions, such as polynomials, Nadaraya-Watson and sigmoid functions, the ELM-like systems can attain the theoretical generalization bound of the FNNs with all connections adjusted, i.e., they do not degrade the generalization capability of the FNNs even when the connections with and within hidden neurons are randomly fixed; 2) the number of hidden neurons needed for an ELM-like system to achieve the theoretical bound can be estimated; and 3) whenever the activation function is taken as polynomial, the deduced hidden layer output matrix is of full column-rank, therefore the generalized inverse technique can be efficiently applied to yield the solution of an ELM-like system, and, furthermore, for the nonpolynomial case, the Tikhonov regularization can be applied to guarantee the weak regularity while not sacrificing the generalization capability. In Part II, however, we reveal a different aspect of the feasibility of ELM: there also exists some activation functions, which makes the corresponding ELM degrade the generalization capability. The obtained results underlie the feasibility and efficiency of ELM-like systems, and yield various generalizations and improvements of the systems as well.
Collapse
|
13
|
Hybrid extreme rotation forest. Neural Netw 2014; 52:33-42. [DOI: 10.1016/j.neunet.2014.01.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2013] [Revised: 12/16/2013] [Accepted: 01/03/2014] [Indexed: 11/21/2022]
|
14
|
|
15
|
Zhang S, He B, Nian R, Wang J, Han B, Lendasse A, Yuan G. Fast Image Recognition Based on Independent Component Analysis and Extreme Learning Machine. Cognit Comput 2014. [DOI: 10.1007/s12559-014-9245-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
16
|
Marqués I, Graña M. Fusion of lattice independent and linear features improving face identification. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2012.06.045] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
17
|
Karczmarek P, Pedrycz W, Reformat M, Akhoundi E. A study in facial regions saliency: a fuzzy measure approach. Soft comput 2013. [DOI: 10.1007/s00500-013-1064-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
18
|
Extreme Learning Machines for Feature Selection and Classification of Cocaine Dependent Patients on Structural MRI Data. Neural Process Lett 2013. [DOI: 10.1007/s11063-013-9277-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|