1
|
A multiple criteria ensemble pruning method for binary classification based on D-S theory of evidence. INT J MACH LEARN CYB 2023. [DOI: 10.1007/s13042-022-01690-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023]
|
2
|
You GR, Shiue YR, Su CT, Huang QL. Enhancing ensemble diversity based on multiscale dilated convolution in image classification. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.05.064] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
3
|
Aguilar E, Nagarajan B, Radeva P. Uncertainty-aware selecting for an ensemble of deep food recognition models. Comput Biol Med 2022; 146:105645. [PMID: 35751183 DOI: 10.1016/j.compbiomed.2022.105645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 05/04/2022] [Accepted: 05/14/2022] [Indexed: 11/26/2022]
Abstract
Deep learning is a machine learning technique that has revolutionized the research community due to its impressive results on various real-life problems. Recently, ensembles of Convolutional Neural Networks (CNN) have proven to achieve high robustness and accuracy in numerous computer vision challenges. As expected, the more models we add to the ensemble, the better performance we can obtain, but, in contrast, more computer resources are needed. Hence, the importance of deciding how many models to use and which models to select from a pool of trained models is huge. From the latter, a common strategy in deep learning is to select the models randomly or according to the results on the validation set. However, in this way models are chosen based on individual performance ignoring how they are expected to work together. Alternatively, to ensure a better complement between models, an exhaustive search can be used by evaluating the performance of several ensemble models based on different numbers and combinations of trained models. Nevertheless, this may result in being high computationally expensive. Considering that epistemic uncertainty analysis has recently been successfully employed to understand model learning, we aim to analyze whether an uncertainty-aware epistemic method can help us decide which groups of CNN models may work best. The method was validated on several food datasets and with different CNN architectures. In most cases, our proposal outperforms the results by a statistically significant range with respect to the baseline techniques and is much less computationally expensive compared to the brute-force search.
Collapse
Affiliation(s)
- Eduardo Aguilar
- Department of Computing and Systems Engineering, Catholic University of the North, Avenida Angamos 0610, Antofagasta, 1270709, Antofagasta, Chile.
| | - Bhalaji Nagarajan
- Department of Mathematics and Computer Science, University of Barcelona, Gran Via de les Corts Catalanes 585, Barcelona, 08007, Barcelona, Spain
| | - Petia Radeva
- Department of Mathematics and Computer Science, University of Barcelona, Gran Via de les Corts Catalanes 585, Barcelona, 08007, Barcelona, Spain
| |
Collapse
|
4
|
Zhou X, Liu H, Pourpanah F, Zeng T, Wang X. A survey on epistemic (model) uncertainty in supervised learning: Recent advances and applications. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.10.119] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
|
5
|
Qasem A, Sheikh Abdullah SNH, Sahran S, Albashish D, Goudarzi S, Arasaratnam S. An improved ensemble pruning for mammogram classification using modified Bees algorithm. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-06995-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
6
|
Shiue YR, You GR, Su CT, Chen H. Balancing accuracy and diversity in ensemble learning using a two-phase artificial bee colony approach. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107212] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
7
|
Development of ensemble learning classification with density peak decomposition-based evolutionary multi-objective optimization. INT J MACH LEARN CYB 2021. [DOI: 10.1007/s13042-020-01271-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
8
|
Bania RK, Halder A. R-HEFS: Rough set based heterogeneous ensemble feature selection method for medical data classification. Artif Intell Med 2021; 114:102049. [PMID: 33875164 DOI: 10.1016/j.artmed.2021.102049] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 02/11/2021] [Accepted: 02/21/2021] [Indexed: 11/28/2022]
Abstract
Feature selection is one of the trustworthy processes of dimensionality reduction technique to select a subset of relevant and non-redundant features from large datasets. Ensemble feature selection (EFS) approach is a recent technique aiming at accumulating diversity in the subset of selected features. It improves the performance of learning algorithms and obtains more stable and robust results. In this paper, a novel rough set theory (RST) based heterogeneous EFS method (R-HEFS) is proposed for selecting the less redundant and highly relevant features during the aggregation of diverse feature subsets by applying the feature-class, feature-feature rough dependency and feature-significance measures. In R-HEFS five state-of-the-art RST based filter methods are used as a base feature selectors. Experiments are carried out on 10 benchmark medical datasets collected from the UCI repository. For the imputation of the missing values and discretization of the continuous features, k nearest neighbor (kNN) imputation method and RST based discretization techniques are applied. The effectiveness of the proposed R-HEFS method is evaluated and analyzed by using four benchmark classifiers viz., Naïve Bayes (NB), random forest (RF), support vector machine (SVM), and AdaBoost. The proposed R-HEFS method turns out to be effective by removing the non-relevant and redundant features during the process of aggregation of base feature selectors and it assists to increase the classification accuracy. Out of 10 different medical datasets, on 7 datasets, R-HEFS has achieved better average classification accuracy. So, the overall results strongly suggest that the proposed R-HEFS method can reduce the dimension of large medical datasets and may help the physicians or medical experts to diagnose (classify) different diseases with lesser computational complexities.
Collapse
Affiliation(s)
- Rubul Kumar Bania
- Department of Computer Application, North-Eastern Hill University, Tura Campus, Tura 794002, Meghalaya, India.
| | - Anindya Halder
- Department of Computer Application, North-Eastern Hill University, Tura Campus, Tura 794002, Meghalaya, India.
| |
Collapse
|
9
|
|
10
|
Gao J, Liu K, Wang B, Wang D, Zhang X. Improving deep forest by ensemble pruning based on feature vectorization and quantum walks. Soft comput 2021. [DOI: 10.1007/s00500-020-05274-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
11
|
|
12
|
Ni Z, Xia P, Zhu X, Ding Y, Ni L. A novel ensemble pruning approach based on information exchange glowworm swarm optimization and complementarity measure. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2020. [DOI: 10.3233/jifs-189149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Ensemble pruning has been widely used for enhancing classification ability employing a smaller number of classifiers. Ensemble pruning extracts a part of classifiers with good overall performance to form the final ensemble. Diversity and accuracy of classifiers are of vital importance for a successful ensemble. It is hard for the members in one ensemble to achieve both good diversity and high accuracy, simultaneously, because there is a tradeoff between them. Existing works usually search for the tradeoff in terms of diversity measures, or find it utilizing heuristic algorithms, which cannot gain the exact solution without exhaustive search. To address the above issue, a novel ensemble pruning method based on information exchange glowworm swarm optimization and complementarity measure, abbreviated EPIECM, is proposed using the combination of information exchange glowworm swarm optimization (IEGSO) and complementarity measure (COM). Firstly, multiple generated classifiers are utilized to construct a pool of learners who perform diversely. Secondly, COM is employed to pre-prune the classifiers with poor comprehensive performance, and the pre-pruned ensemble is formed utilizing the retaining classifiers. Finally, the optimal subset of classifiers is combined from the remaining constituents after pre-pruning with IEGSO. Empirical results on 27 UCI datasets indicate that EPIECM significantly outperforms other techniques.
Collapse
Affiliation(s)
- Zhiwei Ni
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Pingfan Xia
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Xuhui Zhu
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Yufei Ding
- School of Computer Science, University of California Santa Barbara, California, USA
| | - Liping Ni
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| |
Collapse
|
13
|
Fletcher S, Verma B, Zhang M. A non-specialized ensemble classifier using multi-objective optimization. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.05.029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
|
14
|
Ensemble pruning of ELM via migratory binary glowworm swarm optimization and margin distance minimization. Neural Process Lett 2020. [DOI: 10.1007/s11063-020-10336-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
15
|
Silva RA, Britto Jr ADS, Enembreck F, Sabourin R, Oliveira LES. CSBF: A static ensemble fusion method based on the centrality score of complex networks. Comput Intell 2020. [DOI: 10.1111/coin.12249] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Ronan Assumpção Silva
- Postgraduate Program in Informatics (PPGIA)Pontifical Catholic University of Parana (PUCPR) Parana Brazil
- Department of InformaticsFederal Institute of Parana (IFPR) Parana Brazil
| | - Alceu de Souza Britto Jr
- Postgraduate Program in Informatics (PPGIA)Pontifical Catholic University of Parana (PUCPR) Parana Brazil
- Department of InformaticsState University of Ponta Grossa (UEPG) Parana Brazil
| | - Fabricio Enembreck
- Postgraduate Program in Informatics (PPGIA)Pontifical Catholic University of Parana (PUCPR) Parana Brazil
| | - Robert Sabourin
- Laboratoire d'Imagerie, de Vision et d'Intelligence ArtificielleÉcole de Technologie Supérieure (ÉTS) Montreal Canada
| | | |
Collapse
|
16
|
Lu B, Fu L, Nie B, Peng Z, Liu H. A Novel Framework with High Diagnostic Sensitivity for Lung Cancer Detection by Electronic Nose. SENSORS 2019; 19:s19235333. [PMID: 31817006 PMCID: PMC6928832 DOI: 10.3390/s19235333] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 11/27/2019] [Accepted: 11/29/2019] [Indexed: 12/11/2022]
Abstract
The electronic nose (e-nose) system is a newly developing detection technology for its advantages of non-invasiveness, simple operation, and low cost. However, lung cancer screening through e-nose requires effective pattern recognition frameworks. Existing frameworks rely heavily on hand-crafted features and have relatively low diagnostic sensitivity. To handle these problems, gated recurrent unit based autoencoder (GRU-AE) is adopted to automatically extract features from temporal and high-dimensional e-nose data. Moreover, we propose a novel margin and sensitivity based ordering ensemble pruning (MSEP) model for effective classification. The proposed heuristic model aims to reduce missed diagnosis rate of lung cancer patients while maintaining a high rate of overall identification. In the experiments, five state-of-the-art classification models and two popular dimensionality reduction methods were involved for comparison to demonstrate the validity of the proposed GRU-AE-MSEP framework, through 214 collected breath samples measured by e-nose. Experimental results indicated that the proposed intelligent framework achieved high sensitivity of 94.22%, accuracy of 93.55%, and specificity of 92.80%, thereby providing a new practical means for wide disease screening by e-nose in medical scenarios.
Collapse
Affiliation(s)
- Binchun Lu
- Chongqing University-University of Cincinnati Joint Co-op Institute, Chongqing University, Chongqing 400030, China; (B.L.); (L.F.)
| | - Lidan Fu
- Chongqing University-University of Cincinnati Joint Co-op Institute, Chongqing University, Chongqing 400030, China; (B.L.); (L.F.)
| | - Bo Nie
- Key Laboratory of Biotechnology Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400030, China;
| | - Zhiyun Peng
- State Key Laboratory of Power Transmission Equipment & System Security and New Technology, Chongqing University, Chongqing 400030, China;
| | - Hongying Liu
- Key Laboratory of Biotechnology Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400030, China;
- Correspondence:
| |
Collapse
|
17
|
Feng W, Dauphin G, Huang W, Quan Y, Liao W. New margin-based subsampling iterative technique in modified random forests for classification. Knowl Based Syst 2019. [DOI: 10.1016/j.knosys.2019.07.016] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
18
|
|
19
|
Mikhail JW, Fossaceca JM, Iammartino R. A Semi-Boosted Nested Model With Sensitivity-Based Weighted Binarization for Multi-Domain Network Intrusion Detection. ACM T INTEL SYST TEC 2019. [DOI: 10.1145/3313778] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Effective network intrusion detection techniques are required to thwart evolving cybersecurity threats. Historically, traditional enterprise networks have been researched extensively in this regard. However, the cyber threat landscape has grown to include wireless networks. In this article, the authors present a novel model that can be trained on completely different feature sets and applied to two distinct intrusion detection applications: traditional enterprise networks and 802.11 wireless networks. This is the first method that demonstrates superior performance in both aforementioned applications. The model is based on a one-versus-all binary framework comprising multiple nested sub-ensembles. To provide good generalization ability, each sub-ensemble contains a collection of sub-learners, and only a portion of the sub-learners implement boosting. A class weight based on the sensitivity metric (true-positive rate), learned from the training data only, is assigned to the sub-ensembles of each class. The use of pruning to remove sub-learners that do not contribute to or have an adverse effect on overall system performance is investigated as well. The results demonstrate that the proposed system can achieve exceptional performance in applications to both traditional enterprise intrusion detection and 802.11 wireless intrusion detection.
Collapse
|
20
|
Zhu X, Ni Z, Ni L, Jin F, Cheng M, Li J. Spread binary artificial fish swarm algorithm combined with double-fault measure for ensemble pruning. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-169993] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Xuhui Zhu
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Zhiwei Ni
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Liping Ni
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Feifei Jin
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | | | - Jingming Li
- School of Management Science and Engineering, Anhui University of Finance and Economics, Bengbu, China
| |
Collapse
|
21
|
An ensemble approach for supporting the respiratory isolation of presumed tuberculosis inpatients. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.11.074] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
22
|
Stacking ensemble with parsimonious base models to improve generalization capability in the characterization of steel bolted components. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2018.06.005] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
23
|
Zhu X, Ni Z, Zhang G, Jin F, Cheng M, Li J. Combining weak-link co-evolution binary artificial fish swarm algorithm and complementarity measure for ensemble pruning. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2018. [DOI: 10.3233/jifs-169685] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Xuhui Zhu
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Zhiwei Ni
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Gongrang Zhang
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | - Feifei Jin
- School of Management, Hefei University of Technology, Hefei, China
- Key Laboratory of Process Optimization and Intelligent Decision-making, Ministry of Education, Hefei, China
| | | | - Jingming Li
- School of Management Science and Engineering, Anhui University of Finance and Economics, Bengbu, China
| |
Collapse
|
24
|
Landslide Susceptibility Assessment at Mila Basin (Algeria): A Comparative Assessment of Prediction Capability of Advanced Machine Learning Methods. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION 2018. [DOI: 10.3390/ijgi7070268] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
25
|
Li D, Wen G, Hou Z, Huan E, Hu Y, Li H. RTCRelief-F: an effective clustering and ordering-based ensemble pruning algorithm for facial expression recognition. Knowl Inf Syst 2018. [DOI: 10.1007/s10115-018-1176-z] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
26
|
Golzadeh M, Hadavandi E, Chelgani SC. A new Ensemble based multi-agent system for prediction problems: Case study of modeling coal free swelling index. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2017.12.013] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
27
|
|