1
|
Fu X, Chen Y, Yan J, Chen Y, Xu F. BGRF: A broad granular random forest algorithm. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2023. [DOI: 10.3233/jifs-223960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]
Abstract
The random forest is a combined classification method belonging to ensemble learning. The random forest is also an important machine learning algorithm. The random forest is universally applicable to most data sets. However, the random forest is difficult to deal with uncertain data, resulting in poor classification results. To overcome these shortcomings, a broad granular random forest algorithm is proposed by studying the theory of granular computing and the idea of breadth. First, we granulate the breadth of the relationship between the features of the data sets samples and then form a broad granular vector. In addition, the operation rules of the granular vector are defined, and the granular decision tree model is proposed. Finally, the multiple granular decision tree voting method is adopted to obtain the result of the granular random forest. Some experiments are carried out on several UCI data sets, and the results show that the classification performance of the broad granular random forest algorithm is better than that of the traditional random forest algorithm.
Collapse
Affiliation(s)
- Xingyu Fu
- School of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Yingyue Chen
- School of Economics and Management, Xiamen University of Technology, Xiamen, China
| | - Jingru Yan
- School of Economics and Management, Xiamen University of Technology, Xiamen, China
| | - Yumin Chen
- School of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Feng Xu
- Beijing Srit Software Technology Co., Ltd., Beijing, China
| |
Collapse
|
2
|
Qu K, Xu J, Han Z, Xu S. Maximum relevance minimum redundancy-based feature selection using rough mutual information in adaptive neighborhood rough sets. APPL INTELL 2023. [DOI: 10.1007/s10489-022-04398-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|
3
|
Lin S, Zhang K, Guan D, He L, Chen Y. An intrusion detection method based on granular autoencoders. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2023. [DOI: 10.3233/jifs-223649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Intrusion detection systems have become one of the important tools for network security due to the frequent attacks brought about by the explosive growth of network traffic. Autoencoder is an unsupervised learning model with a neural network structure. It has a powerful feature learning capability and is effective in intrusion detection. However, its network construction suffers from overfitting and gradient disappearance problems. Traditional granular computing methods have advantages in solving such problems, but the process is relatively complex, the granularity dimension is high, and the computational cost is large, which is not suitable for application in intrusion detection systems. To address these problems, we propose a novel autoencoder: Granular AutoEncoders (GAE). The granulation reference set is constructed by random sampling. The granulation of training samples is based on single-feature similarity in a reference set to form granules. The granulation of multiple features results in granular vectors. Some operations of granules are defined. Furthermore, we propose some granular measures, including granular norms and granular loss functions. The GAE is further applied to the field of intrusion detection by designing an anomaly detection algorithm based on the GAE. The algorithm determines whether the network flows are anomalous by comparing the difference between an input granular vector and its output granular vector that is reconstructed by the GAE. Finally, some experiments are conducted using an intrusion detection dataset, comparing multiple metrics in terms of precision, recall, and F1-Score. The experimental results validate the correctness and effectiveness of the intrusion detection method based on GAE. And contrast experiments show that the proposed method has stronger ability for detecting anomalies than the correlation algorithms.
Collapse
Affiliation(s)
- Sihong Lin
- Xiamen Kuaikuai Network Technology Co., Ltd., Xiamen, China
| | - Kunbin Zhang
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Dun Guan
- Xiamen Kuaikuai Network Technology Co., Ltd., Xiamen, China
| | - Linjie He
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Yumin Chen
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| |
Collapse
|
4
|
He L, Chen Y, Wu K. Fuzzy granular deep convolutional network with residual structures. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
5
|
Chen Y, Zhang X, Zhuang Y, Yao B, Lin B. Granular neural networks with a reference frame. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.110147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
6
|
Zhang D, Zhu P. Variable radius neighborhood rough sets and attribute reduction. Int J Approx Reason 2022. [DOI: 10.1016/j.ijar.2022.08.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
7
|
Attribute reduction algorithm of neighborhood rough set based on supervised granulation and its application. Soft comput 2022. [DOI: 10.1007/s00500-022-07454-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
8
|
Statistical-mean double-quantitative K-nearest neighbor classification learning based on neighborhood distance measurement. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109018] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
9
|
Xia D, Wang G, Yang J, Zhang Q, Li S. Local knowledge distance for rough approximation measure in multi-granularity spaces. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.05.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
10
|
Jiang H, Chen Y, Kong L, Cai G, Jiang H. An LVQ clustering algorithm based on neighborhood granules. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-220092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Learning Vector Quantization (LVQ) is a clustering method with supervised information, simple structures, and powerful functions. LVQ assumes that the data samples are labeled, and the learning process uses labels to assist clustering. However, the LVQ is sensitive to initial values, resulting in a poor clustering effect. To overcome these shortcomings, a granular LVQ clustering algorithm is proposed by adopting the neighborhood granulation technology and the LVQ. Firstly, the neighborhood granulation is carried out on some features of a sample of the data set, then a neighborhood granular vector is formed. Furthermore, the size and operations of neighborhood granular vectors are defined, and the relative and absolute granular distances between granular vectors are proposed. Finally, these granular distances are proved to be metrics, and a granular LVQ clustering algorithm is designed. Some experiments are tested on several UCI data sets, and the results show that the granular LVQ clustering is better than the traditional LVQ clustering under suitable neighborhood parameters and distance measurement.
Collapse
Affiliation(s)
- Hailiang Jiang
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Yumin Chen
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Liru Kong
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Guoqiang Cai
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China
| | - Hongbo Jiang
- College of Economics and Management, Xiamen University of Technology, Xiamen, China
| |
Collapse
|
11
|
|
12
|
|
13
|
Dai M, Feng X, Yu H, Guo W. A migratory behavior and emotional preference clustering algorithm based on learning vector quantization and gaussian mixture model. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03325-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
14
|
Yang J, Luo T, Zeng L, Jin X. The cost-sensitive approximation of neighborhood rough sets and granular layer selection. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-212234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Neighborhood rough sets (NRS) are the extended model of the classical rough sets. The NRS describe the target concept by upper and lower neighborhood approximation boundaries. However, the method of approximately describing the uncertain target concept with existed neighborhood information granules is not given. To solve this problem, the cost-sensitive approximation model of the NRS is proposed in this paper, and its related properties are analyzed. To obtain the optimal approximation granular layer, the cost-sensitive progressive mechanism is proposed by considering user requirements. The case study shows that the reasonable granular layer and its approximation can be obtained under certain constraints, which is suitable for cost-sensitive application scenarios. The experimental results show that the advantage of the proposed approximation model, moreover, the decision cost of the NRS approximation model will monotonically decrease with granularity being finer.
Collapse
Affiliation(s)
- Jie Yang
- School of Physics and Electronic Science, Zunyi Normal University, Zunyi, China
- National Pilot School of Software, Yunnan University, Kunming, China
| | - Tian Luo
- School of Physics and Electronic Science, Zunyi Normal University, Zunyi, China
| | - Lijuan Zeng
- School of Physics and Electronic Science, Zunyi Normal University, Zunyi, China
| | - Xin Jin
- National Pilot School of Software, Yunnan University, Kunming, China
| |
Collapse
|
15
|
|
16
|
Classification-level and Class-level Complement Information Measures Based on Neighborhood Decision Systems. Cognit Comput 2021. [DOI: 10.1007/s12559-021-09921-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
17
|
Feature Selection Combining Information Theory View and Algebraic View in the Neighborhood Decision System. ENTROPY 2021; 23:e23060704. [PMID: 34199499 PMCID: PMC8230021 DOI: 10.3390/e23060704] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 05/30/2021] [Accepted: 05/31/2021] [Indexed: 11/17/2022]
Abstract
Feature selection is one of the core contents of rough set theory and application. Since the reduction ability and classification performance of many feature selection algorithms based on rough set theory and its extensions are not ideal, this paper proposes a feature selection algorithm that combines the information theory view and algebraic view in the neighborhood decision system. First, the neighborhood relationship in the neighborhood rough set model is used to retain the classification information of continuous data, to study some uncertainty measures of neighborhood information entropy. Second, to fully reflect the decision ability and classification performance of the neighborhood system, the neighborhood credibility and neighborhood coverage are defined and introduced into the neighborhood joint entropy. Third, a feature selection algorithm based on neighborhood joint entropy is designed, which improves the disadvantage that most feature selection algorithms only consider information theory definition or algebraic definition. Finally, experiments and statistical analyses on nine data sets prove that the algorithm can effectively select the optimal feature subset, and the selection result can maintain or improve the classification performance of the data set.
Collapse
|
18
|
Han SE. Topological properties of locally finite covering rough sets and K-topological rough set structures. Soft comput 2021. [DOI: 10.1007/s00500-021-05693-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
19
|
Zhang X, Gou H, Lv Z, Miao D. Double-quantitative distance measurement and classification learning based on the tri-level granular structure of neighborhood system. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.106799] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
20
|
|
21
|
|
22
|
Neighborhood multi-granulation rough sets-based attribute reduction using Lebesgue and entropy measures in incomplete neighborhood decision systems. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2019.105373] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
23
|
A survey on granular computing and its uncertainty measure from the perspective of rough set theory. GRANULAR COMPUTING 2019. [DOI: 10.1007/s41066-019-00204-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|