1
|
Pessimistic Multigranulation Rough Set of Intuitionistic Fuzzy Sets Based on Soft Relations. MATHEMATICS 2022. [DOI: 10.3390/math10050685] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Qian presented multigranulation rough set (MGRS) models based on Pawlak’s rough set (RS) model. There are two types of MGRS models, named optimistic MGRS and pessimistic MGRS. Recently, Shabir et al. presented an optimistic multigranulation intuitionistic fuzzy rough set (OMGIFRS) based on soft binary relations. This paper explores the pessimistic multigranulation intuitionistic fuzzy rough set (PMGIFRS) based on soft relations combined with a soft set (SS) over two universes. The resulting two sets are lower approximations and upper approximations with respect to the aftersets and foresets. Some basic properties of this established model are studied. Similarly, the MGRS of an IFS based on multiple soft relations is presented and some algebraic properties are discussed. Finally, an example is presented that illustrates the importance of the proposed decision-making algorithm.
Collapse
|
2
|
Zamir A, Khan HU, Mehmood W, Iqbal T, Akram AU. A feature-centric spam email detection model using diverse supervised machine learning algorithms. ELECTRONIC LIBRARY 2020. [DOI: 10.1108/el-07-2019-0181] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
This research study proposes a feature-centric spam email detection model (FSEDM) based on content, sentiment, semantic, user and spam-lexicon features set. The purpose of this study is to exploit the role of sentiment features along with other proposed features to evaluate the classification accuracy of machine learning algorithms for spam email detection.
Design/methodology/approach
Existing studies primarily exploits content-based feature engineering approach; however, a limited number of features is considered. In this regard, this research study proposed a feature-centric framework (FSEDM) based on existing and novel features of email data set, which are extracted after pre-processing. Afterwards, diverse supervised learning techniques are applied on the proposed features in conjunction with feature selection techniques such as information gain, gain ratio and Relief-F to rank most prominent features and classify the emails into spam or ham (not spam).
Findings
Analysis and experimental results indicated that the proposed model with sentiment analysis is competitive approach for spam email detection. Using the proposed model, deep neural network applied with sentiment features outperformed other classifiers in terms of classification accuracy up to 97.2%.
Originality/value
This research is novel in this regard that no previous research focuses on sentiment analysis in conjunction with other email features for detection of spam emails.
Collapse
|
3
|
Saidani N, Adi K, Allili MS. A semantic-based classification approach for an enhanced spam detection. Comput Secur 2020. [DOI: 10.1016/j.cose.2020.101716] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
4
|
Feng X, Hao X, Xin R, Gao X, Liu M, Li F, Wang Y, Shi R, Zhao S, Zhou F. Detecting Methylomic Biomarkers of Pediatric Autism in the Peripheral Blood Leukocytes. Interdiscip Sci 2019; 11:237-246. [DOI: 10.1007/s12539-019-00328-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2018] [Revised: 03/25/2019] [Accepted: 03/28/2019] [Indexed: 12/12/2022]
|
5
|
Méndez JR, Cotos-Yañez TR, Ruano-Ordás D. A new semantic-based feature selection method for spam filtering. Appl Soft Comput 2019. [DOI: 10.1016/j.asoc.2018.12.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
6
|
Ruano-Ordás D, Fdez-Riverola F, R. Méndez J. Using evolutionary computation for discovering spam patterns from e-mail samples. Inf Process Manag 2018. [DOI: 10.1016/j.ipm.2017.12.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
7
|
Ruano-Ordás D, Fdez-Riverola F, Méndez JR. Concept drift in e-mail datasets: An empirical study with practical implications. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2017.10.049] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
8
|
|
9
|
Al-Badarneh A, Al-Shawakfa E, Bani-Ismail B, Al-Rababah K, Shatnawi S. The impact of indexing approaches on Arabic text classification. J Inf Sci 2016. [DOI: 10.1177/0165551515625030] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
This paper investigates the impact of using different indexing approaches (full-word, stem, and root) when classifying Arabic text. In this study, the naïve Bayes classifier is used to construct the multinomial classification models and is evaluated using stratified k-fold cross-validation ( k ranges from 2 to 10). It is also uses a corpus that consists of 1000 normalized Arabic documents. The results of one experiment in this study show that significant accuracy improvements have occurred when the full-word form is used in most k-folds. Further experiments show that the classifier has achieved the highest accuracy in the eight-fold by using 7/8–1/8 train–test ratio, despite the indexing approach being used. The overall results of this study show that the classifier has achieved the maximum micro-average accuracy 99.36%, either by using the full-word form or the stem form. This proves that the stem is a better choice to use when classifying Arabic text, because it makes the corpus dataset smaller and this will enhance both the processing time and storage utilization, and achieve the highest level of accuracy.
Collapse
|
10
|
Classification of Complex Urban Fringe Land Cover Using Evidential Reasoning Based on Fuzzy Rough Set: A Case Study of Wuhan City. REMOTE SENSING 2016. [DOI: 10.3390/rs8040304] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
11
|
Alsmadi I, Alhami I. Clustering and classification of email contents. JOURNAL OF KING SAUD UNIVERSITY - COMPUTER AND INFORMATION SCIENCES 2015. [DOI: 10.1016/j.jksuci.2014.03.014] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|