1
|
Ostonov A, Moshkov M. Comparative Analysis of Deterministic and Nondeterministic Decision Trees for Decision Tables from Closed Classes. ENTROPY (BASEL, SWITZERLAND) 2024; 26:519. [PMID: 38920528 PMCID: PMC11202716 DOI: 10.3390/e26060519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 06/14/2024] [Accepted: 06/15/2024] [Indexed: 06/27/2024]
Abstract
In this paper, we consider classes of decision tables with many-valued decisions closed under operations of the removal of columns, the changing of decisions, the permutation of columns, and the duplication of columns. We study relationships among three parameters of these tables: the complexity of a decision table (if we consider the depth of the decision trees, then the complexity of a decision table is the number of columns in it), the minimum complexity of a deterministic decision tree, and the minimum complexity of a nondeterministic decision tree. We consider the rough classification of functions characterizing relationships and enumerate all possible seven types of relationships.
Collapse
Affiliation(s)
- Azimkhon Ostonov
- Computer, Electrical and Mathematical Sciences & Engineering Division and Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia;
| | | |
Collapse
|
2
|
Musa SY, Asaad BA. A progressive approach to multi-criteria group decision-making: N-bipolar hypersoft topology perspective. PLoS One 2024; 19:e0304016. [PMID: 38771766 PMCID: PMC11108228 DOI: 10.1371/journal.pone.0304016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 05/03/2024] [Indexed: 05/23/2024] Open
Abstract
This paper investigates N-bipolar hypersoft topology (N-BHST), a novel extension of both the well-established N-hypersoft topology (N-HST) and hypersoft topology (HST). Deviating significantly from its precursor, the N-bipolar hypersoft (N-BHS) set, N-BHST introduces a multi-opinion approach to decision-making, augmenting robustness and adaptability. This innovative framework addresses identified limitations in N-bipolar soft topology (N-BST), especially in managing multi-argument approximate functions. The study analyzes various operators (closure, interior, exterior, and boundary) within the N-BHST framework, elucidating their interrelationships. Additionally, an examination is carried out on the enhancement of multi-criteria group decision-making (MCGDM) using N-BHST, setting it apart from existing models. A numerical example is presented to illustrate its application in real-world decision scenarios.
Collapse
Affiliation(s)
- Sagvan Y. Musa
- Department of Mathematics, College of Education, University of Zakho, Zakho, Iraq
| | - Baravan A. Asaad
- Department of Computer Science, College of Science, Cihan University-Duhok, Duhok, Iraq
- Department of Mathematics, College of Science, University of Zakho, Zakho, Iraq
| |
Collapse
|
3
|
Mohd G, Bhat IM, Kakroo I, Balachandran A, Tabasum R, Majid K, Wani MF, Manna U, Ghodake G, Lone S. Azolla Pinnata: Sustainable Floating Oil Cleaner of Water Bodies. ACS OMEGA 2024; 9:12725-12733. [PMID: 38524463 PMCID: PMC10955581 DOI: 10.1021/acsomega.3c08417] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 02/05/2024] [Accepted: 02/12/2024] [Indexed: 03/26/2024]
Abstract
Various plant-based materials effectively absorb oil contaminants at the water/air interface. These materials showcase unparalleled efficiency in purging oil contaminants, encompassing rivers, lakes, and boundless oceans, positioning them as integral components of environmental restoration endeavors. In addition, they are biodegradable, readily available, and eco-friendly, thus making them a preferable choice over traditional oil cleaning materials. This study explores the phenomenal properties of the floating Azolla fern (Azolla pinnata), focusing on its unique hierarchical leaf surface design at both the microscale and nanoscale levels. These intricate structures endow the fern with exceptional characteristics, including superhydrophobicity, high water adhesion, and remarkable oil or organic solvent absorption capabilities. Azolla's leaf surface exhibits a rare combination of dual wettability, where hydrophilic spots on a superhydrophobic base enable the pinning of water droplets, even when positioned upside-down. This extraordinary property, known as the parahydrophobic state, is rare in floating plants, akin to the renowned Salvinia molesta, setting Azolla apart as a natural wonder. Submerged in water, Azolla leaves excel at absorbing light oils at the air-water interface, demonstrating a notable ability to extract high-density organic solvents. Moreover, Azolla's rapid growth, doubling in the area every 4-5 days, especially in flowing waters, positions it as a sustainable alternative to traditional synthetic oil-cleaning materials with long-term environmental repercussions. This scientific lead could pave the way for more environmentally friendly approaches to mitigate the negative impacts of oil spills and promote a cleaner water ecosystem.
Collapse
Affiliation(s)
- Ghulam Mohd
- Department
of Chemistry, National Institute of Technology
(NIT), Jammu
& Kashmir 190006, Srinagar, India
- iDREAM
(Interdisciplinary Division for Renewable Energy & Advanced Materials, Laboratory for Bioinspired Research on Advanced Interface
and Nanomaterials (BRAINS), NIT, Jammu & Kashmir 190006, Srinagar, India
| | - Irfan Majeed Bhat
- Department
of Chemistry, National Institute of Technology
(NIT), Jammu
& Kashmir 190006, Srinagar, India
- iDREAM
(Interdisciplinary Division for Renewable Energy & Advanced Materials, Laboratory for Bioinspired Research on Advanced Interface
and Nanomaterials (BRAINS), NIT, Jammu & Kashmir 190006, Srinagar, India
| | - Insha Kakroo
- Department
of Chemistry, National Institute of Technology
(NIT), Jammu
& Kashmir 190006, Srinagar, India
- iDREAM
(Interdisciplinary Division for Renewable Energy & Advanced Materials, Laboratory for Bioinspired Research on Advanced Interface
and Nanomaterials (BRAINS), NIT, Jammu & Kashmir 190006, Srinagar, India
| | - Akshay Balachandran
- Department
of Chemistry, National Institute of Technology
(NIT), Jammu
& Kashmir 190006, Srinagar, India
- iDREAM
(Interdisciplinary Division for Renewable Energy & Advanced Materials, Laboratory for Bioinspired Research on Advanced Interface
and Nanomaterials (BRAINS), NIT, Jammu & Kashmir 190006, Srinagar, India
| | - Ruheena Tabasum
- Department
of Chemistry, National Institute of Technology
(NIT), Jammu
& Kashmir 190006, Srinagar, India
- iDREAM
(Interdisciplinary Division for Renewable Energy & Advanced Materials, Laboratory for Bioinspired Research on Advanced Interface
and Nanomaterials (BRAINS), NIT, Jammu & Kashmir 190006, Srinagar, India
| | - Kowsar Majid
- Department
of Chemistry, National Institute of Technology
(NIT), Jammu
& Kashmir 190006, Srinagar, India
- iDREAM
(Interdisciplinary Division for Renewable Energy & Advanced Materials, Laboratory for Bioinspired Research on Advanced Interface
and Nanomaterials (BRAINS), NIT, Jammu & Kashmir 190006, Srinagar, India
| | - Mohammad Farooq Wani
- Department
of Mechanical Engineering, NIT Srinagar,
NIT, Jammu & Kashmir 190006, Srinagar, India
| | - Uttam Manna
- Department
of Chemistry, Indian Institute of Technology
(IIT), Kamrup, Guwahati 781039, Assam, India
| | - Gajanan Ghodake
- Department
of Biological Science and Environmental Science, College of Life Science
and Biotechnology, Dongguk University, Seoul, Ilsongdong-gu, Goyang-si 10326, Gyeonggi-do, Republic of Korea
| | - Saifullah Lone
- Department
of Chemistry, National Institute of Technology
(NIT), Jammu
& Kashmir 190006, Srinagar, India
- iDREAM
(Interdisciplinary Division for Renewable Energy & Advanced Materials, Laboratory for Bioinspired Research on Advanced Interface
and Nanomaterials (BRAINS), NIT, Jammu & Kashmir 190006, Srinagar, India
| |
Collapse
|
4
|
Mishra M, Acharjya DP. A hybridized red deer and rough set clinical information retrieval system for hepatitis B diagnosis. Sci Rep 2024; 14:3815. [PMID: 38360918 PMCID: PMC10869783 DOI: 10.1038/s41598-024-53170-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 01/29/2024] [Indexed: 02/17/2024] Open
Abstract
Healthcare is a big concern in the current booming population. Many approaches for improving health are imposed, such as early disease identification, treatment, and prevention. Therefore, knowledge acquisition is highly essential at different stages of decision-making. Inferring knowledge from the information system, which necessitates multiple steps for extracting useful information, is one technique to address this problem. Handling uncertainty throughout data analysis is also another challenging task. Computer intelligence is a step forward to this end while selecting characteristics, classification, clustering, and developing clinical information retrieval systems. According to recent studies, swarm optimization is a useful technique for discovering key features while resolving real-world issues. However, it is ineffective in managing uncertainty. Conversely, a rough set helps a decision system generate decision rules. This produces decision rules without any additional information. In order to assess real-world information systems while managing uncertainties, a hybrid strategy that combines a rough set and red deer algorithm is presented in this research. In the red deer optimization algorithm, the suggested method selects the optimal characteristics in terms of the degree of dependence on the rough set. In order to determine the decision rules, further a rough set is used. The efficiency of the suggested model is also contrasted with that of the decision tree algorithm and the conventional rough set. An empirical study on hepatitis disease illustrates the viability of the proposed research as compared to the decision tree and crisp rough set. The proposed hybridization of rough set and red deer algorithm achieves an accuracy of 91.7% accuracy. The acquired accuracy for the decision tree, and rough set methods is 82.9%, and 88.9%, respectively. It suggests that the proposed research is viable.
Collapse
Affiliation(s)
- Madhusmita Mishra
- Vellore Institute of Technology, School of Computer Science and Engineering, Vellore, 632014, India
| | - D P Acharjya
- Vellore Institute of Technology, School of Computer Science and Engineering, Vellore, 632014, India.
| |
Collapse
|
5
|
Ostonov A, Moshkov M. On Complexity of Deterministic and Nondeterministic Decision Trees for Conventional Decision Tables from Closed Classes. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1411. [PMID: 37895532 PMCID: PMC10606725 DOI: 10.3390/e25101411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 09/05/2023] [Accepted: 09/10/2023] [Indexed: 10/29/2023]
Abstract
In this paper, we consider classes of conventional decision tables closed relative to the removal of attributes (columns) and changing decisions assigned to rows. For tables from an arbitrary closed class, we study the dependence of the minimum complexity of deterministic and nondeterministic decision trees on the complexity of the set of attributes attached to columns. We also study the dependence of the minimum complexity of deterministic decision trees on the minimum complexity of nondeterministic decision trees. Note that a nondeterministic decision tree can be interpreted as a set of true decision rules that covers all rows of the table.
Collapse
Affiliation(s)
- Azimkhon Ostonov
- Computer, Electrical and Mathematical Sciences & Engineering Division and Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia;
| | | |
Collapse
|
6
|
Jin L, Liu Q, Geng Y. Ontology-Based Semantic Modeling of Coal Mine Roof Caving Accidents. Processes (Basel) 2023. [DOI: 10.3390/pr11041058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2023] Open
Abstract
The frequency of roof-caving accidents ranks first among all coal mine accidents. However, the scattered knowledge system in this field and the lack of standardization exacerbate the difficulty of analyzing roof fall accidents. This study proposes an ontology-based semantic modeling method for roof fall accidents to share and reuse roof fall knowledge for intelligent decision-making. The crucial concepts of roof fall accidents and the correlations between concepts are summarized by analyzing the roof fall knowledge, providing a standard framework to represent the prior knowledge in this field. Besides, the ontology modeling tool Protégé is used to construct the ontology. As for ontology-based deep information mining and semantic reasoning, semantic rules based on expert experience and data fusion technology are proposed to evaluate mines’ potential risks comprehensively. In addition, the roof-falling rules are formalized based on the Jena syntax to make the ontology uniformly expressed in the computer. The Jena reasoning engine is utilized to mine potential tacit knowledge and preventive measures or solutions. The proposed method is demonstrated using roof fall cases, which confirms its validity and practicability. Results indicate that this method can realize the storage, management, and sharing of roof fall accident knowledge. Furthermore, it can provide accurate and comprehensive experience knowledge for the roof fall knowledge requester.
Collapse
Affiliation(s)
- Lingzi Jin
- Lu’an Chemical Group Co., Ltd., Changzhi 046204, China
- Center of Shanxi Engineering Research for Coal Mine Intelligent Equipment, Taiyuan University of Technology, Taiyuan 030024, China
- Beijing Tianma Intelligent Control Technology Co., Ltd., Beijing 101399, China
| | - Qian Liu
- School of Safety and Emergency Management Engineering, Taiyuan University of Technology, Taiyuan 030024, China
| | - Yide Geng
- Center of Shanxi Engineering Research for Coal Mine Intelligent Equipment, Taiyuan University of Technology, Taiyuan 030024, China
| |
Collapse
|
7
|
Khedgaonkar RS, Singh KR. Designing face resemblance technique using near set theory under varying facial features. MULTIMEDIA TOOLS AND APPLICATIONS 2023; 82:1-22. [PMID: 37362639 PMCID: PMC9986670 DOI: 10.1007/s11042-023-14927-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 01/10/2023] [Accepted: 02/22/2023] [Indexed: 06/28/2023]
Abstract
Near sets (also called Descriptively Near Sets) classify nonempty sets of objects based on object feature values. The Near Set Theory provides a framework for measuring the similarity of objects based on features that describe them in much the same way humans perceive the similarity of objects. This paper presents a novel approach for face recognition using Near Set Theory that takes into account variations in facial features due to varying facial expressions, and facial plastic surgery. In the proposed work, we demonstrate two-fold usage of Near set theory; firstly, Near Set Theory as a feature selector to select the plastic surgery facial features with the help of tolerance classes, and secondly, Near Set Theory as a recognizer that uses selected prominent intrinsic facial features which are automatically extracted through the deep learning model. Extensive experimentation was performed on various facial datasets such as YALE, PSD, and ASPS. Experimentation demonstrates 93% of accuracy on the YALE face dataset, 98% of accuracy on the PSD dataset, and 98% of accuracy on the ASPS dataset. A detailed comparative analysis of the proposed work of facial resemblance with other state-of-the-art algorithms is presented in this paper. The experimentation results effectively classify face resemblance using Near Set Theory, which has outperformed several state-of-the-art classification approaches.
Collapse
Affiliation(s)
| | - Kavita R. Singh
- Computer Technology, Yeshwantrao Chavan College of Engineering, Nagpur, India
| |
Collapse
|
8
|
Kumari N, Acharjya DP. A hybrid rough set shuffled frog leaping knowledge inference system for diagnosis of lung cancer disease. Comput Biol Med 2023; 155:106662. [PMID: 36805223 DOI: 10.1016/j.compbiomed.2023.106662] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 01/13/2023] [Accepted: 02/09/2023] [Indexed: 02/15/2023]
Abstract
Abundant medical data are generated in the digital world every second. However, gathering helpful information from these data is difficult. Gathering useful information from the dataset is very advantageous and demanding. Besides, such data also contain many extraneous features that do not influence the foreboding accuracy while diagnosing a disease. The data must eliminate these extraneous features to get a better diagnosis. Ultimately, the minimized information system will lead to a better diagnosis. In this paper, we have introduced an incremental rough set shuffled frog leaping algorithm for knowledge inference. The proposed algorithm helps find minimum features from an information system while handling complex databases with uncertainty and incompleteness. The proposed rough set shuffled frog leaping knowledge inference model works in two phases. In the initial phase, the incremental rough set shuffled frog leaping algorithm is used to get the most relevant features. Identifying the relevant features is carried out using a fitness function, which uses the rough degree of dependency. The use of the fitness function identifies the much information with the minimum number of features. The purpose of feature selection is to identify a feature subset from an original set of features without reducing the predictive accuracy and to scale back the computation overhead in the data processing. In the second phase, a rough set is utilized for knowledge discovery in perception with rule generation. The selection of decision rules is carried out based on the accuracy of the decision rule and a predefined threshold value. An empirical analysis of the lung disease information system and a comparative study is conducted. Experimental outcomes exhibit that hybrid techniques express the feasibility of the proposed model while achieving better classification accuracy.
Collapse
Affiliation(s)
- Nancy Kumari
- School of Computer Science and Engineering, VIT, Vellore 632014, India
| | - D P Acharjya
- School of Computer Science and Engineering, VIT, Vellore 632014, India.
| |
Collapse
|
9
|
Ramanna S, Ashrafi N, Loster E, Debroni K, Turner S. Rough-set based learning: Assessing patterns and predictability of anxiety, depression, and sleep scores associated with the use of cannabinoid-based medicine during COVID-19. Front Artif Intell 2023; 6:981953. [PMID: 36872936 PMCID: PMC9975391 DOI: 10.3389/frai.2023.981953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 01/27/2023] [Indexed: 02/17/2023] Open
Abstract
Recently, research is emerging highlighting the potential of cannabinoids' beneficial effects related to anxiety, mood, and sleep disorders as well as pointing to an increased use of cannabinoid-based medicines since COVID-19 was declared a pandemic. The objective of this research is 3 fold: i) to evaluate the relationship of the clinical delivery of cannabinoid-based medicine for anxiety, depression and sleep scores by utilizing machine learning specifically rough set methods; ii) to discover patterns based on patient features such as specific cannabinoid recommendations, diagnosis information, decreasing/increasing levels of clinical assessment tools (CAT) scores over a period of time; and iii) to predict whether new patients could potentially experience either an increase or decrease in CAT scores. The dataset for this study was derived from patient visits to Ekosi Health Centres, Canada over a 2 year period including the COVID timeline. Extensive pre-processing and feature engineering was performed. A class feature indicative of their progress or lack thereof due to the treatment received was introduced. Six Rough/Fuzzy-Rough classifiers as well as Random Forest and RIPPER classifiers were trained on the patient dataset using a 10-fold stratified CV method. The highest overall accuracy, sensitivity and specificity measures of over 99% was obtained using the rule-based rough-set learning model. In this study, we have identified rough-set based machine learning model with high accuracy that could be utilized for future studies regarding cannabinoids and precision medicine.
Collapse
Affiliation(s)
- Sheela Ramanna
- Department of Applied Computer Science, University of Winnipeg, Winnipeg, MB, Canada
| | - Negin Ashrafi
- Department of Applied Computer Science, University of Winnipeg, Winnipeg, MB, Canada
| | - Evan Loster
- Ekosi Health Centre Corporation, Winnipeg, MB, Canada
| | - Karen Debroni
- Ekosi Health Centre Corporation, Winnipeg, MB, Canada
| | | |
Collapse
|
10
|
Mariyam F, Mehfuz S, Sadiq M. RAGOSRA: Rough attributed goal oriented software requirements analysis method. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2023. [DOI: 10.3233/jifs-221300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
Goal oriented software requirements analysis method is used for the analysis of elicited functional goals (FGs) and non-functional goals (NFGs) of a system in which goals are decomposed and refined into sub-goals until requirements from the sub-goals are identified. Based on the critical analysis, we found that most of the attention of goal-oriented methods is on the crisp and fuzzy logic during the analysis of the software goals or requirements. In these methods’ prior information about the type of membership function is required; and the selection of membership function depends on the subjective justification. As a result, it lacks objectivity and may affect the ranking values of the goals or requirements during the analysis. Therefore, this paper presents a rough attributed goal-oriented software requirements analysis (RAGOSRA) method in which rough preference matrix has been used to capture the opinions of different stakeholders. The result of the RAGOSRA method is compared by considering the following criteria, i.e., goal types, goal links, types of data used in the analysis, stakeholder perceptions and time complexity with some fuzzy based methods. Based on the time complexity analysis, it is found that RAGOSRA method requires only 4 operations for the selection of goals for the dataset having NFGs and FGs of an institute examination system. On the other hand, FAGOSRA method, fuzzy TOPSIS method, and fuzzy AHP method requires 36, 200, and 240 operations respectively.
Collapse
Affiliation(s)
- Farhana Mariyam
- Department of Electrical Engineering, Faculty of Engineering and Technology, Jamia Millia Islamia (A Central University), New Delhi, India
| | - Shabana Mehfuz
- Department of Electrical Engineering, Faculty of Engineering and Technology, Jamia Millia Islamia (A Central University), New Delhi, India
| | - Mohd. Sadiq
- Software Engineering Laboratory, Computer Engineering Section, UPFET, Jamia Millia Islamia (A Central University), New Delhi, India
| |
Collapse
|
11
|
Żabiński K, Zielosko B. Improved EAV-Based Algorithm for Decision Rules Construction. ENTROPY (BASEL, SWITZERLAND) 2023; 25:91. [PMID: 36673232 PMCID: PMC9858280 DOI: 10.3390/e25010091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 12/21/2022] [Accepted: 12/28/2022] [Indexed: 06/17/2023]
Abstract
In this article, we present a modification of the algorithm based on EAV (entity-attribute-value) model, for induction of decision rules, utilizing novel approach for attribute ranking. The selection of attributes used as premises of decision rules, is an important stage of the process of rules induction. In the presented approach, this task is realized using ranking of attributes based on standard deviation of attributes' values per decision classes, which is considered as a distinguishability level. The presented approach allows to work not only with numerical values of attributes but also with categorical ones. For this purpose, an additional step of data transformation into a matrix format has been proposed. It allows to transform data table into a binary one with proper equivalents of categorical values of attributes and ensures independence of the influence of the attribute selection function from the data type of variables. The motivation for the proposed method is the development of an algorithm which allows to construct rules close to optimal ones in terms of length, while maintaining enough good classification quality. The experiments presented in the paper have been performed on data sets from UCI ML Repository, comparing results of the proposed approach with three selected greedy heuristics for induction of decision rules, taking into consideration classification accuracy and length and support of constructed rules. The obtained results show that for the most part of datasests, the average length of rules obtained for 80% of best attributes from the ranking is very close to values obtained for the whole set of attributes. In case of classification accuracy, for 50% of considered datasets, results obtained for 80% of best attributes from the ranking are higher or the same as results obtained for the whole set of attributes.
Collapse
|
12
|
Stepaniuk J, Skowron A. Three-way approximation of decision granules based on the rough set approach. Int J Approx Reason 2023. [DOI: 10.1016/j.ijar.2023.01.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
13
|
Rule acquisition in generalized multi-scale information systems with multi-scale decisions. Int J Approx Reason 2022. [DOI: 10.1016/j.ijar.2022.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
14
|
Selvi S, Chandrasekaran M. Detection of Drug Abuse Using Rough Set and Neural Network-Based Elevated Mathematical Predictive Modelling. Neural Process Lett 2022. [DOI: 10.1007/s11063-022-11086-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
15
|
Feature selection for label distribution learning using dual-similarity based neighborhood fuzzy entropy. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.10.054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
16
|
Moshkov M, Zielosko B, Tetteh ET. Selected Data Mining Tools for Data Analysis in Distributed Environment. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1401. [PMID: 37420421 DOI: 10.3390/e24101401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 08/31/2022] [Accepted: 09/14/2022] [Indexed: 07/09/2023]
Abstract
In this paper, we deal with distributed data represented either as a finite set T of decision tables with equal sets of attributes or a finite set I of information systems with equal sets of attributes. In the former case, we discuss a way to the study decision trees common to all tables from the set T: building a decision table in which the set of decision trees coincides with the set of decision trees common to all tables from T. We show when we can build such a decision table and how to build it in a polynomial time. If we have such a table, we can apply various decision tree learning algorithms to it. We extend the considered approach to the study of test (reducts) and decision rules common to all tables from T. In the latter case, we discuss a way to study the association rules common to all information systems from the set I: building a joint information system for which the set of true association rules that are realizable for a given row ρ and have a given attribute a on the right-hand side coincides with the set of association rules that are true for all information systems from I, have the attribute a on the right-hand side, and are realizable for the row ρ. We then show how to build a joint information system in a polynomial time. When we build such an information system, we can apply various association rule learning algorithms to it.
Collapse
Affiliation(s)
- Mikhail Moshkov
- Computer, Electrical and Mathematical Sciences and Engineering Division and Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| | - Beata Zielosko
- Institute of Computer Science, Faculty of Science and Technology, University of Silesia in Katowice, Bȩdzińska 39, 41-200 Sosnowiec, Poland
| | - Evans Teiko Tetteh
- Doctoral School, University of Silesia in Katowice, Bankowa 14, 40-007 Katowice, Poland
| |
Collapse
|
17
|
Machine learning in corporate credit rating assessment using the expanded audit report. Mach Learn 2022. [DOI: 10.1007/s10994-022-06226-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
18
|
Ding W, Qin T, Shen X, Ju H, Wang H, Huang J, Li M. Parallel incremental efficient attribute reduction algorithm based on attribute tree. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.08.044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
19
|
Wang G, Mao H. Approximation operators via TD-matroids on two sets. Soft comput 2022; 26:9785-9804. [PMID: 35966347 PMCID: PMC9361929 DOI: 10.1007/s00500-022-07367-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/29/2022] [Indexed: 11/30/2022]
Abstract
Rough set theory is an extension of set theory with two additional unary set-theoretic operators known as approximation in order to extract information and knowledge. It needs the basic, or say definable, knowledge to approximate the undefinable knowledge in a knowledge space using the pair of approximation operators. Many existed approximation operators are expressed with unary form. How to mine the knowledge which is asked by binary form with rough set has received less research attention, though there are strong needs to reveal the answer for this challenging problem. There exist many information with matroid constraints since matroid provides a platform for combinatorial algorithms especially greedy algorithm. Hence, it is necessary to consider a matroidal structure on two sets no matter the two sets are the same or not. In this paper, we investigate the construction of approximation operators expressed by binary form with matroid theory, and the constructions of matroidal structure aided by a pair of approximation operators expressed by binary form.First, we provide a kind of matroidal structure—TD-matroid defined on two sets as a generalization of Whitney classical matroid. Second, we introduce this new matroidal construction to rough set and construct a pair of approximation operators expressed with binary form. Third, using the existed pair of approximation operators expressed with binary form, we build up two concrete TD-matroids. Fourth, for TD-matroid and the approximation operators expressed by binary form on two sets, we seek out their properties with aspect of posets, respectively. Through the paper, we use some biological examples to explain and test the correct of obtained results. In summary, this paper provides a new approach to research rough set theory and matroid theory on two sets, and to study on their applications each other.
Collapse
Affiliation(s)
- Gang Wang
- College of Life Science, Hebei University, Baoding, 071002 China
| | - Hua Mao
- Department of Mathematics, Hebei University, Baoding, 071002 China
| |
Collapse
|
20
|
Zhang X, Chen X, Xu W, Ding W. Dynamic information fusion in multi-source incomplete interval-valued information system with variation of information sources and attributes. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.06.054] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
21
|
Uddin J, Ghazali R, H. Abawajy J, Shah H, Husaini NA, Zeb A. Rough set based information theoretic approach for clustering uncertain categorical data. PLoS One 2022; 17:e0265190. [PMID: 35559954 PMCID: PMC9106167 DOI: 10.1371/journal.pone.0265190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 02/27/2022] [Indexed: 12/02/2022] Open
Abstract
Motivation Many real applications such as businesses and health generate large categorical datasets with uncertainty. A fundamental task is to efficiently discover hidden and non-trivial patterns from such large uncertain categorical datasets. Since the exact value of an attribute is often unknown in uncertain categorical datasets, conventional clustering analysis algorithms do not provide a suitable means for dealing with categorical data, uncertainty, and stability. Problem statement The ability of decision making in the presence of vagueness and uncertainty in data can be handled using Rough Set Theory. Though, recent categorical clustering techniques based on Rough Set Theory help but they suffer from low accuracy, high computational complexity, and generalizability especially on data sets where they sometimes fail or hardly select their best clustering attribute. Objectives The main objective of this research is to propose a new information theoretic based Rough Purity Approach (RPA). Another objective of this work is to handle the problems of traditional Rough Set Theory based categorical clustering techniques. Hence, the ultimate goal is to cluster uncertain categorical datasets efficiently in terms of the performance, generalizability and computational complexity. Methods The RPA takes into consideration information-theoretic attribute purity of the categorical-valued information systems. Several extensive experiments are conducted to evaluate the efficiency of RPA using a real Supplier Base Management (SBM) and six benchmark UCI datasets. The proposed RPA is also compared with several recent categorical data clustering techniques. Results The experimental results show that RPA outperforms the baseline algorithms. The significant percentage improvement with respect to time (66.70%), iterations (83.13%), purity (10.53%), entropy (14%), and accuracy (12.15%) as well as Rough Accuracy of clusters show that RPA is suitable for practical usage. Conclusion We conclude that as compared to other techniques, the attribute purity of categorical-valued information systems can better cluster the data. Hence, RPA technique can be recommended for large scale clustering in multiple domains and its performance can be enhanced for further research.
Collapse
Affiliation(s)
- Jamal Uddin
- Qurtuba University of Science & IT, Peshawar, Pakistan
- * E-mail:
| | - Rozaida Ghazali
- Universiti Tun Hussien Onn Malaysia, Batu Pahat, Johor, Malaysia
| | | | | | | | - Asim Zeb
- Abbottabad University of Science & Technology, Abbottabad, Pakistan
| |
Collapse
|
22
|
Wang P, Zhao Z, Wang Z, Li Z. Fuzzy set-valued information systems and their homomorphisms based on data compression. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-213186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A fuzzy set-valued information system (FSVIS) is a special information system (IS) where the value of an object under each attribute or each attribute value is a fuzzy set. Homomorphism is a powerful mathematical tool to deal with FSVISs, which can be used to study relationships among them. Based on data compression, we obtain some characterizations about FSVISs and their homomorphisms. First, some homomorphisms between FSVISs are introduced. After that, attribute reduction based on tolerance relation in a FSVIS is studied. Eventually, we get invariant characterizations of FSVISs based on some special homomorphisms under data compression.
Collapse
Affiliation(s)
- Pei Wang
- School of Mathematics and Statistics, Yulin Normal University, Yulin, Guangxi, P.R. China
| | - Zhengwei Zhao
- School of Mathematics and Physics, Guangxi University for Nationalities, Nanning, Guangxi, P.R. China
| | - Zhihong Wang
- School of Information Science and Technology, Southwest Jiaotong University, Chengdu, Sichuan, P.R. China
| | - Zhaowen Li
- School of Mathematics and Statistics, Yulin Normal University, Yulin, Guangxi, P.R. China
| |
Collapse
|
23
|
|
24
|
Evaluation Method of Highway Plant Slope Based on Rough Set Theory and Analytic Hierarchy Process: A Case Study in Taihang Mountain, Hebei, China. MATHEMATICS 2022. [DOI: 10.3390/math10081264] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The material foundation of soil and water conservation is built on the integrity of the highway plant slope. The proportional relevance of the components that affect slope quality was evaluated based on an environmental assessment and the actual characteristics of the highway slope. A system of four major indexes and twelve secondary indexes comprising plant traits, geometric factors, hydrological conditions, and vegetation conditions was developed to assess the stability of roadway plant slopes. The rough set theory approach and the analytic hierarchy process were used to solve the weights of the slope evaluation indexes. Based on a rough set and an analytic hierarchy process, an evaluation model is proposed. The model eliminates the inconsistency and uncertainty in the evaluated factors that are used to calculate the slope. The study was conducted in China. The highway plant slope of the Taihang Mountain highway in the Hebei province was evaluated using the assessment model after dividing the highway plant slope stability into four grades. According to the evaluation results, the model can be used as a reference highway plant slope stability study and provide technical help to prevent and lower slope safety accidents. The evaluation model can predict the slope quality of highway plants, demonstrating the efficacy and reliability of the evaluation methodology and approach.
Collapse
|
25
|
Rough-Set-Theory-Based Classification with Optimized k-Means Discretization. TECHNOLOGIES 2022. [DOI: 10.3390/technologies10020051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The discretization of continuous attributes in a dataset is an essential step before the Rough-Set-Theory (RST)-based classification process is applied. There are many methods for discretization, but not many of them have linked the RST instruments from the beginning of the discretization process. The objective of this research is to propose a method to improve the accuracy and reliability of the RST-based classifier model by involving RST instruments at the beginning of the discretization process. In the proposed method, a k-means-based discretization method optimized with a genetic algorithm (GA) was introduced. Four datasets taken from UCI were selected to test the performance of the proposed method. The evaluation of the proposed discretization technique for RST-based classification is performed by comparing it to other discretization methods, i.e., equal-frequency and entropy-based. The performance comparison among these methods is measured by the number of bins and rules generated and by its accuracy, precision, and recall. A Friedman test continued with post hoc analysis is also applied to measure the significance of the difference in performance. The experimental results indicate that, in general, the performance of the proposed discretization method is significantly better than the other compared methods.
Collapse
|
26
|
A Distributed Attribute Reduction Algorithm for High-Dimensional Data under the Spark Framework. INT J COMPUT INT SYS 2022. [DOI: 10.1007/s44196-022-00076-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
AbstractAttribute reduction is an important issue in rough set theory. However, the rough set theory-based attribute reduction algorithms need to be improved to deal with high-dimensional data. A distributed version of the attribute reduction algorithm is necessary to enable it to effectively handle big data. The partition of attribute space is an important research direction. In this paper, a distributed attribution reduction algorithm based on cosine similarity (DARCS) for high-dimensional data pre-processing under the Spark framework is proposed. First, to avoid the repeated calculation of similar attributes, the algorithm gathers similar attributes based on similarity measure to form multiple clusters. And then one attribute is selected randomly as a representative from each cluster to form a candidate attribute subset to participate in the subsequent reduction operation. At the same time, to improve computing efficiency, an improved method is introduced to calculate the attribute dependency in the divided sub-attribute space. Experiments on eight datasets show that, on the premise of avoiding critical information loss, the reduction ability and computing efficiency of DARCS have been improved by 0.32 to 39.61% and 31.32 to 93.79% respectively compared to the distributed version of attribute reduction algorithm based on a random partitioning of the attributes space.
Collapse
|
27
|
Attaullah, Ashraf S, Rehman N, Khan A, Naeem M, Park C. A wind power plant site selection algorithm based on q-rung orthopair hesitant fuzzy rough Einstein aggregation information. Sci Rep 2022; 12:5443. [PMID: 35361827 PMCID: PMC8971469 DOI: 10.1038/s41598-022-09323-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 03/17/2022] [Indexed: 11/24/2022] Open
Abstract
Wind power is often recognized as one of the best clean energy solutions due to its widespread availability, low environmental impact, and great cost-effectiveness. The successful design of optimal wind power sites to create power is one of the most vital concerns in the exploitation of wind farms. Wind energy site selection is determined by the rules and standards of environmentally sustainable development, leading to a low, renewable energy source that is cost effective and contributes to global advancement. The major contribution of this research is a comprehensive analysis of information for the multi-attribute decision-making (MADM) approach and evaluation of ideal site selection for wind power plants employing q-rung orthopair hesitant fuzzy rough Einstein aggregation operators. A MADM technique is then developed using q-rung orthopair hesitant fuzzy rough aggregation operators. For further validation of the potential of the suggested method, a real case study on wind power plant site has been given. A comparison analysis based on the unique extended TOPSIS approach is presented to illustrate the offered method’s capability. The results show that this method has a larger space for presenting information, is more flexible in its use, and produces more consistent evaluation results. This research is a comprehensive collection of information that should be considered when choosing the optimum site for wind projects.
Collapse
Affiliation(s)
- Attaullah
- Department of Mathematics, Abdul Wali Khan University, Mardan, KPK, 23200, Pakistan
| | - Shahzaib Ashraf
- Department of Mathematics, Khawaja Farid University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
| | - Noor Rehman
- Department of Mathematics and Statistics, Bacha Khan University, Charsadda, KPK, Pakistan
| | - Asghar Khan
- Department of Mathematics, Abdul Wali Khan University, Mardan, KPK, 23200, Pakistan
| | - Muhammad Naeem
- Deanship of Combined First Year, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Choonkil Park
- Research Institute for Natural Sciences, Hanyang University, Seoul, Korea.
| |
Collapse
|
28
|
Yang X, Wang X, Kang J. Multi‐granularity decision rough set attribute reduction algorithm under quantum particle swarm optimization. IET CYBER-SYSTEMS AND ROBOTICS 2022. [DOI: 10.1049/csy2.12041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Affiliation(s)
- Xuxu Yang
- School of Electrical and Control Engineering Shaanxi University of Science and Technology Xi'an China
| | - Xueen Wang
- School of Electrical and Control Engineering Shaanxi University of Science and Technology Xi'an China
| | - Jie Kang
- School of Electrical and Control Engineering Shaanxi University of Science and Technology Xi'an China
| |
Collapse
|
29
|
Han Z, Huang Q, Zhang J, Huang C, Wang H, Huang X. GA-GWNN: Detecting anomalies of online learners by granular computing and graph wavelet convolutional neural network. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03337-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
30
|
Multigranulation double-quantitative decision-theoretic rough sets based on logical operations. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-021-01476-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
31
|
Abstract
In multi-strategy games, the increase in the number of strategies makes it difficult to make a solution. To maintain the competition advantage and obtain maximal profits, one side of the game hopes to predict the opponent’s behavior. Building a model to predict an opponent’s behavior is helpful. In this paper, we propose a rough set-game theory model (RS-GT) considering uncertain information and the opponent’s decision rules. The uncertainty of strategies is obtained based on the rough set method, and an accurate solution is obtained based on game theory from the rough set-game theory model. The players obtain their competitors’ decision rules to predict the opponents’ behavior by mining the information from repeated games in the past. The players determine their strategy to obtain maximum profits by predicting the opponent’s actions, i.e., adopting a first-mover or second-mover strategy to build a favorable situation. The result suggests that the rough set-game theory model helps enterprises avoid unnecessary losses and allows them to obtain greater profits.
Collapse
|
32
|
Moshkov M. On the Depth of Decision Trees with Hypotheses. ENTROPY 2022; 24:e24010116. [PMID: 35052142 PMCID: PMC8774416 DOI: 10.3390/e24010116] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 01/06/2022] [Accepted: 01/10/2022] [Indexed: 02/01/2023]
Abstract
In this paper, based on the results of rough set theory, test theory, and exact learning, we investigate decision trees over infinite sets of binary attributes represented as infinite binary information systems. We define the notion of a problem over an information system and study three functions of the Shannon type, which characterize the dependence in the worst case of the minimum depth of a decision tree solving a problem on the number of attributes in the problem description. The considered three functions correspond to (i) decision trees using attributes, (ii) decision trees using hypotheses (an analog of equivalence queries from exact learning), and (iii) decision trees using both attributes and hypotheses. The first function has two possible types of behavior: logarithmic and linear (this result follows from more general results published by the author earlier). The second and the third functions have three possible types of behavior: constant, logarithmic, and linear (these results were published by the author earlier without proofs that are given in the present paper). Based on the obtained results, we divided the set of all infinite binary information systems into four complexity classes. In each class, the type of behavior for each of the considered three functions does not change.
Collapse
Affiliation(s)
- Mikhail Moshkov
- Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
| |
Collapse
|
33
|
Ammar E, Al-Asfar A. A study of uncertain multi-objective nonlinear programming problems for rough intervals. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-202586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In real conditions, the parameters of multi-objective nonlinear programming (MONLP) problem models can’t be determined exactly. Hence in this paper, we concerned with studying the uncertainty of MONLP problems. We propose algorithms to solve rough and fully-rough-interval multi-objective nonlinear programming (RIMONLP and FRIMONLP) problems, to determine optimal rough solutions value and rough decision variables, where all coefficients and decision variables in the objective functions and constraints are rough intervals (RIs). For the RIMONLP and FRIMONLP problems solving methodology are presented using the weighting method and slice-sum method with Kuhn-Tucker conditions, We will structure two nonlinear programming (NLP) problems. In the first one of this NLP problem, all of its variables and coefficients are the lower approximation (LAI) it’s RIs. The second NLP problems are upper approximation intervals (UAI) of RIs. Subsequently, both NLP problems are sliced into two crisp nonlinear problems. NLP is utilized because numerous real systems are inherently nonlinear. Also, rough intervals are so important for dealing with uncertainty and inaccurate data in decision-making (DM) problems. The suggested algorithms enable us to the optimal solutions in the largest range of possible solution. Finally, Illustrative examples of the results are given.
Collapse
Affiliation(s)
- E. Ammar
- Department of Mathematics, Faculty of Science, Tanta University, Egypt
| | - A. Al-Asfar
- Department of Mathematics, Faculty of Science, Tanta University, Egypt
| |
Collapse
|
34
|
|
35
|
Chen Z, Liu K, Yang X, Fujita H. Random sampling accelerator for attribute reduction. Int J Approx Reason 2022. [DOI: 10.1016/j.ijar.2021.09.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
36
|
Wang X, Liu Y, Zhou R. Multi-granularity belief interval-valued soft set. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-210565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A new model named multi-granularity belief interval-valued soft set is introduced in this paper. Some basic properties about it are presented and illustrated. The improved concepts of the soft belief value and soft belief degree are proposed, which provided an easier and better compared horizontally and vertically method among the different objects and different parameters. An algorithm for decision-making problems on multi-granularity belief interval-valued soft set is put forward and its validity is proved by the application of an example. Moreover, the newly proposed algorithm is compared with existing method to indicate its extensive application.
Collapse
Affiliation(s)
- Xiaomin Wang
- School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao, Hebei, P.R. China
| | - Yang Liu
- School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao, Hebei, P.R. China
| | - Rui Zhou
- School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao, Hebei, P.R. China
| |
Collapse
|
37
|
Azad M, Chikalov I, Hussain S, Moshkov M, Zielosko B. Decision Rules Derived from Optimal Decision Trees with Hypotheses. ENTROPY 2021; 23:e23121641. [PMID: 34945947 PMCID: PMC8700404 DOI: 10.3390/e23121641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 10/14/2021] [Accepted: 12/02/2021] [Indexed: 11/24/2022]
Abstract
Conventional decision trees use queries each of which is based on one attribute. In this study, we also examine decision trees that handle additional queries based on hypotheses. This kind of query is similar to the equivalence queries considered in exact learning. Earlier, we designed dynamic programming algorithms for the computation of the minimum depth and the minimum number of internal nodes in decision trees that have hypotheses. Modification of these algorithms considered in the present paper permits us to build decision trees with hypotheses that are optimal relative to the depth or relative to the number of the internal nodes. We compare the length and coverage of decision rules extracted from optimal decision trees with hypotheses and decision rules extracted from optimal conventional decision trees to choose the ones that are preferable as a tool for the representation of information. To this end, we conduct computer experiments on various decision tables from the UCI Machine Learning Repository. In addition, we also consider decision tables for randomly generated Boolean functions. The collected results show that the decision rules derived from decision trees with hypotheses in many cases are better than the rules extracted from conventional decision trees.
Collapse
Affiliation(s)
- Mohammad Azad
- Department of Computer Science, College of Computer and Information Sciences, Jouf University, Sakaka 72441, Saudi Arabia;
| | - Igor Chikalov
- Intel Corporation, 5000 W Chandler Blvd, Chandler, AZ 85226, USA;
| | - Shahid Hussain
- Department of Computer Science, School of Mathematics and Computer Science, Institute of Business Administration, University Road, Karachi 75270, Pakistan;
| | - Mikhail Moshkov
- Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
- Correspondence:
| | - Beata Zielosko
- Institute of Computer Science, Faculty of Science and Technology, University of Silesia in Katowice, Będzińska 39, 41-200 Sosnowiec, Poland;
| |
Collapse
|
38
|
|
39
|
|
40
|
Guo J, Zhang J, Zhang Y, Xu P, Li L, Xie Z, Li Q. An improved density-based approach to risk assessment on railway investment. DATA TECHNOLOGIES AND APPLICATIONS 2021. [DOI: 10.1108/dta-11-2020-0291] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
PurposeDensity-based spatial clustering of applications with noise (DBSCAN) is the most commonly used density-based clustering algorithm, while it cannot be directly applied to the railway investment risk assessment. To overcome the shortcomings of calculation method and parameter limits of DBSCAN, this paper proposes a new algorithm called Improved Multiple Density-based Spatial clustering of Applications with Noise (IM-DBSCAN) based on the DBSCAN and rough set theory.Design/methodology/approachFirst, the authors develop an improved affinity propagation (AP) algorithm, which is then combined with the DBSCAN (hereinafter referred to as AP-DBSCAN for short) to improve the parameter setting and efficiency of the DBSCAN. Second, the IM-DBSCAN algorithm, which consists of the AP-DBSCAN and a modified rough set, is designed to investigate the railway investment risk. Finally, the IM-DBSCAN algorithm is tested on the China–Laos railway's investment risk assessment, and its performance is compared with other related algorithms.FindingsThe IM-DBSCAN algorithm is implemented on China–Laos railway's investment risk assessment and compares with other related algorithms. The clustering results validate that the AP-DBSCAN algorithm is feasible and efficient in terms of clustering accuracy and operating time. In addition, the experimental results also indicate that the IM-DBSCAN algorithm can be used as an effective method for the prospective risk assessment in railway investment.Originality/valueThis study proposes IM-DBSCAN algorithm that consists of the AP-DBSCAN and a modified rough set to study the railway investment risk. Different from the existing clustering algorithms, AP-DBSCAN put forward the density calculation method to simplify the process of optimizing DBSCAN parameters. Instead of using Euclidean distance approach, the cutoff distance method is introduced to improve the similarity measure for optimizing the parameters. The developed AP-DBSCAN is used to classify the China–Laos railway's investment risk indicators more accurately. Combined with a modified rough set, the IM-DBSCAN algorithm is proposed to analyze the railway investment risk assessment. The contributions of this study can be summarized as follows: (1) Based on AP, DBSCAN, an integrated methodology AP-DBSCAN, which considers improving the parameter setting and efficiency, is proposed to classify railway risk indicators. (2) As AP-DBSCAN is a risk classification model rather than a risk calculation model, an IM-DBSCAN algorithm that consists of the AP-DBSCAN and a modified rough set is proposed to assess the railway investment risk. (3) Taking the China–Laos railway as a real-life case study, the effectiveness and superiority of the proposed IM-DBSCAN algorithm are verified through a set of experiments compared with other state-of-the-art algorithms.
Collapse
|
41
|
|
42
|
Using Rough Set Theory to Find Minimal Log with Rule Generation. Symmetry (Basel) 2021. [DOI: 10.3390/sym13101906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Data pre-processing is a major difficulty in the knowledge discovery process, especially feature selection on a large amount of data. In literature, various approaches have been suggested to overcome this difficulty. Unlike most approaches, Rough Set Theory (RST) can discover data de-pendency and reduce the attributes without the need for further information. In RST, the discernibility matrix is the mathematical foundation for computing such reducts. Although it proved its efficiency in feature selection, unfortunately it is computationally expensive on high dimensional data. Algorithm complexity is related to the search of the minimal subset of attributes, which requires computing an exponential number of possible subsets. To overcome this limitation, many RST enhancements have been proposed. Contrary to recent methods, this paper implements RST concepts in an iterated manner using R language. First, the dataset was partitioned into a smaller number of subsets and each subset processed independently to generate its own minimal attribute set. Within the iterations, only minimal elements in the discernibility matrix were considered. Finally, the iterated outputs were compared, and those common among all reducts formed the minimal one (Core attributes). A comparison with another novel proposed algorithm using three benchmark datasets was performed. The proposed approach showed its efficiency in calculating the same minimal attribute sets with less execution time.
Collapse
|
43
|
Interval modelling in optimization of k‐NN classifiers for large number of attributes in data sets on an example of DNA microarrays. INT J INTELL SYST 2021. [DOI: 10.1002/int.22679] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
44
|
Huang Z, Li J. Multi-scale covering rough sets with applications to data classification. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107736] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
45
|
Liu X, Dai J, Chen J, Zhang C. A fuzzy α-similarity relation-based attribute reduction approach in incomplete interval-valued information systems. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107593] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
46
|
Xie X, Gu X, Li Y, Ji Z. K-size partial reduct: Positive region optimization for attribute reduction. Knowl Based Syst 2021. [DOI: 10.1016/j.knosys.2021.107253] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
|
47
|
Gul R, Shabir M. (α, β)-Multi-granulation bipolar fuzzified rough sets and their applications to multi criteria group decision making. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-210717] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Pawlak’s rough set theory based on single granulation has been extended to multi-granulation rough set structure in recent years. Multi-granulation rough set theory has become a flouring research direction in rough set theory. In this paper, we propose the notion of (α, β)-multi-granulation bipolar fuzzified rough set ((α, β)-MGBFRSs). For this purpose, a collection of bipolar fuzzy tolerance relations has been used. In the framework of multi-granulation, we proposed two types of (α, β)-multi-granulation bipolar fuzzified rough sets model. One is called the optimistic (α, β)-multi-granulation bipolar fuzzified rough sets ((α, β) o-MGBFRSs) and the other is called the pessimistic (α, β)-multi-granulation bipolar fuzzified rough sets ((α, β) p-MGBFRSs). Subsequently, a number of important structural properties and results of proposed models are investigated in detail. The relationships among the (α, β)-MGBFRSs, (α, β) o-MGBFRSs and (α, β) p-MGBFRSs are also established. In order to illustrate our proposed models, some examples are considered, which are helpful for applying this theory in practical issues. Moreover, several important measures associated with (α, β)-multi-granulation bipolar fuzzified rough set like the measure of accuracy, the measure of precision, and accuracy of approximation are presented. Finally, we construct a new approach to multi-criteria group decision-making method based on (α, β)-MGBFRSs, and the validity of this technique is illustrated by a practical application. Compared with the existing results, we also expound its advantages.
Collapse
Affiliation(s)
- Rizwan Gul
- Department of Mathematics, Quaid-i-Azam University, Islamabad, Pakistan
| | - Muhammad Shabir
- Department of Mathematics, Quaid-i-Azam University, Islamabad, Pakistan
| |
Collapse
|
48
|
Kusunoki Y, Błaszczyński J, Inuiguchi M, Słowiński R. Empirical risk minimization for dominance-based rough set approaches. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.02.043] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
49
|
Szczuko P, Kurowski A, Odya P, Czyżewski A, Kostek B, Graff B, Narkiewicz K. Mining Knowledge of Respiratory Rate Quantification and Abnormal Pattern Prediction. Cognit Comput 2021; 14:2120-2140. [PMID: 34276830 PMCID: PMC8272620 DOI: 10.1007/s12559-021-09908-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Accepted: 06/23/2021] [Indexed: 12/02/2022]
Abstract
The described application of granular computing is motivated because cardiovascular disease (CVD) remains a major killer globally. There is increasing evidence that abnormal respiratory patterns might contribute to the development and progression of CVD. Consequently, a method that would support a physician in respiratory pattern evaluation should be developed. Group decision-making, tri-way reasoning, and rough set–based analysis were applied to granular computing. Signal attributes and anthropomorphic parameters were explored to develop prediction models to determine the percentage contribution of periodic-like, intermediate, and normal breathing patterns in the analyzed signals. The proposed methodology was validated employing k-nearest neighbor (k-NN) and UMAP (uniform manifold approximation and projection). The presented approach applied to respiratory pattern evaluation shows that median accuracies in a considerable number of cases exceeded 0.75. Overall, parameters related to signal analysis are indicated as more important than anthropomorphic features. It was also found that obesity characterized by a high WHR (waist-to-hip ratio) and male sex were predisposing factors for the occurrence of periodic-like or intermediate patterns of respiration. It may be among the essential findings derived from this study. Based on classification measures, it may be observed that a physician may use such a methodology as a respiratory pattern evaluation-aided method.
Collapse
Affiliation(s)
- Piotr Szczuko
- Multimedia System Department, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, 80-233 Gdańsk, Poland
| | - Adam Kurowski
- Multimedia System Department, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, 80-233 Gdańsk, Poland.,Audio Acoustics Department, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, 80-233 Gdańsk, Poland
| | - Piotr Odya
- Multimedia System Department, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, 80-233 Gdańsk, Poland
| | - Andrzej Czyżewski
- Multimedia System Department, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, 80-233 Gdańsk, Poland
| | - Bożena Kostek
- Audio Acoustics Department, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, 80-233 Gdańsk, Poland
| | - Beata Graff
- Department of Hypertension and Diabetology, Medical University of Gdansk, 80-210 Gdańsk, Poland
| | - Krzysztof Narkiewicz
- Department of Hypertension and Diabetology, Medical University of Gdansk, 80-210 Gdańsk, Poland
| |
Collapse
|
50
|
Aszalós L. Decompose Boolean Matrices with Correlation Clustering. ENTROPY 2021; 23:e23070852. [PMID: 34356393 PMCID: PMC8305536 DOI: 10.3390/e23070852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 06/14/2021] [Accepted: 06/28/2021] [Indexed: 11/22/2022]
Abstract
One of the tasks of data science is the decomposition of large matrices in order to understand their structures. A special case of this is when we decompose relations, i.e., logical matrices. In this paper, we present a method based on the similarity of rows and columns, which uses correlation clustering to cluster the rows and columns of the matrix, facilitating the visualization of the relation by rearranging the rows and columns. In this article, we compare our method with Gunther Schmidt’s problems and solutions. Our method produces the original solutions by selecting its parameters from a small set. However, with other parameters, it provides solutions with even lower entropy.
Collapse
Affiliation(s)
- László Aszalós
- Faculty of Informatics, University of Debrecen, Egyetem tér 1, 4032 Debrecen, Hungary
| |
Collapse
|