1
|
Mittal H, Pandey AC, Saraswat M, Kumar S, Pal R, Modwel G. A comprehensive survey of image segmentation: clustering methods, performance parameters, and benchmark datasets. MULTIMEDIA TOOLS AND APPLICATIONS 2021; 81:35001-35026. [PMID: 33584121 PMCID: PMC7870780 DOI: 10.1007/s11042-021-10594-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 01/07/2021] [Accepted: 01/21/2021] [Indexed: 06/12/2023]
Abstract
Image segmentation is an essential phase of computer vision in which useful information is extracted from an image that can range from finding objects while moving across a room to detect abnormalities in a medical image. As image pixels are generally unlabelled, the commonly used approach for the same is clustering. This paper reviews various existing clustering based image segmentation methods. Two main clustering methods have been surveyed, namely hierarchical and partitional based clustering methods. As partitional clustering is computationally better, further study is done in the perspective of methods belonging to this class. Further, literature bifurcates the partitional based clustering methods into three categories, namely K-means based methods, histogram-based methods, and meta-heuristic based methods. The survey of various performance parameters for the quantitative evaluation of segmentation results is also included. Further, the publicly available benchmark datasets for image-segmentation are briefed.
Collapse
Affiliation(s)
- Himanshu Mittal
- Jaypee Institute of Information Technology, Noida, Uttar Pradesh India
| | | | - Mukesh Saraswat
- Jaypee Institute of Information Technology, Noida, Uttar Pradesh India
| | - Sumit Kumar
- Amity University, Noida, Uttar Pradesh India
| | - Raju Pal
- Jaypee Institute of Information Technology, Noida, Uttar Pradesh India
| | - Garv Modwel
- Valeo India Private Limited, Chennai, Tamil Nadu India
| |
Collapse
|
2
|
Abstract
This paper presents a comprehensive survey of the meta-heuristic optimization algorithms on the text clustering applications and highlights its main procedures. These Artificial Intelligence (AI) algorithms are recognized as promising swarm intelligence methods due to their successful ability to solve machine learning problems, especially text clustering problems. This paper reviews all of the relevant literature on meta-heuristic-based text clustering applications, including many variants, such as basic, modified, hybridized, and multi-objective methods. As well, the main procedures of text clustering and critical discussions are given. Hence, this review reports its advantages and disadvantages and recommends potential future research paths. The main keywords that have been considered in this paper are text, clustering, meta-heuristic, optimization, and algorithm.
Collapse
|
3
|
Yuanyuan Zhang A, Minyu Feng B, Feng Chen C, Jürgen Kurths D. Adaptive clustering based on element-wised distance for distributed estimation over multi-task networks. CHAOS (WOODBURY, N.Y.) 2020; 30:053116. [PMID: 32491899 DOI: 10.1063/1.5141493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Accepted: 04/02/2020] [Indexed: 06/11/2023]
Abstract
In multitask networks, neighboring agents that belong to different clusters pursue different goals, and therefore arbitrary cooperation will lead to a degradation in estimation performance. In this paper, an adaptive clustering method is proposed for distributed estimation that enables agents to distinguish between subneighbors that belong to the same cluster and those that belong to a different cluster. This creates an appropriate degree of cooperation to improve parameter estimation accuracy, especially for the case where the prior information of a cluster is unknown. In contrast to the static and quantitative threshold that is imposed in traditional clustering methods, we devise a method for real-time clustering hypothesis detection, which is constructed through the use of a reliable adaptive clustering threshold as reference and the averaged element-wise distance between tasks as real-time clustering detection statistic. Meanwhile, we relax the clustering conditions to maintain maximum cooperation without sacrificing accuracy. Simulations are presented to compare the proposed algorithm and some traditional clustering strategies in both stationary and nonstationary environments. The effects of task difference on performance are also obtained to demonstrate the superiority of our proposed clustering strategy in terms of accuracy, robustness, and suitability.
Collapse
Affiliation(s)
- A Yuanyuan Zhang
- College of Artificial Intelligence, Southwest University, Chongqing 400715, China
| | - B Minyu Feng
- College of Artificial Intelligence, Southwest University, Chongqing 400715, China
| | - C Feng Chen
- College of Artificial Intelligence, Southwest University, Chongqing 400715, China
| | - D Jürgen Kurths
- Potsdam Institute for Climate Impact Research, 14473 Potsdam, Germany
| |
Collapse
|
4
|
Chander S, Vijaya P, Dhyani P. ADOFL: Multi-Kernel-Based Adaptive Directive Operative Fractional Lion Optimisation Algorithm for Data Clustering. JOURNAL OF INTELLIGENT SYSTEMS 2018. [DOI: 10.1515/jisys-2016-0175] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Abstract
The progress of databases in fields such as medical, business, education, marketing, etc., is colossal because of the developments in information technology. Knowledge discovery from such concealed bulk databases is a tedious task. For this, data mining is one of the promising solutions and clustering is one of its applications. The clustering process groups the data objects related to each other in a similar cluster and diverse objects in another cluster. The literature presents many clustering algorithms for data clustering. Optimisation-based clustering algorithm is one of the recently developed algorithms for the clustering process to discover the optimal cluster based on the objective function. In our previous method, direct operative fractional lion optimisation algorithm was proposed for data clustering. In this paper, we designed a new clustering algorithm called adaptive decisive operative fractional lion (ADOFL) optimisation algorithm based on multi-kernel function. Moreover, a new fitness function called multi-kernel WL index is proposed for the selection of the best centroid point for clustering. The experimentation of the proposed ADOFL algorithm is carried out over two benchmarked datasets, Iris and Wine. The performance of the proposed ADOFL algorithm is validated over existing clustering algorithms such as particle swarm clustering (PSC) algorithm, modified PSC algorithm, lion algorithm, fractional lion algorithm, and DOFL. The result shows that the maximum clustering accuracy of 79.51 is obtained by the proposed method in data clustering.
Collapse
|
5
|
Pakrashi A, Chaudhuri BB. A Kalman filtering induced heuristic optimization based partitional data clustering. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.07.057] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
6
|
|
7
|
Yang C, Ji J, Liu J, Liu J, Yin B. Structural learning of Bayesian networks by bacterial foraging optimization. Int J Approx Reason 2016. [DOI: 10.1016/j.ijar.2015.11.003] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
8
|
Cui Z, Yuan G, Sheng Z, Liu W, Wang X, Duan X. A Modified BFGS Formula Using a Trust Region Model for Nonsmooth Convex Minimizations. PLoS One 2015; 10:e0140606. [PMID: 26501775 PMCID: PMC4621044 DOI: 10.1371/journal.pone.0140606] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 09/27/2015] [Indexed: 11/25/2022] Open
Abstract
This paper proposes a modified BFGS formula using a trust region model for solving nonsmooth convex minimizations by using the Moreau-Yosida regularization (smoothing) approach and a new secant equation with a BFGS update formula. Our algorithm uses the function value information and gradient value information to compute the Hessian. The Hessian matrix is updated by the BFGS formula rather than using second-order information of the function, thus decreasing the workload and time involved in the computation. Under suitable conditions, the algorithm converges globally to an optimal solution. Numerical results show that this algorithm can successfully solve nonsmooth unconstrained convex problems.
Collapse
Affiliation(s)
- Zengru Cui
- Guangxi Colleges and Universities Key Laboratory of Mathematics and Its Applications, College of Mathematics and Information Science, Guangxi University, Nanning, Guangxi 530004, China
| | - Gonglin Yuan
- Guangxi Colleges and Universities Key Laboratory of Mathematics and Its Applications, College of Mathematics and Information Science, Guangxi University, Nanning, Guangxi 530004, China
- School of Computer and Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
- * E-mail:
| | - Zhou Sheng
- Guangxi Colleges and Universities Key Laboratory of Mathematics and Its Applications, College of Mathematics and Information Science, Guangxi University, Nanning, Guangxi 530004, China
| | - Wenjie Liu
- School of Computer and Software, Nanjing University of Information Science & Technology, Nanjing 210044, China
- Jiangsu Engineering Center of Network Monitoring, Nanjing University of Information Science & Technology, Nanjing 210044, China
| | - Xiaoliang Wang
- Guangxi Colleges and Universities Key Laboratory of Mathematics and Its Applications, College of Mathematics and Information Science, Guangxi University, Nanning, Guangxi 530004, China
| | - Xiabin Duan
- Guangxi Colleges and Universities Key Laboratory of Mathematics and Its Applications, College of Mathematics and Information Science, Guangxi University, Nanning, Guangxi 530004, China
| |
Collapse
|
9
|
Forsati R, Keikha A, Shamsfard M. An improved bee colony optimization algorithm with an application to document clustering. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.02.048] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
10
|
Ji J, Pang W, Zheng Y, Wang Z, Ma Z. A novel artificial bee colony based clustering algorithm for categorical data. PLoS One 2015; 10:e0127125. [PMID: 25993469 PMCID: PMC4439097 DOI: 10.1371/journal.pone.0127125] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 04/11/2015] [Indexed: 11/19/2022] Open
Abstract
Data with categorical attributes are ubiquitous in the real world. However, existing partitional clustering algorithms for categorical data are prone to fall into local optima. To address this issue, in this paper we propose a novel clustering algorithm, ABC-K-Modes (Artificial Bee Colony clustering based on K-Modes), based on the traditional k-modes clustering algorithm and the artificial bee colony approach. In our approach, we first introduce a one-step k-modes procedure, and then integrate this procedure with the artificial bee colony approach to deal with categorical data. In the search process performed by scout bees, we adopt the multi-source search inspired by the idea of batch processing to accelerate the convergence of ABC-K-Modes. The performance of ABC-K-Modes is evaluated by a series of experiments in comparison with that of the other popular algorithms for categorical data.
Collapse
Affiliation(s)
- Jinchao Ji
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, China
- Key Lab of Intelligent Information Processing of Jilin Universities, Northeast Normal University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| | - Wei Pang
- School of Natural and Computing Sciences, University of Aberdeen, Aberdeen, United Kingdom
| | - Yanlin Zheng
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, China
- Key Lab of Intelligent Information Processing of Jilin Universities, Northeast Normal University, Changchun, China
| | - Zhe Wang
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
- College of Computer Science and Technology, Jilin University, Changchun, China
| | - Zhiqiang Ma
- School of Computer Science and Information Technology, Northeast Normal University, Changchun, China
- Key Lab of Intelligent Information Processing of Jilin Universities, Northeast Normal University, Changchun, China
| |
Collapse
|
11
|
Bermejo E, Cordón O, Damas S, Santamaría J. A comparative study on the application of advanced bacterial foraging models to image registration. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2014.10.018] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
12
|
Abstract
As an important boundary-based clustering algorithm, support vector clustering (SVC) can benefit many real applications owing to its capability of handling arbitrary cluster shapes, especially those directly or indirectly related to pattern exploration and description. As the application deepens, the importance of performance (i.e. criterions of accuracy and efficiency) of SVC increases. To identify gaps in the current methods and propose novel research directions for SVC, we present a survey of the literature in this area. Our approach is to classify the most recent advances into either theory or application. For theoretical contributions, advances related to parameter selection and optimization, dual-problem solutions, and cluster labeling are introduced. We also simultaneously summarize the advantages and drawbacks of each study. With respect to applications, we clearly describe eight groups of schemes based on SVC, either as individual or hybrid methods. Finally, we identify the gaps in SVC research and suggest several future research issues and trends.
Collapse
Affiliation(s)
- Huina Li
- School of Information Engineering, Xuchang University, Xuchang 461000, P. R. China
| | - Yuan Ping
- School of Information Engineering, Xuchang University, Xuchang 461000, P. R. China
| |
Collapse
|
13
|
A Clustering Approach for the l-Diversity Model in Privacy Preserving Data Mining Using Fractional Calculus-Bacterial Foraging Optimization Algorithm. ACTA ACUST UNITED AC 2014. [DOI: 10.1155/2014/396529] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In privacy preserving data mining, the l-diversity and k-anonymity models are the most widely used for preserving the sensitive private information of an individual. Out of these two, l-diversity model gives better privacy and lesser information loss as compared to the k-anonymity model. In addition, we observe that numerous clustering algorithms have been proposed in data mining, namely, k-means, PSO, ACO, and BFO. Amongst them, the BFO algorithm is more stable and faster as compared to all others except k-means. However, BFO algorithm suffers from poor convergence behavior as compared to other optimization algorithms. We also observed that the current literature lacks any approaches that apply BFO with l-diversity model to realize privacy preservation in data mining. Motivated by this observation, we propose here an approach that uses fractional calculus (FC) in the chemotaxis step of the BFO algorithm. The FC is used to boost the computational performance of the algorithm. We also evaluate our proposed FC-BFO and BFO algorithms empirically, focusing on information loss and execution time as vital metrics. The experimental evaluation shows that our proposed FC-BFO algorithm derives an optimal cluster as compared to the original BFO algorithm and existing clustering algorithms.
Collapse
|
14
|
Chiu DY, Pan YC. Topic knowledge map and knowledge structure constructions with genetic algorithm, information retrieval, and multi-dimension scaling method. Knowl Based Syst 2014. [DOI: 10.1016/j.knosys.2014.03.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
15
|
Foraging Agent Swarm Optimization with Applications in Data Clustering. LECTURE NOTES IN COMPUTER SCIENCE 2014. [DOI: 10.1007/978-3-319-09952-1_21] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
16
|
Zhang Y, Wen J, Wang X, Jiang Z. Semi-supervised hybrid clustering by integrating Gaussian mixture model and distance metric learning. J Intell Inf Syst 2013. [DOI: 10.1007/s10844-013-0264-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|