Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Jegatha Deborah L, Baskaran R, Kannan A. A Survey on Internal Validity Measure for Cluster Validation. ACTA ACUST UNITED AC 2010. [DOI: 10.5121/ijcses.2010.1207] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Number

Cited by Other Article(s)

Hector I, Panjanathan R. Predictive maintenance in Industry 4.0: a survey of planning models and machine learning techniques. PeerJ Comput Sci 2024;10:e2016. [PMID: 38855197 PMCID: PMC11157603 DOI: 10.7717/peerj-cs.2016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 04/02/2024] [Indexed: 06/11/2024]

Abstract

Equipment downtime resulting from maintenance in various sectors around the globe has become a major concern. The effectiveness of conventional reactive maintenance methods in addressing interruptions and enhancing operational efficiency has become inadequate. Therefore, acknowledging the constraints associated with reactive maintenance and the growing need for proactive approaches to proactively detect possible breakdowns is necessary. The need for optimisation of asset management and reduction of costly downtime emerges from the demand for industries. The work highlights the use of Internet of Things (IoT)-enabled Predictive Maintenance (PdM) as a revolutionary strategy across many sectors. This article presents a picture of a future in which the use of IoT technology and sophisticated analytics will enable the prediction and proactive mitigation of probable equipment failures. This literature study has great importance as it thoroughly explores the complex steps and techniques necessary for the development and implementation of efficient PdM solutions. The study offers useful insights into the optimisation of maintenance methods and the enhancement of operational efficiency by analysing current information and approaches. The article outlines essential stages in the application of PdM, encompassing underlying design factors, data preparation, feature selection, and decision modelling. Additionally, the study discusses a range of ML models and methodologies for monitoring conditions. In order to enhance maintenance plans, it is necessary to prioritise ongoing study and improvement in the field of PdM. The potential for boosting PdM skills and guaranteeing the competitiveness of companies in the global economy is significant through the incorporation of IoT, Artificial Intelligence (AI), and advanced analytics.

Collapse

Forbes O, Santos-Fernandez E, Wu PPY, Xie HB, Schwenn PE, Lagopoulos J, Mills L, Sacks DD, Hermens DF, Mengersen K. clusterBMA: Bayesian model averaging for clustering. PLoS One 2023;18:e0288000. [PMID: 37603575 PMCID: PMC10441802 DOI: 10.1371/journal.pone.0288000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 06/16/2023] [Indexed: 08/23/2023] Open

Abstract

Various methods have been developed to combine inference across multiple sets of results for unsupervised clustering, within the ensemble clustering literature. The approach of reporting results from one 'best' model out of several candidate clustering models generally ignores the uncertainty that arises from model selection, and results in inferences that are sensitive to the particular model and parameters chosen. Bayesian model averaging (BMA) is a popular approach for combining results across multiple models that offers some attractive benefits in this setting, including probabilistic interpretation of the combined cluster structure and quantification of model-based uncertainty. In this work we introduce clusterBMA, a method that enables weighted model averaging across results from multiple unsupervised clustering algorithms. We use clustering internal validation criteria to develop an approximation of the posterior model probability, used for weighting the results from each model. From a combined posterior similarity matrix representing a weighted average of the clustering solutions across models, we apply symmetric simplex matrix factorisation to calculate final probabilistic cluster allocations. In addition to outperforming other ensemble clustering methods on simulated data, clusterBMA offers unique features including probabilistic allocation to averaged clusters, combining allocation probabilities from 'hard' and 'soft' clustering algorithms, and measuring model-based uncertainty in averaged cluster allocation. This method is implemented in an accompanying R package of the same name. We use simulated datasets to explore the ability of the proposed technique to identify robust integrated clusters with varying levels of separation between subgroups, and with varying numbers of clusters between models. Benchmarking accuracy against four other ensemble methods previously demonstrated to be highly effective in the literature, clusterBMA matches or exceeds the performance of competing approaches under various conditions of dimensionality and cluster separation. clusterBMA substantially outperformed other ensemble methods for high dimensional simulated data with low cluster separation, with 1.16 to 7.12 times better performance as measured by the Adjusted Rand Index. We also explore the performance of this approach through a case study that aims to identify probabilistic clusters of individuals based on electroencephalography (EEG) data. In applied settings for clustering individuals based on health data, the features of probabilistic allocation and measurement of model-based uncertainty in averaged clusters are useful for clinical relevance and statistical communication.

Collapse

Rodríguez Vega G, Zaldívar Colado U, Zaldívar Colado XP, Rodríguez Vega DA, de la Vega Bustillos EJ. Comparison of univariate and multivariate anthropometric accommodation of the northwest Mexico population. ERGONOMICS 2021;64:1018-1034. [PMID: 33683180 DOI: 10.1080/00140139.2021.1892832] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 02/15/2021] [Indexed: 06/12/2023]

Abstract

ABSTRCTErgonomic workstation design is crucial to prevent work-related musculoskeletal disorders. Many researchers have proposed multivariate analysis for human accommodation. However, no multivariate anthropometric analysis exists for the Mexican population. This study compares common multivariate human accommodation approaches (e.g. principal component and archetypal analyses) and clustering techniques (e.g. k-means and Ward's algorithm) with the classical percentile-based univariate accommodation approach, using the Chi-squared goodness-of-fit test and the McNemar's test. The theoretical accommodation percentage obtained by multivariate approaches was higher than those obtained by the percentile univariate approach considering the central 98% data. k-means and archetypal analysis obtained similar and the highest accommodation values, followed by Ward's algorithm and principal component analysis. The study findings can be deployed to assess the design of workstations in Mexico, such as electronic components assembly and crew designs, and the effects of different anthropometric measurements in human accommodation. Practitioner summary: Products and workplaces design are commonly based on the classical univariate approach, using the extreme percentiles. In this study, multivariate approaches were tested on dimensions for sitting workstations, and results showed a bigger accommodation level in comparison to the univariate 1%-99% approaches. Abbreviations: RHM: representative human model; DHM: digital human model; PCA: principal component analysis; AA: archetypal analysis (AA); PCs: principal components; FA: factor analysis; RSS: residual sum of squares; SSE: sum of squared estimated errors; WA: Ward's algorithm; DBI: Davies-Bouldin index; CHI: Calinski-Harabaz index; SI: silhouette index; SH: sitting height; EHS: eye height, sitting; AHS: acromial height, sitting; PH: popliteal height; KHS: knee height, sitting; BPL: buttock-popliteal length; BKL: buttock-knee length; FGR: functional grip reach; AD: anthropometric dimension; E: expected; A: achieved.

Collapse

Key moment extraction for designing an agglomerative clustering algorithm-based video summarization framework. Neural Comput Appl 2021. [DOI: 10.1007/s00521-021-06132-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

College Students' Perception of Current and Projected 30-Year Cardiovascular Disease Risk Using Cluster Analysis with Internal Validation. J Community Health 2018;44:500-506. [PMID: 30554296 DOI: 10.1007/s10900-018-00609-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

A new clustering validity index for arbitrary shape of clusters. Pattern Recognit Lett 2018. [DOI: 10.1016/j.patrec.2018.08.005] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Integrating cluster validity indices based on data envelopment analysis. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2017.11.052] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Abualhaj B, Weng G, Ong M, Attarwala AA, Molina F, Büsing K, Glatting G. Comparison of five cluster validity indices performance in brain [¹⁸ F]FET-PET image segmentation using k-means. Med Phys 2017;44:209-220. [PMID: 28102943 DOI: 10.1002/mp.12025] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Revised: 11/15/2016] [Accepted: 11/16/2016] [Indexed: 11/11/2022] Open

Abstract

PURPOSE

Dynamic [¹⁸ F]fluoro-ethyl-L-tyrosine positron emission tomography ([¹⁸ F]FET-PET) is used to identify tumor lesions for radiotherapy treatment planning, to differentiate glioma recurrence from radiation necrosis and to classify gliomas grading. To segment different regions in the brain k-means cluster analysis can be used. The main disadvantage of k-means is that the number of clusters must be pre-defined. In this study, we therefore compared different cluster validity indices for automated and reproducible determination of the optimal number of clusters based on the dynamic PET data.

METHODS

The k-means algorithm was applied to dynamic [¹⁸ F]FET-PET images of 8 patients. Akaike information criterion (AIC), WB, I, modified Dunn's and Silhouette indices were compared on their ability to determine the optimal number of clusters based on requirements for an adequate cluster validity index. To check the reproducibility of k-means, the coefficients of variation CVs of the objective function values OFVs (sum of squared Euclidean distances within each cluster) were calculated using 100 random centroid initialization replications RCI₁₀₀ for 2 to 50 clusters. k-means was performed independently on three neighboring slices containing tumor for each patient to investigate the stability of the optimal number of clusters within them. To check the independence of the validity indices on the number of voxels, cluster analysis was applied after duplication of a slice selected from each patient. CVs of index values were calculated at the optimal number of clusters using RCI₁₀₀ to investigate the reproducibility of the validity indices. To check if the indices have a single extremum, visual inspection was performed on the replication with minimum OFV from RCI₁₀₀ .

RESULTS

The maximum CV of OFVs was 2.7 × 10^-2 from all patients. The optimal number of clusters given by modified Dunn's and Silhouette indices was 2 or 3 leading to a very poor segmentation. WB and I indices suggested in median 5, [range 4-6] and 4, [range 3-6] clusters, respectively. For WB, I, modified Dunn's and Silhouette validity indices the suggested optimal number of clusters was not affected by the number of the voxels. The maximum coefficient of variation of WB, I, modified Dunn's, and Silhouette validity indices were 3 × 10^-2 , 1, 2 × 10^-1 and 3 × 10^-3 , respectively. WB-index showed a single global maximum, whereas the other indices showed also local extrema.

CONCLUSION

From the investigated cluster validity indices, the WB-index is best suited for automated determination of the optimal number of clusters for [¹⁸ F]FET-PET brain images for the investigated image reconstruction algorithm and the used scanner: it yields meaningful results allowing better differentiation of tissues with higher number of clusters, it is simple, reproducible and has an unique global minimum.

Collapse

Ghosh A, De RK. Identification of certain cancer-mediating genes using Gaussian fuzzy cluster validity index. J Biosci 2015;40:741-54. [PMID: 26564976 DOI: 10.1007/s12038-015-9557-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

A formal algorithm for verifying the validity of clustering results based on model checking. PLoS One 2014;9:e90109. [PMID: 24608823 PMCID: PMC3946478 DOI: 10.1371/journal.pone.0090109] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2013] [Accepted: 01/30/2014] [Indexed: 11/19/2022] Open

Canonical PSO Based K-Means Clustering Approach for Real Datasets. INTERNATIONAL SCHOLARLY RESEARCH NOTICES 2014;2014:414013. [PMID: 27355083 PMCID: PMC4897525 DOI: 10.1155/2014/414013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2014] [Revised: 09/19/2014] [Accepted: 10/02/2014] [Indexed: 11/18/2022]

Fazel Zarandi M, Doostparast Torshizi A, Turksen I, Rezaee B. A new indirect approach to the type-2 fuzzy systems modeling and design. Inf Sci (N Y) 2013. [DOI: 10.1016/j.ins.2012.12.017] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

De Mulder W. Optimal clustering in the context of overlapping cluster analysis. Inf Sci (N Y) 2013. [DOI: 10.1016/j.ins.2012.09.051] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]