Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rodriguez MZ, Comin CH, Casanova D, Bruno OM, Amancio DR, Costa LDF, Rodrigues FA. Clustering algorithms: A comparative approach. PLoS One 2019;14:e0210236. [PMID: 30645617 PMCID: PMC6333366 DOI: 10.1371/journal.pone.0210236] [Citation(s) in RCA: 121] [Impact Index Per Article: 24.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2016] [Accepted: 12/19/2018] [Indexed: 12/04/2022] Open

For:	Rodriguez MZ, Comin CH, Casanova D, Bruno OM, Amancio DR, Costa LDF, Rodrigues FA. Clustering algorithms: A comparative approach. PLoS One 2019;14:e0210236. [PMID: 30645617 PMCID: PMC6333366 DOI: 10.1371/journal.pone.0210236] [Citation(s) in RCA: 121] [Impact Index Per Article: 24.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2016] [Accepted: 12/19/2018] [Indexed: 12/04/2022] Open

Number

Cited by Other Article(s)

101

Cecilia JM, Cano JC, Morales-García J, Llanes A, Imbernón B. Evaluation of Clustering Algorithms on GPU-Based Edge Computing Platforms. SENSORS 2020;20:s20216335. [PMID: 33172017 PMCID: PMC7664181 DOI: 10.3390/s20216335] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 10/30/2020] [Accepted: 11/03/2020] [Indexed: 11/16/2022]

Abstract

Internet of Things (IoT) is becoming a new socioeconomic revolution in which data and immediacy are the main ingredients. IoT generates large datasets on a daily basis but it is currently considered as "dark data", i.e., data generated but never analyzed. The efficient analysis of this data is mandatory to create intelligent applications for the next generation of IoT applications that benefits society. Artificial Intelligence (AI) techniques are very well suited to identifying hidden patterns and correlations in this data deluge. In particular, clustering algorithms are of the utmost importance for performing exploratory data analysis to identify a set (a.k.a., cluster) of similar objects. Clustering algorithms are computationally heavy workloads and require to be executed on high-performance computing clusters, especially to deal with large datasets. This execution on HPC infrastructures is an energy hungry procedure with additional issues, such as high-latency communications or privacy. Edge computing is a paradigm to enable light-weight computations at the edge of the network that has been proposed recently to solve these issues. In this paper, we provide an in-depth analysis of emergent edge computing architectures that include low-power Graphics Processing Units (GPUs) to speed-up these workloads. Our analysis includes performance and power consumption figures of the latest Nvidia's AGX Xavier to compare the energy-performance ratio of these low-cost platforms with a high-performance cloud-based counterpart version. Three different clustering algorithms (i.e., k-means, Fuzzy Minimals (FM), and Fuzzy C-Means (FCM)) are designed to be optimally executed on edge and cloud platforms, showing a speed-up factor of up to 11× for the GPU code compared to sequential counterpart versions in the edge platforms and energy savings of up to 150% between the edge computing and HPC platforms.

Collapse

102

Blumenberg L, Ruggles KV. Hypercluster: a flexible tool for parallelized unsupervised clustering optimization. BMC Bioinformatics 2020;21:428. [PMID: 32993491 PMCID: PMC7525959 DOI: 10.1186/s12859-020-03774-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Accepted: 09/22/2020] [Indexed: 12/24/2022] Open

103

Vaura FC, Salomaa VV, Kantola IM, Kaaja R, Lahti L, Niiranen TJ. Unsupervised hierarchical clustering identifies a metabolically challenged subgroup of hypertensive individuals. J Clin Hypertens (Greenwich) 2020;22:1546-1553. [PMID: 33460260 PMCID: PMC8029868 DOI: 10.1111/jch.13984] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 06/15/2020] [Accepted: 06/27/2020] [Indexed: 11/29/2022]

Abstract

The current classification of hypertension does not reflect the heterogeneity in characteristics or cardiovascular outcomes of hypertensive individuals. Our objective was to identify distinct phenotypes of hypertensive individuals with potentially different cardiovascular risk profiles using data-driven cluster analysis. We performed clustering, a procedure that identifies groups with similar characteristics, in 3726 individuals (mean age 59.4 years, 49% women) with grade 2 hypertension (blood pressure ≥160/100 mmHg or antihypertensive medication) selected from FINRISK 1997, 2002, and 2007 cohorts. We computed clusters based on eight factors associated with hypertension: mean arterial pressure, pulse pressure, non-high-density lipoprotein cholesterol, blood glucose, BMI, C-reactive protein, estimated glomerular filtration rate, and alcohol. After that, we used Cox regression models adjusted for age and sex to assess the relative risk of cardiovascular disease (CVD) outcomes between the clusters and a reference group of 11 020 individuals. We observed two comparable clusters in both men and women. The Metabolically Challenged (MC) cluster was characterized by high blood glucose (Z-score 4.4 ± 1.1 vs 0.2 ± 0.8, men; 3.5 ± 1.1 vs 0.0 ± 0.6, women) and elevated BMI (30.4 ± 4.1 vs 28.9 ± 4.3, men; 32.7 ± 4.9 vs 29.3 ± 5.5, women). Over a 10-year follow-up (1034 CVD events), MC had 1.6-fold (95% CI 1.1-2.4) CVD risk compared to non-MC and 2.5-fold (95% CI 1.7-3.7) CVD risk compared to the reference group (P ≤ .009 for both). Using unsupervised hierarchical clustering, we found two phenotypically distinct hypertension subgroups with different risks of CVD complications. This substratification could be used to design studies that explore the differential effects of antihypertensive therapies among subgroups of hypertensive individuals.

Collapse

104

Kumar S, Suhaib M, Asjad M. Narrowing the barriers to Industry 4.0 practices through PCA-Fuzzy AHP-K means. JOURNAL OF ADVANCES IN MANAGEMENT RESEARCH 2020. [DOI: 10.1108/jamr-06-2020-0098] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract PurposeThe study aims to analyze the barriers in the adoption of Industry 4.0 (I4.0) practices in terms of prioritization, cluster formation and clustering of empirical responses, and then narrowing them with identification of the most influential barriers for further managerial implications in the adoption of I4.0 practices by developing an enhanced understanding of I4.0.Design/methodology/approachFor the survey-based empirical research, barriers to I.40 are synthesized from the review of relevant literature and further discussions with academician and industry persons. Three widely acclaimed statistical techniques, viz. principal component analysis (PCA), fuzzy analytical hierarchical process (fuzzy AHP) and K-means clustering are applied.FindingsThe novel integrated approach shows that lack of transparent cost-benefit analysis with clear comprehension about benefits is the major barrier for the adoption of I4.0, followed by “IT infrastructure,” “Missing standards,” “Lack of properly skilled manpower,” “Fitness of present machines/equipment in the new regime” and “Concern to data security” which are other prominent barriers in adoption of I4.0 practices. The availability of funds, transparent cost-benefit analysis and clear comprehension about benefits will motivate the business owners to adopt it, overcoming the other barriers.Research limitations/implicationsThe present study brings out the new fundamental insights from the barriers to I4.0. The new insights developed here will be helpful for managers and policymakers to understand the concept and barriers hindering its smooth implementation. The factors identified are the major thrust areas for a manager to focus on for the smooth implementation of I4.0 practices. The removal of these barriers will act as a booster in the way of implementing I4.0. Real-world testing of findings is not available yet, and this will be the new direction for further research.Practical implicationsThe new production paradigm is highly complex and evolving. The study will act as a handy tool for the implementing manager for what to push first and what to push later while implementing the I4.0 practices. It will also empower a manager to assess the implementation capabilities of the industry in advance.Originality/valuePCA, fuzzy AHP and K means are deployed for identifying the significant barriers to I4.0 first time. The paper is the result of the original conceptual work of integrating the three techniques in the domain of prioritizing and narrowing the barriers from 16 to 6. Collapse

105

Hwang Y, Um JS, Schlüter S. Evaluating the Mutual Relationship between IPAT/Kaya Identity Index and ODIAC-Based GOSAT Fossil-Fuel CO₂ Flux: Potential and Constraints in Utilizing Decomposed Variables. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020;17:ijerph17165976. [PMID: 32824606 PMCID: PMC7459989 DOI: 10.3390/ijerph17165976] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 08/12/2020] [Accepted: 08/14/2020] [Indexed: 11/21/2022]

106

Mustafa HMJ, Ayob M, Albashish D, Abu-Taleb S. Solving text clustering problem using a memetic differential evolution algorithm. PLoS One 2020;15:e0232816. [PMID: 32525869 PMCID: PMC7289410 DOI: 10.1371/journal.pone.0232816] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 04/22/2020] [Indexed: 12/03/2022] Open

107

Gong Z, Cai T, Thill JC, Hale S, Graham M. Measuring relative opinion from location-based social media: A case study of the 2016 U.S. presidential election. PLoS One 2020;15:e0233660. [PMID: 32442212 PMCID: PMC7244148 DOI: 10.1371/journal.pone.0233660] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Accepted: 05/10/2020] [Indexed: 11/19/2022] Open

108

Bremer PL, De Boer D, Alvarado W, Martinez X, Sorin EJ. Overcoming the Heuristic Nature of k-Means Clustering: Identification and Characterization of Binding Modes from Simulations of Molecular Recognition Complexes. J Chem Inf Model 2020;60:3081-3092. [PMID: 32383869 DOI: 10.1021/acs.jcim.9b01137] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

109

Profiling of Chlorogenic Acids from Bidens pilosa and Differentiation of Closely Related Positional Isomers with the Aid of UHPLC-QTOF-MS/MS-Based In-Source Collision-Induced Dissociation. Metabolites 2020;10:metabo10050178. [PMID: 32365739 PMCID: PMC7281500 DOI: 10.3390/metabo10050178] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 04/20/2020] [Accepted: 04/21/2020] [Indexed: 12/14/2022] Open

110

Nwadiugwu MC. Gene-Based Clustering Algorithms: Comparison Between Denclue, Fuzzy-C, and BIRCH. Bioinform Biol Insights 2020;14:1177932220909851. [PMID: 32284672 PMCID: PMC7133071 DOI: 10.1177/1177932220909851] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 02/02/2020] [Indexed: 11/17/2022] Open

111

Licen S, Di Gilio A, Palmisani J, Petraccone S, de Gennaro G, Barbieri P. Pattern Recognition and Anomaly Detection by Self-Organizing Maps in a Multi Month E-nose Survey at an Industrial Site. SENSORS 2020;20:s20071887. [PMID: 32235302 PMCID: PMC7180849 DOI: 10.3390/s20071887] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Revised: 03/21/2020] [Accepted: 03/23/2020] [Indexed: 11/29/2022]

112

Brito ACM, Silva FN, Amancio DR. A complex network approach to political analysis: Application to the Brazilian Chamber of Deputies. PLoS One 2020;15:e0229928. [PMID: 32191720 PMCID: PMC7081992 DOI: 10.1371/journal.pone.0229928] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2019] [Accepted: 02/17/2020] [Indexed: 11/29/2022] Open

113

Personalized prediction of smartphone-based psychotherapeutic micro-intervention success using machine learning. J Affect Disord 2020;264:430-437. [PMID: 31787419 DOI: 10.1016/j.jad.2019.11.071] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Revised: 09/18/2019] [Accepted: 11/12/2019] [Indexed: 12/29/2022]

Abstract

BACKGROUND

Tailoring healthcare to patients' individual needs is a central goal of precision medicine. Combining smartphone-based interventions with machine learning approaches may help attaining this goal. The aim of our study was to explore the predictability of the success of smartphone-based psychotherapeutic micro-interventions in eliciting mood changes using machine learning.

METHODS

Participants conducted daily smartphone-based psychotherapeutic micro-interventions, guided by short video clips, for 13 consecutive days. Participants chose one of four intervention techniques used in psychotherapeutic approaches. Mood changes were assessed using the Multidimensional Mood State Questionnaire. Micro-intervention success was predicted using random forest (RF) tree-based mixed-effects logistic regression models. Data from 27 participants were used, totaling 324 micro-interventions, randomly split 100 times into training and test samples, using within-subject and between-subject sampling.

RESULTS

Mood improved from pre- to post-intervention in 137 sessions (initial success-rate: 42.3%). The RF approach resulted in predictions of micro-intervention success significantly better than the initial success-rate within and between subjects (positive predictive value: 0.732 (95%-CI: 0.607; 0.820) and 0.698 (95%-CI: 0.564; 0.805), respectively). Prediction quality was highest using the RF approach within subjects (rand accuracy: 0.75 (95%-CI: 0.641; 0.840), Matthew's correlation coefficient: 0.483 (95%-CI: 0.323; 0.723)).

LIMITATIONS

The RF approach does not allow firm conclusions about the exact contribution of each factor to the algorithm's predictions. We included a limited number of predictors and did not compare whether predictability differed between psychotherapeutic techniques.

CONCLUSIONS

Our findings may pave the way for translation and encourage scrutinizing personalized prediction in the psychotherapeutic context to improve treatment efficacy.

Collapse

114

Kotiang S, Eslami A. A probabilistic graphical model for system-wide analysis of gene regulatory networks. Bioinformatics 2020;36:3192-3199. [DOI: 10.1093/bioinformatics/btaa122] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 01/15/2020] [Accepted: 02/18/2020] [Indexed: 01/28/2023] Open

Abstract Abstract Motivation The inference of gene regulatory networks (GRNs) from DNA microarray measurements forms a core element of systems biology-based phenotyping. In the recent past, numerous computational methodologies have been formalized to enable the deduction of reliable and testable predictions in today’s biology. However, little focus has been aimed at quantifying how well existing state-of-the-art GRNs correspond to measured gene-expression profiles. Results Here, we present a computational framework that combines the formulation of probabilistic graphical modeling, standard statistical estimation, and integration of high-throughput biological data to explore the global behavior of biological systems and the global consistency between experimentally verified GRNs and corresponding large microarray compendium data. The model is represented as a probabilistic bipartite graph, which can handle highly complex network systems and accommodates partial measurements of diverse biological entities, e.g. messengerRNAs, proteins, metabolites and various stimulators participating in regulatory networks. This method was tested on microarray expression data from the M3D database, corresponding to sub-networks on one of the best researched model organisms, Escherichia coli. Results show a surprisingly high correlation between the observed states and the inferred system’s behavior under various experimental conditions. Availability and implementation Processed data and software implementation using Matlab are freely available at https://github.com/kotiang54/PgmGRNs. Full dataset available from the M3D database. Collapse

115

Weißer T, Saßmannshausen T, Ohrndorf D, Burggräf P, Wagner J. A clustering approach for topic filtering within systematic literature reviews. MethodsX 2020;7:100831. [PMID: 32195145 PMCID: PMC7078380 DOI: 10.1016/j.mex.2020.100831] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 02/12/2020] [Indexed: 11/23/2022] Open

116

Rich-Griffin C, Stechemesser A, Finch J, Lucas E, Ott S, Schäfer P. Single-Cell Transcriptomics: A High-Resolution Avenue for Plant Functional Genomics. TRENDS IN PLANT SCIENCE 2020;25:186-197. [PMID: 31780334 DOI: 10.1016/j.tplants.2019.10.008] [Citation(s) in RCA: 93] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 09/30/2019] [Accepted: 10/17/2019] [Indexed: 05/19/2023]

117

Kabir KL, Akhter N, Shehu A. From molecular energy landscapes to equilibrium dynamics via landscape analysis and markov state models. J Bioinform Comput Biol 2020;17:1940014. [DOI: 10.1142/s0219720019400146] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

118

Defne Z, Aretxabaleta AL, Ganju NK, Kalra TS, Jones DK, Smith KEL. A geospatially resolved wetland vulnerability index: Synthesis of physical drivers. PLoS One 2020;15:e0228504. [PMID: 31999806 PMCID: PMC6992177 DOI: 10.1371/journal.pone.0228504] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 01/16/2020] [Indexed: 11/18/2022] Open

Abstract

Assessing wetland vulnerability to chronic and episodic physical drivers is fundamental for establishing restoration priorities. We synthesized multiple data sets from E.B. Forsythe National Wildlife Refuge, New Jersey, to establish a wetland vulnerability metric that integrates a range of physical processes, anthropogenic impact and physical/biophysical features. The geospatial data are based on aerial imagery, remote sensing, regulatory information, and hydrodynamic modeling; and include elevation, tidal range, unvegetated to vegetated marsh ratio (UVVR), shoreline erosion, potential exposure to contaminants, residence time, marsh condition change, change in salinity, salinity exposure and sediment concentration. First, we delineated the wetland complex into individual marsh units based on surface contours, and then defined a wetland vulnerability index that combined contributions from all parameters. We applied principal component and cluster analyses to explore the interrelations between the data layers, and separate regions that exhibited common characteristics. Our analysis shows that the spatial variation of vulnerability in this domain cannot be explained satisfactorily by a smaller subset of the variables. The most influential factor on the vulnerability index was the combined effect of elevation, tide range, residence time, and UVVR. Tide range and residence time had the highest correlation, and similar bay-wide spatial variation. Some variables (e.g., shoreline erosion) had no significant correlation with the rest of the variables. The aggregated index based on the complete dataset allows us to assess the overall state of a given marsh unit and quickly locate the most vulnerable units in a larger marsh complex. The application of geospatially complete datasets and consideration of chronic and episodic physical drivers represents an advance over traditional point-based methods for wetland assessment.

Collapse

119

Liu J, Zhao M, Kong W. Sub-Graph Regularization on Kernel Regression for Robust Semi-Supervised Dimensionality Reduction. ENTROPY 2019;21:1125. [PMCID: PMC7514469 DOI: 10.3390/e21111125] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 11/07/2019] [Indexed: 06/17/2023]

120

Permutation Entropy: Enhancing Discriminating Power by Using Relative Frequencies Vector of Ordinal Patterns Instead of Their Shannon Entropy. ENTROPY 2019. [PMCID: PMC7514234 DOI: 10.3390/e21101013] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

121

Classifying fishing behavioral diversity using high-frequency movement data. Proc Natl Acad Sci U S A 2019;116:16811-16816. [PMID: 31399551 DOI: 10.1073/pnas.1906766116] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

122

Mustafa HMJ, Ayob M, Nazri MZA, Kendall G. An improved adaptive memetic differential evolution optimization algorithms for data clustering problems. PLoS One 2019;14:e0216906. [PMID: 31137034 PMCID: PMC6538400 DOI: 10.1371/journal.pone.0216906] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 04/30/2019] [Indexed: 11/23/2022] Open