Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Linderman GC, Steinerberger S. Clustering with t-SNE, provably. SIAM J Math Data Sci 2019;1:313-332. [PMID: 33073204 PMCID: PMC7561036 DOI: 10.1137/18m1216134] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Number

Cited by Other Article(s)

Jang JH, Kim TY, Lim HS, Yoon D. Unsupervised feature learning for electrocardiogram data using the convolutional variational autoencoder. PLoS One 2021;16:e0260612. [PMID: 34852002 PMCID: PMC8635334 DOI: 10.1371/journal.pone.0260612] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 11/13/2021] [Indexed: 11/18/2022] Open

Chi EC. Discovering Geometry in Data Arrays. Comput Sci Eng 2021;23:42-51. [PMID: 35784398 PMCID: PMC9248489 DOI: 10.1109/mcse.2021.3120039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2024]

Sorino P, Campanella A, Bonfiglio C, Mirizzi A, Franco I, Bianco A, Caruso MG, Misciagna G, Aballay LR, Buongiorno C, Liuzzi R, Cisternino AM, Notarnicola M, Chiloiro M, Fallucchi F, Pascoschi G, Osella AR. Development and validation of a neural network for NAFLD diagnosis. Sci Rep 2021;11:20240. [PMID: 34642390 PMCID: PMC8511336 DOI: 10.1038/s41598-021-99400-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 09/24/2021] [Indexed: 12/18/2022] Open

Abstract

Non-Alcoholic Fatty Liver Disease (NAFLD) affects about 20–30% of the adult population in developed countries and is an increasingly important cause of hepatocellular carcinoma. Liver ultrasound (US) is widely used as a noninvasive method to diagnose NAFLD. However, the intensive use of US is not cost-effective and increases the burden on the healthcare system. Electronic medical records facilitate large-scale epidemiological studies and, existing NAFLD scores often require clinical and anthropometric parameters that may not be captured in those databases. Our goal was to develop and validate a simple Neural Network (NN)-based web app that could be used to predict NAFLD particularly its absence. The study included 2970 subjects; training and testing of the neural network using a train–test-split approach was done on 2869 of them. From another population consisting of 2301 subjects, a further 100 subjects were randomly extracted to test the web app. A search was made to find the best parameters for the NN and then this NN was exported for incorporation into a local web app. The percentage of accuracy, area under the ROC curve, confusion matrix, Positive (PPV) and Negative Predicted Value (NPV) values, precision, recall and f1-score were verified. After that, Explainability (XAI) was analyzed to understand the diagnostic reasoning of the NN. Finally, in the local web app, the specificity and sensitivity values were checked. The NN achieved a percentage of accuracy during testing of 77.0%, with an area under the ROC curve value of 0.82. Thus, in the web app the NN evidenced to achieve good results, with a specificity of 1.00 and sensitivity of 0.73. The described approach can be used to support NAFLD diagnosis, reducing healthcare costs. The NN-based web app is easy to apply and the required parameters are easily found in healthcare databases.

Collapse

Affiliation(s)

Paolo Sorino Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Angelo Campanella Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Caterina Bonfiglio Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Antonella Mirizzi Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Isabella Franco Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Antonella Bianco Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Maria Gabriella Caruso Laboratory of Nutritional Biochemistry, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Giovanni Misciagna Scientific and Ethical Committee, Polyclinic Hospital, University of Bari, Piazza Giulio Cesare, 11, 70124, Bari, BA, Italy
Laura R Aballay Human Nutrition Research Center (CenINH), School of Nutrition, Faculty of Medical Sciences, Universidad Nacional de Córdoba, Córdoba, Argentina
Claudia Buongiorno Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Rosalba Liuzzi Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Anna Maria Cisternino Clinical Nutrition Outpatient Clinic, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Maria Notarnicola Laboratory of Nutritional Biochemistry, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy
Marisa Chiloiro San Giacomo Hospital, Largo S. Veneziani, 21, 70043, Monopoli, BA, Italy
Francesca Fallucchi Department of Engineering Sciences, Guglielmo Marconi University, Via plinio 44, 00193, Rome, Italy
Giovanni Pascoschi Department of Electrical and Information Engineering, Polytechnic of Bari, Via Re David, 200, 70125, Bari, BA, Italy
Alberto Rubén Osella Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, "S de Bellis" Research Hospital, Via Turi 27, 70013, Castellana Grotte, BA, Italy.

Collapse

Class distribution-aware adaptive margins and cluster embedding for classification of fruit and vegetables at supermarket self-checkouts. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.07.040] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Bryan de la Peña J, Kunder N, Lou TF, Chase R, Stanowick A, Barragan-Iglesias P, Pancrazio JJ, Campbell ZT. A Role for Translational Regulation by S6 Kinase and a Downstream Target in Inflammatory Pain. Br J Pharmacol 2021;178:4675-4690. [PMID: 34355805 DOI: 10.1111/bph.15646] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 07/23/2021] [Accepted: 07/26/2021] [Indexed: 11/30/2022] Open

Visualization of vibrational spectroscopy for agro-food samples using t-Distributed Stochastic Neighbor Embedding. Food Control 2021. [DOI: 10.1016/j.foodcont.2020.107812] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Zhao Y, Fang ZY, Lin CX, Deng C, Xu YP, Li HD. RFCell: A Gene Selection Approach for scRNA-seq Clustering Based on Permutation and Random Forest. Front Genet 2021;12:665843. [PMID: 34386033 PMCID: PMC8354212 DOI: 10.3389/fgene.2021.665843] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/01/2021] [Indexed: 11/13/2022] Open

Single-Trial Kernel-Based Functional Connectivity for Enhanced Feature Extraction in Motor-Related Tasks. SENSORS 2021;21:s21082750. [PMID: 33924672 PMCID: PMC8069819 DOI: 10.3390/s21082750] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 04/01/2021] [Accepted: 04/08/2021] [Indexed: 02/06/2023]

Wang P, Zhang G, Li Y, Oad A, Huang G. Stochastic Neighbor Embedding Algorithm and its Application in Molecular Biological Data. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200414093636] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Automatic Image-Based Event Detection for Large-N Seismic Arrays Using a Convolutional Neural Network. REMOTE SENSING 2021. [DOI: 10.3390/rs13030389] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract Passive seismic experiments have been proposed as a cost-effective and non-invasive alternative to controlled-source seismology, allowing body–wave reflections based on seismic interferometry principles to be retrieved. However, from the huge volume of the recorded ambient noise, only selected time periods (noise panels) are contributing constructively to the retrieval of reflections. We address the issue of automatic scanning of ambient noise data recorded by a large-N array in search of body–wave energy (body–wave events) utilizing a convolutional neural network (CNN). It consists of computing first both amplitude and frequency attribute values at each receiver station for all divided portions of the recorded signal (noise panels). The created 2-D attribute maps are then converted to images and used to extract spatial and temporal patterns associated with the body–wave energy present in the data to build binary CNN-based classifiers. The ensemble of two multi-headed CNN models trained separately on the frequency and amplitude attribute maps demonstrates better generalization ability than each of its participating networks. We also compare the prediction performance of our deep learning (DL) framework with a conventional machine learning (ML) algorithm called XGBoost. The DL-based solution applied to 240 h of ambient seismic noise data recorded by the Kylylahti array in Finland demonstrates high detection accuracy and the superiority over the ML-based one. The ensemble of CNN-based models managed to find almost three times more verified body–wave events in the full unlabelled dataset than it was provided at the training stage. Moreover, the high-level abstraction features extracted at the deeper convolution layers can be used to perform unsupervised clustering of the classified panels with respect to their visual characteristics. Collapse

Natural Language Processing Based Method for Clustering and Analysis of Aviation Safety Narratives. AEROSPACE 2020. [DOI: 10.3390/aerospace7100143] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Abstract The complexity of commercial aviation operations has grown substantially in recent years, together with a diversification of techniques for collecting and analyzing flight data. As a result, data-driven frameworks for enhancing flight safety have grown in popularity. Data-driven techniques offer efficient and repeatable exploration of patterns and anomalies in large datasets. Text-based flight safety data presents a unique challenge in its subjectivity, and relies on natural language processing tools to extract underlying trends from narratives. In this paper, a methodology is presented for the analysis of aviation safety narratives based on text-based accounts of in-flight events and categorical metadata parameters which accompany them. An extensive pre-processing routine is presented, including a comparison between numeric models of textual representation for the purposes of document classification. A framework for categorizing and visualizing narratives is presented through a combination of k-means clustering and 2-D mapping with t-Distributed Stochastic Neighbor Embedding (t-SNE). A cluster post-processing routine is developed for identifying driving factors in each cluster and building a hierarchical structure of cluster and sub-cluster labels. The Aviation Safety Reporting System (ASRS), which includes over a million de-identified voluntarily submitted reports describing aviation safety incidents for commercial flights, is analyzed as a case study for the methodology. The method results in the identification of 10 major clusters and a total of 31 sub-clusters. The identified groupings are post-processed through metadata-based statistical analysis of the learned clusters. The developed method shows promise in uncovering trends from clusters that are not evident in existing anomaly labels in the data and offers a new tool for obtaining insights from text-based safety data that complement existing approaches. Collapse

Chen L, Guo Q, Liu Z, Zhang S, Zhang H. Enhanced synchronization-inspired clustering for high-dimensional data. COMPLEX INTELL SYST 2020. [DOI: 10.1007/s40747-020-00191-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Chatzimparmpas A, Martins RM, Kerren A. t-viSNE: Interactive Assessment and Interpretation of t-SNE Projections. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2020;26:2696-2714. [PMID: 32305922 DOI: 10.1109/tvcg.2020.2986996] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Peña-Solórzano CA, Albrecht DW, Bassed RB, Gillam J, Harris PC, Dimmock MR. Semi-supervised labelling of the femur in a whole-body post-mortem CT database using deep learning. Comput Biol Med 2020;122:103797. [PMID: 32658723 DOI: 10.1016/j.compbiomed.2020.103797] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2020] [Revised: 04/29/2020] [Accepted: 04/29/2020] [Indexed: 01/16/2023]

Aliverti E, Tilson JL, Filer DL, Babcock B, Colaneri A, Ocasio J, Gershon TR, Wilhelmsen KC, Dunson DB. Projected t-SNE for batch correction. Bioinformatics 2020;36:3522-3527. [PMID: 32176244 PMCID: PMC7267829 DOI: 10.1093/bioinformatics/btaa189] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2019] [Revised: 03/02/2020] [Accepted: 03/12/2020] [Indexed: 12/31/2022] Open

Linderman GC, Mishne G, Jaffe A, Kluger Y, Steinerberger S. Randomized near-neighbor graphs, giant components and applications in data science. J Appl Probab 2020;57:458-476. [PMID: 32913373 PMCID: PMC7480951 DOI: 10.1017/jpr.2020.21] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Škvorc U, Eftimov T, Korošec P. Understanding the problem space in single-objective numerical optimization using exploratory landscape analysis. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106138] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Zhang Y, Kim MS, Reichenberger ER, Stear B, Taylor DM. Scedar: A scalable Python package for single-cell RNA-seq exploratory data analysis. PLoS Comput Biol 2020;16:e1007794. [PMID: 32339163 PMCID: PMC7217489 DOI: 10.1371/journal.pcbi.1007794] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 05/12/2020] [Accepted: 03/17/2020] [Indexed: 11/25/2022] Open

Linderman GC, Steinerberger S. NUMERICAL INTEGRATION ON GRAPHS: WHERE TO SAMPLE AND HOW TO WEIGH. MATHEMATICS OF COMPUTATION 2020;89:1933-1952. [PMID: 33927452 PMCID: PMC8081285 DOI: 10.1090/mcom/3515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Chi EC, Gaines BR, Sun WW, Zhou H, Yang J. Provable Convex Co-clustering of Tensors. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2020;21:214. [PMID: 33312074 PMCID: PMC7731944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Kobak D, Berens P. The art of using t-SNE for single-cell transcriptomics. Nat Commun 2019;10:5416. [PMID: 31780648 PMCID: PMC6882829 DOI: 10.1038/s41467-019-13056-x] [Citation(s) in RCA: 446] [Impact Index Per Article: 74.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Accepted: 08/30/2019] [Indexed: 12/25/2022] Open