Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Cohen S, Dror G, Ruppin E. Feature Selection via Coalitional Game Theory. Neural Comput 2007;19:1939-61. [PMID: 17521285 DOI: 10.1162/neco.2007.19.7.1939] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]

Number

Cited by Other Article(s)

Park SH, Song SH, Burton F, Arsan C, Jobst B, Feldman M. Machine learning characterization of a rare neurologic disease via electronic health records: a proof-of-principle study on stiff person syndrome. BMC Neurol 2024;24:272. [PMID: 39097681 PMCID: PMC11297611 DOI: 10.1186/s12883-024-03760-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 07/12/2024] [Indexed: 08/05/2024] Open

Abstract

BACKGROUND

Despite the frequent diagnostic delays of rare neurologic diseases (RND), it remains difficult to study RNDs and their comorbidities due to their rarity and hence the statistical underpowering. Affecting one to two in a million annually, stiff person syndrome (SPS) is an RND characterized by painful muscle spasms and rigidity. Leveraging underutilized electronic health records (EHR), this study showcased a machine-learning-based framework to identify clinical features that optimally characterize the diagnosis of SPS.

METHODS

A machine-learning-based feature selection approach was employed on 319 items from the past medical histories of 48 individuals (23 with a diagnosis of SPS and 25 controls) with elevated serum autoantibodies against glutamic-acid-decarboxylase-65 (anti-GAD65) in Dartmouth Health's EHR to determine features with the highest discriminatory power. Each iteration of the algorithm implemented a Support Vector Machine (SVM) model, generating importance scores-SHapley Additive exPlanation (SHAP) values-for each feature and removing one with the least salient. Evaluation metrics were calculated through repeated stratified cross-validation.

RESULTS

Depression, hypothyroidism, GERD, and joint pain were the most characteristic features of SPS. Utilizing these features, the SVM model attained precision of 0.817 (95% CI 0.795-0.840), sensitivity of 0.766 (95% CI 0.743-0.790), F-score of 0.761 (95% CI 0.744-0.778), AUC of 0.808 (95% CI 0.791-0.825), and accuracy of 0.775 (95% CI 0.759-0.790).

CONCLUSIONS

This framework discerned features that, with further research, may help fully characterize the pathologic mechanism of SPS: depression, hypothyroidism, and GERD may respectively represent comorbidities through common inflammatory, genetic, and dysautonomic links. This methodology could address diagnostic challenges in neurology by uncovering latent associations and generating hypotheses for RNDs.

Collapse

Wang H, Doumard E, Soule-Dupuy C, Kemoun P, Aligon J, Monsarrat P. Explanations as a New Metric for Feature Selection: A Systematic Approach. IEEE J Biomed Health Inform 2023;27:4131-4142. [PMID: 37220033 DOI: 10.1109/jbhi.2023.3279340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

O'Sullivan CM, Ghahramani A, Deo RC, Pembleton KG. Pattern recognition describing spatio-temporal drivers of catchment classification for water quality. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;861:160240. [PMID: 36403827 DOI: 10.1016/j.scitotenv.2022.160240] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 11/12/2022] [Accepted: 11/13/2022] [Indexed: 06/16/2023]

Abstract

Classification using spatial data is foundational for hydrological modelling, particularly for ungauged areas. However, models developed from classified land use drivers deliver inconsistent water quality results for the same land uses and hinder decision-making guided by those models. This paper explores whether the temporal variation of water quality drivers, such as season and flow, influence inconsistency in the classification, and whether variability is captured in spatial datasets that include original vegetation to represent the variability of biotic responses in areas mapped with the same land use. An Artificial Neural Network Pattern Recognition (ANN-PR) method is used to match catchments by Dissolved Inorganic Nitrogen (DIN) patterns in water quality datasets partitioned into Wet vs Dry Seasons and Increasing vs Retreating flows. Explainable artificial intelligence approaches are then used to classify catchments via spatial feature datasets for each catchment. Catchments matched for sharing patterns in both spatial data and DIN datasets were corroborated and the benefit of partitioning the observed DIN dataset evaluated using Kruskal Wallis method. The highest corroboration rates for spatial data classification with DIN classification were achieved with seasonal partitioning of water quality datasets and significant independence (p < 0.001 to 0.026) from non-partitioned datasets was achieved. This study demonstrated that DIN patterns fall into three categories suited to classification under differing temporal scales with corresponding vegetation types as the indicators. Categories 1 and 3 included dominance of woodlands in their datasets and catchments suited to classify together change depending on temporal scale of the data. Category 2 catchments were dominated by vineforest and classified catchments did not change under different temporal scales. This demonstrates that including original vegetation as a proxy for differences in DIN patterns will help guide future classification where only spatially mapped data is available for ungauged catchments and will better inform data needs for water modelling.

Collapse

Balestra C, Maj C, Müller E, Mayr A. Redundancy-aware unsupervised ranking based on game theory: Ranking pathways in collections of gene sets. PLoS One 2023;18:e0282699. [PMID: 36893181 PMCID: PMC9997904 DOI: 10.1371/journal.pone.0282699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 02/13/2023] [Indexed: 03/10/2023] Open

Abstract

In Genetics, gene sets are grouped in collections concerning their biological function. This often leads to high-dimensional, overlapping, and redundant families of sets, thus precluding a straightforward interpretation of their biological meaning. In Data Mining, it is often argued that techniques to reduce the dimensionality of data could increase the maneuverability and consequently the interpretability of large data. In the past years, moreover, we witnessed an increasing consciousness of the importance of understanding data and interpretable models in the machine learning and bioinformatics communities. On the one hand, there exist techniques aiming to aggregate overlapping gene sets to create larger pathways. While these methods could partly solve the large size of the collections' problem, modifying biological pathways is hardly justifiable in this biological context. On the other hand, the representation methods to increase interpretability of collections of gene sets that have been proposed so far have proved to be insufficient. Inspired by this Bioinformatics context, we propose a method to rank sets within a family of sets based on the distribution of the singletons and their size. We obtain sets' importance scores by computing Shapley values; Making use of microarray games, we do not incur the typical exponential computational complexity. Moreover, we address the challenge of constructing redundancy-aware rankings where, in our case, redundancy is a quantity proportional to the size of intersections among the sets in the collections. We use the obtained rankings to reduce the dimension of the families, therefore showing lower redundancy among sets while still preserving a high coverage of their elements. We finally evaluate our approach for collections of gene sets and apply Gene Sets Enrichment Analysis techniques to the now smaller collections: As expected, the unsupervised nature of the proposed rankings allows for unremarkable differences in the number of significant gene sets for specific phenotypic traits. In contrast, the number of performed statistical tests can be drastically reduced. The proposed rankings show a practical utility in bioinformatics to increase interpretability of the collections of gene sets and a step forward to include redundancy-awareness into Shapley values computations.

Collapse

Saadat R, Syed-Mohamad SM, Azmi A, Keikhosrokiani P. Enhancing manufacturing process by predicting component failures using machine learning. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07465-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Hasan MK, Ghazal TM, Alkhalifah A, Abu Bakar KA, Omidvar A, Nafi NS, Agbinya JI. Fischer Linear Discrimination and Quadratic Discrimination Analysis-Based Data Mining Technique for Internet of Things Framework for Healthcare. Front Public Health 2021;9:737149. [PMID: 34712639 PMCID: PMC8545792 DOI: 10.3389/fpubh.2021.737149] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 08/20/2021] [Indexed: 11/20/2022] Open

Tan K, Huang W, Liu X, Hu J, Dong S. A Hierarchical Graph Convolution Network for Representation Learning of Gene Expression Data. IEEE J Biomed Health Inform 2021;25:3219-3229. [PMID: 33449889 DOI: 10.1109/jbhi.2021.3052008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Benchmarking Analysis of the Accuracy of Classification Methods Related to Entropy. ENTROPY 2021;23:e23070850. [PMID: 34356391 PMCID: PMC8306704 DOI: 10.3390/e23070850] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Revised: 06/18/2021] [Accepted: 06/24/2021] [Indexed: 11/19/2022]

Carrizosa E, Molero-Río C, Romero Morales D. Mathematical optimization in classification and regression trees. TOP (BERLIN, GERMANY) 2021;29:5-33. [PMID: 38624654 PMCID: PMC7967110 DOI: 10.1007/s11750-021-00594-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 01/27/2021] [Indexed: 06/02/2023]

Alzubi OA, Alzubi JA, Alweshah M, Qiqieh I, Al-Shami S, Ramachandran M. An optimal pruning algorithm of classifier ensembles: dynamic programming approach. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-04761-6] [Citation(s) in RCA: 60] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Raimondi D, Orlando G, Vranken WF, Moreau Y. Exploring the limitations of biophysical propensity scales coupled with machine learning for protein sequence analysis. Sci Rep 2019;9:16932. [PMID: 31729443 PMCID: PMC6858301 DOI: 10.1038/s41598-019-53324-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Accepted: 10/25/2019] [Indexed: 11/21/2022] Open

Zaeri-Amirani M, Afghah F, Mousavi S. A Feature Selection Method Based on Shapley Value to False Alarm Reduction in ICUs A Genetic-Algorithm Approach. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2019;2018:319-323. [PMID: 30440402 DOI: 10.1109/embc.2018.8512266] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Using game theory and decision decomposition to effectively discern and characterise bi-locus diseases. Artif Intell Med 2019;99:101690. [PMID: 31606112 DOI: 10.1016/j.artmed.2019.06.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Revised: 06/21/2019] [Accepted: 06/30/2019] [Indexed: 01/08/2023]

Liénard JF, Achakulvisut T, Acuna DE, David SV. Intellectual synthesis in mentorship determines success in academic careers. Nat Commun 2018;9:4840. [PMID: 30482900 PMCID: PMC6258699 DOI: 10.1038/s41467-018-07034-y] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Accepted: 10/11/2018] [Indexed: 11/30/2022] Open

Afghah F, Razi A, Soroushmehr R, Ghanbari H, Najarian K. Game Theoretic Approach for Systematic Feature Selection; Application in False Alarm Detection in Intensive Care Units. ENTROPY (BASEL, SWITZERLAND) 2018;20:E190. [PMID: 33265281 PMCID: PMC7512707 DOI: 10.3390/e20030190] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Revised: 02/27/2018] [Accepted: 03/05/2018] [Indexed: 01/19/2023]

CAFÉ-Map: Context Aware Feature Mapping for mining high dimensional biomedical data. Comput Biol Med 2016;79:68-79. [PMID: 27764717 DOI: 10.1016/j.compbiomed.2016.10.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Revised: 10/05/2016] [Accepted: 10/10/2016] [Indexed: 12/18/2022]

Robust Feature Selection from Microarray Data Based on Cooperative Game Theory and Qualitative Mutual Information. Adv Bioinformatics 2016;2016:1058305. [PMID: 27127506 PMCID: PMC4818815 DOI: 10.1155/2016/1058305] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2015] [Revised: 02/20/2016] [Accepted: 02/22/2016] [Indexed: 11/17/2022] Open

Zeng K, She K, Niu X. Feature selection with neighborhood entropy-based cooperative game theory. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2014;2014:479289. [PMID: 25276120 PMCID: PMC4158261 DOI: 10.1155/2014/479289] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2014] [Revised: 07/27/2014] [Accepted: 08/10/2014] [Indexed: 11/18/2022]

Using cooperative game theory to optimize the feature selection problem. Neurocomputing 2012. [DOI: 10.1016/j.neucom.2012.05.001] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Dor O, Reich Y. Strengthening learning algorithms by feature discovery. Inf Sci (N Y) 2012. [DOI: 10.1016/j.ins.2011.11.039] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Roy K, Bhattacharya P, Suen CY. Iris recognition using shape-guided approach and game theory. Pattern Anal Appl 2011. [DOI: 10.1007/s10044-011-0229-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Schuster S, Kreft JU, Brenner N, Wessely F, Theissen G, Ruppin E, Schroeter A. Cooperation and cheating in microbial exoenzyme production--theoretical analysis for biotechnological applications. Biotechnol J 2010;5:751-8. [PMID: 20540107 DOI: 10.1002/biot.200900303] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]