Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tsai YS, Lin CT, Tseng GC, Chung IF, Pal NR. Discovery of dominant and dormant genes from expression data using a novel generalization of SNR for multi-class problems. BMC Bioinformatics 2008;9:425. [PMID: 18842155 PMCID: PMC2620271 DOI: 10.1186/1471-2105-9-425] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2008] [Accepted: 10/09/2008] [Indexed: 12/14/2022] Open

For:	Tsai YS, Lin CT, Tseng GC, Chung IF, Pal NR. Discovery of dominant and dormant genes from expression data using a novel generalization of SNR for multi-class problems. BMC Bioinformatics 2008;9:425. [PMID: 18842155 PMCID: PMC2620271 DOI: 10.1186/1471-2105-9-425] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2008] [Accepted: 10/09/2008] [Indexed: 12/14/2022] Open

Number

Cited by Other Article(s)

Chen J, Wen B. Bi-level gene selection of cancer by combining clustering and sparse learning. Comput Biol Med 2024;172:108236. [PMID: 38471351 DOI: 10.1016/j.compbiomed.2024.108236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 02/07/2024] [Accepted: 02/25/2024] [Indexed: 03/14/2024]

Wu Y, Sa Y, Guo Y, Li Q, Zhang N. Identification of WHO II/III gliomas by 16 prognostic-related gene signatures using machine learning methods. Curr Med Chem 2021;29:1622-1639. [PMID: 34455959 DOI: 10.2174/0929867328666210827103049] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 05/27/2021] [Accepted: 05/28/2021] [Indexed: 11/22/2022]

Abstract

BACKGROUND

It is found that the prognosis of gliomas of the same grade has large differences among World Health Organization(WHO) grade II and III in clinical observation. Therefore, a better understanding of the genetics and molecular mechanisms underlying WHO grade II and III gliomas is required, with the aim of developing a classification scheme at the molecular level rather than the conventional pathological morphology level.

METHOD

We performed survival analysis combined with machine learning methods of Least Absolute Shrinkage and Selection Operator using expression datasets downloaded from the Chinese Glioma Genome Atlas as well as The Cancer Genome Atlas. Risk scores were calculated by the product of expression level of overall survival-related genes and their multivariate Cox proportional hazards regression coefficients. WHO grade II and III gliomas were categorized into the low-risk subgroup, medium-risk subgroup, and high-risk subgroup. We used the 16 prognostic-related genes as input features to build a classification model based on prognosis using a fully connected neural network. Gene function annotations were also performed.

RESULTS

The 16 genes (AKNAD1, C7orf13, CDK20, CHRFAM7A, CHRNA1, EFNB1, GAS1, HIST2H2BE, KCNK3, KLHL4, LRRK2, NXPH3, PIGZ, SAMD5, ERINC2, and SIX6) related to the glioma prognosis were screened. The 16 selected genes were associated with the development of gliomas and carcinogenesis. The accuracy of an external validation data set of the fully connected neural network model from the two cohorts reached 95.5%. Our method has good potential capability in classifying WHO grade II and III gliomas into low-risk, medium-risk, and high-risk subgroups. The subgroups showed significant (P<0.01) differences in overall survival.

CONCLUSION

This resulted in the identification of 16 genes that were related to the prognosis of gliomas. Here we developed a computational method to discriminate WHO grade II and III gliomas into three subgroups with distinct prognoses. The gene expression-based method provides a reliable alternative to determine the prognosis of gliomas.

Collapse

Acharya S, Saha S, Nikhil N. Unsupervised gene selection using biological knowledge : application in sample clustering. BMC Bioinformatics 2017;18:513. [PMID: 29166852 PMCID: PMC5700545 DOI: 10.1186/s12859-017-1933-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Accepted: 11/08/2017] [Indexed: 11/10/2022] Open

Delgado E, Boisen MM, Laskey R, Chen R, Song C, Sallit J, Yochum ZA, Andersen CL, Sikora MJ, Wagner J, Safe S, Elishaev E, Lee A, Edwards RP, Haluska P, Tseng G, Schurdak M, Oesterreich S. High expression of orphan nuclear receptor NR4A1 in a subset of ovarian tumors with worse outcome. Gynecol Oncol 2016;141:348-356. [PMID: 26946093 DOI: 10.1016/j.ygyno.2016.02.030] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Revised: 02/18/2016] [Accepted: 02/22/2016] [Indexed: 12/21/2022]

Affiliation(s)

Evan Delgado University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
Michelle M Boisen Division of Gynecologic Oncology, Magee-Womens Hospital of the University of Pittsburgh Medical Center, Pittsburgh, PA, USA.
Robin Laskey Division of Gynecologic Oncology, Magee-Womens Hospital of the University of Pittsburgh Medical Center, Pittsburgh, PA, USA
Rui Chen Department of Biostatistics and Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, USA
Chi Song Department of Biostatistics and Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, USA
Jad Sallit Ross University School of Medicine
Zachary A Yochum Department of Medicine, Division of Hematology Oncology, University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
Courtney L Andersen Department of Pharmacology and Chemical Biology, Womens Cancer Research Center, Magee-Womens Research Institute, and University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA; Molecular Pharmacology Training Program, University of Pittsburgh School of Medicine, Pittsburgh, PA
Matthew J Sikora Department of Pharmacology and Chemical Biology, Womens Cancer Research Center, Magee-Womens Research Institute, and University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
Jacob Wagner University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
Stephen Safe Department of Veterinary Physiology and Pharmacology, Texas A&M University, College Station, TX, USA
Esther Elishaev Department of Pathology, Magee-Womens Hospital of the University of Pittsburgh Medical Center, Pittsburgh, PA, USA
Adrian Lee Department of Pharmacology and Chemical Biology, Womens Cancer Research Center, Magee-Womens Research Institute, and University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA
Robert P Edwards Division of Gynecologic Oncology, Magee-Womens Hospital of the University of Pittsburgh Medical Center, Pittsburgh, PA, USA
Paul Haluska Department of Oncology and Pharmacology, Mayo Clinic, Rochester, MN, USA
George Tseng Department of Biostatistics and Department of Human Genetics, University of Pittsburgh, Pittsburgh, PA, USA
Mark Schurdak University of Pittsburgh Drug Discovery Institute, Pittsburgh, PA, USA
Steffi Oesterreich Department of Pharmacology and Chemical Biology, Womens Cancer Research Center, Magee-Womens Research Institute, and University of Pittsburgh Cancer Institute, Pittsburgh, PA, USA

Collapse

Acharya S, Saha S, Thadisina Y. Multiobjective Simulated Annealing-Based Clustering of Tissue Samples for Cancer Diagnosis. IEEE J Biomed Health Inform 2016;20:691-8. [DOI: 10.1109/jbhi.2015.2404971] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Acharya S, Saha S. Importance of proximity measures in clustering of cancer and miRNA datasets: proposal of an automated framework. MOLECULAR BIOSYSTEMS 2016;12:3478-3501. [DOI: 10.1039/c6mb00609d] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Diaz-Cano SJ. Pathological bases for a robust application of cancer molecular classification. Int J Mol Sci 2015;16:8655-75. [PMID: 25898411 PMCID: PMC4425102 DOI: 10.3390/ijms16048655] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 04/07/2015] [Indexed: 12/12/2022] Open

Wang HW, Sun HJ, Chang TY, Lo HH, Cheng WC, Tseng GC, Lin CT, Chang SJ, Pal N, Chung IF. Discovering monotonic stemness marker genes from time-series stem cell microarray data. BMC Genomics 2015;16 Suppl 2:S2. [PMID: 25708300 PMCID: PMC4331716 DOI: 10.1186/1471-2164-16-s2-s2] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

Background

Identification of genes with ascending or descending monotonic expression patterns over time or stages of stem cells is an important issue in time-series microarray data analysis. We propose a method named Monotonic Feature Selector (MFSelector) based on a concept of total discriminating error (DE_total) to identify monotonic genes. MFSelector considers various time stages in stage order (i.e., Stage One vs. other stages, Stages One and Two vs. remaining stages and so on) and computes DE_totalof each gene. MFSelector can successfully identify genes with monotonic characteristics.

Results

We have demonstrated the effectiveness of MFSelector on two synthetic data sets and two stem cell differentiation data sets: embryonic stem cell neurogenesis (ESCN) and embryonic stem cell vasculogenesis (ESCV) data sets. We have also performed extensive quantitative comparisons of the three monotonic gene selection approaches. Some of the monotonic marker genes such as OCT4, NANOG, BLBP, discovered from the ESCN dataset exhibit consistent behavior with that reported in other studies. The role of monotonic genes found by MFSelector in either stemness or differentiation is validated using information obtained from Gene Ontology analysis and other literature. We justify and demonstrate that descending genes are involved in the proliferation or self-renewal activity of stem cells, while ascending genes are involved in differentiation of stem cells into variant cell lineages.

Conclusions

We have developed a novel system, easy to use even with no pre-existing knowledge, to identify gene sets with monotonic expression patterns in multi-stage as well as in time-series genomics matrices. The case studies on ESCN and ESCV have helped to get a better understanding of stemness and differentiation. The novel monotonic marker genes discovered from a data set are found to exhibit consistent behavior in another independent data set, demonstrating the utility of the proposed method. The MFSelector R function and data sets can be downloaded from: http://microarray.ym.edu.tw/tools/MFSelector/.

Collapse

Banerjee M, Pal NR. Feature selection with SVD entropy: Some modification and extension. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2013.12.029] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Rajapakse JC, Mundra PA. Multiclass gene selection using Pareto-fronts. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013;10:87-97. [PMID: 23702546 DOI: 10.1109/tcbb.2013.1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Wu MY, Dai DQ, Shi Y, Yan H, Zhang XF. Biomarker identification and cancer classification based on microarray data using Laplace naive Bayes model with mean shrinkage. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012;9:1649-1662. [PMID: 22868679 DOI: 10.1109/tcbb.2012.105] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Tsai YS, Aguan K, Pal NR, Chung IF. Identification of single- and multiple-class specific signature genes from gene expression profiles by group marker index. PLoS One 2011;6:e24259. [PMID: 21909426 PMCID: PMC3164723 DOI: 10.1371/journal.pone.0024259] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2011] [Accepted: 08/06/2011] [Indexed: 01/06/2023] Open

Abstract

Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases.

Collapse