Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Anand A, Suganthan PN. Multiclass cancer classification by support vector machines with class-wise optimized genes and probability estimates. J Theor Biol 2009;259:533-40. [PMID: 19406131 DOI: 10.1016/j.jtbi.2009.04.013] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2008] [Revised: 02/11/2009] [Accepted: 04/20/2009] [Indexed: 11/15/2022]

For:	Anand A, Suganthan PN. Multiclass cancer classification by support vector machines with class-wise optimized genes and probability estimates. J Theor Biol 2009;259:533-40. [PMID: 19406131 DOI: 10.1016/j.jtbi.2009.04.013] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2008] [Revised: 02/11/2009] [Accepted: 04/20/2009] [Indexed: 11/15/2022]

Number

Cited by Other Article(s)

A convex multi-class model via distance metric learning based class-to-instance confidence. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

RNA-Seq-Based Breast Cancer Subtypes Classification Using Machine Learning Approaches. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2020;2020:4737969. [PMID: 33178256 PMCID: PMC7644310 DOI: 10.1155/2020/4737969] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 05/31/2020] [Accepted: 10/09/2020] [Indexed: 12/20/2022]

Abstract

Background

Breast invasive carcinoma (BRCA) is not a single disease as each subtype has a distinct morphology structure. Although several computational methods have been proposed to conduct breast cancer subtype identification, the specific interaction mechanisms of genes involved in the subtypes are still incomplete. To identify and explore the corresponding interaction mechanisms of genes for each subtype of breast cancer can impose an important impact on the personalized treatment for different patients.

Methods

We integrate the biological importance of genes from the gene regulatory networks to the differential expression analysis and then obtain the weighted differentially expressed genes (weighted DEGs). A gene with a high weight means it regulates more target genes and thus holds more biological importance. Besides, we constructed gene coexpression networks for control and experiment groups, and the significantly differentially interacting structures encouraged us to design the corresponding Gene Ontology (GO) enrichment based on gene coexpression networks (GOEGCN). The GOEGCN considers the two-side distinction analysis between gene coexpression networks for control and experiment groups. The method allows us to study how the modulated coexpressed gene couples impact biological functions at a GO level.

Results

We modeled the binary classification with weighted DEGs for each subtype. The binary classifier could make a good prediction for an unseen sample, and the experimental results validated the effectiveness of our proposed approaches. The novel enriched GO terms based on GOEGCN for control and experiment groups of each subtype explain the specific biological function changes according to the two-side distinction of coexpression network structures to some extent.

Conclusion

The weighted DEGs contain biological importance derived from the gene regulatory network. Based on the weighted DEGs, five binary classifiers were learned and showed good performance concerning the “Sensitivity,” “Specificity,” “Accuracy,” “F1,” and “AUC” metrics. The GOEGCN with weighted DEGs for control and experiment groups presented a novel GO enrichment analysis results and the novel enriched GO terms would further unveil the changes of specific biological functions among all the BRCA subtypes to some extent. The R code in this research is available at https://github.com/yxchspring/GOEGCN_BRCA_Subtypes.

Collapse

Yan J, Zhang Z, Lin K, Yang F, Luo X. A hybrid scheme-based one-vs-all decision trees for multi-class classification tasks. Knowl Based Syst 2020. [DOI: 10.1016/j.knosys.2020.105922] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Krzhizhanovskaya VV, Závodszky G, Lees MH, Dongarra JJ, Sloot PMA, Brissos S, Teixeira J. Performance Analysis of Binarization Strategies for Multi-class Imbalanced Data Classification. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7303687 DOI: 10.1007/978-3-030-50423-6_11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Yang L, Gao H, Liu Z, Tang L. Identification of Phage Virion Proteins by Using the g-gap Tripeptide Composition. LETT ORG CHEM 2019. [DOI: 10.2174/1570178615666180910112813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Nath A, Subbiah K. The role of pertinently diversified and balanced training as well as testing data sets in achieving the true performance of classifiers in predicting the antifreeze proteins. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2017.07.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Mukhopadhyay S, Das NK, Kurmi I, Pradhan A, Ghosh N, Panigrahi PK. Tissue multifractality and hidden Markov model based integrated framework for optimum precancer detection. JOURNAL OF BIOMEDICAL OPTICS 2017;22:1-8. [PMID: 29052373 DOI: 10.1117/1.jbo.22.10.105005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 09/29/2017] [Indexed: 06/07/2023]

Zararsız G, Goksuluk D, Korkmaz S, Eldem V, Zararsiz GE, Duru IP, Ozturk A. A comprehensive simulation study on classification of RNA-Seq data. PLoS One 2017;12:e0182507. [PMID: 28832679 PMCID: PMC5568128 DOI: 10.1371/journal.pone.0182507] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2017] [Accepted: 07/19/2017] [Indexed: 02/02/2023] Open

Abstract

RNA sequencing (RNA-Seq) is a powerful technique for the gene-expression profiling of organisms that uses the capabilities of next-generation sequencing technologies. Developing gene-expression-based classification algorithms is an emerging powerful method for diagnosis, disease classification and monitoring at molecular level, as well as providing potential markers of diseases. Most of the statistical methods proposed for the classification of gene-expression data are either based on a continuous scale (eg. microarray data) or require a normal distribution assumption. Hence, these methods cannot be directly applied to RNA-Seq data since they violate both data structure and distributional assumptions. However, it is possible to apply these algorithms with appropriate modifications to RNA-Seq data. One way is to develop count-based classifiers, such as Poisson linear discriminant analysis and negative binomial linear discriminant analysis. Another way is to bring the data closer to microarrays and apply microarray-based classifiers. In this study, we compared several classifiers including PLDA with and without power transformation, NBLDA, single SVM, bagging SVM (bagSVM), classification and regression trees (CART), and random forests (RF). We also examined the effect of several parameters such as overdispersion, sample size, number of genes, number of classes, differential-expression rate, and the transformation method on model performances. A comprehensive simulation study is conducted and the results are compared with the results of two miRNA and two mRNA experimental datasets. The results revealed that increasing the sample size, differential-expression rate and decreasing the dispersion parameter and number of groups lead to an increase in classification accuracy. Similar with differential-expression studies, the classification of RNA-Seq data requires careful attention when handling data overdispersion. We conclude that, as a count-based classifier, the power transformed PLDA and, as a microarray-based classifier, vst or rlog transformed RF and SVM classifiers may be a good choice for classification. An R/BIOCONDUCTOR package, MLSeq, is freely available at https://www.bioconductor.org/packages/release/bioc/html/MLSeq.html.

Collapse

Gitoee A, Faridi A, France J. Mathematical models for response to amino acids: estimating the response of broiler chickens to branched-chain amino acids using support vector regression and neural network models. Neural Comput Appl 2017. [DOI: 10.1007/s00521-017-2842-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Rojas-Moraleda R, Valous NA, Gowen A, Esquerre C, Härtel S, Salinas L, O’Donnell C. A frame-based ANN for classification of hyperspectral images: assessment of mechanical damage in mushrooms. Neural Comput Appl 2016. [DOI: 10.1007/s00521-016-2376-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Ranganarayanan P, Thanigesan N, Ananth V, Jayaraman VK, Ramakrishnan V. Identification of Glucose-Binding Pockets in Human Serum Albumin Using Support Vector Machine and Molecular Dynamics Simulations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016;13:148-157. [PMID: 26886739 DOI: 10.1109/tcbb.2015.2415806] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Li P, Hu Y, Yi J, Li J, Yang J, Wang J. Identification of potential biomarkers to differentially diagnose solid pseudopapillary tumors and pancreatic malignancies via a gene regulatory network. J Transl Med 2015;13:361. [PMID: 26578390 PMCID: PMC4650856 DOI: 10.1186/s12967-015-0718-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Accepted: 10/31/2015] [Indexed: 01/18/2023] Open

Abstract

Background

Solid pseudopapillary neoplasms (SPN) are pancreatic tumors with low malignant potential and good prognosis. However, differential diagnosis between SPN and pancreatic malignancies including pancreatic neuroendocrine tumor (PanNET) and ductal adenocarcinoma (PDAC) is difficult. This study tried to identify candidate biomarkers for the distinction between SPN and the two malignant pancreatic tumors by examining the gene regulatory network of SPN.

Methods

The gene regulatory network for SPN was constructed by a co-expression model. Genes that have been reported to be correlated with SPN were used as the clues to hunt more SPN-related genes in the network according to a shortest path approach. By means of the K-nearest neighbor algorithm (KNN) classifier evaluated by the jackknife test, sets of genes to distinguish SPN and malignant pancreatic tumors were determined.

Results

We took a new strategy to identify candidate biomarkers for differentiating SPN from the two malignant pancreatic tumors PanNET and PDAC by analyzing shortest paths among SPN-related genes in the gene regulatory network. 43 new SPN-relevant genes were discovered, among which, we found hsa-miR-194 and hsa-miR-7 along with 7 transcription factors (TFs) such as SOX11, SMAD3 and SOX4 etc. could correctly differentiate SPN from PanNET, while hsa-miR-204 and 4 TFs such as SOX9, TCF7 and PPARD etc. were demonstrated as the potential markers for SPN versus PDAC. 14 genes were demonstrated to serve as the candidate biomarkers for distinguishing SPN from PanNET and PDAC when considering them as malignant pancreatic tumors together.

Conclusion

This study provides new candidate genes related to SPN and the potential biomarkers to differentiate SPN from PanNET and PDAC, which may help to diagnose patients with SPN in clinical setting. Furthermore, candidate biomarkers such as SOX11 and hsa-miR-204 which could cause cell proliferation but inhibit invasion or metastasis may be of importance in understanding the molecular mechanism of pancreatic oncogenesis and could be possible therapeutic targets for malignant pancreatic tumors.

Electronic supplementary material

The online version of this article (doi:10.1186/s12967-015-0718-3) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Pengping Li State Key Laboratory of Pharmaceutical Biotechnology, Collaborative Innovation Center of Chemistry for Life Sciences, Jiangsu Engineering Research Center for MicroRNA Biology and Biotechnology, NJU Advanced Institute for Life Sciences (NAILS), School of life sciences, Nanjing University, 163 Xianlin Road, Nanjing, 210023, China.
Yuebing Hu Department of Neurosurgery, Jinling Hospital, School of Medicine, Nanjing University, 305 East Zhongshan Road, Nanjing, 210002, China.
Jiao Yi State Key Laboratory of Pharmaceutical Biotechnology, Collaborative Innovation Center of Chemistry for Life Sciences, Jiangsu Engineering Research Center for MicroRNA Biology and Biotechnology, NJU Advanced Institute for Life Sciences (NAILS), School of life sciences, Nanjing University, 163 Xianlin Road, Nanjing, 210023, China.
Jie Li State Key Laboratory of Pharmaceutical Biotechnology, Collaborative Innovation Center of Chemistry for Life Sciences, Jiangsu Engineering Research Center for MicroRNA Biology and Biotechnology, NJU Advanced Institute for Life Sciences (NAILS), School of life sciences, Nanjing University, 163 Xianlin Road, Nanjing, 210023, China.
Jie Yang State Key Laboratory of Pharmaceutical Biotechnology, Collaborative Innovation Center of Chemistry for Life Sciences, Jiangsu Engineering Research Center for MicroRNA Biology and Biotechnology, NJU Advanced Institute for Life Sciences (NAILS), School of life sciences, Nanjing University, 163 Xianlin Road, Nanjing, 210023, China.
Jin Wang State Key Laboratory of Pharmaceutical Biotechnology, Collaborative Innovation Center of Chemistry for Life Sciences, Jiangsu Engineering Research Center for MicroRNA Biology and Biotechnology, NJU Advanced Institute for Life Sciences (NAILS), School of life sciences, Nanjing University, 163 Xianlin Road, Nanjing, 210023, China.

Collapse

Reboiro-Jato M, Díaz F, Glez-Peña D, Fdez-Riverola F. A novel ensemble of classifiers that use biological relevant gene sets for microarray classification. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2014.01.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

He L, Yang X, Hao Z. An adaptive class pairwise dimensionality reduction algorithm. Neural Comput Appl 2013. [DOI: 10.1007/s00521-012-0897-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Garcia-Manteiga JM. Data Analysis and Interpretation in Metabolomics. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open

Analyzing the presence of noise in multi-class problems: alleviating its influence with the One-vs-One decomposition. Knowl Inf Syst 2012. [DOI: 10.1007/s10115-012-0570-1] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Khan A, Majid A, Hayat M. CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition. Comput Biol Chem 2011;35:218-29. [PMID: 21864791 DOI: 10.1016/j.compbiolchem.2011.05.003] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Revised: 05/17/2011] [Accepted: 05/18/2011] [Indexed: 12/18/2022]

Chou KC. Some remarks on protein attribute prediction and pseudo amino acid composition. J Theor Biol 2010;273:236-47. [PMID: 21168420 PMCID: PMC7125570 DOI: 10.1016/j.jtbi.2010.12.024] [Citation(s) in RCA: 966] [Impact Index Per Article: 69.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2010] [Revised: 12/08/2010] [Accepted: 12/13/2010] [Indexed: 11/29/2022]

Identification and optimization of classifier genes from multi-class earthworm microarray dataset. PLoS One 2010;5:e13715. [PMID: 21060837 PMCID: PMC2965664 DOI: 10.1371/journal.pone.0013715] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2010] [Accepted: 10/06/2010] [Indexed: 11/19/2022] Open