Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen X, Wang L, Smith JD, Zhang B. Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes. Bioinformatics 2008;24:2474-81. [PMID: 18753155 DOI: 10.1093/bioinformatics/btn458] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

For:	Chen X, Wang L, Smith JD, Zhang B. Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes. Bioinformatics 2008;24:2474-81. [PMID: 18753155 DOI: 10.1093/bioinformatics/btn458] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Number

Cited by Other Article(s)

Matsui Y, Togayachi A, Sakamoto K, Angata K, Kadomatsu K, Nishihara S. Integrated Systems Analysis Deciphers Transcriptome and Glycoproteome Links in Alzheimer's Disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.25.573290. [PMID: 38234803 PMCID: PMC10793412 DOI: 10.1101/2023.12.25.573290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]

Smith RN, Rosales IA, Tomaszewski KT, Mahowald GT, Araujo-Medina M, Acheampong E, Bruce A, Rios A, Otsuka T, Tsuji T, Hotta K, Colvin R. Utility of Banff Human Organ Transplant Gene Panel in Human Kidney Transplant Biopsies. Transplantation 2023;107:1188-1199. [PMID: 36525551 PMCID: PMC10132999 DOI: 10.1097/tp.0000000000004389] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Maghsoudi Z, Nguyen H, Tavakkoli A, Nguyen T. A comprehensive survey of the approaches for pathway analysis using multi-omics data integration. Brief Bioinform 2022;23:6761962. [PMID: 36252928 PMCID: PMC9677478 DOI: 10.1093/bib/bbac435] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 08/26/2022] [Accepted: 09/08/2022] [Indexed: 02/07/2023] Open

Zhang LX, Yan H, Liu Y, Xu J, Song J, Yu DJ. Enhancing Characteristic Gene Selection and Tumor Classification by the Robust Laplacian Supervised Discriminative Sparse PCA. J Chem Inf Model 2022;62:1794-1807. [PMID: 35353532 DOI: 10.1021/acs.jcim.1c01403] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Gene set inference from single-cell sequencing data using a hybrid of matrix factorization and variational autoencoders. NAT MACH INTELL 2020. [DOI: 10.1038/s42256-020-00269-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Deng Y, Wu S, Fan H. Genome-wide pathway-based quantitative multiple phenotypes analysis. PLoS One 2020;15:e0240910. [PMID: 33175855 PMCID: PMC7657528 DOI: 10.1371/journal.pone.0240910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 10/06/2020] [Indexed: 11/18/2022] Open

Odom GJ, Ban Y, Colaprico A, Liu L, Silva TC, Sun X, Pico AR, Zhang B, Wang L, Chen X. PathwayPCA: an R/Bioconductor Package for Pathway Based Integrative Analysis of Multi-Omics Data. Proteomics 2020;20:e1900409. [PMID: 32430990 PMCID: PMC7677175 DOI: 10.1002/pmic.201900409] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 05/01/2020] [Indexed: 01/01/2023]

Affiliation(s)

Gabriel J. Odom Department of Biostatistics, Florida International University, Stempel College of Public Health, Miami, FL 33199, USA Division of Biostatistics, Department of Public Health Sciences, University of Miami, Miller School of Medicine, Miami, FL 33136, USA
Yuguang Ban Sylvester Comprehensive Cancer Center, University of Miami, Miller School of Medicine, Miami, FL 33136, USA
Antonio Colaprico Division of Biostatistics, Department of Public Health Sciences, University of Miami, Miller School of Medicine, Miami, FL 33136, USA
Lizhong Liu Division of Biostatistics, Department of Public Health Sciences, University of Miami, Miller School of Medicine, Miami, FL 33136, USA
Tiago Chedraoui Silva Division of Biostatistics, Department of Public Health Sciences, University of Miami, Miller School of Medicine, Miami, FL 33136, USA
Xiaodian Sun Sylvester Comprehensive Cancer Center, University of Miami, Miller School of Medicine, Miami, FL 33136, USA
Alexander R. Pico Institute for Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA 94158, USA
Bing Zhang Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA
Lily Wang Division of Biostatistics, Department of Public Health Sciences, University of Miami, Miller School of Medicine, Miami, FL 33136, USA Sylvester Comprehensive Cancer Center, University of Miami, Miller School of Medicine, Miami, FL 33136, USA Dr. John T Macdonald Foundation Department of Human Genetics, University of Miami, Miller School of Medicine, Miami, FL 33136, USA John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL 33136, USA
Xi Chen Division of Biostatistics, Department of Public Health Sciences, University of Miami, Miller School of Medicine, Miami, FL 33136, USA Sylvester Comprehensive Cancer Center, University of Miami, Miller School of Medicine, Miami, FL 33136, USA

Collapse

Yan KK, Wang X, Lam WWT, Vardhanabhuti V, Lee AWM, Pang HH. Radiomics analysis using stability selection supervised component analysis for right-censored survival data. Comput Biol Med 2020;124:103959. [PMID: 32905923 PMCID: PMC7501167 DOI: 10.1016/j.compbiomed.2020.103959] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Revised: 08/02/2020] [Accepted: 08/03/2020] [Indexed: 02/03/2023]

Somani J, Ramchandran S, Lähdesmäki H. A personalised approach for identifying disease-relevant pathways in heterogeneous diseases. NPJ Syst Biol Appl 2020;6:17. [PMID: 32518234 PMCID: PMC7283216 DOI: 10.1038/s41540-020-0130-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Accepted: 03/12/2020] [Indexed: 11/30/2022] Open

Kim K, Sun H. Incorporating genetic networks into case-control association studies with high-dimensional DNA methylation data. BMC Bioinformatics 2019;20:510. [PMID: 31640538 PMCID: PMC6805595 DOI: 10.1186/s12859-019-3040-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 08/21/2019] [Indexed: 12/23/2022] Open

Abstract

Background

In human genetic association studies with high-dimensional gene expression data, it has been well known that statistical selection methods utilizing prior biological network knowledge such as genetic pathways and signaling pathways can outperform other methods that ignore genetic network structures in terms of true positive selection. In recent epigenetic research on case-control association studies, relatively many statistical methods have been proposed to identify cancer-related CpG sites and their corresponding genes from high-dimensional DNA methylation array data. However, most of existing methods are not designed to utilize genetic network information although methylation levels between linked genes in the genetic networks tend to be highly correlated with each other.

Results

We propose new approach that combines data dimension reduction techniques with network-based regularization to identify outcome-related genes for analysis of high-dimensional DNA methylation data. In simulation studies, we demonstrated that the proposed approach overwhelms other statistical methods that do not utilize genetic network information in terms of true positive selection. We also applied it to the 450K DNA methylation array data of the four breast invasive carcinoma cancer subtypes from The Cancer Genome Atlas (TCGA) project.

Conclusions

The proposed variable selection approach can utilize prior biological network information for analysis of high-dimensional DNA methylation array data. It first captures gene level signals from multiple CpG sites using data a dimension reduction technique and then performs network-based regularization based on biological network graph information. It can select potentially cancer-related genes and genetic pathways that were missed by the existing methods.

Electronic supplementary material

The online version of this article (10.1186/s12859-019-3040-x) contains supplementary material, which is available to authorized users.

Collapse

Radiomics and MGMT promoter methylation for prognostication of newly diagnosed glioblastoma. Sci Rep 2019;9:14435. [PMID: 31594994 PMCID: PMC6783410 DOI: 10.1038/s41598-019-50849-y] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Accepted: 09/20/2019] [Indexed: 11/16/2022] Open

Bani-Sadr A, Eker OF, Berner LP, Ameli R, Hermier M, Barritault M, Meyronet D, Guyotat J, Jouanneau E, Honnorat J, Ducray F, Berthezene Y. Conventional MRI radiomics in patients with suspected early- or pseudo-progression. Neurooncol Adv 2019;1:vdz019. [PMID: 32642655 PMCID: PMC7212855 DOI: 10.1093/noajnl/vdz019] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

Walsh EE, Mariani TJ, Chu C, Grier A, Gill SR, Qiu X, Wang L, Holden-Wiltse J, Corbett A, Thakar J, Benoodt L, McCall MN, Topham DJ, Falsey AR, Caserta MT. Aims, Study Design, and Enrollment Results From the Assessing Predictors of Infant Respiratory Syncytial Virus Effects and Severity Study. JMIR Res Protoc 2019;8:e12907. [PMID: 31199303 PMCID: PMC6595944 DOI: 10.2196/12907] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Revised: 03/01/2019] [Accepted: 03/03/2019] [Indexed: 01/04/2023] Open

Abstract

Background

The majority of infants hospitalized with primary respiratory syncytial virus (RSV) infection have no obvious risk factors for severe disease.

Objective

The aim of this study (Assessing Predictors of Infant RSV Effects and Severity, AsPIRES) was to identify factors associated with severe disease in full-term healthy infants younger than 10 months with primary RSV infection.

Methods

RSV infected infants were enrolled from 3 cohorts during consecutive winters from August 2012 to April 2016 in Rochester, New York. A birth cohort was prospectively enrolled and followed through their first winter for development of RSV infection. An outpatient supplemental cohort was enrolled in the emergency department or pediatric offices, and a hospital cohort was enrolled on admission with RSV infection. RSV was diagnosed by reverse transcriptase-polymerase chain reaction. Demographic and clinical data were recorded and samples collected for assays: buccal swab (cytomegalovirus polymerase chain reaction, PCR), nasal swab (RSV qualitative PCR, complete viral gene sequence, 16S ribosomal ribonucleic acid [RNA] amplicon microbiota analysis), nasal wash (chemokine and cytokine assays), nasal brush (nasal respiratory epithelial cell gene expression using RNA sequencing [RNAseq]), and 2 to 3 ml of heparinized blood (flow cytometry, RNAseq analysis of purified cluster of differentiation [CD]4+, CD8+, B cells and natural killer cells, and RSV-specific antibody). Cord blood (RSV-specific antibody) was also collected for the birth cohort. Univariate and multivariate logistic regression will be used for analysis of data using a continuous Global Respiratory Severity Score (GRSS) as the outcome variable. Novel statistical methods will be developed for integration of the large complex datasets.

Results

A total of 453 infants were enrolled into the 3 cohorts; 226 in the birth cohort, 60 in the supplemental cohort, and 78 in the hospital cohort. A total of 126 birth cohort infants remained in the study and were evaluated for 150 respiratory illnesses. Of the 60 RSV positive infants in the supplemental cohort, 42 completed the study, whereas all 78 of the RSV positive hospital cohort infants completed the study. A GRSS was calculated for each RSV-infected infant and is being used to analyze each of the complex datasets by correlation with disease severity in univariate and multivariate methods.

Conclusions

The AsPIRES study will provide insights into the complex pathogenesis of RSV infection in healthy full-term infants with primary RSV infection. The analysis will allow assessment of multiple factors potentially influencing the severity of RSV infection including the level of RSV specific antibodies, the innate immune response of nasal epithelial cells, the adaptive response by various lymphocyte subsets, the resident airway microbiota, and viral factors. Results of this study will inform disease interventions such as vaccines and antiviral therapies.

Collapse

Recent Advances in Supervised Dimension Reduction: A Survey. MACHINE LEARNING AND KNOWLEDGE EXTRACTION 2019. [DOI: 10.3390/make1010020] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Majumdar S, Basak SC, Lungu CN, Diudea MV, Grunwald GD. Mathematical structural descriptors and mutagenicity assessment: a study with congeneric and diverse datasets^$. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2018;29:579-590. [PMID: 30025481 DOI: 10.1080/1062936x.2018.1496475] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Accepted: 07/01/2018] [Indexed: 06/08/2023]

Meng Y, Cai XH, Wang L. Potential Genes and Pathways of Neonatal Sepsis Based on Functional Gene Set Enrichment Analyses. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2018;2018:6708520. [PMID: 30154914 PMCID: PMC6091373 DOI: 10.1155/2018/6708520] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 06/04/2018] [Accepted: 06/27/2018] [Indexed: 12/16/2022]

Kickingereder P, Götz M, Muschelli J, Wick A, Neuberger U, Shinohara RT, Sill M, Nowosielski M, Schlemmer HP, Radbruch A, Wick W, Bendszus M, Maier-Hein KH, Bonekamp D. Large-scale Radiomic Profiling of Recurrent Glioblastoma Identifies an Imaging Predictor for Stratifying Anti-Angiogenic Treatment Response. Clin Cancer Res 2016;22:5765-5771. [PMID: 27803067 DOI: 10.1158/1078-0432.ccr-16-0702] [Citation(s) in RCA: 193] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Revised: 06/23/2016] [Accepted: 07/07/2016] [Indexed: 01/03/2023]

Chan WH, Mohamad MS, Deris S, Zaki N, Kasim S, Omatu S, Corchado JM, Al Ashwal H. Identification of informative genes and pathways using an improved penalized support vector machine with a weighting scheme. Comput Biol Med 2016;77:102-15. [PMID: 27522238 DOI: 10.1016/j.compbiomed.2016.08.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Revised: 08/03/2016] [Accepted: 08/03/2016] [Indexed: 01/03/2023]

Kickingereder P, Burth S, Wick A, Götz M, Eidel O, Schlemmer HP, Maier-Hein KH, Wick W, Bendszus M, Radbruch A, Bonekamp D. Radiomic Profiling of Glioblastoma: Identifying an Imaging Predictor of Patient Survival with Improved Performance over Established Clinical and Radiologic Risk Models. Radiology 2016;280:880-9. [PMID: 27326665 DOI: 10.1148/radiol.2016160845] [Citation(s) in RCA: 274] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Abstract

Purpose To evaluate whether radiomic feature-based magnetic resonance (MR) imaging signatures allow prediction of survival and stratification of patients with newly diagnosed glioblastoma with improved accuracy compared with that of established clinical and radiologic risk models. Materials and Methods Retrospective evaluation of data was approved by the local ethics committee and informed consent was waived. A total of 119 patients (allocated in a 2:1 ratio to a discovery [n = 79] or validation [n = 40] set) with newly diagnosed glioblastoma were subjected to radiomic feature extraction (12 190 features extracted, including first-order, volume, shape, and texture features) from the multiparametric (contrast material-enhanced T1-weighted and fluid-attenuated inversion-recovery imaging sequences) and multiregional (contrast-enhanced and unenhanced) tumor volumes. Radiomic features of patients in the discovery set were subjected to a supervised principal component (SPC) analysis to predict progression-free survival (PFS) and overall survival (OS) and were validated in the validation set. The performance of a Cox proportional hazards model with the SPC analysis predictor was assessed with C index and integrated Brier scores (IBS, lower scores indicating higher accuracy) and compared with Cox models based on clinical (age and Karnofsky performance score) and radiologic (Gaussian normalized relative cerebral blood volume and apparent diffusion coefficient) parameters. Results SPC analysis allowed stratification based on 11 features of patients in the discovery set into a low- or high-risk group for PFS (hazard ratio [HR], 2.43; P = .002) and OS (HR, 4.33; P < .001), and the results were validated successfully in the validation set for PFS (HR, 2.28; P = .032) and OS (HR, 3.45; P = .004). The performance of the SPC analysis (OS: IBS, 0.149; C index, 0.654; PFS: IBS, 0.138; C index, 0.611) was higher compared with that of the radiologic (OS: IBS, 0.175; C index, 0.603; PFS: IBS, 0.149; C index, 0.554) and clinical risk models (OS: IBS, 0.161, C index, 0.640; PFS: IBS, 0.139; C index, 0.599). The performance of the SPC analysis model was further improved when combined with clinical data (OS: IBS, 0.142; C index, 0.696; PFS: IBS, 0.132; C index, 0.637). Conclusion An 11-feature radiomic signature that allows prediction of survival and stratification of patients with newly diagnosed glioblastoma was identified, and improved performance compared with that of established clinical and radiologic risk models was demonstrated. (©) RSNA, 2016 Online supplemental material is available for this article.

Collapse

Affiliation(s)

Philipp Kickingereder From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Sina Burth From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Antje Wick From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Michael Götz From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Oliver Eidel From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Heinz-Peter Schlemmer From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Klaus H Maier-Hein From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Wolfgang Wick From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Martin Bendszus From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
Alexander Radbruch From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany
David Bonekamp From the Department of Neuroradiology (P.K., S.B., O.E., M.B., A.R., D.B.) and Neurology Clinic (A.W., W.W.), University of Heidelberg Medical Center, Im Neuenheimer Feld 400, 69120 Heidelberg, Germany; Department of Medical Image Computing, Medical and Biological Informatics Division (M.G., K.H.M.H.), Department of Radiology (H.P.S., A.R., D.B.), and Clinical Neuro-oncology Cooperation Unit, German Cancer Consortium (DKTK) (W.W.), German Cancer Research Center (DKFZ), Heidelberg, Germany

Collapse

Zhang Q, Zhao Y, Zhang R, Wei Y, Yi H, Shao F, Chen F. A Comparative Study of Five Association Tests Based on CpG Set for Epigenome-Wide Association Studies. PLoS One 2016;11:e0156895. [PMID: 27258058 PMCID: PMC4892473 DOI: 10.1371/journal.pone.0156895] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2016] [Accepted: 05/20/2016] [Indexed: 11/19/2022] Open

Danaher P, Paul D, Wang P. Covariance-based analyses of biological pathways. Biometrika 2015;102:533-544. [PMID: 26412865 DOI: 10.1093/biomet/asv013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

A novel hybrid dimension reduction technique for undersized high dimensional gene expression data sets using information complexity criterion for cancer classification. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2015;2015:370640. [PMID: 25838836 PMCID: PMC4370236 DOI: 10.1155/2015/370640] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Accepted: 02/18/2015] [Indexed: 11/21/2022]

Thomas R, Hubbard AE, McHale CM, Zhang L, Rappaport SM, Lan Q, Rothman N, Vermeulen R, Guyton KZ, Jinot J, Sonawane BR, Smith MT. Characterization of changes in gene expression and biochemical pathways at low levels of benzene exposure. PLoS One 2014;9:e91828. [PMID: 24786086 PMCID: PMC4006721 DOI: 10.1371/journal.pone.0091828] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 02/14/2014] [Indexed: 11/19/2022] Open

Killeen AP, Morris DG, Kenny DA, Mullen MP, Diskin MG, Waters SM. Global gene expression in endometrium of high and low fertility heifers during the mid-luteal phase of the estrous cycle. BMC Genomics 2014;15:234. [PMID: 24669966 PMCID: PMC3986929 DOI: 10.1186/1471-2164-15-234] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 03/14/2014] [Indexed: 01/01/2023] Open

Hira ZM, Trigeorgis G, Gillies DF. An algorithm for finding biologically significant features in microarray data based on a priori manifold learning. PLoS One 2014;9:e90562. [PMID: 24595155 PMCID: PMC3940899 DOI: 10.1371/journal.pone.0090562] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Accepted: 02/02/2014] [Indexed: 11/19/2022] Open

SNP set association analysis for genome-wide association studies. PLoS One 2013;8:e62495. [PMID: 23658731 PMCID: PMC3643925 DOI: 10.1371/journal.pone.0062495] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2012] [Accepted: 03/22/2013] [Indexed: 11/29/2022] Open

Chen X, Ishwaran H. Pathway hunting by random survival forests. Bioinformatics 2013;29:99-105. [PMID: 23129299 PMCID: PMC3530909 DOI: 10.1093/bioinformatics/bts643] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2012] [Revised: 07/18/2012] [Accepted: 10/17/2012] [Indexed: 01/22/2023] Open

Wang L, Chen X, Zhang B. Statistical Analysis of Patient-Specific Pathway Activities via Mixed Models. ACTA ACUST UNITED AC 2013;Suppl 8:7313. [PMID: 24124644 DOI: 10.4172/2155-6180.s8-001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Yu T, Bai Y. Analyzing LC/MS metabolic profiling data in the context of existing metabolic networks. ACTA ACUST UNITED AC 2012;1:83-91. [PMID: 24010053 DOI: 10.2174/2213235x11301010084] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Comparative evaluation of set-level techniques in predictive classification of gene expression samples. BMC Bioinformatics 2012;13 Suppl 10:S15. [PMID: 22759420 PMCID: PMC3382436 DOI: 10.1186/1471-2105-13-s10-s15] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Abstract

Background

Analysis of gene expression data in terms of a priori-defined gene sets has recently received significant attention as this approach typically yields more compact and interpretable results than those produced by traditional methods that rely on individual genes. The set-level strategy can also be adopted with similar benefits in predictive classification tasks accomplished with machine learning algorithms. Initial studies into the predictive performance of set-level classifiers have yielded rather controversial results. The goal of this study is to provide a more conclusive evaluation by testing various components of the set-level framework within a large collection of machine learning experiments.

Results

Genuine curated gene sets constitute better features for classification than sets assembled without biological relevance. For identifying the best gene sets for classification, the Global test outperforms the gene-set methods GSEA and SAM-GS as well as two generic feature selection methods. To aggregate expressions of genes into a feature value, the singular value decomposition (SVD) method as well as the SetSig technique improve on simple arithmetic averaging. Set-level classifiers learned with 10 features constituted by the Global test slightly outperform baseline gene-level classifiers learned with all original data features although they are slightly less accurate than gene-level classifiers learned with a prior feature-selection step.

Conclusion

Set-level classifiers do not boost predictive accuracy, however, they do achieve competitive accuracy if learned with the right combination of ingredients.

Availability

Open-source, publicly available software was used for classifier learning and testing. The gene expression datasets and the gene set database used are also publicly available. The full tabulation of experimental results is available at http://ida.felk.cvut.cz/CESLT.

Collapse

Adaptive elastic-net sparse principal component analysis for pathway association testing. Stat Appl Genet Mol Biol 2011;10:/j/sagmb.2011.10.issue-1/1544-6115.1697/1544-6115.1697.xml. [PMID: 23089825 DOI: 10.2202/1544-6115.1697] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]

Misman MF, Mohamad MS, Deris S, Abdullah A, Hashim SZM. An improved hybrid of SVM and SCAD for pathway analysis. Bioinformation 2011;7:169-75. [PMID: 22102773 PMCID: PMC3218518 DOI: 10.6026/97320630007169] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2011] [Accepted: 10/02/2011] [Indexed: 11/23/2022] Open

Capturing changes in gene expression dynamics by gene set differential coordination analysis. Genomics 2011;98:469-77. [PMID: 21971296 DOI: 10.1016/j.ygeno.2011.09.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2011] [Revised: 09/01/2011] [Accepted: 09/16/2011] [Indexed: 12/31/2022]

Han B, Li L, Chen Y, Zhu L, Dai Q. A two step method to identify clinical outcome relevant genes with microarray data. J Biomed Inform 2011;44:229-38. [DOI: 10.1016/j.jbi.2010.11.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2010] [Revised: 10/06/2010] [Accepted: 11/29/2010] [Indexed: 12/29/2022]

Long N, Gianola D, Rosa G, Weigel K. Dimension reduction and variable selection for genomic selection: application to predicting milk yield in Holsteins. J Anim Breed Genet 2011;128:247-57. [DOI: 10.1111/j.1439-0388.2011.00917.x] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Chen X, Wang L, Hu B, Guo M, Barnard J, Zhu X. Pathway-based analysis for genome-wide association studies using supervised principal components. Genet Epidemiol 2011;34:716-24. [PMID: 20842628 DOI: 10.1002/gepi.20532] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Robotti E, Demartini M, Gosetti F, Calabrese G, Marengo E. Development of a classification and ranking method for the identification of possible biomarkers in two-dimensional gel-electrophoresis based on principal component analysis and variable selection procedures. MOLECULAR BIOSYSTEMS 2011;7:677-86. [PMID: 21286649 DOI: 10.1039/c0mb00124d] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Ma S, Dai Y. Principal component analysis based methods in bioinformatics studies. Brief Bioinform 2011;12:714-22. [PMID: 21242203 DOI: 10.1093/bib/bbq090] [Citation(s) in RCA: 125] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Ma S, Kosorok MR, Huang J, Dai Y. Incorporating higher-order representative features improves prediction in network-based cancer prognosis analysis. BMC Med Genomics 2011;4:5. [PMID: 21226928 PMCID: PMC3037289 DOI: 10.1186/1755-8794-4-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 01/12/2011] [Indexed: 01/30/2023] Open

Chen X, Wang L, Ishwaran H. An Integrative Pathway-based Clinical-genomic Model for Cancer Survival Prediction. Stat Probab Lett 2010;80:1313-1319. [PMID: 21731150 DOI: 10.1016/j.spl.2010.04.011] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Montgomery SB, Dermitzakis ET. The resolution of the genetics of gene expression. Hum Mol Genet 2009;18:R211-5. [PMID: 19808798 DOI: 10.1093/hmg/ddp400] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

A unified mixed effects model for gene set analysis of time course microarray experiments. Stat Appl Genet Mol Biol 2009;8:Article 47. [PMID: 19954419 DOI: 10.2202/1544-6115.1484] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Lee YL, Xu X, Wallenstein S, Chen J. Gene expression profiles of the one-carbon metabolism pathway. J Genet Genomics 2009;36:277-82. [PMID: 19447375 DOI: 10.1016/s1673-8527(08)60115-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2008] [Revised: 12/24/2008] [Accepted: 01/20/2009] [Indexed: 10/20/2022]

Nueda MJ, Sebastián P, Tarazona S, García-García F, Dopazo J, Ferrer A, Conesa A. Functional assessment of time course microarray data. BMC Bioinformatics 2009;10 Suppl 6:S9. [PMID: 19534758 PMCID: PMC2697656 DOI: 10.1186/1471-2105-10-s6-s9] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Abstract

Motivation

Time-course microarray experiments study the progress of gene expression along time across one or several experimental conditions. Most developed analysis methods focus on the clustering or the differential expression analysis of genes and do not integrate functional information. The assessment of the functional aspects of time-course transcriptomics data requires the use of approaches that exploit the activation dynamics of the functional categories to where genes are annotated.

Methods

We present three novel methodologies for the functional assessment of time-course microarray data. i) maSigFun derives from the maSigPro method, a regression-based strategy to model time-dependent expression patterns and identify genes with differences across series. maSigFun fits a regression model for groups of genes labeled by a functional class and selects those categories which have a significant model. ii) PCA-maSigFun fits a PCA model of each functional class-defined expression matrix to extract orthogonal patterns of expression change, which are then assessed for their fit to a time-dependent regression model. iii) ASCA-functional uses the ASCA model to rank genes according to their correlation to principal time expression patterns and assess functional enrichment on a GSA fashion. We used simulated and experimental datasets to study these novel approaches. Results were compared to alternative methodologies.

Results

Synthetic and experimental data showed that the different methods are able to capture different aspects of the relationship between genes, functions and co-expression that are biologically meaningful. The methods should not be considered as competitive but they provide different insights into the molecular and functional dynamic events taking place within the biological system under study.

Collapse

Chen X, Wang L. Integrating biological knowledge with gene expression profiles for survival prediction of cancer. J Comput Biol 2009;16:265-78. [PMID: 19183004 DOI: 10.1089/cmb.2008.12tt] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Wu Z, Zhao X, Chen L. Identifying responsive functional modules from protein-protein interaction network. Mol Cells 2009;27:271-7. [PMID: 19326072 DOI: 10.1007/s10059-009-0035-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2009] [Accepted: 01/26/2009] [Indexed: 10/21/2022] Open

Ma S, Kosorok MR. Identification of differential gene pathways with principal component analysis. ACTA ACUST UNITED AC 2009;25:882-9. [PMID: 19223452 DOI: 10.1093/bioinformatics/btp085] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Abstract

MOTIVATION

Development of high-throughput technology makes it possible to measure expressions of thousands of genes simultaneously. Genes have the inherent pathway structure, where pathways are composed of multiple genes with coordinated biological functions. It is of great interest to identify differential gene pathways that are associated with the variations of phenotypes.

RESULTS

We propose the following approach for detecting differential gene pathways. First, we construct gene pathways using databases such as KEGG or GO. Second, for each pathway, we extract a small number of representative features, which are linear combinations of gene expressions and/or their transformations. Specifically, we propose using (i) principal components (PCs) of gene expression sets, (ii) PCs of expanded gene expression sets and (iii) expanded sets of PCs of gene expressions, as the representative features. Third, we identify differential gene pathways as those with representative features significantly associated with the variations of phenotypes, particularly disease clinical outcomes, in regression models. The false discovery rate approach is used to adjust for multiple comparisons. Analysis of three gene expression datasets suggests that (i) the proposed approach can effectively identify differential gene pathways; (ii) PCs that explain only a small amount of variations of gene expressions may bear significant associations between gene pathways and phenotypes; (iii) including second-order terms of gene expressions may lead to identification of new differential gene pathways; (iv) the proposed approach is relatively insensitive to additional noises; and (v) the proposed approach can identify gene pathways missed by alternative approaches.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse