Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wu MC, Zhang L, Wang Z, Christiani DC, Lin X. Sparse linear discriminant analysis for simultaneous testing for the significance of a gene set/pathway and gene selection. ACTA ACUST UNITED AC 2009;25:1145-51. [PMID: 19168911 DOI: 10.1093/bioinformatics/btp019] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

For:	Wu MC, Zhang L, Wang Z, Christiani DC, Lin X. Sparse linear discriminant analysis for simultaneous testing for the significance of a gene set/pathway and gene selection. ACTA ACUST UNITED AC 2009;25:1145-51. [PMID: 19168911 DOI: 10.1093/bioinformatics/btp019] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Number

Cited by Other Article(s)

Hwangbo S, Lee S, Hosain MM, Goo T, Lee S, Kim I, Park T. Kernel-based hierarchical structural component models for pathway analysis on survival phenotype. Genes Genomics 2024:10.1007/s13258-024-01569-9. [PMID: 39327384 DOI: 10.1007/s13258-024-01569-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Accepted: 09/07/2024] [Indexed: 09/28/2024]

Abstract

BACKGROUND

High-throughput sequencing, particularly RNA-sequencing (RNA-seq), has advanced differential gene expression analysis, revealing pathways involved in various biological conditions. Traditional pathway-based methods generally consider pathways independently, overlooking the correlations among them and ignoring quite a few overlapping biomarkers between pathways. In addition, most pathway-based approaches assume that biomarkers have linear effects on the phenotype of interest.

OBJECTIVE

This study aims to develop the HisCoM-KernelS model to identify survival phenotype-related pathways by accommodating complex, nonlinear relationships between genes and survival outcomes, while accounting for inter-pathway correlations.

METHODS

We applied HisCoM-KernelS model to the TCGA pancreatic ductal adenocarcinoma (PDAC) RNA-seq dataset, comprising 4,498 protein-coding genes mapped to 186 KEGG pathways from 148 PDAC samples. Kernel machine regression was used to model pathway effects on survival outcomes, incorporating hierarchical gene-pathway structures. Model parameters were estimated using the alternating least squares algorithm, and the significance of pathways was assessed through a permutation test.

RESULTS

HisCoM-KernelS identified several pathways significantly associated with pancreatic cancer survival, including those corroborated by previous studies. HisCoM-KernelS, especially with the Gaussian kernel, showed a better balance of detection rate and number of significant pathways compared to four other existing pathway-based methods: HisCoM-PAGE, Global Test, GSEA, and CoxKM.

CONCLUSION

HisCoM-KernelS successfully extends pathway-based analysis to survival outcomes, capturing complex nonlinear gene effects and inter-pathway correlations. Its application to the TCGA PDAC dataset emphasizes its utility in identifying biologically relevant pathways, offering a robust tool for survival phenotype research in high-throughput sequencing data.

Collapse

Atkins S, Einarsson G, Clemmensen L, Ames B. Proximal methods for sparse optimal scoring and discriminant analysis. ADV DATA ANAL CLASSI 2022. [DOI: 10.1007/s11634-022-00530-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Hwangbo S, Lee S, Lee S, Hwang H, Kim I, Park T. Kernel-based hierarchical structural component models for pathway analysis. Bioinformatics 2022;38:3078-3086. [PMID: 35460238 DOI: 10.1093/bioinformatics/btac276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 04/08/2022] [Indexed: 11/14/2022] Open

Bao Y, Liu Y. Varying coefficient linear discriminant analysis for dynamic data. Electron J Stat 2022. [DOI: 10.1214/22-ejs2066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Broś-Konopielko M, Białek A, Oleszczuk-Modzelewska L, Zaleśkiewicz B, Różańska-Walędziak A, Czajkowski K. Nutritional, Anthropometric and Sociodemographic Factors Affecting Fatty Acids Profile of Pregnant Women's Serum at Labour-Chemometric Studies. Nutrients 2021;13:2948. [PMID: 34578833 PMCID: PMC8470577 DOI: 10.3390/nu13092948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 11/16/2022] Open

Li G, Duan X, Wu Z, Wu C. Generalized elastic net optimal scoring problem for feature selection. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.03.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

High-dimensional linear discriminant analysis with moderately clipped LASSO. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2021. [DOI: 10.29220/csam.2021.28.1.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Zhang L, Kim I. Finite mixtures of semiparametric Bayesian survival kernel machine regressions: Application to breast cancer gene pathway subgroup analysis. J R Stat Soc Ser C Appl Stat 2020. [DOI: 10.1111/rssc.12457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Luo S, Chen Z. A procedure of linear discrimination analysis with detected sparsity structure for high-dimensional multi-class classification. J MULTIVARIATE ANAL 2020. [DOI: 10.1016/j.jmva.2020.104641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Chen H, He Y, Ji J, Shi Y. The sparse group lasso for high-dimensional integrative linear discriminant analysis with application to alzheimer's disease prediction. J STAT COMPUT SIM 2020. [DOI: 10.1080/00949655.2020.1800011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Yang X, Tian L, Chen Y, Yang L, Xu S, Wu W. Inverse Projection Representation and Category Contribution Rate for Robust Tumor Recognition. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:1262-1275. [PMID: 30575544 DOI: 10.1109/tcbb.2018.2886334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Lippmann C, Ultsch A, Lötsch J. Computational functional genomics-based reduction of disease-related gene sets to their key components. Bioinformatics 2020;35:2362-2370. [PMID: 30500872 DOI: 10.1093/bioinformatics/bty986] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 09/05/2018] [Accepted: 11/29/2018] [Indexed: 01/21/2023] Open

Abstract

MOTIVATION

The genetic architecture of diseases becomes increasingly known. This raises difficulties in picking suitable targets for further research among an increasing number of candidates. Although expression based methods of gene set reduction are applied to laboratory-derived genetic data, the analysis of topical sets of genes gathered from knowledge bases requires a modified approach as no quantitative information about gene expression is available.

RESULTS

We propose a computational functional genomics-based approach at reducing sets of genes to the most relevant items based on the importance of the gene within the polyhierarchy of biological processes characterizing the disease. Knowledge bases about the biological roles of genes can provide a valid description of traits or diseases represented as a directed acyclic graph (DAG) picturing the polyhierarchy of disease relevant biological processes. The proposed method uses a gene importance score derived from the location of the gene-related biological processes in the DAG. It attempts to recreate the DAG and thereby, the roles of the original gene set, with the least number of genes in descending order of importance. This obtained precision and recall of over 70% to recreate the components of the DAG charactering the biological functions of n=540 genes relevant to pain with a subset of only the k=29 best-scoring genes.

CONCLUSIONS

A new method for reduction of gene sets is shown that is able to reproduce the biological processes in which the full gene set is involved by over 70%; however, by using only ∼5% of the original genes.

AVAILABILITY AND IMPLEMENTATION

The necessary numerical parameters for the calculation of gene importance are implemented in the R package dbtORA at https://github.com/IME-TMP-FFM/dbtORA.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Islam SJ, Kim JH, Topel M, Liu C, Ko YA, Mujahid MS, Sims M, Mubasher M, Ejaz K, Morgan-Billingslea J, Jones K, Waller EK, Jones D, Uppal K, Dunbar SB, Pemu P, Vaccarino V, Searles CD, Baltrus P, Lewis TT, Quyyumi AA, Taylor H. Cardiovascular Risk and Resilience Among Black Adults: Rationale and Design of the MECA Study. J Am Heart Assoc 2020;9:e015247. [PMID: 32340530 PMCID: PMC7428584 DOI: 10.1161/jaha.119.015247] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Abstract

Background

Cardiovascular disease incidence, prevalence, morbidity, and mortality have declined in the past several decades; however, disparities persist among subsets of the population. Notably, blacks have not experienced the same improvements on the whole as whites. Furthermore, frequent reports of relatively poorer health statistics among the black population have led to a broad assumption that black race reliably predicts relatively poorer health outcomes. However, substantial intraethnic and intraracial heterogeneity exists; moreover, individuals with similar risk factors and environmental exposures are often known to experience vastly different cardiovascular health outcomes. Thus, some individuals have good outcomes even in the presence of cardiovascular risk factors, a concept known as resilience.

Methods and Results

The MECA (Morehouse‐Emory Center for Health Equity) Study was designed to investigate the multilevel exposures that contribute to “resilience” in the face of risk for poor cardiovascular health among blacks in the greater Atlanta, GA, metropolitan area. We used census tract data to determine “at‐risk” and “resilient” neighborhoods with high or low prevalence of cardiovascular morbidity and mortality, based on cardiovascular death, hospitalization, and emergency department visits for blacks. More than 1400 individuals from these census tracts assented to demographic, health, and psychosocial questionnaires administered through telephone surveys. Afterwards, ≈500 individuals were recruited to enroll in a clinical study, where risk biomarkers, such as oxidative stress, and inflammatory markers, endothelial progenitor cells, metabolomic and microRNA profiles, and subclinical vascular dysfunction were measured. In addition, comprehensive behavioral questionnaires were collected and ideal cardiovascular health metrics were assessed using the American Heart Association's Life Simple 7 measure. Last, 150 individuals with low Life Simple 7 were recruited and randomized to a behavioral mobile health (eHealth) plus health coach or eHealth only intervention and followed up for improvement.

Conclusions

The MECA Study is investigating socioenvironmental and individual behavioral measures that promote resilience to cardiovascular disease in blacks by assessing biological, functional, and molecular mechanisms.

REGISTRATION

URL: https://www.clinicaltrials.gov. Unique identifier: NCT03308812.

Collapse

Affiliation(s)

Shabatun J Islam Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA
Jeong Hwan Kim Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA
Matthew Topel Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA
Chang Liu Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA.,Department of Epidemiology Rollins School of Public Health Emory University Atlanta GA
Yi-An Ko Department of Biostatistics and Bioinformatics Rollins School of Public Health Emory University Atlanta GA
Mahasin S Mujahid Division of Epidemiology School of Public Health University of California Berkeley CA
Mario Sims Department of Medicine University of Mississippi Medical Center Jackson MS
Mohamed Mubasher Department of Community Health and Preventive Medicine Morehouse School of Medicine Atlanta GA
Kiran Ejaz Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA
Jan Morgan-Billingslea Department of Community Health and Preventive Medicine Morehouse School of Medicine Atlanta GA
Kia Jones Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA
Edmund K Waller Department of Hematology and Oncology Winship Cancer Institute Emory University School of Medicine Atlanta GA
Dean Jones Division of Pulmonary, Allergy, Critical Care and Sleep Medicine Department of Medicine Emory University School of Medicine Atlanta GA
Karan Uppal Division of Pulmonary, Allergy, Critical Care and Sleep Medicine Department of Medicine Emory University School of Medicine Atlanta GA
Sandra B Dunbar Nell Hodgson Woodruff School of Nursing Emory University Atlanta GA
Priscilla Pemu Department of Medicine Morehouse School of Medicine Atlanta GA
Viola Vaccarino Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA.,Department of Epidemiology Rollins School of Public Health Emory University Atlanta GA
Charles D Searles Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA
Peter Baltrus Department of Community Health and Preventive Medicine Morehouse School of Medicine Atlanta GA.,National Center for Primary Care Morehouse School of Medicine Atlanta GA
Tené T Lewis Department of Epidemiology Rollins School of Public Health Emory University Atlanta GA
Arshed A Quyyumi Division of Cardiology Department of Medicine Emory University School of Medicine Atlanta GA
Herman Taylor Department of Medicine Morehouse School of Medicine Atlanta GA

Collapse

Tony Cai T, Zhang L. High dimensional linear discriminant analysis: optimality, adaptive algorithm and missing data. J R Stat Soc Series B Stat Methodol 2019. [DOI: 10.1111/rssb.12326] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Jung S, Ahn J, Jeon Y. Penalized Orthogonal Iteration for Sparse Estimation of Generalized Eigenvalue Problem. J Comput Graph Stat 2019. [DOI: 10.1080/10618600.2019.1568014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Gaynanova I, Wang T. Sparse quadratic classification rules via linear dimension reduction. J MULTIVARIATE ANAL 2019;169:278-299. [PMID: 31105355 PMCID: PMC6516858 DOI: 10.1016/j.jmva.2018.09.011] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Liu J, Yu G, Liu Y. Graph-based sparse linear discriminant analysis for high-dimensional classification. J MULTIVARIATE ANAL 2018;171:250-269. [PMID: 31983784 DOI: 10.1016/j.jmva.2018.12.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Li Q, Li L. Integrative linear discriminant analysis with guaranteed error rate improvement. Biometrika 2018;105:917-930. [PMID: 31762476 DOI: 10.1093/biomet/asy047] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

Zhang L, Kim I. Semiparametric Bayesian kernel survival model for evaluating pathway effects. Stat Methods Med Res 2018;28:3301-3317. [PMID: 30289021 DOI: 10.1177/0962280218797360] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Jung S. Continuum directions for supervised dimension reduction. Comput Stat Data Anal 2018. [DOI: 10.1016/j.csda.2018.03.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Wu MC, Kuan PF. A Guide to Illumina BeadChip Data Analysis. Methods Mol Biol 2018;1708:303-330. [PMID: 29224151 DOI: 10.1007/978-1-4939-7481-8_16] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Lu Q, Qiao X. Sparse Fisher's linear discriminant analysis for partially labeled data. Stat Anal Data Min 2017. [DOI: 10.1002/sam.11367] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

One-dimensional vs. two-dimensional based features: Plant identification approach. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.jal.2016.11.021] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Richardson JB, Lee KY, Mireji P, Enyaru J, Sistrom M, Aksoy S, Zhao H, Caccone A. Genomic analyses of African Trypanozoon strains to assess evolutionary relationships and identify markers for strain identification. PLoS Negl Trop Dis 2017;11:e0005949. [PMID: 28961238 PMCID: PMC5636163 DOI: 10.1371/journal.pntd.0005949] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Revised: 10/11/2017] [Accepted: 09/11/2017] [Indexed: 11/27/2022] Open

Abstract

African trypanosomes of the sub-genus Trypanozoon) are eukaryotic parasitesthat cause disease in either humans or livestock. The development of genomic resources can be of great use to those interested in studying and controlling the spread of these trypanosomes. Here we present a large comparative analysis of Trypanozoon whole genomes, 83 in total, including human and animal infective African trypanosomes: 21 T. brucei brucei, 22 T. b. gambiense, 35 T. b. rhodesiense and 4 T. evansi strains, of which 21 were from Uganda. We constructed a maximum likelihood phylogeny based on 162,210 single nucleotide polymorphisms (SNPs.) The three Trypanosoma brucei sub-species and Trypanosoma evansi are not monophyletic, confirming earlier studies that indicated high similarity among Trypanosoma “sub-species”. We also used discriminant analysis of principal components (DAPC) on the same set of SNPs, identifying seven genetic clusters. These clusters do not correspond well with existing taxonomic classifications, in agreement with the phylogenetic analysis. Geographic origin is reflected in both the phylogeny and clustering analysis. Finally, we used sparse linear discriminant analysis to rank SNPs by their informativeness in differentiating the strains in our data set. As few as 84 SNPs can completely distinguish the strains used in our study, and discriminant analysis was still able to detect genetic structure using as few as 10 SNPs. Our results reinforce earlier results of high genetic similarity between the African Trypanozoon. Despite this, a small subset of SNPs can be used to identify genetic markers that can be used for strain identification or other epidemiological investigations.

Trypanosomes are a major health threat to the people and livestock of Sub-Saharan Africa. Building genomic resources and understanding the genetic structure of these parasites will aid researchers trying to control their spread. To this end, we compared the genomes from 83 trypanosome strains, identifying 162,210 single nucleotide polymorphisms (SNPs) between them. Our analysis shows high genetic similarity between the trypanosomes, and confirms earlier results indicating that the traditional taxonomic classifications do not correspond well with genetic data. Further, we demonstrate that, despite the high genetic similarity, each strain in the study can be distinguished using as few as 84 SNPs, suggesting that a small number of SNPs can be useful for tracking and classifying populations of African trypanosomes.

Collapse

Tharwat A, Gaber T, Ibrahim A, Hassanien AE. Linear discriminant analysis: A detailed tutorial. AI COMMUN 2017. [DOI: 10.3233/aic-170729] [Citation(s) in RCA: 343] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Inferring Genes and Biological Functions That Are Sensitive to the Severity of Toxicity Symptoms. Int J Mol Sci 2017;18:ijms18040755. [PMID: 28368331 PMCID: PMC5412340 DOI: 10.3390/ijms18040755] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2016] [Revised: 03/23/2017] [Accepted: 03/30/2017] [Indexed: 11/16/2022] Open

He Y, Zhang X, Wang P. Discriminant analysis on high dimensional Gaussian copula model. Stat Probab Lett 2016. [DOI: 10.1016/j.spl.2016.05.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Gaynanova I, Booth JG, Wells MT. Simultaneous Sparse Estimation of Canonical Vectors in the p ≫ N Setting. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2015.1034318] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

He Q, Cai T, Liu Y, Zhao N, Harmon QE, Almli LM, Binder EB, Engel SM, Ressler KJ, Conneely KN, Lin X, Wu MC. Prioritizing individual genetic variants after kernel machine testing using variable selection. Genet Epidemiol 2016;40:722-731. [PMID: 27488097 DOI: 10.1002/gepi.21993] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Revised: 05/28/2016] [Accepted: 06/20/2016] [Indexed: 01/06/2023]

Fan J, Feng Y, Jiang J, Tong X. Feature Augmentation via Nonparametrics and Selection (FANS) in High-Dimensional Classification. J Am Stat Assoc 2016;111:275-287. [PMID: 27185970 DOI: 10.1080/01621459.2015.1005212] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

DC programming and DCA for sparse Fisher linear discriminant analysis. Neural Comput Appl 2016. [DOI: 10.1007/s00521-016-2216-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Ahn J, Jeon Y. Sparse HDLSS discrimination with constrained data piling. Comput Stat Data Anal 2015. [DOI: 10.1016/j.csda.2015.04.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Nguyen T, Khosravi A, Creighton D, Nahavandi S. A novel aggregate gene selection method for microarray data classification. Pattern Recognit Lett 2015. [DOI: 10.1016/j.patrec.2015.03.018] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Zheng Z, Huang X, Chen Z, He X, Liu H, Yang J. Regression analysis of locality preserving projections via sparse penalty. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.01.004] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Mai Q, Zou H. Sparse semiparametric discriminant analysis. J MULTIVARIATE ANAL 2015. [DOI: 10.1016/j.jmva.2014.12.009] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Kolar M, Liu H. Optimal Feature Selection in High-Dimensional Discriminant Analysis. IEEE TRANSACTIONS ON INFORMATION THEORY 2015;61:1063-1083. [PMID: 25620807 PMCID: PMC4302965 DOI: 10.1109/tit.2014.2381241] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Gaynanova I, Kolar M. Optimal variable selection in multi-group sparse discriminant analysis. Electron J Stat 2015. [DOI: 10.1214/15-ejs1064] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Hino H, Fujiki J. ADHERENTLY PENALIZED LINEAR DISCRIMINANT ANALYSIS. JOURNAL JAPANESE SOCIETY OF COMPUTATIONAL STATISTICS 2015. [DOI: 10.5183/jjscs.1412001_219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Zhang Y, Huo L, Lin L, Zeng Y. The Dantzig Discriminant Analysis with High Dimensional Data. COMMUN STAT-THEOR M 2014. [DOI: 10.1080/03610926.2013.878359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Hao N, Dong B, Fan J. Sparsifying the Fisher Linear Discriminant by Rotation. J R Stat Soc Series B Stat Methodol 2014;77:827-851. [PMID: 26512210 DOI: 10.1111/rssb.12092] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Huang H, Huang Y. Improved discriminant sparsity neighborhood preserving embedding for hyperspectral image classification. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2014.01.010] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Zhan X, Epstein MP, Ghosh D. An Adaptive Genetic Association Test Using Double Kernel Machines. STATISTICS IN BIOSCIENCES 2014;7:262-281. [PMID: 26640602 DOI: 10.1007/s12561-014-9116-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Pepe D, Grassi M. Investigating perturbed pathway modules from gene expression data via structural equation models. BMC Bioinformatics 2014;15:132. [PMID: 24885496 PMCID: PMC4052286 DOI: 10.1186/1471-2105-15-132] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2013] [Accepted: 04/25/2014] [Indexed: 01/18/2023] Open

Abstract

Background

It is currently accepted that the perturbation of complex intracellular networks, rather than the dysregulation of a single gene, is the basis for phenotypical diversity. High-throughput gene expression data allow to investigate changes in gene expression profiles among different conditions. Recently, many efforts have been made to individuate which biological pathways are perturbed, given a list of differentially expressed genes (DEGs). In order to understand these mechanisms, it is necessary to unveil the variation of genes in relation to each other, considering the different phenotypes. In this paper, we illustrate a pipeline, based on Structural Equation Modeling (SEM) that allowed to investigate pathway modules, considering not only deregulated genes but also the connections between the perturbed ones.

Results

The procedure was tested on microarray experiments relative to two neurological diseases: frontotemporal lobar degeneration with ubiquitinated inclusions (FTLD-U) and multiple sclerosis (MS). Starting from DEGs and dysregulated biological pathways, a model for each pathway was generated using databases information biological databases, in order to design how DEGs were connected in a causal structure. Successively, SEM analysis proved if pathways differ globally, between groups, and for specific path relationships. The results confirmed the importance of certain genes in the analyzed diseases, and unveiled which connections are modified among them.

Conclusions

We propose a framework to perform differential gene expression analysis on microarray data based on SEM, which is able to: 1) find relevant genes and perturbed biological pathways, investigating putative sub-pathway models based on the concept of disease module; 2) test and improve the generated models; 3) detect a differential expression level of one gene, and differential connection between two genes. This could shed light, not only on the mechanisms affecting variations in gene expression, but also on the causes of gene-gene relationship modifications in diseased phenotypes.

Collapse

Cai H, Ruan P, Ng M, Akutsu T. Feature weight estimation for gene selection: a local hyperlinear learning approach. BMC Bioinformatics 2014;15:70. [PMID: 24625071 PMCID: PMC4007530 DOI: 10.1186/1471-2105-15-70] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Accepted: 03/06/2014] [Indexed: 11/10/2022] Open

Chi YY, Gribbin MJ, Johnson JL, Muller KE. Power calculation for overall hypothesis testing with high-dimensional commensurate outcomes. Stat Med 2014;33:812-27. [PMID: 24122945 PMCID: PMC4072336 DOI: 10.1002/sim.5986] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2012] [Revised: 08/19/2013] [Accepted: 08/21/2013] [Indexed: 11/07/2022]

An J, Pan Y, Yan Z, Li W, Cui J, Yuan J, Tian L, Xing R, Lu Y. MiR-23a in amplified 19p13.13 loci targets metallothionein 2A and promotes growth in gastric cancer cells. J Cell Biochem 2013;114:2160-9. [PMID: 23553990 DOI: 10.1002/jcb.24565] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2012] [Accepted: 03/28/2013] [Indexed: 12/19/2022]

Wang C, Cao L, Miao B. Optimal feature selection for sparse linear discriminant analysis and its applications in gene expression data. Comput Stat Data Anal 2013. [DOI: 10.1016/j.csda.2013.04.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Mai Q, Zou H. A Note On the Connection and Equivalence of Three Sparse Linear Discriminant Analysis Methods. Technometrics 2013. [DOI: 10.1080/00401706.2012.746208] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Mai Q. A review of discriminant analysis in high dimensions. ACTA ACUST UNITED AC 2013. [DOI: 10.1002/wics.1257] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Yu T, Bai Y. Analyzing LC/MS metabolic profiling data in the context of existing metabolic networks. ACTA ACUST UNITED AC 2012;1:83-91. [PMID: 24010053 DOI: 10.2174/2213235x11301010084] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]