Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ju JH, Shenoy SA, Crystal RG, Mezey JG. An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci. PLoS Comput Biol 2017;13:e1005537. [PMID: 28505156 PMCID: PMC5448815 DOI: 10.1371/journal.pcbi.1005537] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 05/30/2017] [Accepted: 04/28/2017] [Indexed: 11/19/2022] Open

For:	Ju JH, Shenoy SA, Crystal RG, Mezey JG. An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci. PLoS Comput Biol 2017;13:e1005537. [PMID: 28505156 PMCID: PMC5448815 DOI: 10.1371/journal.pcbi.1005537] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 05/30/2017] [Accepted: 04/28/2017] [Indexed: 11/19/2022] Open

Number

Cited by Other Article(s)

Ravichandran P, Parsana P, Keener R, Hansen KD, Battle A. Aggregation of recount3 RNA-seq data improves inference of consensus and tissue-specific gene co-expression networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.20.576447. [PMID: 38328080 PMCID: PMC10849507 DOI: 10.1101/2024.01.20.576447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]

Abstract

Background

Gene co-expression networks (GCNs) describe relationships among expressed genes key to maintaining cellular identity and homeostasis. However, the small sample size of typical RNA-seq experiments which is several orders of magnitude fewer than the number of genes is too low to infer GCNs reliably. recount3, a publicly available dataset comprised of 316,443 uniformly processed human RNA-seq samples, provides an opportunity to improve power for accurate network reconstruction and obtain biological insight from the resulting networks.

Results

We compared alternate aggregation strategies to identify an optimal workflow for GCN inference by data aggregation and inferred three consensus networks: a universal network, a non-cancer network, and a cancer network in addition to 27 tissue context-specific networks. Central network genes from our consensus networks were enriched for evolutionarily constrained genes and ubiquitous biological pathways, whereas central context-specific network genes included tissue-specific transcription factors and factorization based on the hubs led to clustering of related tissue contexts. We discovered that annotations corresponding to context-specific networks inferred from aggregated data were enriched for trait heritability beyond known functional genomic annotations and were significantly more enriched when we aggregated over a larger number of samples.

Conclusion

This study outlines best practices for network GCN inference and evaluation by data aggregation. We recommend estimating and regressing confounders in each data set before aggregation and prioritizing large sample size studies for GCN reconstruction. Increased statistical power in inferring context-specific networks enabled the derivation of variant annotations that were enriched for concordant trait heritability independent of functional genomic annotations that are context-agnostic. While we observed strictly increasing held-out log-likelihood with data aggregation, we noted diminishing marginal improvements. Future directions aimed at alternate methods for estimating confounders and integrating orthogonal information from modalities such as Hi-C and ChIP-seq can further improve GCN inference.

Collapse

Sun G, Yu H, Wang P, Lopez-Guerrero M, Mural RV, Mizero ON, Grzybowski M, Song B, van Dijk K, Schachtman DP, Zhang C, Schnable JC. A role for heritable transcriptomic variation in maize adaptation to temperate environments. Genome Biol 2023;24:55. [PMID: 36964601 PMCID: PMC10037803 DOI: 10.1186/s13059-023-02891-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 03/06/2023] [Indexed: 03/26/2023] Open

Abstract

Background

Transcription bridges genetic information and phenotypes. Here, we evaluated how changes in transcriptional regulation enable maize (Zea mays), a crop originally domesticated in the tropics, to adapt to temperate environments.

Result

We generated 572 unique RNA-seq datasets from the roots of 340 maize genotypes. Genes involved in core processes such as cell division, chromosome organization and cytoskeleton organization showed lower heritability of gene expression, while genes involved in anti-oxidation activity exhibited higher expression heritability. An expression genome-wide association study (eGWAS) identified 19,602 expression quantitative trait loci (eQTLs) associated with the expression of 11,444 genes. A GWAS for alternative splicing identified 49,897 splicing QTLs (sQTLs) for 7614 genes. Genes harboring both cis-eQTLs and cis-sQTLs in linkage disequilibrium were disproportionately likely to encode transcription factors or were annotated as responding to one or more stresses. Independent component analysis of gene expression data identified loci regulating co-expression modules involved in oxidation reduction, response to water deprivation, plastid biogenesis, protein biogenesis, and plant-pathogen interaction. Several genes involved in cell proliferation, flower development, DNA replication, and gene silencing showed lower gene expression variation explained by genetic factors between temperate and tropical maize lines. A GWAS of 27 previously published phenotypes identified several candidate genes overlapping with genomic intervals showing signatures of selection during adaptation to temperate environments.

Conclusion

Our results illustrate how maize transcriptional regulatory networks enable changes in transcriptional regulation to adapt to temperate regions.

Supplementary information

The online version contains supplementary material available at 10.1186/s13059-023-02891-3.

Collapse

Affiliation(s)

Guangchao Sun grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
Huihui Yu grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, USA
Peng Wang grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
Martha Lopez-Guerrero grid.24434.350000 0004 1937 0060Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, USA
Ravi V. Mural grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
Olivier N. Mizero grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
Marcin Grzybowski grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
Baoxing Song grid.5386.8000000041936877XInstitute for Genomic Diversity, Cornell University, Ithaca, USA
Karin van Dijk grid.24434.350000 0004 1937 0060Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, USA
Daniel P. Schachtman grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA
Chi Zhang grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, USA
James C. Schnable grid.24434.350000 0004 1937 0060Quantitative Life Sciences Initiative, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, USA grid.24434.350000 0004 1937 0060Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, USA

Collapse

Mokou M, Narayanasamy S, Stroggilos R, Balaur IA, Vlahou A, Mischak H, Frantzi M. A Drug Repurposing Pipeline Based on Bladder Cancer Integrated Proteotranscriptomics Signatures. Methods Mol Biol 2023;2684:59-99. [PMID: 37410228 DOI: 10.1007/978-1-0716-3291-8_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/07/2023]

Cote AC, Young HE, Huckins LM. Comparison of confound adjustment methods in the construction of gene co-expression networks. Genome Biol 2022;23:44. [PMID: 35115012 PMCID: PMC8812044 DOI: 10.1186/s13059-022-02606-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2021] [Accepted: 01/03/2022] [Indexed: 11/23/2022] Open

Yuan K, Zeng T, Chen L. Interpreting Functional Impact of Genetic Variations by Network QTL for Genotype–Phenotype Association Study. Front Cell Dev Biol 2022;9:720321. [PMID: 35155440 PMCID: PMC8826544 DOI: 10.3389/fcell.2021.720321] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 12/13/2021] [Indexed: 12/18/2022] Open

Abstract

An enormous challenge in the post-genome era is to annotate and resolve the consequences of genetic variation on diverse phenotypes. The genome-wide association study (GWAS) is a well-known method to identify potential genetic loci for complex traits from huge genetic variations, following which it is crucial to identify expression quantitative trait loci (eQTL). However, the conventional eQTL methods usually disregard the systematical role of single-nucleotide polymorphisms (SNPs) or genes, thereby overlooking many network-associated phenotypic determinates. Such a problem motivates us to recognize the network-based quantitative trait loci (QTL), i.e., network QTL (nQTL), which is to detect the cascade association as genotype → network → phenotype rather than conventional genotype → expression → phenotype in eQTL. Specifically, we develop the nQTL framework on the theory and approach of single-sample networks, which can identify not only network traits (e.g., the gene subnetwork associated with genotype) for analyzing complex biological processes but also network signatures (e.g., the interactive gene biomarker candidates screened from network traits) for characterizing targeted phenotype and corresponding subtypes. Our results show that the nQTL framework can efficiently capture associations between SNPs and network traits (i.e., edge traits) in various simulated data scenarios, compared with traditional eQTL methods. Furthermore, we have carried out nQTL analysis on diverse biological and biomedical datasets. Our analysis is effective in detecting network traits for various biological problems and can discover many network signatures for discriminating phenotypes, which can help interpret the influence of nQTL on disease subtyping, disease prognosis, drug response, and pathogen factor association. Particularly, in contrast to the conventional approaches, the nQTL framework could also identify many network traits from human bulk expression data, validated by matched single-cell RNA-seq data in an independent or unsupervised manner. All these results strongly support that nQTL and its detection framework can simultaneously explore the global genotype–network–phenotype associations and the underlying network traits or network signatures with functional impact and importance.

Collapse

Jeng XJ, Rhyne J, Zhang T, Tzeng JY. Effective SNP ranking improves the performance of eQTL mapping. Genet Epidemiol 2020;44:611-619. [PMID: 32216117 DOI: 10.1002/gepi.22293] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Revised: 02/21/2020] [Accepted: 03/11/2020] [Indexed: 11/06/2022]

A Multi-Omics Perspective of Quantitative Trait Loci in Precision Medicine. Trends Genet 2020;36:318-336. [PMID: 32294413 DOI: 10.1016/j.tig.2020.01.009] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 01/05/2020] [Accepted: 01/21/2020] [Indexed: 02/07/2023]

Saad MN, Mabrouk MS, Eldeib AM, Shaker OG. Comparative study for haplotype block partitioning methods - Evidence from chromosome 6 of the North American Rheumatoid Arthritis Consortium (NARAC) dataset. PLoS One 2019;13:e0209603. [PMID: 30596705 PMCID: PMC6312333 DOI: 10.1371/journal.pone.0209603] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 12/07/2018] [Indexed: 11/19/2022] Open

Abstract

Haplotype-based methods compete with “one-SNP-at-a-time” approaches on being preferred for association studies. Chromosome 6 contains most of the known genetic biomarkers for rheumatoid arthritis (RA) disease. Therefore, chromosome 6 serves as a benchmark for the haplotype methods testing. The aim of this study is to test the North American Rheumatoid Arthritis Consortium (NARAC) dataset to find out if haplotype block methods or single-locus approaches alone can sufficiently provide the significant single nucleotide polymorphisms (SNPs) associated with RA. In addition, could we be satisfied with only one method of the haplotype block methods for partitioning chromosome 6 of the NARAC dataset? In the NARAC dataset, chromosome 6 comprises 35,574 SNPs for 2,062 individuals (868 cases, 1,194 controls). Individual SNP approach and three haplotype block methods were applied to the NARAC dataset to identify the RA biomarkers. We employed three haplotype partitioning methods which are confidence interval test (CIT), four gamete test (FGT), and solid spine of linkage disequilibrium (SSLD). P-values after stringent Bonferroni correction for multiple testing were measured to assess the strength of association between the genetic variants and RA susceptibility. Moreover, the block size (in base pairs (bp) and number of SNPs included), number of blocks, percentage of uncovered SNPs by the block method, percentage of significant blocks from the total number of blocks, number of significant haplotypes and SNPs were used to compare among the three haplotype block methods. Individual SNP, CIT, FGT, and SSLD methods detected 432, 1,086, 1,099, and 1,322 associated SNPs, respectively. Each method identified significant SNPs that were not detected by any other method (Individual SNP: 12, FGT: 37, CIT: 55, and SSLD: 189 SNPs). 916 SNPs were discovered by all the three haplotype block methods. 367 SNPs were discovered by the haplotype block methods and the individual SNP approach. The P-values of these 367 SNPs were lower than those of the SNPs uniquely detected by only one method. The 367 SNPs detected by all the methods represent promising candidates for RA susceptibility. They should be further investigated for the European population. A hybrid technique including the four methods should be applied to detect the significant SNPs associated with RA for chromosome 6 of the NARAC dataset. Moreover, SSLD method may be preferred for its favored benefits in case of selecting only one method.

Collapse

Lee C. Genome-Wide Expression Quantitative Trait Loci Analysis Using Mixed Models. Front Genet 2018;9:341. [PMID: 30186313 PMCID: PMC6110903 DOI: 10.3389/fgene.2018.00341] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Accepted: 08/09/2018] [Indexed: 01/22/2023] Open