Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Manor O, Segal E. Robust prediction of expression differences among human individuals using only genotype information. PLoS Genet 2013;9:e1003396. [PMID: 23555302 PMCID: PMC3610805 DOI: 10.1371/journal.pgen.1003396] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Accepted: 01/24/2013] [Indexed: 11/25/2022] Open

For:	Manor O, Segal E. Robust prediction of expression differences among human individuals using only genotype information. PLoS Genet 2013;9:e1003396. [PMID: 23555302 PMCID: PMC3610805 DOI: 10.1371/journal.pgen.1003396] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Accepted: 01/24/2013] [Indexed: 11/25/2022] Open

Number

Cited by Other Article(s)

Ye X, Yang S, Tu J, Xu L, Wang Y, Chen H, Yu R, Huang P. Leveraging baseline transcriptional features and information from single-cell data to power the prediction of influenza vaccine response. Front Cell Infect Microbiol 2024;14:1243586. [PMID: 38384303 PMCID: PMC10879619 DOI: 10.3389/fcimb.2024.1243586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 01/11/2024] [Indexed: 02/23/2024] Open

Abstract

Introduction

Vaccination is still the primary means for preventing influenza virus infection, but the protective effects vary greatly among individuals. Identifying individuals at risk of low response to influenza vaccination is important. This study aimed to explore improved strategies for constructing predictive models of influenza vaccine response using gene expression data.

Methods

We first used gene expression and immune response data from the Immune Signatures Data Resource (IS2) to define influenza vaccine response-related transcriptional expression and alteration features at different time points across vaccination via differential expression analysis. Then, we mapped these features to single-cell resolution using additional published single-cell data to investigate the possible mechanism. Finally, we explored the potential of these identified transcriptional features in predicting influenza vaccine response. We used several modeling strategies and also attempted to leverage the information from single-cell RNA sequencing (scRNA-seq) data to optimize the predictive models.

Results

The results showed that models based on genes showing differential expression (DEGs) or fold change (DFGs) at day 7 post-vaccination performed the best in internal validation, while models based on DFGs had a better performance in external validation than those based on DEGs. In addition, incorporating baseline predictors could improve the performance of models based on days 1-3, while the model based on the expression profile of plasma cells deconvoluted from the model that used DEGs at day 7 as predictors showed an improved performance in external validation.

Conclusion

Our study emphasizes the value of using combination modeling strategy and leveraging information from single-cell levels in constructing influenza vaccine response predictive models.

Collapse

Yao S, Zhang X, Zou SC, Zhu Y, Li B, Kuang WP, Guo Y, Li XS, Li L, Wang XY. A transcriptome-wide association study identifies susceptibility genes for Parkinson's disease. NPJ Parkinsons Dis 2021;7:79. [PMID: 34504106 PMCID: PMC8429416 DOI: 10.1038/s41531-021-00221-7] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 08/10/2021] [Indexed: 12/19/2022] Open

Zhu D, Yao S, Wu H, Ke X, Zhou X, Geng S, Dong S, Chen H, Yang T, Cheng Y, Guo Y. A transcriptome-wide association study identifies novel susceptibility genes for psoriasis. Hum Mol Genet 2021;31:300-308. [PMID: 34409462 DOI: 10.1093/hmg/ddab237] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 08/09/2021] [Accepted: 08/10/2021] [Indexed: 01/17/2023] Open

Affiliation(s)

Dongli Zhu Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China
Shi Yao Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China.,National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, 710004, P. R. China
Hao Wu Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China
Xin Ke Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China
Xiaorong Zhou Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China
Songmei Geng Department of Dermatology, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, 710004, P. R. China
Shanshan Dong Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China
Hao Chen Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China.,Research Institute of Xi'an Jiaotong University, Hangzhou, Zhejiang, 311215, P.R. China
Tielin Yang Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China.,National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, 710004, P. R. China
Ying Cheng Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China
Yan Guo Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, P. R. China.,National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi'an Jiaotong University, Xi'an, Shaanxi, 710004, P. R. China

Collapse

Zhang J, Lu H, Zhang S, Wang T, Zhao H, Guan F, Zeng P. Leveraging Methylation Alterations to Discover Potential Causal Genes Associated With the Survival Risk of Cervical Cancer in TCGA Through a Two-Stage Inference Approach. Front Genet 2021;12:667877. [PMID: 34149809 PMCID: PMC8206792 DOI: 10.3389/fgene.2021.667877] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 04/19/2021] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

Multiple genes were previously identified to be associated with cervical cancer; however, the genetic architecture of cervical cancer remains unknown and many potential causal genes are yet to be discovered.

METHODS

To explore potential causal genes related to cervical cancer, a two-stage causal inference approach was proposed within the framework of Mendelian randomization, where the gene expression was treated as exposure, with methylations located within the promoter regions of genes serving as instrumental variables. Five prediction models were first utilized to characterize the relationship between the expression and methylations for each gene; then, the methylation-regulated gene expression (MReX) was obtained and the association was evaluated via Cox mixed-effect model based on MReX. We further implemented the aggregated Cauchy association test (ACAT) combination to take advantage of respective strengths of these prediction models while accounting for dependency among the p-values.

RESULTS

A total of 14 potential causal genes were discovered to be associated with the survival risk of cervical cancer in TCGA when the five prediction models were separately employed. The total number of potential causal genes was brought to 23 when conducting ACAT. Some of the newly discovered genes may be novel (e.g., YJEFN3, SPATA5L1, IMMP1L, C5orf55, PPIP5K2, ZNF330, CRYZL1, PPM1A, ESCO2, ZNF605, ZNF225, ZNF266, FICD, and OSTC). Functional analyses showed that these genes were enriched in tumor-associated pathways. Additionally, four genes (i.e., COL6A1, SYDE1, ESCO2, and GIPC1) were differentially expressed between tumor and normal tissues.

CONCLUSION

Our study discovered promising candidate genes that were causally associated with the survival risk of cervical cancer and thus provided new insights into the genetic etiology of cervical cancer.

Collapse

Banerjee S, Simonetti FL, Detrois KE, Kaphle A, Mitra R, Nagial R, Söding J. Tejaas: reverse regression increases power for detecting trans-eQTLs. Genome Biol 2021;22:142. [PMID: 33957961 PMCID: PMC8101255 DOI: 10.1186/s13059-021-02361-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 04/22/2021] [Indexed: 12/18/2022] Open

Okoro PC, Schubert R, Guo X, Johnson WC, Rotter JI, Hoeschele I, Liu Y, Im HK, Luke A, Dugas LR, Wheeler HE. Transcriptome prediction performance across machine learning models and diverse ancestries. HGG ADVANCES 2021;2:100019. [PMID: 33937878 PMCID: PMC8087249 DOI: 10.1016/j.xhgg.2020.100019] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Yao S, Wu H, Liu TT, Wang JH, Ding JM, Guo J, Rong Y, Ke X, Hao RH, Dong SS, Yang TL, Guo Y. Epigenetic Element-Based Transcriptome-Wide Association Study Identifies Novel Genes for Bipolar Disorder. Schizophr Bull 2021;47:1642-1652. [PMID: 33772305 PMCID: PMC8530404 DOI: 10.1093/schbul/sbab023] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Affiliation(s)

Shi Yao National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi 710004, P. R. China,Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Hao Wu Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Tong-Tong Liu Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Jia-Hao Wang Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Jing-Miao Ding Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Jing Guo Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Yu Rong Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Xin Ke Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Ruo-Han Hao Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Shan-Shan Dong Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Tie-Lin Yang National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi 710004, P. R. China,Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China
Yan Guo National and Local Joint Engineering Research Center of Biodiagnosis and Biotherapy, The Second Affiliated Hospital, Xi’an Jiaotong University, Xi’an, Shaanxi 710004, P. R. China,Key Laboratory of Biomedical Information Engineering of Ministry of Education, Biomedical Informatics & Genomics Center, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, Shaanxi 710049, P. R. China,To whom correspondence should be addressed; tel: +86-29-62818386, fax: +86-29-62818386, e-mail:

Collapse

Zeng P, Dai J, Jin S, Zhou X. Aggregating multiple expression prediction models improves the power of transcriptome-wide association studies. Hum Mol Genet 2021;30:939-951. [PMID: 33615361 DOI: 10.1093/hmg/ddab056] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 02/10/2021] [Accepted: 02/15/2021] [Indexed: 12/11/2022] Open

Abstract

Transcriptome-wide association study (TWAS) is an important integrative method for identifying genes that are causally associated with phenotypes. A key step of TWAS involves the construction of expression prediction models for every gene in turn using its cis-SNPs as predictors. Different TWAS methods rely on different models for gene expression prediction, and each such model makes a distinct modeling assumption that is often suitable for a particular genetic architecture underlying expression. However, the genetic architectures underlying gene expression vary across genes throughout the transcriptome. Consequently, different TWAS methods may be beneficial in detecting genes with distinct genetic architectures. Here, we develop a new method, HMAT, which aggregates TWAS association evidence obtained across multiple gene expression prediction models by leveraging the harmonic mean P-value combination strategy. Because each expression prediction model is suited to capture a particular genetic architecture, aggregating TWAS associations across prediction models as in HMAT improves accurate expression prediction and enables subsequent powerful TWAS analysis across the transcriptome. A key feature of HMAT is its ability to accommodate the correlations among different TWAS test statistics and produce calibrated P-values after aggregation. Through numerical simulations, we illustrated the advantage of HMAT over commonly used TWAS methods as well as ad hoc P-value combination rules such as Fisher's method. We also applied HMAT to analyze summary statistics of nine common diseases. In the real data applications, HMAT was on average 30.6% more powerful compared to the next best method, detecting many new disease-associated genes that were otherwise not identified by existing TWAS approaches. In conclusion, HMAT represents a flexible and powerful TWAS method that enjoys robust performance across a range of genetic architectures underlying gene expression.

Collapse

Alpay BA, Demetci P, Istrail S, Aguiar D. Combinatorial and statistical prediction of gene expression from haplotype sequence. Bioinformatics 2020;36:i194-i202. [PMID: 32657373 PMCID: PMC7355230 DOI: 10.1093/bioinformatics/btaa318] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Abstract

MOTIVATION

Genome-wide association studies (GWAS) have discovered thousands of significant genetic effects on disease phenotypes. By considering gene expression as the intermediary between genotype and disease phenotype, expression quantitative trait loci studies have interpreted many of these variants by their regulatory effects on gene expression. However, there remains a considerable gap between genotype-to-gene expression association and genotype-to-gene expression prediction. Accurate prediction of gene expression enables gene-based association studies to be performed post hoc for existing GWAS, reduces multiple testing burden, and can prioritize genes for subsequent experimental investigation.

RESULTS

In this work, we develop gene expression prediction methods that relax the independence and additivity assumptions between genetic markers. First, we consider gene expression prediction from a regression perspective and develop the HAPLEXR algorithm which combines haplotype clusterings with allelic dosages. Second, we introduce the new gene expression classification problem, which focuses on identifying expression groups rather than continuous measurements; we formalize the selection of an appropriate number of expression groups using the principle of maximum entropy. Third, we develop the HAPLEXD algorithm that models haplotype sharing with a modified suffix tree data structure and computes expression groups by spectral clustering. In both models, we penalize model complexity by prioritizing genetic clusters that indicate significant effects on expression. We compare HAPLEXR and HAPLEXD with three state-of-the-art expression prediction methods and two novel logistic regression approaches across five GTEx v8 tissues. HAPLEXD exhibits significantly higher classification accuracy overall; HAPLEXR shows higher prediction accuracy on approximately half of the genes tested and the largest number of best predicted genes (r2>0.1) among all methods. We show that variant and haplotype features selected by HAPLEXR are smaller in size than competing methods (and thus more interpretable) and are significantly enriched in functional annotations related to gene regulation. These results demonstrate the importance of explicitly modeling non-dosage dependent and intragenic epistatic effects when predicting expression.

AVAILABILITY AND IMPLEMENTATION

Source code and binaries are freely available at https://github.com/rapturous/HAPLEX.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Shi W, Fornes O, Wasserman WW. Gene expression models based on transcription factor binding events confer insight into functional cis-regulatory variants. Bioinformatics 2020;35:2610-2617. [PMID: 30541050 PMCID: PMC6662294 DOI: 10.1093/bioinformatics/bty992] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Revised: 10/17/2018] [Accepted: 12/10/2018] [Indexed: 01/03/2023] Open

Petty LE, Highland HM, Gamazon ER, Hu H, Karhade M, Chen HH, de Vries PS, Grove ML, Aguilar D, Bell GI, Huff CD, Hanis CL, Doddapaneni H, Munzy DM, Gibbs RA, Ma J, Parra EJ, Cruz M, Valladares-Salgado A, Arking DE, Barbeira A, Im HK, Morrison AC, Boerwinkle E, Below JE. Functionally oriented analysis of cardiometabolic traits in a trans-ethnic sample. Hum Mol Genet 2019;28:1212-1224. [PMID: 30624610 PMCID: PMC6423424 DOI: 10.1093/hmg/ddy435] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Revised: 11/13/2018] [Accepted: 11/20/2018] [Indexed: 01/02/2023] Open

Affiliation(s)

Lauren E Petty Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.,Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Heather M Highland Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA.,Department of Epidemiology, University of North Carolina, Chapel Hill, NC, USA
Eric R Gamazon Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.,Clare Hall, University of Cambridge, Cambridge, UK
Hao Hu Department of Epidemiology, MD Anderson Cancer Center, Houston, TX, USA
Mandar Karhade Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Hung-Hsin Chen Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.,Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Paul S de Vries Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Megan L Grove Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
David Aguilar Department of Cardiology, Baylor College of Medicine Houston, TX, USA
Graeme I Bell Departments of Medicine and Human Genetics, The University of Chicago, Chicago, IL, USA
Chad D Huff Department of Epidemiology, MD Anderson Cancer Center, Houston, TX, USA
Craig L Hanis Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
HarshaVardhan Doddapaneni Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Donna M Munzy Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Richard A Gibbs Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
Jianzhong Ma Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Esteban J Parra Department of Anthropology, University of Toronto at Mississauga, Mississauga, Ontario, Canada
Miguel Cruz Unidad de Investigación Médica en Bioquímica, Hospital de Especialidades, Centro Médico Nacional Siglo XXI, IMSS, Mexico City, Mexico
Adan Valladares-Salgado Unidad de Investigación Médica en Bioquímica, Hospital de Especialidades, Centro Médico Nacional Siglo XXI, IMSS, Mexico City, Mexico
Dan E Arking McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Alvaro Barbeira Section of Genetic Medicine, Department of Medicine, University of Chicago, IL, USA
Hae Kyung Im Section of Genetic Medicine, Department of Medicine, University of Chicago, IL, USA
Alanna C Morrison Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Eric Boerwinkle Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Jennifer E Below Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA.,Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA

Collapse

Cannon ME, Mohlke KL. Deciphering the Emerging Complexities of Molecular Mechanisms at GWAS Loci. Am J Hum Genet 2018;103:637-653. [PMID: 30388398 PMCID: PMC6218604 DOI: 10.1016/j.ajhg.2018.10.001] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Mogil LS, Andaleon A, Badalamenti A, Dickinson SP, Guo X, Rotter JI, Johnson WC, Im HK, Liu Y, Wheeler HE. Genetic architecture of gene expression traits across diverse populations. PLoS Genet 2018;14:e1007586. [PMID: 30096133 PMCID: PMC6105030 DOI: 10.1371/journal.pgen.1007586] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Revised: 08/22/2018] [Accepted: 07/24/2018] [Indexed: 01/14/2023] Open

Abstract

For many complex traits, gene regulation is likely to play a crucial mechanistic role. How the genetic architectures of complex traits vary between populations and subsequent effects on genetic prediction are not well understood, in part due to the historical paucity of GWAS in populations of non-European ancestry. We used data from the MESA (Multi-Ethnic Study of Atherosclerosis) cohort to characterize the genetic architecture of gene expression within and between diverse populations. Genotype and monocyte gene expression were available in individuals with African American (AFA, n = 233), Hispanic (HIS, n = 352), and European (CAU, n = 578) ancestry. We performed expression quantitative trait loci (eQTL) mapping in each population and show genetic correlation of gene expression depends on shared ancestry proportions. Using elastic net modeling with cross validation to optimize genotypic predictors of gene expression in each population, we show the genetic architecture of gene expression for most predictable genes is sparse. We found the best predicted gene in each population, TACSTD2 in AFA and CHURC1 in CAU and HIS, had similar prediction performance across populations with R2 > 0.8 in each population. However, we identified a subset of genes that are well-predicted in one population, but poorly predicted in another. We show these differences in predictive performance are due to allele frequency differences between populations. Using genotype weights trained in MESA to predict gene expression in independent populations showed that a training set with ancestry similar to the test set is better at predicting gene expression in test populations, demonstrating an urgent need for diverse population sampling in genomics. Our predictive models and performance statistics in diverse cohorts are made publicly available for use in transcriptome mapping methods at https://github.com/WheelerLab/DivPop.

Collapse

Affiliation(s)

Lauren S. Mogil Department of Biology, Loyola University Chicago, Chicago, Illinois, United States of America
Angela Andaleon Department of Biology, Loyola University Chicago, Chicago, Illinois, United States of America Program in Bioinformatics, Loyola University Chicago, Chicago, Illinois, United States of America
Alexa Badalamenti Program in Bioinformatics, Loyola University Chicago, Chicago, Illinois, United States of America
Scott P. Dickinson Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, Illinois, United States of America
Xiuqing Guo Institute for Translational Genomics and Population Sciences, Los Angeles Biomedical Research Institute and Department of Pediatrics at Harbor-UCLA Medical Center, Torrance, California, United States of America
Jerome I. Rotter Institute for Translational Genomics and Population Sciences, Los Angeles Biomedical Research Institute and Department of Pediatrics at Harbor-UCLA Medical Center, Torrance, California, United States of America
W. Craig Johnson Department of Biostatistics, University of Washington, Seattle, Washington, United States of America
Hae Kyung Im Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, Illinois, United States of America
Yongmei Liu Department of Epidemiology & Prevention, Wake Forest School of Medicine, Winston-Salem, North Carolina, United States of America
Heather E. Wheeler Department of Biology, Loyola University Chicago, Chicago, Illinois, United States of America Program in Bioinformatics, Loyola University Chicago, Chicago, Illinois, United States of America Department of Computer Science, Loyola University Chicago, Chicago, Illinois, United States of America Department of Public Health Sciences, Stritch School of Medicine, Loyola University Chicago, Maywood, Illinois, United States of America

Collapse

Heinig M. Using Gene Expression to Annotate Cardiovascular GWAS Loci. Front Cardiovasc Med 2018;5:59. [PMID: 29922679 PMCID: PMC5996083 DOI: 10.3389/fcvm.2018.00059] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 05/15/2018] [Indexed: 01/27/2023] Open

Barbeira AN, Dickinson SP, Bonazzola R, Zheng J, Wheeler HE, Torres JM, Torstenson ES, Shah KP, Garcia T, Edwards TL, Stahl EA, Huckins LM, Nicolae DL, Cox NJ, Im HK. Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics. Nat Commun 2018;9:1825. [PMID: 29739930 PMCID: PMC5940825 DOI: 10.1038/s41467-018-03621-1] [Citation(s) in RCA: 589] [Impact Index Per Article: 98.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2017] [Accepted: 12/27/2017] [Indexed: 12/25/2022] Open

Affiliation(s)

Alvaro N Barbeira Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA
Scott P Dickinson Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA
Rodrigo Bonazzola Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA
Jiamao Zheng Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA
Heather E Wheeler Department of Biology, Loyola University Chicago, Chicago, IL, 60660, USA.,Department of Computer Science, Loyola University Chicago, Chicago, IL, 60660, USA
Jason M Torres Committee on Molecular Metabolism and Nutrition, The University of Chicago, Chicago, IL, 60637, USA
Eric S Torstenson Vanderbilt Genetic Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
Kaanan P Shah Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA
Tzintzuni Garcia Center for Research Informatics, The University of Chicago, Chicago, IL, 60615, USA
Todd L Edwards Division of Epidemiology, Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
Eli A Stahl Division of Psychiatric Genomics, Icahn School of Medicine at Mount Sinai, NYC, NY, 10029, USA.,Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, NYC, NY, 10029, USA
Laura M Huckins Division of Psychiatric Genomics, Icahn School of Medicine at Mount Sinai, NYC, NY, 10029, USA.,Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, NYC, NY, 10029, USA

Dan L Nicolae Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA
Nancy J Cox Vanderbilt Genetic Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
Hae Kyung Im Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA.

Collapse

Xie R, Wen J, Quitadamo A, Cheng J, Shi X. A deep auto-encoder model for gene expression prediction. BMC Genomics 2017;18:845. [PMID: 29219072 PMCID: PMC5773895 DOI: 10.1186/s12864-017-4226-0] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Zeng P, Wang T, Huang S. Cis-SNPs Set Testing and PrediXcan Analysis for Gene Expression Data using Linear Mixed Models. Sci Rep 2017;7:15237. [PMID: 29127305 PMCID: PMC5681585 DOI: 10.1038/s41598-017-15055-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Accepted: 10/19/2017] [Indexed: 12/21/2022] Open

Zeng P, Zhou X, Huang S. Prediction of gene expression with cis-SNPs using mixed models and regularization methods. BMC Genomics 2017;18:368. [PMID: 28490319 PMCID: PMC5425981 DOI: 10.1186/s12864-017-3759-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2016] [Accepted: 05/03/2017] [Indexed: 12/25/2022] Open

Abstract

Background

It has been shown that gene expression in human tissues is heritable, thus predicting gene expression using only SNPs becomes possible. The prediction of gene expression can offer important implications on the genetic architecture of individual functional associated SNPs and further interpretations of the molecular basis underlying human diseases.

Methods

We compared three types of methods for predicting gene expression using only cis-SNPs, including the polygenic model, i.e. linear mixed model (LMM), two sparse models, i.e. Lasso and elastic net (ENET), and the hybrid of LMM and sparse model, i.e. Bayesian sparse linear mixed model (BSLMM). The three kinds of prediction methods have very different assumptions of underlying genetic architectures. These methods were evaluated using simulations under various scenarios, and were applied to the Geuvadis gene expression data.

Results

The simulations showed that these four prediction methods (i.e. Lasso, ENET, LMM and BSLMM) behaved best when their respective modeling assumptions were satisfied, but BSLMM had a robust performance across a range of scenarios. According to R² of these models in the Geuvadis data, the four methods performed quite similarly. We did not observe any clustering or enrichment of predictive genes (defined as genes with R² ≥ 0.05) across the chromosomes, and also did not see there was any clear relationship between the proportion of the predictive genes and the proportion of genes in each chromosome. However, an interesting finding in the Geuvadis data was that highly predictive genes (e.g. R² ≥ 0.30) may have sparse genetic architectures since Lasso, ENET and BSLMM outperformed LMM for these genes; and this observation was validated in another gene expression data. We further showed that the predictive genes were enriched in approximately independent LD blocks.

Conclusions

Gene expression can be predicted with only cis-SNPs using well-developed prediction models and these predictive genes were enriched in some approximately independent LD blocks. The prediction of gene expression can shed some light on the functional interpretation for identified SNPs in GWASs.

Collapse

Maricque BB, Dougherty JD, Cohen BA. A genome-integrated massively parallel reporter assay reveals DNA sequence determinants of cis-regulatory activity in neural cells. Nucleic Acids Res 2017;45:e16. [PMID: 28204611 PMCID: PMC5389540 DOI: 10.1093/nar/gkw942] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Revised: 10/05/2016] [Accepted: 10/11/2016] [Indexed: 11/12/2022] Open

Wheeler HE, Shah KP, Brenner J, Garcia T, Aquino-Michaels K, Cox NJ, Nicolae DL, Im HK. Survey of the Heritability and Sparse Architecture of Gene Expression Traits across Human Tissues. PLoS Genet 2016;12:e1006423. [PMID: 27835642 PMCID: PMC5106030 DOI: 10.1371/journal.pgen.1006423] [Citation(s) in RCA: 127] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Accepted: 10/12/2016] [Indexed: 11/19/2022] Open

Abstract

Understanding the genetic architecture of gene expression traits is key to elucidating the underlying mechanisms of complex traits. Here, for the first time, we perform a systematic survey of the heritability and the distribution of effect sizes across all representative tissues in the human body. We find that local h2 can be relatively well characterized with 59% of expressed genes showing significant h2 (FDR < 0.1) in the DGN whole blood cohort. However, current sample sizes (n ≤ 922) do not allow us to compute distal h2. Bayesian Sparse Linear Mixed Model (BSLMM) analysis provides strong evidence that the genetic contribution to local expression traits is dominated by a handful of genetic variants rather than by the collective contribution of a large number of variants each of modest size. In other words, the local architecture of gene expression traits is sparse rather than polygenic across all 40 tissues (from DGN and GTEx) examined. This result is confirmed by the sparsity of optimal performing gene expression predictors via elastic net modeling. To further explore the tissue context specificity, we decompose the expression traits into cross-tissue and tissue-specific components using a novel Orthogonal Tissue Decomposition (OTD) approach. Through a series of simulations we show that the cross-tissue and tissue-specific components are identifiable via OTD. Heritability and sparsity estimates of these derived expression phenotypes show similar characteristics to the original traits. Consistent properties relative to prior GTEx multi-tissue analysis results suggest that these traits reflect the expected biology. Finally, we apply this knowledge to develop prediction models of gene expression traits for all tissues. The prediction models, heritability, and prediction performance R2 for original and decomposed expression phenotypes are made publicly available (https://github.com/hakyimlab/PrediXcan).

Collapse

Gamazon ER, Wheeler HE, Shah KP, Mozaffari SV, Aquino-Michaels K, Carroll RJ, Eyler AE, Denny JC, Nicolae DL, Cox NJ, Kyung Im H. A gene-based association method for mapping traits using reference transcriptome data. Nat Genet 2015;47:1091-8. [PMID: 26258848 PMCID: PMC4552594 DOI: 10.1038/ng.3367] [Citation(s) in RCA: 1055] [Impact Index Per Article: 117.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Accepted: 07/06/2015] [Indexed: 12/14/2022]

Albert FW, Kruglyak L. The role of regulatory variation in complex traits and disease. Nat Rev Genet 2015;16:197-212. [DOI: 10.1038/nrg3891] [Citation(s) in RCA: 684] [Impact Index Per Article: 76.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Manor O, Segal E. GenoExp: a web tool for predicting gene expression levels from single nucleotide polymorphisms. ACTA ACUST UNITED AC 2015;31:1848-50. [PMID: 25637557 DOI: 10.1093/bioinformatics/btv050] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Accepted: 01/26/2015] [Indexed: 12/16/2022]

Okser S, Pahikkala T, Airola A, Salakoski T, Ripatti S, Aittokallio T. Regularized machine learning in the genetic prediction of complex traits. PLoS Genet 2014;10:e1004754. [PMID: 25393026 PMCID: PMC4230844 DOI: 10.1371/journal.pgen.1004754] [Citation(s) in RCA: 99] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

Levo M, Segal E. In pursuit of design principles of regulatory sequences. Nat Rev Genet 2014;15:453-68. [PMID: 24913666 DOI: 10.1038/nrg3684] [Citation(s) in RCA: 153] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]