Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Barbeira AN, Melia OJ, Liang Y, Bonazzola R, Wang G, Wheeler HE, Aguet F, Ardlie KG, Wen X, Im HK. Fine-mapping and QTL tissue-sharing information improves the reliability of causal gene identification. Genet Epidemiol 2020;44:854-867. [PMID: 32964524 PMCID: PMC7693040 DOI: 10.1002/gepi.22346] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 06/26/2020] [Accepted: 06/26/2020] [Indexed: 01/01/2023]

For:	Barbeira AN, Melia OJ, Liang Y, Bonazzola R, Wang G, Wheeler HE, Aguet F, Ardlie KG, Wen X, Im HK. Fine-mapping and QTL tissue-sharing information improves the reliability of causal gene identification. Genet Epidemiol 2020;44:854-867. [PMID: 32964524 PMCID: PMC7693040 DOI: 10.1002/gepi.22346] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 06/26/2020] [Accepted: 06/26/2020] [Indexed: 01/01/2023]

Number

Cited by Other Article(s)

Lona-Durazo F, Omachi K, Fermin D, Eichinger F, Troost JP, Lin MH, Dinsmore IR, Mirshahi T, Chang AR, Miner JH, Paterson AD, Barua M, Gagliano Taliun SA. Association of Genetically Predicted Skipping of COL4A4 Exon 27 with Hematuria and Albuminuria. J Am Soc Nephrol 2024:00001751-990000000-00408. [PMID: 39190490 DOI: 10.1681/asn.0000000000000480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 08/22/2024] [Indexed: 08/29/2024] Open

Affiliation(s)

Frida Lona-Durazo Montreal Heart Institute, Montreal, Quebec, Canada Faculty of Medicine, Université de Montréal, Montreal, Quebec, Canada
Kohei Omachi Division of Nephrology, Washington University School of Medicine, St. Louis, Missouri Department of Molecular Medicine, Graduate School of Pharmaceutical Sciences, Kumamoto University, Kumamoto, Japan
Damian Fermin Division of Nephrology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan
Felix Eichinger Division of Nephrology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan
Jonathan P Troost Michigan Institute for Clinical and Health Research, University of Michigan, Ann Arbor, Michigan
Meei-Hua Lin Division of Nephrology, Washington University School of Medicine, St. Louis, Missouri
Ian R Dinsmore Department of Genomic Health, Geisinger, Danville, Pennsylvania
Tooraj Mirshahi Department of Genomic Health, Geisinger, Danville, Pennsylvania
Alexander R Chang Department of Population Health Sciences, Center for Kidney Health Research, Geisinger, Danville, Pennsylvania Department of Nephrology, Geisinger, Danville, Pennsylvania
Jeffrey H Miner Division of Nephrology, Washington University School of Medicine, St. Louis, Missouri
Andrew D Paterson Divisions of Epidemiology and Biostatistics, Dalla Lana School of Public Health, Toronto, Ontario, Canada Genetics and Genome Biology, Research Institute at The Hospital for Sick Children, Toronto, Ontario, Canada Institute of Medical Sciences, University of Toronto, Toronto, Ontario, Canada
Moumita Barua Institute of Medical Sciences, University of Toronto, Toronto, Ontario, Canada Division of Nephrology, University Health Network, Toronto, Ontario, Canada Department of Medicine, University of Toronto, Toronto, Ontario, Canada Toronto General Hospital Research Institute, Toronto, Ontario, Canada
Sarah A Gagliano Taliun Montreal Heart Institute, Montreal, Quebec, Canada Department of Medicine, Université de Montréal, Montreal, Quebec, Canada Department of Neurosciences, Université de Montréal, Montreal, Quebec, Canada

Collapse

Song S, Wang L, Hou L, Liu JS. Partitioning and aggregating cross-tissue and tissue-specific genetic effects to identify gene-trait associations. Nat Commun 2024;15:5769. [PMID: 38982044 PMCID: PMC11233643 DOI: 10.1038/s41467-024-49924-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 06/25/2024] [Indexed: 07/11/2024] Open

Li JL, McClellan JC, Zhang H, Gao G, Huo D. Multi-tissue transcriptome-wide association studies identified 235 genes for intrinsic subtypes of breast cancer. J Natl Cancer Inst 2024;116:1105-1115. [PMID: 38400758 PMCID: PMC11223833 DOI: 10.1093/jnci/djae041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 01/25/2024] [Accepted: 02/20/2024] [Indexed: 02/26/2024] Open

Abstract

BACKGROUND

Although genome-wide association studies (GWAS) of breast cancer (BC) identified common variants which differ between intrinsic subtypes, genes through which these variants act to impact BC risk have not been fully established. Transcriptome-wide association studies (TWAS) have identified genes associated with overall BC risk, but subtype-specific differences are largely unknown.

METHODS

We performed two multi-tissue TWAS for each BC intrinsic subtype, including an expression-based approach that collated TWAS signals from expression quantitative trait loci (eQTLs) across multiple tissues and a novel splicing-based approach that collated signals from splicing QTLs (sQTLs) across intron clusters and subsequently across tissues. We used summary statistics for five intrinsic subtypes including Luminal A-like, Luminal B-like, Luminal B/HER2-negative-like, HER2-enriched-like, and triple-negative BC, generated from 106 278 BC cases and 91 477 controls in the Breast Cancer Association Consortium.

RESULTS

Overall, we identified 235 genes in 88 loci that were associated with at least one of the five intrinsic subtypes. Most genes were subtype-specific, and many have not been reported in previous TWAS. We discovered common variants that modulate expression of CHEK2 confer increased risk to Luminal A-like BC, in contrast to the viewpoint that CHEK2 primarily harbors rare, penetrant mutations. Additionally, our splicing-based TWAS provided population-level support for MDM4 splice variants that increased the risk of triple-negative BC.

CONCLUSION

Our comprehensive, multi-tissue TWAS corroborated previous GWAS loci for overall BC risk and intrinsic subtypes, while underscoring how common variation that impacts expression and splicing of genes in multiple tissue types can be used to further elucidate the etiology of BC.

Collapse

Gao G, McClellan J, Barbeira AN, Fiorica PN, Li JL, Mu Z, Olopade OI, Huo D, Im HK. A multi-tissue, splicing-based joint transcriptome-wide association study identifies susceptibility genes for breast cancer. Am J Hum Genet 2024;111:1100-1113. [PMID: 38733992 PMCID: PMC11179262 DOI: 10.1016/j.ajhg.2024.04.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 04/13/2024] [Accepted: 04/15/2024] [Indexed: 05/13/2024] Open

Head ST, Dezem F, Todor A, Yang J, Plummer J, Gayther S, Kar S, Schildkraut J, Epstein MP. Cis- and trans-eQTL TWASs of breast and ovarian cancer identify more than 100 susceptibility genes in the BCAC and OCAC consortia. Am J Hum Genet 2024;111:1084-1099. [PMID: 38723630 PMCID: PMC11179407 DOI: 10.1016/j.ajhg.2024.04.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 04/11/2024] [Accepted: 04/16/2024] [Indexed: 05/21/2024] Open

Durge AR, Shrimankar DD. DHFS-ECM: Design of a Dual Heuristic Feature Selection-based Ensemble Classification Model for the Identification of Bamboo Species from Genomic Sequences. Curr Genomics 2024;25:185-201. [PMID: 39087000 PMCID: PMC11288165 DOI: 10.2174/0113892029268176240125055419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 01/16/2024] [Accepted: 01/16/2024] [Indexed: 08/02/2024] Open

Abstract

Background

Analyzing genomic sequences plays a crucial role in understanding biological diversity and classifying Bamboo species. Existing methods for genomic sequence analysis suffer from limitations such as complexity, low accuracy, and the need for constant reconfiguration in response to evolving genomic datasets.

Aim

This study addresses these limitations by introducing a novel Dual Heuristic Feature Selection-based Ensemble Classification Model (DHFS-ECM) for the precise identification of Bamboo species from genomic sequences.

Methods

The proposed DHFS-ECM method employs a Genetic Algorithm to perform dual heuristic feature selection. This process maximizes inter-class variance, leading to the selection of informative N-gram feature sets. Subsequently, intra-class variance levels are used to create optimal training and validation sets, ensuring comprehensive coverage of class-specific features. The selected features are then processed through an ensemble classification layer, combining multiple stratification models for species-specific categorization.

Results

Comparative analysis with state-of-the-art methods demonstrate that DHFS-ECM achieves remarkable improvements in accuracy (9.5%), precision (5.9%), recall (8.5%), and AUC performance (4.5%). Importantly, the model maintains its performance even with an increased number of species classes due to the continuous learning facilitated by the Dual Heuristic Genetic Algorithm Model.

Conclusion

DHFS-ECM offers several key advantages, including efficient feature extraction, reduced model complexity, enhanced interpretability, and increased robustness and accuracy through the ensemble classification layer. These attributes make DHFS-ECM a promising tool for real-time clinical applications and a valuable contribution to the field of genomic sequence analysis.

Collapse

McClellan JC, Li JL, Gao G, Huo D. Expression- and splicing-based multi-tissue transcriptome-wide association studies identified multiple genes for breast cancer by estrogen-receptor status. Breast Cancer Res 2024;26:51. [PMID: 38515142 PMCID: PMC10958972 DOI: 10.1186/s13058-024-01809-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 03/14/2024] [Indexed: 03/23/2024] Open

Abstract

BACKGROUND

Although several transcriptome-wide association studies (TWASs) have been performed to identify genes associated with overall breast cancer (BC) risk, only a few TWAS have explored the differences in estrogen receptor-positive (ER+) and estrogen receptor-negative (ER-) breast cancer. Additionally, these studies were based on gene expression prediction models trained primarily in breast tissue, and they did not account for alternative splicing of genes.

METHODS

In this study, we utilized two approaches to perform multi-tissue TWASs of breast cancer by ER subtype: (1) an expression-based TWAS that combined TWAS signals for each gene across multiple tissues and (2) a splicing-based TWAS that combined TWAS signals of all excised introns for each gene across tissues. To perform this TWAS, we utilized summary statistics for ER + BC from the Breast Cancer Association Consortium (BCAC) and for ER- BC from a meta-analysis of BCAC and the Consortium of Investigators of Modifiers of BRCA1 and BRCA2 (CIMBA).

RESULTS

In total, we identified 230 genes in 86 loci that were associated with ER + BC and 66 genes in 29 loci that were associated with ER- BC at a Bonferroni threshold of significance. Of these genes, 2 genes associated with ER + BC at the 1q21.1 locus were located at least 1 Mb from published GWAS hits. For several well-studied tumor suppressor genes such as TP53 and CHEK2 which have historically been thought to impact BC risk through rare, penetrant mutations, we discovered that common variants, which modulate gene expression, may additionally contribute to ER + or ER- etiology.

CONCLUSIONS

Our study comprehensively examined how differences in common variation contribute to molecular differences between ER + and ER- BC and introduces a novel, splicing-based framework that can be used in future TWAS studies.

Collapse

Wittich H, Ardlie K, Taylor KD, Durda P, Liu Y, Mikhaylova A, Gignoux CR, Cho MH, Rich SS, Rotter JI, Manichaikul A, Im HK, Wheeler HE. Transcriptome-wide association study of the plasma proteome reveals cis and trans regulatory mechanisms underlying complex traits. Am J Hum Genet 2024;111:445-455. [PMID: 38320554 PMCID: PMC10940016 DOI: 10.1016/j.ajhg.2024.01.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 01/12/2024] [Accepted: 01/12/2024] [Indexed: 02/08/2024] Open

Wang T, Yan Z, Zhang Y, Lou Z, Zheng X, Mai D, Wang Y, Shang X, Xiao B, Peng J, Chen J. postGWAS: A web server for deciphering the causality post the genome-wide association studies. Comput Biol Med 2024;171:108108. [PMID: 38359659 DOI: 10.1016/j.compbiomed.2024.108108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 01/23/2024] [Accepted: 02/04/2024] [Indexed: 02/17/2024]

Affiliation(s)

Tao Wang School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China
Zhihao Yan School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China
Yiming Zhang School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China
Zhuofei Lou School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China
Xiaozhu Zheng Department of Anesthesiology, The People's Hospital of Yubei District, Chongqing, 401120, China
DuoDuo Mai School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China
Yongtian Wang School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China
Xuequn Shang School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China
Bing Xiao School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
Jiajie Peng School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China; Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, 710072, China
Jing Chen School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, 710048, China.

Collapse

Head ST, Dezem F, Todor A, Yang J, Plummer J, Gayther S, Kar S, Schildkraut J, Epstein MP. Cis- and trans-eQTL TWAS of breast and ovarian cancer identify more than 100 risk associated genes in the BCAC and OCAC consortia. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.09.566218. [PMID: 38014246 PMCID: PMC10680675 DOI: 10.1101/2023.11.09.566218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]

Ottensmann L, Tabassum R, Ruotsalainen SE, Gerl MJ, Klose C, Widén E, Simons K, Ripatti S, Pirinen M. Genome-wide association analysis of plasma lipidome identifies 495 genetic associations. Nat Commun 2023;14:6934. [PMID: 37907536 PMCID: PMC10618167 DOI: 10.1038/s41467-023-42532-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 10/13/2023] [Indexed: 11/02/2023] Open

Araujo DS, Nguyen C, Hu X, Mikhaylova AV, Gignoux C, Ardlie K, Taylor KD, Durda P, Liu Y, Papanicolaou G, Cho MH, Rich SS, Rotter JI, Im HK, Manichaikul A, Wheeler HE. Multivariate adaptive shrinkage improves cross-population transcriptome prediction and association studies in underrepresented populations. HGG ADVANCES 2023;4:100216. [PMID: 37869564 PMCID: PMC10589725 DOI: 10.1016/j.xhgg.2023.100216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 06/27/2023] [Indexed: 10/24/2023] Open

Abstract

Transcriptome prediction models built with data from European-descent individuals are less accurate when applied to different populations because of differences in linkage disequilibrium patterns and allele frequencies. We hypothesized that methods that leverage shared regulatory effects across different conditions, in this case, across different populations, may improve cross-population transcriptome prediction. To test this hypothesis, we made transcriptome prediction models for use in transcriptome-wide association studies (TWASs) using different methods (elastic net, joint-tissue imputation [JTI], matrix expression quantitative trait loci [Matrix eQTL], multivariate adaptive shrinkage in R [MASHR], and transcriptome-integrated genetic association resource [TIGAR]) and tested their out-of-sample transcriptome prediction accuracy in population-matched and cross-population scenarios. Additionally, to evaluate model applicability in TWASs, we integrated publicly available multiethnic genome-wide association study (GWAS) summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study and Pan-ancestry genetic analysis of the UK Biobank (PanUKBB) with our developed transcriptome prediction models. In regard to transcriptome prediction accuracy, MASHR models performed better or the same as other methods in both population-matched and cross-population transcriptome predictions. Furthermore, in multiethnic TWASs, MASHR models yielded more discoveries that replicate in both PAGE and PanUKBB across all methods analyzed, including loci previously mapped in GWASs and loci previously not found in GWASs. Overall, our study demonstrates the importance of using methods that benefit from different populations' effect size estimates in order to improve TWASs for multiethnic or underrepresented populations.

Collapse

Affiliation(s)

Daniel S. Araujo Program in Bioinformatics, Loyola University Chicago, Chicago, IL 60660, USA
Chris Nguyen Department of Biology, Loyola University Chicago, Chicago, IL 60660, USA
Xiaowei Hu Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
Anna V. Mikhaylova Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
Chris Gignoux Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, UC Denver Anschutz Medical Campus, Aurora, CO 80045, USA
Kristin Ardlie Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Kent D. Taylor The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, the Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502, USA
Peter Durda Laboratory for Clinical Biochemistry Research, University of Vermont, Colchester, VT 05446, USA
Yongmei Liu Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA
George Papanicolaou Epidemiology Branch, Division of Cardiovascular Sciences, National Heart, Lung and Blood Institute, Bethesda, MD 20892, USA
Michael H. Cho Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA
Stephen S. Rich Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
Jerome I. Rotter The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, the Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502, USA
NHLBI TOPMed Consortium Program in Bioinformatics, Loyola University Chicago, Chicago, IL 60660, USA Department of Biology, Loyola University Chicago, Chicago, IL 60660, USA Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA Department of Biostatistics, University of Washington, Seattle, WA 98195, USA Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, UC Denver Anschutz Medical Campus, Aurora, CO 80045, USA Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, the Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA 90502, USA Laboratory for Clinical Biochemistry Research, University of Vermont, Colchester, VT 05446, USA Department of Medicine, Duke University School of Medicine, Durham, NC 27710, USA Epidemiology Branch, Division of Cardiovascular Sciences, National Heart, Lung and Blood Institute, Bethesda, MD 20892, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA Section of Genetic Medicine, University of Chicago, Chicago, IL 60637, USA
Hae Kyung Im Section of Genetic Medicine, University of Chicago, Chicago, IL 60637, USA
Ani Manichaikul Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
Heather E. Wheeler Program in Bioinformatics, Loyola University Chicago, Chicago, IL 60660, USA Department of Biology, Loyola University Chicago, Chicago, IL 60660, USA

Collapse

Ghaffar A, Nyholt DR. Integrating eQTL and GWAS data characterises established and identifies novel migraine risk loci. Hum Genet 2023;142:1113-1137. [PMID: 37245199 PMCID: PMC10449685 DOI: 10.1007/s00439-023-02568-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 05/02/2023] [Indexed: 05/29/2023]

Abstract

Migraine-a painful, throbbing headache disorder-is the most common complex brain disorder, yet its molecular mechanisms remain unclear. Genome-wide association studies (GWAS) have proven successful in identifying migraine risk loci; however, much work remains to identify the causal variants and genes. In this paper, we compared three transcriptome-wide association study (TWAS) imputation models-MASHR, elastic net, and SMultiXcan-to characterise established genome-wide significant (GWS) migraine GWAS risk loci, and to identify putative novel migraine risk gene loci. We compared the standard TWAS approach of analysing 49 GTEx tissues with Bonferroni correction for testing all genes present across all tissues (Bonferroni), to TWAS in five tissues estimated to be relevant to migraine, and TWAS with Bonferroni correction that took into account the correlation between eQTLs within each tissue (Bonferroni-matSpD). Elastic net models performed in all 49 GTEx tissues using Bonferroni-matSpD characterised the highest number of established migraine GWAS risk loci (n = 20) with GWS TWAS genes having colocalisation (PP4 > 0.5) with an eQTL. SMultiXcan in all 49 GTEx tissues identified the highest number of putative novel migraine risk genes (n = 28) with GWS differential expression at 20 non-GWS GWAS loci. Nine of these putative novel migraine risk genes were later found to be at and in linkage disequilibrium with true (GWS) migraine risk loci in a recent, more powerful migraine GWAS. Across all TWAS approaches, a total of 62 putative novel migraine risk genes were identified at 32 independent genomic loci. Of these 32 loci, 21 were true risk loci in the recent, more powerful migraine GWAS. Our results provide important guidance on the selection, use, and utility of imputation-based TWAS approaches to characterise established GWAS risk loci and identify novel risk gene loci.

Collapse

Vysotskiy M, Weiss LA. Combinations of genes at the 16p11.2 and 22q11.2 CNVs contribute to neurobehavioral traits. PLoS Genet 2023;19:e1010780. [PMID: 37267418 DOI: 10.1371/journal.pgen.1010780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 05/09/2023] [Indexed: 06/04/2023] Open

Gao G, Fiorica PN, McClellan J, Barbeira AN, Li JL, Olopade OI, Im HK, Huo D. A joint transcriptome-wide association study across multiple tissues identifies candidate breast cancer susceptibility genes. Am J Hum Genet 2023;110:950-962. [PMID: 37164006 PMCID: PMC10257003 DOI: 10.1016/j.ajhg.2023.04.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 04/14/2023] [Indexed: 05/12/2023] Open

Araujo DS, Nguyen C, Hu X, Mikhaylova AV, Gignoux C, Ardlie K, Taylor KD, Durda P, Liu Y, Papanicolaou G, Cho MH, Rich SS, Rotter JI, Im HK, Manichaikul A, Wheeler HE. Multivariate adaptive shrinkage improves cross-population transcriptome prediction for transcriptome-wide association studies in underrepresented populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.09.527747. [PMID: 36798214 PMCID: PMC9934635 DOI: 10.1101/2023.02.09.527747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]

Affiliation(s)

Daniel S. Araujo Program in Bioinformatics, Loyola University Chicago, Chicago, IL, 60660, USA
Chris Nguyen Department of Biology, Loyola University Chicago, Chicago, IL, 60660, USA
Xiaowei Hu Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA, 22908, USA
Anna V. Mikhaylova Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
Chris Gignoux Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, UC Denver Anschutz Medical Campus, Aurora, CO, 80045, USA
Kristin Ardlie Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
Kent D. Taylor The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
Peter Durda Laboratory for Clinical Biochemistry Research, University of Vermont, Colchester, VT, 05446, USA
Yongmei Liu Department of Medicine, Duke University School of Medicine, Durham, NC, 27710, USA
George Papanicolaou Epidemiology Branch, Division of Cardiovascular Sciences, National Heart, Lung and Blood Institute, Bethesda, MD, 20892, USA
Michael H. Cho Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, 02115, USA
Stephen S. Rich Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA, 22908, USA
Jerome I. Rotter The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
NHLBI TOPMed Consortium
Hae Kyung Im Section of Genetic Medicine, The University of Chicago, Chicago, IL, 60637, USA
Ani Manichaikul Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA, 22908, USA
Heather E. Wheeler Program in Bioinformatics, Loyola University Chicago, Chicago, IL, 60660, USA Department of Biology, Loyola University Chicago, Chicago, IL, 60660, USA

Collapse

Durge AR, Shrimankar DD, Sawarkar AD. Heuristic Analysis of Genomic Sequence Processing Models for High Efficiency Prediction: A Statistical Perspective. Curr Genomics 2022;23:299-317. [PMID: 36778194 PMCID: PMC9878859 DOI: 10.2174/1389202923666220927105311] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 11/22/2022] Open

Díez-Villanueva A, Sanz-Pamplona R, Solé X, Cordero D, Crous-Bou M, Guinó E, Lopez-Doriga A, Berenguer A, Aussó S, Paré-Brunet L, Obón-Santacana M, Moratalla-Navarro F, Salazar R, Sanjuan X, Santos C, Biondo S, Diez-Obrero V, Garcia-Serrano A, Alonso MH, Carreras-Torres R, Closa A, Moreno V. COLONOMICS - integrative omics data of one hundred paired normal-tumoral samples from colon cancer patients. Sci Data 2022;9:595. [PMID: 36182938 PMCID: PMC9526730 DOI: 10.1038/s41597-022-01697-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 08/16/2022] [Indexed: 11/29/2022] Open

Affiliation(s)

Anna Díez-Villanueva Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Rebeca Sanz-Pamplona Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Xavier Solé Molecular Biology CORE, Center for Biomedical Diagnostics, Hospital Clínic de Barcelona, 08036, Barcelona, Spain Translational Genomic and Targeted Therapeutics in Solid Tumors, August Pi i Sunyer Biomedical Research Institute (IDIBAPS), 08036, Barcelona, Spain
David Cordero Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Marta Crous-Bou Unit of Nutrition and Cancer, Cancer Epidemiology Research Program, Catalan Institute of Oncology (ICO) - Bellvitge Biomedical Research Institute (IDIBELL). L'Hospitalet de Llobregat, Barcelona, 08908, Spain Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
Elisabet Guinó Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Adriana Lopez-Doriga Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Antoni Berenguer Rheumatology Department - Parc Taulí Research and Innovation Institute (I3PT), Barcelona, Spain
Susanna Aussó TIC Salut Social Foundation. Ministry of Health of Generalitat de Catalunya, Barcelona, Spain
Laia Paré-Brunet Reveal Genomics. S.L., Barcelona, Spain
Mireia Obón-Santacana Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Ferran Moratalla-Navarro Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain Department of Clinical Sciences, Faculty of Medicine and health Sciences and Universitat de Barcelona Institute of Complex Systems (UBICS), University of Barcelona, Barcelona, Spain
Ramon Salazar Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Department of Clinical Sciences, Faculty of Medicine and health Sciences and Universitat de Barcelona Institute of Complex Systems (UBICS), University of Barcelona, Barcelona, Spain Medical Oncology Department. Catalan Institute of Oncology (ICO), Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Oncology (CIBERONC), Madrid, Spain
Xavier Sanjuan Department of Clinical Sciences, Faculty of Medicine and health Sciences and Universitat de Barcelona Institute of Complex Systems (UBICS), University of Barcelona, Barcelona, Spain Pathology Service, Bellvitge University Hospital (HUB), Hospitalet de Llobregat, Barcelona, Spain
Cristina Santos Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Department of Clinical Sciences, Faculty of Medicine and health Sciences and Universitat de Barcelona Institute of Complex Systems (UBICS), University of Barcelona, Barcelona, Spain Medical Oncology Department. Catalan Institute of Oncology (ICO), Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Oncology (CIBERONC), Madrid, Spain
Sebastiano Biondo Department of Clinical Sciences, Faculty of Medicine and health Sciences and Universitat de Barcelona Institute of Complex Systems (UBICS), University of Barcelona, Barcelona, Spain Digestive Surgery Service, Bellvitge University Hospital (HUB). Hospitalet de Llobregat, Barcelona, Spain
Virginia Diez-Obrero Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain
Ainhoa Garcia-Serrano Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Maria Henar Alonso Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain Department of Clinical Sciences, Faculty of Medicine and health Sciences and Universitat de Barcelona Institute of Complex Systems (UBICS), University of Barcelona, Barcelona, Spain
Robert Carreras-Torres Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain
Adria Closa The John Curtin School of Medical Research, Australian National University, Canberra, Australia EMBL Australia Partner Laboratory Network at the Australian National University, Canberra, Australia
Víctor Moreno Oncology Data Analytics Program, Catalan Institute of Oncology (ICO). Hospitalet de Llobregat, Barcelona, Spain. Colorectal Cancer Group, ONCOBELL, Bellvitge Biomedical Research Institute (IDIBELL). Hospitalet de Llobregat, Barcelona, Spain. Biomedical Research Centre Network for Epidemiology and Public Health (CIBERESP), Madrid, Spain. Department of Clinical Sciences, Faculty of Medicine and health Sciences and Universitat de Barcelona Institute of Complex Systems (UBICS), University of Barcelona, Barcelona, Spain.

Collapse

Schubert R, Geoffroy E, Gregga I, Mulford AJ, Aguet F, Ardlie K, Gerszten R, Clish C, Van Den Berg D, Taylor KD, Durda P, Johnson WC, Cornell E, Guo X, Liu Y, Tracy R, Conomos M, Blackwell T, Papanicolaou G, Lappalainen T, Mikhaylova AV, Thornton TA, Cho MH, Gignoux CR, Lange L, Lange E, Rich SS, Rotter JI, Manichaikul A, Im HK, Wheeler HE. Protein prediction for trait mapping in diverse populations. PLoS One 2022;17:e0264341. [PMID: 35202437 PMCID: PMC8870552 DOI: 10.1371/journal.pone.0264341] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 02/08/2022] [Indexed: 11/18/2022] Open

Abstract

Genetically regulated gene expression has helped elucidate the biological mechanisms underlying complex traits. Improved high-throughput technology allows similar interrogation of the genetically regulated proteome for understanding complex trait mechanisms. Here, we used the Trans-omics for Precision Medicine (TOPMed) Multi-omics pilot study, which comprises data from Multi-Ethnic Study of Atherosclerosis (MESA), to optimize genetic predictors of the plasma proteome for genetically regulated proteome-wide association studies (PWAS) in diverse populations. We built predictive models for protein abundances using data collected in TOPMed MESA, for which we have measured 1,305 proteins by a SOMAscan assay. We compared predictive models built via elastic net regression to models integrating posterior inclusion probabilities estimated by fine-mapping SNPs prior to elastic net. In order to investigate the transferability of predictive models across ancestries, we built protein prediction models in all four of the TOPMed MESA populations, African American (n = 183), Chinese (n = 71), European (n = 416), and Hispanic/Latino (n = 301), as well as in all populations combined. As expected, fine-mapping produced more significant protein prediction models, especially in African ancestries populations, potentially increasing opportunity for discovery. When we tested our TOPMed MESA models in the independent European INTERVAL study, fine-mapping improved cross-ancestries prediction for some proteins. Using GWAS summary statistics from the Population Architecture using Genomics and Epidemiology (PAGE) study, which comprises ∼50,000 Hispanic/Latinos, African Americans, Asians, Native Hawaiians, and Native Americans, we applied S-PrediXcan to perform PWAS for 28 complex traits. The most protein-trait associations were discovered, colocalized, and replicated in large independent GWAS using proteome prediction model training populations with similar ancestries to PAGE. At current training population sample sizes, performance between baseline and fine-mapped protein prediction models in PWAS was similar, highlighting the utility of elastic net. Our predictive models in diverse populations are publicly available for use in proteome mapping methods at https://doi.org/10.5281/zenodo.4837327.

Collapse

Affiliation(s)

Ryan Schubert Department of Mathematics and Statistics, Loyola University Chicago, Chicago, IL, United States of America Department of Biology, Loyola University Chicago, Chicago, IL, United States of America Program in Bioinformatics, Loyola University Chicago, Chicago, IL, United States of America
Elyse Geoffroy Program in Bioinformatics, Loyola University Chicago, Chicago, IL, United States of America
Isabelle Gregga Department of Biology, Loyola University Chicago, Chicago, IL, United States of America
Ashley J. Mulford Department of Biology, Loyola University Chicago, Chicago, IL, United States of America Program in Bioinformatics, Loyola University Chicago, Chicago, IL, United States of America
Francois Aguet Broad Institute, Cambridge, MA, United States of America
Kristin Ardlie Broad Institute, Cambridge, MA, United States of America
Robert Gerszten Beth Israel Deaconess Medical Center, Boston, MA, United States of America
Clary Clish Broad Institute, Cambridge, MA, United States of America
David Van Den Berg University of Southern California, Los Angeles, CA, United States of America
Kent D. Taylor The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, United States of America
Peter Durda Laboratory for Clinical Biochemistry Research, University of Vermont, Burlington, VT, United States of America
W. Craig Johnson Collaborative Health Studies Coordinating Center, University of Washington, Seattle, WA, United States of America
Elaine Cornell Laboratory for Clinical Biochemistry Research, University of Vermont, Burlington, VT, United States of America
Xiuqing Guo The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, United States of America
Yongmei Liu Department of Medicine, Duke University School of Medicine, Durham, NC, United States of America
Russell Tracy Laboratory for Clinical Biochemistry Research, University of Vermont, Burlington, VT, United States of America
Matthew Conomos Department of Biostatistics, University of Washington, Seattle, WA, United States of America
Tom Blackwell Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States of America
George Papanicolaou Epidemiology Branch, National Heart, Lung and Blood Institute, Bethesda, MD, United States of America
Tuuli Lappalainen New York Genome Center and Department of Systems Biology, Columbia University, New York, NY United States of America
Anna V. Mikhaylova Department of Biostatistics, University of Washington, Seattle, WA, United States of America
Timothy A. Thornton Department of Biostatistics, University of Washington, Seattle, WA, United States of America
Michael H. Cho Channing Division of Network Medicine, Brigham and Women’s Hospital, Boston, MA, United States of America
Christopher R. Gignoux Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
Leslie Lange Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
Ethan Lange Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
Stephen S. Rich Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States of America
Jerome I. Rotter The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, United States of America
NHLBI TOPMed Consortium
Ani Manichaikul Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States of America
Hae Kyung Im Section of Genetic Medicine, The University of Chicago, Chicago, IL, United States of America
Heather E. Wheeler Department of Biology, Loyola University Chicago, Chicago, IL, United States of America Program in Bioinformatics, Loyola University Chicago, Chicago, IL, United States of America * E-mail:

Collapse

Liang Y, Pividori M, Manichaikul A, Palmer AA, Cox NJ, Wheeler HE, Im HK. Polygenic transcriptome risk scores (PTRS) can improve portability of polygenic risk scores across ancestries. Genome Biol 2022;23:23. [PMID: 35027082 PMCID: PMC8759285 DOI: 10.1186/s13059-021-02591-w] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 12/27/2021] [Indexed: 12/17/2022] Open

Mahoney E, Janve V, Hohman TJ, Dumitrescu L. Evaluation of Sex-Aware PrediXcan Models for Predicting Gene Expression. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2022;27:361-372. [PMID: 34890163 PMCID: PMC8924937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Abstract

Gene-based methods such as PrediXcan use expression quantitative trait loci to build tissue-specific gene expression models when only genetic data is available. There are known sex differences in tissue-specific gene expression and in the genetic architecture of gene expression, but such differences have not been incorporated into predicted gene expression models to date. We built sex-aware PrediXcan models using whole blood transcriptomic data from the Genotype-Tissue Expression (GTEx) project (195 females and 371 males) and evaluated their performance in an independent dataset. Specifically, PrediXcan models were built following the method described in Gamazon et al. 2015, but we included both whole-sample and sex-specific models. Validation was evaluated leveraging lymphoblast RNA sequencing data from the EUR cohort of the 1000 Genomes Project (178 females and 171 males). Correlations (R2) between observed and predicted expression were evaluated in 5,283 autosomal genes to determine performance of models. In sum, we successfully predicted 1,149 genes in males and 623 in females, while 3,511 genes appeared to be not sex-specific. Of the sex-specific genes, 15% (189 genes in males and 73 genes in females) exhibited higher R2 in sex-specific models compared to whole-sample models, although the overall gain in predictive power was generally minimal and well within measurement error. Nevertheless, two female-specific genes and six male-specific genes showed significantly better prediction when using the sex-specific weights versus the whole-sample weights; furthermore, several of these genes play a role in mitochondrial metabolism, which is known to be influenced by sex hormones. Taken together, these results support previous reports of the small contribution of genetic architecture to sex-specific expression. Still, sex-aware PrediXcan models were able to provide robust sex-specific prediction signals. Future studies exploring the contribution of the X chromosome and tissue specificity on sex-specific genetically regulated expression will clarify the utility of this method.

Collapse

Díez-Obrero V, Moratalla-Navarro F, Ibáñez-Sanz G, Guardiola J, Rodríguez-Moranta F, Obón-Santacana M, Díez-Villanueva A, Dampier CH, Devall M, Carreras-Torres R, Casey G, Moreno V. Transcriptome-Wide Association Study for Inflammatory Bowel Disease Reveals Novel Candidate Susceptibility Genes in Specific Colon Subsites and Tissue Categories. J Crohns Colitis 2021;16:275-285. [PMID: 34286847 PMCID: PMC8864630 DOI: 10.1093/ecco-jcc/jjab131] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Abstract

BACKGROUND AND AIMS

Genome-wide association studies [GWAS] for inflammatory bowel disease [IBD] have identified 240 risk variants. However, the benefit of understanding the genetic architecture of IBD remains to be exploited. Transcriptome-wide association studies [TWAS] associate gene expression with genetic susceptibility to disease, providing functional insight into risk loci. In this study, we integrate relevant datasets for IBD and perform a TWAS to nominate novel genes implicated in IBD genetic susceptibility.

METHODS

We applied elastic net regression to generate gene expression prediction models for the University of Barcelona and University of Virginia RNA sequencing project [BarcUVa-Seq] and correlated expression and disease association research [CEDAR] datasets. Together with Genotype-Tissue Expression project [GTEx] data, and GWAS results from about 60 000 individuals, we employed Summary-PrediXcan and Summary-MultiXcan for single and joint analyses of TWAS results, respectively.

RESULTS

BarcUVa-Seq TWAS revealed 39 novel genes whose expression in the colon is associated with IBD genetic susceptibility. They included expression markers for specific colon cell types. TWAS meta-analysis including all tissues/cell types provided 186 novel candidate susceptibility genes. Additionally, we identified 78 novel susceptibility genes whose expression is associated with IBD exclusively in immune (N = 19), epithelial (N = 25), mesenchymal (N = 22) and neural (N = 12) tissue categories. Associated genes were involved in relevant molecular pathways, including pathways related to known IBD therapeutics, such as tumour necrosis factor signalling.

CONCLUSION

These findings provide insight into tissue-specific molecular processes underlying IBD genetic susceptibility. Associated genes could be candidate targets for new therapeutics and should be prioritized in functional studies.

Collapse

Affiliation(s)

Virginia Díez-Obrero Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), L’Hospitalet de Llobregat, Barcelona, Spain,ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L’Hospitalet de Llobregat, Barcelona, Spain,Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain,Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, Spain
Ferran Moratalla-Navarro Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), L’Hospitalet de Llobregat, Barcelona, Spain,Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain,Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, Spain
Gemma Ibáñez-Sanz Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), L’Hospitalet de Llobregat, Barcelona, Spain,ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L’Hospitalet de Llobregat, Barcelona, Spain,Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain,Gastroenterology Department, Bellvitge University Hospital, L’Hospitalet de Llobregat, Spain
Jordi Guardiola Gastroenterology Department, Bellvitge University Hospital, L’Hospitalet de Llobregat, Spain
Francisco Rodríguez-Moranta Gastroenterology Department, Bellvitge University Hospital, L’Hospitalet de Llobregat, Spain
Mireia Obón-Santacana Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), L’Hospitalet de Llobregat, Barcelona, Spain,ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L’Hospitalet de Llobregat, Barcelona, Spain,Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
Anna Díez-Villanueva Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), L’Hospitalet de Llobregat, Barcelona, Spain,ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L’Hospitalet de Llobregat, Barcelona, Spain,Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
Christopher Heaton Dampier Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA,Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
Matthew Devall Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA,Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
Robert Carreras-Torres Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), L’Hospitalet de Llobregat, Barcelona, Spain,ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L’Hospitalet de Llobregat, Barcelona, Spain,Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain
Graham Casey Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA,Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
Victor Moreno Oncology Data Analytics Program, Catalan Institute of Oncology (ICO), L’Hospitalet de Llobregat, Barcelona, Spain,ONCOBELL Program, Bellvitge Biomedical Research Institute (IDIBELL), L’Hospitalet de Llobregat, Barcelona, Spain,Consortium for Biomedical Research in Epidemiology and Public Health (CIBERESP), Madrid, Spain,Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, Spain,Corresponding author: Dr Victor Moreno, Catalan Institute of Oncology, Oncology Data Analytics Program, Hospital Duran i Reynals, Gran Via de l’Hospitalet, 199–203, 08908 L’Hospitalet de Llobregat (Barcelona) Spain. Tel: +34 932 607 434;

Collapse

Feng H, Mancuso N, Pasaniuc B, Kraft P. Multitrait transcriptome-wide association study (TWAS) tests. Genet Epidemiol 2021;45:563-576. [PMID: 34082479 DOI: 10.1002/gepi.22391] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 03/26/2021] [Accepted: 04/05/2021] [Indexed: 12/19/2022]

Okoro PC, Schubert R, Guo X, Johnson WC, Rotter JI, Hoeschele I, Liu Y, Im HK, Luke A, Dugas LR, Wheeler HE. Transcriptome prediction performance across machine learning models and diverse ancestries. HGG ADVANCES 2021;2:100019. [PMID: 33937878 PMCID: PMC8087249 DOI: 10.1016/j.xhgg.2020.100019] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 12/29/2020] [Indexed: 11/18/2022] Open

Lu H, Zhang J, Jiang Z, Zhang M, Wang T, Zhao H, Zeng P. Detection of Genetic Overlap Between Rheumatoid Arthritis and Systemic Lupus Erythematosus Using GWAS Summary Statistics. Front Genet 2021;12:656545. [PMID: 33815486 PMCID: PMC8012913 DOI: 10.3389/fgene.2021.656545] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 03/01/2021] [Indexed: 01/04/2023] Open

Abstract

Background

Clinical and epidemiological studies have suggested systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA) are comorbidities and common genetic etiologies can partly explain such coexistence. However, shared genetic determinations underlying the two diseases remain largely unknown.

Methods

Our analysis relied on summary statistics available from genome-wide association studies of SLE (N = 23,210) and RA (N = 58,284). We first evaluated the genetic correlation between RA and SLE through the linkage disequilibrium score regression (LDSC). Then, we performed a multiple-tissue eQTL (expression quantitative trait loci) weighted integrative analysis for each of the two diseases and aggregated association evidence across these tissues via the recently proposed harmonic mean P-value (HMP) combination strategy, which can produce a single well-calibrated P-value for correlated test statistics. Afterwards, we conducted the pleiotropy-informed association using conjunction conditional FDR (ccFDR) to identify potential pleiotropic genes associated with both RA and SLE.

Results

We found there existed a significant positive genetic correlation (r_g = 0.404, P = 6.01E-10) via LDSC between RA and SLE. Based on the multiple-tissue eQTL weighted integrative analysis and the HMP combination across various tissues, we discovered 14 potential pleiotropic genes by ccFDR, among which four were likely newly novel genes (i.e., INPP5B, OR5K2, RP11-2C24.5, and CTD-3105H18.4). The SNP effect sizes of these pleiotropic genes were typically positively dependent, with an average correlation of 0.579. Functionally, these genes were implicated in multiple auto-immune relevant pathways such as inositol phosphate metabolic process, membrane and glucagon signaling pathway.

Conclusion

This study reveals common genetic components between RA and SLE and provides candidate associated loci for understanding of molecular mechanism underlying the comorbidity of the two diseases.

Collapse

Barbeira AN, Bonazzola R, Gamazon ER, Liang Y, Park Y, Kim-Hellmuth S, Wang G, Jiang Z, Zhou D, Hormozdiari F, Liu B, Rao A, Hamel AR, Pividori MD, Aguet F, Bastarache L, Jordan DM, Verbanck M, Do R, Stephens M, Ardlie K, McCarthy M, Montgomery SB, Segrè AV, Brown CD, Lappalainen T, Wen X, Im HK. Exploiting the GTEx resources to decipher the mechanisms at GWAS loci. Genome Biol 2021;22:49. [PMID: 33499903 PMCID: PMC7836161 DOI: 10.1186/s13059-020-02252-4] [Citation(s) in RCA: 138] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Accepted: 12/18/2020] [Indexed: 12/12/2022] Open

Affiliation(s)

Alvaro N Barbeira Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, USA
Rodrigo Bonazzola Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, USA
Eric R Gamazon Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA Data Science Institute, Vanderbilt University, Nashville, TN, USA Clare Hall, University of Cambridge, Cambridge, UK MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
Yanyu Liang Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, USA
YoSon Park Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
Sarah Kim-Hellmuth Statistical Genetics, Max Planck Institute of Psychiatry, Munich, Germany New York Genome Center, New York, NY, USA Department of Systems Biology, Columbia University, New York, NY, USA
Gao Wang Department of Human Genetics, University of Chicago, Chicago, IL, USA
Zhuoxun Jiang Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, USA
Dan Zhou Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
Farhad Hormozdiari The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Boxiang Liu Department of Biology, Stanford University, Stanford, 94305, CA, USA
Abhiram Rao Department of Biology, Stanford University, Stanford, 94305, CA, USA
Andrew R Hamel The Broad Institute of MIT and Harvard, Cambridge, MA, USA Ocular Genomics Institute, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA, USA
Milton D Pividori Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, USA
François Aguet The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Lisa Bastarache Department of Biomedical Informatics, Department of Medicine, Vanderbilt University, Nashville, TN, USA Center for Human Genetics Research, Department of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, TN, USA
Daniel M Jordan Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Marie Verbanck Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA Université de Paris - EA 7537 BIOSTM, Paris, France
Ron Do Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Matthew Stephens Department of Human Genetics, University of Chicago, Chicago, IL, USA
Kristin Ardlie The Broad Institute of MIT and Harvard, Cambridge, MA, USA
Mark McCarthy University of Oxford, Oxford, UK
Stephen B Montgomery Department of Genetics, Stanford University, Stanford, CA, USA Department of Pathology, Stanford University, Stanford, CA, USA
Ayellet V Segrè The Broad Institute of MIT and Harvard, Cambridge, MA, USA Ocular Genomics Institute, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA, USA
Christopher D Brown Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
Tuuli Lappalainen New York Genome Center, New York, NY, USA Department of Systems Biology, Columbia University, New York, NY, USA
Xiaoquan Wen Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
Hae Kyung Im Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL, USA.

Collapse

Barbeira AN, Melia OJ, Liang Y, Bonazzola R, Wang G, Wheeler HE, Aguet F, Ardlie KG, Wen X, Im HK. Fine-mapping and QTL tissue-sharing information improves the reliability of causal gene identification. Genet Epidemiol 2020;44:854-867. [PMID: 32964524 PMCID: PMC7693040 DOI: 10.1002/gepi.22346] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 06/26/2020] [Accepted: 06/26/2020] [Indexed: 01/01/2023]