51
|
Acharya S, Liao S, Jung WJ, Kang YS, Moghaddam VA, Feitosa MF, Wojczynski MK, Lin S, Anema JA, Schwander K, Connell JO, Province MA, Brent MR. A methodology for gene level omics-WAS integration identifies genes influencing traits associated with cardiovascular risks: the Long Life Family Study. Hum Genet 2024; 143:1241-1252. [PMID: 39276247 PMCID: PMC11485042 DOI: 10.1007/s00439-024-02701-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 08/15/2024] [Indexed: 09/16/2024]
Abstract
The Long Life Family Study (LLFS) enrolled 4953 participants in 539 pedigrees displaying exceptional longevity. To identify genetic mechanisms that affect cardiovascular risks in the LLFS population, we developed a multi-omics integration pipeline and applied it to 11 traits associated with cardiovascular risks. Using our pipeline, we aggregated gene-level statistics from rare-variant analysis, GWAS, and gene expression-trait association by Correlated Meta-Analysis (CMA). Across all traits, CMA identified 64 significant genes after Bonferroni correction (p ≤ 2.8 × 10-7), 29 of which replicated in the Framingham Heart Study (FHS) cohort. Notably, 20 of the 29 replicated genes do not have a previously known trait-associated variant in the GWAS Catalog within 50 kb. Thirteen modules in Protein-Protein Interaction (PPI) networks are significantly enriched in genes with low meta-analysis p-values for at least one trait, three of which are replicated in the FHS cohort. The functional annotation of genes in these modules showed a significant over-representation of trait-related biological processes including sterol transport, protein-lipid complex remodeling, and immune response regulation. Among major findings, our results suggest a role of triglyceride-associated and mast-cell functional genes FCER1A, MS4A2, GATA2, HDC, and HRH4 in atherosclerosis risks. Our findings also suggest that lower expression of ATG2A, a gene we found to be associated with BMI, may be both a cause and consequence of obesity. Finally, our results suggest that ENPP3 may play an intermediary role in triglyceride-induced inflammation. Our pipeline is freely available and implemented in the Nextflow workflow language, making it easily runnable on any compute platform ( https://nf-co.re/omicsgenetraitassociation ).
Collapse
Affiliation(s)
- Sandeep Acharya
- Division of Computational and Data Sciences, Washington University, St Louis, MO, USA
| | - Shu Liao
- Department of Computer Science and Engineering, Washington University, St Louis, MO, USA
| | - Wooseok J Jung
- Department of Computer Science and Engineering, Washington University, St Louis, MO, USA
| | - Yu S Kang
- Department of Computer Science and Engineering, Washington University, St Louis, MO, USA
| | - Vaha Akbary Moghaddam
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA
| | - Mary F Feitosa
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA
| | - Mary K Wojczynski
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA
| | - Shiow Lin
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA
| | - Jason A Anema
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA
| | - Karen Schwander
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA
| | - Jeff O Connell
- Department of Medicine, University of Maryland, Baltimore, MD, USA
| | - Michael A Province
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO, USA
| | - Michael R Brent
- Department of Computer Science and Engineering, Washington University, St Louis, MO, USA.
| |
Collapse
|
52
|
Schraiber JG, Edge MD, Pennell M. Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations. PLoS Biol 2024; 22:e3002847. [PMID: 39383205 PMCID: PMC11493298 DOI: 10.1371/journal.pbio.3002847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 10/21/2024] [Accepted: 09/17/2024] [Indexed: 10/11/2024] Open
Abstract
In both statistical genetics and phylogenetics, a major goal is to identify correlations between genetic loci or other aspects of the phenotype or environment and a focal trait. In these 2 fields, there are sophisticated but disparate statistical traditions aimed at these tasks. The disconnect between their respective approaches is becoming untenable as questions in medicine, conservation biology, and evolutionary biology increasingly rely on integrating data from within and among species, and once-clear conceptual divisions are becoming increasingly blurred. To help bridge this divide, we lay out a general model describing the covariance between the genetic contributions to the quantitative phenotypes of different individuals. Taking this approach shows that standard models in both statistical genetics (e.g., genome-wide association studies; GWAS) and phylogenetic comparative biology (e.g., phylogenetic regression) can be interpreted as special cases of this more general quantitative-genetic model. The fact that these models share the same core architecture means that we can build a unified understanding of the strengths and limitations of different methods for controlling for genetic structure when testing for associations. We develop intuition for why and when spurious correlations may occur analytically and conduct population-genetic and phylogenetic simulations of quantitative traits. The structural similarity of problems in statistical genetics and phylogenetics enables us to take methodological advances from one field and apply them in the other. We demonstrate by showing how a standard GWAS technique-including both the genetic relatedness matrix (GRM) as well as its leading eigenvectors, corresponding to the principal components of the genotype matrix, in a regression model-can mitigate spurious correlations in phylogenetic analyses. As a case study, we re-examine an analysis testing for coevolution of expression levels between genes across a fungal phylogeny and show that including eigenvectors of the covariance matrix as covariates decreases the false positive rate while simultaneously increasing the true positive rate. More generally, this work provides a foundation for more integrative approaches for understanding the genetic architecture of phenotypes and how evolutionary processes shape it.
Collapse
Affiliation(s)
- Joshua G. Schraiber
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Michael D. Edge
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America
- Department of Biological Sciences, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
53
|
Clarke B, Holtkamp E, Öztürk H, Mück M, Wahlberg M, Meyer K, Munzlinger F, Brechtmann F, Hölzlwimmer FR, Lindner J, Chen Z, Gagneur J, Stegle O. Integration of variant annotations using deep set networks boosts rare variant association testing. Nat Genet 2024; 56:2271-2280. [PMID: 39322779 PMCID: PMC11525182 DOI: 10.1038/s41588-024-01919-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 08/20/2024] [Indexed: 09/27/2024]
Abstract
Rare genetic variants can have strong effects on phenotypes, yet accounting for rare variants in genetic analyses is statistically challenging due to the limited number of allele carriers and the burden of multiple testing. While rich variant annotations promise to enable well-powered rare variant association tests, methods integrating variant annotations in a data-driven manner are lacking. Here we propose deep rare variant association testing (DeepRVAT), a model based on set neural networks that learns a trait-agnostic gene impairment score from rare variant annotations and phenotypes, enabling both gene discovery and trait prediction. On 34 quantitative and 63 binary traits, using whole-exome-sequencing data from UK Biobank, we find that DeepRVAT yields substantial gains in gene discoveries and improved detection of individuals at high genetic risk. Finally, we demonstrate how DeepRVAT enables calibrated and computationally efficient rare variant tests at biobank scale, aiding the discovery of genetic risk factors for human disease traits.
Collapse
Affiliation(s)
- Brian Clarke
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
- AI Health Innovation Cluster, German Cancer Research Center (DKFZ), Heidelberg, Germany.
| | - Eva Holtkamp
- TUM School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
- Helmholtz Association-Munich School for Data Science (MUDS), Munich, Germany
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany
| | - Hakime Öztürk
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Marcel Mück
- AI Health Innovation Cluster, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Magnus Wahlberg
- AI Health Innovation Cluster, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Kayla Meyer
- AI Health Innovation Cluster, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Felix Munzlinger
- AI Health Innovation Cluster, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Felix Brechtmann
- TUM School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
- Munich Center for Machine Learning, Munich, Germany
| | - Florian R Hölzlwimmer
- TUM School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Jonas Lindner
- TUM School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Zhifen Chen
- Department of Cardiology, Deutsches Herzzentrum München, Technical University Munich, Munich, Germany
- Deutsches Zentrum für Herz- und Kreislaufforschung (DZHK), Partner Site Munich Heart Alliance, Munich, Germany
| | - Julien Gagneur
- TUM School of Computation, Information and Technology, Technical University of Munich, Garching, Germany.
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany.
- Munich Center for Machine Learning, Munich, Germany.
- Institute of Human Genetics, School of Medicine and Health, Technical University of Munich, Munich, Germany.
| | - Oliver Stegle
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany.
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK.
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, UK.
| |
Collapse
|
54
|
Correia Marques M, Rubin D, Shuldiner EG, Datta M, Schmitz E, Gutierrez Cruz G, Patt A, Bennett E, Grom A, Foell D, Gattorno M, Bohnsack J, Yeung RSM, Prahalad S, Mellins E, Anton J, Len CA, Oliveira S, Woo P, Ozen S, Deng Z, Ombrello MJ. Enrichment of Rare Variants of Hemophagocytic Lymphohistiocytosis Genes in Systemic Juvenile Idiopathic Arthritis. Arthritis Rheumatol 2024; 76:1566-1572. [PMID: 38937141 DOI: 10.1002/art.42938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 05/23/2024] [Accepted: 06/12/2024] [Indexed: 06/29/2024]
Abstract
OBJECTIVE Our objective was to evaluate whether there is an enrichment of rare variants in familial hemophagocytic lymphohistiocytosis (HLH)-associated genes among patients with systemic juvenile idiopathic arthritis (sJIA) with or without macrophage activation syndrome (MAS). METHODS Targeted sequencing of HLH genes (LYST, PRF1, RAB27A, STX11, STXBP2, UNC13D) was performed in patients with sJIA from an established cohort. Sequence data from control participants were obtained in silico (database of Genotypes and Phenotypes: phs000280.v8.p2). Rare variant association testing (RVT) was performed with sequence kernel association test package. Significance was defined as P < 0.05 after 100,000 permutations. RESULTS Sequencing data from 524 sJIA cases were jointly called and harmonized with exome-derived target data from 3,000 controls. Quality control operations produced a set of 480 cases and 2,924 ancestrally matched control participants. RVT of cases and controls revealed a significant association with rare protein-altering variants (minor allele frequency [MAF] < 0.01) of STXBP2 (P = 0.020) and ultrarare variants (MAF < 0.001) of STXBP2 (P = 0.006) and UNC13D (P = 0.046). A subanalysis of 32 cases with known MAS and 90 without revealed a significant difference in the distribution of rare UNC13D variants (P = 0.0047) between the groups. Additionally, patients with sJIA more often carried two or more HLH variants than did controls (P = 0.007), driven largely by digenic combinations involving LYST. CONCLUSION We identified an enrichment of rare HLH variants in patients with sJIA compared with controls, driven by STXBP2 and UNC13D. Biallelic variation in HLH genes was associated with sJIA, driven by LYST. Only UNC13D displayed enrichment in patients with MAS. This suggests that HLH variants may contribute to the pathophysiology of sJIA, even without MAS.
Collapse
Affiliation(s)
- Mariana Correia Marques
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Danielle Rubin
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Emily G Shuldiner
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Mallika Datta
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Elizabeth Schmitz
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Gustavo Gutierrez Cruz
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Andrew Patt
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Elizabeth Bennett
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Alexei Grom
- Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Dirk Foell
- University Hospital Muenster, Muenster, Germany
| | | | - John Bohnsack
- University of Utah Eccles School of Medicine, Salt Lake City
| | | | - Sampath Prahalad
- Emory University School of Medicine and Children's Healthcare of Atlanta, Atlanta, Georgia
| | | | - Jordi Anton
- Hospital Sant Joan de Déu, Universitat de Barcelona, Barcelona, Spain
| | | | - Sheila Oliveira
- Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Patricia Woo
- University College London, London, United Kingdom
| | - Seza Ozen
- Hacettepe University, Ankara, Turkey
| | - Zuoming Deng
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| | - Michael J Ombrello
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, NIH, Bethesda, Maryland
| |
Collapse
|
55
|
Das D, Khor ES, Jiang F, He J, Kawakami Y, Wainwright L, Hollinger J, Geiger J, Liu H, Meng F, Porter GA, Jin Z, Murphy P, Yao P. Loss-of-function of RNA-binding protein PRRC2B causes translational defects and congenital cardiovascular malformation. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.09.26.24313895. [PMID: 39398999 PMCID: PMC11469349 DOI: 10.1101/2024.09.26.24313895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Alternative splicing generates variant forms of proteins for a given gene and accounts for functional redundancy or diversification. A novel RNA-binding protein, Pro-rich Coiled-coil Containing Protein 2B (PRRC2B), has been reported by multiple laboratories to mediate uORF-dependent and independent regulation of translation initiation required for cell cycle progression and proliferation. We identified two alternative spliced isoforms in human and mouse hearts and HEK293T cells, full-length (FL) and exon 16-excluded isoform ΔE16. A congenital heart disease-associated human mutation-mimicry knock-in of the equivalent variant in the mouse genome leads to the depletion of the full-length Prrc2b mRNA but not the alternative spliced truncated form ΔE16, does not cause any apparent structural or functional disorders. In contrast, global genetic inactivation of the PRRC2B gene in the mouse genome, nullifying both mRNA isoforms, caused patent ductus arteriosus (PDA) and neonatal lethality in mice. Bulk and single nucleus transcriptome profiling analyses of embryonic mouse hearts demonstrated a significant overall downregulation of multiple smooth muscle-specific genes in Prrc2b mutant mice resulting from reduced smooth muscle cell number. Integrated analysis of proteomic changes in Prrc2b null mouse embryonic hearts and polysome-seq and RNA-seq multi-omics analysis in human HEK293T cells uncover conserved PRRC2B-regulated target mRNAs that encode essential factors required for cardiac and vascular development. Our findings reveal the connection between alternative splicing regulation of PRRC2B, PRRC2B-mediated translational control, and congenital cardiovascular development and disorder. This study may shed light on the significance of PRRC2B in human cardiovascular disease diagnosis and treatment.
Collapse
Affiliation(s)
- Debojyoti Das
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
| | - Eng-Soon Khor
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
| | - Feng Jiang
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
- Department of Biochemistry & Biophysics, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
| | - Jiali He
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
| | - Yui Kawakami
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
| | - Lindsey Wainwright
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
- Department of Biochemistry & Biophysics, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
| | - Jared Hollinger
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
| | - Joshua Geiger
- Department of Vascular Surgery, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
| | - Huan Liu
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
| | - Fanju Meng
- Department of Biomedical Genetics, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
| | - George A. Porter
- Department of Pediatrics, Medicine, and Pharmacology and Physiology, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
| | - Zhenggen Jin
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
| | - Patrick Murphy
- Department of Biomedical Genetics, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
| | - Peng Yao
- Aab Cardiovascular Research Institute, Department of Medicine, University of Rochester School of Medicine & Dentistry, Rochester, NY 14642
- Department of Biochemistry & Biophysics, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
- The Center for RNA Biology, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
- The Center for Biomedical Informatics, University of Rochester School of Medicine & Dentistry, Rochester, New York 14642
| |
Collapse
|
56
|
Tseng YP, Chang YS, Mekala VR, Liu TY, Chang JG, Shieh GS. Whole-genome sequencing reveals rare variants associated with gout in Taiwanese males. Front Genet 2024; 15:1423714. [PMID: 39385933 PMCID: PMC11462091 DOI: 10.3389/fgene.2024.1423714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 08/28/2024] [Indexed: 10/12/2024] Open
Abstract
To identify rare variants (RVs) of gout, we sequenced the whole genomes of 321 male gout patients and combined these with those of 64 male gout patients and 682 normal controls at Taiwan Biobank. We performed ACAT-O to identify 682 significant RVs (p < 3.8 × 10-8) clustered on chromosomes 1, 7, 10, 16, and 18. To prioritize causal variants effectively, we sifted them by Combined Annotation-Dependent Depletion score >10 or |effect size| ≥ 1.5 for those without CADD scores. In particular, to the best of our knowledge, we identified the rare variants rs559954634, rs186763678, and 13-85340782-G-A for the first time to be associated with gout in Taiwanese males. Importantly, the RV rs559954634 positively affects gout, and its neighboring gene NPHS2 is involved in serum urate and expressed in kidney tissues. The kidneys play a major role in regulating uric acid levels. This suggests that rs559954634 may be involved in gout. Furthermore, rs186763678 is in the intron of NFIA that interacts with SLC2A9, which has the most significant effect on serum urate. Note that gene-gene interaction NFIA-SLC2A9 is significantly associated with serum urate in the Italian MICROS population and a Croatian population. Moreover, 13-85340782-G-A significantly affects gout susceptibility (odds ratio 6.0; P = 0.038). The >1% carrier frequencies of these potentially pathogenic (protective) RVs in cases (controls) suggest the revealed associations may be true; these RVs deserve further studies for the mechanism. Finally, multivariate logistic regression analysis shows that the rare variants rs559954634 and 13-85340782-G-A jointly are significantly associated with gout susceptibility.
Collapse
Affiliation(s)
- Yu-Ping Tseng
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
| | - Ya-Sian Chang
- Department of Pathology, Chung Shan Medical University Hospital, Taichung, Taiwan
| | | | - Ting-Yuan Liu
- Department of Medical Research, China Medical University Hospital, Taichung, Taiwan
| | - Jan-Gowth Chang
- Department of Laboratory Medicine, China Medical University Hospital, Taichung, Taiwan
| | - Grace S. Shieh
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
- Bioinformatics Program, Taiwan International Graduate Program, Academia Sinica, Taipei, Taiwan
- Data Science Degree Program, Academia Sinica and National Taiwan University, Taipei, Taiwan
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei, Taiwan
| |
Collapse
|
57
|
Hou T, Shen X, Zhang S, Liang M, Chen L, Lu Q. AIGen: an artificial intelligence software for complex genetic data analysis. Brief Bioinform 2024; 25:bbae566. [PMID: 39550221 PMCID: PMC11568876 DOI: 10.1093/bib/bbae566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 09/12/2024] [Accepted: 11/11/2024] [Indexed: 11/18/2024] Open
Abstract
The recent development of artificial intelligence (AI) technology, especially the advance of deep neural network (DNN) technology, has revolutionized many fields. While DNN plays a central role in modern AI technology, it has rarely been used in genetic data analysis due to analytical and computational challenges brought by high-dimensional genetic data and an increasing number of samples. To facilitate the use of AI in genetic data analysis, we developed a C++ package, AIGen, based on two newly developed neural networks (i.e. kernel neural networks and functional neural networks) that are capable of modeling complex genotype-phenotype relationships (e.g. interactions) while providing robust performance against high-dimensional genetic data. Moreover, computationally efficient algorithms (e.g. a minimum norm quadratic unbiased estimation approach and batch training) are implemented in the package to accelerate the computation, making them computationally efficient for analyzing large-scale datasets with thousands or even millions of samples. By applying AIGen to the UK Biobank dataset, we demonstrate that it can efficiently analyze large-scale genetic data, attain improved accuracy, and maintain robust performance. Availability: AIGen is developed in C++ and its source code, along with reference libraries, is publicly accessible on GitHub at https://github.com/TingtHou/AIGen.
Collapse
Affiliation(s)
- Tingting Hou
- Department of Experimental Statistics, Louisiana State University, 45 Martin D. Woodin Hall, Baton Rouge, LA 70802, United States
| | - Xiaoxi Shen
- Department of Mathematics, Texas State University, 601 University Drive San Marcos, TX 78666, United States
| | - Shan Zhang
- Department of Biostatistics, University of Florida, 2004 Mowry Road, Gainesville, FL 32611, United States
| | - Muxuan Liang
- Department of Biostatistics, University of Florida, 2004 Mowry Road, Gainesville, FL 32611, United States
| | - Li Chen
- Department of Biostatistics, University of Florida, 2004 Mowry Road, Gainesville, FL 32611, United States
| | - Qing Lu
- Department of Biostatistics, University of Florida, 2004 Mowry Road, Gainesville, FL 32611, United States
| |
Collapse
|
58
|
Svishcheva GR, Belonogova NM, Kirichenko AV, Tsepilov YA, Axenovich TI. A New Method for Conditional Gene-Based Analysis Effectively Accounts for the Regional Polygenic Background. Genes (Basel) 2024; 15:1174. [PMID: 39336765 PMCID: PMC11431718 DOI: 10.3390/genes15091174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 08/27/2024] [Accepted: 09/05/2024] [Indexed: 09/30/2024] Open
Abstract
Gene-based association analysis is a powerful tool for identifying genes that explain trait variability. An essential step of this analysis is a conditional analysis. It aims to eliminate the influence of SNPs outside the gene, which are in linkage disequilibrium with intragenic SNPs. The popular conditional analysis method, GCTA-COJO, accounts for the influence of several top independently associated SNPs outside the gene, correcting the z statistics for intragenic SNPs. We suggest a new TauCOR method for conditional gene-based analysis using summary statistics. This method accounts the influence of the full regional polygenic background, correcting the genotype correlations between intragenic SNPs. As a result, the distribution of z statistics for intragenic SNPs becomes conditionally independent of distribution for extragenic SNPs. TauCOR is compatible with any gene-based association test. TauCOR was tested on summary statistics simulated under different scenarios and on real summary statistics for a 'gold standard' gene list from the Open Targets Genetics project. TauCOR proved to be effective in all modelling scenarios and on real data. The TauCOR's strategy showed comparable sensitivity and higher specificity and accuracy than GCTA-COJO on both simulated and real data. The method can be successfully used to improve the effectiveness of gene-based association analyses.
Collapse
Affiliation(s)
- Gulnara R Svishcheva
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Ave. Lavrentiev, 10, 630090 Novosibirsk, Russia
- Institute of General Genetics, Russian Academy of Sciences, Gubkin St. 3, 119311 Moscow, Russia
| | - Nadezhda M Belonogova
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Ave. Lavrentiev, 10, 630090 Novosibirsk, Russia
| | - Anatoly V Kirichenko
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Ave. Lavrentiev, 10, 630090 Novosibirsk, Russia
| | - Yakov A Tsepilov
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Ave. Lavrentiev, 10, 630090 Novosibirsk, Russia
- Wellcome Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1RQ, UK
| | - Tatiana I Axenovich
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Ave. Lavrentiev, 10, 630090 Novosibirsk, Russia
| |
Collapse
|
59
|
Kontou PI, Bagos PG. The goldmine of GWAS summary statistics: a systematic review of methods and tools. BioData Min 2024; 17:31. [PMID: 39238044 PMCID: PMC11375927 DOI: 10.1186/s13040-024-00385-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 08/27/2024] [Indexed: 09/07/2024] Open
Abstract
Genome-wide association studies (GWAS) have revolutionized our understanding of the genetic architecture of complex traits and diseases. GWAS summary statistics have become essential tools for various genetic analyses, including meta-analysis, fine-mapping, and risk prediction. However, the increasing number of GWAS summary statistics and the diversity of software tools available for their analysis can make it challenging for researchers to select the most appropriate tools for their specific needs. This systematic review aims to provide a comprehensive overview of the currently available software tools and databases for GWAS summary statistics analysis. We conducted a comprehensive literature search to identify relevant software tools and databases. We categorized the tools and databases by their functionality, including data management, quality control, single-trait analysis, and multiple-trait analysis. We also compared the tools and databases based on their features, limitations, and user-friendliness. Our review identified a total of 305 functioning software tools and databases dedicated to GWAS summary statistics, each with unique strengths and limitations. We provide descriptions of the key features of each tool and database, including their input/output formats, data types, and computational requirements. We also discuss the overall usability and applicability of each tool for different research scenarios. This comprehensive review will serve as a valuable resource for researchers who are interested in using GWAS summary statistics to investigate the genetic basis of complex traits and diseases. By providing a detailed overview of the available tools and databases, we aim to facilitate informed tool selection and maximize the effectiveness of GWAS summary statistics analysis.
Collapse
Affiliation(s)
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131, Lamia, Greece.
| |
Collapse
|
60
|
Zhu L, Zhang S, Sha Q. Meta-analysis of set-based multiple phenotype association test based on GWAS summary statistics from different cohorts. Front Genet 2024; 15:1359591. [PMID: 39301532 PMCID: PMC11410627 DOI: 10.3389/fgene.2024.1359591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 08/23/2024] [Indexed: 09/22/2024] Open
Abstract
Genome-wide association studies (GWAS) have emerged as popular tools for identifying genetic variants that are associated with complex diseases. Standard analysis of a GWAS involves assessing the association between each variant and a disease. However, this approach suffers from limited reproducibility and difficulties in detecting multi-variant and pleiotropic effects. Although joint analysis of multiple phenotypes for GWAS can identify and interpret pleiotropic loci which are essential to understand pleiotropy in diseases and complex traits, most of the multiple phenotype association tests are designed for a single variant, resulting in much lower power, especially when their effect sizes are small and only their cumulative effect is associated with multiple phenotypes. To overcome these limitations, set-based multiple phenotype association tests have been developed to enhance statistical power and facilitate the identification and interpretation of pleiotropic regions. In this research, we propose a new method, named Meta-TOW-S, which conducts joint association tests between multiple phenotypes and a set of variants (such as variants in a gene) utilizing GWAS summary statistics from different cohorts. Our approach applies the set-based method that Tests for the effect of an Optimal Weighted combination of variants in a gene (TOW) and accounts for sample size differences across GWAS cohorts by employing the Cauchy combination method. Meta-TOW-S combines the advantages of set-based tests and multi-phenotype association tests, exhibiting computational efficiency and enabling analysis across multiple phenotypes while accommodating overlapping samples from different GWAS cohorts. To assess the performance of Meta-TOW-S, we develop a phenotype simulator package that encompasses a comprehensive simulation scheme capable of modeling multiple phenotypes and multiple variants, including noise structures and diverse correlation patterns among phenotypes. Simulation studies validate that Meta-TOW-S maintains a desirable Type I error rate. Further simulation under different scenarios shows that Meta-TOW-S can improve power compared with other existing meta-analysis methods. When applied to four psychiatric disorders summary data, Meta-TOW-S detects a greater number of significant genes.
Collapse
Affiliation(s)
- Lirong Zhu
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, United States
| | - Shuanglin Zhang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, United States
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, United States
| |
Collapse
|
61
|
He D, He X, Shen D, Liu L, Yang X, Hao M, Wang Y, Li Y, Liu Q, Liu M, Wang J, Zhang X, Cui L. Loss-of-function variants in RNA binding motif protein X-linked induce neuronal defects contributing to amyotrophic lateral sclerosis pathogenesis. MedComm (Beijing) 2024; 5:e712. [PMID: 39263607 PMCID: PMC11387721 DOI: 10.1002/mco2.712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 06/19/2024] [Accepted: 06/27/2024] [Indexed: 09/13/2024] Open
Abstract
Despite being one of the most prevalent RNA modifications, the role of N6-methyladenosine (m6A) in amyotrophic lateral sclerosis (ALS) remains ambiguous. In this investigation, we explore the contribution of genetic defects of m6A-related genes to ALS pathogenesis. We scrutinized the mutation landscape of m6A genes through a comprehensive analysis of whole-exome sequencing cohorts, encompassing 508 ALS patients and 1660 population-matched controls. Our findings reveal a noteworthy enrichment of RNA binding motif protein X-linked (RBMX) variants among ALS patients, with a significant correlation between pathogenic m6A variants and adverse clinical outcomes. Furthermore, Rbmx knockdown in NSC-34 cells overexpressing mutant TDP43Q331K results in cell death mediated by an augmented p53 response. Similarly, RBMX knockdown in ALS motor neurons derived from induced pluripotent stem cells (iPSCs) manifests morphological defects and activation of the p53 pathway. Transcriptional analysis using publicly available single-cell sequencing data from the primary motor cortex indicates that RBMX-regulated genes selectively influence excitatory neurons and exhibit enrichment in ALS-implicated pathways. Through integrated analyses, our study underscores the emerging roles played by RBMX in ALS, suggesting a potential nexus between the disease and dysregulated m6A-mediated mRNA metabolism.
Collapse
Affiliation(s)
- Di He
- Department of Neurology Beijing Tiantan Hospital, Capital Medical University Beijing China
- Department of Neurology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College Beijing China
| | - Xinyi He
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute Fudan University Shanghai China
| | - Dongchao Shen
- Department of Neurology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College Beijing China
| | - Liyang Liu
- McKusick-Zhang Center for Genetic Medicine, State Key Laboratory of Medical Molecular Biology Institute of Basic Medical Sciences Chinese Academy of Medical Sciences, School of Basic Medicine, Peking Union Medical College Beijing China
| | - Xunzhe Yang
- Department of Neurology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College Beijing China
| | - Meng Hao
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute Fudan University Shanghai China
| | - Yi Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute Fudan University Shanghai China
| | - Yi Li
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute Fudan University Shanghai China
| | - Qing Liu
- Department of Neurology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College Beijing China
| | - Mingsheng Liu
- Department of Neurology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College Beijing China
| | - Jiucun Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute Fudan University Shanghai China
- Research Unit of Dissecting the Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases Chinese Academy of Medical Sciences (2019RU058) Shanghai China
| | - Xue Zhang
- McKusick-Zhang Center for Genetic Medicine, State Key Laboratory of Medical Molecular Biology Institute of Basic Medical Sciences Chinese Academy of Medical Sciences, School of Basic Medicine, Peking Union Medical College Beijing China
- Neuroscience Center Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS) Beijing China
| | - Liying Cui
- Department of Neurology Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College Beijing China
| |
Collapse
|
62
|
Koh H. A general kernel machine regression framework using principal component analysis for jointly testing main and interaction effects: Applications to human microbiome studies. NAR Genom Bioinform 2024; 6:lqae148. [PMID: 39534501 PMCID: PMC11555437 DOI: 10.1093/nargab/lqae148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Revised: 09/27/2024] [Accepted: 10/18/2024] [Indexed: 11/16/2024] Open
Abstract
The effect of a treatment on a health or disease response can be modified by genetic or microbial variants. It is the matter of interaction effects between genetic or microbial variants and a treatment. To powerfully discover genetic or microbial biomarkers, it is crucial to incorporate such interaction effects in addition to the main effects. However, in the context of kernel machine regression analysis of its kind, existing methods cannot be utilized in a situation, where a kernel is available but its underlying real variants are unknown. To address such limitations, I introduce a general kernel machine regression framework using principal component analysis for jointly testing main and interaction effects. It begins with extracting principal components from an input kernel through the singular value decomposition. Then, it employs the principal components as surrogate variants to construct three endogenous kernels for the main effects, interaction effects, and both of them, respectively. Hence, it works with a kernel as an input without knowing its underlying real variants, and also detects either the main effects, interaction effects, or both of them robustly. I also introduce its omnibus testing extension to multiple input kernels, named OmniK. I demonstrate its use for human microbiome studies.
Collapse
Affiliation(s)
- Hyunwook Koh
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon 21985, South Korea
| |
Collapse
|
63
|
Huang R, Jin Z, Zhang D, Li L, Zhou J, Xiao L, Li P, Zhang M, Tian C, Zhang W, Zhong L, Quan M, Zhao R, Du L, Liu LJ, Li Z, Zhang D, Du Q. Rare variations within the serine/arginine-rich splicing factor PtoRSZ21 modulate stomatal size to determine drought tolerance in Populus. THE NEW PHYTOLOGIST 2024; 243:1776-1794. [PMID: 38978318 DOI: 10.1111/nph.19934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 06/13/2024] [Indexed: 07/10/2024]
Abstract
Rare variants contribute significantly to the 'missing heritability' of quantitative traits. The genome-wide characteristics of rare variants and their roles in environmental adaptation of woody plants remain unexplored. Utilizing genome-wide rare variant association study (RVAS), expression quantitative trait loci (eQTL) mapping, genetic transformation, and molecular experiments, we explored the impact of rare variants on stomatal morphology and drought adaptation in Populus. Through comparative analysis of five world-wide Populus species, we observed the influence of mutational bias and adaptive selection on the distribution of rare variants. RVAS identified 75 candidate genes correlated with stomatal size (SS)/stomatal density (SD), and a rare haplotype in the promoter of serine/arginine-rich splicing factor PtoRSZ21 emerged as the foremost association signal governing SS. As a positive regulator of drought tolerance, PtoRSZ21 can recruit the core splicing factor PtoU1-70K to regulate alternative splicing (AS) of PtoATG2b (autophagy-related 2). The rare haplotype PtoRSZ21hap2 weakens binding affinity to PtoMYB61, consequently affecting PtoRSZ21 expression and SS, ultimately resulting in differential distribution of Populus accessions in arid and humid climates. This study enhances the understanding of regulatory mechanisms that underlie AS induced by rare variants and might provide targets for drought-tolerant varieties breeding in Populus.
Collapse
Affiliation(s)
- Rui Huang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Zhuoying Jin
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Donghai Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Lianzheng Li
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Jiaxuan Zhou
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Liang Xiao
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Peng Li
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Mengjiao Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Chongde Tian
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Wenke Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Leishi Zhong
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Mingyang Quan
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Rui Zhao
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Liang Du
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Li-Jun Liu
- College of Forestry, State Forestry and Grassland Administration Key Laboratory of Silviculture in Downstream Areas of the Yellow River, Shandong Agriculture University, Taian, Shandong, 271018, China
| | - Zhonghai Li
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Deqiang Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Qingzhang Du
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| |
Collapse
|
64
|
Odden MC, Li Y, Jotwani V, Dobrota S, Tan AX, Cummings SR, Shlipak MG, Scherzer R, Ix JH, Buckwalter MS, Tranah GJ. Joint and Individual Mitochondrial DNA Variation and Cognitive Outcomes in Black and White Older Adults. J Gerontol A Biol Sci Med Sci 2024; 79:glae170. [PMID: 39007867 PMCID: PMC11345514 DOI: 10.1093/gerona/glae170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Indexed: 07/16/2024] Open
Abstract
BACKGROUND Mitochondrial dysfunction manifests in neurodegenerative diseases and other age-associated disorders. In this study, we examined variation in inherited mitochondrial DNA (mtDNA) sequences in Black and White participants from 2 large aging studies to identify variants related to cognitive function. METHODS Participants included self-reported Black and White adults aged ≥70 years in the Lifestyle Interventions and Independence for Elders (LIFE; N = 1 319) and Health Aging and Body Composition (Health ABC; N = 788) studies. Cognitive function was measured by the Digit-Symbol Substitution Test (DSST), and the Modified Mini-Mental State Examination (3MSE) at baseline and over follow-up in LIFE (3.6 years) and Health ABC (10 years). We examined the joint effects of multiple variants across 16 functional mitochondrial regions with cognitive function using a sequence kernel association test. Based on these results, we prioritized meta-analysis of common variants in Black and White participants using mixed effects models. A Bonferroni-adjusted p value of <.05 was considered statistically significant. RESULTS Joint variation in subunits ND1, ND2, and ND5 of Complex I, 12S RNA, and hypervariable region (HVR) were significantly associated with DSST and 3MSE at baseline. In meta-analyses among Black participants, variant m.4216T>C, ND1 was associated with a faster decline in 3MSE, and variant m.462C>T in the HVR was associated with a slower decline in DSST. Variant m.5460G>C, ND2 was associated with slower and m.182C>T in the HVR was associated with faster decline in 3MSE in White participants. CONCLUSIONS Among Black and White adults, oxidative phosphorylation Complex I variants were associated with cognitive function.
Collapse
Affiliation(s)
- Michelle C Odden
- Department of Epidemiology and Population Health, Stanford School of Medicine, Stanford, California, USA
| | - Yongmei Li
- Department of Epidemiology and Population Health, Stanford School of Medicine, Stanford, California, USA
| | - Vasantha Jotwani
- Kidney Health Research Collaborative, Department of Medicine, San Francisco Veterans Affairs Health Care System, San Francisco, California, USA
- Department of Medicine, University of California San Francisco, San Francisco, California, USA
| | - Sylvie Dobrota
- Department of Epidemiology and Population Health, Stanford School of Medicine, Stanford, California, USA
| | - Annabel X Tan
- Department of Epidemiology and Population Health, Stanford School of Medicine, Stanford, California, USA
| | - Steven R Cummings
- California Pacific Medical Center Research Institute, San Francisco, California, USA
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, California, USA
| | - Michael G Shlipak
- Kidney Health Research Collaborative, Department of Medicine, San Francisco Veterans Affairs Health Care System, San Francisco, California, USA
- Department of Medicine, University of California San Francisco, San Francisco, California, USA
| | - Rebecca Scherzer
- Kidney Health Research Collaborative, Department of Medicine, San Francisco Veterans Affairs Health Care System, San Francisco, California, USA
- Department of Medicine, University of California San Francisco, San Francisco, California, USA
| | - Joachim H Ix
- Division of Nephrology-Hypertension, Department of Medicine, University of California San Diego, San Diego, California, USA
| | - Marion S Buckwalter
- Department of Neurology and Neurological Sciences, Stanford School of Medicine, Stanford, California, USA
| | - Gregory J Tranah
- California Pacific Medical Center Research Institute, San Francisco, California, USA
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, California, USA
| |
Collapse
|
65
|
Guo H, Urban AE, Wong WH. Prioritizing disease-related rare variants by integrating gene expression data. PLoS Genet 2024; 20:e1011412. [PMID: 39348415 PMCID: PMC11466430 DOI: 10.1371/journal.pgen.1011412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 10/10/2024] [Accepted: 08/29/2024] [Indexed: 10/02/2024] Open
Abstract
Rare variants, comprising the vast majority of human genetic variations, are likely to have more deleterious impact in the context of human diseases compared to common variants. Here we present carrier statistic, a statistical framework to prioritize disease-related rare variants by integrating gene expression data. By quantifying the impact of rare variants on gene expression, carrier statistic can prioritize those rare variants that have large functional consequence in the patients. Through simulation studies and analyzing real multi-omics dataset, we demonstrated that carrier statistic is applicable in studies with limited sample size (a few hundreds) and achieves substantially higher sensitivity than existing rare variants association methods. Application to Alzheimer's disease reveals 16 rare variants within 15 genes with extreme carrier statistics. We also found strong excess of rare variants among the top prioritized genes in patients compared to that in healthy individuals. The carrier statistic method can be applied to various rare variant types and is adaptable to other omics data modalities, offering a powerful tool for investigating the molecular mechanisms underlying complex diseases.
Collapse
Affiliation(s)
- Hanmin Guo
- Department of Statistics, Stanford University, Stanford, California, United States of America
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, California, United States of America
| | - Alexander Eckehart Urban
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, California, United States of America
- Department of Genetics, Stanford University School of Medicine, Stanford, California, United States of America
| | - Wing Hung Wong
- Department of Statistics, Stanford University, Stanford, California, United States of America
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California, United States of America
| |
Collapse
|
66
|
Shao M, Tian M, Chen K, Jiang H, Zhang S, Li Z, Shen Y, Chen F, Shen B, Cao C, Gu N. Leveraging Random Effects in Cistrome-Wide Association Studies for Decoding the Genetic Determinants of Prostate Cancer. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2400815. [PMID: 39099406 PMCID: PMC11423091 DOI: 10.1002/advs.202400815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 07/09/2024] [Indexed: 08/06/2024]
Abstract
Cistrome-wide association studies (CWAS) are pivotal for identifying genetic determinants of diseases by correlating genetically regulated cistrome states with phenotypes. Traditional CWAS typically develops a model based on cistrome and genotype data to associate predicted cistrome states with phenotypes. The random effect cistrome-wide association study (RECWAS), reevaluates the necessity of cistrome state prediction in CWAS. RECWAS utilizes either a linear model or marginal effect for initial feature selection, followed by kernel-based feature aggregation for association testing is introduced. Through simulations and analysis of prostate cancer data, a thorough evaluation of CWAS and RECWAS is conducted. The results suggest that RECWAS offers improved power compared to traditional CWAS, identifying additional genomic regions associated with prostate cancer. CWAS identified 102 significant regions, while RECWAS found 50 additional significant regions compared to CWAS, many of which are validated. Validation encompassed a range of biological evidence, including risk signals from the GWAS catalog, susceptibility genes from the DisGeNET database, and enhancer-domain scores. RECWAS consistently demonstrated improved performance over traditional CWAS in identifying genomic regions associated with prostate cancer. These findings demonstrate the benefits of incorporating kernel methods into CWAS and provide new insights for genetic discovery in complex diseases.
Collapse
Affiliation(s)
- Mengting Shao
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
| | - Min Tian
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
| | - Kaiyang Chen
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
| | - Hangjin Jiang
- Center for Data ScienceZhejiang UniversityHangzhou310058P. R. China
| | - Shuting Zhang
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
| | - Zhenghui Li
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
| | - Yan Shen
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
| | - Feng Chen
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
| | - Baixin Shen
- Department of UrologyThe Second Affiliated Hospital of Nanjing Medical UniversityNanjing210011P. R. China
| | - Chen Cao
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
- Department of UrologyThe Second Affiliated Hospital of Nanjing Medical UniversityNanjing210011P. R. China
| | - Ning Gu
- Key Laboratory for Bio‐Electromagnetic Environment and Advanced Medical TheranosticsSchool of Biomedical Engineering and InformaticsNanjing Medical UniversityNanjing211166P. R. China
- Nanjing Key Laboratory for Cardiovascular Information and Health Engineering MedicineInstitute of Clinical MedicineNanjing Drum Tower HospitalMedical SchoolNanjing UniversityNanjing210093P. R. China
| |
Collapse
|
67
|
Aparo A, Bonnici V, Avesani S, Cascione L, Giugno R. DiGAS: Differential gene allele spectrum as a descriptor in genetic studies. Comput Biol Med 2024; 179:108924. [PMID: 39067286 DOI: 10.1016/j.compbiomed.2024.108924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 07/16/2024] [Accepted: 07/17/2024] [Indexed: 07/30/2024]
Abstract
Diagnosing individuals with complex genetic diseases is a challenging task. Computational methodologies exploit information at the genotype level by taking into account single nucleotide polymorphisms (SNPs) leveraging the results of genome-wide association studies analysis to assign a statistical significance to each SNP. Recent methodologies extend such an approach by aggregating SNP significance at the genetic level to identify genes that are related to the condition under study. However, such methodologies still suffer from the initial SNP analysis limitations. Here, we present DiGAS, a tool for diagnosing genetic conditions by computing significance, by means of SNP information, directly at the complex level of genetic regions. Such an approach is based on a generalized notion of allele spectrum, which evaluates the complete genetic alterations of the SNP set belonging to a genetic region at the population level. The statistical significance of a region is then evaluated through a differential allele spectrum analysis between the conditions of individuals belonging to the population. Tests, performed on well-established datasets regarding Alzheimer's disease, show that DiGAS outperforms the state of the art in distinguishing between sick and healthy subjects.
Collapse
Affiliation(s)
- Antonino Aparo
- University of Verona, Strada le Grazie, 15, Verona, 37134, Italy; Research Center LURM (Interdepartmental Laboratory of Medical Research), University of Verona, Ple. L.A. Scuro 10, Verona, 37134, Italy
| | - Vincenzo Bonnici
- University of Parma, Parco Area delle Scienze, 53/A, Parma, 43124, Italy
| | - Simone Avesani
- University of Verona, Strada le Grazie, 15, Verona, 37134, Italy
| | - Luciano Cascione
- Institute of Oncology Research (IOR), Via Francesco Chiesa 5, Bellinzona, 6500, Switzerland
| | - Rosalba Giugno
- University of Verona, Strada le Grazie, 15, Verona, 37134, Italy.
| |
Collapse
|
68
|
Mbatchou J, McPeek MS. JASPER: Fast, powerful, multitrait association testing in structured samples gives insight on pleiotropy in gene expression. Am J Hum Genet 2024; 111:1750-1769. [PMID: 39025064 PMCID: PMC11339629 DOI: 10.1016/j.ajhg.2024.06.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 06/19/2024] [Accepted: 06/20/2024] [Indexed: 07/20/2024] Open
Abstract
Joint association analysis of multiple traits with multiple genetic variants can provide insight into genetic architecture and pleiotropy, improve trait prediction, and increase power for detecting association. Furthermore, some traits are naturally high-dimensional, e.g., images, networks, or longitudinally measured traits. Assessing significance for multitrait genetic association can be challenging, especially when the sample has population sub-structure and/or related individuals. Failure to adequately adjust for sample structure can lead to power loss and inflated type 1 error, and commonly used methods for assessing significance can work poorly with a large number of traits or be computationally slow. We developed JASPER, a fast, powerful, robust method for assessing significance of multitrait association with a set of genetic variants, in samples that have population sub-structure, admixture, and/or relatedness. In simulations, JASPER has higher power, better type 1 error control, and faster computation than existing methods, with the power and speed advantage of JASPER increasing with the number of traits. JASPER is potentially applicable to a wide range of association testing applications, including for multiple disease traits, expression traits, image-derived traits, and microbiome abundances. It allows for covariates, ascertainment, and rare variants and is robust to phenotype model misspecification. We apply JASPER to analyze gene expression in the Framingham Heart Study, where, compared to alternative approaches, JASPER finds more significant associations, including several that indicate pleiotropic effects, most of which replicate previous results, while others have not previously been reported. Our results demonstrate the promise of JASPER for powerful multitrait analysis in structured samples.
Collapse
Affiliation(s)
- Joelle Mbatchou
- Regeneron Genetics Center, Tarrytown, NY 10591, USA; Department of Statistics, The University of Chicago, Chicago, IL 60637, USA
| | - Mary Sara McPeek
- Department of Statistics, The University of Chicago, Chicago, IL 60637, USA; Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
69
|
Yuan J, Zhuang YY, Liu X, Zhang Y, Li K, Chen ZJ, Li D, Chen H, Liang J, Yao Y, Yu X, Zhuo R, Zhao F, Zhou X, Yu X, Qu J, Su J. Exome-wide association study identifies KDELR3 mutations in extreme myopia. Nat Commun 2024; 15:6703. [PMID: 39112444 PMCID: PMC11306401 DOI: 10.1038/s41467-024-50580-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 07/15/2024] [Indexed: 08/10/2024] Open
Abstract
Extreme myopia (EM), defined as a spherical equivalent (SE) ≤ -10.00 diopters (D), is one of the leading causes of sight impairment. Known EM-associated variants only explain limited risk and are inadequate for clinical decision-making. To discover risk genes, we performed a whole-exome sequencing (WES) on 449 EM individuals and 9606 controls. We find a significant excess of rare protein-truncating variants (PTVs) in EM cases, enriched in the retrograde vesicle-mediated transport pathway. Employing single-cell RNA-sequencing (scRNA-seq) and a single-cell polygenic burden score (scPBS), we pinpointed PI16 + /SFRP4+ fibroblasts as the most relevant cell type. We observed that KDELR3 is highly expressed in scleral fibroblast and involved in scleral extracellular matrix (ECM) organization. The zebrafish model revealed that kdelr3 downregulation leads to elongated ocular axial length and increased lens diameter. Together, our study provides insight into the genetics of EM in humans and highlights KDELR3's role in EM pathogenesis.
Collapse
Affiliation(s)
- Jian Yuan
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
- Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang, China
| | - You-Yuan Zhuang
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - Xiaoyu Liu
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - Yue Zhang
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - Kai Li
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, China
| | - Zhen Ji Chen
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - Dandan Li
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - He Chen
- School of Biomedical Engineering, Hainan University, Haikou, China
| | - Jiacheng Liang
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - Yinghao Yao
- Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang, China
| | - Xiangyi Yu
- Institute of PSI Genomics, Wenzhou, China
| | - Ran Zhuo
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - Fei Zhao
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
| | - Xiangtian Zhou
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China
- Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang, China
| | | | - Jia Qu
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China.
- Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang, China.
- School of Biomedical Engineering, Hainan University, Haikou, China.
| | - Jianzhong Su
- National Engineering Research Center of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou, China.
- Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang, China.
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, China.
| |
Collapse
|
70
|
Seffernick AE, Cao X, Cheng C, Yang W, Autry RJ, Yang JJ, Pui CH, Teachey DT, Lamba JK, Mullighan CG, Pounds SB. Bootstrap Evaluation of Association Matrices (BEAM) for Integrating Multiple Omics Profiles with Multiple Outcomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.31.605805. [PMID: 39131398 PMCID: PMC11312528 DOI: 10.1101/2024.07.31.605805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Motivation Large datasets containing multiple clinical and omics measurements for each subject motivate the development of new statistical methods to integrate these data to advance scientific discovery. Model We propose bootstrap evaluation of association matrices (BEAM), which integrates multiple omics profiles with multiple clinical endpoints. BEAM associates a set omic features with clinical endpoints via regression models and then uses bootstrap resampling to determine statistical significance of the set. Unlike existing methods, BEAM uniquely accommodates an arbitrary number of omic profiles and endpoints. Results In simulations, BEAM performed similarly to the theoretically best simple test and outperformed other integrated analysis methods. In an example pediatric leukemia application, BEAM identified several genes with biological relevance established by a CRISPR assay that had been missed by univariate screens and other integrated analysis methods. Thus, BEAM is a powerful, flexible, and robust tool to identify genes for further laboratory and/or clinical research evaluation. Availability Source code, documentation, and a vignette for BEAM are available on GitHub at: https://github.com/annaSeffernick/BEAMR. The R package is available from CRAN at: https://cran.r-project.org/package=BEAMR. Contact Stanley.Pounds@stjude.org. Supplementary Information Supplementary data are available at the journal's website.
Collapse
Affiliation(s)
- Anna Eames Seffernick
- Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Xueyuan Cao
- Department of Health Promotion and Disease Prevention, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Cheng Cheng
- Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Wenjian Yang
- Department of Pharmacy & Pharmaceutical Services, St. Jude Children’s Research Hospital, Memphis, TN, USA
- Hematological Malignancies Program, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Robert J. Autry
- Hopp Children's Cancer Center Heidelberg (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Consortium for Translational Cancer Research (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jun J. Yang
- Department of Pharmacy & Pharmaceutical Services, St. Jude Children’s Research Hospital, Memphis, TN, USA
- Hematological Malignancies Program, St. Jude Children’s Research Hospital, Memphis, TN, USA
- Department of Oncology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Ching-Hon Pui
- Hematological Malignancies Program, St. Jude Children’s Research Hospital, Memphis, TN, USA
- Department of Oncology, St. Jude Children’s Research Hospital, Memphis, TN, USA
- Department of Pathology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - David T. Teachey
- Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
- Department of Pediatrics and the Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Division of Oncology and Center for Childhood Cancer Research, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jatinder K. Lamba
- Department of Pharmacotherapy and Translational Research, University of Florida College of Pharmacy, Gainesville, FL, USA
| | - Charles G. Mullighan
- Hematological Malignancies Program, St. Jude Children’s Research Hospital, Memphis, TN, USA
- Department of Pathology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Stanley B. Pounds
- Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, TN, USA
| |
Collapse
|
71
|
Han S. Bayesian Rare Variant Analysis Identifies Novel Schizophrenia Putative Risk Genes. J Pers Med 2024; 14:822. [PMID: 39202013 PMCID: PMC11355493 DOI: 10.3390/jpm14080822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 07/17/2024] [Accepted: 07/30/2024] [Indexed: 09/03/2024] Open
Abstract
The genetics of schizophrenia is so complex that it involves both common variants and rare variants. Rare variant association studies of schizophrenia are challenging because statistical methods for rare variant analysis are under-powered due to the rarity of rare variants. The recent Schizophrenia Exome meta-analysis (SCHEMA) consortium, the largest consortium in this field to date, has successfully identified 10 schizophrenia risk genes from ultra-rare variants by burden test, while more risk genes remain to be discovered by more powerful rare variant association test methods. In this study, we use a recently developed Bayesian rare variant association method that is powerful for detecting sparse rare risk variants that implicates 88 new candidate risk genes associated with schizophrenia from the SCHEMA case-control sample. These newly identified genes are significantly enriched in autism risk genes and GO enrichment analysis indicates that new candidate risk genes are involved in mechanosensory behavior, regulation of cell size, neuron projection morphogenesis, and plasma-membrane-bounded cell projection morphogenesis, that may provide new insights on the etiology of schizophrenia.
Collapse
Affiliation(s)
- Shengtong Han
- School of Dentistry, Marquette University, Milwaukee, WI 53201-1881, USA
| |
Collapse
|
72
|
Khan A, Unlu G, Lin P, Liu Y, Kilic E, Kenny TC, Birsoy K, Gamazon ER. Metabolic gene function discovery platform GeneMAP identifies SLC25A48 as necessary for mitochondrial choline import. Nat Genet 2024; 56:1614-1623. [PMID: 38977856 PMCID: PMC11887816 DOI: 10.1038/s41588-024-01827-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Accepted: 06/10/2024] [Indexed: 07/10/2024]
Abstract
Organisms maintain metabolic homeostasis through the combined functions of small-molecule transporters and enzymes. While many metabolic components have been well established, a substantial number remains without identified physiological substrates. To bridge this gap, we have leveraged large-scale plasma metabolome genome-wide association studies (GWAS) to develop a multiomic Gene-Metabolite Association Prediction (GeneMAP) discovery platform. GeneMAP can generate accurate predictions and even pinpoint genes that are distant from the variants implicated by GWAS. In particular, our analysis identified solute carrier family 25 member 48 (SLC25A48) as a genetic determinant of plasma choline levels. Mechanistically, SLC25A48 loss strongly impairs mitochondrial choline import and synthesis of its downstream metabolite betaine. Integrative rare variant and polygenic score analyses in UK Biobank provide strong evidence that the SLC25A48 causal effects on human disease may in part be mediated by the effects of choline. Altogether, our study provides a discovery platform for metabolic gene function and proposes SLC25A48 as a mitochondrial choline transporter.
Collapse
Affiliation(s)
- Artem Khan
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Gokhan Unlu
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Phillip Lin
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Yuyang Liu
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Ece Kilic
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Timothy C Kenny
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Kıvanç Birsoy
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA.
| | - Eric R Gamazon
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA.
| |
Collapse
|
73
|
Park J, Peña-Tauber A, Talozzi L, Greicius MD, Guen YL. Genetic associations with human longevity are enriched for oncogenic genes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.07.30.24311226. [PMID: 39132489 PMCID: PMC11312667 DOI: 10.1101/2024.07.30.24311226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Human lifespan is shaped by both genetic and environmental exposures and their interaction. To enable precision health, it is essential to understand how genetic variants contribute to earlier death or prolonged survival. In this study, we tested the association of common genetic variants and the burden of rare non-synonymous variants in a survival analysis, using age-at-death (N = 35,551, median [min, max] = 72.4 [40.9, 85.2]), and last-known-age (N = 358,282, median [min, max] = 71.9 [52.6, 88.7]), in European ancestry participants of the UK Biobank. The associations we identified seemed predominantly driven by cancer, likely due to the age range of the cohort. Common variant analysis highlighted three longevity-associated loci: APOE, ZSCAN23, and MUC5B. We identified six genes whose burden of loss-of-function variants is significantly associated with reduced lifespan: TET2, ATM, BRCA2, CKMT1B, BRCA1 and ASXL1. Additionally, in eight genes, the burden of pathogenic missense variants was associated with reduced lifespan: DNMT3A, SF3B1, CHL1, TET2, PTEN, SOX21, TP53 and SRSF2. Most of these genes have previously been linked to oncogenic-related pathways and some are linked to and are known to harbor somatic variants that predispose to clonal hematopoiesis. A direction-agnostic (SKAT-O) approach additionally identified significant associations with C1orf52, TERT, IDH2, and RLIM, highlighting a link between telomerase function and longevity as well as identifying additional oncogenic genes. Our results emphasize the importance of understanding genetic factors driving the most prevalent causes of mortality at a population level, highlighting the potential of early genetic testing to identify germline and somatic variants increasing one's susceptibility to cancer and/or early death.
Collapse
Affiliation(s)
- Junyoung Park
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, 94305, USA
| | - Andrés Peña-Tauber
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, 94305, USA
| | - Lia Talozzi
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, 94305, USA
| | - Michael D. Greicius
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, 94305, USA
| | - Yann Le Guen
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94304, USA
| |
Collapse
|
74
|
Yu Z, Coorens THH, Uddin MM, Ardlie KG, Lennon N, Natarajan P. Genetic variation across and within individuals. Nat Rev Genet 2024; 25:548-562. [PMID: 38548833 PMCID: PMC11457401 DOI: 10.1038/s41576-024-00709-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/09/2024] [Indexed: 04/12/2024]
Abstract
Germline variation and somatic mutation are intricately connected and together shape human traits and disease risks. Germline variants are present from conception, but they vary between individuals and accumulate over generations. By contrast, somatic mutations accumulate throughout life in a mosaic manner within an individual due to intrinsic and extrinsic sources of mutations and selection pressures acting on cells. Recent advancements, such as improved detection methods and increased resources for association studies, have drastically expanded our ability to investigate germline and somatic genetic variation and compare underlying mutational processes. A better understanding of the similarities and differences in the types, rates and patterns of germline and somatic variants, as well as their interplay, will help elucidate the mechanisms underlying their distinct yet interlinked roles in human health and biology.
Collapse
Affiliation(s)
- Zhi Yu
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | | | - Md Mesbah Uddin
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | | | - Niall Lennon
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Pradeep Natarajan
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Cardiovascular Research Center and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
75
|
Zhao G, Le Y, Sun M, Xu J, Qin Y, Men S, Ye Z, Tan H, Hu H, You J, Li J, Jin S, Wang M, Zhang X, Lin Z, Tu L. A dominant negative mutation of GhMYB25-like alters cotton fiber initiation, reducing lint and fuzz. THE PLANT CELL 2024; 36:2759-2777. [PMID: 38447960 PMCID: PMC11289660 DOI: 10.1093/plcell/koae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/09/2023] [Accepted: 12/11/2023] [Indexed: 03/08/2024]
Abstract
Cotton (Gossypium hirsutum) fibers, vital natural textile materials, are single-cell trichomes that differentiate from the ovule epidermis. These fibers are categorized as lint (longer fibers useful for spinning) or fuzz (shorter, less useful fibers). Currently, developing cotton varieties with high lint yield but without fuzz remains challenging due to our limited knowledge of the molecular mechanisms underlying fiber initiation. This study presents the identification and characterization of a naturally occurring dominant negative mutation GhMYB25-like_AthapT, which results in a reduced lint and fuzzless phenotype. The GhMYB25-like_AthapT protein exerts its dominant negative effect by suppressing the activity of GhMYB25-like during lint and fuzz initiation. Intriguingly, the negative effect of GhMYB25-like_AthapT could be alleviated by high expression levels of GhMYB25-like. We also uncovered the role of GhMYB25-like in regulating the expression of key genes such as GhPDF2 (PROTODERMAL FACTOR 2), CYCD3; 1 (CYCLIN D3; 1), and PLD (Phospholipase D), establishing its significance as a pivotal transcription factor in fiber initiation. We identified other genes within this regulatory network, expanding our understanding of the determinants of fiber cell fate. These findings offer valuable insights for cotton breeding and contribute to our fundamental understanding of fiber development.
Collapse
Affiliation(s)
- Guannan Zhao
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Yu Le
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Mengling Sun
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Jiawen Xu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Yuan Qin
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - She Men
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Zhengxiu Ye
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Haozhe Tan
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Haiyan Hu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Jiaqi You
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Jianying Li
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Shuangxia Jin
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Maojun Wang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Xianlong Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Zhongxu Lin
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Lili Tu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| |
Collapse
|
76
|
Yang H, Wang X, Zhang Z, Chen F, Cao H, Yan L, Gao X, Dong H, Cui Y. A high-dimensional omnibus test for set-based association analysis. Brief Bioinform 2024; 25:bbae456. [PMID: 39288231 PMCID: PMC11407446 DOI: 10.1093/bib/bbae456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 08/21/2024] [Accepted: 09/03/2024] [Indexed: 09/19/2024] Open
Abstract
Set-based association analysis is a valuable tool in studying the etiology of complex diseases in genome-wide association studies, as it allows for the joint testing of variants in a region or group. Two common types of single nucleotide polymorphism (SNP)-disease functional models are recognized when evaluating the joint function of a set of SNP: the cumulative weak signal model, in which multiple functional variants with small effects contribute to disease risk, and the dominating strong signal model, in which a few functional variants with large effects contribute to disease risk. However, existing methods have two main limitations that reduce their power. Firstly, they typically only consider one disease-SNP association model, which can result in significant power loss if the model is misspecified. Secondly, they do not account for the high-dimensional nature of SNPs, leading to low power or high false positives. In this study, we propose a solution to these challenges by using a high-dimensional inference procedure that involves simultaneously fitting many SNPs in a regression model. We also propose an omnibus testing procedure that employs a robust and powerful P-value combination method to enhance the power of SNP-set association. Our results from extensive simulation studies and a real data analysis demonstrate that our set-based high-dimensional inference strategy is both flexible and computationally efficient and can substantially improve the power of SNP-set association analysis. Application to a real dataset further demonstrates the utility of the testing strategy.
Collapse
Affiliation(s)
- Haitao Yang
- Division of Health Statistics, School of Public Health, Hebei Medical University, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
- Hebei Key Laboratory of Environment and Human Health, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
- Hebei Key Laboratory of Forensic Medicine, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
| | - Xin Wang
- Division of Health Statistics, School of Public Health, Hebei Medical University, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
| | - Zechen Zhang
- Division of Health Statistics, School of Public Health, Hebei Medical University, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
- Hebei Key Laboratory of Environment and Human Health, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
| | - Fuzhao Chen
- Division of Health Statistics, School of Public Health, Hebei Medical University, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
| | - Hongyan Cao
- Department of Health Statistics, Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, School of Public Health; MOE Key Laboratory of Coal Environmental Pathogenicity and Prevention, Shanxi Medical University, No 56 Xinjian South Rd., Taiyuan, Shanxi 030001, P.R. China
| | - Lina Yan
- Division of Health Statistics, School of Public Health, Hebei Medical University, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
- Hebei Key Laboratory of Environment and Human Health, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
| | - Xia Gao
- Division of Health Statistics, School of Public Health, Hebei Medical University, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
- Hebei Key Laboratory of Environment and Human Health, 361 East Zhongshan Road, Shijiazhuang, Hebei 050017, P.R. China
| | - Hui Dong
- Department of Neurology, Second Hospital of Hebei Medical University, 215 West Heping Road, Shijiazhuang, Hebei 050000, P.R. China
| | - Yuehua Cui
- Department of Statistics and Probability, Michigan State University, 619 Red Cedar Rd., East Lansing, MI 48824, United States
| |
Collapse
|
77
|
Tian D, Zhang Z, Huang B, Han B, Li X, Zhao K. Genome-Wide Association Analyses and Population Verification Highlight the Potential Genetic Basis of Horned Morphology during Polled Selection in Tibetan Sheep. Animals (Basel) 2024; 14:2152. [PMID: 39123678 PMCID: PMC11311095 DOI: 10.3390/ani14152152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 07/15/2024] [Accepted: 07/22/2024] [Indexed: 08/12/2024] Open
Abstract
The types and morphology of sheep horns have been extensively researched, yet the genetic foundation underlying the emergence of diverse horn characteristics during the breeding of polled Tibetan sheep has remained elusive. Genome-wide association analysis (GWAS) was performed on 103 subtypes (normal large horn, scurs, and polled) differentiated from G2 (offspring (G2) of parent (G1) of polled) of the polled core herd. Six single nucleotide polymorphisms (SNPs) located on chromosome 10 of the relaxin family peptide receptor 2 (RXFP2) gene exhibited positive correlations with horn length, horn base circumference, and horn base interval. Furthermore, in genotyping 382 G2 individuals, significant variations were observed for each specific horn type. Three additional mutations were identified near the target SNP upstream of the amplification product. Finally, the RXFP2-specific haplotype associated with the horned trait effectively maintained horn length, horn base circumference, and horn base interval in Tibetan sheep, as confirmed by population validation of nine loci in a sample size of 1125 individuals. The present study offers novel insights into the genetic differentiation of the horned type during improvement breeding and evolution, thereby establishing a robust theoretical foundation for polled Tibetan sheep breeding and providing valuable guidance for practical production.
Collapse
Affiliation(s)
- Dehong Tian
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810001, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zian Zhang
- Qinghai Sheep Breeding and Promotion Service Center, Gangcha 812300, China
| | - Bin Huang
- Qinghai Sheep Breeding and Promotion Service Center, Gangcha 812300, China
| | - Buying Han
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810001, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xue Li
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810001, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Kai Zhao
- Qinghai Provincial Key Laboratory of Animal Ecological Genomics, Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xining 810001, China
| |
Collapse
|
78
|
Cirulli ET, Schiabor Barrett KM, Bolze A, Judge DP, Pawloski PA, Grzymski JJ, Lee W, Washington NL. A power-based sliding window approach to evaluate the clinical impact of rare genetic variants in the nucleotide sequence or the spatial position of the folded protein. HGG ADVANCES 2024; 5:100284. [PMID: 38509709 PMCID: PMC11004801 DOI: 10.1016/j.xhgg.2024.100284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 03/14/2024] [Accepted: 03/15/2024] [Indexed: 03/22/2024] Open
Abstract
Systematic determination of novel variant pathogenicity remains a major challenge, even when there is an established association between a gene and phenotype. Here we present Power Window (PW), a sliding window technique that identifies the impactful regions of a gene using population-scale clinico-genomic datasets. By sizing analysis windows on the number of variant carriers, rather than the number of variants or nucleotides, statistical power is held constant, enabling the localization of clinical phenotypes and removal of unassociated gene regions. The windows can be built by sliding across either the nucleotide sequence of the gene (through 1D space) or the positions of the amino acids in the folded protein (through 3D space). Using a training set of 350k exomes from the UK Biobank (UKB), we developed PW models for well-established gene-disease associations and tested their accuracy in two independent cohorts (117k UKB exomes and 65k exomes sequenced at Helix in the Healthy Nevada Project, myGenetics, or In Our DNA SC studies). The significant models retained a median of 49% of the qualifying variant carriers in each gene (range 2%-98%), with quantitative traits showing a median effect size improvement of 66% compared with aggregating variants across the entire gene, and binary traits' odds ratios improving by a median of 2.2-fold. PW showcases that electronic health record-based statistical analyses can accurately distinguish between novel coding variants in established genes that will have high phenotypic penetrance and those that will not, unlocking new potential for human genomics research, drug development, variant interpretation, and precision medicine.
Collapse
Affiliation(s)
| | | | - Alexandre Bolze
- Helix, 101 S Ellsworth Ave Suite 350, San Mateo, CA 94401, USA
| | - Daniel P Judge
- Division of Cardiology, Medical University of South Carolina, 30 Courtenay Drive, MSC 592, Charleston, SC 29425, USA
| | | | - Joseph J Grzymski
- University of Nevada, 2215 Raggio Pkwy, Reno, NV 89512, USA; Renown Institute for Health Innovation, Reno, NV 89512, USA
| | - William Lee
- Helix, 101 S Ellsworth Ave Suite 350, San Mateo, CA 94401, USA
| | | |
Collapse
|
79
|
Akinbiyi T, McPeek MS, Abney M. ADELLE: A global testing method for Trans-eQTL mapping. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.24.581871. [PMID: 38464248 PMCID: PMC10925110 DOI: 10.1101/2024.02.24.581871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Understanding the genetic regulatory mechanisms of gene expression is a challenging and ongoing problem. Genetic variants that are associated with expression levels are readily identified when they are proximal to the gene (i.e., cis-eQTLs), but SNPs distant from the gene whose expression levels they are associated with (i.e., trans-eQTLs) have been much more difficult to discover, even though they account for a majority of the heritability in gene expression levels. A major impediment to the identification of more trans-eQTLs is the lack of statistical methods that are powerful enough to overcome the obstacles of small effect sizes and large multiple testing burden of trans-eQTL mapping. Here, we propose ADELLE, a powerful statistical testing framework that requires only summary statistics and is designed to be most sensitive to SNPs that are associated with multiple gene expression levels, a characteristic of many trans-eQTLs. In simulations, we show that for detecting SNPs that are associated with 0.1%-2% of 10,000 traits, among the 7 methods we consider ADELLE is clearly the most powerful overall, with either the highest power or power not significantly different from the highest for all settings in that range. We apply ADELLE to a mouse advanced intercross line data set and show its ability to find trans-eQTLs that were not significant under a standard analysis. This demonstrates that ADELLE is a powerful tool at uncovering trans regulators of genetic expression.
Collapse
|
80
|
Yang L, Ou YN, Wu BS, Liu WS, Deng YT, He XY, Chen YL, Kang J, Fei CJ, Zhu Y, Tan L, Dong Q, Feng J, Cheng W, Yu JT. Large-scale whole-exome sequencing analyses identified protein-coding variants associated with immune-mediated diseases in 350,770 adults. Nat Commun 2024; 15:5924. [PMID: 39009607 PMCID: PMC11250857 DOI: 10.1038/s41467-024-49782-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 06/18/2024] [Indexed: 07/17/2024] Open
Abstract
The genetic contribution of protein-coding variants to immune-mediated diseases (IMDs) remains underexplored. Through whole exome sequencing of 40 IMDs in 350,770 UK Biobank participants, we identified 162 unique genes in 35 IMDs, among which 124 were novel genes. Several genes, including FLG which is associated with atopic dermatitis and asthma, showed converging evidence from both rare and common variants. 91 genes exerted significant effects on longitudinal outcomes (interquartile range of Hazard Ratio: 1.12-5.89). Mendelian randomization identified five causal genes, of which four were approved drug targets (CDSN, DDR1, LTA, and IL18BP). Proteomic analysis indicated that mutations associated with specific IMDs might also affect protein expression in other IMDs. For example, DXO (celiac disease-related gene) and PSMB9 (alopecia areata-related gene) could modulate CDSN (autoimmune hypothyroidism-, psoriasis-, asthma-, and Graves' disease-related gene) expression. Identified genes predominantly impact immune and biochemical processes, and can be clustered into pathways of immune-related, urate metabolism, and antigen processing. Our findings identified protein-coding variants which are the key to IMDs pathogenesis and provided new insights into tailored innovative therapies.
Collapse
Affiliation(s)
- Liu Yang
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Ya-Nan Ou
- Department of Neurology, Qingdao Municipal Hospital, Qingdao University, Qingdao, 266071, China
| | - Bang-Sheng Wu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Wei-Shi Liu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Yue-Ting Deng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Xiao-Yu He
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Yi-Lin Chen
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Jujiao Kang
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200443, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
| | - Chen-Jie Fei
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Ying Zhu
- State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, Fudan University, Shanghai, 200032, China
| | - Lan Tan
- Department of Neurology, Qingdao Municipal Hospital, Qingdao University, Qingdao, 266071, China
| | - Qiang Dong
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Jianfeng Feng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200443, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
| | - Wei Cheng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200443, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
| | - Jin-Tai Yu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China.
| |
Collapse
|
81
|
Hop PJ, Lai D, Keagle PJ, Baron DM, Kenna BJ, Kooyman M, Shankaracharya, Halter C, Straniero L, Asselta R, Bonvegna S, Soto-Beasley AI, Wszolek ZK, Uitti RJ, Isaias IU, Pezzoli G, Ticozzi N, Ross OA, Veldink JH, Foroud TM, Kenna KP, Landers JE. Systematic rare variant analyses identify RAB32 as a susceptibility gene for familial Parkinson's disease. Nat Genet 2024; 56:1371-1376. [PMID: 38858457 PMCID: PMC11250361 DOI: 10.1038/s41588-024-01787-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 05/06/2024] [Indexed: 06/12/2024]
Abstract
Despite substantial progress, causal variants are identified only for a minority of familial Parkinson's disease (PD) cases, leaving high-risk pathogenic variants unidentified1,2. To identify such variants, we uniformly processed exome sequencing data of 2,184 index familial PD cases and 69,775 controls. Exome-wide analyses converged on RAB32 as a novel PD gene identifying c.213C > G/p.S71R as a high-risk variant presenting in ~0.7% of familial PD cases while observed in only 0.004% of controls (odds ratio of 65.5). This variant was confirmed in all cases via Sanger sequencing and segregated with PD in three families. RAB32 encodes a small GTPase known to interact with LRRK2 (refs. 3,4). Functional analyses showed that RAB32 S71R increases LRRK2 kinase activity, as indicated by increased autophosphorylation of LRRK2 S1292. Here our results implicate mutant RAB32 in a key pathological mechanism in PD-LRRK2 kinase activity5-7-and thus provide novel insights into the mechanistic connections between RAB family biology, LRRK2 and PD risk.
Collapse
Affiliation(s)
- Paul J Hop
- Department of Translational Neuroscience, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Dongbing Lai
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Pamela J Keagle
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA
| | - Desiree M Baron
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA
| | - Brendan J Kenna
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Maarten Kooyman
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Shankaracharya
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA
| | - Cheryl Halter
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Letizia Straniero
- Department of Biomedical Sciences, Humanitas University, Milan, Italy
- IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | - Rosanna Asselta
- Department of Biomedical Sciences, Humanitas University, Milan, Italy
- IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | | | | | | | - Ryan J Uitti
- Department of Neurology, Mayo Clinic, Jacksonville, FL, USA
| | - Ioannis Ugo Isaias
- Parkinson Institute, ASST Gaetano Pini-CTO, Milan, Italy
- Department of Neurology, University Hospital of Würzburg and Julius Maximilian University of Würzburg, Würzburg, Germany
| | - Gianni Pezzoli
- Parkinson Institute, ASST Gaetano Pini-CTO, Milan, Italy
- Fondazione Grigioni per il Morbo di Parkinson, Milan, Italy
| | - Nicola Ticozzi
- Department of Neurology-Stroke Unit and Laboratory of Neuroscience, Istituto Auxologico Italiano IRCCS, Milan, Italy
- Department of Pathophysiology and Transplantation, 'Dino Ferrari' Center, Università degli Studi di Milano, Milan, Italy
| | - Owen A Ross
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Department of Clinical Genomics, Mayo Clinic, Jacksonville, FL, USA
| | - Jan H Veldink
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Tatiana M Foroud
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Kevin P Kenna
- Department of Translational Neuroscience, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - John E Landers
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA.
| |
Collapse
|
82
|
Yaacov O, Mathiyalagan P, Berk-Rauch HE, Ganesh SK, Zhu L, Hoffmann TJ, Iribarren C, Risch N, Lee D, Chakravarti A. Identification of the Molecular Components of Enhancer-Mediated Gene Expression Variation in Multiple Tissues Regulating Blood Pressure. Hypertension 2024; 81:1500-1510. [PMID: 38747164 PMCID: PMC11168860 DOI: 10.1161/hypertensionaha.123.22538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 04/24/2024] [Indexed: 06/14/2024]
Abstract
BACKGROUND Inter-individual variation in blood pressure (BP) arises in part from sequence variants within enhancers modulating the expression of causal genes. We propose that these genes, active in tissues relevant to BP physiology, can be identified from tissue-level epigenomic data and genotypes of BP-phenotyped individuals. METHODS We used chromatin accessibility data from the heart, adrenal, kidney, and artery to identify cis-regulatory elements (CREs) in these tissues and estimate the impact of common human single-nucleotide variants within these CREs on gene expression, using machine learning methods. To identify causal genes, we performed a gene-wise association test. We conducted analyses in 2 separate large-scale cohorts: 77 822 individuals from the Genetic Epidemiology Research on Adult Health and Aging and 315 270 individuals from the UK Biobank. RESULTS We identified 309, 259, 331, and 367 genes (false discovery rate <0.05) for diastolic BP and 191, 184, 204, and 204 genes for systolic BP in the artery, kidney, heart, and adrenal, respectively, in Genetic Epidemiology Research on Adult Health and Aging; 50% to 70% of these genes were replicated in the UK Biobank, significantly higher than the 12% to 15% expected by chance (P<0.0001). These results enabled tissue expression prediction of these 988 to 2875 putative BP genes in individuals of both cohorts to construct an expression polygenic score. This score explained ≈27% of the reported single-nucleotide variant heritability, substantially higher than expected from prior studies. CONCLUSIONS Our work demonstrates the power of tissue-restricted comprehensive CRE analysis, followed by CRE-based expression prediction, for understanding BP regulation in relevant tissues and provides dual-modality supporting evidence, CRE and expression, for the causality genes.
Collapse
Affiliation(s)
- Or Yaacov
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| | - Prabhu Mathiyalagan
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
- Benthos Prime Central, Houston, TX, USA
| | - Hanna E. Berk-Rauch
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| | - Santhi K. Ganesh
- Department of Internal Medicine & Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Luke Zhu
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| | - Thomas J. Hoffmann
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Carlos Iribarren
- Kaiser Permanente Northern California Division of Research, Oakland, CA, USA
| | - Neil Risch
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
- Kaiser Permanente Northern California Division of Research, Oakland, CA, USA
| | - Dongwon Lee
- Department of Pediatrics, Division of Nephrology, Boston Children’s Hospital, Boston & Harvard Medical School, Boston, MA, USA
| | - Aravinda Chakravarti
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| |
Collapse
|
83
|
Petrazzini BO, Forrest IS, Rocheleau G, Vy HMT, Márquez-Luna C, Duffy Á, Chen R, Park JK, Gibson K, Goonewardena SN, Malick WA, Rosenson RS, Jordan DM, Do R. Exome sequence analysis identifies rare coding variants associated with a machine learning-based marker for coronary artery disease. Nat Genet 2024; 56:1412-1419. [PMID: 38862854 PMCID: PMC11781350 DOI: 10.1038/s41588-024-01791-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 05/08/2024] [Indexed: 06/13/2024]
Abstract
Coronary artery disease (CAD) exists on a spectrum of disease represented by a combination of risk factors and pathogenic processes. An in silico score for CAD built using machine learning and clinical data in electronic health records captures disease progression, severity and underdiagnosis on this spectrum and could enhance genetic discovery efforts for CAD. Here we tested associations of rare and ultrarare coding variants with the in silico score for CAD in the UK Biobank, All of Us Research Program and BioMe Biobank. We identified associations in 17 genes; of these, 14 show at least moderate levels of prior genetic, biological and/or clinical support for CAD. We also observed an excess of ultrarare coding variants in 321 aggregated CAD genes, suggesting more ultrarare variant associations await discovery. These results expand our understanding of the genetic etiology of CAD and illustrate how digital markers can enhance genetic association investigations for complex diseases.
Collapse
Affiliation(s)
- Ben Omega Petrazzini
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Iain S Forrest
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ghislain Rocheleau
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ha My T Vy
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Carla Márquez-Luna
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Áine Duffy
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Robert Chen
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Joshua K Park
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Kyle Gibson
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Sascha N Goonewardena
- Division of Cardiovascular Medicine, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Division of Cardiovascular Medicine, VA Ann Arbor Health System, Ann Arbor, MI, USA
| | - Waqas A Malick
- Metabolism and Lipids Program, Mount Sinai Fuster Heart Hospital, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Robert S Rosenson
- Metabolism and Lipids Program, Mount Sinai Fuster Heart Hospital, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Daniel M Jordan
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ron Do
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
84
|
Pattillo Smith S, Darnell G, Udwin D, Stamp J, Harpak A, Ramachandran S, Crawford L. Discovering non-additive heritability using additive GWAS summary statistics. eLife 2024; 13:e90459. [PMID: 38913556 PMCID: PMC11196113 DOI: 10.7554/elife.90459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Accepted: 04/22/2024] [Indexed: 06/26/2024] Open
Abstract
LD score regression (LDSC) is a method to estimate narrow-sense heritability from genome-wide association study (GWAS) summary statistics alone, making it a fast and popular approach. In this work, we present interaction-LD score (i-LDSC) regression: an extension of the original LDSC framework that accounts for interactions between genetic variants. By studying a wide range of generative models in simulations, and by re-analyzing 25 well-studied quantitative phenotypes from 349,468 individuals in the UK Biobank and up to 159,095 individuals in BioBank Japan, we show that the inclusion of a cis-interaction score (i.e. interactions between a focal variant and proximal variants) recovers genetic variance that is not captured by LDSC. For each of the 25 traits analyzed in the UK Biobank and BioBank Japan, i-LDSC detects additional variation contributed by genetic interactions. The i-LDSC software and its application to these biobanks represent a step towards resolving further genetic contributions of sources of non-additive genetic effects to complex trait variation.
Collapse
Affiliation(s)
- Samuel Pattillo Smith
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Department of Ecology and Evolutionary Biology, Brown UniversityProvidenceUnited States
- Department of Integrative Biology, The University of Texas at AustinAustinUnited States
- Department of Population Health, The University of Texas at AustinAustinUnited States
| | - Gregory Darnell
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Institute for Computational and Experimental Research in Mathematics, Brown UniversityProvidenceUnited States
| | - Dana Udwin
- Department of Biostatistics, Brown UniversityProvidenceUnited States
| | - Julian Stamp
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
| | - Arbel Harpak
- Department of Integrative Biology, The University of Texas at AustinAustinUnited States
- Department of Population Health, The University of Texas at AustinAustinUnited States
| | - Sohini Ramachandran
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Department of Ecology and Evolutionary Biology, Brown UniversityProvidenceUnited States
- Data Science Institute, Brown UniversityProvidenceUnited States
| | - Lorin Crawford
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Department of Biostatistics, Brown UniversityProvidenceUnited States
- MicrosoftCambridgeUnited States
| |
Collapse
|
85
|
Choi J, Xu Z, Sun R. Variance-components tests for genetic association with multiple interval-censored outcomes. Stat Med 2024; 43:2560-2574. [PMID: 38636557 PMCID: PMC11116038 DOI: 10.1002/sim.10081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 02/18/2024] [Accepted: 04/02/2024] [Indexed: 04/20/2024]
Abstract
Massive genetic compendiums such as the UK Biobank have become an invaluable resource for identifying genetic variants that are associated with complex diseases. Due to the difficulties of massive data collection, a common practice of these compendiums is to collect interval-censored data. One challenge in analyzing such data is the lack of methodology available for genetic association studies with interval-censored data. Genetic effects are difficult to detect because of their rare and weak nature, and often the time-to-event outcomes are transformed to binary phenotypes for access to more powerful signal detection approaches. However transforming the data to binary outcomes can result in loss of valuable information. To alleviate such challenges, this work develops methodology to associate genetic variant sets with multiple interval-censored outcomes. Testing sets of variants such as genes or pathways is a common approach in genetic association settings to lower the multiple testing burden, aggregate small effects, and improve interpretations of results. Instead of performing inference with only a single outcome, utilizing multiple outcomes can increase statistical power by aggregating information across multiple correlated phenotypes. Simulations show that the proposed strategy can offer significant power gains over a single outcome approach. We apply the proposed test to the investigation that motivated this study, a search for the genes that perturb risks of bone fractures and falls in the UK Biobank.
Collapse
Affiliation(s)
- Jaihee Choi
- Department of Statistics, Rice University, Texas, USA
| | - Zhichao Xu
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Texas, USA
| | - Ryan Sun
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Texas, USA
| |
Collapse
|
86
|
Zhong Y, Tubbs JD, Leung PBM, Zhan N, Hui TCK, Ho KKY, Hung KSY, Cheung EFC, So HC, Lui SSY, Sham PC. Whole-exome sequencing in a Chinese sample provides preliminary evidence for the link between rare/low-frequency immune-related variants and early-onset schizophrenia. Asian J Psychiatr 2024; 96:104046. [PMID: 38663229 DOI: 10.1016/j.ajp.2024.104046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 04/06/2024] [Indexed: 06/01/2024]
Abstract
Rare and low-frequency variants contribute to schizophrenia (SCZ), and may influence its age-at-onset (AAO). We examined the association of rare or low-frequency deleterious coding variants in Chinese patients with SCZ. We collected DNA samples in 197 patients with SCZ spectrum disorder and 82 healthy controls (HC), and performed exome sequencing. The AAO variable was ascertained in the majority of SCZ participants for identify the early-onset (EOS, AAO<=18) and adult-onset (AOS, AAO>18) subgroups. We examined the overall association of rare/low-frequency, damaging variants in SCZ versus HC, EOS versus HC, and AOS versus HC at the gene and gene-set levels using Sequence Kernel Association Test. The quantitative rare-variant association test of AAO was conducted. Resampling was used to obtain empirical p-values and to control for family-wise error rate (FWER). In binary-trait association tests, we identified 5 potential candidate risk genes and 10 gene ontology biological processes (GOBP) terms, among which PADI2 reached FWER-adjusted significance. In quantitative rare-variant association tests, we found marginally significant correlations of AAO with alterations in 4 candidate risk genes, and 5 GOBP pathways. Together, the biological and functional profiles of these genes and gene sets supported the involvement of perturbations of neural systems in SCZ, and altered immune functions in EOS.
Collapse
Affiliation(s)
- Yuanxin Zhong
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China
| | - Justin D Tubbs
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA; Department of Psychiatry, Harvard Medical School, Boston, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Perry B M Leung
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Na Zhan
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China
| | - Tomy C K Hui
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China
| | - Karen K Y Ho
- Department of General Adult Psychiatry, Castle Peak Hospital, Hong Kong Special Administrative Region of China
| | - Karen S Y Hung
- Department of General Adult Psychiatry, Castle Peak Hospital, Hong Kong Special Administrative Region of China
| | - Eric F C Cheung
- Department of General Adult Psychiatry, Castle Peak Hospital, Hong Kong Special Administrative Region of China
| | - Hon-Cheong So
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong Special Administrative Region of China; Department of Psychiatry, The Chinese University of Hong Kong, Shatin, Hong Kong Special Administrative Region of China.
| | - Simon S Y Lui
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China.
| | - Pak C Sham
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong Special Administrative Region of China.
| |
Collapse
|
87
|
Malik R, Beaufort N, Li J, Tanaka K, Georgakis MK, He Y, Koido M, Terao C, Japan B, Anderson CD, Kamatani Y, Zand R, Dichgans M. Genetically proxied HTRA1 protease activity and circulating levels independently predict risk of ischemic stroke and coronary artery disease. NATURE CARDIOVASCULAR RESEARCH 2024; 3:701-713. [PMID: 39196222 DOI: 10.1038/s44161-024-00475-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 04/23/2024] [Indexed: 08/29/2024]
Abstract
Genetic variants in HTRA1 are associated with stroke risk. However, the mechanisms mediating this remain largely unknown, as does the full spectrum of phenotypes associated with genetic variation in HTRA1. Here we show that rare HTRA1 variants are linked to ischemic stroke in the UK Biobank and BioBank Japan. Integrating data from biochemical experiments, we next show that variants causing loss of protease function associated with ischemic stroke, coronary artery disease and skeletal traits in the UK Biobank and MyCode cohorts. Moreover, a common variant modulating circulating HTRA1 mRNA and protein levels enhances the risk of ischemic stroke and coronary artery disease while lowering the risk of migraine and macular dystrophy in genome-wide association study, UK Biobank, MyCode and BioBank Japan data. We found no interaction between proxied HTRA1 activity and levels. Our findings demonstrate the role of HTRA1 for cardiovascular diseases and identify two mechanisms as potential targets for therapeutic interventions.
Collapse
Affiliation(s)
- Rainer Malik
- Institute for Stroke and Dementia Research (ISD), University Hospital, Ludwig Maximilian University of Munich, Munich, Germany
| | - Nathalie Beaufort
- Institute for Stroke and Dementia Research (ISD), University Hospital, Ludwig Maximilian University of Munich, Munich, Germany
| | - Jiang Li
- Department of Molecular and Functional Genomics, Geisinger Health System, Danville, PA, USA
| | - Koki Tanaka
- Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | - Marios K Georgakis
- Institute for Stroke and Dementia Research (ISD), University Hospital, Ludwig Maximilian University of Munich, Munich, Germany
| | - Yunye He
- Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | - Masaru Koido
- Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - BioBank Japan
- Institute of Medical Science, University of Tokyo, Tokyo, Japan
| | - Christopher D Anderson
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and Massachusetts Institute of Technology, Boston, MA, USA
- McCance Center for Brain Health, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Brigham and Women's Hospital, Boston, MA, USA
| | - Yoichiro Kamatani
- Laboratory of Complex Trait Genomics, Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan
| | - Ramin Zand
- Department of Neurology, Pennsylvania State University, Hershey, PA, USA
- Department of Neurology, Neuroscience Institute, Geisinger Health System, Danville, PA, USA
| | - Martin Dichgans
- Institute for Stroke and Dementia Research (ISD), University Hospital, Ludwig Maximilian University of Munich, Munich, Germany.
- German Center for Neurodegenerative Diseases (DZNE), Munich, Germany.
- German Center for Cardiovascular Research (DZHK), Munich, Germany.
- Munich Cluster for Systems Neurology (SyNergy), Munich, Germany.
| |
Collapse
|
88
|
Huang YH, Chen YC, Ho WM, Lee RG, Chung RH, Liu YL, Chang PY, Chang SC, Wang CW, Chung WH, Tsai SJ, Kuo PH, Lee YS, Hsiao CC. Classifying Alzheimer's disease and normal subjects using machine learning techniques and genetic-environmental features. J Formos Med Assoc 2024; 123:701-709. [PMID: 38044212 DOI: 10.1016/j.jfma.2023.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 08/24/2023] [Accepted: 10/23/2023] [Indexed: 12/05/2023] Open
Abstract
BACKGROUND Alzheimer's disease (AD) is complicated by multiple environmental and polygenetic factors. The accuracy of artificial neural networks (ANNs) incorporating the common factors for identifying AD has not been evaluated. METHODS A total of 184 probable AD patients and 3773 healthy individuals aged 65 and over were enrolled. AD-related genes (51 SNPs) and 8 environmental factors were selected as features for multilayer ANN modeling. Random Forest (RF) and Support Vector Machine with RBF kernel (SVM) were also employed for comparison. Model results were verified using traditional statistics. RESULTS The ANN achieved high accuracy (0.98), sensitivity (0.95), and specificity (0.96) in the intrinsic test for AD classification. Excluding age and genetic data still yielded favorable results (accuracy: 0.97, sensitivity: 0.94, specificity: 0.96). The assigned weights to ANN features highlighted the importance of mental evaluation, years of education, and specific genetic variations (CASS4 rs7274581, PICALM rs3851179, and TOMM40 rs2075650) for AD classification. Receiver operating characteristic analysis revealed AUC values of 0.99 (intrinsic test), 0.60 (TWB-GWA), and 0.72 (CG-WGS), with slightly lower AUC values (0.96, 0.80, 0.52) when excluding age in ANN. The performance of the ANN model in AD classification was comparable to RF, SVM (linear kernel), and SVM (RBF kernel). CONCLUSION The ANN model demonstrated good sensitivity, specificity, and accuracy in AD classification. The top-weighted SNPs for AD prediction were CASS4 rs7274581, PICALM rs3851179, and TOMM40 rs2075650. The ANN model performed similarly to RF and SVM, indicating its capability to handle the complexity of AD as a disease entity.
Collapse
Affiliation(s)
- Yu-Hua Huang
- Department of Neurology, Chang Gung Memorial Hospital Linkou Medical Center and College of Medicine, Chang-Gung University, Taoyuan, Taiwan
| | - Yi-Chun Chen
- Department of Neurology, Chang Gung Memorial Hospital Linkou Medical Center and College of Medicine, Chang-Gung University, Taoyuan, Taiwan
| | - Wei-Min Ho
- Department of Neurology, Chang Gung Memorial Hospital Linkou Medical Center and College of Medicine, Chang-Gung University, Taoyuan, Taiwan
| | - Ren-Guey Lee
- Department of Electronics Engineering, National Taipei University of Technology, Taipei, Taiwan
| | - Ren-Hua Chung
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Pi-Yueh Chang
- Department of Laboratory Medicine, Chang Gung Memorial Hospital Linkou Medical Center, Taoyuan, Taiwan; Department of Medical Biotechnology and Laboratory Science, Chang Gung University, Taoyuan, Taiwan
| | - Shih-Cheng Chang
- Department of Laboratory Medicine, Chang Gung Memorial Hospital Linkou Medical Center, Taoyuan, Taiwan; Department of Medical Biotechnology and Laboratory Science, Chang Gung University, Taoyuan, Taiwan
| | - Chaung-Wei Wang
- Department of Dermatology, Drug Hypersensitivity Clinical and Research Center, Chang Gung Memorial Hospital, Linkou, Taipei and Keelung, Taiwan; Cancer Vaccine and Immune Cell Therapy Core Laboratory, Chang Gung Memorial Hospital, Linkou, Taiwan; Whole-Genome Research Core Laboratory of Human Diseases, Chang Gung Memorial Hospital, Keelung, Taiwan
| | - Wen-Hung Chung
- Department of Dermatology, Drug Hypersensitivity Clinical and Research Center, Chang Gung Memorial Hospital, Linkou, Taipei and Keelung, Taiwan; Cancer Vaccine and Immune Cell Therapy Core Laboratory, Chang Gung Memorial Hospital, Linkou, Taiwan; Whole-Genome Research Core Laboratory of Human Diseases, Chang Gung Memorial Hospital, Keelung, Taiwan
| | - Shih-Jen Tsai
- Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan; Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Department of Public Health, Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei, Taiwan
| | - Yun-Shien Lee
- Department of Biotechnology, Ming Chuan University, Taoyuan, Taiwan; Genomic Medicine Research Core Laboratory, Chang Gung Memorial Hospital, Taoyuan, Taiwan.
| | - Chun-Chieh Hsiao
- Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan; Department of Computer Information and Network Engineering, Lunghwa University of Science and Technology, Taoyuan, Taiwan.
| |
Collapse
|
89
|
Zheng D, Grandgenett PM, Zhang Q, Baine M, Shi Y, Du Q, Liang X, Wong J, Iqbal S, Preuss K, Kamal A, Yu H, Du H, Hollingsworth MA, Zhang C. radioGWAS links radiome to genome to discover driver genes with somatic mutations for heterogeneous tumor image phenotype in pancreatic cancer. Sci Rep 2024; 14:12316. [PMID: 38811597 PMCID: PMC11137018 DOI: 10.1038/s41598-024-62741-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 05/21/2024] [Indexed: 05/31/2024] Open
Abstract
Addressing the significant level of variability exhibited by pancreatic cancer necessitates the adoption of a systems biology approach that integrates molecular data, biological properties of the tumors, medical images, and clinical features of the patients. In this study, a comprehensive multi-omics methodology was employed to examine a distinctive collection of patient dataset containing rapid autopsy tumor and normal tissue samples as well as longitudinal imaging with a focus on pancreatic cancer. By performing a whole exome sequencing analysis on tumor and normal tissues to identify somatic gene variants and a radiomic feature analysis to tumor CT images, the genome-wide association approach established a connection between pancreatic cancer driver genes and relevant radiomic features, enabling a thorough and quantitative assessment of the heterogeneity of pancreatic tumors. The significant association between sets of genes and radiomic features revealed the involvement of genes in shaping tumor morphological heterogeneity. Some results of the association established a connection between the molecular level mechanism and their outcomes at the level of tumor structural heterogeneity. Because tumor structure and tumor structural heterogeneity are related to the patients' overall survival, patients who had pancreatic cancer driver gene mutations with an association to a certain radiomic feature have been observed to experience worse survival rates than cases without these somatic mutations. Furthermore, the association analysis has revealed potential gene mutations and radiomic feature candidates that warrant further investigation in future research endeavors.
Collapse
Affiliation(s)
- Dandan Zheng
- Department of Radiation Oncology, University of Rochester Medical Center, Rochester, NY, USA.
| | - Paul M Grandgenett
- Eppley Institute for Research in Cancer and Allied Diseases, University of Nebraska Medical Center, Omaha, NE, USA
| | - Qi Zhang
- Department of Mathematics and Statistics, University of New Hampshire, Durham, NH, USA
| | - Michael Baine
- Department of Radiation Oncology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Yu Shi
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA
| | - Qian Du
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA
| | - Xiaoying Liang
- Department of Radiation Oncology, Mayo Clinic, Jacksonville, FL, USA
| | - Jeffrey Wong
- Department of Radiation Oncology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Subhan Iqbal
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA
| | - Kiersten Preuss
- Department of Nutrition and Health Sciences, University of Nebraska, Lincoln, NE, USA
| | - Ahsan Kamal
- Department of Radiation Oncology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Hongfeng Yu
- School of Computing, University of Nebraska, Lincoln, NE, USA
| | - Huijing Du
- Department of Mathematics, University of Nebraska, Lincoln, NE, USA
| | - Michael A Hollingsworth
- Eppley Institute for Research in Cancer and Allied Diseases, University of Nebraska Medical Center, Omaha, NE, USA.
| | - Chi Zhang
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA.
| |
Collapse
|
90
|
Yee SW, Macdonald CB, Mitrovic D, Zhou X, Koleske ML, Yang J, Buitrago Silva D, Rockefeller Grimes P, Trinidad DD, More SS, Kachuri L, Witte JS, Delemotte L, Giacomini KM, Coyote-Maestas W. The full spectrum of SLC22 OCT1 mutations illuminates the bridge between drug transporter biophysics and pharmacogenomics. Mol Cell 2024; 84:1932-1947.e10. [PMID: 38703769 PMCID: PMC11382353 DOI: 10.1016/j.molcel.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 01/04/2024] [Accepted: 04/15/2024] [Indexed: 05/06/2024]
Abstract
Mutations in transporters can impact an individual's response to drugs and cause many diseases. Few variants in transporters have been evaluated for their functional impact. Here, we combine saturation mutagenesis and multi-phenotypic screening to dissect the impact of 11,213 missense single-amino-acid deletions, and synonymous variants across the 554 residues of OCT1, a key liver xenobiotic transporter. By quantifying in parallel expression and substrate uptake, we find that most variants exert their primary effect on protein abundance, a phenotype not commonly measured alongside function. Using our mutagenesis results combined with structure prediction and molecular dynamic simulations, we develop accurate structure-function models of the entire transport cycle, providing biophysical characterization of all known and possible human OCT1 polymorphisms. This work provides a complete functional map of OCT1 variants along with a framework for integrating functional genomics, biophysical modeling, and human genetics to predict variant effects on disease and drug efficacy.
Collapse
Affiliation(s)
- Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Christian B Macdonald
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Darko Mitrovic
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden
| | - Xujia Zhou
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Megan L Koleske
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Jia Yang
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Dina Buitrago Silva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Patrick Rockefeller Grimes
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Donovan D Trinidad
- Department of Medicine, Division of Infectious Disease, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Swati S More
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Linda Kachuri
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - John S Witte
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - Lucie Delemotte
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden.
| | - Kathleen M Giacomini
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA.
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA; Chan Zuckerberg Biohub, San Francisco, CA 94148, USA.
| |
Collapse
|
91
|
Zhou W, Cuomo ASE, Xue A, Kanai M, Chau G, Krishna C, Xavier RJ, MacArthur DG, Powell JE, Daly MJ, Neale BM. Efficient and accurate mixed model association tool for single-cell eQTL analysis. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.15.24307317. [PMID: 38798318 PMCID: PMC11118640 DOI: 10.1101/2024.05.15.24307317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Understanding the genetic basis of gene expression can help us understand the molecular underpinnings of human traits and disease. Expression quantitative trait locus (eQTL) mapping can help in studying this relationship but have been shown to be very cell-type specific, motivating the use of single-cell RNA sequencing and single-cell eQTLs to obtain a more granular view of genetic regulation. Current methods for single-cell eQTL mapping either rely on the "pseudobulk" approach and traditional pipelines for bulk transcriptomics or do not scale well to large datasets. Here, we propose SAIGE-QTL, a robust and scalable tool that can directly map eQTLs using single-cell profiles without needing aggregation at the pseudobulk level. Additionally, SAIGE-QTL allows for testing the effects of less frequent/rare genetic variation through set-based tests, which is traditionally excluded from eQTL mapping studies. We evaluate the performance of SAIGE-QTL on both real and simulated data and demonstrate the improved power for eQTL mapping over existing pipelines.
Collapse
|
92
|
Ohanele C, Peoples JN, Karlstaedt A, Geiger JT, Gayle AD, Ghazal N, Sohani F, Brown ME, Davis ME, Porter GA, Faundez V, Kwong JQ. Mitochondrial citrate carrier SLC25A1 is a dosage-dependent regulator of metabolic reprogramming and morphogenesis in the developing heart. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.22.541833. [PMID: 37292906 PMCID: PMC10245819 DOI: 10.1101/2023.05.22.541833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The developing mammalian heart undergoes an important metabolic shift from glycolysis toward mitochondrial oxidation, such that oxidative phosphorylation defects may present with cardiac abnormalities. Here, we describe a new mechanistic link between mitochondria and cardiac morphogenesis, uncovered by studying mice with systemic loss of the mitochondrial citrate carrier SLC25A1. Slc25a1 null embryos displayed impaired growth, cardiac malformations, and aberrant mitochondrial function. Importantly, Slc25a1 heterozygous embryos, which are overtly indistinguishable from wild type, exhibited an increased frequency of these defects, suggesting Slc25a1 haploinsuffiency and dose-dependent effects. Supporting clinical relevance, we found a near-significant association between ultrarare human pathogenic SLC25A1 variants and pediatric congenital heart disease. Mechanistically, SLC25A1 may link mitochondria to transcriptional regulation of metabolism through epigenetic control of gene expression to promote metabolic remodeling in the developing heart. Collectively, this work positions SLC25A1 as a novel mitochondrial regulator of ventricular morphogenesis and cardiac metabolic maturation and suggests a role in congenital heart disease.
Collapse
|
93
|
Wu K, Wang W, Cheng Q, Xiao D, Li Y, Chen M, Zheng X. Rare MED12L Variants Are Associated with Susceptibility to Guttate Psoriasis in the Han Chinese Population. Dermatology 2024; 240:606-614. [PMID: 38735287 DOI: 10.1159/000538805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Accepted: 04/08/2024] [Indexed: 05/14/2024] Open
Abstract
INTRODUCTION According to the common disease/rare variant hypothesis, it is important to study the role of rare variants in complex diseases. The association of rare variants with psoriasis has been demonstrated, but the association between rare variants and specific clinical subtypes of psoriasis has not been investigated. METHODS Gene-based and gene-level meta-analyses were performed on data extracted from our previous study data sets (2,483 patients with guttate psoriasis and 8,292 patients with non-guttate psoriasis) for genotyping. Then, haplotype analysis was performed for rare loss-of-function variants located in MED12L, and protein function prediction was performed for MED12L. Gene-based analysis at each stage had a moderate significance threshold (p < 0.05). A χ2 test was then conducted on the three potential genes, and the merged gene-based analysis was used to confirm the results. We also conducted association analysis and meta-analysis for functional variants located on the identified gene. RESULTS Through these gene-level analyses, we determined that MED12L is a guttate psoriasis susceptibility gene (p = 9.99 × 10-5), and the single-nucleotide polymorphism with the strongest association was rs199780529 (p_combine = 1 × 10-3, p_meta = 2 × 10-3). CONCLUSIONS In our study, a guttate psoriasis-specific subtype-associated susceptibility gene was confirmed in a Chinese Han population. These findings contribute to a better genetic understanding of different subtypes of psoriasis.
Collapse
Affiliation(s)
- Kejia Wu
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- First Clinical Medical College, Anhui Medical University, Hefei, China
| | - Wanrong Wang
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- First Clinical Medical College, Anhui Medical University, Hefei, China
| | - Qianhui Cheng
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- First Clinical Medical College, Anhui Medical University, Hefei, China
| | - Duncheng Xiao
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- Second Clinical Medical College, Anhui Medical University, Hefei, China
| | - Yunxiao Li
- School of Life Science, Shandong University, Qingdao, China
| | - Mengyun Chen
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Xiaodong Zheng
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| |
Collapse
|
94
|
Guo H, Urban AE, Wong WH. Prioritizing disease-related rare variants by integrating gene expression data. RESEARCH SQUARE 2024:rs.3.rs-4355589. [PMID: 38766095 PMCID: PMC11100897 DOI: 10.21203/rs.3.rs-4355589/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
Rare variants, comprising a vast majority of human genetic variations, are likely to have more deleterious impact on human diseases compared to common variants. Here we present carrier statistic, a statistical framework to prioritize disease-related rare variants by integrating gene expression data. By quantifying the impact of rare variants on gene expression, carrier statistic can prioritize those rare variants that have large functional consequence in the diseased patients. Through simulation studies and analyzing real multi-omics dataset, we demonstrated that carrier statistic is applicable in studies with limited sample size (a few hundreds) and achieves substantially higher sensitivity than existing rare variants association methods. Application to Alzheimer's disease reveals 16 rare variants within 15 genes with extreme carrier statistics. We also found strong excess of rare variants among the top prioritized genes in diseased patients compared to that in healthy individuals. The carrier statistic method can be applied to various rare variant types and is adaptable to other omics data modalities, offering a powerful tool for investigating the molecular mechanisms underlying complex diseases.
Collapse
|
95
|
Littleton SH, Trang KB, Volpe CM, Cook K, DeBruyne N, Maguire JA, Weidekamp MA, Hodge KM, Boehm K, Lu S, Chesi A, Bradfield JP, Pippin JA, Anderson SA, Wells AD, Pahl MC, Grant SFA. Variant-to-function analysis of the childhood obesity chr12q13 locus implicates rs7132908 as a causal variant within the 3' UTR of FAIM2. CELL GENOMICS 2024; 4:100556. [PMID: 38697123 PMCID: PMC11099382 DOI: 10.1016/j.xgen.2024.100556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 03/21/2024] [Accepted: 04/08/2024] [Indexed: 05/04/2024]
Abstract
The ch12q13 locus is among the most significant childhood obesity loci identified in genome-wide association studies. This locus resides in a non-coding region within FAIM2; thus, the underlying causal variant(s) presumably influence disease susceptibility via cis-regulation. We implicated rs7132908 as a putative causal variant by leveraging our in-house 3D genomic data and public domain datasets. Using a luciferase reporter assay, we observed allele-specific cis-regulatory activity of the immediate region harboring rs7132908. We generated isogenic human embryonic stem cell lines homozygous for either rs7132908 allele to assess changes in gene expression and chromatin accessibility throughout a differentiation to hypothalamic neurons, a key cell type known to regulate feeding behavior. The rs7132908 obesity risk allele influenced expression of FAIM2 and other genes and decreased the proportion of neurons produced by differentiation. We have functionally validated rs7132908 as a causal obesity variant that temporally regulates nearby effector genes and influences neurodevelopment and survival.
Collapse
Affiliation(s)
- Sheridan H Littleton
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Cell and Molecular Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Khanh B Trang
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Christina M Volpe
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kieona Cook
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Psychiatry, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Nicole DeBruyne
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Cell and Molecular Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jean Ann Maguire
- Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Mary Ann Weidekamp
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kenyaita M Hodge
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Keith Boehm
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Sumei Lu
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Alessandra Chesi
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jonathan P Bradfield
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Quantinuum Research LLC, San Diego, CA 92101, USA
| | - James A Pippin
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Stewart A Anderson
- Department of Child and Adolescent Psychiatry, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew D Wells
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Matthew C Pahl
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Struan F A Grant
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Division of Endocrinology and Diabetes, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
| |
Collapse
|
96
|
Mørup SB, Leung P, Reilly C, Sherman BT, Chang W, Milojevic M, Milinkovic A, Liappis A, Borgwardt L, Petoumenos K, Paredes R, Mistry SS, MacPherson CR, Lundgren J, Helleberg M, Reekie J, Murray DD. The association between single-nucleotide polymorphisms within type 1 interferon pathway genes and human immunodeficiency virus type 1 viral load in antiretroviral-naïve participants. AIDS Res Ther 2024; 21:27. [PMID: 38698440 PMCID: PMC11067292 DOI: 10.1186/s12981-024-00610-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 03/29/2024] [Indexed: 05/05/2024] Open
Abstract
BACKGROUND Human genetic contribution to HIV progression remains inadequately explained. The type 1 interferon (IFN) pathway is important for host control of HIV and variation in type 1 IFN genes may contribute to disease progression. This study assessed the impact of variations at the gene and pathway level of type 1 IFN on HIV-1 viral load (VL). METHODS Two cohorts of antiretroviral (ART) naïve participants living with HIV (PLWH) with either early (START) or advanced infection (FIRST) were analysed separately. Type 1 IFN genes (n = 17) and receptor subunits (IFNAR1, IFNAR2) were examined for both cumulated type 1 IFN pathway analysis and individual gene analysis. SKAT-O was applied to detect associations between the genotype and HIV-1 study entry viral load (log10 transformed) as a proxy for set point VL; P-values were corrected using Bonferroni (P < 0.0025). RESULTS The analyses among those with early infection included 2429 individuals from five continents. The median study entry HIV VL was 14,623 (IQR 3460-45100) copies/mL. Across 673 SNPs within 19 type 1 IFN genes, no significant association with study entry VL was detected. Conversely, examining individual genes in START showed a borderline significant association between IFNW1, and study entry VL (P = 0.0025). This significance remained after separate adjustments for age, CD4+ T-cell count, CD4+/CD8+ T-cell ratio and recent infection. When controlling for population structure using linear mixed effects models (LME), in addition to principal components used in the main model, this was no longer significant (p = 0.0244). In subgroup analyses stratified by geographical region, the association between IFNW1 and study entry VL was only observed among African participants, although, the association was not significant when controlling for population structure using LME. Of the 17 SNPs within the IFNW1 region, only rs79876898 (A > G) was associated with study entry VL (p = 0.0020, beta = 0.32; G associated with higher study entry VL than A) in single SNP association analyses. The findings were not reproduced in FIRST participants. CONCLUSION Across 19 type 1 IFN genes, only IFNW1 was associated with HIV-1 study entry VL in a cohort of ART-naïve individuals in early stages of their infection, however, this was no longer significant in sensitivity analyses that controlled for population structures using LME.
Collapse
Affiliation(s)
- Sara Bohnstedt Mørup
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Preston Leung
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Cavan Reilly
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Brad T Sherman
- Laboratory of Human Retrovirology and Immunoinformatics, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Weizhong Chang
- Laboratory of Human Retrovirology and Immunoinformatics, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Maja Milojevic
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Ana Milinkovic
- Chelsea and Westminster Hospital NHS Foundation Trust, London, UK
| | - Angelike Liappis
- Washington DC Veterans Affairs Medical Center and The George Washington University School of Medicine and Health Sciences, Washington, DC, USA
| | - Line Borgwardt
- Center for Genomic Medicine, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Kathy Petoumenos
- Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
| | - Roger Paredes
- Department of Infectious Diseases and IrsiCaixa, Hospital Universitari Germans Trias i Pujol, Badalona, Spain
| | - Shweta S Mistry
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Cameron R MacPherson
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Institut Roche, Boulogne-Billancourt, France
| | - Jens Lundgren
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Department of Infectious Diseases, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Marie Helleberg
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Department of Infectious Diseases, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Joanne Reekie
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Daniel D Murray
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark.
| |
Collapse
|
97
|
Bass AJ, Bian S, Wingo AP, Wingo TS, Cutler DJ, Epstein MP. Identifying latent genetic interactions in genome-wide association studies using multiple traits. Genome Med 2024; 16:62. [PMID: 38664839 PMCID: PMC11044415 DOI: 10.1186/s13073-024-01329-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 04/02/2024] [Indexed: 04/28/2024] Open
Abstract
The "missing" heritability of complex traits may be partly explained by genetic variants interacting with other genes or environments that are difficult to specify, observe, and detect. We propose a new kernel-based method called Latent Interaction Testing (LIT) to screen for genetic interactions that leverages pleiotropy from multiple related traits without requiring the interacting variable to be specified or observed. Using simulated data, we demonstrate that LIT increases power to detect latent genetic interactions compared to univariate methods. We then apply LIT to obesity-related traits in the UK Biobank and detect variants with interactive effects near known obesity-related genes (URL: https://CRAN.R-project.org/package=lit ).
Collapse
Affiliation(s)
- Andrew J Bass
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA.
| | - Shijia Bian
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA, 30322, USA
| | - Aliza P Wingo
- Department of Psychiatry, Emory University, Atlanta, GA, 30322, USA
| | - Thomas S Wingo
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA
- Department of Neurology, Emory University, Atlanta, GA, 30322, USA
| | - David J Cutler
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA
| | - Michael P Epstein
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA.
| |
Collapse
|
98
|
Cui Y, Ye W, Li JS, Li JJ, Vilain E, Sallam T, Li W. A genome-wide spectrum of tandem repeat expansions in 338,963 humans. Cell 2024; 187:2336-2341.e5. [PMID: 38582080 PMCID: PMC11065452 DOI: 10.1016/j.cell.2024.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 01/23/2024] [Accepted: 03/05/2024] [Indexed: 04/08/2024]
Abstract
The Genome Aggregation Database (gnomAD), widely recognized as the gold-standard reference map of human genetic variation, has largely overlooked tandem repeat (TR) expansions, despite the fact that TRs constitute ∼6% of our genome and are linked to over 50 human diseases. Here, we introduce the TR-gnomAD (https://wlcb.oit.uci.edu/TRgnomAD), a biobank-scale reference of 0.86 million TRs derived from 338,963 whole-genome sequencing (WGS) samples of diverse ancestries (39.5% non-European samples). TR-gnomAD offers critical insights into ancestry-specific disease prevalence using disparities in TR unit number frequencies among ancestries. Moreover, TR-gnomAD is able to differentiate between common, presumably benign TR expansions, which are prevalent in TR-gnomAD, from those potentially pathogenic TR expansions, which are found more frequently in disease groups than within TR-gnomAD. Together, TR-gnomAD is an invaluable resource for researchers and physicians to interpret TR expansions in individuals with genetic diseases.
Collapse
Affiliation(s)
- Ya Cui
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA.
| | - Wenbin Ye
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Jason Sheng Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Eric Vilain
- Institute for Clinical and Translational Science, University of California, Irvine, Irvine, CA 92697, USA; Department of Pediatrics, University of California, Irvine, Irvine, CA 92697, USA
| | - Tamer Sallam
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Wei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA.
| |
Collapse
|
99
|
Liu M, Su YR, Liu Y, Hsu L, He Q. Structured testing of genetic association with mixed clinical outcomes. Genet Epidemiol 2024; 48:226-237. [PMID: 38606632 PMCID: PMC11470132 DOI: 10.1002/gepi.22560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 02/15/2024] [Accepted: 03/27/2024] [Indexed: 04/13/2024]
Abstract
Genetic factors play a fundamental role in disease development. Studying the genetic association with clinical outcomes is critical for understanding disease biology and devising novel treatment targets. However, the frequencies of genetic variations are often low, making it difficult to examine the variants one-by-one. Moreover, the clinical outcomes are complex, including patients' survival time and other binary or continuous outcomes such as recurrences and lymph node count, and how to effectively analyze genetic association with these outcomes remains unclear. In this article, we proposed a structured test statistic for testing genetic association with mixed types of survival, binary, and continuous outcomes. The structured testing incorporates known biological information of variants while allowing for their heterogeneous effects and is a powerful strategy for analyzing infrequent genetic factors. Simulation studies show that the proposed test statistic has correct type I error and is highly effective in detecting significant genetic variants. We applied our approach to a uterine corpus endometrial carcinoma study and identified several genetic pathways associated with the clinical outcomes.
Collapse
Affiliation(s)
- Meiling Liu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - Yu-Ru Su
- Biostatistics Division, Kaiser Permanente Washington Health Research Institute, Seattle, Washington, USA
| | - Yang Liu
- Department of Mathematics and Statistics, Wright State University, Dayton, Ohio, USA
| | - Li Hsu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - Qianchuan He
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| |
Collapse
|
100
|
Nagle MF, Yuan J, Kaur D, Ma C, Peremyslova E, Jiang Y, Niño de Rivera A, Jawdy S, Chen JG, Feng K, Yates TB, Tuskan GA, Muchero W, Fuxin L, Strauss SH. GWAS supported by computer vision identifies large numbers of candidate regulators of in planta regeneration in Populus trichocarpa. G3 (BETHESDA, MD.) 2024; 14:jkae026. [PMID: 38325329 PMCID: PMC10989874 DOI: 10.1093/g3journal/jkae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/18/2024] [Accepted: 01/20/2024] [Indexed: 02/09/2024]
Abstract
Plant regeneration is an important dimension of plant propagation and a key step in the production of transgenic plants. However, regeneration capacity varies widely among genotypes and species, the molecular basis of which is largely unknown. Association mapping methods such as genome-wide association studies (GWAS) have long demonstrated abilities to help uncover the genetic basis of trait variation in plants; however, the performance of these methods depends on the accuracy and scale of phenotyping. To enable a large-scale GWAS of in planta callus and shoot regeneration in the model tree Populus, we developed a phenomics workflow involving semantic segmentation to quantify regenerating plant tissues over time. We found that the resulting statistics were of highly non-normal distributions, and thus employed transformations or permutations to avoid violating assumptions of linear models used in GWAS. We report over 200 statistically supported quantitative trait loci (QTLs), with genes encompassing or near to top QTLs including regulators of cell adhesion, stress signaling, and hormone signaling pathways, as well as other diverse functions. Our results encourage models of hormonal signaling during plant regeneration to consider keystone roles of stress-related signaling (e.g. involving jasmonates and salicylic acid), in addition to the auxin and cytokinin pathways commonly considered. The putative regulatory genes and biological processes we identified provide new insights into the biological complexity of plant regeneration, and may serve as new reagents for improving regeneration and transformation of recalcitrant genotypes and species.
Collapse
Affiliation(s)
- Michael F Nagle
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Jialin Yuan
- Department of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR 97331, USA
| | - Damanpreet Kaur
- Department of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR 97331, USA
| | - Cathleen Ma
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Ekaterina Peremyslova
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Yuan Jiang
- Statistics Department, Oregon State University, 239 Weniger Hall, Corvallis, OR 97331, USA
| | - Alexa Niño de Rivera
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Sara Jawdy
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
| | - Jin-Gui Chen
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Bredesen Center for Interdisciplinary Research, University of Tennessee-Knoxville, 310 Ferris Hall 1508 Middle Dr, Knoxville, TN 37996, USA
| | - Kai Feng
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
| | - Timothy B Yates
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Bredesen Center for Interdisciplinary Research, University of Tennessee-Knoxville, 310 Ferris Hall 1508 Middle Dr, Knoxville, TN 37996, USA
| | - Gerald A Tuskan
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
| | - Wellington Muchero
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Bredesen Center for Interdisciplinary Research, University of Tennessee-Knoxville, 310 Ferris Hall 1508 Middle Dr, Knoxville, TN 37996, USA
| | - Li Fuxin
- Department of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR 97331, USA
| | - Steven H Strauss
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| |
Collapse
|