1
|
Huang R, Jin Z, Zhang D, Li L, Zhou J, Xiao L, Li P, Zhang M, Tian C, Zhang W, Zhong L, Quan M, Zhao R, Du L, Liu LJ, Li Z, Zhang D, Du Q. Rare variations within the serine/arginine-rich splicing factor PtoRSZ21 modulate stomatal size to determine drought tolerance in Populus. THE NEW PHYTOLOGIST 2024; 243:1776-1794. [PMID: 38978318 DOI: 10.1111/nph.19934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 06/13/2024] [Indexed: 07/10/2024]
Abstract
Rare variants contribute significantly to the 'missing heritability' of quantitative traits. The genome-wide characteristics of rare variants and their roles in environmental adaptation of woody plants remain unexplored. Utilizing genome-wide rare variant association study (RVAS), expression quantitative trait loci (eQTL) mapping, genetic transformation, and molecular experiments, we explored the impact of rare variants on stomatal morphology and drought adaptation in Populus. Through comparative analysis of five world-wide Populus species, we observed the influence of mutational bias and adaptive selection on the distribution of rare variants. RVAS identified 75 candidate genes correlated with stomatal size (SS)/stomatal density (SD), and a rare haplotype in the promoter of serine/arginine-rich splicing factor PtoRSZ21 emerged as the foremost association signal governing SS. As a positive regulator of drought tolerance, PtoRSZ21 can recruit the core splicing factor PtoU1-70K to regulate alternative splicing (AS) of PtoATG2b (autophagy-related 2). The rare haplotype PtoRSZ21hap2 weakens binding affinity to PtoMYB61, consequently affecting PtoRSZ21 expression and SS, ultimately resulting in differential distribution of Populus accessions in arid and humid climates. This study enhances the understanding of regulatory mechanisms that underlie AS induced by rare variants and might provide targets for drought-tolerant varieties breeding in Populus.
Collapse
Affiliation(s)
- Rui Huang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Zhuoying Jin
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Donghai Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Lianzheng Li
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Jiaxuan Zhou
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Liang Xiao
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Peng Li
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Mengjiao Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Chongde Tian
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Wenke Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Leishi Zhong
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Mingyang Quan
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Rui Zhao
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Liang Du
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Li-Jun Liu
- College of Forestry, State Forestry and Grassland Administration Key Laboratory of Silviculture in Downstream Areas of the Yellow River, Shandong Agriculture University, Taian, Shandong, 271018, China
| | - Zhonghai Li
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Deqiang Zhang
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| | - Qingzhang Du
- State Key Laboratory of Tree Genetics and Breeding, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
- Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education, College of Biological Sciences and Technology, Beijing Forestry University, No. 35, Qinghua East Road, Beijing, 100083, China
| |
Collapse
|
2
|
Shao M, Tian M, Chen K, Jiang H, Zhang S, Li Z, Shen Y, Chen F, Shen B, Cao C, Gu N. Leveraging Random Effects in Cistrome-Wide Association Studies for Decoding the Genetic Determinants of Prostate Cancer. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024:e2400815. [PMID: 39099406 DOI: 10.1002/advs.202400815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 07/09/2024] [Indexed: 08/06/2024]
Abstract
Cistrome-wide association studies (CWAS) are pivotal for identifying genetic determinants of diseases by correlating genetically regulated cistrome states with phenotypes. Traditional CWAS typically develops a model based on cistrome and genotype data to associate predicted cistrome states with phenotypes. The random effect cistrome-wide association study (RECWAS), reevaluates the necessity of cistrome state prediction in CWAS. RECWAS utilizes either a linear model or marginal effect for initial feature selection, followed by kernel-based feature aggregation for association testing is introduced. Through simulations and analysis of prostate cancer data, a thorough evaluation of CWAS and RECWAS is conducted. The results suggest that RECWAS offers improved power compared to traditional CWAS, identifying additional genomic regions associated with prostate cancer. CWAS identified 102 significant regions, while RECWAS found 50 additional significant regions compared to CWAS, many of which are validated. Validation encompassed a range of biological evidence, including risk signals from the GWAS catalog, susceptibility genes from the DisGeNET database, and enhancer-domain scores. RECWAS consistently demonstrated improved performance over traditional CWAS in identifying genomic regions associated with prostate cancer. These findings demonstrate the benefits of incorporating kernel methods into CWAS and provide new insights for genetic discovery in complex diseases.
Collapse
Affiliation(s)
- Mengting Shao
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
| | - Min Tian
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
| | - Kaiyang Chen
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
| | - Hangjin Jiang
- Center for Data Science, Zhejiang University, Hangzhou, 310058, P. R. China
| | - Shuting Zhang
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
| | - Zhenghui Li
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
| | - Yan Shen
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
| | - Feng Chen
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
| | - Baixin Shen
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, 210011, P. R. China
| | - Chen Cao
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
- Department of Urology, The Second Affiliated Hospital of Nanjing Medical University, Nanjing, 210011, P. R. China
| | - Ning Gu
- Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China
- Nanjing Key Laboratory for Cardiovascular Information and Health Engineering Medicine, Institute of Clinical Medicine, Nanjing Drum Tower Hospital, Medical School, Nanjing University, Nanjing, 210093, P. R. China
| |
Collapse
|
3
|
Yu Z, Coorens THH, Uddin MM, Ardlie KG, Lennon N, Natarajan P. Genetic variation across and within individuals. Nat Rev Genet 2024; 25:548-562. [PMID: 38548833 DOI: 10.1038/s41576-024-00709-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/09/2024] [Indexed: 04/12/2024]
Abstract
Germline variation and somatic mutation are intricately connected and together shape human traits and disease risks. Germline variants are present from conception, but they vary between individuals and accumulate over generations. By contrast, somatic mutations accumulate throughout life in a mosaic manner within an individual due to intrinsic and extrinsic sources of mutations and selection pressures acting on cells. Recent advancements, such as improved detection methods and increased resources for association studies, have drastically expanded our ability to investigate germline and somatic genetic variation and compare underlying mutational processes. A better understanding of the similarities and differences in the types, rates and patterns of germline and somatic variants, as well as their interplay, will help elucidate the mechanisms underlying their distinct yet interlinked roles in human health and biology.
Collapse
Affiliation(s)
- Zhi Yu
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | | | - Md Mesbah Uddin
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Cardiovascular Research Center and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | | | - Niall Lennon
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Pradeep Natarajan
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Cardiovascular Research Center and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
4
|
Zhao G, Le Y, Sun M, Xu J, Qin Y, Men S, Ye Z, Tan H, Hu H, You J, Li J, Jin S, Wang M, Zhang X, Lin Z, Tu L. A dominant negative mutation of GhMYB25-like alters cotton fiber initiation, reducing lint and fuzz. THE PLANT CELL 2024; 36:2759-2777. [PMID: 38447960 PMCID: PMC11289660 DOI: 10.1093/plcell/koae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/09/2023] [Accepted: 12/11/2023] [Indexed: 03/08/2024]
Abstract
Cotton (Gossypium hirsutum) fibers, vital natural textile materials, are single-cell trichomes that differentiate from the ovule epidermis. These fibers are categorized as lint (longer fibers useful for spinning) or fuzz (shorter, less useful fibers). Currently, developing cotton varieties with high lint yield but without fuzz remains challenging due to our limited knowledge of the molecular mechanisms underlying fiber initiation. This study presents the identification and characterization of a naturally occurring dominant negative mutation GhMYB25-like_AthapT, which results in a reduced lint and fuzzless phenotype. The GhMYB25-like_AthapT protein exerts its dominant negative effect by suppressing the activity of GhMYB25-like during lint and fuzz initiation. Intriguingly, the negative effect of GhMYB25-like_AthapT could be alleviated by high expression levels of GhMYB25-like. We also uncovered the role of GhMYB25-like in regulating the expression of key genes such as GhPDF2 (PROTODERMAL FACTOR 2), CYCD3; 1 (CYCLIN D3; 1), and PLD (Phospholipase D), establishing its significance as a pivotal transcription factor in fiber initiation. We identified other genes within this regulatory network, expanding our understanding of the determinants of fiber cell fate. These findings offer valuable insights for cotton breeding and contribute to our fundamental understanding of fiber development.
Collapse
Affiliation(s)
- Guannan Zhao
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Yu Le
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Mengling Sun
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Jiawen Xu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Yuan Qin
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - She Men
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Zhengxiu Ye
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Haozhe Tan
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Haiyan Hu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Jiaqi You
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Jianying Li
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Shuangxia Jin
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Maojun Wang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Xianlong Zhang
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Zhongxu Lin
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| | - Lili Tu
- National Key Laboratory of Crop Genetic Improvement, Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, Hubei Province 430070, China
| |
Collapse
|
5
|
Aparo A, Bonnici V, Avesani S, Cascione L, Giugno R. DiGAS: Differential gene allele spectrum as a descriptor in genetic studies. Comput Biol Med 2024; 179:108924. [PMID: 39067286 DOI: 10.1016/j.compbiomed.2024.108924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 07/16/2024] [Accepted: 07/17/2024] [Indexed: 07/30/2024]
Abstract
Diagnosing individuals with complex genetic diseases is a challenging task. Computational methodologies exploit information at the genotype level by taking into account single nucleotide polymorphisms (SNPs) leveraging the results of genome-wide association studies analysis to assign a statistical significance to each SNP. Recent methodologies extend such an approach by aggregating SNP significance at the genetic level to identify genes that are related to the condition under study. However, such methodologies still suffer from the initial SNP analysis limitations. Here, we present DiGAS, a tool for diagnosing genetic conditions by computing significance, by means of SNP information, directly at the complex level of genetic regions. Such an approach is based on a generalized notion of allele spectrum, which evaluates the complete genetic alterations of the SNP set belonging to a genetic region at the population level. The statistical significance of a region is then evaluated through a differential allele spectrum analysis between the conditions of individuals belonging to the population. Tests, performed on well-established datasets regarding Alzheimer's disease, show that DiGAS outperforms the state of the art in distinguishing between sick and healthy subjects.
Collapse
Affiliation(s)
- Antonino Aparo
- University of Verona, Strada le Grazie, 15, Verona, 37134, Italy; Research Center LURM (Interdepartmental Laboratory of Medical Research), University of Verona, Ple. L.A. Scuro 10, Verona, 37134, Italy
| | - Vincenzo Bonnici
- University of Parma, Parco Area delle Scienze, 53/A, Parma, 43124, Italy
| | - Simone Avesani
- University of Verona, Strada le Grazie, 15, Verona, 37134, Italy
| | - Luciano Cascione
- Institute of Oncology Research (IOR), Via Francesco Chiesa 5, Bellinzona, 6500, Switzerland
| | - Rosalba Giugno
- University of Verona, Strada le Grazie, 15, Verona, 37134, Italy.
| |
Collapse
|
6
|
Cirulli ET, Schiabor Barrett KM, Bolze A, Judge DP, Pawloski PA, Grzymski JJ, Lee W, Washington NL. A power-based sliding window approach to evaluate the clinical impact of rare genetic variants in the nucleotide sequence or the spatial position of the folded protein. HGG ADVANCES 2024; 5:100284. [PMID: 38509709 PMCID: PMC11004801 DOI: 10.1016/j.xhgg.2024.100284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 03/14/2024] [Accepted: 03/15/2024] [Indexed: 03/22/2024] Open
Abstract
Systematic determination of novel variant pathogenicity remains a major challenge, even when there is an established association between a gene and phenotype. Here we present Power Window (PW), a sliding window technique that identifies the impactful regions of a gene using population-scale clinico-genomic datasets. By sizing analysis windows on the number of variant carriers, rather than the number of variants or nucleotides, statistical power is held constant, enabling the localization of clinical phenotypes and removal of unassociated gene regions. The windows can be built by sliding across either the nucleotide sequence of the gene (through 1D space) or the positions of the amino acids in the folded protein (through 3D space). Using a training set of 350k exomes from the UK Biobank (UKB), we developed PW models for well-established gene-disease associations and tested their accuracy in two independent cohorts (117k UKB exomes and 65k exomes sequenced at Helix in the Healthy Nevada Project, myGenetics, or In Our DNA SC studies). The significant models retained a median of 49% of the qualifying variant carriers in each gene (range 2%-98%), with quantitative traits showing a median effect size improvement of 66% compared with aggregating variants across the entire gene, and binary traits' odds ratios improving by a median of 2.2-fold. PW showcases that electronic health record-based statistical analyses can accurately distinguish between novel coding variants in established genes that will have high phenotypic penetrance and those that will not, unlocking new potential for human genomics research, drug development, variant interpretation, and precision medicine.
Collapse
Affiliation(s)
| | | | - Alexandre Bolze
- Helix, 101 S Ellsworth Ave Suite 350, San Mateo, CA 94401, USA
| | - Daniel P Judge
- Division of Cardiology, Medical University of South Carolina, 30 Courtenay Drive, MSC 592, Charleston, SC 29425, USA
| | | | - Joseph J Grzymski
- University of Nevada, 2215 Raggio Pkwy, Reno, NV 89512, USA; Renown Institute for Health Innovation, Reno, NV 89512, USA
| | - William Lee
- Helix, 101 S Ellsworth Ave Suite 350, San Mateo, CA 94401, USA
| | | |
Collapse
|
7
|
Akinbiyi T, McPeek MS, Abney M. ADELLE: A global testing method for Trans-eQTL mapping. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.24.581871. [PMID: 38464248 PMCID: PMC10925110 DOI: 10.1101/2024.02.24.581871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Understanding the genetic regulatory mechanisms of gene expression is a challenging and ongoing problem. Genetic variants that are associated with expression levels are readily identified when they are proximal to the gene (i.e., cis-eQTLs), but SNPs distant from the gene whose expression levels they are associated with (i.e., trans-eQTLs) have been much more difficult to discover, even though they account for a majority of the heritability in gene expression levels. A major impediment to the identification of more trans-eQTLs is the lack of statistical methods that are powerful enough to overcome the obstacles of small effect sizes and large multiple testing burden of trans-eQTL mapping. Here, we propose ADELLE, a powerful statistical testing framework that requires only summary statistics and is designed to be most sensitive to SNPs that are associated with multiple gene expression levels, a characteristic of many trans-eQTLs. In simulations, we show that for detecting SNPs that are associated with 0.1%-2% of 10,000 traits, among the 7 methods we consider ADELLE is clearly the most powerful overall, with either the highest power or power not significantly different from the highest for all settings in that range. We apply ADELLE to a mouse advanced intercross line data set and show its ability to find trans-eQTLs that were not significant under a standard analysis. This demonstrates that ADELLE is a powerful tool at uncovering trans regulators of genetic expression.
Collapse
|
8
|
Yang L, Ou YN, Wu BS, Liu WS, Deng YT, He XY, Chen YL, Kang J, Fei CJ, Zhu Y, Tan L, Dong Q, Feng J, Cheng W, Yu JT. Large-scale whole-exome sequencing analyses identified protein-coding variants associated with immune-mediated diseases in 350,770 adults. Nat Commun 2024; 15:5924. [PMID: 39009607 PMCID: PMC11250857 DOI: 10.1038/s41467-024-49782-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 06/18/2024] [Indexed: 07/17/2024] Open
Abstract
The genetic contribution of protein-coding variants to immune-mediated diseases (IMDs) remains underexplored. Through whole exome sequencing of 40 IMDs in 350,770 UK Biobank participants, we identified 162 unique genes in 35 IMDs, among which 124 were novel genes. Several genes, including FLG which is associated with atopic dermatitis and asthma, showed converging evidence from both rare and common variants. 91 genes exerted significant effects on longitudinal outcomes (interquartile range of Hazard Ratio: 1.12-5.89). Mendelian randomization identified five causal genes, of which four were approved drug targets (CDSN, DDR1, LTA, and IL18BP). Proteomic analysis indicated that mutations associated with specific IMDs might also affect protein expression in other IMDs. For example, DXO (celiac disease-related gene) and PSMB9 (alopecia areata-related gene) could modulate CDSN (autoimmune hypothyroidism-, psoriasis-, asthma-, and Graves' disease-related gene) expression. Identified genes predominantly impact immune and biochemical processes, and can be clustered into pathways of immune-related, urate metabolism, and antigen processing. Our findings identified protein-coding variants which are the key to IMDs pathogenesis and provided new insights into tailored innovative therapies.
Collapse
Affiliation(s)
- Liu Yang
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Ya-Nan Ou
- Department of Neurology, Qingdao Municipal Hospital, Qingdao University, Qingdao, 266071, China
| | - Bang-Sheng Wu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Wei-Shi Liu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Yue-Ting Deng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Xiao-Yu He
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Yi-Lin Chen
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Jujiao Kang
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200443, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
| | - Chen-Jie Fei
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Ying Zhu
- State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, Fudan University, Shanghai, 200032, China
| | - Lan Tan
- Department of Neurology, Qingdao Municipal Hospital, Qingdao University, Qingdao, 266071, China
| | - Qiang Dong
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
| | - Jianfeng Feng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200443, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
| | - Wei Cheng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200443, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
| | - Jin-Tai Yu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, 200040, China.
| |
Collapse
|
9
|
Mbatchou J, McPeek MS. JASPER: Fast, powerful, multitrait association testing in structured samples gives insight on pleiotropy in gene expression. Am J Hum Genet 2024:S0002-9297(24)00216-7. [PMID: 39025064 DOI: 10.1016/j.ajhg.2024.06.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 06/19/2024] [Accepted: 06/20/2024] [Indexed: 07/20/2024] Open
Abstract
Joint association analysis of multiple traits with multiple genetic variants can provide insight into genetic architecture and pleiotropy, improve trait prediction, and increase power for detecting association. Furthermore, some traits are naturally high-dimensional, e.g., images, networks, or longitudinally measured traits. Assessing significance for multitrait genetic association can be challenging, especially when the sample has population sub-structure and/or related individuals. Failure to adequately adjust for sample structure can lead to power loss and inflated type 1 error, and commonly used methods for assessing significance can work poorly with a large number of traits or be computationally slow. We developed JASPER, a fast, powerful, robust method for assessing significance of multitrait association with a set of genetic variants, in samples that have population sub-structure, admixture, and/or relatedness. In simulations, JASPER has higher power, better type 1 error control, and faster computation than existing methods, with the power and speed advantage of JASPER increasing with the number of traits. JASPER is potentially applicable to a wide range of association testing applications, including for multiple disease traits, expression traits, image-derived traits, and microbiome abundances. It allows for covariates, ascertainment, and rare variants and is robust to phenotype model misspecification. We apply JASPER to analyze gene expression in the Framingham Heart Study, where, compared to alternative approaches, JASPER finds more significant associations, including several that indicate pleiotropic effects, most of which replicate previous results, while others have not previously been reported. Our results demonstrate the promise of JASPER for powerful multitrait analysis in structured samples.
Collapse
Affiliation(s)
- Joelle Mbatchou
- Regeneron Genetics Center, Tarrytown, NY 10591, USA; Department of Statistics, The University of Chicago, Chicago, IL 60637, USA
| | - Mary Sara McPeek
- Department of Statistics, The University of Chicago, Chicago, IL 60637, USA; Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
10
|
Khan A, Unlu G, Lin P, Liu Y, Kilic E, Kenny TC, Birsoy K, Gamazon ER. Metabolic gene function discovery platform GeneMAP identifies SLC25A48 as necessary for mitochondrial choline import. Nat Genet 2024:10.1038/s41588-024-01827-2. [PMID: 38977856 DOI: 10.1038/s41588-024-01827-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Accepted: 06/10/2024] [Indexed: 07/10/2024]
Abstract
Organisms maintain metabolic homeostasis through the combined functions of small-molecule transporters and enzymes. While many metabolic components have been well established, a substantial number remains without identified physiological substrates. To bridge this gap, we have leveraged large-scale plasma metabolome genome-wide association studies (GWAS) to develop a multiomic Gene-Metabolite Association Prediction (GeneMAP) discovery platform. GeneMAP can generate accurate predictions and even pinpoint genes that are distant from the variants implicated by GWAS. In particular, our analysis identified solute carrier family 25 member 48 (SLC25A48) as a genetic determinant of plasma choline levels. Mechanistically, SLC25A48 loss strongly impairs mitochondrial choline import and synthesis of its downstream metabolite betaine. Integrative rare variant and polygenic score analyses in UK Biobank provide strong evidence that the SLC25A48 causal effects on human disease may in part be mediated by the effects of choline. Altogether, our study provides a discovery platform for metabolic gene function and proposes SLC25A48 as a mitochondrial choline transporter.
Collapse
Affiliation(s)
- Artem Khan
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Gokhan Unlu
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Phillip Lin
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Yuyang Liu
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Ece Kilic
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Timothy C Kenny
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA
| | - Kıvanç Birsoy
- Laboratory of Metabolic Regulation and Genetics, The Rockefeller University, New York, NY, USA.
| | - Eric R Gamazon
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA.
| |
Collapse
|
11
|
Boedijono FS, Bood V, Eichhorn IA, Hansbro PM, Slebos DJ, van den Berge M, Faiz A, Pouwels SD. Identification of Genetic Factors Associated With DAMP Release in COPD Patients. Arch Bronconeumol 2024:S0300-2896(24)00245-X. [PMID: 39034199 DOI: 10.1016/j.arbres.2024.06.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 06/07/2024] [Accepted: 06/28/2024] [Indexed: 07/23/2024]
Affiliation(s)
- Fia Sabrina Boedijono
- Respiratory Bioinformatics and Molecular Biology Group, University of Technology Sydney, Australia; Centre for Inflammation, Centenary Institute and University of Technology Sydney, Faculty of Science, School of Life Sciences, Sydney, Australia
| | - Verena Bood
- Department of Pulmonary Diseases, University Medical Center Groningen, The Netherlands; Department of Pathology and Medical Biology, University Medical Center Groningen, Groningen, The Netherlands; GRIAC Research Institute, University of Groningen, Groningen, The Netherlands
| | - Ilse A Eichhorn
- Department of Pulmonary Diseases, University Medical Center Groningen, The Netherlands; Department of Pathology and Medical Biology, University Medical Center Groningen, Groningen, The Netherlands; GRIAC Research Institute, University of Groningen, Groningen, The Netherlands
| | - Philip M Hansbro
- Centre for Inflammation, Centenary Institute and University of Technology Sydney, Faculty of Science, School of Life Sciences, Sydney, Australia
| | - Dirk-Jan Slebos
- Department of Pulmonary Diseases, University Medical Center Groningen, The Netherlands; GRIAC Research Institute, University of Groningen, Groningen, The Netherlands
| | - Maarten van den Berge
- Department of Pulmonary Diseases, University Medical Center Groningen, The Netherlands; GRIAC Research Institute, University of Groningen, Groningen, The Netherlands
| | - Alen Faiz
- Respiratory Bioinformatics and Molecular Biology Group, University of Technology Sydney, Australia
| | - Simon D Pouwels
- Department of Pulmonary Diseases, University Medical Center Groningen, The Netherlands; Department of Pathology and Medical Biology, University Medical Center Groningen, Groningen, The Netherlands; GRIAC Research Institute, University of Groningen, Groningen, The Netherlands.
| |
Collapse
|
12
|
Hop PJ, Lai D, Keagle PJ, Baron DM, Kenna BJ, Kooyman M, Shankaracharya, Halter C, Straniero L, Asselta R, Bonvegna S, Soto-Beasley AI, Wszolek ZK, Uitti RJ, Isaias IU, Pezzoli G, Ticozzi N, Ross OA, Veldink JH, Foroud TM, Kenna KP, Landers JE. Systematic rare variant analyses identify RAB32 as a susceptibility gene for familial Parkinson's disease. Nat Genet 2024; 56:1371-1376. [PMID: 38858457 PMCID: PMC11250361 DOI: 10.1038/s41588-024-01787-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 05/06/2024] [Indexed: 06/12/2024]
Abstract
Despite substantial progress, causal variants are identified only for a minority of familial Parkinson's disease (PD) cases, leaving high-risk pathogenic variants unidentified1,2. To identify such variants, we uniformly processed exome sequencing data of 2,184 index familial PD cases and 69,775 controls. Exome-wide analyses converged on RAB32 as a novel PD gene identifying c.213C > G/p.S71R as a high-risk variant presenting in ~0.7% of familial PD cases while observed in only 0.004% of controls (odds ratio of 65.5). This variant was confirmed in all cases via Sanger sequencing and segregated with PD in three families. RAB32 encodes a small GTPase known to interact with LRRK2 (refs. 3,4). Functional analyses showed that RAB32 S71R increases LRRK2 kinase activity, as indicated by increased autophosphorylation of LRRK2 S1292. Here our results implicate mutant RAB32 in a key pathological mechanism in PD-LRRK2 kinase activity5-7-and thus provide novel insights into the mechanistic connections between RAB family biology, LRRK2 and PD risk.
Collapse
Affiliation(s)
- Paul J Hop
- Department of Translational Neuroscience, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Dongbing Lai
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Pamela J Keagle
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA
| | - Desiree M Baron
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA
| | - Brendan J Kenna
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Maarten Kooyman
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Shankaracharya
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA
| | - Cheryl Halter
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Letizia Straniero
- Department of Biomedical Sciences, Humanitas University, Milan, Italy
- IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | - Rosanna Asselta
- Department of Biomedical Sciences, Humanitas University, Milan, Italy
- IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
| | | | | | | | - Ryan J Uitti
- Department of Neurology, Mayo Clinic, Jacksonville, FL, USA
| | - Ioannis Ugo Isaias
- Parkinson Institute, ASST Gaetano Pini-CTO, Milan, Italy
- Department of Neurology, University Hospital of Würzburg and Julius Maximilian University of Würzburg, Würzburg, Germany
| | - Gianni Pezzoli
- Parkinson Institute, ASST Gaetano Pini-CTO, Milan, Italy
- Fondazione Grigioni per il Morbo di Parkinson, Milan, Italy
| | - Nicola Ticozzi
- Department of Neurology-Stroke Unit and Laboratory of Neuroscience, Istituto Auxologico Italiano IRCCS, Milan, Italy
- Department of Pathophysiology and Transplantation, 'Dino Ferrari' Center, Università degli Studi di Milano, Milan, Italy
| | - Owen A Ross
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Department of Clinical Genomics, Mayo Clinic, Jacksonville, FL, USA
| | - Jan H Veldink
- Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Tatiana M Foroud
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Kevin P Kenna
- Department of Translational Neuroscience, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - John E Landers
- Department of Neurology, UMass Chan Medical School, Worcester, MA, USA.
| |
Collapse
|
13
|
Yaacov O, Mathiyalagan P, Berk-Rauch HE, Ganesh SK, Zhu L, Hoffmann TJ, Iribarren C, Risch N, Lee D, Chakravarti A. Identification of the Molecular Components of Enhancer-Mediated Gene Expression Variation in Multiple Tissues Regulating Blood Pressure. Hypertension 2024; 81:1500-1510. [PMID: 38747164 PMCID: PMC11168860 DOI: 10.1161/hypertensionaha.123.22538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 04/24/2024] [Indexed: 06/14/2024]
Abstract
BACKGROUND Inter-individual variation in blood pressure (BP) arises in part from sequence variants within enhancers modulating the expression of causal genes. We propose that these genes, active in tissues relevant to BP physiology, can be identified from tissue-level epigenomic data and genotypes of BP-phenotyped individuals. METHODS We used chromatin accessibility data from the heart, adrenal, kidney, and artery to identify cis-regulatory elements (CREs) in these tissues and estimate the impact of common human single-nucleotide variants within these CREs on gene expression, using machine learning methods. To identify causal genes, we performed a gene-wise association test. We conducted analyses in 2 separate large-scale cohorts: 77 822 individuals from the Genetic Epidemiology Research on Adult Health and Aging and 315 270 individuals from the UK Biobank. RESULTS We identified 309, 259, 331, and 367 genes (false discovery rate <0.05) for diastolic BP and 191, 184, 204, and 204 genes for systolic BP in the artery, kidney, heart, and adrenal, respectively, in Genetic Epidemiology Research on Adult Health and Aging; 50% to 70% of these genes were replicated in the UK Biobank, significantly higher than the 12% to 15% expected by chance (P<0.0001). These results enabled tissue expression prediction of these 988 to 2875 putative BP genes in individuals of both cohorts to construct an expression polygenic score. This score explained ≈27% of the reported single-nucleotide variant heritability, substantially higher than expected from prior studies. CONCLUSIONS Our work demonstrates the power of tissue-restricted comprehensive CRE analysis, followed by CRE-based expression prediction, for understanding BP regulation in relevant tissues and provides dual-modality supporting evidence, CRE and expression, for the causality genes.
Collapse
Affiliation(s)
- Or Yaacov
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| | - Prabhu Mathiyalagan
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
- Benthos Prime Central, Houston, TX, USA
| | - Hanna E. Berk-Rauch
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| | - Santhi K. Ganesh
- Department of Internal Medicine & Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Luke Zhu
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| | - Thomas J. Hoffmann
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Carlos Iribarren
- Kaiser Permanente Northern California Division of Research, Oakland, CA, USA
| | - Neil Risch
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
- Kaiser Permanente Northern California Division of Research, Oakland, CA, USA
| | - Dongwon Lee
- Department of Pediatrics, Division of Nephrology, Boston Children’s Hospital, Boston & Harvard Medical School, Boston, MA, USA
| | - Aravinda Chakravarti
- Center for Human Genetics and Genomics, NYU Grossman School of Medicine, New York, NY, USA
| |
Collapse
|
14
|
Petrazzini BO, Forrest IS, Rocheleau G, Vy HMT, Márquez-Luna C, Duffy Á, Chen R, Park JK, Gibson K, Goonewardena SN, Malick WA, Rosenson RS, Jordan DM, Do R. Exome sequence analysis identifies rare coding variants associated with a machine learning-based marker for coronary artery disease. Nat Genet 2024; 56:1412-1419. [PMID: 38862854 DOI: 10.1038/s41588-024-01791-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 05/08/2024] [Indexed: 06/13/2024]
Abstract
Coronary artery disease (CAD) exists on a spectrum of disease represented by a combination of risk factors and pathogenic processes. An in silico score for CAD built using machine learning and clinical data in electronic health records captures disease progression, severity and underdiagnosis on this spectrum and could enhance genetic discovery efforts for CAD. Here we tested associations of rare and ultrarare coding variants with the in silico score for CAD in the UK Biobank, All of Us Research Program and BioMe Biobank. We identified associations in 17 genes; of these, 14 show at least moderate levels of prior genetic, biological and/or clinical support for CAD. We also observed an excess of ultrarare coding variants in 321 aggregated CAD genes, suggesting more ultrarare variant associations await discovery. These results expand our understanding of the genetic etiology of CAD and illustrate how digital markers can enhance genetic association investigations for complex diseases.
Collapse
Affiliation(s)
- Ben Omega Petrazzini
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Iain S Forrest
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ghislain Rocheleau
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ha My T Vy
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Carla Márquez-Luna
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Áine Duffy
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Robert Chen
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Joshua K Park
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Medical Scientist Training Program, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Kyle Gibson
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Sascha N Goonewardena
- Division of Cardiovascular Medicine, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Division of Cardiovascular Medicine, VA Ann Arbor Health System, Ann Arbor, MI, USA
| | - Waqas A Malick
- Metabolism and Lipids Program, Mount Sinai Fuster Heart Hospital, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Robert S Rosenson
- Metabolism and Lipids Program, Mount Sinai Fuster Heart Hospital, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Daniel M Jordan
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ron Do
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Center for Genomic Data Analytics, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
15
|
Pattillo Smith S, Darnell G, Udwin D, Stamp J, Harpak A, Ramachandran S, Crawford L. Discovering non-additive heritability using additive GWAS summary statistics. eLife 2024; 13:e90459. [PMID: 38913556 PMCID: PMC11196113 DOI: 10.7554/elife.90459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Accepted: 04/22/2024] [Indexed: 06/26/2024] Open
Abstract
LD score regression (LDSC) is a method to estimate narrow-sense heritability from genome-wide association study (GWAS) summary statistics alone, making it a fast and popular approach. In this work, we present interaction-LD score (i-LDSC) regression: an extension of the original LDSC framework that accounts for interactions between genetic variants. By studying a wide range of generative models in simulations, and by re-analyzing 25 well-studied quantitative phenotypes from 349,468 individuals in the UK Biobank and up to 159,095 individuals in BioBank Japan, we show that the inclusion of a cis-interaction score (i.e. interactions between a focal variant and proximal variants) recovers genetic variance that is not captured by LDSC. For each of the 25 traits analyzed in the UK Biobank and BioBank Japan, i-LDSC detects additional variation contributed by genetic interactions. The i-LDSC software and its application to these biobanks represent a step towards resolving further genetic contributions of sources of non-additive genetic effects to complex trait variation.
Collapse
Affiliation(s)
- Samuel Pattillo Smith
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Department of Ecology and Evolutionary Biology, Brown UniversityProvidenceUnited States
- Department of Integrative Biology, The University of Texas at AustinAustinUnited States
- Department of Population Health, The University of Texas at AustinAustinUnited States
| | - Gregory Darnell
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Institute for Computational and Experimental Research in Mathematics, Brown UniversityProvidenceUnited States
| | - Dana Udwin
- Department of Biostatistics, Brown UniversityProvidenceUnited States
| | - Julian Stamp
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
| | - Arbel Harpak
- Department of Integrative Biology, The University of Texas at AustinAustinUnited States
- Department of Population Health, The University of Texas at AustinAustinUnited States
| | - Sohini Ramachandran
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Department of Ecology and Evolutionary Biology, Brown UniversityProvidenceUnited States
- Data Science Institute, Brown UniversityProvidenceUnited States
| | - Lorin Crawford
- Center for Computational Molecular Biology, Brown UniversityProvidenceUnited States
- Department of Biostatistics, Brown UniversityProvidenceUnited States
- MicrosoftCambridgeUnited States
| |
Collapse
|
16
|
Choi J, Xu Z, Sun R. Variance-components tests for genetic association with multiple interval-censored outcomes. Stat Med 2024; 43:2560-2574. [PMID: 38636557 PMCID: PMC11116038 DOI: 10.1002/sim.10081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 02/18/2024] [Accepted: 04/02/2024] [Indexed: 04/20/2024]
Abstract
Massive genetic compendiums such as the UK Biobank have become an invaluable resource for identifying genetic variants that are associated with complex diseases. Due to the difficulties of massive data collection, a common practice of these compendiums is to collect interval-censored data. One challenge in analyzing such data is the lack of methodology available for genetic association studies with interval-censored data. Genetic effects are difficult to detect because of their rare and weak nature, and often the time-to-event outcomes are transformed to binary phenotypes for access to more powerful signal detection approaches. However transforming the data to binary outcomes can result in loss of valuable information. To alleviate such challenges, this work develops methodology to associate genetic variant sets with multiple interval-censored outcomes. Testing sets of variants such as genes or pathways is a common approach in genetic association settings to lower the multiple testing burden, aggregate small effects, and improve interpretations of results. Instead of performing inference with only a single outcome, utilizing multiple outcomes can increase statistical power by aggregating information across multiple correlated phenotypes. Simulations show that the proposed strategy can offer significant power gains over a single outcome approach. We apply the proposed test to the investigation that motivated this study, a search for the genes that perturb risks of bone fractures and falls in the UK Biobank.
Collapse
Affiliation(s)
- Jaihee Choi
- Department of Statistics, Rice University, Texas, USA
| | - Zhichao Xu
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Texas, USA
| | - Ryan Sun
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Texas, USA
| |
Collapse
|
17
|
Zhong Y, Tubbs JD, Leung PBM, Zhan N, Hui TCK, Ho KKY, Hung KSY, Cheung EFC, So HC, Lui SSY, Sham PC. Whole-exome sequencing in a Chinese sample provides preliminary evidence for the link between rare/low-frequency immune-related variants and early-onset schizophrenia. Asian J Psychiatr 2024; 96:104046. [PMID: 38663229 DOI: 10.1016/j.ajp.2024.104046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 04/06/2024] [Indexed: 06/01/2024]
Abstract
Rare and low-frequency variants contribute to schizophrenia (SCZ), and may influence its age-at-onset (AAO). We examined the association of rare or low-frequency deleterious coding variants in Chinese patients with SCZ. We collected DNA samples in 197 patients with SCZ spectrum disorder and 82 healthy controls (HC), and performed exome sequencing. The AAO variable was ascertained in the majority of SCZ participants for identify the early-onset (EOS, AAO<=18) and adult-onset (AOS, AAO>18) subgroups. We examined the overall association of rare/low-frequency, damaging variants in SCZ versus HC, EOS versus HC, and AOS versus HC at the gene and gene-set levels using Sequence Kernel Association Test. The quantitative rare-variant association test of AAO was conducted. Resampling was used to obtain empirical p-values and to control for family-wise error rate (FWER). In binary-trait association tests, we identified 5 potential candidate risk genes and 10 gene ontology biological processes (GOBP) terms, among which PADI2 reached FWER-adjusted significance. In quantitative rare-variant association tests, we found marginally significant correlations of AAO with alterations in 4 candidate risk genes, and 5 GOBP pathways. Together, the biological and functional profiles of these genes and gene sets supported the involvement of perturbations of neural systems in SCZ, and altered immune functions in EOS.
Collapse
Affiliation(s)
- Yuanxin Zhong
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China
| | - Justin D Tubbs
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA; Department of Psychiatry, Harvard Medical School, Boston, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Perry B M Leung
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; Department of Psychosis Studies, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Na Zhan
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China
| | - Tomy C K Hui
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China
| | - Karen K Y Ho
- Department of General Adult Psychiatry, Castle Peak Hospital, Hong Kong Special Administrative Region of China
| | - Karen S Y Hung
- Department of General Adult Psychiatry, Castle Peak Hospital, Hong Kong Special Administrative Region of China
| | - Eric F C Cheung
- Department of General Adult Psychiatry, Castle Peak Hospital, Hong Kong Special Administrative Region of China
| | - Hon-Cheong So
- School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong Special Administrative Region of China; Department of Psychiatry, The Chinese University of Hong Kong, Shatin, Hong Kong Special Administrative Region of China.
| | - Simon S Y Lui
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China.
| | - Pak C Sham
- Department of Psychiatry, School of Clinical Medicine, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong Special Administrative Region of China; State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong Special Administrative Region of China.
| |
Collapse
|
18
|
Huang YH, Chen YC, Ho WM, Lee RG, Chung RH, Liu YL, Chang PY, Chang SC, Wang CW, Chung WH, Tsai SJ, Kuo PH, Lee YS, Hsiao CC. Classifying Alzheimer's disease and normal subjects using machine learning techniques and genetic-environmental features. J Formos Med Assoc 2024; 123:701-709. [PMID: 38044212 DOI: 10.1016/j.jfma.2023.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 08/24/2023] [Accepted: 10/23/2023] [Indexed: 12/05/2023] Open
Abstract
BACKGROUND Alzheimer's disease (AD) is complicated by multiple environmental and polygenetic factors. The accuracy of artificial neural networks (ANNs) incorporating the common factors for identifying AD has not been evaluated. METHODS A total of 184 probable AD patients and 3773 healthy individuals aged 65 and over were enrolled. AD-related genes (51 SNPs) and 8 environmental factors were selected as features for multilayer ANN modeling. Random Forest (RF) and Support Vector Machine with RBF kernel (SVM) were also employed for comparison. Model results were verified using traditional statistics. RESULTS The ANN achieved high accuracy (0.98), sensitivity (0.95), and specificity (0.96) in the intrinsic test for AD classification. Excluding age and genetic data still yielded favorable results (accuracy: 0.97, sensitivity: 0.94, specificity: 0.96). The assigned weights to ANN features highlighted the importance of mental evaluation, years of education, and specific genetic variations (CASS4 rs7274581, PICALM rs3851179, and TOMM40 rs2075650) for AD classification. Receiver operating characteristic analysis revealed AUC values of 0.99 (intrinsic test), 0.60 (TWB-GWA), and 0.72 (CG-WGS), with slightly lower AUC values (0.96, 0.80, 0.52) when excluding age in ANN. The performance of the ANN model in AD classification was comparable to RF, SVM (linear kernel), and SVM (RBF kernel). CONCLUSION The ANN model demonstrated good sensitivity, specificity, and accuracy in AD classification. The top-weighted SNPs for AD prediction were CASS4 rs7274581, PICALM rs3851179, and TOMM40 rs2075650. The ANN model performed similarly to RF and SVM, indicating its capability to handle the complexity of AD as a disease entity.
Collapse
Affiliation(s)
- Yu-Hua Huang
- Department of Neurology, Chang Gung Memorial Hospital Linkou Medical Center and College of Medicine, Chang-Gung University, Taoyuan, Taiwan
| | - Yi-Chun Chen
- Department of Neurology, Chang Gung Memorial Hospital Linkou Medical Center and College of Medicine, Chang-Gung University, Taoyuan, Taiwan
| | - Wei-Min Ho
- Department of Neurology, Chang Gung Memorial Hospital Linkou Medical Center and College of Medicine, Chang-Gung University, Taoyuan, Taiwan
| | - Ren-Guey Lee
- Department of Electronics Engineering, National Taipei University of Technology, Taipei, Taiwan
| | - Ren-Hua Chung
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan
| | - Yu-Li Liu
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Pi-Yueh Chang
- Department of Laboratory Medicine, Chang Gung Memorial Hospital Linkou Medical Center, Taoyuan, Taiwan; Department of Medical Biotechnology and Laboratory Science, Chang Gung University, Taoyuan, Taiwan
| | - Shih-Cheng Chang
- Department of Laboratory Medicine, Chang Gung Memorial Hospital Linkou Medical Center, Taoyuan, Taiwan; Department of Medical Biotechnology and Laboratory Science, Chang Gung University, Taoyuan, Taiwan
| | - Chaung-Wei Wang
- Department of Dermatology, Drug Hypersensitivity Clinical and Research Center, Chang Gung Memorial Hospital, Linkou, Taipei and Keelung, Taiwan; Cancer Vaccine and Immune Cell Therapy Core Laboratory, Chang Gung Memorial Hospital, Linkou, Taiwan; Whole-Genome Research Core Laboratory of Human Diseases, Chang Gung Memorial Hospital, Keelung, Taiwan
| | - Wen-Hung Chung
- Department of Dermatology, Drug Hypersensitivity Clinical and Research Center, Chang Gung Memorial Hospital, Linkou, Taipei and Keelung, Taiwan; Cancer Vaccine and Immune Cell Therapy Core Laboratory, Chang Gung Memorial Hospital, Linkou, Taiwan; Whole-Genome Research Core Laboratory of Human Diseases, Chang Gung Memorial Hospital, Keelung, Taiwan
| | - Shih-Jen Tsai
- Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan; Division of Psychiatry, National Yang-Ming University, Taipei, Taiwan
| | - Po-Hsiu Kuo
- Department of Public Health, Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei, Taiwan
| | - Yun-Shien Lee
- Department of Biotechnology, Ming Chuan University, Taoyuan, Taiwan; Genomic Medicine Research Core Laboratory, Chang Gung Memorial Hospital, Taoyuan, Taiwan.
| | - Chun-Chieh Hsiao
- Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan; Department of Computer Information and Network Engineering, Lunghwa University of Science and Technology, Taoyuan, Taiwan.
| |
Collapse
|
19
|
Zheng D, Grandgenett PM, Zhang Q, Baine M, Shi Y, Du Q, Liang X, Wong J, Iqbal S, Preuss K, Kamal A, Yu H, Du H, Hollingsworth MA, Zhang C. radioGWAS links radiome to genome to discover driver genes with somatic mutations for heterogeneous tumor image phenotype in pancreatic cancer. Sci Rep 2024; 14:12316. [PMID: 38811597 PMCID: PMC11137018 DOI: 10.1038/s41598-024-62741-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 05/21/2024] [Indexed: 05/31/2024] Open
Abstract
Addressing the significant level of variability exhibited by pancreatic cancer necessitates the adoption of a systems biology approach that integrates molecular data, biological properties of the tumors, medical images, and clinical features of the patients. In this study, a comprehensive multi-omics methodology was employed to examine a distinctive collection of patient dataset containing rapid autopsy tumor and normal tissue samples as well as longitudinal imaging with a focus on pancreatic cancer. By performing a whole exome sequencing analysis on tumor and normal tissues to identify somatic gene variants and a radiomic feature analysis to tumor CT images, the genome-wide association approach established a connection between pancreatic cancer driver genes and relevant radiomic features, enabling a thorough and quantitative assessment of the heterogeneity of pancreatic tumors. The significant association between sets of genes and radiomic features revealed the involvement of genes in shaping tumor morphological heterogeneity. Some results of the association established a connection between the molecular level mechanism and their outcomes at the level of tumor structural heterogeneity. Because tumor structure and tumor structural heterogeneity are related to the patients' overall survival, patients who had pancreatic cancer driver gene mutations with an association to a certain radiomic feature have been observed to experience worse survival rates than cases without these somatic mutations. Furthermore, the association analysis has revealed potential gene mutations and radiomic feature candidates that warrant further investigation in future research endeavors.
Collapse
Affiliation(s)
- Dandan Zheng
- Department of Radiation Oncology, University of Rochester Medical Center, Rochester, NY, USA.
| | - Paul M Grandgenett
- Eppley Institute for Research in Cancer and Allied Diseases, University of Nebraska Medical Center, Omaha, NE, USA
| | - Qi Zhang
- Department of Mathematics and Statistics, University of New Hampshire, Durham, NH, USA
| | - Michael Baine
- Department of Radiation Oncology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Yu Shi
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA
| | - Qian Du
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA
| | - Xiaoying Liang
- Department of Radiation Oncology, Mayo Clinic, Jacksonville, FL, USA
| | - Jeffrey Wong
- Department of Radiation Oncology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Subhan Iqbal
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA
| | - Kiersten Preuss
- Department of Nutrition and Health Sciences, University of Nebraska, Lincoln, NE, USA
| | - Ahsan Kamal
- Department of Radiation Oncology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Hongfeng Yu
- School of Computing, University of Nebraska, Lincoln, NE, USA
| | - Huijing Du
- Department of Mathematics, University of Nebraska, Lincoln, NE, USA
| | - Michael A Hollingsworth
- Eppley Institute for Research in Cancer and Allied Diseases, University of Nebraska Medical Center, Omaha, NE, USA.
| | - Chi Zhang
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA.
| |
Collapse
|
20
|
Herrera-Luis E, Benke K, Volk H, Ladd-Acosta C, Wojcik GL. Gene-environment interactions in human health. Nat Rev Genet 2024:10.1038/s41576-024-00731-z. [PMID: 38806721 DOI: 10.1038/s41576-024-00731-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2024] [Indexed: 05/30/2024]
Abstract
Gene-environment interactions (G × E), the interplay of genetic variation with environmental factors, have a pivotal impact on human complex traits and diseases. Statistically, G × E can be assessed by determining the deviation from expectation of predictive models based solely on the phenotypic effects of genetics or environmental exposures. Despite the unprecedented, widespread and diverse use of G × E analytical frameworks, heterogeneity in their application and reporting hinders their applicability in public health. In this Review, we discuss study design considerations as well as G × E analytical frameworks to assess polygenic liability dependent on the environment, to identify specific genetic variants exhibiting G × E, and to characterize environmental context for these dynamics. We conclude with recommendations to address the most common challenges and pitfalls in the conceptualization, methodology and reporting of G × E studies, as well as future directions.
Collapse
Affiliation(s)
- Esther Herrera-Luis
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Kelly Benke
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Heather Volk
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Christine Ladd-Acosta
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Genevieve L Wojcik
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
21
|
Li MD, Liu Q, Shi X, Wang Y, Zhu Z, Guan Y, He J, Han H, Mao Y, Ma Y, Yuan W, Yao J, Yang Z. Integrative analysis of genetics, epigenetics and RNA expression data reveal three susceptibility loci for smoking behavior in Chinese Han population. Mol Psychiatry 2024:10.1038/s41380-024-02599-1. [PMID: 38789676 DOI: 10.1038/s41380-024-02599-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 04/18/2024] [Accepted: 05/03/2024] [Indexed: 05/26/2024]
Abstract
Despite numerous studies demonstrate that genetics and epigenetics factors play important roles on smoking behavior, our understanding of their functional relevance and coordinated regulation remains largely unknown. Here we present a multiomics study on smoking behavior for Chinese smoker population with the goal of not only identifying smoking-associated functional variants but also deciphering the pathogenesis and mechanism underlying smoking behavior in this under-studied ethnic population. After whole-genome sequencing analysis of 1329 Chinese Han male samples in discovery phase and OpenArray analysis of 3744 samples in replication phase, we discovered that three novel variants located near FOXP1 (rs7635815), and between DGCR6 and PRODH (rs796774020), and in ARVCF (rs148582811) were significantly associated with smoking behavior. Subsequently cis-mQTL and cis-eQTL analysis indicated that these variants correlated significantly with the differential methylation regions (DMRs) or differential expressed genes (DEGs) located in the regions where these variants present. Finally, our in silico multiomics analysis revealed several hub genes, like DRD2, PTPRD, FOXP1, COMT, CTNNAP2, to be synergistic regulated each other in the etiology of smoking.
Collapse
Affiliation(s)
- Ming D Li
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China.
- Research Center for Air Pollution and Health, Zhejiang University, Hangzhou, China.
| | - Qiang Liu
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Xiaoqiang Shi
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Yan Wang
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Zhouhai Zhu
- Joint Institute of Tobacco and Health, Kunming, Yunnan, China
| | - Ying Guan
- Joint Institute of Tobacco and Health, Kunming, Yunnan, China
| | - Jingmin He
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
- College of Biological Sciences, Shanxi Agricultural University, Taigu, Shanxi, China
| | - Haijun Han
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Ying Mao
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Yunlong Ma
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Wenji Yuan
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Jianhua Yao
- Joint Institute of Tobacco and Health, Kunming, Yunnan, China
| | - Zhongli Yang
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, National Medical Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China.
| |
Collapse
|
22
|
Yee SW, Macdonald CB, Mitrovic D, Zhou X, Koleske ML, Yang J, Buitrago Silva D, Rockefeller Grimes P, Trinidad DD, More SS, Kachuri L, Witte JS, Delemotte L, Giacomini KM, Coyote-Maestas W. The full spectrum of SLC22 OCT1 mutations illuminates the bridge between drug transporter biophysics and pharmacogenomics. Mol Cell 2024; 84:1932-1947.e10. [PMID: 38703769 DOI: 10.1016/j.molcel.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 01/04/2024] [Accepted: 04/15/2024] [Indexed: 05/06/2024]
Abstract
Mutations in transporters can impact an individual's response to drugs and cause many diseases. Few variants in transporters have been evaluated for their functional impact. Here, we combine saturation mutagenesis and multi-phenotypic screening to dissect the impact of 11,213 missense single-amino-acid deletions, and synonymous variants across the 554 residues of OCT1, a key liver xenobiotic transporter. By quantifying in parallel expression and substrate uptake, we find that most variants exert their primary effect on protein abundance, a phenotype not commonly measured alongside function. Using our mutagenesis results combined with structure prediction and molecular dynamic simulations, we develop accurate structure-function models of the entire transport cycle, providing biophysical characterization of all known and possible human OCT1 polymorphisms. This work provides a complete functional map of OCT1 variants along with a framework for integrating functional genomics, biophysical modeling, and human genetics to predict variant effects on disease and drug efficacy.
Collapse
Affiliation(s)
- Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Christian B Macdonald
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Darko Mitrovic
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden
| | - Xujia Zhou
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Megan L Koleske
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Jia Yang
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Dina Buitrago Silva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Patrick Rockefeller Grimes
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Donovan D Trinidad
- Department of Medicine, Division of Infectious Disease, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Swati S More
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Linda Kachuri
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - John S Witte
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - Lucie Delemotte
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden.
| | - Kathleen M Giacomini
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA.
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA; Chan Zuckerberg Biohub, San Francisco, CA 94148, USA.
| |
Collapse
|
23
|
Zhou W, Cuomo ASE, Xue A, Kanai M, Chau G, Krishna C, Xavier RJ, MacArthur DG, Powell JE, Daly MJ, Neale BM. Efficient and accurate mixed model association tool for single-cell eQTL analysis. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.15.24307317. [PMID: 38798318 PMCID: PMC11118640 DOI: 10.1101/2024.05.15.24307317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Understanding the genetic basis of gene expression can help us understand the molecular underpinnings of human traits and disease. Expression quantitative trait locus (eQTL) mapping can help in studying this relationship but have been shown to be very cell-type specific, motivating the use of single-cell RNA sequencing and single-cell eQTLs to obtain a more granular view of genetic regulation. Current methods for single-cell eQTL mapping either rely on the "pseudobulk" approach and traditional pipelines for bulk transcriptomics or do not scale well to large datasets. Here, we propose SAIGE-QTL, a robust and scalable tool that can directly map eQTLs using single-cell profiles without needing aggregation at the pseudobulk level. Additionally, SAIGE-QTL allows for testing the effects of less frequent/rare genetic variation through set-based tests, which is traditionally excluded from eQTL mapping studies. We evaluate the performance of SAIGE-QTL on both real and simulated data and demonstrate the improved power for eQTL mapping over existing pipelines.
Collapse
|
24
|
Ohanele C, Peoples JN, Karlstaedt A, Geiger JT, Gayle AD, Ghazal N, Sohani F, Brown ME, Davis ME, Porter GA, Faundez V, Kwong JQ. Mitochondrial citrate carrier SLC25A1 is a dosage-dependent regulator of metabolic reprogramming and morphogenesis in the developing heart. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.22.541833. [PMID: 37292906 PMCID: PMC10245819 DOI: 10.1101/2023.05.22.541833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The developing mammalian heart undergoes an important metabolic shift from glycolysis toward mitochondrial oxidation, such that oxidative phosphorylation defects may present with cardiac abnormalities. Here, we describe a new mechanistic link between mitochondria and cardiac morphogenesis, uncovered by studying mice with systemic loss of the mitochondrial citrate carrier SLC25A1. Slc25a1 null embryos displayed impaired growth, cardiac malformations, and aberrant mitochondrial function. Importantly, Slc25a1 heterozygous embryos, which are overtly indistinguishable from wild type, exhibited an increased frequency of these defects, suggesting Slc25a1 haploinsuffiency and dose-dependent effects. Supporting clinical relevance, we found a near-significant association between ultrarare human pathogenic SLC25A1 variants and pediatric congenital heart disease. Mechanistically, SLC25A1 may link mitochondria to transcriptional regulation of metabolism through epigenetic control of gene expression to promote metabolic remodeling in the developing heart. Collectively, this work positions SLC25A1 as a novel mitochondrial regulator of ventricular morphogenesis and cardiac metabolic maturation and suggests a role in congenital heart disease.
Collapse
|
25
|
Wu K, Wang W, Cheng Q, Xiao D, Li Y, Chen M, Zheng X. Rare MED12L Variants Are Associated with Susceptibility to Guttate Psoriasis in the Han Chinese Population. Dermatology 2024:1-9. [PMID: 38735287 DOI: 10.1159/000538805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Accepted: 04/08/2024] [Indexed: 05/14/2024] Open
Abstract
INTRODUCTION According to the common disease/rare variant hypothesis, it is important to study the role of rare variants in complex diseases. The association of rare variants with psoriasis has been demonstrated, but the association between rare variants and specific clinical subtypes of psoriasis has not been investigated. METHODS Gene-based and gene-level meta-analyses were performed on data extracted from our previous study data sets (2,483 patients with guttate psoriasis and 8,292 patients with non-guttate psoriasis) for genotyping. Then, haplotype analysis was performed for rare loss-of-function variants located in MED12L, and protein function prediction was performed for MED12L. Gene-based analysis at each stage had a moderate significance threshold (p < 0.05). A χ2 test was then conducted on the three potential genes, and the merged gene-based analysis was used to confirm the results. We also conducted association analysis and meta-analysis for functional variants located on the identified gene. RESULTS Through these gene-level analyses, we determined that MED12L is a guttate psoriasis susceptibility gene (p = 9.99 × 10-5), and the single-nucleotide polymorphism with the strongest association was rs199780529 (p_combine = 1 × 10-3, p_meta = 2 × 10-3). CONCLUSIONS In our study, a guttate psoriasis-specific subtype-associated susceptibility gene was confirmed in a Chinese Han population. These findings contribute to a better genetic understanding of different subtypes of psoriasis.
Collapse
Affiliation(s)
- Kejia Wu
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- First Clinical Medical College, Anhui Medical University, Hefei, China
| | - Wanrong Wang
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- First Clinical Medical College, Anhui Medical University, Hefei, China
| | - Qianhui Cheng
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- First Clinical Medical College, Anhui Medical University, Hefei, China
| | - Duncheng Xiao
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
- Second Clinical Medical College, Anhui Medical University, Hefei, China
| | - Yunxiao Li
- School of Life Science, Shandong University, Qingdao, China
| | - Mengyun Chen
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| | - Xiaodong Zheng
- Department of Dermatology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Hefei, China
- Anhui Province Laboratory of Inflammation and Immune Mediated Diseases, Hefei, China
- Anhui Provincial Institute of Translational Medicine, Hefei, China
| |
Collapse
|
26
|
Guo H, Urban AE, Wong WH. Prioritizing disease-related rare variants by integrating gene expression data. RESEARCH SQUARE 2024:rs.3.rs-4355589. [PMID: 38766095 PMCID: PMC11100897 DOI: 10.21203/rs.3.rs-4355589/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
Rare variants, comprising a vast majority of human genetic variations, are likely to have more deleterious impact on human diseases compared to common variants. Here we present carrier statistic, a statistical framework to prioritize disease-related rare variants by integrating gene expression data. By quantifying the impact of rare variants on gene expression, carrier statistic can prioritize those rare variants that have large functional consequence in the diseased patients. Through simulation studies and analyzing real multi-omics dataset, we demonstrated that carrier statistic is applicable in studies with limited sample size (a few hundreds) and achieves substantially higher sensitivity than existing rare variants association methods. Application to Alzheimer's disease reveals 16 rare variants within 15 genes with extreme carrier statistics. We also found strong excess of rare variants among the top prioritized genes in diseased patients compared to that in healthy individuals. The carrier statistic method can be applied to various rare variant types and is adaptable to other omics data modalities, offering a powerful tool for investigating the molecular mechanisms underlying complex diseases.
Collapse
|
27
|
Littleton SH, Trang KB, Volpe CM, Cook K, DeBruyne N, Maguire JA, Weidekamp MA, Hodge KM, Boehm K, Lu S, Chesi A, Bradfield JP, Pippin JA, Anderson SA, Wells AD, Pahl MC, Grant SFA. Variant-to-function analysis of the childhood obesity chr12q13 locus implicates rs7132908 as a causal variant within the 3' UTR of FAIM2. CELL GENOMICS 2024; 4:100556. [PMID: 38697123 PMCID: PMC11099382 DOI: 10.1016/j.xgen.2024.100556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 03/21/2024] [Accepted: 04/08/2024] [Indexed: 05/04/2024]
Abstract
The ch12q13 locus is among the most significant childhood obesity loci identified in genome-wide association studies. This locus resides in a non-coding region within FAIM2; thus, the underlying causal variant(s) presumably influence disease susceptibility via cis-regulation. We implicated rs7132908 as a putative causal variant by leveraging our in-house 3D genomic data and public domain datasets. Using a luciferase reporter assay, we observed allele-specific cis-regulatory activity of the immediate region harboring rs7132908. We generated isogenic human embryonic stem cell lines homozygous for either rs7132908 allele to assess changes in gene expression and chromatin accessibility throughout a differentiation to hypothalamic neurons, a key cell type known to regulate feeding behavior. The rs7132908 obesity risk allele influenced expression of FAIM2 and other genes and decreased the proportion of neurons produced by differentiation. We have functionally validated rs7132908 as a causal obesity variant that temporally regulates nearby effector genes and influences neurodevelopment and survival.
Collapse
Affiliation(s)
- Sheridan H Littleton
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Cell and Molecular Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Khanh B Trang
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Christina M Volpe
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kieona Cook
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Psychiatry, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Nicole DeBruyne
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Cell and Molecular Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jean Ann Maguire
- Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Mary Ann Weidekamp
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kenyaita M Hodge
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Keith Boehm
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Sumei Lu
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Alessandra Chesi
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jonathan P Bradfield
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Quantinuum Research LLC, San Diego, CA 92101, USA
| | - James A Pippin
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Stewart A Anderson
- Department of Child and Adolescent Psychiatry, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew D Wells
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Institute for Immunology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Matthew C Pahl
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Struan F A Grant
- Center for Spatial and Functional Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Division of Endocrinology and Diabetes, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA.
| |
Collapse
|
28
|
Mørup SB, Leung P, Reilly C, Sherman BT, Chang W, Milojevic M, Milinkovic A, Liappis A, Borgwardt L, Petoumenos K, Paredes R, Mistry SS, MacPherson CR, Lundgren J, Helleberg M, Reekie J, Murray DD. The association between single-nucleotide polymorphisms within type 1 interferon pathway genes and human immunodeficiency virus type 1 viral load in antiretroviral-naïve participants. AIDS Res Ther 2024; 21:27. [PMID: 38698440 PMCID: PMC11067292 DOI: 10.1186/s12981-024-00610-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 03/29/2024] [Indexed: 05/05/2024] Open
Abstract
BACKGROUND Human genetic contribution to HIV progression remains inadequately explained. The type 1 interferon (IFN) pathway is important for host control of HIV and variation in type 1 IFN genes may contribute to disease progression. This study assessed the impact of variations at the gene and pathway level of type 1 IFN on HIV-1 viral load (VL). METHODS Two cohorts of antiretroviral (ART) naïve participants living with HIV (PLWH) with either early (START) or advanced infection (FIRST) were analysed separately. Type 1 IFN genes (n = 17) and receptor subunits (IFNAR1, IFNAR2) were examined for both cumulated type 1 IFN pathway analysis and individual gene analysis. SKAT-O was applied to detect associations between the genotype and HIV-1 study entry viral load (log10 transformed) as a proxy for set point VL; P-values were corrected using Bonferroni (P < 0.0025). RESULTS The analyses among those with early infection included 2429 individuals from five continents. The median study entry HIV VL was 14,623 (IQR 3460-45100) copies/mL. Across 673 SNPs within 19 type 1 IFN genes, no significant association with study entry VL was detected. Conversely, examining individual genes in START showed a borderline significant association between IFNW1, and study entry VL (P = 0.0025). This significance remained after separate adjustments for age, CD4+ T-cell count, CD4+/CD8+ T-cell ratio and recent infection. When controlling for population structure using linear mixed effects models (LME), in addition to principal components used in the main model, this was no longer significant (p = 0.0244). In subgroup analyses stratified by geographical region, the association between IFNW1 and study entry VL was only observed among African participants, although, the association was not significant when controlling for population structure using LME. Of the 17 SNPs within the IFNW1 region, only rs79876898 (A > G) was associated with study entry VL (p = 0.0020, beta = 0.32; G associated with higher study entry VL than A) in single SNP association analyses. The findings were not reproduced in FIRST participants. CONCLUSION Across 19 type 1 IFN genes, only IFNW1 was associated with HIV-1 study entry VL in a cohort of ART-naïve individuals in early stages of their infection, however, this was no longer significant in sensitivity analyses that controlled for population structures using LME.
Collapse
Affiliation(s)
- Sara Bohnstedt Mørup
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Preston Leung
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Cavan Reilly
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Brad T Sherman
- Laboratory of Human Retrovirology and Immunoinformatics, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Weizhong Chang
- Laboratory of Human Retrovirology and Immunoinformatics, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Maja Milojevic
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Ana Milinkovic
- Chelsea and Westminster Hospital NHS Foundation Trust, London, UK
| | - Angelike Liappis
- Washington DC Veterans Affairs Medical Center and The George Washington University School of Medicine and Health Sciences, Washington, DC, USA
| | - Line Borgwardt
- Center for Genomic Medicine, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Kathy Petoumenos
- Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
| | - Roger Paredes
- Department of Infectious Diseases and IrsiCaixa, Hospital Universitari Germans Trias i Pujol, Badalona, Spain
| | - Shweta S Mistry
- Division of Biostatistics and Health Data Science, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Cameron R MacPherson
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Institut Roche, Boulogne-Billancourt, France
| | - Jens Lundgren
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Department of Infectious Diseases, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Marie Helleberg
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
- Department of Infectious Diseases, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Joanne Reekie
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark
| | - Daniel D Murray
- Centre of Excellence for Health, Immunity, and Infections, Copenhagen University Hospital, Rigshospitalet, Copenhagen, Denmark.
| |
Collapse
|
29
|
Bass AJ, Bian S, Wingo AP, Wingo TS, Cutler DJ, Epstein MP. Identifying latent genetic interactions in genome-wide association studies using multiple traits. Genome Med 2024; 16:62. [PMID: 38664839 PMCID: PMC11044415 DOI: 10.1186/s13073-024-01329-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 04/02/2024] [Indexed: 04/28/2024] Open
Abstract
The "missing" heritability of complex traits may be partly explained by genetic variants interacting with other genes or environments that are difficult to specify, observe, and detect. We propose a new kernel-based method called Latent Interaction Testing (LIT) to screen for genetic interactions that leverages pleiotropy from multiple related traits without requiring the interacting variable to be specified or observed. Using simulated data, we demonstrate that LIT increases power to detect latent genetic interactions compared to univariate methods. We then apply LIT to obesity-related traits in the UK Biobank and detect variants with interactive effects near known obesity-related genes (URL: https://CRAN.R-project.org/package=lit ).
Collapse
Affiliation(s)
- Andrew J Bass
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA.
| | - Shijia Bian
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA, 30322, USA
| | - Aliza P Wingo
- Department of Psychiatry, Emory University, Atlanta, GA, 30322, USA
| | - Thomas S Wingo
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA
- Department of Neurology, Emory University, Atlanta, GA, 30322, USA
| | - David J Cutler
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA
| | - Michael P Epstein
- Department of Human Genetics, Emory University, Atlanta, GA, 30322, USA.
| |
Collapse
|
30
|
Cui Y, Ye W, Li JS, Li JJ, Vilain E, Sallam T, Li W. A genome-wide spectrum of tandem repeat expansions in 338,963 humans. Cell 2024; 187:2336-2341.e5. [PMID: 38582080 PMCID: PMC11065452 DOI: 10.1016/j.cell.2024.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 01/23/2024] [Accepted: 03/05/2024] [Indexed: 04/08/2024]
Abstract
The Genome Aggregation Database (gnomAD), widely recognized as the gold-standard reference map of human genetic variation, has largely overlooked tandem repeat (TR) expansions, despite the fact that TRs constitute ∼6% of our genome and are linked to over 50 human diseases. Here, we introduce the TR-gnomAD (https://wlcb.oit.uci.edu/TRgnomAD), a biobank-scale reference of 0.86 million TRs derived from 338,963 whole-genome sequencing (WGS) samples of diverse ancestries (39.5% non-European samples). TR-gnomAD offers critical insights into ancestry-specific disease prevalence using disparities in TR unit number frequencies among ancestries. Moreover, TR-gnomAD is able to differentiate between common, presumably benign TR expansions, which are prevalent in TR-gnomAD, from those potentially pathogenic TR expansions, which are found more frequently in disease groups than within TR-gnomAD. Together, TR-gnomAD is an invaluable resource for researchers and physicians to interpret TR expansions in individuals with genetic diseases.
Collapse
Affiliation(s)
- Ya Cui
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA.
| | - Wenbin Ye
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Jason Sheng Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Eric Vilain
- Institute for Clinical and Translational Science, University of California, Irvine, Irvine, CA 92697, USA; Department of Pediatrics, University of California, Irvine, Irvine, CA 92697, USA
| | - Tamer Sallam
- Division of Cardiology, Department of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Wei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA.
| |
Collapse
|
31
|
Liu M, Su YR, Liu Y, Hsu L, He Q. Structured testing of genetic association with mixed clinical outcomes. Genet Epidemiol 2024. [PMID: 38606632 DOI: 10.1002/gepi.22560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 02/15/2024] [Accepted: 03/27/2024] [Indexed: 04/13/2024]
Abstract
Genetic factors play a fundamental role in disease development. Studying the genetic association with clinical outcomes is critical for understanding disease biology and devising novel treatment targets. However, the frequencies of genetic variations are often low, making it difficult to examine the variants one-by-one. Moreover, the clinical outcomes are complex, including patients' survival time and other binary or continuous outcomes such as recurrences and lymph node count, and how to effectively analyze genetic association with these outcomes remains unclear. In this article, we proposed a structured test statistic for testing genetic association with mixed types of survival, binary, and continuous outcomes. The structured testing incorporates known biological information of variants while allowing for their heterogeneous effects and is a powerful strategy for analyzing infrequent genetic factors. Simulation studies show that the proposed test statistic has correct type I error and is highly effective in detecting significant genetic variants. We applied our approach to a uterine corpus endometrial carcinoma study and identified several genetic pathways associated with the clinical outcomes.
Collapse
Affiliation(s)
- Meiling Liu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - Yu-Ru Su
- Biostatistics Division, Kaiser Permanente Washington Health Research Institute, Seattle, Washington, USA
| | - Yang Liu
- Department of Mathematics and Statistics, Wright State University, Dayton, Ohio, USA
| | - Li Hsu
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - Qianchuan He
- Public Health Sciences Division, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| |
Collapse
|
32
|
Nagle MF, Yuan J, Kaur D, Ma C, Peremyslova E, Jiang Y, Niño de Rivera A, Jawdy S, Chen JG, Feng K, Yates TB, Tuskan GA, Muchero W, Fuxin L, Strauss SH. GWAS supported by computer vision identifies large numbers of candidate regulators of in planta regeneration in Populus trichocarpa. G3 (BETHESDA, MD.) 2024; 14:jkae026. [PMID: 38325329 PMCID: PMC10989874 DOI: 10.1093/g3journal/jkae026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/18/2024] [Accepted: 01/20/2024] [Indexed: 02/09/2024]
Abstract
Plant regeneration is an important dimension of plant propagation and a key step in the production of transgenic plants. However, regeneration capacity varies widely among genotypes and species, the molecular basis of which is largely unknown. Association mapping methods such as genome-wide association studies (GWAS) have long demonstrated abilities to help uncover the genetic basis of trait variation in plants; however, the performance of these methods depends on the accuracy and scale of phenotyping. To enable a large-scale GWAS of in planta callus and shoot regeneration in the model tree Populus, we developed a phenomics workflow involving semantic segmentation to quantify regenerating plant tissues over time. We found that the resulting statistics were of highly non-normal distributions, and thus employed transformations or permutations to avoid violating assumptions of linear models used in GWAS. We report over 200 statistically supported quantitative trait loci (QTLs), with genes encompassing or near to top QTLs including regulators of cell adhesion, stress signaling, and hormone signaling pathways, as well as other diverse functions. Our results encourage models of hormonal signaling during plant regeneration to consider keystone roles of stress-related signaling (e.g. involving jasmonates and salicylic acid), in addition to the auxin and cytokinin pathways commonly considered. The putative regulatory genes and biological processes we identified provide new insights into the biological complexity of plant regeneration, and may serve as new reagents for improving regeneration and transformation of recalcitrant genotypes and species.
Collapse
Affiliation(s)
- Michael F Nagle
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Jialin Yuan
- Department of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR 97331, USA
| | - Damanpreet Kaur
- Department of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR 97331, USA
| | - Cathleen Ma
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Ekaterina Peremyslova
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Yuan Jiang
- Statistics Department, Oregon State University, 239 Weniger Hall, Corvallis, OR 97331, USA
| | - Alexa Niño de Rivera
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| | - Sara Jawdy
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
| | - Jin-Gui Chen
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Bredesen Center for Interdisciplinary Research, University of Tennessee-Knoxville, 310 Ferris Hall 1508 Middle Dr, Knoxville, TN 37996, USA
| | - Kai Feng
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
| | - Timothy B Yates
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Bredesen Center for Interdisciplinary Research, University of Tennessee-Knoxville, 310 Ferris Hall 1508 Middle Dr, Knoxville, TN 37996, USA
| | - Gerald A Tuskan
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
| | - Wellington Muchero
- Biosciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA
- Bredesen Center for Interdisciplinary Research, University of Tennessee-Knoxville, 310 Ferris Hall 1508 Middle Dr, Knoxville, TN 37996, USA
| | - Li Fuxin
- Department of Electrical Engineering and Computer Science, Oregon State University, 1148 Kelley Engineering Center, Corvallis, OR 97331, USA
| | - Steven H Strauss
- Department of Forest Ecosystems and Society, Oregon State University, 321 Richardson Hall, Corvallis, OR 97311, USA
| |
Collapse
|
33
|
He XY, Wu BS, Yang L, Guo Y, Deng YT, Li ZY, Fei CJ, Liu WS, Ge YJ, Kang J, Feng J, Cheng W, Dong Q, Yu JT. Genetic associations of protein-coding variants in venous thromboembolism. Nat Commun 2024; 15:2819. [PMID: 38561338 PMCID: PMC10984941 DOI: 10.1038/s41467-024-47178-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 03/19/2024] [Indexed: 04/04/2024] Open
Abstract
Previous genetic studies of venous thromboembolism (VTE) have been largely limited to common variants, leaving the genetic determinants relatively incomplete. We performed an exome-wide association study of VTE among 14,723 cases and 334,315 controls. Fourteen known and four novel genes (SRSF6, PHPT1, CGN, and MAP3K2) were identified through protein-coding variants, with broad replication in the FinnGen cohort. Most genes we discovered exhibited the potential to predict future VTE events in longitudinal analysis. Notably, we provide evidence for the additive contribution of rare coding variants to known genome-wide polygenic risk in shaping VTE risk. The identified genes were enriched in pathways affecting coagulation and platelet activation, along with liver-specific expression. The pleiotropic effects of these genes indicated the potential involvement of coagulation factors, blood cell traits, liver function, and immunometabolic processes in VTE pathogenesis. In conclusion, our study unveils the valuable contribution of protein-coding variants in VTE etiology and sheds new light on its risk stratification.
Collapse
Affiliation(s)
- Xiao-Yu He
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Bang-Sheng Wu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Liu Yang
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Yu Guo
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Yue-Ting Deng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Ze-Yu Li
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Chen-Jie Fei
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Wei-Shi Liu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Yi-Jun Ge
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China
| | - Jujiao Kang
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Jianfeng Feng
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China
- Department of Computer Science, University of Warwick, Coventry, UK
| | - Wei Cheng
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China.
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Fudan University, Ministry of Education, Shanghai, China.
- Department of Computer Science, University of Warwick, Coventry, UK.
| | - Qiang Dong
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China.
| | - Jin-Tai Yu
- Department of Neurology and National Center for Neurological Disorders, Huashan Hospital, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Shanghai Medical College, Fudan University, Shanghai, China.
| |
Collapse
|
34
|
Peng Q, Gilder DA, Bernert RA, Karriker-Jaffe KJ, Ehlers CL. Genetic factors associated with suicidal behaviors and alcohol use disorders in an American Indian population. Mol Psychiatry 2024; 29:902-913. [PMID: 38177348 PMCID: PMC11176067 DOI: 10.1038/s41380-023-02379-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 12/12/2023] [Accepted: 12/13/2023] [Indexed: 01/06/2024]
Abstract
American Indians (AI) demonstrate the highest rates of both suicidal behaviors (SB) and alcohol use disorders (AUD) among all ethnic groups in the US. Rates of suicide and AUD vary substantially between tribal groups and across different geographical regions, underscoring a need to delineate more specific risk and resilience factors. Using data from over 740 AI living within eight contiguous reservations, we assessed genetic risk factors for SB by investigating: (1) possible genetic overlap with AUD, and (2) impacts of rare and low-frequency genomic variants. Suicidal behaviors included lifetime history of suicidal thoughts and acts, including verified suicide deaths, scored using a ranking variable for the SB phenotype (range 0-4). We identified five loci significantly associated with SB and AUD, two of which are intergenic and three intronic on genes AACSP1, ANK1, and FBXO11. Nonsynonymous rare and low-frequency mutations in four genes including SERPINF1 (PEDF), ZNF30, CD34, and SLC5A9, and non-intronic rare and low-frequency mutations in genes OPRD1, HSD17B3 and one lincRNA were significantly associated with SB. One identified pathway related to hypoxia-inducible factor (HIF) regulation, whose 83 nonsynonymous rare and low-frequency variants on 10 genes were significantly linked to SB as well. Four additional genes, and two pathways related to vasopressin-regulated water metabolism and cellular hexose transport, also were strongly associated with SB. This study represents the first investigation of genetic factors for SB in an American Indian population that has high risk for suicide. Our study suggests that bivariate association analysis between comorbid disorders can increase statistical power; and rare and low-frequency variant analysis in a high-risk population enabled by whole-genome sequencing has the potential to identify novel genetic factors. Although such findings may be population specific, rare functional mutations relating to PEDF and HIF regulation align with past reports and suggest a biological mechanism for suicide risk and a potential therapeutic target for intervention.
Collapse
Affiliation(s)
- Qian Peng
- Department of Neuroscience, The Scripps Research Institute, La Jolla, CA, USA.
| | - David A Gilder
- Department of Neuroscience, The Scripps Research Institute, La Jolla, CA, USA
| | - Rebecca A Bernert
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, USA
| | | | - Cindy L Ehlers
- Department of Neuroscience, The Scripps Research Institute, La Jolla, CA, USA
| |
Collapse
|
35
|
Alfayyadh MM, Maksemous N, Sutherland HG, Lea RA, Griffiths LR. Unravelling the Genetic Landscape of Hemiplegic Migraine: Exploring Innovative Strategies and Emerging Approaches. Genes (Basel) 2024; 15:443. [PMID: 38674378 PMCID: PMC11049430 DOI: 10.3390/genes15040443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Migraine is a severe, debilitating neurovascular disorder. Hemiplegic migraine (HM) is a rare and debilitating neurological condition with a strong genetic basis. Sequencing technologies have improved the diagnosis and our understanding of the molecular pathophysiology of HM. Linkage analysis and sequencing studies in HM families have identified pathogenic variants in ion channels and related genes, including CACNA1A, ATP1A2, and SCN1A, that cause HM. However, approximately 75% of HM patients are negative for these mutations, indicating there are other genes involved in disease causation. In this review, we explored our current understanding of the genetics of HM. The evidence presented herein summarises the current knowledge of the genetics of HM, which can be expanded further to explain the remaining heritability of this debilitating condition. Innovative bioinformatics and computational strategies to cover the entire genetic spectrum of HM are also discussed in this review.
Collapse
Affiliation(s)
| | | | | | | | - Lyn R. Griffiths
- Centre for Genomics and Personalised Health, Genomics Research Centre, School of Biomedical Sciences, Queensland University of Technology (QUT), Brisbane, QLD 4059, Australia; (M.M.A.); (N.M.); (H.G.S.); (R.A.L.)
| |
Collapse
|
36
|
Ruiz-Serra V, Valentini S, Madroñero S, Valencia A, Porta-Pardo E. 3Dmapper: a command line tool for BioBank-scale mapping of variants to protein structures. Bioinformatics 2024; 40:btae171. [PMID: 38565273 PMCID: PMC11018535 DOI: 10.1093/bioinformatics/btae171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 02/09/2024] [Accepted: 03/30/2024] [Indexed: 04/04/2024] Open
Abstract
MOTIVATION The interpretation of genomic data is crucial to understand the molecular mechanisms of biological processes. Protein structures play a vital role in facilitating this interpretation by providing functional context to genetic coding variants. However, mapping genes to proteins is a tedious and error-prone task due to inconsistencies in data formats. Over the past two decades, numerous tools and databases have been developed to automatically map annotated positions and variants to protein structures. However, most of these tools are web-based and not well-suited for large-scale genomic data analysis. RESULTS To address this issue, we introduce 3Dmapper, a stand-alone command-line tool developed in Python and R. It systematically maps annotated protein positions and variants to protein structures, providing a solution that is both efficient and reliable. AVAILABILITY AND IMPLEMENTATION https://github.com/vicruiser/3Dmapper.
Collapse
Affiliation(s)
- Victoria Ruiz-Serra
- Barcelona Supercomputing Center (BSC)
- Josep Carreras Leukaemia Research Institute (IJC), Badalona 08916, Spain
| | - Samuel Valentini
- Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, Trento 38123, Italy
| | - Sergi Madroñero
- Josep Carreras Leukaemia Research Institute (IJC), Badalona 08916, Spain
| | - Alfonso Valencia
- Barcelona Supercomputing Center (BSC)
- Institució Catalana de Recerca Avançada (ICREA)
| | - Eduard Porta-Pardo
- Barcelona Supercomputing Center (BSC)
- Josep Carreras Leukaemia Research Institute (IJC), Badalona 08916, Spain
| |
Collapse
|
37
|
Das Adhikari S, Cui Y, Wang J. BayesKAT: bayesian optimal kernel-based test for genetic association studies reveals joint genetic effects in complex diseases. Brief Bioinform 2024; 25:bbae182. [PMID: 38653490 PMCID: PMC11036342 DOI: 10.1093/bib/bbae182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 03/10/2024] [Accepted: 04/05/2024] [Indexed: 04/25/2024] Open
Abstract
Genome-wide Association Studies (GWAS) methods have identified individual single-nucleotide polymorphisms (SNPs) significantly associated with specific phenotypes. Nonetheless, many complex diseases are polygenic and are controlled by multiple genetic variants that are usually non-linearly dependent. These genetic variants are marginally less effective and remain undetected in GWAS analysis. Kernel-based tests (KBT), which evaluate the joint effect of a group of genetic variants, are therefore critical for complex disease analysis. However, choosing different kernel functions in KBT can significantly influence the type I error control and power, and selecting the optimal kernel remains a statistically challenging task. A few existing methods suffer from inflated type 1 errors, limited scalability, inferior power or issues of ambiguous conclusions. Here, we present a new Bayesian framework, BayesKAT (https://github.com/wangjr03/BayesKAT), which overcomes these kernel specification issues by selecting the optimal composite kernel adaptively from the data while testing genetic associations simultaneously. Furthermore, BayesKAT implements a scalable computational strategy to boost its applicability, especially for high-dimensional cases where other methods become less effective. Based on a series of performance comparisons using both simulated and real large-scale genetics data, BayesKAT outperforms the available methods in detecting complex group-level associations and controlling type I errors simultaneously. Applied on a variety of groups of functionally related genetic variants based on biological pathways, co-expression gene modules and protein complexes, BayesKAT deciphers the complex genetic basis and provides mechanistic insights into human diseases.
Collapse
Affiliation(s)
- Sikta Das Adhikari
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Yuehua Cui
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
| | - Jianrong Wang
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
38
|
Guo H, Urban AE, Wong WH. Prioritizing disease-related rare variants by integrating gene expression data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.19.585836. [PMID: 38562756 PMCID: PMC10983955 DOI: 10.1101/2024.03.19.585836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Rare variants, comprising a vast majority of human genetic variations, are likely to have more deleterious impact on human diseases compared to common variants. Here we present carrier statistic, a statistical framework to prioritize disease-related rare variants by integrating gene expression data. By quantifying the impact of rare variants on gene expression, carrier statistic can prioritize those rare variants that have large functional consequence in the diseased patients. Through simulation studies and analyzing real multi-omics dataset, we demonstrated that carrier statistic is applicable in studies with limited sample size (a few hundreds) and achieves substantially higher sensitivity than existing rare variants association methods. Application to Alzheimer's disease reveals 16 rare variants within 15 genes with extreme carrier statistics. The carrier statistic method can be applied to various rare variant types and is adaptable to other omics data modalities, offering a powerful tool for investigating the molecular mechanisms underlying complex diseases.
Collapse
Affiliation(s)
- Hanmin Guo
- Department of Statistics, Stanford University, Stanford, California 94305, USA
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Alexander Eckehart Urban
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, California 94305, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Wing Hung Wong
- Department of Statistics, Stanford University, Stanford, California 94305, USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, California 94305, USA
| |
Collapse
|
39
|
Cinciripini PM, Wetter DW, Wang J, Yu R, Kypriotakis G, Kumar T, Robinson JD, Cui Y, Green CE, Bergen AW, Kosten TR, Scherer SE, Shete S. Deep sequencing of candidate genes identified 14 variants associated with smoking abstinence in an ethnically diverse sample. Sci Rep 2024; 14:6385. [PMID: 38493193 PMCID: PMC10944542 DOI: 10.1038/s41598-024-56750-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 03/11/2024] [Indexed: 03/18/2024] Open
Abstract
Despite the large public health toll of smoking, genetic studies of smoking cessation have been limited with few discoveries of risk or protective loci. We investigated common and rare variant associations with success in quitting smoking using a cohort from 8 randomized controlled trials involving 2231 participants and a total of 10,020 common and 24,147 rare variants. We identified 14 novel markers including 6 mapping to genes previously related to psychiatric and substance use disorders, 4 of which were protective (CYP2B6 (rs1175607105), HTR3B (rs1413172952; rs1204720503), rs80210037 on chr15), and 2 of which were associated with reduced cessation (PARP15 (rs2173763), SCL18A2 (rs363222)). The others mapped to areas associated with cancer including FOXP1 (rs1288980) and ZEB1 (rs7349). Network analysis identified significant canonical pathways for the serotonin receptor signaling pathway, nicotine and bupropion metabolism, and several related to tumor suppression. Two novel markers (rs6749438; rs6718083) on chr2 are flanked by genes associated with regulation of bodyweight. The identification of novel loci in this study can provide new targets of pharmacotherapy and inform efforts to develop personalized treatments based on genetic profiles.
Collapse
Affiliation(s)
- Paul M Cinciripini
- Department of Behavioral Science, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA.
| | - David W Wetter
- Department of Department of Population Health Sciences, University of Utah and Huntsman Cancer Institute, Salt Lake City, Utah, 84112, USA
| | - Jian Wang
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Robert Yu
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - George Kypriotakis
- Department of Behavioral Science, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA.
| | - Tapsi Kumar
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Jason D Robinson
- Department of Behavioral Science, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Yong Cui
- Department of Behavioral Science, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA
| | - Charles E Green
- Department of Pediatrics, The University of Texas Medical School at Houston, Houston, TX, 77030, USA
| | | | - Thomas R Kosten
- Department of Psychiatry, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Steven E Scherer
- Department of Molecular and Human Genetics, Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Sanjay Shete
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA.
- Department of Epidemiology, The University of Texas MD Anderson Cancer Center, Houston, TX, 77030, USA.
| |
Collapse
|
40
|
Marques MC, Rubin D, Shuldiner E, Datta M, Schmitz E, Cruz GG, Patt A, Bennett E, Grom A, Foell D, Gattorno M, Bohnsack J, Yeung RSM, Prahalad S, Mellins E, Anton J, Len CA, Oliveira S, Woo P, Ozen S, Deng Z, Ombrello MJ. Enrichment of Rare Variants of Hemophagocytic Lymphohistiocytosis Genes in Systemic Juvenile Idiopathic Arthritis. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.13.24304215. [PMID: 38529491 PMCID: PMC10962746 DOI: 10.1101/2024.03.13.24304215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
Objective To evaluate whether there is an enrichment of rare variants in familial hemophagocytic lymphohistiocytosis (HLH) genes and systemic juvenile idiopathic arthritis (sJIA) with or without macrophage activation syndrome (MAS). Methods Targeted sequencing of HLH genes (LYST, PRF1, RAB27A, STX11, STXBP2, UNC13D) was performed in sJIA subjects from an established cohort. Sequence data from control subjects were obtained in silico (dbGaP:phs000280.v8.p2). Rare variant association testing (RVT) was performed with sequence kernel association test (SKAT) package. Significance was defined as p<0.05 after 100,000 permutations. Results Sequencing data from 524 sJIA cases were jointly called and harmonized with exome-derived target data from 3000 controls. Quality control operations produced a set of 481 cases and 2924 ancestrally-matched control subjects. RVT of sJIA cases and controls revealed a significant association with rare protein-altering variants (minor allele frequency [MAF]<0.01) of STXBP2 (p=0.020), and ultra-rare variants (MAF<0.001) of STXBP2 (p=0.007) and UNC13D (p=0.045). A subanalysis of 32 cases with known MAS and 90 without revealed significant association of rare UNC13D variants (p=0.0047). Additionally, sJIA patients more often carried ≥2 HLH variants than did controls (p=0.007), driven largely by digenic combinations involving LYST. Conclusion We identified an enrichment of rare HLH variants in sJIA patients compared with healthy controls, driven by STXBP2 and UNC13D. Biallelic variation in HLH genes was associated with sJIA, driven by LYST. Only UNC13D displayed enrichment in patients with MAS. This suggests that HLH variants may contribute to the pathophysiology of sJIA, even without MAS.
Collapse
Affiliation(s)
- Mariana Correia Marques
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Danielle Rubin
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Emily Shuldiner
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Mallika Datta
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Elizabeth Schmitz
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Gustavo Gutierrez Cruz
- Genomic Technology Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Andrew Patt
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Elizabeth Bennett
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Alexei Grom
- Rheumatology, Cincinnati Children`s Hospital Medical Center, Cincinnati, USA
| | - Dirk Foell
- Department of Pediatric Rheumatology and Immunology, University Hospital Muenster, Muenster, Germany
| | - Marco Gattorno
- Unit Rheumatology and Autoinflammatory Diseases, IRCCS Istituto G. Gaslini, Genoa, Italy
| | - John Bohnsack
- Department of Pediatrics, University of Utah Eccles School of Medicine, Salt Lake City, USA
| | - Rae S M Yeung
- The Hospital for Sick Children, University of Toronto, Toronto, Canada
| | - Sampath Prahalad
- Emory University school of Medicine, Atlanta, USA
- Children's Healthcare of Atlanta, Atlanta, USA
| | - Elizabeth Mellins
- Department of Pediatrics, Program in Immunology, Stanford University, Stanford, USA
| | - Jordi Anton
- Hospital Sant Joan de Déu, Universitat de Barcelona, Barcelona, Spain
| | | | - Sheila Oliveira
- Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | | | - Seza Ozen
- Department of Pediatrics, Hacettepe University, Ankara, Turkey
| | - Zuoming Deng
- Biodata Mining and Discovery Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| | - Michael J Ombrello
- Translational Genetics and Genomics Section, National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, USA
| |
Collapse
|
41
|
He M, Zhao N. A Mixed Effect Similarity Matrix Regression Model (SMRmix) for Integrating Multiple Microbiome Datasets at Community Level and its Application in HIV. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.10.584315. [PMID: 38559012 PMCID: PMC10979838 DOI: 10.1101/2024.03.10.584315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Recent studies have highlighted the importance of human microbiota in our health and diseases. However, in many areas of research, individual microbiome studies often offer inconsistent results due to the limited sample sizes and the heterogeneity in study populations and experimental procedures. Integrative analysis of multiple microbiome datasets is necessary. However, statistical methods that incorporate multiple microbiome datasets and account for the study heterogeneity are not available in the literature. In this paper, we develop a mixed effect similarity matrix regression (SMRmix) approach for identifying community level microbiome shifts between outcomes. SMRmix has a close connection with the microbiome kernel association test, one of the most popular approaches for such a task but is only applicable when we have a single study. Via extensive simulations, we show that SMRmix has well-controlled type I error and higher power than some potential competitors. We also applied SMRmix to data from the HIV-reanalysis consortium, a collective effort that obtained all publicly available data on gut microbiome and HIV at December 2017, and obtained consistent associations of gut microbiome with HIV infection, and with MSM status (i.e. men who have sex with men).
Collapse
|
42
|
Zhang S, Jiang Z, Zeng P. Incorporating genetic similarity of auxiliary samples into eGene identification under the transfer learning framework. J Transl Med 2024; 22:258. [PMID: 38461317 PMCID: PMC10924384 DOI: 10.1186/s12967-024-05053-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 03/01/2024] [Indexed: 03/11/2024] Open
Abstract
BACKGROUND The term eGene has been applied to define a gene whose expression level is affected by at least one independent expression quantitative trait locus (eQTL). It is both theoretically and empirically important to identify eQTLs and eGenes in genomic studies. However, standard eGene detection methods generally focus on individual cis-variants and cannot efficiently leverage useful knowledge acquired from auxiliary samples into target studies. METHODS We propose a multilocus-based eGene identification method called TLegene by integrating shared genetic similarity information available from auxiliary studies under the statistical framework of transfer learning. We apply TLegene to eGene identification in ten TCGA cancers which have an explicit relevant tissue in the GTEx project, and learn genetic effect of variant in TCGA from GTEx. We also adopt TLegene to the Geuvadis project to evaluate its usefulness in non-cancer studies. RESULTS We observed substantial genetic effect correlation of cis-variants between TCGA and GTEx for a larger number of genes. Furthermore, consistent with the results of our simulations, we found that TLegene was more powerful than existing methods and thus identified 169 distinct candidate eGenes, which was much larger than the approach that did not consider knowledge transfer across target and auxiliary studies. Previous studies and functional enrichment analyses provided empirical evidence supporting the associations of discovered eGenes, and it also showed evidence of allelic heterogeneity of gene expression. Furthermore, TLegene identified more eGenes in Geuvadis and revealed that these eGenes were mainly enriched in cells EBV transformed lymphocytes tissue. CONCLUSION Overall, TLegene represents a flexible and powerful statistical method for eGene identification through transfer learning of genetic similarity shared across auxiliary and target studies.
Collapse
Affiliation(s)
- Shuo Zhang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Zhou Jiang
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Ping Zeng
- Department of Biostatistics, School of Public Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Center for Medical Statistics and Data Analysis, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Key Laboratory of Human Genetics and Environmental Medicine, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Key Laboratory of Environment and Health, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Xuzhou Engineering Research Innovation Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
- Jiangsu Engineering Research Center of Biological Data Mining and Healthcare Transformation, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China.
| |
Collapse
|
43
|
Oh EY, Han KM, Kim A, Kang Y, Tae WS, Han MR, Ham BJ. Integration of whole-exome sequencing and structural neuroimaging analysis in major depressive disorder: a joint study. Transl Psychiatry 2024; 14:141. [PMID: 38461185 PMCID: PMC10924915 DOI: 10.1038/s41398-024-02849-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 02/07/2024] [Accepted: 02/22/2024] [Indexed: 03/11/2024] Open
Abstract
Major depressive disorder (MDD) is a common mental illness worldwide and is triggered by an intricate interplay between environmental and genetic factors. Although there are several studies on common variants in MDD, studies on rare variants are relatively limited. In addition, few studies have examined the genetic contributions to neurostructural alterations in MDD using whole-exome sequencing (WES). We performed WES in 367 patients with MDD and 161 healthy controls (HCs) to detect germline and copy number variations in the Korean population. Gene-based rare variants were analyzed to investigate the association between the genes and individuals, followed by neuroimaging-genetic analysis to explore the neural mechanisms underlying the genetic impact in 234 patients with MDD and 135 HCs using diffusion tensor imaging data. We identified 40 MDD-related genes and observed 95 recurrent regions of copy number variations. We also discovered a novel gene, FRMPD3, carrying rare variants that influence MDD. In addition, the single nucleotide polymorphism rs771995197 in the MUC6 gene was significantly associated with the integrity of widespread white matter tracts. Moreover, we identified 918 rare exonic missense variants in genes associated with MDD susceptibility. We postulate that rare variants of FRMPD3 may contribute significantly to MDD, with a mild penetration effect.
Collapse
Affiliation(s)
- Eun-Young Oh
- Division of Life Sciences, College of Life Sciences and Bioengineering, Incheon National University, Incheon, Republic of Korea
| | - Kyu-Man Han
- Department of Psychiatry, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Republic of Korea
- Brain Convergence Research Center, Korea University College of Medicine, Seoul, Republic of Korea
| | - Aram Kim
- Department of Biomedical Sciences, Korea University College of Medicine, Seoul, Republic of Korea
| | - Youbin Kang
- Department of Biomedical Sciences, Korea University College of Medicine, Seoul, Republic of Korea
| | - Woo-Suk Tae
- Brain Convergence Research Center, Korea University College of Medicine, Seoul, Republic of Korea
| | - Mi-Ryung Han
- Division of Life Sciences, College of Life Sciences and Bioengineering, Incheon National University, Incheon, Republic of Korea.
| | - Byung-Joo Ham
- Department of Psychiatry, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Republic of Korea.
- Brain Convergence Research Center, Korea University College of Medicine, Seoul, Republic of Korea.
| |
Collapse
|
44
|
Schraiber JG, Edge MD, Pennell M. Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.10.579721. [PMID: 38496530 PMCID: PMC10942266 DOI: 10.1101/2024.02.10.579721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
In both statistical genetics and phylogenetics, a major goal is to identify correlations between genetic loci or other aspects of the phenotype or environment and a focal trait. In these two fields, there are sophisticated but disparate statistical traditions aimed at these tasks. The disconnect between their respective approaches is becoming untenable as questions in medicine, conservation biology, and evolutionary biology increasingly rely on integrating data from within and among species, and once-clear conceptual divisions are becoming increasingly blurred. To help bridge this divide, we derive a general model describing the covariance between the genetic contributions to the quantitative phenotypes of different individuals. Taking this approach shows that standard models in both statistical genetics (e.g., Genome-Wide Association Studies; GWAS) and phylogenetic comparative biology (e.g., phylogenetic regression) can be interpreted as special cases of this more general quantitative-genetic model. The fact that these models share the same core architecture means that we can build a unified understanding of the strengths and limitations of different methods for controlling for genetic structure when testing for associations. We develop intuition for why and when spurious correlations may occur using analytical theory and conduct population-genetic and phylogenetic simulations of quantitative traits. The structural similarity of problems in statistical genetics and phylogenetics enables us to take methodological advances from one field and apply them in the other. We demonstrate this by showing how a standard GWAS technique-including both the genetic relatedness matrix (GRM) as well as its leading eigenvectors, corresponding to the principal components of the genotype matrix, in a regression model-can mitigate spurious correlations in phylogenetic analyses. As a case study of this, we re-examine an analysis testing for co-evolution of expression levels between genes across a fungal phylogeny, and show that including covariance matrix eigenvectors as covariates decreases the false positive rate while simultaneously increasing the true positive rate. More generally, this work provides a foundation for more integrative approaches for understanding the genetic architecture of phenotypes and how evolutionary processes shape it.
Collapse
|
45
|
Acharya S, Liao S, Jung WJ, Kang YS, Moghaddam VA, Feitosa M, Wojczynski M, Lin S, Anema JA, Schwander K, Connell JO, Province M, Brent MR. Multi-omics Integration Identifies Genes Influencing Traits Associated with Cardiovascular Risks: The Long Life Family Study. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.04.24303657. [PMID: 38496585 PMCID: PMC10942516 DOI: 10.1101/2024.03.04.24303657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
The Long Life Family Study (LLFS) enrolled 4,953 participants in 539 pedigrees displaying exceptional longevity. To identify genetic mechanisms that affect cardiovascular risks in the LLFS population, we developed a multi-omics integration pipeline and applied it to 11 traits associated with cardiovascular risks. Using our pipeline, we aggregated gene-level statistics from rare-variant analysis, GWAS, and gene expression-trait association by Correlated Meta-Analysis (CMA). Across all traits, CMA identified 64 significant genes after Bonferroni correction (p ≤ 2.8×10-7), 29 of which replicated in the Framingham Heart Study (FHS) cohort. Notably, 20 of the 29 replicated genes do not have a previously known trait-associated variant in the GWAS Catalog within 50 kb. Thirteen modules in Protein-Protein Interaction (PPI) networks are significantly enriched in genes with low meta-analysis p-values for at least one trait, three of which are replicated in the FHS cohort. The functional annotation of genes in these modules showed a significant over-representation of trait-related biological processes including sterol transport, protein-lipid complex remodeling, and immune response regulation. Among major findings, our results suggest a role of triglyceride-associated and mast-cell functional genes FCER1A, MS4A2, GATA2, HDC, and HRH4 in atherosclerosis risks. Our findings also suggest that lower expression of ATG2A, a gene we found to be associated with BMI, may be both a cause and consequence of obesity. Finally, our results suggest that ENPP3 may play an intermediary role in triglyceride-induced inflammation. Our pipeline is freely available and implemented in the Nextflow workflow language, making it easily runnable on any compute platform (https://nf-co.re/omicsgenetraitassociation).
Collapse
Affiliation(s)
- Sandeep Acharya
- Division of Computational and Data Sciences, Washington University, St Louis, MO
| | - Shu Liao
- Department of Computer Science and Engineering, Washington University, St Louis, MO
| | - Wooseok J Jung
- Department of Computer Science and Engineering, Washington University, St Louis, MO
| | - Yu S Kang
- Department of Computer Science and Engineering, Washington University, St Louis, MO
| | - Vaha A Moghaddam
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO
| | - Mary Feitosa
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO
| | - Mary Wojczynski
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO
| | - Shiow Lin
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO
| | - Jason A Anema
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO
| | - Karen Schwander
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO
| | - Jeff O Connell
- Department of Medicine, University of Maryland, Baltimore, MD
| | - Mike Province
- Division of Statistical Genomics, Washington University School of Medicine, St Louis, MO
| | - Michael R Brent
- Department of Computer Science and Engineering, Washington University, St Louis, MO
| |
Collapse
|
46
|
He D, Liu Y, Dong S, Shen D, Yang X, Hao M, Yin X, He X, Li Y, Wang Y, Liu M, Wang J, Chen X, Cui L. The prognostic value of systematic genetic screening in amyotrophic lateral sclerosis patients. J Neurol 2024; 271:1385-1396. [PMID: 37980296 DOI: 10.1007/s00415-023-12079-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 10/22/2023] [Accepted: 10/23/2023] [Indexed: 11/20/2023]
Abstract
BACKGROUND Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease with complex genetic architecture. Emerging evidence has indicated comorbidity between ALS and autoimmune conditions, suggesting a potential shared genetic basis. The objective of this study is to assess the prognostic value of systematic screening for rare deleterious mutations in genes associated with ALS and aberrant inflammatory responses. METHODS A discovery cohort of 494 patients and a validation cohort of 69 patients were analyzed in this study, with population-matched healthy subjects (n = 4961) served as controls. Whole exome sequencing (WES) was performed to identify rare deleterious variants in 50 ALS genes and 1177 genes associated with abnormal inflammatory responses. Genotype-phenotype correlation was assessed, and an integrative prognostic model incorporating genetic and clinical factors was constructed. RESULTS In the discovery cohort, 8.1% of patients carried confirmed ALS variants, and an additional 15.2% of patients carried novel ALS variants. Gene burden analysis revealed 303 immune-implicated genes with enriched rare variants, and 13.4% of patients harbored rare deleterious variants in these genes. Patients with ALS variants exhibited a more rapid disease progression (HR 2.87 [95% CI 2.03-4.07], p < 0.0001), while no significant effect was observed for immune-implicated variants. The nomogram model incorporating genetic and clinical information demonstrated improved accuracy in predicting disease outcomes (C-index, 0.749). CONCLUSION Our findings enhance the comprehension of the genetic basis of ALS within the Chinese population. It also appears that rare deleterious mutations occurring in immune-implicated genes exert minimal influence on the clinical trajectories of ALS patients.
Collapse
Affiliation(s)
- Di He
- Department of Neurology, Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS & PUMC), Dongcheng District, Beijing, China
| | - Yining Liu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
| | - Siqi Dong
- Department of Neurology, Huashan Hospital and Institute of Neurology, National Center for Neurological Disorders, Fudan University, Shanghai, China
| | - Dongchao Shen
- Department of Neurology, Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS & PUMC), Dongcheng District, Beijing, China
| | - Xunzhe Yang
- Department of Neurology, Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS & PUMC), Dongcheng District, Beijing, China
| | - Meng Hao
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
| | - Xianhong Yin
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
| | - Xinyi He
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
| | - Yi Li
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
| | - Yi Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
| | - Mingsheng Liu
- Department of Neurology, Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS & PUMC), Dongcheng District, Beijing, China
| | - Jiucun Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China.
- Research Unit of Dissecting the Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases (2019RU058), Chinese Academy of Medical Sciences, Beijing, China.
| | - Xiangjun Chen
- Department of Neurology, Huashan Hospital and Institute of Neurology, National Center for Neurological Disorders, Fudan University, Shanghai, China.
- Research Unit of Dissecting the Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases (2019RU058), Chinese Academy of Medical Sciences, Beijing, China.
| | - Liying Cui
- Department of Neurology, Peking Union Medical College Hospital (PUMCH), Chinese Academy of Medical Sciences and Peking Union Medical College (CAMS & PUMC), Dongcheng District, Beijing, China.
| |
Collapse
|
47
|
Cao X, Liang X, Zhang S, Sha Q. Gene selection by incorporating genetic networks into case-control association studies. Eur J Hum Genet 2024; 32:270-277. [PMID: 36529820 PMCID: PMC10923938 DOI: 10.1038/s41431-022-01264-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Revised: 11/27/2022] [Accepted: 11/30/2022] [Indexed: 12/23/2022] Open
Abstract
Large-scale genome-wide association studies (GWAS) have been successfully applied to a wide range of genetic variants underlying complex diseases. The network-based regression approach has been developed to incorporate a biological genetic network and to overcome the challenges caused by the computational efficiency for analyzing high-dimensional genomic data. In this paper, we propose a gene selection approach by incorporating genetic networks into case-control association studies for DNA sequence data or DNA methylation data. Instead of using traditional dimension reduction techniques such as principal component analyses and supervised principal component analyses, we use a linear combination of genotypes at SNPs or methylation values at CpG sites in a gene to capture gene-level signals. We employ three linear combination approaches: optimally weighted sum (OWS), beta-based weighted sum (BWS), and LD-adjusted polygenic risk score (LD-PRS). OWS and LD-PRS are supervised approaches that depend on the effect of each SNP or CpG site on the case-control status, while BWS can be extracted without using the case-control status. After using one of the linear combinations of genotypes or methylation values in each gene to capture gene-level signals, we regularize them to perform gene selection based on the biological network. Simulation studies show that the proposed approaches have higher true positive rates than using traditional dimension reduction techniques. We also apply our approaches to DNA methylation data and UK Biobank DNA sequence data for analyzing rheumatoid arthritis. The results show that the proposed methods can select potentially rheumatoid arthritis related genes that are missed by existing methods.
Collapse
Affiliation(s)
- Xuewei Cao
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA
| | - Xiaoyu Liang
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI, USA
| | - Shuanglin Zhang
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA
| | - Qiuying Sha
- Department of Mathematical Sciences, Michigan Technological University, Houghton, MI, USA.
| |
Collapse
|
48
|
Clay S, Alladina J, Smith NP, Visness CM, Wood RA, O'Connor GT, Cohen RT, Khurana Hershey GK, Kercsmar CM, Gruchalla RS, Gill MA, Liu AH, Kim H, Kattan M, Bacharier LB, Rastogi D, Rivera-Spoljaric K, Robison RG, Gergen PJ, Busse WW, Villani AC, Cho JL, Medoff BD, Gern JE, Jackson DJ, Ober C, Dapas M. Gene-based association study of rare variants in children of diverse ancestries implicates TNFRSF21 in the development of allergic asthma. J Allergy Clin Immunol 2024; 153:809-820. [PMID: 37944567 PMCID: PMC10939893 DOI: 10.1016/j.jaci.2023.10.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 09/25/2023] [Accepted: 10/12/2023] [Indexed: 11/12/2023]
Abstract
BACKGROUND Most genetic studies of asthma and allergy have focused on common variation in individuals primarily of European ancestry. Studying the role of rare variation in quantitative phenotypes and in asthma phenotypes in populations of diverse ancestries can provide additional, important insights into the development of these traits. OBJECTIVE We sought to examine the contribution of rare variants to different asthma- or allergy-associated quantitative traits in children with diverse ancestries and explore their role in asthma phenotypes. METHODS We examined whole-genome sequencing data from children participants in longitudinal studies of asthma (n = 1035; parent-identified as 67% Black and 25% Hispanic) to identify rare variants (minor allele frequency < 0.01). We assigned variants to genes and tested for associations using an omnibus variant-set test between each of 24,902 genes and 8 asthma-associated quantitative traits. On combining our results with external data on predicted gene expression in humans and mouse knockout studies, we identified 3 candidate genes. A burden of rare variants in each gene and in a combined 3-gene score was tested for its associations with clinical phenotypes of asthma. Finally, published single-cell gene expression data in lower airway mucosal cells after allergen challenge were used to assess transcriptional responses to allergen. RESULTS Rare variants in USF1 were significantly associated with blood neutrophil count (P = 2.18 × 10-7); rare variants in TNFRSF21 with total IgE (P = 6.47 × 10-6) and PIK3R6 with eosinophil count (P = 4.10 × 10-5) reached suggestive significance. These 3 findings were supported by independent data from human and mouse studies. A burden of rare variants in TNFRSF21 and in a 3-gene score was associated with allergy-related phenotypes in cohorts of children with mild and severe asthma. Furthermore, TNFRSF21 was significantly upregulated in bronchial basal epithelial cells from adults with allergic asthma but not in adults with allergies (but not asthma) after allergen challenge. CONCLUSIONS We report novel associations between rare variants in genes and allergic and inflammatory phenotypes in children with diverse ancestries, highlighting TNFRSF21 as contributing to the development of allergic asthma.
Collapse
Affiliation(s)
- Selene Clay
- Department of Human Genetics, University of Chicago, Chicago, Ill.
| | - Jehan Alladina
- Division of Pulmonary and Critical Care Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Mass; Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital and Harvard Medical School, Boston, Mass
| | - Neal P Smith
- Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital and Harvard Medical School, Boston, Mass; Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Mass; Massachusetts General Hospital Cancer Center, Boston, Mass
| | | | - Robert A Wood
- Pediatric Allergy and Immunology Department, Johns Hopkins Bloomberg School of Public Health, Baltimore, Md
| | - George T O'Connor
- Department of Pediatrics, Boston University School of Medicine, Boston, Mass
| | - Robyn T Cohen
- Department of Pediatrics, Boston University School of Medicine, Boston, Mass
| | | | - Carolyn M Kercsmar
- Division of Pulmonary Medicine, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Rebecca S Gruchalla
- Internal Medicine and Pediatrics, University of Texas Southwestern Medical Center, Dallas, Tex
| | - Michelle A Gill
- Pediatric Infectious Diseases, St. Louis Children's Hospital, St Louis, Mo
| | - Andrew H Liu
- Breathing Institute, Children's Hospital Colorado, Aurora, Colo
| | - Haejin Kim
- Allergy and Immunology, Henry Ford Health, Detroit, Mich
| | - Meyer Kattan
- Department of Pediatrics, Columbia University Medical Center, New York, NY
| | - Leonard B Bacharier
- Department of Pediatrics, Monroe Carell Jr Children's Hospital at Vanderbilt University Medical Center, Nashville, Tenn
| | - Deepa Rastogi
- Division of Pulmonology and Sleep Medicine, Children's National Hospital, Washington, DC
| | - Katherine Rivera-Spoljaric
- Department of Pediatric Allergy, Immunology, and Pulmonary Medicine, Washington University School of Medicine, St Louis, Mo
| | - Rachel G Robison
- Department of Pediatrics, Monroe Carell Jr Children's Hospital at Vanderbilt University Medical Center, Nashville, Tenn; Ann & Robert H. Lurie Children's Hospital, Chicago, Ill
| | - Peter J Gergen
- National Institute of Allergy and Infectious Diseases, Rockville, Md
| | - William W Busse
- Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, Wis
| | - Alexandra-Chloe Villani
- Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital and Harvard Medical School, Boston, Mass; Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Mass; Massachusetts General Hospital Cancer Center, Boston, Mass
| | - Josalyn L Cho
- Division of Pulmonary, Critical Care and Occupational Medicine, University of Iowa Carver College of Medicine, Iowa City, Iowa
| | - Benjamin D Medoff
- Division of Pulmonary and Critical Care Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Mass; Center for Immunology and Inflammatory Diseases, Massachusetts General Hospital and Harvard Medical School, Boston, Mass
| | - James E Gern
- Department of Pediatrics, University of Wisconsin School of Medicine and Public Health, Madison, Wis
| | - Daniel J Jackson
- Department of Pediatrics, University of Wisconsin School of Medicine and Public Health, Madison, Wis
| | - Carole Ober
- Department of Human Genetics, University of Chicago, Chicago, Ill
| | - Matthew Dapas
- Department of Human Genetics, University of Chicago, Chicago, Ill
| |
Collapse
|
49
|
Yoon JG, Jang DG, Cho SG, Lee C, Noh SH, Seo SK, Yu JW, Chung HW, Han K, Kwon SS, Han DH, Oh J, Jang IJ, Kim SH, Jee YK, Lee H, Park DW, Sohn JW, Yoon HJ, Kim CH, Lee JM, Kim SH, Lee MG. Synergistic toxicity with copper contributes to NAT2-associated isoniazid toxicity. Exp Mol Med 2024; 56:570-582. [PMID: 38424191 PMCID: PMC10984958 DOI: 10.1038/s12276-024-01172-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 12/12/2023] [Accepted: 12/15/2023] [Indexed: 03/02/2024] Open
Abstract
Anti-tuberculosis (AT) medications, including isoniazid (INH), can cause drug-induced liver injury (DILI), but the underlying mechanism remains unclear. In this study, we aimed to identify genetic factors that may increase the susceptibility of individuals to AT-DILI and to examine genetic interactions that may lead to isoniazid (INH)-induced hepatotoxicity. We performed a targeted sequencing analysis of 380 pharmacogenes in a discovery cohort of 112 patients (35 AT-DILI patients and 77 controls) receiving AT treatment for active tuberculosis. Pharmacogenome-wide association analysis was also conducted using 1048 population controls (Korea1K). NAT2 and ATP7B genotypes were analyzed in a replication cohort of 165 patients (37 AT-DILI patients and 128 controls) to validate the effects of both risk genotypes. NAT2 ultraslow acetylators (UAs) were found to have a greater risk of AT-DILI than other genotypes (odds ratio [OR] 5.6 [95% confidence interval; 2.5-13.2], P = 7.2 × 10-6). The presence of ATP7B gene 832R/R homozygosity (rs1061472) was found to co-occur with NAT2 UA in AT-DILI patients (P = 0.017) and to amplify the risk in NAT2 UA (OR 32.5 [4.5-1423], P = 7.5 × 10-6). In vitro experiments using human liver-derived cell lines (HepG2 and SNU387 cells) revealed toxic synergism between INH and Cu, which were strongly augmented in cells with defective NAT2 and ATP7B activity, leading to increased mitochondrial reactive oxygen species generation, mitochondrial dysfunction, DNA damage, and apoptosis. These findings link the co-occurrence of ATP7B and NAT2 genotypes to the risk of INH-induced hepatotoxicity, providing novel mechanistic insight into individual AT-DILI susceptibility. Yoon et al. showed that individuals who carry NAT2 UAs and ATP7B 832R/R genotypes are at increased risk of developing isoniazid hepatotoxicity, primarily due to the increased synergistic toxicity between isoniazid and copper, which exacerbates mitochondrial dysfunction-related apoptosis.
Collapse
Affiliation(s)
- Jihoon G Yoon
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
- Department of Genomic Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Dong Geon Jang
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Sung-Gyu Cho
- Department of Microbiology and Immunology, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Chaeyoung Lee
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Shin Hye Noh
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Soo Kyung Seo
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Jung Woo Yu
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Hyeon Woo Chung
- Department of Microbiology and Immunology, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - KyeoRe Han
- Department of Microbiology and Immunology, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Soon Sung Kwon
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
- Department of Laboratory Medicine, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Dai Hoon Han
- Division of Hepatobiliary and Pancreatic Surgery, Department of Surgery, Yonsei University College of Medicine, Seoul, Korea
| | - Jaeseong Oh
- Department of Clinical Pharmacology and Therapeutics, Seoul National University College of Medicine and Hospital, Seoul, Republic of Korea
| | - In-Jin Jang
- Department of Clinical Pharmacology and Therapeutics, Seoul National University College of Medicine and Hospital, Seoul, Republic of Korea
| | - Sang-Hoon Kim
- Department of Internal Medicine, Eulji University School of Medicine, Seoul, Republic of Korea
| | - Young-Koo Jee
- Department of Internal Medicine, Dankook University College of Medicine, Cheonan, Republic of Korea
| | - Hyun Lee
- Department of Internal Medicine, Hanyang University College of Medicine, Seoul, Republic of Korea
| | - Dong Won Park
- Department of Internal Medicine, Hanyang University College of Medicine, Seoul, Republic of Korea
| | - Jang Won Sohn
- Department of Internal Medicine, Hanyang University College of Medicine, Seoul, Republic of Korea
| | - Ho Joo Yoon
- Department of Internal Medicine, Hanyang University College of Medicine, Seoul, Republic of Korea
| | - Chul Hoon Kim
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Jae Myun Lee
- Department of Microbiology and Immunology, Yonsei University College of Medicine, Seoul, Republic of Korea.
| | - Sang-Heon Kim
- Department of Internal Medicine, Hanyang University College of Medicine, Seoul, Republic of Korea.
| | - Min Goo Lee
- Department of Pharmacology, BK21 Project of Yonsei Advanced Medical Science, Woo Choo Lee Institute for Precision Drug Development, Yonsei University College of Medicine, Seoul, Republic of Korea.
- Severance Biomedical Science Institute, Yonsei University College of Medicine, Seoul, Republic of Korea.
| |
Collapse
|
50
|
Pathan N, Deng WQ, Di Scipio M, Khan M, Mao S, Morton RW, Lali R, Pigeyre M, Chong MR, Paré G. A method to estimate the contribution of rare coding variants to complex trait heritability. Nat Commun 2024; 15:1245. [PMID: 38336875 PMCID: PMC10858280 DOI: 10.1038/s41467-024-45407-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 01/22/2024] [Indexed: 02/12/2024] Open
Abstract
It has been postulated that rare coding variants (RVs; MAF < 0.01) contribute to the "missing" heritability of complex traits. We developed a framework, the Rare variant heritability (RARity) estimator, to assess RV heritability (h2RV) without assuming a particular genetic architecture. We applied RARity to 31 complex traits in the UK Biobank (n = 167,348) and showed that gene-level RV aggregation suffers from 79% (95% CI: 68-93%) loss of h2RV. Using unaggregated variants, 27 traits had h2RV > 5%, with height having the highest h2RV at 21.9% (95% CI: 19.0-24.8%). The total heritability, including common and rare variants, recovered pedigree-based estimates for 11 traits. RARity can estimate gene-level h2RV, enabling the assessment of gene-level characteristics and revealing 11, previously unreported, gene-phenotype relationships. Finally, we demonstrated that in silico pathogenicity prediction (variant-level) and gene-level annotations do not generally enrich for RVs that over-contribute to complex trait variance, and thus, innovative methods are needed to predict RV functionality.
Collapse
Affiliation(s)
- Nazia Pathan
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, Canada
| | - Wei Q Deng
- Peter Boris Centre for Addictions Research, St. Joseph's Healthcare Hamilton, Hamilton, Canada
- Department of Psychiatry and Behavioural Neurosciences, McMaster University, Hamilton, Canada
| | - Matteo Di Scipio
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - Mohammad Khan
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - Shihong Mao
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
| | - Robert W Morton
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, Canada
| | - Ricky Lali
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada
| | - Marie Pigeyre
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
- Department of Medicine, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - Michael R Chong
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, Canada
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, Canada
| | - Guillaume Paré
- Population Health Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton Health Sciences and McMaster University, Hamilton, Canada.
- Department of Pathology and Molecular Medicine, McMaster University, Michael G. DeGroote School of Medicine, Hamilton, Canada.
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada.
- Thrombosis and Atherosclerosis Research Institute, David Braley Cardiac, Vascular and Stroke Research Institute, Hamilton, Canada.
| |
Collapse
|