1
|
Qi T, Song L, Guo Y, Chen C, Yang J. From genetic associations to genes: methods, applications, and challenges. Trends Genet 2024; 40:642-667. [PMID: 38734482 DOI: 10.1016/j.tig.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 04/15/2024] [Accepted: 04/16/2024] [Indexed: 05/13/2024]
Abstract
Genome-wide association studies (GWASs) have identified numerous genetic loci associated with human traits and diseases. However, pinpointing the causal genes remains a challenge, which impedes the translation of GWAS findings into biological insights and medical applications. In this review, we provide an in-depth overview of the methods and technologies used for prioritizing genes from GWAS loci, including gene-based association tests, integrative analysis of GWAS and molecular quantitative trait loci (xQTL) data, linking GWAS variants to target genes through enhancer-gene connection maps, and network-based prioritization. We also outline strategies for generating context-dependent xQTL data and their applications in gene prioritization. We further highlight the potential of gene prioritization in drug repurposing. Lastly, we discuss future challenges and opportunities in this field.
Collapse
Affiliation(s)
- Ting Qi
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, China; School of Life Sciences, Westlake University, Hangzhou 310024, China.
| | - Liyang Song
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, China; School of Life Sciences, Westlake University, Hangzhou 310024, China
| | - Yazhou Guo
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, China; School of Life Sciences, Westlake University, Hangzhou 310024, China
| | - Chang Chen
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, China; School of Life Sciences, Westlake University, Hangzhou 310024, China
| | - Jian Yang
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, China; School of Life Sciences, Westlake University, Hangzhou 310024, China.
| |
Collapse
|
2
|
Wirojsirasak W, Songsri P, Jongrungklang N, Tangphatsornruang S, Klomsa-ard P, Ukoskit K. A Large-Scale Candidate-Gene Association Mapping for Drought Tolerance and Agronomic Traits in Sugarcane. Int J Mol Sci 2023; 24:12801. [PMID: 37628982 PMCID: PMC10454574 DOI: 10.3390/ijms241612801] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 08/09/2023] [Accepted: 08/12/2023] [Indexed: 08/27/2023] Open
Abstract
Dissection of the genetic loci controlling drought tolerance traits with a complex genetic inheritance is important for drought-tolerant sugarcane improvement. In this study, we conducted a large-scale candidate gene association study of 649 candidate genes in a sugarcane diversity panel to identify genetic variants underlying agronomic traits and drought tolerance indices evaluated in plant cane and ratoon cane under water-stressed (WS) and non-stressed (NS) environments. We identified 197 significant marker-trait associations (MTAs) in 141 candidate genes associated with 18 evaluated traits with the Bonferroni correction threshold (α = 0.05). Out of the total, 95 MTAs in 78 candidate genes and 62 MTAs in 58 candidate genes were detected under NS and WS conditions, respectively. Most MTAs were found only in specific water regimes and crop seasons. These MTAs explained 7.93-30.52% of phenotypic variation. Association mapping results revealed that 34, 59, and 104 MTAs involved physiological and molecular adaptation, phytohormone metabolism, and drought-inducible genes. They identified 19 pleiotropic genes associated with more than one trait and many genes related to drought tolerance indices. The genetic and genomic resources identified in this study will enable the combining of yield-related traits and sugar-related traits with agronomic value to optimize the yield of sugarcane cultivars grown under drought-stressed and non-stressed environments.
Collapse
Affiliation(s)
- Warodom Wirojsirasak
- Department of Biotechnology, Faculty of Science and Technology, Rangsit Campus, Thammasat University, Pathum Thani 12120, Thailand;
- Mitr Phol Innovation and Research Center, Chaiyaphum 36110, Thailand;
| | - Patcharin Songsri
- Department of Agronomy, Faculty of Agriculture, Khon Kaen University, Khon Kaen 40002, Thailand; (P.S.); (N.J.)
- Northeast Thailand Cane and Sugar Research Center, Faculty of Agriculture, Khon Kaen University, Khon Kaen 40002, Thailand
| | - Nakorn Jongrungklang
- Department of Agronomy, Faculty of Agriculture, Khon Kaen University, Khon Kaen 40002, Thailand; (P.S.); (N.J.)
- Northeast Thailand Cane and Sugar Research Center, Faculty of Agriculture, Khon Kaen University, Khon Kaen 40002, Thailand
| | - Sithichoke Tangphatsornruang
- National Center for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, Pathum Thani 12120, Thailand;
| | | | - Kittipat Ukoskit
- Department of Biotechnology, Faculty of Science and Technology, Rangsit Campus, Thammasat University, Pathum Thani 12120, Thailand;
| |
Collapse
|
3
|
Monti R, Ohler U. Toward Identification of Functional Sequences and Variants in Noncoding DNA. Annu Rev Biomed Data Sci 2023; 6:191-210. [PMID: 37262323 DOI: 10.1146/annurev-biodatasci-122120-110102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Understanding the noncoding part of the genome, which encodes gene regulation, is necessary to identify genetic mechanisms of disease and translate findings from genome-wide association studies into actionable results for treatments and personalized care. Here we provide an overview of the computational analysis of noncoding regions, starting from gene-regulatory mechanisms and their representation in data. Deep learning methods, when applied to these data, highlight important regulatory sequence elements and predict the functional effects of genetic variants. These and other algorithms are used to predict damaging sequence variants. Finally, we introduce rare-variant association tests that incorporate functional annotations and predictions in order to increase interpretability and statistical power.
Collapse
Affiliation(s)
- Remo Monti
- Max Delbrück Center for Molecular Medicine (MDC), Helmholtz Association of German Research Centers, Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany;
- Digital Health-Machine Learning, Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
| | - Uwe Ohler
- Max Delbrück Center for Molecular Medicine (MDC), Helmholtz Association of German Research Centers, Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany;
| |
Collapse
|
4
|
Ma S, Wang C, Khan A, Liu L, Dalgleish J, Kiryluk K, He Z, Ionita-Laza I. BIGKnock: fine-mapping gene-based associations via knockoff analysis of biobank-scale data. Genome Biol 2023; 24:24. [PMID: 36782330 PMCID: PMC9926792 DOI: 10.1186/s13059-023-02864-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 01/23/2023] [Indexed: 02/15/2023] Open
Abstract
We propose BIGKnock (BIobank-scale Gene-based association test via Knockoffs), a computationally efficient gene-based testing approach for biobank-scale data, that leverages long-range chromatin interaction data, and performs conditional genome-wide testing via knockoffs. BIGKnock can prioritize causal genes over proxy associations at a locus. We apply BIGKnock to the UK Biobank data with 405,296 participants for multiple binary and quantitative traits, and show that relative to conventional gene-based tests, BIGKnock produces smaller sets of significant genes that contain the causal gene(s) with high probability. We further illustrate its ability to pinpoint potential causal genes at [Formula: see text] of the associated loci.
Collapse
Affiliation(s)
- Shiyang Ma
- Department of Biostatistics, Columbia University, New York, NY, USA
- Clinical Research Institute, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Chen Wang
- Department of Biostatistics, Columbia University, New York, NY, USA
| | - Atlas Khan
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY, USA
| | - Linxi Liu
- Department of Statistics, University of Pittsburgh, Pittsburgh, PA, USA
| | - James Dalgleish
- Department of Biostatistics, Columbia University, New York, NY, USA
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University, New York, NY, USA
| | - Zihuai He
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, USA
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA, USA
| | | |
Collapse
|
5
|
Schormair B, Zhao C, Salminen AV, Oexle K, Winkelmann J. Reassessment of candidate gene studies for idiopathic restless legs syndrome in a large genome-wide association study dataset of European ancestry. Sleep 2022; 45:6576194. [PMID: 35486972 PMCID: PMC9366638 DOI: 10.1093/sleep/zsac098] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 04/06/2022] [Indexed: 11/12/2022] Open
Abstract
Abstract
Study Objectives
Several candidate gene studies have been published for idiopathic restless legs syndrome (RLS) in populations of European ancestry, but the reported associations have not been confirmed in independent samples. Our aim was to reassess these findings in a large case–control dataset in order to evaluate their validity.
Methods
We screened PubMed for RLS candidate gene studies. We used the genome-wide association study (GWAS) dataset of the International EU-RLS-GENE Consortium as our replication sample, which provided genome-wide single-variant association data based on at most 17 220 individuals of European ancestry. We performed additional gene-based tests using the software MAGMA and assessed the power of our study using the genpwr R package.
Results
We identified 14 studies conducted in European samples which assessed 45 variants in 27 genes of which 5 variants had been reported as significantly associated. None of these individual variants were replicated in our GWAS-based reassessment (nominal p > 0.05) and gene-based tests for the respective five genes ADH1B, GABRR3, HMOX1, MAOA, and VDR, were also nonsignificant (nominal p > 0.05). Our replication dataset was well powered to detect the reported effects, even when adjusting for effect size overestimation due to winner’s curse. Power estimates were close to 100% for all variants.
Conclusion
In summary, none of the significant single-variant associations from candidate gene studies were confirmed in our GWAS dataset. Therefore, these associations were likely false positive. Our observations emphasize the need for large sample sizes and stringent significance thresholds in future association studies for RLS.
Collapse
Affiliation(s)
- Barbara Schormair
- Institute of Neurogenomics, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH) , Neuherberg , Germany
- Institute of Human Genetics, School of Medicine, Technical University of Munich , Munich , Germany
| | - Chen Zhao
- Institute of Neurogenomics, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH) , Neuherberg , Germany
| | - Aaro V Salminen
- Institute of Neurogenomics, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH) , Neuherberg , Germany
| | - Konrad Oexle
- Institute of Neurogenomics, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH) , Neuherberg , Germany
- Institute of Human Genetics, School of Medicine, Technical University of Munich , Munich , Germany
| | - Juliane Winkelmann
- Institute of Neurogenomics, Helmholtz Zentrum München, German Research Center for Environmental Health (GmbH) , Neuherberg , Germany
- Institute of Human Genetics, School of Medicine, Technical University of Munich , Munich , Germany
- Chair of Neurogenetics, School of Medicine, Technical University of Munich , Munich , Germany
- Munich Cluster for Systems Neurology (SyNergy) , Munich , Germany
| | | |
Collapse
|