1
|
Sun WX, Chang XY, Chen Y, Zhao Q, Zhang YM. The integration of quantile regression with 3VmrMLM identifies more QTNs and QTN-by-environment interactions using SNP- and haplotype-based markers. PLANT COMMUNICATIONS 2024:101196. [PMID: 39580620 DOI: 10.1016/j.xplc.2024.101196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 10/11/2024] [Accepted: 11/20/2024] [Indexed: 11/26/2024]
Abstract
Current methods used in genome-wide association studies frequently lack power owing to their inability to detect heterogeneous associations and rare and multiallelic variants. To address these issues, quantile regression is integrated with a three (compressed) variance component multi-locus random-SNP-effect mixed linear model (3VmrMLM) to propose q3VmrMLM for detecting heterogeneous quantitative trait nucleotides (QTNs) and QTN-by-environment interactions (QEIs), and then design haplotype-based q3VmrMLM (q3VmrMLM-Hap) for identifying multiallelic haplotypes and rare variants. In Monte Carlo simulation studies, q3VmrMLM had higher power than 3VmrMLM, sequence kernel association test (SKAT), and integrated quantile rank test (iQRAT). In a re-analysis of 10 traits in 1439 rice hybrids, 261 known genes were identified only by q3VmrMLM and q3VmrMLM-Hap, whereas 175 known genes were detected by both the new and existing methods. Of all the significant QTNs with known genes, q3VmrMLM (179: 140 variance heterogeneity and 157 quantile effect heterogeneity) found more heterogeneous QTNs than 3VmrMLM (123), SKAT (27), and iQRAT (29); q3VmrMLM-Hap (121) mapped more low-frequency (<0.05) QTNs than q3VmrMLM (51), 3VmrMLM (43), SKAT (11), and iQRAT (12); and q3VmrMLM-Hap (12), q3VmrMLM (16), and 3VmrMLM (12) had similar power in identifying gene-by-environment interactions. All significant and suggested QTNs achieved the highest predictive accuracy (r = 0.9045). In conclusion, this study describes a new and complementary approach to mining genes and unraveling the genetic architecture of complex traits in crops.
Collapse
Affiliation(s)
- Wen-Xian Sun
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Xiao-Yu Chang
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Ying Chen
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Qiong Zhao
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuan-Ming Zhang
- College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
2
|
Freda PJ, Ghosh A, Bhandary P, Matsumoto N, Chitre AS, Zhou J, Hall MA, Palmer AA, Obafemi-Ajayi T, Moore JH. PAGER: A novel genotype encoding strategy for modeling deviations from additivity in complex trait association studies. BioData Min 2024; 17:41. [PMID: 39394173 PMCID: PMC11468469 DOI: 10.1186/s13040-024-00393-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 09/30/2024] [Indexed: 10/13/2024] Open
Abstract
BACKGROUND The additive model of inheritance assumes that heterozygotes (Aa) are exactly intermediate in respect to homozygotes (AA and aa). While this model is commonly used in single-locus genetic association studies, significant deviations from additivity are well-documented and contribute to phenotypic variance across many traits and systems. This assumption can introduce type I and type II errors by overestimating or underestimating the effects of variants that deviate from additivity. Alternative genotype encoding strategies have been explored to account for different inheritance patterns, but they often incur significant computational or methodological costs. To address these challenges, we introduce PAGER (Phenotype Adjusted Genotype Encoding and Ranking), an efficient pre-processing method that encodes each genetic variant based on normalized mean phenotypic differences between diallelic genotype classes (AA, Aa, and aa). This approach more accurately reflects each variant's true inheritance model, improving model precision while minimizing the costs associated with alternative encoding strategies. RESULTS Through extensive benchmarking on SNPs simulated with both binary and continuous phenotypes, we demonstrate that PAGER accurately represents various inheritance patterns (including additive, dominant, recessive, and heterosis), achieves levels of statistical power that meet or exceed other encoding strategies, and attains computation speeds up to 55 times faster than a similar method, EDGE. We also apply PAGER to publicly available real-world data and identify a novel, relevant putative QTL associated with body mass index in rats (Rattus norvegicus) that is not detected with the additive model. CONCLUSIONS Overall, we show that PAGER is an efficient genotype encoding approach that can uncover sources of missing heritability and reveal novel insights in the study of complex traits while incurring minimal costs.
Collapse
Affiliation(s)
- Philip J Freda
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vincente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA
| | - Attri Ghosh
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vincente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA
| | - Priyanka Bhandary
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vincente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA
| | - Nicholas Matsumoto
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vincente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA
| | - Apurva S Chitre
- Department of Psychiatry, University of California San Diego, 9500 Gilman Dr, La Jolla, CA, 92093-0667, USA
| | - Jiayan Zhou
- Department of Medicine, Stanford University School of Medicine, 291 Campus Dr., Li Ka Shing Building, Stanford, CA, 94305, USA
| | - Molly A Hall
- Department of Genetics and Institute for Biomedical Informatics, University of Pennsylvania, 3700 Hamilton Walk, Richards Building A301, Philadelphia, PA, 19104, USA
| | - Abraham A Palmer
- Department of Psychiatry, University of California San Diego, 9500 Gilman Dr, La Jolla, CA, 92093-0667, USA
- Institute for Genomic Medicine, University of California San Diego, 9500 Gilman Dr, La Jolla, CA, 92093-0667, USA
| | - Tayo Obafemi-Ajayi
- Cooperative Engineering Program, Missouri State University, 901 S. National Ave, Springfield, MO, 65897, USA
| | - Jason H Moore
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, 700 N. San Vincente Blvd., Pacific Design Center, Suite G540, West Hollywood, CA, 90069, USA.
| |
Collapse
|
3
|
de Oliveira LF, Brito LF, Marques DBD, da Silva DA, Lopes PS, Dos Santos CG, Johnson JS, Veroneze R. Investigating the impact of non-additive genetic effects in the estimation of variance components and genomic predictions for heat tolerance and performance traits in crossbred and purebred pig populations. BMC Genom Data 2023; 24:76. [PMID: 38093199 PMCID: PMC10717470 DOI: 10.1186/s12863-023-01174-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 11/13/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND Non-additive genetic effects are often ignored in livestock genetic evaluations. However, fitting them in the models could improve the accuracy of genomic breeding values. Furthermore, non-additive genetic effects contribute to heterosis, which could be optimized through mating designs. Traits related to fitness and adaptation, such as heat tolerance, tend to be more influenced by non-additive genetic effects. In this context, the primary objectives of this study were to estimate variance components and assess the predictive performance of genomic prediction of breeding values based on alternative models and two independent datasets, including performance records from a purebred pig population and heat tolerance indicators recorded in crossbred lactating sows. RESULTS Including non-additive genetic effects when modelling performance traits in purebred pigs had no effect on the residual variance estimates for most of the traits, but lower additive genetic variances were observed, especially when additive-by-additive epistasis was included in the models. Furthermore, including non-additive genetic effects did not improve the prediction accuracy of genomic breeding values, but there was animal re-ranking across the models. For the heat tolerance indicators recorded in a crossbred population, most traits had small non-additive genetic variance with large standard error estimates. Nevertheless, panting score and hair density presented substantial additive-by-additive epistatic variance. Panting score had an epistatic variance estimate of 0.1379, which accounted for 82.22% of the total genetic variance. For hair density, the epistatic variance estimates ranged from 0.1745 to 0.1845, which represent 64.95-69.59% of the total genetic variance. CONCLUSIONS Including non-additive genetic effects in the models did not improve the accuracy of genomic breeding values for performance traits in purebred pigs, but there was substantial re-ranking of selection candidates depending on the model fitted. Except for panting score and hair density, low non-additive genetic variance estimates were observed for heat tolerance indicators in crossbred pigs.
Collapse
Affiliation(s)
- Letícia Fernanda de Oliveira
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil.
- Department of Animal Sciences, Purdue University, West Lafayette, IN, USA.
| | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, USA
| | | | | | - Paulo Sávio Lopes
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil
| | | | - Jay S Johnson
- USDA-ARS Livestock Behavior Research Unit, West Lafayette, IN, USA
| | - Renata Veroneze
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil
| |
Collapse
|
4
|
Cui L, Yang B, Xiao S, Gao J, Baud A, Graham D, McBride M, Dominiczak A, Schafer S, Aumatell RL, Mont C, Teruel AF, Hübner N, Flint J, Mott R, Huang L. Dominance is common in mammals and is associated with trans-acting gene expression and alternative splicing. Genome Biol 2023; 24:215. [PMID: 37773188 PMCID: PMC10540365 DOI: 10.1186/s13059-023-03060-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 09/18/2023] [Indexed: 10/01/2023] Open
Abstract
BACKGROUND Dominance and other non-additive genetic effects arise from the interaction between alleles, and historically these phenomena play a major role in quantitative genetics. However, most genome-wide association studies (GWAS) assume alleles act additively. RESULTS We systematically investigate both dominance-here representing any non-additive within-locus interaction-and additivity across 574 physiological and gene expression traits in three mammalian stocks: F2 intercross pigs, rat heterogeneous stock, and mice heterogeneous stock. Dominance accounts for about one quarter of heritable variance across all physiological traits in all species. Hematological and immunological traits exhibit the highest dominance variance, possibly reflecting balancing selection in response to pathogens. Although most quantitative trait loci (QTLs) are detectable as additive QTLs, we identify 154, 64, and 62 novel dominance QTLs in pigs, rats, and mice respectively that are undetectable as additive QTLs. Similarly, even though most cis-acting expression QTLs are additive, gene expression exhibits a large fraction of dominance variance, and trans-acting eQTLs are enriched for dominance. Genes causal for dominance physiological QTLs are less likely to be physically linked to their QTLs but instead act via trans-acting dominance eQTLs. In addition, thousands of eQTLs are associated with alternatively spliced isoforms with complex additive and dominant architectures in heterogeneous stock rats, suggesting a possible mechanism for dominance. CONCLUSIONS Although heritability is predominantly additive, many mammalian genetic effects are dominant and likely arise through distinct mechanisms. It is therefore advantageous to consider both additive and dominance effects in GWAS to improve power and uncover causality.
Collapse
Affiliation(s)
- Leilei Cui
- National Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, People's Republic of China
- UCL Genetics Institute, University College London, London, WC1E 6BT, UK
- Human Aging Research Institute and School of Life Science, Nanchang University, and Jiangxi Key Laboratory of Human Aging, Jiangxi, China
- School of Life Sciences, Nanchang University, Nanchang, China
| | - Bin Yang
- National Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, People's Republic of China
| | - Shijun Xiao
- National Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, People's Republic of China
| | - Jun Gao
- National Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, People's Republic of China
| | - Amelie Baud
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Delyth Graham
- BHF Glasgow Cardiovascular Research Centre, University of Glasgow, Glasgow, G12 8TA, UK
| | - Martin McBride
- BHF Glasgow Cardiovascular Research Centre, University of Glasgow, Glasgow, G12 8TA, UK
| | - Anna Dominiczak
- BHF Glasgow Cardiovascular Research Centre, University of Glasgow, Glasgow, G12 8TA, UK
| | - Sebastian Schafer
- Cardiovascular and Metabolic Disorders Program, Duke-National University of Singapore Medical School, Singapore, Singapore
| | - Regina Lopez Aumatell
- Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Carme Mont
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Albert Fernandez Teruel
- Departamento de Psiquiatría y Medicina Legal, Universitat Autonoma de Barcelona, Barcelona, Spain
| | - Norbert Hübner
- Genetics and Genomics of Cardiovascular Diseases Research Group, Max Delbrück Center (MDC) for Molecular Medicine in the Helmholtz Association, Berlin, Germany
- DZHK (German Center for Cardiovascular Research) Partner Site Berlin, Berlin, Germany
- Charité Universitätsmedizin Berlin, Berlin, Germany
| | - Jonathan Flint
- Department of Psychiatry and Behavioral Sciences, Brain Research Institute, University of California, Los Angeles, CA, USA
| | - Richard Mott
- UCL Genetics Institute, University College London, London, WC1E 6BT, UK.
| | - Lusheng Huang
- National Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, People's Republic of China.
| |
Collapse
|