1
|
Zhao Z, Niu Q, Wu J, Wu T, Xie X, Wang Z, Zhang L, Gao H, Gao X, Xu L, Zhu B, Li J. Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle. Biol Direct 2024; 19:147. [PMID: 39741345 DOI: 10.1186/s13062-024-00574-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 12/02/2024] [Indexed: 01/02/2025] Open
Abstract
BACKGROUND Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging. METHODS We generated 11 feature sets for sequencing variants from genomics, transcriptomics, metabolomics, and epigenetics data in beef cattle, then we assessed the contribution of functional variants using genomic restricted maximum likelihood (GREML). We next estimated and ranked variant scores for 43 economically important traits, and compared the prediction accuracy of the top and bottom sets using genomic best linear unbiased prediction (GBLUP) and BayesB model. In addition, we annotated the variants from GWAS with functional feature sets and performed enrichment analysis. RESULTS We observed significant enrichments for 32 functional categories in 11 feature sets. The evolutionary related sets (conservation regions and selection signatures) contributed significantly to heritability (31.78-fold and 14.48-fold enrichment), while metabolomics and transcriptomics showed low heritability enrichments. We observed a significant increase in prediction accuracy using the top feature set variants compared to whole-genome sequencing (WGS) data. The prediction accuracy based on the top 10% variant set showed an average increase of 11.6% and 7.54% using BayesB and GBLUP across traits, respectively. Notably, the greatest increase of 31.52% was obtained for spleen weight (SW) using BayesB. Also, we found that the top 10% of variants show strong enrichment with weight related QTLs based on the Cattle QTL database. CONCLUSIONS Our findings suggest that integrating biological prior information from multiple layers can enhance our understanding of the genetic architecture underlying complex traits and further improve genomic prediction in beef cattle.
Collapse
Affiliation(s)
- Zhida Zhao
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Qunhao Niu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Jiayuan Wu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Tianyi Wu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Xueyuan Xie
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Zezhao Wang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lupei Zhang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Huijiang Gao
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Xue Gao
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lingyang Xu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Bo Zhu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
- Northern Agriculture and Livestock Husbandry Technology Innovation Center, Hohhot, 010010, China.
| | - Junya Li
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| |
Collapse
|
2
|
de Ronne M, Abed A, Légaré G, Laroche J, Boucher St-Amour VT, Fortier É, Beattie A, Badea A, Khanal R, O'Donoughue L, Rajcan I, Belzile F, Boyle B, Torkamaneh D. Integrating targeted genetic markers to genotyping-by-sequencing for an ultimate genotyping tool. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:247. [PMID: 39365439 DOI: 10.1007/s00122-024-04750-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 09/18/2024] [Indexed: 10/05/2024]
Abstract
New selection methods, using trait-specific markers (marker-assisted selection (MAS)) and/or genome-wide markers (genomic selection (GS)), are becoming increasingly widespread in breeding programs. This new era requires innovative and cost-efficient solutions for genotyping. Reduction in sequencing cost has enhanced the use of high-throughput low-cost genotyping methods such as genotyping-by-sequencing (GBS) for genome-wide single-nucleotide polymorphism (SNP) profiling in large breeding populations. However, the major weakness of GBS methodologies is their inability to genotype targeted markers. Conversely, targeted methods, such as amplicon sequencing (AmpSeq), often face cost constraints, hindering genome-wide genotyping across a large cohort. Although GBS and AmpSeq data can be generated from the same sample, an efficient method to achieve this is lacking. In this study, we present the Genome-wide & Targeted Amplicon (GTA) genotyping platform, an innovative way to integrate multiplex targeted amplicons into the GBS library preparation to provide an all-in-one cost-effective genotyping solution to breeders and research communities. Custom primers were designed to target 23 and 36 high-value markers associated with key agronomical traits in soybean and barley, respectively. The resulting multiplex amplicons were compatible with the GBS library preparation enabling both GBS and targeted genotyping data to be produced efficiently and cost-effectively. To facilitate data analysis, we have introduced Fast-GBS.v3, a user-friendly bioinformatic pipeline that generates comprehensive outputs from data obtained following sequencing of GTA libraries. This high-throughput low-cost approach will greatly facilitate the application of DNA markers as it provides required markers for both MAS and GS in a single assay.
Collapse
Affiliation(s)
- Maxime de Ronne
- Département de Phytologie, Université Laval, Québec, Canada
- Institut de Biologie Intégrative Et Des Systèmes (IBIS), Université Laval, Québec, Canada
- Centre de Recherche Et d'innovation Sur Les Végétaux (CRIV), Université Laval, Québec, Canada
| | - Amina Abed
- Consortium de Recherche Sur La Pomme de Terre du Québec (CRPTQ), Québec, Canada
| | - Gaétan Légaré
- Institut de Biologie Intégrative Et Des Systèmes (IBIS), Université Laval, Québec, Canada
| | - Jérôme Laroche
- Institut de Biologie Intégrative Et Des Systèmes (IBIS), Université Laval, Québec, Canada
| | - Vincent-Thomas Boucher St-Amour
- Département de Phytologie, Université Laval, Québec, Canada
- Institut de Biologie Intégrative Et Des Systèmes (IBIS), Université Laval, Québec, Canada
- Centre de Recherche Et d'innovation Sur Les Végétaux (CRIV), Université Laval, Québec, Canada
| | - Éric Fortier
- Centre de Recherche Sur Les Grains (CÉROM), Saint-Mathieu-de-Beloeil, Québec, Canada
| | - Aaron Beattie
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Canada
| | - Ana Badea
- Agriculture and Agri-Food Canada, Brandon Research and Development Centre, Brandon, Canada
| | - Raja Khanal
- Agriculture and Agri-Food Canada, Ottawa Research and Development Center, Ottawa, Canada
| | - Louise O'Donoughue
- Centre de Recherche Sur Les Grains (CÉROM), Saint-Mathieu-de-Beloeil, Québec, Canada
| | - Istvan Rajcan
- Department of Plant Agriculture, University of Guelph, Guelph, Canada
| | - François Belzile
- Département de Phytologie, Université Laval, Québec, Canada
- Institut de Biologie Intégrative Et Des Systèmes (IBIS), Université Laval, Québec, Canada
- Centre de Recherche Et d'innovation Sur Les Végétaux (CRIV), Université Laval, Québec, Canada
| | - Brian Boyle
- Institut de Biologie Intégrative Et Des Systèmes (IBIS), Université Laval, Québec, Canada
| | - Davoud Torkamaneh
- Département de Phytologie, Université Laval, Québec, Canada.
- Institut de Biologie Intégrative Et Des Systèmes (IBIS), Université Laval, Québec, Canada.
- Centre de Recherche Et d'innovation Sur Les Végétaux (CRIV), Université Laval, Québec, Canada.
- Institut Intelligence Et Données (IID), Université Laval, Québec, Canada.
| |
Collapse
|
3
|
Teodoro M, Maiorano AM, Campos GS, de Albuquerque LG, de Oliveira HN. Genetic parameters, genomic prediction, and identification of regulatory regions located on chromosome 14 for weight traits in Nellore cattle. J Anim Breed Genet 2024. [PMID: 39189106 DOI: 10.1111/jbg.12895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 08/06/2024] [Accepted: 08/11/2024] [Indexed: 08/28/2024]
Abstract
This study aimed to investigate functional variants in chromosome 14 (BTA14) and its impact in genomic selection for birth weight (BW), weaning weight (WW), and yearling weight (YW) in Nellore cattle. Genetic parameter estimation and the weighted single-step genomic best linear unbiased prediction (WssGBLUP) analyses were performed. Direct additive heritability estimates were high for WW and YW, and moderate for BW. Trait-associated variants distributed across multiple regions on BTA14 were observed in the weighted single-step genome-wide association studies (WssGWAS) results, implying a polygenic genetic architecture for weight in different ages. Several genes have been found in association with the weight traits, including the CUB And Sushi multiple domains 3 (CSMD3), thyroglobulin (TG), and diacylglycerol O-acyltransferase 1 (DGAT1) genes. The variance explained per SNP was higher in six functional classes of gene regulatory regions (5UTR, CpG islands, downstream, upstream, long non-coding RNA, and transcription factor binding sites (TFBS)), highlighting their importance for weight traits in Nellore cattle. A marginal increase in accuracy was observed when the selected functional variants (SV) information was considered in the WssGBLUP method, probably because of the small number of SV available on BTA14. The identified genes, pathways, and functions contribute to a better understanding of the genetic and physiological mechanisms regulating weight traits in the Nellore breed.
Collapse
Affiliation(s)
- Miller Teodoro
- Department of Animal Science, São Paulo State University, Jaboticabal, Brazil
| | | | | | | | | |
Collapse
|
4
|
de Ronne M, Légaré G, Belzile F, Boyle B, Torkamaneh D. 3D-GBS: a universal genotyping-by-sequencing approach for genomic selection and other high-throughput low-cost applications in species with small to medium-sized genomes. PLANT METHODS 2023; 19:13. [PMID: 36740716 PMCID: PMC9899395 DOI: 10.1186/s13007-023-00990-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 01/31/2023] [Indexed: 06/18/2023]
Abstract
Despite the increased efficiency of sequencing technologies and the development of reduced-representation sequencing (RRS) approaches allowing high-throughput sequencing (HTS) of multiplexed samples, the per-sample genotyping cost remains the most limiting factor in the context of large-scale studies. For example, in the context of genomic selection (GS), breeders need genome-wide markers to predict the breeding value of large cohorts of progenies, requiring the genotyping of thousands candidates. Here, we introduce 3D-GBS, an optimized GBS procedure, to provide an ultra-high-throughput and ultra-low-cost genotyping solution for species with small to medium-sized genome and illustrate its use in soybean. Using a combination of three restriction enzymes (PstI/NsiI/MspI), the portion of the genome that is captured was reduced fourfold (compared to a "standard" ApeKI-based protocol) while reducing the number of markers by only 40%. By better focusing the sequencing effort on limited set of restriction fragments, fourfold more samples can be genotyped at the same minimal depth of coverage. This GBS protocol also resulted in a lower proportion of missing data and provided a more uniform distribution of SNPs across the genome. Moreover, we investigated the optimal number of reads per sample needed to obtain an adequate number of markers for GS and QTL mapping (500-1000 markers per biparental cross). This optimization allows sequencing costs to be decreased by ~ 92% and ~ 86% for GS and QTL mapping studies, respectively, compared to previously published work. Overall, 3D-GBS represents a unique and affordable solution for applications requiring extremely high-throughput genotyping where cost remains the most limiting factor.
Collapse
Affiliation(s)
- Maxime de Ronne
- Département de Phytologie, Université Laval, Quebec, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec, Canada
- Centre de recherche et d'innovation sur les végétaux (CRIV), Université Laval, Quebec, Canada
| | - Gaétan Légaré
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec, Canada
| | - François Belzile
- Département de Phytologie, Université Laval, Quebec, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec, Canada
- Centre de recherche et d'innovation sur les végétaux (CRIV), Université Laval, Quebec, Canada
| | - Brian Boyle
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec, Canada
| | - Davoud Torkamaneh
- Département de Phytologie, Université Laval, Quebec, Canada.
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Quebec, Canada.
- Centre de recherche et d'innovation sur les végétaux (CRIV), Université Laval, Quebec, Canada.
- Institut intelligence et données (IID), Université Laval, Quebec, Canada.
| |
Collapse
|
5
|
Rare and population-specific functional variation across pig lines. Genet Sel Evol 2022; 54:39. [PMID: 35659233 PMCID: PMC9164375 DOI: 10.1186/s12711-022-00732-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 05/17/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND It is expected that functional, mainly missense and loss-of-function (LOF), and regulatory variants are responsible for most phenotypic differences between breeds and genetic lines of livestock species that have undergone diverse selection histories. However, there is still limited knowledge about the existing missense and LOF variation in commercial livestock populations, in particular regarding population-specific variation and how it can affect applications such as across-breed genomic prediction. METHODS We re-sequenced the whole genome of 7848 individuals from nine commercial pig lines (average sequencing coverage: 4.1×) and imputed whole-genome genotypes for 440,610 pedigree-related individuals. The called variants were categorized according to predicted functional annotation (from LOF to intergenic) and prevalence level (number of lines in which the variant segregated; from private to widespread). Variants in each category were examined in terms of their distribution along the genome, alternative allele frequency, per-site Wright's fixation index (FST), individual load, and association to production traits. RESULTS Of the 46 million called variants, 28% were private (called in only one line) and 21% were widespread (called in all nine lines). Genomic regions with a low recombination rate were enriched with private variants. Low-prevalence variants (called in one or a few lines only) were enriched for lower allele frequencies, lower FST, and putatively functional and regulatory roles (including LOF and deleterious missense variants). On average, individuals carried fewer private deleterious missense alleles than expected compared to alleles with other predicted consequences. Only a small subset of the low-prevalence variants had intermediate allele frequencies and explained small fractions of phenotypic variance (up to 3.2%) of production traits. The significant low-prevalence variants had higher per-site FST than the non-significant ones. These associated low-prevalence variants were tagged by other more widespread variants in high linkage disequilibrium, including intergenic variants. CONCLUSIONS Most low-prevalence variants have low minor allele frequencies and only a small subset of low-prevalence variants contributed detectable fractions of phenotypic variance of production traits. Accounting for low-prevalence variants is therefore unlikely to noticeably benefit across-breed analyses, such as the prediction of genomic breeding values in a population using reference populations of a different genetic background.
Collapse
|
6
|
Li Y, Ruperao P, Batley J, Edwards D, Martin W, Hobson K, Sutton T. Genomic prediction of preliminary yield trials in chickpea: Effect of functional annotation of SNPs and environment. THE PLANT GENOME 2022; 15:e20166. [PMID: 34786880 DOI: 10.1002/tpg2.20166] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/14/2021] [Indexed: 06/13/2023]
Abstract
Achieving yield potential in chickpea (Cicer arietinum L.) is limited by many constraints that include biotic and abiotic stresses. Combining next-generation sequencing technology with advanced statistical modeling has the potential to increase genetic gain efficiently. Whole genome resequencing data was obtained from 315 advanced chickpea breeding lines from the Australian chickpea breeding program resulting in more than 298,000 single nucleotide polymorphisms (SNPs) discovered. Analysis of population structure revealed a distinct group of breeding lines with many alleles that are absent from recently released Australian cultivars. Genome-wide association studies (GWAS) using these Australian breeding lines identified 20 SNPs significantly associated with grain yield in multiple field environments. A reduced level of nucleotide diversity and extended linkage disequilibrium suggested that some regions in these chickpea genomes may have been through selective breeding for yield or other traits. A large introgression segment that introduced from C. echinospermum for phytophthora root rot resistance was identified on chromosome 6, yet it also has unintended consequences of reducing yield due to linkage drag. We further investigated the effect of genotype by environment interaction on genomic prediction of yield. We found that the training set had better prediction accuracy when phenotyped under conditions relevant to the targeted environments. We also investigated the effect of SNP functional annotation on prediction accuracy using different subsets of SNPs based on their genomic locations: regulatory regions, exome, and alternative splice sites. Compared with the whole SNP dataset, a subset of SNPs did not significantly decrease prediction accuracy for grain yield despite consisting of a smaller number of SNPs.
Collapse
Affiliation(s)
- Yongle Li
- School of Agriculture, Food and Wine, The Univ. of Adelaide, Adelaide, SA, 5064, Australia
| | - Pradeep Ruperao
- Statistics, Bioinformatics and Data Management, ICRISAT, Hyderabad, 502324, India
| | - Jacqueline Batley
- School of Biological Sciences, The Univ. of Western Australia, Perth, WA, 6001, Australia
| | - David Edwards
- School of Biological Sciences, The Univ. of Western Australia, Perth, WA, 6001, Australia
| | - William Martin
- Dep. of Agriculture and Fisheries, Warwick, Qld, 4370, Australia
| | - Kristy Hobson
- NSW Dep. of Primary Industries, Tamworth, NSW, 2340, Australia
| | - Tim Sutton
- School of Agriculture, Food and Wine, The Univ. of Adelaide, Adelaide, SA, 5064, Australia
- South Australian Research and Development Institute, Adelaide, SA, 5064, Australia
| |
Collapse
|
7
|
Lai E, Danner AL, Famula TR, Oberbauer AM. Genome-Wide Association Studies Reveal Susceptibility Loci for Noninfectious Claw Lesions in Holstein Dairy Cattle. Front Genet 2021; 12:657375. [PMID: 34122511 PMCID: PMC8194352 DOI: 10.3389/fgene.2021.657375] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 04/15/2021] [Indexed: 01/10/2023] Open
Abstract
Sole ulcers (SUs) and white line disease (WLD) are two common noninfectious claw lesions (NICL) that arise due to a compromised horn production and are frequent causes of lameness in dairy cattle, imposing welfare and profitability concerns. Low to moderate heritability estimates of SU and WLD susceptibility indicate that genetic selection could reduce their prevalence. To identify the susceptibility loci for SU, WLD, SU and/or WLD, and any type of noninfectious claw lesion, genome-wide association studies (GWAS) were performed using generalized linear mixed model (GLMM) regression, chunk-based association testing (CBAT), and a random forest (RF) approach. Cows from five commercial dairies in California were classified as controls having no lameness records and ≥6 years old (n = 102) or cases having SU (n = 152), WLD (n = 117), SU and/or WLD (SU + WLD, n = 198), or any type of noninfectious claw lesion (n = 217). The top single nucleotide polymorphisms (SNPs) were defined as those passing the Bonferroni-corrected suggestive and significance thresholds in the GLMM analysis or those that a validated RF model considered important. Effects of the top SNPs were quantified using Bayesian estimation. Linkage disequilibrium (LD) blocks defined by the top SNPs were explored for candidate genes and previously identified, functionally relevant quantitative trait loci. The GLMM and CBAT approaches revealed the same regions of association on BTA8 for SU and BTA13 common to WLD, SU + WLD, and NICL. These SNPs had effects significantly different from zero, and the LD blocks they defined explained a significant amount of phenotypic variance for each dataset (6.1-8.1%, p < 0.05), indicating the small but notable contribution of these regions to susceptibility. These regions contained candidate genes involved in wound healing, skin lesions, bone growth and mineralization, adipose tissue, and keratinization. The LD block defined by the most significant SNP on BTA8 for SU included a SNP previously associated with SU. The RF models were overfitted, indicating that the SNP effects were very small, thereby preventing meaningful interpretation of SNPs and any downstream analyses. These findings suggested that variants associated with various physiological systems may contribute to susceptibility for NICL, demonstrating the complexity of genetic predisposition.
Collapse
Affiliation(s)
- Ellen Lai
- Animal Science Department, University of California, Davis, Davis, CA, United States
| | - Alexa L Danner
- Animal Science Department, University of California, Davis, Davis, CA, United States
| | - Thomas R Famula
- Animal Science Department, University of California, Davis, Davis, CA, United States
| | - Anita M Oberbauer
- Animal Science Department, University of California, Davis, Davis, CA, United States
| |
Collapse
|
8
|
DNA methylation studies in cattle. J Appl Genet 2021; 62:121-136. [PMID: 33400132 DOI: 10.1007/s13353-020-00604-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 10/23/2020] [Accepted: 12/22/2020] [Indexed: 12/12/2022]
Abstract
Investigation of the role of epigenetics in cattle breeding is gaining importance. DNA methylation represents an epigenetic modification which is essential for genomic stability and maintenance of development. Recently, DNA methylation research in cattle has intensified. The studies focus on the definition of methylomes in various organs and tissues in relation to the expression of genes underlying economically important traits, and explore methylome changes under developmental, environmental, disease, and diet influences. The investigations further characterize the methylation patterns of gametes in connection with their quality, and study methylome alterations in the developing naturally or assisted produced zygotes, embryos, and fetuses, considering their viability. A wide array of technologies developed for accurate and precise analysis of DNA methylation patterns is employed for both single-gene and genome-wide studies. Overall, the research is directed towards the identification of single methylation markers or their combinations which may be useful in the selection and breeding of animals to ensure cattle improvement.
Collapse
|
9
|
Xiang R, Breen EJ, Prowse-Wilkins CP, Chamberlain AJ, Goddard ME. Bayesian genome-wide analysis of cattle traits using variants with functional and evolutionary significance. ANIMAL PRODUCTION SCIENCE 2021. [DOI: 10.1071/an21061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Context
Functional genomics studies have highlighted genomic regions with regulatory and evolutionary significance. Such information independent of association analysis may benefit fine-mapping and genomic selection of economically important traits. However, systematic evaluation of the use of functional information in mapping, and genomic selection of cattle traits, is lacking. Also, single-nucleotide polymorphisms (SNPs) from the high-density (HD) panel are known to tag informative variants, but the performance of genomic prediction using HD SNPs together with variants supported by different functional genomics is unknown.
Aims
We selected six sets of functionally important variants and modelled each set together with HD SNPs in Bayesian models to map and predict protein, fat and milk yield as well as mastitis, somatic cell count and temperament of dairy cattle.
Methods
Two models were used, namely (1) BayesR, which includes priors of four distribution of variant effects, and (2) BayesRC, which includes additional priors of different functional classes of variants. Bayesian models were trained in three breeds of 28 000 cows of Holstein, Jersey and Australian Red and predicted into 2600 independent bulls.
Key results
Adding functionally important variants significantly increased the enrichment of genetic variance explained for mapped variants, suggesting improved genome-wide mapping precision. Such improvement was significantly higher when the same set of variants was modelled by BayesRC than by BayesR. Combining functional variant sets with HD SNPs improves genomic prediction accuracy in the majority of the cases and such improvement was more common and stronger for non-Holstein breeds and traits such as mastitis, somatic cell count and temperament. In contrast, adding a large number of random sequence variants to HD SNPs reduces mapping precision and has a worse or similar prediction accuracy, compared with using HD SNPs alone to map or predict. While BayesRC tended to have better genomic prediction accuracy than did BayesR, the overall difference in prediction accuracy between the two models was insignificant.
Conclusions
Our findings demonstrated the usefulness of functional data in genomic mapping and prediction.
Implications
We have highlighted the need for effective tools exploiting complex functional datasets to improve genomic prediction.
Collapse
|
10
|
Genetic variations associated with long noncoding RNAs. Essays Biochem 2020; 64:867-873. [DOI: 10.1042/ebc20200033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 09/10/2020] [Accepted: 09/21/2020] [Indexed: 12/19/2022]
Abstract
Abstract
Genetic variations, including single nucleotide polymorphisms (SNPs) and structural variations, are widely distributed in the genome, including the long noncoding RNA (lncRNA) regions. The changes at locus might produce numerous effects in a variety of aspects. Multiple bioinformatics resources and tools were also developed for systematically dealing with genetic variations associated with lncRNAs. Moreover, correlation of the genetic variations in lncRNAs with immune disease, cancers, and other disease as well as development process were all included for discussion. In this essay, we summarized how and in what aspects these changes would affect lncRNA functions.
Collapse
|
11
|
Zhang F, Wang Y, Mukiibi R, Chen L, Vinsky M, Plastow G, Basarab J, Stothard P, Li C. Genetic architecture of quantitative traits in beef cattle revealed by genome wide association studies of imputed whole genome sequence variants: I: feed efficiency and component traits. BMC Genomics 2020; 21:36. [PMID: 31931702 PMCID: PMC6956504 DOI: 10.1186/s12864-019-6362-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 12/02/2019] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Genome wide association studies (GWAS) on residual feed intake (RFI) and its component traits including daily dry matter intake (DMI), average daily gain (ADG), and metabolic body weight (MWT) were conducted in a population of 7573 animals from multiple beef cattle breeds based on 7,853,211 imputed whole genome sequence variants. The GWAS results were used to elucidate genetic architectures of the feed efficiency related traits in beef cattle. RESULTS The DNA variant allele substitution effects approximated a bell-shaped distribution for all the traits while the distribution of additive genetic variances explained by single DNA variants followed a scaled inverse chi-squared distribution to a greater extent. With a threshold of P-value < 1.00E-05, 16, 72, 88, and 116 lead DNA variants on multiple chromosomes were significantly associated with RFI, DMI, ADG, and MWT, respectively. In addition, lead DNA variants with potentially large pleiotropic effects on DMI, ADG, and MWT were found on chromosomes 6, 14 and 20. On average, missense, 3'UTR, 5'UTR, and other regulatory region variants exhibited larger allele substitution effects in comparison to other functional classes. Intergenic and intron variants captured smaller proportions of additive genetic variance per DNA variant. Instead 3'UTR and synonymous variants explained a greater amount of genetic variance per DNA variant for all the traits examined while missense, 5'UTR and other regulatory region variants accounted for relatively more additive genetic variance per sequence variant for RFI and ADG, respectively. In total, 25 to 27 enriched cellular and molecular functions were identified with lipid metabolism and carbohydrate metabolism being the most significant for the feed efficiency traits. CONCLUSIONS RFI is controlled by many DNA variants with relatively small effects whereas DMI, ADG, and MWT are influenced by a few DNA variants with large effects and many DNA variants with small effects. Nucleotide polymorphisms in regulatory region and synonymous functional classes play a more important role per sequence variant in determining variation of the feed efficiency traits. The genetic architecture as revealed by the GWAS of the imputed 7,853,211 DNA variants will improve our understanding on the genetic control of feed efficiency traits in beef cattle.
Collapse
Affiliation(s)
- Feng Zhang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada.,State Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, Jiangxi, China.,Present Address: Institute of Translational Medicine, Nanchang University, Nanchang, Jiangxi, China
| | - Yining Wang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Robert Mukiibi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Liuhong Chen
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Michael Vinsky
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada
| | - Graham Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - John Basarab
- Alberta Agriculture and Forestry, Lacombe Research and Development Centre, 6000 C&E Trail, Lacombe, AB, Canada
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Changxi Li
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada. .,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada.
| |
Collapse
|
12
|
Wang Y, Zhang F, Mukiibi R, Chen L, Vinsky M, Plastow G, Basarab J, Stothard P, Li C. Genetic architecture of quantitative traits in beef cattle revealed by genome wide association studies of imputed whole genome sequence variants: II: carcass merit traits. BMC Genomics 2020; 21:38. [PMID: 31931697 PMCID: PMC6958779 DOI: 10.1186/s12864-019-6273-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 11/12/2019] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Genome wide association studies (GWAS) were conducted on 7,853,211 imputed whole genome sequence variants in a population of 3354 to 3984 animals from multiple beef cattle breeds for five carcass merit traits including hot carcass weight (HCW), average backfat thickness (AFAT), rib eye area (REA), lean meat yield (LMY) and carcass marbling score (CMAR). Based on the GWAS results, genetic architectures of the carcass merit traits in beef cattle were elucidated. RESULTS The distributions of DNA variant allele substitution effects approximated a bell-shaped distribution for all the traits while the distribution of additive genetic variances explained by single DNA variants conformed to a scaled inverse chi-squared distribution to a greater extent. At a threshold of P-value < 10-5, 51, 33, 46, 40, and 38 lead DNA variants on multiple chromosomes were significantly associated with HCW, AFAT, REA, LMY, and CMAR, respectively. In addition, lead DNA variants with potentially large pleiotropic effects on HCW, AFAT, REA, and LMY were found on chromosome 6. On average, missense variants, 3'UTR variants, 5'UTR variants, and other regulatory region variants exhibited larger allele substitution effects on the traits in comparison to other functional classes. The amounts of additive genetic variance explained per DNA variant were smaller for intergenic and intron variants on all the traits whereas synonymous variants, missense variants, 3'UTR variants, 5'UTR variants, downstream and upstream gene variants, and other regulatory region variants captured a greater amount of additive genetic variance per sequence variant for one or more carcass merit traits investigated. In total, 26 enriched cellular and molecular functions were identified with lipid metabolisms, small molecular biochemistry, and carbohydrate metabolism being the most significant for the carcass merit traits. CONCLUSIONS The GWAS results have shown that the carcass merit traits are controlled by a few DNA variants with large effects and many DNA variants with small effects. Nucleotide polymorphisms in regulatory, synonymous, and missense functional classes have relatively larger impacts per sequence variant on the variation of carcass merit traits. The genetic architecture as revealed by the GWAS will improve our understanding on genetic controls of carcass merit traits in beef cattle.
Collapse
Affiliation(s)
- Yining Wang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Feng Zhang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
- State Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, Jiangxi China
- Present Address: Institute of Translational Medicine, Nanchang University, Nanchang, Jiangxi China
| | - Robert Mukiibi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Liuhong Chen
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Michael Vinsky
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
| | - Graham Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - John Basarab
- Alberta Agriculture and Forestry, Lacombe Research and Development Centre, 6000 C&E Trail, Lacombe, AB Canada
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Changxi Li
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| |
Collapse
|
13
|
Xiang R, Berg IVD, MacLeod IM, Hayes BJ, Prowse-Wilkins CP, Wang M, Bolormaa S, Liu Z, Rochfort SJ, Reich CM, Mason BA, Vander Jagt CJ, Daetwyler HD, Lund MS, Chamberlain AJ, Goddard ME. Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proc Natl Acad Sci U S A 2019; 116:19398-19408. [PMID: 31501319 PMCID: PMC6765237 DOI: 10.1073/pnas.1904159116] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Many genome variants shaping mammalian phenotype are hypothesized to regulate gene transcription and/or to be under selection. However, most of the evidence to support this hypothesis comes from human studies. Systematic evidence for regulatory and evolutionary signals contributing to complex traits in a different mammalian model is needed. Sequence variants associated with gene expression (expression quantitative trait loci [eQTLs]) and concentration of metabolites (metabolic quantitative trait loci [mQTLs]) and under histone-modification marks in several tissues were discovered from multiomics data of over 400 cattle. Variants under selection and evolutionary constraint were identified using genome databases of multiple species. These analyses defined 30 sets of variants, and for each set, we estimated the genetic variance the set explained across 34 complex traits in 11,923 bulls and 32,347 cows with 17,669,372 imputed variants. The per-variant trait heritability of these sets across traits was highly consistent (r > 0.94) between bulls and cows. Based on the per-variant heritability, conserved sites across 100 vertebrate species and mQTLs ranked the highest, followed by eQTLs, young variants, those under histone-modification marks, and selection signatures. From these results, we defined a Functional-And-Evolutionary Trait Heritability (FAETH) score indicating the functionality and predicted heritability of each variant. In additional 7,551 cattle, the high FAETH-ranking variants had significantly increased genetic variances and genomic prediction accuracies in 3 production traits compared to the low FAETH-ranking variants. The FAETH framework combines the information of gene regulation, evolution, and trait heritability to rank variants, and the publicly available FAETH data provide a set of biological priors for cattle genomic selection worldwide.
Collapse
Affiliation(s)
- Ruidong Xiang
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia;
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Irene van den Berg
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Benjamin J Hayes
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- Centre for Animal Science, The University of Queensland, St. Lucia, QLD 4067, Australia
| | - Claire P Prowse-Wilkins
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Min Wang
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Sunduimijid Bolormaa
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Zhiqian Liu
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Simone J Rochfort
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Coralie M Reich
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Brett A Mason
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Christy J Vander Jagt
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083, Australia
| | - Mogens S Lund
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, DK-8830 Tjele, Denmark
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| | - Michael E Goddard
- Faculty of Veterinary & Agricultural Science, The University of Melbourne, Parkville, VIC 3052, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC 3083, Australia
| |
Collapse
|
14
|
Zhao H, Zhang S, Wu X, Pan C, Li X, Lei C, Chen H, Lan X. DNA methylation pattern of the goat PITX1 gene and its effects on milk performance. Arch Anim Breed 2019; 62:59-68. [PMID: 31807614 PMCID: PMC6852879 DOI: 10.5194/aab-62-59-2019] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 01/24/2019] [Indexed: 12/23/2022] Open
Abstract
Paired-like homeodomain transcription factor 1 (PITX1) is a pivotal
gene in the hypothalamic–pituitary–adrenal axis, which is a well-known
pathway affecting lactation performance. The aim of this study was to analyze
the DNA methylation profile of the PITX1 gene and its relevance to
milk performance in Xinong Saanen dairy goats; thus, potential epigenetic
markers of lactation performance were identified. A total of 267 goat blood
samples were divided into “low” and “high” groups according to two milk
traits: the average milk yield (AMY) and the average milk density (AMD). One
CpG island in the 3′-flanking region of the PITX1 gene was
identified as being related to milk performance. Fisher's exact test
demonstrated that the methylation rates of the overall CpG island and the 3rd
and 12th CpG-dinucleotide loci in the blood were significantly associated
with the AMY, and the overall methylation rate of the high AMY group was
relative hypomethylation compared with the low AMY group. The overall
methylation rates of this CpG island in mammary gland tissue from dry and
lactation periods again exhibited a significant difference: the lactation
period showed relative hypomethylation compared with the dry period.
Bioinformatic transcription factor binding site prediction identified some
lactation performance related transcription factors in this CpG island, such
as CTCF, STAT, SMAD, CDEF, SP1, and KLFS. Briefly, overall methylation
changes of the CpG island in the PITX1 gene are relevant to
lactation performance, which will be valuable for future studies and
epigenetic marker-assisted selection (eMAS) in the breeding of goats with
respect to lactation performance.
Collapse
Affiliation(s)
- Haiyu Zhao
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Sihuan Zhang
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xianfeng Wu
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Chuanying Pan
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xiangchen Li
- Institute of Beijing Animal Science and Veterinary, Chinese Academy of Agricultural Science, Beijing 100194, China
| | - Chuzhao Lei
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Hong Chen
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xianyong Lan
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| |
Collapse
|
15
|
Giuffra E, Tuggle CK. Functional Annotation of Animal Genomes (FAANG): Current Achievements and Roadmap. Annu Rev Anim Biosci 2018; 7:65-88. [PMID: 30427726 DOI: 10.1146/annurev-animal-020518-114913] [Citation(s) in RCA: 107] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Functional annotation of genomes is a prerequisite for contemporary basic and applied genomic research, yet farmed animal genomics is deficient in such annotation. To address this, the FAANG (Functional Annotation of Animal Genomes) Consortium is producing genome-wide data sets on RNA expression, DNA methylation, and chromatin modification, as well as chromatin accessibility and interactions. In addition to informing our understanding of genome function, including comparative approaches to elucidate constrained sequence or epigenetic elements, these annotation maps will improve the precision and sensitivity of genomic selection for animal improvement. A scientific community-driven effort has already created a coordinated data collection and analysis enterprise crucial for the success of this global effort. Although it is early in this continuing process, functional data have already been produced and application to genetic improvement reported. The functional annotation delivered by the FAANG initiative will add value and utility to the greatly improved genome sequences being established for domesticated animal species.
Collapse
Affiliation(s)
- Elisabetta Giuffra
- Génétique Animale et Biologie Intégrative (GABI), Institut National de la Recherche Agronomique (INRA), AgroParisTech, Université Paris Saclay, 78350 Jouy-en-Josas, France;
| | | | | |
Collapse
|