1
|
Schneider H, Segelke D, Tetens J, Thaller G, Bennewitz J. A genomic assessment of the correlation between milk production traits and claw and udder health traits in Holstein dairy cattle. J Dairy Sci 2023; 106:1190-1205. [PMID: 36460501 DOI: 10.3168/jds.2022-22312] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 09/01/2022] [Indexed: 11/30/2022]
Abstract
Claw diseases and mastitis represent the most important disease traits in dairy cattle with increasing incidences and a frequently mentioned connection to milk yield. Yet, many studies aimed to detect the genetic background of both trait complexes via fine-mapping of quantitative trait loci. However, little is known about genomic regions that simultaneously affect milk production and disease traits. For this purpose, several tools to detect local genetic correlations have been developed. In this study, we attempted a detailed analysis of milk production and disease traits as well as their interrelationship using a sample of 34,497 50K genotyped German Holstein cows with milk production and claw and udder disease traits records. We performed a pedigree-based quantitative genetic analysis to estimate heritabilities and genetic correlations. Additionally, we generated GWAS summary statistics, paying special attention to genomic inflation, and used these data to identify shared genomic regions, which affect various trait combinations. The heritability on the liability scale of the disease traits was low, between 0.02 for laminitis and 0.19 for interdigital hyperplasia. The heritabilities for milk production traits were higher (between 0.27 for milk energy yield and 0.48 for fat-protein ratio). Global genetic correlations indicate the shared genetic effect between milk production and disease traits on a whole genome level. Most of these estimates were not significantly different from zero, only mastitis showed a positive one to milk (0.18) and milk energy yield (0.13), as well as a negative one to fat-protein ratio (-0.07). The genomic analysis revealed significant SNPs for milk production traits that were enriched on Bos taurus autosome 5, 6, and 14. For digital dermatitis, we found significant hits, predominantly on Bos taurus autosome 5, 10, 22, and 23, whereas we did not find significantly trait-associated SNPs for the other disease traits. Our results confirm the known genetic background of disease and milk production traits. We further detected 13 regions that harbor strong concordant effects on a trait combination of milk production and disease traits. This detailed investigation of genetic correlations reveals additional knowledge about the localization of regions with shared genetic effects on these trait complexes, which in turn enables a better understanding of the underlying biological pathways and putatively the utilization for a more precise design of breeding schemes.
Collapse
Affiliation(s)
- Helen Schneider
- Institute of Animal Science, University of Hohenheim, 70599 Stuttgart, Germany.
| | - Dierck Segelke
- Vereinigte Informationssysteme Tierhaltung w.V. (VIT), 27283 Verden, Germany
| | - Jens Tetens
- Department of Animal Sciences, University of Göttingen, 37077 Göttingen, Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts University of Kiel, 24098 Kiel, Germany
| | - Jörn Bennewitz
- Institute of Animal Science, University of Hohenheim, 70599 Stuttgart, Germany
| |
Collapse
|
2
|
Lu X, Arbab AAI, Abdalla IM, Liu D, Zhang Z, Xu T, Su G, Yang Z. Genetic Parameter Estimation and Genome-Wide Association Study-Based Loci Identification of Milk-Related Traits in Chinese Holstein. Front Genet 2022; 12:799664. [PMID: 35154251 PMCID: PMC8836289 DOI: 10.3389/fgene.2021.799664] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 12/07/2021] [Indexed: 11/22/2022] Open
Abstract
Accurately estimating the genetic parameters and revealing more genetic variants underlying milk production and quality are conducive to the genetic improvement of dairy cows. In this study, we estimate the genetic parameters of five milk-related traits of cows-namely, milk yield (MY), milk fat percentage (MFP), milk fat yield (MFY), milk protein percentage (MPP), and milk protein yield (MPY)-based on a random regression test-day model. A total of 95,375 test-day records of 9,834 cows in the lower reaches of the Yangtze River were used for the estimation. In addition, genome-wide association studies (GWASs) for these traits were conducted, based on adjusted phenotypes. The heritability, as well as the standard errors, of MY, MFP, MFY, MPP, and MPY during lactation ranged from 0.22 ± 0.02 to 0.31 ± 0.04, 0.06 ± 0.02 to 0.15 ± 0.03, 0.09 ± 0.02 to 0.28 ± 0.04, 0.07 ± 0.01 to 0.16 ± 0.03, and 0.14 ± 0.02 to 0.27 ± 0.03, respectively, and the genetic correlations between different days in milk (DIM) within lactations decreased as the time interval increased. Two, six, four, six, and three single nucleotide polymorphisms (SNPs) were detected, which explained 5.44, 12.39, 8.89, 10.65, and 7.09% of the phenotypic variation in MY, MFP, MFY, MPP, and MPY, respectively. Ten Kyoto Encyclopedia of Genes and Genomes pathways and 25 Gene Ontology terms were enriched by analyzing the nearest genes and genes within 200 kb of the detected SNPs. Moreover, 17 genes in the enrichment results that may play roles in milk production and quality were selected as candidates, including CAMK2G, WNT3A, WNT9A, PLCB4, SMAD9, PLA2G4A, ARF1, OPLAH, MGST1, CLIP1, DGAT1, PRMT6, VPS28, HSF1, MAF1, TMEM98, and F7. We hope that this study will provide useful information for in-depth understanding of the genetic architecture of milk production and quality traits, as well as contribute to the genomic selection work of dairy cows in the lower reaches of the Yangtze River.
Collapse
Affiliation(s)
- Xubin Lu
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | | | | | - Dingding Liu
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Zhipeng Zhang
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| | - Tianle Xu
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, Yangzhou University, Yangzhou, China
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Zhangping Yang
- College of Animal Science and Technology, Yangzhou University, Yangzhou, China
| |
Collapse
|
3
|
Karaman E, Su G, Croue I, Lund MS. Genomic prediction using a reference population of multiple pure breeds and admixed individuals. Genet Sel Evol 2021; 53:46. [PMID: 34058971 PMCID: PMC8168010 DOI: 10.1186/s12711-021-00637-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 05/11/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In dairy cattle populations in which crossbreeding has been used, animals show some level of diversity in their origins. In rotational crossbreeding, for instance, crossbred dams are mated with purebred sires from different pure breeds, and the genetic composition of crossbred animals is an admixture of the breeds included in the rotation. How to use the data of such individuals in genomic evaluations is still an open question. In this study, we aimed at providing methodologies for the use of data from crossbred individuals with an admixed genetic background together with data from multiple pure breeds, for the purpose of genomic evaluations for both purebred and crossbred animals. A three-breed rotational crossbreeding system was mimicked using simulations based on animals genotyped with the 50 K single nucleotide polymorphism (SNP) chip. RESULTS For purebred populations, within-breed genomic predictions generally led to higher accuracies than those from multi-breed predictions using combined data of pure breeds. Adding admixed population's (MIX) data to the combined pure breed data considering MIX as a different breed led to higher accuracies. When prediction models were able to account for breed origin of alleles, accuracies were generally higher than those from combining all available data, depending on the correlation of quantitative trait loci (QTL) effects between the breeds. Accuracies varied when using SNP effects from any of the pure breeds to predict the breeding values of MIX. Using those breed-specific SNP effects that were estimated separately in each pure breed, while accounting for breed origin of alleles for the selection candidates of MIX, generally improved the accuracies. Models that are able to accommodate MIX data with the breed origin of alleles approach generally led to higher accuracies than models without breed origin of alleles, depending on the correlation of QTL effects between the breeds. CONCLUSIONS Combining all available data, pure breeds' and admixed population's data, in a multi-breed reference population is beneficial for the estimation of breeding values for pure breeds with a small reference population. For MIX, such an approach can lead to higher accuracies than considering breed origin of alleles for the selection candidates, and using breed-specific SNP effects estimated separately in each pure breed. Including MIX data in the reference population of multiple breeds by considering the breed origin of alleles, accuracies can be further improved. Our findings are relevant for breeding programs in which crossbreeding is systematically applied, and also for populations that involve different subpopulations and between which exchange of genetic material is routine practice.
Collapse
Affiliation(s)
- Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | | | - Mogens S Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
4
|
Song H, Zhang Q, Ding X. The superiority of multi-trait models with genotype-by-environment interactions in a limited number of environments for genomic prediction in pigs. J Anim Sci Biotechnol 2020; 11:88. [PMID: 32974012 PMCID: PMC7507970 DOI: 10.1186/s40104-020-00493-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2020] [Accepted: 07/07/2020] [Indexed: 11/21/2022] Open
Abstract
Background Different production systems and climates could lead to genotype-by-environment (G × E) interactions between populations, and the inclusion of G × E interactions is becoming essential in breeding decisions. The objective of this study was to investigate the performance of multi-trait models in genomic prediction in a limited number of environments with G × E interactions. Results In total, 2,688 and 1,384 individuals with growth and reproduction phenotypes, respectively, from two Yorkshire pig populations with similar genetic backgrounds were genotyped with the PorcineSNP80 panel. Single- and multi-trait models with genomic best linear unbiased prediction (GBLUP) and BayesC π were implemented to investigate their genomic prediction abilities with 20 replicates of five-fold cross-validation. Our results regarding between-environment genetic correlations of growth and reproductive traits (ranging from 0.618 to 0.723) indicated the existence of G × E interactions between these two Yorkshire pig populations. For single-trait models, genomic prediction with GBLUP was only 1.1% more accurate on average in the combined population than in single populations, and no significant improvements were obtained by BayesC π for most traits. In addition, single-trait models with either GBLUP or BayesC π produced greater bias for the combined population than for single populations. However, multi-trait models with GBLUP and BayesC π better accommodated G × E interactions, yielding 2.2% – 3.8% and 1.0% – 2.5% higher prediction accuracies for growth and reproductive traits, respectively, compared to those for single-trait models of single populations and the combined population. The multi-trait models also yielded lower bias and larger gains in the case of a small reference population. The smaller improvement in prediction accuracy and larger bias obtained by the single-trait models in the combined population was mainly due to the low consistency of linkage disequilibrium between the two populations, which also caused the BayesC π method to always produce the largest standard error in marker effect estimation for the combined population. Conclusions In conclusion, our findings confirmed that directly combining populations to enlarge the reference population is not efficient in improving the accuracy of genomic prediction in the presence of G × E interactions, while multi-trait models perform better in a limited number of environments with G × E interactions.
Collapse
Affiliation(s)
- Hailiang Song
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, Taian, 271001 China
| | - Xiangdong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, 100193 China
| |
Collapse
|
5
|
Karaman E, Lund MS, Su G. Multi-trait single-step genomic prediction accounting for heterogeneous (co)variances over the genome. Heredity (Edinb) 2020; 124:274-287. [PMID: 31641237 PMCID: PMC6972913 DOI: 10.1038/s41437-019-0273-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 09/05/2019] [Accepted: 09/06/2019] [Indexed: 11/23/2022] Open
Abstract
Widely used genomic prediction models may not properly account for heterogeneous (co)variance structure across the genome. Models such as BayesA and BayesB assume locus-specific variance, which are highly influenced by the prior for (co)variance of single nucleotide polymorphism (SNP) effect, regardless of the size of data. Models such as BayesC or GBLUP assume a common (co)variance for a proportion (BayesC) or all (GBLUP) of the SNP effects. In this study, we propose a multi-trait Bayesian whole genome regression method (BayesN0), which is based on grouping a number of predefined SNPs to account for heterogeneous (co)variance structure across the genome. This model was also implemented in single-step Bayesian regression (ssBayesN0). For practical implementation, we considered multi-trait single-step SNPBLUP models, using (co)variance estimates from BayesN0 or ssBayesN0. Genotype data were simulated using haplotypes on first five chromosomes of 2200 Danish Holstein cattle, and phenotypes were simulated for two traits with heritabilities 0.1 or 0.4, assuming 200 quantitative trait loci (QTL). We compared prediction accuracy from different prediction models and different region sizes (one SNP, 100 SNPs, one chromosome or whole genome). In general, highest accuracies were obtained when 100 adjacent SNPs were grouped together. The ssBayesN0 improved accuracies over BayesN0, and using (co)variance estimates from ssBayesN0 generally yielded higher accuracies than using (co)variance estimates from BayesN0, for the 100 SNPs region size. Our results suggest that it could be a good strategy to estimate (co)variance components from ssBayesN0, and then to use those estimates in genomic prediction using multi-trait single-step SNPBLUP, in routine genomic evaluations.
Collapse
Affiliation(s)
- Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | - Mogens S Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
6
|
Genomic Prediction Using Multi-trait Weighted GBLUP Accounting for Heterogeneous Variances and Covariances Across the Genome. G3-GENES GENOMES GENETICS 2018; 8:3549-3558. [PMID: 30194089 PMCID: PMC6222589 DOI: 10.1534/g3.118.200673] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Implicit assumption of common (co)variance for all loci in multi-trait Genomic Best Linear Unbiased Prediction (GBLUP) results in a genomic relationship matrix (G) that is common to all traits. When this assumption is violated, Bayesian whole genome regression methods may be superior to GBLUP by accounting for unequal (co)variance for all loci or genome regions. This study aimed to develop a strategy to improve the accuracy of GBLUP for multi-trait genomic prediction, using (co)variance estimates of SNP effects from Bayesian whole genome regression methods. Five generations (G1-G5, test populations) of genotype data were available by simulations based on data of 2,200 Danish Holstein cows (G0, reference population). Two correlated traits with heritabilities of 0.1 or 0.4, and a genetic correlation of 0.45 were generated. First, SNP effects and breeding values were estimated using BayesAS method, assuming (co)variance was the same for SNPs within a genome region, and different between regions. Region size was set as one SNP, 100 SNPs, a whole chromosome or whole genome. Second, posterior (co)variances of SNP effects were used to weight SNPs in construction of G matrices. In general, region size of 100 SNPs led to highest prediction accuracies using BayesAS, and wGBLUP outperformed GBLUP at this region size. Our results suggest that when genetic architectures of traits favor Bayesian methods, the accuracy of multi-trait GBLUP can be as high as the Bayesian method if SNPs are weighted by the Bayesian posterior (co)variances.
Collapse
|
7
|
Li X, Liu X, Chen Y. The influence of a first-order antedependence model and hyperparameters in BayesCπ for genomic prediction. ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES 2018; 31:1863-1870. [PMID: 30056688 PMCID: PMC6212739 DOI: 10.5713/ajas.18.0102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Accepted: 06/02/2018] [Indexed: 11/27/2022]
Abstract
OBJECTIVE The Bayesian first-order antedependence models, which specified single nucleotide polymorphisms (SNP) effects as being spatially correlated in the conventional BayesA/B, had more accurate genomic prediction than their corresponding classical counterparts. Given advantages of BayesCπ over BayesA/B, we have developed hyper-BayesCπ, ante-BayesCπ, and ante-hyper-BayesCπ to evaluate influences of the antedependence model and hyperparameters for vg and sg2 on BayesCπ. METHODS Three public data (two simulated data and one mouse data) were used to validate our proposed methods. Genomic prediction performance of proposed methods was compared to traditional BayesCπ, ante-BayesA and ante-BayesB. RESULTS Through both simulation and real data analyses, we found that hyper-BayesCπ, ante-BayesCπ and ante-hyper-BayesCπ were comparable with BayesCπ, ante-BayesB, and ante-BayesA regarding the prediction accuracy and bias, except the situation in which ante-BayesB performed significantly worse when using a few SNPs and π = 0.95. CONCLUSION Hyper-BayesCπ is recommended because it avoids pre-estimated total genetic variance of a trait compared with BayesCπ and shortens computing time compared with ante-BayesB. Although the antedependence model in BayesCπ did not show the advantages in our study, larger real data with high density chip may be used to validate it again in the future.
Collapse
Affiliation(s)
- Xiujin Li
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, North Third Road, Guangzhou Higher Education Mega Center, Guangzhou, Guangdong 510006, China
| | - Xiaohong Liu
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, North Third Road, Guangzhou Higher Education Mega Center, Guangzhou, Guangdong 510006, China
| | - Yaosheng Chen
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, North Third Road, Guangzhou Higher Education Mega Center, Guangzhou, Guangdong 510006, China
| |
Collapse
|