1
|
Zhao H, MacLeod IM, Keeble-Gagnere G, Barbulescu DM, Tibbits JF, Kaur S, Hayden M. Using genotype imputation to integrate Canola populations for genome-wide association and genomic prediction of blackleg resistance. BMC Genomics 2025; 26:215. [PMID: 40038585 DOI: 10.1186/s12864-025-11250-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 01/16/2025] [Indexed: 03/06/2025] Open
Abstract
BACKGROUND Integrating germplasm populations genotyped by different genotyping platforms via genotype imputation is a way to utilize accumulated genetic resources. In this study, we used 278 canola samples genotyped via whole-genome sequencing (WGS) at 10× coverage to evaluate the imputation accuracy of three imputation approaches. The optimal imputation methods were used to impute and integrate two Canola genotype datasets: a diverse canola collection genotyped by genotyping-by-sequencing via transcriptome (GBS-t) and a double haploid (DH) line collection genotyped with low-coverage WGS (skim-WGS). The genomic predictive ability (GP) and detection power of marker‒trait association (GWAS) of the combined population for blackleg resistance were evaluated. RESULTS The empirical imputation accuracy (r2) measured as the squared correlation between observed and imputed genotypes was moderate for Minimac3 when imputing from the GBS-t density to the WGS. The accuracy dramatically improved from 0.64 to 0.82 by removing SNPs with poor Minimac3-reported Rsq (Rsq < 0.2) quality statistics. The r2 for GLIMPSE was higher than that for Beagle when imputing from different low-coverage to full-coverage WGS. We imputed and integrated the diverse canola collection and the DH lines, and the combined population showed similar or slightly greater predictive ability (PA) for blackleg resistance traits than did each of the single populations with ~ 921 K SNPs. Higher marker-trait association (MTA) detection powers were indicated with the combined population; however, similar numbers of MTAs were discovered when each single population was combined in a meta-GWAS. CONCLUSION It is feasible to impute and integrate germplasms from different sequencing platforms for downstream analyses. However, genetic heterogeneity across populations could add complexity to the analysis. Increasing the sample size by combining datasets showed slightly greater predictive ability and greater detection power in GWASs in the present study.
Collapse
Affiliation(s)
- Huanhuan Zhao
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Gabriel Keeble-Gagnere
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Denise M Barbulescu
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Josquin F Tibbits
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Sukhjiwan Kaur
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Matthew Hayden
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia.
| |
Collapse
|
2
|
Liu B, Liu H, Tu J, Xiao J, Yang J, He X, Zhang H. An investigation of machine learning methods applied to genomic prediction in yellow-feathered broilers. Poult Sci 2025; 104:104489. [PMID: 39571199 PMCID: PMC11647775 DOI: 10.1016/j.psj.2024.104489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 10/10/2024] [Accepted: 10/31/2024] [Indexed: 01/25/2025] Open
Abstract
Machine learning (ML) methods have rapidly developed in various theoretical and practical research areas, including predicting genomic breeding values for large livestock animals. However, few studies have investigated the application of ML in broiler breeding. In this study, seven different ML methods-support vector regression (SVR), random forest (RF), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), kernel ridge regression (KRR) and multilayer perceptron (MLP) were employed to predict the genomic breeding values of laying traits, growth and carcass traits in a yellow-feathered broiler breeding population. The results indicated that classic methods, such as GBLUP and Bayesian, achieved superior prediction accuracy compared to ML methods in five of the eight traits. For half-eviscerated weight (HEW), ML methods showed an average improvement of 54.4% over GBLUP and Bayesian methods. Among the ML methods, SVR, RF, GBDT, and XGBoost exhibited improvements exceeding 60%, with respective values of 61.3%, 61.0%, 60.4%, and 60.7%; while MLP improved by 54.4% and LightGBM by 53.7%, KRR had the lowest improvement at 29.4%. For eviscerated weight (EW), ML methods still outperformed GBLUP and Bayesian methods. MLP gained the largest improvement at 19.0%, while SVR, RF, GBDT, XGBoost, LightGBM, and KRR improved by 15.0%, 16.5%, 9.5%, 7.0%, 1.6%, and 15.9%, respectively. Compared to default hyperparameters, the average improvement of ML methods with tuned hyperparameters was 34.0%, 32.9%, 27.0%, 19.3%, 26.8%, 13.2%, 18.9%, and 46.3%, respectively. The prediction accuracy of above algorithms could be optimized using genome-wide association study (GWAS) to select subsets of significant SNPs. This work provides valuable insights into genomic prediction, aiding genetic breeding in broilers.
Collapse
Affiliation(s)
- Bogong Liu
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Huichao Liu
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Junhao Tu
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Jian Xiao
- Hunan Xiangjia Husbandry Co., Ltd, Changde, Hunan, China
| | - Jie Yang
- Hunan Xiangjia Husbandry Co., Ltd, Changde, Hunan, China
| | - Xi He
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China; Hunan Engineering Research Center of Poultry Production Safety, Changsha, Hunan, China; Yuelushan Laboratory, Changsha 410128, China
| | - Haihan Zhang
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China; Hunan Engineering Research Center of Poultry Production Safety, Changsha, Hunan, China; Yuelushan Laboratory, Changsha 410128, China.
| |
Collapse
|
3
|
Skovbjerg CK, Sarup P, Wahlström E, Jensen JD, Orabi J, Olesen L, Jensen J, Jahoor A, Ramstein G. Multi-population GWAS detects robust marker associations in a newly established six-rowed winter barley breeding program. Heredity (Edinb) 2025; 134:33-48. [PMID: 39609544 PMCID: PMC11724117 DOI: 10.1038/s41437-024-00733-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 10/24/2024] [Accepted: 10/25/2024] [Indexed: 11/30/2024] Open
Abstract
Genome-wide association study (GWAS) is a powerful tool for identifying marker-trait associations that can accelerate breeding progress. Yet, its power is typically constrained in newly established breeding programs where large phenotypic and genotypic datasets have not yet accumulated. Expanding the dataset by inclusion of data from well-established breeding programs with many years of phenotyping and genotyping can potentially address this problem. In this study we performed single- and multi-population GWAS on heading date and lodging in four barley breeding populations with varying combinations of row-type and growth habit. Focusing on a recently established 6-rowed winter (6RW) barley population, single-population GWAS hardly resulted in any significant associations. Nevertheless, the combination of the 6RW target population with other populations in multi-population GWAS detected four and five robust candidate quantitative trait loci for heading date and lodging, respectively. Of these, three remained undetected when analysing the combined populations individually. Further, multi-population GWAS detected markers capturing a larger proportion of genetic variance in 6RW. For multi-population GWAS, we compared the findings of a univariate model (MP1) with a multivariate model (MP2). While both models surpassed single-population GWAS in power, MP2 offered a significant advantage by having more realistic assumptions while pointing towards robust marker-trait associations across populations. Additionally, comparisons of GWAS findings for MP2 and single-population GWAS allowed identification of population-specific loci. In conclusion, our study presents a promising approach to kick-start genomics-based breeding in newly established breeding populations.
Collapse
Affiliation(s)
- Cathrine Kiel Skovbjerg
- Nordic Seed A/S, Odder, Denmark.
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus C, Denmark.
| | | | | | | | | | | | - Just Jensen
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus C, Denmark
| | - Ahmed Jahoor
- Nordic Seed A/S, Odder, Denmark
- Department of Plant Breeding, The Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Guillaume Ramstein
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus C, Denmark
| |
Collapse
|
4
|
Barani S, Miraie Ashtiani SR, Nejati Javaremi A, Khansefid M, Esfandyari H. Optimizing purebred selection to improve crossbred performance. Front Genet 2024; 15:1384973. [PMID: 39381139 PMCID: PMC11458422 DOI: 10.3389/fgene.2024.1384973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Accepted: 08/29/2024] [Indexed: 10/10/2024] Open
Abstract
Crossbreeding is a widely adopted practice in the livestock industry, leveraging the advantages of heterosis and breed complementarity. The prediction of Crossbred Performance (CP) often relies on Purebred Performance (PB) due to limited crossbred data availability. However, the effective selection of purebred parents for enhancing CP depends on non-additive genetic effects and environmental factors. These factors are encapsulated in the genetic correlation between crossbred and purebred populations (r p c ). In this study, a two-way crossbreeding simulation was employed to investigate various strategies for integrating data from purebred and crossbred populations. The goal was to identify optimal models that maximize CP across different levels ofr p c . Different scenarios involving the selection of genotyped individuals from purebred and crossbred populations were explored using ssGBLUP (single-step Genomic Best Linear Unbiased Prediction) and ssGBLUP-MF (ssGBLUP with metafounders) models. The findings revealed an increase in prediction accuracy across all scenarios asr p c values increased. Notably, in the scenario incorporating genotypes from both purebred parent breeds and their crossbreds, both ssGBLUP and ssGBLUP-MF models exhibited nearly identical predictive accuracy. This scenario achieved maximum accuracy whenr p c was less than 0.5. However, atr p c = 0.8, ssGBLUP, which exclusively included sire breed genotypes in the training set, achieved the highest overall prediction accuracy at 73.2%. In comparison, the BLUP-UPG (BLUP with unknown parent group) model demonstrated lower accuracy than ssGBLUP and ssGBLUP-MF across allr p c levels. Although ssGBLUP and ssGBLUP-MF did not demonstrate a definitive trend in their respective scenarios, the prediction ability for CP increased when incorporating both crossbred and purebred population genotypes at lower levels of r p c . Furthermore, whenr p c was high, utilizing paternal genotype for CP predictions emerged as the most effective strategy. Predicted dispersion remained relatively similar in all scenarios, indicating a slight underestimation of breeding values. Overall, ther p c value emerged as a critical factor in predicting CP based on purebred data. However, the optimal model to maximize CP depends on the factors influencingr p c . Consequently, ongoing research aims to develop models that optimize purebred selection, further enhancing CP.
Collapse
Affiliation(s)
- Somayeh Barani
- Department of Animal Science, Faculty of Agriculture, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Sayed Reza Miraie Ashtiani
- Department of Animal Science, Faculty of Agriculture, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Ardeshir Nejati Javaremi
- Department of Animal Science, Faculty of Agriculture, College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - Majid Khansefid
- Agriculture Victoria Research, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | | |
Collapse
|
5
|
Chu TT, Kristensen PS, Jensen J. Simulation of functional additive and non-additive genetic effects using statistical estimates from quantitative genetic models. Heredity (Edinb) 2024; 133:33-42. [PMID: 38822133 PMCID: PMC11222558 DOI: 10.1038/s41437-024-00690-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 05/08/2024] [Accepted: 05/08/2024] [Indexed: 06/02/2024] Open
Abstract
Stochastic simulation software is commonly used to aid breeders designing cost-effective breeding programs and to validate statistical models used in genetic evaluation. An essential feature of the software is the ability to simulate populations with desired genetic and non-genetic parameters. However, this feature often fails when non-additive effects due to dominance or epistasis are modeled, as the desired properties of simulated populations are estimated from classical quantitative genetic statistical models formulated at the population level. The software simulates underlying functional effects for genotypic values at the individual level, which are not necessarily the same as effects from statistical models in which dominance and epistasis are included. This paper provides the theoretical basis and mathematical formulas for the transformation between functional and statistical effects in such simulations. The transformation is demonstrated with two statistical models analyzing individual phenotypes in a single population (common in animal breeding) and plot phenotypes of three-way hybrids involving two inbred populations (observed in some crop breeding programs). We also describe different methods for the simulation of functional effects for additive genetics, dominance, and epistasis to achieve the desired levels of variance components in classical statistical models used in quantitative genetics.
Collapse
Affiliation(s)
- Thinh Tuan Chu
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark.
- Vietnam National University of Agriculture, Faculty of Animal Science, Trâu Quỳ, Gia Lâm, Hanoi, Vietnam.
| | | | - Just Jensen
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark
| |
Collapse
|
6
|
Yin C, Shi H, Zhou P, Wang Y, Tao X, Yin Z, Zhang X, Liu Y. Genomic Prediction of Growth Traits in Yorkshire Pigs of Different Reference Group Sizes Using Different Estimated Breeding Value Models. Animals (Basel) 2024; 14:1098. [PMID: 38612337 PMCID: PMC11010886 DOI: 10.3390/ani14071098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 03/31/2024] [Accepted: 04/02/2024] [Indexed: 04/14/2024] Open
Abstract
The need for sufficient reference population data poses a significant challenge in breeding programs aimed at improving pig farming on a small to medium scale. To overcome this hurdle, investigating the advantages of combing reference populations of varying sizes is crucial for enhancing the accuracy of the genomic estimated breeding value (GEBV). Genomic selection (GS) in populations with limited reference data can be optimized by combining populations of the same breed or related breeds. This study focused on understanding the effect of combing different reference group sizes on the accuracy of GS for determining the growth effectiveness and percentage of lean meat in Yorkshire pigs. Specifically, our study investigated two important traits: the age at 100 kg live weight (AGE100) and the backfat thickness at 100 kg live weight (BF100). This research assessed the efficiency of genomic prediction (GP) using different GEBV models across three Yorkshire populations with varying genetic backgrounds. The GeneSeek 50K GGP porcine high-density array was used for genotyping. A total of 2295 Yorkshire pigs were included, representing three Yorkshire pig populations with different genetic backgrounds-295 from Danish (small) lines from Huaibei City, Anhui Province, 500 from Canadian (medium) lines from Lixin County, Anhui Province, and 1500 from American (large) lines from Shanghai. To evaluate the impact of different population combination scenarios on the GS accuracy, three approaches were explored: (1) combining all three populations for prediction, (2) combining two populations to predict the third, and (3) predicting each population independently. Five GEBV models, including three Bayesian models (BayesA, BayesB, and BayesC), the genomic best linear unbiased prediction (GBLUP) model, and single-step GBLUP (ssGBLUP) were implemented through 20 repetitions of five-fold cross-validation (CV). The results indicate that predicting one target population using the other two populations yielded the highest accuracy, providing a novel approach for improving the genomic selection accuracy in Yorkshire pigs. In this study, it was found that using different populations of the same breed to predict small- and medium-sized herds might be effective in improving the GEBV. This investigation highlights the significance of incorporating population combinations in genetic models for predicting the breeding value, particularly for pig farmers confronted with resource limitations.
Collapse
Affiliation(s)
- Chang Yin
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Haoran Shi
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Peng Zhou
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Yuwei Wang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Xuzhe Tao
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Zongjun Yin
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China; (Z.Y.); (X.Z.)
| | - Xiaodong Zhang
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China; (Z.Y.); (X.Z.)
| | - Yang Liu
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| |
Collapse
|
7
|
Yin C, Zhou P, Wang Y, Yin Z, Liu Y. Using genomic selection to improve the accuracy of genomic prediction for multi-populations in pigs. Animal 2024; 18:101062. [PMID: 38211414 DOI: 10.1016/j.animal.2023.101062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 12/11/2023] [Accepted: 12/12/2023] [Indexed: 01/13/2024] Open
Abstract
The size of the reference group is among the most critical determinants of genomic estimated breeding values (GEBVs) accuracy. However, small- and medium-sized pig farms often need help accumulating adequate reference data, posing significant challenges to breeding programs. To solve this problem, exploring the potential benefits of combining reference groups of different sizes is necessary to improve GEBV accuracy. The primary objective of this investigation was to assess a more effective statistical model for combined multi-populations and its potential to enhance the accuracy of GEBVs for small and medium populations. Three populations were simulated using the QMSim software, each consisting of different sizes (300, 600, and 1 500, respectively). To assess the impact of heritability on the accuracy of GEBVs, four different levels of heritability (0.05, 0.15, 0.35, and 0.5) were simulated. Simultaneously, to investigate the impact of kinship on multi-populations, the study created four distinct scenarios for the three sizes of populations. These scenarios included: (1) the three groups are all independent, (2) the large group and the small group with a familial connection (n = 1 800), a middle group (n = 600) acting independently with no kinship, (3) the large group with a familial connection to the middle group (n = 2 100) but no connection to the small group (n = 300), and (4) the small group with a familial connection to the middle group (n = 900), while the large group (n = 1 500) acted independently with no kinship. This study evaluates and compares the accuracy of predicting breeding values using four different methods, including genomic best linear unbiased prediction (GBLUP), single-stepGBLUP (ssGBLUP), and two Bayesian models (Bayes A and Bayes B), with varying sizes of reference groups. In each scenario, three different prediction strategies were compared: (1) Merging all three different sizes of populations for predicting, (2) predicting each independent population separately, and (3) the other two populations predict the population. Our findings reveal that combining populations enhances the Bayesian models, with Bayes B yielding the highest accuracy. In independent populations, the best linear unbiased prediction (BLUP) models demonstrated the highest accuracy. However, in cases where populations were related and the heritability was high, the Bayes B model exhibited the highest overall accuracy (slightly higher than BLUP models) in the independent population. Our results underscore the importance of considering population combinations when using genetic models to predict breeding values, particularly for pig farmers with limited resources.
Collapse
Affiliation(s)
- Chang Yin
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Experimental Teaching Demonstration Centre of Animal Science, Nanjing Agricultural University, Nanjing 210095, PR China
| | - Peng Zhou
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Experimental Teaching Demonstration Centre of Animal Science, Nanjing Agricultural University, Nanjing 210095, PR China
| | - Yuwei Wang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Experimental Teaching Demonstration Centre of Animal Science, Nanjing Agricultural University, Nanjing 210095, PR China
| | - Zongjun Yin
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, PR China
| | - Yang Liu
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Experimental Teaching Demonstration Centre of Animal Science, Nanjing Agricultural University, Nanjing 210095, PR China.
| |
Collapse
|
8
|
Eiríksson JH, Su G, Strandén I, Christensen OF. Segregation between breeds and local breed proportions in genetic and genomic models for crossbreds. Genet Sel Evol 2023; 55:45. [PMID: 37407936 DOI: 10.1186/s12711-023-00810-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 05/04/2023] [Indexed: 07/07/2023] Open
Abstract
BACKGROUND The breeding value of a crossbred individual can be expressed as the sum of the contributions from each of the contributing pure breeds. In theory, the breeding value should account for segregation between breeds, which results from the difference in the mean contribution of loci between breeds, which in turn is caused by differences in allele frequencies between breeds. However, with multiple generations of crossbreeding, how to account for breed segregation in genomic models that split the breeding value of crossbreds based on breed origin of alleles (BOA) is not known. Furthermore, local breed proportions (LBP) have been modelled based on BOA and is a concept related to breed segregation. The objectives of this study were to explore the theoretical background of the effect of LBP and how it relates to breed segregation and to investigate how to incorporate breed segregation (co)variance in genomic BOA models. RESULTS We showed that LBP effects result from the difference in the mean contribution of loci between breeds in an additive genetic model, i.e. breed segregation effects. We found that the (co)variance structure for BS effects in genomic BOA models does not lead to relationship matrices that are positive semi-definite in all cases. However, by setting one breed as a reference breed, a valid (co)variance structure can be constructed by including LBP effects for all other breeds and assuming them to be correlated. We successfully estimated variance components for a genomic BOA model with LBP effects in a simulated example. CONCLUSIONS Breed segregation effects and LBP effects are two alternative ways to account for the contribution of differences in the mean effects of loci between breeds. When the covariance between LBP effects across breeds is included in the model, a valid (co)variance structure for LBP effects can be constructed by setting one breed as reference breed and fitting an LBP effect for each of the other breeds.
Collapse
Affiliation(s)
- Jón H Eiríksson
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark.
- Faculty of Agricultural Sciences, Agricultural University of Iceland, 311, Borgarnes, Iceland.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| | - Ismo Strandén
- Natural Resources Institute Finland (Luke), 31600, Jokioinen, Finland
| | - Ole F Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| |
Collapse
|
9
|
Clasen JB, Fikse WF, Su G, Karaman E. Multibreed genomic prediction using summary statistics and a breed-origin-of-alleles approach. Heredity (Edinb) 2023; 131:33-42. [PMID: 37231157 PMCID: PMC10313778 DOI: 10.1038/s41437-023-00619-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 04/11/2023] [Accepted: 04/26/2023] [Indexed: 05/27/2023] Open
Abstract
Because of an increasing interest in crossbreeding between dairy breeds in dairy cattle herds, farmers are requesting breeding values for crossbred animals. However, genomically enhanced breeding values are difficult to predict in crossbred populations because the genetic make-up of crossbred individuals is unlikely to follow the same pattern as for purebreds. Furthermore, sharing genotype and phenotype information between breed populations are not always possible, which means that genetic merit (GM) for crossbred animals may be predicted without the information needed from some pure breeds, resulting in low prediction accuracy. This simulation study investigated the consequences of using summary statistics from single-breed genomic predictions for some or all pure breeds in two- and three-breed rotational crosses, rather than their raw data. A genomic prediction model taking into account the breed-origin of alleles (BOA) was considered. Because of a high genomic correlation between the breeds simulated (0.62-0.87), the prediction accuracies using the BOA approach were similar to a joint model, assuming homogeneous SNP effects for these breeds. Having a reference population with summary statistics available from all pure breeds and full phenotype and genotype information from crossbreds yielded almost as high prediction accuracies (0.720-0.768) as having a reference population with full information from all pure breeds and crossbreds (0.753-0.789). Lacking information from the pure breeds yielded much lower prediction accuracies (0.590-0.676). Furthermore, including crossbred animals in a combined reference population also benefitted prediction accuracies in the purebred animals, especially for the smallest breed population.
Collapse
Affiliation(s)
- J B Clasen
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, 75007, Uppsala, Sweden.
- Center for Quantitative Genetics and Genomics, Aarhus University, C. F. Møllers Allé 8, DK-8000, Aarhus, Denmark.
| | - W F Fikse
- Växa Sverige, Swedish University of Agricultural Sciences, Ulls väg 26, 756 51, Uppsala, Sweden
| | - G Su
- Center for Quantitative Genetics and Genomics, Aarhus University, C. F. Møllers Allé 8, DK-8000, Aarhus, Denmark
| | - E Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, C. F. Møllers Allé 8, DK-8000, Aarhus, Denmark
| |
Collapse
|
10
|
Guillenea A, Lund MS, Evans R, Boerner V, Karaman E. A breed-of-origin of alleles model that includes crossbred data improves predictive ability for crossbred animals in a multi-breed population. Genet Sel Evol 2023; 55:34. [PMID: 37189059 PMCID: PMC10184430 DOI: 10.1186/s12711-023-00806-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 04/24/2023] [Indexed: 05/17/2023] Open
Abstract
BACKGROUND Recently, crossbred animals have begun to be used as parents in the next generations of dairy and beef cattle systems, which has increased the interest in predicting the genetic merit of those animals. The primary objective of this study was to investigate three available methods for genomic prediction of crossbred animals. In the first two methods, SNP effects from within-breed evaluations are used by weighting them by the average breed proportions across the genome (BPM method) or by their breed-of-origin (BOM method). The third method differs from the BOM in that it estimates breed-specific SNP effects using purebred and crossbred data, considering the breed-of-origin of alleles (BOA method). For within-breed evaluations, and thus for BPM and BOM, 5948 Charolais, 6771 Limousin and 7552 Others (a combined population of other breeds) were used to estimate SNP effects separately within each breed. For the BOA, the purebreds' data were enhanced with data from ~ 4K, ~ 8K or ~ 18K crossbred animals. For each animal, its predictor of genetic merit (PGM) was estimated by considering the breed-specific SNP effects. Predictive ability and absence of bias were estimated for crossbreds and the Limousin and Charolais animals. Predictive ability was measured as the correlation between PGM and the adjusted phenotype, while the regression of the adjusted phenotype on PGM was estimated as a measure of bias. RESULTS With BPM and BOM, the predictive abilities for crossbreds were 0.468 and 0.472, respectively, and with the BOA method, they ranged from 0.490 to 0.510. The performance of the BOA method improved as the number of crossbred animals in the reference increased and with the use of the correlated approach, in which the correlation of SNP effects across the genome of the different breeds was considered. The slopes of regression for PGM on adjusted phenotypes for crossbreds showed overdispersion of the genetic merits for all methods but this bias tended to be reduced by the use of the BOA method and by increasing the number of crossbred animals. CONCLUSIONS For the estimation of the genetic merit of crossbred animals, the results from this study suggest that the BOA method that accommodates crossbred data can yield more accurate predictions than the methods that use SNP effects from separate within-breed evaluations.
Collapse
Affiliation(s)
- Ana Guillenea
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark.
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| | - Ross Evans
- ICBF, Link Road, Carrigrohane, Ballincollig, Co. Cork, P31 D452, Ireland
| | - Vinzent Boerner
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| |
Collapse
|
11
|
Zhuang Z, Wu J, Qiu Y, Ruan D, Ding R, Xu C, Zhou S, Zhang Y, Liu Y, Ma F, Yang J, Sun Y, Zheng E, Yang M, Cai G, Yang J, Wu Z. Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs. J Anim Sci Biotechnol 2023; 14:67. [PMID: 37161604 PMCID: PMC10170792 DOI: 10.1186/s40104-023-00863-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/05/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Pork quality can directly affect customer purchase tendency and meat quality traits have become valuable in modern pork production. However, genetic improvement has been slow due to high phenotyping costs. In this study, whole genome sequence (WGS) data was used to evaluate the prediction accuracy of genomic best linear unbiased prediction (GBLUP) for meat quality in large-scale crossbred commercial pigs. RESULTS We produced WGS data (18,695,907 SNPs and 2,106,902 INDELs exceed quality control) from 1,469 sequenced Duroc × (Landrace × Yorkshire) pigs and developed a reference panel for meat quality including meat color score, marbling score, L* (lightness), a* (redness), and b* (yellowness) of genomic prediction. The prediction accuracy was defined as the Pearson correlation coefficient between adjusted phenotypes and genomic estimated breeding values in the validation population. Using different marker density panels derived from WGS data, accuracy differed substantially among meat quality traits, varied from 0.08 to 0.47. Results showed that MultiBLUP outperform GBLUP and yielded accuracy increases ranging from 17.39% to 75%. We optimized the marker density and found medium- and high-density marker panels are beneficial for the estimation of heritability for meat quality. Moreover, we conducted genotype imputation from 50K chip to WGS level in the same population and found average concordance rate to exceed 95% and r2 = 0.81. CONCLUSIONS Overall, estimation of heritability for meat quality traits can benefit from the use of WGS data. This study showed the superiority of using WGS data to genetically improve pork quality in genomic prediction.
Collapse
Affiliation(s)
- Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yibin Qiu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Donglin Ruan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Rongrong Ding
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Cineng Xu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Shenping Zhou
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Fucai Ma
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jifei Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ying Sun
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ming Yang
- College of Animal Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
- Yunfu Subcenter of Guangdong Laboratory for Lingnan Modern Agriculture, Yunfu, 527400, China.
| |
Collapse
|
12
|
Hu G, Do DN, Manafiazar G, Kelvin AA, Sargolzaei M, Plastow G, Wang Z, Miar Y. Population genomics of American mink using genotype data. Front Genet 2023; 14:1175408. [PMID: 37274788 PMCID: PMC10234291 DOI: 10.3389/fgene.2023.1175408] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 04/14/2023] [Indexed: 06/07/2023] Open
Abstract
Understanding the genetic structure of the target population is critically important to develop an efficient genomic selection program in domestic animals. In this study, 2,973 American mink of six color types from two farms (Canadian Centre for Fur Animal Research (CCFAR), Truro, NS and Millbank Fur Farm (MFF), Rockwood, ON) were genotyped with the Affymetrix Mink 70K panel to compute their linkage disequilibrium (LD) patterns, effective population size (Ne), genetic diversity, genetic distances, and population differentiation and structure. The LD pattern represented by average r 2, decreased to <0.2 when the inter-marker interval reached larger than 350 kb and 650 kb for CCFAR and MFF, respectively, and suggested at least 7,700 and 4,200 single nucleotide polymorphisms (SNPs) be used to obtain adequate accuracy for genomic selection programs in CCFAR and MFF respectively. The Ne for five generations ago was estimated to be 76 and 91 respectively. Our results from genetic distance and diversity analyses showed that American mink of the various color types had a close genetic relationship and low genetic diversity, with most of the genetic variation occurring within rather than between color types. Three ancestral genetic groups was considered the most appropriate number to delineate the genetic structure of these populations. Black (in both CCFAR and MFF) and pastel color types had their own ancestral clusters, while demi, mahogany, and stardust color types were admixed with the three ancestral genetic groups. This study provided essential information to utilize the first medium-density SNP panel for American mink in their genomic studies.
Collapse
Affiliation(s)
- Guoyu Hu
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| | - Duy Ngoc Do
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| | - Ghader Manafiazar
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| | - Alyson A. Kelvin
- Vaccine and Infectious Disease Organization (VIDO), University of Saskatchewan, Saskatoon, SK, Canada
| | - Mehdi Sargolzaei
- Department of Pathobiology, University of Guelph, Guelph, ON, Canada
- Select Sires Inc, Plain City, OH, United States
| | - Graham Plastow
- Livestock Gentec, Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Zhiquan Wang
- Livestock Gentec, Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Younes Miar
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| |
Collapse
|
13
|
Zhao W, Zhang Z, Ma P, Wang Z, Wang Q, Zhang Z, Pan Y. The effect of high-density genotypic data and different methods on joint genomic prediction: A case study in large white pigs. Anim Genet 2023; 54:45-54. [PMID: 36414135 DOI: 10.1111/age.13275] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 11/07/2022] [Accepted: 11/07/2022] [Indexed: 11/24/2022]
Abstract
Joint genomic prediction (GP) is an attractive method to improve the accuracy of GP by combining information from multiple populations. However, many factors can negatively influence the accuracy of joint GP, such as differences in linkage disequilibrium phasing between single nucleotide polymorphisms (SNPs) and causal variants, minor allele frequencies and causal variants' effect sizes across different populations. The objective of this study was to investigate whether the imputed high-density genotype data can improve the accuracy of joint GP using genomic best linear unbiased prediction (GBLUP), single-step GBLUP (ssGBLUP), multi-trait GBLUP (MT-GBLUP) and GBLUP based on genomic relationship matrix considering heterogenous minor allele frequencies across different populations (wGBLUP). Three traits, including days taken to reach slaughter weight, backfat thickness and loin muscle area, were measured on 67 276 Large White pigs from two different populations, for which 3334 were genotyped by SNP array. The results showed that a combined population could substantially improve the accuracy of GP compared with a single-population GP, especially for the population with a smaller size. The imputed SNP data had no effect for single population GP but helped to yield higher accuracy than the medium-density array data for joint GP. Of the four methods, ssGLBUP performed the best, but the advantage of ssGBLUP decreased as more individuals were genotyped. In some cases, MT-GBLUP and wGBLUP performed better than GBLUP. In conclusion, our results confirmed that joint GP could be beneficial from imputed high-density genotype data, and the wGBLUP and MT-GBLUP methods are promising for joint GP in pig breeding.
Collapse
Affiliation(s)
- Wei Zhao
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Zhenyang Zhang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Peipei Ma
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Zhen Wang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Qishan Wang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Zhe Zhang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Yuchun Pan
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China.,Hainan Research Institute, Zhejiang University, Sanya, China
| |
Collapse
|
14
|
Ogawa S, Taniguchi Y, Watanabe T, Iwaisaki H. Fitting Genomic Prediction Models with Different Marker Effects among Prefectures to Carcass Traits in Japanese Black Cattle. Genes (Basel) 2022; 14:24. [PMID: 36672767 PMCID: PMC9859149 DOI: 10.3390/genes14010024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 12/16/2022] [Accepted: 12/20/2022] [Indexed: 12/25/2022] Open
Abstract
We fitted statistical models, which assumed single-nucleotide polymorphism (SNP) marker effects differing across the fattened steers marketed into different prefectures, to the records for cold carcass weight (CW) and marbling score (MS) of 1036, 733, and 279 Japanese Black fattened steers marketed into Tottori, Hiroshima, and Hyogo prefectures in Japan, respectively. Genotype data on 33,059 SNPs was used. Five models that assume only common SNP effects to all the steers (model 1), common effects plus SNP effects differing between the steers marketed into Hyogo prefecture and others (model 2), only the SNP effects differing between Hyogo steers and others (model 3), common effects plus SNP effects specific to each prefecture (model 4), and only the effects specific to each prefecture (model 5) were exploited. For both traits, slightly lower values of residual variance than that of model 1 were estimated when fitting all other models. Estimated genetic correlation among the prefectures in models 2 and 4 ranged to 0.53 to 0.71, all <0.8. These results might support that the SNP effects differ among the prefectures to some degree, although we discussed the necessity of careful consideration to interpret the current results.
Collapse
Affiliation(s)
- Shinichiro Ogawa
- Graduate School of Agriculture, Kyoto University, Kyoto 606-8502, Japan
- Division of Meat Animal and Poultry Research, Institute of Livestock and Grassland Science, Tsukuba 305-0901, Japan
| | - Yukio Taniguchi
- Graduate School of Agriculture, Kyoto University, Kyoto 606-8502, Japan
| | - Toshio Watanabe
- National Livestock Breeding Center, Fukushima 961-8511, Japan
- Maebashi Institute of Animal Science, Livestock Improvement Association of Japan, Inc., Maebashi 371-0121, Japan
| | - Hiroaki Iwaisaki
- Graduate School of Agriculture, Kyoto University, Kyoto 606-8502, Japan
- Sado Island Center for Ecological Sustainability, Niigata University, Niigata 952-0103, Japan
| |
Collapse
|
15
|
Eiríksson JH, Strandén I, Su G, Mäntysaari EA, Christensen OF. Local breed proportions and local breed heterozygosity in genomic predictions for crossbred dairy cows. J Dairy Sci 2022; 105:9822-9836. [DOI: 10.3168/jds.2022-22225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 07/19/2022] [Indexed: 11/06/2022]
|
16
|
Schmid M, Stock J, Bennewitz J, Wellmann R. Improving the Accuracy of Multi-Breed Prediction in Admixed Populations by Accounting for the Breed Origin of Haplotype Segments. Front Genet 2022; 13:840815. [PMID: 35401683 PMCID: PMC8987492 DOI: 10.3389/fgene.2022.840815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 03/04/2022] [Indexed: 11/13/2022] Open
Abstract
Numerically small breeds have often been upgraded with mainstream breeds. This historic introgression predisposes the breeds for joint genomic evaluations with mainstream breeds. The linkage disequilibrium structure differs between breeds. The marker effects of a haplotype segment may, therefore, depend on the breed from which the haplotype segment originates. An appropriate method for genomic evaluation would account for this dependency. This study proposes a method for the computation of genomic breeding values for small admixed breeds that incorporate phenotypic and genomic information from large introgressed breeds by considering the breed origin of alleles (BOA) in the evaluation. The proposed BOA model classifies haplotype segments according to their origins and assumes different but correlated SNP effects for the different origins. The BOA model was compared in a simulation study to conventional within-breed genomic best linear unbiased prediction (GBLUP) and conventional multi-breed GBLUP models. The BOA model outperformed within-breed GBLUP as well as multi-breed GBLUP in most cases.
Collapse
Affiliation(s)
- Markus Schmid
- Institute of Animal Science, Department of Animal Genetics and Breeding, University of Hohenheim, Stuttgart, Germany
| | - Joana Stock
- Institute of Animal Science, Department of Animal Genetics and Breeding, University of Hohenheim, Stuttgart, Germany
| | - Jörn Bennewitz
- Institute of Animal Science, Department of Animal Genetics and Breeding, University of Hohenheim, Stuttgart, Germany
| | - Robin Wellmann
- Institute of Animal Science, Department of Animal Genetics and Breeding, University of Hohenheim, Stuttgart, Germany
| |
Collapse
|
17
|
Eiríksson JH, Byskov K, Su G, Thomasen JR, Christensen OF. Genomic predictions for crossbred dairy cows by combining solutions from purebred evaluation based on breed origin of alleles. J Dairy Sci 2022; 105:5178-5191. [DOI: 10.3168/jds.2021-21644] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 02/21/2022] [Indexed: 11/19/2022]
|
18
|
Misztal I, Steyn Y, Lourenco D. Genomic evaluation with multibreed and crossbred data. JDS COMMUNICATIONS 2022; 3:156-159. [PMID: 36339739 PMCID: PMC9623721 DOI: 10.3168/jdsc.2021-0177] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 11/21/2021] [Indexed: 11/19/2022]
Abstract
We found low accuracy of genomic evaluation of crossbreds based on purebred data. We found higher accuracy for crossbreds based on crossbred data. Use of putative sequence variants had a small impact on genomic accuracy.
Several types of multibreed genomic evaluation are in use. These include evaluation of crossbreds based on purebred SNP effects, joint evaluation of all purebreds and crossbreds with a single additive effect, and treating each purebred and crossbred group as a separate trait. Additionally, putative quantitative trait nucleotides can be exploited to increase the accuracy of prediction. Existing studies indicate that the prediction of crossbreds based on purebred data has low accuracy, that a joint evaluation can potentially provide accurate evaluations for crossbreds but could lower accuracy for purebreds compared with single-breed evaluations, and that the use of putative quantitative trait nucleotides only marginally increases the accuracy. One hypothesis is that genomic selection is based on estimation of clusters of independent chromosome segments. Subsequently, predicting a particular group type would require a reference population of the same type, and crosses with same breed percentage but different type (F1 vs. F2) would, at best, use separate reference populations. The genomic selection of multibreed population is still an active research topic.
Collapse
|
19
|
Guillenea A, Su G, Lund MS, Karaman E. Genomic prediction in Nordic Red dairy cattle considering breed origin of alleles. J Dairy Sci 2022; 105:2426-2438. [PMID: 35033341 DOI: 10.3168/jds.2021-21173] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 11/23/2021] [Indexed: 01/02/2023]
Abstract
This study investigated the reliability of genomic prediction (GP) using breed origin of alleles (BOA) approach in the Nordic Red (RDC) population, which has an admixed population structure. The RDC population consists of animals with varying degrees of genetic materials from the Danish Red (RDM), Swedish Red (SRB), Finnish Ayrshire (FAY), and Holstein (HOL) because bulls have been used across the breeds. The BOA approach was tested using 39,550 RDC animals in the reference population and 11,786 in the validation population. Deregressed proofs (DRP) of milk, fat and protein were used as response variable for GP. Direct genomic breeding values (DGV) for animals in the validation population were calculated with (BOA model) or without (joint model) considering breed origin of alleles. The joint model assumed homogeneous marker effects and a single set of marker effects were estimated, whereas BOA model assumed heterogeneous marker effects, and different sets of marker effects were estimated across the breeds. For the BOA approach, we tested scenarios assuming both correlated (BOA_cor) and uncorrelated (BOA_uncor) marker effects between the breeds. Additionally, we investigated GP using a standard Illumina 50K chip and including SNP selected from imputed whole-genome sequencing (50K+WGS). We also studied the effect of estimating (co)variances for genome regions of different sizes to exploit the information of the genome regions contributing to the (co)variance between the breeds. Region sizes were set as 1 SNP, a group of 30 or 100 adjacent SNP, or the whole genome. Reliability of DGV was measured as squared correlations between DGV and DRP divided by the reliability of DRP. Across the 3 traits, in general, RS30 and RS100 SNP yielded the highest reliabilities. Including WGS SNP improved reliabilities in almost all scenarios (0.297 on average for 50K and 0.307 on average for 50K+WGS). The BOA_uncor (0.233 on average) was inferior to the joint model (0.339 on average), but the reliabilities obtained using BOA_cor (0.334 on average) in most cases were not significantly different from those obtained using the joint model. The results indicate that both including additional whole-genome sequencing SNP and dividing the genome into fixed regions improve GP in the RDC. The BOA models have the potential to increase the reliability of GP, but the benefit is limited in populations with a high exchange of genetic material for a long time, as is the case for RDC.
Collapse
Affiliation(s)
- Ana Guillenea
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Mogens Sand Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| |
Collapse
|
20
|
Eiríksson JH, Karaman E, Su G, Christensen OF. Breed of origin of alleles and genomic predictions for crossbred dairy cows. Genet Sel Evol 2021; 53:84. [PMID: 34742238 PMCID: PMC8572482 DOI: 10.1186/s12711-021-00678-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 10/20/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In dairy cattle, genomic selection has been implemented successfully for purebred populations, but, to date, genomic estimated breeding values (GEBV) for crossbred cows are rarely available, although they are valuable for rotational crossbreeding schemes that are promoted as efficient strategies. An attractive approach to provide GEBV for crossbreds is to use estimated marker effects from the genetic evaluation of purebreds. The effects of each marker allele in crossbreds can depend on the breed of origin of the allele (BOA), thus applying marker effects based on BOA could result in more accurate GEBV than applying only proportional contribution of the purebreds. Application of BOA models in rotational crossbreeding requires methods for detecting BOA, but the existing methods have not been developed for rotational crossbreeding. Therefore, the aims of this study were to develop and test methods for detecting BOA in a rotational crossbreeding system, and to investigate methods for calculating GEBV for crossbred cows using estimated marker effects from purebreds. RESULTS For detecting BOA in crossbred cows from rotational crossbreeding for which pedigree is recorded, we developed the AllOr method based on the comparison of haplotypes in overlapping windows. To calculate the GEBV of crossbred cows, two models were compared: a BOA model where marker effects estimated from purebreds are combined based on the detected BOA; and a breed proportion model where marker effects are combined based on estimated breed proportions. The methods were tested on simulated data that mimic the first four generations of rotational crossbreeding between Holstein, Jersey and Red Dairy Cattle. The AllOr method detected BOA correctly for 99.6% of the marker alleles across the four crossbred generations. The reliability of GEBV was higher with the BOA model than with the breed proportion model for the four generations of crossbreeding, with the largest difference observed in the first generation. CONCLUSIONS In rotational crossbreeding for which pedigree is recorded, BOA can be accurately detected using the AllOr method. Combining marker effects estimated from purebreds to predict the breeding value of crossbreds based on BOA is a promising approach to provide GEBV for crossbred dairy cows.
Collapse
Affiliation(s)
- Jón H Eiríksson
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Ole F Christensen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
21
|
Lamonov SA, Skorkina IA. Efficiency of blood “refreshing” method in cattle pure breeding of the Simmental breed. BIO WEB OF CONFERENCES 2021. [DOI: 10.1051/bioconf/20213700102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Over the past two decades, pedigree cattle and sperm products have been imported into the Russian Federation from outstanding breeding bulls of the Simmental breed from Western Europe, including Austria, within the framework of the “AIC Development” national project. Specific productive and technological qualities are inherent in these animals due to the direction peculiarities of selection and breeding work with this breed in Austria. World experience has clearly proved that the future belongs to large dairy complexes and farms, which are fully mechanized and automated. Modern Simmental cattle (mostly cows) do not meet the requirements of intensive milk production technology for a number of productive and technological characteristics. Summarizing the world zootechnical experience, it is possible to determine the basic requirements for cows suitable for operation in conditions of advanced milk production technologies. Based on numerous studies, these animals are characterized by high milk production, good reproductive qualities, suitable for machine milking, with strong hoof horn, resistant to diseases and mastitis. Consequently, a significant role is given to the manning of a milking herd with highly productive and competitive Simmental cows, obtained as a result of improving the system of selection and breeding work to further increase the production of marketable milk. The conducted research result shows zootechnical efficiency of using such a breeding method as “refreshing” blood in the breeding process. In particular, the best indicators of milk yield for the first lactation are observed in cows descended from bullsproducers of Austrian selection, -4153.1 kg of natural fat milk. The morphological and functional characteristics of the udders of all experimental first-calf cows meet the requirements of suitability for machine milking. Cows of the Simmental breed obtained by the method of “blood refreshing” are the most cost-effective in the same conditions of feeding and keeping.
Collapse
|