1
|
Brzáková M, Veselá Z, Vařeka J, Bauer J. Improving Breeding Value Reliability with Genomic Data in Breeding Groups of Charolais. Genes (Basel) 2023; 14:2139. [PMID: 38136964 PMCID: PMC10743247 DOI: 10.3390/genes14122139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 11/24/2023] [Accepted: 11/24/2023] [Indexed: 12/24/2023] Open
Abstract
The aim of this study was to assess the impact of incorporating genomic data using the single-step genomic best linear unbiased prediction (ssGBLUP) method compared to the best linear unbiased prediction (BLUP) method on the reliability of breeding values for age at first calving, calving interval, and productive longevity at 78 months in Charolais cattle. The study included 48,590 purebred Charolais individuals classified into four subgroups based on genotyping and performance records. The results showed that considering genotypes significantly improved genomic estimated breeding values (GEBV) reliability across all categories except nongenotyped individuals. For young genotyped individuals, the increase in reliability was up to 27% for both sexes. The highest average reliability was achieved for genotyped proven bulls and cows with performance records, and the inclusion of genomic data further improved the reliability by up to 22% and 21% for cows and bulls, respectively. The gain in reliability was observed mainly during the first three calvings, and then the differences decreased. The imported individuals showed lower estimated breeding values (EBV) and GEBV reliabilities than the domestic population, probably due to the weak genetic connection with the domestic population. However, when the progeny of imported heifers were sired by domestic bulls, the reliability increased by up to 24%. For nongenotyped individuals, only a slight increase in reliability was observed; however, the number of genotyped individuals in the population was still relatively small.
Collapse
Affiliation(s)
- Michaela Brzáková
- Department of Genetics and Breeding of Farm Animals, Institute of Animal Science, 104 00 Prague, Czech Republic; (Z.V.); (J.V.)
| | - Zdeňka Veselá
- Department of Genetics and Breeding of Farm Animals, Institute of Animal Science, 104 00 Prague, Czech Republic; (Z.V.); (J.V.)
| | - Jan Vařeka
- Department of Genetics and Breeding of Farm Animals, Institute of Animal Science, 104 00 Prague, Czech Republic; (Z.V.); (J.V.)
| | - Jiří Bauer
- Czech-Moravian Breeders’ Corporation, 252 09 Hradištko, Czech Republic;
| |
Collapse
|
2
|
Fernández-González J, Akdemir D, Isidro Y Sánchez J. A comparison of methods for training population optimization in genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:30. [PMID: 36892603 PMCID: PMC9998580 DOI: 10.1007/s00122-023-04265-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 11/21/2022] [Indexed: 06/18/2023]
Abstract
Maximizing CDmean and Avg_GRM_self were the best criteria for training set optimization. A training set size of 50-55% (targeted) or 65-85% (untargeted) is needed to obtain 95% of the accuracy. With the advent of genomic selection (GS) as a widespread breeding tool, mechanisms to efficiently design an optimal training set for GS models became more relevant, since they allow maximizing the accuracy while minimizing the phenotyping costs. The literature described many training set optimization methods, but there is a lack of a comprehensive comparison among them. This work aimed to provide an extensive benchmark among optimization methods and optimal training set size by testing a wide range of them in seven datasets, six different species, different genetic architectures, population structure, heritabilities, and with several GS models to provide some guidelines about their application in breeding programs. Our results showed that targeted optimization (uses information from the test set) performed better than untargeted (does not use test set data), especially when heritability was low. The mean coefficient of determination was the best targeted method, although it was computationally intensive. Minimizing the average relationship within the training set was the best strategy for untargeted optimization. Regarding the optimal training set size, maximum accuracy was obtained when the training set was the entire candidate set. Nevertheless, a 50-55% of the candidate set was enough to reach 95-100% of the maximum accuracy in the targeted scenario, while we needed a 65-85% for untargeted optimization. Our results also suggested that a diverse training set makes GS robust against population structure, while including clustering information was less effective. The choice of the GS model did not have a significant influence on the prediction accuracies.
Collapse
Affiliation(s)
- Javier Fernández-González
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| | - Deniz Akdemir
- CIBMTR (Center for International Blood and Marrow Transplant Research), National Marrow Donor Program/Be The Match, Minneapolis, USA
| | - Julio Isidro Y Sánchez
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| |
Collapse
|
3
|
Ning C, Xie K, Huang J, Di Y, Wang Y, Yang A, Hu J, Zhang Q, Wang D, Fan X. Marker density and statistical model designs to increase accuracy of genomic selection for wool traits in Angora rabbits. Front Genet 2022; 13:968712. [PMID: 36118881 PMCID: PMC9478554 DOI: 10.3389/fgene.2022.968712] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 08/17/2022] [Indexed: 11/13/2022] Open
Abstract
The Angora rabbit, a well-known breed for fiber production, has been undergoing traditional breeding programs relying mainly on phenotypes. Genomic selection (GS) uses genomic information and promises to accelerate genetic gain. Practically, to implement GS in Angora rabbit breeding, it is necessary to evaluate different marker densities and GS models to develop suitable strategies for an optimized breeding pipeline. Considering a lack in microarray, low-coverage sequencing combined with genotype imputation was used to boost the number of SNPs across the rabbit genome. Here, in a population of 629 Angora rabbits, a total of 18,577,154 high-quality SNPs were imputed (imputation accuracy above 98%) based on low-coverage sequencing of 3.84X genomic coverage, and wool traits and body weight were measured at 70, 140 and 210 days of age. From the original markers, 0.5K, 1K, 3K, 5K, 10K, 50K, 100K, 500K, 1M and 2M were randomly selected and evaluated, resulting in 50K markers as the baseline for the heritability estimation and genomic prediction. Comparing to the GS performance of single-trait models, the prediction accuracy of nearly all traits could be improved by multi-trait models, which might because multiple-trait models used information from genetically correlated traits. Furthermore, we observed high significant negative correlation between the increased prediction accuracy from single-trait to multiple-trait models and estimated heritability. The results indicated that low-heritability traits could borrow more information from correlated traits and hence achieve higher prediction accuracy. The research first reported heritability estimation in rabbits by using genome-wide markers, and provided 50K as an optimal marker density for further microarray design, genetic evaluation and genomic selection in Angora rabbits. We expect that the work could provide strategies for GS in early selection, and optimize breeding programs in rabbits.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Dan Wang
- *Correspondence: Dan Wang, ; Xinzhong Fan,
| | | |
Collapse
|
4
|
Salek Ardestani S, Jafarikia M, Sargolzaei M, Sullivan B, Miar Y. Genomic Prediction of Average Daily Gain, Back-Fat Thickness, and Loin Muscle Depth Using Different Genomic Tools in Canadian Swine Populations. Front Genet 2021; 12:665344. [PMID: 34149806 PMCID: PMC8209496 DOI: 10.3389/fgene.2021.665344] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 04/15/2021] [Indexed: 12/12/2022] Open
Abstract
Improvement of prediction accuracy of estimated breeding values (EBVs) can lead to increased profitability for swine breeding companies. This study was performed to compare the accuracy of different popular genomic prediction methods and traditional best linear unbiased prediction (BLUP) for future performance of back-fat thickness (BFT), average daily gain (ADG), and loin muscle depth (LMD) in Canadian Duroc, Landrace, and Yorkshire swine breeds. In this study, 17,019 pigs were genotyped using Illumina 60K and Affymetrix 50K panels. After quality control and imputation steps, a total of 41,304, 48,580, and 49,102 single-nucleotide polymorphisms remained for Duroc (n = 6,649), Landrace (n = 5,362), and Yorkshire (n = 5,008) breeds, respectively. The breeding values of animals in the validation groups (n = 392–774) were predicted before performance test using BLUP, BayesC, BayesCπ, genomic BLUP (GBLUP), and single-step GBLUP (ssGBLUP) methods. The prediction accuracies were obtained using the correlation between the predicted breeding values and their deregressed EBVs (dEBVs) after performance test. The genomic prediction methods showed higher prediction accuracies than traditional BLUP for all scenarios. Although the accuracies of genomic prediction methods were not significantly (P > 0.05) different, ssGBLUP was the most accurate method for Duroc-ADG, Duroc-LMD, Landrace-BFT, Landrace-ADG, and Yorkshire-BFT scenarios, and BayesCπ was the most accurate method for Duroc-BFT, Landrace-LMD, and Yorkshire-ADG scenarios. Furthermore, BayesCπ method was the least biased method for Duroc-LMD, Landrace-BFT, Landrace-ADG, Yorkshire-BFT, and Yorkshire-ADG scenarios. Our findings can be beneficial for accelerating the genetic progress of BFT, ADG, and LMD in Canadian swine populations by selecting more accurate and unbiased genomic prediction methods.
Collapse
Affiliation(s)
| | - Mohsen Jafarikia
- Canadian Centre for Swine Improvement, Ottawa, ON, Canada.,Centre for Genetic Improvement of Livestock (CGIL), Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada
| | - Mehdi Sargolzaei
- Department of Pathobiology, University of Guelph, Guelph, ON, Canada.,Select Sires Inc., Plain City, OH, United States
| | - Brian Sullivan
- Canadian Centre for Swine Improvement, Ottawa, ON, Canada
| | - Younes Miar
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| |
Collapse
|
5
|
Karimi K, Ngoc Do D, Sargolzaei M, Miar Y. Population Genomics of American Mink Using Whole Genome Sequencing Data. Genes (Basel) 2021; 12:genes12020258. [PMID: 33670138 PMCID: PMC7916864 DOI: 10.3390/genes12020258] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Revised: 02/04/2021] [Accepted: 02/05/2021] [Indexed: 12/11/2022] Open
Abstract
Characterizing the genetic structure and population history can facilitate the development of genomic breeding strategies for the American mink. In this study, we used the whole genome sequences of 100 mink from the Canadian Centre for Fur Animal Research (CCFAR) at the Dalhousie Faculty of Agriculture (Truro, NS, Canada) and Millbank Fur Farm (Rockwood, ON, Canada) to investigate their population structure, genetic diversity and linkage disequilibrium (LD) patterns. Analysis of molecular variance (AMOVA) indicated that the variation among color-types was significant (p < 0.001) and accounted for 18% of the total variation. The admixture analysis revealed that assuming three ancestral populations (K = 3) provided the lowest cross-validation error (0.49). The effective population size (Ne) at five generations ago was estimated to be 99 and 50 for CCFAR and Millbank Fur Farm, respectively. The LD patterns revealed that the average r2 reduced to <0.2 at genomic distances of >20 kb and >100 kb in CCFAR and Millbank Fur Farm suggesting that the density of 120,000 and 24,000 single nucleotide polymorphisms (SNP) would provide the adequate accuracy of genomic evaluation in these populations, respectively. These results indicated that accounting for admixture is critical for designing the SNP panels for genotype-phenotype association studies of American mink.
Collapse
Affiliation(s)
- Karim Karimi
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS B2N 5E3, Canada; (K.K.); (D.N.D.)
| | - Duy Ngoc Do
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS B2N 5E3, Canada; (K.K.); (D.N.D.)
| | - Mehdi Sargolzaei
- Department of Pathobiology, University of Guelph, Guelph, ON N1G 2W1, Canada;
- Select Sires Inc., Plain City, OH 43064, USA
| | - Younes Miar
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS B2N 5E3, Canada; (K.K.); (D.N.D.)
- Correspondence:
| |
Collapse
|
6
|
Karimi K, Farid AH, Myles S, Miar Y. Detection of selection signatures for response to Aleutian mink disease virus infection in American mink. Sci Rep 2021; 11:2944. [PMID: 33536540 PMCID: PMC7859209 DOI: 10.1038/s41598-021-82522-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Accepted: 01/21/2021] [Indexed: 02/06/2023] Open
Abstract
Aleutian disease (AD) is the most significant health issue for farmed American mink. The objective of this study was to identify the genomic regions subjected to selection for response to infection with Aleutian mink disease virus (AMDV) in American mink using genotyping by sequencing (GBS) data. A total of 225 black mink were inoculated with AMDV and genotyped using a GBS assay based on the sequencing of ApeKI-digested libraries. Five AD-characterized phenotypes were used to assign animals to pairwise groups. Signatures of selection were detected using integrated measurement of fixation index (FST) and nucleotide diversity (θπ), that were validated by haplotype-based (hap-FLK) test. The total of 99 putatively selected regions harbouring 63 genes were detected in different groups. The gene ontology revealed numerous genes related to immune response (e.g. TRAF3IP2, WDR7, SWAP70, CBFB, and GPR65), liver development (e.g. SULF2, SRSF5) and reproduction process (e.g. FBXO5, CatSperβ, CATSPER4, and IGF2R). The hapFLK test supported two strongly selected regions that contained five candidate genes related to immune response, virus–host interaction, reproduction and liver regeneration. This study provided the first map of putative selection signals of response to AMDV infection in American mink, bringing new insights into genomic regions controlling the AD phenotypes.
Collapse
Affiliation(s)
- Karim Karimi
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| | - A Hossain Farid
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| | - Sean Myles
- Department of Plant, Food, and Environmental Sciences, Dalhousie University, Truro, NS, Canada
| | - Younes Miar
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada.
| |
Collapse
|
7
|
Aliakbari A, Delpuech E, Labrune Y, Riquet J, Gilbert H. The impact of training on data from genetically-related lines on the accuracy of genomic predictions for feed efficiency traits in pigs. Genet Sel Evol 2020; 52:57. [PMID: 33028194 PMCID: PMC7539441 DOI: 10.1186/s12711-020-00576-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 09/21/2020] [Indexed: 01/08/2023] Open
Abstract
Background Most genomic predictions use a unique population that is split into a training and a validation set. However, genomic prediction using genetically heterogeneous training sets could provide more flexibility when constructing the training sets in small populations. The aim of our study was to investigate the potential of genomic prediction of feed efficiency related traits using training sets that combine animals from two different, but genetically-related lines. We compared realized prediction accuracy and prediction bias for different training set compositions for five production traits. Results Genomic breeding values (GEBV) were predicted using the single-step genomic best linear unbiased prediction method in six scenarios applied iteratively to two genetically-related lines (i.e. 12 scenarios). The objective for all scenarios was to predict GEBV of pigs in the last three generations (~ 400 pigs, G7 to G9) of a given line. For each line, a control scenario was set up with a training set that included only animals from that line (target line). For all traits, adding more animals from the other line to the training set did not increase prediction accuracy compared to the control scenario. A small decrease in prediction accuracies was found for average daily gain, backfat thickness, and daily feed intake as the number of animals from the target line decreased in the training set. Including more animals from the other line did not decrease prediction accuracy for feed conversion ratio and residual feed intake, which were both highly affected by selection within lines. However, prediction biases were systematic for these cases and might be reduced with bivariate analyses. Conclusions Our results show that genomic prediction using a training set that includes animals from genetically-related lines can be as accurate as genomic prediction using a training set from the target population. With combined reference sets, accuracy increased for traits that were highly affected by selection. Our results provide insights into the design of reference populations, especially to initiate genomic selection in small-sized lines, for which the number of historical samples is small and that are developed simultaneously. This applies especially to poultry and pig breeding and to other crossbreeding schemes.
Collapse
Affiliation(s)
- Amir Aliakbari
- GenPhySE, Université de Toulouse, INRAE, 31326, Castanet-Tolosan, France.
| | - Emilie Delpuech
- GenPhySE, Université de Toulouse, INRAE, 31326, Castanet-Tolosan, France
| | - Yann Labrune
- GenPhySE, Université de Toulouse, INRAE, 31326, Castanet-Tolosan, France
| | - Juliette Riquet
- GenPhySE, Université de Toulouse, INRAE, 31326, Castanet-Tolosan, France
| | - Hélène Gilbert
- GenPhySE, Université de Toulouse, INRAE, 31326, Castanet-Tolosan, France
| |
Collapse
|
8
|
Karimi K, Farid AH, Sargolzaei M, Myles S, Miar Y. Linkage Disequilibrium, Effective Population Size and Genomic Inbreeding Rates in American Mink Using Genotyping-by-Sequencing Data. Front Genet 2020; 11:223. [PMID: 32231688 PMCID: PMC7083153 DOI: 10.3389/fgene.2020.00223] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Accepted: 02/26/2020] [Indexed: 12/11/2022] Open
Abstract
Knowledge of linkage disequilibrium (LD) patterns is necessary to determine the minimum density of markers required for genomic studies and to infer historical changes as well as inbreeding events in the populations. In this study, we used genotyping-by-sequencing (GBS) approach to detect single nucleotide polymorphisms (SNPs) across American mink genome and further to estimate LD, effective population size (Ne), and inbreeding rates based on excess of homozygosity (FHOM) and runs of homozygosity (ROH). A GBS assay was constructed based on the sequencing of ApeKI-digested libraries from 285 American mink using Illumina HiSeq Sequencer. Data of 13,321 SNPs located on 46 scaffolds was used to perform LD analysis. The average LD (r2 ± SD) between adjacent SNPs was 0.30 ± 0.35 over all scaffolds with an average distance of 51 kb between markers. The average r2 < 0.2 was observed at inter-marker distances of >40 kb, suggesting that at least 60,000 informative SNPs would be required for genomic selection in American mink. The Ne was estimated to be 116 at five generations ago. In addition, the most rapid decline of population size was observed between 100 and 200 generations ago. Our results showed that short extensions of homozygous genotypes (500 kb to 1 Mb) were abundant across the genome and accounted for 33% of all ROH identified. The average inbreeding coefficient based on ROH longer than 1 Mb was 0.132 ± 0.042. The estimations of FHOM ranged from −0.44 to 0.34 among different samples with an average of 0.15 over all individuals. This study provided useful insights to determine the density of SNP panel providing enough statistical power and accuracy in genomic studies of American mink. Moreover, these results confirmed that GBS approach can be considered as a useful tool for genomic studies in American mink.
Collapse
Affiliation(s)
- Karim Karimi
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| | - A Hossain Farid
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| | - Mehdi Sargolzaei
- Department of Pathobiology, University of Guelph, Guelph, ON, Canada.,Select Sires Inc., Plain City, OH, United States
| | - Sean Myles
- Department of Plant, Food, and Environmental Sciences, Dalhousie University, Truro, NS, Canada
| | - Younes Miar
- Department of Animal Science and Aquaculture, Dalhousie University, Truro, NS, Canada
| |
Collapse
|
9
|
Do DN, Miar Y. Evaluation of Growth Curve Models for Body Weight in American Mink. Animals (Basel) 2019; 10:ani10010022. [PMID: 31877627 PMCID: PMC7023449 DOI: 10.3390/ani10010022] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2019] [Revised: 12/04/2019] [Accepted: 12/17/2019] [Indexed: 01/26/2023] Open
Abstract
Modelling the growth curves of animals is important for optimizing the management and efficiency of animal production; however, little is known about the growth curves in American mink (Neovison vison). The study evaluated the performances of four three-parameter (Logistic, Gompertz, von Bertalanffy, and Brody), four four-parameter (Richards, Weibull, Bridges, and Janoscheck) and two polynomial models for describing the growth curves in mink. Body weights were collected from the third week of life to the week 31 in 738 black mink (373 males and 365 females). Models were fitted using the nls and nlsLM functions in stats and minpack.lm packages in R software, respectively. The Akaike's information criterion (AIC) and Bayesian information criterion (BIC) were used for model comparison. Based on these criteria, Logistic and Richards were the best models for males and females, respectively. Four-parameter models had better performance compared to the other models, except Logistic model. The estimated maximum weight and mature growth rate varied among the models and differed between males and females. The results indicated that males and females had different growth curves as males grew faster and reached to the maximum body weight later compared to females. Further studies on genetic parameters and selection response for growth curve parameters are required for development of selection programs based on the shape of growth curves in mink.
Collapse
|