Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Nilforooshan MA. Short Communication: Reduced GBLUP equations to core animals in the algorithm for proven and young (APY). Vet Anim Sci 2024;23:100334. [PMID: 38283332 PMCID: PMC10820638 DOI: 10.1016/j.vas.2024.100334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/02/2024] [Accepted: 01/02/2024] [Indexed: 01/30/2024] Open

Vandenplas J, Ten Napel J, Darbaghshahi SN, Evans R, Calus MPL, Veerkamp R, Cromie A, Mäntysaari EA, Strandén I. Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates. Genet Sel Evol 2023;55:37. [PMID: 37291510 DOI: 10.1186/s12711-023-00808-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 04/28/2023] [Indexed: 06/10/2023] Open

Abstract

BACKGROUND

Single-step genomic best linear unbiased prediction (ssGBLUP) models allow the combination of genomic, pedigree, and phenotypic data into a single model, which is computationally challenging for large genotyped populations. In practice, genotypes of animals without their own phenotype and progeny, so-called genotyped selection candidates, can become available after genomic breeding values have been estimated by ssGBLUP. In some breeding programmes, genomic estimated breeding values (GEBV) for these animals should be known shortly after obtaining genotype information but recomputing GEBV using the full ssGBLUP takes too much time. In this study, first we compare two equivalent formulations of ssGBLUP models, i.e. one that is based on the Woodbury matrix identity applied to the inverse of the genomic relationship matrix, and one that is based on marker equations. Second, we present computationally-fast approaches to indirectly compute GEBV for genotyped selection candidates, without the need to do the full ssGBLUP evaluation.

RESULTS

The indirect approaches use information from the latest ssGBLUP evaluation and rely on the decomposition of GEBV into its components. The two equivalent ssGBLUP models and indirect approaches were tested on a six-trait calving difficulty model using Irish dairy and beef cattle data that include 2.6 million genotyped animals of which about 500,000 were considered as genotyped selection candidates. When using the same computational approaches, the solving phase of the two equivalent ssGBLUP models showed similar requirements for memory and time per iteration. The computational differences between them were due to the preprocessing phase of the genomic information. Regarding the indirect approaches, compared to GEBV obtained from single-step evaluations including all genotypes, indirect GEBV had correlations higher than 0.99 for all traits while showing little dispersion and level bias.

CONCLUSIONS

In conclusion, ssGBLUP predictions for the genotyped selection candidates were accurately approximated using the presented indirect approaches, which are more memory efficient and computationally fast, compared to solving a full ssGBLUP evaluation. Thus, indirect approaches can be used even on a weekly basis to estimate GEBV for newly genotyped animals, while the full single-step evaluation is done only a few times within a year.

Collapse

Osawa T, Masuda Y, Saburi J, Hirumachi K. Application of single-step single nucleotide polymorphism best linear unbiased predictor model with unknown-parent groups for type traits in Japanese Holsteins. J Dairy Sci 2023:S0022-0302(23)00291-6. [PMID: 37268563 DOI: 10.3168/jds.2022-22541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 01/30/2023] [Indexed: 06/04/2023]

Abstract

The objectives of this study were to investigate the computational performance and the predictive ability and bias of a single-step SNP BLUP model (ssSNPBLUP) in genotyped young animals with unknown-parent groups (UPG) for type traits, using national genetic evaluation data from the Japanese Holstein population. The phenotype, genotype, and pedigree data were the same as those used in a national genetic evaluation of linear type traits classified between April 1984 and December 2020. In the current study, 2 data sets were prepared: the full data set containing all entries up to December 2020 and a truncated data set ending with December 2016. Genotyped animals were classified into 3 types: sires with classified daughters (S), cows with records (C), and young animals (Y). The computing performance and prediction accuracy of ssSNPBLUP were compared for the following 3 groups of genotyped animals: sires with classified daughters and young animals (SY); cows with records and young animals (CY); and sires with classified daughters, cows with records, and young animals (SCY). In addition, we tested 3 parameters of residual polygenic variance in ssSNPBLUP (0.1, 0.2, or 0.3). Daughter yield deviations (DYD) for the validation bulls and phenotypes adjusted for all fixed effects and random effects other than animal and residual (Y_adj) for the validation cows were obtained using the full data set from the pedigree-based BLUP model. The regression coefficients of DYD for bulls (or Y_adj for cows) on the genomic estimated breeding value (GEBV) using the truncated data set were used to measure the inflation of the predictions of young animals. The coefficient of determination of DYD on GEBV was used to measure the predictive ability of the predictions for the validation bulls. The reliability of the predictions for the validation cows was calculated as the square of the correlation between Y_adj and GEBV divided by heritability. The predictive ability was highest in the SCY group and lowest in the CY group. However, minimal difference was found in predictive abilities with or without UPG models using different parameters of residual polygenic variance. The regression coefficients approached 1.0 as the parameter of residual polygenic variance increased, but regression coefficients were mostly similar regardless of the use of UPG across the groups of genotyped animals. The ssSNPBLUP model, including UPG, was demonstrated as feasible for implementation in the national evaluation of type traits in Japanese Holsteins.

Collapse

Bussiman F, Chen CY, Holl J, Bermann M, Legarra A, Misztal I, Lourenco D. Boundaries for genotype, phenotype, and pedigree truncation in genomic evaluations in pigs. J Anim Sci 2023;101:skad273. [PMID: 37584978 PMCID: PMC10464514 DOI: 10.1093/jas/skad273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 08/10/2023] [Indexed: 08/17/2023] Open

Abstract

Historical data collection for genetic evaluation purposes is a common practice in animal populations; however, the larger the dataset, the higher the computing power needed to perform the analyses. Also, fitting the same model to historical and recent data may be inappropriate. Data truncation can reduce the number of equations to solve, consequently decreasing computing costs; however, the large volume of genotypes is responsible for most of the increase in computations. This study aimed to assess the impact of removing genotypes along with phenotypes and pedigree on the computing performance, reliability, and inflation of genomic predicted breeding value (GEBV) from single-step genomic best linear unbiased predictor for selection candidates. Data from two pig lines, a terminal sire (L1) and a maternal line (L2), were analyzed in this study. Four analyses were implemented: growth and "weaning to finish" mortality on L1, pre-weaning and reproductive traits on L2. Four genotype removal scenarios were proposed: removing genotyped animals without phenotypes and progeny (noInfo), removing genotyped animals based on birth year (Age), the combination of noInfo and Age scenarios (noInfo + Age), and no genotype removal (AllGen). In all scenarios, phenotypes were removed, based on birth year, and three pedigree depths were tested: two and three generations traced back and using the entire pedigree. The full dataset contained 1,452,257 phenotypes for growth traits, 324,397 for weaning to finish mortality, 517,446 for pre-weaning traits, and 7,853,629 for reproductive traits in pure and crossbred pigs. Pedigree files for lines L1 and L2 comprised 3,601,369 and 11,240,865 animals, of which 168,734 and 170,121 were genotyped, respectively. In each truncation scenario, the linear regression method was used to assess the reliability and dispersion of GEBV for genotyped parents (born after 2019). The number of years of data that could be removed without harming reliability depended on the number of records, type of analyses (multitrait vs. single trait), the heritability of the trait, and data structure. All scenarios had similar reliabilities, except for noInfo, which performed better in the growth analysis. Based on the data used in this study, considering the last ten years of phenotypes, tracing three generations back in the pedigree, and removing genotyped animals not contributing own or progeny phenotypes, increases computing efficiency with no change in the ability to predict breeding values.

Collapse

Bermann M, Cesarani A, Misztal I, Lourenco D. Past, present, and future developments in single-step genomic models. ITALIAN JOURNAL OF ANIMAL SCIENCE 2022. [DOI: 10.1080/1828051x.2022.2053366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Gutiérrez-Reinoso MA, Aponte PM, García-Herreros M. A review of inbreeding depression in dairy cattle: current status, emerging control strategies, and future prospects. J DAIRY RES 2022;89:1-10. [PMID: 35225176 DOI: 10.1017/s0022029922000188] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Abstract

Dairy cattle breeding has historically focused on relatively small numbers of elite bulls as sires of sons. In recent years, even if generation intervals were reduced and more diverse sires of sons could have been selected, genomic selection has not fundamentally changed the fact that a large number of individuals are being analyzed. However, a relatively small number of elite bulls are still siring those animals. Therefore inbreeding-derived negative consequences in the gene pool have brought concern. The detrimental effects of non-additive genetic changes such as inbreeding depression and dominance have been widely disseminated while seriously affecting bioeconomically important parameters because of an antagonistic relationship between dairy production and reproductive traits. Therefore, the estimation of benefits and limitations of inbreeding and variance of the selection response deserves to be evaluated and discussed to preserve genetic variability, a significant concern in the selection of individuals for reproduction and production. Short-term strategies for genetic merit improvement through modern breeding programs have severely lowered high-producing dairy cattle fertility potential. Since the current selection programs potentially increase long-term costs, genetic diversity has decreased globally as a consequence. Therefore, a greater understanding of the potential that selection programs have for supporting long-term genetic sustainability and genetic diversity among dairy cattle populations should be prioritized in managing farm profitability. The present review provides a broad approach to current inbreeding-derived problems, identifying critical points to be solved and possible alternative strategies to control selection against homozygous haplotypes while maintaining sustained selection pressure. Moreover, this manuscript explores future perspectives, emphasizing theoretical applications and critical points, and strategies to avoid the adverse effects of inbreeding in dairy cattle. Finally, this review provides an overview of challenges that will soon require multidisciplinary approaches to managing dairy cattle populations, intending to combine increases in productive trait phenotypes with improvements in reproductive, health, welfare, linear conformation, and adaptability traits into the foreseeable future.

Collapse

Masuda Y, VanRaden PM, Tsuruta S, Lourenco DAL, Misztal I. Invited review: Unknown-parent groups and metafounders in single-step genomic BLUP. J Dairy Sci 2021;105:923-939. [PMID: 34799109 DOI: 10.3168/jds.2021-20293] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 09/26/2021] [Indexed: 11/19/2022]

Abstract

Single-step genomic BLUP (ssGBLUP) is a method for genomic prediction that integrates matrices of pedigree (A) and genomic (G) relationships into a single unified additive relationship matrix whose inverse is incorporated into a set of mixed model equations (MME) to compute genomic predictions. Pedigree information in dairy cattle is often incomplete. Missing pedigree potentially causes biases and inflation in genomic estimated breeding values (GEBV) obtained with ssGBLUP. Three major issues are associated with missing pedigree in ssGBLUP, namely biased predictions by selection, missing inbreeding in pedigree relationships, and incompatibility between G and A in level and scale. These issues can be solved using a proper model for unknown-parent groups (UPG). The theory behind the use of UPG is well established for pedigree BLUP, but not for ssGBLUP. This study reviews the development of the UPG model in pedigree BLUP, the properties of UPG models in ssGBLUP, and the effect of UPG on genetic trends and genomic predictions. Similarities and differences between UPG and metafounder (MF) models, a generalized UPG model, are also reviewed. A UPG model (QP) derived using a transformation of the MME has a good convergence behavior. However, with insufficient data, the QP model may yield biased genetic trends and may underestimate UPG. The QP model can be altered by removing the genomic relationships linking GEBV and UPG effects from MME. This altered QP model exhibits less bias in genetic trends and less inflation in genomic predictions than the QP model, especially with large data sets. Recently, a new model, which encapsulates the UPG equations into the pedigree relationships for genotyped animals, was proposed in simulated purebred populations. The MF model is a comprehensive solution to the missing pedigree issue. This model can be a choice for multibreed or crossbred evaluations if the data set allows the estimation of a reasonable relationship matrix for MF. Missing pedigree influences genetic trends, but its effect on the predictability of genetic merit for genotyped animals should be negligible when many proven bulls are genotyped. The SNP effects can be back-solved using GEBV from older genotyped animals, and these predicted SNP effects can be used to calculate GEBV for young-genotyped animals with missing parents.

Collapse

Tsuruta S, Lourenco D, Masuda Y, Lawlor T, Misztal I. Reducing computational cost of large-scale genomic evaluation by using indirect genomic prediction. JDS COMMUNICATIONS 2021;2:356-360. [PMID: 36337117 PMCID: PMC9623783 DOI: 10.3168/jdsc.2021-0097] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 06/27/2021] [Indexed: 06/10/2023]

Abstract

Over half a million Holsteins are being genotyped annually in the United States. The computational cost of including all genotypes in single-step genomic (ssG)BLUP is high, although it is feasible to conduct large-scale genomic prediction using an efficient algorithm such as APY (algorithm for proven and young). An effective method to further reduce the computing cost could be the use of indirect genomic predictions (IGP) for genotyped animals when they have neither progeny nor phenotypes. These young genotyped animals have no effect on the other genotyped animals and could have their genomic prediction done indirectly. The main objective of this study was to calculate IGP for various groups of genotyped animals and investigate the reduction in computing time as well as bias and accuracy of the IGP. We compared IGP with genomic (G)EBV for 18 linear type traits in US Holsteins, including 2.3 million (M) genotyped animals. The full data set consisted of 10.9M records for 18 linear type traits up to 2018 calving, 13.6M animals in the pedigree, and 2.3M animals genotyped for 79K SNP. For IGP, ssGBLUP included all genotyped animals except those with neither progeny nor phenotypes by year from 2014 to 2018 (i.e., the target animals). The SNP marker effects were computed based on GEBV for genotyped animals that had progeny, or phenotypes, or both. Further, IGP were calculated for target genotyped animals in each year group. For all genotyped animal groups from 2014 to 2018, the coefficients of determination (R²) of a linear regression of GEBV on IGP were 0.960 for males and 0.954 for females for 18 traits on average. To reduce computing costs, the SNP marker effects were calculated based on GEBV from randomly selected genotyped animals from 15K to 60K. By randomly selecting a small number of genotyped animals, the computing time was dramatically reduced. As more genotyped animals were randomly selected to calculate SNP effects, R² was higher (more accurate) and the regression coefficient was lower (more inflated IGP). In a practical genomic evaluation in US Holsteins, to get sufficient contributions from GEBV, 25K to 35K is a rational number of genotyped animals that can be randomly selected to compute SNP effects and obtain accurate and unbiased IGP. Considering the computing time and both unbiasedness and accuracy of IGP, genomic evaluation can be conducted separately in GEBV for genotyped animals with phenotypes or progeny and in IGP for young genotyped animals. This can be a practical solution when conducting a large-scale genomic evaluation and would enable more frequent evaluation at lower cost, especially when many genotyped animals have neither phenotypes nor progeny.

Collapse

Koivula M, Strandén I, Aamand GP, Mäntysaari EA. Practical implementation of genetic groups in single-step genomic evaluations with Woodbury matrix identity-based genomic relationship inverse. J Dairy Sci 2021;104:10049-10058. [PMID: 34099294 DOI: 10.3168/jds.2020-19821] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 04/22/2021] [Indexed: 11/19/2022]

Gutierrez-Reinoso MA, Aponte PM, Garcia-Herreros M. Genomic Analysis, Progress and Future Perspectives in Dairy Cattle Selection: A Review. Animals (Basel) 2021;11:599. [PMID: 33668747 PMCID: PMC7996307 DOI: 10.3390/ani11030599] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 11/23/2020] [Accepted: 11/24/2020] [Indexed: 12/16/2022] Open

Abstract

Genomics comprises a set of current and valuable technologies implemented as selection tools in dairy cattle commercial breeding programs. The intensive progeny testing for production and reproductive traits based on genomic breeding values (GEBVs) has been crucial to increasing dairy cattle productivity. The knowledge of key genes and haplotypes, including their regulation mechanisms, as markers for productivity traits, may improve the strategies on the present and future for dairy cattle selection. Genome-wide association studies (GWAS) such as quantitative trait loci (QTL), single nucleotide polymorphisms (SNPs), or single-step genomic best linear unbiased prediction (ssGBLUP) methods have already been included in global dairy programs for the estimation of marker-assisted selection-derived effects. The increase in genetic progress based on genomic predicting accuracy has also contributed to the understanding of genetic effects in dairy cattle offspring. However, the crossing within inbred-lines critically increased homozygosis with accumulated negative effects of inbreeding like a decline in reproductive performance. Thus, inaccurate-biased estimations based on empirical-conventional models of dairy production systems face an increased risk of providing suboptimal results derived from errors in the selection of candidates of high genetic merit-based just on low-heritability phenotypic traits. This extends the generation intervals and increases costs due to the significant reduction of genetic gains. The remarkable progress of genomic prediction increases the accurate selection of superior candidates. The scope of the present review is to summarize and discuss the advances and challenges of genomic tools for dairy cattle selection for optimizing breeding programs and controlling negative inbreeding depression effects on productivity and consequently, achieving economic-effective advances in food production efficiency. Particular attention is given to the potential genomic selection-derived results to facilitate precision management on modern dairy farms, including an overview of novel genome editing methodologies as perspectives toward the future.

Collapse

Masuda Y, Tsuruta S, Bermann M, Bradford HL, Misztal I. Comparison of models for missing pedigree in single-step genomic prediction. J Anim Sci 2021;99:6119644. [PMID: 33493284 DOI: 10.1093/jas/skab019] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 01/20/2021] [Indexed: 11/14/2022] Open

Abstract

Pedigree information is often missing for some animals in a breeding program. Unknown-parent groups (UPGs) are assigned to the missing parents to avoid biased genetic evaluations. Although the use of UPGs is well established for the pedigree model, it is unclear how UPGs are integrated into the inverse of the unified relationship matrix (H-inverse) required for single-step genomic best linear unbiased prediction. A generalization of the UPG model is the metafounder (MF) model. The objectives of this study were to derive 3 H-inverses and to compare genetic trends among models with UPG and MF H-inverses using a simulated purebred population. All inverses were derived using the joint density function of the random breeding values and genetic groups. The breeding values of genotyped animals (u2) were assumed to be adjusted for UPG effects (g) using matrix Q2 as u2∗=u2+Q2g before incorporating genomic information. The Quaas-Pollak-transformed (QP) H-inverse was derived using a joint density function of u2∗ and g updated with genomic information and assuming nonzero cov(u2∗,g'). The modified QP (altered) H-inverse also assumes that the genomic information updates u2∗ and g, but cov(u2∗,g')=0. The UPG-encapsulated (EUPG) H-inverse assumed genomic information updates the distribution of u2∗. The EUPG H-inverse had the same structure as the MF H-inverse. Fifty percent of the genotyped females in the simulation had a missing dam, and missing parents were replaced with UPGs by generation. The simulation study indicated that u2∗ and g in models using the QP and altered H-inverses may be inseparable leading to potential biases in genetic trends. Models using the EUPG and MF H-inverses showed no genetic trend biases. These 2 H-inverses yielded the same genomic EBV (GEBV). The predictive ability and inflation of GEBVs from young genotyped animals were nearly identical among models using the QP, altered, EUPG, and MF H-inverses. Although the choice of H-inverse in real applications with enough data may not result in biased genetic trends, the EUPG and MF H-inverses are to be preferred because of theoretical justification and possibility to reduce biases.

Collapse

Meyer K. Impact of missing pedigrees in single-step genomic evaluation. ANIMAL PRODUCTION SCIENCE 2021. [DOI: 10.1071/an21045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Abstract Context A common problem in mixed model-based genetic evaluation schemes for livestock is that cohorts of animals differ systematically in mean genetic merit, for example, due to missing pedigree. This can be modelled by fitting genetic groups. Single-step genomic evaluation (ssGBLUP) combining information from genotyped and non-genotyped individuals has become routine, but little is known of the effects of unknown parents in this context. Aims To investigate the effects of missing pedigrees on accuracy and bias of predicted breeding values for ssGBLUP analyses. Methods A simulation study was used to examine alternative ways to account for genetic groups in ssGBLUP, for multi-generation data with strong selection and rapidly increasing numbers of genotyped animals in the most recent generations. Key results Results demonstrated that missing pedigrees can markedly impair predicted breeding values. With selection, alignment of genomic and pedigree relationship matrices is essential when fitting unknown parent groups (UPG). Genomic relationships are complete; that is, they ‘automatically’ reference the genomic base, which typically differs from the genetic base for pedigreed animals. This can lead to biased comparisons between genotyped and non-genotyped animals with unknown parents when the two categories of animals are assigned to the same UPG. Allocating genotyped individuals to a separate UPG across all generations for each strain or breed was shown to be a simple and effective way to reduce misalignment bias. In contrast, fitting metafounders modified pedigree-based relationships to account for ancestral genomic relationships and inbreeding rather than the genomic relationship matrix. Thus, no bias due to different types of animals assigned to the same metafounders was apparent. Overall, fitting metafounders yielded slightly higher correlations between true and predicted breeding values than did UPG models, which assume genetic groups to be unrelated. Conclusions Missing pedigrees are more problematic with ssGBLUP than for analyses considering pedigree-based relationships only. UPG models with separation of genotyped and non-genotyped individuals and analyses fitting metafounders yielded comparable predictions of breeding values in terms of accuracy and bias. Implications A previously unidentified incompatibility between alignment of founder populations and assignment of genotyped and non-genotyped animals to the same UPG has been reported. Implementation of the proposed strategy to reduce ‘double counting’ is straightforward and can improve results of ssGBLUP analyses. Collapse

Brunes LC, Baldi F, Lopes FB, Narciso MG, Lobo RB, Espigolan R, Costa MFO, Magnabosco CU. Genomic prediction ability for feed efficiency traits using different models and pseudo-phenotypes under several validation strategies in Nelore cattle. Animal 2020;15:100085. [PMID: 33573965 DOI: 10.1016/j.animal.2020.100085] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 09/09/2020] [Accepted: 09/15/2020] [Indexed: 10/22/2022] Open

Abstract

There is a growing interest to improve feed efficiency (FE) traits in cattle. The genomic selection was proposed to improve these traits since they are difficult and expensive to measure. Up to date, there are scarce studies about the implementation of genomic selection for FE traits in indicine cattle under different scenarios of pseudo-phenotypes, models, and validation strategies on a commercial large scale. Thus, the aim was to evaluate the feasibility of genomic selection implementation for FE traits in Nelore cattle applying different models and pseudo-phenotypes under validation strategies. Phenotypic and genotypic information from 4 329 and 3 467 animals were used, respectively, which were tested for residual feed intake, DM intake, feed efficiency, feed conversion ratio, residual BW gain, and residual intake and BW gain. Six prediction methods were used: single-step genomic best linear unbiased prediction, Bayes A, Bayes B, Bayes Cπ, Bayesian least absolute shrinkage and selection operator (BLASSO), and Bayes R. Phenotypes adjusted for fixed effects (Y*), estimated breeding value (EBV), and EBV deregressed (DEBV) were used as pseudo-phenotypes. The validation approaches used were: (1) random: the data was randomly divided into ten subsets and the validation was done in each subset at a time; (2) age: the partition into training and testing sets was based on year of birth and testing animals were born after 2016; and (3) EBV accuracy: the data was split into two groups, being animals with accuracy above 0.45 the training set; and below 0.45 the validation set. In the analyses that used the Y* as pseudo-phenotype, prediction ability (PA) was obtained by dividing the correlation between pseudo-phenotype and genomic EBV (GEBV) by the square root of the heritability of the trait. When EBV and DEBV were used as the pseudo-phenotype, the simple correlation of this quantity with the GEBV was considered as PA. The prediction methods show similar results for PA and bias. The random cross-validation presented higher PA (0.17) than EBV accuracy (0.14) and age (0.13). The PA was higher for Y* than for EBV and DEBV (30.0 and 34.3%, respectively). Random validation presented the highest PA, being indicated for use in populations composed mainly of young animals and traits with few generations of data recording. For high heritability traits, the validation can be done by age, enabling the prediction of the next-generation genetic merit. These results would support breeders to identify genomic approaches that are more viable for genomic prediction for FE-related traits.

Collapse

Misztal I, Tsuruta S, Pocrnic I, Lourenco D. Core-dependent changes in genomic predictions using the Algorithm for Proven and Young in single-step genomic best linear unbiased prediction. J Anim Sci 2020;98:skaa374. [PMID: 33211798 PMCID: PMC7739885 DOI: 10.1093/jas/skaa374] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 11/11/2020] [Indexed: 11/28/2022] Open

Abstract

Single-step genomic best linear unbiased prediction with the Algorithm for Proven and Young (APY) is a popular method for large-scale genomic evaluations. With the APY algorithm, animals are designated as core or noncore, and the computing resources to create the inverse of the genomic relationship matrix (GRM) are reduced by inverting only a portion of that matrix for core animals. However, using different core sets of the same size causes fluctuations in genomic estimated breeding values (GEBVs) up to one additive standard deviation without affecting prediction accuracy. About 2% of the variation in the GRM is noise. In the recursion formula for APY, the error term modeling the noise is different for every set of core animals, creating changes in breeding values. While average changes are small, and correlations between breeding values estimated with different core animals are close to 1.0, based on the normal distribution theory, outliers can be several times bigger than the average. Tests included commercial datasets from beef and dairy cattle and from pigs. Beyond a certain number of core animals, the prediction accuracy did not improve, but fluctuations decreased with more animals. Fluctuations were much smaller than the possible changes based on prediction error variance. GEBVs change over time even for animals with no new data as genomic relationships ties all the genotyped animals, causing reranking of top animals. In contrast, changes in nongenomic models without new data are small. Also, GEBV can change due to details in the model, such as redefinition of contemporary groups or unknown parent groups. In particular, increasing the fraction of blending of the GRM with a pedigree relationship matrix from 5% to 20% caused changes in GEBV up to 0.45 SD, with a correlation of GEBV > 0.99. Fluctuations in genomic predictions are part of genomic evaluation models and are also present without the APY algorithm when genomic evaluations are computed with updated data. The best approach to reduce the impact of fluctuations in genomic evaluations is to make selection decisions not on individual animals with limited individual accuracy but on groups of animals with high average accuracy.

Collapse

Kudinov A, Mäntysaari E, Aamand G, Uimari P, Strandén I. Metafounder approach for single-step genomic evaluations of Red Dairy cattle. J Dairy Sci 2020;103:6299-6310. [DOI: 10.3168/jds.2019-17483] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Accepted: 03/09/2020] [Indexed: 01/01/2023]

Konstantinov KV, Goddard ME. Application of multivariate single-step SNP best linear unbiased predictor model and revised SNP list for genomic evaluation of dairy cattle in Australia. J Dairy Sci 2020;103:8305-8316. [PMID: 32622609 DOI: 10.3168/jds.2020-18242] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 04/21/2020] [Indexed: 11/19/2022]

Abstract

The objectives of this study were (1) to evaluate the computational feasibility of the multitrait test-day single-step SNP-BLUP (ssSNP-BLUP) model using phenotypic records of genotyped and nongenotyped animals, and (2) to compare accuracies (coefficient of determination; R²) and bias of genomic estimated breeding values (GEBV) and de-regressed proofs as response variables in 3 Australian dairy cattle breeds (i.e., Holstein, Jersey, and Red breeds). Additive genomic random regression coefficients for milk, fat, protein yield and somatic cell score were predicted in the first, second, and third lactation. The predicted coefficients were used to derive 305-d GEBV and were compared with the traditional parent averages obtained from a BLUP model without genomic information. Cow fertility traits were evaluated from the 5-trait repeatability model (i.e., calving interval, days from calving to first service, pregnancy diagnosis, first service nonreturn rate, and lactation length). The de-regressed proofs were only for calving interval. Our results showed that ssSNP-BLUP using multitrait test-day model increased reliability and reduced bias of breeding values of young animals when compared with parent average from traditional BLUP in Australian Holsten, Jersey, and Red breeds. The use of a custom selection of approximately 46,000 SNP (custom XT SNP list) increased the reliability of GEBV compared with the results obtained using the commercial Illumina 50K chip (Illumina, San Diego, CA). The use of the second preconditioner substantially improved the convergence rate of the preconditioned conjugate gradient method, but further work is needed to improve the efficiency of the computation of the Kronecker matrix product by vector. Application of ssSNP-BLUP to multitrait random regression models is computationally feasible.

Collapse

Mäntysaari E, Koivula M, Strandén I. Symposium review: Single-step genomic evaluations in dairy cattle. J Dairy Sci 2020;103:5314-5326. [DOI: 10.3168/jds.2019-17754] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 01/21/2020] [Indexed: 11/19/2022]

Vandenplas J, Eding H, Bosmans M, Calus MPL. Computational strategies for the preconditioned conjugate gradient method applied to ssSNPBLUP, with an application to a multivariate maternal model. Genet Sel Evol 2020;52:24. [PMID: 32404053 PMCID: PMC7222437 DOI: 10.1186/s12711-020-00543-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2020] [Accepted: 04/27/2020] [Indexed: 01/12/2023] Open

Ben Zaabza H, Mäntysaari EA, Strandén I. Using Monte Carlo method to include polygenic effects in calculation of SNP-BLUP model reliability. J Dairy Sci 2020;103:5170-5182. [PMID: 32253036 DOI: 10.3168/jds.2019-17255] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 02/04/2020] [Indexed: 11/19/2022]

Misztal I, Lourenco D, Legarra A. Current status of genomic evaluation. J Anim Sci 2020;98:skaa101. [PMID: 32267923 PMCID: PMC7183352 DOI: 10.1093/jas/skaa101] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 04/07/2020] [Indexed: 12/14/2022] Open

Abstract

Early application of genomic selection relied on SNP estimation with phenotypes or de-regressed proofs (DRP). Chips of 50k SNP seemed sufficient for an accurate estimation of SNP effects. Genomic estimated breeding values (GEBV) were composed of an index with parent average, direct genomic value, and deduction of a parental index to eliminate double counting. Use of SNP selection or weighting increased accuracy with small data sets but had minimal to no impact with large data sets. Efforts to include potentially causative SNP derived from sequence data or high-density chips showed limited or no gain in accuracy. After the implementation of genomic selection, EBV by BLUP became biased because of genomic preselection and DRP computed based on EBV required adjustments, and the creation of DRP for females is hard and subject to double counting. Genomic selection was greatly simplified by single-step genomic BLUP (ssGBLUP). This method based on combining genomic and pedigree relationships automatically creates an index with all sources of information, can use any combination of male and female genotypes, and accounts for preselection. To avoid biases, especially under strong selection, ssGBLUP requires that pedigree and genomic relationships are compatible. Because the inversion of the genomic relationship matrix (G) becomes costly with more than 100k genotyped animals, large data computations in ssGBLUP were solved by exploiting limited dimensionality of genomic data due to limited effective population size. With such dimensionality ranging from 4k in chickens to about 15k in cattle, the inverse of G can be created directly (e.g., by the algorithm for proven and young) at a linear cost. Due to its simplicity and accuracy, ssGBLUP is routinely used for genomic selection by the major chicken, pig, and beef industries. Single step can be used to derive SNP effects for indirect prediction and for genome-wide association studies, including computations of the P-values. Alternative single-step formulations exist that use SNP effects for genotyped or for all animals. Although genomics is the new standard in breeding and genetics, there are still some problems that need to be solved. This involves new validation procedures that are unaffected by selection, parameter estimation that accounts for all the genomic data used in selection, and strategies to address reduction in genetic variances after genomic selection was implemented.

Collapse

Aguilar I, Fernandez EN, Blasco A, Ravagnolo O, Legarra A. Effects of ignoring inbreeding in model-based accuracy for BLUP and SSGBLUP. J Anim Breed Genet 2020;137:356-364. [PMID: 32080913 DOI: 10.1111/jbg.12470] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 12/10/2019] [Accepted: 01/11/2020] [Indexed: 11/29/2022]

Boerner V, Johnston DJ. More animals than markers: a study into the application of the single step T-BLUP model in large-scale multi-trait Australian Angus beef cattle genetic evaluation. Genet Sel Evol 2019;51:57. [PMID: 31619157 PMCID: PMC6796474 DOI: 10.1186/s12711-019-0499-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Accepted: 09/24/2019] [Indexed: 11/25/2022] Open

Abstract

Multi-trait single step genetic evaluation is increasingly facing the situation of having more individuals with genotypes than markers within each genotype. This creates a situation where the genomic relationship matrix (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{G }$$\end{document}G) is not of full rank and its inversion is algebraically impossible. Recently, the SS-T-BLUP method was proposed as a modified version of the single step equations, providing an elegant way to circumvent the inversion of the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{G }$$\end{document}G and therefore accommodate the situation described. SS-T-BLUP uses the Woodbury matrix identity, thus it requires an add-on matrix, which is usually the covariance matrix of the residual polygenic effet. In this paper, we examine the application of SS-T-BLUP to a large-scale multi-trait Australian Angus beef cattle dataset using the full BREEDPLAN single step genetic evaluation model and compare the results to the application of two different methods of using \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{G }$$\end{document}G in a single step model. Results clearly show that SS-T-BLUP outperforms other single step formulations in terms of computational speed and avoids approximation of the inverse of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mathbf{G }$$\end{document}G.

Collapse

van Grevenhof EM, Vandenplas J, Calus MPL. Genomic prediction for crossbred performance using metafounders. J Anim Sci 2019;97:548-558. [PMID: 30423111 DOI: 10.1093/jas/sky433] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 11/09/2018] [Indexed: 01/01/2023] Open

Aldridge MN, Vandenplas J, Calus MPL. Efficient and accurate computation of base generation allele frequencies. J Dairy Sci 2018;102:1364-1373. [PMID: 30471906 DOI: 10.3168/jds.2018-15264] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 10/01/2018] [Indexed: 11/19/2022]

Abstract

Allele frequencies are used for several aspects of genomic prediction, with the assumption that these are equal to the allele frequency in the base generation of the pedigree. The current standard method, however, calculates allele frequencies from the current genotyped population. We compared the current standard method with BLUP and general least squares (GLS) methods explicitly targeting the base population to determine whether there is a more accurate and still efficient method of calculating allele frequencies that better represents the base generation. A data set based on a typical dairy population was simulated for 325,266 animals; the last 100,078 animals in generations 9 to 12 of the population were genotyped, with 1,670 SNP markers. For the BLUP method, several SNP genotypes were analyzed with a multitrait model by assuming a heritability of 0.99 and no genetic correlation among them. This method was limited by the time required for each BLUP to converge (approximately 6 min per BLUP run of 15 SNP). The GLS method had 2 implementations. The first implementation, using imputation on the fly and multiplication of sparse matrices, was very efficient and required just 49 s and 1.3 GB of random access memory. The second implementation, using a dense full A₂₂^-1 matrix, was very inefficient and required more than 1 d of wall clock time and more than 118.2 GB of random access memory. When no selection was considered in the simulations, all methods predicted equally well. When selection was introduced, higher correlations between the estimated allele frequency and known base generation allele frequency were observed for BLUP (0.96 ± 0.01) and GLS (0.97 ± 0.01) compared with the current standard method (0.87 ± 0.01). The GLS method decreased in accuracy when introducing incomplete pedigree, with 25% of sires in the first 5 generations randomly replaced as unknown to erroneously identify founder animals (0.93 ± 0.01) and a further decrease for 8 generations (0.91 ± 0.01). There was no change in accuracy when introducing 5% genotyping errors (0.97 ± 0.01), 5% missing genotypes (0.97 ± 0.01), or both 5% genotyping errors and missing genotypes (0.97 ± 0.01). The GLS method provided the most accurate estimates of base generation allele frequency and was only slightly slower compared with the current method. The efficient implementation of the GLS method, therefore, is very well suited for practical application and is recommended for implementation.

Collapse

Gao H, Koivula M, Jensen J, Strandén I, Madsen P, Pitkänen T, Aamand G, Mäntysaari E. Short communication: Genomic prediction using different single-step methods in the Finnish red dairy cattle population. J Dairy Sci 2018;101:10082-10088. [DOI: 10.3168/jds.2018-14913] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 07/09/2018] [Indexed: 12/21/2022]

Matilainen K, Strandén I, Aamand GP, Mäntysaari EA. Single step genomic evaluation for female fertility in Nordic Red dairy cattle. J Anim Breed Genet 2018;135:337-348. [DOI: 10.1111/jbg.12353] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Revised: 06/18/2018] [Accepted: 07/03/2018] [Indexed: 11/30/2022]

Meyer K, Tier B, Swan A. Estimates of genetic trend for single-step genomic evaluations. Genet Sel Evol 2018;50:39. [PMID: 30075705 PMCID: PMC6091173 DOI: 10.1186/s12711-018-0410-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 07/10/2018] [Indexed: 01/03/2023] Open

Vandenplas J, Calus MPL, Ten Napel J. Sparse single-step genomic BLUP in crossbreeding schemes. J Anim Sci 2018;96:2060-2073. [PMID: 29873759 DOI: 10.1093/jas/sky136] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 04/16/2018] [Indexed: 12/20/2022] Open

Masuda Y, VanRaden P, Misztal I, Lawlor T. Differing genetic trend estimates from traditional and genomic evaluations of genotyped animals as evidence of preselection bias in US Holsteins. J Dairy Sci 2018;101:5194-5206. [DOI: 10.3168/jds.2017-13310] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Accepted: 12/10/2017] [Indexed: 11/19/2022]

Koivula M, Strandén I, Aamand G, Mäntysaari E. Reducing bias in the dairy cattle single-step genomic evaluation by ignoring bulls without progeny. J Anim Breed Genet 2018;135:107-115. [DOI: 10.1111/jbg.12318] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 01/22/2018] [Indexed: 12/26/2022]

Colleau JJ, Palhière I, Rodríguez-Ramilo ST, Legarra A. A fast indirect method to compute functions of genomic relationships concerning genotyped and ungenotyped individuals, for diversity management. Genet Sel Evol 2017;49:87. [PMID: 29191178 PMCID: PMC5709854 DOI: 10.1186/s12711-017-0363-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Accepted: 11/24/2017] [Indexed: 12/01/2022] Open

Abstract

Background

Pedigree-based management of genetic diversity in populations, e.g., using optimal contributions, involves computation of the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Ax}}$$\end{document}Ax type yielding elements (relationships) or functions (usually averages) of relationship matrices. For pedigree-based relationships \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{A}}$$\end{document}A, a very efficient method exists. When all the individuals of interest are genotyped, genomic management can be addressed using the genomic relationship matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{G}}$$\end{document}G; however, to date, the computational problem of efficiently computing \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Gx}}$$\end{document}Gx has not been well studied. When some individuals of interest are not genotyped, genomic management should consider the relationship matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{H}}$$\end{document}H that combines genotyped and ungenotyped individuals; however, direct computation of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Hx}}$$\end{document}Hx is computationally very demanding, because construction of a possibly huge matrix is required. Our work presents efficient ways of computing \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Gx}}$$\end{document}Gx and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Hx}}$$\end{document}Hx, with applications on real data from dairy sheep and dairy goat breeding schemes.

Results

For genomic relationships, an efficient indirect computation with quadratic instead of cubic cost is \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{x}} = {\mathbf{Z}}\left( {{\mathbf{Z^{\prime}x}}} \right)/k$$\end{document}x=ZZ′x/k, where Z is a matrix relating animals to genotypes. For the relationship matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{H}}$$\end{document}H, we propose an indirect method based on the difference between vectors \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Hx}} - {\mathbf{Ax}}$$\end{document}Hx-Ax, which involves computation of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Ax}}$$\end{document}Ax and of products such as \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{Gw}}$$\end{document}Gw and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{A}}_{22}^{ - 1} {\mathbf{w}}$$\end{document}A22-1w, where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{w}}$$\end{document}w is a working vector derived from \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{x}}$$\end{document}x. The latter computation is the most demanding but can be done using sparse Cholesky decompositions of matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{A}}^{ - 1}$$\end{document}A-1, which allows handling very large genomic and pedigree data files. Studies based on simulations reported in the literature show that the trends of average relationships in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{H}}$$\end{document}H and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{A}}$$\end{document}A differ as genomic selection proceeds. When selection is based on genomic relationships but management is based on pedigree data, the true genetic diversity is overestimated. However, our tests on real data from sheep and goat obtained before genomic selection started do not show this.

Conclusions

We present efficient methods to compute elements and statistics of the genomic relationships \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{G}}$$\end{document}G and of matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathbf{H}}$$\end{document}H that combines ungenotyped and genotyped individuals. These methods should be useful to monitor and handle genomic diversity.

Collapse

Mäntysaari EA, Evans RD, Strandén I. Efficient single-step genomic evaluation for a multibreed beef cattle population having many genotyped animals. J Anim Sci 2017;95:4728-4737. [PMID: 29293736 PMCID: PMC6292282 DOI: 10.2527/jas2017.1912] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2017] [Accepted: 09/10/2017] [Indexed: 01/04/2023] Open

Abstract

An equivalent computational approach called ssGTBLUP was formulated for the original single-step GBLUP (ssGBLUP). In ssGTBLUP, the genomic relationship matrix has the form = ' + , where the (centered and scaled) marker matrix has size x (numbers of genotypes and markers), and the matrix can be easily inverted. The inverse can be written as = - ' where is an by matrix. When the preconditioned conjugate gradient (PCG) method is used to solve the mixed model equations, a matrix vector product needs to be computed. In ssGBLUP, this requires multiplications, but in ssGTBLUP, the product ' has 2 multiplications and has multiplications with the constant independent of or . In an approximate approach called ssGTBLUP(p), the eigendecomposition of ' is used to reduce the number of rows in the matrix. Here, p is the percentage of total variance explained by the accepted eigenvalues. The objective of this study was to compare the performance of ssGBLUP, ssGTBLUP, ssGTBLUP(p), and the APY (algorithm for proven and young) method. In APY, the core had 50,000 (APY50K), 30,000 (APY30K), or 10,000 (APY10K) animals. The approaches were tested on the Irish beef carcass conformation genetic evaluation which has a heterogeneous multibreed population. The pedigree had 13.3 million animals. There were = 54,620 markers available from = 163,277 genotyped animals. For genotyped animals, the correlations of breeding values between ssGBLUP and ssGTBLUP(p) for the 11 traits in the model ranged from 0.999-1.000 for p = 99, 0.998-1.000 for p = 98, and 0.992-0.998 for p = 95 but were 0.994-1.000 for APY50K, 0.969-0.997 for APY30K, and 0.899-0.967 for APY10K. Computing times per iteration were 4.43, 3.30, 2.69, 2.29, 1.55, 1.76, 1.27, and 0.55 min for ssGBLUP, ssGTBLUP, ssGTBLUP(99), ssGTBLUP(98), ssGTBLUP(95), APY50K, APY30K, and APY10K, respectively. The ssGTBLUP(p) approach allowed a well-defined approximation to ssGBLUP and fast computations.

Collapse