1
|
Nilforooshan MA. Short Communication: Reduced GBLUP equations to core animals in the algorithm for proven and young (APY). Vet Anim Sci 2024; 23:100334. [PMID: 38283332 PMCID: PMC10820638 DOI: 10.1016/j.vas.2024.100334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/02/2024] [Accepted: 01/02/2024] [Indexed: 01/30/2024] Open
Abstract
The number of animal genotypes is rapidly increasing, and a major challenge for animal models is inverting the genomic relationship matrix (G). Matrix G has a limited dimensionality, and the algorithm for proven and young (APY) makes inverting a large G possible via the inverse of a block diagonal of G with a size equivalent to the dimensionality of G. APY divides genotyped animals into core and non-core groups, and breeding values of non-core animals are conditioned on the breeding values of core animals. Therefore, there is the possibility of opting out equations for non-core animals from the model. A methodology was presented for a reduced APY genomic BLUP (GBLUP) to equations for core animals. Using a small example dataset, the method was validated by the equality of the full and the reduced model analysis results. Absorption of fixed effect equations into random effect equations was successful in reducing the number of equations to solve and producing the same random effect solutions. Extending the method to APY single-step GBLUP (ssGBLUP) was not computationally justifiable. Other reduction techniques exist for ssGBLUP (regardless of APY or non-APY) that work by reducing the number of equations for non-genotyped animals. The number of equations can further be reduced by data pruning.
Collapse
|
2
|
Vandenplas J, Ten Napel J, Darbaghshahi SN, Evans R, Calus MPL, Veerkamp R, Cromie A, Mäntysaari EA, Strandén I. Efficient large-scale single-step evaluations and indirect genomic prediction of genotyped selection candidates. Genet Sel Evol 2023; 55:37. [PMID: 37291510 DOI: 10.1186/s12711-023-00808-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 04/28/2023] [Indexed: 06/10/2023] Open
Abstract
BACKGROUND Single-step genomic best linear unbiased prediction (ssGBLUP) models allow the combination of genomic, pedigree, and phenotypic data into a single model, which is computationally challenging for large genotyped populations. In practice, genotypes of animals without their own phenotype and progeny, so-called genotyped selection candidates, can become available after genomic breeding values have been estimated by ssGBLUP. In some breeding programmes, genomic estimated breeding values (GEBV) for these animals should be known shortly after obtaining genotype information but recomputing GEBV using the full ssGBLUP takes too much time. In this study, first we compare two equivalent formulations of ssGBLUP models, i.e. one that is based on the Woodbury matrix identity applied to the inverse of the genomic relationship matrix, and one that is based on marker equations. Second, we present computationally-fast approaches to indirectly compute GEBV for genotyped selection candidates, without the need to do the full ssGBLUP evaluation. RESULTS The indirect approaches use information from the latest ssGBLUP evaluation and rely on the decomposition of GEBV into its components. The two equivalent ssGBLUP models and indirect approaches were tested on a six-trait calving difficulty model using Irish dairy and beef cattle data that include 2.6 million genotyped animals of which about 500,000 were considered as genotyped selection candidates. When using the same computational approaches, the solving phase of the two equivalent ssGBLUP models showed similar requirements for memory and time per iteration. The computational differences between them were due to the preprocessing phase of the genomic information. Regarding the indirect approaches, compared to GEBV obtained from single-step evaluations including all genotypes, indirect GEBV had correlations higher than 0.99 for all traits while showing little dispersion and level bias. CONCLUSIONS In conclusion, ssGBLUP predictions for the genotyped selection candidates were accurately approximated using the presented indirect approaches, which are more memory efficient and computationally fast, compared to solving a full ssGBLUP evaluation. Thus, indirect approaches can be used even on a weekly basis to estimate GEBV for newly genotyped animals, while the full single-step evaluation is done only a few times within a year.
Collapse
Affiliation(s)
- Jeremie Vandenplas
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands.
| | - Jan Ten Napel
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | | | - Ross Evans
- Irish Cattle Breeding Federation, Highfield House, Newcestown Road, Bandon, Cork, Ireland
| | - Mario P L Calus
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Roel Veerkamp
- Wageningen University and Research, P.O. Box 338, 6700 AH, Wageningen, The Netherlands
| | - Andrew Cromie
- Irish Cattle Breeding Federation, Highfield House, Newcestown Road, Bandon, Cork, Ireland
| | | | - Ismo Strandén
- Natural Resources Institute Finland (Luke), Jokioinen, Finland
| |
Collapse
|
3
|
Osawa T, Masuda Y, Saburi J, Hirumachi K. Application of single-step single nucleotide polymorphism best linear unbiased predictor model with unknown-parent groups for type traits in Japanese Holsteins. J Dairy Sci 2023:S0022-0302(23)00291-6. [PMID: 37268563 DOI: 10.3168/jds.2022-22541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 01/30/2023] [Indexed: 06/04/2023]
Abstract
The objectives of this study were to investigate the computational performance and the predictive ability and bias of a single-step SNP BLUP model (ssSNPBLUP) in genotyped young animals with unknown-parent groups (UPG) for type traits, using national genetic evaluation data from the Japanese Holstein population. The phenotype, genotype, and pedigree data were the same as those used in a national genetic evaluation of linear type traits classified between April 1984 and December 2020. In the current study, 2 data sets were prepared: the full data set containing all entries up to December 2020 and a truncated data set ending with December 2016. Genotyped animals were classified into 3 types: sires with classified daughters (S), cows with records (C), and young animals (Y). The computing performance and prediction accuracy of ssSNPBLUP were compared for the following 3 groups of genotyped animals: sires with classified daughters and young animals (SY); cows with records and young animals (CY); and sires with classified daughters, cows with records, and young animals (SCY). In addition, we tested 3 parameters of residual polygenic variance in ssSNPBLUP (0.1, 0.2, or 0.3). Daughter yield deviations (DYD) for the validation bulls and phenotypes adjusted for all fixed effects and random effects other than animal and residual (Yadj) for the validation cows were obtained using the full data set from the pedigree-based BLUP model. The regression coefficients of DYD for bulls (or Yadj for cows) on the genomic estimated breeding value (GEBV) using the truncated data set were used to measure the inflation of the predictions of young animals. The coefficient of determination of DYD on GEBV was used to measure the predictive ability of the predictions for the validation bulls. The reliability of the predictions for the validation cows was calculated as the square of the correlation between Yadj and GEBV divided by heritability. The predictive ability was highest in the SCY group and lowest in the CY group. However, minimal difference was found in predictive abilities with or without UPG models using different parameters of residual polygenic variance. The regression coefficients approached 1.0 as the parameter of residual polygenic variance increased, but regression coefficients were mostly similar regardless of the use of UPG across the groups of genotyped animals. The ssSNPBLUP model, including UPG, was demonstrated as feasible for implementation in the national evaluation of type traits in Japanese Holsteins.
Collapse
Affiliation(s)
- Takefumi Osawa
- National Livestock Breeding Center, Nishigo-mura, Fukushima, 961-8511, Japan.
| | - Yutaka Masuda
- Rakuno Gakuen University, Ebetsu, Hokkaido, 069-8501, Japan
| | - Junichi Saburi
- National Livestock Breeding Center, Nishigo-mura, Fukushima, 961-8511, Japan
| | - Keita Hirumachi
- National Livestock Breeding Center, Nishigo-mura, Fukushima, 961-8511, Japan
| |
Collapse
|
4
|
Bussiman F, Chen CY, Holl J, Bermann M, Legarra A, Misztal I, Lourenco D. Boundaries for genotype, phenotype, and pedigree truncation in genomic evaluations in pigs. J Anim Sci 2023; 101:skad273. [PMID: 37584978 PMCID: PMC10464514 DOI: 10.1093/jas/skad273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 08/10/2023] [Indexed: 08/17/2023] Open
Abstract
Historical data collection for genetic evaluation purposes is a common practice in animal populations; however, the larger the dataset, the higher the computing power needed to perform the analyses. Also, fitting the same model to historical and recent data may be inappropriate. Data truncation can reduce the number of equations to solve, consequently decreasing computing costs; however, the large volume of genotypes is responsible for most of the increase in computations. This study aimed to assess the impact of removing genotypes along with phenotypes and pedigree on the computing performance, reliability, and inflation of genomic predicted breeding value (GEBV) from single-step genomic best linear unbiased predictor for selection candidates. Data from two pig lines, a terminal sire (L1) and a maternal line (L2), were analyzed in this study. Four analyses were implemented: growth and "weaning to finish" mortality on L1, pre-weaning and reproductive traits on L2. Four genotype removal scenarios were proposed: removing genotyped animals without phenotypes and progeny (noInfo), removing genotyped animals based on birth year (Age), the combination of noInfo and Age scenarios (noInfo + Age), and no genotype removal (AllGen). In all scenarios, phenotypes were removed, based on birth year, and three pedigree depths were tested: two and three generations traced back and using the entire pedigree. The full dataset contained 1,452,257 phenotypes for growth traits, 324,397 for weaning to finish mortality, 517,446 for pre-weaning traits, and 7,853,629 for reproductive traits in pure and crossbred pigs. Pedigree files for lines L1 and L2 comprised 3,601,369 and 11,240,865 animals, of which 168,734 and 170,121 were genotyped, respectively. In each truncation scenario, the linear regression method was used to assess the reliability and dispersion of GEBV for genotyped parents (born after 2019). The number of years of data that could be removed without harming reliability depended on the number of records, type of analyses (multitrait vs. single trait), the heritability of the trait, and data structure. All scenarios had similar reliabilities, except for noInfo, which performed better in the growth analysis. Based on the data used in this study, considering the last ten years of phenotypes, tracing three generations back in the pedigree, and removing genotyped animals not contributing own or progeny phenotypes, increases computing efficiency with no change in the ability to predict breeding values.
Collapse
Affiliation(s)
- Fernando Bussiman
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | | | | | - Matias Bermann
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | | | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
5
|
Bermann M, Cesarani A, Misztal I, Lourenco D. Past, present, and future developments in single-step genomic models. ITALIAN JOURNAL OF ANIMAL SCIENCE 2022. [DOI: 10.1080/1828051x.2022.2053366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Matias Bermann
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Alberto Cesarani
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
- Dipartimento di Agraria, Università degli Studi di Sassari, Sassari, Italy
| | - Ignacy Misztal
- Dipartimento di Agraria, Università degli Studi di Sassari, Sassari, Italy
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| |
Collapse
|
6
|
Gutiérrez-Reinoso MA, Aponte PM, García-Herreros M. A review of inbreeding depression in dairy cattle: current status, emerging control strategies, and future prospects. J DAIRY RES 2022; 89:1-10. [PMID: 35225176 DOI: 10.1017/s0022029922000188] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Dairy cattle breeding has historically focused on relatively small numbers of elite bulls as sires of sons. In recent years, even if generation intervals were reduced and more diverse sires of sons could have been selected, genomic selection has not fundamentally changed the fact that a large number of individuals are being analyzed. However, a relatively small number of elite bulls are still siring those animals. Therefore inbreeding-derived negative consequences in the gene pool have brought concern. The detrimental effects of non-additive genetic changes such as inbreeding depression and dominance have been widely disseminated while seriously affecting bioeconomically important parameters because of an antagonistic relationship between dairy production and reproductive traits. Therefore, the estimation of benefits and limitations of inbreeding and variance of the selection response deserves to be evaluated and discussed to preserve genetic variability, a significant concern in the selection of individuals for reproduction and production. Short-term strategies for genetic merit improvement through modern breeding programs have severely lowered high-producing dairy cattle fertility potential. Since the current selection programs potentially increase long-term costs, genetic diversity has decreased globally as a consequence. Therefore, a greater understanding of the potential that selection programs have for supporting long-term genetic sustainability and genetic diversity among dairy cattle populations should be prioritized in managing farm profitability. The present review provides a broad approach to current inbreeding-derived problems, identifying critical points to be solved and possible alternative strategies to control selection against homozygous haplotypes while maintaining sustained selection pressure. Moreover, this manuscript explores future perspectives, emphasizing theoretical applications and critical points, and strategies to avoid the adverse effects of inbreeding in dairy cattle. Finally, this review provides an overview of challenges that will soon require multidisciplinary approaches to managing dairy cattle populations, intending to combine increases in productive trait phenotypes with improvements in reproductive, health, welfare, linear conformation, and adaptability traits into the foreseeable future.
Collapse
Affiliation(s)
- Miguel A Gutiérrez-Reinoso
- Universidad Técnica de Cotopaxi, Facultad de Ciencias Agropecuarias y Recursos Naturales, Carrera de Medicina Veterinaria (UTC), Latacunga, Ecuador
- Laboratorio de Biotecnología Animal, Departamento de Ciencia Animal, Facultad de Ciencias Veterinarias, Universidad de Concepción, Chillán (UdeC), Chile
| | - Pedro M Aponte
- Universidad San Francisco de Quito (USFQ), Colegio de Ciencias Biológicas y Ambientales (COCIBA), Campus Cumbayá, Quito, Ecuador
- Instituto de Investigaciones en Biomedicina, iBioMed, Universidad San Francisco de Quito (USFQ), Campus Cumbayá, Quito, Ecuador
| | | |
Collapse
|
7
|
Masuda Y, VanRaden PM, Tsuruta S, Lourenco DAL, Misztal I. Invited review: Unknown-parent groups and metafounders in single-step genomic BLUP. J Dairy Sci 2021; 105:923-939. [PMID: 34799109 DOI: 10.3168/jds.2021-20293] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 09/26/2021] [Indexed: 11/19/2022]
Abstract
Single-step genomic BLUP (ssGBLUP) is a method for genomic prediction that integrates matrices of pedigree (A) and genomic (G) relationships into a single unified additive relationship matrix whose inverse is incorporated into a set of mixed model equations (MME) to compute genomic predictions. Pedigree information in dairy cattle is often incomplete. Missing pedigree potentially causes biases and inflation in genomic estimated breeding values (GEBV) obtained with ssGBLUP. Three major issues are associated with missing pedigree in ssGBLUP, namely biased predictions by selection, missing inbreeding in pedigree relationships, and incompatibility between G and A in level and scale. These issues can be solved using a proper model for unknown-parent groups (UPG). The theory behind the use of UPG is well established for pedigree BLUP, but not for ssGBLUP. This study reviews the development of the UPG model in pedigree BLUP, the properties of UPG models in ssGBLUP, and the effect of UPG on genetic trends and genomic predictions. Similarities and differences between UPG and metafounder (MF) models, a generalized UPG model, are also reviewed. A UPG model (QP) derived using a transformation of the MME has a good convergence behavior. However, with insufficient data, the QP model may yield biased genetic trends and may underestimate UPG. The QP model can be altered by removing the genomic relationships linking GEBV and UPG effects from MME. This altered QP model exhibits less bias in genetic trends and less inflation in genomic predictions than the QP model, especially with large data sets. Recently, a new model, which encapsulates the UPG equations into the pedigree relationships for genotyped animals, was proposed in simulated purebred populations. The MF model is a comprehensive solution to the missing pedigree issue. This model can be a choice for multibreed or crossbred evaluations if the data set allows the estimation of a reasonable relationship matrix for MF. Missing pedigree influences genetic trends, but its effect on the predictability of genetic merit for genotyped animals should be negligible when many proven bulls are genotyped. The SNP effects can be back-solved using GEBV from older genotyped animals, and these predicted SNP effects can be used to calculate GEBV for young-genotyped animals with missing parents.
Collapse
Affiliation(s)
- Yutaka Masuda
- Department of Animal and Dairy Science, University of Georgia, Athens 30602.
| | - Paul M VanRaden
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, US Department of Agriculture, Beltsville, MD 20705
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| | | | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| |
Collapse
|
8
|
Tsuruta S, Lourenco D, Masuda Y, Lawlor T, Misztal I. Reducing computational cost of large-scale genomic evaluation by using indirect genomic prediction. JDS COMMUNICATIONS 2021; 2:356-360. [PMID: 36337117 PMCID: PMC9623783 DOI: 10.3168/jdsc.2021-0097] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 06/27/2021] [Indexed: 06/10/2023]
Abstract
Over half a million Holsteins are being genotyped annually in the United States. The computational cost of including all genotypes in single-step genomic (ssG)BLUP is high, although it is feasible to conduct large-scale genomic prediction using an efficient algorithm such as APY (algorithm for proven and young). An effective method to further reduce the computing cost could be the use of indirect genomic predictions (IGP) for genotyped animals when they have neither progeny nor phenotypes. These young genotyped animals have no effect on the other genotyped animals and could have their genomic prediction done indirectly. The main objective of this study was to calculate IGP for various groups of genotyped animals and investigate the reduction in computing time as well as bias and accuracy of the IGP. We compared IGP with genomic (G)EBV for 18 linear type traits in US Holsteins, including 2.3 million (M) genotyped animals. The full data set consisted of 10.9M records for 18 linear type traits up to 2018 calving, 13.6M animals in the pedigree, and 2.3M animals genotyped for 79K SNP. For IGP, ssGBLUP included all genotyped animals except those with neither progeny nor phenotypes by year from 2014 to 2018 (i.e., the target animals). The SNP marker effects were computed based on GEBV for genotyped animals that had progeny, or phenotypes, or both. Further, IGP were calculated for target genotyped animals in each year group. For all genotyped animal groups from 2014 to 2018, the coefficients of determination (R2) of a linear regression of GEBV on IGP were 0.960 for males and 0.954 for females for 18 traits on average. To reduce computing costs, the SNP marker effects were calculated based on GEBV from randomly selected genotyped animals from 15K to 60K. By randomly selecting a small number of genotyped animals, the computing time was dramatically reduced. As more genotyped animals were randomly selected to calculate SNP effects, R2 was higher (more accurate) and the regression coefficient was lower (more inflated IGP). In a practical genomic evaluation in US Holsteins, to get sufficient contributions from GEBV, 25K to 35K is a rational number of genotyped animals that can be randomly selected to compute SNP effects and obtain accurate and unbiased IGP. Considering the computing time and both unbiasedness and accuracy of IGP, genomic evaluation can be conducted separately in GEBV for genotyped animals with phenotypes or progeny and in IGP for young genotyped animals. This can be a practical solution when conducting a large-scale genomic evaluation and would enable more frequent evaluation at lower cost, especially when many genotyped animals have neither phenotypes nor progeny.
Collapse
Affiliation(s)
- S. Tsuruta
- Animal and Dairy Science Department, University of Georgia, Athens 30602
| | - D.A.L. Lourenco
- Animal and Dairy Science Department, University of Georgia, Athens 30602
| | - Y. Masuda
- Animal and Dairy Science Department, University of Georgia, Athens 30602
| | - T.J. Lawlor
- Holstein Association USA Inc., Brattleboro, VT 05301
| | - I. Misztal
- Animal and Dairy Science Department, University of Georgia, Athens 30602
| |
Collapse
|
9
|
Koivula M, Strandén I, Aamand GP, Mäntysaari EA. Practical implementation of genetic groups in single-step genomic evaluations with Woodbury matrix identity-based genomic relationship inverse. J Dairy Sci 2021; 104:10049-10058. [PMID: 34099294 DOI: 10.3168/jds.2020-19821] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 04/22/2021] [Indexed: 11/19/2022]
Abstract
The growing amount of genomic information in dairy cattle has increased computational and modeling challenges in the single-step evaluations. The computational challenges are due to the dense inverses of genomic (G) and pedigree (A22) relationship matrices of genotyped animals in the single-step mixed model equations. An equivalent mixed model equation is given by single-step genomic BLUP that are based on the T matrix (ssGTBLUP), where these inverses are avoided by expressing G-1 through a product of 2 rectangular matrices, and (A22)-1 through sparse matrix blocks of the inverse of full relationship matrix A-1. A proper way to account genetic groups through unknown parent groups (UPG) after the Quaas-Pollak transformation (QP) is one key factor in a single-step model. When the UPG effects are incompletely accounted, the iterative solving method may have convergence problems. In this study, we investigated computational and predictive performance of ssGTBLUP with residual polygenic (RPG) effect and UPG. The QP transformation used A-1 and, in the complete form, T and (A22)-1 matrices as well. The models were tested with official Nordic Holstein milk production test-day data and model. The results show that UPG can be easily implemented in ssGTBLUP having RPG. The complete QP transformation was computationally feasible when preconditioned conjugate gradient iteration and iteration on data without explicitly setting up G or A22 matrices were used. Furthermore, for good convergence of the preconditioned conjugate gradient method, a complete QP transformation was necessary.
Collapse
Affiliation(s)
- M Koivula
- Natural Resources Institute Finland (Luke), FI-31600 Jokioinen, Finland.
| | - I Strandén
- Natural Resources Institute Finland (Luke), FI-31600 Jokioinen, Finland
| | - G P Aamand
- Nordic Cattle Genetic Evaluation (NAV), 8200 Aarhus N, Denmark
| | - E A Mäntysaari
- Natural Resources Institute Finland (Luke), FI-31600 Jokioinen, Finland
| |
Collapse
|
10
|
Gutierrez-Reinoso MA, Aponte PM, Garcia-Herreros M. Genomic Analysis, Progress and Future Perspectives in Dairy Cattle Selection: A Review. Animals (Basel) 2021; 11:599. [PMID: 33668747 PMCID: PMC7996307 DOI: 10.3390/ani11030599] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 11/23/2020] [Accepted: 11/24/2020] [Indexed: 12/16/2022] Open
Abstract
Genomics comprises a set of current and valuable technologies implemented as selection tools in dairy cattle commercial breeding programs. The intensive progeny testing for production and reproductive traits based on genomic breeding values (GEBVs) has been crucial to increasing dairy cattle productivity. The knowledge of key genes and haplotypes, including their regulation mechanisms, as markers for productivity traits, may improve the strategies on the present and future for dairy cattle selection. Genome-wide association studies (GWAS) such as quantitative trait loci (QTL), single nucleotide polymorphisms (SNPs), or single-step genomic best linear unbiased prediction (ssGBLUP) methods have already been included in global dairy programs for the estimation of marker-assisted selection-derived effects. The increase in genetic progress based on genomic predicting accuracy has also contributed to the understanding of genetic effects in dairy cattle offspring. However, the crossing within inbred-lines critically increased homozygosis with accumulated negative effects of inbreeding like a decline in reproductive performance. Thus, inaccurate-biased estimations based on empirical-conventional models of dairy production systems face an increased risk of providing suboptimal results derived from errors in the selection of candidates of high genetic merit-based just on low-heritability phenotypic traits. This extends the generation intervals and increases costs due to the significant reduction of genetic gains. The remarkable progress of genomic prediction increases the accurate selection of superior candidates. The scope of the present review is to summarize and discuss the advances and challenges of genomic tools for dairy cattle selection for optimizing breeding programs and controlling negative inbreeding depression effects on productivity and consequently, achieving economic-effective advances in food production efficiency. Particular attention is given to the potential genomic selection-derived results to facilitate precision management on modern dairy farms, including an overview of novel genome editing methodologies as perspectives toward the future.
Collapse
Affiliation(s)
- Miguel A. Gutierrez-Reinoso
- Facultad de Ciencias Agropecuarias y Recursos Naturales, Carrera de Medicina Veterinaria, Universidad Técnica de Cotopaxi (UTC), Latacunga 05-0150, Ecuador
- Laboratorio de Biotecnología Animal, Departamento de Ciencia Animal, Facultad de Ciencias Veterinarias, Universidad de Concepción (UdeC), Chillán 3780000, Chile
| | - Pedro M. Aponte
- Colegio de Ciencias Biológicas y Ambientales (COCIBA), Universidad San Francisco de Quito (USFQ), Quito 170157, Ecuador
- Campus Cumbayá, Instituto de Investigaciones en Biomedicina “One-health”, Universidad San Francisco de Quito (USFQ), Quito 170157, Ecuador
| | - Manuel Garcia-Herreros
- Instituto Nacional de Investigação Agrária e Veterinária (INIAV), 2005-048 Santarém, Portugal
| |
Collapse
|
11
|
Masuda Y, Tsuruta S, Bermann M, Bradford HL, Misztal I. Comparison of models for missing pedigree in single-step genomic prediction. J Anim Sci 2021; 99:6119644. [PMID: 33493284 DOI: 10.1093/jas/skab019] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 01/20/2021] [Indexed: 11/14/2022] Open
Abstract
Pedigree information is often missing for some animals in a breeding program. Unknown-parent groups (UPGs) are assigned to the missing parents to avoid biased genetic evaluations. Although the use of UPGs is well established for the pedigree model, it is unclear how UPGs are integrated into the inverse of the unified relationship matrix (H-inverse) required for single-step genomic best linear unbiased prediction. A generalization of the UPG model is the metafounder (MF) model. The objectives of this study were to derive 3 H-inverses and to compare genetic trends among models with UPG and MF H-inverses using a simulated purebred population. All inverses were derived using the joint density function of the random breeding values and genetic groups. The breeding values of genotyped animals (u2) were assumed to be adjusted for UPG effects (g) using matrix Q2 as u2∗=u2+Q2g before incorporating genomic information. The Quaas-Pollak-transformed (QP) H-inverse was derived using a joint density function of u2∗ and g updated with genomic information and assuming nonzero cov(u2∗,g'). The modified QP (altered) H-inverse also assumes that the genomic information updates u2∗ and g, but cov(u2∗,g')=0. The UPG-encapsulated (EUPG) H-inverse assumed genomic information updates the distribution of u2∗. The EUPG H-inverse had the same structure as the MF H-inverse. Fifty percent of the genotyped females in the simulation had a missing dam, and missing parents were replaced with UPGs by generation. The simulation study indicated that u2∗ and g in models using the QP and altered H-inverses may be inseparable leading to potential biases in genetic trends. Models using the EUPG and MF H-inverses showed no genetic trend biases. These 2 H-inverses yielded the same genomic EBV (GEBV). The predictive ability and inflation of GEBVs from young genotyped animals were nearly identical among models using the QP, altered, EUPG, and MF H-inverses. Although the choice of H-inverse in real applications with enough data may not result in biased genetic trends, the EUPG and MF H-inverses are to be preferred because of theoretical justification and possibility to reduce biases.
Collapse
Affiliation(s)
- Yutaka Masuda
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Matias Bermann
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Heather L Bradford
- Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| |
Collapse
|
12
|
Meyer K. Impact of missing pedigrees in single-step genomic evaluation. ANIMAL PRODUCTION SCIENCE 2021. [DOI: 10.1071/an21045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Context
A common problem in mixed model-based genetic evaluation schemes for livestock is that cohorts of animals differ systematically in mean genetic merit, for example, due to missing pedigree. This can be modelled by fitting genetic groups. Single-step genomic evaluation (ssGBLUP) combining information from genotyped and non-genotyped individuals has become routine, but little is known of the effects of unknown parents in this context.
Aims
To investigate the effects of missing pedigrees on accuracy and bias of predicted breeding values for ssGBLUP analyses.
Methods
A simulation study was used to examine alternative ways to account for genetic groups in ssGBLUP, for multi-generation data with strong selection and rapidly increasing numbers of genotyped animals in the most recent generations.
Key results
Results demonstrated that missing pedigrees can markedly impair predicted breeding values. With selection, alignment of genomic and pedigree relationship matrices is essential when fitting unknown parent groups (UPG). Genomic relationships are complete; that is, they ‘automatically’ reference the genomic base, which typically differs from the genetic base for pedigreed animals. This can lead to biased comparisons between genotyped and non-genotyped animals with unknown parents when the two categories of animals are assigned to the same UPG. Allocating genotyped individuals to a separate UPG across all generations for each strain or breed was shown to be a simple and effective way to reduce misalignment bias. In contrast, fitting metafounders modified pedigree-based relationships to account for ancestral genomic relationships and inbreeding rather than the genomic relationship matrix. Thus, no bias due to different types of animals assigned to the same metafounders was apparent. Overall, fitting metafounders yielded slightly higher correlations between true and predicted breeding values than did UPG models, which assume genetic groups to be unrelated.
Conclusions
Missing pedigrees are more problematic with ssGBLUP than for analyses considering pedigree-based relationships only. UPG models with separation of genotyped and non-genotyped individuals and analyses fitting metafounders yielded comparable predictions of breeding values in terms of accuracy and bias.
Implications
A previously unidentified incompatibility between alignment of founder populations and assignment of genotyped and non-genotyped animals to the same UPG has been reported. Implementation of the proposed strategy to reduce ‘double counting’ is straightforward and can improve results of ssGBLUP analyses.
Collapse
|
13
|
Brunes LC, Baldi F, Lopes FB, Narciso MG, Lobo RB, Espigolan R, Costa MFO, Magnabosco CU. Genomic prediction ability for feed efficiency traits using different models and pseudo-phenotypes under several validation strategies in Nelore cattle. Animal 2020; 15:100085. [PMID: 33573965 DOI: 10.1016/j.animal.2020.100085] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 09/09/2020] [Accepted: 09/15/2020] [Indexed: 10/22/2022] Open
Abstract
There is a growing interest to improve feed efficiency (FE) traits in cattle. The genomic selection was proposed to improve these traits since they are difficult and expensive to measure. Up to date, there are scarce studies about the implementation of genomic selection for FE traits in indicine cattle under different scenarios of pseudo-phenotypes, models, and validation strategies on a commercial large scale. Thus, the aim was to evaluate the feasibility of genomic selection implementation for FE traits in Nelore cattle applying different models and pseudo-phenotypes under validation strategies. Phenotypic and genotypic information from 4 329 and 3 467 animals were used, respectively, which were tested for residual feed intake, DM intake, feed efficiency, feed conversion ratio, residual BW gain, and residual intake and BW gain. Six prediction methods were used: single-step genomic best linear unbiased prediction, Bayes A, Bayes B, Bayes Cπ, Bayesian least absolute shrinkage and selection operator (BLASSO), and Bayes R. Phenotypes adjusted for fixed effects (Y*), estimated breeding value (EBV), and EBV deregressed (DEBV) were used as pseudo-phenotypes. The validation approaches used were: (1) random: the data was randomly divided into ten subsets and the validation was done in each subset at a time; (2) age: the partition into training and testing sets was based on year of birth and testing animals were born after 2016; and (3) EBV accuracy: the data was split into two groups, being animals with accuracy above 0.45 the training set; and below 0.45 the validation set. In the analyses that used the Y* as pseudo-phenotype, prediction ability (PA) was obtained by dividing the correlation between pseudo-phenotype and genomic EBV (GEBV) by the square root of the heritability of the trait. When EBV and DEBV were used as the pseudo-phenotype, the simple correlation of this quantity with the GEBV was considered as PA. The prediction methods show similar results for PA and bias. The random cross-validation presented higher PA (0.17) than EBV accuracy (0.14) and age (0.13). The PA was higher for Y* than for EBV and DEBV (30.0 and 34.3%, respectively). Random validation presented the highest PA, being indicated for use in populations composed mainly of young animals and traits with few generations of data recording. For high heritability traits, the validation can be done by age, enabling the prediction of the next-generation genetic merit. These results would support breeders to identify genomic approaches that are more viable for genomic prediction for FE-related traits.
Collapse
Affiliation(s)
- L C Brunes
- Animal Science Department, Goiás Federal University, 74690-900 Goiânia, GO, Brazil; Embrapa Rice and Beans, GO-462, km 12, 75375-000 Santo Antônio de Goiás, GO, Brazil.
| | - F Baldi
- Animal Science Department, São Paulo State University - Júlio de Mesquita Filho (UNESP), Prof. Paulo Donato Castelane, 14884-900 Jaboticabal, SP, Brazil
| | - F B Lopes
- Cobb-Vantress, Inc., 72761 Siloam Springs, AR, USA
| | - M G Narciso
- Embrapa Rice and Beans, GO-462, km 12, 75375-000 Santo Antônio de Goiás, GO, Brazil
| | - R B Lobo
- National Association of Breeders and Researchers, 14020-230 Ribeirão Preto, Brazil
| | - R Espigolan
- Department of Veterinary Medicine, Faculty of Animal Science and Food Engineering, University of Sao Paulo, 13635-900 Pirassununga, SP, Brazil
| | - M F O Costa
- Embrapa Rice and Beans, GO-462, km 12, 75375-000 Santo Antônio de Goiás, GO, Brazil
| | - C U Magnabosco
- Embrapa Cerrados, BR-020, 18 Sobradinho, 70770-901 Brasilia, DF, Brazil
| |
Collapse
|
14
|
Misztal I, Tsuruta S, Pocrnic I, Lourenco D. Core-dependent changes in genomic predictions using the Algorithm for Proven and Young in single-step genomic best linear unbiased prediction. J Anim Sci 2020; 98:skaa374. [PMID: 33211798 PMCID: PMC7739885 DOI: 10.1093/jas/skaa374] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 11/11/2020] [Indexed: 11/28/2022] Open
Abstract
Single-step genomic best linear unbiased prediction with the Algorithm for Proven and Young (APY) is a popular method for large-scale genomic evaluations. With the APY algorithm, animals are designated as core or noncore, and the computing resources to create the inverse of the genomic relationship matrix (GRM) are reduced by inverting only a portion of that matrix for core animals. However, using different core sets of the same size causes fluctuations in genomic estimated breeding values (GEBVs) up to one additive standard deviation without affecting prediction accuracy. About 2% of the variation in the GRM is noise. In the recursion formula for APY, the error term modeling the noise is different for every set of core animals, creating changes in breeding values. While average changes are small, and correlations between breeding values estimated with different core animals are close to 1.0, based on the normal distribution theory, outliers can be several times bigger than the average. Tests included commercial datasets from beef and dairy cattle and from pigs. Beyond a certain number of core animals, the prediction accuracy did not improve, but fluctuations decreased with more animals. Fluctuations were much smaller than the possible changes based on prediction error variance. GEBVs change over time even for animals with no new data as genomic relationships ties all the genotyped animals, causing reranking of top animals. In contrast, changes in nongenomic models without new data are small. Also, GEBV can change due to details in the model, such as redefinition of contemporary groups or unknown parent groups. In particular, increasing the fraction of blending of the GRM with a pedigree relationship matrix from 5% to 20% caused changes in GEBV up to 0.45 SD, with a correlation of GEBV > 0.99. Fluctuations in genomic predictions are part of genomic evaluation models and are also present without the APY algorithm when genomic evaluations are computed with updated data. The best approach to reduce the impact of fluctuations in genomic evaluations is to make selection decisions not on individual animals with limited individual accuracy but on groups of animals with high average accuracy.
Collapse
Affiliation(s)
- Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens, GA
| | - Ivan Pocrnic
- The Roslin Institute, The University of Edinburgh, Edinburgh, UK
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA
| |
Collapse
|
15
|
Kudinov A, Mäntysaari E, Aamand G, Uimari P, Strandén I. Metafounder approach for single-step genomic evaluations of Red Dairy cattle. J Dairy Sci 2020; 103:6299-6310. [DOI: 10.3168/jds.2019-17483] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Accepted: 03/09/2020] [Indexed: 01/01/2023]
|
16
|
Konstantinov KV, Goddard ME. Application of multivariate single-step SNP best linear unbiased predictor model and revised SNP list for genomic evaluation of dairy cattle in Australia. J Dairy Sci 2020; 103:8305-8316. [PMID: 32622609 DOI: 10.3168/jds.2020-18242] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 04/21/2020] [Indexed: 11/19/2022]
Abstract
The objectives of this study were (1) to evaluate the computational feasibility of the multitrait test-day single-step SNP-BLUP (ssSNP-BLUP) model using phenotypic records of genotyped and nongenotyped animals, and (2) to compare accuracies (coefficient of determination; R2) and bias of genomic estimated breeding values (GEBV) and de-regressed proofs as response variables in 3 Australian dairy cattle breeds (i.e., Holstein, Jersey, and Red breeds). Additive genomic random regression coefficients for milk, fat, protein yield and somatic cell score were predicted in the first, second, and third lactation. The predicted coefficients were used to derive 305-d GEBV and were compared with the traditional parent averages obtained from a BLUP model without genomic information. Cow fertility traits were evaluated from the 5-trait repeatability model (i.e., calving interval, days from calving to first service, pregnancy diagnosis, first service nonreturn rate, and lactation length). The de-regressed proofs were only for calving interval. Our results showed that ssSNP-BLUP using multitrait test-day model increased reliability and reduced bias of breeding values of young animals when compared with parent average from traditional BLUP in Australian Holsten, Jersey, and Red breeds. The use of a custom selection of approximately 46,000 SNP (custom XT SNP list) increased the reliability of GEBV compared with the results obtained using the commercial Illumina 50K chip (Illumina, San Diego, CA). The use of the second preconditioner substantially improved the convergence rate of the preconditioned conjugate gradient method, but further work is needed to improve the efficiency of the computation of the Kronecker matrix product by vector. Application of ssSNP-BLUP to multitrait random regression models is computationally feasible.
Collapse
Affiliation(s)
- K V Konstantinov
- DataGene Limited, Agriculture Victoria, AgriBio Centre for AgriBusiness, 5 Ring Rd., Bundoora, Victoria 3083, Australia.
| | - M E Goddard
- Melbourne School of Land and Environment, University of Melbourne, Parkville, Victoria 3010, Australia
| |
Collapse
|
17
|
Mäntysaari E, Koivula M, Strandén I. Symposium review: Single-step genomic evaluations in dairy cattle. J Dairy Sci 2020; 103:5314-5326. [DOI: 10.3168/jds.2019-17754] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 01/21/2020] [Indexed: 11/19/2022]
|
18
|
Vandenplas J, Eding H, Bosmans M, Calus MPL. Computational strategies for the preconditioned conjugate gradient method applied to ssSNPBLUP, with an application to a multivariate maternal model. Genet Sel Evol 2020; 52:24. [PMID: 32404053 PMCID: PMC7222437 DOI: 10.1186/s12711-020-00543-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2020] [Accepted: 04/27/2020] [Indexed: 01/12/2023] Open
Abstract
Background The single-step single nucleotide polymorphism best linear unbiased prediction (ssSNPBLUP) is one of the single-step evaluations that enable a simultaneous analysis of phenotypic and pedigree information of genotyped and non-genotyped animals with a large number of genotypes. The aim of this study was to develop and illustrate several computational strategies to efficiently solve different ssSNPBLUP systems with a large number of genotypes on current computers. Results The different developed strategies were based on simplified computations of some terms of the preconditioner, and on splitting the coefficient matrix of the different ssSNPBLUP systems into multiple parts to perform its multiplication by a vector more efficiently. Some matrices were computed explicitly and stored in memory (e.g. the inverse of the pedigree relationship matrix), or were stored using a compressed form (e.g. the Plink 1 binary form for the genotype matrix), to permit the use of efficient parallel procedures while limiting the required amount of memory. The developed strategies were tested on a bivariate genetic evaluation for livability of calves for the Netherlands and the Flemish region in Belgium. There were 29,885,286 animals in the pedigree, 25,184,654 calf records, and 131,189 genotyped animals. The ssSNPBLUP system required around 18 GB Random Access Memory and 12 h to be solved with the most performing implementation. Conclusions Based on our proposed approaches and results, we showed that ssSNPBLUP provides a feasible approach in terms of memory and time requirements to estimate genomic breeding values using current computers.
Collapse
Affiliation(s)
- Jeremie Vandenplas
- Animal Breeding and Genomics, Wageningen UR, P.O. 338, 6700 AH, Wageningen, The Netherlands.
| | - Herwin Eding
- CRV BV, Wassenaarweg, 20, 6843 NW, Arnhem, The Netherlands
| | - Maarten Bosmans
- VORtech Scientific Software Engineers, Westlandseweg 40d, 2624 AD, Delft, The Netherlands
| | - Mario P L Calus
- Animal Breeding and Genomics, Wageningen UR, P.O. 338, 6700 AH, Wageningen, The Netherlands
| |
Collapse
|
19
|
Ben Zaabza H, Mäntysaari EA, Strandén I. Using Monte Carlo method to include polygenic effects in calculation of SNP-BLUP model reliability. J Dairy Sci 2020; 103:5170-5182. [PMID: 32253036 DOI: 10.3168/jds.2019-17255] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 02/04/2020] [Indexed: 11/19/2022]
Abstract
An SNP-BLUP model is computationally scalable even for large numbers of genotyped animals. When genetic variation cannot be completely captured by SNP markers, a more accurate model is obtained by fitting a residual polygenic effect (RPG) as well. However, inclusion of the RPG effect increases the size of the SNP-BLUP mixed model equations (MME) by the number of genotyped animals. Consequently, the calculation of model reliabilities requiring elements of the inverted MME coefficient matrix becomes more computationally challenging with increasing numbers of genotyped animals. We present a Monte Carlo (MC)-based sampling method to estimate the reliability of the SNP-BLUP model including the RPG effect, where the MME size depends on the number of markers and MC samples. We compared reliabilities calculated using different RPG proportions and different MC sample sizes in analyzing 2 data sets. Data set 1 (data set 2) contained 19,757 (222,619) genotyped animals, with 11,729 (50,240) SNP markers, and 231,186 (13.35 million) pedigree animals. Correlations between the correct and the MC-calculated reliabilities were above 98% even with 5,000 MC samples and an 80% RPG proportion in both data sets. However, more MC samples were needed to achieve a small maximum absolute difference and mean squared error, particularly when the RPG proportion exceeded 20%. The computing time for MC SNP-BLUP was shorter than for GBLUP. In conclusion, the MC-based approach can be an effective strategy for calculating SNP-BLUP model reliability with an RPG effect included.
Collapse
Affiliation(s)
- H Ben Zaabza
- Natural Resources Institute Finland (Luke), FI-31600 Jokioinen, Finland.
| | - E A Mäntysaari
- Natural Resources Institute Finland (Luke), FI-31600 Jokioinen, Finland
| | - I Strandén
- Natural Resources Institute Finland (Luke), FI-31600 Jokioinen, Finland
| |
Collapse
|
20
|
Misztal I, Lourenco D, Legarra A. Current status of genomic evaluation. J Anim Sci 2020; 98:skaa101. [PMID: 32267923 PMCID: PMC7183352 DOI: 10.1093/jas/skaa101] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 04/07/2020] [Indexed: 12/14/2022] Open
Abstract
Early application of genomic selection relied on SNP estimation with phenotypes or de-regressed proofs (DRP). Chips of 50k SNP seemed sufficient for an accurate estimation of SNP effects. Genomic estimated breeding values (GEBV) were composed of an index with parent average, direct genomic value, and deduction of a parental index to eliminate double counting. Use of SNP selection or weighting increased accuracy with small data sets but had minimal to no impact with large data sets. Efforts to include potentially causative SNP derived from sequence data or high-density chips showed limited or no gain in accuracy. After the implementation of genomic selection, EBV by BLUP became biased because of genomic preselection and DRP computed based on EBV required adjustments, and the creation of DRP for females is hard and subject to double counting. Genomic selection was greatly simplified by single-step genomic BLUP (ssGBLUP). This method based on combining genomic and pedigree relationships automatically creates an index with all sources of information, can use any combination of male and female genotypes, and accounts for preselection. To avoid biases, especially under strong selection, ssGBLUP requires that pedigree and genomic relationships are compatible. Because the inversion of the genomic relationship matrix (G) becomes costly with more than 100k genotyped animals, large data computations in ssGBLUP were solved by exploiting limited dimensionality of genomic data due to limited effective population size. With such dimensionality ranging from 4k in chickens to about 15k in cattle, the inverse of G can be created directly (e.g., by the algorithm for proven and young) at a linear cost. Due to its simplicity and accuracy, ssGBLUP is routinely used for genomic selection by the major chicken, pig, and beef industries. Single step can be used to derive SNP effects for indirect prediction and for genome-wide association studies, including computations of the P-values. Alternative single-step formulations exist that use SNP effects for genotyped or for all animals. Although genomics is the new standard in breeding and genetics, there are still some problems that need to be solved. This involves new validation procedures that are unaffected by selection, parameter estimation that accounts for all the genomic data used in selection, and strategies to address reduction in genetic variances after genomic selection was implemented.
Collapse
Affiliation(s)
- Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA
| | - Andres Legarra
- Department of Animal Genetics, Institut National de la Recherche Agronomique, Castanet-Tolosan, France
| |
Collapse
|
21
|
Aguilar I, Fernandez EN, Blasco A, Ravagnolo O, Legarra A. Effects of ignoring inbreeding in model-based accuracy for BLUP and SSGBLUP. J Anim Breed Genet 2020; 137:356-364. [PMID: 32080913 DOI: 10.1111/jbg.12470] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 12/10/2019] [Accepted: 01/11/2020] [Indexed: 11/29/2022]
Abstract
Model-based accuracy, defined as the theoretical correlation between true and estimated breeding value, can be obtained for each individual as a function of its prediction error variance (PEV) and inbreeding coefficient F, in BLUP, GBLUP and SSGBLUP genetic evaluations. However, for computational convenience, inbreeding is often ignored in two places. First, in the computation of reliability = 1-PEV/(1 + F). Second, in the set-up, using Henderson's rules, of the inverse of the pedigree-based relationship matrix A. Both approximations have an effect in the computation of model-based accuracy and result in wrong values. In this work, first we present a reminder of the theory and extend it to SSGBLUP. Second, we quantify the error of ignoring inbreeding with real data in three scenarios: BLUP evaluation and SSGBLUP in Uruguayan dairy cattle, and BLUP evaluations in a line of rabbit closed for >40 generations with steady increase of inbreeding up to an average of 0.30. We show that ignoring inbreeding in the set-up of the A-inverse is equivalent to assume that non-inbred animals are actually inbred. This results in an increase of apparent PEV that is negligible for dairy cattle but considerable for rabbit. Ignoring inbreeding in reliability = 1-PEV/(1 + F) leads to underestimation of reliability for BLUP evaluations, and this underestimation is very large for rabbit. For SSGBLUP in dairy cattle, it leads to both underestimation and overestimation of reliability, both for genotyped and non-genotyped animals. We strongly recommend to include inbreeding both in the set-up of A-inverse and in the computation of reliability from PEVs.
Collapse
Affiliation(s)
- Ignacio Aguilar
- Instituto Nacional de Investigación Agropecuaria (INIA), Montevideo, Uruguay
| | - Eduardo N Fernandez
- Cátedra de Mejora y Conservación de Recursos Genéticos e Instituto de Investigación sobre Producción Agropecuaria, Ambiente y Salud, Facultad de Ciencias Agrarias, UNLZ, Buenos Aires, Argentina
| | - Agustin Blasco
- Institute for Animal Science and Technology, Universitat Politècnica de València, València, Spain
| | - Olga Ravagnolo
- Instituto Nacional de Investigación Agropecuaria (INIA), Montevideo, Uruguay
| | | |
Collapse
|
22
|
Boerner V, Johnston DJ. More animals than markers: a study into the application of the single step T-BLUP model in large-scale multi-trait Australian Angus beef cattle genetic evaluation. Genet Sel Evol 2019; 51:57. [PMID: 31619157 PMCID: PMC6796474 DOI: 10.1186/s12711-019-0499-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Accepted: 09/24/2019] [Indexed: 11/25/2022] Open
Abstract
Multi-trait single step genetic evaluation is increasingly facing the situation of having more individuals with genotypes than markers within each genotype. This creates a situation where the genomic relationship matrix (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathbf{G }$$\end{document}G) is not of full rank and its inversion is algebraically impossible. Recently, the SS-T-BLUP method was proposed as a modified version of the single step equations, providing an elegant way to circumvent the inversion of the \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathbf{G }$$\end{document}G and therefore accommodate the situation described. SS-T-BLUP uses the Woodbury matrix identity, thus it requires an add-on matrix, which is usually the covariance matrix of the residual polygenic effet. In this paper, we examine the application of SS-T-BLUP to a large-scale multi-trait Australian Angus beef cattle dataset using the full BREEDPLAN single step genetic evaluation model and compare the results to the application of two different methods of using \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathbf{G }$$\end{document}G in a single step model. Results clearly show that SS-T-BLUP outperforms other single step formulations in terms of computational speed and avoids approximation of the inverse of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\mathbf{G }$$\end{document}G.
Collapse
Affiliation(s)
- Vinzent Boerner
- Animal Genetics and Breeding Unit (AGBU), Armidale, Australia.
| | | |
Collapse
|
23
|
van Grevenhof EM, Vandenplas J, Calus MPL. Genomic prediction for crossbred performance using metafounders. J Anim Sci 2019; 97:548-558. [PMID: 30423111 DOI: 10.1093/jas/sky433] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 11/09/2018] [Indexed: 01/01/2023] Open
Abstract
Future genomic evaluation models to be used routinely in breeding programs for pigs and poultry need to be able to optimally use information of crossbred (CB) animals to predict breeding values for CB performance of purebred (PB) selection candidates. Important challenges in the commonly used single-step genomic best linear unbiased prediction (ssGBLUP) model are the definition of relationships between the different line compositions and the definition of the base generation per line. The use of metafounders (MFs) in ssGBLUP has been proposed to overcome these issues. When relationships between lines are known to be different from 0, the use of MFs generalizes the concept of genetic groups relying on the genotype data. Our objective was to investigate the effect of using MFs in genomic prediction for CB performance on estimated variance components, and accuracy and bias of GEBV. This was studied using stochastic simulation to generate data representing a three-way crossbreeding scheme in pigs, with the parental lines being either closely related or unrelated. Results show that using MFs, the variance components should be scaled appropriately, especially when basing them on estimates obtained with, for example a pedigree-based model. The accuracies of GEBV that were obtained using MFs were similar to accuracies without using MFs, regardless whether the lines involved in the CB were closely related or unrelated. The use of MFs resulted in a model that had similar or somewhat better convergence properties compared to other models. We recommend the use of MFs in ssGBLUP for genomic evaluations in crossbreeding schemes.
Collapse
Affiliation(s)
| | - Jérémie Vandenplas
- Wageningen University & Research Animal Breeding and Genomics, Wageningen, The Netherlands
| | - Mario P L Calus
- Wageningen University & Research Animal Breeding and Genomics, Wageningen, The Netherlands
| |
Collapse
|
24
|
Aldridge MN, Vandenplas J, Calus MPL. Efficient and accurate computation of base generation allele frequencies. J Dairy Sci 2018; 102:1364-1373. [PMID: 30471906 DOI: 10.3168/jds.2018-15264] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 10/01/2018] [Indexed: 11/19/2022]
Abstract
Allele frequencies are used for several aspects of genomic prediction, with the assumption that these are equal to the allele frequency in the base generation of the pedigree. The current standard method, however, calculates allele frequencies from the current genotyped population. We compared the current standard method with BLUP and general least squares (GLS) methods explicitly targeting the base population to determine whether there is a more accurate and still efficient method of calculating allele frequencies that better represents the base generation. A data set based on a typical dairy population was simulated for 325,266 animals; the last 100,078 animals in generations 9 to 12 of the population were genotyped, with 1,670 SNP markers. For the BLUP method, several SNP genotypes were analyzed with a multitrait model by assuming a heritability of 0.99 and no genetic correlation among them. This method was limited by the time required for each BLUP to converge (approximately 6 min per BLUP run of 15 SNP). The GLS method had 2 implementations. The first implementation, using imputation on the fly and multiplication of sparse matrices, was very efficient and required just 49 s and 1.3 GB of random access memory. The second implementation, using a dense full A22-1 matrix, was very inefficient and required more than 1 d of wall clock time and more than 118.2 GB of random access memory. When no selection was considered in the simulations, all methods predicted equally well. When selection was introduced, higher correlations between the estimated allele frequency and known base generation allele frequency were observed for BLUP (0.96 ± 0.01) and GLS (0.97 ± 0.01) compared with the current standard method (0.87 ± 0.01). The GLS method decreased in accuracy when introducing incomplete pedigree, with 25% of sires in the first 5 generations randomly replaced as unknown to erroneously identify founder animals (0.93 ± 0.01) and a further decrease for 8 generations (0.91 ± 0.01). There was no change in accuracy when introducing 5% genotyping errors (0.97 ± 0.01), 5% missing genotypes (0.97 ± 0.01), or both 5% genotyping errors and missing genotypes (0.97 ± 0.01). The GLS method provided the most accurate estimates of base generation allele frequency and was only slightly slower compared with the current method. The efficient implementation of the GLS method, therefore, is very well suited for practical application and is recommended for implementation.
Collapse
Affiliation(s)
- M N Aldridge
- Wageningen University & Research, Animal Breeding and Genomics, 6700AH Wageningen, the Netherlands.
| | - J Vandenplas
- Wageningen University & Research, Animal Breeding and Genomics, 6700AH Wageningen, the Netherlands
| | - M P L Calus
- Wageningen University & Research, Animal Breeding and Genomics, 6700AH Wageningen, the Netherlands
| |
Collapse
|
25
|
Gao H, Koivula M, Jensen J, Strandén I, Madsen P, Pitkänen T, Aamand G, Mäntysaari E. Short communication: Genomic prediction using different single-step methods in the Finnish red dairy cattle population. J Dairy Sci 2018; 101:10082-10088. [DOI: 10.3168/jds.2018-14913] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 07/09/2018] [Indexed: 12/21/2022]
|
26
|
Matilainen K, Strandén I, Aamand GP, Mäntysaari EA. Single step genomic evaluation for female fertility in Nordic Red dairy cattle. J Anim Breed Genet 2018; 135:337-348. [DOI: 10.1111/jbg.12353] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Revised: 06/18/2018] [Accepted: 07/03/2018] [Indexed: 11/30/2022]
Affiliation(s)
| | - Ismo Strandén
- Natural Resources Institute Finland (Luke); Jokioinen Finland
| | | | | |
Collapse
|
27
|
Meyer K, Tier B, Swan A. Estimates of genetic trend for single-step genomic evaluations. Genet Sel Evol 2018; 50:39. [PMID: 30075705 PMCID: PMC6091173 DOI: 10.1186/s12711-018-0410-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 07/10/2018] [Indexed: 01/03/2023] Open
Abstract
Background A common measure employed to evaluate the efficacy of livestock improvement schemes is the genetic trend, which is calculated as the means of predicted breeding values for animals born in successive time periods. This implies that different cohorts refer to the same base population. For genetic evaluation schemes integrating genomic information with records for all animals, genotyped or not, this is often not the case: expected means for pedigree founders are zero whereas values for genotyped animals are expected to sum to zero at the (mean) time corresponding to the frequencies that are used to center marker allele counts when calculating genomic relationships. Methods The paper examines estimates of genetic trends from single-step genomic evaluations. After a review of methods which propose to align pedigree-based and genomic relationship matrices, simulation is used to illustrate the effects of alignments and choice of assumed gene frequencies on trajectories of genetic trends. Results The results show that methods available to alleviate differences between the founder populations implied by the two types of relationship matrices perform well; in particular, the meta-founder approach is advantageous. An application to data from routine genetic evaluation of Australian sheep is shown, confirming their effectiveness for practical data. Conclusions Aligning pedigree and genomic relationship matrices for single step genetic evaluation for populations under selection is essential. Fitting meta-founders is an effective and simple method to avoid distortion of estimates of genetic trends.
Collapse
Affiliation(s)
- Karin Meyer
- Animal Genetics and Breeding Unit, University of New England, Armidale, NSW, 2351, Australia.
| | - Bruce Tier
- Animal Genetics and Breeding Unit, University of New England, Armidale, NSW, 2351, Australia
| | - Andrew Swan
- Animal Genetics and Breeding Unit, University of New England, Armidale, NSW, 2351, Australia
| |
Collapse
|
28
|
Vandenplas J, Calus MPL, Ten Napel J. Sparse single-step genomic BLUP in crossbreeding schemes. J Anim Sci 2018; 96:2060-2073. [PMID: 29873759 DOI: 10.1093/jas/sky136] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Accepted: 04/16/2018] [Indexed: 12/20/2022] Open
Abstract
The algorithm for proven and young animals (APY) efficiently computes an approximated inverse of the genomic relationship matrix, by dividing genotyped animals in the so-called core and noncore animals. The APY leads to computationally feasible single-step genomic Best Linear Unbiased Prediction (ssGBLUP) with a large number of genotyped animals and was successfully applied to real single-breed or line datasets. This study aimed to assess the quality of genomic estimated breeding values (GEBV) when using the APY (GEBVAPY), in comparison to GEBV when using the directly inverted genomic relationship matrix (GEBVDIRECT), for situations based on crossbreeding schemes, including F1 and F2 crosses, such as the ones for pigs and chickens. Based on simulations of a 3-way crossbreeding program, we compared different approximated inverses of a genomic relationship matrix, by varying the size and the composition of the core group. We showed that GEBVAPY were accurate approximations of GEBVDIRECT for multivariate ssGBLUP involving different breeds and their crosses. GEBVAPY as accurate as GEBVDIRECT were obtained when the core groups included animals from different breed compositions and when the core groups had a size between the numbers of the largest eigenvalues explaining 98% and 99% of the variation in the raw genomic relationship matrix.
Collapse
Affiliation(s)
- Jérémie Vandenplas
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, AH Wageningen, The Netherlands
| | - Mario P L Calus
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, AH Wageningen, The Netherlands
| | - Jan Ten Napel
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, AH Wageningen, The Netherlands
| |
Collapse
|
29
|
Masuda Y, VanRaden P, Misztal I, Lawlor T. Differing genetic trend estimates from traditional and genomic evaluations of genotyped animals as evidence of preselection bias in US Holsteins. J Dairy Sci 2018; 101:5194-5206. [DOI: 10.3168/jds.2017-13310] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Accepted: 12/10/2017] [Indexed: 11/19/2022]
|
30
|
Koivula M, Strandén I, Aamand G, Mäntysaari E. Reducing bias in the dairy cattle single-step genomic evaluation by ignoring bulls without progeny. J Anim Breed Genet 2018; 135:107-115. [DOI: 10.1111/jbg.12318] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 01/22/2018] [Indexed: 12/26/2022]
Affiliation(s)
- M. Koivula
- Natural Resources Institute Finland (Luke); Green Technology; Jokioinen Finland
| | - I. Strandén
- Natural Resources Institute Finland (Luke); Green Technology; Jokioinen Finland
| | - G.P. Aamand
- NAV Nordic Cattle Genetic Evaluation; Aarhus N Denmark
| | - E.A. Mäntysaari
- Natural Resources Institute Finland (Luke); Green Technology; Jokioinen Finland
| |
Collapse
|
31
|
Colleau JJ, Palhière I, Rodríguez-Ramilo ST, Legarra A. A fast indirect method to compute functions of genomic relationships concerning genotyped and ungenotyped individuals, for diversity management. Genet Sel Evol 2017; 49:87. [PMID: 29191178 PMCID: PMC5709854 DOI: 10.1186/s12711-017-0363-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Accepted: 11/24/2017] [Indexed: 12/01/2022] Open
Abstract
Background Pedigree-based management of genetic diversity in populations, e.g., using optimal contributions, involves computation of the \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Ax}}$$\end{document}Ax type yielding elements (relationships) or functions (usually averages) of relationship matrices. For pedigree-based relationships \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{A}}$$\end{document}A, a very efficient method exists. When all the individuals of interest are genotyped, genomic management can be addressed using the genomic relationship matrix \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{G}}$$\end{document}G; however, to date, the computational problem of efficiently computing \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Gx}}$$\end{document}Gx has not been well studied. When some individuals of interest are not genotyped, genomic management should consider the relationship matrix \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{H}}$$\end{document}H that combines genotyped and ungenotyped individuals; however, direct computation of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Hx}}$$\end{document}Hx is computationally very demanding, because construction of a possibly huge matrix is required. Our work presents efficient ways of computing \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Gx}}$$\end{document}Gx and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Hx}}$$\end{document}Hx, with applications on real data from dairy sheep and dairy goat breeding schemes. Results For genomic relationships, an efficient indirect computation with quadratic instead of cubic cost is \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{x}} = {\mathbf{Z}}\left( {{\mathbf{Z^{\prime}x}}} \right)/k$$\end{document}x=ZZ′x/k, where Z is a matrix relating animals to genotypes. For the relationship matrix \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{H}}$$\end{document}H, we propose an indirect method based on the difference between vectors \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Hx}} - {\mathbf{Ax}}$$\end{document}Hx-Ax, which involves computation of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Ax}}$$\end{document}Ax and of products such as \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{Gw}}$$\end{document}Gw and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{A}}_{22}^{ - 1} {\mathbf{w}}$$\end{document}A22-1w, where \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{w}}$$\end{document}w is a working vector derived from \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{x}}$$\end{document}x. The latter computation is the most demanding but can be done using sparse Cholesky decompositions of matrix \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{A}}^{ - 1}$$\end{document}A-1, which allows handling very large genomic and pedigree data files. Studies based on simulations reported in the literature show that the trends of average relationships in \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{H}}$$\end{document}H and \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{A}}$$\end{document}A differ as genomic selection proceeds. When selection is based on genomic relationships but management is based on pedigree data, the true genetic diversity is overestimated. However, our tests on real data from sheep and goat obtained before genomic selection started do not show this. Conclusions We present efficient methods to compute elements and statistics of the genomic relationships \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{G}}$$\end{document}G and of matrix \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$${\mathbf{H}}$$\end{document}H that combines ungenotyped and genotyped individuals. These methods should be useful to monitor and handle genomic diversity.
Collapse
Affiliation(s)
- Jean-Jacques Colleau
- GABI, INRA, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Isabelle Palhière
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet Tolosan, France
| | | | - Andres Legarra
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet Tolosan, France.
| |
Collapse
|
32
|
Mäntysaari EA, Evans RD, Strandén I. Efficient single-step genomic evaluation for a multibreed beef cattle population having many genotyped animals. J Anim Sci 2017; 95:4728-4737. [PMID: 29293736 PMCID: PMC6292282 DOI: 10.2527/jas2017.1912] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2017] [Accepted: 09/10/2017] [Indexed: 01/04/2023] Open
Abstract
An equivalent computational approach called ssGTBLUP was formulated for the original single-step GBLUP (ssGBLUP). In ssGTBLUP, the genomic relationship matrix has the form = ' + , where the (centered and scaled) marker matrix has size x (numbers of genotypes and markers), and the matrix can be easily inverted. The inverse can be written as = - ' where is an by matrix. When the preconditioned conjugate gradient (PCG) method is used to solve the mixed model equations, a matrix vector product needs to be computed. In ssGBLUP, this requires multiplications, but in ssGTBLUP, the product ' has 2 multiplications and has multiplications with the constant independent of or . In an approximate approach called ssGTBLUP(p), the eigendecomposition of ' is used to reduce the number of rows in the matrix. Here, p is the percentage of total variance explained by the accepted eigenvalues. The objective of this study was to compare the performance of ssGBLUP, ssGTBLUP, ssGTBLUP(p), and the APY (algorithm for proven and young) method. In APY, the core had 50,000 (APY50K), 30,000 (APY30K), or 10,000 (APY10K) animals. The approaches were tested on the Irish beef carcass conformation genetic evaluation which has a heterogeneous multibreed population. The pedigree had 13.3 million animals. There were = 54,620 markers available from = 163,277 genotyped animals. For genotyped animals, the correlations of breeding values between ssGBLUP and ssGTBLUP(p) for the 11 traits in the model ranged from 0.999-1.000 for p = 99, 0.998-1.000 for p = 98, and 0.992-0.998 for p = 95 but were 0.994-1.000 for APY50K, 0.969-0.997 for APY30K, and 0.899-0.967 for APY10K. Computing times per iteration were 4.43, 3.30, 2.69, 2.29, 1.55, 1.76, 1.27, and 0.55 min for ssGBLUP, ssGTBLUP, ssGTBLUP(99), ssGTBLUP(98), ssGTBLUP(95), APY50K, APY30K, and APY10K, respectively. The ssGTBLUP(p) approach allowed a well-defined approximation to ssGBLUP and fast computations.
Collapse
Affiliation(s)
- E. A. Mäntysaari
- Natural Resources Institute Finland (Luke), Green Technology, FI-31600 Jokioinen, Finland
| | - R. D. Evans
- Irish Cattle Breeding Federation, Highfield House, Newcestown Road, Bandon, Cork, Ireland
| | - I. Strandén
- Natural Resources Institute Finland (Luke), Green Technology, FI-31600 Jokioinen, Finland
| |
Collapse
|