Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lourenco DAL, Fragomeni BO, Bradford HL, Menezes IR, Ferraz JBS, Aguilar I, Tsuruta S, Misztal I. Implications of SNP weighting on single-step genomic predictions for different reference population sizes. J Anim Breed Genet 2017;134:463-471. [PMID: 28833593 DOI: 10.1111/jbg.12288] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 07/19/2017] [Indexed: 01/20/2023]

For:	Lourenco DAL, Fragomeni BO, Bradford HL, Menezes IR, Ferraz JBS, Aguilar I, Tsuruta S, Misztal I. Implications of SNP weighting on single-step genomic predictions for different reference population sizes. J Anim Breed Genet 2017;134:463-471. [PMID: 28833593 DOI: 10.1111/jbg.12288] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 07/19/2017] [Indexed: 01/20/2023]

Number

Cited by Other Article(s)

Leite NG, Bermann M, Tsuruta S, Misztal I, Lourenco D. Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations. Genet Sel Evol 2024;56:59. [PMID: 39174924 PMCID: PMC11340074 DOI: 10.1186/s12711-024-00925-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 07/24/2024] [Indexed: 08/24/2024] Open

Abstract

BACKGROUND

Single-nucleotide polymorphism (SNP) effects can be backsolved from ssGBLUP genomic estimated breeding values (GEBV) and used for genome-wide association studies (ssGWAS). However, obtaining p-values for those SNP effects relies on the inversion of dense matrices, which poses computational limitations in large genotyped populations. In this study, we present a method to approximate SNP p-values for ssGWAS with many genotyped animals. This method relies on the combination of a sparse approximation of the inverse of the genomic relationship matrix ( G A P Y - 1 ) built with the algorithm for proven and young ( APY ) and an approximation of the prediction error variance of SNP effects which does not require the inversion of the left-hand side (LHS) of the mixed model equations. To test the proposed p-value computing method, we used a reduced genotyped population of 50K genotyped animals and compared the approximated SNP p-values with benchmark p-values obtained with the direct inverse of LHS built with an exact genomic relationship matrix (G - 1 ) . Then, we applied the proposed approximation method to obtain SNP p-values for a larger genotyped population composed of 450K genotyped animals.

RESULTS

The same genomic regions on chromosomes 7 and 20 were identified across all p-value computing methods when using 50K genotyped animals. In terms of computational requirements, obtaining p-values with the proposed approximation reduced the wall-clock time by 38 times and the memory requirement by ten times compared to using the exact inversion of the LHS. When the approximation was applied to a population of 450K genotyped animals, two new significant regions on chromosomes 6 and 14 were uncovered, indicating an increase in GWAS detection power when including more genotypes in the analyses. The process of obtaining p-values with the approximation and 450K genotyped individuals took 24.5 wall-clock hours and 87.66GB of memory, which is expected to increase linearly with the addition of noncore genotyped individuals.

CONCLUSIONS

With the proposed method, obtaining p-values for SNP effects in ssGWAS is computationally feasible in large genotyped populations. The computational cost of obtaining p-values in ssGWAS may no longer be a limitation in extensive populations with many genotyped animals.

Collapse

Pocrnic I, Lourenco D, Misztal I. Single nucleotide polymorphism profile for quantitative trait nucleotide in populations with small effective size and its impact on mapping and genomic predictions. Genetics 2024;227:iyae103. [PMID: 38913695 PMCID: PMC11304960 DOI: 10.1093/genetics/iyae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/07/2024] [Accepted: 06/16/2024] [Indexed: 06/26/2024] Open

Dominguez-Castaño P, Fortes M, Tan WLA, Toro-Ospina AM, Silva JAIV. Genome-wide association study for milk yield, frame, and udder conformation traits of Gir dairy cattle. J Dairy Sci 2024:S0022-0302(24)01031-2. [PMID: 39067750 DOI: 10.3168/jds.2024-24648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 07/09/2024] [Indexed: 07/30/2024]

Santana BF, Riser M, Hay EHA, Fragomeni BDO. Alternative SNP weighting for multi-step and single-step genomic BLUP in the presence of causative variants. J Anim Breed Genet 2023;140:679-694. [PMID: 37551047 DOI: 10.1111/jbg.12817] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 06/29/2023] [Accepted: 07/02/2023] [Indexed: 08/09/2023]

Wicki M, Raoul J, Legarra A. Effect of subdivision of the Lacaune dairy sheep breed on the accuracy of genomic prediction. J Dairy Sci 2023;106:5570-5581. [PMID: 37349212 DOI: 10.3168/jds.2022-23114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 02/16/2023] [Indexed: 06/24/2023]

Abstract

Genomic selection was deployed in Lacaune dairy breed in 2015. Lacaune population split in 1972 into 2 breeding companies with associated flocks, and there have been very few exchanges of animals between the subpopulations, leading to divergence of the 2 subpopulations. In spite of that, there is a joint genomic prediction. The objective of this study is to understand how this structuring affects prediction accuracy. We analyzed all the data available from Lacaune breeding program for milk yield: around 6 million phenotypes, 2 million animals in the pedigree and more than 29,000 genotyped animals, including 3,434 and 2,868 AI rams for each company. To consider missing pedigree, we set up genetic groups using the theory of metafounders. First, we studied the pedigree and genomic structures of the 2 subpopulations calculating Fst, evolution of average pedigree relationships across time and principal components analysis of genomic relationships. In a second part, we compared the reliability between different scenarios: an evaluation with a single reference population (Alone), an evaluation with a joint reference population (Together) and an evaluation of one subpopulation based on the reference population of the other group (Indirect). The low Fst value (0.02) reveals that the 2 subpopulations are still genetically close. Nevertheless, a low and constant average relationship between the animals of the 2 subpopulations confirms the absence of recent connections between them. We can see with principal component analysis results that even if they are close, they diverge over time. Finally, we observe small gains in accuracy of Together versus Alone, in spite of whereas doubling the reference population size in Together. These gains vary across years and subpopulations: less than 0.08 (0.46 to 0.54; ratio of accuracy for the partial and whole evaluations-corresponding to the greatest change in this ratio for breeding company 1, observed for the cohort 2016) for one subpopulation and between 0.03 (0.55 to 0.58) and 0.17 (0.48 to 0.65) for the other. To conclude, the 2 subpopulations remain close enough genetically so that their combined evaluation is advantageous, even if only slightly.

Collapse

Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Holl J, Herring WO, Misztal I, Lourenco D. Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs. Genet Sel Evol 2023;55:55. [PMID: 37495982 PMCID: PMC10373252 DOI: 10.1186/s12711-023-00831-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 07/18/2023] [Indexed: 07/28/2023] Open

Abstract

BACKGROUND

Whole-genome sequence (WGS) data harbor causative variants that may not be present in standard single nucleotide polymorphism (SNP) chip data. The objective of this study was to investigate the impact of using preselected variants from WGS for single-step genomic predictions in maternal and terminal pig lines with up to 1.8k sequenced and 104k sequence imputed animals per line.

METHODS

Two maternal and four terminal lines were investigated for eight and seven traits, respectively. The number of sequenced animals ranged from 1365 to 1491 for the maternal lines and 381 to 1865 for the terminal lines. Imputation to sequence occurred within each line for 66k to 76k animals for the maternal lines and 29k to 104k animals for the terminal lines. Two preselected SNP sets were generated based on a genome-wide association study (GWAS). Top40k included the SNPs with the lowest p-value in each of the 40k genomic windows, and ChipPlusSign included significant variants integrated into the porcine SNP chip used for routine genotyping. We compared the performance of single-step genomic predictions between using preselected SNP sets assuming equal or different variances and the standard porcine SNP chip.

RESULTS

In the maternal lines, ChipPlusSign and Top40k showed an average increase in accuracy of 0.6 and 4.9%, respectively, compared to the regular porcine SNP chip. The greatest increase was obtained with Top40k, particularly for fertility traits, for which the initial accuracy based on the standard SNP chip was low. However, in the terminal lines, Top40k resulted in an average loss of accuracy of 1%. ChipPlusSign provided a positive, although small, gain in accuracy (0.9%). Assigning different variances for the SNPs slightly improved accuracies when using variances obtained from BayesR. However, increases were inconsistent across the lines and traits.

CONCLUSIONS

The benefit of using sequence data depends on the line, the size of the genotyped population, and how the WGS variants are preselected. When WGS data are available on hundreds of thousands of animals, using sequence data presents an advantage but this remains limited in pigs.

Collapse

Jang S, Tsuruta S, Leite NG, Misztal I, Lourenco D. Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study. Genet Sel Evol 2023;55:49. [PMID: 37460964 DOI: 10.1186/s12711-023-00823-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/03/2023] [Indexed: 07/20/2023] Open

Abstract

BACKGROUND

Identifying true positive variants in genome-wide associations (GWA) depends on several factors, including the number of genotyped individuals. The limited dimensionality of genomic information may give insights into the optimal number of individuals to be used in GWA. This study investigated different discovery set sizes based on the number of largest eigenvalues explaining a certain proportion of variance in the genomic relationship matrix (G). In addition, we investigated the impact on the prediction accuracy by adding variants, which were selected based on different set sizes, to the regular single nucleotide polymorphism (SNP) chips used for genomic prediction.

METHODS

We simulated sequence data that included 500k SNPs with 200 or 2000 quantitative trait nucleotides (QTN). A regular 50k panel included one in every ten simulated SNPs. Effective population size (Ne) was set to 20 or 200. GWA were performed using a number of genotyped animals equivalent to the number of largest eigenvalues of G (EIG) explaining 50, 60, 70, 80, 90, 95, 98, and 99% of the variance. In addition, the largest discovery set consisted of 30k genotyped animals. Limited or extensive phenotypic information was mimicked by changing the trait heritability. Significant and large-effect size SNPs were added to the 50k panel and used for single-step genomic best linear unbiased prediction (ssGBLUP).

RESULTS

Using a number of genotyped animals corresponding to at least EIG98 allowed the identification of QTN with the largest effect sizes when Ne was large. Populations with smaller Ne required more than EIG98. Furthermore, including genotyped animals with a higher reliability (i.e., a higher trait heritability) improved the identification of the most informative QTN. Prediction accuracy was highest when the significant or the large-effect SNPs representing twice the number of simulated QTN were added to the 50k panel.

CONCLUSIONS

Accurately identifying causative variants from sequence data depends on the effective population size and, therefore, on the dimensionality of genomic information. This dimensionality can help identify the most suitable sample size for GWA and could be considered for variant selection, especially when resources are restricted. Even when variants are accurately identified, their inclusion in prediction models has limited benefits.

Collapse

Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Herring WO, Holl J, Misztal I, Lourenco D. Multi-line ssGBLUP evaluation using preselected markers from whole-genome sequence data in pigs. Front Genet 2023;14:1163626. [PMID: 37252662 PMCID: PMC10213539 DOI: 10.3389/fgene.2023.1163626] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 05/03/2023] [Indexed: 05/31/2023] Open

Abstract

Genomic evaluations in pigs could benefit from using multi-line data along with whole-genome sequencing (WGS) if the data are large enough to represent the variability across populations. The objective of this study was to investigate strategies to combine large-scale data from different terminal pig lines in a multi-line genomic evaluation (MLE) through single-step GBLUP (ssGBLUP) models while including variants preselected from whole-genome sequence (WGS) data. We investigated single-line and multi-line evaluations for five traits recorded in three terminal lines. The number of sequenced animals in each line ranged from 731 to 1,865, with 60k to 104k imputed to WGS. Unknown parent groups (UPG) and metafounders (MF) were explored to account for genetic differences among the lines and improve the compatibility between pedigree and genomic relationships in the MLE. Sequence variants were preselected based on multi-line genome-wide association studies (GWAS) or linkage disequilibrium (LD) pruning. These preselected variant sets were used for ssGBLUP predictions without and with weights from BayesR, and the performances were compared to that of a commercial porcine single-nucleotide polymorphisms (SNP) chip. Using UPG and MF in MLE showed small to no gain in prediction accuracy (up to 0.02), depending on the lines and traits, compared to the single-line genomic evaluation (SLE). Likewise, adding selected variants from the GWAS to the commercial SNP chip resulted in a maximum increase of 0.02 in the prediction accuracy, only for average daily feed intake in the most numerous lines. In addition, no benefits were observed when using preselected sequence variants in multi-line genomic predictions. Weights from BayesR did not help improve the performance of ssGBLUP. This study revealed limited benefits of using preselected whole-genome sequence variants for multi-line genomic predictions, even when tens of thousands of animals had imputed sequence data. Correctly accounting for line differences with UPG or MF in MLE is essential to obtain predictions similar to SLE; however, the only observed benefit of an MLE is to have comparable predictions across lines. Further investigation into the amount of data and novel methods to preselect whole-genome causative variants in combined populations would be of significant interest.

Collapse

First large-scale genomic prediction in the honey bee. Heredity (Edinb) 2023;130:320-328. [PMID: 36878945 PMCID: PMC10163272 DOI: 10.1038/s41437-023-00606-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 02/23/2023] [Accepted: 02/23/2023] [Indexed: 03/08/2023] Open

Exploring the statistical nature of independent chromosome segments. Livest Sci 2023. [DOI: 10.1016/j.livsci.2023.105207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2023]

Čítek J, Brzáková M, Bauer J, Tichý L, Sztankóová Z, Vostrý L, Steyn Y. Genome-Wide Association Study for Body Conformation Traits and Fitness in Czech Holsteins. Animals (Basel) 2022;12:ani12243522. [PMID: 36552441 PMCID: PMC10375906 DOI: 10.3390/ani12243522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 12/07/2022] [Accepted: 12/12/2022] [Indexed: 12/15/2022] Open

Brzáková M, Bauer J, Steyn Y, Šplíchal J, Fulínová D. The prediction accuracies of linear-type traits in Czech Holstein cattle when using ssGBLUP or wssGBLUP. J Anim Sci 2022;100:skac369. [PMID: 36334266 PMCID: PMC9746800 DOI: 10.1093/jas/skac369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 11/04/2022] [Indexed: 11/07/2022] Open

Luo H, Hu L, Brito LF, Dou J, Sammad A, Chang Y, Ma L, Guo G, Liu L, Zhai L, Xu Q, Wang Y. Weighted single-step GWAS and RNA sequencing reveals key candidate genes associated with physiological indicators of heat stress in Holstein cattle. J Anim Sci Biotechnol 2022;13:108. [PMID: 35986427 PMCID: PMC9392250 DOI: 10.1186/s40104-022-00748-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 06/24/2022] [Indexed: 12/15/2022] Open

Abstract

Background

The study of molecular processes regulating heat stress response in dairy cattle is paramount for developing mitigation strategies to improve heat tolerance and animal welfare. Therefore, we aimed to identify quantitative trait loci (QTL) regions associated with three physiological indicators of heat stress response in Holstein cattle, including rectal temperature (RT), respiration rate score (RS), and drooling score (DS). We estimated genetic parameters for all three traits. Subsequently, a weighted single-step genome-wide association study (WssGWAS) was performed based on 3200 genotypes, 151,486 phenotypic records, and 38,101 animals in the pedigree file. The candidate genes located within the identified QTL regions were further investigated through RNA sequencing (RNA-seq) analyses of blood samples for four cows collected in April (non-heat stress group) and four cows collected in July (heat stress group).

Results

The heritability estimates for RT, RS, and DS were 0.06, 0.04, and 0.03, respectively. Fourteen, 19, and 20 genomic regions explained 2.94%, 3.74%, and 4.01% of the total additive genetic variance of RT, RS, and DS, respectively. Most of these genomic regions are located in the Bos taurus autosome (BTA) BTA3, BTA6, BTA8, BTA12, BTA14, BTA21, and BTA24. No genomic regions overlapped between the three indicators of heat stress, indicating the polygenic nature of heat tolerance and the complementary mechanisms involved in heat stress response. For the RNA-seq analyses, 2627 genes were significantly upregulated and 369 downregulated in the heat stress group in comparison to the control group. When integrating the WssGWAS, RNA-seq results, and existing literature, the key candidate genes associated with physiological indicators of heat stress in Holstein cattle are: PMAIP1, SBK1, TMEM33, GATB, CHORDC1, RTN4IP1, and BTBD7.

Conclusions

Physiological indicators of heat stress are heritable and can be improved through direct selection. Fifty-three QTL regions associated with heat stress indicators confirm the polygenic nature and complex genetic determinism of heat tolerance in dairy cattle. The identified candidate genes will contribute for optimizing genomic evaluation models by assigning higher weights to genetic markers located in these regions as well as to the design of SNP panels containing polymorphisms located within these candidate genes.

Graphical Abstract

Supplementary Information

The online version contains supplementary material available at 10.1186/s40104-022-00748-6.

Collapse

Weighted Single-Step Genomic Best Linear Unbiased Prediction Method Application for Assessing Pigs on Meat Productivity and Reproduction Traits. Animals (Basel) 2022;12:ani12131693. [PMID: 35804591 PMCID: PMC9264777 DOI: 10.3390/ani12131693] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Revised: 06/10/2022] [Accepted: 06/28/2022] [Indexed: 11/16/2022] Open

Abstract Changes in the accuracy of the genomic estimates obtained by the ssGBLUP and wssGBLUP methods were evaluated using different reference groups. The weighting procedure’s reasonableness of application Pwas considered to improve the accuracy of genomic predictions for meat, fattening and reproduction traits in pigs. Six reference groups were formed to assess the genomic data quantity impact on the accuracy of predicted values (groups of genotyped animals). The datasets included 62,927 records of meat and fattening productivity (fat thickness over 6–7 ribs (BF1, mm)), muscle depth (MD, mm) and precocity up to 100 kg (age, days) and 16,070 observations of reproductive qualities (the number of all born piglets (TNB) and the number of live-born piglets (NBA), according to the results of the first farrowing). The wssGBLUP method has an advantage over ssGBLUP in terms of estimation reliability. When using a small reference group, the difference in the accuracy of ssGBLUP over BLUP AM is from −1.9 to +7.3 percent points, while for wssGBLUP, the change in accuracy varies from +18.2 to +87.3 percent points. Furthermore, the superiority of the wssGBLUP is also maintained for the largest group of genotyped animals: from +4.7 to +15.9 percent points for ssGBLUP and from +21.1 to +90.5 percent points for wssGBLUP. However, for all analyzed traits, the number of markers explaining 5% of genetic variability varied from 71 to 108, and the number of such SNPs varied depending on the size of the reference group (79–88 for BF1, 72–81 for MD, 71–108 for age). The results of the genetic variation distribution have the greatest similarity between groups of about 1000 and about 1500 individuals. Thus, the size of the reference group of more than 1000 individuals gives more stable results for the estimation based on the wssGBLUP method, while using the reference group of 500 individuals can lead to distorted results of GEBV. Collapse

Botelho ME, Lopes MS, Mathur PK, Knol EF, e Silva FF, Lopes PS, Gimarães SEF, Marques DB, Veroneze R. Weighted genome-wide association study reveals new candidate genes related to boar taint compounds 1. Livest Sci 2022. [DOI: 10.1016/j.livsci.2022.104845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Buaban S, Lengnudum K, Boonkum W, Phakdeedindan P. Genome-wide association study on milk production and somatic cell score for Thai dairy cattle using weighted single-step approach with random regression test-day model. J Dairy Sci 2021;105:468-494. [PMID: 34756438 DOI: 10.3168/jds.2020-19826] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Accepted: 08/24/2021] [Indexed: 12/26/2022]

Abstract

Genome-wide association studies are a powerful tool to identify genomic regions and variants associated with phenotypes. However, only limited mutual confirmation from different studies is available. The objectives of this study were to identify genomic regions as well as genes and pathways associated with the first-lactation milk, fat, protein, and total solid yields; fat, protein, and total solid percentage; and somatic cell score (SCS) in a Thai dairy cattle population. Effects of SNPs were estimated by a weighted single-step GWAS, which back-solved the genomic breeding values predicted using single-step genomic BLUP (ssGBLUP) fitting a single-trait random regression test-day model. Genomic regions that explained at least 0.5% of the total genetic variance were selected for further analyses of candidate genes. Despite the small number of genotyped animals, genomic predictions led to an improvement in the accuracy over the traditional BLUP. Genomic predictions using weighted ssGBLUP were slightly better than the ssGBLUP. The genomic regions associated with milk production traits contained 210 candidate genes on 19 chromosomes [Bos taurus autosome (BTA) 1 to 7, 9, 11 to 16, 20 to 21, 26 to 27 and 29], whereas 21 candidate genes on 3 chromosomes (BTA 11, 16, and 21) were associated with SCS. Many genomic regions explained a small fraction of the genetic variance, indicating polygenic inheritance of the studied traits. Several candidate genes coincided with previous reports for milk production traits in Holstein cattle, especially a large region of genes on BTA14. We identified 141 and 5 novel genes related to milk production and SCS, respectively. These novel genes were also found to be functionally related to heat tolerance (e.g., SLC45A2, IRAG1, and LOC101902172), longevity (e.g., SYT10 and LOC101903327), and fertility (e.g., PAG1). These findings may be attributed to indirect selection in our population. Identified biological networks including intracellular cell transportation and protein catabolism implicate milk production, whereas the immunological pathways such as lymphocyte activation are closely related to SCS. Further studies are required to validate our findings before exploiting them in genomic selection.

Collapse

Genomic Prediction in Local Breeds: The Rendena Cattle as a Case Study. Animals (Basel) 2021;11:ani11061815. [PMID: 34207091 PMCID: PMC8234894 DOI: 10.3390/ani11061815] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 06/16/2021] [Accepted: 06/16/2021] [Indexed: 01/26/2023] Open

Abstract

Simple Summary

Although genomic selection is being used in many livestock species, it has not yet been considered in local breeds due to the lower population size and the potential less effective impact on the genetic evaluation of these breeds. The current research aims to investigate how genomic data can impact the accuracy of genetic predictions for beef traits in Rendena, a small local cattle breed of the North-East of Italy selected for a dual purpose. Classical animal models using only phenotypic information were compared with two models that integrated genomic data with pedigree information. The genomic models presented better accuracy in estimated breeding values of the animals than the ‘classical’ animal model, especially the ‘simpler’ one assuming homogeneous variances of single nucleotide polymorphisms. Our results show that the inclusion of genomic information can be successfully applied to breeding selection scenarios even in small local cattle breeds such as Rendena.

Abstract

The maintenance of local cattle breeds is key to selecting for efficient food production, landscape protection, and conservation of biodiversity and local cultural heritage. Rendena is an indigenous cattle breed from the alpine North-East of Italy, selected for dual purpose, but with lesser emphasis given to beef traits. In this situation, increasing accuracy for beef traits could prevent detrimental effects due to the antagonism with milk production. Our study assessed the impact of genomic information on estimated breeding values (EBVs) in Rendena performance-tested bulls. Traits considered were average daily gain, in vivo EUROP score, and in vivo estimate of dressing percentage. The final dataset contained 1691 individuals with phenotypes and 8372 animals in pedigree, 1743 of which were genotyped. Using the cross-validation method, three models were compared: (i) Pedigree-BLUP (PBLUP); (ii) single-step GBLUP (ssGBLUP), and (iii) weighted single-step GBLUP (WssGBLUP). Models including genomic information presented higher accuracy, especially WssGBLUP. However, the model with the best overall properties was the ssGBLUP, showing higher accuracy than PBLUP and optimal values of bias and dispersion parameters. Our study demonstrated that integrating phenotypes for beef traits with genomic data can be helpful to estimate EBVs, even in a small local breed.

Collapse

Cesarani A, Biffani S, Garcia A, Lourenco D, Bertolini G, Neglia G, Misztal I, Macciotta NPP. Genomic investigation of milk production in Italian buffalo. ITALIAN JOURNAL OF ANIMAL SCIENCE 2021. [DOI: 10.1080/1828051x.2021.1902404] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Lázaro SF, Tonhati H, Oliveira HR, Silva AA, Nascimento AV, Santos DJA, Stefani G, Brito LF. Genomic studies of milk-related traits in water buffalo (Bubalus bubalis) based on single-step genomic best linear unbiased prediction and random regression models. J Dairy Sci 2021;104:5768-5793. [PMID: 33685677 DOI: 10.3168/jds.2020-19534] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 01/02/2021] [Indexed: 01/14/2023]

Abstract

Genomic selection has been widely implemented in many livestock breeding programs, but it remains incipient in buffalo. Therefore, this study aimed to (1) estimate variance components incorporating genomic information in Murrah buffalo; (2) evaluate the performance of genomic prediction for milk-related traits using single- and multitrait random regression models (RRM) and the single-step genomic best linear unbiased prediction approach; and (3) estimate longitudinal SNP effects and candidate genes potentially associated with time-dependent variation in milk, fat, and protein yields, as well as somatic cell score (SCS) in multiple parities. The data used to estimate the genetic parameters consisted of a total of 323,140 test-day records. The average daily heritability estimates were moderate (0.35 ± 0.02 for milk yield, 0.22 ± 0.03 for fat yield, 0.42 ± 0.03 for protein yield, and 0.16 ± 0.03 for SCS). The highest heritability estimates, considering all traits studied, were observed between 20 and 280 d in milk (DIM). The genetic correlation estimates at different DIM among the evaluated traits ranged from -0.10 (156 to 185 DIM for SCS) to 0.61 (36 to 65 DIM for fat yield). In general, direct selection for any of the traits evaluated is expected to result in indirect genetic gains for milk yield, fat yield, and protein yield but also increase SCS at certain lactation stages, which is undesirable. The predicted RRM coefficients were used to derive the genomic estimated breeding values (GEBV) for each time point (from 5 to 305 DIM). In general, the tuning parameters evaluated when constructing the hybrid genomic relationship matrices had a small effect on the GEBV accuracy and a greater effect on the bias estimates. The SNP solutions were back-solved from the GEBV predicted from the Legendre random regression coefficients, which were then used to estimate the longitudinal SNP effects (from 5 to 305 DIM). The daily SNP effect for 3 different lactation stages were performed considering 3 different lactation stages for each trait and parity: from 5 to 70, from 71 to 150, and from 151 to 305 DIM. Important genomic regions related to the analyzed traits and parities that explain more than 0.50% of the total additive genetic variance were selected for further analyses of candidate genes. In general, similar potential candidate genes were found between traits, but our results suggest evidence of differential sets of candidate genes underlying the phenotypic expression of the traits across parities. These results contribute to a better understanding of the genetic architecture of milk production traits in dairy buffalo and reinforce the relevance of incorporating genomic information to genetically evaluate longitudinal traits in dairy buffalo. Furthermore, the candidate genes identified can be used as target genes in future functional genomics studies.

Collapse

Cesarani A, Garcia A, Hidalgo J, Degano L, Vicario D, Macciotta NPP, Lourenco D. Genomic information allows for more accurate breeding values for milkability in dual-purpose Italian Simmental cattle. J Dairy Sci 2021;104:5719-5727. [PMID: 33612221 DOI: 10.3168/jds.2020-19838] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 12/14/2020] [Indexed: 02/01/2023]

Abstract

Milkability is a trait related to the milking efficiency of an animal, and it is a component of the herd profitability. Due to its economic importance, milkability is currently included in the selection index of the Italian Simmental cattle breed with a weight of 7.5%. This lowly heritable trait is measured on a subjective scale from 1 to 3 (1 = slow, 3 = fast), and genetic evaluations are performed by pedigree-based BLUP. Genomic information is now available for some animals in the Italian Simmental population, and its inclusion in the genetic evaluation system could increase accuracy of breeding values and genetic progress for milkability. The aim of this study was to test the feasibility and advantages of having a genomic evaluation for this trait in the Italian Simmental population. Phenotypes were available for 131,308 cows. A total of 9,526 animals had genotypes for 42,152 loci; among the genotyped animals, 2,455 were cows with phenotypes, and the other were their relatives. The youngest cows with both phenotypes and genotypes (n = 900) were identified as selection candidates. Variance components and heritability were estimated using pedigree information, whereas genetic and genomic evaluations were carried out using BLUP and single-step genomic BLUP (ssGBLUP), respectively. In addition, a weighted ssGBLUP was assessed using genomic regions from a genome-wide association study. Evaluation models were validated using theoretical and realized accuracies. The estimated heritability for milkability was 0.12 ± 0.01. The mean theoretical accuracies for selection candidates were 0.43 ± 0.08 (BLUP) and 0.53 ± 0.06 (ssGBLUP). The mean realized accuracies based on linear regression statistics were 0.29 (BLUP) and 0.40 (ssGBLUP). No genomic regions were significantly associated with milkability, thus no improvements in accuracy were observed when using weighted ssGBLUP. Results indicated that genomic information could improve the accuracy of breeding values and increase genetic progress for milkability in Italian Simmental.

Collapse

Mehrban H, Naserkheil M, Lee DH, Cho C, Choi T, Park M, Ibáñez-Escriche N. Genomic Prediction Using Alternative Strategies of Weighted Single-Step Genomic BLUP for Yearling Weight and Carcass Traits in Hanwoo Beef Cattle. Genes (Basel) 2021;12:genes12020266. [PMID: 33673102 PMCID: PMC7917987 DOI: 10.3390/genes12020266] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 02/08/2021] [Accepted: 02/11/2021] [Indexed: 01/20/2023] Open

Botelho ME, Lopes MS, Mathur PK, Knol EF, Guimarães SEF, Marques DBD, Lopes PS, Silva FF, Veroneze R. Applying an association weight matrix in weighted genomic prediction of boar taint compounds. J Anim Breed Genet 2020;138:442-453. [PMID: 33285013 DOI: 10.1111/jbg.12528] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 10/13/2020] [Accepted: 11/14/2020] [Indexed: 12/14/2022]

Abstract

Biological information regarding markers and gene association may be used to attribute different weights for single nucleotide polymorphism (SNP) in genome-wide selection. Therefore, we aimed to evaluate the predictive ability and the bias of genomic prediction using models that allow SNP weighting in the genomic relationship matrix (G) building, with and without incorporating biological information to obtain the weights. Firstly, we performed a genome-wide association studies (GWAS) in data set containing single- (SL) or a multi-line (ML) pig population for androstenone, skatole and indole levels. Secondly, 1%, 2%, 5%, 10%, 30% and 50% of the markers explaining the highest proportions of the genetic variance for each trait were selected to build gene networks through the association weight matrix (AWM) approach. The number of edges in the network was computed and used to derive weights for G (AWM-WssGBLUP). The single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) were used as standard scenarios. All scenarios presented predictive abilities different from zero; however, the great overlap in their confidences interval suggests no differences among scenarios. Most of scenarios of based on AWM provide overestimations for skatole in both SL and ML populations. On the other hand, the skatole and indole prediction were no biased in the ssGBLUP (S1) in both SL and ML populations. Most of scenarios based on AWM provide no biased predictions for indole in both SL and ML populations. In summary, using biological information through AWM matrix and gene networks to derive weights for genomic prediction resulted in no increase in predictive ability for boar taint compounds. In addition, this approach increased the number of analyses steps. Thus, we can conclude that ssGBLUP is most appropriate for the analysis of boar taint compounds in comparison with the weighted strategies used in the present work.

Collapse

Paiva JT, Peixoto MGCD, Bruneli FAT, Alvarenga AB, Oliveira HR, Silva AA, Silva DA, Veroneze R, Silva FF, Lopes PS. Genetic parameters, genome-wide association and gene networks for milk and reproductive traits in Guzerá cattle. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104273] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Baller JL, Kachman SD, Kuehn LA, Spangler ML. Genomic prediction using pooled data in a single-step genomic best linear unbiased prediction framework. J Anim Sci 2020;98:5851497. [PMID: 32497209 DOI: 10.1093/jas/skaa184] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 06/01/2020] [Indexed: 01/16/2023] Open

Abstract

Economically relevant traits are routinely collected within the commercial segments of the beef industry but are rarely included in genetic evaluations because of unknown pedigrees. Individual relationships could be resurrected with genomics, but this would be costly; therefore, pooling DNA and phenotypic data provide a cost-effective solution. Pedigree, phenotypic, and genomic data were simulated for a beef cattle population consisting of 15 generations. Genotypes mimicked a 50k marker panel (841 quantitative trait loci were located across the genome, approximately once per 3 Mb) and the phenotype was moderately heritable. Individuals from generation 15 were included in pools (observed genotype and phenotype were mean values of a group). Estimated breeding values (EBV) were generated from a single-step genomic best linear unbiased prediction model. The effects of pooling strategy (random and minimizing or uniformly maximizing phenotypic variation within pools), pool size (1, 2, 10, 20, 50, 100, or no data from generation 15), and generational gaps of genotyping on EBV accuracy (correlation of EBV with true breeding values) were quantified. Greatest EBV accuracies of sires and dams were observed when there was no gap between genotyped parents and pooled offspring. The EBV accuracies resulting from pools were usually greater than no data from generation 15 regardless of sire or dam genotyping. Minimizing phenotypic variation increased EBV accuracy by 8% and 9% over random pooling and uniformly maximizing phenotypic variation, respectively. A pool size of 2 was the only scenario that did not significantly decrease EBV accuracy compared with individual data when pools were formed randomly or by uniformly maximizing phenotypic variation (P > 0.05). Pool sizes of 2, 10, 20, or 50 did not generally lead to statistical differences in EBV accuracy than individual data when pools were constructed to minimize phenotypic variation (P > 0.05). Largest numerical increases in EBV accuracy resulting from pooling compared with no data from generation 15 were seen with sires with prior low EBV accuracy (those born in generation 14). Pooling of any size led to larger EBV accuracies of the pools than individual data when minimizing phenotypic variation. Resulting EBV for the pools could be used to inform management decisions of those pools. Pooled genotyping to garner commercial-level phenotypes for genetic evaluations seems plausible although differences exist depending on pool size and pool formation strategy.

Collapse

Garcia BF, Melo TPD, Neves HHDR, Carvalheiro R. Comparison of GWA statistical methods for traits under different genetic structures: A simulation study. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104213] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Takeda M, Uemoto Y, Satoh M. Effect of genotyped bulls with different numbers of phenotyped progenies on quantitative trait loci detection and genomic evaluation in a simulated cattle population. Anim Sci J 2020;91:e13432. [PMID: 32779330 PMCID: PMC7507195 DOI: 10.1111/asj.13432] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 06/18/2020] [Accepted: 07/01/2020] [Indexed: 02/02/2023]

Liu A, Lund MS, Boichard D, Karaman E, Guldbrandtsen B, Fritz S, Aamand GP, Nielsen US, Sahana G, Wang Y, Su G. Weighted single-step genomic best linear unbiased prediction integrating variants selected from sequencing data by association and bioinformatics analyses. Genet Sel Evol 2020;52:48. [PMID: 32799816 PMCID: PMC7429790 DOI: 10.1186/s12711-020-00568-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 08/07/2020] [Indexed: 11/30/2022] Open

Abstract

Background

Sequencing data enable the detection of causal loci or single nucleotide polymorphisms (SNPs) highly linked to causal loci to improve genomic prediction. However, until now, studies on integrating such SNPs using a single-step genomic best linear unbiased prediction (ssGBLUP) model are scarce. We investigated the integration of sequencing SNPs selected by association (1262 SNPs) and bioinformatics (2359 SNPs) analyses into the currently used 54K-SNP chip, using three ssGBLUP models which make different assumptions on the distribution of SNP effects: a basic ssGBLUP model, a so-called featured ssGBLUP (ssFGBLUP) model that considered selected sequencing SNPs as a feature genetic component, and a weighted ssGBLUP (ssWGBLUP) model in which the genomic relationship matrix was weighted by the SNP variances estimated from a Bayesian whole-genome regression model, with every 1, 30, or 100 adjacent SNPs within a chromosome region sharing the same variance. We used data on milk production and female fertility in Danish Jersey. In total, 15,823 genotyped and 528,981‬ non-genotyped females born between 1990 and 2013 were used as reference population and 7415 genotyped females and 33,040 non-genotyped females born between 2014 and 2016 were used as validation population.

Results

With basic ssGBLUP, integrating SNPs selected from sequencing data improved prediction reliabilities for milk and protein yields, but resulted in limited or no improvement for fat yield and female fertility. Model performances depended on the SNP set used. When using ssWGBLUP with the 54K SNPs, reliabilities for milk and protein yields improved by 0.028 for genotyped animals and by 0.006 for non-genotyped animals compared with ssGBLUP. However, with the SNP set that included SNPs selected from sequencing data, no statistically significant difference in prediction reliability was observed between the three ssGBLUP models.

Conclusions

In summary, when using 54K SNPs, a ssWGBLUP model with a common weight on the SNPs in a given region is a feasible approach for single-trait genetic evaluation. Integrating relevant SNPs selected from sequencing data into the standard SNP chip can improve the reliability of genomic prediction. Based on such SNP data, a basic ssGBLUP model was suggested since no significant improvement was observed from using alternative models such as ssWGBLUP and ssFGBLUP.

Collapse

Gualdrón Duarte JL, Gori AS, Hubin X, Lourenco D, Charlier C, Misztal I, Druet T. Performances of Adaptive MultiBLUP, Bayesian regressions, and weighted-GBLUP approaches for genomic predictions in Belgian Blue beef cattle. BMC Genomics 2020;21:545. [PMID: 32762654 PMCID: PMC7430838 DOI: 10.1186/s12864-020-06921-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 07/17/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Genomic selection has been successfully implemented in many livestock and crop species. The genomic best linear unbiased predictor (GBLUP) approach, assigning equal variance to all SNP effects, is one of the reference methods. When large-effect variants contribute to complex traits, it has been shown that genomic prediction methods that assign a higher variance to subsets of SNP effects can achieve higher prediction accuracy. We herein compared the efficiency of several such approaches, including the Adaptive MultiBLUP (AM-BLUP) that uses local genomic relationship matrices (GRM) to automatically identify and weight genomic regions with large effects, to predict genetic merit in Belgian Blue beef cattle.

RESULTS

We used a population of approximately 10,000 genotyped cows and their phenotypes for 14 traits, mostly related to muscular development and body dimensions. According to the trait, we found that 4 to 25% of the genetic variance could be associated with 2 to 12 genomic regions harbouring large-effect variants. Noteworthy, three previously identified recessive deleterious variants presented heterozygote advantage and were among the most significant SNPs for several traits. The AM-BLUP resulted in increased reliability of genomic predictions compared to GBLUP (+ 2%), but Bayesian methods proved more efficient (+ 3%). Overall, the reliability gains remained thus limited although higher gains were observed for skin thickness, a trait affected by two genomic regions having particularly large effects. Higher accuracies than those from the original AM-BLUP were achieved when applying the Bayesian Sparse Linear Mixed Model to pre-select groups of SNPs with large effects and subsequently use their estimated variance to build a weighted GRM. Finally, the single-step GBLUP performed best and could be further improved (+ 3% prediction accuracy) by using these weighted GRM.

CONCLUSIONS

The AM-BLUP is an attractive method to automatically identify and weight genomic regions with large effects on complex traits. However, the method was less accurate than Bayesian methods. Overall, weighted methods achieved modest accuracy gains compared to GBLUP. Nevertheless, the computational efficiency of the AM-BLUP might be valuable at higher marker density, including with whole-genome sequencing data. Furthermore, weighted GRM are particularly useful to account for large variance loci in the single-step GBLUP.

Collapse

Lourenco D, Legarra A, Tsuruta S, Masuda Y, Aguilar I, Misztal I. Single-Step Genomic Evaluations from Theory to Practice: Using SNP Chips and Sequence Data in BLUPF90. Genes (Basel) 2020;11:E790. [PMID: 32674271 PMCID: PMC7397237 DOI: 10.3390/genes11070790] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/03/2020] [Accepted: 07/06/2020] [Indexed: 11/16/2022] Open

Freitas P, Oliveira H, Silva F, Fleming A, Miglior F, Schenkel F, Brito L. Genomic analyses for predicted milk fatty acid composition throughout lactation in North American Holstein cattle. J Dairy Sci 2020;103:6318-6331. [DOI: 10.3168/jds.2019-17628] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 03/12/2020] [Indexed: 12/12/2022]

Alvarenga AB, Veroneze R, Oliveira HR, Marques DBD, Lopes PS, Silva FF, Brito LF. Comparing Alternative Single-Step GBLUP Approaches and Training Population Designs for Genomic Evaluation of Crossbred Animals. Front Genet 2020;11:263. [PMID: 32328083 PMCID: PMC7162606 DOI: 10.3389/fgene.2020.00263] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 03/05/2020] [Indexed: 02/06/2023] Open

Abstract

As crossbreeding is extensively used in some livestock species, we aimed to evaluate the performance of single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) methods to predict Genomic Estimated Breeding Values (GEBVs) of crossbred animals. Different training population scenarios were evaluated: (SC1) ssGBLUP based on a single-trait model considering purebred and crossbred animals in a joint training population; (SC2) ssGBLUP based on a multiple-trait model to enable considering phenotypes recorded in purebred and crossbred training animals as different traits; (SC3) WssGBLUP based on a single-trait model considering purebred and crossbred animals jointly in the training population (both populations were used for SNP weights' estimation); (SC4) WssGBLUP based on a single-trait model considering only purebred animals in the training population (crossbred population only used for SNP weights' estimation); (SC5) WssGBLUP based on a single-trait model and the training population characterized by purebred animals (purebred population used for SNP weights' estimation). A complex trait was simulated assuming alternative genetic architectures. Different scaling factors to blend the inverse of the genomic (G -1) and pedigree (A 22 - 1 ) relationship matrices were also tested. The predictive performance of each scenario was evaluated based on the validation accuracy and regression coefficient. The genetic correlations across simulated populations in the different scenarios ranged from moderate to high (0.71-0.99). The scenario mimicking a completely polygenic trait (h Q T L 2 = 0) yielded the lowest validation accuracy (0.12; for SC3 and SC4). The simulated scenarios assuming 4,500 QTLs affecting the trait andh Q T L 2 = h 2 resulted in the greatest GEBV accuracies (0.47; for SC1 and SC2). The regression coefficients ranged from 0.28 (for SC3 assuming polygenic effect) to 1.27 (for SC2 considering 4,500 QTLs). In general, SC3 and SC5 resulted in inflated GEBVs, whereas other scenarios yielded deflated GEBVs. The scaling factors used to combine G -1 andA 22 - 1 had a small influence on the validation accuracies, but a greater effect on the regression coefficients. Due to the complexity of multiple-trait models and WssGBLUP analyses, and a similar predictive performance across the methods evaluated, SC1 is recommended for genomic evaluation in crossbred populations with similar genetic structures [moderate-to-high (0.71-0.99) genetic correlations between purebred and crossbred populations].

Collapse

Misztal I, Lourenco D, Legarra A. Current status of genomic evaluation. J Anim Sci 2020;98:skaa101. [PMID: 32267923 PMCID: PMC7183352 DOI: 10.1093/jas/skaa101] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 04/07/2020] [Indexed: 12/14/2022] Open

Abstract

Early application of genomic selection relied on SNP estimation with phenotypes or de-regressed proofs (DRP). Chips of 50k SNP seemed sufficient for an accurate estimation of SNP effects. Genomic estimated breeding values (GEBV) were composed of an index with parent average, direct genomic value, and deduction of a parental index to eliminate double counting. Use of SNP selection or weighting increased accuracy with small data sets but had minimal to no impact with large data sets. Efforts to include potentially causative SNP derived from sequence data or high-density chips showed limited or no gain in accuracy. After the implementation of genomic selection, EBV by BLUP became biased because of genomic preselection and DRP computed based on EBV required adjustments, and the creation of DRP for females is hard and subject to double counting. Genomic selection was greatly simplified by single-step genomic BLUP (ssGBLUP). This method based on combining genomic and pedigree relationships automatically creates an index with all sources of information, can use any combination of male and female genotypes, and accounts for preselection. To avoid biases, especially under strong selection, ssGBLUP requires that pedigree and genomic relationships are compatible. Because the inversion of the genomic relationship matrix (G) becomes costly with more than 100k genotyped animals, large data computations in ssGBLUP were solved by exploiting limited dimensionality of genomic data due to limited effective population size. With such dimensionality ranging from 4k in chickens to about 15k in cattle, the inverse of G can be created directly (e.g., by the algorithm for proven and young) at a linear cost. Due to its simplicity and accuracy, ssGBLUP is routinely used for genomic selection by the major chicken, pig, and beef industries. Single step can be used to derive SNP effects for indirect prediction and for genome-wide association studies, including computations of the P-values. Alternative single-step formulations exist that use SNP effects for genotyped or for all animals. Although genomics is the new standard in breeding and genetics, there are still some problems that need to be solved. This involves new validation procedures that are unaffected by selection, parameter estimation that accounts for all the genomic data used in selection, and strategies to address reduction in genetic variances after genomic selection was implemented.

Collapse

Fragomeni BO, Lourenco DAL, Legarra A, VanRaden PM, Misztal I. Alternative SNP weighting for single-step genomic best linear unbiased predictor evaluation of stature in US Holsteins in the presence of selected sequence variants. J Dairy Sci 2019;102:10012-10019. [PMID: 31495612 DOI: 10.3168/jds.2019-16262] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 07/16/2019] [Indexed: 11/19/2022]

Abstract

Causal variants inferred from sequence data analysis are expected to increase accuracy of genomic selection. In this work we evaluated the gain in reliability of genomic predictions, for stature in US Holsteins, when adding selected sequence variants to a pre-existent SNP chip. Two prediction methods were tested: de-regressed proofs assuming heterogeneous (genomic BLUP; GBLUP) residual variances and by single-step GBLUP (ssGBLUP) using actual phenotypes. Phenotypic data included 3,999,631 records for stature on 3,027,304 Holstein cows. Genotypes on 54,087 SNP markers (54k) were available for 26,877 bulls. Additionally, 16,648 selected sequence variants were combined with the 54k markers, for a total of 70,735 (70k) markers. In all methods, SNP in the genomic relationship matrix (G) were unweighted or weighted iteratively, with weights derived either by SNP effects squared or by a nonlinear method that resembles BayesA (nonlinear A). Reliability of genomic predictions were obtained by cross validation. With unweighted G derived from 54k markers, the reliabilities (× 100) were 72.4 for GBLUP and 75.3 for ssGBLUP. With unweighted G derived from 70k markers, the reliabilities were 73.4 and 76.0, respectively. Weighting by nonlinear A changed reliabilities to 73.3, and 75.9, respectively. Addition of selected sequence variants had a small effect on reliabilities. Weighting by quadratic functions reduced reliabilities. Weighting by nonlinear A increased reliabilities for GBLUP but had only a small effect in ssGBLUP. Reliabilities for direct genomic values extracted from ssGBLUP using unweighted G with 54k were higher than reliabilities by any GBLUP. Thus, ssGBLUP seems to capture more information than GBLUP and there is less room for extra reliability. Improvements in GBLUP may be because the weights in G change the covariance structure, which can explain a proportion of the variance that is accounted for when a heterogeneous residual variance is assumed by considering a different number of daughters per bull.

Collapse

Silva RMO, Evenhuis JP, Vallejo RL, Gao G, Martin KE, Leeds TD, Palti Y, Lourenco DAL. Whole-genome mapping of quantitative trait loci and accuracy of genomic predictions for resistance to columnaris disease in two rainbow trout breeding populations. Genet Sel Evol 2019;51:42. [PMID: 31387519 PMCID: PMC6683352 DOI: 10.1186/s12711-019-0484-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 07/30/2019] [Indexed: 01/09/2023] Open

Abstract

Background

Columnaris disease (CD) is an emerging problem for the rainbow trout aquaculture industry in the US. The objectives of this study were to: (1) identify common genomic regions that explain a large proportion of the additive genetic variance for resistance to CD in two rainbow trout (Oncorhynchus mykiss) populations; and (2) estimate the gains in prediction accuracy when genomic information is used to evaluate the genetic potential of survival to columnaris infection in each population.

Methods

Two aquaculture populations were investigated: the National Center for Cool and Cold Water Aquaculture (NCCCWA) odd-year line and the Troutlodge, Inc., May odd-year (TLUM) nucleus breeding population. Fish that survived to 21 days post-immersion challenge were recorded as resistant. Single nucleotide polymorphism (SNP) genotypes were available for 1185 and 1137 fish from NCCCWA and TLUM, respectively. SNP effects and variances were estimated using the weighted single-step genomic best linear unbiased prediction (BLUP) for genome-wide association. Genomic regions that explained more than 1% of the additive genetic variance were considered to be associated with resistance to CD. Predictive ability was calculated in a fivefold cross-validation scheme and using a linear regression method.

Results

Validation on adjusted phenotypes provided a prediction accuracy close to zero, due to the binary nature of the trait. Using breeding values computed from the complete data as benchmark improved prediction accuracy of genomic models by about 40% compared to the pedigree-based BLUP. Fourteen windows located on six chromosomes were associated with resistance to CD in the NCCCWA population, of which two windows on chromosome Omy 17 jointly explained more than 10% of the additive genetic variance. Twenty-six windows located on 13 chromosomes were associated with resistance to CD in the TLUM population. Only four associated genomic regions overlapped with quantitative trait loci (QTL) between both populations.

Conclusions

Our results suggest that genome-wide selection for resistance to CD in rainbow trout has greater potential than selection for a few target genomic regions that were found to be associated to resistance to CD due to the polygenic architecture of this trait, and because the QTL associated with resistance to CD are not sufficiently informative for selection decisions across populations.

Electronic supplementary material

The online version of this article (10.1186/s12711-019-0484-4) contains supplementary material, which is available to authorized users.

Collapse

Oliveira H, Lourenco D, Masuda Y, Misztal I, Tsuruta S, Jamrozik J, Brito L, Silva F, Schenkel F. Application of single-step genomic evaluation using multiple-trait random regression test-day models in dairy cattle. J Dairy Sci 2019;102:2365-2377. [DOI: 10.3168/jds.2018-15466] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2018] [Accepted: 11/20/2018] [Indexed: 01/30/2023]

Garcia ALS, Bosworth B, Waldbieser G, Misztal I, Tsuruta S, Lourenco DAL. Development of genomic predictions for harvest and carcass weight in channel catfish. Genet Sel Evol 2018;50:66. [PMID: 30547740 PMCID: PMC6295041 DOI: 10.1186/s12711-018-0435-5] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Accepted: 11/29/2018] [Indexed: 12/30/2022] Open

Abstract

Background

Catfish farming is the largest segment of US aquaculture and research is ongoing to improve production efficiency, including genetic selection programs to improve economically important traits. The objectives of this study were to investigate the use of genomic selection to improve breeding value accuracy and to identify major single nucleotide polymorphisms (SNPs) associated with harvest weight and residual carcass weight in a channel catfish population. Phenotypes were available for harvest weight (n = 27,160) and residual carcass weight (n = 6020), and 36,365 pedigree records were available. After quality control, genotypes for 54,837 SNPs were available for 2911 fish. Estimated breeding values (EBV) were obtained with traditional pedigree-based best linear unbiased prediction (BLUP) and genomic (G)EBV were estimated with single-step genomic BLUP (ssGBLUP). EBV and GEBV prediction accuracies were evaluated using different validation strategies. The ability to predict future performance was calculated as the correlation between EBV or GEBV and adjusted phenotypes.

Results

Compared to the pedigree BLUP, ssGBLUP increased predictive ability up to 28% and 36% for harvest weight and residual carcass weight, respectively; and GEBV were superior to EBV for all validation strategies tested. Breeding value inflation was assessed as the regression coefficient of adjusted phenotypes on breeding values, and the results indicated that genomic information reduced breeding value inflation. Genome-wide association studies based on windows of 20 adjacent SNPs indicated that both harvest weight and residual carcass weight have a polygenic architecture with no major SNPs (the largest SNPs explained 0.96 and 1.19% of the additive genetic variation for harvest weight and residual carcass weight respectively).

Conclusions

Genomic evaluation improves the ability to predict future performance relative to traditional BLUP and will allow more accurate identification of genetically superior individuals within catfish families.

Collapse

Guarini AR, Lourenco DAL, Brito LF, Sargolzaei M, Baes CF, Miglior F, Misztal I, Schenkel FS. Genetics and genomics of reproductive disorders in Canadian Holstein cattle. J Dairy Sci 2018;102:1341-1353. [PMID: 30471913 DOI: 10.3168/jds.2018-15038] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Accepted: 09/29/2018] [Indexed: 01/25/2023]