1
|
Leite NG, Bermann M, Tsuruta S, Misztal I, Lourenco D. Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations. Genet Sel Evol 2024; 56:59. [PMID: 39174924 PMCID: PMC11340074 DOI: 10.1186/s12711-024-00925-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 07/24/2024] [Indexed: 08/24/2024] Open
Abstract
BACKGROUND Single-nucleotide polymorphism (SNP) effects can be backsolved from ssGBLUP genomic estimated breeding values (GEBV) and used for genome-wide association studies (ssGWAS). However, obtaining p-values for those SNP effects relies on the inversion of dense matrices, which poses computational limitations in large genotyped populations. In this study, we present a method to approximate SNP p-values for ssGWAS with many genotyped animals. This method relies on the combination of a sparse approximation of the inverse of the genomic relationship matrix ( G A P Y - 1 ) built with the algorithm for proven and young ( APY ) and an approximation of the prediction error variance of SNP effects which does not require the inversion of the left-hand side (LHS) of the mixed model equations. To test the proposed p-value computing method, we used a reduced genotyped population of 50K genotyped animals and compared the approximated SNP p-values with benchmark p-values obtained with the direct inverse of LHS built with an exact genomic relationship matrix (G - 1 ) . Then, we applied the proposed approximation method to obtain SNP p-values for a larger genotyped population composed of 450K genotyped animals. RESULTS The same genomic regions on chromosomes 7 and 20 were identified across all p-value computing methods when using 50K genotyped animals. In terms of computational requirements, obtaining p-values with the proposed approximation reduced the wall-clock time by 38 times and the memory requirement by ten times compared to using the exact inversion of the LHS. When the approximation was applied to a population of 450K genotyped animals, two new significant regions on chromosomes 6 and 14 were uncovered, indicating an increase in GWAS detection power when including more genotypes in the analyses. The process of obtaining p-values with the approximation and 450K genotyped individuals took 24.5 wall-clock hours and 87.66GB of memory, which is expected to increase linearly with the addition of noncore genotyped individuals. CONCLUSIONS With the proposed method, obtaining p-values for SNP effects in ssGWAS is computationally feasible in large genotyped populations. The computational cost of obtaining p-values in ssGWAS may no longer be a limitation in extensive populations with many genotyped animals.
Collapse
Affiliation(s)
- Natália Galoro Leite
- 1Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
| | - Matias Bermann
- 1Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Shogo Tsuruta
- 1Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Ignacy Misztal
- 1Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Daniela Lourenco
- 1Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
2
|
Pocrnic I, Lourenco D, Misztal I. Single nucleotide polymorphism profile for quantitative trait nucleotide in populations with small effective size and its impact on mapping and genomic predictions. Genetics 2024; 227:iyae103. [PMID: 38913695 PMCID: PMC11304960 DOI: 10.1093/genetics/iyae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/07/2024] [Accepted: 06/16/2024] [Indexed: 06/26/2024] Open
Abstract
Increasing SNP density by incorporating sequence information only marginally increases prediction accuracies of breeding values in livestock. To find out why, we used statistical models and simulations to investigate the shape of distribution of estimated SNP effects (a profile) around quantitative trait nucleotides (QTNs) in populations with a small effective population size (Ne). A QTN profile created by averaging SNP effects around each QTN was similar to the shape of expected pairwise linkage disequilibrium (PLD) based on Ne and genetic distance between SNP, with a distinct peak for the QTN. Populations with smaller Ne showed lower but wider QTN profiles. However, adding more genotyped individuals with phenotypes dragged the profile closer to the QTN. The QTN profile was higher and narrower for populations with larger compared to smaller Ne. Assuming the PLD curve for the QTN profile, 80% of the additive genetic variance explained by each QTN was contained in ± 1/Ne Morgan interval around the QTN, corresponding to 2 Mb in cattle and 5 Mb in pigs and chickens. With such large intervals, identifying QTN is difficult even if all of them are in the data and the assumed genetic architecture is simplistic. Additional complexity in QTN detection arises from confounding of QTN profiles with signals due to relationships, overlapping profiles with closely spaced QTN, and spurious signals. However, small Ne allows for accurate predictions with large data even without QTN identification because QTNs are accounted for by QTN profiles if SNP density is sufficient to saturate the segments.
Collapse
Affiliation(s)
- Ivan Pocrnic
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
3
|
Dominguez-Castaño P, Fortes M, Tan WLA, Toro-Ospina AM, Silva JAIV. Genome-wide association study for milk yield, frame, and udder conformation traits of Gir dairy cattle. J Dairy Sci 2024:S0022-0302(24)01031-2. [PMID: 39067750 DOI: 10.3168/jds.2024-24648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 07/09/2024] [Indexed: 07/30/2024]
Abstract
Genome-wide association studies (GWAS) are employed to identify genomic regions and candidate genes associated with several traits. The aim of this study was to perform a GWAS to identify causative variants and genes associated with milk yield, frame, and udder conformation traits in Gir dairy cattle. Body conformation traits were classified as "frame," and "udder" traits for this study. After genotyping imputation and quality control 42,105 polymorphisms were available for analyses and 24,489 cows with pedigree information had phenotypes. First, P-value was calculated based on the variance of the prediction error of the SNP-effects on the first iteration. After that, 2 more iterations were performed to carry out the weighted single-step genome-wide association methodology, performed using genomic moving windows defined based on linkage disequilibrium. The significant SNPs and top 10 windows explaining the highest percentage of additive genetic variance were selected and used for QTL and gene annotation. The variants identified in our work overlapped with QTLs from the animal QTL database on chromosomes 1 to 23, except for chromosome 4. The Gir breed is less studied than the Holstein breed and as such the animal QTL database is biased to Holstein results. Hence it is noteworthy that our GWAS had similarities with previously described QTLs. These previously known QTLs were related to milk yield, body height, rump angle, udder width, and udder depth. In total, 5 genes were annotated. Of these genes, FAM13A and CMSS1 had been previously related to bone and carcass weight in cattle.
Collapse
Affiliation(s)
- P Dominguez-Castaño
- Faculdade de Ciências Agrárias e Veterinárias (FCAV), Universidade Estadual Paulista, Jaboticabal, São Paulo 14884-900, Brasil; Facultad de Ciencias Agrarias, Fundación Universitaria Agraria de Colombia-UNIAGRARIA, Bogotá 111166, Colombia; The University of Queensland, School of Chemistry and Molecular Biosciences, St Lucia, Brisbane, Queensland 4072, Australia.
| | - M Fortes
- The University of Queensland, School of Chemistry and Molecular Biosciences, St Lucia, Brisbane, Queensland 4072, Australia
| | - W L A Tan
- The University of Queensland, School of Chemistry and Molecular Biosciences, St Lucia, Brisbane, Queensland 4072, Australia
| | - A M Toro-Ospina
- Science and Humanities Faculty, Digital University Institute of Antioquia, IUDigital, Medellin 50010, Colombia
| | - J A Ii V Silva
- Faculdade de Ciências Agrárias e Veterinárias (FCAV), Universidade Estadual Paulista, Jaboticabal, São Paulo 14884-900, Brasil; Faculdade de Medicina Veterinária e Zootecnia (FMVZ), Universidade Estadual Paulista, Botucatu, São Paulo 18618-000, Brasil.
| |
Collapse
|
4
|
Santana BF, Riser M, Hay EHA, Fragomeni BDO. Alternative SNP weighting for multi-step and single-step genomic BLUP in the presence of causative variants. J Anim Breed Genet 2023; 140:679-694. [PMID: 37551047 DOI: 10.1111/jbg.12817] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 06/29/2023] [Accepted: 07/02/2023] [Indexed: 08/09/2023]
Abstract
The accuracy of genetic selection in dairy can be increased by the adoption of new technologies, such as the inclusion of sequence data. In simulation studies, assigning different weights to causative single-nucleotide polymorphism (SNP) markers led to better predictions depending on the genomic prediction method used. However, it is still not clear how the weights should be calculated. Our objective was to evaluate the accuracy of a multi-step method (GBLUP) and single-step GBLUP with simulated data using regular SNP, causatives variants (QTN) and the combination of both. Additionally, we compared the accuracies of all previous scenarios using alternatives for SNP weighting. The data were simulated assuming a single trait with a heritability of 0.3. The effective population size (Ne) was approximately 200. The pedigree contained 440,000 animals, and approximately 16,800 individuals were genotyped. A total of 49,974 SNP markers were evenly placed throughout the genome, and 100, 1000 and 2000 causative QTN were simulated. Both GBLUP and ssGBLUP were used in this study. We evaluated quadratic and nonlinear SNP weights in addition to the unweighted G. The inclusion of QTN to panels led to significant accuracy gains. Nonlinear A was demonstrated to be superior to quadratic weighting and unweighted approaches; however, results from Nonlinear A were dependent on the equation parameters. The unweighted approach was more suitable for less polygenic scenarios. Finally, SNP weighting might help elucidate trait architecture features based on changes in the accuracy of genomic prediction.
Collapse
Affiliation(s)
| | - Molly Riser
- Animal Science Department, University of Connecticut, Storrs, Connecticut, USA
| | - El Hamidi A Hay
- Fort Keogh Livestock and Range Research Laboratory, USDA/ARS, Miles City, Montana, USA
| | - Breno de Oliveira Fragomeni
- Animal Science Department, University of Connecticut, Storrs, Connecticut, USA
- Institute for System Genomics, University of Connecticut, Storrs, Connecticut, USA
| |
Collapse
|
5
|
Wicki M, Raoul J, Legarra A. Effect of subdivision of the Lacaune dairy sheep breed on the accuracy of genomic prediction. J Dairy Sci 2023; 106:5570-5581. [PMID: 37349212 DOI: 10.3168/jds.2022-23114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 02/16/2023] [Indexed: 06/24/2023]
Abstract
Genomic selection was deployed in Lacaune dairy breed in 2015. Lacaune population split in 1972 into 2 breeding companies with associated flocks, and there have been very few exchanges of animals between the subpopulations, leading to divergence of the 2 subpopulations. In spite of that, there is a joint genomic prediction. The objective of this study is to understand how this structuring affects prediction accuracy. We analyzed all the data available from Lacaune breeding program for milk yield: around 6 million phenotypes, 2 million animals in the pedigree and more than 29,000 genotyped animals, including 3,434 and 2,868 AI rams for each company. To consider missing pedigree, we set up genetic groups using the theory of metafounders. First, we studied the pedigree and genomic structures of the 2 subpopulations calculating Fst, evolution of average pedigree relationships across time and principal components analysis of genomic relationships. In a second part, we compared the reliability between different scenarios: an evaluation with a single reference population (Alone), an evaluation with a joint reference population (Together) and an evaluation of one subpopulation based on the reference population of the other group (Indirect). The low Fst value (0.02) reveals that the 2 subpopulations are still genetically close. Nevertheless, a low and constant average relationship between the animals of the 2 subpopulations confirms the absence of recent connections between them. We can see with principal component analysis results that even if they are close, they diverge over time. Finally, we observe small gains in accuracy of Together versus Alone, in spite of whereas doubling the reference population size in Together. These gains vary across years and subpopulations: less than 0.08 (0.46 to 0.54; ratio of accuracy for the partial and whole evaluations-corresponding to the greatest change in this ratio for breeding company 1, observed for the cohort 2016) for one subpopulation and between 0.03 (0.55 to 0.58) and 0.17 (0.48 to 0.65) for the other. To conclude, the 2 subpopulations remain close enough genetically so that their combined evaluation is advantageous, even if only slightly.
Collapse
Affiliation(s)
- M Wicki
- INRAE, INP, UMR 1388 GenPhySE, F-31326 Castanet-Tolosan, France; Institut de l'Elevage, Castanet-Tolosan 31321, France.
| | - J Raoul
- INRAE, INP, UMR 1388 GenPhySE, F-31326 Castanet-Tolosan, France; Institut de l'Elevage, Castanet-Tolosan 31321, France
| | - A Legarra
- INRAE, INP, UMR 1388 GenPhySE, F-31326 Castanet-Tolosan, France
| |
Collapse
|
6
|
Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Holl J, Herring WO, Misztal I, Lourenco D. Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs. Genet Sel Evol 2023; 55:55. [PMID: 37495982 PMCID: PMC10373252 DOI: 10.1186/s12711-023-00831-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 07/18/2023] [Indexed: 07/28/2023] Open
Abstract
BACKGROUND Whole-genome sequence (WGS) data harbor causative variants that may not be present in standard single nucleotide polymorphism (SNP) chip data. The objective of this study was to investigate the impact of using preselected variants from WGS for single-step genomic predictions in maternal and terminal pig lines with up to 1.8k sequenced and 104k sequence imputed animals per line. METHODS Two maternal and four terminal lines were investigated for eight and seven traits, respectively. The number of sequenced animals ranged from 1365 to 1491 for the maternal lines and 381 to 1865 for the terminal lines. Imputation to sequence occurred within each line for 66k to 76k animals for the maternal lines and 29k to 104k animals for the terminal lines. Two preselected SNP sets were generated based on a genome-wide association study (GWAS). Top40k included the SNPs with the lowest p-value in each of the 40k genomic windows, and ChipPlusSign included significant variants integrated into the porcine SNP chip used for routine genotyping. We compared the performance of single-step genomic predictions between using preselected SNP sets assuming equal or different variances and the standard porcine SNP chip. RESULTS In the maternal lines, ChipPlusSign and Top40k showed an average increase in accuracy of 0.6 and 4.9%, respectively, compared to the regular porcine SNP chip. The greatest increase was obtained with Top40k, particularly for fertility traits, for which the initial accuracy based on the standard SNP chip was low. However, in the terminal lines, Top40k resulted in an average loss of accuracy of 1%. ChipPlusSign provided a positive, although small, gain in accuracy (0.9%). Assigning different variances for the SNPs slightly improved accuracies when using variances obtained from BayesR. However, increases were inconsistent across the lines and traits. CONCLUSIONS The benefit of using sequence data depends on the line, the size of the genotyped population, and how the WGS variants are preselected. When WGS data are available on hundreds of thousands of animals, using sequence data presents an advantage but this remains limited in pigs.
Collapse
Affiliation(s)
- Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
| | - Roger Ros-Freixedes
- Departament de Ciència Animal, Universitat de Lleida-Agrotecnio-CERCA Center, Lleida, Spain
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Ching-Yi Chen
- The Pig Improvement Company, Genus Plc, Hendersonville, TN, USA
| | - Justin Holl
- The Pig Improvement Company, Genus Plc, Hendersonville, TN, USA
| | | | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
7
|
Jang S, Tsuruta S, Leite NG, Misztal I, Lourenco D. Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study. Genet Sel Evol 2023; 55:49. [PMID: 37460964 DOI: 10.1186/s12711-023-00823-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/03/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Identifying true positive variants in genome-wide associations (GWA) depends on several factors, including the number of genotyped individuals. The limited dimensionality of genomic information may give insights into the optimal number of individuals to be used in GWA. This study investigated different discovery set sizes based on the number of largest eigenvalues explaining a certain proportion of variance in the genomic relationship matrix (G). In addition, we investigated the impact on the prediction accuracy by adding variants, which were selected based on different set sizes, to the regular single nucleotide polymorphism (SNP) chips used for genomic prediction. METHODS We simulated sequence data that included 500k SNPs with 200 or 2000 quantitative trait nucleotides (QTN). A regular 50k panel included one in every ten simulated SNPs. Effective population size (Ne) was set to 20 or 200. GWA were performed using a number of genotyped animals equivalent to the number of largest eigenvalues of G (EIG) explaining 50, 60, 70, 80, 90, 95, 98, and 99% of the variance. In addition, the largest discovery set consisted of 30k genotyped animals. Limited or extensive phenotypic information was mimicked by changing the trait heritability. Significant and large-effect size SNPs were added to the 50k panel and used for single-step genomic best linear unbiased prediction (ssGBLUP). RESULTS Using a number of genotyped animals corresponding to at least EIG98 allowed the identification of QTN with the largest effect sizes when Ne was large. Populations with smaller Ne required more than EIG98. Furthermore, including genotyped animals with a higher reliability (i.e., a higher trait heritability) improved the identification of the most informative QTN. Prediction accuracy was highest when the significant or the large-effect SNPs representing twice the number of simulated QTN were added to the 50k panel. CONCLUSIONS Accurately identifying causative variants from sequence data depends on the effective population size and, therefore, on the dimensionality of genomic information. This dimensionality can help identify the most suitable sample size for GWA and could be considered for variant selection, especially when resources are restricted. Even when variants are accurately identified, their inclusion in prediction models has limited benefits.
Collapse
Affiliation(s)
- Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Natalia Galoro Leite
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
8
|
Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Herring WO, Holl J, Misztal I, Lourenco D. Multi-line ssGBLUP evaluation using preselected markers from whole-genome sequence data in pigs. Front Genet 2023; 14:1163626. [PMID: 37252662 PMCID: PMC10213539 DOI: 10.3389/fgene.2023.1163626] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 05/03/2023] [Indexed: 05/31/2023] Open
Abstract
Genomic evaluations in pigs could benefit from using multi-line data along with whole-genome sequencing (WGS) if the data are large enough to represent the variability across populations. The objective of this study was to investigate strategies to combine large-scale data from different terminal pig lines in a multi-line genomic evaluation (MLE) through single-step GBLUP (ssGBLUP) models while including variants preselected from whole-genome sequence (WGS) data. We investigated single-line and multi-line evaluations for five traits recorded in three terminal lines. The number of sequenced animals in each line ranged from 731 to 1,865, with 60k to 104k imputed to WGS. Unknown parent groups (UPG) and metafounders (MF) were explored to account for genetic differences among the lines and improve the compatibility between pedigree and genomic relationships in the MLE. Sequence variants were preselected based on multi-line genome-wide association studies (GWAS) or linkage disequilibrium (LD) pruning. These preselected variant sets were used for ssGBLUP predictions without and with weights from BayesR, and the performances were compared to that of a commercial porcine single-nucleotide polymorphisms (SNP) chip. Using UPG and MF in MLE showed small to no gain in prediction accuracy (up to 0.02), depending on the lines and traits, compared to the single-line genomic evaluation (SLE). Likewise, adding selected variants from the GWAS to the commercial SNP chip resulted in a maximum increase of 0.02 in the prediction accuracy, only for average daily feed intake in the most numerous lines. In addition, no benefits were observed when using preselected sequence variants in multi-line genomic predictions. Weights from BayesR did not help improve the performance of ssGBLUP. This study revealed limited benefits of using preselected whole-genome sequence variants for multi-line genomic predictions, even when tens of thousands of animals had imputed sequence data. Correctly accounting for line differences with UPG or MF in MLE is essential to obtain predictions similar to SLE; however, the only observed benefit of an MLE is to have comparable predictions across lines. Further investigation into the amount of data and novel methods to preselect whole-genome causative variants in combined populations would be of significant interest.
Collapse
Affiliation(s)
- Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| | - Roger Ros-Freixedes
- Departament de Ciència Animal, Universitat de Lleida-Agrotecnio-CERCA Center, Lleida, Spain
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Ching-Yi Chen
- The Pig Improvement Company, Genus plc, Hendersonville, TN, United States
| | - William O Herring
- The Pig Improvement Company, Genus plc, Hendersonville, TN, United States
| | - Justin Holl
- The Pig Improvement Company, Genus plc, Hendersonville, TN, United States
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, United States
| |
Collapse
|
9
|
First large-scale genomic prediction in the honey bee. Heredity (Edinb) 2023; 130:320-328. [PMID: 36878945 PMCID: PMC10163272 DOI: 10.1038/s41437-023-00606-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 02/23/2023] [Accepted: 02/23/2023] [Indexed: 03/08/2023] Open
Abstract
Genomic selection has increased genetic gain in several livestock species, but due to the complicated genetics and reproduction biology not yet in honey bees. Recently, 2970 queens were genotyped to gather a reference population. For the application of genomic selection in honey bees, this study analyzes the accuracy and bias of pedigree-based and genomic breeding values for honey yield, three workability traits, and two traits for resistance against the parasite Varroa destructor. For breeding value estimation, we use a honey bee-specific model with maternal and direct effects, to account for the contributions of the workers and the queen of a colony to the phenotypes. We conducted a validation for the last generation and a five-fold cross-validation. In the validation for the last generation, the accuracy of pedigree-based estimated breeding values was 0.12 for honey yield, and ranged from 0.42 to 0.61 for the workability traits. The inclusion of genomic marker data improved these accuracies to 0.23 for honey yield, and a range from 0.44 to 0.65 for the workability traits. The inclusion of genomic data did not improve the accuracy of the disease-related traits. Traits with high heritability for maternal effects compared to the heritability for direct effects showed the most promising results. For all traits except the Varroa resistance traits, the bias with genomic methods was on a similar level compared to the bias with pedigree-based BLUP. The results show that genomic selection can successfully be applied to honey bees.
Collapse
|
10
|
Exploring the statistical nature of independent chromosome segments. Livest Sci 2023. [DOI: 10.1016/j.livsci.2023.105207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2023]
|
11
|
Čítek J, Brzáková M, Bauer J, Tichý L, Sztankóová Z, Vostrý L, Steyn Y. Genome-Wide Association Study for Body Conformation Traits and Fitness in Czech Holsteins. Animals (Basel) 2022; 12:ani12243522. [PMID: 36552441 PMCID: PMC10375906 DOI: 10.3390/ani12243522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 12/07/2022] [Accepted: 12/12/2022] [Indexed: 12/15/2022] Open
Abstract
The aim of this study was a genome-wide association study (GWAS) on conformation traits using 25,486 genotyped Czech Holsteins, with 35,227 common SNPs for each genotype. Linear trait records were collected between 1995 and 2020. The Interbull information from Multiple Across Country Evaluation (MACE) was included for bulls that mostly had daughter records in a foreign country. When using the Bonferroni correction, the number of SNPs that were either significant or approached the significance threshold was low-dairy capacity composite on BTA4, feet and legs composite BTA21, total score BTA10, stature BTA24, body depth BTA6, angularity BTA20, fore udder attachment BTA10. Without the Bonferroni correction, the total number of significant or near of significance SNPs was 32. The SNPs were localized on BTA1,2,4,5,6,7,8,18,22,25,26,28 for dairy capacity composite, BTA15,21 for feet and legs composite, BTA10 for total score, BTA24 stature, BTA6,23 body depth, BTA20 angularity, BTA2 rump angle, BTA9,10 rear legs rear view, BTA2,19 rear legs side view, BTA10 fore udder attachment, BTA2 udder depth, BTA10 rear udder height, BTA12 central alignment, BTA24 rear teat placement, BTA8,29 rear udder width. The results provide biological information for the improvement of body conformation and fitness in the Holstein population.
Collapse
Affiliation(s)
- Jindřich Čítek
- Department of Genetics and Agricultural Biotechnology, Faculty of Agriculture, University of South Bohemia in České Budějovice, Studentská 1668, 370 05 České Budějovice, Czech Republic
- Veterinary Research Institute, Hudcova 296, 621 00 Brno, Czech Republic
| | - Michaela Brzáková
- Institute of Animal Science, Přátelství 815, 104 00 Praha, Czech Republic
| | - Jiří Bauer
- Czech Moravian Breeders' Corporation, Benešovská 123, 252 09 Hradištko, Czech Republic
| | - Ladislav Tichý
- Institute of Animal Science, Přátelství 815, 104 00 Praha, Czech Republic
- Faculty of Agrobiology, Food and Natural Resources, Czech University of Life Sciences Prague, 165 00 Praha, Czech Republic
| | - Zuzana Sztankóová
- Institute of Animal Science, Přátelství 815, 104 00 Praha, Czech Republic
| | - Luboš Vostrý
- Faculty of Agrobiology, Food and Natural Resources, Czech University of Life Sciences Prague, 165 00 Praha, Czech Republic
| | - Yvette Steyn
- Department of Animal and Dairy Science, University of Georgia, 425 River Road, Athens, GA 30602, USA
| |
Collapse
|
12
|
Brzáková M, Bauer J, Steyn Y, Šplíchal J, Fulínová D. The prediction accuracies of linear-type traits in Czech Holstein cattle when using ssGBLUP or wssGBLUP. J Anim Sci 2022; 100:skac369. [PMID: 36334266 PMCID: PMC9746800 DOI: 10.1093/jas/skac369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 11/04/2022] [Indexed: 11/07/2022] Open
Abstract
The aim of this study was to assess the contribution of the weighted single-step genomic best linear unbiased prediction (wssGBLUP) method compared to the single-step genomic best linear unbiased prediction (ssGBLUP) method for genomic evaluation of 25 linear-type traits in the Czech Holstein cattle population. The nationwide database of linear-type traits with 6,99,681 records combined with deregressed proofs from Interbull (MACE method) was used as the input data. Genomic breeding values (GEBVs) were predicted based on these phenotypes using ssGBLUP and wssGBLUP methods using the BLUPF90 software. The bull validation test was employed which was based on comparing GEBVs of young bulls (N = 334) with no progeny in 2016. A minimum of 50 daughters with their own performance in 2020 was chosen to verify the contribution to the GEBV prediction, GEBV reliability, validation reliabilities (R2), and regression coefficients (b1). The results showed that the differences between the two methods were negligible. The low benefit of wssGBLUP may be due to the inclusion of a small number of SNPs; therefore, most predictions rely on polygenic relationships between animals. Nevertheless, the benefits of wssGBLUP analysis should be assessed with respect to specific population structures and given traits.
Collapse
Affiliation(s)
- Michaela Brzáková
- Department of Genetics and Breeding of Farm Animals, Institute of Animal Science, Prague-Uhříněves 104 00, Czech Republic
| | - Jiří Bauer
- Czech-Moravian Breeders’ Corporation, Hradištko 252 09, Czech Republic
| | - Yvette Steyn
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Jiří Šplíchal
- Czech-Moravian Breeders’ Corporation, Hradištko 252 09, Czech Republic
| | - Daniela Fulínová
- Czech-Moravian Breeders’ Corporation, Hradištko 252 09, Czech Republic
| |
Collapse
|
13
|
Luo H, Hu L, Brito LF, Dou J, Sammad A, Chang Y, Ma L, Guo G, Liu L, Zhai L, Xu Q, Wang Y. Weighted single-step GWAS and RNA sequencing reveals key candidate genes associated with physiological indicators of heat stress in Holstein cattle. J Anim Sci Biotechnol 2022; 13:108. [PMID: 35986427 PMCID: PMC9392250 DOI: 10.1186/s40104-022-00748-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 06/24/2022] [Indexed: 12/15/2022] Open
Abstract
Background The study of molecular processes regulating heat stress response in dairy cattle is paramount for developing mitigation strategies to improve heat tolerance and animal welfare. Therefore, we aimed to identify quantitative trait loci (QTL) regions associated with three physiological indicators of heat stress response in Holstein cattle, including rectal temperature (RT), respiration rate score (RS), and drooling score (DS). We estimated genetic parameters for all three traits. Subsequently, a weighted single-step genome-wide association study (WssGWAS) was performed based on 3200 genotypes, 151,486 phenotypic records, and 38,101 animals in the pedigree file. The candidate genes located within the identified QTL regions were further investigated through RNA sequencing (RNA-seq) analyses of blood samples for four cows collected in April (non-heat stress group) and four cows collected in July (heat stress group). Results The heritability estimates for RT, RS, and DS were 0.06, 0.04, and 0.03, respectively. Fourteen, 19, and 20 genomic regions explained 2.94%, 3.74%, and 4.01% of the total additive genetic variance of RT, RS, and DS, respectively. Most of these genomic regions are located in the Bos taurus autosome (BTA) BTA3, BTA6, BTA8, BTA12, BTA14, BTA21, and BTA24. No genomic regions overlapped between the three indicators of heat stress, indicating the polygenic nature of heat tolerance and the complementary mechanisms involved in heat stress response. For the RNA-seq analyses, 2627 genes were significantly upregulated and 369 downregulated in the heat stress group in comparison to the control group. When integrating the WssGWAS, RNA-seq results, and existing literature, the key candidate genes associated with physiological indicators of heat stress in Holstein cattle are: PMAIP1, SBK1, TMEM33, GATB, CHORDC1, RTN4IP1, and BTBD7. Conclusions Physiological indicators of heat stress are heritable and can be improved through direct selection. Fifty-three QTL regions associated with heat stress indicators confirm the polygenic nature and complex genetic determinism of heat tolerance in dairy cattle. The identified candidate genes will contribute for optimizing genomic evaluation models by assigning higher weights to genetic markers located in these regions as well as to the design of SNP panels containing polymorphisms located within these candidate genes. Graphical Abstract ![]()
Supplementary Information The online version contains supplementary material available at 10.1186/s40104-022-00748-6.
Collapse
|
14
|
Weighted Single-Step Genomic Best Linear Unbiased Prediction Method Application for Assessing Pigs on Meat Productivity and Reproduction Traits. Animals (Basel) 2022; 12:ani12131693. [PMID: 35804591 PMCID: PMC9264777 DOI: 10.3390/ani12131693] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Revised: 06/10/2022] [Accepted: 06/28/2022] [Indexed: 11/16/2022] Open
Abstract
Changes in the accuracy of the genomic estimates obtained by the ssGBLUP and wssGBLUP methods were evaluated using different reference groups. The weighting procedure’s reasonableness of application Pwas considered to improve the accuracy of genomic predictions for meat, fattening and reproduction traits in pigs. Six reference groups were formed to assess the genomic data quantity impact on the accuracy of predicted values (groups of genotyped animals). The datasets included 62,927 records of meat and fattening productivity (fat thickness over 6–7 ribs (BF1, mm)), muscle depth (MD, mm) and precocity up to 100 kg (age, days) and 16,070 observations of reproductive qualities (the number of all born piglets (TNB) and the number of live-born piglets (NBA), according to the results of the first farrowing). The wssGBLUP method has an advantage over ssGBLUP in terms of estimation reliability. When using a small reference group, the difference in the accuracy of ssGBLUP over BLUP AM is from −1.9 to +7.3 percent points, while for wssGBLUP, the change in accuracy varies from +18.2 to +87.3 percent points. Furthermore, the superiority of the wssGBLUP is also maintained for the largest group of genotyped animals: from +4.7 to +15.9 percent points for ssGBLUP and from +21.1 to +90.5 percent points for wssGBLUP. However, for all analyzed traits, the number of markers explaining 5% of genetic variability varied from 71 to 108, and the number of such SNPs varied depending on the size of the reference group (79–88 for BF1, 72–81 for MD, 71–108 for age). The results of the genetic variation distribution have the greatest similarity between groups of about 1000 and about 1500 individuals. Thus, the size of the reference group of more than 1000 individuals gives more stable results for the estimation based on the wssGBLUP method, while using the reference group of 500 individuals can lead to distorted results of GEBV.
Collapse
|
15
|
Botelho ME, Lopes MS, Mathur PK, Knol EF, e Silva FF, Lopes PS, Gimarães SEF, Marques DB, Veroneze R. Weighted genome-wide association study reveals new candidate genes related to boar taint compounds 1. Livest Sci 2022. [DOI: 10.1016/j.livsci.2022.104845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
16
|
Buaban S, Lengnudum K, Boonkum W, Phakdeedindan P. Genome-wide association study on milk production and somatic cell score for Thai dairy cattle using weighted single-step approach with random regression test-day model. J Dairy Sci 2021; 105:468-494. [PMID: 34756438 DOI: 10.3168/jds.2020-19826] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Accepted: 08/24/2021] [Indexed: 12/26/2022]
Abstract
Genome-wide association studies are a powerful tool to identify genomic regions and variants associated with phenotypes. However, only limited mutual confirmation from different studies is available. The objectives of this study were to identify genomic regions as well as genes and pathways associated with the first-lactation milk, fat, protein, and total solid yields; fat, protein, and total solid percentage; and somatic cell score (SCS) in a Thai dairy cattle population. Effects of SNPs were estimated by a weighted single-step GWAS, which back-solved the genomic breeding values predicted using single-step genomic BLUP (ssGBLUP) fitting a single-trait random regression test-day model. Genomic regions that explained at least 0.5% of the total genetic variance were selected for further analyses of candidate genes. Despite the small number of genotyped animals, genomic predictions led to an improvement in the accuracy over the traditional BLUP. Genomic predictions using weighted ssGBLUP were slightly better than the ssGBLUP. The genomic regions associated with milk production traits contained 210 candidate genes on 19 chromosomes [Bos taurus autosome (BTA) 1 to 7, 9, 11 to 16, 20 to 21, 26 to 27 and 29], whereas 21 candidate genes on 3 chromosomes (BTA 11, 16, and 21) were associated with SCS. Many genomic regions explained a small fraction of the genetic variance, indicating polygenic inheritance of the studied traits. Several candidate genes coincided with previous reports for milk production traits in Holstein cattle, especially a large region of genes on BTA14. We identified 141 and 5 novel genes related to milk production and SCS, respectively. These novel genes were also found to be functionally related to heat tolerance (e.g., SLC45A2, IRAG1, and LOC101902172), longevity (e.g., SYT10 and LOC101903327), and fertility (e.g., PAG1). These findings may be attributed to indirect selection in our population. Identified biological networks including intracellular cell transportation and protein catabolism implicate milk production, whereas the immunological pathways such as lymphocyte activation are closely related to SCS. Further studies are required to validate our findings before exploiting them in genomic selection.
Collapse
Affiliation(s)
- S Buaban
- Bureau of Animal Husbandry and Genetic Improvement, Department of Livestock Development, Pathum Thani 12000, Thailand
| | - K Lengnudum
- Bureau of Biotechnology in Livestock Production, Department of Livestock Development, Pathum Thani 12000, Thailand
| | - W Boonkum
- Department of Animal Science, Faculty of Agriculture, Khon Kaen University, Khon Kaen 40002, Thailand
| | - P Phakdeedindan
- Department of Animal Husbandry, Faculty of Veterinary Science, Chulalongkorn University, Bangkok 10330, Thailand; Genomics and Precision Dentistry Research Unit, Department of Physiology, Faculty of Dentistry, Chulalongkorn University, Bangkok 10330, Thailand.
| |
Collapse
|
17
|
Genomic Prediction in Local Breeds: The Rendena Cattle as a Case Study. Animals (Basel) 2021; 11:ani11061815. [PMID: 34207091 PMCID: PMC8234894 DOI: 10.3390/ani11061815] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Revised: 06/16/2021] [Accepted: 06/16/2021] [Indexed: 01/26/2023] Open
Abstract
Simple Summary Although genomic selection is being used in many livestock species, it has not yet been considered in local breeds due to the lower population size and the potential less effective impact on the genetic evaluation of these breeds. The current research aims to investigate how genomic data can impact the accuracy of genetic predictions for beef traits in Rendena, a small local cattle breed of the North-East of Italy selected for a dual purpose. Classical animal models using only phenotypic information were compared with two models that integrated genomic data with pedigree information. The genomic models presented better accuracy in estimated breeding values of the animals than the ‘classical’ animal model, especially the ‘simpler’ one assuming homogeneous variances of single nucleotide polymorphisms. Our results show that the inclusion of genomic information can be successfully applied to breeding selection scenarios even in small local cattle breeds such as Rendena. Abstract The maintenance of local cattle breeds is key to selecting for efficient food production, landscape protection, and conservation of biodiversity and local cultural heritage. Rendena is an indigenous cattle breed from the alpine North-East of Italy, selected for dual purpose, but with lesser emphasis given to beef traits. In this situation, increasing accuracy for beef traits could prevent detrimental effects due to the antagonism with milk production. Our study assessed the impact of genomic information on estimated breeding values (EBVs) in Rendena performance-tested bulls. Traits considered were average daily gain, in vivo EUROP score, and in vivo estimate of dressing percentage. The final dataset contained 1691 individuals with phenotypes and 8372 animals in pedigree, 1743 of which were genotyped. Using the cross-validation method, three models were compared: (i) Pedigree-BLUP (PBLUP); (ii) single-step GBLUP (ssGBLUP), and (iii) weighted single-step GBLUP (WssGBLUP). Models including genomic information presented higher accuracy, especially WssGBLUP. However, the model with the best overall properties was the ssGBLUP, showing higher accuracy than PBLUP and optimal values of bias and dispersion parameters. Our study demonstrated that integrating phenotypes for beef traits with genomic data can be helpful to estimate EBVs, even in a small local breed.
Collapse
|
18
|
Cesarani A, Biffani S, Garcia A, Lourenco D, Bertolini G, Neglia G, Misztal I, Macciotta NPP. Genomic investigation of milk production in Italian buffalo. ITALIAN JOURNAL OF ANIMAL SCIENCE 2021. [DOI: 10.1080/1828051x.2021.1902404] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Alberto Cesarani
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Stefano Biffani
- Consiglio Nazionale delle Ricerche (CNR), Istituto di biologia e biotecnologia agraria (IBBA), Milano, Italy
| | - Andre Garcia
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Giacomo Bertolini
- Associazione Nazionale Allevatori Specie Bufalina (ANASB), Caserta, Italy
| | - Gianluca Neglia
- Dipartimento di Medicina Veterinaria e Produzioni Animali, II University of Naples, Napoli, Italy
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | | |
Collapse
|
19
|
Lázaro SF, Tonhati H, Oliveira HR, Silva AA, Nascimento AV, Santos DJA, Stefani G, Brito LF. Genomic studies of milk-related traits in water buffalo (Bubalus bubalis) based on single-step genomic best linear unbiased prediction and random regression models. J Dairy Sci 2021; 104:5768-5793. [PMID: 33685677 DOI: 10.3168/jds.2020-19534] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 01/02/2021] [Indexed: 01/14/2023]
Abstract
Genomic selection has been widely implemented in many livestock breeding programs, but it remains incipient in buffalo. Therefore, this study aimed to (1) estimate variance components incorporating genomic information in Murrah buffalo; (2) evaluate the performance of genomic prediction for milk-related traits using single- and multitrait random regression models (RRM) and the single-step genomic best linear unbiased prediction approach; and (3) estimate longitudinal SNP effects and candidate genes potentially associated with time-dependent variation in milk, fat, and protein yields, as well as somatic cell score (SCS) in multiple parities. The data used to estimate the genetic parameters consisted of a total of 323,140 test-day records. The average daily heritability estimates were moderate (0.35 ± 0.02 for milk yield, 0.22 ± 0.03 for fat yield, 0.42 ± 0.03 for protein yield, and 0.16 ± 0.03 for SCS). The highest heritability estimates, considering all traits studied, were observed between 20 and 280 d in milk (DIM). The genetic correlation estimates at different DIM among the evaluated traits ranged from -0.10 (156 to 185 DIM for SCS) to 0.61 (36 to 65 DIM for fat yield). In general, direct selection for any of the traits evaluated is expected to result in indirect genetic gains for milk yield, fat yield, and protein yield but also increase SCS at certain lactation stages, which is undesirable. The predicted RRM coefficients were used to derive the genomic estimated breeding values (GEBV) for each time point (from 5 to 305 DIM). In general, the tuning parameters evaluated when constructing the hybrid genomic relationship matrices had a small effect on the GEBV accuracy and a greater effect on the bias estimates. The SNP solutions were back-solved from the GEBV predicted from the Legendre random regression coefficients, which were then used to estimate the longitudinal SNP effects (from 5 to 305 DIM). The daily SNP effect for 3 different lactation stages were performed considering 3 different lactation stages for each trait and parity: from 5 to 70, from 71 to 150, and from 151 to 305 DIM. Important genomic regions related to the analyzed traits and parities that explain more than 0.50% of the total additive genetic variance were selected for further analyses of candidate genes. In general, similar potential candidate genes were found between traits, but our results suggest evidence of differential sets of candidate genes underlying the phenotypic expression of the traits across parities. These results contribute to a better understanding of the genetic architecture of milk production traits in dairy buffalo and reinforce the relevance of incorporating genomic information to genetically evaluate longitudinal traits in dairy buffalo. Furthermore, the candidate genes identified can be used as target genes in future functional genomics studies.
Collapse
Affiliation(s)
- Sirlene F Lázaro
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907; Department of Animal Science, College of Agricultural and Veterinary Sciences, São Paulo State University (UNESP), Jaboticabal, 14884-900, SP, Brazil
| | - Humberto Tonhati
- Department of Animal Science, College of Agricultural and Veterinary Sciences, São Paulo State University (UNESP), Jaboticabal, 14884-900, SP, Brazil
| | - Hinayah R Oliveira
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907; Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, N1G 2W1, ON, Canada
| | - Alessandra A Silva
- Department of Animal Science, College of Agricultural and Veterinary Sciences, São Paulo State University (UNESP), Jaboticabal, 14884-900, SP, Brazil
| | - André V Nascimento
- Department of Animal Science, College of Agricultural and Veterinary Sciences, São Paulo State University (UNESP), Jaboticabal, 14884-900, SP, Brazil
| | - Daniel J A Santos
- Department of Animal and Avian Science, University of Maryland, College Park 20742
| | - Gabriela Stefani
- Department of Animal Science, College of Agricultural and Veterinary Sciences, São Paulo State University (UNESP), Jaboticabal, 14884-900, SP, Brazil
| | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907.
| |
Collapse
|
20
|
Cesarani A, Garcia A, Hidalgo J, Degano L, Vicario D, Macciotta NPP, Lourenco D. Genomic information allows for more accurate breeding values for milkability in dual-purpose Italian Simmental cattle. J Dairy Sci 2021; 104:5719-5727. [PMID: 33612221 DOI: 10.3168/jds.2020-19838] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 12/14/2020] [Indexed: 02/01/2023]
Abstract
Milkability is a trait related to the milking efficiency of an animal, and it is a component of the herd profitability. Due to its economic importance, milkability is currently included in the selection index of the Italian Simmental cattle breed with a weight of 7.5%. This lowly heritable trait is measured on a subjective scale from 1 to 3 (1 = slow, 3 = fast), and genetic evaluations are performed by pedigree-based BLUP. Genomic information is now available for some animals in the Italian Simmental population, and its inclusion in the genetic evaluation system could increase accuracy of breeding values and genetic progress for milkability. The aim of this study was to test the feasibility and advantages of having a genomic evaluation for this trait in the Italian Simmental population. Phenotypes were available for 131,308 cows. A total of 9,526 animals had genotypes for 42,152 loci; among the genotyped animals, 2,455 were cows with phenotypes, and the other were their relatives. The youngest cows with both phenotypes and genotypes (n = 900) were identified as selection candidates. Variance components and heritability were estimated using pedigree information, whereas genetic and genomic evaluations were carried out using BLUP and single-step genomic BLUP (ssGBLUP), respectively. In addition, a weighted ssGBLUP was assessed using genomic regions from a genome-wide association study. Evaluation models were validated using theoretical and realized accuracies. The estimated heritability for milkability was 0.12 ± 0.01. The mean theoretical accuracies for selection candidates were 0.43 ± 0.08 (BLUP) and 0.53 ± 0.06 (ssGBLUP). The mean realized accuracies based on linear regression statistics were 0.29 (BLUP) and 0.40 (ssGBLUP). No genomic regions were significantly associated with milkability, thus no improvements in accuracy were observed when using weighted ssGBLUP. Results indicated that genomic information could improve the accuracy of breeding values and increase genetic progress for milkability in Italian Simmental.
Collapse
Affiliation(s)
- A Cesarani
- Department of Animal and Dairy Science, University of Georgia, Athens, 30602.
| | - A Garcia
- Department of Animal and Dairy Science, University of Georgia, Athens, 30602
| | - J Hidalgo
- Department of Animal and Dairy Science, University of Georgia, Athens, 30602
| | - L Degano
- Associazione Nazionale Allevatori Pezzata Rossa Italiana (ANAPRI), 33100 Udine, Italy
| | - D Vicario
- Associazione Nazionale Allevatori Pezzata Rossa Italiana (ANAPRI), 33100 Udine, Italy
| | - N P P Macciotta
- Department of Agricultural Sciences, University of Sassari, 07100 Sassari, Italy
| | - D Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, 30602
| |
Collapse
|
21
|
Mehrban H, Naserkheil M, Lee DH, Cho C, Choi T, Park M, Ibáñez-Escriche N. Genomic Prediction Using Alternative Strategies of Weighted Single-Step Genomic BLUP for Yearling Weight and Carcass Traits in Hanwoo Beef Cattle. Genes (Basel) 2021; 12:genes12020266. [PMID: 33673102 PMCID: PMC7917987 DOI: 10.3390/genes12020266] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 02/08/2021] [Accepted: 02/11/2021] [Indexed: 01/20/2023] Open
Abstract
The weighted single-step genomic best linear unbiased prediction (GBLUP) method has been proposed to exploit information from genotyped and non-genotyped relatives, allowing the use of weights for single-nucleotide polymorphism in the construction of the genomic relationship matrix. The purpose of this study was to investigate the accuracy of genetic prediction using the following single-trait best linear unbiased prediction methods in Hanwoo beef cattle: pedigree-based (PBLUP), un-weighted (ssGBLUP), and weighted (WssGBLUP) single-step genomic methods. We also assessed the impact of alternative single and window weighting methods according to their effects on the traits of interest. The data was comprised of 15,796 phenotypic records for yearling weight (YW) and 5622 records for carcass traits (backfat thickness: BFT, carcass weight: CW, eye muscle area: EMA, and marbling score: MS). Also, the genotypic data included 6616 animals for YW and 5134 for carcass traits on the 43,950 single-nucleotide polymorphisms. The ssGBLUP showed significant improvement in genomic prediction accuracy for carcass traits (71%) and yearling weight (99%) compared to the pedigree-based method. The window weighting procedures performed better than single SNP weighting for CW (11%), EMA (11%), MS (3%), and YW (6%), whereas no gain in accuracy was observed for BFT. Besides, the improvement in accuracy between window WssGBLUP and the un-weighted method was low for BFT and MS, while for CW, EMA, and YW resulted in a gain of 22%, 15%, and 20%, respectively, which indicates the presence of relevant quantitative trait loci for these traits. These findings indicate that WssGBLUP is an appropriate method for traits with a large quantitative trait loci effect.
Collapse
Affiliation(s)
- Hossein Mehrban
- Department of Animal Science, Shahrekord University, Shahrekord 88186-34141, Iran;
| | - Masoumeh Naserkheil
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj 77871-31587, Iran;
- Department of Animal Life and Environment Sciences, Hankyong National University, Jungang-ro 327, Anseong-si 17579, Gyeonggi-do, Korea
| | - Deuk Hwan Lee
- Department of Animal Life and Environment Sciences, Hankyong National University, Jungang-ro 327, Anseong-si 17579, Gyeonggi-do, Korea
- Correspondence: ; Tel.: +82-316-705-091
| | - Chungil Cho
- Hanwoo Genetic Improvement Center, NongHyup Agribusiness Group Inc., Seosan 31948, Korea;
| | - Taejeong Choi
- Animal Breeding and Genetics Division, National Institute of Animal Science, Rural Development Administration, Cheonan 31000, Korea; (T.C.); (M.P.)
| | - Mina Park
- Animal Breeding and Genetics Division, National Institute of Animal Science, Rural Development Administration, Cheonan 31000, Korea; (T.C.); (M.P.)
| | - Noelia Ibáñez-Escriche
- Institute for Animal Science and Technology, Universitat Politècnica de València, 46022 València, Spain;
| |
Collapse
|
22
|
Botelho ME, Lopes MS, Mathur PK, Knol EF, Guimarães SEF, Marques DBD, Lopes PS, Silva FF, Veroneze R. Applying an association weight matrix in weighted genomic prediction of boar taint compounds. J Anim Breed Genet 2020; 138:442-453. [PMID: 33285013 DOI: 10.1111/jbg.12528] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 10/13/2020] [Accepted: 11/14/2020] [Indexed: 12/14/2022]
Abstract
Biological information regarding markers and gene association may be used to attribute different weights for single nucleotide polymorphism (SNP) in genome-wide selection. Therefore, we aimed to evaluate the predictive ability and the bias of genomic prediction using models that allow SNP weighting in the genomic relationship matrix (G) building, with and without incorporating biological information to obtain the weights. Firstly, we performed a genome-wide association studies (GWAS) in data set containing single- (SL) or a multi-line (ML) pig population for androstenone, skatole and indole levels. Secondly, 1%, 2%, 5%, 10%, 30% and 50% of the markers explaining the highest proportions of the genetic variance for each trait were selected to build gene networks through the association weight matrix (AWM) approach. The number of edges in the network was computed and used to derive weights for G (AWM-WssGBLUP). The single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) were used as standard scenarios. All scenarios presented predictive abilities different from zero; however, the great overlap in their confidences interval suggests no differences among scenarios. Most of scenarios of based on AWM provide overestimations for skatole in both SL and ML populations. On the other hand, the skatole and indole prediction were no biased in the ssGBLUP (S1) in both SL and ML populations. Most of scenarios based on AWM provide no biased predictions for indole in both SL and ML populations. In summary, using biological information through AWM matrix and gene networks to derive weights for genomic prediction resulted in no increase in predictive ability for boar taint compounds. In addition, this approach increased the number of analyses steps. Thus, we can conclude that ssGBLUP is most appropriate for the analysis of boar taint compounds in comparison with the weighted strategies used in the present work.
Collapse
Affiliation(s)
- Margareth E Botelho
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Marcos S Lopes
- Topigs Norsvin, Curitiba, Brazil.,Topigs Norsvin Research Center, Beuningen, the Netherlands
| | | | - Egbert F Knol
- Topigs Norsvin Research Center, Beuningen, the Netherlands
| | | | - Daniele B D Marques
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Paulo S Lopes
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Fabyano F Silva
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| | - Renata Veroneze
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa, Brazil
| |
Collapse
|
23
|
Paiva JT, Peixoto MGCD, Bruneli FAT, Alvarenga AB, Oliveira HR, Silva AA, Silva DA, Veroneze R, Silva FF, Lopes PS. Genetic parameters, genome-wide association and gene networks for milk and reproductive traits in Guzerá cattle. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104273] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
24
|
Baller JL, Kachman SD, Kuehn LA, Spangler ML. Genomic prediction using pooled data in a single-step genomic best linear unbiased prediction framework. J Anim Sci 2020; 98:5851497. [PMID: 32497209 DOI: 10.1093/jas/skaa184] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 06/01/2020] [Indexed: 01/16/2023] Open
Abstract
Economically relevant traits are routinely collected within the commercial segments of the beef industry but are rarely included in genetic evaluations because of unknown pedigrees. Individual relationships could be resurrected with genomics, but this would be costly; therefore, pooling DNA and phenotypic data provide a cost-effective solution. Pedigree, phenotypic, and genomic data were simulated for a beef cattle population consisting of 15 generations. Genotypes mimicked a 50k marker panel (841 quantitative trait loci were located across the genome, approximately once per 3 Mb) and the phenotype was moderately heritable. Individuals from generation 15 were included in pools (observed genotype and phenotype were mean values of a group). Estimated breeding values (EBV) were generated from a single-step genomic best linear unbiased prediction model. The effects of pooling strategy (random and minimizing or uniformly maximizing phenotypic variation within pools), pool size (1, 2, 10, 20, 50, 100, or no data from generation 15), and generational gaps of genotyping on EBV accuracy (correlation of EBV with true breeding values) were quantified. Greatest EBV accuracies of sires and dams were observed when there was no gap between genotyped parents and pooled offspring. The EBV accuracies resulting from pools were usually greater than no data from generation 15 regardless of sire or dam genotyping. Minimizing phenotypic variation increased EBV accuracy by 8% and 9% over random pooling and uniformly maximizing phenotypic variation, respectively. A pool size of 2 was the only scenario that did not significantly decrease EBV accuracy compared with individual data when pools were formed randomly or by uniformly maximizing phenotypic variation (P > 0.05). Pool sizes of 2, 10, 20, or 50 did not generally lead to statistical differences in EBV accuracy than individual data when pools were constructed to minimize phenotypic variation (P > 0.05). Largest numerical increases in EBV accuracy resulting from pooling compared with no data from generation 15 were seen with sires with prior low EBV accuracy (those born in generation 14). Pooling of any size led to larger EBV accuracies of the pools than individual data when minimizing phenotypic variation. Resulting EBV for the pools could be used to inform management decisions of those pools. Pooled genotyping to garner commercial-level phenotypes for genetic evaluations seems plausible although differences exist depending on pool size and pool formation strategy.
Collapse
Affiliation(s)
- Johnna L Baller
- Department of Animal Science, University of Nebraska, Lincoln, NE
| | | | - Larry A Kuehn
- USDA, ARS, U.S. Meat Animal Research Center, Clay Center, NE
| | | |
Collapse
|
25
|
Garcia BF, Melo TPD, Neves HHDR, Carvalheiro R. Comparison of GWA statistical methods for traits under different genetic structures: A simulation study. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104213] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
26
|
Takeda M, Uemoto Y, Satoh M. Effect of genotyped bulls with different numbers of phenotyped progenies on quantitative trait loci detection and genomic evaluation in a simulated cattle population. Anim Sci J 2020; 91:e13432. [PMID: 32779330 PMCID: PMC7507195 DOI: 10.1111/asj.13432] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 06/18/2020] [Accepted: 07/01/2020] [Indexed: 02/02/2023]
Abstract
The objective of this study was to assess the effect of genotyped bulls with different numbers of phenotyped progenies on quantitative trait loci (QTL) detection and genomic evaluation using a simulated cattle population. Twelve generations (G1–G12) were simulated from the base generation (G0). The recent population had different effective population sizes, heritability, and number of QTL. G0–G4 were used for pedigree information. A total of 300 genotyped bulls from G5–G10 were randomly selected. Their progenies were generated in G6–G11 with different numbers of progeny per bull. Scenarios were considered according to the number of progenies and whether the genotypes were possessed by the bulls or the progenies. A genome‐wide association study and genomic evaluation were performed with a single‐step genomic best linear unbiased prediction method to calculate the power of QTL detection and the genomic estimated breeding value (GEBV). We found that genotyped bulls could be available for QTL detection depending on conditions. Additionally, using a reference population, including genotyped bulls, which had more progeny phenotypes, enabled a more accurate prediction of GEBV. However, it is desirable to have more than 4,500 individuals consisting of both genotypes and phenotypes for practical genomic evaluation.
Collapse
Affiliation(s)
- Masayuki Takeda
- National Livestock Breeding Center, Nishigo, Japan.,Graduate School of Agricultural Science, Tohoku University, Sendai, Japan
| | - Yoshinobu Uemoto
- Graduate School of Agricultural Science, Tohoku University, Sendai, Japan
| | - Masahiro Satoh
- Graduate School of Agricultural Science, Tohoku University, Sendai, Japan
| |
Collapse
|
27
|
Liu A, Lund MS, Boichard D, Karaman E, Guldbrandtsen B, Fritz S, Aamand GP, Nielsen US, Sahana G, Wang Y, Su G. Weighted single-step genomic best linear unbiased prediction integrating variants selected from sequencing data by association and bioinformatics analyses. Genet Sel Evol 2020; 52:48. [PMID: 32799816 PMCID: PMC7429790 DOI: 10.1186/s12711-020-00568-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 08/07/2020] [Indexed: 11/30/2022] Open
Abstract
Background Sequencing data enable the detection of causal loci or single nucleotide polymorphisms (SNPs) highly linked to causal loci to improve genomic prediction. However, until now, studies on integrating such SNPs using a single-step genomic best linear unbiased prediction (ssGBLUP) model are scarce. We investigated the integration of sequencing SNPs selected by association (1262 SNPs) and bioinformatics (2359 SNPs) analyses into the currently used 54K-SNP chip, using three ssGBLUP models which make different assumptions on the distribution of SNP effects: a basic ssGBLUP model, a so-called featured ssGBLUP (ssFGBLUP) model that considered selected sequencing SNPs as a feature genetic component, and a weighted ssGBLUP (ssWGBLUP) model in which the genomic relationship matrix was weighted by the SNP variances estimated from a Bayesian whole-genome regression model, with every 1, 30, or 100 adjacent SNPs within a chromosome region sharing the same variance. We used data on milk production and female fertility in Danish Jersey. In total, 15,823 genotyped and 528,981 non-genotyped females born between 1990 and 2013 were used as reference population and 7415 genotyped females and 33,040 non-genotyped females born between 2014 and 2016 were used as validation population. Results With basic ssGBLUP, integrating SNPs selected from sequencing data improved prediction reliabilities for milk and protein yields, but resulted in limited or no improvement for fat yield and female fertility. Model performances depended on the SNP set used. When using ssWGBLUP with the 54K SNPs, reliabilities for milk and protein yields improved by 0.028 for genotyped animals and by 0.006 for non-genotyped animals compared with ssGBLUP. However, with the SNP set that included SNPs selected from sequencing data, no statistically significant difference in prediction reliability was observed between the three ssGBLUP models. Conclusions In summary, when using 54K SNPs, a ssWGBLUP model with a common weight on the SNPs in a given region is a feasible approach for single-trait genetic evaluation. Integrating relevant SNPs selected from sequencing data into the standard SNP chip can improve the reliability of genomic prediction. Based on such SNP data, a basic ssGBLUP model was suggested since no significant improvement was observed from using alternative models such as ssWGBLUP and ssFGBLUP.
Collapse
Affiliation(s)
- Aoxing Liu
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Didier Boichard
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Bernt Guldbrandtsen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Sebastien Fritz
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, 78350, Jouy-en-Josas, France.,ALLICE, 75012, Paris, France
| | | | | | - Goutam Sahana
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
28
|
Gualdrón Duarte JL, Gori AS, Hubin X, Lourenco D, Charlier C, Misztal I, Druet T. Performances of Adaptive MultiBLUP, Bayesian regressions, and weighted-GBLUP approaches for genomic predictions in Belgian Blue beef cattle. BMC Genomics 2020; 21:545. [PMID: 32762654 PMCID: PMC7430838 DOI: 10.1186/s12864-020-06921-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 07/17/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genomic selection has been successfully implemented in many livestock and crop species. The genomic best linear unbiased predictor (GBLUP) approach, assigning equal variance to all SNP effects, is one of the reference methods. When large-effect variants contribute to complex traits, it has been shown that genomic prediction methods that assign a higher variance to subsets of SNP effects can achieve higher prediction accuracy. We herein compared the efficiency of several such approaches, including the Adaptive MultiBLUP (AM-BLUP) that uses local genomic relationship matrices (GRM) to automatically identify and weight genomic regions with large effects, to predict genetic merit in Belgian Blue beef cattle. RESULTS We used a population of approximately 10,000 genotyped cows and their phenotypes for 14 traits, mostly related to muscular development and body dimensions. According to the trait, we found that 4 to 25% of the genetic variance could be associated with 2 to 12 genomic regions harbouring large-effect variants. Noteworthy, three previously identified recessive deleterious variants presented heterozygote advantage and were among the most significant SNPs for several traits. The AM-BLUP resulted in increased reliability of genomic predictions compared to GBLUP (+ 2%), but Bayesian methods proved more efficient (+ 3%). Overall, the reliability gains remained thus limited although higher gains were observed for skin thickness, a trait affected by two genomic regions having particularly large effects. Higher accuracies than those from the original AM-BLUP were achieved when applying the Bayesian Sparse Linear Mixed Model to pre-select groups of SNPs with large effects and subsequently use their estimated variance to build a weighted GRM. Finally, the single-step GBLUP performed best and could be further improved (+ 3% prediction accuracy) by using these weighted GRM. CONCLUSIONS The AM-BLUP is an attractive method to automatically identify and weight genomic regions with large effects on complex traits. However, the method was less accurate than Bayesian methods. Overall, weighted methods achieved modest accuracy gains compared to GBLUP. Nevertheless, the computational efficiency of the AM-BLUP might be valuable at higher marker density, including with whole-genome sequencing data. Furthermore, weighted GRM are particularly useful to account for large variance loci in the single-step GBLUP.
Collapse
Affiliation(s)
- José Luis Gualdrón Duarte
- Unit of Animal Genomics, GIGA-R, 11 Avenue de l'Hôpital (B34), University of Liège, 4000, Liège, Belgium.
| | - Ann-Stephan Gori
- Innovation Department, Elevéo asbl and Inovéo, Awé Group, 5590, Ciney, Belgium
| | - Xavier Hubin
- Innovation Department, Elevéo asbl and Inovéo, Awé Group, 5590, Ciney, Belgium
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, 425 River Rd, Athens, GA, 30602, USA
| | - Carole Charlier
- Unit of Animal Genomics, GIGA-R, 11 Avenue de l'Hôpital (B34), University of Liège, 4000, Liège, Belgium
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, 425 River Rd, Athens, GA, 30602, USA
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R, 11 Avenue de l'Hôpital (B34), University of Liège, 4000, Liège, Belgium
| |
Collapse
|
29
|
Lourenco D, Legarra A, Tsuruta S, Masuda Y, Aguilar I, Misztal I. Single-Step Genomic Evaluations from Theory to Practice: Using SNP Chips and Sequence Data in BLUPF90. Genes (Basel) 2020; 11:E790. [PMID: 32674271 PMCID: PMC7397237 DOI: 10.3390/genes11070790] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/03/2020] [Accepted: 07/06/2020] [Indexed: 11/16/2022] Open
Abstract
Single-step genomic evaluation became a standard procedure in livestock breeding, and the main reason is the ability to combine all pedigree, phenotypes, and genotypes available into one single evaluation, without the need of post-analysis processing. Therefore, the incorporation of data on genotyped and non-genotyped animals in this method is straightforward. Since 2009, two main implementations of single-step were proposed. One is called single-step genomic best linear unbiased prediction (ssGBLUP) and uses single nucleotide polymorphism (SNP) to construct the genomic relationship matrix; the other is the single-step Bayesian regression (ssBR), which is a marker effect model. Under the same assumptions, both models are equivalent. In this review, we focus solely on ssGBLUP. The implementation of ssGBLUP into the BLUPF90 software suite was done in 2009, and since then, several changes were made to make ssGBLUP flexible to any model, number of traits, number of phenotypes, and number of genotyped animals. Single-step GBLUP from the BLUPF90 software suite has been used for genomic evaluations worldwide. In this review, we will show theoretical developments and numerical examples of ssGBLUP using SNP data from regular chips to sequence data.
Collapse
Affiliation(s)
- Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| | - Andres Legarra
- Institut National de la Recherche Agronomique, UMR1388 GenPhySE, 31326 Castanet Tolosan, France;
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| | - Yutaka Masuda
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| | - Ignacio Aguilar
- Instituto Nacional de Investigación Agropecuaria (INIA), 11500 Montevideo, Uruguay;
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| |
Collapse
|
30
|
Freitas P, Oliveira H, Silva F, Fleming A, Miglior F, Schenkel F, Brito L. Genomic analyses for predicted milk fatty acid composition throughout lactation in North American Holstein cattle. J Dairy Sci 2020; 103:6318-6331. [DOI: 10.3168/jds.2019-17628] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 03/12/2020] [Indexed: 12/12/2022]
|
31
|
Alvarenga AB, Veroneze R, Oliveira HR, Marques DBD, Lopes PS, Silva FF, Brito LF. Comparing Alternative Single-Step GBLUP Approaches and Training Population Designs for Genomic Evaluation of Crossbred Animals. Front Genet 2020; 11:263. [PMID: 32328083 PMCID: PMC7162606 DOI: 10.3389/fgene.2020.00263] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 03/05/2020] [Indexed: 02/06/2023] Open
Abstract
As crossbreeding is extensively used in some livestock species, we aimed to evaluate the performance of single-step GBLUP (ssGBLUP) and weighted ssGBLUP (WssGBLUP) methods to predict Genomic Estimated Breeding Values (GEBVs) of crossbred animals. Different training population scenarios were evaluated: (SC1) ssGBLUP based on a single-trait model considering purebred and crossbred animals in a joint training population; (SC2) ssGBLUP based on a multiple-trait model to enable considering phenotypes recorded in purebred and crossbred training animals as different traits; (SC3) WssGBLUP based on a single-trait model considering purebred and crossbred animals jointly in the training population (both populations were used for SNP weights' estimation); (SC4) WssGBLUP based on a single-trait model considering only purebred animals in the training population (crossbred population only used for SNP weights' estimation); (SC5) WssGBLUP based on a single-trait model and the training population characterized by purebred animals (purebred population used for SNP weights' estimation). A complex trait was simulated assuming alternative genetic architectures. Different scaling factors to blend the inverse of the genomic (G -1) and pedigree (A 22 - 1 ) relationship matrices were also tested. The predictive performance of each scenario was evaluated based on the validation accuracy and regression coefficient. The genetic correlations across simulated populations in the different scenarios ranged from moderate to high (0.71-0.99). The scenario mimicking a completely polygenic trait (h Q T L 2 = 0) yielded the lowest validation accuracy (0.12; for SC3 and SC4). The simulated scenarios assuming 4,500 QTLs affecting the trait andh Q T L 2 = h 2 resulted in the greatest GEBV accuracies (0.47; for SC1 and SC2). The regression coefficients ranged from 0.28 (for SC3 assuming polygenic effect) to 1.27 (for SC2 considering 4,500 QTLs). In general, SC3 and SC5 resulted in inflated GEBVs, whereas other scenarios yielded deflated GEBVs. The scaling factors used to combine G -1 andA 22 - 1 had a small influence on the validation accuracies, but a greater effect on the regression coefficients. Due to the complexity of multiple-trait models and WssGBLUP analyses, and a similar predictive performance across the methods evaluated, SC1 is recommended for genomic evaluation in crossbred populations with similar genetic structures [moderate-to-high (0.71-0.99) genetic correlations between purebred and crossbred populations].
Collapse
Affiliation(s)
- Amanda B. Alvarenga
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Renata Veroneze
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Hinayah R. Oliveira
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
- Department of Animal Biosciences, Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, ON, Canada
| | | | - Paulo S. Lopes
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Fabyano F. Silva
- Department of Animal Science, Federal University of Viçosa, Viçosa, Brazil
| | - Luiz F. Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, United States
| |
Collapse
|
32
|
Misztal I, Lourenco D, Legarra A. Current status of genomic evaluation. J Anim Sci 2020; 98:skaa101. [PMID: 32267923 PMCID: PMC7183352 DOI: 10.1093/jas/skaa101] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 04/07/2020] [Indexed: 12/14/2022] Open
Abstract
Early application of genomic selection relied on SNP estimation with phenotypes or de-regressed proofs (DRP). Chips of 50k SNP seemed sufficient for an accurate estimation of SNP effects. Genomic estimated breeding values (GEBV) were composed of an index with parent average, direct genomic value, and deduction of a parental index to eliminate double counting. Use of SNP selection or weighting increased accuracy with small data sets but had minimal to no impact with large data sets. Efforts to include potentially causative SNP derived from sequence data or high-density chips showed limited or no gain in accuracy. After the implementation of genomic selection, EBV by BLUP became biased because of genomic preselection and DRP computed based on EBV required adjustments, and the creation of DRP for females is hard and subject to double counting. Genomic selection was greatly simplified by single-step genomic BLUP (ssGBLUP). This method based on combining genomic and pedigree relationships automatically creates an index with all sources of information, can use any combination of male and female genotypes, and accounts for preselection. To avoid biases, especially under strong selection, ssGBLUP requires that pedigree and genomic relationships are compatible. Because the inversion of the genomic relationship matrix (G) becomes costly with more than 100k genotyped animals, large data computations in ssGBLUP were solved by exploiting limited dimensionality of genomic data due to limited effective population size. With such dimensionality ranging from 4k in chickens to about 15k in cattle, the inverse of G can be created directly (e.g., by the algorithm for proven and young) at a linear cost. Due to its simplicity and accuracy, ssGBLUP is routinely used for genomic selection by the major chicken, pig, and beef industries. Single step can be used to derive SNP effects for indirect prediction and for genome-wide association studies, including computations of the P-values. Alternative single-step formulations exist that use SNP effects for genotyped or for all animals. Although genomics is the new standard in breeding and genetics, there are still some problems that need to be solved. This involves new validation procedures that are unaffected by selection, parameter estimation that accounts for all the genomic data used in selection, and strategies to address reduction in genetic variances after genomic selection was implemented.
Collapse
Affiliation(s)
- Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA
| | - Andres Legarra
- Department of Animal Genetics, Institut National de la Recherche Agronomique, Castanet-Tolosan, France
| |
Collapse
|
33
|
Fragomeni BO, Lourenco DAL, Legarra A, VanRaden PM, Misztal I. Alternative SNP weighting for single-step genomic best linear unbiased predictor evaluation of stature in US Holsteins in the presence of selected sequence variants. J Dairy Sci 2019; 102:10012-10019. [PMID: 31495612 DOI: 10.3168/jds.2019-16262] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 07/16/2019] [Indexed: 11/19/2022]
Abstract
Causal variants inferred from sequence data analysis are expected to increase accuracy of genomic selection. In this work we evaluated the gain in reliability of genomic predictions, for stature in US Holsteins, when adding selected sequence variants to a pre-existent SNP chip. Two prediction methods were tested: de-regressed proofs assuming heterogeneous (genomic BLUP; GBLUP) residual variances and by single-step GBLUP (ssGBLUP) using actual phenotypes. Phenotypic data included 3,999,631 records for stature on 3,027,304 Holstein cows. Genotypes on 54,087 SNP markers (54k) were available for 26,877 bulls. Additionally, 16,648 selected sequence variants were combined with the 54k markers, for a total of 70,735 (70k) markers. In all methods, SNP in the genomic relationship matrix (G) were unweighted or weighted iteratively, with weights derived either by SNP effects squared or by a nonlinear method that resembles BayesA (nonlinear A). Reliability of genomic predictions were obtained by cross validation. With unweighted G derived from 54k markers, the reliabilities (× 100) were 72.4 for GBLUP and 75.3 for ssGBLUP. With unweighted G derived from 70k markers, the reliabilities were 73.4 and 76.0, respectively. Weighting by nonlinear A changed reliabilities to 73.3, and 75.9, respectively. Addition of selected sequence variants had a small effect on reliabilities. Weighting by quadratic functions reduced reliabilities. Weighting by nonlinear A increased reliabilities for GBLUP but had only a small effect in ssGBLUP. Reliabilities for direct genomic values extracted from ssGBLUP using unweighted G with 54k were higher than reliabilities by any GBLUP. Thus, ssGBLUP seems to capture more information than GBLUP and there is less room for extra reliability. Improvements in GBLUP may be because the weights in G change the covariance structure, which can explain a proportion of the variance that is accounted for when a heterogeneous residual variance is assumed by considering a different number of daughters per bull.
Collapse
Affiliation(s)
- B O Fragomeni
- Department of Animal Science, University of Connecticut, Storrs-Mansfield 06269.
| | - D A L Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| | - A Legarra
- Institut National de la Recherche Agronomique, UMR1388 GenPhySE, Castanet Tolosan, France 31326
| | - P M VanRaden
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705
| | - I Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| |
Collapse
|
34
|
Silva RMO, Evenhuis JP, Vallejo RL, Gao G, Martin KE, Leeds TD, Palti Y, Lourenco DAL. Whole-genome mapping of quantitative trait loci and accuracy of genomic predictions for resistance to columnaris disease in two rainbow trout breeding populations. Genet Sel Evol 2019; 51:42. [PMID: 31387519 PMCID: PMC6683352 DOI: 10.1186/s12711-019-0484-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 07/30/2019] [Indexed: 01/09/2023] Open
Abstract
Background Columnaris disease (CD) is an emerging problem for the rainbow trout aquaculture industry in the US. The objectives of this study were to: (1) identify common genomic regions that explain a large proportion of the additive genetic variance for resistance to CD in two rainbow trout (Oncorhynchus mykiss) populations; and (2) estimate the gains in prediction accuracy when genomic information is used to evaluate the genetic potential of survival to columnaris infection in each population. Methods Two aquaculture populations were investigated: the National Center for Cool and Cold Water Aquaculture (NCCCWA) odd-year line and the Troutlodge, Inc., May odd-year (TLUM) nucleus breeding population. Fish that survived to 21 days post-immersion challenge were recorded as resistant. Single nucleotide polymorphism (SNP) genotypes were available for 1185 and 1137 fish from NCCCWA and TLUM, respectively. SNP effects and variances were estimated using the weighted single-step genomic best linear unbiased prediction (BLUP) for genome-wide association. Genomic regions that explained more than 1% of the additive genetic variance were considered to be associated with resistance to CD. Predictive ability was calculated in a fivefold cross-validation scheme and using a linear regression method. Results Validation on adjusted phenotypes provided a prediction accuracy close to zero, due to the binary nature of the trait. Using breeding values computed from the complete data as benchmark improved prediction accuracy of genomic models by about 40% compared to the pedigree-based BLUP. Fourteen windows located on six chromosomes were associated with resistance to CD in the NCCCWA population, of which two windows on chromosome Omy 17 jointly explained more than 10% of the additive genetic variance. Twenty-six windows located on 13 chromosomes were associated with resistance to CD in the TLUM population. Only four associated genomic regions overlapped with quantitative trait loci (QTL) between both populations. Conclusions Our results suggest that genome-wide selection for resistance to CD in rainbow trout has greater potential than selection for a few target genomic regions that were found to be associated to resistance to CD due to the polygenic architecture of this trait, and because the QTL associated with resistance to CD are not sufficiently informative for selection decisions across populations. Electronic supplementary material The online version of this article (10.1186/s12711-019-0484-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rafael M O Silva
- National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, United States Department of Agriculture, 11861 Leetown Road, Leetown, WV, 25430, USA.,Department of Animal and Dairy Science, University of Georgia, Athens, 425 River Road, Athens, GA, 30602, USA.,Zoetis, Sao Paulo, Sao Paulo, 04711-130, Brazil
| | - Jason P Evenhuis
- National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, United States Department of Agriculture, 11861 Leetown Road, Leetown, WV, 25430, USA
| | - Roger L Vallejo
- National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, United States Department of Agriculture, 11861 Leetown Road, Leetown, WV, 25430, USA
| | - Guangtu Gao
- National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, United States Department of Agriculture, 11861 Leetown Road, Leetown, WV, 25430, USA
| | - Kyle E Martin
- Troutloged, Inc., P.O. Box 1290, Sumner, WA, 98390, USA
| | - Tim D Leeds
- National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, United States Department of Agriculture, 11861 Leetown Road, Leetown, WV, 25430, USA
| | - Yniv Palti
- National Center for Cool and Cold Water Aquaculture, Agricultural Research Service, United States Department of Agriculture, 11861 Leetown Road, Leetown, WV, 25430, USA.
| | - Daniela A L Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, 425 River Road, Athens, GA, 30602, USA
| |
Collapse
|
35
|
Oliveira H, Lourenco D, Masuda Y, Misztal I, Tsuruta S, Jamrozik J, Brito L, Silva F, Schenkel F. Application of single-step genomic evaluation using multiple-trait random regression test-day models in dairy cattle. J Dairy Sci 2019; 102:2365-2377. [DOI: 10.3168/jds.2018-15466] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2018] [Accepted: 11/20/2018] [Indexed: 01/30/2023]
|
36
|
Garcia ALS, Bosworth B, Waldbieser G, Misztal I, Tsuruta S, Lourenco DAL. Development of genomic predictions for harvest and carcass weight in channel catfish. Genet Sel Evol 2018; 50:66. [PMID: 30547740 PMCID: PMC6295041 DOI: 10.1186/s12711-018-0435-5] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Accepted: 11/29/2018] [Indexed: 12/30/2022] Open
Abstract
Background Catfish farming is the largest segment of US aquaculture and research is ongoing to improve production efficiency, including genetic selection programs to improve economically important traits. The objectives of this study were to investigate the use of genomic selection to improve breeding value accuracy and to identify major single nucleotide polymorphisms (SNPs) associated with harvest weight and residual carcass weight in a channel catfish population. Phenotypes were available for harvest weight (n = 27,160) and residual carcass weight (n = 6020), and 36,365 pedigree records were available. After quality control, genotypes for 54,837 SNPs were available for 2911 fish. Estimated breeding values (EBV) were obtained with traditional pedigree-based best linear unbiased prediction (BLUP) and genomic (G)EBV were estimated with single-step genomic BLUP (ssGBLUP). EBV and GEBV prediction accuracies were evaluated using different validation strategies. The ability to predict future performance was calculated as the correlation between EBV or GEBV and adjusted phenotypes. Results Compared to the pedigree BLUP, ssGBLUP increased predictive ability up to 28% and 36% for harvest weight and residual carcass weight, respectively; and GEBV were superior to EBV for all validation strategies tested. Breeding value inflation was assessed as the regression coefficient of adjusted phenotypes on breeding values, and the results indicated that genomic information reduced breeding value inflation. Genome-wide association studies based on windows of 20 adjacent SNPs indicated that both harvest weight and residual carcass weight have a polygenic architecture with no major SNPs (the largest SNPs explained 0.96 and 1.19% of the additive genetic variation for harvest weight and residual carcass weight respectively). Conclusions Genomic evaluation improves the ability to predict future performance relative to traditional BLUP and will allow more accurate identification of genetically superior individuals within catfish families.
Collapse
Affiliation(s)
- Andre L S Garcia
- Animal and Dairy Science Department, University of Georgia, Athens, GA, 30602, USA.
| | - Brian Bosworth
- Warmwater Aquaculture Research Unit (WARU), USDA-ARS, Stoneville, MS, 30776, USA
| | - Geoffrey Waldbieser
- Warmwater Aquaculture Research Unit (WARU), USDA-ARS, Stoneville, MS, 30776, USA
| | - Ignacy Misztal
- Animal and Dairy Science Department, University of Georgia, Athens, GA, 30602, USA
| | - Shogo Tsuruta
- Animal and Dairy Science Department, University of Georgia, Athens, GA, 30602, USA
| | - Daniela A L Lourenco
- Animal and Dairy Science Department, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
37
|
Guarini AR, Lourenco DAL, Brito LF, Sargolzaei M, Baes CF, Miglior F, Misztal I, Schenkel FS. Genetics and genomics of reproductive disorders in Canadian Holstein cattle. J Dairy Sci 2018; 102:1341-1353. [PMID: 30471913 DOI: 10.3168/jds.2018-15038] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Accepted: 09/29/2018] [Indexed: 01/25/2023]
Abstract
In Canada, reproductive disorders known to affect the profitability of dairy cattle herds have been recorded by producers on a voluntary basis since 2007. Previous studies have shown the feasibility of using producer-recorded health data for genetic evaluations. Despite low heritability estimates and limited availability of phenotypic information, sufficient genetic variation has been observed for those traits to indicate that genetic progress, although slow, can be achieved. Pedigree- and genomic-based analyses were performed on producer-recorded health data of reproductive disorders, including retained placenta (RETP), metritis (METR), and cystic ovaries (CYST) using traditional BLUP and single-step genomic BLUP. Genome-wide association studies and functional analyses were carried out to unravel significant genomic regions and biological pathways, and to better understand the genetic mechanisms underlying RETP, METR, and CYST. Heritability estimates (posterior standard deviation in parentheses) were 0.02 (0.003), 0.01 (0.004), and 0.02 (0.003) for CYST, METR, and RETP, respectively. A moderate to strong genetic correlation of 0.69 (0.102) was found between METR and RETP. Averaged over all traits, sire proof reliabilities increased by approximately 11 percentage points with the incorporation of genomic data using a multiple-trait linear model. Biological pathways and associated genes underlying the studied traits were identified and will contribute to a better understanding of the biology of these 3 health disorders in dairy cattle.
Collapse
Affiliation(s)
- A R Guarini
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada N1G 2W1
| | - D A L Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| | - L F Brito
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada N1G 2W1
| | - M Sargolzaei
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada N1G 2W1; The Semex Alliance, Guelph, ON, Canada N1H 6J2
| | - C F Baes
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada N1G 2W1
| | - F Miglior
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada N1G 2W1; Canadian Dairy Network, Guelph, ON, Canada N1K 1E5
| | - I Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens 30602
| | - F S Schenkel
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON, Canada N1G 2W1.
| |
Collapse
|