1
|
Comparison of methods for predicting genomic breeding values for growth traits in Nellore cattle. Trop Anim Health Prod 2021; 53:349. [PMID: 34101031 DOI: 10.1007/s11250-021-02785-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Accepted: 05/23/2021] [Indexed: 10/21/2022]
Abstract
The objective of this study was to evaluate the accuracy of genomic predictions of growth traits in Nellore cattle. Data from 5064 animals belonging to farms that participate in the Conexão DeltaGen and PAINT breeding programs were used. Genotyping was performed with the Illumina BovineHD BeadChip (777,962 SNPs). After quality control of the genomic data, 412,993 SNPs were used. Deregressed EBVs (DEBVs) were calculated using the estimated breeding values (EBVs) and accuracies of birth weight (BW), weight gain from birth to weaning (GBW), postweaning weight gain (PWG), yearling height (YH), and cow weight (CW) provided by GenSys. Three models were used to estimate marker effects: genomic best linear unbiased prediction (GBLUP), BayesCπ, and improved Bayesian least absolute shrinkage and selection operator (IBLASSO). The prediction ability of genomic estimated breeding value (GEBVs) was estimated by the average Pearson correlation between DEBVs and GEBVs, predicted with the different methodologies in the validation populations. The regression coefficients of DEBVs on GEBVs in the validation population were calculated and used as indicators of prediction bias of GEBV. In general, the Bayesian methods provided slightly more accurate predictions of genomic breeding values than GBLUP. The BayesCπ and IBLASSO were similar for all traits (BW, GBW, PWG, and YH), except for CW. Thus, there does not seem to be a more suitable method for the estimation of SNP effects and genomic breeding values. Bayesian regression models are of interest for future applications of genomic selection in this population, but further improvements are needed to reduce deflation of their predictions.
Collapse
|
2
|
Estimation of Variance Components and Genomic Prediction for Individual Birth Weight Using Three Different Genome-Wide SNP Platforms in Yorkshire Pigs. Animals (Basel) 2020; 10:ani10122219. [PMID: 33256056 PMCID: PMC7761447 DOI: 10.3390/ani10122219] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Revised: 11/20/2020] [Accepted: 11/23/2020] [Indexed: 11/17/2022] Open
Abstract
Simple Summary The individual birth weight (IBW) of pigs is an important trait regarding its relevance to mortality at weaning, sow prolificacy, and growth performance. This study investigates the variance component estimation, informative window regions, and the efficiency of genomic predictions associated with IBW traits in Yorkshire pigs. The low heritability (0.13) is estimated on the basis of a full model including individual genetic, sow genetic, and common environmental effects. Two common window regions of the genome are identified under three different genotyping platforms found within the ARAP2 and TSN genes concerning the IBW trait. The genomic prediction ability is improved using deregressed estimated breeding values including parental information as a response variable despite Bayesian methods and genotyping platforms for the IBW trait in Korean Yorkshire pigs. Abstract This study estimates the individual birth weight (IBW) trait heritability and investigates the genomic prediction efficiency using three types of high-density single nucleotide polymorphism (SNP) genotyping panels in Korean Yorkshire pigs. We use 38,864 IBW phenotypic records to identify a suitable model for statistical genetics, where 698 genotypes match our phenotypic records. During our genomic analysis, the deregressed estimated breeding values (DEBVs) and their reliabilities are used as derived response variables from the estimated breeding values (EBVs). Bayesian methods identify the informative regions and perform the genomic prediction using the IBW trait, in which two common significant window regions (SSC8 27 Mb and SSC15 29 Mb) are identified using the three genotyping platforms. Higher prediction ability is observed using the DEBV-including parent average as a response variable, regardless of the SNP genotyping panels and the Bayesian methods, relative to the DEBV-excluding parent average. Hence, we suggest that fine-mapping studies targeting the identified informative regions in this study are necessary to find the causal mutations to improve the IBW trait’s prediction ability. Furthermore, studying the IBW trait using a genomic prediction model with a larger genomic dataset may improve the genomic prediction accuracy in Korean Yorkshire pigs.
Collapse
|
3
|
Genomic Analysis Using Bayesian Methods under Different Genotyping Platforms in Korean Duroc Pigs. Animals (Basel) 2020; 10:ani10050752. [PMID: 32344859 PMCID: PMC7277155 DOI: 10.3390/ani10050752] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 04/16/2020] [Accepted: 04/22/2020] [Indexed: 12/03/2022] Open
Abstract
Simple Summary This study investigated the informative regions and the efficiency of genomic predictions for backfat thickness, days to 90 kg body weight, loin muscle area, and lean percentage in Korean Duroc pigs. The several regions of the genome were identified and a significant marker was found near the MC4R gene for growth and production-related traits. No differences in genomic accuracy were identified on the basis of the Bayesian approaches in these four growth and production-related traits. The genomic accuracy is improved by using deregressed estimated breeding values including parental information as a response variable in Korean Duroc pigs. Abstract Genomic evaluation has been widely applied to several species using commercial single nucleotide polymorphism (SNP) genotyping platforms. This study investigated the informative genomic regions and the efficiency of genomic prediction by using two Bayesian approaches (BayesB and BayesC) under two moderate-density SNP genotyping panels in Korean Duroc pigs. Growth and production records of 1026 individuals were genotyped using two medium-density, SNP genotyping platforms: Illumina60K and GeneSeek80K. These platforms consisted of 61,565 and 68,528 SNP markers, respectively. The deregressed estimated breeding values (DEBVs) derived from estimated breeding values (EBVs) and their reliabilities were taken as response variables. Two Bayesian approaches were implemented to perform the genome-wide association study (GWAS) and genomic prediction. Multiple significant regions for days to 90 kg (DAYS), lean muscle area (LMA), and lean percent (PCL) were detected. The most significant SNP marker, located near the MC4R gene, was detected using GeneSeek80K. Accuracy of genomic predictions was higher using the GeneSeek80K SNP panel for DAYS (Δ2%) and LMA (Δ2–3%) with two response variables, with no gains in accuracy by the Bayesian approaches in four growth and production-related traits. Genomic prediction is best derived from DEBVs including parental information as a response variable between two DEBVs regardless of the genotyping platform and the Bayesian method for genomic prediction accuracy in Korean Duroc pig breeding.
Collapse
|
4
|
Baller JL, Howard JT, Kachman SD, Spangler ML. The impact of clustering methods for cross-validation, choice of phenotypes, and genotyping strategies on the accuracy of genomic predictions. J Anim Sci 2019; 97:1534-1549. [PMID: 30721970 DOI: 10.1093/jas/skz055] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Accepted: 02/04/2019] [Indexed: 01/22/2023] Open
Abstract
For genomic predictors to be of use in genetic evaluation, their predicted accuracy must be a reliable indicator of their utility, and thus unbiased. The objective of this paper was to evaluate the accuracy of prediction of genomic breeding values (GBV) using different clustering strategies and response variables. Red Angus genotypes (n = 9,763) were imputed to a reference 50K panel. The influence of clustering method [k-means, k-medoids, principal component (PC) analysis on the numerator relationship matrix (A) and the identical-by-state genomic relationship matrix (G) as both data and covariance matrices, and random] and response variables [deregressed estimated breeding values (DEBV) and adjusted phenotypes] were evaluated for cross-validation. The GBV were estimated using a Bayes C model for all traits. Traits for DEBV included birth weight (BWT), marbling (MARB), rib-eye area (REA), and yearling weight (YWT). Adjusted phenotypes included BWT, YWT, and ultrasonically measured intramuscular fat percentage and REA. Prediction accuracies were estimated using the genetic correlation between GBV and associated response variable using a bivariate animal model. A simulation mimicking a cattle population, replicated 5 times, was conducted to quantify differences between true and estimated accuracies. The simulation used the same clustering methods and response variables, with the addition of 2 genotyping strategies (random and top 25% of individuals), and forward validation. The prediction accuracies were estimated similarly, and true accuracies were estimated as the correlation between the residuals of a bivariate model including true breeding value (TBV) and GBV. Using the adjusted Rand index, random clusters were clearly different from relationship-based clustering methods. In both real and simulated data, random clustering consistently led to the largest estimates of accuracy, while no method was consistently associated with more or less bias than other methods. In simulation, random genotyping led to higher estimated accuracies than selection of the top 25% of individuals. Interestingly, random genotyping seemed to overpredict true accuracy while selective genotyping tended to underpredict accuracy. When forward in time validation was used, DEBV led to less biased estimates of GBV accuracy. Results suggest the highest, least biased GBV accuracies are associated with random genotyping and DEBV.
Collapse
Affiliation(s)
- Johnna L Baller
- Department of Animal Science, University of Nebraska, Lincoln, NE
| | - Jeremy T Howard
- Department of Animal Science, University of Nebraska, Lincoln, NE
| | | | | |
Collapse
|
5
|
Lee S, Dang C, Choy Y, Do C, Cho K, Kim J, Kim Y, Lee J. Comparison of genome-wide association and genomic prediction methods for milk production traits in Korean Holstein cattle. ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES 2019; 32:913-921. [PMID: 30744323 PMCID: PMC6601072 DOI: 10.5713/ajas.18.0847] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Accepted: 01/11/2019] [Indexed: 11/27/2022]
Abstract
OBJECTIVE The objectives of this study were to compare identified informative regions through two genome-wide association study (GWAS) approaches and determine the accuracy and bias of the direct genomic value (DGV) for milk production traits in Korean Holstein cattle, using two genomic prediction approaches: single-step genomic best linear unbiased prediction (ss-GBLUP) and Bayesian Bayes-B. METHODS Records on production traits such as adjusted 305-day milk (MY305), fat (FY305), and protein (PY305) yields were collected from 265,271 first parity cows. After quality control, 50,765 single-nucleotide polymorphic genotypes were available for analysis. In GWAS for ss-GBLUP (ssGWAS) and Bayes-B (BayesGWAS), the proportion of genetic variance for each 1-Mb genomic window was calculated and used to identify informative genomic regions. Accuracy of the DGV was estimated by a five-fold cross-validation with random clustering. As a measure of accuracy for DGV, we also assessed the correlation between DGV and deregressed-estimated breeding value (DEBV). The bias of DGV for each method was obtained by determining regression coefficients. RESULTS A total of nine and five significant windows (1 Mb) were identified for MY305 using ssGWAS and BayesGWAS, respectively. Using ssGWAS and BayesGWAS, we also detected multiple significant regions for FY305 (12 and 7) and PY305 (14 and 2), respectively. Both single-step DGV and Bayes DGV also showed somewhat moderate accuracy ranges for MY305 (0.32 to 0.34), FY305 (0.37 to 0.39), and PY305 (0.35 to 0.36) traits, respectively. The mean biases of DGVs determined using the single-step and Bayesian methods were 1.50±0.21 and 1.18±0.26 for MY305, 1.75±0.33 and 1.14±0.20 for FY305, and 1.59±0.20 and 1.14±0.15 for PY305, respectively. CONCLUSION From the bias perspective, we believe that genomic selection based on the application of Bayesian approaches would be more suitable than application of ss-GBLUP in Korean Holstein populations.
Collapse
Affiliation(s)
- SeokHyun Lee
- Animal Breeding and Genetics Division, National Institute of Animal Science, RDA, Cheonan 31000, Korea
| | - ChangGwon Dang
- Animal Breeding and Genetics Division, National Institute of Animal Science, RDA, Cheonan 31000, Korea
| | - YunHo Choy
- Animal Breeding and Genetics Division, National Institute of Animal Science, RDA, Cheonan 31000, Korea
| | - ChangHee Do
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
| | - Kwanghyun Cho
- Department of Dairy Science, Korea National College of Agriculture and Fisheries, Jeonju 54874, Korea
| | - Jongjoo Kim
- Division of Applied Life Science, Yeungnam University, Gyeongsan 38541, Korea
| | - Yousam Kim
- Division of Applied Life Science, Yeungnam University, Gyeongsan 38541, Korea
| | - Jungjae Lee
- Jun P&C Institute, INC., Yongin 16950, Korea
| |
Collapse
|
6
|
Speidel SE, Buckley BA, Boldt RJ, Enns RM, Lee J, Spangler ML, Thomas MG. Genome-wide association study of Stayability and Heifer Pregnancy in Red Angus cattle. J Anim Sci 2018; 96:846-853. [PMID: 29471369 PMCID: PMC6093520 DOI: 10.1093/jas/sky041] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 02/15/2018] [Indexed: 11/12/2022] Open
Abstract
Reproductive performance is the most important component of cattle production from the standpoint of economic sustainability of commercial beef enterprises. Heifer Pregnancy (HPG) and Stayability (STAY) genetic predictions are 2 selection tools published by the Red Angus Association of America (RAAA) to assist with improvements in reproductive performance. Given the importance of HPG and STAY to the profitability of commercial beef enterprises, the objective of this study was to identify QTL associated with both HPG and STAY in Red Angus cattle. A genome-wide association study (GWAS) was performed using deregressed HPG and STAY EBV, calculated using a single-trait animal model and a 3-generation pedigree with data from the Spring 2015 RAAA National Cattle Evaluation. Each individual animal possessed 74,659 SNP genotypes. Individual animals with a deregressed EBV reliability > 0.05 were merged with the genotype file and marker quality control was performed. Criteria for sifting genotypes consisted of removing those markers where any of the following were found: average call rate less than 0.85, minor allele frequency < 0.01, lack of Hardy-Weinberg equilibrium (P < 0.0001), or extreme linkage disequilibrium (r2 > 0.99). These criteria resulted in 2,664 animals with 62,807 SNP available for GWAS. Association studies were performed using a Bayes Cπ model in the BOLT software package. Marker significance was calculated as the posterior probability of inclusion (PPI), or the number of instances a specific marker was sampled divided by the total number of samples retained from the Markov chain Monte Carlo chains. Nine markers, with a PPI ≥ 3% were identified as QTL associated with HPG on BTA 1, 11, 13, 23, and 29. Twelve markers, with a PPI ≥ 75% were identified as QTL associated with STAY on BTA 6, 8, 9, 12, 15, 18, 22, and 23.
Collapse
Affiliation(s)
- S E Speidel
- Department of Animal Sciences, Colorado State University, Fort Collins, CO
| | - B A Buckley
- Department of Human Nutrition, Food and Animal Sciences, University of Hawaii at Manoa, Honolulu, HI
| | - R J Boldt
- Department of Animal Sciences, Colorado State University, Fort Collins, CO
| | - R M Enns
- Department of Animal Sciences, Colorado State University, Fort Collins, CO
| | - J Lee
- Department of Animal Science, University of Nebraska-Lincoln, Lincoln, NE
| | - M L Spangler
- Department of Animal Science, University of Nebraska-Lincoln, Lincoln, NE
| | - M G Thomas
- Department of Animal Sciences, Colorado State University, Fort Collins, CO
| |
Collapse
|