1
|
Li Z, He J, Yang F, Yin S, Gao Z, Chen W, Sun C, Tait RG, Bauck S, Guo W, Wu XL. A look under the hood of genomic-estimated breed compositions for brangus cattle: What have we learned? Front Genet 2023; 14:1080279. [PMID: 37056284 PMCID: PMC10086375 DOI: 10.3389/fgene.2023.1080279] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 03/14/2023] [Indexed: 04/05/2023] Open
Abstract
The Brangus cattle were developed to utilize the superior traits of Angus and Brahman cattle. Their genetic compositions are expected to be stabilized at 3/8 Brahman and 5/8 Angus. Previous studies have shown more than expected Angus lineage with Brangus cattle, and the reasons are yet to be investigated. In this study, we revisited the breed compositions for 3,605 Brangus cattle from three perspectives: genome-wise (GBC), per chromosomes (CBC), and per chromosome segments (SBC). The former (GBC) depicted an overall picture of the “mosaic” genome of the Brangus attributable to their ancestors, whereas the latter two criteria (CBC and SBC) corresponded to local ancestral contributions. The average GBC for the 3,605 Brangus cattle were 70.2% Angus and 29.8% Brahman. The K-means clustering supported the postulation of the mixture of 1/2 Ultrablack (UB) animals in Brangus. For the non-UB Brangus animals, the average GBC were estimated to be 67.4% Angus and 32.6% Brahman. The 95% confidence intervals of their overall GBC were 60.4%–73.5% Angus and 26.5%–39.6% Brahman. Possibly, genetic selection and drifting have resulted in an approximately 5% average deviation toward Angus lineage. The estimated ancestral contributions by chromosomes were heavily distributed toward Angus, with 27 chromosomes having an average Angus CBC greater than 62.5% but only two chromosomes (5 and 20) having Brahman CBC greater than 37.5%. The chromosomal regions with high Angus breed proportions were prevalent, tending to form larger blocks on most chromosomes. In contrast, chromosome segments with high Brahman breed proportion were relatively few and isolated, presenting only on seven chromosomes. Hence, genomic hitchhiking effects were strong where Angus favorable alleles resided but weak where Brahman favorable alleles were present. The functions of genes identified in the chromosomal regions with high (≥75%) Angus compositions were diverse yet may were related to growth and body development. In contrast, the genes identified in the regions with high (≥37.5%) Brahman compositions were primarily responsible for disease resistance. In conclusion, we have addressed the questions concerning the Brangus genetic make-ups. The results can help form a dynamic picture of the Brangus breed formation and the genomic reshaping.
Collapse
Affiliation(s)
- Zhi Li
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Jun He
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
- *Correspondence: Jun He, ; Xiao-Lin Wu,
| | - Fang Yang
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Shishu Yin
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Zhendong Gao
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Wenwu Chen
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, China
| | - Chuanyu Sun
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, United States
| | - Richard G. Tait
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, United States
| | - Stewart Bauck
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, United States
| | - Wei Guo
- Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI, United States
| | - Xiao-Lin Wu
- Department of Animal and Dairy Sciences, University of Wisconsin, Madison, WI, United States
- Council on Dairy Cattle Breeding, Bowie, MD, United States
- *Correspondence: Jun He, ; Xiao-Lin Wu,
| |
Collapse
|
2
|
Wilmot H, Glorieux G, Hubin X, Gengler N. A genomic breed assignment test for traceability of meat of Dual-Purpose Blue. Livest Sci 2022. [DOI: 10.1016/j.livsci.2022.104996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
3
|
Wilmot H, Bormann J, Soyeurt H, Hubin X, Glorieux G, Mayeres P, Bertozzi C, Gengler N. Development of a genomic tool for breed assignment by comparison of different classification models: Application to three local cattle breeds. J Anim Breed Genet 2021; 139:40-61. [PMID: 34427366 DOI: 10.1111/jbg.12643] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 08/06/2021] [Accepted: 08/08/2021] [Indexed: 12/11/2022]
Abstract
Assignment of individual cattle to a specific breed can often not rely on pedigree information. This is especially the case for local breeds for which the development of genomic assignment tools is required to allow individuals of unknown origin to be included to their herd books. A breed assignment model can be based on two specific stages: (a) the selection of breed-informative markers and (b) the assignment of individuals to a breed with a classification method. However, the performance of combination of methods used in these two stages has been rarely studied until now. In this study, the combination of 16 different SNP panels with four classification methods was developed on 562 reference genotypes from 12 cattle breeds. Based on their performances, best models were validated on three local breeds of interest. In cross-validation, 14 models had a global cross-validation accuracy higher than 90%, with a maximum of 98.22%. In validation, best models used 7,153 or 2,005 SNPs, based on a partial least squares-discriminant analysis (PLS-DA) and assigned individuals to breeds based on nearest shrunken centroids. The average validation sensitivity of the first two best models for the three local breeds of interest were 98.33% and 97.5%. Moreover, results reported in this study suggest that further studies should consider the PLS-DA method when selecting breed-informative SNPs.
Collapse
Affiliation(s)
- Hélène Wilmot
- National Fund for Scientific Research (F.R.S.-FNRS), Brussels, Belgium.,TERRA Teaching and Research Centre, Gembloux Agro-Bio Tech, University of Liège, Gembloux, Belgium
| | - Jeanne Bormann
- Administration of Technical Agricultural Services (ASTA), Luxembourg, Grand Duchy of Luxembourg
| | - Hélène Soyeurt
- TERRA Teaching and Research Centre, Gembloux Agro-Bio Tech, University of Liège, Gembloux, Belgium
| | | | | | | | | | - Nicolas Gengler
- TERRA Teaching and Research Centre, Gembloux Agro-Bio Tech, University of Liège, Gembloux, Belgium
| |
Collapse
|
4
|
Gao Z, Zhang Y, Li Z, Zeng Q, Yang F, Song Y, Song Y, He J. Genomic breed composition of Ningxiang pig via different SNP panels. J Anim Physiol Anim Nutr (Berl) 2021; 106:783-791. [PMID: 34260785 DOI: 10.1111/jpn.13603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 06/17/2021] [Accepted: 06/21/2021] [Indexed: 11/30/2022]
Abstract
The genomic breed composition (GBC) reflects the genetic relationship between individual animal and ancestor breeds in composite or hybrid breeds. Also, it can estimate the genomic contribution of each breed (ancestor) to the genome of each individual animal. Using genomic SNP information to estimate Ningxiang pig GBC is of great significance. First of all, GBC was widely used in cattle and had significant effects, but there is almost no using experience in Chinese endemic pig breeds. Importantly, High-density SNPs are expensive but can be economized by deploying a relatively small number of highly informative SNP scattered evenly across the genome. Moreover, the impact of low-density SNPs selection strategy on estimating the GBC of individual animals has not been fully explained. Using SNP data from different databases and organizations, we established reference (N = 2015) and verification (N = 302) data sets. Twelve successively smaller SNP panels (500, 1K, 5K, 10K) were built from those SNP in the reference data by three selection methods (uniform, maximized the Euclidean distance (MED) and random distribution method). For each panel, the GBC of Ningxiang pigs in the reference dataset was estimated. Then combining Shannon entropy and the GBC results, the optimal panel (the 10K SNP panel constructed by MED method) was picked out to estimate the GBC of verification Ningxiang pig, which detected that 230 individuals were purebred Ningxiang pigs and the remaining 72 impure individuals contained 6.44% blood related with Rongchang pigs and 4.09% with Bamaxiang pigs in the verification Ningxiang population. Finally, the genetic structure analysis of verification population was performed combining with the results of GBC, multi-dimensional scaling (MDS) analysis and hierarchical cluster analysis. These results showed: (a) GBC could accurately identify purebred Ningxiang pigs and, scientifically, calculate the genomic contribution of each breed of each hybrid animal. (b) GBC could carry out population genetic structure and understand the genetic background of Ningxiang pigs. Such findings highlight a variety of opportunities to better protect and identify other endangered local breeds in China facing the same situation as Ningxiang pig and provide more accurate, economical and efficient new technical support in GBC estimation breeding work.
Collapse
Affiliation(s)
- Zhendong Gao
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Yuebo Zhang
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Zhi Li
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Qinhua Zeng
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Fang Yang
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Yuexiang Song
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Yukun Song
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Jun He
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| |
Collapse
|
5
|
Estimating breed composition for pigs: A case study focused on Mangalitsa pigs and two methods. Livest Sci 2021. [DOI: 10.1016/j.livsci.2021.104398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
6
|
Li Z, Wu XL, Guo W, He J, Li H, Rosa GJM, Gianola D, Tait RG, Parham J, Genho J, Schultz T, Bauck S. Estimation of genomic breed composition of individual animals in composite beef cattle. Anim Genet 2020; 51:457-460. [PMID: 32239777 DOI: 10.1111/age.12928] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/10/2020] [Indexed: 02/01/2023]
Abstract
Three statistical models (an admixture model, linear regression, and ridge-regression BLUP) and two strategies for selecting SNP panels (uniformly spaced vs. maximum Euclidean distance of SNP allele frequencies between ancestral breeds) were compared for estimating genomic-estimated breed composition (GBC) in Brangus and Santa Gertrudis cattle, respectively. Animals were genotyped with a GeneSeek Genomic Profiler bovine low-density version 4 SNP chip. The estimated GBC was consistent among the uniformly spaced SNP panels, and values were similar between the three models. However, estimated GBC varied considerably between the three methods when using fewer than 10 000 SNPs that maximized the Euclidean distance of allele frequencies between the ancestral breeds. The admixture model performed most consistently across various SNP panel sizes. For the other two models, stabilized estimates were obtained with an SNP panel size of 20 000 SNPs or more. Based on the uniformly spaced 20K SNP panel, the estimated GBC was 69.8-70.5% Angus and 29.5-30.2% Brahman for Brangus, and 63.9-65.3% Shorthorn and 34.7-36.1% Brahman in Santa Gertrudis. The estimated GBC of ancestries for Santa Gertrudis roughly agreed with the pedigree-expected values. However, the estimated GBC in Brangus showed a considerably larger Angus composition than the pedigree-expected value (62.5%). The elevated Angus composition in the Brangus could be due to the mixture of some 1/2 Ultrablack animals (Brangus × Angus). Another reason could be the consequences of selection in Brangus cattle for phenotypes where the Angus breed has advantages.
Collapse
Affiliation(s)
- Z Li
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA.,Department of Animal Science, University of Wyoming, Laramie, WY, 82071, USA.,College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, 410128, China
| | - X-L Wu
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA.,Department of Animal Sciences, University of Wisconsin, Madison, WI, 53706, USA
| | - W Guo
- Department of Animal Science, University of Wyoming, Laramie, WY, 82071, USA
| | - J He
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA.,College of Animal Science and Technology, Hunan Agricultural University, Changsha, Hunan, 410128, China
| | - H Li
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA.,Department of Animal Sciences, University of Wisconsin, Madison, WI, 53706, USA
| | - G J M Rosa
- Department of Animal Sciences, University of Wisconsin, Madison, WI, 53706, USA
| | - D Gianola
- Department of Animal Sciences, University of Wisconsin, Madison, WI, 53706, USA
| | - R G Tait
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA
| | - J Parham
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA
| | - J Genho
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA
| | - T Schultz
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA
| | - S Bauck
- Biostatistics and Bioinformatics, Neogen GeneSeek, Lincoln, NE, 68504, USA
| |
Collapse
|
7
|
Hulsegge I, Schoon M, Windig J, Neuteboom M, Hiemstra SJ, Schurink A. Development of a genetic tool for determining breed purity of cattle. Livest Sci 2019. [DOI: 10.1016/j.livsci.2019.03.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
8
|
Gobena M, Elzo MA, Mateescu RG. Population Structure and Genomic Breed Composition in an Angus-Brahman Crossbred Cattle Population. Front Genet 2018; 9:90. [PMID: 29636769 PMCID: PMC5881247 DOI: 10.3389/fgene.2018.00090] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 03/05/2018] [Indexed: 12/27/2022] Open
Abstract
Crossbreeding is a common strategy used in tropical and subtropical regions to enhance beef production, and having accurate knowledge of breed composition is essential for the success of a crossbreeding program. Although pedigree records have been traditionally used to obtain the breed composition of crossbred cattle, the accuracy of pedigree-based breed composition can be reduced by inaccurate and/or incomplete records and Mendelian sampling. Breed composition estimation from genomic data has multiple advantages including higher accuracy without being affected by missing, incomplete, or inaccurate records and the ability to be used as independent authentication of breed in breed-labeled beef products. The present study was conducted with 676 Angus–Brahman crossbred cattle with genotype and pedigree information to evaluate the feasibility and accuracy of using genomic data to determine breed composition. We used genomic data in parametric and non-parametric methods to detect population structure due to differences in breed composition while accounting for the confounding effect of close familial relationships. By applying principal component analysis (PCA) and the maximum likelihood method of ADMIXTURE to genomic data, it was possible to successfully characterize population structure resulting from heterogeneous breed ancestry, while accounting for close familial relationships. PCA results offered additional insight into the different hierarchies of genetic variation structuring. The first principal component was strongly correlated with Angus–Brahman proportions, and the second represented variation within animals that have a relatively more extended Brangus lineage—indicating the presence of a distinct pattern of genetic variation in these cattle. Although there was strong agreement between breed proportions estimated from pedigree and genetic information, there were significant discrepancies between these two methods for certain animals. This was most likely due to inaccuracies in the pedigree-based estimation of breed composition, which supported the case for using genomic information to complement and/or replace pedigree information when estimating breed composition. Comparison with a supervised analysis where purebreds are used as the training set suggest that accurate predictions can be achieved even in the absence of purebred population information.
Collapse
Affiliation(s)
- Mesfin Gobena
- Department of Animal Sciences, University of Florida, Gainesville, FL, United States
| | - Mauricio A Elzo
- Department of Animal Sciences, University of Florida, Gainesville, FL, United States
| | - Raluca G Mateescu
- Department of Animal Sciences, University of Florida, Gainesville, FL, United States
| |
Collapse
|
9
|
Martínez CA, Khare K, Elzo MA. BIBI: Bayesian inference of breed composition. J Anim Breed Genet 2017; 135:54-61. [PMID: 29164684 DOI: 10.1111/jbg.12305] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Accepted: 10/24/2017] [Indexed: 11/28/2022]
Abstract
The aim of this paper was to develop statistical models to estimate individual breed composition based on the previously proposed idea of regressing discrete random variables corresponding to counts of reference alleles of biallelic molecular markers located across the genome on the allele frequencies of each marker in the pure (base) breeds. Some of the existing regression-based methods do not guarantee that estimators of breed composition will lie in the appropriate parameter space, and none of them account for uncertainty about allele frequencies in the pure breeds, that is, uncertainty about the design matrix. To overcome these limitations, we proposed two Bayesian generalized linear models. For each individual, both models assume that the counts of the reference allele at each marker locus follow independent Binomial distributions, use the logit link and pose a Dirichlet prior over the vector of regression coefficients (which corresponds to breed composition). This prior guarantees that point estimators of breed composition such as the posterior mean pertain to the appropriate space. The difference between these models is that model termed BIBI does not account for uncertainty about the design matrix, while model termed BIBI2 accounts for such an uncertainty by assigning independent Beta priors to the entries of this matrix. We implemented these models in a data set from the University of Florida's multibreed Angus-Brahman population. Posterior means were used as point estimators of breed composition. In addition, the ordinary least squares estimator proposed by Kuehn et al. () (OLSK) was also computed. BIBI and BIBI2 estimated breed composition more accurately than OLSK, and BIBI2 had a 7.69% improvement in accuracy as compared to BIBI.
Collapse
Affiliation(s)
- C A Martínez
- Department of Animal Sciences, University of Florida, Gainesville, FL, USA
| | - K Khare
- Department of Statistics, University of Florida, Gainesville, FL, USA
| | - M A Elzo
- Department of Animal Sciences, University of Florida, Gainesville, FL, USA
| |
Collapse
|