Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Brøndum RF, Su G, Janss L, Sahana G, Guldbrandtsen B, Boichard D, Lund MS. Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction. J Dairy Sci 2015;98:4107-16. [PMID: 25892697 DOI: 10.3168/jds.2014-9005] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2014] [Accepted: 03/12/2015] [Indexed: 12/30/2022]

For:	Brøndum RF, Su G, Janss L, Sahana G, Guldbrandtsen B, Boichard D, Lund MS. Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction. J Dairy Sci 2015;98:4107-16. [PMID: 25892697 DOI: 10.3168/jds.2014-9005] [Citation(s) in RCA: 106] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2014] [Accepted: 03/12/2015] [Indexed: 12/30/2022]

Number

Cited by Other Article(s)

Chegini A, Strandén I, Karaman E, Iso-Touru T, Pösö J, Aamand GP, Lidauer MH. Marker weighting improves single-step genomic prediction reliabilities of udder health traits in Nordic Red and Jersey dairy cattle populations. J Dairy Sci 2024:S0022-0302(24)01196-2. [PMID: 39369893 DOI: 10.3168/jds.2024-25374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 09/04/2024] [Indexed: 10/08/2024]

Abstract

The standard single-step genomic prediction model assumes that all SNP markers explain an equal amount of genetic variance, which, however, may not be true. This is because SNPs are located in or near different genes with different functions. Therefore, it seems logical to consider SNP marker-specific weights when predicting genomic breeding values. We hypothesized that allowing differences in the amount of genetic variance explained by each SNP marker will improve prediction reliability and response to selection. To investigate this hypothesis, we first developed multi-trait standard single-step genomic models based on the current multi-trait random regression evaluation models for udder health traits of the Nordic Red (RDC) and Jersey (JER) dairy cattle populations. The models included 4 clinical mastitis (CM) traits, 3 test-day somatic cell score (SCS) traits, and the conformation traits fore udder attachment and udder depth. In the second step, we investigated the effect of applying different SNP marker weighting scenarios in the single-step genomic prediction models, for which a single-step SNP best linear unbiased prediction model was applied. We investigated the prediction reliability of the different models by forward prediction, where the last 4 years of the data were removed to estimate breeding values for validation candidates. In addition, genetic trends of the pedigree-based estimated breeding values (PEBV) and genomic enhanced breeding values (GEBV) were examined. The data sets for RDC and JER included 6.9 and 1.2 million animals of which 5.6 and 0.9 million cows had records, respectively. The number of genotyped animals was 125,789 and 64,777 for RDC and JER, respectively. Cows had repeated SCS observations but only single observations for all other traits and breeding values for all traits were modeled by one covariance function. This required modeling 12 eigenvalue breeding value coefficients for each cow and developing SNP marker weights for the principal components rather than for the biological traits. We investigated 3 SNP marker weighting scenarios: 1) a nonlinear method similar to BayesA, 2) using the classical formula 2pqû2 that accounts for allele heterozygosity, and 3) applying a mean SNP weight calculated by 2pqû2 for every 20 adjacent SNP markers. Bias, dispersion, and prediction reliability were calculated using PEBV or GEBV from the evaluation based on the full data set on those using the reduced data set. We found that the recent favorable genetic trend in CM and SCS has been accelerated since the introduction of genomic selection. The study also shows that a significant increase in prediction reliability, i.e., 0.74 vs. 0.48 for RDC and 0.72 vs. 0.41 for JER cows for CM, can be achieved with a standard single-step genomic prediction model compared with a pedigree-based prediction model. Almost all scenarios with SNP marker weighting further improved the prediction reliability between 0.5% and 12.7%. The highest improvement was achieved by weighing the SNP markers based on the 2pqû2 formula.

Collapse

Strandén I, Jenko J. A computationally feasible multi-trait single-step genomic prediction model with trait-specific marker weights. Genet Sel Evol 2024;56:58. [PMID: 39152403 PMCID: PMC11328383 DOI: 10.1186/s12711-024-00926-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 07/26/2024] [Indexed: 08/19/2024] Open

Pocrnic I, Lourenco D, Misztal I. Single nucleotide polymorphism profile for quantitative trait nucleotide in populations with small effective size and its impact on mapping and genomic predictions. Genetics 2024;227:iyae103. [PMID: 38913695 PMCID: PMC11304960 DOI: 10.1093/genetics/iyae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/07/2024] [Accepted: 06/16/2024] [Indexed: 06/26/2024] Open

van den Berg I, Chamberlain AJ, MacLeod IM, Nguyen TV, Goddard ME, Xiang R, Mason B, Meier S, Phyn CVC, Burke CR, Pryce JE. Using expression data to fine map QTL associated with fertility in dairy cattle. Genet Sel Evol 2024;56:42. [PMID: 38844868 PMCID: PMC11154999 DOI: 10.1186/s12711-024-00912-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 05/13/2024] [Indexed: 06/09/2024] Open

Abstract

BACKGROUND

Female fertility is an important trait in dairy cattle. Identifying putative causal variants associated with fertility may help to improve the accuracy of genomic prediction of fertility. Combining expression data (eQTL) of genes, exons, gene splicing and allele specific expression is a promising approach to fine map QTL to get closer to the causal mutations. Another approach is to identify genomic differences between cows selected for high and low fertility and a selection experiment in New Zealand has created exactly this resource. Our objective was to combine multiple types of expression data, fertility traits and allele frequency in high- (POS) and low-fertility (NEG) cows with a genome-wide association study (GWAS) on calving interval in Australian cows to fine-map QTL associated with fertility in both Australia and New Zealand dairy cattle populations.

RESULTS

Variants that were significantly associated with calving interval (CI) were strongly enriched for variants associated with gene, exon, gene splicing and allele-specific expression, indicating that there is substantial overlap between QTL associated with CI and eQTL. We identified 671 genes with significant differential expression between POS and NEG cows, with the largest fold change detected for the CCDC196 gene on chromosome 10. Our results provide numerous candidate genes associated with female fertility in dairy cattle, including GYS2 and TIGAR on chromosome 5 and SYT3 and HSD17B14 on chromosome 18. Multiple QTL regions were located in regions with large numbers of copy number variants (CNV). To identify the causal mutations for these variants, long read sequencing may be useful.

CONCLUSIONS

Variants that were significantly associated with CI were highly enriched for eQTL. We detected 671 genes that were differentially expressed between POS and NEG cows. Several QTL detected for CI overlapped with eQTL, providing candidate genes for fertility in dairy cattle.

Collapse

Du A, Guo Z, Chen A, Xu L, Sun D, Han B. PC Gene Affects Milk Production Traits in Dairy Cattle. Genes (Basel) 2024;15:708. [PMID: 38927644 PMCID: PMC11202589 DOI: 10.3390/genes15060708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 05/20/2024] [Accepted: 05/27/2024] [Indexed: 06/28/2024] Open

Zhang M, Xu L, Lu H, Luo H, Zhou J, Wang D, Zhang X, Huang X, Wang Y. Genomic prediction based on a joint reference population for the Xinjiang Brown cattle. Front Genet 2024;15:1394636. [PMID: 38737126 PMCID: PMC11082323 DOI: 10.3389/fgene.2024.1394636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 04/10/2024] [Indexed: 05/14/2024] Open

Abstract

Introduction: Xinjiang Brown cattle constitute the largest breed of cattle in Xinjiang. Therefore, it is crucial to establish a genomic evaluation system, especially for those with low levels of breed improvement. Methods: This study aimed to establish a cross breed joint reference population by analyzing the genetic structure of 485 Xinjiang Brown cattle and 2,633 Chinese Holstein cattle (Illumina GeneSeek GGP bovine 150 K chip). The Bayes method single-step genome-wide best linear unbiased prediction was used to conduct a genomic evaluation of the joint reference population for the milk traits of Xinjiang Brown cattle. The reference population of Chinese Holstein cattle was randomly divided into groups to construct the joint reference population. By comparing the prediction accuracy, estimation bias, and inflation coefficient of the validation population, the optimal number of joint reference populations was determined. Results and Discussion: The results indicated a distinct genetic structure difference between the two breeds of adult cows, and both breeds should be considered when constructing multi-breed joint reference and validation populations. The reliability range of genome prediction of milk traits in the joint reference population was 0.142-0.465. Initially, it was determined that the inclusion of 600 and 900 Chinese Holstein cattle in the joint reference population positively impacted the genomic prediction of Xinjiang Brown cattle to certain extent. It was feasible to incorporate the Chinese Holstein into Xinjiang Brown cattle population to form a joint reference population for multi-breed genomic evaluation. However, for different Xinjiang Brown cattle populations, a fixed number of Chinese Holstein cattle cannot be directly added during multi-breed genomic selection. Pre-evaluation analysis based on the genetic structure, kinship, and other factors of the current population is required to ensure the authenticity and reliability of genomic predictions and improve estimation accuracy.

Collapse

Ćeran M, Đorđević V, Miladinović J, Vasiljević M, Đukić V, Ranđelović P, Jaćimović S. Selective Genotyping and Phenotyping for Optimization of Genomic Prediction Models for Populations with Different Diversity. PLANTS (BASEL, SWITZERLAND) 2024;13:975. [PMID: 38611503 PMCID: PMC11013471 DOI: 10.3390/plants13070975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 03/22/2024] [Accepted: 03/24/2024] [Indexed: 04/14/2024]

Abstract

To overcome the different challenges to food security caused by a growing population and climate change, soybean (Glycine max (L.) Merr.) breeders are creating novel cultivars that have the potential to improve productivity while maintaining environmental sustainability. Genomic selection (GS) is an advanced approach that may accelerate the rate of genetic gain in breeding using genome-wide molecular markers. The accuracy of genomic selection can be affected by trait architecture and heritability, marker density, linkage disequilibrium, statistical models, and training set. The selection of a minimal and optimal marker set with high prediction accuracy can lower genotyping costs, computational time, and multicollinearity. Selective phenotyping could reduce the number of genotypes tested in the field while preserving the genetic diversity of the initial population. This study aimed to evaluate different methods of selective genotyping and phenotyping on the accuracy of genomic prediction for soybean yield. The evaluation was performed on three populations: recombinant inbred lines, multifamily diverse lines, and germplasm collection. Strategies adopted for marker selection were as follows: SNP (single nucleotide polymorphism) pruning, estimation of marker effects, randomly selected markers, and genome-wide association study. Reduction of the number of genotypes was performed by selecting a core set from the initial population based on marker data, yet maintaining the original population's genetic diversity. Prediction ability using all markers and genotypes was different among examined populations. The subsets obtained by the model-based strategy can be considered the most suitable for marker selection for all populations. The selective phenotyping based on makers in all cases had higher values of prediction ability compared to minimal values of prediction ability of multiple cycles of random selection, with the highest values of prediction obtained using AN approach and 75% population size. The obtained results indicate that selective genotyping and phenotyping hold great potential and can be integrated as tools for improving or retaining selection accuracy by reducing genotyping or phenotyping costs for genomic selection.

Collapse

Meuwissen T, Eikje LS, Gjuvsland AB. GWABLUP: genome-wide association assisted best linear unbiased prediction of genetic values. Genet Sel Evol 2024;56:17. [PMID: 38429665 PMCID: PMC11234632 DOI: 10.1186/s12711-024-00881-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 01/31/2024] [Indexed: 03/03/2024] Open

Abstract

BACKGROUND

Since the very beginning of genomic selection, researchers investigated methods that improved upon SNP-BLUP (single nucleotide polymorphism best linear unbiased prediction). SNP-BLUP gives equal weight to all SNPs, whereas it is expected that many SNPs are not near causal variants and thus do not have substantial effects. A recent approach to remedy this is to use genome-wide association study (GWAS) findings and increase the weights of GWAS-top-SNPs in genomic predictions. Here, we employ a genome-wide approach to integrate GWAS results into genomic prediction, called GWABLUP.

RESULTS

GWABLUP consists of the following steps: (1) performing a GWAS in the training data which results in likelihood ratios; (2) smoothing the likelihood ratios over the SNPs; (3) combining the smoothed likelihood ratio with the prior probability of SNPs having non-zero effects, which yields the posterior probability of the SNPs; (4) calculating a weighted genomic relationship matrix using the posterior probabilities as weights; and (5) performing genomic prediction using the weighted genomic relationship matrix. Using high-density genotypes and milk, fat, protein and somatic cell count phenotypes on dairy cows, GWABLUP was compared to GBLUP, GBLUP (topSNPs) with extra weights for GWAS top-SNPs, and BayesGC, i.e. a Bayesian variable selection model. The GWAS resulted in six, five, four, and three genome-wide significant peaks for milk, fat and protein yield and somatic cell count, respectively. GWABLUP genomic predictions were 10, 6, 7 and 1% more reliable than those of GBLUP for milk, fat and protein yield and somatic cell count, respectively. It was also more reliable than GBLUP (topSNPs) for all four traits, and more reliable than BayesGC for three of the traits. Although GWABLUP showed a tendency towards inflation bias for three of the traits, this was not statistically significant. In a multitrait analysis, GWABLUP yielded the highest accuracy for two of the traits. However, for SCC, which was relatively unrelated to the yield traits, including yield trait GWAS-results reduced the reliability compared to a single trait analysis.

CONCLUSIONS

GWABLUP uses GWAS results to differentially weigh all the SNPs in a weighted GBLUP genomic prediction analysis. GWABLUP yielded up to 10% and 13% more reliable genomic predictions than GBLUP for single and multitrait analyses, respectively. Extension of GWABLUP to single-step analyses is straightforward.

Collapse

Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023;13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open

Affiliation(s)

Yiyi Liu National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Yuling Zhang National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Fuchen Zhou National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zekai Yao National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Yuexin Zhan National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zhenfei Fan National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Xianglun Meng National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zebin Zhang National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Langqing Liu National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Jie Yang National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
Zhenfang Wu National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
Gengyuan Cai National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
Enqin Zheng National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.) Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China

Collapse

Wei C, Chang C, Zhang W, Ren D, Cai X, Zhou T, Shi S, Wu X, Si J, Yuan X, Li J, Zhang Z. Preselecting Variants from Large-Scale Genome-Wide Association Study Meta-Analyses Increases the Genomic Prediction Accuracy of Growth and Carcass Traits in Large White Pigs. Animals (Basel) 2023;13:3746. [PMID: 38136785 PMCID: PMC10740834 DOI: 10.3390/ani13243746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 11/30/2023] [Accepted: 11/30/2023] [Indexed: 12/24/2023] Open

Affiliation(s)

Chen Wei National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Chengjie Chang National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Wenjing Zhang National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Duanyang Ren National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Xiaodian Cai National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Tianru Zhou National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Shaolei Shi National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Xibo Wu Guangxi State Farms Yongxin Animal Husbandry Group Co., Ltd., Nanning 530022, China; (X.W.); (J.S.)
Jinglei Si Guangxi State Farms Yongxin Animal Husbandry Group Co., Ltd., Nanning 530022, China; (X.W.); (J.S.)
Xiaolong Yuan National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Jiaqi Li National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)
Zhe Zhang National Engineering Research Centre for Swine Breeding Industry, Provincial Key Laboratory of Agricultural Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510640, China; (C.W.); (C.C.); (W.Z.); (D.R.); (X.C.); (T.Z.); (S.S.); (X.Y.); (J.L.)

Collapse

Liu Y, Han B, Zheng W, Peng P, Yang C, Jiang G, Ma Y, Li J, Ni J, Sun D. Identification of genetic associations and functional SNPs of bovine KLF6 gene on milk production traits in Chinese holstein. BMC Genom Data 2023;24:72. [PMID: 38017423 PMCID: PMC10685595 DOI: 10.1186/s12863-023-01175-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 11/13/2023] [Indexed: 11/30/2023] Open

Abstract

BACKGROUND

Our previous research identified the Kruppel like factor 6 (KLF6) gene as a prospective candidate for milk production traits in dairy cattle. The expression of KLF6 in the livers of Holstein cows during the peak of lactation was significantly higher than that during the dry and early lactation periods. Notably, it plays an essential role in activating peroxisome proliferator-activated receptor α (PPARα) signaling pathways. The primary aim of this study was to further substantiate whether the KLF6 gene has significant genetic effects on milk traits in dairy cattle.

RESULTS

Through direct sequencing of PCR products with pooled DNA, we totally identified 12 single nucleotide polymorphisms (SNPs) within the KLF6 gene. The set of SNPs encompasses 7 located in 5' flanking region, 2 located in exon 2 and 3 located in 3' untranslated region (UTR). Of these, the g.44601035G > A is a missense mutation that resulting in the replacement of arginine (CGG) with glutamine (CAG), consequently leading to alterations in the secondary structure of the KLF6 protein, as predicted by SOPMA. The remaining 7 regulatory SNPs significantly impacted the transcriptional activity of KLF6 following mutation (P < 0.005), manifesting as changes in transcription factor binding sites. Additionally, 4 SNPs located in both the UTR and exons were predicted to influence the secondary structure of KLF6 mRNA using the RNAfold web server. Furthermore, we performed the genotype-phenotype association analysis using SAS 9.2 which found all the 12 SNPs were significantly correlated to milk yield, fat yield, fat percentage, protein yield and protein percentage within both the first and second lactations (P < 0.0001 ~ 0.0441). Also, with Haploview 4.2 software, we found the 12 SNPs linked closely and formed a haplotype block, which was strongly associated with five milk traits (P < 0.0001 ~ 0.0203).

CONCLUSIONS

In summary, our study represented the KLF6 gene has significant impacts on milk yield and composition traits in dairy cattle. Among the identified SNPs, 7 were implicated in modulating milk traits by impacting transcriptional activity, 4 by altering mRNA secondary structure, and 1 by affecting the protein secondary structure of KLF6. These findings provided valuable molecular insights for genomic selection program of dairy cattle.

Collapse

Affiliation(s)

Yanan Liu Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, China Agricultural University, No. 2 Yuanmingyuan West Road, Haidian District, Beijing, 100193, China
Bo Han Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, China Agricultural University, No. 2 Yuanmingyuan West Road, Haidian District, Beijing, 100193, China
Weijie Zheng Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, China Agricultural University, No. 2 Yuanmingyuan West Road, Haidian District, Beijing, 100193, China
Peng Peng Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, China Agricultural University, No. 2 Yuanmingyuan West Road, Haidian District, Beijing, 100193, China
Chendong Yang Hebei Province Animal Husbandry and Fine Breeds Work Station, No. 7 Xuefu Road, Changan District, Shijiazhuang, 050000, China
Guie Jiang Hebei Province Animal Husbandry and Fine Breeds Work Station, No. 7 Xuefu Road, Changan District, Shijiazhuang, 050000, China
Yabin Ma Hebei Province Animal Husbandry and Fine Breeds Work Station, No. 7 Xuefu Road, Changan District, Shijiazhuang, 050000, China
Jianming Li Hebei Province Animal Husbandry and Fine Breeds Work Station, No. 7 Xuefu Road, Changan District, Shijiazhuang, 050000, China
Junqing Ni Hebei Province Animal Husbandry and Fine Breeds Work Station, No. 7 Xuefu Road, Changan District, Shijiazhuang, 050000, China.
Dongxiao Sun Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, Department of Animal Genetics, Breeding and Reproduction, College of Animal Science and Technology, China Agricultural University, No. 2 Yuanmingyuan West Road, Haidian District, Beijing, 100193, China.

Collapse

Id-Lahoucine S, Cánovas A, Legarra A, Casellas J. Transmission ratio distortion regions in the context of genomic evaluation and their effects on reproductive traits in cattle. J Dairy Sci 2023;106:7786-7798. [PMID: 37210358 DOI: 10.3168/jds.2022-23062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 04/19/2023] [Indexed: 05/22/2023]

Abstract

Transmission ratio distortion (TRD), which is a deviation from Mendelian expectations, has been associated with basic mechanisms of life such as sperm and ova fertility and viability at developmental stages of the reproductive cycle. In this study different models including TRD regions were tested for different reproductive traits [days from first service to conception (FSTC), number of services, first service nonreturn rate (NRR), and stillbirth (SB)]. Thus, in addition to a basic model with systematic and random effects, including genetic effects modeled through a genomic relationship matrix, we developed 2 additional models, including a second genomic relationship matrix based on TRD regions, and TRD regions as a random effect assuming heterogeneous variances. The analyses were performed with 10,623 cows and 1,520 bulls genotyped for 47,910 SNPs, 590 TRD regions, and several records ranging from 9,587 (FSTC) to 19,667 (SB). The results of this study showed the ability of TRD regions to capture some additional genetic variance for some traits; however, this did not translate into higher accuracy for genomic prediction. This could be explained by the nature of TRD itself, which may arise in different stages of the reproductive cycle. Nevertheless, important effects of TRD regions were found on SB (31 regions) and NRR (18 regions) when comparing at-risk versus control matings, especially for regions with allelic TRD pattern. Particularly for NRR, the probability of observing nonpregnant cow increases by up to 27% for specific TRD regions, and the probability of observing stillbirth increased by up to 254%. These results support the relevance of several TRD regions on some reproductive traits, especially those with allelic patterns that have not received as much attention as recessive TRD patterns.

Collapse

Xie L, Qin J, Rao L, Cui D, Tang X, Chen L, Xiao S, Zhang Z, Huang L. Genetic dissection and genomic prediction for pork cuts and carcass morphology traits in pig. J Anim Sci Biotechnol 2023;14:116. [PMID: 37660101 PMCID: PMC10475202 DOI: 10.1186/s40104-023-00914-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 07/02/2023] [Indexed: 09/04/2023] Open

Abstract

BACKGROUND

As pre-cut and pre-packaged chilled meat becomes increasingly popular, integrating the carcass-cutting process into the pig industry chain has become a trend. Identifying quantitative trait loci (QTLs) of pork cuts would facilitate the selection of pigs with a higher overall value. However, previous studies solely focused on evaluating the phenotypic and genetic parameters of pork cuts, neglecting the investigation of QTLs influencing these traits. This study involved 17 pork cuts and 12 morphology traits from 2,012 pigs across four populations genotyped using CC1 PorcineSNP50 BeadChips. Our aim was to identify QTLs and evaluate the accuracy of genomic estimated breed values (GEBVs) for pork cuts.

RESULTS

We identified 14 QTLs and 112 QTLs for 17 pork cuts by GWAS using haplotype and imputation genotypes, respectively. Specifically, we found that HMGA1, VRTN and BMP2 were associated with body length and weight. Subsequent analysis revealed that HMGA1 primarily affects the size of fore leg bones, VRTN primarily affects the number of vertebrates, and BMP2 primarily affects the length of vertebrae and the size of hind leg bones. The prediction accuracy was defined as the correlation between the adjusted phenotype and GEBVs in the validation population, divided by the square root of the trait's heritability. The prediction accuracy of GEBVs for pork cuts varied from 0.342 to 0.693. Notably, ribs, boneless picnic shoulder, tenderloin, hind leg bones, and scapula bones exhibited prediction accuracies exceeding 0.600. Employing better models, increasing marker density through genotype imputation, and pre-selecting markers significantly improved the prediction accuracy of GEBVs.

CONCLUSIONS

We performed the first study to dissect the genetic mechanism of pork cuts and identified a large number of significant QTLs and potential candidate genes. These findings carry significant implications for the breeding of pork cuts through marker-assisted and genomic selection. Additionally, we have constructed the first reference populations for genomic selection of pork cuts in pigs.

Collapse

Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Holl J, Herring WO, Misztal I, Lourenco D. Using pre-selected variants from large-scale whole-genome sequence data for single-step genomic predictions in pigs. Genet Sel Evol 2023;55:55. [PMID: 37495982 PMCID: PMC10373252 DOI: 10.1186/s12711-023-00831-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 07/18/2023] [Indexed: 07/28/2023] Open

Abstract

BACKGROUND

Whole-genome sequence (WGS) data harbor causative variants that may not be present in standard single nucleotide polymorphism (SNP) chip data. The objective of this study was to investigate the impact of using preselected variants from WGS for single-step genomic predictions in maternal and terminal pig lines with up to 1.8k sequenced and 104k sequence imputed animals per line.

METHODS

Two maternal and four terminal lines were investigated for eight and seven traits, respectively. The number of sequenced animals ranged from 1365 to 1491 for the maternal lines and 381 to 1865 for the terminal lines. Imputation to sequence occurred within each line for 66k to 76k animals for the maternal lines and 29k to 104k animals for the terminal lines. Two preselected SNP sets were generated based on a genome-wide association study (GWAS). Top40k included the SNPs with the lowest p-value in each of the 40k genomic windows, and ChipPlusSign included significant variants integrated into the porcine SNP chip used for routine genotyping. We compared the performance of single-step genomic predictions between using preselected SNP sets assuming equal or different variances and the standard porcine SNP chip.

RESULTS

In the maternal lines, ChipPlusSign and Top40k showed an average increase in accuracy of 0.6 and 4.9%, respectively, compared to the regular porcine SNP chip. The greatest increase was obtained with Top40k, particularly for fertility traits, for which the initial accuracy based on the standard SNP chip was low. However, in the terminal lines, Top40k resulted in an average loss of accuracy of 1%. ChipPlusSign provided a positive, although small, gain in accuracy (0.9%). Assigning different variances for the SNPs slightly improved accuracies when using variances obtained from BayesR. However, increases were inconsistent across the lines and traits.

CONCLUSIONS

The benefit of using sequence data depends on the line, the size of the genotyped population, and how the WGS variants are preselected. When WGS data are available on hundreds of thousands of animals, using sequence data presents an advantage but this remains limited in pigs.

Collapse

Jang S, Ros-Freixedes R, Hickey JM, Chen CY, Herring WO, Holl J, Misztal I, Lourenco D. Multi-line ssGBLUP evaluation using preselected markers from whole-genome sequence data in pigs. Front Genet 2023;14:1163626. [PMID: 37252662 PMCID: PMC10213539 DOI: 10.3389/fgene.2023.1163626] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 05/03/2023] [Indexed: 05/31/2023] Open

Abstract

Genomic evaluations in pigs could benefit from using multi-line data along with whole-genome sequencing (WGS) if the data are large enough to represent the variability across populations. The objective of this study was to investigate strategies to combine large-scale data from different terminal pig lines in a multi-line genomic evaluation (MLE) through single-step GBLUP (ssGBLUP) models while including variants preselected from whole-genome sequence (WGS) data. We investigated single-line and multi-line evaluations for five traits recorded in three terminal lines. The number of sequenced animals in each line ranged from 731 to 1,865, with 60k to 104k imputed to WGS. Unknown parent groups (UPG) and metafounders (MF) were explored to account for genetic differences among the lines and improve the compatibility between pedigree and genomic relationships in the MLE. Sequence variants were preselected based on multi-line genome-wide association studies (GWAS) or linkage disequilibrium (LD) pruning. These preselected variant sets were used for ssGBLUP predictions without and with weights from BayesR, and the performances were compared to that of a commercial porcine single-nucleotide polymorphisms (SNP) chip. Using UPG and MF in MLE showed small to no gain in prediction accuracy (up to 0.02), depending on the lines and traits, compared to the single-line genomic evaluation (SLE). Likewise, adding selected variants from the GWAS to the commercial SNP chip resulted in a maximum increase of 0.02 in the prediction accuracy, only for average daily feed intake in the most numerous lines. In addition, no benefits were observed when using preselected sequence variants in multi-line genomic predictions. Weights from BayesR did not help improve the performance of ssGBLUP. This study revealed limited benefits of using preselected whole-genome sequence variants for multi-line genomic predictions, even when tens of thousands of animals had imputed sequence data. Correctly accounting for line differences with UPG or MF in MLE is essential to obtain predictions similar to SLE; however, the only observed benefit of an MLE is to have comparable predictions across lines. Further investigation into the amount of data and novel methods to preselect whole-genome causative variants in combined populations would be of significant interest.

Collapse

Zhang R, Zhang Y, Liu T, Jiang B, Li Z, Qu Y, Chen Y, Li Z. Utilizing Variants Identified with Multiple Genome-Wide Association Study Methods Optimizes Genomic Selection for Growth Traits in Pigs. Animals (Basel) 2023;13:ani13040722. [PMID: 36830509 PMCID: PMC9952664 DOI: 10.3390/ani13040722] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Revised: 02/09/2023] [Accepted: 02/15/2023] [Indexed: 02/22/2023] Open

Jones HE, Wilson PB. Progress and opportunities through use of genomics in animal production. Trends Genet 2022;38:1228-1252. [PMID: 35945076 DOI: 10.1016/j.tig.2022.06.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 06/08/2022] [Accepted: 06/17/2022] [Indexed: 01/24/2023]

Ribeiro G, Baldi F, Cesar ASM, Alexandre PA, Peripolli E, Ferraz JBS, Fukumasu H. Detection of potential functional variants based on systems-biology: the case of feed efficiency in beef cattle. BMC Genomics 2022;23:774. [PMID: 36434498 PMCID: PMC9700932 DOI: 10.1186/s12864-022-08958-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 10/20/2022] [Indexed: 11/26/2022] Open

Abstract

BACKGROUND

Potential functional variants (PFVs) can be defined as genetic variants responsible for a given phenotype. Ultimately, these are the best DNA markers for animal breeding and selection, especially for polygenic and complex phenotypes. Herein, we described the identification of PFVs for complex phenotypes (in this case, Feed Efficiency in beef cattle) using a systems-biology driven approach based on RNA-seq data from physiologically relevant organs.

RESULTS

The systems-biology coupled with deep molecular phenotyping by RNA-seq of liver, muscle, hypothalamus, pituitary, and adrenal glands of animals with high and low feed efficiency (FE) measured by residual feed intake (RFI) identified 2,000,936 uniquely variants. Among them, 9986 variants were significantly associated with FE and only 78 had a high impact on protein expression and were considered as PFVs. A set of 169 significant uniquely variants were expressed in all five organs, however, only 27 variants had a moderate impact and none of them a had high impact on protein expression. These results provide evidence of tissue-specific effects of high-impact PFVs. The PFVs were enriched (FDR < 0.05) for processing and presentation of MHC Class I and II mediated antigens, which are an important part of the adaptive immune response. The experimental validation of these PFVs was demonstrated by the increased prediction accuracy for RFI using the weighted G matrix (ssGBLUP+wG; Acc = 0.10 and b = 0.48) obtained in the ssGWAS in comparison to the unweighted G matrix (ssGBLUP; Acc = 0.29 and b = 1.10).

CONCLUSION

Here we identified PFVs for FE in beef cattle using a strategy based on systems-biology and deep molecular phenotyping. This approach has great potential to be used in genetic prediction programs, especially for polygenic phenotypes.

Collapse

Ros-Freixedes R, Johnsson M, Whalen A, Chen CY, Valente BD, Herring WO, Gorjanc G, Hickey JM. Genomic prediction with whole-genome sequence data in intensely selected pig lines. GENETICS SELECTION EVOLUTION 2022;54:65. [PMID: 36153511 PMCID: PMC9509613 DOI: 10.1186/s12711-022-00756-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 09/05/2022] [Indexed: 12/03/2022]

Abstract

Background

Early simulations indicated that whole-genome sequence data (WGS) could improve the accuracy of genomic predictions within and across breeds. However, empirical results have been ambiguous so far. Large datasets that capture most of the genomic diversity in a population must be assembled so that allele substitution effects are estimated with high accuracy. The objectives of this study were to use a large pig dataset from seven intensely selected lines to assess the benefits of using WGS for genomic prediction compared to using commercial marker arrays and to identify scenarios in which WGS provides the largest advantage.

Methods

We sequenced 6931 individuals from seven commercial pig lines with different numerical sizes. Genotypes of 32.8 million variants were imputed for 396,100 individuals (17,224 to 104,661 per line). We used BayesR to perform genomic prediction for eight complex traits. Genomic predictions were performed using either data from a standard marker array or variants preselected from WGS based on association tests.

Results

The accuracies of genomic predictions based on preselected WGS variants were not robust across traits and lines and the improvements in prediction accuracy that we achieved so far with WGS compared to standard marker arrays were generally small. The most favourable results for WGS were obtained when the largest training sets were available and standard marker arrays were augmented with preselected variants with statistically significant associations to the trait. With this method and training sets of around 80k individuals, the accuracy of within-line genomic predictions was on average improved by 0.025. With multi-line training sets, improvements of 0.04 compared to marker arrays could be expected.

Conclusions

Our results showed that WGS has limited potential to improve the accuracy of genomic predictions compared to marker arrays in intensely selected pig lines. Thus, although we expect that larger improvements in accuracy from the use of WGS are possible with a combination of larger training sets and optimised pipelines for generating and analysing such datasets, the use of WGS in the current implementations of genomic prediction should be carefully evaluated against the cost of large-scale WGS data on a case-by-case basis.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12711-022-00756-0.

Collapse

Genomic Selection in Chinese Holsteins Using Regularized Regression Models for Feature Selection of Whole Genome Sequencing Data. Animals (Basel) 2022;12:ani12182419. [PMID: 36139283 PMCID: PMC9495168 DOI: 10.3390/ani12182419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/09/2022] [Accepted: 09/12/2022] [Indexed: 11/30/2022] Open

Abstract

Simple Summary

Genomic selection (GS) is increasingly widely used in animal breeding, owing to its high efficiency in the genetic improvement of economic traits. In China, GS has been implemented for genetic evaluation of young bulls in dairy cattle breeding programs since 2012. GS is commonly based on single nucleotide polymorphism (SNP) chips. The cost of whole genome sequencing (WGS) has decreased tremendously in recent years, allowing increased studies of WGS-based GS. In this study, based on the imputed WGS data of approximately 8000 Chinese Holsteins, we investigated the performance of GS of milk production traits using the feature selection method of regularized regression. The results showed that WGS-based GS using regularized regression models and the commonly used linear mixed models achieved comparable prediction accuracies. For milk and protein yields, GS using a combination of SNPs selected with a regularized regression model and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with a linear mixed model combined with 50K SNP chip data performed best for fat yield. The proposed method of GS based on WGS data, i.e., feature selection using regularization regression models, provides a valuable novel strategy for genomic selection.

Abstract

Genomic selection (GS) is an efficient method to improve genetically economic traits. Feature selection is an important method for GS based on whole-genome sequencing (WGS) data. We investigated the prediction performance of GS of milk production traits using imputed WGS data on 7957 Chinese Holsteins. We used two regularized regression models, least absolute shrinkage and selection operator (LASSO) and elastic net (EN) for feature selection. For comparison, we performed genome-wide association studies based on a linear mixed model (LMM), and the N single nucleotide polymorphisms (SNPs) with the lowest p-values were selected (LMM_LASSO and LMM_EN), where N was the number of non-zero effect SNPs selected by LASSO or EN. GS was conducted using a genomic best linear unbiased prediction (GBLUP) model and several sets of SNPs: (1) selected WGS SNPs; (2) 50K SNP chip data; (3) WGS data; and (4) a combined set of selected WGS SNPs and 50K SNP chip data. The results showed that the prediction accuracies of GS with features selected using LASSO or EN were comparable to those using features selected with LMM_LASSO or LMM_EN. For milk and protein yields, GS using a combination of SNPs selected with LASSO and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with LMM_LASSO combined with 50K SNP chip data performed best for fat yield. The proposed method, feature selection using regularization regression models, provides a valuable novel strategy for WGS-based GS.

Collapse

Bolormaa S, MacLeod IM, Khansefid M, Marett LC, Wales WJ, Miglior F, Baes CF, Schenkel FS, Connor EE, Manzanilla-Pech CIV, Stothard P, Herman E, Nieuwhof GJ, Goddard ME, Pryce JE. Sharing of either phenotypes or genetic variants can increase the accuracy of genomic prediction of feed efficiency. Genet Sel Evol 2022;54:60. [PMID: 36068488 PMCID: PMC9450441 DOI: 10.1186/s12711-022-00749-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 08/17/2022] [Indexed: 11/16/2022] Open

Abstract

BACKGROUND

Sharing individual phenotype and genotype data between countries is complex and fraught with potential errors, while sharing summary statistics of genome-wide association studies (GWAS) is relatively straightforward, and thus would be especially useful for traits that are expensive or difficult-to-measure, such as feed efficiency. Here we examined: (1) the sharing of individual cow data from international partners; and (2) the use of sequence variants selected from GWAS of international cow data to evaluate the accuracy of genomic estimated breeding values (GEBV) for residual feed intake (RFI) in Australian cows.

RESULTS

GEBV for RFI were estimated using genomic best linear unbiased prediction (GBLUP) with 50k or high-density single nucleotide polymorphisms (SNPs), from a training population of 3797 individuals in univariate to trivariate analyses where the three traits were RFI phenotypes calculated using 584 Australian lactating cows (AUSc), 824 growing heifers (AUSh), and 2526 international lactating cows (OVE). Accuracies of GEBV in AUSc were evaluated by either cohort-by-birth-year or fourfold random cross-validations. GEBV of AUSc were also predicted using only the AUS training population with a weighted genomic relationship matrix constructed with SNPs from the 50k array and sequence variants selected from a meta-GWAS that included only international datasets. The genomic heritabilities estimated using the AUSc, OVE and AUSh datasets were moderate, ranging from 0.20 to 0.36. The genetic correlations (rg) of traits between heifers and cows ranged from 0.30 to 0.95 but were associated with large standard errors. The mean accuracies of GEBV in Australian cows were up to 0.32 and almost doubled when either overseas cows, or both overseas cows and AUS heifers were included in the training population. They also increased when selected sequence variants were combined with 50k SNPs, but with a smaller relative increase.

CONCLUSIONS

The accuracy of RFI GEBV increased when international data were used or when selected sequence variants were combined with 50k SNP array data. This suggests that if direct sharing of data is not feasible, a meta-analysis of summary GWAS statistics could provide selected SNPs for custom panels to use in genomic selection programs. However, since this finding is based on a small cross-validation study, confirmation through a larger study is recommended.

Collapse

Affiliation(s)

Sunduimijid Bolormaa Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
Iona M. MacLeod Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
Majid Khansefid Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
Leah C. Marett Agriculture Victoria Research, Ellinbank Centre, Ellinbank, Gippsland, VIC 3821 Australia School of Agriculture and Food, University of Melbourne, Parkville, VIC 3010 Australia
William J. Wales Agriculture Victoria Research, Ellinbank Centre, Ellinbank, Gippsland, VIC 3821 Australia School of Agriculture and Food, University of Melbourne, Parkville, VIC 3010 Australia
Filippo Miglior LACTANET, Sainte-Anne-de-Bellevue, QC H9X 3R4 Canada CGIL, University of Guelph, Guelph, ON N1G 2W1 Canada
Christine F. Baes CGIL, University of Guelph, Guelph, ON N1G 2W1 Canada Institute of Genetics, Vetsuisse Faculty, University of Bern, 3002 Bern, Switzerland
Flavio S. Schenkel CGIL, University of Guelph, Guelph, ON N1G 2W1 Canada
Erin E. Connor Animal Genomics and Improvement Laboratory, USDA, Agricultural Research Service, Beltsville Agricultural Research Center, Beltsville, MD 20705 USA Department of Animal and Food Sciences, University of Delaware, Newark, DE 19716 USA
Coralia I. V. Manzanilla-Pech Center for Quantitative Genetics and Genomics, Aarhus University, Blichers Alle 20, 8830 Tjele, Denmark
Paul Stothard Faculty of Agricultural, Life & Environmental Sciences, University of Alberta, Edmonton, AB T6G 2R3 Canada
Emily Herman Faculty of Agricultural, Life & Environmental Sciences, University of Alberta, Edmonton, AB T6G 2R3 Canada
Gert J. Nieuwhof Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia DataGene Ltd, Agribio, Bundoora, VIC 3083 Australia
Michael E. Goddard Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia School of Veterinary and Agricultural Sciences, University of Melbourne, Parkville, VIC 3052 Australia
Jennie E. Pryce Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083 Australia

Collapse

Du A, Zhao F, Liu Y, Xu L, Chen K, Sun D, Han B. Genetic polymorphisms of PKLR gene and their associations with milk production traits in Chinese Holstein cows. Front Genet 2022;13:1002706. [PMID: 36118870 PMCID: PMC9479125 DOI: 10.3389/fgene.2022.1002706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 08/12/2022] [Indexed: 11/13/2022] Open

Reich P, Falker-Gieske C, Pook T, Tetens J. Development and validation of a horse reference panel for genotype imputation. Genet Sel Evol 2022;54:49. [PMID: 35787788 PMCID: PMC9252005 DOI: 10.1186/s12711-022-00740-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/23/2022] [Indexed: 11/10/2022] Open

Abstract

Background

Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses.

Results

Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation.

Conclusions

The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12711-022-00740-8.

Collapse

Zhang Q, Zhang Q, Jensen J. Association Studies and Genomic Prediction for Genetic Improvements in Agriculture. FRONTIERS IN PLANT SCIENCE 2022;13:904230. [PMID: 35720549 PMCID: PMC9201771 DOI: 10.3389/fpls.2022.904230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 05/16/2022] [Indexed: 06/15/2023]

van den Berg I, Ho PN, Nguyen TV, Haile-Mariam M, Luke TDW, Pryce JE. Using mid-infrared spectroscopy to increase GWAS power to detect QTL associated with blood urea nitrogen. Genet Sel Evol 2022;54:27. [PMID: 35436852 PMCID: PMC9014603 DOI: 10.1186/s12711-022-00719-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 04/05/2022] [Indexed: 11/20/2022] Open

Yoshida GM, Yáñez JM. Increased accuracy of genomic predictions for growth under chronic thermal stress in rainbow trout by prioritizing variants from GWAS using imputed sequence data. Evol Appl 2022;15:537-552. [PMID: 35505881 PMCID: PMC9046923 DOI: 10.1111/eva.13240] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 04/01/2021] [Accepted: 04/03/2021] [Indexed: 02/07/2023] Open

Abstract

Through imputation of genotypes, genome-wide association study (GWAS) and genomic prediction (GP) using whole-genome sequencing (WGS) data are cost-efficient and feasible in aquaculture breeding schemes. The objective was to dissect the genetic architecture of growth traits under chronic heat stress in rainbow trout (Oncorhynchus mykiss) and to assess the accuracy of GP based on imputed WGS and different preselected single nucleotide polymorphism (SNP) arrays. A total of 192 and 764 fish challenged to a heat stress experiment for 62 days were genotyped using a customized 1 K and 26 K SNP panels, respectively, and then, genotype imputation was performed from a low-density chip to WGS using 102 parents (36 males and 66 females) as the reference population. Imputed WGS data were used to perform GWAS and test GP accuracy under different preselected SNP scenarios. Heritability was estimated for body weight (BW), body length (BL) and average daily gain (ADG). Estimates using imputed WGS data ranged from 0.33 ± 0.05 to 0.55 ± 0.05 for growth traits under chronic heat stress. GWAS revealed that the top five cumulatively SNPs explained a maximum of 0.94%, 0.86% and 0.51% of genetic variance for BW, BL and ADG, respectively. Some important functional candidate genes associated with growth-related traits were found among the most important SNPs, including signal transducer and activator of transcription 5B and 3 (STAT5B and STAT3, respectively) and cytokine-inducible SH2-containing protein (CISH). WGS data resulted in a slight increase in prediction accuracy compared with pedigree-based method, whereas preselected SNPs based on the top GWAS hits improved prediction accuracies, with values ranging from 1.2 to 13.3%. Our results support the evidence of the polygenic nature of growth traits when measured under heat stress. The accuracies of GP can be improved using preselected variants from GWAS, and the use of WGS marginally increases prediction accuracy.

Collapse

Misztal I, Steyn Y, Lourenco D. Genomic evaluation with multibreed and crossbred data. JDS COMMUNICATIONS 2022;3:156-159. [PMID: 36339739 PMCID: PMC9623721 DOI: 10.3168/jdsc.2021-0177] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 11/21/2021] [Indexed: 11/19/2022]

Cheruiyot EK, Haile-Mariam M, Cocks BG, MacLeod IM, Mrode R, Pryce JE. Functionally prioritised whole-genome sequence variants improve the accuracy of genomic prediction for heat tolerance. Genet Sel Evol 2022;54:17. [PMID: 35183109 PMCID: PMC8858496 DOI: 10.1186/s12711-022-00708-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 02/03/2022] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Heat tolerance is a trait of economic importance in the context of warm climates and the effects of global warming on livestock production, reproduction, health, and well-being. This study investigated the improvement in prediction accuracy for heat tolerance when selected sets of sequence variants from a large genome-wide association study (GWAS) were combined with a standard 50k single nucleotide polymorphism (SNP) panel used by the dairy industry.

METHODS

Over 40,000 dairy cattle with genotype and phenotype data were analysed. The phenotypes used to measure an individual's heat tolerance were defined as the rate of decline in milk production traits with rising temperature and humidity. We used Holstein and Jersey cows to select sequence variants linked to heat tolerance. The prioritised sequence variants were the most significant SNPs passing a GWAS p-value threshold selected based on sliding 100-kb windows along each chromosome. We used a bull reference set to develop the genomic prediction equations, which were then validated in an independent set of Holstein, Jersey, and crossbred cows. Prediction analyses were performed using the BayesR, BayesRC, and GBLUP methods.

RESULTS

The accuracy of genomic prediction for heat tolerance improved by up to 0.07, 0.05, and 0.10 units in Holstein, Jersey, and crossbred cows, respectively, when sets of selected sequence markers from Holstein cows were added to the 50k SNP panel. However, in some scenarios, the prediction accuracy decreased unexpectedly with the largest drop of - 0.10 units for the heat tolerance fat yield trait observed in Jersey cows when 50k plus pre-selected SNPs from Holstein cows were used. Using pre-selected SNPs discovered on a combined set of Holstein and Jersey cows generally improved the accuracy, especially in the Jersey validation. In addition, combining Holstein and Jersey bulls in the reference set generally improved prediction accuracy in most scenarios compared to using only Holstein bulls as the reference set.

CONCLUSIONS

Informative sequence markers can be prioritised to improve the genomic prediction of heat tolerance in different breeds. In addition to providing biological insight, these variants could also have a direct application for developing customized SNP arrays or can be used via imputation in current industry SNP panels.

Collapse

Cai Z, Christensen OF, Lund MS, Ostersen T, Sahana G. Large-scale association study on daily weight gain in pigs reveals overlap of genetic factors for growth in humans. BMC Genomics 2022;23:133. [PMID: 35168569 PMCID: PMC8845347 DOI: 10.1186/s12864-022-08373-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 02/08/2022] [Indexed: 01/10/2023] Open

Abstract

Background

Imputation from genotyping array to whole-genome sequence variants using resequencing of representative reference populations enhances our ability to map genetic factors affecting complex phenotypes in livestock species. The accumulation of knowledge about gene function in human and laboratory animals can provide substantial advantage for genomic research in livestock species.

Results

In this study, 201,388 pigs from three commercial Danish breeds genotyped with low to medium (8.5k to 70k) SNP arrays were imputed to whole genome sequence variants using a two-step approach. Both imputation steps achieved high accuracies, and in total this yielded 26,447,434 markers on 18 autosomes. The average estimated imputation accuracy of markers with minor allele frequency ≥ 0.05 was 0.94. To overcome the memory consumption of running genome-wide association study (GWAS) for each breed, we performed within-breed subpopulation GWAS then within-breed meta-analysis for average daily weight gain (ADG), followed by a multi-breed meta-analysis of GWAS summary statistics. We identified 15 quantitative trait loci (QTL). Our post-GWAS analysis strategy to prioritize of candidate genes including information like gene ontology, mammalian phenotype database, differential expression gene analysis of high and low feed efficiency pig and human GWAS catalog for height, obesity, and body mass index, we proposed MRAP2, LEPROT, PMAIP1, ENSSSCG00000036234, BMP2, ELFN1, LIG4 and FAM155A as the candidate genes with biological support for ADG in pigs.

Conclusion

Our post-GWAS analysis strategy helped to identify candidate genes not just by distance to the lead SNP but also by multiple sources of biological evidence. Besides, the identified QTL overlap with genes which are known for their association with human growth-related traits. The GWAS with this large data set showed the power to map the genetic factors associated with ADG in pigs and have added to our understanding of the genetics of growth across mammalian species.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12864-022-08373-3.

Collapse

Teng J, Zhao C, Wang D, Chen Z, Tang H, Li J, Mei C, Yang Z, Ning C, Zhang Q. Assessment of the performance of different imputation methods for low-coverage sequencing in Holstein cattle. J Dairy Sci 2022;105:3355-3366. [DOI: 10.3168/jds.2021-21360] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 12/13/2021] [Indexed: 12/27/2022]

Guillenea A, Su G, Lund MS, Karaman E. Genomic prediction in Nordic Red dairy cattle considering breed origin of alleles. J Dairy Sci 2022;105:2426-2438. [PMID: 35033341 DOI: 10.3168/jds.2021-21173] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 11/23/2021] [Indexed: 01/02/2023]

Abstract

This study investigated the reliability of genomic prediction (GP) using breed origin of alleles (BOA) approach in the Nordic Red (RDC) population, which has an admixed population structure. The RDC population consists of animals with varying degrees of genetic materials from the Danish Red (RDM), Swedish Red (SRB), Finnish Ayrshire (FAY), and Holstein (HOL) because bulls have been used across the breeds. The BOA approach was tested using 39,550 RDC animals in the reference population and 11,786 in the validation population. Deregressed proofs (DRP) of milk, fat and protein were used as response variable for GP. Direct genomic breeding values (DGV) for animals in the validation population were calculated with (BOA model) or without (joint model) considering breed origin of alleles. The joint model assumed homogeneous marker effects and a single set of marker effects were estimated, whereas BOA model assumed heterogeneous marker effects, and different sets of marker effects were estimated across the breeds. For the BOA approach, we tested scenarios assuming both correlated (BOA_cor) and uncorrelated (BOA_uncor) marker effects between the breeds. Additionally, we investigated GP using a standard Illumina 50K chip and including SNP selected from imputed whole-genome sequencing (50K+WGS). We also studied the effect of estimating (co)variances for genome regions of different sizes to exploit the information of the genome regions contributing to the (co)variance between the breeds. Region sizes were set as 1 SNP, a group of 30 or 100 adjacent SNP, or the whole genome. Reliability of DGV was measured as squared correlations between DGV and DRP divided by the reliability of DRP. Across the 3 traits, in general, RS30 and RS100 SNP yielded the highest reliabilities. Including WGS SNP improved reliabilities in almost all scenarios (0.297 on average for 50K and 0.307 on average for 50K+WGS). The BOA_uncor (0.233 on average) was inferior to the joint model (0.339 on average), but the reliabilities obtained using BOA_cor (0.334 on average) in most cases were not significantly different from those obtained using the joint model. The results indicate that both including additional whole-genome sequencing SNP and dividing the genome into fixed regions improve GP in the RDC. The BOA models have the potential to increase the reliability of GP, but the benefit is limited in populations with a high exchange of genetic material for a long time, as is the case for RDC.

Collapse

Bedhane M, van der Werf J, de las Heras-Saldana S, Lim D, Park B, Na Park M, Seung Hee R, Clark S. The accuracy of genomic prediction for meat quality traits in Hanwoo cattle when using genotypes from different SNP densities and preselected variants from imputed whole genome sequence. ANIMAL PRODUCTION SCIENCE 2022. [DOI: 10.1071/an20659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Abstract Context Genomic prediction is the use of genomic data in the estimation of genomic breeding values (GEBV) in animal breeding. In beef cattle breeding programs, genomic prediction increases the rates of genetic gain by increasing the accuracy of selection at earlier ages. Aims The objectives of the study were to examine the effect of single-nucleotide polymorphism (SNP) density and to evaluate the effect of using SNPs preselected from imputed whole-genome sequence for genomic prediction. Methods Genomic and phenotypic data from 2110 Hanwoo steers were used to predict GEBV for marbling score (MS), meat texture (MT), and meat colour (MC) traits. Three types of SNP densities including 50k, high-density (HD), and whole-genome sequence data and preselected SNPs from genome-wide association study (GWAS) were used for genomic prediction analyses. Two scenarios (independent and dependent discovery populations) were used to select top significant SNPs. The accuracy of GEBV was assessed using random cross-validation. Genomic best linear unbiased prediction (GBLUP) was used to predict the breeding values for each trait. Key results Our result showed that very similar prediction accuracies were observed across all SNP densities used in the study. The prediction accuracy among traits ranged from 0.29±0.05 for MC to 0.46±0.04 for MS. Depending on the studied traits, up to 5% of prediction accuracy improvement was obtained when the preselected SNPs from GWAS analysis were included in the prediction analysis. Conclusions High SNP density such as HD and the whole-genome sequence data yielded a similar prediction accuracy in Hanwoo beef cattle. Therefore, the 50K SNP chip panel is sufficient to capture the relationships in a breed with a small effective population size such as the Hanwoo cattle population. Preselected variants improved prediction accuracy when they were included in the genomic prediction model. Implications The estimated genomic prediction accuracies are moderately accurate in Hanwoo cattle and for searching for SNPs that are more productive could increase the accuracy of estimated breeding values for the studied traits. Collapse

Mesbah-Uddin M, Guldbrandtsen B, Capitan A, Lund MS, Boichard D, Sahana G. Genome-wide association study with imputed whole-genome sequence variants including large deletions for female fertility in 3 Nordic dairy cattle breeds. J Dairy Sci 2021;105:1298-1313. [PMID: 34955274 DOI: 10.3168/jds.2021-20655] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Accepted: 09/22/2021] [Indexed: 11/19/2022]

Zhang Z, Ma P, Zhang Z, Wang Z, Wang Q, Pan Y. The construction of a haplotype reference panel using extremely low coverage whole genome sequences and its application in genome-wide association studies and genomic prediction in Duroc pigs. Genomics 2021;114:340-350. [PMID: 34929285 DOI: 10.1016/j.ygeno.2021.12.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 10/11/2021] [Accepted: 12/15/2021] [Indexed: 12/30/2022]

Pilon B, Hinterneder K, Hay EHA, Fragomeni B. Inbreeding Calculated with Runs of Homozygosity Suggests Chromosome-Specific Inbreeding Depression Regions in Line 1 Hereford. Animals (Basel) 2021;11:ani11113105. [PMID: 34827837 PMCID: PMC8614356 DOI: 10.3390/ani11113105] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/19/2021] [Accepted: 10/25/2021] [Indexed: 01/12/2023] Open

Yang R, Xu Z, Wang Q, Zhu D, Bian C, Ren J, Huang Z, Zhu X, Tian Z, Wang Y, Jiang Z, Zhao Y, Zhang D, Li N, Hu X. Genome‑wide association study and genomic prediction for growth traits in yellow-plumage chicken using genotyping-by-sequencing. Genet Sel Evol 2021;53:82. [PMID: 34706641 PMCID: PMC8555081 DOI: 10.1186/s12711-021-00672-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 09/08/2021] [Indexed: 12/25/2022] Open

Abstract

Background

Growth traits are of great importance for poultry breeding and production and have been the topic of extensive investigation, with many quantitative trait loci (QTL) detected. However, due to their complex genetic background, few causative genes have been confirmed and the underlying molecular mechanisms remain unclear, thus limiting our understanding of QTL and their potential use for the genetic improvement of poultry. Therefore, deciphering the genetic architecture is a promising avenue for optimising genomic prediction strategies and exploiting genomic information for commercial breeding. The objectives of this study were to: (1) conduct a genome-wide association study to identify key genetic factors and explore the polygenicity of chicken growth traits; (2) investigate the efficiency of genomic prediction in broilers; and (3) evaluate genomic predictions that harness genomic features.

Results

We identified five significant QTL, including one on chromosome 4 with major effects and four on chromosomes 1, 2, 17, and 27 with minor effects, accounting for 14.5 to 34.1% and 0.2 to 2.6% of the genomic additive genetic variance, respectively, and 23.3 to 46.7% and 0.6 to 4.5% of the observed predictive accuracy of breeding values, respectively. Further analysis showed that the QTL with minor effects collectively had a considerable influence, reflecting the polygenicity of the genetic background. The accuracy of genomic best linear unbiased predictions (BLUP) was improved by 22.0 to 70.3% compared to that of the conventional pedigree-based BLUP model. The genomic feature BLUP model further improved the observed prediction accuracy by 13.8 to 15.2% compared to the genomic BLUP model.

Conclusions

A major QTL and four minor QTL were identified for growth traits; the remaining variance was due to QTL effects that were too small to be detected. The genomic BLUP and genomic feature BLUP models yielded considerably higher prediction accuracy compared to the pedigree-based BLUP model. This study revealed the polygenicity of growth traits in yellow-plumage chickens and demonstrated that the predictive ability can be greatly improved by using genomic information and related features.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12711-021-00672-9.

Collapse

Affiliation(s)

Ruifei Yang State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.,College of Animal Science and Technology, China Agricultural University, Beijing, China
Zhenqiang Xu Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
Qi Wang State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Di Zhu State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Cheng Bian State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Jiangli Ren State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Zhuolin Huang State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Xiaoning Zhu State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Zhixin Tian State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Yuzhe Wang State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Ziqin Jiang Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
Yiqiang Zhao State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Dexiang Zhang Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China.
Ning Li State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
Xiaoxiang Hu State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.

Collapse

Mollandin F, Rau A, Croiseau P. An evaluation of the predictive performance and mapping power of the BayesR model for genomic prediction. G3 GENES|GENOMES|GENETICS 2021;11:6317672. [PMID: 34849780 PMCID: PMC8527474 DOI: 10.1093/g3journal/jkab225] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/27/2021] [Indexed: 12/02/2022]

Ling AS, Hay EH, Aggrey SE, Rekaya R. Dissection of the impact of prioritized QTL-linked and -unlinked SNP markers on the accuracy of genomic selection¹. BMC Genom Data 2021;22:26. [PMID: 34380418 PMCID: PMC8356450 DOI: 10.1186/s12863-021-00979-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 07/18/2021] [Indexed: 12/01/2022] Open

Abstract

Background

Use of genomic information has resulted in an undeniable improvement in prediction accuracies and an increase in genetic gain in animal and plant genetic selection programs in spite of oversimplified assumptions about the true biological processes. Even for complex traits, a large portion of markers do not segregate with or effectively track genomic regions contributing to trait variation; yet it is not clear how genomic prediction accuracies are impacted by such potentially nonrelevant markers. In this study, a simulation was carried out to evaluate genomic predictions in the presence of markers unlinked with trait-relevant QTL. Further, we compared the ability of the population statistic F_ST and absolute estimated marker effect as preselection statistics to discriminate between linked and unlinked markers and the corresponding impact on accuracy.

Results

We found that the accuracy of genomic predictions decreased as the proportion of unlinked markers used to calculate the genomic relationships increased. Using all, only linked, and only unlinked marker sets yielded prediction accuracies of 0.62, 0.89, and 0.22, respectively. Furthermore, it was found that prediction accuracies are severely impacted by unlinked markers with large spurious associations. F_ST-preselected marker sets of 10 k and larger yielded accuracies 8.97 to 17.91% higher than those achieved using preselection by absolute estimated marker effects, despite selecting 5.1 to 37.7% more unlinked markers and explaining 2.4 to 5.0% less of the genetic variance. This was attributed to false positives selected by absolute estimated marker effects having a larger spurious association with the trait of interest and more negative impact on predictions. The Pearson correlation between F_ST scores and absolute estimated marker effects was 0.77 and 0.27 among only linked and only unlinked markers, respectively. The sensitivity of F_ST scores to detect truly linked markers is comparable to absolute estimated marker effects but the consistency between the two statistics regarding false positives is weak.

Conclusion

Identification and exclusion of markers that have little to no relevance to the trait of interest may significantly increase genomic prediction accuracies. The population statistic F_ST presents an efficient and effective tool for preselection of trait-relevant markers.

Collapse

Moghaddar N, Brown DJ, Swan AA, Gurman PM, Li L, van der Werf JH. Genomic prediction in a numerically small breed population using prioritized genetic markers from whole-genome sequence data. J Anim Breed Genet 2021;139:71-83. [PMID: 34374454 DOI: 10.1111/jbg.12638] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 06/08/2021] [Accepted: 07/19/2021] [Indexed: 11/30/2022]

Abstract

The objective of this study was to investigate the accuracy of genomic prediction of body weight and eating quality traits in a numerically small sheep population (Dorper sheep). Prediction was based on a large multi-breed/admixed reference population and using (a) 50k or 500k single nucleotide polymorphism (SNP) genotypes, (b) imputed whole-genome sequencing data (~31 million), (c) selected SNPs from whole genome sequence data and (d) 50k SNP genotypes plus selected SNPs from whole-genome sequence data. Furthermore, the impact of using a breed-adjusted genomic relationship matrix on accuracy of genomic breeding value was assessed. The selection of genetic variants was based on an association study performed on imputed whole-genome sequence data in an independent population, which was chosen either randomly from the base population or according to higher genetic proximity to the target population. Genomic prediction was based on genomic best linear unbiased prediction (GBLUP), and the accuracy of genomic prediction was assessed according to the correlation between genomic breeding value and corrected phenotypes divided by the square root of trait heritability. The accuracy of genomic prediction was between 0.20 and 0.30 across different traits based on common 50k SNP genotypes, which improved on average by 0.06 (absolute value) on average based on using prioritized genetic markers from whole-genome sequence data. Using prioritized genetic markers from a genetically more related GWAS population resulted in slightly higher prediction accuracy (0.02 absolute value) compared to genetic markers derived from a random GWAS population. Using high-density SNP genotypes or imputed whole-genome sequence data in GBLUP showed almost no improvement in genomic prediction accuracy however, accounting for different marker allele frequencies in reference population according to a breed-adjusted GRM resulted to on average 0.024 (absolute value) increase in accuracy of genomic prediction.

Collapse

Emebiri L, Hildebrand S, Tan MK, Juliana P, Singh PK, Fuentes-Davila G, Singh RP. Pre-emptive Breeding Against Karnal Bunt Infection in Common Wheat: Combining Genomic and Agronomic Information to Identify Suitable Parents. FRONTIERS IN PLANT SCIENCE 2021;12:675859. [PMID: 34394138 PMCID: PMC8358121 DOI: 10.3389/fpls.2021.675859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 05/18/2021] [Indexed: 06/13/2023]

Abstract

Wheat (Triticum aestivum L.) is the most widely grown cereal crop in the world and is staple food to half the world's population. The current world population is expected to reach 9.8 billion people by 2050, but food production is not expected to keep pace with demand in developing countries. Significant opportunities exist for traditional grain exporters to produce and export greater amounts of wheat to fill the gap. Karnal bunt, however, is a major threat, due to its use as a non-tariff trade barrier by several wheat-importing countries. The cultivation of resistant varieties remains the most cost-effective approach to manage the disease, but in countries that are free of the disease, genetic improvement is difficult due to quarantine restrictions. Here we report a study on pre-emptive breeding designed to identify linked molecular markers, evaluate the prospects of genomic selection as a tool, and prioritise wheat genotypes suitable for use as parents. In a genome-wide association (GWAS) study, we identified six DArTseq markers significantly linked to Karnal bunt resistance, which explained between 7.6 and 29.5% of the observed phenotypic variation. The accuracy of genomic prediction was estimated to vary between 0.53 and 0.56, depending on whether it is based solely on the identified Quantitative trait loci (QTL) markers or the use of genome-wide markers. As genotypes used as parents would be required to possess good yield and phenology, further research was conducted to assess the agronomic value of Karnal bunt resistant germplasm from the International Maize and Wheat Improvement Center (CIMMYT). We identified an ideal genotype, ZVS13_385, which possessed similar agronomic attributes to the highly successful Australian wheat variety, Mace. It is phenotypically resistant to Karnal bunt infection (<1% infection) and carried all the favourable alleles detected for resistance in this study. The identification of a genotype combining Karnal bunt resistance with adaptive agronomic traits overcomes the concerns of breeders regarding yield penalty in the absence of the disease.

Collapse

Gebreyesus G, Lund MS, Sahana G, Su G. Reliabilities of Genomic Prediction for Young Stock Survival Traits Using 54K SNP Chip Augmented With Additional Single-Nucleotide Polymorphisms Selected From Imputed Whole-Genome Sequencing Data. Front Genet 2021;12:667300. [PMID: 34349779 PMCID: PMC8326759 DOI: 10.3389/fgene.2021.667300] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 06/23/2021] [Indexed: 11/16/2022] Open

Abstract

This study investigated effects of integrating single-nucleotide polymorphisms (SNPs) selected based on previous genome-wide association studies (GWASs), from imputed whole-genome sequencing (WGS) data, in the conventional 54K chip on genomic prediction reliability of young stock survival (YSS) traits in dairy cattle. The WGS SNPs included two groups of SNP sets that were selected based on GWAS in the Danish Holstein for YSS index (YSS_SNPs, n = 98) and SNPs chosen as peaks of quantitative trait loci for the traits of Nordic total merit index in Denmark–Finland–Sweden dairy cattle populations (DFS_SNPs, n = 1,541). Additionally, the study also investigated the possibility of improving genomic prediction reliability for survival traits by modeling the SNPs within recessive lethal haplotypes (LET_SNP, n = 130) detected from the 54K chip in the Nordic Holstein. De-regressed proofs (DRPs) were obtained from 6,558 Danish Holstein bulls genotyped with either 54K chip or customized LD chip that includes SNPs in the standard LD chip and some of the selected WGS SNPs. The chip data were subsequently imputed to 54K SNP together with the selected WGS SNPs. Genomic best linear unbiased prediction (GBLUP) models were implemented to predict breeding values through either pooling the 54K and selected WGS SNPs together as one genetic component (a one-component model) or considering 54K SNPs and selected WGS SNPs as two separate genetic components (a two-component model). Across all the traits, inclusion of each of the selected WGS SNP sets led to negligible improvements in prediction accuracies (0.17 percentage points on average) compared to prediction using only 54K. Similarly, marginal improvement in prediction reliability was obtained when all the selected WGS SNPs were included (0.22 percentage points). No further improvement in prediction reliability was observed when considering random regression on genotype code of recessive lethal alleles in the model including both groups of the WGS SNPs. Additionally, there was no difference in prediction reliability from integrating the selected WGS SNP sets through the two-component model compared to the one-component GBLUP.

Collapse

Impact of Marker Pruning Strategies Based on Different Measurements of Marker Distance on Genomic Prediction in Dairy Cattle. Animals (Basel) 2021;11:ani11071992. [PMID: 34359120 PMCID: PMC8300388 DOI: 10.3390/ani11071992] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 06/27/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022] Open

Abstract

Simple Summary

The usefulness of genomic prediction (GP) has been widely proofed by breeding analysis in livestock, plants and aquatic populations. It is well known that ‘marker density’ is a critical factor that affects the accuracy of GP, however, how to properly measure ‘marker density’ in GP is yet to be determined. With population-level whole-genome sequence data or high-density single nucleotide polymorphism (SNP) data available, this question seems to be answered more convincingly. In this study, we investigated and discussed the impact of four ‘marker density’ measures that reflect genetic or physical distances between SNPs on the accuracy of GP in a Germany Holstein dairy cattle population. Our results showed that the degree of variation of physical distance between adjacent SNPs had significant effects on the accuracy of GP, while the genetic distance between SNPs had no relationship with the accuracy of GP. Therefore, for studies based on high-density SNP data, the default strategy of pruning SNPs based on genetic distance is detrimental to heritability estimation and genomic prediction. The results extended the communities knowledge of ‘marker density’ and provided useful suggestions for the application and research on genome prediction.

Abstract

With the availability of high-density single-nucleotide polymorphism (SNP) data and the development of genotype imputation methods, high-density panel-based genomic prediction (GP) has become possible in livestock breeding. It is generally considered that the genomic estimated breeding value (GEBV) accuracy increases with the marker density, while studies have shown that the GEBV accuracy does not increase or even decrease when high-density panels were used. Therefore, in addition to the SNP number, other measurements of ‘marker density’ seem to have impacts on the GEBV accuracy, and exploring the relationship between the GEBV accuracy and the measurements of ‘marker density’ based on high-density SNP or whole-genome sequence data is important for the field of GP. In this study, we constructed different SNP panels with certain SNP numbers (e.g., 1 k) by using the physical distance (PhyD), genetic distance (GenD) and random distance (RanD) between SNPs respectively based on the high-density SNP data of a Germany Holstein dairy cattle population. Therefore, there are three different panels at a certain SNP number level. These panels were used to construct GP models to predict fat percentage, milk yield and somatic cell score. Meanwhile, the mean (d¯) and variance (σd2) of the physical distance between SNPs and the mean (r2¯) and variance (σr22) of the genetic distance between SNPs in each panel were used as marker density-related measurements and their influence on the GEBV accuracy was investigated. At the same SNP number level, the d¯ of all panels is basically the same, but the σd2, r2¯ and σr22 are different. Therefore, we only investigated the effects of σd2, r2¯ and σr22 on the GEBV accuracy. The results showed that at a certain SNP number level, the GEBV accuracy was negatively correlated with σd2, but not with r2¯ and σr22. Compared with GenD and RanD, the σd2 of panels constructed by PhyD is smaller. The low and moderate-density panels (< 50 k) constructed by RanD or GenD have large σd2, which is not conducive to genomic prediction. The GEBV accuracy of the low and moderate-density panels constructed by PhyD is 3.8~34.8% higher than that of the low and moderate-density panels constructed by RanD and GenD. Panels with 20–30 k SNPs constructed by PhyD can achieve the same or slightly higher GEBV accuracy than that of high-density SNP panels for all three traits. In summary, the smaller the variation degree of physical distance between adjacent SNPs, the higher the GEBV accuracy. The low and moderate-density panels construct by physical distance are beneficial to genomic prediction, while pruning high-density SNP data based on genetic distance is detrimental to genomic prediction. The results provide suggestions for the development of SNP panels and the research of genome prediction based on whole-genome sequence data.

Collapse

Gebreyesus G, Lund MS, Kupisiewicz K, Su G. Genetic parameters of semen quality traits and genetic correlations with service sire nonreturn rate in Nordic Holstein bulls. J Dairy Sci 2021;104:10010-10019. [PMID: 34099302 DOI: 10.3168/jds.2021-20403] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 04/28/2021] [Indexed: 11/19/2022]

Lund TB, Gamborg C, Secher J, Sand E P. Danish dairy farmers' acceptance of and willingness to use semen from bulls produced by means of in vitro embryo production and genomic selection. J Dairy Sci 2021;104:8023-8038. [PMID: 33934865 DOI: 10.3168/jds.2020-19210] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 03/06/2021] [Indexed: 11/19/2022]

Abstract

A novel technology combining in vitro production and genomic embryo selection is currently under development in dairy cattle breeding. Adoption of this technology will probably accelerate genetic progress toward the main breeding goals of economic interest, as well as allow selection for traits of societal concern such as decreased methane emissions and improved animal welfare. However, dairy farmers, and especially organic farmers, could find the technology morally questionable and reject its use. This cross-sectional study surveyed Danish dairy farmers' general acceptance of the combined technology and their reported likelihood of using semen produced with it. Drawing on diffusion theory, a questionnaire was developed to examine the way farmers discover and communicate about new technological breeding options, and to measure the factors which predict acceptance and likelihood of adopting the technology. The questionnaire was sent to a randomly selected sample of organic and conventional dairy farmers in Denmark, and 85 organic and 71 conventional farmers (41% response rate) completed it. Seventy-six percent of farmers reported that they would be likely to use semen from bulls derived from the technology. A majority (61%) also found the technology acceptable, but many (33%) were unsure or undecided. Most farmers saw the technology as beneficial, but ethical reservations were aired by around a fifth of the farmers. There were no differences between organic and conventional farmers in likelihood of using, perceived utility, and ethical reservations about the technology. Self-reported idealistic organic farmers showed lower acceptance of the technology, but reported similar likelihood of using semen produced by it. Young farmers (20-39 yr) exhibited higher acceptance of the technology. Larger producers (in terms of number of cows) were more likely to report that they will use and accept the technology. We conclude that it is likely that semen from the technology combining in vitro production and genomic selection would be widely used by both organic and conventional farmers provided that costs can be kept low, and that there are advantages in terms of achieving breeding goals. Structural developments, growth in size of dairy farms, acceptance by young farmers, and the fact that economic incentives (and even ethical arguments) seem to favor the technology all point to this conclusion.

Collapse

Fernandes Júnior GA, Carvalheiro R, de Oliveira HN, Sargolzaei M, Costilla R, Ventura RV, Fonseca LFS, Neves HHR, Hayes BJ, de Albuquerque LG. Imputation accuracy to whole-genome sequence in Nellore cattle. Genet Sel Evol 2021;53:27. [PMID: 33711929 PMCID: PMC7953568 DOI: 10.1186/s12711-021-00622-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 03/05/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

A cost-effective strategy to explore the complete DNA sequence in animals for genetic evaluation purposes is to sequence key ancestors of a population, followed by imputation mechanisms to infer marker genotypes that were not originally reported in a target population of animals genotyped with single nucleotide polymorphism (SNP) panels. The feasibility of this process relies on the accuracy of the genotype imputation in that population, particularly for potential causal mutations which may be at low frequency and either within genes or regulatory regions. The objective of the present study was to investigate the imputation accuracy to the sequence level in a Nellore beef cattle population, including that for variants in annotation classes which are more likely to be functional.

METHODS

Information of 151 key sequenced Nellore sires were used to assess the imputation accuracy from bovine HD BeadChip SNP (~ 777 k) to whole-genome sequence. The choice of the sires aimed at optimizing the imputation accuracy of a genotypic database, comprised of about 10,000 genotyped Nellore animals. Genotype imputation was performed using two computational approaches: FImpute3 and Minimac4 (after using Eagle for phasing). The accuracy of the imputation was evaluated using a fivefold cross-validation scheme and measured by the squared correlation between observed and imputed genotypes, calculated by individual and by SNP. SNPs were classified into a range of annotations, and the accuracy of imputation within each annotation classification was also evaluated.

RESULTS

High average imputation accuracies per animal were achieved using both FImpute3 (0.94) and Minimac4 (0.95). On average, common variants (minor allele frequency (MAF) > 0.03) were more accurately imputed by Minimac4 and low-frequency variants (MAF ≤ 0.03) were more accurately imputed by FImpute3. The inherent Minimac4 Rsq imputation quality statistic appears to be a good indicator of the empirical Minimac4 imputation accuracy. Both software provided high average SNP-wise imputation accuracy for all classes of biological annotations.

CONCLUSIONS

Our results indicate that imputation to whole-genome sequence is feasible in Nellore beef cattle since high imputation accuracies per individual are expected. SNP-wise imputation accuracy is software-dependent, especially for rare variants. The accuracy of imputation appears to be relatively independent of annotation classification.

Collapse

Meuwissen T, van den Berg I, Goddard M. On the use of whole-genome sequence data for across-breed genomic prediction and fine-scale mapping of QTL. Genet Sel Evol 2021;53:19. [PMID: 33637049 PMCID: PMC7908738 DOI: 10.1186/s12711-021-00607-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Accepted: 01/25/2021] [Indexed: 11/10/2022] Open

Abstract

Background

Whole-genome sequence (WGS) data are increasingly available on large numbers of individuals in animal and plant breeding and in human genetics through second-generation resequencing technologies, 1000 genomes projects, and large-scale genotype imputation from lower marker densities. Here, we present a computationally fast implementation of a variable selection genomic prediction method, that could handle WGS data on more than 35,000 individuals, test its accuracy for across-breed predictions and assess its quantitative trait locus (QTL) mapping precision.

Methods

The Monte Carlo Markov chain (MCMC) variable selection model (Bayes GC) fits simultaneously a genomic best linear unbiased prediction (GBLUP) term, i.e. a polygenic effect whose correlations are described by a genomic relationship matrix (G), and a Bayes C term, i.e. a set of single nucleotide polymorphisms (SNPs) with large effects selected by the model. Computational speed is improved by a Metropolis–Hastings sampling that directs computations to the SNPs, which are, a priori, most likely to be included into the model. Speed is also improved by running many relatively short MCMC chains. Memory requirements are reduced by storing the genotype matrix in binary form. The model was tested on a WGS dataset containing Holstein, Jersey and Australian Red cattle. The data contained 4,809,520 genotypes on 35,549 individuals together with their milk, fat and protein yields, and fat and protein percentage traits.

Results

The prediction accuracies of the Jersey individuals improved by 1.5% when using across-breed GBLUP compared to within-breed predictions. Using WGS instead of 600 k SNP-chip data yielded on average a 3% accuracy improvement for Australian Red cows. QTL were fine-mapped by locating the SNP with the highest posterior probability of being included in the model. Various QTL known from the literature were rediscovered, and a new SNP affecting milk production was discovered on chromosome 20 at 34.501126 Mb. Due to the high mapping precision, it was clear that many of the discovered QTL were the same across the five dairy traits.

Conclusions

Across-breed Bayes GC genomic prediction improved prediction accuracies compared to GBLUP. The combination of across-breed WGS data and Bayesian genomic prediction proved remarkably effective for the fine-mapping of QTL.

Collapse

Al-Khudhair A, VanRaden PM, Null DJ, Li B. Marker selection and genomic prediction of economically important traits using imputed high-density genotypes for 5 breeds of dairy cattle. J Dairy Sci 2021;104:4478-4485. [PMID: 33612229 DOI: 10.3168/jds.2020-19260] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 11/22/2020] [Indexed: 11/19/2022]

Abstract

Marker sets used in US dairy genomic predictions were previously expanded by including high-density (HD) or sequence markers with the largest effects for Holstein breed only. Other non-Holstein breeds lacked enough HD genotyped animals to be used as a reference population at that time, and thus were not included in the genomic prediction. Recently, numbers of non-Holstein breeds genotyped using HD panels reached an acceptable level for imputation and marker selection, allowing HD genomic prediction and HD marker selection for Holstein plus 4 other breeds. Genotypes for 351,461 Holsteins, 347,570 Jerseys, 42,346 Brown Swiss, 9,364 Ayrshires (including Red dairy cattle), and 4,599 Guernseys were imputed to the HD marker list that included 643,059 SNP. The separate HD reference populations included Illumina BovineHD (San Diego, CA) genotypes for 4,012 Holsteins, 407 Jerseys, 181 Brown Swiss, 527 Ayrshires, and 147 Guernseys. The 643,059 variants included the HD SNP and all 79,254 (80K) genetic markers and QTL used in routine national genomic evaluations. Before imputation, approximately 91 to 97% of genotypes were unknown for each breed; after imputation, 1.1% of Holstein, 3.2% of Jersey, 6.7% of Brown Swiss, 4.8% of Ayrshire, and 4.2% of Guernsey alleles remained unknown due to lower density haplotypes that had no matching HD haplotype. The higher remaining missing rates in non-Holstein breeds are mainly due to fewer HD genotyped animals in the imputation reference populations. Allele effects for up to 39 traits were estimated separately within each breed using phenotypic reference populations that included up to 6,157 Jersey males and 110,130 Jersey females. Correlations of HD with 80K genomic predictions for young animals averaged 0.986, 0.989, 0.985, 0.992, and 0.978 for Jersey, Ayrshire, Brown Swiss, Guernsey, and Holstein breeds, respectively. Correlations were highest for yield traits (about 0.991) and lowest for foot angle and rear legs-side view (0.981and 0.982, respectively). Some HD effects were more than twice as large as the largest 80K SNP effect, and HD markers had larger effects than nearby 80K markers for many breed-trait combinations. Previous studies selected and included markers with large effects for Holstein traits; the newly selected HD markers should also improve non-Holstein and crossbred genomic predictions and were added to official US genomic predictions in April 2020.

Collapse

Lopez BIM, An N, Srikanth K, Lee S, Oh JD, Shin DH, Park W, Chai HH, Park JE, Lim D. Genomic Prediction Based on SNP Functional Annotation Using Imputed Whole-Genome Sequence Data in Korean Hanwoo Cattle. Front Genet 2021;11:603822. [PMID: 33552124 PMCID: PMC7859490 DOI: 10.3389/fgene.2020.603822] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open

Abstract

Whole-genome sequence (WGS) data are increasingly being applied into genomic predictions, offering a higher predictive ability by including causal mutations or single-nucleotide polymorphisms (SNPs) putatively in strong linkage disequilibrium with causal mutations affecting the trait. This study aimed to improve the predictive performance of the customized Hanwoo 50 k SNP panel for four carcass traits in commercial Hanwoo population by adding highly predictive variants from sequence data. A total of 16,892 Hanwoo cattle with phenotypes (i.e., backfat thickness, carcass weight, longissimus muscle area, and marbling score), 50 k genotypes, and WGS imputed genotypes were used. We partitioned imputed WGS data according to functional annotation [intergenic (IGR), intron (ITR), regulatory (REG), synonymous (SYN), and non-synonymous (NSY)] to characterize the genomic regions that will deliver higher predictive power for the traits investigated. Animals were assigned into two groups, the discovery set (7324 animals) used for predictive variant detection and the cross-validation set for genomic prediction. Genome-wide association studies were performed by trait to every genomic region and entire WGS data for the pre-selection of variants. Each set of pre-selected SNPs with different density (1000, 3000, 5000, or 10,000) were added to the 50 k genotypes separately and the predictive performance of each set of genotypes was assessed using the genomic best linear unbiased prediction (GBLUP). Results showed that the predictive performance of the customized Hanwoo 50 k SNP panel can be improved by the addition of pre-selected variants from the WGS data, particularly 3000 variants from each trait, which is then sufficient to improve the prediction accuracy for all traits. When 12,000 pre-selected variants (3000 variants from each trait) were added to the 50 k genotypes, the prediction accuracies increased by 9.9, 9.2, 6.4, and 4.7% for backfat thickness, carcass weight, longissimus muscle area, and marbling score compared to the regular 50 k SNP panel, respectively. In terms of prediction bias, regression coefficients for all sets of genotypes in all traits were close to 1, indicating an unbiased prediction. The strategy used to select variants based on functional annotation did not show a clear advantage compared to using whole-genome. Nonetheless, such pre-selected SNPs from the IGR region gave the highest improvement in prediction accuracy among genomic regions and the values were close to those obtained using the WGS data for all traits. We concluded that additional gain in prediction accuracy when using pre-selected variants appears to be trait-dependent, and using WGS data remained more accurate compared to using a specific genomic region.

Collapse

van den Berg I, Ho PN, Haile-Mariam M, Beatson PR, O'Connor E, Pryce JE. Genetic parameters of blood urea nitrogen and milk urea nitrogen concentration in dairy cattle managed in pasture-based production systems of New Zealand and Australia. ANIMAL PRODUCTION SCIENCE 2021. [DOI: 10.1071/an21049] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Abstract Context Urinary nitrogen excretion by grazing cattle causes environmental pollution. Selecting for cows with a lower concentration of urinary nitrogen excretion may reduce the environmental impact. While urinary nitrogen excretion is difficult to measure, blood urea nitrogen (BUN), mid-infrared spectroscopy (MIR)-predicted BUN (MBUN), which is predicted from MIR spectra measured on milk samples, and milk urea nitrogen (MUN) are potential indicator traits. Australia and New Zealand have increasing datasets of cows with urea records, with 18 120 and 15 754 cows with urea records in Australia and New Zealand respectively. A collaboration between Australia and New Zealand could further increase the size of the dataset by sharing data. Aims Our aims were to estimate genetic parameters for urea traits within country, and genetic correlations between countries to gauge the benefit of having a joint reference population for genomic prediction of an indicator trait that is potentially suitable for selection to reduce urinary nitrogen excretion for both countries. Methods Genetic parameters were estimated within country (Australia and New Zealand) in Holstein, Jersey and a multibreed population, for BUN, MBUN and MUN in Australia and MUN in New Zealand, using high-density genotypes. Genetic correlations were also estimated between the urea traits recorded in Australia and MUN in New Zealand. Analyses used the first record available for each cow or within days-in-milk (DIM) intervals. Key results Heritabilities ranged from 0.08 to 0.32 for the various urea traits. Higher heritabilities were obtained for Jersey than for Holstein, and for the New Zealand cows than for the Australian cows. While urea traits were highly correlated within Australia (0.71–0.94), genetic correlations between Australia and New Zealand were small to moderate (0.08–0.58). Conclusions Our results showed that the heritability for urea traits differs among trait, breed, and country. While urea traits are highly correlated within country, genetic correlations between urea traits in Australia and MUN in New Zealand were only low to moderate. Implications Further study is required to identify the underlying causes of the difference in heritabilities observed, to compare the accuracies of different reference populations, and to estimate genetic correlations between urea traits and other traits such as fertility and feed intake. Larger datasets may help estimate genetic correlations more accurately between countries. Collapse

Khansefid M, Goddard ME, Haile-Mariam M, Konstantinov KV, Schrooten C, de Jong G, Jewell EG, O’Connor E, Pryce JE, Daetwyler HD, MacLeod IM. Improving Genomic Prediction of Crossbred and Purebred Dairy Cattle. Front Genet 2020;11:598580. [PMID: 33381150 PMCID: PMC7767986 DOI: 10.3389/fgene.2020.598580] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 11/19/2020] [Indexed: 11/17/2022] Open

Abstract

This study assessed the accuracy and bias of genomic prediction (GP) in purebred Holstein (H) and Jersey (J) as well as crossbred (H and J) validation cows using different reference sets and prediction strategies. The reference sets were made up of different combinations of 36,695 H and J purebreds and crossbreds. Additionally, the effect of using different sets of marker genotypes on GP was studied (conventional panel: 50k, custom panel enriched with, or close to, causal mutations: XT_50k, and conventional high-density with a limited custom set: pruned HDnGBS). We also compared the use of genomic best linear unbiased prediction (GBLUP) and Bayesian (emBayesR) models, and the traits tested were milk, fat, and protein yields. On average, by including crossbred cows in the reference population, the prediction accuracies increased by 0.01-0.08 and were less biased (regression coefficient closer to 1 by 0.02-0.16), and the benefit was greater for crossbreds compared to purebreds. The accuracy of prediction increased by 0.02 using XT_50k compared to 50k genotypes without affecting the bias. Although using pruned HDnGBS instead of 50k also increased the prediction accuracy by about 0.02, it increased the bias for purebred predictions in emBayesR models. Generally, emBayesR outperformed GBLUP for prediction accuracy when using 50k or pruned HDnGBS genotypes, but the benefits diminished with XT_50k genotypes. Crossbred predictions derived from a joint pure H and J reference were similar in accuracy to crossbred predictions derived from the two separate purebred reference sets and combined proportional to breed composition. However, the latter approach was less biased by 0.13. Most interestingly, using an equalized breed reference instead of an H-dominated reference, on average, reduced the bias of prediction by 0.16-0.19 and increased the accuracy by 0.04 for crossbred and J cows, with a little change in the H accuracy. In conclusion, we observed improved genomic predictions for both crossbreds and purebreds by equalizing breed contributions in a mixed breed reference that included crossbred cows. Furthermore, we demonstrate, that compared to the conventional 50k or high-density panels, our customized set of 50k sequence markers improved or matched the prediction accuracy and reduced bias with both GBLUP and Bayesian models.

Collapse