1
|
Chegini A, Strandén I, Karaman E, Iso-Touru T, Pösö J, Aamand GP, Lidauer MH. Marker weighting improves single-step genomic prediction reliabilities of udder health traits in Nordic Red and Jersey dairy cattle populations. J Dairy Sci 2024:S0022-0302(24)01196-2. [PMID: 39369893 DOI: 10.3168/jds.2024-25374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 09/04/2024] [Indexed: 10/08/2024]
Abstract
The standard single-step genomic prediction model assumes that all SNP markers explain an equal amount of genetic variance, which, however, may not be true. This is because SNPs are located in or near different genes with different functions. Therefore, it seems logical to consider SNP marker-specific weights when predicting genomic breeding values. We hypothesized that allowing differences in the amount of genetic variance explained by each SNP marker will improve prediction reliability and response to selection. To investigate this hypothesis, we first developed multi-trait standard single-step genomic models based on the current multi-trait random regression evaluation models for udder health traits of the Nordic Red (RDC) and Jersey (JER) dairy cattle populations. The models included 4 clinical mastitis (CM) traits, 3 test-day somatic cell score (SCS) traits, and the conformation traits fore udder attachment and udder depth. In the second step, we investigated the effect of applying different SNP marker weighting scenarios in the single-step genomic prediction models, for which a single-step SNP best linear unbiased prediction model was applied. We investigated the prediction reliability of the different models by forward prediction, where the last 4 years of the data were removed to estimate breeding values for validation candidates. In addition, genetic trends of the pedigree-based estimated breeding values (PEBV) and genomic enhanced breeding values (GEBV) were examined. The data sets for RDC and JER included 6.9 and 1.2 million animals of which 5.6 and 0.9 million cows had records, respectively. The number of genotyped animals was 125,789 and 64,777 for RDC and JER, respectively. Cows had repeated SCS observations but only single observations for all other traits and breeding values for all traits were modeled by one covariance function. This required modeling 12 eigenvalue breeding value coefficients for each cow and developing SNP marker weights for the principal components rather than for the biological traits. We investigated 3 SNP marker weighting scenarios: 1) a nonlinear method similar to BayesA, 2) using the classical formula 2pqû2 that accounts for allele heterozygosity, and 3) applying a mean SNP weight calculated by 2pqû2 for every 20 adjacent SNP markers. Bias, dispersion, and prediction reliability were calculated using PEBV or GEBV from the evaluation based on the full data set on those using the reduced data set. We found that the recent favorable genetic trend in CM and SCS has been accelerated since the introduction of genomic selection. The study also shows that a significant increase in prediction reliability, i.e., 0.74 vs. 0.48 for RDC and 0.72 vs. 0.41 for JER cows for CM, can be achieved with a standard single-step genomic prediction model compared with a pedigree-based prediction model. Almost all scenarios with SNP marker weighting further improved the prediction reliability between 0.5% and 12.7%. The highest improvement was achieved by weighing the SNP markers based on the 2pqû2 formula.
Collapse
Affiliation(s)
- Arash Chegini
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland.
| | - Ismo Strandén
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Terhi Iso-Touru
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland
| | | | - Gert P Aamand
- Nordic Cattle Genetic Evaluation (NAV), Agro Food Park 15, 8200 Aarhus, Denmark
| | - Martin H Lidauer
- Natural Resources Institute Finland (Luke), Tietotie 4, 31600 Jokioinen, Finland
| |
Collapse
|
2
|
Wang H, Wu H, Zhang W, Jiang J, Qian H, Man C, Gao H, Chen Q, Du L, Chen S, Wang F. Development and validation of a 5K low-density SNP chip for Hainan cattle. BMC Genomics 2024; 25:873. [PMID: 39294563 PMCID: PMC11409743 DOI: 10.1186/s12864-024-10753-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 09/02/2024] [Indexed: 09/20/2024] Open
Abstract
BACKGROUND This study aimed to design and develop a 5K low-density liquid chip for Hainan cattle utilizing targeted capture sequencing technology. The chip incorporates a substantial number of functional single nucleotide polymorphism (SNP) loci derived from public literature, including SNP loci significantly associated with immunity, heat stress, meat quality, reproduction, and other traits. Additionally, SNPs located in the coding regions of immune-related genes from the Bovine Genome Variation Database (BGVD) and Hainan cattle-specific SNP loci were included. RESULTS A total of 5,293 SNPs were selected, resulting in 9,837 DNA probes with a coverage rate of 85.69%, thereby creating a Hainan cattle-specific 5K Genotyping by Target Sequencing (GBTS) liquid chip. Evaluation with 152 cattle samples demonstrated excellent clustering performance and a detection rate ranging from 96.60 to 99.07%, with 94.5% of SNP sites exhibiting polymorphism. The chip achieved 100% gender coverage and displayed a heterozygosity rate between 14.20% and 29.65%, with a repeatability rate of 99.65-99.85%. Analyses using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) revealed the potential regulatory roles of exonic SNPs in immune response pathways. CONCLUSION The development and validation of the 5K GBTS liquid chip for Hainan cattle represent a valuable tool for genome analysis and genetic diversity assessment. Furthermore, it facilitates breed identification, gender determination, and kinship analysis, providing a foundation for the efficient utilization and development of local cattle genetic resources.
Collapse
Affiliation(s)
- Huan Wang
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Hui Wu
- Xinjiang Barkol Kazakh Autonomous County Animal Husbandry Veterinary Station, Barkol Kazakh Autonomous County, Xinjiang, PR China
| | - Wencan Zhang
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Junming Jiang
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Hejie Qian
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Churiga Man
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Hongyan Gao
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Qiaoling Chen
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Li Du
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China
| | - Si Chen
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China.
| | - Fengyang Wang
- Hainan Key Lab of Tropical Animal Reproduction, Breeding and Epidemic Disease Research, College of Tropical Agriculture and Forestry, Hainan University, Haikou, Hainan, PR China.
| |
Collapse
|
3
|
Lamb HJ, Nguyen LT, Copley JP, Engle BN, Hayes BJ, Ross EM. Imputation strategies for genomic prediction using nanopore sequencing. BMC Biol 2023; 21:286. [PMID: 38066581 PMCID: PMC10709982 DOI: 10.1186/s12915-023-01782-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 11/27/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND Genomic prediction describes the use of SNP genotypes to predict complex traits and has been widely applied in humans and agricultural species. Genotyping-by-sequencing, a method which uses low-coverage sequence data paired with genotype imputation, is becoming an increasingly popular SNP genotyping method for genomic prediction. The development of Oxford Nanopore Technologies' (ONT) MinION sequencer has now made genotyping-by-sequencing portable and rapid. Here we evaluate the speed and accuracy of genomic predictions using low-coverage ONT sequence data in a population of cattle using four imputation approaches. We also investigate the effect of SNP reference panel size on imputation performance. RESULTS SNP array genotypes and ONT sequence data for 62 beef heifers were used to calculate genomic estimated breeding values (GEBVs) from 641 k SNP for four traits. GEBV accuracy was much higher when genome-wide flanking SNP from sequence data were used to help impute the 641 k panel used for genomic predictions. Using the imputation package QUILT, correlations between ONT and low-density SNP array genomic breeding values were greater than 0.91 and up to 0.97 for sequencing coverages as low as 0.1 × using a reference panel of 48 million SNP. Imputation time was significantly reduced by decreasing the number of flanking sequence SNP used in imputation for all methods. When compared to high-density SNP arrays, genotyping accuracy and genomic breeding value correlations at 0.5 × coverage were also found to be higher than those imputed from low-density arrays. CONCLUSIONS Here we demonstrated accurate genomic prediction is possible with ONT sequence data from sequencing coverages as low as 0.1 × , and imputation time can be as short as 10 min per sample. We also demonstrate that in this population, genotyping-by-sequencing at 0.1 × coverage can be more accurate than imputation from low-density SNP arrays.
Collapse
Affiliation(s)
- H J Lamb
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4067, Australia.
| | - L T Nguyen
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4067, Australia
| | - J P Copley
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4067, Australia
| | - B N Engle
- USDA, ARS, U.S. Meat Animal Research Centre, Clay Centre, NE, 68933, USA
| | - B J Hayes
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4067, Australia
| | - E M Ross
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4067, Australia
| |
Collapse
|
4
|
Guillenea A, Lund MS, Evans R, Boerner V, Karaman E. A breed-of-origin of alleles model that includes crossbred data improves predictive ability for crossbred animals in a multi-breed population. Genet Sel Evol 2023; 55:34. [PMID: 37189059 PMCID: PMC10184430 DOI: 10.1186/s12711-023-00806-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 04/24/2023] [Indexed: 05/17/2023] Open
Abstract
BACKGROUND Recently, crossbred animals have begun to be used as parents in the next generations of dairy and beef cattle systems, which has increased the interest in predicting the genetic merit of those animals. The primary objective of this study was to investigate three available methods for genomic prediction of crossbred animals. In the first two methods, SNP effects from within-breed evaluations are used by weighting them by the average breed proportions across the genome (BPM method) or by their breed-of-origin (BOM method). The third method differs from the BOM in that it estimates breed-specific SNP effects using purebred and crossbred data, considering the breed-of-origin of alleles (BOA method). For within-breed evaluations, and thus for BPM and BOM, 5948 Charolais, 6771 Limousin and 7552 Others (a combined population of other breeds) were used to estimate SNP effects separately within each breed. For the BOA, the purebreds' data were enhanced with data from ~ 4K, ~ 8K or ~ 18K crossbred animals. For each animal, its predictor of genetic merit (PGM) was estimated by considering the breed-specific SNP effects. Predictive ability and absence of bias were estimated for crossbreds and the Limousin and Charolais animals. Predictive ability was measured as the correlation between PGM and the adjusted phenotype, while the regression of the adjusted phenotype on PGM was estimated as a measure of bias. RESULTS With BPM and BOM, the predictive abilities for crossbreds were 0.468 and 0.472, respectively, and with the BOA method, they ranged from 0.490 to 0.510. The performance of the BOA method improved as the number of crossbred animals in the reference increased and with the use of the correlated approach, in which the correlation of SNP effects across the genome of the different breeds was considered. The slopes of regression for PGM on adjusted phenotypes for crossbreds showed overdispersion of the genetic merits for all methods but this bias tended to be reduced by the use of the BOA method and by increasing the number of crossbred animals. CONCLUSIONS For the estimation of the genetic merit of crossbred animals, the results from this study suggest that the BOA method that accommodates crossbred data can yield more accurate predictions than the methods that use SNP effects from separate within-breed evaluations.
Collapse
Affiliation(s)
- Ana Guillenea
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark.
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| | - Ross Evans
- ICBF, Link Road, Carrigrohane, Ballincollig, Co. Cork, P31 D452, Ireland
| | - Vinzent Boerner
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus C, Denmark
| |
Collapse
|
5
|
Wolf MJ, Neumann GB, Kokuć P, Yin T, Brockmann GA, König S, May K. Genetic evaluations for endangered dual-purpose German Black Pied cattle using 50K SNPs, a breed-specific 200K chip, and whole-genome sequencing. J Dairy Sci 2023; 106:3345-3358. [PMID: 37028956 DOI: 10.3168/jds.2022-22665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 12/16/2022] [Indexed: 04/09/2023]
Abstract
Genetic evaluations of local cattle breeds are hampered due to small reference groups or biased due to the utilization of SNP effects estimated in other large populations. Against this background, there is a lack of studies addressing the possible advantage of whole-genome sequences (WGS) or consideration of specific variants from WGS data in genomic predictions for local breeds with small population size. Consequently, the aim of this study was to compare genetic parameters and accuracies of genomic estimated breeding values (GEBV) for 305-d production traits, fat-to protein ratio (FPR), and somatic cell score (SCS) at the first test date after calving and confirmation traits of the endangered German Black Pied cattle (DSN) breed using 4 different marker panels: (1) the commercial 50K Illumina BovineSNP50 BeadChip, (2) a customized 200K chip designed for DSN (DSN200K) which considers the most important variants for DSN from WGS, (3) randomly generated 200K chips based on WGS data, and (4) a WGS panel. The same number of animals was considered for all marker panel analyses (i.e., 1,811 genotyped or sequenced cows for conformation traits, 2,383 cows for lactation production traits, and 2,420 cows for FPR and SCS). Mixed models for the estimation of genetic parameters directly included the respective genomic relationship matrix from the different marker panels plus the trait-specific fixed effects. For the calculation of GEBV accuracies, we applied repeated random subsampling validation. In the process of separate cross-validations per trait, we created a validation set including 20% of cows with masked phenotypes, and a training set comprising 80% of the cows. The cows were selected randomly in a procedure with 10 replicates considering replacements in the different scenarios. The accuracy was defined as the correlation between the direct GEBV and the phenotypes with subtracted corresponding fixed effects for the cows in the validation set. For FPR and SCS, as well as for lactation production traits, heritabilities were largest based on WGS data, but the increase compared with the 50K or DSN200K applications was quite small in the range from 0.01 to 0.03. Also, for most of the conformation traits, heritabilities were largest based on WGS and DSN200K data, but the increase was in the range of the corresponding standard error. Accordingly, GEBV accuracies for most of the studied traits were highest based on WGS data or when utilizing the DSN200K chip, but the accuracy differences across the marker panels were quite small and nonsignificant. In conclusion, WGS data and the DSN200K chip only contributed to minor improvements in genomic predictions, still justifying the use of the commercial 50K chip. Nevertheless, WGS and the 200KDSN chip harbor breed-specific variants, which are valuable for studying causal genetic mechanisms in the endangered DSN population.
Collapse
Affiliation(s)
- Manuel J Wolf
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, 35390 Gießen, Germany
| | - Guilherme B Neumann
- Animal Breeding Biology and Molecular Genetics, Albrecht Daniel Thaer-Institute for Agricultural and Horticultural Sciences, Humboldt Universität zu Berlin, 10115 Berlin, Germany
| | - Paula Kokuć
- Animal Breeding Biology and Molecular Genetics, Albrecht Daniel Thaer-Institute for Agricultural and Horticultural Sciences, Humboldt Universität zu Berlin, 10115 Berlin, Germany
| | - Tong Yin
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, 35390 Gießen, Germany
| | - Gudrun A Brockmann
- Animal Breeding Biology and Molecular Genetics, Albrecht Daniel Thaer-Institute for Agricultural and Horticultural Sciences, Humboldt Universität zu Berlin, 10115 Berlin, Germany
| | - Sven König
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, 35390 Gießen, Germany.
| | - Katharina May
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, 35390 Gießen, Germany
| |
Collapse
|
6
|
Teng J, Wang D, Zhao C, Zhang X, Chen Z, Liu J, Sun D, Tang H, Wang W, Li J, Mei C, Yang Z, Ning C, Zhang Q. Longitudinal genome-wide association studies of milk production traits in Holstein cattle using whole-genome sequence data imputed from medium-density chip data. J Dairy Sci 2023; 106:2535-2550. [PMID: 36797187 DOI: 10.3168/jds.2022-22277] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 10/20/2022] [Indexed: 02/16/2023]
Abstract
Longitudinal traits, such as milk production traits in dairy cattle, are featured by having phenotypic values at multiple time points, which change dynamically over time. In this study, we first imputed SNP chip (50-100K) data to whole-genome sequence (WGS) data in a Chinese Holstein population consisting of 6,470 cows. The imputation accuracies were 0.88 to 0.97 on average after quality control. We then performed longitudinal GWAS in this population based on a random regression test-day model using the imputed WGS data. The longitudinal GWAS revealed 16, 39, and 75 quantitative trait locus regions associated with milk yield, fat percentage, and protein percentage, respectively. We estimated the 95% confidence intervals (CI) for these quantitative trait locus regions using the logP drop method and identified 581 genes involved in these CI. Further, we focused on the CI that covered or overlapped with only 1 gene or the CI that contained an extremely significant top SNP. Twenty-eight candidate genes were identified in these CI. Most of them have been reported in the literature to be associated with milk production traits, such as DGAT1, HSF1, MGST1, GHR, ABCG2, ADCK5, and CSN1S1. Among the unreported novel genes, some also showed good potential as candidate genes, such as CCSER1, CUX2, SNTB1, RGS7, OSR2, and STK3, and are worth being further investigated. Our study provided not only new insights into the candidate genes for milk production traits, but also a general framework for longitudinal GWAS based on random regression test-day model using WGS data.
Collapse
Affiliation(s)
- Jun Teng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Dan Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Changheng Zhao
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Xinyi Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Zhi Chen
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Jianfeng Liu
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Dongxiao Sun
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Hui Tang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Wenwen Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Jianbin Li
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, Jinan 250100, China
| | - Cheng Mei
- Dongying Shenzhou AustAsia Modern Dairy Farm Co. Ltd., Dongying 257200, China
| | - Zhangping Yang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Chao Ning
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China.
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China.
| |
Collapse
|
7
|
Tahir MS, Porto-Neto LR, Reverter-Gomez T, Olasege BS, Sajid MR, Wockner KB, Tan AWL, Fortes MRS. Utility of multi-omics data to inform genomic prediction of heifer fertility traits. J Anim Sci 2022; 100:skac340. [PMID: 36239447 PMCID: PMC9733504 DOI: 10.1093/jas/skac340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 10/12/2022] [Indexed: 12/15/2022] Open
Abstract
Biologically informed single nucleotide polymorphisms (SNPs) impact genomic prediction accuracy of the target traits. Our previous genomics, proteomics, and transcriptomics work identified candidate genes related to puberty and fertility in Brahman heifers. We aimed to test this biological information for capturing heritability and predicting heifer fertility traits in another breed i.e., Tropical Composite. The SNP from the identified genes including 10 kilobases (kb) region on either side were selected as biologically informed SNP set. The SNP from the rest of the Bos taurus genes including 10-kb region on either side were selected as biologically uninformed SNP set. Bovine high-density (HD) complete SNP set (628,323 SNP) was used as a control. Two populations-Tropical Composites (N = 1331) and Brahman (N = 2310)-had records for three traits: pregnancy after first mating season (PREG1, binary), first conception score (FCS, score 1 to 3), and rebreeding score (REB, score 1 to 3.5). Using the best linear unbiased prediction method, effectiveness of each SNP set to predict the traits was tested in two scenarios: a 5-fold cross-validation within Tropical Composites using biological information from Brahman studies, and application of prediction equations from one breed to the other. The accuracy of prediction was calculated as the correlation between genomic estimated breeding values and adjusted phenotypes. Results show that biologically informed SNP set estimated heritabilities not significantly better than the control HD complete SNP set in Tropical Composites; however, it captured all the observed genetic variance in PREG1 and FCS when modeled together with the biologically uninformed SNP set. In 5-fold cross-validation within Tropical Composites, the biologically informed SNP set performed marginally better (statistically insignificant) in terms of prediction accuracies (PREG1: 0.20, FCS: 0.13, and REB: 0.12) as compared to HD complete SNP set (PREG1: 0.17, FCS: 0.10, and REB: 0.11), and biologically uninformed SNP set (PREG1: 0.16, FCS: 0.10, and REB: 0.11). Across-breed use of prediction equations still remained a challenge: accuracies by all SNP sets dropped to around zero for all traits. The performance of biologically informed SNP was not significantly better than other sets in Tropical Composites. However, results indicate that biological information obtained from Brahman was successful to predict the fertility traits in Tropical Composite population.
Collapse
Affiliation(s)
- Muhammad S Tahir
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Laercio R Porto-Neto
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Toni Reverter-Gomez
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Babatunde S Olasege
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Mirza R Sajid
- Department of Statistics, University of Gujrat, 50700 Punjab, Pakistan
| | - Kimberley B Wockner
- Queensland Department of Agriculture and Fisheries, Brisbane 4072, QLD, Australia
| | - Andre W L Tan
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Marina R S Fortes
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| |
Collapse
|
8
|
Ma S, Li P, Liu H, Xi Y, Xu Q, Qi J, Wang J, Li L, Wang J, Hu J, He H, Han C, Bai L. Genome-wide association analysis of the primary feather growth traits of duck: identification of potential Loci for growth regulation. Poult Sci 2022; 102:102243. [PMID: 36334470 PMCID: PMC9636485 DOI: 10.1016/j.psj.2022.102243] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 10/05/2022] [Accepted: 10/05/2022] [Indexed: 11/28/2022] Open
Abstract
The feather is an important epidermal appendage, plays an important role in the life activities of avian specie, and has important economic value. Revealing the molecular regulation mechanism of feather growth has a significant meaning in studying adaptive evolution, physiology, and mating of avian species and also provides a theoretical reference for poultry breeding. In this study, the genome-wide association analysis (GWAS) of 358 ducks was based on primary feather length phenotypic data (28-60 d), length growth rates (LGRs), and maturity scores (60 d) to explore the genetic basis affecting feather growth and maturation. The results showed that, among the primary feather 1 to 5 in ducks, the mean LGR of primary feather 2 was the fastest, with the longest length. The primary feathers in males grew and matured slightly faster than in females. The mean maturity scores of primary feather 10∼7 were higher than primary feather 1 to 3 in ducks. GWAS further showed 116 SNPs associated with feather length traits. In addition, 2 candidate regions (Chr1: 127,407,230-127,524,879 bp and Chr21: 182,061,707-183,616,298 bp) were associated with LGR, which contain total 13 candidate genes (The extremely significant SNPs were mainly located in 2 genes: Chr1: REPS2 and Chr21: PTPRT). Four candidate regions (Chr1: 29,113,036-28,675,018 bp, Chr2: 18,253,612-149,111,290 bp, Chr15: 6,489,774 to 12,138,221 bp and Chr21: 6,578,021-8,472,904 bp) were associated with feather maturity, which contain total 24 candidate genes (The extremely significant SNPs were mainly located in 4 genes: Chr1: IMMP2L, DOCK4 and DDX10, Chr2: LDLRAD4). In conclusion, sex factors influence feather growth and maturity, and the genetic basis of the growth /maturity trait between different feathers is similar. REPS2, PTPRT genes, and IMMP2L, DOCK4, DDX10, and LDLRAD4 are important candidate genes that influence feather growth and maturity, respectively.
Collapse
Affiliation(s)
- Shengchao Ma
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China,College of Animal Science, Xinjiang Agricultural University, P. R. China
| | - Pengcheng Li
- Berry Genomics Corporation, Beijing 100015, P. R. China
| | - Hehe Liu
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China,Corresponding author:
| | - Yang Xi
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Qian Xu
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jingjing Qi
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jianmei Wang
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Liang Li
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jiwen Wang
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jiwei Hu
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Hua He
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Chunchun Han
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Lili Bai
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| |
Collapse
|
9
|
Genomic Selection in Chinese Holsteins Using Regularized Regression Models for Feature Selection of Whole Genome Sequencing Data. Animals (Basel) 2022; 12:ani12182419. [PMID: 36139283 PMCID: PMC9495168 DOI: 10.3390/ani12182419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/09/2022] [Accepted: 09/12/2022] [Indexed: 11/30/2022] Open
Abstract
Simple Summary Genomic selection (GS) is increasingly widely used in animal breeding, owing to its high efficiency in the genetic improvement of economic traits. In China, GS has been implemented for genetic evaluation of young bulls in dairy cattle breeding programs since 2012. GS is commonly based on single nucleotide polymorphism (SNP) chips. The cost of whole genome sequencing (WGS) has decreased tremendously in recent years, allowing increased studies of WGS-based GS. In this study, based on the imputed WGS data of approximately 8000 Chinese Holsteins, we investigated the performance of GS of milk production traits using the feature selection method of regularized regression. The results showed that WGS-based GS using regularized regression models and the commonly used linear mixed models achieved comparable prediction accuracies. For milk and protein yields, GS using a combination of SNPs selected with a regularized regression model and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with a linear mixed model combined with 50K SNP chip data performed best for fat yield. The proposed method of GS based on WGS data, i.e., feature selection using regularization regression models, provides a valuable novel strategy for genomic selection. Abstract Genomic selection (GS) is an efficient method to improve genetically economic traits. Feature selection is an important method for GS based on whole-genome sequencing (WGS) data. We investigated the prediction performance of GS of milk production traits using imputed WGS data on 7957 Chinese Holsteins. We used two regularized regression models, least absolute shrinkage and selection operator (LASSO) and elastic net (EN) for feature selection. For comparison, we performed genome-wide association studies based on a linear mixed model (LMM), and the N single nucleotide polymorphisms (SNPs) with the lowest p-values were selected (LMMLASSO and LMMEN), where N was the number of non-zero effect SNPs selected by LASSO or EN. GS was conducted using a genomic best linear unbiased prediction (GBLUP) model and several sets of SNPs: (1) selected WGS SNPs; (2) 50K SNP chip data; (3) WGS data; and (4) a combined set of selected WGS SNPs and 50K SNP chip data. The results showed that the prediction accuracies of GS with features selected using LASSO or EN were comparable to those using features selected with LMMLASSO or LMMEN. For milk and protein yields, GS using a combination of SNPs selected with LASSO and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with LMMLASSO combined with 50K SNP chip data performed best for fat yield. The proposed method, feature selection using regularization regression models, provides a valuable novel strategy for WGS-based GS.
Collapse
|
10
|
Marina H, Pelayo R, Gutiérrez-Gil B, Suárez-Vega A, Esteban-Blanco C, Reverter A, Arranz JJ. Low-density SNP panel for efficient imputation and genomic selection of milk production and technological traits in dairy sheep. J Dairy Sci 2022; 105:8199-8217. [PMID: 36028350 DOI: 10.3168/jds.2021-21601] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 05/30/2022] [Indexed: 11/19/2022]
Abstract
The present study aimed to ascertain how different strategies for leveraging genomic information enhance the accuracy of estimated breeding values for milk and cheese-making traits and to evaluate the implementation of a low-density (LowD) SNP chip designed explicitly for that aim. Thus, milk samples from a total of 2,020 dairy ewes from 2 breeds (1,039 Spanish Assaf and 981 Churra) were collected and analyzed to determine 3 milk production and composition traits and 2 traits related to milk coagulation properties and cheese yield. The 2 studied populations were genotyped with a customized 50K Affymetrix SNP chip (Affymetrix Inc.) containing 55,627 SNP markers. The prediction accuracies were obtained using different multitrait methodologies, such as the BLUP model based on pedigree information, the genomic BLUP (GBLUP), and the BLUP at the SNP level (SNP-BLUP), which are based on genotypic data, and the single-step GBLUP (ssGBLUP), which combines both sources of information. All of these methods were analyzed by cross-validation, comparing predictions of the whole population with the test population sets. Additionally, we describe the design of a LowD SNP chip (3K) and its prediction accuracies through the different methods mentioned previously. Furthermore, the results obtained using the LowD SNP chip were compared with those based on the 50K SNP chip data sets. Finally, we conclude that implementing genomic selection through the ssGBLUP model in the current breeding programs would increase the accuracy of the estimated breeding values compared with the BLUP methodology in the Assaf (from 0.19 to 0.39) and Churra (from 0.27 to 0.44) dairy sheep populations. The LowD SNP chip is cost-effective and has proven to be an accurate tool for estimating genomic breeding values for milk and cheese-making traits, microsatellite imputation, and parentage verification. The results presented here suggest that the routine use of this LowD SNP chip could potentially increase the genetic gains of the breeding selection programs of the 2 Spanish dairy sheep breeds considered here.
Collapse
Affiliation(s)
- H Marina
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad de León, Campus de Vegazana s/n, León 24071, Spain
| | - R Pelayo
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad de León, Campus de Vegazana s/n, León 24071, Spain
| | - B Gutiérrez-Gil
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad de León, Campus de Vegazana s/n, León 24071, Spain
| | - A Suárez-Vega
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad de León, Campus de Vegazana s/n, León 24071, Spain
| | - C Esteban-Blanco
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad de León, Campus de Vegazana s/n, León 24071, Spain
| | - A Reverter
- CSIRO Agriculture & Food, 306 Carmody Rd., St. Lucia, Brisbane, QLD 4067, Australia
| | - J J Arranz
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad de León, Campus de Vegazana s/n, León 24071, Spain.
| |
Collapse
|
11
|
Genotyping, the Usefulness of Imputation to Increase SNP Density, and Imputation Methods and Tools. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2467:113-138. [PMID: 35451774 DOI: 10.1007/978-1-0716-2205-6_4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Imputation has become a standard practice in modern genetic research to increase genome coverage and improve accuracy of genomic selection and genome-wide association study as a large number of samples can be genotyped at lower density (and lower cost) and, imputed up to denser marker panels or to sequence level, using information from a limited reference population. Most genotype imputation algorithms use information from relatives and population linkage disequilibrium. A number of software for imputation have been developed originally for human genetics and, more recently, for animal and plant genetics considering pedigree information and very sparse SNP arrays or genotyping-by-sequencing data. In comparison to human populations, the population structures in farmed species and their limited effective sizes allow to accurately impute high-density genotypes or sequences from very low-density SNP panels and a limited set of reference individuals. Whatever the imputation method, the imputation accuracy, measured by the correct imputation rate or the correlation between true and imputed genotypes, increased with the increasing relatedness of the individual to be imputed with its denser genotyped ancestors and as its own genotype density increased. Increasing the imputation accuracy pushes up the genomic selection accuracy whatever the genomic evaluation method. Given the marker densities, the most important factors affecting imputation accuracy are clearly the size of the reference population and the relationship between individuals in the reference and target populations.
Collapse
|
12
|
Teng J, Zhao C, Wang D, Chen Z, Tang H, Li J, Mei C, Yang Z, Ning C, Zhang Q. Assessment of the performance of different imputation methods for low-coverage sequencing in Holstein cattle. J Dairy Sci 2022; 105:3355-3366. [DOI: 10.3168/jds.2021-21360] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 12/13/2021] [Indexed: 12/27/2022]
|
13
|
Guillenea A, Su G, Lund MS, Karaman E. Genomic prediction in Nordic Red dairy cattle considering breed origin of alleles. J Dairy Sci 2022; 105:2426-2438. [PMID: 35033341 DOI: 10.3168/jds.2021-21173] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 11/23/2021] [Indexed: 01/02/2023]
Abstract
This study investigated the reliability of genomic prediction (GP) using breed origin of alleles (BOA) approach in the Nordic Red (RDC) population, which has an admixed population structure. The RDC population consists of animals with varying degrees of genetic materials from the Danish Red (RDM), Swedish Red (SRB), Finnish Ayrshire (FAY), and Holstein (HOL) because bulls have been used across the breeds. The BOA approach was tested using 39,550 RDC animals in the reference population and 11,786 in the validation population. Deregressed proofs (DRP) of milk, fat and protein were used as response variable for GP. Direct genomic breeding values (DGV) for animals in the validation population were calculated with (BOA model) or without (joint model) considering breed origin of alleles. The joint model assumed homogeneous marker effects and a single set of marker effects were estimated, whereas BOA model assumed heterogeneous marker effects, and different sets of marker effects were estimated across the breeds. For the BOA approach, we tested scenarios assuming both correlated (BOA_cor) and uncorrelated (BOA_uncor) marker effects between the breeds. Additionally, we investigated GP using a standard Illumina 50K chip and including SNP selected from imputed whole-genome sequencing (50K+WGS). We also studied the effect of estimating (co)variances for genome regions of different sizes to exploit the information of the genome regions contributing to the (co)variance between the breeds. Region sizes were set as 1 SNP, a group of 30 or 100 adjacent SNP, or the whole genome. Reliability of DGV was measured as squared correlations between DGV and DRP divided by the reliability of DRP. Across the 3 traits, in general, RS30 and RS100 SNP yielded the highest reliabilities. Including WGS SNP improved reliabilities in almost all scenarios (0.297 on average for 50K and 0.307 on average for 50K+WGS). The BOA_uncor (0.233 on average) was inferior to the joint model (0.339 on average), but the reliabilities obtained using BOA_cor (0.334 on average) in most cases were not significantly different from those obtained using the joint model. The results indicate that both including additional whole-genome sequencing SNP and dividing the genome into fixed regions improve GP in the RDC. The BOA models have the potential to increase the reliability of GP, but the benefit is limited in populations with a high exchange of genetic material for a long time, as is the case for RDC.
Collapse
Affiliation(s)
- Ana Guillenea
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Mogens Sand Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| |
Collapse
|
14
|
Bedhane M, van der Werf J, de las Heras-Saldana S, Lim D, Park B, Na Park M, Seung Hee R, Clark S. The accuracy of genomic prediction for meat quality traits in Hanwoo cattle when using genotypes from different SNP densities and preselected variants from imputed whole genome sequence. ANIMAL PRODUCTION SCIENCE 2022. [DOI: 10.1071/an20659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Context
Genomic prediction is the use of genomic data in the estimation of genomic breeding values (GEBV) in animal breeding. In beef cattle breeding programs, genomic prediction increases the rates of genetic gain by increasing the accuracy of selection at earlier ages.
Aims
The objectives of the study were to examine the effect of single-nucleotide polymorphism (SNP) density and to evaluate the effect of using SNPs preselected from imputed whole-genome sequence for genomic prediction.
Methods
Genomic and phenotypic data from 2110 Hanwoo steers were used to predict GEBV for marbling score (MS), meat texture (MT), and meat colour (MC) traits. Three types of SNP densities including 50k, high-density (HD), and whole-genome sequence data and preselected SNPs from genome-wide association study (GWAS) were used for genomic prediction analyses. Two scenarios (independent and dependent discovery populations) were used to select top significant SNPs. The accuracy of GEBV was assessed using random cross-validation. Genomic best linear unbiased prediction (GBLUP) was used to predict the breeding values for each trait.
Key results
Our result showed that very similar prediction accuracies were observed across all SNP densities used in the study. The prediction accuracy among traits ranged from 0.29±0.05 for MC to 0.46±0.04 for MS. Depending on the studied traits, up to 5% of prediction accuracy improvement was obtained when the preselected SNPs from GWAS analysis were included in the prediction analysis.
Conclusions
High SNP density such as HD and the whole-genome sequence data yielded a similar prediction accuracy in Hanwoo beef cattle. Therefore, the 50K SNP chip panel is sufficient to capture the relationships in a breed with a small effective population size such as the Hanwoo cattle population. Preselected variants improved prediction accuracy when they were included in the genomic prediction model.
Implications
The estimated genomic prediction accuracies are moderately accurate in Hanwoo cattle and for searching for SNPs that are more productive could increase the accuracy of estimated breeding values for the studied traits.
Collapse
|
15
|
Mestav B. Assessment of the relationship between the postpartum diseases susceptibility and the bovine monocyte subsets via Bayesian logistic regression, under various prior distributions. Res Vet Sci 2022; 145:1-12. [DOI: 10.1016/j.rvsc.2022.01.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 01/21/2022] [Accepted: 01/24/2022] [Indexed: 10/19/2022]
|
16
|
Yang R, Xu Z, Wang Q, Zhu D, Bian C, Ren J, Huang Z, Zhu X, Tian Z, Wang Y, Jiang Z, Zhao Y, Zhang D, Li N, Hu X. Genome‑wide association study and genomic prediction for growth traits in yellow-plumage chicken using genotyping-by-sequencing. Genet Sel Evol 2021; 53:82. [PMID: 34706641 PMCID: PMC8555081 DOI: 10.1186/s12711-021-00672-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 09/08/2021] [Indexed: 12/25/2022] Open
Abstract
Background Growth traits are of great importance for poultry breeding and production and have been the topic of extensive investigation, with many quantitative trait loci (QTL) detected. However, due to their complex genetic background, few causative genes have been confirmed and the underlying molecular mechanisms remain unclear, thus limiting our understanding of QTL and their potential use for the genetic improvement of poultry. Therefore, deciphering the genetic architecture is a promising avenue for optimising genomic prediction strategies and exploiting genomic information for commercial breeding. The objectives of this study were to: (1) conduct a genome-wide association study to identify key genetic factors and explore the polygenicity of chicken growth traits; (2) investigate the efficiency of genomic prediction in broilers; and (3) evaluate genomic predictions that harness genomic features. Results We identified five significant QTL, including one on chromosome 4 with major effects and four on chromosomes 1, 2, 17, and 27 with minor effects, accounting for 14.5 to 34.1% and 0.2 to 2.6% of the genomic additive genetic variance, respectively, and 23.3 to 46.7% and 0.6 to 4.5% of the observed predictive accuracy of breeding values, respectively. Further analysis showed that the QTL with minor effects collectively had a considerable influence, reflecting the polygenicity of the genetic background. The accuracy of genomic best linear unbiased predictions (BLUP) was improved by 22.0 to 70.3% compared to that of the conventional pedigree-based BLUP model. The genomic feature BLUP model further improved the observed prediction accuracy by 13.8 to 15.2% compared to the genomic BLUP model. Conclusions A major QTL and four minor QTL were identified for growth traits; the remaining variance was due to QTL effects that were too small to be detected. The genomic BLUP and genomic feature BLUP models yielded considerably higher prediction accuracy compared to the pedigree-based BLUP model. This study revealed the polygenicity of growth traits in yellow-plumage chickens and demonstrated that the predictive ability can be greatly improved by using genomic information and related features. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00672-9.
Collapse
Affiliation(s)
- Ruifei Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.,College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhenqiang Xu
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Qi Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Di Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Cheng Bian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Jiangli Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhuolin Huang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoning Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhixin Tian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Yuzhe Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Ziqin Jiang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Yiqiang Zhao
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Dexiang Zhang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China.
| | - Ning Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoxiang Hu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.
| |
Collapse
|
17
|
Mollandin F, Rau A, Croiseau P. An evaluation of the predictive performance and mapping power of the BayesR model for genomic prediction. G3 GENES|GENOMES|GENETICS 2021; 11:6317672. [PMID: 34849780 PMCID: PMC8527474 DOI: 10.1093/g3journal/jkab225] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/27/2021] [Indexed: 12/02/2022]
Abstract
Technological advances and decreasing costs have led to the rise of increasingly dense genotyping data, making feasible the identification of potential causal markers. Custom genotyping chips, which combine medium-density genotypes with a custom genotype panel, can capitalize on these candidates to potentially yield improved accuracy and interpretability in genomic prediction. A particularly promising model to this end is BayesR, which divides markers into four effect size classes. BayesR has been shown to yield accurate predictions and promise for quantitative trait loci (QTL) mapping in real data applications, but an extensive benchmarking in simulated data is currently lacking. Based on a set of real genotypes, we generated simulated data under a variety of genetic architectures and phenotype heritabilities, and we evaluated the impact of excluding or including causal markers among the genotypes. We define several statistical criteria for QTL mapping, including several based on sliding windows to account for linkage disequilibrium (LD). We compare and contrast these statistics and their ability to accurately prioritize known causal markers. Overall, we confirm the strong predictive performance for BayesR in moderately to highly heritable traits, particularly for 50k custom data. In cases of low heritability or weak LD with the causal marker in 50k genotypes, QTL mapping is a challenge, regardless of the criterion used. BayesR is a promising approach to simultaneously obtain accurate predictions and interpretable classifications of SNPs into effect size classes. We illustrated the performance of BayesR in a variety of simulation scenarios, and compared the advantages and limitations of each.
Collapse
Affiliation(s)
- Fanny Mollandin
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas 78350, France
| | - Andrea Rau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas 78350, France
- BioEcoAgro Joint Research Unit, INRAE, Université de Liège, Université de Lille, Université de Picardie Jules Verne, Peronne 80203, France
| | - Pascal Croiseau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas 78350, France
| |
Collapse
|
18
|
Gebreyesus G, Lund MS, Sahana G, Su G. Reliabilities of Genomic Prediction for Young Stock Survival Traits Using 54K SNP Chip Augmented With Additional Single-Nucleotide Polymorphisms Selected From Imputed Whole-Genome Sequencing Data. Front Genet 2021; 12:667300. [PMID: 34349779 PMCID: PMC8326759 DOI: 10.3389/fgene.2021.667300] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 06/23/2021] [Indexed: 11/16/2022] Open
Abstract
This study investigated effects of integrating single-nucleotide polymorphisms (SNPs) selected based on previous genome-wide association studies (GWASs), from imputed whole-genome sequencing (WGS) data, in the conventional 54K chip on genomic prediction reliability of young stock survival (YSS) traits in dairy cattle. The WGS SNPs included two groups of SNP sets that were selected based on GWAS in the Danish Holstein for YSS index (YSS_SNPs, n = 98) and SNPs chosen as peaks of quantitative trait loci for the traits of Nordic total merit index in Denmark–Finland–Sweden dairy cattle populations (DFS_SNPs, n = 1,541). Additionally, the study also investigated the possibility of improving genomic prediction reliability for survival traits by modeling the SNPs within recessive lethal haplotypes (LET_SNP, n = 130) detected from the 54K chip in the Nordic Holstein. De-regressed proofs (DRPs) were obtained from 6,558 Danish Holstein bulls genotyped with either 54K chip or customized LD chip that includes SNPs in the standard LD chip and some of the selected WGS SNPs. The chip data were subsequently imputed to 54K SNP together with the selected WGS SNPs. Genomic best linear unbiased prediction (GBLUP) models were implemented to predict breeding values through either pooling the 54K and selected WGS SNPs together as one genetic component (a one-component model) or considering 54K SNPs and selected WGS SNPs as two separate genetic components (a two-component model). Across all the traits, inclusion of each of the selected WGS SNP sets led to negligible improvements in prediction accuracies (0.17 percentage points on average) compared to prediction using only 54K. Similarly, marginal improvement in prediction reliability was obtained when all the selected WGS SNPs were included (0.22 percentage points). No further improvement in prediction reliability was observed when considering random regression on genotype code of recessive lethal alleles in the model including both groups of the WGS SNPs. Additionally, there was no difference in prediction reliability from integrating the selected WGS SNP sets through the two-component model compared to the one-component GBLUP.
Collapse
Affiliation(s)
- Grum Gebreyesus
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Goutam Sahana
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| |
Collapse
|
19
|
Karaman E, Su G, Croue I, Lund MS. Genomic prediction using a reference population of multiple pure breeds and admixed individuals. Genet Sel Evol 2021; 53:46. [PMID: 34058971 PMCID: PMC8168010 DOI: 10.1186/s12711-021-00637-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 05/11/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In dairy cattle populations in which crossbreeding has been used, animals show some level of diversity in their origins. In rotational crossbreeding, for instance, crossbred dams are mated with purebred sires from different pure breeds, and the genetic composition of crossbred animals is an admixture of the breeds included in the rotation. How to use the data of such individuals in genomic evaluations is still an open question. In this study, we aimed at providing methodologies for the use of data from crossbred individuals with an admixed genetic background together with data from multiple pure breeds, for the purpose of genomic evaluations for both purebred and crossbred animals. A three-breed rotational crossbreeding system was mimicked using simulations based on animals genotyped with the 50 K single nucleotide polymorphism (SNP) chip. RESULTS For purebred populations, within-breed genomic predictions generally led to higher accuracies than those from multi-breed predictions using combined data of pure breeds. Adding admixed population's (MIX) data to the combined pure breed data considering MIX as a different breed led to higher accuracies. When prediction models were able to account for breed origin of alleles, accuracies were generally higher than those from combining all available data, depending on the correlation of quantitative trait loci (QTL) effects between the breeds. Accuracies varied when using SNP effects from any of the pure breeds to predict the breeding values of MIX. Using those breed-specific SNP effects that were estimated separately in each pure breed, while accounting for breed origin of alleles for the selection candidates of MIX, generally improved the accuracies. Models that are able to accommodate MIX data with the breed origin of alleles approach generally led to higher accuracies than models without breed origin of alleles, depending on the correlation of QTL effects between the breeds. CONCLUSIONS Combining all available data, pure breeds' and admixed population's data, in a multi-breed reference population is beneficial for the estimation of breeding values for pure breeds with a small reference population. For MIX, such an approach can lead to higher accuracies than considering breed origin of alleles for the selection candidates, and using breed-specific SNP effects estimated separately in each pure breed. Including MIX data in the reference population of multiple breeds by considering the breed origin of alleles, accuracies can be further improved. Our findings are relevant for breeding programs in which crossbreeding is systematically applied, and also for populations that involve different subpopulations and between which exchange of genetic material is routine practice.
Collapse
Affiliation(s)
- Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | | | - Mogens S Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
20
|
Zhang H, Liu A, Wang Y, Luo H, Yan X, Guo X, Li X, Liu L, Su G. Genetic Parameters and Genome-Wide Association Studies of Eight Longevity Traits Representing Either Full or Partial Lifespan in Chinese Holsteins. Front Genet 2021; 12:634986. [PMID: 33719343 PMCID: PMC7947242 DOI: 10.3389/fgene.2021.634986] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Accepted: 02/05/2021] [Indexed: 11/17/2022] Open
Abstract
Due to the complexity of longevity trait in dairy cattle, two groups of trait definitions are widely used to measure longevity, either covering the full lifespan or representing only a part of it to achieve an early selection. Usually, only one group of longevity definition is used in breeding program for one population, and genetic studies on the comparisons of two groups of trait definitions are scarce. Based on the data of eight traits well representing the both groups of trait definitions, the current study investigated genetic parameters and genetic architectures of longevity in Holsteins. Heritabilities and correlations of eight longevity traits were estimated using single-trait and multi-trait animal models, with the data from 103,479 cows. Among the cows with phenotypes, 2,630 cows were genotyped with the 150K-SNP panel. A single-trait fixed and random Circuitous Probability Unification model was performed to detect candidate genes for eight longevity traits. Generally, all eight longevity traits had low heritabilities, ranging from 0.038 for total productive life and herd life to 0.090 for days from the first calving to the end of first lactation or culling. High genetic correlations were observed among the traits within the same definition group: from 0.946 to 0.997 for three traits reflecting full lifespan and from 0.666 to 0.997 for five traits reflecting partial productive life. Genetic correlations between two groups of traits ranged from 0.648 to 0.963, and increased gradually with the extension of lactations number regarding the partial productive life traits. A total of 55 SNPs located on 25 chromosomes were found genome-wide significantly associated with longevity, in which 12 SNPs were associated with more than one trait, even across traits of different definition groups. This is the first study to investigate the genetic architecture of longevity representing both full and the partial lifespan simultaneously, which will assist the selection of an appropriate trait definition for genetic improvement of longevity. Because of high genetic correlations with the full lifespan traits and higher heritability, the partial productive life trait measured as the days from the first calving to the end of the third lactation or culling could be a good alternative for early selection on longevity. The candidate genes identified by this study, such as RPRM, GRIA3, GTF2H5, CA5A, CACNA2D1, FGF10, and DNAJA3, could be used to pinpoint causative mutations for longevity and further benefit the genomic improvement of longevity in dairy cattle.
Collapse
Affiliation(s)
- Hailiang Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Aoxing Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China.,Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hanpeng Luo
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Xinyi Yan
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Xiangyu Guo
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| | - Xiang Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Lin Liu
- Beijing Dairy Cattle Center, Beijing, China
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, Tjele, Denmark
| |
Collapse
|
21
|
Lopez BIM, An N, Srikanth K, Lee S, Oh JD, Shin DH, Park W, Chai HH, Park JE, Lim D. Genomic Prediction Based on SNP Functional Annotation Using Imputed Whole-Genome Sequence Data in Korean Hanwoo Cattle. Front Genet 2021; 11:603822. [PMID: 33552124 PMCID: PMC7859490 DOI: 10.3389/fgene.2020.603822] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open
Abstract
Whole-genome sequence (WGS) data are increasingly being applied into genomic predictions, offering a higher predictive ability by including causal mutations or single-nucleotide polymorphisms (SNPs) putatively in strong linkage disequilibrium with causal mutations affecting the trait. This study aimed to improve the predictive performance of the customized Hanwoo 50 k SNP panel for four carcass traits in commercial Hanwoo population by adding highly predictive variants from sequence data. A total of 16,892 Hanwoo cattle with phenotypes (i.e., backfat thickness, carcass weight, longissimus muscle area, and marbling score), 50 k genotypes, and WGS imputed genotypes were used. We partitioned imputed WGS data according to functional annotation [intergenic (IGR), intron (ITR), regulatory (REG), synonymous (SYN), and non-synonymous (NSY)] to characterize the genomic regions that will deliver higher predictive power for the traits investigated. Animals were assigned into two groups, the discovery set (7324 animals) used for predictive variant detection and the cross-validation set for genomic prediction. Genome-wide association studies were performed by trait to every genomic region and entire WGS data for the pre-selection of variants. Each set of pre-selected SNPs with different density (1000, 3000, 5000, or 10,000) were added to the 50 k genotypes separately and the predictive performance of each set of genotypes was assessed using the genomic best linear unbiased prediction (GBLUP). Results showed that the predictive performance of the customized Hanwoo 50 k SNP panel can be improved by the addition of pre-selected variants from the WGS data, particularly 3000 variants from each trait, which is then sufficient to improve the prediction accuracy for all traits. When 12,000 pre-selected variants (3000 variants from each trait) were added to the 50 k genotypes, the prediction accuracies increased by 9.9, 9.2, 6.4, and 4.7% for backfat thickness, carcass weight, longissimus muscle area, and marbling score compared to the regular 50 k SNP panel, respectively. In terms of prediction bias, regression coefficients for all sets of genotypes in all traits were close to 1, indicating an unbiased prediction. The strategy used to select variants based on functional annotation did not show a clear advantage compared to using whole-genome. Nonetheless, such pre-selected SNPs from the IGR region gave the highest improvement in prediction accuracy among genomic regions and the values were close to those obtained using the WGS data for all traits. We concluded that additional gain in prediction accuracy when using pre-selected variants appears to be trait-dependent, and using WGS data remained more accurate compared to using a specific genomic region.
Collapse
Affiliation(s)
- Bryan Irvine M Lopez
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Narae An
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Krishnamoorthy Srikanth
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Seunghwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Jae-Don Oh
- Department of Animal Biotechnology, Chonbuk National University, Jeonju, South Korea
| | - Dong-Hyun Shin
- Department of Agricultural Convergence Technology, Chonbuk National University, Jeonju, South Korea
| | - Woncheoul Park
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Han-Ha Chai
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Jong-Eun Park
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Dajeong Lim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| |
Collapse
|
22
|
Paiva JT, Peixoto MGCD, Bruneli FAT, Alvarenga AB, Oliveira HR, Silva AA, Silva DA, Veroneze R, Silva FF, Lopes PS. Genetic parameters, genome-wide association and gene networks for milk and reproductive traits in Guzerá cattle. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104273] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
23
|
Calleja-Rodriguez A, Pan J, Funda T, Chen Z, Baison J, Isik F, Abrahamsson S, Wu HX. Evaluation of the efficiency of genomic versus pedigree predictions for growth and wood quality traits in Scots pine. BMC Genomics 2020; 21:796. [PMID: 33198692 PMCID: PMC7667760 DOI: 10.1186/s12864-020-07188-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 10/26/2020] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Genomic selection (GS) or genomic prediction is a promising approach for tree breeding to obtain higher genetic gains by shortening time of progeny testing in breeding programs. As proof-of-concept for Scots pine (Pinus sylvestris L.), a genomic prediction study was conducted with 694 individuals representing 183 full-sib families that were genotyped with genotyping-by-sequencing (GBS) and phenotyped for growth and wood quality traits. 8719 SNPs were used to compare different genomic with pedigree prediction models. Additionally, four prediction efficiency methods were used to evaluate the impact of genomic breeding value estimations by assigning diverse ratios of training and validation sets, as well as several subsets of SNP markers. RESULTS Genomic Best Linear Unbiased Prediction (GBLUP) and Bayesian Ridge Regression (BRR) combined with expectation maximization (EM) imputation algorithm showed slightly higher prediction efficiencies than Pedigree Best Linear Unbiased Prediction (PBLUP) and Bayesian LASSO, with some exceptions. A subset of approximately 6000 SNP markers, was enough to provide similar prediction efficiencies as the full set of 8719 markers. Additionally, prediction efficiencies of genomic models were enough to achieve a higher selection response, that varied between 50-143% higher than the traditional pedigree-based selection. CONCLUSIONS Although prediction efficiencies were similar for genomic and pedigree models, the relative selection response was doubled for genomic models by assuming that earlier selections can be done at the seedling stage, reducing the progeny testing time, thus shortening the breeding cycle length roughly by 50%.
Collapse
Affiliation(s)
- Ainhoa Calleja-Rodriguez
- Skogforsk (The Forestry Research Institute of Sweden), Box 3, Sävar, SE 918 21 Sweden
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
| | - Jin Pan
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
| | - Tomas Funda
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
- Department of Genetics and Breeding, Faculty of Agrobiology and Natural Resources, Czech University of Life Sciences Prague, Prague, 165 00 Czech Republic
- Key Laboratory of Forest Genetics and Biotechnology, Nanjing Forestry University, Nanjing, 210037 China
| | - Zhiqiang Chen
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
| | - John Baison
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
- RAGT Seeds, Essex, CB 101TA United Kingdom
| | - Fikret Isik
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC 27695 USA
| | - Sara Abrahamsson
- Skogforsk (The Forestry Research Institute of Sweden), Box 3, Sävar, SE 918 21 Sweden
| | - Harry X. Wu
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
- Beijing Advanced Innovation Centre for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, 100083 China
- National Research Collection Australia, CSIRO, Canberra, ACT 2601 Australia
| |
Collapse
|
24
|
Tribout T, Croiseau P, Lefebvre R, Barbat A, Boussaha M, Fritz S, Boichard D, Hoze C, Sanchez MP. Confirmed effects of candidate variants for milk production, udder health, and udder morphology in dairy cattle. Genet Sel Evol 2020; 52:55. [PMID: 32998688 PMCID: PMC7529513 DOI: 10.1186/s12711-020-00575-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 09/18/2020] [Indexed: 11/10/2022] Open
Abstract
Background Over the last years, genome-wide association studies (GWAS) based on imputed whole-genome sequences (WGS) have been used to detect quantitative trait loci (QTL) and highlight candidate genes for important traits. However, in general this approach does not allow to validate the effects of candidate mutations or determine if they are truly causative for the trait(s) in question. To address these questions, we applied a two-step, within-breed GWAS approach on 15 traits (5 linked with milk production, 2 with udder health, and 8 with udder morphology) in Montbéliarde (MON), Normande (NOR), and Holstein (HOL) cattle. We detected the most-promising candidate variants (CV) using imputed WGS of 2515 MON, 2203 NOR, and 6321 HOL bulls, and validated their effects in three younger populations of 23,926 MON, 9400 NOR, and 51,977 HOL cows. Results Bull sequence-based GWAS detected 84 QTL: 13, 10, and 30 for milk production traits; 3, 0, and 2 for somatic cell score (SCS); and 8, 2 and 16 for udder morphology traits, in MON, NOR, and HOL respectively. Five genomic regions with effects on milk production traits were shared among the three breeds whereas six (2 for production and 4 for udder morphology and health traits) had effects in two breeds. In 80 of these QTL, 855 CV were highlighted based on the significance of their effects and functional annotation. The subsequent GWAS on MON, NOR, and HOL cows validated 8, 9, and 23 QTL for production traits; 0, 0, and 1 for SCS; and 4, 1, and 8 for udder morphology traits, respectively. In 47 of the 54 confirmed QTL, the CV identified in bulls had more significant effects than single nucleotide polymorphisms (SNPs) from the standard 50K chip. The best CV for each validated QTL was located in a gene that was functionally related to production (36 QTL) or udder (9 QTL) traits. Conclusions Using this two-step GWAS approach, we identified and validated 54 QTL that included CV mostly located within functional candidate genes and explained up to 6.3% (udder traits) and 37% (production traits) of the genetic variance of economically important dairy traits. These CV are now included in the chip used to evaluate French dairy cattle and can be integrated into routine genomic evaluation.
Collapse
Affiliation(s)
- Thierry Tribout
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Pascal Croiseau
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Rachel Lefebvre
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Anne Barbat
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Mekki Boussaha
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Sébastien Fritz
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France.,Allice, 75012, Paris, France
| | - Didier Boichard
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Chris Hoze
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France.,Allice, 75012, Paris, France
| | - Marie-Pierre Sanchez
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France.
| |
Collapse
|
25
|
Liu A, Lund MS, Boichard D, Karaman E, Guldbrandtsen B, Fritz S, Aamand GP, Nielsen US, Sahana G, Wang Y, Su G. Weighted single-step genomic best linear unbiased prediction integrating variants selected from sequencing data by association and bioinformatics analyses. Genet Sel Evol 2020; 52:48. [PMID: 32799816 PMCID: PMC7429790 DOI: 10.1186/s12711-020-00568-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 08/07/2020] [Indexed: 11/30/2022] Open
Abstract
Background Sequencing data enable the detection of causal loci or single nucleotide polymorphisms (SNPs) highly linked to causal loci to improve genomic prediction. However, until now, studies on integrating such SNPs using a single-step genomic best linear unbiased prediction (ssGBLUP) model are scarce. We investigated the integration of sequencing SNPs selected by association (1262 SNPs) and bioinformatics (2359 SNPs) analyses into the currently used 54K-SNP chip, using three ssGBLUP models which make different assumptions on the distribution of SNP effects: a basic ssGBLUP model, a so-called featured ssGBLUP (ssFGBLUP) model that considered selected sequencing SNPs as a feature genetic component, and a weighted ssGBLUP (ssWGBLUP) model in which the genomic relationship matrix was weighted by the SNP variances estimated from a Bayesian whole-genome regression model, with every 1, 30, or 100 adjacent SNPs within a chromosome region sharing the same variance. We used data on milk production and female fertility in Danish Jersey. In total, 15,823 genotyped and 528,981 non-genotyped females born between 1990 and 2013 were used as reference population and 7415 genotyped females and 33,040 non-genotyped females born between 2014 and 2016 were used as validation population. Results With basic ssGBLUP, integrating SNPs selected from sequencing data improved prediction reliabilities for milk and protein yields, but resulted in limited or no improvement for fat yield and female fertility. Model performances depended on the SNP set used. When using ssWGBLUP with the 54K SNPs, reliabilities for milk and protein yields improved by 0.028 for genotyped animals and by 0.006 for non-genotyped animals compared with ssGBLUP. However, with the SNP set that included SNPs selected from sequencing data, no statistically significant difference in prediction reliability was observed between the three ssGBLUP models. Conclusions In summary, when using 54K SNPs, a ssWGBLUP model with a common weight on the SNPs in a given region is a feasible approach for single-trait genetic evaluation. Integrating relevant SNPs selected from sequencing data into the standard SNP chip can improve the reliability of genomic prediction. Based on such SNP data, a basic ssGBLUP model was suggested since no significant improvement was observed from using alternative models such as ssWGBLUP and ssFGBLUP.
Collapse
Affiliation(s)
- Aoxing Liu
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Didier Boichard
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Bernt Guldbrandtsen
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Sebastien Fritz
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, 78350, Jouy-en-Josas, France.,ALLICE, 75012, Paris, France
| | | | | | - Goutam Sahana
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
26
|
Gualdrón Duarte JL, Gori AS, Hubin X, Lourenco D, Charlier C, Misztal I, Druet T. Performances of Adaptive MultiBLUP, Bayesian regressions, and weighted-GBLUP approaches for genomic predictions in Belgian Blue beef cattle. BMC Genomics 2020; 21:545. [PMID: 32762654 PMCID: PMC7430838 DOI: 10.1186/s12864-020-06921-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 07/17/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genomic selection has been successfully implemented in many livestock and crop species. The genomic best linear unbiased predictor (GBLUP) approach, assigning equal variance to all SNP effects, is one of the reference methods. When large-effect variants contribute to complex traits, it has been shown that genomic prediction methods that assign a higher variance to subsets of SNP effects can achieve higher prediction accuracy. We herein compared the efficiency of several such approaches, including the Adaptive MultiBLUP (AM-BLUP) that uses local genomic relationship matrices (GRM) to automatically identify and weight genomic regions with large effects, to predict genetic merit in Belgian Blue beef cattle. RESULTS We used a population of approximately 10,000 genotyped cows and their phenotypes for 14 traits, mostly related to muscular development and body dimensions. According to the trait, we found that 4 to 25% of the genetic variance could be associated with 2 to 12 genomic regions harbouring large-effect variants. Noteworthy, three previously identified recessive deleterious variants presented heterozygote advantage and were among the most significant SNPs for several traits. The AM-BLUP resulted in increased reliability of genomic predictions compared to GBLUP (+ 2%), but Bayesian methods proved more efficient (+ 3%). Overall, the reliability gains remained thus limited although higher gains were observed for skin thickness, a trait affected by two genomic regions having particularly large effects. Higher accuracies than those from the original AM-BLUP were achieved when applying the Bayesian Sparse Linear Mixed Model to pre-select groups of SNPs with large effects and subsequently use their estimated variance to build a weighted GRM. Finally, the single-step GBLUP performed best and could be further improved (+ 3% prediction accuracy) by using these weighted GRM. CONCLUSIONS The AM-BLUP is an attractive method to automatically identify and weight genomic regions with large effects on complex traits. However, the method was less accurate than Bayesian methods. Overall, weighted methods achieved modest accuracy gains compared to GBLUP. Nevertheless, the computational efficiency of the AM-BLUP might be valuable at higher marker density, including with whole-genome sequencing data. Furthermore, weighted GRM are particularly useful to account for large variance loci in the single-step GBLUP.
Collapse
Affiliation(s)
- José Luis Gualdrón Duarte
- Unit of Animal Genomics, GIGA-R, 11 Avenue de l'Hôpital (B34), University of Liège, 4000, Liège, Belgium.
| | - Ann-Stephan Gori
- Innovation Department, Elevéo asbl and Inovéo, Awé Group, 5590, Ciney, Belgium
| | - Xavier Hubin
- Innovation Department, Elevéo asbl and Inovéo, Awé Group, 5590, Ciney, Belgium
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, 425 River Rd, Athens, GA, 30602, USA
| | - Carole Charlier
- Unit of Animal Genomics, GIGA-R, 11 Avenue de l'Hôpital (B34), University of Liège, 4000, Liège, Belgium
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, 425 River Rd, Athens, GA, 30602, USA
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R, 11 Avenue de l'Hôpital (B34), University of Liège, 4000, Liège, Belgium
| |
Collapse
|
27
|
Lourenco D, Legarra A, Tsuruta S, Masuda Y, Aguilar I, Misztal I. Single-Step Genomic Evaluations from Theory to Practice: Using SNP Chips and Sequence Data in BLUPF90. Genes (Basel) 2020; 11:E790. [PMID: 32674271 PMCID: PMC7397237 DOI: 10.3390/genes11070790] [Citation(s) in RCA: 65] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/03/2020] [Accepted: 07/06/2020] [Indexed: 11/16/2022] Open
Abstract
Single-step genomic evaluation became a standard procedure in livestock breeding, and the main reason is the ability to combine all pedigree, phenotypes, and genotypes available into one single evaluation, without the need of post-analysis processing. Therefore, the incorporation of data on genotyped and non-genotyped animals in this method is straightforward. Since 2009, two main implementations of single-step were proposed. One is called single-step genomic best linear unbiased prediction (ssGBLUP) and uses single nucleotide polymorphism (SNP) to construct the genomic relationship matrix; the other is the single-step Bayesian regression (ssBR), which is a marker effect model. Under the same assumptions, both models are equivalent. In this review, we focus solely on ssGBLUP. The implementation of ssGBLUP into the BLUPF90 software suite was done in 2009, and since then, several changes were made to make ssGBLUP flexible to any model, number of traits, number of phenotypes, and number of genotyped animals. Single-step GBLUP from the BLUPF90 software suite has been used for genomic evaluations worldwide. In this review, we will show theoretical developments and numerical examples of ssGBLUP using SNP data from regular chips to sequence data.
Collapse
Affiliation(s)
- Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| | - Andres Legarra
- Institut National de la Recherche Agronomique, UMR1388 GenPhySE, 31326 Castanet Tolosan, France;
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| | - Yutaka Masuda
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| | - Ignacio Aguilar
- Instituto Nacional de Investigación Agropecuaria (INIA), 11500 Montevideo, Uruguay;
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA; (S.T.); (Y.M.); (I.M.)
| |
Collapse
|
28
|
Liu A, Lund MS, Boichard D, Mao X, Karaman E, Fritz S, Aamand GP, Wang Y, Su G. Imputation for sequencing variants preselected to a customized low-density chip. Sci Rep 2020; 10:9524. [PMID: 32533087 PMCID: PMC7293337 DOI: 10.1038/s41598-020-66523-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Accepted: 05/19/2020] [Indexed: 12/27/2022] Open
Abstract
The sequencing variants preselected from association analyses and bioinformatics analyses could improve genomic prediction. In this study, the imputation of sequencing SNPs preselected from major dairy breeds in Denmark-Finland-Sweden (DFS) and France (FRA) was investigated for both contemporary animals and old bulls in Danish Jersey. For contemporary animals, a two-step imputation which first imputed to 54 K and then to 54 K + DFS + FRA SNPs achieved highest accuracy. Correlations between observed and imputed genotypes were 91.6% for DFS SNPs and 87.6% for FRA SNPs, while concordance rates were 96.6% for DFS SNPs and 93.5% for FRA SNPs. The SNPs with lower minor allele frequency (MAF) tended to have lower correlations but higher concordance rates. For old bulls, imputation for DFS and FRA SNPs were relatively accurate even for bulls without progenies (correlations higher than 97.2% and concordance rates higher than 98.4%). For contemporary animals, given limited imputation accuracy of preselected sequencing SNPs especially for SNPs with low MAF, it would be a good strategy to directly genotype preselected sequencing SNPs with a customized SNP chip. For old bulls, given high imputation accuracy for preselected sequencing SNPs with all MAF ranges, it would be unnecessary to re-genotype preselected sequencing SNPs.
Collapse
Affiliation(s)
- Aoxing Liu
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.,Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Didier Boichard
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France
| | - Xiaowei Mao
- Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, 100044, Beijing, P.R. China.,CAS Center for Excellence in Life and Paleoenvironment, 100044, Beijing, P.R. China
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Sebastien Fritz
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France.,ALLICE, 75012, Paris, France
| | | | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| |
Collapse
|
29
|
Bayesian modeling reveals host genetics associated with rumen microbiota jointly influence methane emission in dairy cows. ISME JOURNAL 2020; 14:2019-2033. [PMID: 32366970 PMCID: PMC7368015 DOI: 10.1038/s41396-020-0663-x] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 03/25/2020] [Accepted: 04/15/2020] [Indexed: 12/11/2022]
Abstract
Reducing methane emissions from livestock production is of great importance for the sustainable management of the Earth’s environment. Rumen microbiota play an important role in producing biogenic methane. However, knowledge of how host genetics influences variation in ruminal microbiota and their joint effects on methane emission is limited. We analyzed data from 750 dairy cows, using a Bayesian model to simultaneously assess the impact of host genetics and microbiota on host methane emission. We estimated that host genetics and microbiota explained 24% and 7%, respectively, of variation in host methane levels. In this Bayesian model, one bacterial genus explained up to 1.6% of the total microbiota variance. Further analysis was performed by a mixed linear model to estimate variance explained by host genomics in abundances of microbial genera and operational taxonomic units (OTU). Highest estimates were observed for a bacterial OTU with 33%, for an archaeal OTU with 26%, and for a microbial genus with 41% heritability. However, after multiple testing correction for the number of genera and OTUs modeled, none of the effects remained significant. We also used a mixed linear model to test effects of individual host genetic markers on microbial genera and OTUs. In this analysis, genetic markers inside host genes ABS4 and DNAJC10 were found associated with microbiota composition. We show that a Bayesian model can be utilized to model complex structure and relationship between microbiota simultaneously and their interaction with host genetics on methane emission. The host genome explains a significant fraction of between-individual variation in microbial abundance. Individual microbial taxonomic groups each only explain a small amount of variation in methane emissions. The identification of genes and genetic markers suggests that it is possible to design strategies for breeding cows with desired microbiota composition associated with phenotypes.
Collapse
|