1
|
Forutan M, Engle BN, Chamberlain AJ, Ross EM, Nguyen LT, D'Occhio MJ, Snr AC, Kho EA, Fordyce G, Speight S, Goddard ME, Hayes BJ. Genome-wide association and expression quantitative trait loci in cattle reveals common genes regulating mammalian fertility. Commun Biol 2024; 7:724. [PMID: 38866948 PMCID: PMC11169601 DOI: 10.1038/s42003-024-06403-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 05/31/2024] [Indexed: 06/14/2024] Open
Abstract
Most genetic variants associated with fertility in mammals fall in non-coding regions of the genome and it is unclear how these variants affect fertility. Here we use genome-wide association summary statistics for Heifer puberty (pubertal or not at 600 days) from 27,707 Bos indicus, Bos taurus and crossbred cattle; multi-trait GWAS signals from 2119 indicine cattle for four fertility traits, including days to calving, age at first calving, pregnancy status, and foetus age in weeks (assessed by rectal palpation of the foetus); and expression quantitative trait locus for whole blood from 489 indicine cattle, to identify 87 putatively functional genes affecting cattle fertility. Our analysis reveals a significant overlap between the set of cattle and previously reported human fertility-related genes, impling the existence of a shared pool of genes that regulate fertility in mammals. These findings are crucial for developing approaches to improve fertility in cattle and potentially other mammals.
Collapse
Affiliation(s)
- Mehrnush Forutan
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia.
| | - Bailey N Engle
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
- USDA,ARS, U.S. Meat Animal Research Center, Clay Center, NE, 68933, USA
| | - Amanda J Chamberlain
- Agriculture Victoria, Centre for AgriBiosciences, Bundoora, VIC, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Elizabeth M Ross
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Loan T Nguyen
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Michael J D'Occhio
- School of Life and Environmental Sciences, Faculty of Science, The University of Sydney, Sydney, NSW, Australia
| | - Alf Collins Snr
- Collins Belah Valley Brahman Stud, Marlborough, 4705, QLD, Australia
| | - Elise A Kho
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Geoffry Fordyce
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | | | - Michael E Goddard
- Agriculture Victoria, Centre for AgriBiosciences, Bundoora, VIC, Australia
- University of Melbourne, Melbourne, Australia
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| |
Collapse
|
2
|
Stephen MA, Burke CR, Steele N, Pryce JE, Meier S, Amer PR, Phyn CVC, Garrick DJ. Genome-wide association study of age at puberty and its (co)variances with fertility and stature in growing and lactating Holstein-Friesian dairy cattle. J Dairy Sci 2024; 107:3700-3715. [PMID: 38135043 DOI: 10.3168/jds.2023-23963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 11/24/2023] [Indexed: 12/24/2023]
Abstract
Reproductive performance is a key determinant of cow longevity in a pasture-based, seasonal dairy system. Unfortunately, direct fertility phenotypes such as intercalving interval or pregnancy rate tend to have low heritabilities and occur relatively late in an animal's life. In contrast, age at puberty (AGEP) is a moderately heritable, early-in-life trait that may be estimated using an animal's age at first measured elevation in blood plasma progesterone (AGEP4) concentrations. Understanding the genetic architecture of AGEP4 in addition to genetic relationships between AGEP4 and fertility traits in lactating cows is important, as is its relationship with body size in the growing animal. Thus, the objectives of this research were 3-fold. First, to estimate the genetic and phenotypic (co)variances between AGEP4 and subsequent fertility during first and second lactations. Second, to quantify the associations between AGEP4 and height, length, and BW measured when animals were approximately 11 mo old (standard deviation = 0.5). Third, to identify genomic regions that are likely to be associated with variation in AGEP4. We measured AGEP4, height, length, and BW in approximately 5,000 Holstein-Friesian or Holstein-Friesian × Jersey crossbred yearling heifers across 54 pasture-based herds managed in seasonal calving farm systems. We also obtained calving rate (CR42, success or failure to calve within the first 42 d of the seasonal calving period), breeding rate (PB21, success or failure to be presented for breeding within the first 21 d of the seasonal breeding period) and pregnancy rate (PR42, success or failure to become pregnant within the first 42 d of the seasonal breeding period) phenotypes from their first and second lactations. The animals were genotyped using the Weatherby's Versa 50K SNP array (Illumina, San Diego, CA). The estimated heritabilities of AGEP4, height, length, and BW were 0.34 (90% credibility interval [CRI]: 0.30, 0.37), 0.28 (90% CRI: 0.25, 0.31), 0.21 (90% CRI: 0.18, 0.23), and 0.33 (90% CRI: 0.30, 0.36), respectively. In contrast, the heritabilities of CR42, PB21 and PR42 were all <0.05 in both first and second lactations. The genetic correlations between AGEP4 and these fertility traits were generally moderate, ranging from 0.11 to 0.60, whereas genetic correlations between AGEP4 and yearling body-conformation traits ranged from 0.02 to 0.28. Our GWAS highlighted a genomic window on chromosome 5 that was strongly associated with variation in AGEP4. We also identified 4 regions, located on chromosomes 14, 6, 1, and 11 (in order of decreasing importance), that exhibited suggestive associations with AGEP4. Our results show that AGEP4 is a reasonable predictor of estimated breeding values for fertility traits in lactating cows. Although the GWAS provided insights into genetic mechanisms underpinning AGEP4, further work is required to test genomic predictions of fertility that use this information.
Collapse
Affiliation(s)
- M A Stephen
- DairyNZ Ltd., Hamilton 3240, New Zealand; AL Rae Centre for Genetics and Breeding-Massey University, Ruakura, Hamilton 3214, New Zealand.
| | - C R Burke
- DairyNZ Ltd., Hamilton 3240, New Zealand
| | - N Steele
- DairyNZ Ltd., Hamilton 3240, New Zealand
| | - J E Pryce
- Agriculture Victoria Research, AgriBio, Centre for AgriBioscience, Bundoora, Victoria 3083, Australia; School of Applied Systems Biology, La Trobe University, Bundoora, Victoria 3083, Australia
| | - S Meier
- DairyNZ Ltd., Hamilton 3240, New Zealand
| | - P R Amer
- AbacusBio, 442 Moray Place, Dunedin 9016, New Zealand
| | - C V C Phyn
- DairyNZ Ltd., Hamilton 3240, New Zealand
| | - D J Garrick
- AL Rae Centre for Genetics and Breeding-Massey University, Ruakura, Hamilton 3214, New Zealand
| |
Collapse
|
3
|
Ma H, Li H, Ge F, Zhao H, Zhu B, Zhang L, Gao H, Xu L, Li J, Wang Z. Improving Genomic Predictions in Multi-Breed Cattle Populations: A Comparative Analysis of BayesR and GBLUP Models. Genes (Basel) 2024; 15:253. [PMID: 38397242 PMCID: PMC10887749 DOI: 10.3390/genes15020253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Revised: 02/09/2024] [Accepted: 02/16/2024] [Indexed: 02/25/2024] Open
Abstract
Numerous studies have shown that combining populations from similar or closely related genetic breeds improves the accuracy of genomic predictions (GP). Extensive experimentation with diverse Bayesian and genomic best linear unbiased prediction (GBLUP) models have been developed to explore multi-breed genomic selection (GS) in livestock, ultimately establishing them as successful approaches for predicting genomic estimated breeding value (GEBV). This study aimed to assess the effectiveness of using BayesR and GBLUP models with linkage disequilibrium (LD)-weighted genomic relationship matrices (GRMs) for genomic prediction in three different beef cattle breeds to identify the best approach for enhancing the accuracy of multi-breed genomic selection in beef cattle. Additionally, a comparison was conducted to evaluate the predictive precision of different marker densities and genetic correlations among the three breeds of beef cattle. The GRM between Yunling cattle (YL) and other breeds demonstrated modest affinity and highlighted a notable genetic concordance of 0.87 between Chinese Wagyu (WG) and Huaxi (HX) cattle. In the within-breed GS, BayesR demonstrated an advantage over GBLUP. The prediction accuracies for HX cattle using the BayesR model were 0.52 with BovineHD BeadChip data (HD) and 0.46 with whole-genome sequencing data (WGS). In comparison to the GBLUP model, the accuracy increased by 26.8% for HD data and 9.5% for WGS data. For WG and YL, BayesR doubled the within-breed prediction accuracy to 14.3% from 7.1%, outperforming GBLUP across both HD and WGS datasets. Moreover, analyzing multiple breeds using genomic selection showed that BayesR consistently outperformed GBLUP in terms of predictive accuracy, especially when using WGS. For instance, in a mixed reference population of HX and WG, BayesR achieved a significant accuracy of 0.53 using WGS for HX, which was a substantial enhancement over the accuracies obtained with GBLUP models. The research further highlights the benefit of including various breeds in the reference group, leading to enhanced accuracy in predictions and emphasizing the importance of comprehensive genomic selection methods. Our research findings indicate that BayesR exhibits superior performance compared to GBLUP in multi-breed genomic prediction accuracy, achieving a maximum improvement of 33.3%, especially in genetically diverse breeds. The improvement can be attributed to the effective utilization of higher single nucleotide polymorphism (SNP) marker density by BayesR, resulting in enhanced prediction accuracy. This evidence conclusively demonstrates the significant impact of BayesR on enhancing genomic predictions in diverse cattle populations, underscoring the crucial role of genetic relatedness in selection methodologies. In parallel, subsequent studies should focus on refining GRM and exploring alternative models for GP.
Collapse
Affiliation(s)
- Haoran Ma
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| | - Hongwei Li
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB 510632, Canada
| | - Fei Ge
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| | - Huqiong Zhao
- College of Animal Science, Shanxi Agricultural University, Jinzhong 030801, China
| | - Bo Zhu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| | - Lupei Zhang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| | - Huijiang Gao
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| | - Lingyang Xu
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| | - Junya Li
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| | - Zezhao Wang
- Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China; (H.M.); (H.L.); (L.Z.); (J.L.)
| |
Collapse
|
4
|
Hayes BJ, Duff CJ, Hine BC, Mahony TJ. Genomic estimated breeding values for bovine respiratory disease resistance in Angus feedlot cattle. J Anim Sci 2024; 102:skae113. [PMID: 38659364 PMCID: PMC11107116 DOI: 10.1093/jas/skae113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 04/23/2024] [Indexed: 04/26/2024] Open
Abstract
Bovine respiratory disease (BRD) causes major losses in feedlot cattle worldwide. A genetic component for BRD resistance in feedlot cattle and calves has been reported in a number of studies, with heritabilities ranging from 0.04 to 0.2. These results suggest selection could be used to reduce the incidence of BRD. Genomic selection could be an attractive approach for breeding for BRD resistance, given the phenotype is not likely to be recorded on breeding animals. In this study, we derived GEBVs for BRD resistance and assessed their accuracy in a reasonably large data set recorded for feedlot treatment of BRD (1213 Angus steers, in two feedlots). In fivefold cross validation, genomic predictions were moderately accurate (0.23 ± 0.01) when a BayesR approach was used. Expansion of this approach to include more animals and a diversity of breeds is recommended to successfully develop a GEBV for BRD resistance in feedlots for the beef industry.
Collapse
Affiliation(s)
- Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD 4072, Australia
| | | | - Bradley C Hine
- CSIRO, F.D. McMaster Laboratory, Armidale, NSW 2350, Australia
| | - Timothy J Mahony
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD 4072, Australia
| |
Collapse
|
5
|
Déru V, Tiezzi F, VanRaden PM, Lozada-Soto EA, Toghiani S, Maltecca C. Imputation accuracy from low- to medium-density SNP chips for US crossbred dairy cattle. J Dairy Sci 2024; 107:398-411. [PMID: 37641298 DOI: 10.3168/jds.2023-23250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Accepted: 06/16/2023] [Indexed: 08/31/2023]
Abstract
This study aimed at evaluating the quality of imputation accuracy (IA) by marker (IAm) and by individual (IAi) in US crossbred dairy cattle. Holstein × Jersey crossbreds were used to evaluate IA from a low- (7K) to a medium-density (50K) SNP chip. Crossbred animals, as well as their sires (53), dams (77), and maternal grandsires (63), were all genotyped with a 78K SNP chip. Seven different scenarios of reference populations were tested, in which some scenarios used different family relationships and others added random unrelated purebred and crossbred individuals to those different family relationship scenarios. The same scenarios were tested on Holstein and Jersey purebred animals to compare these outcomes against those attained in crossbred animals. The genotype imputation was performed with findhap (version 4) software (VanRaden, 2015). There were no significant differences in IA results depending on whether the sire of imputed individuals was Holstein and the dam was Jersey, or vice versa. The IA increased significantly with the addition of related individuals in the reference population, from 86.70 ± 0.06% when only sires or dams were included in the reference population to 90.09 ± 0.06% when sire (S), dam (D), and maternal grandsire genomic data were combined in the reference population. In all scenarios including related individuals in the reference population, IAm and IAi were significantly superior in purebred Jersey and Holstein animals than in crossbreds, ranging from 90.75 ± 0.06 to 94.02 ± 0.06%, and from 90.88 ± 0.11 to 94.04 ± 0.10%, respectively. Additionally, a scenario called SPB+DLD(where PB indicates purebread and LD indicates low density), similar to the genomic evaluations performed on US crossbred dairy, was tested. In this scenario, the information from the 5 evaluated breeds (Ayrshire, Brown Swiss, Guernsey, Holstein, and Jersey) genotyped with a 50K SNP chip and genomic information from the dams genotyped with a 7K SNP chip were combined in the reference population, and the IAm and IAi were 80.87 ± 0.06% and 80.85 ± 0.08%, respectively. Adding randomly nonrelated genotyped individuals in the reference population reduced IA for both purebred and crossbred cows, except for scenario SPB+DLD, where adding crossbreds to the reference population increased IA values. Our findings demonstrate that IA for US Holstein × Jersey crossbred ranged from 85 to 90%, and emphasize the significance of designing and defining the reference population for improved IA.
Collapse
Affiliation(s)
- Vanille Déru
- Department of Animal Science, North Carolina State University, Raleigh, NC 27607.
| | - Francesco Tiezzi
- Department of Agriculture, Food, Environment and Forestry, University of Florence, Florence, 50144, Italy
| | - Paul M VanRaden
- USDA, Agricultural Research Service, Animal Genomics and Improvement Laboratory, Beltsville, MD 20705-2350
| | | | - Sajjad Toghiani
- USDA, Agricultural Research Service, Animal Genomics and Improvement Laboratory, Beltsville, MD 20705-2350
| | - Christian Maltecca
- Department of Animal Science, North Carolina State University, Raleigh, NC 27607
| |
Collapse
|
6
|
Hayes BJ, Copley J, Dodd E, Ross EM, Speight S, Fordyce G. Multi-breed genomic evaluation for tropical beef cattle when no pedigree information is available. Genet Sel Evol 2023; 55:71. [PMID: 37845626 PMCID: PMC10578004 DOI: 10.1186/s12711-023-00847-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 10/04/2023] [Indexed: 10/18/2023] Open
Abstract
BACKGROUND It has been challenging to implement genomic selection in multi-breed tropical beef cattle populations. If commercial (often crossbred) animals could be used in the reference population for these genomic evaluations, this could allow for very large reference populations. In tropical beef systems, such animals often have no pedigree information. Here we investigate potential models for such data, using marker heterozygosity (to model heterosis) and breed composition derived from genetic markers, as covariates in the model. Models treated breed effects as either fixed or random, and included genomic best linear unbiased prediction (GBLUP) and BayesR. A tropically-adapted beef cattle dataset of 29,391 purebred, crossbred and composite commercial animals was used to evaluate the models. RESULTS Treating breed effects as random, in an approach analogous to genetic groups allowed partitioning of the genetic variance into within-breed and across breed-components (even with a large number of breeds), and estimation of within-breed and across-breed genomic estimated breeding values (GEBV). We demonstrate that moderately-accurate (0.30-0.43) GEBV can be calculated using these models. Treating breed effects as random gave more accurate GEBV than treating breed as fixed. A simple GBLUP model where no breed effects were fitted gave the same accuracy (and correlations of GEBV very close to 1) as a model where GEBV for within-breed and the GEBV for (random) across-breed effects were included. When GEBV were predicted for herds with no data in the reference population, BayesR resulted in the highest accuracy, with 3% accuracy improvement averaged across traits, especially when the validation population was less related to the reference population. Estimates of heterosis from our models were in line with previous estimates from beef cattle. A method for estimating the number of effective breed comparisons for each breed combination accumulated across contemporary groups is presented. CONCLUSIONS When no pedigree is available, breed composition and heterosis for inclusion in multi-breed genomic evaluation can be estimated from genotypes. When GEBV were predicted for herds with no data in the reference population, BayesR resulted in the highest accuracy.
Collapse
Affiliation(s)
- Ben J Hayes
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4067, Australia.
| | - James Copley
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4067, Australia
| | - Elsie Dodd
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4067, Australia
| | - Elizabeth M Ross
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4067, Australia
| | - Shannon Speight
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4067, Australia
- BlackBox Co, Mareeba, QLD, 4880, Australia
| | - Geoffry Fordyce
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, University of Queensland, St Lucia, QLD, 4067, Australia
| |
Collapse
|
7
|
Herry F, Hérault F, Lecerf F, Lagoutte L, Doublet M, Picard-Druet D, Bardou P, Varenne A, Burlot T, Le Roy P, Allais S. Restriction site-associated DNA sequencing technologies as an alternative to low-density SNP chips for genomic selection: a simulation study in layer chickens. BMC Genomics 2023; 24:271. [PMID: 37208589 DOI: 10.1186/s12864-023-09321-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 04/18/2023] [Indexed: 05/21/2023] Open
Abstract
BACKGROUND To reduce the cost of genomic selection, a low-density (LD) single nucleotide polymorphism (SNP) chip can be used in combination with imputation for genotyping selection candidates instead of using a high-density (HD) SNP chip. Next-generation sequencing (NGS) techniques have been increasingly used in livestock species but remain expensive for routine use for genomic selection. An alternative and cost-efficient solution is to use restriction site-associated DNA sequencing (RADseq) techniques to sequence only a fraction of the genome using restriction enzymes. From this perspective, use of RADseq techniques followed by an imputation step on HD chip as alternatives to LD chips for genomic selection was studied in a pure layer line. RESULTS Genome reduction and sequencing fragments were identified on reference genome using four restriction enzymes (EcoRI, TaqI, AvaII and PstI) and a double-digest RADseq (ddRADseq) method (TaqI-PstI). The SNPs contained in these fragments were detected from the 20X sequence data of the individuals in our population. Imputation accuracy on HD chip with these genotypes was assessed as the mean correlation between true and imputed genotypes. Several production traits were evaluated using single-step GBLUP methodology. The impact of imputation errors on the ranking of the selection candidates was assessed by comparing a genomic evaluation based on ancestry using true HD or imputed HD genotyping. The relative accuracy of genomic estimated breeding values (GEBVs) was investigated by considering the GEBVs estimated on offspring as a reference. With AvaII or PstI and ddRADseq with TaqI and PstI, more than 10 K SNPs were detected in common with the HD SNP chip, resulting in an imputation accuracy greater than 0.97. The impact of imputation errors on genomic evaluation of the breeders was reduced, with a Spearman correlation greater than 0.99. Finally, the relative accuracy of GEBVs was equivalent. CONCLUSIONS RADseq approaches can be interesting alternatives to low-density SNP chips for genomic selection. With more than 10 K SNPs in common with the SNPs of the HD SNP chip, good imputation and genomic evaluation results can be obtained. However, with real data, heterogeneity between individuals with missing data must be considered.
Collapse
Affiliation(s)
- Florian Herry
- NOVOGEN, 5 rue des Compagnons, Secteur du Vau Ballier, Plédran, 22960, France
- PEGASE, INRAE, Institut Agro, Saint-Gilles, 35590, France
| | | | | | | | | | | | - Philippe Bardou
- SIGENAE, GenPhySE, Université de Toulouse, INRA, ENVT, 24 chemin de Borde-Rouge - Auzeville Tolosane, Castanet Tolosan, 31326, France
| | - Amandine Varenne
- NOVOGEN, 5 rue des Compagnons, Secteur du Vau Ballier, Plédran, 22960, France
| | - Thierry Burlot
- NOVOGEN, 5 rue des Compagnons, Secteur du Vau Ballier, Plédran, 22960, France
| | - Pascale Le Roy
- PEGASE, INRAE, Institut Agro, Saint-Gilles, 35590, France
| | - Sophie Allais
- PEGASE, INRAE, Institut Agro, Saint-Gilles, 35590, France.
| |
Collapse
|
8
|
Kriaridou C, Tsairidou S, Fraslin C, Gorjanc G, Looseley ME, Johnston IA, Houston RD, Robledo D. Evaluation of low-density SNP panels and imputation for cost-effective genomic selection in four aquaculture species. Front Genet 2023; 14:1194266. [PMID: 37252666 PMCID: PMC10213886 DOI: 10.3389/fgene.2023.1194266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Accepted: 04/26/2023] [Indexed: 05/31/2023] Open
Abstract
Genomic selection can accelerate genetic progress in aquaculture breeding programmes, particularly for traits measured on siblings of selection candidates. However, it is not widely implemented in most aquaculture species, and remains expensive due to high genotyping costs. Genotype imputation is a promising strategy that can reduce genotyping costs and facilitate the broader uptake of genomic selection in aquaculture breeding programmes. Genotype imputation can predict ungenotyped SNPs in populations genotyped at a low-density (LD), using a reference population genotyped at a high-density (HD). In this study, we used datasets of four aquaculture species (Atlantic salmon, turbot, common carp and Pacific oyster), phenotyped for different traits, to investigate the efficacy of genotype imputation for cost-effective genomic selection. The four datasets had been genotyped at HD, and eight LD panels (300-6,000 SNPs) were generated in silico. SNPs were selected to be: i) evenly distributed according to physical position ii) selected to minimise the linkage disequilibrium between adjacent SNPs or iii) randomly selected. Imputation was performed with three different software packages (AlphaImpute2, FImpute v.3 and findhap v.4). The results revealed that FImpute v.3 was faster and achieved higher imputation accuracies. Imputation accuracy increased with increasing panel density for both SNP selection methods, reaching correlations greater than 0.95 in the three fish species and 0.80 in Pacific oyster. In terms of genomic prediction accuracy, the LD and the imputed panels performed similarly, reaching values very close to the HD panels, except in the pacific oyster dataset, where the LD panel performed better than the imputed panel. In the fish species, when LD panels were used for genomic prediction without imputation, selection of markers based on either physical or genetic distance (instead of randomly) resulted in a high prediction accuracy, whereas imputation achieved near maximal prediction accuracy independently of the LD panel, showing higher reliability. Our results suggests that, in fish species, well-selected LD panels may achieve near maximal genomic selection prediction accuracy, and that the addition of imputation will result in maximal accuracy independently of the LD panel. These strategies represent effective and affordable methods to incorporate genomic selection into most aquaculture settings.
Collapse
Affiliation(s)
- Christina Kriaridou
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
| | - Smaragda Tsairidou
- Global Academy of Agriculture and Food Systems, University of Edinburgh, Edinburgh, United Kingdom
| | - Clémence Fraslin
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
| | | | | | - Ross D. Houston
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
- Benchmark Genetics, Penicuik, United Kingdom
| | - Diego Robledo
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
9
|
Teng J, Wang D, Zhao C, Zhang X, Chen Z, Liu J, Sun D, Tang H, Wang W, Li J, Mei C, Yang Z, Ning C, Zhang Q. Longitudinal genome-wide association studies of milk production traits in Holstein cattle using whole-genome sequence data imputed from medium-density chip data. J Dairy Sci 2023; 106:2535-2550. [PMID: 36797187 DOI: 10.3168/jds.2022-22277] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 10/20/2022] [Indexed: 02/16/2023]
Abstract
Longitudinal traits, such as milk production traits in dairy cattle, are featured by having phenotypic values at multiple time points, which change dynamically over time. In this study, we first imputed SNP chip (50-100K) data to whole-genome sequence (WGS) data in a Chinese Holstein population consisting of 6,470 cows. The imputation accuracies were 0.88 to 0.97 on average after quality control. We then performed longitudinal GWAS in this population based on a random regression test-day model using the imputed WGS data. The longitudinal GWAS revealed 16, 39, and 75 quantitative trait locus regions associated with milk yield, fat percentage, and protein percentage, respectively. We estimated the 95% confidence intervals (CI) for these quantitative trait locus regions using the logP drop method and identified 581 genes involved in these CI. Further, we focused on the CI that covered or overlapped with only 1 gene or the CI that contained an extremely significant top SNP. Twenty-eight candidate genes were identified in these CI. Most of them have been reported in the literature to be associated with milk production traits, such as DGAT1, HSF1, MGST1, GHR, ABCG2, ADCK5, and CSN1S1. Among the unreported novel genes, some also showed good potential as candidate genes, such as CCSER1, CUX2, SNTB1, RGS7, OSR2, and STK3, and are worth being further investigated. Our study provided not only new insights into the candidate genes for milk production traits, but also a general framework for longitudinal GWAS based on random regression test-day model using WGS data.
Collapse
Affiliation(s)
- Jun Teng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Dan Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Changheng Zhao
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Xinyi Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Zhi Chen
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Jianfeng Liu
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Dongxiao Sun
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Hui Tang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Wenwen Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Jianbin Li
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, Jinan 250100, China
| | - Cheng Mei
- Dongying Shenzhou AustAsia Modern Dairy Farm Co. Ltd., Dongying 257200, China
| | - Zhangping Yang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Chao Ning
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China.
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China.
| |
Collapse
|
10
|
Forutan M, Lynn A, Aliloo H, Clark SA, McGilchrist P, Polkinghorne R, Hayes BJ. Predicting phenotypes of beef eating quality traits. Front Genet 2023; 14:1089490. [PMID: 36816029 PMCID: PMC9936823 DOI: 10.3389/fgene.2023.1089490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 01/19/2023] [Indexed: 02/04/2023] Open
Abstract
Introduction: Phenotype predictions of beef eating quality for individual animals could be used to allocate animals to longer and more expensive feeding regimes as they enter the feedlot if they are predicted to have higher eating quality, and to sort carcasses into consumer or market value categories. Phenotype predictions can include genetic effects (breed effects, heterosis and breeding value), predicted from genetic markers, as well as fixed effects such as days aged and carcass weight, hump height, ossification, and hormone growth promotant (HGP) status. Methods: Here we assessed accuracy of phenotype predictions for five eating quality traits (tenderness, juiciness, flavour, overall liking and MQ4) in striploins from 1701 animals from a wide variety of backgrounds, including Bos indicus and Bos taurus breeds, using genotypes and simple fixed effects including days aged and carcass weight. The genetic components were predicted based on 709k single nucleotide polymorphism (SNP) using BayesR model, which assumes some markers may have a moderate to large effect. Fixed effects in the prediction included principal components of the genomic relationship matrix, to account for breed effects, heterosis, days aged and carcass weight. Results and Discussion: A model which allowed breed effects to be captured in the SNP effects (e.g., not explicitly fitting these effects) tended to have slightly higher accuracies (0.43-0.50) compared to when these effects were explicitly fitted as fixed effects (0.42-0.49), perhaps because breed effects when explicitly fitted were estimated with more error than when incorporated into the (random) SNP effects. Adding estimates of effects of days aged and carcass weight did not increase the accuracy of phenotype predictions in this particular analysis. The accuracy of phenotype prediction for beef eating quality traits was sufficiently high that such predictions could be useful in predicting eating quality from DNA samples taken from an animal/carcass as it enters the processing plant, to enable optimal supply chain value extraction by sorting product into markets with different quality. The BayesR predictions identified several novel genes potentially associated with beef eating quality.
Collapse
Affiliation(s)
- Mehrnush Forutan
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia,*Correspondence: Mehrnush Forutan,
| | - Andrew Lynn
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Hassan Aliloo
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Samuel A. Clark
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Peter McGilchrist
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | | | - Ben J. Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
11
|
Ortega MS, Bickhart DM, Lockhart KN, Null DJ, Hutchison JL, McClure JC, Cole JB. Truncation of IFT80 causes early embryonic loss in Holstein cattle associated with Holstein haplotype 2. J Dairy Sci 2022; 105:9001-9011. [PMID: 36085107 DOI: 10.3168/jds.2022-21853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Accepted: 05/31/2022] [Indexed: 11/19/2022]
Abstract
Recessive alleles represent genetic risk in populations that have undergone bottleneck events. We present a comprehensive framework for identification and validation of these genetic defects, including haplotype-based detection, variant selection from sequence data, and validation using knockout embryos. Holstein haplotype 2 (HH2), which causes embryonic death, was used to demonstrate the approach. Holstein haplotype 2 was identified using a deficiency-of-homozygotes approach and confirmed to negatively affect conception rate and stillbirths. Five carriers were present in a group of 183 sequenced Holstein bulls selected to maximize the coverage of unique haplotypes. Three variants concordant with haplotype calls were found in HH2: a high-priority frameshift mutation resulting, and 2 low-priority variants (1 synonymous variant, 1 premature stop codon). The frameshift in intraflagellar 80 (IFT80) was confirmed in a separate group of Holsteins from the 1000 Bull Genomes Project that shared no animals with the discovery set. IFT80-null embryos were generated by truncating the IFT80 transcript at exon 2 or 11 using a CRISPR-Cas9 system. Abattoir-derived oocytes were fertilized in vitro, and zygotes were injected at the one-cell stage either with a guide RNA and CAS9 mRNA complex (n = 100) or Cas9 mRNA (control, n = 100) before return to culture, and replicated 3 times. IFT80 is activated at the 8-cell stage, and IFT80-null embryos arrested at this stage of development, which is consistent with data from mouse hypomorphs and HH2 carrier-to-carrier matings. This frameshift in IFT80 on chromosome 1 at 107,172,615 bp (p.Leu381fs) disrupts WNT and hedgehog signaling, and is responsible for the death of homozygous embryos.
Collapse
Affiliation(s)
- M Sofía Ortega
- Division of Animal Sciences, College of Agriculture, Food, and Natural Resources, University of Missouri, Columbia 65211
| | - Derek M Bickhart
- Cell Wall Biology and Utilization Research Laboratory, U.S. Dairy Forage Research Center, Agricultural Research Service, United States Department of Agriculture, Madison, WI 53706
| | - Kelsey N Lockhart
- Division of Animal Sciences, College of Agriculture, Food, and Natural Resources, University of Missouri, Columbia 65211
| | - Daniel J Null
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705-2350
| | - Jana L Hutchison
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705-2350
| | - Jennifer C McClure
- Cell Wall Biology and Utilization Research Laboratory, U.S. Dairy Forage Research Center, Agricultural Research Service, United States Department of Agriculture, Madison, WI 53706
| | - John B Cole
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705-2350.
| |
Collapse
|
12
|
Riggio V, Tijjani A, Callaby R, Talenti A, Wragg D, Obishakin ET, Ezeasor C, Jongejan F, Ogo NI, Aboagye-Antwi F, Toure A, Nzalawahej J, Diallo B, Missohou A, Belem AMG, Djikeng A, Juleff N, Fourie J, Labuschagne M, Madder M, Marshall K, Prendergast JGD, Morrison LJ. Assessment of genotyping array performance for genome-wide association studies and imputation in African cattle. Genet Sel Evol 2022; 54:58. [PMID: 36057548 PMCID: PMC9441065 DOI: 10.1186/s12711-022-00751-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 08/17/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In cattle, genome-wide association studies (GWAS) have largely focused on European or Asian breeds, using genotyping arrays that were primarily designed for European cattle. Because there is growing interest in performing GWAS in African breeds, we have assessed the performance of 23 commercial bovine genotyping arrays for capturing the diversity across African breeds and performing imputation. We used 409 whole-genome sequences (WGS) spanning global cattle breeds, and a real cohort of 2481 individuals (including African breeds) that were genotyped with the Illumina high-density (HD) array and the GeneSeek bovine 50 k array. RESULTS We found that commercially available arrays were not effective in capturing variants that segregate among African indicine animals. Only 6% of these variants in high linkage disequilibrium (LD) (r2 > 0.8) were on the best performing arrays, which contrasts with the 17% and 25% in African and European taurine cattle, respectively. However, imputation from available HD arrays can successfully capture most variants (accuracies up to 0.93), mainly when using a global, not continent-specific, reference panel, which partially reflects the unusually high levels of admixture on the continent. When considering functional variants, the GGPF250 array performed best for tagging WGS variants and imputation. Finally, we show that imputation from low-density arrays can perform almost as well as HD arrays, if a two-stage imputation approach is adopted, i.e. first imputing to HD and then to WGS, which can potentially reduce the costs of GWAS. CONCLUSIONS Our results show that the choice of an array should be based on a balance between the objective of the study and the breed/population considered, with the HD and BOS1 arrays being the best choice for both taurine and indicine breeds when performing GWAS, and the GGPF250 being preferable for fine-mapping studies. Moreover, our results suggest that there is no advantage to using the indicus-specific arrays for indicus breeds, regardless of the objective. Finally, we show that using a reference panel that better represents global bovine diversity improves imputation accuracy, particularly for non-European taurine populations.
Collapse
Affiliation(s)
- Valentina Riggio
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK. .,Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK.
| | - Abdulfatai Tijjani
- Centre for Tropical Livestock Genetics and Health (CTLGH), ILRI Ethiopia, P.O Box 5689, Addis Ababa, Ethiopia
| | - Rebecca Callaby
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK.,Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - Andrea Talenti
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - David Wragg
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - Emmanuel T Obishakin
- Biotechnology Division, National Veterinary Research Institute, Vom, Plateau State, Nigeria.,Biomedical Research Centre, Ghent University Global Campus, Songdo, Incheon, South Korea
| | - Chukwunonso Ezeasor
- Department of Veterinary Pathology and Microbiology, University of Nigeria, Nsukka, Enugu State, Nigeria
| | - Frans Jongejan
- Department of Veterinary Tropical Diseases, Faculty of Veterinary Science, University of Pretoria, Onderstepoort, South Africa
| | - Ndudim I Ogo
- National Veterinary Research Institute, Vom, Nigeria
| | - Fred Aboagye-Antwi
- Department of Animal Biology and Conservation Sciences, University of Ghana, Accra, Ghana
| | - Alassane Toure
- Laboratoire National d'Appui Au Dévéloppement Agricole(LANADA)/Laboratoire Central Vétérinaire de Bingerville, Bp: 206, Bingerville, Côte d'Ivoire
| | - Jahashi Nzalawahej
- Department of Microbiology, Parasitology and Biotechnology, Sokoine University of Agriculture, Morogoro, Tanzania
| | | | - Ayao Missohou
- Ecole Inter-Etats des Sciences et Médecine Vétérinaires (EISMV) de Dakar, Dakar, Senegal
| | - Adrien M G Belem
- Université Polytechnique de Bobo-Dioulasso (UPB), Bobo -Dioulasso, Burkina Faso
| | - Appolinaire Djikeng
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK.,Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - Nick Juleff
- Bill & Melinda Gates Foundation, Seattle, WA, USA
| | | | - Michel Labuschagne
- Clinomics, Uitzich Road, Bainsvlei, Bloemfontein, 9338, South Africa.,Clinvet, Uitzich Road, Bainsvlei, Bloemfontein, 9338, South Africa
| | - Maxime Madder
- Clinglobal, B03/04, The Tamarin Commercial Hub, Jacaranda Avenue, Tamarin, 90903, Mauritius
| | - Karen Marshall
- Centre for Tropical Livestock Genetics and Health (CTLGH), ILRI Kenya, P.O. Box 30709, Nairobi, 00100, Kenya.,International Livestock Research Institute, P.O. Box 30709, Nairobi, 00100, Kenya
| | - James G D Prendergast
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK.,Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - Liam J Morrison
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK.,Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| |
Collapse
|
13
|
Nawaz MY, Bernardes PA, Savegnago RP, Lim D, Lee SH, Gondro C. Evaluation of Whole-Genome Sequence Imputation Strategies in Korean Hanwoo Cattle. Animals (Basel) 2022; 12:ani12172265. [PMID: 36077985 PMCID: PMC9454883 DOI: 10.3390/ani12172265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 08/25/2022] [Accepted: 08/30/2022] [Indexed: 11/29/2022] Open
Abstract
Simple Summary In this study, we evaluated various imputation strategies for the Korean Hanwoo cattle. We observed that a large reference panel consisting of many cattle breeds did not improve the imputation accuracy when compared to a proportionally small purebred Hanwoo reference. This was because the multi-breed reference did not contain animals sufficiently related to the Hanwoo to improve the accuracies and, although not detrimental, in effect, only added to the computational burden of the imputation. Despite the large multi-breed reference, when the Hanwoo were removed from the reference, the imputation accuracies were low. These results suggest additional sequencing efforts are needed for underrepresented breeds, particularly those less genetically related to the main European breeds. Abstract This study evaluated the accuracy of sequence imputation in Hanwoo beef cattle using different reference panels: a large multi-breed reference with no Hanwoo (n = 6269), a much smaller Hanwoo purebred reference (n = 88), and both datasets combined (n = 6357). The target animals were 136 cattle both sequenced and genotyped with the Illumina BovineSNP50 v2 (50K). The average imputation accuracy measured by the Pearson correlation (R) was 0.695 with the multi-breed reference, 0.876 with the purebred Hanwoo, and 0.887 with the combined data; the average concordance rates (CR) were 88.16%, 94.49%, and 94.84%, respectively. The accuracy gains from adding a large multi-breed reference of 6269 samples to only 88 Hanwoo was marginal; however, the concordance rate for the heterozygotes decreased from 85% to 82%, and the concordance rate for fixed SNPs in Hanwoo also decreased from 99.98% to 98.73%. Although the multi-breed panel was large, it was not sufficiently representative of the breed for accurate imputation without the Hanwoo animals. Additionally, we evaluated the value of high-density 700K genotypes (n = 991) as an intermediary step in the imputation process. The imputation accuracy differences were negligible between a single-step imputation strategy from 50K directly to sequence and a two-step imputation approach (50K-700K-sequence). We also observed that imputed sequence data can be used as a reference panel for imputation (mean R = 0.9650, mean CR = 98.35%). Finally, we identified 31 poorly imputed genomic regions in the Hanwoo genome and demonstrated that imputation accuracies were particularly lower at the chromosomal ends.
Collapse
Affiliation(s)
- Muhammad Yasir Nawaz
- Genetics and Genome Sciences Graduate Program, Michigan State University, East Lansing, MI 48824, USA
- Correspondence: (M.Y.N.); (C.G.)
| | - Priscila Arrigucci Bernardes
- Department of Animal Science and Rural Development, Federal University of Santa Catarina, Florianopolis 88034-000, SC, Brazil
| | | | - Dajeong Lim
- Animal Genome & Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea
| | - Seung Hwan Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 305764, Korea
| | - Cedric Gondro
- Department of Animal Science, Michigan State University, East Lansing, MI 48824, USA
- Correspondence: (M.Y.N.); (C.G.)
| |
Collapse
|
14
|
Performance and muscle lipogenesis of calves born to Nellore cows with different residual feed intake classification. PLoS One 2022; 17:e0272236. [PMID: 35905086 PMCID: PMC9337683 DOI: 10.1371/journal.pone.0272236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 07/14/2022] [Indexed: 11/19/2022] Open
Abstract
This study aimed to evaluate relationships among maternal residual feed intake (RFI) with growth performance and expression of genes involved in lipid metabolism in offspring of Nellore cattle. Fifty-three cows classified as negative or positive RFI by genomic prediction were exposed to fixed-time artificial insemination (FTAI) protocols at 2 and 3 years of age using semen from the same bull. In the first year, cows gestated under grazing conditions and nursed their calves in the feedlot. In the second year, the opposite occurred. Cows were weighed every 28 days during pregnancy and calves were weighed at birth and every 28 days until weaning. Ultrasound images were collected from the carcass of cows and calves. Muscle gene expression was evaluated in calves at birth and weaning. Data were analyzed by year considering the fixed effects of RFI class and FTAI protocol for variables measured in cows, and RFI class, FTAI protocol and sex for variables measured in calves. There was no effect of maternal RFI on calves performance in the first year. Lower expression of FABP4 gene and trend towards lower expression of SREBF1 and LPL genes were detected in samples collected after birth from calves born to negative RFI cows, indicating that adipogenesis was reduced during the fetal and neonatal period. In the second year, negative RFI cows had greater subcutaneous fat thickness than positive RFI cows, and their calves tended to be heavier at birth and to have less rump fat thickness at weaning. No significant differences in expression of genes studied were detected between cow RFI classes. Nellore cows classified as negative RFI consume less feed and produce calves whose growth potential is similar to that of calves produced by positive RFI cows.
Collapse
|
15
|
Smith JL, Wilson ML, Nilson SM, Rowan TN, Schnabel RD, Decker JE, Seabury CM. Genome-wide association and genotype by environment interactions for growth traits in U.S. Red Angus cattle. BMC Genomics 2022; 23:517. [PMID: 35842584 PMCID: PMC9287884 DOI: 10.1186/s12864-022-08667-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 05/27/2022] [Indexed: 11/10/2022] Open
Abstract
Background Genotypic information produced from single nucleotide polymorphism (SNP) arrays has routinely been used to identify genomic regions associated with complex traits in beef and dairy cattle. Herein, we assembled a dataset consisting of 15,815 Red Angus beef cattle distributed across the continental U.S. and a union set of 836,118 imputed SNPs to conduct genome-wide association analyses (GWAA) for growth traits using univariate linear mixed models (LMM); including birth weight, weaning weight, and yearling weight. Genomic relationship matrix heritability estimates were produced for all growth traits, and genotype-by-environment (GxE) interactions were investigated. Results Moderate to high heritabilities with small standard errors were estimated for birth weight (0.51 ± 0.01), weaning weight (0.25 ± 0.01), and yearling weight (0.42 ± 0.01). GWAA revealed 12 pleiotropic QTL (BTA6, BTA14, BTA20) influencing Red Angus birth weight, weaning weight, and yearling weight which met a nominal significance threshold (P ≤ 1e-05) for polygenic traits using 836K imputed SNPs. Moreover, positional candidate genes associated with Red Angus growth traits in this study (i.e., LCORL, LOC782905, NCAPG, HERC6, FAM184B, SLIT2, MMRN1, KCNIP4, CCSER1, GRID2, ARRDC3, PLAG1, IMPAD1, NSMAF, PENK, LOC112449660, MOS, SH3PXD2B, STC2, CPEB4) were also previously associated with feed efficiency, growth, and carcass traits in beef cattle. Collectively, 14 significant GxE interactions were also detected, but were less consistent among the investigated traits at a nominal significance threshold (P ≤ 1e-05); with one pleiotropic GxE interaction detected on BTA28 (24 Mb) for Red Angus weaning weight and yearling weight. Conclusions Sixteen well-supported QTL regions detected from the GWAA and GxE GWAA for growth traits (birth weight, weaning weight, yearling weight) in U.S. Red Angus cattle were found to be pleiotropic. Twelve of these pleiotropic QTL were also identified in previous studies focusing on feed efficiency and growth traits in multiple beef breeds and/or their composites. In agreement with other beef cattle GxE studies our results implicate the role of vasodilation, metabolism, and the nervous system in the genetic sensitivity to environmental stress. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08667-6.
Collapse
Affiliation(s)
- Johanna L Smith
- Department of Veterinary Pathobiology, Texas A&M University, College Station, 77843, USA
| | - Miranda L Wilson
- Department of Veterinary Pathobiology, Texas A&M University, College Station, 77843, USA
| | - Sara M Nilson
- Division of Animal Sciences, University of Missouri, Columbia, 65211, USA
| | - Troy N Rowan
- Division of Animal Sciences, University of Missouri, Columbia, 65211, USA.,Genetics Area Program, University of Missouri, Columbia, 65211, USA
| | - Robert D Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, 65211, USA.,Genetics Area Program, University of Missouri, Columbia, 65211, USA.,Informatics Institute, University of Missouri, Columbia, 65211, USA
| | - Jared E Decker
- Division of Animal Sciences, University of Missouri, Columbia, 65211, USA.,Genetics Area Program, University of Missouri, Columbia, 65211, USA.,Informatics Institute, University of Missouri, Columbia, 65211, USA
| | - Christopher M Seabury
- Department of Veterinary Pathobiology, Texas A&M University, College Station, 77843, USA.
| |
Collapse
|
16
|
Breeding Sustainable Beef Cows: Reducing Weight and Increasing Productivity. Animals (Basel) 2022; 12:ani12141745. [PMID: 35883292 PMCID: PMC9311566 DOI: 10.3390/ani12141745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 07/01/2022] [Accepted: 07/03/2022] [Indexed: 11/16/2022] Open
Abstract
Programs for sustainable beef production are established, but the specific role of beef cows in these systems is not well defined. This work characterized cows for two traits related to sustainability, cow weight (CW) and cumulative weight weaned (WtW). Cow weight indicates nutrient requirements and enteric methane emissions. Cumulative weight weaned reflects reproductive performance and avoidance of premature culling for characteristics related to animal health, welfare, and worker safety. Both traits were evaluated with random regression models with records from a crossbred population representing 18 breeds that conduct US national cattle evaluations. The genomic REML analyses included additive and dominance components, with relationships among 22,776 animals constructed from genotypes of 181,286 potentially functional variants imputed from a low-pass sequence. Projected to 8 years of age, the additive heritability estimate for CW was 0.57 and 0.11 for WtW. Dominance heritability was 0.02 for CW and 0.19 for WtW. Many variants with significant associations with CW were within previously described quantitative trait loci (QTL) for growth-related production, meat, and carcass traits. Significant additive WtW variants were covered by QTL for traits related to reproduction and structural soundness. All breeds contributed to groups of cows with high and low total genetic values (additive + dominance effects) for both traits. The high WtW cows and cows above the WtW mean but below the CW mean had larger heterosis values and fewer bases in runs of homozygosity. The high additive heritability of CW and dominance effects on WtW indicate that breeding to improve beef cow sustainability should involve selection to reduce CW and mate selection to maintain heterosis and reduce runs of homozygosity.
Collapse
|
17
|
Reich P, Falker-Gieske C, Pook T, Tetens J. Development and validation of a horse reference panel for genotype imputation. Genet Sel Evol 2022; 54:49. [PMID: 35787788 PMCID: PMC9252005 DOI: 10.1186/s12711-022-00740-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/23/2022] [Indexed: 11/10/2022] Open
Abstract
Background Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. Results Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. Conclusions The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00740-8.
Collapse
Affiliation(s)
- Paula Reich
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.
| | - Clemens Falker-Gieske
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Torsten Pook
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Jens Tetens
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| |
Collapse
|
18
|
Häfliger IM, Spengeler M, Seefried FR, Drögemüller C. Four novel candidate causal variants for deficient homozygous haplotypes in Holstein cattle. Sci Rep 2022; 12:5435. [PMID: 35361830 PMCID: PMC8971413 DOI: 10.1038/s41598-022-09403-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 03/07/2022] [Indexed: 12/23/2022] Open
Abstract
Mendelian variants can determine both insemination success and neonatal survival and thus influence fertility and rearing success of cattle. We present 24 deficient homozygous haplotype regions in the Holstein population of Switzerland and provide an overview of the previously identified haplotypes in the global Holstein breed. This study encompasses massive genotyping, whole-genome sequencing (WGS) and phenotype association analyses. We performed haplotype screenings on almost 53 thousand genotyped animals including 114 k SNP data with two different approaches. We revealed significant haplotype associations to several survival, birth and fertility traits. Within haplotype regions, we mined WGS data of hundreds of bovine genomes for candidate causal variants, which were subsequently evaluated by using a custom genotyping array in several thousand breeding animals. With this approach, we confirmed the known deleterious SMC2:p.Phe1135Ser missense variant associated with Holstein haplotype (HH) 3. For two previously reported deficient homozygous haplotypes that show negative associations to female fertility traits, we propose candidate causative loss-of-function variants: the HH13-related KIR2DS1:p.Gln159* nonsense variant and the HH21-related NOTCH3:p.Cys44del deletion. In addition, we propose the RIOX1:p.Ala133_Glu142del deletion as well as the PCDH15:p.Leu867Val missense variant to explain the unexpected low number of homozygous haplotype carriers for HH25 and HH35, respectively. In conclusion, we demonstrate that with mining massive SNP data in combination with WGS data, we can map several haplotype regions and unravel novel recessive protein-changing variants segregating at frequencies of 1 to 5%. Our findings both confirm previously identified loci and expand the spectrum of undesired alleles impairing reproduction success in Holstein cattle, the world's most important dairy breed.
Collapse
Affiliation(s)
- Irene M Häfliger
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bremgartenstrasse 109a, 3001, Bern, Switzerland.
| | | | | | - Cord Drögemüller
- Institute of Genetics, Vetsuisse Faculty, University of Bern, Bremgartenstrasse 109a, 3001, Bern, Switzerland
| |
Collapse
|
19
|
Teng J, Zhao C, Wang D, Chen Z, Tang H, Li J, Mei C, Yang Z, Ning C, Zhang Q. Assessment of the performance of different imputation methods for low-coverage sequencing in Holstein cattle. J Dairy Sci 2022; 105:3355-3366. [DOI: 10.3168/jds.2021-21360] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 12/13/2021] [Indexed: 12/27/2022]
|
20
|
Dauria BD, Sigdel A, Petrini J, Bóscollo PP, Pilonetto F, Salvian M, Rezende FM, Pedrosa VB, Bittar CMM, Machado PF, Coutinho LL, Wiggans GR, Mourão GB. Genetic effects of heat stress on milk fatty acids in a Brazilian Holstein cattle. J Dairy Sci 2022; 105:3296-3305. [PMID: 35094861 DOI: 10.3168/jds.2021-20914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 10/13/2021] [Indexed: 11/19/2022]
Abstract
The present study aimed to estimate covariance components of milk fatty acids (FA) and to compare the genomic estimated breeding values under general and heat-stress effects. Data consisted of 38,762 test-day records from 6,344 Holstein cows obtained from May 2012 through January 2018 on 4 dairy herds from Brazil. Single-trait repeatability test-day models with random regressions as a function of temperature-humidity index values were used for genetic analyses. The models included contemporary groups, parity order (1-6), and days in milk classes as fixed effects, and general and thermotolerance additive genetic and permanent environmental as random effects. Notably, differences in heritability estimates between environments (general and heat stress) increased (0.03 to 0.06) for unsaturated FA traits, such as unsaturated, monounsaturated, and polyunsaturated, at higher heat-stress levels. In contrast, heritability estimated between environments for saturated FA traits, including saturated FA, palmitic acid (C16:0), and stearic acid (C18:0) did not observe significant differences between environments. In addition, our study revealed negative genetic correlations between general and heat-stress additive genetic effects (antagonistic effect) for the saturated FA, C16:0, C18:0, and C18:1, which ranged from -0.007 to -0.32. Spearman's ranking correlation between genomic estimated breeding values ranged from -0.27 to 0.99. Results indicated a moderate to strong interaction of genotype by the environment for most FA traits comparing a heat-stress environment with thermoneutral conditions. Our findings point out novel opportunities to explore the use of FA milk profile and heat-stress models.
Collapse
Affiliation(s)
- B D Dauria
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - A Sigdel
- Department of Animal and Dairy Science, University of Wisconsin, Madison 53706
| | - J Petrini
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - P P Bóscollo
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - F Pilonetto
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - M Salvian
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - F M Rezende
- Department of Animal Sciences, University of Florida, Gainesville 32608
| | - V B Pedrosa
- Department of Animal Science, State University of Ponta Grossa, Ponta Grossa, PR 84030-900, Brazil
| | - C M M Bittar
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - P F Machado
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - L L Coutinho
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil
| | - G R Wiggans
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| | - G B Mourão
- Department of Animal Science, University of São Paulo, Piracicaba, SP 13418-900, Brazil.
| |
Collapse
|
21
|
Zhao C, Teng J, Zhang X, Wang D, Zhang X, Li S, Jiang X, Li H, Ning C, Zhang Q. Towards a Cost-Effective Implementation of Genomic Prediction Based on Low Coverage Whole Genome Sequencing in Dezhou Donkey. Front Genet 2021; 12:728764. [PMID: 34804115 PMCID: PMC8595392 DOI: 10.3389/fgene.2021.728764] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 09/20/2021] [Indexed: 11/25/2022] Open
Abstract
Low-coverage whole genome sequencing is a low-cost genotyping technology. Combined with genotype imputation approaches, it is likely to become a critical component of cost-effective genomic selection programs in agricultural livestock. Here, we used the low-coverage sequence data of 617 Dezhou donkeys to investigate the performance of genotype imputation for low-coverage whole genome sequence data and genomic prediction based on the imputed genotype data. The specific aims were as follows: 1) to measure the accuracy of genotype imputation under different sequencing depths, sample sizes, minor allele frequency (MAF), and imputation pipelines and 2) to assess the accuracy of genomic prediction under different marker densities derived from the imputed sequence data, different strategies for constructing the genomic relationship matrixes, and single-vs. multi-trait models. We found that a high imputation accuracy (>0.95) can be achieved for sequence data with a sequencing depth as low as 1x and the number of sequenced individuals ≥400. For genomic prediction, the best performance was obtained by using a marker density of 410K and a G matrix constructed using expected marker dosages. Multi-trait genomic best linear unbiased prediction (GBLUP) performed better than single-trait GBLUP. Our study demonstrates that low-coverage whole genome sequencing would be a cost-effective approach for genomic prediction in Dezhou donkey.
Collapse
Affiliation(s)
- Changheng Zhao
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Jun Teng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xinhao Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China.,National Engineering Research Center for Gelatin-based TCM, Dong-E E-Jiao Co., Ltd., Dong'e County, China
| | - Dan Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xinyi Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Shiyin Li
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xin Jiang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Haijing Li
- National Engineering Research Center for Gelatin-based TCM, Dong-E E-Jiao Co., Ltd., Dong'e County, China
| | - Chao Ning
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| |
Collapse
|
22
|
Impact of genotypic errors with equal and unequal family contribution on accuracy of genomic prediction in aquaculture using simulation. Sci Rep 2021; 11:18318. [PMID: 34526591 PMCID: PMC8443606 DOI: 10.1038/s41598-021-97873-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 08/31/2021] [Indexed: 11/08/2022] Open
Abstract
Genotypic errors, conflict between recorded genotype and the true genotype, can lead to false or biased population genetic parameters. Here, the effect of genotypic errors on accuracy of genomic predictions and genomic relationship matrix are investigated using a simulation study based on population and genomic structure comparable to black tiger prawn, Penaeus monodon. Fifty full-sib families across five generations with phenotypic and genotypic information on 53 K SNPs were simulated. Ten replicates of different scenarios with three heritability estimates, equal and unequal family contributions were generated. Within each scenario, four SNP densities and three genotypic error rates in each SNP density were implemented. Results showed that family contribution did not have a substantial impact on accuracy of predictions across different datasets. In the absence of genotypic errors, 3 K SNP density was found to be efficient in estimating the accuracy, whilst increasing the SNP density from 3 to 20 K resulted in a marginal increase in accuracy of genomic predictions using the current population and genomic parameters. In addition, results showed that the presence of even 10% errors in a 10 and 20 K SNP panel might not have a severe impact on accuracy of predictions. However, below 10 K marker density, even a 5% error can result in lower accuracy of predictions.
Collapse
|
23
|
Umlai UKI, Bangarusamy DK, Estivill X, Jithesh PV. Genome sequencing data analysis for rare disease gene discovery. Brief Bioinform 2021; 23:6366880. [PMID: 34498682 DOI: 10.1093/bib/bbab363] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 07/24/2021] [Accepted: 08/17/2021] [Indexed: 12/14/2022] Open
Abstract
Rare diseases occur in a smaller proportion of the general population, which is variedly defined as less than 200 000 individuals (US) or in less than 1 in 2000 individuals (Europe). Although rare, they collectively make up to approximately 7000 different disorders, with majority having a genetic origin, and affect roughly 300 million people globally. Most of the patients and their families undergo a long and frustrating diagnostic odyssey. However, advances in the field of genomics have started to facilitate the process of diagnosis, though it is hindered by the difficulty in genome data analysis and interpretation. A major impediment in diagnosis is in the understanding of the diverse approaches, tools and datasets available for variant prioritization, the most important step in the analysis of millions of variants to select a few potential variants. Here we present a review of the latest methodological developments and spectrum of tools available for rare disease genetic variant discovery and recommend appropriate data interpretation methods for variant prioritization. We have categorized the resources based on various steps of the variant interpretation workflow, starting from data processing, variant calling, annotation, filtration and finally prioritization, with a special emphasis on the last two steps. The methods discussed here pertain to elucidating the genetic basis of disease in individual patient cases via trio- or family-based analysis of the genome data. We advocate the use of a combination of tools and datasets and to follow multiple iterative approaches to elucidate the potential causative variant.
Collapse
Affiliation(s)
- Umm-Kulthum Ismail Umlai
- Division of Genomics & Translational Biomedicine, College of Health & Life Sciences, Hamad Bin Khalifa University, B-147, Penrose House, PO Box 34110, Education City, Doha, Qatar
| | - Dhinoth Kumar Bangarusamy
- Division of Genomics & Translational Biomedicine, College of Health & Life Sciences, Hamad Bin Khalifa University, B-147, Penrose House, PO Box 34110, Education City, Doha, Qatar
| | - Xavier Estivill
- Quantitative Genomics Laboratories (qGenomics), Barcelona, Catalonia, Spain
| | - Puthen Veettil Jithesh
- Division of Genomics & Translational Biomedicine, College of Health & Life Sciences, Hamad Bin Khalifa University, B-147, Penrose House, PO Box 34110, Education City, Doha, Qatar
| |
Collapse
|
24
|
Moghaddar N, Brown DJ, Swan AA, Gurman PM, Li L, van der Werf JH. Genomic prediction in a numerically small breed population using prioritized genetic markers from whole-genome sequence data. J Anim Breed Genet 2021; 139:71-83. [PMID: 34374454 DOI: 10.1111/jbg.12638] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 06/08/2021] [Accepted: 07/19/2021] [Indexed: 11/30/2022]
Abstract
The objective of this study was to investigate the accuracy of genomic prediction of body weight and eating quality traits in a numerically small sheep population (Dorper sheep). Prediction was based on a large multi-breed/admixed reference population and using (a) 50k or 500k single nucleotide polymorphism (SNP) genotypes, (b) imputed whole-genome sequencing data (~31 million), (c) selected SNPs from whole genome sequence data and (d) 50k SNP genotypes plus selected SNPs from whole-genome sequence data. Furthermore, the impact of using a breed-adjusted genomic relationship matrix on accuracy of genomic breeding value was assessed. The selection of genetic variants was based on an association study performed on imputed whole-genome sequence data in an independent population, which was chosen either randomly from the base population or according to higher genetic proximity to the target population. Genomic prediction was based on genomic best linear unbiased prediction (GBLUP), and the accuracy of genomic prediction was assessed according to the correlation between genomic breeding value and corrected phenotypes divided by the square root of trait heritability. The accuracy of genomic prediction was between 0.20 and 0.30 across different traits based on common 50k SNP genotypes, which improved on average by 0.06 (absolute value) on average based on using prioritized genetic markers from whole-genome sequence data. Using prioritized genetic markers from a genetically more related GWAS population resulted in slightly higher prediction accuracy (0.02 absolute value) compared to genetic markers derived from a random GWAS population. Using high-density SNP genotypes or imputed whole-genome sequence data in GBLUP showed almost no improvement in genomic prediction accuracy however, accounting for different marker allele frequencies in reference population according to a breed-adjusted GRM resulted to on average 0.024 (absolute value) increase in accuracy of genomic prediction.
Collapse
Affiliation(s)
- Nasir Moghaddar
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Daniel J Brown
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Andrew A Swan
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Phillip M Gurman
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Li Li
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Julius H van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| |
Collapse
|
25
|
Al-Khudhair A, VanRaden PM, Null DJ, Li B. Marker selection and genomic prediction of economically important traits using imputed high-density genotypes for 5 breeds of dairy cattle. J Dairy Sci 2021; 104:4478-4485. [PMID: 33612229 DOI: 10.3168/jds.2020-19260] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 11/22/2020] [Indexed: 11/19/2022]
Abstract
Marker sets used in US dairy genomic predictions were previously expanded by including high-density (HD) or sequence markers with the largest effects for Holstein breed only. Other non-Holstein breeds lacked enough HD genotyped animals to be used as a reference population at that time, and thus were not included in the genomic prediction. Recently, numbers of non-Holstein breeds genotyped using HD panels reached an acceptable level for imputation and marker selection, allowing HD genomic prediction and HD marker selection for Holstein plus 4 other breeds. Genotypes for 351,461 Holsteins, 347,570 Jerseys, 42,346 Brown Swiss, 9,364 Ayrshires (including Red dairy cattle), and 4,599 Guernseys were imputed to the HD marker list that included 643,059 SNP. The separate HD reference populations included Illumina BovineHD (San Diego, CA) genotypes for 4,012 Holsteins, 407 Jerseys, 181 Brown Swiss, 527 Ayrshires, and 147 Guernseys. The 643,059 variants included the HD SNP and all 79,254 (80K) genetic markers and QTL used in routine national genomic evaluations. Before imputation, approximately 91 to 97% of genotypes were unknown for each breed; after imputation, 1.1% of Holstein, 3.2% of Jersey, 6.7% of Brown Swiss, 4.8% of Ayrshire, and 4.2% of Guernsey alleles remained unknown due to lower density haplotypes that had no matching HD haplotype. The higher remaining missing rates in non-Holstein breeds are mainly due to fewer HD genotyped animals in the imputation reference populations. Allele effects for up to 39 traits were estimated separately within each breed using phenotypic reference populations that included up to 6,157 Jersey males and 110,130 Jersey females. Correlations of HD with 80K genomic predictions for young animals averaged 0.986, 0.989, 0.985, 0.992, and 0.978 for Jersey, Ayrshire, Brown Swiss, Guernsey, and Holstein breeds, respectively. Correlations were highest for yield traits (about 0.991) and lowest for foot angle and rear legs-side view (0.981and 0.982, respectively). Some HD effects were more than twice as large as the largest 80K SNP effect, and HD markers had larger effects than nearby 80K markers for many breed-trait combinations. Previous studies selected and included markers with large effects for Holstein traits; the newly selected HD markers should also improve non-Holstein and crossbred genomic predictions and were added to official US genomic predictions in April 2020.
Collapse
Affiliation(s)
- A Al-Khudhair
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| | - P M VanRaden
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350.
| | - D J Null
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| | - B Li
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| |
Collapse
|
26
|
Lopez BIM, An N, Srikanth K, Lee S, Oh JD, Shin DH, Park W, Chai HH, Park JE, Lim D. Genomic Prediction Based on SNP Functional Annotation Using Imputed Whole-Genome Sequence Data in Korean Hanwoo Cattle. Front Genet 2021; 11:603822. [PMID: 33552124 PMCID: PMC7859490 DOI: 10.3389/fgene.2020.603822] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open
Abstract
Whole-genome sequence (WGS) data are increasingly being applied into genomic predictions, offering a higher predictive ability by including causal mutations or single-nucleotide polymorphisms (SNPs) putatively in strong linkage disequilibrium with causal mutations affecting the trait. This study aimed to improve the predictive performance of the customized Hanwoo 50 k SNP panel for four carcass traits in commercial Hanwoo population by adding highly predictive variants from sequence data. A total of 16,892 Hanwoo cattle with phenotypes (i.e., backfat thickness, carcass weight, longissimus muscle area, and marbling score), 50 k genotypes, and WGS imputed genotypes were used. We partitioned imputed WGS data according to functional annotation [intergenic (IGR), intron (ITR), regulatory (REG), synonymous (SYN), and non-synonymous (NSY)] to characterize the genomic regions that will deliver higher predictive power for the traits investigated. Animals were assigned into two groups, the discovery set (7324 animals) used for predictive variant detection and the cross-validation set for genomic prediction. Genome-wide association studies were performed by trait to every genomic region and entire WGS data for the pre-selection of variants. Each set of pre-selected SNPs with different density (1000, 3000, 5000, or 10,000) were added to the 50 k genotypes separately and the predictive performance of each set of genotypes was assessed using the genomic best linear unbiased prediction (GBLUP). Results showed that the predictive performance of the customized Hanwoo 50 k SNP panel can be improved by the addition of pre-selected variants from the WGS data, particularly 3000 variants from each trait, which is then sufficient to improve the prediction accuracy for all traits. When 12,000 pre-selected variants (3000 variants from each trait) were added to the 50 k genotypes, the prediction accuracies increased by 9.9, 9.2, 6.4, and 4.7% for backfat thickness, carcass weight, longissimus muscle area, and marbling score compared to the regular 50 k SNP panel, respectively. In terms of prediction bias, regression coefficients for all sets of genotypes in all traits were close to 1, indicating an unbiased prediction. The strategy used to select variants based on functional annotation did not show a clear advantage compared to using whole-genome. Nonetheless, such pre-selected SNPs from the IGR region gave the highest improvement in prediction accuracy among genomic regions and the values were close to those obtained using the WGS data for all traits. We concluded that additional gain in prediction accuracy when using pre-selected variants appears to be trait-dependent, and using WGS data remained more accurate compared to using a specific genomic region.
Collapse
Affiliation(s)
- Bryan Irvine M Lopez
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Narae An
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Krishnamoorthy Srikanth
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Seunghwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Jae-Don Oh
- Department of Animal Biotechnology, Chonbuk National University, Jeonju, South Korea
| | - Dong-Hyun Shin
- Department of Agricultural Convergence Technology, Chonbuk National University, Jeonju, South Korea
| | - Woncheoul Park
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Han-Ha Chai
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Jong-Eun Park
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Dajeong Lim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| |
Collapse
|
27
|
Khansefid M, Goddard ME, Haile-Mariam M, Konstantinov KV, Schrooten C, de Jong G, Jewell EG, O'Connor E, Pryce JE, Daetwyler HD, MacLeod IM. Improving Genomic Prediction of Crossbred and Purebred Dairy Cattle. Front Genet 2020; 11:598580. [PMID: 33381150 PMCID: PMC7767986 DOI: 10.3389/fgene.2020.598580] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Accepted: 11/19/2020] [Indexed: 11/17/2022] Open
Abstract
This study assessed the accuracy and bias of genomic prediction (GP) in purebred Holstein (H) and Jersey (J) as well as crossbred (H and J) validation cows using different reference sets and prediction strategies. The reference sets were made up of different combinations of 36,695 H and J purebreds and crossbreds. Additionally, the effect of using different sets of marker genotypes on GP was studied (conventional panel: 50k, custom panel enriched with, or close to, causal mutations: XT_50k, and conventional high-density with a limited custom set: pruned HDnGBS). We also compared the use of genomic best linear unbiased prediction (GBLUP) and Bayesian (emBayesR) models, and the traits tested were milk, fat, and protein yields. On average, by including crossbred cows in the reference population, the prediction accuracies increased by 0.01–0.08 and were less biased (regression coefficient closer to 1 by 0.02–0.16), and the benefit was greater for crossbreds compared to purebreds. The accuracy of prediction increased by 0.02 using XT_50k compared to 50k genotypes without affecting the bias. Although using pruned HDnGBS instead of 50k also increased the prediction accuracy by about 0.02, it increased the bias for purebred predictions in emBayesR models. Generally, emBayesR outperformed GBLUP for prediction accuracy when using 50k or pruned HDnGBS genotypes, but the benefits diminished with XT_50k genotypes. Crossbred predictions derived from a joint pure H and J reference were similar in accuracy to crossbred predictions derived from the two separate purebred reference sets and combined proportional to breed composition. However, the latter approach was less biased by 0.13. Most interestingly, using an equalized breed reference instead of an H-dominated reference, on average, reduced the bias of prediction by 0.16–0.19 and increased the accuracy by 0.04 for crossbred and J cows, with a little change in the H accuracy. In conclusion, we observed improved genomic predictions for both crossbreds and purebreds by equalizing breed contributions in a mixed breed reference that included crossbred cows. Furthermore, we demonstrate, that compared to the conventional 50k or high-density panels, our customized set of 50k sequence markers improved or matched the prediction accuracy and reduced bias with both GBLUP and Bayesian models.
Collapse
Affiliation(s)
- Majid Khansefid
- AgriBio Centre for AgriBioscience, Agriculture Victoria Services, Bundoora, VIC, Australia
| | - Michael E Goddard
- AgriBio Centre for AgriBioscience, Agriculture Victoria Services, Bundoora, VIC, Australia.,Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Parkville, VIC, Australia
| | - Mekonnen Haile-Mariam
- AgriBio Centre for AgriBioscience, Agriculture Victoria Services, Bundoora, VIC, Australia
| | | | | | | | | | | | - Jennie E Pryce
- AgriBio Centre for AgriBioscience, Agriculture Victoria Services, Bundoora, VIC, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | - Hans D Daetwyler
- AgriBio Centre for AgriBioscience, Agriculture Victoria Services, Bundoora, VIC, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | - Iona M MacLeod
- AgriBio Centre for AgriBioscience, Agriculture Victoria Services, Bundoora, VIC, Australia
| |
Collapse
|
28
|
Li B, VanRaden PM, Null DJ, O'Connell JR, Cole JB. Major quantitative trait loci influencing milk production and conformation traits in Guernsey dairy cattle detected on Bos taurus autosome 19. J Dairy Sci 2020; 104:550-560. [PMID: 33189290 DOI: 10.3168/jds.2020-18766] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 09/07/2020] [Indexed: 01/30/2023]
Abstract
The goal of this study was to identify potential quantitative trait loci (QTL) for 27 production, fitness, and conformation traits of Guernsey cattle through genome-wide association (GWA) analyses, with extra emphasis on BTA19, where major QTL were observed for several traits. Animals' de-regressed predicted transmitting abilities (PTA) from the December 2018 traditional US evaluation were used as phenotypes. All of the Guernsey cattle included in the QTL analyses were predictor animals in the reference population, ranging from 1,077 to 1,685 animals for different traits. Single-trait GWA analyses were carried out by a mixed-model approach for all 27 traits using imputed high-density genotypes. A major QTL was detected on BTA19, influencing several milk production traits, conformation traits, and livability of Guernsey cattle, and the most significant SNP lie in the region of 26.2 to 28.3 Mb. The myosin heavy chain 10 (MYH10) gene residing within this region was found to be highly associated with milk production and body conformation traits of dairy cattle. After the initial GWA analyses, which suggested that many significant SNP are in linkage with one another, conditional analyses were used for fine mapping. The top significant SNP on BTA19 were fixed as covariables in the model, one at a time, until no more significant SNP were detected on BTA19. After this fine-mapping approach was applied, only 1 significant SNP was detected on BTA19 for most traits, but multiple, independent significant SNP were found for protein yield, dairy form, and stature. In addition, the haplotype that hosts the major QTL on BTA19 was traced to a US Guernsey born in 1954. The haplotype is common in the breed, indicating a long-term influence of this QTL on the US Guernsey population.
Collapse
Affiliation(s)
- B Li
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, USDA Agricultural Research Service, Beltsville, MD 20705-2350
| | - P M VanRaden
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, USDA Agricultural Research Service, Beltsville, MD 20705-2350
| | - D J Null
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, USDA Agricultural Research Service, Beltsville, MD 20705-2350
| | - J R O'Connell
- School of Medicine, University of Maryland, Baltimore 21201
| | - J B Cole
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, USDA Agricultural Research Service, Beltsville, MD 20705-2350.
| |
Collapse
|
29
|
Snelling WM, Hoff JL, Li JH, Kuehn LA, Keel BN, Lindholm-Perry AK, Pickrell JK. Assessment of Imputation from Low-Pass Sequencing to Predict Merit of Beef Steers. Genes (Basel) 2020; 11:E1312. [PMID: 33167493 PMCID: PMC7716200 DOI: 10.3390/genes11111312] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 10/28/2020] [Accepted: 11/02/2020] [Indexed: 01/27/2023] Open
Abstract
Decreasing costs are making low coverage sequencing with imputation to a comprehensive reference panel an attractive alternative to obtain functional variant genotypes that can increase the accuracy of genomic prediction. To assess the potential of low-pass sequencing, genomic sequence of 77 steers sequenced to >10X coverage was downsampled to 1X and imputed to a reference of 946 cattle representing multiple Bos taurus and Bos indicus-influenced breeds. Genotypes for nearly 60 million variants detected in the reference were imputed from the downsampled sequence. The imputed genotypes strongly agreed with the SNP array genotypes (r¯=0.99) and the genotypes called from the transcript sequence (r¯=0.97). Effects of BovineSNP50 and GGP-F250 variants on birth weight, postweaning gain, and marbling were solved without the steers' phenotypes and genotypes, then applied to their genotypes, to predict the molecular breeding values (MBV). The steers' MBV were similar when using imputed and array genotypes. Replacing array variants with functional sequence variants might allow more robust MBV. Imputation from low coverage sequence offers a viable, low-cost approach to obtain functional variant genotypes that could improve genomic prediction.
Collapse
Affiliation(s)
- Warren M. Snelling
- U.S. Department of Agriculture, Agricultural Research Service, U.S. Meat Animal Research Center, Clay Center, NE 68933, USA; (L.A.K.); (B.N.K.); (A.K.L.-P.)
| | - Jesse L. Hoff
- Gencove, Inc., New York, NY 10016, USA; (J.L.H.); (J.H.L.); (J.K.P.)
| | - Jeremiah H. Li
- Gencove, Inc., New York, NY 10016, USA; (J.L.H.); (J.H.L.); (J.K.P.)
| | - Larry A. Kuehn
- U.S. Department of Agriculture, Agricultural Research Service, U.S. Meat Animal Research Center, Clay Center, NE 68933, USA; (L.A.K.); (B.N.K.); (A.K.L.-P.)
| | - Brittney N. Keel
- U.S. Department of Agriculture, Agricultural Research Service, U.S. Meat Animal Research Center, Clay Center, NE 68933, USA; (L.A.K.); (B.N.K.); (A.K.L.-P.)
| | - Amanda K. Lindholm-Perry
- U.S. Department of Agriculture, Agricultural Research Service, U.S. Meat Animal Research Center, Clay Center, NE 68933, USA; (L.A.K.); (B.N.K.); (A.K.L.-P.)
| | | |
Collapse
|
30
|
Liu L, Zhou J, Chen CJ, Zhang J, Wen W, Tian J, Zhang Z, Gu Y. GWAS-Based Identification of New Loci for Milk Yield, Fat, and Protein in Holstein Cattle. Animals (Basel) 2020; 10:ani10112048. [PMID: 33167458 PMCID: PMC7694478 DOI: 10.3390/ani10112048] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 11/01/2020] [Accepted: 11/03/2020] [Indexed: 12/20/2022] Open
Abstract
Simple Summary Understanding the genetic architecture underlying milk production traits in cattle is beneficial so that genetic variants can be targeted toward the genetic improvement. In this study, we performed a genome-wide association study for milk production and quality traits in Holstein cattle. In the total of ten significant single-nucleotide polymorphisms (SNPs) associated with milk fat and protein, six are located in previously reported quantitative traits locus (QTL) regions. The study not only identified the effect of DGAT1 gene on milk fat and protein but also found several novel candidate genes. In addition, some pleiotropic SNPs and QTLs were identified that associated with more than two traits, these results could provide some basis for molecular breeding in dairy cattle. Abstract High-yield and high-quality of milk are the primary goals of dairy production. Understanding the genetic architecture underlying these milk-related traits is beneficial so that genetic variants can be targeted toward the genetic improvement. In this study, we measured five milk production and quality traits in Holstein cattle population from China. These traits included milk yield, fat, and protein. We used the estimated breeding values as dependent variables to conduct the genome-wide association studies (GWAS). Breeding values were estimated through pedigree relationships by using a linear mixed model. Genotyping was carried out on the individuals with phenotypes by using the Illumina BovineSNP150 BeadChip. The association analyses were conducted by using the fixed and random model Circulating Probability Unification (FarmCPU) method. A total of ten single-nucleotide polymorphisms (SNPs) were detected above the genome-wide significant threshold (p < 4.0 × 10−7), including six located in previously reported quantitative traits locus (QTL) regions. We found eight candidate genes within distances of 120 kb upstream or downstream to the associated SNPs. The study not only identified the effect of DGAT1 gene on milk fat and protein, but also discovered novel genetic loci and candidate genes related to milk traits. These novel genetic loci would be an important basis for molecular breeding in dairy cattle.
Collapse
Affiliation(s)
- Liyuan Liu
- School of Agriculture, Ningxia University, Yinchuan 750021, Ningxia, China; (L.L.); (J.Z.); (J.Z.)
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, DC 99164, USA;
| | - Jinghang Zhou
- School of Agriculture, Ningxia University, Yinchuan 750021, Ningxia, China; (L.L.); (J.Z.); (J.Z.)
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, DC 99164, USA;
| | - Chunpeng James Chen
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, DC 99164, USA;
| | - Juan Zhang
- School of Agriculture, Ningxia University, Yinchuan 750021, Ningxia, China; (L.L.); (J.Z.); (J.Z.)
| | - Wan Wen
- Animal Husbandry Workstation, Yinchuan 750001, Ningxia, China; (W.W.); (J.T.)
| | - Jia Tian
- Animal Husbandry Workstation, Yinchuan 750001, Ningxia, China; (W.W.); (J.T.)
| | - Zhiwu Zhang
- Department of Crop and Soil Sciences, Washington State University, Pullman, Washington, DC 99164, USA;
- Correspondence: (Z.Z.); (Y.G.)
| | - Yaling Gu
- School of Agriculture, Ningxia University, Yinchuan 750021, Ningxia, China; (L.L.); (J.Z.); (J.Z.)
- Correspondence: (Z.Z.); (Y.G.)
| |
Collapse
|
31
|
Accuracy of genomic evaluation using imputed high-density genotypes for carcass traits in commercial Hanwoo population. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104256] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
32
|
Money D, Wilson D, Jenko J, Whalen A, Thorn S, Gorjanc G, Hickey JM. Extending long-range phasing and haplotype library imputation algorithms to large and heterogeneous datasets. Genet Sel Evol 2020; 52:38. [PMID: 32640985 PMCID: PMC7346379 DOI: 10.1186/s12711-020-00558-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Accepted: 06/26/2020] [Indexed: 12/12/2022] Open
Abstract
Background We describe the latest improvements to the long-range phasing (LRP) and haplotype library imputation (HLI) algorithms for successful phasing of both datasets with one million individuals and datasets genotyped using different sets of single nucleotide polymorphisms (SNPs). Previous publicly available implementations of the LRP algorithm implemented in AlphaPhase could not phase large datasets due to the computational cost of defining surrogate parents by exhaustive all-against-all searches. Furthermore, the AlphaPhase implementations of LRP and HLI were not designed to deal with large amounts of missing data that are inherent when using multiple SNP arrays. Methods We developed methods that avoid the need for all-against-all searches by performing LRP on subsets of individuals and then concatenating the results. We also extended LRP and HLI algorithms to enable the use of different sets of markers, including missing values, when determining surrogate parents and identifying haplotypes. We implemented and tested these extensions in an updated version of AlphaPhase, and compared its performance to the software package Eagle2. Results A simulated dataset with one million individuals genotyped with the same 6711 SNPs for a single chromosome took less than a day to phase, compared to more than seven days for Eagle2. The percentage of correctly phased alleles at heterozygous loci was 90.2 and 99.9% for AlphaPhase and Eagle2, respectively. A larger dataset with one million individuals genotyped with 49,579 SNPs for a single chromosome took AlphaPhase 23 days to phase, with 89.9% of alleles at heterozygous loci phased correctly. The phasing accuracy was generally lower for datasets with different sets of markers than with one set of markers. For a simulated dataset with three sets of markers, 1.5% of alleles at heterozygous positions were phased incorrectly, compared to 0.4% with one set of markers. Conclusions The improved LRP and HLI algorithms enable AlphaPhase to quickly and accurately phase very large and heterogeneous datasets. AlphaPhase is an order of magnitude faster than the other tested packages, although Eagle2 showed a higher level of phasing accuracy. The speed gain will make phasing achievable for very large genomic datasets in livestock, enabling more powerful breeding and genetics research and application.
Collapse
Affiliation(s)
- Daniel Money
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - David Wilson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Janez Jenko
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Andrew Whalen
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Steve Thorn
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK.
| |
Collapse
|
33
|
Pralle RS, Schultz NE, White HM, Weigel KA. Hyperketonemia GWAS and parity-dependent SNP associations in Holstein dairy cows intensively sampled for blood β-hydroxybutyrate concentration. Physiol Genomics 2020; 52:347-357. [PMID: 32628084 DOI: 10.1152/physiolgenomics.00016.2020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Hyperketonemia (HYK) is a metabolic disorder that affects early postpartum dairy cows; however, there has been limited success in identifying genomic variants contributing to HYK susceptibility. We conducted a genome-wide association study (GWAS) using HYK phenotypes based on an intensive screening protocol, interrogated genotype interactions with parity group (GWIS), and evaluated the enrichment of annotated metabolic pathways. Holstein cows were enrolled into the experiment after parturition, and blood samples were collected at four timepoints between 5 and 18 days postpartum. Concentration of blood β-hydroxybutyrate (BHB) was quantified cow-side via a handheld BHB meter. Cows were labeled as a HYK case when at least one blood sample had BHB ≥ 1.2 mmol/L, and all other cows were considered non-HYK controls. After quality control procedures, 1,710 cows and 58,699 genotypes were available for further analysis. The GWAS and GWIS were performed using the forward feature select linear mixed model method. There was evidence for an association between ARS-BFGL-NGS-91238 and HYK susceptibility, as well as parity-dependent associations to HYK for BovineHD0600024247 and BovineHD1400023753. Candidate genes annotated to these single nuclear polymorphism associations have been previously associated with obesity, diabetes, insulin resistance, and fatty liver in humans and rodent models. Enrichment analysis revealed focal adhesion and axon guidance as metabolic pathways contributing to HYK etiology, while genetic variation in pathways related to insulin secretion and sensitivity may affect HYK susceptibility in a parity-dependent matter. In conclusion, the present work proposes several novel marker associations and metabolic pathways contributing to genetic risk for HYK susceptibility.
Collapse
Affiliation(s)
- Ryan S Pralle
- Department of Dairy Science, University of Wisconsin-Madison, Madison, Wisconsin
| | - Nichol E Schultz
- Department of Dairy Science, University of Wisconsin-Madison, Madison, Wisconsin
| | - Heather M White
- Department of Dairy Science, University of Wisconsin-Madison, Madison, Wisconsin
| | - Kent A Weigel
- Department of Dairy Science, University of Wisconsin-Madison, Madison, Wisconsin
| |
Collapse
|
34
|
Liu A, Lund MS, Boichard D, Mao X, Karaman E, Fritz S, Aamand GP, Wang Y, Su G. Imputation for sequencing variants preselected to a customized low-density chip. Sci Rep 2020; 10:9524. [PMID: 32533087 PMCID: PMC7293337 DOI: 10.1038/s41598-020-66523-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Accepted: 05/19/2020] [Indexed: 12/27/2022] Open
Abstract
The sequencing variants preselected from association analyses and bioinformatics analyses could improve genomic prediction. In this study, the imputation of sequencing SNPs preselected from major dairy breeds in Denmark-Finland-Sweden (DFS) and France (FRA) was investigated for both contemporary animals and old bulls in Danish Jersey. For contemporary animals, a two-step imputation which first imputed to 54 K and then to 54 K + DFS + FRA SNPs achieved highest accuracy. Correlations between observed and imputed genotypes were 91.6% for DFS SNPs and 87.6% for FRA SNPs, while concordance rates were 96.6% for DFS SNPs and 93.5% for FRA SNPs. The SNPs with lower minor allele frequency (MAF) tended to have lower correlations but higher concordance rates. For old bulls, imputation for DFS and FRA SNPs were relatively accurate even for bulls without progenies (correlations higher than 97.2% and concordance rates higher than 98.4%). For contemporary animals, given limited imputation accuracy of preselected sequencing SNPs especially for SNPs with low MAF, it would be a good strategy to directly genotype preselected sequencing SNPs with a customized SNP chip. For old bulls, given high imputation accuracy for preselected sequencing SNPs with all MAF ranges, it would be unnecessary to re-genotype preselected sequencing SNPs.
Collapse
Affiliation(s)
- Aoxing Liu
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.,Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Didier Boichard
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France
| | - Xiaowei Mao
- Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, 100044, Beijing, P.R. China.,CAS Center for Excellence in Life and Paleoenvironment, 100044, Beijing, P.R. China
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Sebastien Fritz
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France.,ALLICE, 75012, Paris, France
| | | | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| |
Collapse
|
35
|
Bickhart DM, McClure JC, Schnabel RD, Rosen BD, Medrano JF, Smith TPL. Symposium review: Advances in sequencing technology herald a new frontier in cattle genomics and genome-enabled selection. J Dairy Sci 2020; 103:5278-5290. [PMID: 32331872 DOI: 10.3168/jds.2019-17693] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 12/03/2019] [Indexed: 11/19/2022]
Abstract
The cattle reference genome assembly has underpinned major innovations in beef and dairy genetics through genome-enabled selection, including removal of deleterious recessive variants and selection for favorable alleles affecting quantitative production traits. The initial reference assemblies, up to and including UMD3.1 and Btau4.1, were based on a combination of clone-by-clone sequencing of bacterial artificial chromosome clones generated from blood DNA of a Hereford bull and whole-genome shotgun sequencing of blood DNA from his inbred daughter/granddaughter named L1 Dominette 01449 (Dominette). The approach introduced assembly gaps, misassemblies, and errors, and it limited the ability to assemble regions that undergo rearrangement in blood cells, such as immune gene clusters. Nonetheless, the reference supported the creation of genotyping tools and provided a basis for many studies of gene expression. Recently, long-read sequencing technologies have emerged that facilitated a re-assembly of the reference genome, using lung tissue from Dominette to resolve many of the problems and providing a bridge to place historical studies in common context. The new reference, ARS-UCD1.2, successfully assembled germline immune gene clusters and improved overall continuity (i.e., reduction of gaps and inversions) by over 250-fold. This reference properly places nearly all of the legacy genetic markers used for over a decade in the industry. In this review, we discuss the improvements made to the cattle reference; remaining issues present in the assembly; tools developed to support genome-based studies in beef and dairy cattle; and the emergence of newer genome assembly methods that are producing even higher-quality assemblies for other breeds of cattle at a fraction of the cost. The new frontier for cattle genomics research will likely include a transition from the individual Hereford reference genome, to a "pan-genome" reference, representing all the DNA segments existing in commonly used cattle breeds, bringing the cattle reference into line with the current direction of human genome research.
Collapse
Affiliation(s)
- D M Bickhart
- US Dairy Forage Research Center, Agricultural Research Service, USDA, Madison, WI 53705.
| | - J C McClure
- US Dairy Forage Research Center, Agricultural Research Service, USDA, Madison, WI 53705
| | - R D Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, 65211; MU Institute for Data Science and Informatics, University of Missouri, Columbia, 65211
| | - B D Rosen
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705
| | - J F Medrano
- Department of Animal Science, University of California Davis, 95616
| | - T P L Smith
- Meat Animal Research Center, Agricultural Research Service, USDA, Clay Center, NE 68933
| |
Collapse
|
36
|
Interest of using imputation for genomic evaluation in layer chicken. Poult Sci 2020; 99:2324-2336. [PMID: 32359567 PMCID: PMC7597443 DOI: 10.1016/j.psj.2020.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 12/27/2019] [Accepted: 01/01/2020] [Indexed: 11/21/2022] Open
Abstract
With the availability of the 600K Affymetrix Axiom high-density (HD) single nucleotide polymorphism (SNP) chip, genomic selection has been implemented in broiler and layer chicken. However, the cost of this SNP chip is too high to genotype all selection candidates. A solution is to develop a low-density SNP chip, at a lower price, and to impute all missing markers. But to routinely implement this solution, the impact of imputation on genomic evaluation accuracy must be studied. It is also interesting to study the consequences of the use of low-density SNP chips in genomic evaluation accuracy. In this perspective, the interest of using imputation in genomic selection was studied in a pure layer line. Two low-density SNP chip designs were compared: an equidistant methodology and a methodology based on linkage disequilibrium. Egg weight, egg shell color, egg shell strength, and albumen height were evaluated with single-step genomic best linear unbiased prediction methodology. The impact of imputation errors or the absence of imputation on the ranking of the male selection candidates was assessed with a genomic evaluation based on ancestry. Thus, genomic estimated breeding values (GEBV) obtained with imputed HD genotypes or low-density genotypes were compared with GEBV obtained with the HD SNP chip. The relative accuracy of GEBV was also investigated by considering as reference GEBV estimated on the offspring. A limited reordering of the breeders, selected on a multitrait index, was observed. Spearman correlations between GEBV on HD genotypes and GEBV on low-density genotypes (with or without imputation) were always higher than 0.94 with more than 3K SNP. For the genetically closer, top 150 individuals for a specific trait, with imputation, the reordering was reduced with correlation higher than 0.94 with more than 3K SNP. Without imputation, the correlations remained lower than 0.85 with less than 3K and 16K SNP for equidistant and linkage disequilibrium methodology, respectively. The differences in GEBV correlations between both methodologies were never significant. The conclusions were the same for all studied traits.
Collapse
|
37
|
Talouarn E, Bardou P, Palhière I, Oget C, Clément V, Tosser-Klopp G, Rupp R, Robert-Granié C. Genome wide association analysis on semen volume and milk yield using different strategies of imputation to whole genome sequence in French dairy goats. BMC Genet 2020; 21:19. [PMID: 32085723 PMCID: PMC7035711 DOI: 10.1186/s12863-020-0826-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Accepted: 02/13/2020] [Indexed: 01/17/2023] Open
Abstract
Background Goats were domesticated 10,500 years ago to supply humans with useful resources. Since then, specialized breeds that are adapted to their local environment have been developed and display specific genetic profiles. The VarGoats project is a 1000 genomes resequencing program designed to cover the genetic diversity of the Capra genus. In this study, our main objective was to assess the use of sequence data to detect genomic regions associated with traits of interest in French Alpine and Saanen breeds. Results Direct imputation from the GoatSNP50 BeadChip genotypes to sequence level was investigated in these breeds using FImpute and different reference panels: within-breed, all Capra hircus sequenced individuals, European goats and French mainland goats. The best results were obtained with the French goat panel with allele and genotype concordance rates reaching 0.86 and 0.75 in the Alpine and 0.86 and 0.73 in the Saanen breed respectively. Mean correlations tended to be low in both breeds due to the high proportion of variants with low frequencies. For association analysis, imputation was performed using FImpute for 1129 French Alpine and Saanen males using within-breed and French panels on 23,338,436 filtered variants. The association results of both imputation scenarios were then compared. In Saanen goats, a large region on chromosome 19 was significantly linked to semen volume and milk yield in both scenarios. Significant variants for milk yield were annotated for 91 genes on chromosome 19 in Saanen goats. For semen volume, the annotated genes include YBOX2 which is related to azoospermia or oligospermia in other species. New signals for milk yield were detected on chromosome 2 in Alpine goats and on chromosome 5 in Saanen goats when using a multi-breed panel. Conclusion Even with very small reference populations, an acceptable imputation quality can be achieved in French dairy goats. GWAS on imputed sequences confirmed the existence of QTLs and identified new regions of interest in dairy goats. Adding identified candidates to a genotyping array and sequencing more individuals might corroborate the involvement of identified regions while removing potential imputation errors.
Collapse
Affiliation(s)
- Estelle Talouarn
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France.
| | - Philippe Bardou
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France.,Sigenae, INRAE, 31326, Castanet-Tolosan, France
| | - Isabelle Palhière
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | - Claire Oget
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | | | | | - Gwenola Tosser-Klopp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | - Rachel Rupp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | | |
Collapse
|
38
|
Keel BN, Snelling WM, Lindholm-Perry AK, Oliver WT, Kuehn LA, Rohrer GA. Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs. Front Genet 2020; 10:1339. [PMID: 32038708 PMCID: PMC6985563 DOI: 10.3389/fgene.2019.01339] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Accepted: 12/09/2019] [Indexed: 01/24/2023] Open
Abstract
The "large p small n" problem has posed a significant challenge in the analysis and interpretation of genome-wide association studies (GWAS). The use of prior information to rank genomic regions and perform SNP selection could increase the power of GWAS. In this study, we propose the use of gene expression data from RNA-Seq of multiple tissues as prior information to assign weights to SNP, select SNP based on a weight threshold, and utilize weighted hypothesis testing to conduct a GWAS. RNA-Seq libraries from hypothalamus, duodenum, ileum, and jejunum tissue of 30 pigs with divergent feed efficiency phenotypes were sequenced, and a three-way gene x individual x tissue clustering analysis was performed, using constrained tensor decomposition, to obtain a total of 10 gene expression modules. Loading values from each gene module were used to assign weights to 49,691 commercial SNP markers, and SNP were selected using a weight threshold, resulting in 10 SNP sets ranging in size from 101 to 955 markers. Weighted GWAS for feed intake in 4,200 pigs was performed separately for each of the 10 SNP sets. A total of 36 unique significant SNP associations were identified across the ten gene modules (SNP sets). For comparison, a standard unweighted GWAS using all 49,691 SNP was performed, and only 2 SNP were significant. None of the SNP from the unweighted analysis resided in known QTL related to swine feed efficiency (feed intake, average daily gain, and feed conversion ratio) compared to 29 (80.6%) in the weighted analyses, with 9 SNP residing in feed intake QTL. These results suggest that the heritability of feed intake is driven by many SNP that individually do not attain genome-wide significance in GWAS. Hence, the proposed procedure for prioritizing SNP based on gene expression data across multiple tissues provides a promising approach for improving the power of GWAS.
Collapse
Affiliation(s)
- Brittney N. Keel
- USDA, ARS, U.S. Meat Animal Research Center, Clay Center, NE, United States
| | | | | | | | | | | |
Collapse
|
39
|
Ventura RV, Brito LF, Oliveira GA, Daetwyler HD, Schenkel FS, Sargolzaei M, Vandervoort G, Fonseca e Silva F, Miller SP, Carvalho ME, Santana MHA, Mattos EC, Fonseca P, Eler JP, Ferraz JBS. A comprehensive comparison of high-density SNP panels and an alternative ultra-high-density panel for genomic analyses in Nellore cattle. ANIMAL PRODUCTION SCIENCE 2020. [DOI: 10.1071/an18305] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
There is evidence that some genotyping platforms might not work very well for Zebu cattle when compared with Taurine breeds. In addition, the availability of panels with low to moderate number of overlapping markers is a limitation for combining datasets for genomic evaluations, especially when animals are genotyped using different SNP panels. In the present study, we compared the performance of medium- and high-density (HD) commercially available panels and investigated the feasibility of developing an ultra-HD panel (SP) containing markers from an Illumina (HD_I) and an Affymetrix (HD_A) panels. The SP panel contained 1123442 SNPs. After performing SNP pruning on the basis of linkage disequilibrium, HD_A, HD_I and SP contained 429624, 365225 and 658770 markers distributed across the whole genome. The overall mean proportion of markers pruned out per chromosome for HD_A, HD_I and SP was 15.17%, 43.18%, 38.63% respectively. The HD_I panel presented the highest mean number of runs-of-homozygosity segments per animal (45.48%, an increment of 5.11% compared with SP) and longer segments, on average (3057.95 kb per segment), than did both HD_A and SP. HD_I also showed the highest mean number of SNPs per run-of-homozygosity segment. Consequently, the majority of animals presented the highest genomic inbreeding levels when genotyped using HD_I. The visual examination of marker distribution along the genome illustrated uncovered regions among the different panels. Haplotype-block comparison among panels and the average haplotype size constructed on the basis of HD_A were smaller than those from HD_I. The average number of SNPs per haplotype was different between HD_A and HD_I. Both HD_A and HD_I panels achieved high imputation accuracies when used as the lower-density panels for imputing to SP. However, imputation accuracy from HD_A to SP was greater than was imputation from HD_I to SP. Imputation from one HD panel to the other is also feasible. Low- and medium-density panels, composed of markers that are subsets of both HD_A and HD_I panels, should be developed to achieve better imputation accuracies to both HD levels. Therefore, the genomic analyses performed in the present study showed significant differences among the SNP panels used.
Collapse
|
40
|
Rowan TN, Hoff JL, Crum TE, Taylor JF, Schnabel RD, Decker JE. A multi-breed reference panel and additional rare variants maximize imputation accuracy in cattle. Genet Sel Evol 2019; 51:77. [PMID: 31878893 PMCID: PMC6933688 DOI: 10.1186/s12711-019-0519-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Accepted: 12/16/2019] [Indexed: 01/08/2023] Open
Abstract
Background During the last decade, the use of common-variant array-based single nucleotide polymorphism (SNP) genotyping in the beef and dairy industries has produced an astounding amount of medium-to-low density genomic data. Although low-density assays work well in the context of genomic prediction, they are less useful for detecting and mapping causal variants and the effects of rare variants are not captured. The objective of this project was to maximize the accuracies of genotype imputation from medium- and low-density assays to the marker set obtained by combining two high-density research assays (~ 850,000 SNPs), the Illumina BovineHD and the GGP-F250 assays, which contains a large proportion of rare and potentially functional variants and for which the assay design is described here. This 850 K SNP set is useful for both imputation to sequence-level genotypes and direct downstream analysis. Results We found that a large multi-breed composite imputation reference panel that includes 36,131 samples with either BovineHD and/or GGP-F250 genotypes significantly increased imputation accuracy compared with a within-breed reference panel, particularly at variants with low minor allele frequencies. Individual animal imputation accuracies were maximized when more genetically similar animals were represented in the composite reference panel, particularly with complete 850 K genotypes. The addition of rare variants from the GGP-F250 assay to our composite reference panel significantly increased the imputation accuracy of rare variants that are exclusively present on the BovineHD assay. In addition, we show that an assay marker density of 50 K SNPs balances cost and accuracy for imputation to 850 K. Conclusions Using high-density genotypes on all available individuals in a multi-breed reference panel maximized imputation accuracy for tested cattle populations. Admixed animals or those from breeds with a limited representation in the composite reference panel were still imputed at high accuracy, which is expected to further increase as the reference panel expands. We anticipate that the addition of rare variants from the GGP-F250 assay will increase the accuracy of imputation to sequence level.
Collapse
Affiliation(s)
- Troy N Rowan
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Jesse L Hoff
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Tamar E Crum
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Jeremy F Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Robert D Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA. .,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA.
| | - Jared E Decker
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA. .,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA.
| |
Collapse
|
41
|
Jiang D, Xin C, Ye J, Yuan Y, Fang M. ICGRM: integrative construction of genomic relationship matrix combining multiple genomic regions for big dataset. BMC Bioinformatics 2019; 20:731. [PMID: 31878869 PMCID: PMC6933885 DOI: 10.1186/s12859-019-3319-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Accepted: 12/16/2019] [Indexed: 12/13/2022] Open
Abstract
Background Genomic prediction is an advanced method for estimating genetic values, which has been widely accepted for genetic evaluation in animal and disease-risk prediction in human. It estimates genetic values with genome-wide distributed SNPs instead of pedigree. The key step of it is to construct genomic relationship matrix (GRM) via genome-wide SNPs; however, usually the calculation of GRM needs huge computer memory especially when the SNP number and sample size are big, so that sometimes it will become computationally prohibitive even for super computer clusters. We herein developed an integrative algorithm to compute GRM. To avoid calculating GRM for the whole genome, ICGRM freely divides the genome-wide SNPs into several segments and computes the summary statistics related to GRM for each segment that requires quite few computer RAM; then it integrates these summary statistics to produce GRM for whole genome. Results It showed that the computer memory of ICGRM was reduced by 15 times (from 218Gb to 14Gb) after the genome SNPs were split into 5 to 200 parts in terms of the number of SNPs in our simulation dataset, making it computationally feasible for almost all kinds of computer servers. ICGRM is implemented in C/C++ and freely available via https://github.com/mingfang618/CLGRM. Conclusions ICGRM is computationally efficient software to build GRM and can be used for big dataset.
Collapse
Affiliation(s)
- Dan Jiang
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Fisheries College, Jimei University, Xiamen, 361021, Fujian Province, China
| | - Cong Xin
- Institute of Dermatology and Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei, 230032, China
| | - Jinhua Ye
- College of Science, Heilongjiang Bayi Agricultural University, Daqing, 163319, China
| | - Yingbo Yuan
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Fisheries College, Jimei University, Xiamen, 361021, Fujian Province, China
| | - Ming Fang
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Fisheries College, Jimei University, Xiamen, 361021, Fujian Province, China.
| |
Collapse
|
42
|
Evaluation of imputation accuracy using the combination of two high-density panels in Nelore beef cattle. Sci Rep 2019; 9:17920. [PMID: 31784673 PMCID: PMC6884513 DOI: 10.1038/s41598-019-54382-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 11/12/2019] [Indexed: 11/17/2022] Open
Abstract
This study compared imputation from lower-density commercial and customized panels to high-density panels and a combined panel (Illumina and Affymetrix) in Nelore beef cattle. Additionally, linkage disequilibrium and haplotype block conformation were estimated in individual high-density panels and compared with corresponding values in the combined panel after imputation. Overall, 814 animals were genotyped using BovineHD BeadChip (IllumHD), and 93 of these animals were also genotyped using the Axion Genome-Wide BOS 1 Array Plate (AffyHD). In general, customization considering linkage disequilibrium and minor allele frequency had the highest accuracies. The IllumHD panel had higher values of linkage disequilibrium for short distances between SNPs than AffyHD and the combined panel. The combined panel had an increased number of small haplotype blocks. The use of a combined panel is recommended due to its increased density and number of haplotype blocks, which in turn increase the probability of a marker being close to a quantitative trait locus of interest. Considering common SNPs between IllumHD and AffyHD for the customization of a low-density panel increases the imputation accuracy for IllumHD, AffyHD and the combined panel.
Collapse
|
43
|
Song H, Ye S, Jiang Y, Zhang Z, Zhang Q, Ding X. Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs. Genet Sel Evol 2019; 51:58. [PMID: 31638889 PMCID: PMC6805481 DOI: 10.1186/s12711-019-0500-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 10/07/2019] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND For genomic selection in populations with a small reference population, combining populations of the same breed or populations of related breeds is an effective way to increase the size of the reference population. However, genomic predictions based on single nucleotide polymorphism (SNP)-chip genotype data using combined populations with different genetic backgrounds or from different breeds have not shown a clear advantage over using within-population or within-breed predictions. The increasing availability of whole-genome sequencing (WGS) data provides new opportunities for combined population genomic prediction. Our objective was to investigate the accuracy of genomic prediction using imputation-based WGS data from combined populations in pigs. Using 80K SNP panel genotypes, WGS genotypes, or genotypes on WGS variants that were pruned based on linkage disequilibrium (LD), three methods [genomic best linear unbiased prediction (GBLUP), single-step (ss)GBLUP, and genomic feature (GF)BLUP] were implemented with different prior information to identify the best method to improve the accuracy of genomic prediction for combined populations in pigs. RESULTS In total, 2089 and 2043 individuals with production and reproduction phenotypes, respectively, from three Yorkshire populations with different genetic backgrounds were genotyped with the PorcineSNP80 panel. Imputation accuracy from 80K to WGS variants reached 92%. The results showed that use of the WGS data compared to the 80K SNP panel did not increase the accuracy of genomic prediction in a single population, but using WGS data with LD pruning and GFBLUP with prior information did yield higher accuracy than the 80K SNP panel. For the 80K SNP panel genotypes, using the combined population resulted in a slight improvement, no change, or even a slight decrease in accuracy in comparison with the single population for GBLUP and ssGBLUP, while accuracy increased by 1 to 2.4% when using WGS data. Notably, the GFBLUP method did not perform well for both the combined population and the single populations. CONCLUSIONS The use of WGS data was beneficial for combined population genomic prediction. Simply increasing the number of SNPs to the WGS level did not increase accuracy for a single population, while using pruned WGS data based on LD and GFBLUP with prior information could yield higher accuracy than the 80K SNP panel.
Collapse
Affiliation(s)
- Hailiang Song
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Yifan Jiang
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, Taian, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
44
|
Atashi H, Salavati M, De Koster J, Ehrlich J, Crowe M, Opsomer G, Hostens M. Genome-wide association for milk production and lactation curve parameters in Holstein dairy cows. J Anim Breed Genet 2019; 137:292-304. [PMID: 31576624 PMCID: PMC7217222 DOI: 10.1111/jbg.12442] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 09/07/2019] [Accepted: 09/12/2019] [Indexed: 12/31/2022]
Abstract
The aim of this study was to identify genomic regions associated with 305‐day milk yield and lactation curve parameters on primiparous (n = 9,910) and multiparous (n = 11,158) Holstein cows. The SNP solutions were estimated using a weighted single‐step genomic BLUP approach and imputed high‐density panel (777k) genotypes. The proportion of genetic variance explained by windows of 50 consecutive SNP (with an average of 165 Kb) was calculated, and regions that accounted for more than 0.50% of the variance were used to search for candidate genes. Estimated heritabilities were 0.37, 0.34, 0.17, 0.12, 0.30 and 0.19, respectively, for 305‐day milk yield, peak yield, peak time, ramp, scale and decay for primiparous cows. Genetic correlations of 305‐day milk yield with peak yield, peak time, ramp, scale and decay in primiparous cows were 0.99, 0.63, 0.20, 0.97 and −0.52, respectively. The results identified three windows on BTA14 associated with 305‐day milk yield and the parameters of lactation curve in primi‐ and multiparous cows. Previously proposed candidate genes for milk yield supported by this work include GRINA, CYHR1, FOXH1, TONSL, PPP1R16A, ARHGAP39, MAF1, OPLAH and MROH1, whereas newly identified candidate genes are MIR2308, ZNF7, ZNF34, SLURP1, MAFA and KIFC2 (BTA14). The protein lipidation biological process term, which plays a key role in controlling protein localization and function, was identified as the most important term enriched by the identified genes.
Collapse
Affiliation(s)
- Hadi Atashi
- Department of Reproduction, Obstetrics and Herd Health, Ghent University, Merelbeke, Belgium.,Department of Animal Science, Shiraz University, Shiraz, Iran
| | - Mazdak Salavati
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, UK
| | - Jenne De Koster
- Department of Reproduction, Obstetrics and Herd Health, Ghent University, Merelbeke, Belgium
| | | | - Mark Crowe
- University College Dublin, Dublin, Ireland
| | - Geert Opsomer
- Department of Reproduction, Obstetrics and Herd Health, Ghent University, Merelbeke, Belgium
| | | | - Miel Hostens
- Department of Reproduction, Obstetrics and Herd Health, Ghent University, Merelbeke, Belgium
| |
Collapse
|
45
|
Bojnord NR, Honarvar M, Aminafshara M, Kashan NEJ. Imputation of non-genotyped individuals using their genotyped progeny implementing machine learning algorithm. GENE REPORTS 2019. [DOI: 10.1016/j.genrep.2019.100435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
46
|
Phasing quality assessment in a brown layer population through family- and population-based software. BMC Genet 2019; 20:57. [PMID: 31311514 PMCID: PMC6636125 DOI: 10.1186/s12863-019-0759-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Accepted: 06/23/2019] [Indexed: 01/05/2023] Open
Abstract
Background Haplotype data contains more information than genotype data and provides possibilities such as imputing low frequency variants, inferring points of recombination, detecting recurrent mutations, mapping linkage disequilibrium (LD), studying selection signatures, estimating IBD probabilities, etc. In addition, haplotype structure is used to assess genetic diversity and expected accuracy in genomic selection programs. Nevertheless, the quality and efficiency of phasing has rarely been a subject of thorough study but was assessed mainly as a by-product in imputation quality studies. Moreover, phasing studies based on data of a poultry population are non-existent. The aim of this study was to evaluate the phasing quality of FImpute and Beagle, two of the most used phasing software. Results We simulated ten replicated samples of a layer population comprising 888 individuals from a real SNP dataset of 580 k and a pedigree of 12 generations. Chromosomes analyzed were 1, 7 and 20. We measured the percentage of SNPs that were phased equally between true and phased haplotypes (Eqp), proportion of individuals completely correctly phased, number of incorrectly phased SNPs or Breakpoints (Bkp) and the length of inverted haplotype segments. Results were obtained for three different groups of individuals, with no parents or offspring genotyped in the dataset, with only one parent, and with both parents, respectively. The phasing was performed with Beagle (v3.3 and v4.1) and FImpute v2.2 (with and without pedigree). Eqp values ranged from 88 to 100%, with the best results from haplotypes phased with Beagle v4.1 and FImpute with pedigree information and at least one parent genotyped. FImpute haplotypes showed a higher number of Bkp than Beagle. As a consequence, switched haplotype segments were longer for Beagle than for FImpute. Conclusion We concluded that for the dataset applied in this study Beagle v4.1 or FImpute with pedigree information and at least one parent genotyped in the data set were the best alternatives for obtaining high quality phased haplotypes. Electronic supplementary material The online version of this article (10.1186/s12863-019-0759-3) contains supplementary material, which is available to authorized users.
Collapse
|
47
|
Nayeri S, Schenkel F, Fleming A, Kroezen V, Sargolzaei M, Baes C, Cánovas A, Squires J, Miglior F. Genome-wide association analysis for β-hydroxybutyrate concentration in Milk in Holstein dairy cattle. BMC Genet 2019; 20:58. [PMID: 31311492 PMCID: PMC6636026 DOI: 10.1186/s12863-019-0761-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 06/28/2019] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Ketosis in dairy cattle has been shown to cause a high morbidity in the farm and substantial financial losses to dairy farmers. Ketosis symptoms, however, are difficult to identify, therefore, the amount of ketone bodies (mainly β-hydroxybutyric acid, BHB) is used as an indicator of subclinical ketosis in cows. It has also been shown that milk BHB concentrations have a strong correlation with ketosis in dairy cattle. Mid-infrared spectroscopy (MIR) has recently became a fast, cheap and high-throughput method for analyzing milk components. The aim of this study was to perform a genome-wide association study (GWAS) on the MIR-predicted milk BHB to identify genomic regions, genes and pathways potentially affecting subclinical ketosis in North American Holstein dairy cattle. RESULTS Several significant regions were identified associated with MIR-predicted milk BHB concentrations (indicator of subclinical ketosis) in the first lactation (SCK1) and second and later lactations (SCK2) in Holstein dairy cows. The strongest association was located on BTA6 for SCK1 and BTA14 on SCK2. Several SNPs on BTA6 were identified in regions and variants reported previously to be associated with susceptibility to ketosis and clinical mastitis in Jersey and Holstein dairy cattle, respectively. One highly significant SNP on BTA14 was found within the DGAT1 gene with known functions on fat metabolism and inflammatory response in dairy cattle. A region on BTA6 and three SNPs on BTA20 were found to overlap between SCK1 and SCK2. However, a novel region on BTA20 (55-63 Mb) for SCK2 was also identified, which was not reported in previous association studies. Enrichment analysis of the list of candidate genes within the identified regions for MIR-predicted milk BHB concentrations yielded molecular functions and biological processes that may be involved in the inflammatory response and lipid metabolism in dairy cattle. CONCLUSIONS The results of this study confirmed several SNPs and genes identified in previous studies as associated with ketosis susceptibility and immune response, and also found a novel region that can be used for further analysis to identify causal variations and key regulatory genes that affect clinical/ subclinical ketosis.
Collapse
Affiliation(s)
- S. Nayeri
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
| | - F. Schenkel
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
| | - A. Fleming
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
- Canadian Dairy Network, Guelph, ON N1K 1E5 Canada
| | - V. Kroezen
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
| | - M. Sargolzaei
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
- Select Sires Inc., Plain City, OH 43064 USA
| | - C. Baes
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
| | - A. Cánovas
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
| | - J. Squires
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
| | - F. Miglior
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G 2W1 Canada
| |
Collapse
|
48
|
Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data. Heredity (Edinb) 2019; 124:37-49. [PMID: 31278370 PMCID: PMC6906477 DOI: 10.1038/s41437-019-0246-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Revised: 05/11/2019] [Accepted: 06/17/2019] [Indexed: 11/10/2022] Open
Abstract
The availability of whole genome sequencing (WGS) data enables the discovery of causative single nucleotide polymorphisms (SNPs) or SNPs in high linkage disequilibrium with causative SNPs. This study investigated effects of integrating SNPs selected from imputed WGS data into the data of 54K chip on genomic prediction in Danish Jersey. The WGS SNPs, mainly including peaks of quantitative trait loci, structure variants, regulatory regions of genes, and SNPs within genes with strong effects predicted with variant effect predictor, were selected in previous analyses for dairy breeds in Denmark–Finland–Sweden (DFS) and France (FRA). Animals genotyped with 54K chip, standard LD chip, and customized LD chip which covered selected WGS SNPs and SNPs in the standard LD chip, were imputed to 54K together with DFS and FRA SNPs. Genomic best linear unbiased prediction (GBLUP) and Bayesian four-distribution mixture models considering 54K and selected WGS SNPs as one (a one-component model) or two separate genetic components (a two-component model) were used to predict breeding values. For milk production traits and mastitis, both DFS (0.025) and FRA (0.029) sets of additional WGS SNPs improved reliabilities, and inclusions of all selected WGS SNPs generally achieved highest improvements of reliabilities (0.034). A Bayesian four-distribution model yielded higher reliabilities than a GBLUP model for milk and protein, but extra gains in reliabilities from using selected WGS SNPs were smaller for a Bayesian four-distribution model than a GBLUP model. Generally, no significant difference was observed between one-component and two-component models, except for using GBLUP models for milk.
Collapse
|
49
|
Nani JP, Rezende FM, Peñagaricano F. Predicting male fertility in dairy cattle using markers with large effect and functional annotation data. BMC Genomics 2019; 20:258. [PMID: 30940077 PMCID: PMC6444482 DOI: 10.1186/s12864-019-5644-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2018] [Accepted: 03/25/2019] [Indexed: 11/22/2022] Open
Abstract
Background Fertility is among the most important economic traits in dairy cattle. Genomic prediction for cow fertility has received much attention in the last decade, while bull fertility has been largely overlooked. The goal of this study was to assess genomic prediction of dairy bull fertility using markers with large effect and functional annotation data. Sire conception rate (SCR) was used as a measure of service sire fertility. Dataset consisted of 11.5 k U.S. Holstein bulls with SCR records and about 300 k single nucleotide polymorphism (SNP) markers. The analyses included the use of both single-kernel and multi-kernel predictive models fitting either all SNPs, markers with large effect, or markers with presumed functional roles, such as non-synonymous, synonymous, or non-coding regulatory variants. Results The entire set of SNPs yielded predictive correlations of 0.340. Five markers located on chromosomes BTA8, BTA9, BTA13, BTA17, and BTA27 showed marked dominance effects. Interestingly, the inclusion of these five major markers as fixed effects in the predictive models increased predictive correlations to 0.403, representing an increase in accuracy of about 19% compared with the standard model. Single-kernel models fitting functional SNP classes outperformed their counterparts using random sets of SNPs, suggesting that the predictive power of these functional variants is driven in part by their biological roles. Multi-kernel models fitting all the functional SNP classes together with the five major markers exhibited predictive correlations around 0.405. Conclusions The inclusion of markers with large effect markedly improved the prediction of dairy sire fertility. Functional variants exhibited higher predictive ability than random variants, but did not outperform the standard whole-genome approach. This research is the foundation for the development of novel strategies that could help the dairy industry make accurate genome-guided selection decisions on service sire fertility.
Collapse
Affiliation(s)
- Juan Pablo Nani
- Department of Animal Sciences, University of Florida, 2250 Shealy Drive, Gainesville, FL, 32611, USA.,Estación Experimental Agropecuaria Rafaela, Instituto Nacional de Tecnología Agropecuaria, 22-2300, Rafaela, SF, Argentina
| | - Fernanda M Rezende
- Department of Animal Sciences, University of Florida, 2250 Shealy Drive, Gainesville, FL, 32611, USA.,Faculdade de Medicina Veterinária, Universidade Federal de Uberlândia, Uberlândia, MG, 38410-337, Brazil
| | - Francisco Peñagaricano
- Department of Animal Sciences, University of Florida, 2250 Shealy Drive, Gainesville, FL, 32611, USA. .,University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32610, USA.
| |
Collapse
|
50
|
Iung LHS, Petrini J, Ramírez-Díaz J, Salvian M, Rovadoscki GA, Pilonetto F, Dauria BD, Machado PF, Coutinho LL, Wiggans GR, Mourão GB. Genome-wide association study for milk production traits in a Brazilian Holstein population. J Dairy Sci 2019; 102:5305-5314. [PMID: 30904307 DOI: 10.3168/jds.2018-14811] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Accepted: 10/19/2018] [Indexed: 12/19/2022]
Abstract
Advances in the molecular area of selection have expanded knowledge of the genetic architecture of complex traits through genome-wide association studies (GWAS). Several GWAS have been performed so far, but confirming these results is not always possible due to several factors, including environmental conditions. Thus, our objective was to identify genomic regions associated with traditional milk production traits, including milk yield, somatic cell score, fat, protein and lactose percentages, and fatty acid composition in a Holstein cattle population producing under tropical conditions. For this, 75,228 phenotypic records from 5,981 cows and genotypic data of 56,256 SNP from 1,067 cows were used in a weighted single-step GWAS. A total of 46 windows of 10 SNP explaining more than 1% of the genetic variance across 10 Bos taurus autosomes (BTA) harbored well-known and novel genes. The MGST1 (BTA5), ABCG2 (BTA6), DGAT1 (BTA14), and PAEP (BTA11) genes were confirmed within some of the regions identified in our study. Potential novel genes involved in tissue damage and repair of the mammary gland (COL18A1), immune response (LTTC19), glucose homeostasis (SLC37A1), synthesis of unsaturated fatty acids (LTBP1), and sugar transport (SLC37A1 and MFSD4A) were found for milk yield, somatic cell score, fat percentage, and fatty acid composition. Our findings may assist genomic selection by using these regions to design a customized SNP array to improve milk production traits on farms with similar environmental conditions.
Collapse
Affiliation(s)
- L H S Iung
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - J Petrini
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - J Ramírez-Díaz
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - M Salvian
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - G A Rovadoscki
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - F Pilonetto
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - B D Dauria
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - P F Machado
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - L L Coutinho
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil
| | - G R Wiggans
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| | - G B Mourão
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo 13418900, Brazil.
| |
Collapse
|