1
|
Križanac AM, Reimer C, Heise J, Liu Z, Pryce JE, Bennewitz J, Thaller G, Falker-Gieske C, Tetens J. Sequence-based GWAS in 180,000 German Holstein cattle reveals new candidate variants for milk production traits. Genet Sel Evol 2025; 57:3. [PMID: 39905301 PMCID: PMC11796172 DOI: 10.1186/s12711-025-00951-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 01/23/2025] [Indexed: 02/06/2025] Open
Abstract
BACKGROUND Milk production traits are complex and influenced by many genetic and environmental factors. Although extensive research has been performed for these traits, with many associations unveiled thus far, due to their crucial economic importance, complex genetic architecture, and the fact that causal variants in cattle are still scarce, there is a need for a better understanding of their genetic background. In this study, we aimed to identify new candidate loci associated with milk production traits in German Holstein cattle, the most important dairy breed in Germany and worldwide. For that purpose, 180,217 cattle were imputed to the sequence level and large-scale genome-wide association study (GWAS) followed by fine-mapping and evolutionary and functional annotation were carried out to identify and prioritize new association signals. RESULTS Using the imputed sequence data of a large cattle dataset, we identified 50,876 significant variants, confirming many known and identifying previously unreported candidate variants for milk (MY), fat (FY), and protein yield (PY). Genome-wide significant signals were fine-mapped with the Bayesian approach that determines the credible variant sets and generates the probability of causality for each signal. The variants with the highest probabilities of being causal were further classified using external information about the function and evolution, making the prioritization for subsequent validation experiments easier. The top potential causal variants determined with fine-mapping explained a large percentage of genetic variance compared to random ones; 178 variants explained 11.5%, 104 explained 7.7%, and 68 variants explained 3.9% of the variance for MY, FY, and PY, respectively, demonstrating the potential for causality. CONCLUSIONS Our findings proved the power of large samples and sequence-based GWAS in detecting new association signals. In order to fully exploit the power of GWAS, one should aim at very large samples combined with whole-genome sequence data. These can also come with both computational and time burdens, as presented in our study. Although milk production traits in cattle are comprehensively investigated, the genetic background of these traits is still not fully understood, with the potential for many new associations to be revealed, as shown. With constantly growing sample sizes, we expect more insights into the genetic architecture of milk production traits in the future.
Collapse
Affiliation(s)
- Ana-Marija Križanac
- Department of Animal Sciences, University of Goettingen, Burckhardtweg 2, 37077, Göttingen, Germany.
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany.
| | - Christian Reimer
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, 31535, Neustadt, Germany
| | - Johannes Heise
- Vereinigte Informationssysteme Tierhaltung w.V. (VIT), 27283, Verden, Germany
| | - Zengting Liu
- Vereinigte Informationssysteme Tierhaltung w.V. (VIT), 27283, Verden, Germany
| | - Jennie E Pryce
- Agriculture Victoria Research, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Jörn Bennewitz
- Institute of Animal Science, University of Hohenheim, 70599, Stuttgart, Germany
| | - Georg Thaller
- Institute of Animal Breeding and Husbandry, Christian-Albrechts-University, 24118, Kiel, Germany
| | - Clemens Falker-Gieske
- Department of Animal Sciences, University of Goettingen, Burckhardtweg 2, 37077, Göttingen, Germany
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany
| | - Jens Tetens
- Department of Animal Sciences, University of Goettingen, Burckhardtweg 2, 37077, Göttingen, Germany
- Center for Integrated Breeding Research, Department of Animal Sciences, University of Goettingen, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany
| |
Collapse
|
2
|
Teodoro M, Maiorano AM, Campos GS, de Albuquerque LG, de Oliveira HN. Genetic parameters, genomic prediction, and identification of regulatory regions located on chromosome 14 for weight traits in Nellore cattle. J Anim Breed Genet 2024. [PMID: 39189106 DOI: 10.1111/jbg.12895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 08/06/2024] [Accepted: 08/11/2024] [Indexed: 08/28/2024]
Abstract
This study aimed to investigate functional variants in chromosome 14 (BTA14) and its impact in genomic selection for birth weight (BW), weaning weight (WW), and yearling weight (YW) in Nellore cattle. Genetic parameter estimation and the weighted single-step genomic best linear unbiased prediction (WssGBLUP) analyses were performed. Direct additive heritability estimates were high for WW and YW, and moderate for BW. Trait-associated variants distributed across multiple regions on BTA14 were observed in the weighted single-step genome-wide association studies (WssGWAS) results, implying a polygenic genetic architecture for weight in different ages. Several genes have been found in association with the weight traits, including the CUB And Sushi multiple domains 3 (CSMD3), thyroglobulin (TG), and diacylglycerol O-acyltransferase 1 (DGAT1) genes. The variance explained per SNP was higher in six functional classes of gene regulatory regions (5UTR, CpG islands, downstream, upstream, long non-coding RNA, and transcription factor binding sites (TFBS)), highlighting their importance for weight traits in Nellore cattle. A marginal increase in accuracy was observed when the selected functional variants (SV) information was considered in the WssGBLUP method, probably because of the small number of SV available on BTA14. The identified genes, pathways, and functions contribute to a better understanding of the genetic and physiological mechanisms regulating weight traits in the Nellore breed.
Collapse
Affiliation(s)
- Miller Teodoro
- Department of Animal Science, São Paulo State University, Jaboticabal, Brazil
| | | | | | | | | |
Collapse
|
3
|
Yuan C, Gualdrón Duarte JL, Takeda H, Georges M, Druet T. Evaluation of heritability partitioning approaches in livestock populations. BMC Genomics 2024; 25:690. [PMID: 39003468 PMCID: PMC11246585 DOI: 10.1186/s12864-024-10600-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 07/08/2024] [Indexed: 07/15/2024] Open
Abstract
BACKGROUND Heritability partitioning approaches estimate the contribution of different functional classes, such as coding or regulatory variants, to the genetic variance. This information allows a better understanding of the genetic architecture of complex traits, including complex diseases, but can also help improve the accuracy of genomic selection in livestock species. However, methods have mainly been tested on human genomic data, whereas livestock populations have specific characteristics, such as high levels of relatedness, small effective population size or long-range levels of linkage disequilibrium. RESULTS Here, we used data from 14,762 cows, imputed at the whole-genome sequence level for 11,537,240 variants, to simulate traits in a typical livestock population and evaluate the accuracy of two state-of-the-art heritability partitioning methods, GREML and a Bayesian mixture model. In simulations where a single functional class had increased contribution to heritability, we observed that the estimators were unbiased but had low precision. When causal variants were enriched in variants with low (< 0.05) or high (> 0.20) minor allele frequency or low (below 1st quartile) or high (above 3rd quartile) linkage disequilibrium scores, it was necessary to partition the genetic variance into multiple classes defined on the basis of allele frequencies or LD scores to obtain unbiased results. When multiple functional classes had variable contributions to heritability, estimators showed higher levels of variation and confounding between certain categories was observed. In addition, estimators from small categories were particularly imprecise. However, the estimates and their ranking were still informative about the contribution of the classes. We also demonstrated that using methods that estimate the contribution of a single category at a time, a commonly used approach, results in an overestimation. Finally, we applied the methods to phenotypes for muscular development and height and estimated that, on average, variants in open chromatin regions had a higher contribution to the genetic variance (> 45%), while variants in coding regions had the strongest individual effects (> 25-fold enrichment on average). Conversely, variants in intergenic or intronic regions showed lower levels of enrichment (0.2 and 0.6-fold on average, respectively). CONCLUSIONS Heritability partitioning approaches should be used cautiously in livestock populations, in particular for small categories. Two-component approaches that fit only one functional category at a time lead to biased estimators and should not be used.
Collapse
Affiliation(s)
- Can Yuan
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de L'Hôpital, 1, 4000, Liège, Belgium.
| | | | - Haruko Takeda
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de L'Hôpital, 1, 4000, Liège, Belgium
| | - Michel Georges
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de L'Hôpital, 1, 4000, Liège, Belgium
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Avenue de L'Hôpital, 1, 4000, Liège, Belgium
| |
Collapse
|
4
|
Lee DJ, Kim Y, Dinh PTN, Chung Y, Lee D, Kim Y, Lee SH, Choi I, Lee SH. Identification of Missense Variants Affecting Carcass Traits for Hanwoo Precision Breeding. Genes (Basel) 2023; 14:1839. [PMID: 37895191 PMCID: PMC10606632 DOI: 10.3390/genes14101839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 09/20/2023] [Accepted: 09/21/2023] [Indexed: 10/29/2023] Open
Abstract
This study aimed to identify causal variants associated with important carcass traits such as weight and meat quality in Hanwoo cattle. We analyzed missense mutations extracted from imputed sequence data (ARS-UCD1.2) and performed an exon-specific association test on the carcass traits of 16,970 commercial Hanwoo. We found 33, 2, 1, and 3 significant SNPs associated with carcass weight (CW), backfat thickness (BFT), eye muscle area (EMA), and marbling score (MS), respectively. In CW and EMA, the most significant missense SNP was identified at 19,524,263 on BTA14 and involved the PRKDC. A missense SNP in the ZFAND2B, located at 107,160,304 on BTA2 was identified as being involved in BFT. For MS, missense SNP in the ACVR2B gene, located at 11,849,704 in BTA22 was identified as the most significant marker. The contribution of the most significant missense SNPs to genetic variance was confirmed to be 8.47%, 2.08%, 1.73%, and 1.19% in CW, BFT, EMA, and MS, respectively. We generated favorable and unfavorable haplotype combinations based on the significant SNPs for CW. Significant differences in GEBV (Genomic Estimated Breeding Values) were observed between groups with each favorable and unfavorable haplotype combination. In particular, the missense SNPs in PRKDC, MRPL9, and ANKFN1 appear to significantly affect the protein's function and structure, making them strong candidates as causal mutations. These missense SNPs have the potential to serve as valuable markers for improving carcass traits in Hanwoo commercial farms.
Collapse
Affiliation(s)
- Dong Jae Lee
- Division of Animal & Dairy Science, Chungnam National University, Daejeon 34134, Republic of Korea; (D.J.L.); (Y.C.); (D.L.); (S.H.L.)
| | - Yoonsik Kim
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea; (Y.K.); (P.T.N.D.)
| | - Phuong Thanh N. Dinh
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea; (Y.K.); (P.T.N.D.)
| | - Yoonji Chung
- Division of Animal & Dairy Science, Chungnam National University, Daejeon 34134, Republic of Korea; (D.J.L.); (Y.C.); (D.L.); (S.H.L.)
| | - Dooho Lee
- Division of Animal & Dairy Science, Chungnam National University, Daejeon 34134, Republic of Korea; (D.J.L.); (Y.C.); (D.L.); (S.H.L.)
| | - Yeongkuk Kim
- Quantomic Research & Solution, Daejeon 34134, Republic of Korea;
| | - Soo Hyun Lee
- Division of Animal & Dairy Science, Chungnam National University, Daejeon 34134, Republic of Korea; (D.J.L.); (Y.C.); (D.L.); (S.H.L.)
| | - Inchul Choi
- Division of Animal & Dairy Science, Chungnam National University, Daejeon 34134, Republic of Korea; (D.J.L.); (Y.C.); (D.L.); (S.H.L.)
| | - Seung Hwan Lee
- Department of Bio-AI Convergence, Chungnam National University, Daejeon 34134, Republic of Korea; (Y.K.); (P.T.N.D.)
| |
Collapse
|
5
|
Pausch H, Mapel XM. Review: Genetic mutations affecting bull fertility. Animal 2023; 17 Suppl 1:100742. [PMID: 37567657 DOI: 10.1016/j.animal.2023.100742] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 08/13/2023] Open
Abstract
Cattle are a well-suited "model organism" to study the genetic underpinnings of variation in male reproductive performance. The adoption of artificial insemination and genomic prediction in many cattle breeds provide access to microarray-derived genotypes and repeated measurements for semen quality and insemination success in several thousand bulls. Similar-sized mapping cohorts with phenotypes for male fertility are not available for most other species precluding powerful association testing. The repeated measurements of the artificial insemination bulls' semen quality enable the differentiation between transient and biologically relevant trait fluctuations, and thus, are an ideal source of phenotypes for variance components estimation and genome-wide association testing. Genome-wide case-control association testing involving bulls with either aberrant sperm quality or low insemination success revealed several causal recessive loss-of-function alleles underpinning monogenic reproductive disorders. These variants are routinely monitored with customised genotyping arrays in the male selection candidates to avoid the use of subfertile or infertile bulls for artificial insemination and natural service. Genome-wide association studies with quantitative measurements of semen quality and insemination success revealed quantitative trait loci for male fertility, but the underlying causal variants remain largely unknown. Moreover, these loci explain only a small part of the heritability of male fertility. Integrating genome-wide association studies with gene expression and other omics data from male reproductive tissues is required for the fine-mapping of candidate causal variants underlying variation in male reproductive performance in cattle.
Collapse
Affiliation(s)
- Hubert Pausch
- Animal Genomics, Department of Environmental Systems Science, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland.
| | - Xena Marie Mapel
- Animal Genomics, Department of Environmental Systems Science, ETH Zurich, Universitaetstrasse 2, 8092 Zurich, Switzerland
| |
Collapse
|
6
|
Comparison of allele-specific expression in Sistani cattle and its crossbreed with Holstein, Simmental, and Montbeliarde breeds. Trop Anim Health Prod 2023; 55:7. [DOI: 10.1007/s11250-022-03422-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 12/06/2022] [Indexed: 12/23/2022]
|
7
|
Tahir MS, Porto-Neto LR, Reverter-Gomez T, Olasege BS, Sajid MR, Wockner KB, Tan AWL, Fortes MRS. Utility of multi-omics data to inform genomic prediction of heifer fertility traits. J Anim Sci 2022; 100:skac340. [PMID: 36239447 PMCID: PMC9733504 DOI: 10.1093/jas/skac340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 10/12/2022] [Indexed: 12/15/2022] Open
Abstract
Biologically informed single nucleotide polymorphisms (SNPs) impact genomic prediction accuracy of the target traits. Our previous genomics, proteomics, and transcriptomics work identified candidate genes related to puberty and fertility in Brahman heifers. We aimed to test this biological information for capturing heritability and predicting heifer fertility traits in another breed i.e., Tropical Composite. The SNP from the identified genes including 10 kilobases (kb) region on either side were selected as biologically informed SNP set. The SNP from the rest of the Bos taurus genes including 10-kb region on either side were selected as biologically uninformed SNP set. Bovine high-density (HD) complete SNP set (628,323 SNP) was used as a control. Two populations-Tropical Composites (N = 1331) and Brahman (N = 2310)-had records for three traits: pregnancy after first mating season (PREG1, binary), first conception score (FCS, score 1 to 3), and rebreeding score (REB, score 1 to 3.5). Using the best linear unbiased prediction method, effectiveness of each SNP set to predict the traits was tested in two scenarios: a 5-fold cross-validation within Tropical Composites using biological information from Brahman studies, and application of prediction equations from one breed to the other. The accuracy of prediction was calculated as the correlation between genomic estimated breeding values and adjusted phenotypes. Results show that biologically informed SNP set estimated heritabilities not significantly better than the control HD complete SNP set in Tropical Composites; however, it captured all the observed genetic variance in PREG1 and FCS when modeled together with the biologically uninformed SNP set. In 5-fold cross-validation within Tropical Composites, the biologically informed SNP set performed marginally better (statistically insignificant) in terms of prediction accuracies (PREG1: 0.20, FCS: 0.13, and REB: 0.12) as compared to HD complete SNP set (PREG1: 0.17, FCS: 0.10, and REB: 0.11), and biologically uninformed SNP set (PREG1: 0.16, FCS: 0.10, and REB: 0.11). Across-breed use of prediction equations still remained a challenge: accuracies by all SNP sets dropped to around zero for all traits. The performance of biologically informed SNP was not significantly better than other sets in Tropical Composites. However, results indicate that biological information obtained from Brahman was successful to predict the fertility traits in Tropical Composite population.
Collapse
Affiliation(s)
- Muhammad S Tahir
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Laercio R Porto-Neto
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Toni Reverter-Gomez
- Commonwealth Scientific and Industrial Research Organization, St. Lucia, Brisbane 4072, QLD, Australia
| | - Babatunde S Olasege
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Mirza R Sajid
- Department of Statistics, University of Gujrat, 50700 Punjab, Pakistan
| | - Kimberley B Wockner
- Queensland Department of Agriculture and Fisheries, Brisbane 4072, QLD, Australia
| | - Andre W L Tan
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| | - Marina R S Fortes
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia Campus, Brisbane 4072, QLD, Australia
| |
Collapse
|
8
|
Dizaj RF, Amin-Afshar M, Esmaeilkhanian S, Emamjomeh-Kashan N, Banabazi MH. Comparing allele-specific expression in Sistani cattle and its crossbreds with Holstein, Simmental, and Montbeliarde. Onderstepoort J Vet Res 2022. [DOI: 10.4102/ojvr.v89i1.2041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
9
|
Niu Q, Zhang T, Xu L, Wang T, Wang Z, Zhu B, Zhang L, Gao H, Song J, Li J, Xu L. Integration of selection signatures and multi-trait GWAS reveals polygenic genetic architecture of carcass traits in beef cattle. Genomics 2021; 113:3325-3336. [PMID: 34314829 DOI: 10.1016/j.ygeno.2021.07.025] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 05/05/2021] [Accepted: 07/22/2021] [Indexed: 11/18/2022]
Abstract
Carcass merits are widely considered as economically important traits affecting beef production in the beef cattle industry. However, the genetic basis of carcass traits remains to be well understood. Here, we applied multiple methods, including the Composite of Likelihood Ratio (CLR) and Genome-wide Association Study (GWAS), to explore the selection signatures and candidate variants affecting carcass traits. We identified 11,600 selected regions overlapping with 2214 candidate genes, and most of those were enriched in binding and gene regulation. Notably, we identified 66 and 110 potential variants significantly associated with carcass traits using single-trait and multi-traits analyses, respectively. By integrating selection signatures with single and multi-traits associations, we identified 12 and 27 putative genes, respectively. Several highly conserved missense variants were identified in OR5M13D, NCAPG, and TEX2. Our study supported polygenic genetic architecture of carcass traits and provided novel insights into the genetic basis of complex traits in beef cattle.
Collapse
Affiliation(s)
- Qunhao Niu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Tianliu Zhang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Ling Xu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Tianzhen Wang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Zezhao Wang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Bo Zhu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Lupei Zhang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Huijiang Gao
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Jiuzhou Song
- Department of Animal and Avian Science, University of Maryland, College Park, USA
| | - Junya Li
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China.
| | - Lingyang Xu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China.
| |
Collapse
|
10
|
Prowse-Wilkins CP, Wang J, Xiang R, Garner JB, Goddard ME, Chamberlain AJ. Putative Causal Variants Are Enriched in Annotated Functional Regions From Six Bovine Tissues. Front Genet 2021; 12:664379. [PMID: 34249087 PMCID: PMC8260860 DOI: 10.3389/fgene.2021.664379] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Accepted: 05/24/2021] [Indexed: 12/13/2022] Open
Abstract
Genetic variants which affect complex traits (causal variants) are thought to be found in functional regions of the genome. Identifying causal variants would be useful for predicting complex trait phenotypes in dairy cows, however, functional regions are poorly annotated in the bovine genome. Functional regions can be identified on a genome-wide scale by assaying for post-translational modifications to histone proteins (histone modifications) and proteins interacting with the genome (e.g., transcription factors) using a method called Chromatin immunoprecipitation followed by sequencing (ChIP-seq). In this study ChIP-seq was performed to find functional regions in the bovine genome by assaying for four histone modifications (H3K4Me1, H3K4Me3, H3K27ac, and H3K27Me3) and one transcription factor (CTCF) in 6 tissues (heart, kidney, liver, lung, mammary and spleen) from 2 to 3 lactating dairy cows. Eighty-six ChIP-seq samples were generated in this study, identifying millions of functional regions in the bovine genome. Combinations of histone modifications and CTCF were found using ChromHMM and annotated by comparing with active and inactive genes across the genome. Functional marks differed between tissues highlighting areas which might be particularly important to tissue-specific regulation. Supporting the cis-regulatory role of functional regions, the read counts in some ChIP peaks correlated with nearby gene expression. The functional regions identified in this study were enriched for putative causal variants as seen in other species. Interestingly, regions which correlated with gene expression were particularly enriched for potential causal variants. This supports the hypothesis that complex traits are regulated by variants that alter gene expression. This study provides one of the largest ChIP-seq annotation resources in cattle including, for the first time, in the mammary gland of lactating cows. By linking regulatory regions to expression QTL and trait QTL we demonstrate a new strategy for identifying causal variants in cattle.
Collapse
Affiliation(s)
- Claire P Prowse-Wilkins
- Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Jianghui Wang
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Ruidong Xiang
- Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Josie B Garner
- Agriculture Victoria, Ellinbank Dairy Centre, Ellinbank, VIC, Australia
| | - Michael E Goddard
- Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia.,Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
| |
Collapse
|
11
|
Lai E, Danner AL, Famula TR, Oberbauer AM. Genome-Wide Association Studies Reveal Susceptibility Loci for Noninfectious Claw Lesions in Holstein Dairy Cattle. Front Genet 2021; 12:657375. [PMID: 34122511 PMCID: PMC8194352 DOI: 10.3389/fgene.2021.657375] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 04/15/2021] [Indexed: 01/10/2023] Open
Abstract
Sole ulcers (SUs) and white line disease (WLD) are two common noninfectious claw lesions (NICL) that arise due to a compromised horn production and are frequent causes of lameness in dairy cattle, imposing welfare and profitability concerns. Low to moderate heritability estimates of SU and WLD susceptibility indicate that genetic selection could reduce their prevalence. To identify the susceptibility loci for SU, WLD, SU and/or WLD, and any type of noninfectious claw lesion, genome-wide association studies (GWAS) were performed using generalized linear mixed model (GLMM) regression, chunk-based association testing (CBAT), and a random forest (RF) approach. Cows from five commercial dairies in California were classified as controls having no lameness records and ≥6 years old (n = 102) or cases having SU (n = 152), WLD (n = 117), SU and/or WLD (SU + WLD, n = 198), or any type of noninfectious claw lesion (n = 217). The top single nucleotide polymorphisms (SNPs) were defined as those passing the Bonferroni-corrected suggestive and significance thresholds in the GLMM analysis or those that a validated RF model considered important. Effects of the top SNPs were quantified using Bayesian estimation. Linkage disequilibrium (LD) blocks defined by the top SNPs were explored for candidate genes and previously identified, functionally relevant quantitative trait loci. The GLMM and CBAT approaches revealed the same regions of association on BTA8 for SU and BTA13 common to WLD, SU + WLD, and NICL. These SNPs had effects significantly different from zero, and the LD blocks they defined explained a significant amount of phenotypic variance for each dataset (6.1-8.1%, p < 0.05), indicating the small but notable contribution of these regions to susceptibility. These regions contained candidate genes involved in wound healing, skin lesions, bone growth and mineralization, adipose tissue, and keratinization. The LD block defined by the most significant SNP on BTA8 for SU included a SNP previously associated with SU. The RF models were overfitted, indicating that the SNP effects were very small, thereby preventing meaningful interpretation of SNPs and any downstream analyses. These findings suggested that variants associated with various physiological systems may contribute to susceptibility for NICL, demonstrating the complexity of genetic predisposition.
Collapse
Affiliation(s)
- Ellen Lai
- Animal Science Department, University of California, Davis, Davis, CA, United States
| | - Alexa L Danner
- Animal Science Department, University of California, Davis, Davis, CA, United States
| | - Thomas R Famula
- Animal Science Department, University of California, Davis, Davis, CA, United States
| | - Anita M Oberbauer
- Animal Science Department, University of California, Davis, Davis, CA, United States
| |
Collapse
|
12
|
Marete A, Ariel O, Ibeagha-Awemu E, Bissonnette N. Identification of Long Non-coding RNA Isolated From Naturally Infected Macrophages and Associated With Bovine Johne's Disease in Canadian Holstein Using a Combination of Neural Networks and Logistic Regression. Front Vet Sci 2021; 8:639053. [PMID: 33969037 PMCID: PMC8100051 DOI: 10.3389/fvets.2021.639053] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 02/15/2021] [Indexed: 01/15/2023] Open
Abstract
Mycobacterium avium ssp. paratuberculosis (MAP) causes chronic enteritis in most ruminants. The pathogen MAP causes Johne's disease (JD), a chronic, incurable, wasting disease. Weight loss, diarrhea, and a gradual drop in milk production characterize the disease's clinical phase, culminating in death. Several studies have characterized long non-coding RNA (lncRNA) in bovine tissues, and a previous study characterizes (lncRNA) in macrophages infected with MAP in vitro. In this study, we aim to characterize the lncRNA in macrophages from cows naturally infected with MAP. From 15 herds, feces and blood samples were collected for each cow older than 24 months, twice yearly over 3–5 years. Paired samples were analyzed by fecal PCR and blood ELISA. We used RNA-seq data to study lncRNA in macrophages from 33 JD(+) and 33 JD(–) dairy cows. We performed RNA-seq analysis using the “new Tuxedo” suite. We characterized lncRNA using logistic regression and multilayered neural networks and used DESeq2 for differential expression analysis and Panther and Reactome classification systems for gene ontology (GO) analysis. The study identified 13,301 lncRNA, 605 of which were novel lncRNA. We found seven genes close to differentially expressed lncRNA, including CCDC174, ERI1, FZD1, TWSG1, ZBTB38, ZNF814, and ZSCAN4. None of the genes associated with susceptibility to JD have been cited in the literature. LncRNA target genes were significantly enriched for biological process GO terms involved in immunity and nucleic acid regulation. These include the MyD88 pathway (TLR5), GO:0043312 (neutrophil degranulation), GO:0002446 (neutrophil-mediated immunity), and GO:0042119 (neutrophil activation). These results identified lncRNA with potential roles in host immunity and potential candidate genes and pathways through which lncRNA might function in response to MAP infection.
Collapse
Affiliation(s)
- Andrew Marete
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada
| | - Olivier Ariel
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada.,Faculty of Science, Sherbrooke University, Sherbrooke, QC, Canada
| | - Eveline Ibeagha-Awemu
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada
| | - Nathalie Bissonnette
- Agriculture and Agri-Food Canada, Sherbrooke Research and Development Centre, Sherbrooke, QC, Canada
| |
Collapse
|
13
|
Fernandes Júnior GA, Carvalheiro R, de Oliveira HN, Sargolzaei M, Costilla R, Ventura RV, Fonseca LFS, Neves HHR, Hayes BJ, de Albuquerque LG. Imputation accuracy to whole-genome sequence in Nellore cattle. Genet Sel Evol 2021; 53:27. [PMID: 33711929 PMCID: PMC7953568 DOI: 10.1186/s12711-021-00622-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 03/05/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A cost-effective strategy to explore the complete DNA sequence in animals for genetic evaluation purposes is to sequence key ancestors of a population, followed by imputation mechanisms to infer marker genotypes that were not originally reported in a target population of animals genotyped with single nucleotide polymorphism (SNP) panels. The feasibility of this process relies on the accuracy of the genotype imputation in that population, particularly for potential causal mutations which may be at low frequency and either within genes or regulatory regions. The objective of the present study was to investigate the imputation accuracy to the sequence level in a Nellore beef cattle population, including that for variants in annotation classes which are more likely to be functional. METHODS Information of 151 key sequenced Nellore sires were used to assess the imputation accuracy from bovine HD BeadChip SNP (~ 777 k) to whole-genome sequence. The choice of the sires aimed at optimizing the imputation accuracy of a genotypic database, comprised of about 10,000 genotyped Nellore animals. Genotype imputation was performed using two computational approaches: FImpute3 and Minimac4 (after using Eagle for phasing). The accuracy of the imputation was evaluated using a fivefold cross-validation scheme and measured by the squared correlation between observed and imputed genotypes, calculated by individual and by SNP. SNPs were classified into a range of annotations, and the accuracy of imputation within each annotation classification was also evaluated. RESULTS High average imputation accuracies per animal were achieved using both FImpute3 (0.94) and Minimac4 (0.95). On average, common variants (minor allele frequency (MAF) > 0.03) were more accurately imputed by Minimac4 and low-frequency variants (MAF ≤ 0.03) were more accurately imputed by FImpute3. The inherent Minimac4 Rsq imputation quality statistic appears to be a good indicator of the empirical Minimac4 imputation accuracy. Both software provided high average SNP-wise imputation accuracy for all classes of biological annotations. CONCLUSIONS Our results indicate that imputation to whole-genome sequence is feasible in Nellore beef cattle since high imputation accuracies per individual are expected. SNP-wise imputation accuracy is software-dependent, especially for rare variants. The accuracy of imputation appears to be relatively independent of annotation classification.
Collapse
Affiliation(s)
| | - Roberto Carvalheiro
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil.,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil
| | - Henrique N de Oliveira
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil.,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil
| | - Mehdi Sargolzaei
- Ontario Veterinary College, UG, Guelph, Canada.,Select Sires Inc., Plain City, OH, USA
| | - Roy Costilla
- Queensland Alliance for Agriculture and Food Innovation, UQ, Brisbane, QLD, 4072, Australia
| | - Ricardo V Ventura
- School of Veterinary Medicine and Animal Science, USP, Pirassununga, SP, 13635-900, Brazil
| | - Larissa F S Fonseca
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil
| | | | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, UQ, Brisbane, QLD, 4072, Australia
| | - Lucia G de Albuquerque
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil. .,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil.
| |
Collapse
|
14
|
Lopez BIM, An N, Srikanth K, Lee S, Oh JD, Shin DH, Park W, Chai HH, Park JE, Lim D. Genomic Prediction Based on SNP Functional Annotation Using Imputed Whole-Genome Sequence Data in Korean Hanwoo Cattle. Front Genet 2021; 11:603822. [PMID: 33552124 PMCID: PMC7859490 DOI: 10.3389/fgene.2020.603822] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 11/09/2020] [Indexed: 12/12/2022] Open
Abstract
Whole-genome sequence (WGS) data are increasingly being applied into genomic predictions, offering a higher predictive ability by including causal mutations or single-nucleotide polymorphisms (SNPs) putatively in strong linkage disequilibrium with causal mutations affecting the trait. This study aimed to improve the predictive performance of the customized Hanwoo 50 k SNP panel for four carcass traits in commercial Hanwoo population by adding highly predictive variants from sequence data. A total of 16,892 Hanwoo cattle with phenotypes (i.e., backfat thickness, carcass weight, longissimus muscle area, and marbling score), 50 k genotypes, and WGS imputed genotypes were used. We partitioned imputed WGS data according to functional annotation [intergenic (IGR), intron (ITR), regulatory (REG), synonymous (SYN), and non-synonymous (NSY)] to characterize the genomic regions that will deliver higher predictive power for the traits investigated. Animals were assigned into two groups, the discovery set (7324 animals) used for predictive variant detection and the cross-validation set for genomic prediction. Genome-wide association studies were performed by trait to every genomic region and entire WGS data for the pre-selection of variants. Each set of pre-selected SNPs with different density (1000, 3000, 5000, or 10,000) were added to the 50 k genotypes separately and the predictive performance of each set of genotypes was assessed using the genomic best linear unbiased prediction (GBLUP). Results showed that the predictive performance of the customized Hanwoo 50 k SNP panel can be improved by the addition of pre-selected variants from the WGS data, particularly 3000 variants from each trait, which is then sufficient to improve the prediction accuracy for all traits. When 12,000 pre-selected variants (3000 variants from each trait) were added to the 50 k genotypes, the prediction accuracies increased by 9.9, 9.2, 6.4, and 4.7% for backfat thickness, carcass weight, longissimus muscle area, and marbling score compared to the regular 50 k SNP panel, respectively. In terms of prediction bias, regression coefficients for all sets of genotypes in all traits were close to 1, indicating an unbiased prediction. The strategy used to select variants based on functional annotation did not show a clear advantage compared to using whole-genome. Nonetheless, such pre-selected SNPs from the IGR region gave the highest improvement in prediction accuracy among genomic regions and the values were close to those obtained using the WGS data for all traits. We concluded that additional gain in prediction accuracy when using pre-selected variants appears to be trait-dependent, and using WGS data remained more accurate compared to using a specific genomic region.
Collapse
Affiliation(s)
- Bryan Irvine M Lopez
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Narae An
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Krishnamoorthy Srikanth
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Seunghwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Jae-Don Oh
- Department of Animal Biotechnology, Chonbuk National University, Jeonju, South Korea
| | - Dong-Hyun Shin
- Department of Agricultural Convergence Technology, Chonbuk National University, Jeonju, South Korea
| | - Woncheoul Park
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Han-Ha Chai
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Jong-Eun Park
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Dajeong Lim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| |
Collapse
|
15
|
van den Berg I, Xiang R, Jenko J, Pausch H, Boussaha M, Schrooten C, Tribout T, Gjuvsland AB, Boichard D, Nordbø Ø, Sanchez MP, Goddard ME. Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94,321 cattle from eight cattle breeds. Genet Sel Evol 2020; 52:37. [PMID: 32635893 PMCID: PMC7339598 DOI: 10.1186/s12711-020-00556-4] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 06/26/2020] [Indexed: 12/14/2022] Open
Abstract
Background Sequence-based genome-wide association studies (GWAS) provide high statistical power to identify candidate causal mutations when a large number of individuals with both sequence variant genotypes and phenotypes is available. A meta-analysis combines summary statistics from multiple GWAS and increases the power to detect trait-associated variants without requiring access to data at the individual level of the GWAS mapping cohorts. Because linkage disequilibrium between adjacent markers is conserved only over short distances across breeds, a multi-breed meta-analysis can improve mapping precision. Results To maximise the power to identify quantitative trait loci (QTL), we combined the results of nine within-population GWAS that used imputed sequence variant genotypes of 94,321 cattle from eight breeds, to perform a large-scale meta-analysis for fat and protein percentage in cattle. The meta-analysis detected (p ≤ 10−8) 138 QTL for fat percentage and 176 QTL for protein percentage. This was more than the number of QTL detected in all within-population GWAS together (124 QTL for fat percentage and 104 QTL for protein percentage). Among all the lead variants, 100 QTL for fat percentage and 114 QTL for protein percentage had the same direction of effect in all within-population GWAS. This indicates either persistence of the linkage phase between the causal variant and the lead variant across breeds or that some of the lead variants might indeed be causal or tightly linked with causal variants. The percentage of intergenic variants was substantially lower for significant variants than for non-significant variants, and significant variants had mostly moderate to high minor allele frequencies. Significant variants were also clustered in genes that are known to be relevant for fat and protein percentages in milk. Conclusions Our study identified a large number of QTL associated with fat and protein percentage in dairy cattle. We demonstrated that large-scale multi-breed meta-analysis reveals more QTL at the nucleotide resolution than within-population GWAS. Significant variants were more often located in genic regions than non-significant variants and a large part of them was located in potentially regulatory regions.
Collapse
Affiliation(s)
- Irene van den Berg
- Agriculture Victoria Research, AgriBio, 5 Ring Road, Bundoora, VIC, 3083, Australia.
| | - Ruidong Xiang
- Agriculture Victoria Research, AgriBio, 5 Ring Road, Bundoora, VIC, 3083, Australia.,Faculty of Veterinary & Agricultural Science, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Janez Jenko
- GENO SA, Storhamargata 44, 2317, Hamar, Norway
| | | | - Mekki Boussaha
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | | | - Thierry Tribout
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | | | - Didier Boichard
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | | | - Marie-Pierre Sanchez
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, 78350, Jouy-en-Josas, France
| | - Mike E Goddard
- Agriculture Victoria Research, AgriBio, 5 Ring Road, Bundoora, VIC, 3083, Australia.,Faculty of Veterinary & Agricultural Science, University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
16
|
Xu L, Gao N, Wang Z, Xu L, Liu Y, Chen Y, Xu L, Gao X, Zhang L, Gao H, Zhu B, Li J. Incorporating Genome Annotation Into Genomic Prediction for Carcass Traits in Chinese Simmental Beef Cattle. Front Genet 2020; 11:481. [PMID: 32499816 PMCID: PMC7243208 DOI: 10.3389/fgene.2020.00481] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 04/17/2020] [Indexed: 01/08/2023] Open
Abstract
Various methods have been proposed for genomic prediction (GP) in livestock. These methods have mainly focused on statistical considerations and did not include genome annotation information. In this study, to improve the predictive performance of carcass traits in Chinese Simmental beef cattle, we incorporated the genome annotation information into GP. Single nucleotide polymorphisms (SNPs) were annotated to five genomic classes: intergenic, gene, exon, protein coding sequences, and 3'/5' untranslated region. Haploblocks were constructed for all markers and these five genomic classes by defining a biologically functional unit, and haplotype effects were modeled in both numerical dosage and categorical coding strategies. The first-order epistatic effects among SNPs and haplotypes were modeled using a categorical epistasis model. For all makers, the extension from the SNP-based model to a haplotype-based model improved the accuracy by 5.4-9.8% for carcass weight (CW), live weight (LW), and striploin (SI). For the five genomic classes using the haplotype-based prediction model, the incorporation of gene class information into the model improved the accuracies by an average of 1.4, 2.1, and 1.3% for CW, LW, and SI, respectively, compared with their corresponding results for all markers. Including the first-order epistatic effects into the prediction models improved the accuracies in some traits and genomic classes. Therefore, for traits with moderate-to-high heritability, incorporating genome annotation information of gene class into haplotype-based prediction models could be considered as a promising tool for GP in Chinese Simmental beef cattle, and modeling epistasis in prediction can further increase the accuracy to some degree.
Collapse
Affiliation(s)
- Ling Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ning Gao
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Zezhao Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lei Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ying Liu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| |
Collapse
|
17
|
Srikanth K, Lee SH, Chung KY, Park JE, Jang GW, Park MR, Kim NY, Kim TH, Chai HH, Park WC, Lim D. A Gene-Set Enrichment and Protein-Protein Interaction Network-Based GWAS with Regulatory SNPs Identifies Candidate Genes and Pathways Associated with Carcass Traits in Hanwoo Cattle. Genes (Basel) 2020; 11:E316. [PMID: 32188084 PMCID: PMC7140899 DOI: 10.3390/genes11030316] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 03/06/2020] [Accepted: 03/12/2020] [Indexed: 02/06/2023] Open
Abstract
Non-synonymous SNPs and protein coding SNPs within the promoter region of genes (regulatory SNPs) might have a significant effect on carcass traits. Imputed sequence level data of 10,215 Hanwoo bulls, annotated and filtered to include only regulatory SNPs (450,062 SNPs), were used in a genome-wide association study (GWAS) to identify loci associated with backfat thickness (BFT), carcass weight (CWT), eye muscle area (EMA), and marbling score (MS). A total of 15, 176, and 1 SNPs were found to be significantly associated (p < 1.11 × 10-7) with BFT, CWT, and EMA, respectively. The significant loci were BTA4 (CWT), BTA6 (CWT), BTA14 (CWT and EMA), and BTA19 (BFT). BayesR estimated that 1.1%~1.9% of the SNPs contributed to more than 0.01% of the phenotypic variance. So, the GWAS was complemented by a gene-set enrichment (GSEA) and protein-protein interaction network (PPIN) analysis in identifying the pathways affecting carcass traits. At p < 0.005 (~2,261 SNPs), 25 GO and 18 KEGG categories, including calcium signaling, cell proliferation, and folate biosynthesis, were found to be enriched through GSEA. The PPIN analysis showed enrichment for 81 candidate genes involved in various pathways, including the PI3K-AKT, calcium, and FoxO signaling pathways. Our finding provides insight into the effects of regulatory SNPs on carcass traits.
Collapse
Affiliation(s)
- Krishnamoorthy Srikanth
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Seung-Hwan Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea;
| | - Ki-Yong Chung
- Department of Beef Science, Korea National College of Agriculture and Fisheries, Jeonju 54874, Korea;
| | - Jong-Eun Park
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Gul-Won Jang
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Mi-Rim Park
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Na Yeon Kim
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Tae-Hun Kim
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Han-Ha Chai
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Won Cheoul Park
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| | - Dajeong Lim
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea (J.-E.P.); (G.-W.J.); (M.-R.P.); (N.Y.K.); (T.-H.K.); (H.-H.C.); (W.C.P.)
| |
Collapse
|
18
|
Doyle JL, Berry DP, Veerkamp RF, Carthy TR, Evans RD, Walsh SW, Purfield DC. Genomic regions associated with muscularity in beef cattle differ in five contrasting cattle breeds. Genet Sel Evol 2020; 52:2. [PMID: 32000665 PMCID: PMC6993462 DOI: 10.1186/s12711-020-0523-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2019] [Accepted: 01/17/2020] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Linear type traits, which reflect the muscular characteristics of an animal, could provide insight into how, in some cases, morphologically very different animals can yield the same carcass weight. Such variability may contribute to differences in the overall value of the carcass since primal cuts vary greatly in price; such variability may also hinder successful genome-based association studies. Therefore, the objective of our study was to identify genomic regions that are associated with five muscularity linear type traits and to determine if these significant regions are common across five different breeds. Analyses were carried out using linear mixed models on imputed whole-genome sequence data in each of the five breeds, separately. Then, the results of the within-breed analyses were used to conduct an across-breed meta-analysis per trait. RESULTS We identified many quantitative trait loci (QTL) that are located across the whole genome and associated with each trait in each breed. The only commonality among the breeds and traits was a large-effect pleiotropic QTL on BTA2 that contained the MSTN gene, which was associated with all traits in the Charolais and Limousin breeds. Other plausible candidate genes were identified for muscularity traits including PDE1A, PPP1R1C and multiple collagen and HOXD genes. In addition, associated (gene ontology) GO terms and KEGG pathways tended to differ between breeds and between traits especially in the numerically smaller populations of Angus, Hereford, and Simmental breeds. Most of the SNPs that were associated with any of the traits were intergenic or intronic SNPs located within regulatory regions of the genome. CONCLUSIONS The commonality between the Charolais and Limousin breeds indicates that the genetic architecture of the muscularity traits may be similar in these breeds due to their similar origins. Conversely, there were vast differences in the QTL associated with muscularity in Angus, Hereford, and Simmental. Knowledge of these differences in genetic architecture between breeds is useful to develop accurate genomic prediction equations that can operate effectively across breeds. Overall, the associated QTL differed according to trait, which suggests that breeding for a morphologically different (e.g. longer and wider versus shorter and smaller) more efficient animal may become possible in the future.
Collapse
Affiliation(s)
- Jennifer L. Doyle
- Teagasc, Animal and Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork Ireland
- Department of Science, Waterford Institute of Technology, Cork Road, Waterford, Co. Waterford Ireland
| | - Donagh P. Berry
- Teagasc, Animal and Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork Ireland
| | - Roel F. Veerkamp
- Animal Breeding and Genomics Centre, Wageningen University and Research Centre, Livestock Research, Wageningen, The Netherlands
| | - Tara R. Carthy
- Teagasc, Animal and Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork Ireland
| | - Ross D. Evans
- Irish Cattle Breeding Federation, Bandon, Co. Cork Ireland
| | - Siobhán W. Walsh
- Department of Science, Waterford Institute of Technology, Cork Road, Waterford, Co. Waterford Ireland
| | - Deirdre C. Purfield
- Teagasc, Animal and Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork Ireland
| |
Collapse
|
19
|
Zhang F, Wang Y, Mukiibi R, Chen L, Vinsky M, Plastow G, Basarab J, Stothard P, Li C. Genetic architecture of quantitative traits in beef cattle revealed by genome wide association studies of imputed whole genome sequence variants: I: feed efficiency and component traits. BMC Genomics 2020; 21:36. [PMID: 31931702 PMCID: PMC6956504 DOI: 10.1186/s12864-019-6362-1] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 12/02/2019] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Genome wide association studies (GWAS) on residual feed intake (RFI) and its component traits including daily dry matter intake (DMI), average daily gain (ADG), and metabolic body weight (MWT) were conducted in a population of 7573 animals from multiple beef cattle breeds based on 7,853,211 imputed whole genome sequence variants. The GWAS results were used to elucidate genetic architectures of the feed efficiency related traits in beef cattle. RESULTS The DNA variant allele substitution effects approximated a bell-shaped distribution for all the traits while the distribution of additive genetic variances explained by single DNA variants followed a scaled inverse chi-squared distribution to a greater extent. With a threshold of P-value < 1.00E-05, 16, 72, 88, and 116 lead DNA variants on multiple chromosomes were significantly associated with RFI, DMI, ADG, and MWT, respectively. In addition, lead DNA variants with potentially large pleiotropic effects on DMI, ADG, and MWT were found on chromosomes 6, 14 and 20. On average, missense, 3'UTR, 5'UTR, and other regulatory region variants exhibited larger allele substitution effects in comparison to other functional classes. Intergenic and intron variants captured smaller proportions of additive genetic variance per DNA variant. Instead 3'UTR and synonymous variants explained a greater amount of genetic variance per DNA variant for all the traits examined while missense, 5'UTR and other regulatory region variants accounted for relatively more additive genetic variance per sequence variant for RFI and ADG, respectively. In total, 25 to 27 enriched cellular and molecular functions were identified with lipid metabolism and carbohydrate metabolism being the most significant for the feed efficiency traits. CONCLUSIONS RFI is controlled by many DNA variants with relatively small effects whereas DMI, ADG, and MWT are influenced by a few DNA variants with large effects and many DNA variants with small effects. Nucleotide polymorphisms in regulatory region and synonymous functional classes play a more important role per sequence variant in determining variation of the feed efficiency traits. The genetic architecture as revealed by the GWAS of the imputed 7,853,211 DNA variants will improve our understanding on the genetic control of feed efficiency traits in beef cattle.
Collapse
Affiliation(s)
- Feng Zhang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada.,State Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, Jiangxi, China.,Present Address: Institute of Translational Medicine, Nanchang University, Nanchang, Jiangxi, China
| | - Yining Wang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Robert Mukiibi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Liuhong Chen
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Michael Vinsky
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada
| | - Graham Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - John Basarab
- Alberta Agriculture and Forestry, Lacombe Research and Development Centre, 6000 C&E Trail, Lacombe, AB, Canada
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Changxi Li
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada. .,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada.
| |
Collapse
|
20
|
Wang Y, Zhang F, Mukiibi R, Chen L, Vinsky M, Plastow G, Basarab J, Stothard P, Li C. Genetic architecture of quantitative traits in beef cattle revealed by genome wide association studies of imputed whole genome sequence variants: II: carcass merit traits. BMC Genomics 2020; 21:38. [PMID: 31931697 PMCID: PMC6958779 DOI: 10.1186/s12864-019-6273-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 11/12/2019] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Genome wide association studies (GWAS) were conducted on 7,853,211 imputed whole genome sequence variants in a population of 3354 to 3984 animals from multiple beef cattle breeds for five carcass merit traits including hot carcass weight (HCW), average backfat thickness (AFAT), rib eye area (REA), lean meat yield (LMY) and carcass marbling score (CMAR). Based on the GWAS results, genetic architectures of the carcass merit traits in beef cattle were elucidated. RESULTS The distributions of DNA variant allele substitution effects approximated a bell-shaped distribution for all the traits while the distribution of additive genetic variances explained by single DNA variants conformed to a scaled inverse chi-squared distribution to a greater extent. At a threshold of P-value < 10-5, 51, 33, 46, 40, and 38 lead DNA variants on multiple chromosomes were significantly associated with HCW, AFAT, REA, LMY, and CMAR, respectively. In addition, lead DNA variants with potentially large pleiotropic effects on HCW, AFAT, REA, and LMY were found on chromosome 6. On average, missense variants, 3'UTR variants, 5'UTR variants, and other regulatory region variants exhibited larger allele substitution effects on the traits in comparison to other functional classes. The amounts of additive genetic variance explained per DNA variant were smaller for intergenic and intron variants on all the traits whereas synonymous variants, missense variants, 3'UTR variants, 5'UTR variants, downstream and upstream gene variants, and other regulatory region variants captured a greater amount of additive genetic variance per sequence variant for one or more carcass merit traits investigated. In total, 26 enriched cellular and molecular functions were identified with lipid metabolisms, small molecular biochemistry, and carbohydrate metabolism being the most significant for the carcass merit traits. CONCLUSIONS The GWAS results have shown that the carcass merit traits are controlled by a few DNA variants with large effects and many DNA variants with small effects. Nucleotide polymorphisms in regulatory, synonymous, and missense functional classes have relatively larger impacts per sequence variant on the variation of carcass merit traits. The genetic architecture as revealed by the GWAS will improve our understanding on genetic controls of carcass merit traits in beef cattle.
Collapse
Affiliation(s)
- Yining Wang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Feng Zhang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
- State Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, Jiangxi China
- Present Address: Institute of Translational Medicine, Nanchang University, Nanchang, Jiangxi China
| | - Robert Mukiibi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Liuhong Chen
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Michael Vinsky
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
| | - Graham Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - John Basarab
- Alberta Agriculture and Forestry, Lacombe Research and Development Centre, 6000 C&E Trail, Lacombe, AB Canada
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| | - Changxi Li
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB Canada
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB Canada
| |
Collapse
|
21
|
Functionally Annotating Regulatory Elements in the Equine Genome Using Histone Mark ChIP-Seq. Genes (Basel) 2019; 11:genes11010003. [PMID: 31861495 PMCID: PMC7017286 DOI: 10.3390/genes11010003] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 12/10/2019] [Accepted: 12/16/2019] [Indexed: 01/02/2023] Open
Abstract
One of the primary aims of the Functional Annotation of ANimal Genomes (FAANG) initiative is to characterize tissue-specific regulation within animal genomes. To this end, we used chromatin immunoprecipitation followed by sequencing (ChIP-Seq) to map four histone modifications (H3K4me1, H3K4me3, H3K27ac, and H3K27me3) in eight prioritized tissues collected as part of the FAANG equine biobank from two thoroughbred mares. Data were generated according to optimized experimental parameters developed during quality control testing. To ensure that we obtained sufficient ChIP and successful peak-calling, data and peak-calls were assessed using six quality metrics, replicate comparisons, and site-specific evaluations. Tissue specificity was explored by identifying binding motifs within unique active regions, and motifs were further characterized by gene ontology (GO) and protein–protein interaction analyses. The histone marks identified in this study represent some of the first resources for tissue-specific regulation within the equine genome. As such, these publicly available annotation data can be used to advance equine studies investigating health, performance, reproduction, and other traits of economic interest in the horse.
Collapse
|
22
|
Abstract
The increasing amount of available biological information on the markers can be used to inform the models applied for genomic selection to improve predictions. The objective of this study was to propose a general model for genomic selection using a link function approach within the hierarchical generalized linear model framework (hglm) that can include external information on the markers. These models can be fitted using the well-established hglm package in R. We also present an R package (CodataGS) to fit these models, which is significantly faster than the hglm package. Simulated data were used to validate the proposed model. We tested categorical, continuous and combination models where the external information on the markers was related to 1) the location of the QTL on the genome with varying degree of uncertainty, 2) the relationship of the markers with the QTL calculated as the LD between them, and 3) a combination of both. The proposed models showed improved accuracies from 3.8% up to 23.2% compared to the SNP-BLUP method in a simulated population derived from a base population with 100 individuals. Moreover, the proposed categorical model was tested on a dairy cattle dataset for two traits (Milk Yield and Fat Percentage). These results also showed improved accuracy compared to SNP-BLUP, especially for the Fat% trait. The performance of the proposed models depended on the genetic architecture of the trait, as traits that deviate from the infinitesimal model benefited more from the external information. Also, the gain in accuracy depended on the degree of uncertainty of the external information provided to the model. The usefulness of these type of models is expected to increase with time as more accurate information on the markers becomes available.
Collapse
|
23
|
Nani JP, Rezende FM, Peñagaricano F. Predicting male fertility in dairy cattle using markers with large effect and functional annotation data. BMC Genomics 2019; 20:258. [PMID: 30940077 PMCID: PMC6444482 DOI: 10.1186/s12864-019-5644-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2018] [Accepted: 03/25/2019] [Indexed: 11/22/2022] Open
Abstract
Background Fertility is among the most important economic traits in dairy cattle. Genomic prediction for cow fertility has received much attention in the last decade, while bull fertility has been largely overlooked. The goal of this study was to assess genomic prediction of dairy bull fertility using markers with large effect and functional annotation data. Sire conception rate (SCR) was used as a measure of service sire fertility. Dataset consisted of 11.5 k U.S. Holstein bulls with SCR records and about 300 k single nucleotide polymorphism (SNP) markers. The analyses included the use of both single-kernel and multi-kernel predictive models fitting either all SNPs, markers with large effect, or markers with presumed functional roles, such as non-synonymous, synonymous, or non-coding regulatory variants. Results The entire set of SNPs yielded predictive correlations of 0.340. Five markers located on chromosomes BTA8, BTA9, BTA13, BTA17, and BTA27 showed marked dominance effects. Interestingly, the inclusion of these five major markers as fixed effects in the predictive models increased predictive correlations to 0.403, representing an increase in accuracy of about 19% compared with the standard model. Single-kernel models fitting functional SNP classes outperformed their counterparts using random sets of SNPs, suggesting that the predictive power of these functional variants is driven in part by their biological roles. Multi-kernel models fitting all the functional SNP classes together with the five major markers exhibited predictive correlations around 0.405. Conclusions The inclusion of markers with large effect markedly improved the prediction of dairy sire fertility. Functional variants exhibited higher predictive ability than random variants, but did not outperform the standard whole-genome approach. This research is the foundation for the development of novel strategies that could help the dairy industry make accurate genome-guided selection decisions on service sire fertility.
Collapse
Affiliation(s)
- Juan Pablo Nani
- Department of Animal Sciences, University of Florida, 2250 Shealy Drive, Gainesville, FL, 32611, USA.,Estación Experimental Agropecuaria Rafaela, Instituto Nacional de Tecnología Agropecuaria, 22-2300, Rafaela, SF, Argentina
| | - Fernanda M Rezende
- Department of Animal Sciences, University of Florida, 2250 Shealy Drive, Gainesville, FL, 32611, USA.,Faculdade de Medicina Veterinária, Universidade Federal de Uberlândia, Uberlândia, MG, 38410-337, Brazil
| | - Francisco Peñagaricano
- Department of Animal Sciences, University of Florida, 2250 Shealy Drive, Gainesville, FL, 32611, USA. .,University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32610, USA.
| |
Collapse
|
24
|
Zwane AA, Schnabel RD, Hoff J, Choudhury A, Makgahlela ML, Maiwashe A, Van Marle-Koster E, Taylor JF. Genome-Wide SNP Discovery in Indigenous Cattle Breeds of South Africa. Front Genet 2019; 10:273. [PMID: 30988672 PMCID: PMC6452414 DOI: 10.3389/fgene.2019.00273] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 03/12/2019] [Indexed: 01/30/2023] Open
Abstract
Single nucleotide polymorphism arrays have created new possibilities for performing genome-wide studies to detect genomic regions harboring sequence variants that affect complex traits. However, the majority of validated SNPs for which allele frequencies have been estimated are limited primarily to European breeds. The objective of this study was to perform SNP discovery in three South African indigenous breeds (Afrikaner, Drakensberger, and Nguni) using whole genome sequencing. DNA was extracted from blood and hair samples, quantified and prepared at 50 ng/μl concentration for sequencing at the Agricultural Research Council Biotechnology Platform using an Illumina HiSeq 2500. The fastq files were used to call the variants using the Genome Analysis Tool Kit. A total of 1,678,360 were identified as novel using Run 6 of 1000 Bull Genomes Project. Annotation of the identified variants classified them into functional categories. Within the coding regions, about 30% of the SNPs were non-synonymous substitutions that encode for alternate amino acids. The study of distribution of SNP across the genome identified regions showing notable differences in the densities of SNPs among the breeds and highlighted many regions of functional significance. Gene ontology terms identified genes such as MLANA, SYT10, and CDC42EP5 that have been associated with coat color in mouse, and ADAMS3, DNAJC3, and PAG5 genes have been associated with fertility in cattle. Further analysis of the variants detected 688 candidate selective sweeps (ZHp Z-scores ≤ -4) across all three breeds, of which 223 regions were assigned as being putative selective sweeps (ZHp scores ≤-5). We also identified 96 regions with extremely low ZHp Z-scores (≤-6) in Afrikaner and Nguni. Genes such as KIT and MITF that have been associated with skin pigmentation in cattle and CACNA1C, which has been associated with biopolar disorder in human, were identified in these regions. This study provides the first analysis of sequence data to discover SNPs in indigenous South African cattle breeds. The information will play an important role in our efforts to understand the genetic history of our cattle and in designing appropriate breed improvement programmes.
Collapse
Affiliation(s)
- Avhashoni A. Zwane
- Department of Animal Breeding and Genetics, Agricultural Research Council-Animal Production, Irene, South Africa
- Department of Animal and Wildlife Sciences, University of Pretoria, Pretoria, South Africa
| | - Robert D. Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States
- Informatics Institute, University of Missouri, Columbia, MO, United States
| | - Jesse Hoff
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States
| | - Ananyo Choudhury
- Sydney Brenner Institute of Molecular Bioscience, University of the Witwatersrand, Johannesburg, South Africa
| | - Mahlako Linah Makgahlela
- Department of Animal Breeding and Genetics, Agricultural Research Council-Animal Production, Irene, South Africa
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
| | - Azwihangwisi Maiwashe
- Department of Animal Breeding and Genetics, Agricultural Research Council-Animal Production, Irene, South Africa
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
| | - Este Van Marle-Koster
- Department of Animal and Wildlife Sciences, University of Pretoria, Pretoria, South Africa
| | - Jeremy F. Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States
| |
Collapse
|
25
|
Rezende FM, Nani JP, Peñagaricano F. Genomic prediction of bull fertility in US Jersey dairy cattle. J Dairy Sci 2019; 102:3230-3240. [PMID: 30712930 DOI: 10.3168/jds.2018-15810] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Accepted: 11/29/2018] [Indexed: 01/02/2023]
Abstract
Service sire has a major effect on reproductive success in dairy cattle. Recent studies have reported accurate predictions for Holstein bull fertility using genomic data. The objective of this study was to assess the feasibility of genomic prediction of sire conception rate (SCR) in US Jersey cattle using alternative predictive models. Data set consisted of 1.5k Jersey bulls with SCR records and 95k SNP covering the entire genome. The analyses included the use of linear and Gaussian kernel-based models fitting either all the SNP or subsets of markers with presumed functional roles, such as SNP significantly associated with SCR or SNP located within or close to annotated genes. Model predictive ability was evaluated using 5-fold cross-validation with 10 replicates. The entire SNP set exhibited predictive correlations around 0.30. Interestingly, either SNP marginally associated with SCR or genic SNP achieved higher predictive abilities than their counterparts using random sets of SNP. Among alternative SNP subsets, Gaussian kernel models fitting significant SNP achieved the best performance with increases in predictive correlation up to 7% compared with the standard whole-genome approach. Notably, the use of a multi-breed reference population including the entire US Holstein SCR data set (11.5k bulls) allowed us to achieve predictive correlations up to 0.315, gaining 8% in accuracy compared with the standard model fitting a pure Jersey reference set. Overall, our findings indicate that genomic prediction of Jersey bull fertility is feasible. The use of Gaussian kernels fitting markers with relevant roles and the inclusion of Holstein records in the training set seem to be promising alternatives to the standard whole-genome approach. These results have the potential to help the dairy industry improve US Jersey sire fertility through accurate genome-guided decisions.
Collapse
Affiliation(s)
- Fernanda M Rezende
- Department of Animal Sciences, University of Florida, Gainesville 32611; Faculdade de Medicina Veterinária, Universidade Federal de Uberlândia, Uberlândia MG 38410-337, Brazil
| | - Juan Pablo Nani
- Department of Animal Sciences, University of Florida, Gainesville 32611; Estación Experimental Agropecuaria Rafaela, Instituto Nacional de Tecnología Agropecuaria, Rafaela SF 22-2300, Argentina
| | - Francisco Peñagaricano
- Department of Animal Sciences, University of Florida, Gainesville 32611; University of Florida Genetics Institute, University of Florida, Gainesville 32610.
| |
Collapse
|
26
|
Bhuiyan MSA, Lim D, Park M, Lee S, Kim Y, Gondro C, Park B, Lee S. Functional Partitioning of Genomic Variance and Genome-Wide Association Study for Carcass Traits in Korean Hanwoo Cattle Using Imputed Sequence Level SNP Data. Front Genet 2018; 9:217. [PMID: 29988410 PMCID: PMC6024024 DOI: 10.3389/fgene.2018.00217] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 05/28/2018] [Indexed: 11/25/2022] Open
Abstract
Quantitative traits are usually controlled by numerous genomic variants with small individual effects, and variances associated with those traits are explained in a continuous manner. However, the relative contributions of genomic regions to observed genetic variations have not been well explored using sequence level single nucleotide polymorphism (SNP) information. Here, imputed sequence level SNP data (11,278,153 SNPs) of 2109 Hanwoo steers (Korean native cattle) were partitioned according to functional annotation, chromosome, and minor allele frequency (MAF). Genomic relationship matrices (GRMs) were constructed for each classified region and fitted in the model both separately and together for carcass weight (CWT), eye muscle area (EMA), backfat thickness (BFT), and marbling score (MS) traits. A genome-wide association study (GWAS) was performed to identify significantly associated variants in genic and exon regions using a linear mixed model, and the genetic contribution of each exonic SNP was determined using a Bayesian mixture model. Considering all SNPs together, the heritability estimates for CWT, EMA, BFT, and MS were 0.57 ± 0.05, 0.46 ± 0.05, 0.45 ± 0.05, and 0.49 ± 0.05, respectively, which reflected substantial genomic contributions. Joint analysis revealed that the variance explained by each chromosome was proportional to its physical length with weak linear relationships for all traits. Moreover, genomic variances explained by functional category and MAF class differed greatly among the traits studied in joint analysis. For example, exon regions had larger contributions for BFT (0.13 ± 0.08) and MS (0.22 ± 0.08), whereas intron and intergenic regions explained most of the total genomic variances for CWT and EMA (0.22 ± 0.09–0.32 ± 0.11). Considering different functional classes of exon regions and the per SNP contribution revealed the largest proportion of genetic variance was attributable to synonymous variants. GWAS detected 206 and 27 SNPs in genic and exon regions, respectively, on BTA4, BTA6, and BTA14 that were significantly associated with CWT and EMA. These SNPs were harbored by 31 candidate genes, among which TOX, FAM184B, PPARGC1A, PRKDC, LCORL, and COL1A2 were noteworthy. BayesR analysis found that most SNPs (>93%) had very small effects and the 4.02–6.92% that had larger effects (10-4 × σA2, 10-3 × σA2, and 10-2 × σA2) explained most of the total genetic variance, confirming polygenic components of the traits studied.
Collapse
Affiliation(s)
- Mohammad S A Bhuiyan
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea.,Department of Animal Breeding and Genetics, Bangladesh Agricultural University, Mymensingh, Bangladesh
| | - Dajeong Lim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Mina Park
- Animal Genetic Improvement Division, National Institute of Animal Science, Rural Development Administration, Seonghwan, South Korea
| | - Soohyun Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Yeongkuk Kim
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| | - Cedric Gondro
- College of Agriculture and Natural Resources, Michigan State University, East Lansing, MI, United States
| | - Byoungho Park
- Animal Genetic Improvement Division, National Institute of Animal Science, Rural Development Administration, Seonghwan, South Korea
| | - Seunghwan Lee
- Department of Animal Science and Biotechnology, Chungnam National University, Daejeon, South Korea
| |
Collapse
|
27
|
Koufariotis LT, Chen YPP, Stothard P, Hayes BJ. Variance explained by whole genome sequence variants in coding and regulatory genome annotations for six dairy traits. BMC Genomics 2018; 19:237. [PMID: 29618315 PMCID: PMC5885354 DOI: 10.1186/s12864-018-4617-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Accepted: 03/22/2018] [Indexed: 02/03/2023] Open
Abstract
Background There are an exceedingly large number of sequence variants discovered through whole genome sequencing in most populations, including cattle. Deciphering which of these affect complex traits is a major challenge. In this study we hypothesize that variants in some functional classes, such as splice site regions, coding regions, DNA methylated regions and long noncoding RNA will explain more variance in complex traits than others. Two variance component approaches were used to test this hypothesis – the first determines if variants in a functional class capture a greater proportion of the variance, than expected by chance, the second uses the proportion of variance explained when variants in all annotations are fitted simultaneously. Results Our data set consisted of 28.3 million imputed whole genome sequence variants in 16,581 dairy cattle with records for 6 complex trait phenotypes, including production and fertility. We found that sequence variants in splice site regions and synonymous classes captured the greatest proportion of the variance, explaining up to 50% of the variance across all traits. We also found sequence variants in target sites for DNA methylation (genomic regions that are found be highly methylated in bovine placentas), captured a significant proportion of the variance. Per sequence variant, splice site variants explain the highest proportion of variance in this study. The proportion of variance captured by the missense predicted deleterious (from SIFT) and missense tolerated classes was relatively small. Conclusion The results demonstrate using functional annotations to filter whole genome sequence variants into more informative subsets could be useful for prioritization of the variants that are more likely to be associated with complex traits. In addition to variants found in splice sites and protein coding genes regulatory variants and those found in DNA methylated regions, explained considerable variation in milk production and fertility traits. In our analysis synonymous variants captured a significant proportion of the variance, which raises the possible explanation that synonymous mutations might have some effects, or more likely that these variants are miss-annotated, or alternatively the results reflect imperfect imputation of the actual causative variants. Electronic supplementary material The online version of this article (10.1186/s12864-018-4617-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lambros T Koufariotis
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, Building 80, 306 Carmody Road, Brisbane, St Lucia, QLD, 4072, Australia. .,Collage of Science, Health and Engineering, La Trobe University, Melbourne, VIC, 3086, Australia. .,Department of Economic Development, Jobs, Transport and Resources, AgriBio Building, 5 Ring Road, Bundoora, VIC, 3086, Australia. .,Dairy Bio, 5 Ring Road, Bundoora, VIC, 3086, Australia.
| | - Yi-Ping Phoebe Chen
- Collage of Science, Health and Engineering, La Trobe University, Melbourne, VIC, 3086, Australia
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, T6G 2C8, Canada
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, Building 80, 306 Carmody Road, Brisbane, St Lucia, QLD, 4072, Australia.,Department of Economic Development, Jobs, Transport and Resources, AgriBio Building, 5 Ring Road, Bundoora, VIC, 3086, Australia.,Dairy Bio, 5 Ring Road, Bundoora, VIC, 3086, Australia
| |
Collapse
|
28
|
Sheep genome functional annotation reveals proximal regulatory elements contributed to the evolution of modern breeds. Nat Commun 2018; 9:859. [PMID: 29491421 PMCID: PMC5830443 DOI: 10.1038/s41467-017-02809-1] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 12/03/2017] [Indexed: 12/30/2022] Open
Abstract
Domestication fundamentally reshaped animal morphology, physiology and behaviour, offering the opportunity to investigate the molecular processes driving evolutionary change. Here we assess sheep domestication and artificial selection by comparing genome sequence from 43 modern breeds (Ovis aries) and their Asian mouflon ancestor (O. orientalis) to identify selection sweeps. Next, we provide a comparative functional annotation of the sheep genome, validated using experimental ChIP-Seq of sheep tissue. Using these annotations, we evaluate the impact of selection and domestication on regulatory sequences and find that sweeps are significantly enriched for protein coding genes, proximal regulatory elements of genes and genome features associated with active transcription. Finally, we find individual sites displaying strong allele frequency divergence are enriched for the same regulatory features. Our data demonstrate that remodelling of gene expression is likely to have been one of the evolutionary forces that drove phenotypic diversification of this common livestock species. The domestication of plants and animals causes genomic changes underlying various morphologic, physiologic and behavioral changes. Here, Naval-Sanchez et al. provide a ChIP-Seq validated comparative functional annotation of the sheep genome, and show widespread evolution of proximal regulatory elements.
Collapse
|
29
|
Kasarapu P, Porto-Neto LR, Fortes MRS, Lehnert SA, Mudadu MA, Coutinho L, Regitano L, George A, Reverter A. The Bos taurus-Bos indicus balance in fertility and milk related genes. PLoS One 2017; 12:e0181930. [PMID: 28763475 PMCID: PMC5538644 DOI: 10.1371/journal.pone.0181930] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 07/10/2017] [Indexed: 12/16/2022] Open
Abstract
Numerical approaches to high-density single nucleotide polymorphism (SNP) data are often employed independently to address individual questions. We linked independent approaches in a bioinformatics pipeline for further insight. The pipeline driven by heterozygosity and Hardy-Weinberg equilibrium (HWE) analyses was applied to characterize Bos taurus and Bos indicus ancestry. We infer a gene co-heterozygosity network that regulates bovine fertility, from data on 18,363 cattle with genotypes for 729,068 SNP. Hierarchical clustering separated populations according to Bos taurus and Bos indicus ancestry. The weights of the first principal component were subjected to Normal mixture modelling allowing the estimation of a gene’s contribution to the Bos taurus-Bos indicus axis. We used deviation from HWE, contribution to Bos indicus content and association to fertility traits to select 1,284 genes. With this set, we developed a co-heterozygosity network where the group of genes annotated as fertility-related had significantly higher Bos indicus content compared to other functional classes of genes, while the group of genes associated with milk production had significantly higher Bos taurus content. The network analysis resulted in capturing novel gene associations of relevance to bovine domestication events. We report transcription factors that are likely to regulate genes associated with cattle domestication and tropical adaptation. Our pipeline can be generalized to any scenarios where population structure requires scrutiny at the molecular level, particularly in the presence of a priori set of genes known to impact a phenotype of evolutionary interest such as fertility.
Collapse
Affiliation(s)
- Parthan Kasarapu
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, St. Lucia, Brisbane, Queensland, Australia
| | - Laercio R. Porto-Neto
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, St. Lucia, Brisbane, Queensland, Australia
| | - Marina R. S. Fortes
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, Australia
| | - Sigrid A. Lehnert
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, St. Lucia, Brisbane, Queensland, Australia
| | | | - Luiz Coutinho
- Centro de Genomica Funcional ESALQ, University of São Paulo, Piracicaba, Sao Paulo, Brazil
| | - Luciana Regitano
- Embrapa Southeast Livestock, Rodovia Washington Luiz, São Carlos, Sao Paulo, Brazil
| | - Andrew George
- CSIRO, DATA61, Ecosciences Precinct Brisbane, Brisbane, Queensland, Australia
| | - Antonio Reverter
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, St. Lucia, Brisbane, Queensland, Australia
- * E-mail:
| |
Collapse
|
30
|
Kiser JN, White SN, Johnson KA, Hoff JL, Taylor JF, Neibergs HL. Identification of loci associated with susceptibility to subspecies () tissue infection in cattle. J Anim Sci 2017; 95:1080-1091. [PMID: 28380509 DOI: 10.2527/jas.2016.1152] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Johne's disease is a contagious bacterial infection of cattle caused by ssp. (). A previous genome-wide association analysis (GWAA) in Holstein cattle identified QTL on BTA3 and BTA9 that were highly associated (P < 5 × 10) and on BTA1, BTA16, and BTA21 that were moderately associated (P < 5 × 10) with Map tissue infection. The objectives of this study were to validate previous GWAA results in Jersey cattle ( = 57), Holstein cattle from the Pacific Northwest (PNW, = 205) and a combined Holstein population from the PNW and the Northeast (PNW + NE, = 423), and also identify new loci associated with tissue infection. DNA was genotyped using the Illumina BovineSNP50 BeadChip, and the PNW + NE data was also imputed to whole genome sequence level using Run4 of the 1000 Bull Genomes project with Beagle v 4.1 and FImpute. Cases were ileocecal node positive and controls were negative for by quantitative PCR (qPCR). Individuals were removed for SNP call rate < 90%, and SNP were removed for genotype call rate < 90% or minor allele frequency < 1%. For the Jersey, PNW, and PNW + NE, GWAA were conducted using an allelic dosage model. For the PNW and the PNW + NE, an additional efficient mixed-model association eXpedited (EMMAX) analysis was performed using additive, dominance and recessive models. Seven QTL on BTA22 were identified in the Jersey population with the most significant ( = 4.45 × 10) located at 21.7 megabases (Mb). Six QTL were associated in the PNW and the PNW + NE analyses, including a QTL previously identified on BTA16 in the NE population. The most significant locus for the PNW was located on BTA21 at 61 Mb ( = 8.61 × 10) while the most significant locus for the PNW + NE was on BTA12 at 90 Mb ( = 2.33 × 10). No additional QTL were identified with the imputed GWAA. Putative positional candidate genes were identified within 50 kb 5' and 3' of each QTL. Two positional candidate genes were identified in Jersey cattle, 1 identified in the PNW and 8 in the PNW + NE populations. Many identified positional candidate genes are involved in signal transduction, have immunological functions, or have putative functional relevance in entry into host cells. This study supported 2 previously identified SNP within a QTL on BTA16 and identified 16 new QTL, including 2 found in the PNW and the PNW+NE, associated with tissue infection.
Collapse
|
31
|
Ni G, Cavero D, Fangmann A, Erbe M, Simianer H. Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture. Genet Sel Evol 2017; 49:8. [PMID: 28093063 PMCID: PMC5238523 DOI: 10.1186/s12711-016-0277-y] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 12/05/2016] [Indexed: 11/10/2022] Open
Abstract
Background With the availability of next-generation sequencing technologies, genomic prediction based on whole-genome sequencing (WGS) data is now feasible in animal breeding schemes and was expected to lead to higher predictive ability, since such data may contain all genomic variants including causal mutations. Our objective was to compare prediction ability with high-density (HD) array data and WGS data in a commercial brown layer line with genomic best linear unbiased prediction (GBLUP) models using various approaches to weight single nucleotide polymorphisms (SNPs). Methods A total of 892 chickens from a commercial brown layer line were genotyped with 336 K segregating SNPs (array data) that included 157 K genic SNPs (i.e. SNPs in or around a gene). For these individuals, genome-wide sequence information was imputed based on data from re-sequencing runs of 25 individuals, leading to 5.2 million (M) imputed SNPs (WGS data), including 2.6 M genic SNPs. De-regressed proofs (DRP) for eggshell strength, feed intake and laying rate were used as quasi-phenotypic data in genomic prediction analyses. Four weighting factors for building a trait-specific genomic relationship matrix were investigated: identical weights, −(log10P) from genome-wide association study results, squares of SNP effects from random regression BLUP, and variable selection based weights (known as BLUP|GA). Predictive ability was measured as the correlation between DRP and direct genomic breeding values in five replications of a fivefold cross-validation. Results Averaged over the three traits, the highest predictive ability (0.366 ± 0.075) was obtained when only genic SNPs from WGS data were used. Predictive abilities with genic SNPs and all SNPs from HD array data were 0.361 ± 0.072 and 0.353 ± 0.074, respectively. Prediction with −(log10P) or squares of SNP effects as weighting factors for building a genomic relationship matrix or BLUP|GA did not increase accuracy, compared to that with identical weights, regardless of the SNP set used. Conclusions Our results show that little or no benefit was gained when using all imputed WGS data to perform genomic prediction compared to using HD array data regardless of the weighting factors tested. However, using only genic SNPs from WGS data had a positive effect on prediction ability. Electronic supplementary material The online version of this article (doi:10.1186/s12711-016-0277-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Guiyan Ni
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany.
| | | | - Anna Fangmann
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany
| | - Malena Erbe
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany.,Institute for Animal Breeding, Bavarian State Research Centre for Agriculture, Grub, Germany
| | - Henner Simianer
- Animal Breeding and Genetics Group, Georg-August-Universität, Göttingen, Germany
| |
Collapse
|
32
|
Liao R, Zhang X, Chen Q, Wang Z, Wang Q, Yang C, Pan Y. Genome-wide association study reveals novel variants for growth and egg traits in Dongxiang blue-shelled and White Leghorn chickens. Anim Genet 2016; 47:588-96. [PMID: 27166871 DOI: 10.1111/age.12456] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/12/2016] [Indexed: 12/12/2022]
Abstract
This study was designed to investigate the genetic basis of growth and egg traits in Dongxiang blue-shelled chickens and White Leghorn chickens. In this study, we employed a reduced representation sequencing approach called genotyping by genome reducing and sequencing to detect genome-wide SNPs in 252 Dongxiang blue-shelled chickens and 252 White Leghorn chickens. The Dongxiang blue-shelled chicken breed has many specific traits and is characterized by blue-shelled eggs, black plumage, black skin, black bone and black organs. The White Leghorn chicken is an egg-type breed with high productivity. As multibreed genome-wide association studies (GWASs) can improve precision due to less linkage disequilibrium across breeds, a multibreed GWAS was performed with 156 575 SNPs to identify the associated variants underlying growth and egg traits within the two chicken breeds. The analysis revealed 32 SNPs exhibiting a significant genome-wide association with growth and egg traits. Some of the significant SNPs are located in genes that are known to impact growth and egg traits, but nearly half of the significant SNPs are located in genes with unclear functions in chickens. To our knowledge, this is the first multibreed genome-wide report for the genetics of growth and egg traits in the Dongxiang blue-shelled and White Leghorn chickens.
Collapse
Affiliation(s)
- R Liao
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China.,Institute of Animal Husbandry and Veterinary Research, Shanghai Academy of Agricultural Sciences, Shanghai, 201106, China
| | - X Zhang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China.,Shanghai Key Laboratory of Veterinary Biotechnology, Shanghai, 200240, China
| | - Q Chen
- Faculty of Life Science and Technology, Kunming University of Science and Technology, Kunming, 650093, Yunnan, China
| | - Z Wang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Q Wang
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China.,Shanghai Key Laboratory of Veterinary Biotechnology, Shanghai, 200240, China
| | - C Yang
- Institute of Animal Husbandry and Veterinary Research, Shanghai Academy of Agricultural Sciences, Shanghai, 201106, China.
| | - Y Pan
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China. .,Shanghai Key Laboratory of Veterinary Biotechnology, Shanghai, 200240, China.
| |
Collapse
|
33
|
Pausch H, Emmerling R, Schwarzenbacher H, Fries R. A multi-trait meta-analysis with imputed sequence variants reveals twelve QTL for mammary gland morphology in Fleckvieh cattle. Genet Sel Evol 2016; 48:14. [PMID: 26883850 PMCID: PMC4756527 DOI: 10.1186/s12711-016-0190-4] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2015] [Accepted: 01/26/2016] [Indexed: 12/16/2022] Open
Abstract
Background The availability of whole-genome sequence data from key ancestors in bovine populations provides an exhaustive catalogue of polymorphic sites that segregate within and across cattle breeds. Sequence variants identified from the sequenced genome of key ancestors can be imputed into animals that have been genotyped using medium- and high-density genotyping arrays. Association analysis with imputed sequences, particularly when applied to multiple traits simultaneously, is a very powerful approach to detect candidate causal variants that underlie complex phenotypes. Results We used whole-genome sequence data from 157 key ancestors of the German Fleckvieh cattle population to impute 20,561,798 sequence variants into 10,363 animals that had (partly imputed) genotypes based on 634,109 single nucleotide polymorphisms (SNPs). Rare variants were more frequent among the sequence-derived than the array-derived genotypes. Association studies with imputed sequence variants were performed using seven correlated udder conformation traits as response variables. The calculation of an approximate multi-trait test statistic enabled us to detect 12 quantitative trait loci (QTL) (P < 2.97 × 10−9) that affect different morphological features of the mammary gland. Among the tested variants, the most significant associations were found for imputed sequence variants at 11 QTL, whereas the top association signal was observed for an array-derived variant at a QTL on bovine chromosome 14. Seven QTL were associated with multiple phenotypes. Most QTL were located in non-coding regions of the genome but in close proximity of candidate genes that could be involved in mammary gland morphology (SP5, GC, NPFFR2, CRIM1, RXFP2, TBX5, RBM19 and ADAM12). Conclusions Using imputed sequence variants in association analyses allows the detection of QTL at maximum resolution. Multi-trait approaches can reveal QTL that are not detected in single-trait association studies. Most QTL for udder conformation traits were located in non-coding regions of the genome, which suggests that mutations in regulatory sequences are the major determinants of variation in mammary gland morphology in cattle. Electronic supplementary material The online version of this article (doi:10.1186/s12711-016-0190-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hubert Pausch
- Lehrstuhl fuer Tierzucht, Technische Universitaet Muenchen, 85354, Freising, Germany.
| | - Reiner Emmerling
- Institut fuer Tierzucht, Bayerische Landesanstalt fuer Landwirtschaft, 85586, Poing, Germany.
| | | | - Ruedi Fries
- Lehrstuhl fuer Tierzucht, Technische Universitaet Muenchen, 85354, Freising, Germany.
| |
Collapse
|
34
|
Do DN, Janss LLG, Jensen J, Kadarmideen HN. SNP annotation-based whole genomic prediction and selection: an application to feed efficiency and its component traits in pigs. J Anim Sci 2016; 93:2056-63. [PMID: 26020301 DOI: 10.2527/jas.2014-8640] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The study investigated genetic architecture and predictive ability using genomic annotation of residual feed intake (RFI) and its component traits (daily feed intake [DFI], ADG, and back fat [BF]). A total of 1,272 Duroc pigs had both genotypic and phenotypic records, and the records were split into a training (968 pigs) and a validation dataset (304 pigs) by assigning records as before and after January 1, 2012, respectively. SNP were annotated by 14 different classes using Ensembl variant effect prediction. Predictive accuracy and prediction bias were calculated using Bayesian Power LASSO, Bayesian A, B, and Cπ, and genomic BLUP (GBLUP) methods. Predictive accuracy ranged from 0.508 to 0.531, 0.506 to 0.532, 0.276 to 0.357, and 0.308 to 0.362 for DFI, RFI, ADG, and BF, respectively. BayesCπ100.1 increased accuracy slightly compared to the GBLUP model and other methods. The contribution per SNP to total genomic variance was similar among annotated classes across different traits. Predictive performance of SNP classes did not significantly differ from randomized SNP groups. Genomic prediction has accuracy comparable to observed phenotype, and use of genomic prediction can be cost effective by replacing feed intake measurement. Genomic annotation had less impact on predictive accuracy traits considered here but may be different for other traits. It is the first study to provide useful insights into biological classes of SNP driving the whole genomic prediction for complex traits in pigs.
Collapse
|
35
|
Abdollahi-Arpanahi R, Morota G, Valente BD, Kranis A, Rosa GJM, Gianola D. Differential contribution of genomic regions to marked genetic variation and prediction of quantitative traits in broiler chickens. Genet Sel Evol 2016; 48:10. [PMID: 26842494 PMCID: PMC4739338 DOI: 10.1186/s12711-016-0187-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 01/15/2016] [Indexed: 11/15/2022] Open
Abstract
Background Genome-wide association studies in humans have found enrichment of trait-associated single nucleotide polymorphisms (SNPs) in coding regions of the genome and depletion of these in intergenic regions. However, a recent release of the ENCyclopedia of DNA elements showed that ~80 % of the human genome has a biochemical function. Similar studies on the chicken genome are lacking, thus assessing the relative contribution of its genic and non-genic regions to variation is relevant for biological studies and genetic improvement of chicken populations. Methods A dataset including 1351 birds that were genotyped with the 600K Affymetrix platform was used. We partitioned SNPs according to genome annotation data into six classes to characterize the relative contribution of genic and non-genic regions to genetic variation as well as their predictive power using all available quality-filtered SNPs. Target traits were body weight, ultrasound measurement of breast muscle and hen house egg production in broiler chickens. Six genomic regions were considered: intergenic regions, introns, missense, synonymous, 5′ and 3′ untranslated regions, and regions that are located 5 kb upstream and downstream of coding genes. Genomic relationship matrices were constructed for each genomic region and fitted in the models, separately or simultaneously. Kernel-based ridge regression was used to estimate variance components and assess predictive ability. Contribution of each class of genomic regions to dominance variance was also considered. Results Variance component estimates indicated that all genomic regions contributed to marked additive genetic variation and that the class of synonymous regions tended to have the greatest contribution. The marked dominance genetic variation explained by each class of genomic regions was similar and negligible (~0.05). In terms of prediction mean-square error, the whole-genome approach showed the best predictive ability. Conclusions All genic and non-genic regions contributed to phenotypic variation for the three traits studied. Overall, the contribution of additive genetic variance to the total genetic variance was much greater than that of dominance variance. Our results show that all genomic regions are important for the prediction of the targeted traits, and the whole-genome approach was reaffirmed as the best tool for genome-enabled prediction of quantitative traits.
Collapse
Affiliation(s)
- Rostam Abdollahi-Arpanahi
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA. .,Department of Animal and Poultry Science, College of Aburaihan, University of Tehran, Pakdasht, Iran.
| | - Gota Morota
- Department of Animal Science, University of Nebraska, Lincoln, NE, USA.
| | - Bruno D Valente
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA. .,Department of Dairy Science, University of Wisconsin, Madison, WI, USA.
| | - Andreas Kranis
- Aviagen Ltd, Midlothian, UK. .,The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, UK.
| | - Guilherme J M Rosa
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA. .,Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, USA.
| | - Daniel Gianola
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA. .,Department of Dairy Science, University of Wisconsin, Madison, WI, USA. .,Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, USA.
| |
Collapse
|
36
|
Heidaritabar M, Calus MPL, Megens HJ, Vereijken A, Groenen MAM, Bastiaansen JWM. Accuracy of genomic prediction using imputed whole-genome sequence data in white layers. J Anim Breed Genet 2016; 133:167-79. [PMID: 26776363 DOI: 10.1111/jbg.12199] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Accepted: 11/26/2015] [Indexed: 01/17/2023]
Abstract
There is an increasing interest in using whole-genome sequence data in genomic selection breeding programmes. Prediction of breeding values is expected to be more accurate when whole-genome sequence is used, because the causal mutations are assumed to be in the data. We performed genomic prediction for the number of eggs in white layers using imputed whole-genome resequence data including ~4.6 million SNPs. The prediction accuracies based on sequence data were compared with the accuracies from the 60 K SNP panel. Predictions were based on genomic best linear unbiased prediction (GBLUP) as well as a Bayesian variable selection model (BayesC). Moreover, the prediction accuracy from using different types of variants (synonymous, non-synonymous and non-coding SNPs) was evaluated. Genomic prediction using the 60 K SNP panel resulted in a prediction accuracy of 0.74 when GBLUP was applied. With sequence data, there was a small increase (~1%) in prediction accuracy over the 60 K genotypes. With both 60 K SNP panel and sequence data, GBLUP slightly outperformed BayesC in predicting the breeding values. Selection of SNPs more likely to affect the phenotype (i.e. non-synonymous SNPs) did not improve the accuracy of genomic prediction. The fact that sequence data were based on imputation from a small number of sequenced animals may have limited the potential to improve the prediction accuracy. A small reference population (n = 1004) and possible exclusion of many causal SNPs during quality control can be other possible reasons for limited benefit of sequence data. We expect, however, that the limited improvement is because the 60 K SNP panel was already sufficiently dense to accurately determine the relationships between animals in our data.
Collapse
Affiliation(s)
- M Heidaritabar
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, the Netherlands
| | - M P L Calus
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, Wageningen, the Netherlands
| | - H-J Megens
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, the Netherlands
| | - A Vereijken
- Hendrix Genetics Research, Technology and Services B.V., Boxmeer, the Netherlands
| | - M A M Groenen
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, the Netherlands
| | - J W M Bastiaansen
- Animal Breeding and Genomics Centre, Wageningen University, Wageningen, the Netherlands
| |
Collapse
|
37
|
Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle. PLoS One 2015; 10:e0143945. [PMID: 26642058 PMCID: PMC4671594 DOI: 10.1371/journal.pone.0143945] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 11/11/2015] [Indexed: 11/19/2022] Open
Abstract
The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of “missing heritability”–the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits.
Collapse
|
38
|
Genomic correlation: harnessing the benefit of combining two unrelated populations for genomic selection. Genet Sel Evol 2015; 47:84. [PMID: 26525050 PMCID: PMC4630892 DOI: 10.1186/s12711-015-0162-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2015] [Accepted: 10/16/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The success of genomic selection in animal breeding hinges on the availability of a large reference population on which genomic-based predictions of additive genetic or breeding values are built. Here, we explore the benefit of combining two unrelated populations into a single reference population. METHODS The datasets consisted of 1829 Brahman and 1973 Tropical Composite cattle with measurements on five phenotypes relevant to tropical adaptation and genotypes for 71,726 genome-wide single nucleotide polymorphisms (SNPs). The underlying genomic correlation for the same phenotype across the two breeds was explored on the basis of consistent linkage disequilibrium (LD) phase and marker effects in both breeds. RESULTS The proportion of genetic variance explained by the entire set of SNPs ranged from 37.5 to 57.6 %. Estimated genomic correlations were drastically affected by the process used to select SNPs and went from near 0 to more than 0.80 for most traits when using the set of SNPs with significant effects and the same LD phase in the two breeds. We found that, by carefully selecting the subset of SNPs, the missing heritability can be largely recovered and accuracies in genomic predictions can be improved six-fold. However, the increases in accuracy might come at the expense of large biases. CONCLUSIONS Our results offer hope for the effective implementation of genomic selection schemes in situations where the number of breeds is large, the sample size within any single breed is small and the breeding objective includes many phenotypes.
Collapse
|
39
|
Koufariotis LT, Chen YPP, Chamberlain A, Vander Jagt C, Hayes BJ. A catalogue of novel bovine long noncoding RNA across 18 tissues. PLoS One 2015; 10:e0141225. [PMID: 26496443 PMCID: PMC4619662 DOI: 10.1371/journal.pone.0141225] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 10/05/2015] [Indexed: 11/19/2022] Open
Abstract
Long non-coding RNA (lncRNA) have been implicated in diverse biological roles including gene regulation and genomic imprinting. Identifying lncRNA in bovine across many differing tissue would contribute to the current repertoire of bovine lncRNA, and help further improve our understanding of the evolutionary importance and constraints of these transcripts. Additionally, it could aid in identifying sites in the genome outside of protein coding genes where mutations could contribute to variation in complex traits. This is particularly important in bovine as genomic predictions are increasingly used in genetic improvement for milk and meat production. Our aim was to identify and annotate novel long non coding RNA transcripts in the bovine genome captured from RNA Sequencing (RNA-Seq) data across 18 tissues, sampled in triplicate from a single cow. To address the main challenge in identifying lncRNA, namely distinguishing lncRNA transcripts from unannotated genes and protein coding genes, a lncRNA identification pipeline with a number of filtering steps was developed. A total of 9,778 transcripts passed the filtering pipeline. The bovine lncRNA catalogue includes MALAT1 and HOTAIR, both of which have been well described in human and mouse genomes. We attempted to validate the lncRNA in libraries from three additional cows. 726 (87.47%) liver and 1,668 (55.27%) blood class 3 lncRNA were validated with stranded liver and blood libraries respectively. Additionally, this study identified a large number of novel unknown transcripts in the bovine genome with high protein coding potential, illustrating a clear need for better annotations of protein coding genes.
Collapse
Affiliation(s)
- Lambros T. Koufariotis
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
- * E-mail:
| | - Yi-Ping Phoebe Chen
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
| | - Amanda Chamberlain
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| | - Christy Vander Jagt
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| | - Ben J. Hayes
- College of Science, Health and Engineering, La Trobe University Bundoora, Melbourne, Victoria, Australia
- Department of Environment and Primary Industries, AgriBio Bundoora, Melbourne, Victoria, Australia
- Dairy Futures Co-operative Research Centre, Melbourne, Victoria, Australia
| |
Collapse
|
40
|
de Camargo GMF, Porto-Neto LR, Kelly MJ, Bunch RJ, McWilliam SM, Tonhati H, Lehnert SA, Fortes MRS, Moore SS. Non-synonymous mutations mapped to chromosome X associated with andrological and growth traits in beef cattle. BMC Genomics 2015; 16:384. [PMID: 25975716 PMCID: PMC4432507 DOI: 10.1186/s12864-015-1595-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 04/28/2015] [Indexed: 12/31/2022] Open
Abstract
Background Previous genome-wide association analyses identified QTL regions in the X chromosome for percentage of normal sperm and scrotal circumference in Brahman and Tropical Composite cattle. These traits are important to be studied because they are indicators of male fertility and are correlated with female sexual precocity and reproductive longevity. The aim was to investigate candidate genes in these regions and to identify putative causative mutations that influence these traits. In addition, we tested the identified mutations for female fertility and growth traits. Results Using a combination of bioinformatics and molecular assay technology, twelve non-synonymous SNPs in eleven genes were genotyped in a cattle population. Three and nine SNPs explained more than 1% of the additive genetic variance for percentage of normal sperm and scrotal circumference, respectively. The SNPs that had a major influence in percentage of normal sperm were mapped to LOC100138021 and TAF7L genes; and in TEX11 and AR genes for scrotal circumference. One SNP in TEX11 was explained ~13% of the additive genetic variance for scrotal circumference at 12 months. The tested SNP were also associated with weight measurements, but not with female fertility traits. Conclusions The strong association of SNPs located in X chromosome genes with male fertility traits validates the QTL. The implicated genes became good candidates to be used for genetic evaluation, without detrimentally influencing female fertility traits. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1595-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gregório Miguel Ferreira de Camargo
- Departamento de Zootecnia, Universidade Estadual Paulista (Unesp), Jaboticabal, SP, 14884-900, Brazil. .,Commonwealth Scientific and Industrial Research Organization, Agriculture Flagship, St Lucia, QLD, 4067, Australia. .,School of Chemistry and Molecular Bioscience, The University of Queensland, St Lucia Brisbane, QLD, 4072, Australia.
| | - Laercio R Porto-Neto
- Commonwealth Scientific and Industrial Research Organization, Agriculture Flagship, St Lucia, QLD, 4067, Australia.
| | - Matthew J Kelly
- School of Chemistry and Molecular Bioscience, The University of Queensland, St Lucia Brisbane, QLD, 4072, Australia.
| | - Rowan J Bunch
- Commonwealth Scientific and Industrial Research Organization, Agriculture Flagship, St Lucia, QLD, 4067, Australia.
| | - Sean M McWilliam
- Commonwealth Scientific and Industrial Research Organization, Agriculture Flagship, St Lucia, QLD, 4067, Australia.
| | - Humberto Tonhati
- Departamento de Zootecnia, Universidade Estadual Paulista (Unesp), Jaboticabal, SP, 14884-900, Brazil.
| | - Sigrid A Lehnert
- Commonwealth Scientific and Industrial Research Organization, Agriculture Flagship, St Lucia, QLD, 4067, Australia.
| | - Marina R S Fortes
- School of Chemistry and Molecular Bioscience, The University of Queensland, St Lucia Brisbane, QLD, 4072, Australia.
| | - Stephen S Moore
- Queensland Alliance for Agriculture and Food Innovation, Centre for Animal Science, The University of Queensland, Brisbane, QLD, 4067, Australia.
| |
Collapse
|
41
|
Pausch H, Wurmser C, Reinhardt F, Emmerling R, Fries R. Short communication: Validation of 4 candidate causative trait variants in 2 cattle breeds using targeted sequence imputation. J Dairy Sci 2015; 98:4162-7. [PMID: 25892690 DOI: 10.3168/jds.2015-9402] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Accepted: 03/13/2015] [Indexed: 12/15/2022]
Abstract
Most association studies for pinpointing trait-associated variants are performed within breed. The availability of sequence data from key ancestors of several cattle breeds now enables immediate assessment of the frequency of trait-associated variants in populations different from the mapping population and their imputation into large validation populations. The objective of this study was to validate the effects of 4 putatively causative variants on milk production traits, male fertility, and stature in German Fleckvieh and Holstein-Friesian animals using targeted sequence imputation. We used whole-genome sequence data of 456 animals to impute 4 missense mutations in DGAT1, GHR, PRLR, and PROP1 into 10,363 Fleckvieh and 8,812 Holstein animals. The accuracy of the imputed genotypes exceeded 95% for all variants. Association testing with imputed variants revealed consistent antagonistic effects of the DGAT1 p.A232K and GHR p.F279Y variants on milk yield and protein and fat contents, respectively, in both breeds. The allele frequency of both polymorphisms has changed considerably in the past 20 yr, indicating that they were targets of recent selection for milk production traits. The PRLR p.S18N variant was associated with yield traits in Fleckvieh but not in Holstein, suggesting that it may be in linkage disequilibrium with a mutation affecting yield traits rather than being causal. The reported effects of the PROP1 p.H173R variant on milk production, male fertility, and stature could not be confirmed. Our results demonstrate that population-wide imputation of candidate causal variants from sequence data is feasible, enabling their rapid validation in large independent populations.
Collapse
Affiliation(s)
- Hubert Pausch
- Chair of Animal Breeding, Technische Universität München, 85354 Freising, Germany.
| | - Christine Wurmser
- Chair of Animal Breeding, Technische Universität München, 85354 Freising, Germany
| | - Friedrich Reinhardt
- Vereinigte Informationssysteme Tierhaltung w.V. (VIT), 27283 Verden, Germany
| | - Reiner Emmerling
- Institute of Animal Breeding, Bavarian State Research Centre for Agriculture, 85586 Poing, Germany
| | - Ruedi Fries
- Chair of Animal Breeding, Technische Universität München, 85354 Freising, Germany
| |
Collapse
|
42
|
Andersson L, Archibald AL, Bottema CD, Brauning R, Burgess SC, Burt DW, Casas E, Cheng HH, Clarke L, Couldrey C, Dalrymple BP, Elsik CG, Foissac S, Giuffra E, Groenen MA, Hayes BJ, Huang LS, Khatib H, Kijas JW, Kim H, Lunney JK, McCarthy FM, McEwan JC, Moore S, Nanduri B, Notredame C, Palti Y, Plastow GS, Reecy JM, Rohrer GA, Sarropoulou E, Schmidt CJ, Silverstein J, Tellam RL, Tixier-Boichard M, Tosser-Klopp G, Tuggle CK, Vilkki J, White SN, Zhao S, Zhou H. Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project. Genome Biol 2015; 16:57. [PMID: 25854118 PMCID: PMC4373242 DOI: 10.1186/s13059-015-0622-4] [Citation(s) in RCA: 212] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
We describe the organization of a nascent international effort, the Functional Annotation of Animal Genomes (FAANG) project, whose aim is to produce comprehensive maps of functional elements in the genomes of domesticated animal species.
Collapse
|