1
|
Kamprasert N, Aliloo H, van der Werf JHJ, Clark SA. Short communication: Accuracy of whole-genome sequence imputation in Angus cattle using within-breed and multi breed reference populations. Animal 2024; 18:101087. [PMID: 38364656 DOI: 10.1016/j.animal.2024.101087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 01/16/2024] [Accepted: 01/19/2024] [Indexed: 02/18/2024] Open
Abstract
Genotype imputation is a standard approach used in the field of genetics. It can be used to fill in missing genotypes or to increase genotype density. Accurate imputed genotypes are required for downstream analyses. In this study, the accuracy of whole-genome sequence imputation for Angus beef cattle was examined using two different ways to form the reference panel, a within-breed reference population and a multi breed reference population. A stepwise imputation was conducted by imputing medium-density (50k) genotypes to high-density, and then to the whole genome sequence (WGS). The reference population consisted of animals with WGS information from the 1 000 Bull Genomes project. The within-breed reference panel comprised 396 Angus cattle, while an additional 2 380 Taurine cattle were added to the reference population for the multi breed reference scenario. Imputation accuracies were variant-wise average accuracies from a 10-fold cross-validation and expressed as concordance rates (CR) and Pearson's correlations (PR). The two imputation scenarios achieved moderate to high imputation accuracies ranging from 0.896 to 0.966 for CR and from 0.779 to 0.834 for PR. The accuracies from two different scenarios were similar, except for PR from WGS imputation, where the within-breed scenario outperformed the multi breed scenario. The result indicated that including a large number of animals from other breeds in the reference panel to impute purebred Angus did not improve the accuracy and may negatively impact the results. In conclusion, the imputed WGS in Angus cattle can be obtained with high accuracy using a within-breed reference panel.
Collapse
Affiliation(s)
- N Kamprasert
- School of Environmental and Rural Science, University of New England, 2351, Armidale, NSW, Australia.
| | - H Aliloo
- School of Environmental and Rural Science, University of New England, 2351, Armidale, NSW, Australia
| | - J H J van der Werf
- School of Environmental and Rural Science, University of New England, 2351, Armidale, NSW, Australia
| | - S A Clark
- School of Environmental and Rural Science, University of New England, 2351, Armidale, NSW, Australia
| |
Collapse
|
2
|
Nawaz MY, Bernardes PA, Savegnago RP, Lim D, Lee SH, Gondro C. Evaluation of Whole-Genome Sequence Imputation Strategies in Korean Hanwoo Cattle. Animals (Basel) 2022; 12:ani12172265. [PMID: 36077985 PMCID: PMC9454883 DOI: 10.3390/ani12172265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 08/25/2022] [Accepted: 08/30/2022] [Indexed: 11/29/2022] Open
Abstract
Simple Summary In this study, we evaluated various imputation strategies for the Korean Hanwoo cattle. We observed that a large reference panel consisting of many cattle breeds did not improve the imputation accuracy when compared to a proportionally small purebred Hanwoo reference. This was because the multi-breed reference did not contain animals sufficiently related to the Hanwoo to improve the accuracies and, although not detrimental, in effect, only added to the computational burden of the imputation. Despite the large multi-breed reference, when the Hanwoo were removed from the reference, the imputation accuracies were low. These results suggest additional sequencing efforts are needed for underrepresented breeds, particularly those less genetically related to the main European breeds. Abstract This study evaluated the accuracy of sequence imputation in Hanwoo beef cattle using different reference panels: a large multi-breed reference with no Hanwoo (n = 6269), a much smaller Hanwoo purebred reference (n = 88), and both datasets combined (n = 6357). The target animals were 136 cattle both sequenced and genotyped with the Illumina BovineSNP50 v2 (50K). The average imputation accuracy measured by the Pearson correlation (R) was 0.695 with the multi-breed reference, 0.876 with the purebred Hanwoo, and 0.887 with the combined data; the average concordance rates (CR) were 88.16%, 94.49%, and 94.84%, respectively. The accuracy gains from adding a large multi-breed reference of 6269 samples to only 88 Hanwoo was marginal; however, the concordance rate for the heterozygotes decreased from 85% to 82%, and the concordance rate for fixed SNPs in Hanwoo also decreased from 99.98% to 98.73%. Although the multi-breed panel was large, it was not sufficiently representative of the breed for accurate imputation without the Hanwoo animals. Additionally, we evaluated the value of high-density 700K genotypes (n = 991) as an intermediary step in the imputation process. The imputation accuracy differences were negligible between a single-step imputation strategy from 50K directly to sequence and a two-step imputation approach (50K-700K-sequence). We also observed that imputed sequence data can be used as a reference panel for imputation (mean R = 0.9650, mean CR = 98.35%). Finally, we identified 31 poorly imputed genomic regions in the Hanwoo genome and demonstrated that imputation accuracies were particularly lower at the chromosomal ends.
Collapse
Affiliation(s)
- Muhammad Yasir Nawaz
- Genetics and Genome Sciences Graduate Program, Michigan State University, East Lansing, MI 48824, USA
- Correspondence: (M.Y.N.); (C.G.)
| | - Priscila Arrigucci Bernardes
- Department of Animal Science and Rural Development, Federal University of Santa Catarina, Florianopolis 88034-000, SC, Brazil
| | | | - Dajeong Lim
- Animal Genome & Bioinformatics Division, National Institute of Animal Science, RDA, Wanju 55365, Korea
| | - Seung Hwan Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 305764, Korea
| | - Cedric Gondro
- Department of Animal Science, Michigan State University, East Lansing, MI 48824, USA
- Correspondence: (M.Y.N.); (C.G.)
| |
Collapse
|
3
|
Jiang Y, Song H, Gao H, Zhang Q, Ding X. Exploring the optimal strategy of imputation from SNP array to whole-genome sequencing data in farm animals. Front Genet 2022; 13:963654. [PMID: 36092888 PMCID: PMC9459117 DOI: 10.3389/fgene.2022.963654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 08/01/2022] [Indexed: 11/13/2022] Open
Abstract
Genotype imputation from BeadChip to whole-genome sequencing (WGS) data is a cost-effective method of obtaining genotypes of WGS variants. Beagle, one of the most popular imputation software programs, has been widely used for genotype inference in humans and non-human species. A few studies have systematically and comprehensively compared the performance of beagle versions and parameter settings of farm animals. Here, we investigated the imputation performance of three representative versions of Beagle (Beagle 4.1, Beagle 5.0, and Beagle 5.4), and the effective population size (Ne) parameter setting for three species (cattle, pig, and chicken). Six scenarios were investigated to explore the impact of certain key factors on imputation performance. The results showed that the default Ne (1,000,000) is not suitable for livestock and poultry in small reference or low-density arrays of target panels, with 2.47%–10.45% drops in accuracy. Beagle 5 significantly reduced the computation time (4.66-fold–13.24-fold) without an accuracy loss. In addition, using a large combined-reference panel or high-density chip provides greater imputation accuracy, especially for low minor allele frequency (MAF) variants. Finally, a highly significant correlation in the measures of imputation accuracy can be obtained with an MAF equal to or greater than 0.05.
Collapse
Affiliation(s)
- Yifan Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hailiang Song
- Beijing Key Laboratory of Fisheries Biotechnology, Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Hongding Gao
- Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, Taian, China
| | - Xiangdong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, China
- *Correspondence: Xiangdong Ding,
| |
Collapse
|
4
|
Genome-Wide Associative Study of Phenotypic Parameters of the 3D Body Model of Aberdeen Angus Cattle with Multiple Depth Cameras. Animals (Basel) 2022; 12:ani12162128. [PMID: 36009718 PMCID: PMC9405194 DOI: 10.3390/ani12162128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 08/14/2022] [Accepted: 08/17/2022] [Indexed: 12/21/2022] Open
Abstract
Simple Summary This article aims to develop a new approach to the lifetime evaluation of cattle by 3-D visualization of economic-biological and genetic features. The following indicators were selected as phenotypic features: chest width and chest girth retrieved by 3-D model and meat output on the bones. Correlation analysis showed a reliable positive relationship between chest width and meat output on the bones, which can potentially be used for lifetime evaluation of meat productivity of animals. Genome-wide associations analysis revealed the following potential loci of quantitative traits on cattle chromosomes for chest width, chest girth, and meat output on bones. Abstract In beef cattle breeding, genome-wide association studies (GWAS) using single nucleotide polymorphisms (SNPs) arrays can reveal many loci of various production traits, such as growth, productivity, and meat quality. With the development of genome sequencing technologies, new opportunities are opening up for more accurate identification of areas associated with these traits. This article aims to develop a novel approach to the lifetime evaluation of cattle by 3-D visualization of economic-biological and genetic features. The purpose of this study was to identify significant variants underlying differences in the qualitative characteristics of meat, using imputed data on the sequence of the entire genome. Samples of biomaterial of young Aberdeen-Angus breed cattle (n = 96) were the material for carrying out genome-wide SNP genotyping. Genotyping was performed using a high-density DNA chip Bovine GPU HD BeadChip (Illumina Inc., San Diego, CA, USA), containing ~150 thousand SNPs. The following indicators were selected as phenotypic features: chest width and chest girth retrieved by 3-D model and meat output on the bones. Correlation analysis showed a reliable positive relationship between chest width and meat output on the bones, which can potentially be used for lifetime evaluation of meat productivity of animals.
Collapse
|
5
|
Pedrosa VB, Schenkel FS, Chen SY, Oliveira HR, Casey TM, Melka MG, Brito LF. Genomewide Association Analyses of Lactation Persistency and Milk Production Traits in Holstein Cattle Based on Imputed Whole-Genome Sequence Data. Genes (Basel) 2021; 12:genes12111830. [PMID: 34828436 PMCID: PMC8624223 DOI: 10.3390/genes12111830] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 11/13/2021] [Accepted: 11/17/2021] [Indexed: 12/22/2022] Open
Abstract
Lactation persistency and milk production are among the most economically important traits in the dairy industry. In this study, we explored the association of over 6.1 million imputed whole-genome sequence variants with lactation persistency (LP), milk yield (MILK), fat yield (FAT), fat percentage (FAT%), protein yield (PROT), and protein percentage (PROT%) in North American Holstein cattle. We identified 49, 3991, 2607, 4459, 805, and 5519 SNPs significantly associated with LP, MILK, FAT, FAT%, PROT, and PROT%, respectively. Various known associations were confirmed while several novel candidate genes were also revealed, including ARHGAP35, NPAS1, TMEM160, ZC3H4, SAE1, ZMIZ1, PPIF, LDB2, ABI3, SERPINB6, and SERPINB9 for LP; NIM1K, ZNF131, GABRG1, GABRA2, DCHS1, and SPIDR for MILK; NR6A1, OLFML2A, EXT2, POLD1, GOT1, and ETV6 for FAT; DPP6, LRRC26, and the KCN gene family for FAT%; CDC14A, RTCA, HSTN, and ODAM for PROT; and HERC3, HERC5, LALBA, CCL28, and NEURL1 for PROT%. Most of these genes are involved in relevant gene ontology (GO) terms such as fatty acid homeostasis, transporter regulator activity, response to progesterone and estradiol, response to steroid hormones, and lactation. The significant genomic regions found contribute to a better understanding of the molecular mechanisms related to LP and milk production in North American Holstein cattle.
Collapse
Affiliation(s)
- Victor B. Pedrosa
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907, USA; (V.B.P.); (S.-Y.C.); (H.R.O.); (T.M.C.)
- Department of Animal Sciences, State University of Ponta Grossa, Ponta Grossa 84030-900, Brazil
| | - Flavio S. Schenkel
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G2W1, Canada;
| | - Shi-Yi Chen
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907, USA; (V.B.P.); (S.-Y.C.); (H.R.O.); (T.M.C.)
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, College of Animal Science & Technology, Sichuan Agricultural University, Chengdu 611130, China
| | - Hinayah R. Oliveira
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907, USA; (V.B.P.); (S.-Y.C.); (H.R.O.); (T.M.C.)
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph, ON N1G2W1, Canada;
| | - Theresa M. Casey
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907, USA; (V.B.P.); (S.-Y.C.); (H.R.O.); (T.M.C.)
| | - Melkaye G. Melka
- Department of Animal and Food Science, University of Wisconsin River Falls, River Falls, WI 54022, USA;
| | - Luiz F. Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907, USA; (V.B.P.); (S.-Y.C.); (H.R.O.); (T.M.C.)
- Correspondence:
| |
Collapse
|
6
|
Fernandes Júnior GA, Carvalheiro R, de Oliveira HN, Sargolzaei M, Costilla R, Ventura RV, Fonseca LFS, Neves HHR, Hayes BJ, de Albuquerque LG. Imputation accuracy to whole-genome sequence in Nellore cattle. Genet Sel Evol 2021; 53:27. [PMID: 33711929 PMCID: PMC7953568 DOI: 10.1186/s12711-021-00622-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 03/05/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A cost-effective strategy to explore the complete DNA sequence in animals for genetic evaluation purposes is to sequence key ancestors of a population, followed by imputation mechanisms to infer marker genotypes that were not originally reported in a target population of animals genotyped with single nucleotide polymorphism (SNP) panels. The feasibility of this process relies on the accuracy of the genotype imputation in that population, particularly for potential causal mutations which may be at low frequency and either within genes or regulatory regions. The objective of the present study was to investigate the imputation accuracy to the sequence level in a Nellore beef cattle population, including that for variants in annotation classes which are more likely to be functional. METHODS Information of 151 key sequenced Nellore sires were used to assess the imputation accuracy from bovine HD BeadChip SNP (~ 777 k) to whole-genome sequence. The choice of the sires aimed at optimizing the imputation accuracy of a genotypic database, comprised of about 10,000 genotyped Nellore animals. Genotype imputation was performed using two computational approaches: FImpute3 and Minimac4 (after using Eagle for phasing). The accuracy of the imputation was evaluated using a fivefold cross-validation scheme and measured by the squared correlation between observed and imputed genotypes, calculated by individual and by SNP. SNPs were classified into a range of annotations, and the accuracy of imputation within each annotation classification was also evaluated. RESULTS High average imputation accuracies per animal were achieved using both FImpute3 (0.94) and Minimac4 (0.95). On average, common variants (minor allele frequency (MAF) > 0.03) were more accurately imputed by Minimac4 and low-frequency variants (MAF ≤ 0.03) were more accurately imputed by FImpute3. The inherent Minimac4 Rsq imputation quality statistic appears to be a good indicator of the empirical Minimac4 imputation accuracy. Both software provided high average SNP-wise imputation accuracy for all classes of biological annotations. CONCLUSIONS Our results indicate that imputation to whole-genome sequence is feasible in Nellore beef cattle since high imputation accuracies per individual are expected. SNP-wise imputation accuracy is software-dependent, especially for rare variants. The accuracy of imputation appears to be relatively independent of annotation classification.
Collapse
Affiliation(s)
| | - Roberto Carvalheiro
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil.,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil
| | - Henrique N de Oliveira
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil.,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil
| | - Mehdi Sargolzaei
- Ontario Veterinary College, UG, Guelph, Canada.,Select Sires Inc., Plain City, OH, USA
| | - Roy Costilla
- Queensland Alliance for Agriculture and Food Innovation, UQ, Brisbane, QLD, 4072, Australia
| | - Ricardo V Ventura
- School of Veterinary Medicine and Animal Science, USP, Pirassununga, SP, 13635-900, Brazil
| | - Larissa F S Fonseca
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil
| | | | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, UQ, Brisbane, QLD, 4072, Australia
| | - Lucia G de Albuquerque
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil. .,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil.
| |
Collapse
|
7
|
Gebrehiwot NZ, Strucken EM, Aliloo H, Marshall K, Gibson JP. The patterns of admixture, divergence, and ancestry of African cattle populations determined from genome-wide SNP data. BMC Genomics 2020; 21:869. [PMID: 33287702 PMCID: PMC7720612 DOI: 10.1186/s12864-020-07270-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 11/23/2020] [Indexed: 12/22/2022] Open
Abstract
Background Humpless Bos taurus cattle are one of the earliest domestic cattle in Africa, followed by the arrival of humped Bos indicus cattle. The diverse indigenous cattle breeds of Africa are derived from these migrations, with most appearing to be hybrids between Bos taurus and Bos indicus. The present study examines the patterns of admixture, diversity, and relationships among African cattle breeds. Methods Data for ~ 40 k SNPs was obtained from previous projects for 4089 animals representing 35 African indigenous, 6 European Bos taurus, 4 Bos indicus, and 5 African crossbred cattle populations. Genetic diversity and population structure were assessed using principal component analyses (PCA), admixture analyses, and Wright’s F statistic. The linkage disequilibrium and effective population size (Ne) were estimated for the pure cattle populations. Results The first two principal components differentiated Bos indicus from European Bos taurus, and African Bos taurus from other breeds. PCA and admixture analyses showed that, except for recently admixed cattle, all indigenous breeds are either pure African Bos taurus or admixtures of African Bos taurus and Bos indicus. The African zebu breeds had highest proportions of Bos indicus ancestry ranging from 70 to 90% or 60 to 75%, depending on the admixture model. Other indigenous breeds that were not 100% African Bos taurus, ranged from 42 to 70% or 23 to 61% Bos indicus ancestry. The African Bos taurus populations showed substantial genetic diversity, and other indigenous breeds show evidence of having more than one African taurine ancestor. Ne estimates based on r2 and r2adj showed a decline in Ne from a large population at 2000 generations ago, which is surprising for the indigenous breeds given the expected increase in cattle populations over that period and the lack of structured breeding programs. Conclusion African indigenous cattle breeds have a large genetic diversity and are either pure African Bos taurus or admixtures of African Bos taurus and Bos indicus. This provides a rich resource of potentially valuable genetic variation, particularly for adaptation traits, and to support conservation programs. It also provides challenges for the development of genomic assays and tools for use in African populations. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-020-07270-x.
Collapse
Affiliation(s)
- N Z Gebrehiwot
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| | - E M Strucken
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - H Aliloo
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - K Marshall
- International Livestock Research Institute and Centre for Tropical Livestock Genetics and Health, Nairobi, Kenya
| | - J P Gibson
- Centre for Genetic Analysis and Applications, School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| |
Collapse
|
8
|
Talouarn E, Bardou P, Palhière I, Oget C, Clément V, Tosser-Klopp G, Rupp R, Robert-Granié C. Genome wide association analysis on semen volume and milk yield using different strategies of imputation to whole genome sequence in French dairy goats. BMC Genet 2020; 21:19. [PMID: 32085723 PMCID: PMC7035711 DOI: 10.1186/s12863-020-0826-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Accepted: 02/13/2020] [Indexed: 01/17/2023] Open
Abstract
Background Goats were domesticated 10,500 years ago to supply humans with useful resources. Since then, specialized breeds that are adapted to their local environment have been developed and display specific genetic profiles. The VarGoats project is a 1000 genomes resequencing program designed to cover the genetic diversity of the Capra genus. In this study, our main objective was to assess the use of sequence data to detect genomic regions associated with traits of interest in French Alpine and Saanen breeds. Results Direct imputation from the GoatSNP50 BeadChip genotypes to sequence level was investigated in these breeds using FImpute and different reference panels: within-breed, all Capra hircus sequenced individuals, European goats and French mainland goats. The best results were obtained with the French goat panel with allele and genotype concordance rates reaching 0.86 and 0.75 in the Alpine and 0.86 and 0.73 in the Saanen breed respectively. Mean correlations tended to be low in both breeds due to the high proportion of variants with low frequencies. For association analysis, imputation was performed using FImpute for 1129 French Alpine and Saanen males using within-breed and French panels on 23,338,436 filtered variants. The association results of both imputation scenarios were then compared. In Saanen goats, a large region on chromosome 19 was significantly linked to semen volume and milk yield in both scenarios. Significant variants for milk yield were annotated for 91 genes on chromosome 19 in Saanen goats. For semen volume, the annotated genes include YBOX2 which is related to azoospermia or oligospermia in other species. New signals for milk yield were detected on chromosome 2 in Alpine goats and on chromosome 5 in Saanen goats when using a multi-breed panel. Conclusion Even with very small reference populations, an acceptable imputation quality can be achieved in French dairy goats. GWAS on imputed sequences confirmed the existence of QTLs and identified new regions of interest in dairy goats. Adding identified candidates to a genotyping array and sequencing more individuals might corroborate the involvement of identified regions while removing potential imputation errors.
Collapse
Affiliation(s)
- Estelle Talouarn
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France.
| | - Philippe Bardou
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France.,Sigenae, INRAE, 31326, Castanet-Tolosan, France
| | - Isabelle Palhière
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | - Claire Oget
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | | | | | - Gwenola Tosser-Klopp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | - Rachel Rupp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | | |
Collapse
|
9
|
Rowan TN, Hoff JL, Crum TE, Taylor JF, Schnabel RD, Decker JE. A multi-breed reference panel and additional rare variants maximize imputation accuracy in cattle. Genet Sel Evol 2019; 51:77. [PMID: 31878893 PMCID: PMC6933688 DOI: 10.1186/s12711-019-0519-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Accepted: 12/16/2019] [Indexed: 01/08/2023] Open
Abstract
Background During the last decade, the use of common-variant array-based single nucleotide polymorphism (SNP) genotyping in the beef and dairy industries has produced an astounding amount of medium-to-low density genomic data. Although low-density assays work well in the context of genomic prediction, they are less useful for detecting and mapping causal variants and the effects of rare variants are not captured. The objective of this project was to maximize the accuracies of genotype imputation from medium- and low-density assays to the marker set obtained by combining two high-density research assays (~ 850,000 SNPs), the Illumina BovineHD and the GGP-F250 assays, which contains a large proportion of rare and potentially functional variants and for which the assay design is described here. This 850 K SNP set is useful for both imputation to sequence-level genotypes and direct downstream analysis. Results We found that a large multi-breed composite imputation reference panel that includes 36,131 samples with either BovineHD and/or GGP-F250 genotypes significantly increased imputation accuracy compared with a within-breed reference panel, particularly at variants with low minor allele frequencies. Individual animal imputation accuracies were maximized when more genetically similar animals were represented in the composite reference panel, particularly with complete 850 K genotypes. The addition of rare variants from the GGP-F250 assay to our composite reference panel significantly increased the imputation accuracy of rare variants that are exclusively present on the BovineHD assay. In addition, we show that an assay marker density of 50 K SNPs balances cost and accuracy for imputation to 850 K. Conclusions Using high-density genotypes on all available individuals in a multi-breed reference panel maximized imputation accuracy for tested cattle populations. Admixed animals or those from breeds with a limited representation in the composite reference panel were still imputed at high accuracy, which is expected to further increase as the reference panel expands. We anticipate that the addition of rare variants from the GGP-F250 assay will increase the accuracy of imputation to sequence level.
Collapse
Affiliation(s)
- Troy N Rowan
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Jesse L Hoff
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Tamar E Crum
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Jeremy F Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Robert D Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA. .,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA.
| | - Jared E Decker
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA. .,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA.
| |
Collapse
|
10
|
Hayes BJ, Daetwyler HD. 1000 Bull Genomes Project to Map Simple and Complex Genetic Traits in Cattle: Applications and Outcomes. Annu Rev Anim Biosci 2019; 7:89-102. [PMID: 30508490 DOI: 10.1146/annurev-animal-020518-115024] [Citation(s) in RCA: 187] [Impact Index Per Article: 37.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The 1000 Bull Genomes Project is a collection of whole-genome sequences from 2,703 individuals capturing a significant proportion of the world's cattle diversity. So far, 84 million single-nucleotide polymorphisms (SNPs) and 2.5 million small insertion deletions have been identified in the collection, a very high level of genetic diversity. The project has greatly accelerated the identification of deleterious mutations for a range of genetic diseases, as well as for embryonic lethals. The rate of identification of causal mutations for complex traits has been slower, reflecting the typically small effect size of these mutations and the fact that many are likely in as-yet-unannotated regulatory regions. Both the deleterious mutations that have been identified and the mutations associated with complex trait variation have been included in low-cost SNP array designs, and these arrays are being genotyped in tens of thousands of dairy and beef cattle, enabling management of deleterious mutations in these populations as well as genomic selection.
Collapse
Affiliation(s)
- Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland 4067, Australia; .,Agriculture Victoria Research, AgriBio, Bundoora, Victoria 3083, Australia
| | - Hans D Daetwyler
- Agriculture Victoria Research, AgriBio, Bundoora, Victoria 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria 3083, Australia
| |
Collapse
|
11
|
Bedhane M, van der Werf J, Gondro C, Duijvesteijn N, Lim D, Park B, Park MN, Hee RS, Clark S. Genome-Wide Association Study of Meat Quality Traits in Hanwoo Beef Cattle Using Imputed Whole-Genome Sequence Data. Front Genet 2019; 10:1235. [PMID: 31850078 PMCID: PMC6895209 DOI: 10.3389/fgene.2019.01235] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Accepted: 11/06/2019] [Indexed: 01/28/2023] Open
Abstract
The discovery of single nucleotide polymorphisms (SNP) and the subsequent genotyping of large numbers of animals have enabled large-scale analyses to begin to understand the biological processes that underpin variation in animal populations. In beef cattle, genome-wide association studies using genotype arrays have revealed many quantitative trait loci (QTL) for various production traits such as growth, efficiency and meat quality. Most studies regarding meat quality have focused on marbling, which is a key trait associated with meat eating quality. However, other important traits like meat color, texture and fat color have not commonly been studied. Developments in genome sequencing technologies provide new opportunities to identify regions associated with these traits more precisely. The objective of this study was to estimate variance components and identify significant variants underpinning variation in meat quality traits using imputed whole genome sequence data. Phenotypic and genomic data from 2,110 Hanwoo cattle were used. The estimated heritabilities for the studied traits were 0.01, 0.16, 0.31, and 0.49 for fat color, meat color, meat texture and marbling score, respectively. Marbling score and meat texture were highly correlated. The genome-wide association study revealed 107 significant SNPs located on 14 selected chromosomes (one QTL region per selected chromosome). Four QTL regions were identified on BTA2, 12, 16, and 24 for marbling score and two QTL regions were found for meat texture trait on BTA12 and 29. Similarly, three QTL regions were identified for meat color on BTA2, 14 and 24 and five QTL regions for fat color on BTA7, 10, 12, 16, and 21. Candidate genes were identified for all traits, and their potential influence on the given trait was discussed. The significant SNP will be an important inclusion into commercial genotyping arrays to select new breeding animals more accurately.
Collapse
Affiliation(s)
- Mohammed Bedhane
- School of Environmental and Rural Science, University of New England, Armidale, Australia
| | - Julius van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, Australia
| | - Cedric Gondro
- College of Agriculture & Natural Resources, Michigan State University, East Lansing, MI, United States
| | - Naomi Duijvesteijn
- School of Environmental and Rural Science, University of New England, Armidale, Australia
| | - Dajeong Lim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Rural Development Administration, Wanju, South Korea
| | - Byoungho Park
- Animal Genetic Improvement Division, National Institute of Animal Science, Rural Development Administration, Seonghwan, South Korea
| | - Mi Na Park
- Animal Genetic Improvement Division, National Institute of Animal Science, Rural Development Administration, Seonghwan, South Korea
| | - Roh Seung Hee
- Animal Genetic Improvement Division, National Institute of Animal Science, Rural Development Administration, Seonghwan, South Korea
| | - Samuel Clark
- School of Environmental and Rural Science, University of New England, Armidale, Australia
| |
Collapse
|
12
|
Fang ZH, Pausch H. Multi-trait meta-analyses reveal 25 quantitative trait loci for economically important traits in Brown Swiss cattle. BMC Genomics 2019; 20:695. [PMID: 31481029 PMCID: PMC6724290 DOI: 10.1186/s12864-019-6066-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 08/27/2019] [Indexed: 01/02/2023] Open
Abstract
Background Little is known about the genetic architecture of economically important traits in Brown Swiss cattle because only few genome-wide association studies (GWAS) have been carried out in this breed. Moreover, most GWAS have been performed for single traits, thus not providing detailed insights into potentially existing pleiotropic effects of trait-associated loci. Results To compile a comprehensive catalogue of large-effect quantitative trait loci (QTL) segregating in Brown Swiss cattle, we carried out association tests between partially imputed genotypes at 598,016 SNPs and daughter-derived phenotypes for more than 50 economically important traits, including milk production, growth and carcass quality, body conformation, reproduction and calving traits in 4578 artificial insemination bulls from two cohorts of Brown Swiss cattle (Austrian-German and Swiss populations). Across-cohort multi-trait meta-analyses of the results from the single-trait GWAS revealed 25 quantitative trait loci (QTL; P < 8.36 × 10− 8) for economically relevant traits on 17 Bos taurus autosomes (BTA). Evidence of pleiotropy was detected at five QTL located on BTA5, 6, 17, 21 and 25. Of these, two QTL at BTA6:90,486,780 and BTA25:1,455,150 affect a diverse range of economically important traits, including traits related to body conformation, calving, longevity and milking speed. Furthermore, the QTL at BTA6:90,486,780 seems to be a target of ongoing selection as evidenced by an integrated haplotype score of 2.49 and significant changes in allele frequency over the past 25 years, whereas either no or only weak evidence of selection was detected at all other QTL. Conclusions Our findings provide a comprehensive overview of QTL segregating in Brown Swiss cattle. Detected QTL explain between 2 and 10% of the variation in the estimated breeding values and thus may be considered as the most important QTL segregating in the Brown Swiss cattle breed. Multi-trait association testing boosts the power to detect pleiotropic QTL and assesses the full spectrum of phenotypes that are affected by trait-associated variants. Electronic supplementary material The online version of this article (10.1186/s12864-019-6066-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zih-Hua Fang
- Animal Genomics, Institute of Agricultural Science, ETH Zürich, 8092, Zürich, Switzerland.
| | - Hubert Pausch
- Animal Genomics, Institute of Agricultural Science, ETH Zürich, 8092, Zürich, Switzerland
| |
Collapse
|
13
|
Zwane AA, Schnabel RD, Hoff J, Choudhury A, Makgahlela ML, Maiwashe A, Van Marle-Koster E, Taylor JF. Genome-Wide SNP Discovery in Indigenous Cattle Breeds of South Africa. Front Genet 2019; 10:273. [PMID: 30988672 PMCID: PMC6452414 DOI: 10.3389/fgene.2019.00273] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 03/12/2019] [Indexed: 01/30/2023] Open
Abstract
Single nucleotide polymorphism arrays have created new possibilities for performing genome-wide studies to detect genomic regions harboring sequence variants that affect complex traits. However, the majority of validated SNPs for which allele frequencies have been estimated are limited primarily to European breeds. The objective of this study was to perform SNP discovery in three South African indigenous breeds (Afrikaner, Drakensberger, and Nguni) using whole genome sequencing. DNA was extracted from blood and hair samples, quantified and prepared at 50 ng/μl concentration for sequencing at the Agricultural Research Council Biotechnology Platform using an Illumina HiSeq 2500. The fastq files were used to call the variants using the Genome Analysis Tool Kit. A total of 1,678,360 were identified as novel using Run 6 of 1000 Bull Genomes Project. Annotation of the identified variants classified them into functional categories. Within the coding regions, about 30% of the SNPs were non-synonymous substitutions that encode for alternate amino acids. The study of distribution of SNP across the genome identified regions showing notable differences in the densities of SNPs among the breeds and highlighted many regions of functional significance. Gene ontology terms identified genes such as MLANA, SYT10, and CDC42EP5 that have been associated with coat color in mouse, and ADAMS3, DNAJC3, and PAG5 genes have been associated with fertility in cattle. Further analysis of the variants detected 688 candidate selective sweeps (ZHp Z-scores ≤ -4) across all three breeds, of which 223 regions were assigned as being putative selective sweeps (ZHp scores ≤-5). We also identified 96 regions with extremely low ZHp Z-scores (≤-6) in Afrikaner and Nguni. Genes such as KIT and MITF that have been associated with skin pigmentation in cattle and CACNA1C, which has been associated with biopolar disorder in human, were identified in these regions. This study provides the first analysis of sequence data to discover SNPs in indigenous South African cattle breeds. The information will play an important role in our efforts to understand the genetic history of our cattle and in designing appropriate breed improvement programmes.
Collapse
Affiliation(s)
- Avhashoni A. Zwane
- Department of Animal Breeding and Genetics, Agricultural Research Council-Animal Production, Irene, South Africa
- Department of Animal and Wildlife Sciences, University of Pretoria, Pretoria, South Africa
| | - Robert D. Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States
- Informatics Institute, University of Missouri, Columbia, MO, United States
| | - Jesse Hoff
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States
| | - Ananyo Choudhury
- Sydney Brenner Institute of Molecular Bioscience, University of the Witwatersrand, Johannesburg, South Africa
| | - Mahlako Linah Makgahlela
- Department of Animal Breeding and Genetics, Agricultural Research Council-Animal Production, Irene, South Africa
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
| | - Azwihangwisi Maiwashe
- Department of Animal Breeding and Genetics, Agricultural Research Council-Animal Production, Irene, South Africa
- Department of Animal, Wildlife and Grassland Sciences, University of the Free State, Bloemfontein, South Africa
| | - Este Van Marle-Koster
- Department of Animal and Wildlife Sciences, University of Pretoria, Pretoria, South Africa
| | - Jeremy F. Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States
| |
Collapse
|
14
|
Pausch H, Emmerling R, Gredler-Grandl B, Fries R, Daetwyler HD, Goddard ME. Meta-analysis of sequence-based association studies across three cattle breeds reveals 25 QTL for fat and protein percentages in milk at nucleotide resolution. BMC Genomics 2017; 18:853. [PMID: 29121857 PMCID: PMC5680815 DOI: 10.1186/s12864-017-4263-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Accepted: 11/02/2017] [Indexed: 11/25/2022] Open
Abstract
Background Genotyping and whole-genome sequencing data have been generated for hundreds of thousands of cattle. International consortia used these data to compile imputation reference panels that facilitate the imputation of sequence variant genotypes for animals that have been genotyped using dense microarrays. Association studies with imputed sequence variant genotypes allow for the characterization of quantitative trait loci (QTL) at nucleotide resolution particularly when individuals from several breeds are included in the mapping populations. Results We imputed genotypes for 28 million sequence variants in 17,229 cattle of the Braunvieh, Fleckvieh and Holstein breeds in order to compile large mapping populations that provide high power to identify QTL for milk production traits. Association tests between imputed sequence variant genotypes and fat and protein percentages in milk uncovered between six and thirteen QTL (P < 1e-8) per breed. Eight of the detected QTL were significant in more than one breed. We combined the results across breeds using meta-analysis and identified a total of 25 QTL including six that were not significant in the within-breed association studies. Two missense mutations in the ABCG2 (p.Y581S, rs43702337, P = 4.3e-34) and GHR (p.F279Y, rs385640152, P = 1.6e-74) genes were the top variants at QTL on chromosomes 6 and 20. Another known causal missense mutation in the DGAT1 gene (p.A232K, rs109326954, P = 8.4e-1436) was the second top variant at a QTL on chromosome 14 but its allelic substitution effects were inconsistent across breeds. It turned out that the conflicting allelic substitution effects resulted from flaws in the imputed genotypes due to the use of a multi-breed reference population for genotype imputation. Conclusions Many QTL for milk production traits segregate across breeds and across-breed meta-analysis has greater power to detect such QTL than within-breed association testing. Association testing between imputed sequence variant genotypes and phenotypes of interest facilitates identifying causal mutations provided the accuracy of imputation is high. However, true causal mutations may remain undetected when the imputed sequence variant genotypes contain flaws. It is highly recommended to validate the effect of known causal variants in order to assess the ability to detect true causal mutations in association studies with imputed sequence variants. Electronic supplementary material The online version of this article (10.1186/s12864-017-4263-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hubert Pausch
- Animal Genomics, Institute of Agricultural Sciences, ETH Zurich, 8092, Zurich, Switzerland. .,Agriculture Research Division, Agriculture Victoria, Department of Economic Development, Jobs, Transport and Resources, AgriBio, VIC, 3083, Australia.
| | - Reiner Emmerling
- Institute of Animal Breeding, Bavarian State Research Center for Agriculture, 85586, Grub, Germany
| | | | - Ruedi Fries
- Animal Breeding, Technische Universitaet Muenchen, 85354, Freising, Germany
| | - Hans D Daetwyler
- Agriculture Research Division, Agriculture Victoria, Department of Economic Development, Jobs, Transport and Resources, AgriBio, VIC, 3083, Australia.,School of Applied Systems Biology, LaTrobe University, Bundoora, VIC, 3083, Australia
| | - Michael E Goddard
- Agriculture Research Division, Agriculture Victoria, Department of Economic Development, Jobs, Transport and Resources, AgriBio, VIC, 3083, Australia.,Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, 3010, Australia
| |
Collapse
|