1
|
de Oliveira LF, Brito LF, Marques DBD, da Silva DA, Lopes PS, Dos Santos CG, Johnson JS, Veroneze R. Investigating the impact of non-additive genetic effects in the estimation of variance components and genomic predictions for heat tolerance and performance traits in crossbred and purebred pig populations. BMC Genom Data 2023; 24:76. [PMID: 38093199 PMCID: PMC10717470 DOI: 10.1186/s12863-023-01174-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 11/13/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND Non-additive genetic effects are often ignored in livestock genetic evaluations. However, fitting them in the models could improve the accuracy of genomic breeding values. Furthermore, non-additive genetic effects contribute to heterosis, which could be optimized through mating designs. Traits related to fitness and adaptation, such as heat tolerance, tend to be more influenced by non-additive genetic effects. In this context, the primary objectives of this study were to estimate variance components and assess the predictive performance of genomic prediction of breeding values based on alternative models and two independent datasets, including performance records from a purebred pig population and heat tolerance indicators recorded in crossbred lactating sows. RESULTS Including non-additive genetic effects when modelling performance traits in purebred pigs had no effect on the residual variance estimates for most of the traits, but lower additive genetic variances were observed, especially when additive-by-additive epistasis was included in the models. Furthermore, including non-additive genetic effects did not improve the prediction accuracy of genomic breeding values, but there was animal re-ranking across the models. For the heat tolerance indicators recorded in a crossbred population, most traits had small non-additive genetic variance with large standard error estimates. Nevertheless, panting score and hair density presented substantial additive-by-additive epistatic variance. Panting score had an epistatic variance estimate of 0.1379, which accounted for 82.22% of the total genetic variance. For hair density, the epistatic variance estimates ranged from 0.1745 to 0.1845, which represent 64.95-69.59% of the total genetic variance. CONCLUSIONS Including non-additive genetic effects in the models did not improve the accuracy of genomic breeding values for performance traits in purebred pigs, but there was substantial re-ranking of selection candidates depending on the model fitted. Except for panting score and hair density, low non-additive genetic variance estimates were observed for heat tolerance indicators in crossbred pigs.
Collapse
Affiliation(s)
- Letícia Fernanda de Oliveira
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil.
- Department of Animal Sciences, Purdue University, West Lafayette, IN, USA.
| | - Luiz F Brito
- Department of Animal Sciences, Purdue University, West Lafayette, IN, USA
| | | | | | - Paulo Sávio Lopes
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil
| | | | - Jay S Johnson
- USDA-ARS Livestock Behavior Research Unit, West Lafayette, IN, USA
| | - Renata Veroneze
- Department of Animal Science, Federal University of Viçosa, Viçosa, MG, Brazil
| |
Collapse
|
2
|
Wientjes YCJ, Bijma P, van den Heuvel J, Zwaan BJ, Vitezica ZG, Calus MPL. The long-term effects of genomic selection: 2. Changes in allele frequencies of causal loci and new mutations. Genetics 2023; 225:iyad141. [PMID: 37506255 PMCID: PMC10471209 DOI: 10.1093/genetics/iyad141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 05/17/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
Genetic selection has been applied for many generations in animal, plant, and experimental populations. Selection changes the allelic architecture of traits to create genetic gain. It remains unknown whether the changes in allelic architecture are different for the recently introduced technique of genomic selection compared to traditional selection methods and whether they depend on the genetic architectures of traits. Here, we investigate the allele frequency changes of old and new causal loci under 50 generations of phenotypic, pedigree, and genomic selection, for a trait controlled by either additive, additive and dominance, or additive, dominance, and epistatic effects. Genomic selection resulted in slightly larger and faster changes in allele frequencies of causal loci than pedigree selection. For each locus, allele frequency change per generation was not only influenced by its statistical additive effect but also to a large extent by the linkage phase with other loci and its allele frequency. Selection fixed a large number of loci, and 5 times more unfavorable alleles became fixed with genomic and pedigree selection than with phenotypic selection. For pedigree selection, this was mainly a result of increased genetic drift, while genetic hitchhiking had a larger effect on genomic selection. When epistasis was present, the average allele frequency change was smaller (∼15% lower), and a lower number of loci became fixed for all selection methods. We conclude that for long-term genetic improvement using genomic selection, it is important to consider hitchhiking and to limit the loss of favorable alleles.
Collapse
Affiliation(s)
- Yvonne C J Wientjes
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | - Piter Bijma
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | - Joost van den Heuvel
- Laboratory of Genetics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | - Bas J Zwaan
- Laboratory of Genetics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| | | | - Mario P L Calus
- Animal Breeding and Genomics, Wageningen University & Research, 6700 AH Wageningen, The Netherlands
| |
Collapse
|
3
|
Discovering novel clues of natural selection on four worldwide goat breeds. Sci Rep 2023; 13:2110. [PMID: 36747064 PMCID: PMC9902602 DOI: 10.1038/s41598-023-27490-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 01/03/2023] [Indexed: 02/08/2023] Open
Abstract
In goat breeds, the domestication followed by artificial selection for economically important traits have shaped genetic variation within populations, leading to the fixation of specific alleles for specific traits. This led to the formation and evolution of many different breeds specialised and raised for a particular purpose. However, and despite the intensity of artificial selection, natural selection continues acting, possibly leaving a more diluted contribution over time, whose traces may be more difficult to capture. In order to explore selection footprints as response of environmental adaptation, we analysed a total of 993 goats from four transboundary goats breeds (Angora, Boer, Nubian and Saanen) genotyped with the SNP chip 50 K using outlier detection, runs of homozygosity and haplotype-based detection methods. Our results showed that all methods identified footprints on chromosome 6 (from 30 to 49 Mb) for two specific populations of Nubian goats sampled in Egypt. In Angora and Saanen breeds, we detected two selective sweeps using HapFLK, on chromosome 21 (from 52 to 55 Mb) and chromosome 25 (from 1 to 5 Mb) respectively. The analysis of runs of homozygosity showed some hotspots in all breeds. The overall investigation of the selected regions detected combining the different approaches and the gene ontology exploration revealed both novel and well-known loci related to adaptation, especially for heat stress. Our findings can help to better understand the balance between the two selective pressures in commercial goat breeds providing new insights on the molecular mechanisms of adaptation.
Collapse
|
4
|
Ablondi M, Summer A, Stocco G, Finocchiaro R, van Kaam JT, Cassandro M, Dadousis C, Sabbioni A, Cipolat-Gotet C. The role of inbreeding depression on productive performance in the Italian Holstein breed. J Anim Sci 2023; 101:skad382. [PMID: 37983004 PMCID: PMC10693289 DOI: 10.1093/jas/skad382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 11/18/2023] [Indexed: 11/21/2023] Open
Abstract
Inbreeding depression has become an urgent issue in cosmopolitan breeds where the massive genetic progress achieved in the latest generations is counterbalanced by a dramatic loss of genetic diversity causing increased health issues. Thus, the aim of this study was to estimate inbreeding depression on productive traits in Holstein dairy cattle. More precisely, we aimed to i) determine the level of inbreeding in 27,735 Italian Holstein dairy cows using pedigree and genotype data, ii) quantify the effect of inbreeding on 305-d in milk yield (MY; kg), fat yield (FY; kg), and protein yield (PY; kg) based on different statistical approaches, iii) determine if recent inbreeding has a more harmful impact than ancestral ones, and iv) quantify chromosomal homozygosity effect on productive traits. Quality control was performed on the autosomal chromosomes resulting in a final dataset of 84,443 single nucleotide polymorphisms. Four statistical models were used to evaluate the presence of inbreeding depression, which included linear regression analysis and division of FPED and FROH into percentile classes. Moreover, FROH was partitioned into i) length classes to assess the role of recent and ancestral inbreeding and ii) chromosome-specific contributions (FROH-CHR). Results evidenced that inbreeding negatively impacted the productive performance of Italian Holstein Friesian cows. However, differences between the estimated FPED and FROH coefficients resulted in different estimates of inbreeding depression. For instance, a 1% increase in FPED and FROH was associated with a decrease in MY of about 44 and 61 kg (P < 0.01). Further, when considering the extreme inbreeding percentile classes moving from the 5th lowest to the 95th highest, there was a reduction of -263 kg and -561 kg per lactation for FPED and FROH. Increased inbreeding, estimated by FPED and FROH, had also a negative effect on PY and FY, either fit as a regressor or percentile classes. When evaluating the impact of inbreeding based on runs of homozygosity (ROH) length classes, longer ROH (over 8 Mb) had a negative effect in all traits, indicating that recent inbreeding might be more harmful than the ancestral one. Finally, results within chromosome homozygosity highlighted specific chromosomes with a more deleterious effect on productive traits.
Collapse
Affiliation(s)
- Michela Ablondi
- Department of Veterinary Science, University of Parma, 43126 Parma, Italy
| | - Andrea Summer
- Department of Veterinary Science, University of Parma, 43126 Parma, Italy
| | - Giorgia Stocco
- Department of Veterinary Science, University of Parma, 43126 Parma, Italy
| | - Raffaella Finocchiaro
- Associazione Nazionale Allevatori della Razza Frisona Bruna e Jersey Italiana (ANAFIBJ), 26100 Cremona, Italy
| | - Jan-Thijs van Kaam
- Associazione Nazionale Allevatori della Razza Frisona Bruna e Jersey Italiana (ANAFIBJ), 26100 Cremona, Italy
| | - Martino Cassandro
- Associazione Nazionale Allevatori della Razza Frisona Bruna e Jersey Italiana (ANAFIBJ), 26100 Cremona, Italy
- Department of Agronomy, Food, Natural resources, Animals and Environment, University of Padova, 35020 Legnaro, Italy
| | - Christos Dadousis
- Department of Veterinary Science, University of Parma, 43126 Parma, Italy
| | - Alberto Sabbioni
- Department of Veterinary Science, University of Parma, 43126 Parma, Italy
| | | |
Collapse
|
5
|
El-Ouazizi El-Kahia L, Formoso-Rafferty N, Cervantes I, Gutiérrez JP. Differential sensitivity of climate conditions on birth weight genetic values in mice divergently selected for birth weight residual variance. J Anim Sci 2023; 101:skad350. [PMID: 37850884 PMCID: PMC10630028 DOI: 10.1093/jas/skad350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 10/17/2023] [Indexed: 10/19/2023] Open
Abstract
After 32 generations of a divergent selection experiment for residual variance of birth weight in mice, two divergent lines were thus obtained: the heterogeneous line (H-line) and the homogeneous line (L-line). Throughout the generations, differences were observed between the two lines in traits such as litter size, survival at weaning, and birth weight variability caused by unidentified environmental conditions. The L-line exhibited advantages in terms of higher survival rates, larger litter sizes, and less sensitivity to changes in food intake. The study is an examination of the effects of climate as an environmental factor on the performance of these animals. Climate factors including maximum, minimum, and mean temperature (T), humidity (H), and TH index; at three stages (the fecundation, a week before the parturition and the parturition), were linked to a birth weight dataset consisting of 22,614 records distributed as follows: 8,853 corresponding to the H-line, 12,649 to the L-line, and 1,112 to the initial population. Out of the 27 analyzed climatic variables, the maximum temperature 1 wk before parturition (MXTW) was identified as the most influential when comparing heteroscedastic models with the deviance information criterion. The order of Legendre polynomial to apply in the following random regression model was tested by a cross-validation using homoscedastic models. Finally, MXTW was compared on how it affected the two divergent lines by analyzing predicted breeding values (PBV) obtained from a random regression heteroscedastic model. The mean PBV of the H-line in the first generation showed a range of 0.070 g with a negative slope, which was 35 times higher than the range obtained for the L-line, which varied within 0.002 g. In the last generation of selection, the H-line exhibited greater instability of PBV across temperatures, with a difference of 0.101 g between the maximum and minimum mean PBV, compared to 0.017 g for the L-line. The standard deviations of the slopes in the H-line were more dispersed than in the L-line. Unlike the H-line, the L-line had slopes that were not significantly different from 0 throughout the generations of selection, indicating greater stability in response to MXTW variations. The H-line exhibited a higher sensitivity to changes in MXTW, particularly in birth weight, with the L-line being more stable. The selection for uniformity of birth weight could lead to less sensitive animals under environmental changes.
Collapse
Affiliation(s)
- Laila El-Ouazizi El-Kahia
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad Complutense de Madrid, 28040 Madrid, Spain
| | - Nora Formoso-Rafferty
- Departamento de Producción Agraria, E.T.S. Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, 28040 Madrid, Spain
| | - Isabel Cervantes
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad Complutense de Madrid, 28040 Madrid, Spain
| | - Juan Pablo Gutiérrez
- Departamento de Producción Animal, Facultad de Veterinaria, Universidad Complutense de Madrid, 28040 Madrid, Spain
| |
Collapse
|
6
|
Heidaritabar M, Huisman A, Krivushin K, Stothard P, Dervishi E, Charagu P, Bink MCAM, Plastow GS. Imputation to whole-genome sequence and its use in genome-wide association studies for pork colour traits in crossbred and purebred pigs. Front Genet 2022; 13:1022681. [PMID: 36303553 PMCID: PMC9593086 DOI: 10.3389/fgene.2022.1022681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 09/22/2022] [Indexed: 11/16/2022] Open
Abstract
Imputed whole-genome sequence (WGS) has been proposed to improve genome-wide association studies (GWAS), since all causative mutations responsible for phenotypic variation are expected to be present in the data. This approach was applied on a large number of purebred (PB) and crossbred (CB) pigs for 18 pork color traits to evaluate the impact of using imputed WGS relative to medium-density marker panels. The traits included Minolta A*, B*, and L* for fat (FCOL), quadriceps femoris muscle (QFCOL), thawed loin muscle (TMCOL), fresh ham gluteus medius (GMCOL), ham iliopsoas muscle (ICOL), and longissimus dorsi muscle on the fresh loin (FMCOL). Sequence variants were imputed from a medium-density marker panel (61K for CBs and 50K for PBs) in all genotyped pigs using BeagleV5.0. We obtained high imputation accuracy (average of 0.97 for PBs and 0.91 for CBs). GWAS were conducted for three datasets: 954 CBs and 891 PBs, and the combined CBs and PBs. For most traits, no significant associations were detected, regardless of panel density or population type. However, quantitative trait loci (QTL) regions were only found for a few traits including TMCOL Minolta A* and GMCOL Minolta B* (CBs), FMCOL Minolta B*, FMCOL Minolta L*, and ICOL Minolta B* (PBs) and FMCOL Minolta A*, FMCOL Minolta B*, GMCOL Minolta B*, and ICOL Minolta B* (Combined dataset). More QTL regions were identified with WGS (n = 58) relative to medium-density marker panels (n = 22). Most of the QTL were linked to previously reported QTLs or candidate genes that have been previously reported to be associated with meat quality, pH and pork color; e.g., VIL1, PRKAG3, TTLL4, and SLC11A1, USP37. CTDSP1 gene on SSC15 has not been previously associated with meat color traits in pigs. The findings suggest any added value of WGS was only for detecting novel QTL regions when the sample size is sufficiently large as with the Combined dataset in this study. The percentage of phenotypic variance explained by the most significant SNPs also increased with WGS compared with medium-density panels. The results provide additional insights into identification of a number of candidate regions and genes for pork color traits in different pig populations.
Collapse
Affiliation(s)
- Marzieh Heidaritabar
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
- *Correspondence: Marzieh Heidaritabar,
| | - Abe Huisman
- Hendrix Genetics Research, Boxmeer, Netherlands
| | - Kirill Krivushin
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Elda Dervishi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | | | | | - Graham S. Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
7
|
Bolormaa S, MacLeod IM, Khansefid M, Marett LC, Wales WJ, Miglior F, Baes CF, Schenkel FS, Connor EE, Manzanilla-Pech CIV, Stothard P, Herman E, Nieuwhof GJ, Goddard ME, Pryce JE. Sharing of either phenotypes or genetic variants can increase the accuracy of genomic prediction of feed efficiency. Genet Sel Evol 2022; 54:60. [PMID: 36068488 PMCID: PMC9450441 DOI: 10.1186/s12711-022-00749-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 08/17/2022] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Sharing individual phenotype and genotype data between countries is complex and fraught with potential errors, while sharing summary statistics of genome-wide association studies (GWAS) is relatively straightforward, and thus would be especially useful for traits that are expensive or difficult-to-measure, such as feed efficiency. Here we examined: (1) the sharing of individual cow data from international partners; and (2) the use of sequence variants selected from GWAS of international cow data to evaluate the accuracy of genomic estimated breeding values (GEBV) for residual feed intake (RFI) in Australian cows. RESULTS GEBV for RFI were estimated using genomic best linear unbiased prediction (GBLUP) with 50k or high-density single nucleotide polymorphisms (SNPs), from a training population of 3797 individuals in univariate to trivariate analyses where the three traits were RFI phenotypes calculated using 584 Australian lactating cows (AUSc), 824 growing heifers (AUSh), and 2526 international lactating cows (OVE). Accuracies of GEBV in AUSc were evaluated by either cohort-by-birth-year or fourfold random cross-validations. GEBV of AUSc were also predicted using only the AUS training population with a weighted genomic relationship matrix constructed with SNPs from the 50k array and sequence variants selected from a meta-GWAS that included only international datasets. The genomic heritabilities estimated using the AUSc, OVE and AUSh datasets were moderate, ranging from 0.20 to 0.36. The genetic correlations (rg) of traits between heifers and cows ranged from 0.30 to 0.95 but were associated with large standard errors. The mean accuracies of GEBV in Australian cows were up to 0.32 and almost doubled when either overseas cows, or both overseas cows and AUS heifers were included in the training population. They also increased when selected sequence variants were combined with 50k SNPs, but with a smaller relative increase. CONCLUSIONS The accuracy of RFI GEBV increased when international data were used or when selected sequence variants were combined with 50k SNP array data. This suggests that if direct sharing of data is not feasible, a meta-analysis of summary GWAS statistics could provide selected SNPs for custom panels to use in genomic selection programs. However, since this finding is based on a small cross-validation study, confirmation through a larger study is recommended.
Collapse
Affiliation(s)
| | - Iona M. MacLeod
- Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
| | - Majid Khansefid
- Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
| | - Leah C. Marett
- Agriculture Victoria Research, Ellinbank Centre, Ellinbank, Gippsland, VIC 3821 Australia
- School of Agriculture and Food, University of Melbourne, Parkville, VIC 3010 Australia
| | - William J. Wales
- Agriculture Victoria Research, Ellinbank Centre, Ellinbank, Gippsland, VIC 3821 Australia
- School of Agriculture and Food, University of Melbourne, Parkville, VIC 3010 Australia
| | - Filippo Miglior
- LACTANET, Sainte-Anne-de-Bellevue, QC H9X 3R4 Canada
- CGIL, University of Guelph, Guelph, ON N1G 2W1 Canada
| | - Christine F. Baes
- CGIL, University of Guelph, Guelph, ON N1G 2W1 Canada
- Institute of Genetics, Vetsuisse Faculty, University of Bern, 3002 Bern, Switzerland
| | | | - Erin E. Connor
- Animal Genomics and Improvement Laboratory, USDA, Agricultural Research Service, Beltsville Agricultural Research Center, Beltsville, MD 20705 USA
- Department of Animal and Food Sciences, University of Delaware, Newark, DE 19716 USA
| | | | - Paul Stothard
- Faculty of Agricultural, Life & Environmental Sciences, University of Alberta, Edmonton, AB T6G 2R3 Canada
| | - Emily Herman
- Faculty of Agricultural, Life & Environmental Sciences, University of Alberta, Edmonton, AB T6G 2R3 Canada
| | - Gert J. Nieuwhof
- Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
- DataGene Ltd, Agribio, Bundoora, VIC 3083 Australia
| | - Michael E. Goddard
- Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
- School of Veterinary and Agricultural Sciences, University of Melbourne, Parkville, VIC 3052 Australia
| | - Jennie E. Pryce
- Agriculture Victoria Research, Agribio, Bundoora, VIC 3083 Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3083 Australia
| |
Collapse
|
8
|
Wang Z, Zhang Z, Chen Z, Sun J, Cao C, Wu F, Xu Z, Zhao W, Sun H, Guo L, Zhang Z, Wang Q, Pan Y. PHARP: a pig haplotype reference panel for genotype imputation. Sci Rep 2022; 12:12645. [PMID: 35879321 PMCID: PMC9314402 DOI: 10.1038/s41598-022-15851-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 06/30/2022] [Indexed: 11/18/2022] Open
Abstract
Pigs not only function as a major meat source worldwide but also are commonly used as an animal model for studying human complex traits. A large haplotype reference panel has been used to facilitate efficient phasing and imputation of relatively sparse genome-wide microarray chips and low-coverage sequencing data. Using the imputed genotypes in the downstream analysis, such as GWASs, TWASs, eQTL mapping and genomic prediction (GS), is beneficial for obtaining novel findings. However, currently, there is still a lack of publicly available and high-quality pig reference panels with large sample sizes and high diversity, which greatly limits the application of genotype imputation in pigs. In response, we built the pig Haplotype Reference Panel (PHARP) database. PHARP provides a reference panel of 2012 pig haplotypes at 34 million SNPs constructed using whole-genome sequence data from more than 49 studies of 71 pig breeds. It also provides Web-based analytical tools that allow researchers to carry out phasing and imputation consistently and efficiently. PHARP is freely accessible at http://alphaindex.zju.edu.cn/PHARP/index.php . We demonstrate its applicability for pig commercial 50 K SNP arrays, by accurately imputing 2.6 billion genotypes at a concordance rate value of 0.971 in 81 Large White pigs (~ 17 × sequencing coverage). We also applied our reference panel to impute the low-density SNP chip into the high-density data for three GWASs and found novel significantly associated SNPs that might be casual variants.
Collapse
Affiliation(s)
- Zhen Wang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zhenyang Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zitao Chen
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Jiabao Sun
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Caiyun Cao
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Fen Wu
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zhong Xu
- Hubei Key Laboratory of Animal Embryo and Molecular Breeding, Institute of Animal Husbandry and Veterinary, Hubei Provincial Academy of Agricultural Sciences, Wuhan, 430064, China
| | - Wei Zhao
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Hao Sun
- Department of Animal Science, School of Animal Science, Jilin University, Changchun, 130062, China
| | - Longyu Guo
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Zhe Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| | - Qishan Wang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| | - Yuchun Pan
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| |
Collapse
|
9
|
Reich P, Falker-Gieske C, Pook T, Tetens J. Development and validation of a horse reference panel for genotype imputation. Genet Sel Evol 2022; 54:49. [PMID: 35787788 PMCID: PMC9252005 DOI: 10.1186/s12711-022-00740-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/23/2022] [Indexed: 11/10/2022] Open
Abstract
Background Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. Results Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. Conclusions The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00740-8.
Collapse
Affiliation(s)
- Paula Reich
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.
| | - Clemens Falker-Gieske
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Torsten Pook
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Jens Tetens
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| |
Collapse
|
10
|
Sepulveda BJ, Muir SK, Bolormaa S, Knight MI, Behrendt R, MacLeod IM, Pryce JE, Daetwyler HD. Eating Time as a Genetic Indicator of Methane Emissions and Feed Efficiency in Australian Maternal Composite Sheep. Front Genet 2022; 13:883520. [PMID: 35646089 PMCID: PMC9130857 DOI: 10.3389/fgene.2022.883520] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 04/25/2022] [Indexed: 11/13/2022] Open
Abstract
Previous studies have shown reduced enteric methane emissions (ME) and residual feed intake (RFI) through the application of genomic selection in ruminants. The objective of this study was to evaluate feeding behaviour traits as genetic indicators for ME and RFI in Australian Maternal Composite ewes using data from an automated feed intake facility. The feeding behaviour traits evaluated were the amount of time spent eating per day (eating time; ETD; min/day) and per visit (eating time per event; ETE; min/event), daily number of events (DNE), event feed intake (EFI; g/event) and eating rate (ER; g/min). Genotypes and phenotypes of 445 ewes at three different ages (post-weaning, hogget, and adult) were used to estimate the heritability of ME, RFI, and the feeding behaviour traits using univariate genomic best linear unbiased prediction models. Multivariate models were used to estimate the correlations between these traits and within each trait at different ages. The response to selection was evaluated for ME and RFI with direct selection models and indirect models with ETE as an indicator trait, as this behaviour trait was a promising indicator based on heritability and genetic correlations. Heritabilities were between 0.12 and 0.18 for ME and RFI, and between 0.29 and 0.47 for the eating behaviour traits. In our data, selecting for more efficient animals (low RFI) would lead to higher methane emissions per day and per kg of dry matter intake. Selecting for more ETE also improves feed efficiency but results in more methane per day and per kg dry matter intake. Based on our results, ETE could be evaluated as an indicator trait for ME and RFI under an index approach that allows simultaneous selection for improvement in emissions and feed efficiency. Selecting for ETE may have a tremendous impact on the industry, as it may be easier and cheaper to obtain than feed intake and ME data. As the data were collected using individual feeding units, the findings on this research should be validated under grazing conditions.
Collapse
Affiliation(s)
- Boris J Sepulveda
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | | | - Sunduimijid Bolormaa
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
| | | | - Ralph Behrendt
- Agriculture Victoria, Hamilton Centre, Hamilton, VIC, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
| | - Jennie E Pryce
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| |
Collapse
|
11
|
Chen L, Yang S, Araya S, Quigley C, Taliercio E, Mian R, Specht JE, Diers BW, Song Q. Genotype imputation for soybean nested association mapping population to improve precision of QTL detection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:1797-1810. [PMID: 35275252 PMCID: PMC9110473 DOI: 10.1007/s00122-022-04070-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 02/25/2022] [Indexed: 06/14/2023]
Abstract
KEY MESSAGE Software for high imputation accuracy in soybean was identified. Imputed dataset could significantly reduce the interval of genomic regions controlling traits, thus greatly improve the efficiency of candidate gene identification. Genotype imputation is a strategy to increase marker density of existing datasets without additional genotyping. We compared imputation performance of software BEAGLE 5.0, IMPUTE 5 and AlphaPlantImpute and tested software parameters that may help to improve imputation accuracy in soybean populations. Several factors including marker density, extent of linkage disequilibrium (LD), minor allele frequency (MAF), etc., were examined for their effects on imputation accuracy across different software. Our results showed that AlphaPlantImpute had a higher imputation accuracy than BEAGLE 5.0 or IMPUTE 5 tested in each soybean family, especially if the study progeny were genotyped with an extremely low number of markers. LD extent, MAF and reference panel size were positively correlated with imputation accuracy, a minimum number of 50 markers per chromosome and MAF of SNPs > 0.2 in soybean line were required to avoid a significant loss of imputation accuracy. Using the software, we imputed 5176 soybean lines in the soybean nested mapping population (NAM) with high-density markers of the 40 parents. The dataset containing 423,419 markers for 5176 lines and 40 parents was deposited at the Soybase. The imputed NAM dataset was further examined for the improvement of mapping quantitative trait loci (QTL) controlling soybean seed protein content. Most of the QTL identified were at identical or at similar position based on initial and imputed datasets; however, QTL intervals were greatly narrowed. The resulting genotypic dataset of NAM population will facilitate QTL mapping of traits and downstream applications. The information will also help to improve genotyping imputation accuracy in self-pollinated crops.
Collapse
Affiliation(s)
- Linfeng Chen
- Soybean Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville Agricultural Research Center, Beltsville, MD, 20705, USA
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Jiangsu Collaborative Innovation Center for Modern Crop Production, College of Agriculture, Soybean Research Institute, Nanjing Agricultural University, Nanjing, 210095, China
| | - Shouping Yang
- National Center for Soybean Improvement, Key Laboratory of Biology and Genetic Improvement of Soybean, Ministry of Agriculture, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Jiangsu Collaborative Innovation Center for Modern Crop Production, College of Agriculture, Soybean Research Institute, Nanjing Agricultural University, Nanjing, 210095, China.
| | - Susan Araya
- Soybean Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville Agricultural Research Center, Beltsville, MD, 20705, USA
| | - Charles Quigley
- Soybean Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville Agricultural Research Center, Beltsville, MD, 20705, USA
| | - Earl Taliercio
- Soybean and Nitrogen Fixation Research, USDA-ARS, Raleigh, NC, 27607, USA
| | - Rouf Mian
- Soybean and Nitrogen Fixation Research, USDA-ARS, Raleigh, NC, 27607, USA
| | - James E Specht
- Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE, 68583, USA
| | - Brian W Diers
- Department of Crop Sciences, National Soybean Research Center, University of Illinois, 1101 West Peabody Drive, Urbana, IL, 61801, USA
| | - Qijian Song
- Soybean Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville Agricultural Research Center, Beltsville, MD, 20705, USA.
| |
Collapse
|
12
|
Wang F, Liu J, Zeng Q, Zhuoga D. Comparative analysis of long noncoding RNA and mRNA expression provides insights into adaptation to hypoxia in Tibetan sheep. Sci Rep 2022; 12:6597. [PMID: 35449433 PMCID: PMC9023463 DOI: 10.1038/s41598-022-08625-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 03/03/2022] [Indexed: 01/21/2023] Open
Abstract
Tibetan sheep have lived on the Qinghai-Tibetan Plateau for thousands of years and have good adaptability to the hypoxic environment and strong disease resistance. However, the molecular mechanism by which Tibetan sheep adapt to this extreme environment, especially the role of genetic regulation, is still unknown. Emerging evidence suggests that long noncoding RNAs (lncRNAs) participate in the regulation of a diverse range of biological processes. To explore the potential lncRNAs involved in the adaptation to high-altitude hypoxia of Tibetan sheep, we analysed the expression profile of lncRNAs and mRNAs in the liver and lung tissues of sheep using comparative transcriptome analysis between four Tibetan sheep populations (high altitude) and one Hu sheep population (low altitude). The results showed a total of 7848 differentially expressed (DE) lncRNA transcripts, and 22,971 DE mRNA transcripts were detected by pairwise comparison. The expression patterns of selected mRNAs and lncRNAs were validated by qRT-PCR, and the results correlated well with the transcriptome data. Moreover, the functional annotation analysis based on the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases showed that DE mRNAs and the target genes of the lncRNAs were significantly enriched in organ morphogenesis, response to stimulus, haem binding, the immune system, arginine and proline metabolism, and fatty acid biosynthesis. The prediction of mRNA–mRNA and lncRNA–mRNA interaction networks further revealed transcripts potentially involved in adaptation to high-altitude hypoxia, and the hub genes DDX24, PDCD11, EIF4A3, NDUFA11, SART1, PRPF8 and TCONS_00306477, TCONS_00306029, TCONS_00139593, TCONS_00293272, and TCONS_00313398 were selected. Additionally, a set of target genes, PIK3R1, IGF1R, FZD6, IFNB2, ATF3, MB, CYP2B4, PSMD13, and TGFB1, were also identified as candidate genes associated with high-altitude hypoxia adaptation. In conclusion, a collection of novel expressed lncRNAs, a set of target genes and biological pathways known to be relevant for altitude adaptation were identified by comparative transcriptome analysis between Tibetan sheep and Hu sheep. Our results are the first to identify the characterization and expression profile of lncRNAs between Tibetan sheep and Hu sheep and provide insights into the genetic regulation mechanisms by which Tibetan sheep adapt to high-altitude hypoxic environments.
Collapse
Affiliation(s)
- Fan Wang
- College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, 730070, China.,China Agricultural Veterinary Biological Science and Technology Co., Ltd., Lanzhou, 730046, China
| | - Jianbin Liu
- Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China.,Sheep Breeding Engineering Technology Research Center, Chinese Academy of Agricultural Sciences, Lanzhou, 730050, China
| | - Qiaoying Zeng
- College of Veterinary Medicine, Gansu Agricultural University, Lanzhou, 730070, China.
| | - Deqing Zhuoga
- Institute of Livestock Research, Tibet Academy of Agriculture and Animal Science, Lhasa, 850000, China.
| |
Collapse
|
13
|
Wientjes YCJ, Bijma P, Calus MPL, Zwaan BJ, Vitezica ZG, van den Heuvel J. The long-term effects of genomic selection: 1. Response to selection, additive genetic variance, and genetic architecture. Genet Sel Evol 2022; 54:19. [PMID: 35255802 PMCID: PMC8900405 DOI: 10.1186/s12711-022-00709-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 02/10/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Genomic selection has revolutionized genetic improvement in animals and plants, but little is known about its long-term effects. Here, we investigated the long-term effects of genomic selection on response to selection, genetic variance, and the genetic architecture of traits using stochastic simulations. We defined the genetic architecture as the set of causal loci underlying each trait, their allele frequencies, and their statistical additive effects. We simulated a livestock population under 50 generations of phenotypic, pedigree, or genomic selection for a single trait, controlled by either only additive, additive and dominance, or additive, dominance, and epistatic effects. The simulated epistasis was based on yeast data.
Results
Short-term response was always greatest with genomic selection, while response after 50 generations was greater with phenotypic selection than with genomic selection when epistasis was present, and was always greater than with pedigree selection. This was mainly because loss of genetic variance and of segregating loci was much greater with genomic and pedigree selection than with phenotypic selection. Compared to pedigree selection, selection response was always greater with genomic selection. Pedigree and genomic selection lost a similar amount of genetic variance after 50 generations of selection, but genomic selection maintained more segregating loci, which on average had lower minor allele frequencies than with pedigree selection. Based on this result, genomic selection is expected to better maintain genetic gain after 50 generations than pedigree selection. The amount of change in the genetic architecture of traits was considerable across generations and was similar for genomic and pedigree selection, but slightly less for phenotypic selection. Presence of epistasis resulted in smaller changes in allele frequencies and less fixation of causal loci, but resulted in substantial changes in statistical additive effects across generations.
Conclusions
Our results show that genomic selection outperforms pedigree selection in terms of long-term genetic gain, but results in a similar reduction of genetic variance. The genetic architecture of traits changed considerably across generations, especially under selection and when non-additive effects were present. In conclusion, non-additive effects had a substantial impact on the accuracy of selection and long-term response to selection, especially when selection was accurate.
Collapse
|
14
|
Cai Z, Christensen OF, Lund MS, Ostersen T, Sahana G. Large-scale association study on daily weight gain in pigs reveals overlap of genetic factors for growth in humans. BMC Genomics 2022; 23:133. [PMID: 35168569 PMCID: PMC8845347 DOI: 10.1186/s12864-022-08373-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 02/08/2022] [Indexed: 01/10/2023] Open
Abstract
Background Imputation from genotyping array to whole-genome sequence variants using resequencing of representative reference populations enhances our ability to map genetic factors affecting complex phenotypes in livestock species. The accumulation of knowledge about gene function in human and laboratory animals can provide substantial advantage for genomic research in livestock species. Results In this study, 201,388 pigs from three commercial Danish breeds genotyped with low to medium (8.5k to 70k) SNP arrays were imputed to whole genome sequence variants using a two-step approach. Both imputation steps achieved high accuracies, and in total this yielded 26,447,434 markers on 18 autosomes. The average estimated imputation accuracy of markers with minor allele frequency ≥ 0.05 was 0.94. To overcome the memory consumption of running genome-wide association study (GWAS) for each breed, we performed within-breed subpopulation GWAS then within-breed meta-analysis for average daily weight gain (ADG), followed by a multi-breed meta-analysis of GWAS summary statistics. We identified 15 quantitative trait loci (QTL). Our post-GWAS analysis strategy to prioritize of candidate genes including information like gene ontology, mammalian phenotype database, differential expression gene analysis of high and low feed efficiency pig and human GWAS catalog for height, obesity, and body mass index, we proposed MRAP2, LEPROT, PMAIP1, ENSSSCG00000036234, BMP2, ELFN1, LIG4 and FAM155A as the candidate genes with biological support for ADG in pigs. Conclusion Our post-GWAS analysis strategy helped to identify candidate genes not just by distance to the lead SNP but also by multiple sources of biological evidence. Besides, the identified QTL overlap with genes which are known for their association with human growth-related traits. The GWAS with this large data set showed the power to map the genetic factors associated with ADG in pigs and have added to our understanding of the genetics of growth across mammalian species. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08373-3.
Collapse
Affiliation(s)
- Zexi Cai
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| | | | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Tage Ostersen
- SEGES Danish Pig Research Centre, Agro Food Park 15, 8200, Aarhus N, Denmark
| | - Goutam Sahana
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| |
Collapse
|
15
|
Bedhane M, van der Werf J, de las Heras-Saldana S, Lim D, Park B, Na Park M, Seung Hee R, Clark S. The accuracy of genomic prediction for meat quality traits in Hanwoo cattle when using genotypes from different SNP densities and preselected variants from imputed whole genome sequence. ANIMAL PRODUCTION SCIENCE 2022. [DOI: 10.1071/an20659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Context
Genomic prediction is the use of genomic data in the estimation of genomic breeding values (GEBV) in animal breeding. In beef cattle breeding programs, genomic prediction increases the rates of genetic gain by increasing the accuracy of selection at earlier ages.
Aims
The objectives of the study were to examine the effect of single-nucleotide polymorphism (SNP) density and to evaluate the effect of using SNPs preselected from imputed whole-genome sequence for genomic prediction.
Methods
Genomic and phenotypic data from 2110 Hanwoo steers were used to predict GEBV for marbling score (MS), meat texture (MT), and meat colour (MC) traits. Three types of SNP densities including 50k, high-density (HD), and whole-genome sequence data and preselected SNPs from genome-wide association study (GWAS) were used for genomic prediction analyses. Two scenarios (independent and dependent discovery populations) were used to select top significant SNPs. The accuracy of GEBV was assessed using random cross-validation. Genomic best linear unbiased prediction (GBLUP) was used to predict the breeding values for each trait.
Key results
Our result showed that very similar prediction accuracies were observed across all SNP densities used in the study. The prediction accuracy among traits ranged from 0.29±0.05 for MC to 0.46±0.04 for MS. Depending on the studied traits, up to 5% of prediction accuracy improvement was obtained when the preselected SNPs from GWAS analysis were included in the prediction analysis.
Conclusions
High SNP density such as HD and the whole-genome sequence data yielded a similar prediction accuracy in Hanwoo beef cattle. Therefore, the 50K SNP chip panel is sufficient to capture the relationships in a breed with a small effective population size such as the Hanwoo cattle population. Preselected variants improved prediction accuracy when they were included in the genomic prediction model.
Implications
The estimated genomic prediction accuracies are moderately accurate in Hanwoo cattle and for searching for SNPs that are more productive could increase the accuracy of estimated breeding values for the studied traits.
Collapse
|
16
|
A major quantitative trait locus affecting resistance to Tilapia lake virus in farmed Nile tilapia (Oreochromis niloticus). Heredity (Edinb) 2021; 127:334-343. [PMID: 34262170 PMCID: PMC8405827 DOI: 10.1038/s41437-021-00447-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 05/31/2021] [Accepted: 05/31/2021] [Indexed: 02/06/2023] Open
Abstract
Enhancing host resistance to infectious disease has received increasing attention in recent years as a major goal of farm animal breeding programs. Combining field data with genomic tools can provide opportunities to understand the genetic architecture of disease resistance, leading to new opportunities for disease control. In the current study, a genome-wide association study was performed to assess resistance to the Tilapia lake virus (TiLV), one of the biggest threats affecting Nile tilapia (Oreochromis niloticus); a key aquaculture species globally. A pond outbreak of TiLV in a pedigreed population of the GIFT strain was observed, with 950 fish classified as either survivor or mortality, and genotyped using a 65 K SNP array. A significant QTL of large effect was identified on chromosome Oni22. The average mortality rate of tilapia homozygous for the resistance allele at the most significant SNP (P value = 4.51E-10) was 11%, compared to 43% for tilapia homozygous for the susceptibility allele. Several candidate genes related to host response to viral infection were identified within this QTL, including lgals17, vps52, and trim29. These results provide a rare example of a major QTL affecting a trait of major importance to a farmed animal. Genetic markers from the QTL region have potential in marker-assisted selection to improve host resistance, providing a genetic solution to an infectious disease where few other control or mitigation options currently exist.
Collapse
|
17
|
Moghaddar N, Brown DJ, Swan AA, Gurman PM, Li L, van der Werf JH. Genomic prediction in a numerically small breed population using prioritized genetic markers from whole-genome sequence data. J Anim Breed Genet 2021; 139:71-83. [PMID: 34374454 DOI: 10.1111/jbg.12638] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 06/08/2021] [Accepted: 07/19/2021] [Indexed: 11/30/2022]
Abstract
The objective of this study was to investigate the accuracy of genomic prediction of body weight and eating quality traits in a numerically small sheep population (Dorper sheep). Prediction was based on a large multi-breed/admixed reference population and using (a) 50k or 500k single nucleotide polymorphism (SNP) genotypes, (b) imputed whole-genome sequencing data (~31 million), (c) selected SNPs from whole genome sequence data and (d) 50k SNP genotypes plus selected SNPs from whole-genome sequence data. Furthermore, the impact of using a breed-adjusted genomic relationship matrix on accuracy of genomic breeding value was assessed. The selection of genetic variants was based on an association study performed on imputed whole-genome sequence data in an independent population, which was chosen either randomly from the base population or according to higher genetic proximity to the target population. Genomic prediction was based on genomic best linear unbiased prediction (GBLUP), and the accuracy of genomic prediction was assessed according to the correlation between genomic breeding value and corrected phenotypes divided by the square root of trait heritability. The accuracy of genomic prediction was between 0.20 and 0.30 across different traits based on common 50k SNP genotypes, which improved on average by 0.06 (absolute value) on average based on using prioritized genetic markers from whole-genome sequence data. Using prioritized genetic markers from a genetically more related GWAS population resulted in slightly higher prediction accuracy (0.02 absolute value) compared to genetic markers derived from a random GWAS population. Using high-density SNP genotypes or imputed whole-genome sequence data in GBLUP showed almost no improvement in genomic prediction accuracy however, accounting for different marker allele frequencies in reference population according to a breed-adjusted GRM resulted to on average 0.024 (absolute value) increase in accuracy of genomic prediction.
Collapse
Affiliation(s)
- Nasir Moghaddar
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Daniel J Brown
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Andrew A Swan
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Phillip M Gurman
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Li Li
- Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, Australia
| | - Julius H van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| |
Collapse
|
18
|
Wu P, Wang K, Zhou J, Chen D, Jiang A, Jiang Y, Zhu L, Qiu X, Li X, Tang G. A combined GWAS approach reveals key loci for socially-affected traits in Yorkshire pigs. Commun Biol 2021; 4:891. [PMID: 34285319 PMCID: PMC8292486 DOI: 10.1038/s42003-021-02416-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 06/29/2021] [Indexed: 02/06/2023] Open
Abstract
Socially affected traits in pigs are controlled by direct genetic effects and social genetic effects, which can make elucidation of their genetic architecture challenging. We evaluated the genetic basis of direct genetic effects and social genetic effects by combining single-locus and haplotype-based GWAS on imputed whole-genome sequences. Nineteen SNPs and 25 haplotype loci are identified for direct genetic effects on four traits: average daily feed intake, average daily gain, days to 100 kg and time in feeder per day. Nineteen SNPs and 11 haplotype loci are identified for social genetic effects on average daily feed intake, average daily gain, days to 100 kg and feeding speed. Two significant SNPs from single-locus GWAS (SSC6:18,635,874 and SSC6:18,635,895) are shared by a significant haplotype locus with haplotype alleles 'GGG' for both direct genetic effects and social genetic effects in average daily feed intake. A candidate gene, MT3, which is involved in growth, nervous, and immune processes, is identified. We demonstrate the genetic differences between direct genetic effects and social genetic effects and provide an anchor for investigating the genetic architecture underlying direct genetic effects and social genetic effects on socially affected traits in pigs.
Collapse
Affiliation(s)
- Pingxian Wu
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| | - Kai Wang
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| | - Jie Zhou
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| | - Dejuan Chen
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| | - Anan Jiang
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| | - Yanzhi Jiang
- grid.80510.3c0000 0001 0185 3134College of Life Science, Sichuan Agricultural University, Yaan, Sichuan China
| | - Li Zhu
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| | - Xiaotian Qiu
- grid.410634.4National Animal Husbandry Service, Beijing, Beijing, China
| | - Xuewei Li
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| | - Guoqing Tang
- grid.80510.3c0000 0001 0185 3134Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, Sichuan China
| |
Collapse
|
19
|
Jenkins CA, Schofield EC, Mellersh CS, De Risio L, Ricketts SL. Improving the resolution of canine genome-wide association studies using genotype imputation: A study of two breeds. Anim Genet 2021; 52:703-713. [PMID: 34252218 PMCID: PMC8514152 DOI: 10.1111/age.13117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 05/07/2021] [Accepted: 06/24/2021] [Indexed: 01/08/2023]
Abstract
Genotype imputation using a reference panel that combines high-density array data and publicly available whole genome sequence consortium variant data is potentially a cost-effective method to increase the density of extant lower-density array datasets. In this study, three datasets (two Border Collie; one Italian Spinone) generated using a legacy array (Illumina CanineHD, 173 662 SNPs) were utilised to assess the feasibility and accuracy of this approach and to gather additional evidence for the efficacy of canine genotype imputation. The cosmopolitan reference panels used to impute genotypes comprised dogs of 158 breeds, mixed breed dogs, wolves and Chinese indigenous dogs, as well as breed-specific individuals genotyped using the Axiom Canine HD array. The two Border Collie reference panels comprised 808 individuals including 79 Border Collies and 426 326 or 426 332 SNPs; and the Italian Spinone reference panel comprised 807 individuals including 38 Italian Spinoni and 476 313 SNPs. A high accuracy for imputation was observed, with the lowest accuracy observed for one of the Border Collie datasets (mean R2 = 0.94) and the highest for the Italian Spinone dataset (mean R2 = 0.97). This study’s findings demonstrate that imputation of a legacy array study set using a reference panel comprising both breed-specific array data and multi-breed variant data derived from whole genomes is effective and accurate. The process of canine genotype imputation, using the valuable growing resource of publicly available canine genome variant datasets alongside breed-specific data, is described in detail to facilitate and encourage use of this technique in canine genetics.
Collapse
Affiliation(s)
- Christopher A Jenkins
- Department of Veterinary Medicine, Kennel Club Genetics Centre1, University of Cambridge, Cambridge, UK.,Division of Population Health, Health Services Research & Primary Care, University of Manchester, Manchester, UK
| | | | - Ellen C Schofield
- Department of Veterinary Medicine, Kennel Club Genetics Centre1, University of Cambridge, Cambridge, UK
| | - Cathryn S Mellersh
- Department of Veterinary Medicine, Kennel Club Genetics Centre1, University of Cambridge, Cambridge, UK
| | - Luisa De Risio
- Neurology/Neurosurgery Service, Centre for Small Animal Studies, Animal Health Trust, Newmarket, Suffolk, UK
| | - Sally L Ricketts
- Department of Veterinary Medicine, Kennel Club Genetics Centre1, University of Cambridge, Cambridge, UK.,Division of Population Health, Health Services Research & Primary Care, University of Manchester, Manchester, UK
| |
Collapse
|
20
|
Bolormaa S, Swan AA, Stothard P, Khansefid M, Moghaddar N, Duijvesteijn N, van der Werf JHJ, Daetwyler HD, MacLeod IM. A conditional multi-trait sequence GWAS discovers pleiotropic candidate genes and variants for sheep wool, skin wrinkle and breech cover traits. Genet Sel Evol 2021; 53:58. [PMID: 34238208 PMCID: PMC8268212 DOI: 10.1186/s12711-021-00651-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 06/29/2021] [Indexed: 12/01/2022] Open
Abstract
Background Imputation to whole-genome sequence is now possible in large sheep populations. It is therefore of interest to use this data in genome-wide association studies (GWAS) to investigate putative causal variants and genes that underpin economically important traits. Merino wool is globally sought after for luxury fabrics, but some key wool quality attributes are unfavourably correlated with the characteristic skin wrinkle of Merinos. In turn, skin wrinkle is strongly linked to susceptibility to “fly strike” (Cutaneous myiasis), which is a major welfare issue. Here, we use whole-genome sequence data in a multi-trait GWAS to identify pleiotropic putative causal variants and genes associated with changes in key wool traits and skin wrinkle. Results A stepwise conditional multi-trait GWAS (CM-GWAS) identified putative causal variants and related genes from 178 independent quantitative trait loci (QTL) of 16 wool and skin wrinkle traits, measured on up to 7218 Merino sheep with 31 million imputed whole-genome sequence (WGS) genotypes. Novel candidate gene findings included the MAT1A gene that encodes an enzyme involved in the sulphur metabolism pathway critical to production of wool proteins, and the ESRP1 gene. We also discovered a significant wrinkle variant upstream of the HAS2 gene, which in dogs is associated with the exaggerated skin folds in the Shar-Pei breed. Conclusions The wool and skin wrinkle traits studied here appear to be highly polygenic with many putative candidate variants showing considerable pleiotropy. Our CM-GWAS identified many highly plausible candidate genes for wool traits as well as breech wrinkle and breech area wool cover. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00651-0.
Collapse
Affiliation(s)
- Sunduimijid Bolormaa
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia. .,Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.
| | - Andrew A Swan
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,Animal Genetics and Breeding Unit, University of New England, Armidale, NSW, 2351, Australia
| | - Paul Stothard
- Faculty of Agricultural, Life & Environmental Sciences, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Majid Khansefid
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia
| | - Nasir Moghaddar
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Naomi Duijvesteijn
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,Hendrix Genetics, Boxmeer, The Netherlands
| | - Julius H J van der Werf
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3086, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia
| |
Collapse
|
21
|
Cesarani A, Biffani S, Garcia A, Lourenco D, Bertolini G, Neglia G, Misztal I, Macciotta NPP. Genomic investigation of milk production in Italian buffalo. ITALIAN JOURNAL OF ANIMAL SCIENCE 2021. [DOI: 10.1080/1828051x.2021.1902404] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Affiliation(s)
- Alberto Cesarani
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Stefano Biffani
- Consiglio Nazionale delle Ricerche (CNR), Istituto di biologia e biotecnologia agraria (IBBA), Milano, Italy
| | - Andre Garcia
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | - Giacomo Bertolini
- Associazione Nazionale Allevatori Specie Bufalina (ANASB), Caserta, Italy
| | - Gianluca Neglia
- Dipartimento di Medicina Veterinaria e Produzioni Animali, II University of Naples, Napoli, Italy
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, USA
| | | |
Collapse
|
22
|
Genetic basis and identification of candidate genes for wooden breast and white striping in commercial broiler chickens. Sci Rep 2021; 11:6785. [PMID: 33762630 PMCID: PMC7990949 DOI: 10.1038/s41598-021-86176-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 03/08/2021] [Indexed: 02/07/2023] Open
Abstract
Wooden breast (WB) and white striping (WS) are highly prevalent and economically damaging muscle disorders of modern commercial broiler chickens characterized respectively by palpable firmness and fatty white striations running parallel to the muscle fiber. High feed efficiency and rapid growth, especially of the breast muscle, are believed to contribute to development of such muscle defects; however, their etiology remains poorly understood. To gain insight into the genetic basis of these myopathies, a genome-wide association study was conducted using a commercial crossbred broiler population (n = 1193). Heritability was estimated at 0.5 for WB and WS with high genetic correlation between them (0.88). GWAS revealed 28 quantitative trait loci (QTL) on five chromosomes for WB and 6 QTL on one chromosome for WS, with the majority of QTL for both myopathies located in a ~ 8 Mb region of chromosome 5. This region has highly conserved synteny with a portion of human chromosome 11 containing a cluster of imprinted genes associated with growth and metabolic disorders such as type 2 diabetes and Beckwith-Wiedemann syndrome. Candidate genes include potassium voltage-gated channel subfamily Q member 1 (KCNQ1), involved in insulin secretion and cardiac electrical activity, lymphocyte-specific protein 1 (LSP1), involved in inflammation and immune response.
Collapse
|
23
|
Fernandes Júnior GA, Carvalheiro R, de Oliveira HN, Sargolzaei M, Costilla R, Ventura RV, Fonseca LFS, Neves HHR, Hayes BJ, de Albuquerque LG. Imputation accuracy to whole-genome sequence in Nellore cattle. Genet Sel Evol 2021; 53:27. [PMID: 33711929 PMCID: PMC7953568 DOI: 10.1186/s12711-021-00622-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 03/05/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A cost-effective strategy to explore the complete DNA sequence in animals for genetic evaluation purposes is to sequence key ancestors of a population, followed by imputation mechanisms to infer marker genotypes that were not originally reported in a target population of animals genotyped with single nucleotide polymorphism (SNP) panels. The feasibility of this process relies on the accuracy of the genotype imputation in that population, particularly for potential causal mutations which may be at low frequency and either within genes or regulatory regions. The objective of the present study was to investigate the imputation accuracy to the sequence level in a Nellore beef cattle population, including that for variants in annotation classes which are more likely to be functional. METHODS Information of 151 key sequenced Nellore sires were used to assess the imputation accuracy from bovine HD BeadChip SNP (~ 777 k) to whole-genome sequence. The choice of the sires aimed at optimizing the imputation accuracy of a genotypic database, comprised of about 10,000 genotyped Nellore animals. Genotype imputation was performed using two computational approaches: FImpute3 and Minimac4 (after using Eagle for phasing). The accuracy of the imputation was evaluated using a fivefold cross-validation scheme and measured by the squared correlation between observed and imputed genotypes, calculated by individual and by SNP. SNPs were classified into a range of annotations, and the accuracy of imputation within each annotation classification was also evaluated. RESULTS High average imputation accuracies per animal were achieved using both FImpute3 (0.94) and Minimac4 (0.95). On average, common variants (minor allele frequency (MAF) > 0.03) were more accurately imputed by Minimac4 and low-frequency variants (MAF ≤ 0.03) were more accurately imputed by FImpute3. The inherent Minimac4 Rsq imputation quality statistic appears to be a good indicator of the empirical Minimac4 imputation accuracy. Both software provided high average SNP-wise imputation accuracy for all classes of biological annotations. CONCLUSIONS Our results indicate that imputation to whole-genome sequence is feasible in Nellore beef cattle since high imputation accuracies per individual are expected. SNP-wise imputation accuracy is software-dependent, especially for rare variants. The accuracy of imputation appears to be relatively independent of annotation classification.
Collapse
Affiliation(s)
| | - Roberto Carvalheiro
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil.,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil
| | - Henrique N de Oliveira
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil.,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil
| | - Mehdi Sargolzaei
- Ontario Veterinary College, UG, Guelph, Canada.,Select Sires Inc., Plain City, OH, USA
| | - Roy Costilla
- Queensland Alliance for Agriculture and Food Innovation, UQ, Brisbane, QLD, 4072, Australia
| | - Ricardo V Ventura
- School of Veterinary Medicine and Animal Science, USP, Pirassununga, SP, 13635-900, Brazil
| | - Larissa F S Fonseca
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil
| | | | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, UQ, Brisbane, QLD, 4072, Australia
| | - Lucia G de Albuquerque
- School of Agricultural and Veterinarian Sciences, UNESP, Jaboticabal, SP, 14884-900, Brazil. .,National Council for Scientific and Technological Development, CNPq, Brasília, DF, 71605-001, Brazil.
| |
Collapse
|
24
|
Yuan Z, Sunduimijid B, Xiang R, Behrendt R, Knight MI, Mason BA, Reich CM, Prowse-Wilkins C, Vander Jagt CJ, Chamberlain AJ, MacLeod IM, Li F, Yue X, Daetwyler HD. Expression quantitative trait loci in sheep liver and muscle contribute to variations in meat traits. Genet Sel Evol 2021; 53:8. [PMID: 33461502 PMCID: PMC7812657 DOI: 10.1186/s12711-021-00602-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 01/08/2021] [Indexed: 11/10/2022] Open
Abstract
Background Variants that regulate transcription, such as expression quantitative trait loci (eQTL), have shown enrichment in genome-wide association studies (GWAS) for mammalian complex traits. However, no study has reported eQTL in sheep, although it is an important agricultural species for which many GWAS of complex meat traits have been conducted. Using RNA sequence data produced from liver and muscle from 149 sheep and imputed whole-genome single nucleotide polymorphisms (SNPs), our aim was to dissect the genetic architecture of the transcriptome by associating sheep genotypes with three major molecular phenotypes including gene expression (geQTL), exon expression (eeQTL) and RNA splicing (sQTL). We also examined these three types of eQTL for their enrichment in GWAS of multi-meat traits and fatty acid profiles. Results Whereas a relatively small number of molecular phenotypes were significantly heritable (h2 > 0, P < 0.05), their mean heritability ranged from 0.67 to 0.73 for liver and from 0.71 to 0.77 for muscle. Association analysis between molecular phenotypes and SNPs within ± 1 Mb identified many significant cis-eQTL (false discovery rate, FDR < 0.01). The median distance between the eQTL and transcription start sites (TSS) ranged from 68 to 153 kb across the three eQTL types. The number of common variants between geQTL, eeQTL and sQTL within each tissue, and the number of common variants between liver and muscle within each eQTL type were all significantly (P < 0.05) larger than expected by chance. The identified eQTL were significantly (P < 0.05) enriched in GWAS hits associated with 56 carcass traits and fatty acid profiles. For example, several geQTL in muscle mapped to the FAM184B gene, hundreds of sQTL in liver and muscle mapped to the CAST gene, and hundreds of sQTL in liver mapped to the C6 gene. These three genes are associated with body composition or fatty acid profiles. Conclusions We detected a large number of significant eQTL and found that the overlap of variants between eQTL types and tissues was prevalent. Many eQTL were also QTL for meat traits. Our study fills a gap in the knowledge on the regulatory variants and their role in complex traits for the sheep model.
Collapse
Affiliation(s)
- Zehu Yuan
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Grassland Agriculture Engineering Center, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020, People's Republic of China.,Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,Institutes of Agricultural Science and Technology Development (Joint International Research Laboratory of Agriculture & Agri-Product Safety), Yangzhou University, Yangzhou, 225000, People's Republic of China
| | - Bolormaa Sunduimijid
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Ruidong Xiang
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia.,Faculty of Veterinary & Agricultural Science, University of Melbourne, Parkville, VIC, 3010, Australia
| | - Ralph Behrendt
- Agriculture Victoria, Hamilton Centre, Hamilton, VIC, 3300, Australia
| | - Matthew I Knight
- Agriculture Victoria, Hamilton Centre, Hamilton, VIC, 3300, Australia
| | - Brett A Mason
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Coralie M Reich
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Claire Prowse-Wilkins
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Christy J Vander Jagt
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Amanda J Chamberlain
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Iona M MacLeod
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia
| | - Fadi Li
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Grassland Agriculture Engineering Center, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020, People's Republic of China
| | - Xiangpeng Yue
- State Key Laboratory of Grassland Agro-ecosystems; Key Laboratory of Grassland Livestock Industry Innovation, Ministry of Agriculture and Rural Affairs; Grassland Agriculture Engineering Center, Ministry of Education; College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730020, People's Republic of China.
| | - Hans D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, 3083, Australia. .,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia.
| |
Collapse
|
25
|
Yoshida GM, Yáñez JM. Multi-trait GWAS using imputed high-density genotypes from whole-genome sequencing identifies genes associated with body traits in Nile tilapia. BMC Genomics 2021; 22:57. [PMID: 33451291 PMCID: PMC7811220 DOI: 10.1186/s12864-020-07341-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 12/22/2020] [Indexed: 12/16/2022] Open
Abstract
Background Body traits are generally controlled by several genes in vertebrates (i.e. polygenes), which in turn make them difficult to identify through association mapping. Increasing the power of association studies by combining approaches such as genotype imputation and multi-trait analysis improves the ability to detect quantitative trait loci associated with polygenic traits, such as body traits. Results A multi-trait genome-wide association study (mtGWAS) was performed to identify quantitative trait loci (QTL) and genes associated with body traits in Nile tilapia (Oreochromis niloticus) using genotypes imputed to whole-genome sequences (WGS). To increase the statistical power of mtGWAS for the detection of genetic associations, summary statistics from single-trait genome-wide association studies (stGWAS) for eight different body traits recorded in 1309 animals were used. The mtGWAS increased the statistical power from the original sample size from 13 to 44%, depending on the trait analyzed. The better resolution of the WGS data, combined with the increased power of the mtGWAS approach, allowed the detection of significant markers which were not previously found in the stGWAS. Some of the lead single nucleotide polymorphisms (SNPs) were found within important functional candidate genes previously associated with growth-related traits in other terrestrial species. For instance, we identified SNP within the α1,6-fucosyltransferase (FUT8), solute carrier family 4 member 2 (SLC4A2), A disintegrin and metalloproteinase with thrombospondin motifs 9 (ADAMTS9) and heart development protein with EGF like domains 1 (HEG1) genes, which have been associated with average daily gain in sheep, osteopetrosis in cattle, chest size in goats, and growth and meat quality in sheep, respectively. Conclusions The high-resolution mtGWAS presented here allowed the identification of significant SNPs, linked to strong functional candidate genes, associated with body traits in Nile tilapia. These results provide further insights about the genetic variants and genes underlying body trait variation in cichlid fish with high accuracy and strong statistical support. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-020-07341-z.
Collapse
Affiliation(s)
- Grazyella M Yoshida
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Santiago, Chile
| | - José M Yáñez
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Santiago, Chile. .,Núcleo Milenio INVASAL, Concepción, Chile.
| |
Collapse
|
26
|
McEwin RA, Hebart ML, Oakey H, Pitchford WS. Within-breed selection is sufficient to improve terminal crossbred beef marbling: a review of reciprocal recurrent genomic selection. ANIMAL PRODUCTION SCIENCE 2021. [DOI: 10.1071/an21085] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Reciprocal recurrent selection is the selection of purebreds for crossbred performance and takes advantage of additive and non-additive variance by using pedigreed progeny performance records. Developed in maize, the adoption of this approach in livestock breeding has been limited to the pork and poultry industries; genomic selection may facilitate its extension into the beef industry by replacing pedigree. The literature regarding the relative importance of additive versus non-additive variance and reciprocal recurrent genomic selection models was reviewed. The potential for using reciprocal recurrent genomic selection in a terminal Wagyu × Angus cross scenario was examined. Non-additive variance is more important for fitness traits and accounts for a small proportion of variance related to production traits such as marbling. In general, reciprocal recurrent selection was not significantly better at improving performance of crossbreds than was traditional selection within parental breeds using only additive variance in the studies examined. Simulation studies showed benefits of including dominance or breed-specific allele effects in prediction models but advantages were small as more realistic simulations were examined. On the basis of the evidence, it is likely that in a terminal two-way cross-beef scenario utilising Wagyu sires and Angus dams, where selection emphasis is on marbling, selection of purebreds on the basis of additive variance will allow substantial progress to be realised.
Collapse
|
27
|
Hermisdorff IDC, Costa RB, de Albuquerque LG, Pausch H, Kadri NK. Investigating the accuracy of imputing autosomal variants in Nellore cattle using the ARS-UCD1.2 assembly of the bovine genome. BMC Genomics 2020; 21:772. [PMID: 33167856 PMCID: PMC7654006 DOI: 10.1186/s12864-020-07184-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Accepted: 10/26/2020] [Indexed: 11/22/2022] Open
Abstract
Background Imputation accuracy among other things depends on the size of the reference panel, the marker’s minor allele frequency (MAF), and the correct placement of single nucleotide polymorphism (SNP) on the reference genome assembly. Using high-density genotypes of 3938 Nellore cattle from Brazil, we investigated the accuracy of imputation from 50 K to 777 K SNP density using Minimac3, when map positions were determined according to the bovine genome assemblies UMD3.1 and ARS-UCD1.2. We assessed the effect of reference and target panel sizes on the pre-phasing based imputation quality using ten-fold cross-validation. Further, we compared the reliability of the model-based imputation quality score (Rsq) from Minimac3 to the empirical imputation accuracy. Results The overall accuracy of imputation measured as the squared correlation between true and imputed allele dosages (R2dose) was almost identical using either the UMD3.1 or ARS-UCD1.2 genome assembly. When the size of the reference panel increased from 250 to 2000, R2dose increased from 0.845 to 0.917, and the number of polymorphic markers in the imputed data set increased from 586,701 to 618,660. Advantages in both accuracy and marker density were also observed when larger target panels were imputed, likely resulting from more accurate haplotype inference. Imputation accuracy increased from 0.903 to 0.913, and the marker density in the imputed data increased from 593,239 to 595,570 when haplotypes were inferred in 500 and 2900 target animals. The model-based imputation quality scores from Minimac3 (Rsq) were systematically higher than empirically estimated accuracies. However, both metrics were positively correlated and the correlation increased with the size of the reference panel and MAF of imputed variants. Conclusions Accurate imputation of BovineHD BeadChip markers is possible in Nellore cattle using the new bovine reference genome assembly ARS-UCD1.2. The use of large reference and target panels improves the accuracy of the imputed genotypes and provides genotypes for more markers segregating at low frequency for downstream genomic analyses. The model-based imputation quality score from Minimac3 (Rsq) can be used to detect poorly imputed variants but its reliability depends on the size of the reference panel and MAF of the imputed variants. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-020-07184-8.
Collapse
Affiliation(s)
- Isis da Costa Hermisdorff
- School of Veterinary Medicine and Animal Science, Federal University of Bahia (UFBA), Salvador, Brazil.,Animal Genomics, ETH Zurich, Zurich, Switzerland
| | - Raphael Bermal Costa
- School of Veterinary Medicine and Animal Science, Federal University of Bahia (UFBA), Salvador, Brazil
| | - Lucia Galvão de Albuquerque
- Animal Science Department, School of Agricultural and Veterinary Sciences, São Paulo State University (Unesp), Jaboticabal, São Paulo, Brazil
| | | | | |
Collapse
|
28
|
Talouarn E, Teissier M, Bardou P, Larroque H, Clément V, Palhière I, Tosser-Klopp G, Rupp R, Robert-Granié C. Using sequence variants of a QTL region improves the accuracy of genomic evaluation in French Saanen goats. J Dairy Sci 2020; 104:588-601. [PMID: 33131807 DOI: 10.3168/jds.2020-18837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 08/11/2020] [Indexed: 11/19/2022]
Abstract
The enhanced availability of sequence data in livestock provides an opportunity for more accurate predictions in routine genomic evaluations. Such evaluations would therefore no longer rely only on the linkage disequilibrium between a chip marker and the causal mutation. The objective of this study was to assess the usefulness of sequence data in Saanen goats (n = 33) to better capture a quantitative trait locus (QTL) on chromosome 19 (CHI19) and improve the accuracy of predictions for 3 milk production traits, 5 type traits, and somatic cell scores. All 1,207 50K genotypes were imputed to the sequence level. Four scenarios, each using a subset of CHI19 imputed variants, were then tested. Sequence-derived information included all CHI19 variants (529,576), all variants in the QTL region (22,269), 178 variants selected in the QTL region and added to an updated chip, or 178 randomly selected variants on CHI19. Two genomic evaluation models were applied: single-step genomic BLUP and weighted single-step genomic BLUP. All scenarios were compared with single-step genomic BLUP using 50K genotypes. Best overall results were obtained using single-step genomic BLUP on 50K genotypes completed with all variants in the QTL region of chromosome 19 (6.2% average increase in accuracy for 9 traits) with the highest accuracy gain for fat yield (17.9%), significant increases for milk (13.7%) and protein yields (12.5%), and type traits associated with CHI19. Despite its association with the QTL region of chromosome 19, the somatic cell score showed decreased accuracy in every alternative scenario. Using all CHI19 variants led to an overall decrease of 4.8% in prediction accuracy. The updated chip was efficient and improved genomic evaluations by 3.1 to 6.4% on average, depending on the scenario. Indeed, information from only a few carefully selected variants increased accuracies for traits of interest when used in a single-step genomic BLUP model. In conclusion, using QTL region variants imputed from sequence data in single-step genomic evaluations represents a promising perspective for such evaluations in dairy goats. Furthermore, using only a limited number of selected variants in QTL regions, as available on SNP chip updates, significantly increases the accuracy for QTL-associated traits without deteriorating the evaluation accuracy for other traits. The latter approach is interesting, as it avoids time-consuming imputation and data formatting processes and provides reliable genotypes.
Collapse
Affiliation(s)
- Estelle Talouarn
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France.
| | - Marc Teissier
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | | - Hélène Larroque
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | | - Isabelle Palhière
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | | - Rachel Rupp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | |
Collapse
|
29
|
de Las Heras-Saldana S, Lopez BI, Moghaddar N, Park W, Park JE, Chung KY, Lim D, Lee SH, Shin D, van der Werf JHJ. Use of gene expression and whole-genome sequence information to improve the accuracy of genomic prediction for carcass traits in Hanwoo cattle. Genet Sel Evol 2020; 52:54. [PMID: 32993481 PMCID: PMC7525992 DOI: 10.1186/s12711-020-00574-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 09/18/2020] [Indexed: 12/21/2022] Open
Abstract
Background In this study, we assessed the accuracy of genomic prediction for carcass weight (CWT), marbling score (MS), eye muscle area (EMA) and back fat thickness (BFT) in Hanwoo cattle when using genomic best linear unbiased prediction (GBLUP), weighted GBLUP (wGBLUP), and a BayesR model. For these models, we investigated the potential gain from using pre-selected single nucleotide polymorphisms (SNPs) from a genome-wide association study (GWAS) on imputed sequence data and from gene expression information. We used data on 13,717 animals with carcass phenotypes and imputed sequence genotypes that were split in an independent GWAS discovery set of varying size and a remaining set for validation of prediction. Expression data were used from a Hanwoo gene expression experiment based on 45 animals. Results Using a larger number of animals in the reference set increased the accuracy of genomic prediction whereas a larger independent GWAS discovery dataset improved identification of predictive SNPs. Using pre-selected SNPs from GWAS in GBLUP improved accuracy of prediction by 0.02 for EMA and up to 0.05 for BFT, CWT, and MS, compared to a 50 k standard SNP array that gave accuracies of 0.50, 0.47, 0.58, and 0.47, respectively. Accuracy of prediction of BFT and CWT increased when BayesR was applied with the 50 k SNP array (0.02 and 0.03, respectively) and was further improved by combining the 50 k array with the top-SNPs (0.06 and 0.04, respectively). By contrast, using BayesR resulted in limited improvement for EMA and MS. wGBLUP did not improve accuracy but increased prediction bias. Based on the RNA-seq experiment, we identified informative expression quantitative trait loci, which, when used in GBLUP, improved the accuracy of prediction slightly, i.e. between 0.01 and 0.02. SNPs that were located in genes, the expression of which was associated with differences in trait phenotype, did not contribute to a higher prediction accuracy. Conclusions Our results show that, in Hanwoo beef cattle, when SNPs are pre-selected from GWAS on imputed sequence data, the accuracy of prediction improves only slightly whereas the contribution of SNPs that are selected based on gene expression is not significant. The benefit of statistical models to prioritize selected SNPs for estimating genomic breeding values is trait-specific and depends on the genetic architecture of each trait.
Collapse
Affiliation(s)
| | - Bryan Irvine Lopez
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, Rural Development Administration, Wanju, 55365, Republic of Korea
| | - Nasir Moghaddar
- School of Environmental and Rural Science, University of New England, Armidale, NSW 2351, Australia
| | - Woncheoul Park
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, Rural Development Administration, Wanju, 55365, Republic of Korea
| | - Jong-Eun Park
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, Rural Development Administration, Wanju, 55365, Republic of Korea
| | - Ki Y Chung
- Department of Beef Science, Korea National College of Agriculture and Fisheries, Jeonju, Republic of Korea
| | - Dajeong Lim
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, Rural Development Administration, Wanju, 55365, Republic of Korea.
| | - Seung H Lee
- Division of Animal and Dairy Science, Chungnam National University, Deajeon, 34148, Republic of Korea
| | - Donghyun Shin
- The Animal Molecular Genetics and Breeding Centre, Jeonbuk National University, Jeonju, 54896, Republic of Korea
| | - Julius H J van der Werf
- School of Environmental and Rural Science, University of New England, Armidale, NSW 2351, Australia.
| |
Collapse
|
30
|
Liu A, Lund MS, Boichard D, Mao X, Karaman E, Fritz S, Aamand GP, Wang Y, Su G. Imputation for sequencing variants preselected to a customized low-density chip. Sci Rep 2020; 10:9524. [PMID: 32533087 PMCID: PMC7293337 DOI: 10.1038/s41598-020-66523-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Accepted: 05/19/2020] [Indexed: 12/27/2022] Open
Abstract
The sequencing variants preselected from association analyses and bioinformatics analyses could improve genomic prediction. In this study, the imputation of sequencing SNPs preselected from major dairy breeds in Denmark-Finland-Sweden (DFS) and France (FRA) was investigated for both contemporary animals and old bulls in Danish Jersey. For contemporary animals, a two-step imputation which first imputed to 54 K and then to 54 K + DFS + FRA SNPs achieved highest accuracy. Correlations between observed and imputed genotypes were 91.6% for DFS SNPs and 87.6% for FRA SNPs, while concordance rates were 96.6% for DFS SNPs and 93.5% for FRA SNPs. The SNPs with lower minor allele frequency (MAF) tended to have lower correlations but higher concordance rates. For old bulls, imputation for DFS and FRA SNPs were relatively accurate even for bulls without progenies (correlations higher than 97.2% and concordance rates higher than 98.4%). For contemporary animals, given limited imputation accuracy of preselected sequencing SNPs especially for SNPs with low MAF, it would be a good strategy to directly genotype preselected sequencing SNPs with a customized SNP chip. For old bulls, given high imputation accuracy for preselected sequencing SNPs with all MAF ranges, it would be unnecessary to re-genotype preselected sequencing SNPs.
Collapse
Affiliation(s)
- Aoxing Liu
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.,Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Didier Boichard
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France
| | - Xiaowei Mao
- Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, 100044, Beijing, P.R. China.,CAS Center for Excellence in Life and Paleoenvironment, 100044, Beijing, P.R. China
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Sebastien Fritz
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France.,ALLICE, 75012, Paris, France
| | | | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| |
Collapse
|
31
|
Ros-Freixedes R, Whalen A, Chen CY, Gorjanc G, Herring WO, Mileham AJ, Hickey JM. Accuracy of whole-genome sequence imputation using hybrid peeling in large pedigreed livestock populations. Genet Sel Evol 2020; 52:17. [PMID: 32248811 PMCID: PMC7132992 DOI: 10.1186/s12711-020-00536-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 03/27/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND The coupling of appropriate sequencing strategies and imputation methods is critical for assembling large whole-genome sequence datasets from livestock populations for research and breeding. In this paper, we describe and validate the coupling of a sequencing strategy with the imputation method hybrid peeling in real animal breeding settings. METHODS We used data from four pig populations of different size (18,349 to 107,815 individuals) that were widely genotyped at densities between 15,000 and 75,000 markers genome-wide. Around 2% of the individuals in each population were sequenced (most of them at 1× or 2× and 37-92 individuals per population, totalling 284, at 15-30×). We imputed whole-genome sequence data with hybrid peeling. We evaluated the imputation accuracy by removing the sequence data of the 284 individuals with high coverage, using a leave-one-out design. We simulated data that mimicked the sequencing strategy used in the real populations to quantify the factors that affected the individual-wise and variant-wise imputation accuracies using regression trees. RESULTS Imputation accuracy was high for the majority of individuals in all four populations (median individual-wise dosage correlation: 0.97). Imputation accuracy was lower for individuals in the earliest generations of each population than for the rest, due to the lack of marker array data for themselves and their ancestors. The main factors that determined the individual-wise imputation accuracy were the genotyping status, the availability of marker array data for immediate ancestors, and the degree of connectedness to the rest of the population, but sequencing coverage of the relatives had no effect. The main factors that determined variant-wise imputation accuracy were the minor allele frequency and the number of individuals with sequencing coverage at each variant site. Results were validated with the empirical observations. CONCLUSIONS We demonstrate that the coupling of an appropriate sequencing strategy and hybrid peeling is a powerful strategy for generating whole-genome sequence data with high accuracy in large pedigreed populations where only a small fraction of individuals (2%) had been sequenced, mostly at low coverage. This is a critical step for the successful implementation of whole-genome sequence data for genomic prediction and fine-mapping of causal variants.
Collapse
Affiliation(s)
- Roger Ros-Freixedes
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
- Departament de Ciència Animal, Universitat de Lleida-Agrotecnio Center, Lleida, Spain
| | - Andrew Whalen
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Ching-Yi Chen
- The Pig Improvement Company, Genus plc, 100 Bluegrass Commons Blvd Ste 2200, Hendersonville, TN 37075 USA
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - William O. Herring
- The Pig Improvement Company, Genus plc, 100 Bluegrass Commons Blvd Ste 2200, Hendersonville, TN 37075 USA
| | | | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| |
Collapse
|
32
|
Talouarn E, Bardou P, Palhière I, Oget C, Clément V, Tosser-Klopp G, Rupp R, Robert-Granié C. Genome wide association analysis on semen volume and milk yield using different strategies of imputation to whole genome sequence in French dairy goats. BMC Genet 2020; 21:19. [PMID: 32085723 PMCID: PMC7035711 DOI: 10.1186/s12863-020-0826-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Accepted: 02/13/2020] [Indexed: 01/17/2023] Open
Abstract
Background Goats were domesticated 10,500 years ago to supply humans with useful resources. Since then, specialized breeds that are adapted to their local environment have been developed and display specific genetic profiles. The VarGoats project is a 1000 genomes resequencing program designed to cover the genetic diversity of the Capra genus. In this study, our main objective was to assess the use of sequence data to detect genomic regions associated with traits of interest in French Alpine and Saanen breeds. Results Direct imputation from the GoatSNP50 BeadChip genotypes to sequence level was investigated in these breeds using FImpute and different reference panels: within-breed, all Capra hircus sequenced individuals, European goats and French mainland goats. The best results were obtained with the French goat panel with allele and genotype concordance rates reaching 0.86 and 0.75 in the Alpine and 0.86 and 0.73 in the Saanen breed respectively. Mean correlations tended to be low in both breeds due to the high proportion of variants with low frequencies. For association analysis, imputation was performed using FImpute for 1129 French Alpine and Saanen males using within-breed and French panels on 23,338,436 filtered variants. The association results of both imputation scenarios were then compared. In Saanen goats, a large region on chromosome 19 was significantly linked to semen volume and milk yield in both scenarios. Significant variants for milk yield were annotated for 91 genes on chromosome 19 in Saanen goats. For semen volume, the annotated genes include YBOX2 which is related to azoospermia or oligospermia in other species. New signals for milk yield were detected on chromosome 2 in Alpine goats and on chromosome 5 in Saanen goats when using a multi-breed panel. Conclusion Even with very small reference populations, an acceptable imputation quality can be achieved in French dairy goats. GWAS on imputed sequences confirmed the existence of QTLs and identified new regions of interest in dairy goats. Adding identified candidates to a genotyping array and sequencing more individuals might corroborate the involvement of identified regions while removing potential imputation errors.
Collapse
Affiliation(s)
- Estelle Talouarn
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France.
| | - Philippe Bardou
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France.,Sigenae, INRAE, 31326, Castanet-Tolosan, France
| | - Isabelle Palhière
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | - Claire Oget
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | | | | | - Gwenola Tosser-Klopp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | - Rachel Rupp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326, Castanet Tolosan, France
| | | |
Collapse
|
33
|
Genomic prediction based on selected variants from imputed whole-genome sequence data in Australian sheep populations. Genet Sel Evol 2019; 51:72. [PMID: 31805849 PMCID: PMC6896509 DOI: 10.1186/s12711-019-0514-2] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Accepted: 11/25/2019] [Indexed: 12/13/2022] Open
Abstract
Background Whole-genome sequence (WGS) data could contain information on genetic variants at or in high linkage disequilibrium with causative mutations that underlie the genetic variation of polygenic traits. Thus far, genomic prediction accuracy has shown limited increase when using such information in dairy cattle studies, in which one or few breeds with limited diversity predominate. The objective of our study was to evaluate the accuracy of genomic prediction in a multi-breed Australian sheep population of relatively less related target individuals, when using information on imputed WGS genotypes. Methods Between 9626 and 26,657 animals with phenotypes were available for nine economically important sheep production traits and all had WGS imputed genotypes. About 30% of the data were used to discover predictive single nucleotide polymorphism (SNPs) based on a genome-wide association study (GWAS) and the remaining data were used for training and validation of genomic prediction. Prediction accuracy using selected variants from imputed sequence data was compared to that using a standard array of 50k SNP genotypes, thereby comparing genomic best linear prediction (GBLUP) and Bayesian methods (BayesR/BayesRC). Accuracy of genomic prediction was evaluated in two independent populations that were each lowly related to the training set, one being purebred Merino and the other crossbred Border Leicester x Merino sheep. Results A substantial improvement in prediction accuracy was observed when selected sequence variants were fitted alongside 50k genotypes as a separate variance component in GBLUP (2GBLUP) or in Bayesian analysis as a separate category of SNPs (BayesRC). From an average accuracy of 0.27 in both validation sets for the 50k array, the average absolute increase in accuracy across traits with 2GBLUP was 0.083 and 0.073 for purebred and crossbred animals, respectively, whereas with BayesRC it was 0.102 and 0.087. The average gain in accuracy was smaller when selected sequence variants were treated in the same category as 50k SNPs. Very little improvement over 50k prediction was observed when using all WGS variants. Conclusions Accuracy of genomic prediction in diverse sheep populations increased substantially by using variants selected from whole-genome sequence data based on an independent multi-breed GWAS, when compared to genomic prediction using standard 50K genotypes.
Collapse
|
34
|
Wu P, Wang K, Zhou J, Chen D, Yang Q, Yang X, Liu Y, Feng B, Jiang A, Shen L, Xiao W, Jiang Y, Zhu L, Zeng Y, Xu X, Li X, Tang G. GWAS on Imputed Whole-Genome Resequencing From Genotyping-by-Sequencing Data for Farrowing Interval of Different Parities in Pigs. Front Genet 2019; 10:1012. [PMID: 31681435 PMCID: PMC6813215 DOI: 10.3389/fgene.2019.01012] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2019] [Accepted: 09/23/2019] [Indexed: 12/30/2022] Open
Abstract
The whole-genome sequencing (WGS) data can potentially discover all genetic variants. Studies have shown the power of WGS for genome-wide association study (GWAS) lies in the ability to identify quantitative trait loci and nucleotides (QTNs). However, the resequencing of thousands of target individuals is expensive. Genotype imputation is a powerful approach for WGS and to identify causal mutations. This study aimed to evaluate the imputation accuracy from genotyping-by-sequencing (GBS) to WGS in two pig breeds using a resequencing reference population and to detect single-nucleotide polymorphisms (SNPs) and candidate genes for farrowing interval (FI) of different parities using the data before and after imputation for GWAS. Six hundred target pigs, 300 Landrace and 300 Large White pigs, were genotyped by GBS, and 60 reference pigs, 20 Landrace and 40 Large White pigs, were sequenced by whole-genome resequencing. Imputation for pigs was conducted using Beagle software. The average imputation accuracy (allelic R 2) from GBS to WGS was 0.42 for Landrace pigs and 0.45 for Large White pigs. For Landrace pigs (Large White pigs), 4,514,934 (5,533,290) SNPs had an accuracy >0.3, resulting an average accuracy of 0.73 (0.72), and 2,093,778 (2,468,645) SNPs had an accuracy >0.8, resulting an average accuracy of 0.94 (0.93). Association studies with data before and after imputation were performed for FI of different parities in two populations. Before imputation, 18 and 128 significant SNPs were detected for FI in Landrace and Large White pigs, respectively. After imputation, 125 and 27 significant SNPs were identified for dataset with an accuracy >0.3 and 0.8 in Large White pigs, and 113 and 18 SNPs were found among imputed sequence variants. Among these significant SNPs, six top SNPs were detected in both GBS data and imputed WGS data, namely, SSC2: 136127645, SSC5: 103426443, SSC6: 27811226, SSC10: 3609429, SSC14: 15199253, and SSC15: 150297519. Overall, many candidate genes could be involved in FI of different parities in pigs. Although imputation from GBS to WGS data resulted in a low imputation accuracy, association analyses with imputed WGS data were optimized to detect QTNs for complex trait. The obtained results provide new insight into genotype imputation, genetic architecture, and candidate genes for FI of different parities in Landrace and Large White pigs.
Collapse
Affiliation(s)
- Pingxian Wu
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Kai Wang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Jie Zhou
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Dejuan Chen
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Qiang Yang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Xidi Yang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Yihui Liu
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Bo Feng
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Anan Jiang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Linyuan Shen
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Weihang Xiao
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Yanzhi Jiang
- College of Life Science, Sichuan Agricultural University, Yaan, China
| | - Li Zhu
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Yangshuang Zeng
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Xu Xu
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Xuewei Li
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Guoqing Tang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| |
Collapse
|
35
|
Li C, Li M, Li X, Ni W, Xu Y, Yao R, Wei B, Zhang M, Li H, Zhao Y, Liu L, Ullah Y, Jiang Y, Hu S. Whole-Genome Resequencing Reveals Loci Associated With Thoracic Vertebrae Number in Sheep. Front Genet 2019; 10:674. [PMID: 31379930 PMCID: PMC6657399 DOI: 10.3389/fgene.2019.00674] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 06/27/2019] [Indexed: 12/31/2022] Open
Abstract
The number of vertebrae, especially thoracic vertebrae, is an important economic trait that may influence carcass length and meat production in animals. However, the genetic basis of vertebrae number in sheep is still poorly understood. To detect the candidate genes, 400 increased number of thoracic vertebrae (T14L6) and 200 normal (T13L6) Kazakh sheep were collected. We generated and sequenced 60 pools of genomic DNA (each pool prepared by mixing genomic DNA from 10 sheep with the same thoracic traits), with an average depth of coverage of 25.65×. We identified a total of 42,075,402 SNPs and 11 putatively selected genomic regions, including the VRTN gene and the HoxA gene family that regulate vertebral development. The most prominent areas of selective elimination were located in a region of chromosome 7, including VRTN, which regulates spinal development and morphology. Further investigation indicated that the expression level of the VRTN gene during fetal development was significantly higher in sheep with more thoracic vertebrae than in those with a normal number of thoracic vertebrae. A genome-wide comparison between sheep with increased and normal numbers of thoracic vertebrae showed that the VRTN gene is the major selection locus for the number of thoracic vertebrae in sheep and has the potential to be utilized in sheep breeding in the future.
Collapse
Affiliation(s)
- Cunyuan Li
- College of Life Sciences, Shihezi University, Shihezi, China.,College of Animal Science and Technology, Shihezi University, Shihezi, China
| | - Ming Li
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Xiaoyue Li
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Wei Ni
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yueren Xu
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Rui Yao
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Bin Wei
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Mengdan Zhang
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Huixiang Li
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yue Zhao
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Li Liu
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yaseen Ullah
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yu Jiang
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Shengwei Hu
- College of Life Sciences, Shihezi University, Shihezi, China
| |
Collapse
|
36
|
Al Kalaldeh M, Gibson J, Duijvesteijn N, Daetwyler HD, MacLeod I, Moghaddar N, Lee SH, van der Werf JHJ. Using imputed whole-genome sequence data to improve the accuracy of genomic prediction for parasite resistance in Australian sheep. Genet Sel Evol 2019; 51:32. [PMID: 31242855 PMCID: PMC6595562 DOI: 10.1186/s12711-019-0476-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 06/18/2019] [Indexed: 01/16/2023] Open
Abstract
Background This study aimed at (1) comparing the accuracies of genomic prediction for parasite resistance in sheep based on whole-genome sequence (WGS) data to those based on 50k and high-density (HD) single nucleotide polymorphism (SNP) panels; (2) investigating whether the use of variants within quantitative trait loci (QTL) regions that were selected from regional heritability mapping (RHM) in an independent dataset improved the accuracy more than variants selected from genome-wide association studies (GWAS); and (3) comparing the prediction accuracies between variants selected from WGS data to variants selected from the HD SNP panel. Results The accuracy of genomic prediction improved marginally from 0.16 ± 0.02 and 0.18 ± 0.01 when using all the variants from 50k and HD genotypes, respectively, to 0.19 ± 0.01 when using all the variants from WGS data. Fitting a GRM from the selected variants alongside a GRM from the 50k SNP genotypes improved the prediction accuracy substantially compared to fitting the 50k SNP genotypes alone. The gain in prediction accuracy was slightly more pronounced when variants were selected from WGS data compared to when variants were selected from the HD panel. When sequence variants that passed the GWAS \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$- log_{10} (p\,value)$$\end{document}-log10(pvalue) threshold of 3 across the entire genome were selected, the prediction accuracy improved by 5% (up to 0.21 ± 0.01), whereas when selection was limited to sequence variants that passed the same GWAS \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$- log_{10} (p\,value)$$\end{document}-log10(pvalue) threshold of 3 in regions identified by RHM, the accuracy improved by 9% (up to 0.25 ± 0.01). Conclusions Our results show that through careful selection of sequence variants from the QTL regions, the accuracy of genomic prediction for parasite resistance in sheep can be improved. These findings have important implications for genomic prediction in sheep.
Collapse
Affiliation(s)
- Mohammad Al Kalaldeh
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia. .,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| | - John Gibson
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Naomi Duijvesteijn
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Hans D Daetwyler
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,Centre for AgriBioscience, Agriculture Victoria, Bundoora, VIC, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Iona MacLeod
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,Centre for AgriBioscience, Agriculture Victoria, Bundoora, VIC, 3083, Australia
| | - Nasir Moghaddar
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Sang Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia
| | - Julius H J van der Werf
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| |
Collapse
|