1
|
Zhou C, Zhang Y, Ma T, Wu D, Yang Y, Wang D, Li X, Guo S, Yang S, Song Y, Zhang Y, Zuo Y, Cao G. Whole-Genome Resequencing of Ujimqin Sheep Identifies Genes Associated with Vertebral Number. Animals (Basel) 2024; 14:677. [PMID: 38473062 DOI: 10.3390/ani14050677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 02/07/2024] [Accepted: 02/19/2024] [Indexed: 03/14/2024] Open
Abstract
The number of vertebrae is a crucial economic trait that can significantly impact the carcass length and meat production in animals. However, our understanding of the quantitative trait loci (QTLs) and candidate genes associated with the vertebral number in sheep (Ovis aries) remains limited. To identify these candidate genes and QTLs, we collected 73 Ujimqin sheep with increased numbers of vertebrae (T13L7, T14L6, and T14L7) and 23 sheep with normal numbers of vertebrae (T13L6). Through high-throughput genome resequencing, we obtained a total of 24,130,801 effective single-nucleotide polymorphisms (SNPs). By conducting a selective-sweep analysis, we discovered that the most significantly selective region was located on chromosome 7. Within this region, we identified several genes, including VRTN, SYNDIG1L, LTBP2, and ABCD4, known to regulate the spinal development and morphology. Further, a genome-wide association study (GWAS) performed on sheep with increased and normal vertebral numbers confirmed that ABCD4 is a candidate gene for determining the number of vertebrae in sheep. Additionally, the most significant SNP on chromosome 7 was identified as a candidate QTL. Moreover, we detected two missense mutations in the ABCD4 gene; one of these mutations (Chr7: 89393414, C > T) at position 22 leads to the conversion of arginine (Arg) to glutamine (Gln), which is expected to negatively affect the protein's function. Notably, a transcriptome expression profile in mouse embryonic development revealed that ABCD4 is highly expressed during the critical period of vertebral formation (4.5-7.5 days). Our study highlights ABCD4 as a potential major gene influencing the number of vertebrae in Ujimqin sheep, with promising prospects for future genome-assisted breeding improvements in sheep.
Collapse
Affiliation(s)
- Chuanqing Zhou
- College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot 010010, China
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010020, China
| | - Yue Zhang
- College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot 010010, China
| | - Teng Ma
- College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot 010010, China
| | - Dabala Wu
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot 010070, China
| | - Yanyan Yang
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot 010070, China
| | - Daqing Wang
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot 010070, China
| | - Xiunan Li
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot 010070, China
| | - Shuchun Guo
- Inner Mongolia Academy of Agricultural and Animal Husbandry Sciences, Hohhot 010070, China
| | - Siqi Yang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010020, China
| | - Yongli Song
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010020, China
| | - Yong Zhang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010020, China
- Key Laboratory of Animal Biotechnology of the Ministry of Agriculture, College of Veterinary Medicine, Northwest A&F University, Yangling, Xianyang 712100, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010020, China
| | - Guifang Cao
- College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot 010010, China
| |
Collapse
|
2
|
Khan MZ, Chen W, Huang B, Liu X, Wang X, Liu Y, Chai W, Wang C. Advancements in Genetic Marker Exploration for Livestock Vertebral Traits with a Focus on China. Animals (Basel) 2024; 14:594. [PMID: 38396562 PMCID: PMC10885964 DOI: 10.3390/ani14040594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 01/29/2024] [Accepted: 02/07/2024] [Indexed: 02/25/2024] Open
Abstract
In livestock breeding, the number of vertebrae has gained significant attention due to its impact on carcass quality and quantity. Variations in vertebral traits have been observed across different animal species and breeds, with a strong correlation to growth and meat production. Furthermore, vertebral traits are classified as quantitative characteristics. Molecular marker techniques, such as marker-assisted selection (MAS), have emerged as efficient tools to identify genetic markers associated with vertebral traits. In the current review, we highlight some key potential genes and their polymorphisms that play pivotal roles in controlling vertebral traits (development, length, and number) in various livestock species, including pigs, donkeys, and sheep. Specific genetic variants within these genes have been linked to vertebral development, number, and length, offering valuable insights into the genetic mechanisms governing vertebral traits. This knowledge has significant implications for selective breeding strategies to enhance structural characteristics and meat quantity and quality in livestock, ultimately improving the efficiency and quality of the animal husbandry industry.
Collapse
Affiliation(s)
- Muhammad Zahoor Khan
- Liaocheng Research Institute of Donkey High-Efficiency Breeding and Ecological Feeding, Liaocheng University, Liaocheng 522000, China
| | | | | | | | | | | | | | - Changfa Wang
- Liaocheng Research Institute of Donkey High-Efficiency Breeding and Ecological Feeding, Liaocheng University, Liaocheng 522000, China
| |
Collapse
|
3
|
Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023; 13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open
Abstract
Enhancing the accuracy of genomic prediction is a key goal in genomic selection (GS) research. Integrating prior biological information into GS methods using appropriate models can improve prediction accuracy for complex traits. Genome-wide association study (GWAS) is widely utilized to identify potential candidate loci associated with complex traits in livestock and poultry, offering essential genomic insights. In this study, a GWAS was conducted on 685 Duroc × Landrace × Yorkshire (DLY) pigs to extract significant single-nucleotide polymorphisms (SNPs) as genomic features. We compared two GS models, genomic best linear unbiased prediction (GBLUP) and genomic feature BLUP (GFBLUP), by using imputed whole-genome sequencing (WGS) data on 651 Yorkshire pigs. The results revealed that the GBLUP model achieved prediction accuracies of 0.499 for backfat thickness (BFT) and 0.423 for loin muscle area (LMA). By applying the GFBLUP model with GWAS-based SNP preselection, the average prediction accuracies for BFT and LMA traits reached 0.491 and 0.440, respectively. Specifically, the GFBLUP model displayed a 4.8% enhancement in predicting LMA compared to the GBLUP model. These findings suggest that, in certain scenarios, the GFBLUP model may offer superior genomic prediction accuracy when compared to the GBLUP model, underscoring the potential value of incorporating genomic features to refine GS models.
Collapse
Affiliation(s)
- Yiyi Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuling Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Fuchen Zhou
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zekai Yao
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuexin Zhan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfei Fan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Xianglun Meng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zebin Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Langqing Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Jie Yang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Enqin Zheng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
4
|
Teng J, Wang D, Zhao C, Zhang X, Chen Z, Liu J, Sun D, Tang H, Wang W, Li J, Mei C, Yang Z, Ning C, Zhang Q. Longitudinal genome-wide association studies of milk production traits in Holstein cattle using whole-genome sequence data imputed from medium-density chip data. J Dairy Sci 2023; 106:2535-2550. [PMID: 36797187 DOI: 10.3168/jds.2022-22277] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 10/20/2022] [Indexed: 02/16/2023]
Abstract
Longitudinal traits, such as milk production traits in dairy cattle, are featured by having phenotypic values at multiple time points, which change dynamically over time. In this study, we first imputed SNP chip (50-100K) data to whole-genome sequence (WGS) data in a Chinese Holstein population consisting of 6,470 cows. The imputation accuracies were 0.88 to 0.97 on average after quality control. We then performed longitudinal GWAS in this population based on a random regression test-day model using the imputed WGS data. The longitudinal GWAS revealed 16, 39, and 75 quantitative trait locus regions associated with milk yield, fat percentage, and protein percentage, respectively. We estimated the 95% confidence intervals (CI) for these quantitative trait locus regions using the logP drop method and identified 581 genes involved in these CI. Further, we focused on the CI that covered or overlapped with only 1 gene or the CI that contained an extremely significant top SNP. Twenty-eight candidate genes were identified in these CI. Most of them have been reported in the literature to be associated with milk production traits, such as DGAT1, HSF1, MGST1, GHR, ABCG2, ADCK5, and CSN1S1. Among the unreported novel genes, some also showed good potential as candidate genes, such as CCSER1, CUX2, SNTB1, RGS7, OSR2, and STK3, and are worth being further investigated. Our study provided not only new insights into the candidate genes for milk production traits, but also a general framework for longitudinal GWAS based on random regression test-day model using WGS data.
Collapse
Affiliation(s)
- Jun Teng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Dan Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Changheng Zhao
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Xinyi Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Zhi Chen
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Jianfeng Liu
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Dongxiao Sun
- College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Hui Tang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Wenwen Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China
| | - Jianbin Li
- Institute of Animal Science and Veterinary Medicine, Shandong Academy of Agricultural Sciences, Jinan 250100, China
| | - Cheng Mei
- Dongying Shenzhou AustAsia Modern Dairy Farm Co. Ltd., Dongying 257200, China
| | - Zhangping Yang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Chao Ning
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China.
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Technology, Shandong Agricultural University, Tai'an 271018, China.
| |
Collapse
|
5
|
Heidaritabar M, Huisman A, Krivushin K, Stothard P, Dervishi E, Charagu P, Bink MCAM, Plastow GS. Imputation to whole-genome sequence and its use in genome-wide association studies for pork colour traits in crossbred and purebred pigs. Front Genet 2022; 13:1022681. [PMID: 36303553 PMCID: PMC9593086 DOI: 10.3389/fgene.2022.1022681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 09/22/2022] [Indexed: 11/16/2022] Open
Abstract
Imputed whole-genome sequence (WGS) has been proposed to improve genome-wide association studies (GWAS), since all causative mutations responsible for phenotypic variation are expected to be present in the data. This approach was applied on a large number of purebred (PB) and crossbred (CB) pigs for 18 pork color traits to evaluate the impact of using imputed WGS relative to medium-density marker panels. The traits included Minolta A*, B*, and L* for fat (FCOL), quadriceps femoris muscle (QFCOL), thawed loin muscle (TMCOL), fresh ham gluteus medius (GMCOL), ham iliopsoas muscle (ICOL), and longissimus dorsi muscle on the fresh loin (FMCOL). Sequence variants were imputed from a medium-density marker panel (61K for CBs and 50K for PBs) in all genotyped pigs using BeagleV5.0. We obtained high imputation accuracy (average of 0.97 for PBs and 0.91 for CBs). GWAS were conducted for three datasets: 954 CBs and 891 PBs, and the combined CBs and PBs. For most traits, no significant associations were detected, regardless of panel density or population type. However, quantitative trait loci (QTL) regions were only found for a few traits including TMCOL Minolta A* and GMCOL Minolta B* (CBs), FMCOL Minolta B*, FMCOL Minolta L*, and ICOL Minolta B* (PBs) and FMCOL Minolta A*, FMCOL Minolta B*, GMCOL Minolta B*, and ICOL Minolta B* (Combined dataset). More QTL regions were identified with WGS (n = 58) relative to medium-density marker panels (n = 22). Most of the QTL were linked to previously reported QTLs or candidate genes that have been previously reported to be associated with meat quality, pH and pork color; e.g., VIL1, PRKAG3, TTLL4, and SLC11A1, USP37. CTDSP1 gene on SSC15 has not been previously associated with meat color traits in pigs. The findings suggest any added value of WGS was only for detecting novel QTL regions when the sample size is sufficiently large as with the Combined dataset in this study. The percentage of phenotypic variance explained by the most significant SNPs also increased with WGS compared with medium-density panels. The results provide additional insights into identification of a number of candidate regions and genes for pork color traits in different pig populations.
Collapse
Affiliation(s)
- Marzieh Heidaritabar
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
- *Correspondence: Marzieh Heidaritabar,
| | - Abe Huisman
- Hendrix Genetics Research, Boxmeer, Netherlands
| | - Kirill Krivushin
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Elda Dervishi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | | | | | - Graham S. Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
6
|
Jiang Y, Song H, Gao H, Zhang Q, Ding X. Exploring the optimal strategy of imputation from SNP array to whole-genome sequencing data in farm animals. Front Genet 2022; 13:963654. [PMID: 36092888 PMCID: PMC9459117 DOI: 10.3389/fgene.2022.963654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 08/01/2022] [Indexed: 11/13/2022] Open
Abstract
Genotype imputation from BeadChip to whole-genome sequencing (WGS) data is a cost-effective method of obtaining genotypes of WGS variants. Beagle, one of the most popular imputation software programs, has been widely used for genotype inference in humans and non-human species. A few studies have systematically and comprehensively compared the performance of beagle versions and parameter settings of farm animals. Here, we investigated the imputation performance of three representative versions of Beagle (Beagle 4.1, Beagle 5.0, and Beagle 5.4), and the effective population size (Ne) parameter setting for three species (cattle, pig, and chicken). Six scenarios were investigated to explore the impact of certain key factors on imputation performance. The results showed that the default Ne (1,000,000) is not suitable for livestock and poultry in small reference or low-density arrays of target panels, with 2.47%–10.45% drops in accuracy. Beagle 5 significantly reduced the computation time (4.66-fold–13.24-fold) without an accuracy loss. In addition, using a large combined-reference panel or high-density chip provides greater imputation accuracy, especially for low minor allele frequency (MAF) variants. Finally, a highly significant correlation in the measures of imputation accuracy can be obtained with an MAF equal to or greater than 0.05.
Collapse
Affiliation(s)
- Yifan Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hailiang Song
- Beijing Key Laboratory of Fisheries Biotechnology, Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Hongding Gao
- Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, Taian, China
| | - Xiangdong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, China
- *Correspondence: Xiangdong Ding,
| |
Collapse
|
7
|
Wang Z, Zhang Z, Chen Z, Sun J, Cao C, Wu F, Xu Z, Zhao W, Sun H, Guo L, Zhang Z, Wang Q, Pan Y. PHARP: a pig haplotype reference panel for genotype imputation. Sci Rep 2022; 12:12645. [PMID: 35879321 PMCID: PMC9314402 DOI: 10.1038/s41598-022-15851-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 06/30/2022] [Indexed: 11/18/2022] Open
Abstract
Pigs not only function as a major meat source worldwide but also are commonly used as an animal model for studying human complex traits. A large haplotype reference panel has been used to facilitate efficient phasing and imputation of relatively sparse genome-wide microarray chips and low-coverage sequencing data. Using the imputed genotypes in the downstream analysis, such as GWASs, TWASs, eQTL mapping and genomic prediction (GS), is beneficial for obtaining novel findings. However, currently, there is still a lack of publicly available and high-quality pig reference panels with large sample sizes and high diversity, which greatly limits the application of genotype imputation in pigs. In response, we built the pig Haplotype Reference Panel (PHARP) database. PHARP provides a reference panel of 2012 pig haplotypes at 34 million SNPs constructed using whole-genome sequence data from more than 49 studies of 71 pig breeds. It also provides Web-based analytical tools that allow researchers to carry out phasing and imputation consistently and efficiently. PHARP is freely accessible at http://alphaindex.zju.edu.cn/PHARP/index.php . We demonstrate its applicability for pig commercial 50 K SNP arrays, by accurately imputing 2.6 billion genotypes at a concordance rate value of 0.971 in 81 Large White pigs (~ 17 × sequencing coverage). We also applied our reference panel to impute the low-density SNP chip into the high-density data for three GWASs and found novel significantly associated SNPs that might be casual variants.
Collapse
Affiliation(s)
- Zhen Wang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zhenyang Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zitao Chen
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Jiabao Sun
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Caiyun Cao
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Fen Wu
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China
| | - Zhong Xu
- Hubei Key Laboratory of Animal Embryo and Molecular Breeding, Institute of Animal Husbandry and Veterinary, Hubei Provincial Academy of Agricultural Sciences, Wuhan, 430064, China
| | - Wei Zhao
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Hao Sun
- Department of Animal Science, School of Animal Science, Jilin University, Changchun, 130062, China
| | - Longyu Guo
- Department of Animal Science, School of Agriculture and Biology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Zhe Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| | - Qishan Wang
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| | - Yuchun Pan
- College of Animal Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.
| |
Collapse
|
8
|
Xu P, Li D, Wu Z, Ni L, Liu J, Tang Y, Yu T, Ren J, Zhao X, Huang M. An imputation-based genome-wide association study for growth and fatness traits in Sujiang pigs. Animal 2022; 16:100591. [PMID: 35872387 DOI: 10.1016/j.animal.2022.100591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 06/15/2022] [Accepted: 06/16/2022] [Indexed: 11/01/2022] Open
Abstract
Sujiang pigs are a synthetic breed derived from Jiangquhai, Fengjing, and Duroc pigs. In this study, we sequenced the genome of 62 pigs with a coverage depth of 10× to 20×, including 27 Sujiang and 35 founder breed pigs, and we collected 360 global pigs' genome sequence data from public databases including 39 Duroc pigs. We obtained a high-quality variant dataset of 365 Sujiang pigs by imputing the porcine 80 K single nucleotide polymorphism (SNP) Beadchip to the whole-genome scale with a total of 422 pigs as a reference panel. A dataset of 365 imputated Sujiang pigs was used to perform single-trait genome-wide association study (GWAS) and meta-analyses for growth and fatness traits. Single-trait GWAS identified 1 907, 18, and 14 SNPs surpassing the suggestively significant threshold for backfat thickness, chest circumference, and chest width, respectively. Meta-analyses identified 2 400 genome-wide significant SNPs and 520 suggestively significant SNPs for backfat thickness and chest circumference, and 719 genome-wide significant SNPs and 1 225 suggestively significant SNPs for all seven traits. According to the meta-analysis of backfat thickness and chest circumference, a remarkable region of 2.69 Mb on Sus scrofa chromosome 4 containing FAM110B, IMPAD1, LYN, MOS, PENK, PLAG1, SDR16C5 and XKR4 was identified as a candidate region. The haplotype heat map of the 2.69 Mb region verified that Sujiang pigs were derived from Duroc and Chinese indigenous pigs, especially Jiangquhai pigs. The Kruskal-Wallis test showed that haplotypes of the 2.69 Mb region significantly affected backfat thickness and chest circumference traits. We then focused on PLAG1, an important growth-related gene, and identified two synonymous SNPs with obvious differences among different breeds in the PLAG1 gene. We then performed genotyping of 365 Sujiang, 150 Duroc, 95 Jiangquhai, and 100 Fengjing pigs to confirm the above result and verified that the two variants significantly affected phenotypes of growth and fatness traits. Our findings not only provide insights into the genetic architecture of porcine growth and fatness traits but also provide potential markers for selective breeding of these traits in Sujiang pigs.
Collapse
Affiliation(s)
- Pan Xu
- School of Animal Science and Technology, Jiangsu Agri-animal Husbandry Vocational College, Taizhou, PR China
| | - Desen Li
- College of Animal Science, South China Agricultural University, Guangzhou, PR China
| | - Zhongping Wu
- Zhongkai University of Agriculture and Engineering, Guangzhou, PR China
| | - Ligang Ni
- School of Animal Science and Technology, Jiangsu Agri-animal Husbandry Vocational College, Taizhou, PR China
| | - Jiaxing Liu
- School of Animal Science and Technology, Jiangsu Agri-animal Husbandry Vocational College, Taizhou, PR China
| | - Ying Tang
- School of Animal Science and Technology, Jiangsu Agri-animal Husbandry Vocational College, Taizhou, PR China
| | - Tongshun Yu
- School of Animal Science and Technology, Jiangsu Agri-animal Husbandry Vocational College, Taizhou, PR China
| | - Jun Ren
- College of Animal Science, South China Agricultural University, Guangzhou, PR China
| | - Xuting Zhao
- School of Animal Science and Technology, Jiangsu Agri-animal Husbandry Vocational College, Taizhou, PR China
| | - Min Huang
- College of Animal Science, South China Agricultural University, Guangzhou, PR China.
| |
Collapse
|
9
|
Reich P, Falker-Gieske C, Pook T, Tetens J. Development and validation of a horse reference panel for genotype imputation. Genet Sel Evol 2022; 54:49. [PMID: 35787788 PMCID: PMC9252005 DOI: 10.1186/s12711-022-00740-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/23/2022] [Indexed: 11/10/2022] Open
Abstract
Background Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. Results Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. Conclusions The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00740-8.
Collapse
Affiliation(s)
- Paula Reich
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.
| | - Clemens Falker-Gieske
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Torsten Pook
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| | - Jens Tetens
- Department of Animal Sciences, Georg-August-University Göttingen, 37077, Göttingen, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, 37075, Göttingen, Germany
| |
Collapse
|
10
|
Ye H, Zhang Z, Ren D, Cai X, Zhu Q, Ding X, Zhang H, Zhang Z, Li J. Genomic Prediction Using LD-Based Haplotypes in Combined Pig Populations. Front Genet 2022; 13:843300. [PMID: 35754827 PMCID: PMC9218795 DOI: 10.3389/fgene.2022.843300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2021] [Accepted: 05/02/2022] [Indexed: 11/13/2022] Open
Abstract
The size of reference population is an important factor affecting genomic prediction. Thus, combining different populations in genomic prediction is an attractive way to improve prediction ability. However, combining multireference population roughly cannot increase the prediction accuracy as well as expected in pig. This may be due to different linkage disequilibrium (LD) pattern differences between population. In this study, we used the imputed whole-genome sequencing (WGS) data to construct LD-based haplotypes for genomic prediction in combined population to explore the impact of different single-nucleotide polymorphism (SNP) densities, variant representation (SNPs or haplotype alleles), and reference population size on the prediction accuracy for reproduction traits. Our results showed that genomic best linear unbiased prediction (GBLUP) using the WGS data can improve prediction accuracy in multi-population but not within-population. Not only the genomic prediction accuracy of the haplotype method using 80 K chip data in multi-population but also GBLUP for the multi-population (3.4–5.9%) was higher than that within-population (1.2–4.3%). More importantly, we have found that using the haplotype method based on the WGS data in multi-population has better genomic prediction performance, and our results showed that building haploblock in this scenario based on low LD threshold (r2 = 0.2–0.3) produced an optimal set of variables for reproduction traits in Yorkshire pig population. Our results suggested that whether the use of the haplotype method based on the chip data or GBLUP (individual SNP method) based on the WGS data were beneficial for genomic prediction in multi-population, while simultaneously combining the haplotype method and WGS data was a better strategy for multi-population genomic evaluation.
Collapse
Affiliation(s)
- Haoqiang Ye
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zipeng Zhang
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Duanyang Ren
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiaodian Cai
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Qianghui Zhu
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture and Rural Affairs, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hao Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zhe Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jiaqi Li
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
11
|
Wang J, Yu J, Lipka AE, Zhang Z. Interpretation of Manhattan Plots and Other Outputs of Genome-Wide Association Studies. Methods Mol Biol 2022; 2481:63-80. [PMID: 35641759 DOI: 10.1007/978-1-0716-2237-7_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
With increasing marker density, estimation of recombination rate between a marker and a causal mutation using linkage analysis becomes less important. Instead, linkage disequilibrium (LD) becomes the major indicator for gene mapping through genome-wide association studies (GWAS). In addition to the linkage between the marker and the causal mutation, many other factors may contribute to the LD, including population structure and cryptic relationships among individuals. As statistical methods and software evolve to improve statistical power and computing speed in GWAS, the corresponding outputs must also evolve to facilitate the interpretation of input data, the analytical process, and final association results. In this chapter, our descriptions focus on (1) considerations in creating a Manhattan plot displaying the strength of LD and locations of markers across a genome; (2) criteria for genome-wide significance threshold and the different appearance of Manhattan plots in single-locus and multiple-locus models; (3) exploration of population structure and kinship among individuals; (4) quantile-quantile (QQ) plot; (5) LD decay across the genome and LD between the associated markers and their neighbors; (6) exploration of individual and marker information on Manhattan and QQ plots via interactive visualization using HTML. The ultimate objective of this chapter is to help users to connect input data to GWAS outputs to balance power and false positives, and connect GWAS outputs to the selection of candidate genes using LD extent.
Collapse
Affiliation(s)
- Jiabo Wang
- Key Laboratory of Qinghai-Tibetan Plateau Animal Genetic Resource Reservation and Utilization, Sichuan Province and Ministry of Education, Southwest Minzu University, Chengdu, Sichuan, China.
| | - Jianming Yu
- Department of Agronomy, Iowa State University, Ames, IA, USA
| | - Alexander E Lipka
- Department of Crop Sciences, University of Illinois, Urbana, IL, USA
| | - Zhiwu Zhang
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA
| |
Collapse
|
12
|
Pook T, Nemri A, Gonzalez Segovia EG, Valle Torres D, Simianer H, Schoen CC. Increasing calling accuracy, coverage, and read-depth in sequence data by the use of haplotype blocks. PLoS Genet 2021; 17:e1009944. [PMID: 34941872 PMCID: PMC8699914 DOI: 10.1371/journal.pgen.1009944] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 11/13/2021] [Indexed: 01/16/2023] Open
Abstract
High-throughput genotyping of large numbers of lines remains a key challenge in plant genetics, requiring geneticists and breeders to find a balance between data quality and the number of genotyped lines under a variety of different existing genotyping technologies when resources are limited. In this work, we are proposing a new imputation pipeline (“HBimpute”) that can be used to generate high-quality genomic data from low read-depth whole-genome-sequence data. The key idea of the pipeline is the use of haplotype blocks from the software HaploBlocker to identify locally similar lines and subsequently use the reads of all locally similar lines in the variant calling for a specific line. The effectiveness of the pipeline is showcased on a dataset of 321 doubled haploid lines of a European maize landrace, which were sequenced at 0.5X read-depth. The overall imputing error rates are cut in half compared to state-of-the-art software like BEAGLE and STITCH, while the average read-depth is increased to 83X, thus enabling the calling of copy number variation. The usefulness of the obtained imputed data panel is further evaluated by comparing the performance of sequence data in common breeding applications to that of genomic data generated with a genotyping array. For both genome-wide association studies and genomic prediction, results are on par or even slightly better than results obtained with high-density array data (600k). In particular for genomic prediction, we observe slightly higher data quality for the sequence data compared to the 600k array in the form of higher prediction accuracies. This occurred specifically when reducing the data panel to the set of overlapping markers between sequence and array, indicating that sequencing data can benefit from the same marker ascertainment as used in the array process to increase the quality and usability of genomic data. High-throughput genotyping of large numbers of lines remains a key challenge in plant genetics and breeding. Cost, precision, and throughput must be balanced to achieve optimal efficiency given available technologies and finite resources. Although genotyping arrays are still considered the gold standard in high-throughput quantitative genetics, recent advances in sequencing provide new opportunities. Both the quality and cost of genomic data generated based on sequencing are highly dependent on the used read-depth. In this work, we propose a new imputation pipeline (“HBimpute”) that uses haplotype blocks to detect individuals of the same genetic origin and subsequently uses all reads of those individuals in the variant calling. Thus, the obtained virtual read-depth is artificially increased, leading to higher calling accuracy, coverage, and the ability to call copy number variation based on low read-depth sequencing data. To conclude, our approach makes sequencing a cost-competitive alternative to genotyping arrays with the added benefit of allowing the calling of structural variation.
Collapse
Affiliation(s)
- Torsten Pook
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, University of Goettingen, Goettingen, Germany
- * E-mail:
| | | | | | - Daniel Valle Torres
- Plant Breeding, Technical University of Munich, TUM School of Life Sciences Weihenstephan, Freising, Germany
| | - Henner Simianer
- Center for Integrated Breeding Research, Animal Breeding and Genetics Group, University of Goettingen, Goettingen, Germany
| | - Chris-Carolin Schoen
- Plant Breeding, Technical University of Munich, TUM School of Life Sciences Weihenstephan, Freising, Germany
| |
Collapse
|
13
|
Yan G, Liu X, Xiao S, Xin W, Xu W, Li Y, Huang T, Qin J, Xie L, Ma J, Zhang Z, Huang L. An imputed whole-genome sequence-based GWAS approach pinpoints causal mutations for complex traits in a specific swine population. SCIENCE CHINA-LIFE SCIENCES 2021; 65:781-794. [PMID: 34387836 DOI: 10.1007/s11427-020-1960-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Accepted: 05/19/2021] [Indexed: 01/08/2023]
Abstract
Sequencing-based genome-wide association studies (GWAS) have facilitated the identification of causal associations between genetic variants and traits in diverse species. However, it is cost-prohibitive for the majority of research groups to sequence a large number of samples. Here, we carried out genotype imputation to increase the density of single nucleotide polymorphisms in a large-scale Swine F2 population using a reference panel including 117 individuals, followed by a series of GWAS analyses. The imputation accuracies reached 0.89 and 0.86 for allelic concordance and correlation, respectively. A quantitative trait nucleotide (QTN) affecting the chest vertebrate was detected directly, while the investigation of another QTN affecting the residual glucose failed due to the presence of similar haplotypes carrying wild-type and mutant allelesin the reference panel used in this study. A high imputation accuracy was confirmed by Sanger sequencing technology for the most significant loci. Two candidate genes, CPNE5 and MYH3, affecting meat-related traits were proposed. Collectively, we illustrated four scenarios in imputation-based GWAS that may be encountered by researchers, and our results will provide an extensive reference for future genotype imputation-based GWAS analyses in the future.
Collapse
Affiliation(s)
- Guorong Yan
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
- Institute of Photomedicine, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, 200092, China
| | - Xianxian Liu
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Shijun Xiao
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Wenshui Xin
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Wenwu Xu
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Yiping Li
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Tao Huang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Jiangtao Qin
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Lei Xie
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Junwu Ma
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China.
| | - Zhiyan Zhang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China.
| | - Lusheng Huang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, 330045, China
| |
Collapse
|
14
|
Yang R, Guo X, Zhu D, Tan C, Bian C, Ren J, Huang Z, Zhao Y, Cai G, Liu D, Wu Z, Wang Y, Li N, Hu X. Accelerated deciphering of the genetic architecture of agricultural economic traits in pigs using a low-coverage whole-genome sequencing strategy. Gigascience 2021; 10:giab048. [PMID: 34282453 PMCID: PMC8290195 DOI: 10.1093/gigascience/giab048] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 03/21/2021] [Accepted: 06/15/2021] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Uncovering the genetic architecture of economic traits in pigs is important for agricultural breeding. However, high-density haplotype reference panels are unavailable in most agricultural species, limiting accurate genotype imputation in large populations. Moreover, the infinitesimal model of quantitative traits implies that weak association signals tend to be spread across most of the genome, further complicating the genetic analysis. Hence, there is a need to develop new methods for sequencing large cohorts without large reference panels. RESULTS We describe a Tn5-based highly accurate, cost- and time-efficient, low-coverage sequencing method to obtain 11.3 million whole-genome single-nucleotide polymorphisms in 2,869 Duroc boars at a mean depth of 0.73×. On the basis of these single-nucleotide polymorphisms, a genome-wide association study was performed, resulting in 14 quantitative trait loci (QTLs) for 7 of 21 important agricultural traits in pigs. These QTLs harbour genes, such as ABCD4 for total teat number and HMGA1 for back fat thickness, and provided a starting point for further investigation. The inheritance models of the different traits varied greatly. Most follow the minor-polygene model, but this can be attributed to different reasons, such as the shaping of genetic architecture by artificial selection for this population and sufficiently interconnected minor gene regulatory networks. CONCLUSIONS Genome-wide association study results for 21 important agricultural traits identified 14 QTLs/genes and showed their genetic architectures, providing guidance for genetic improvement harnessing genomic features. The Tn5-based low-coverage sequencing method can be applied to large-scale genome studies for any species without a good reference panel and can be used for agricultural breeding.
Collapse
Affiliation(s)
- Ruifei Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Xiaoli Guo
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Di Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Cheng Tan
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Cheng Bian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Jiangli Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Zhuolin Huang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Yiqiang Zhao
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Dewu Liu
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, No. 483 Wushan road, Tianhe district, Guangdong 510640, China
| | - Yuzhe Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Ning Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| | - Xiaoxiang Hu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, No. 2 Yuanmingyuan west road, Haidian district, Beijing 100193, China
| |
Collapse
|
15
|
Liu H, Song H, Jiang Y, Jiang Y, Zhang F, Liu Y, Shi Y, Ding X, Wang C. A Single-Step Genome Wide Association Study on Body Size Traits Using Imputation-Based Whole-Genome Sequence Data in Yorkshire Pigs. Front Genet 2021; 12:629049. [PMID: 34276758 PMCID: PMC8283822 DOI: 10.3389/fgene.2021.629049] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 05/10/2021] [Indexed: 11/13/2022] Open
Abstract
The body shape of a pig is the most direct production index, which can fully reflect the pig’s growth status and is closely related to important economic traits. In this study, a genome-wide association study on seven body size traits, the body length (BL), height (BH), chest circumference (CC), abdominal circumference (AC), cannon bone circumference (CBC), rump width (RW), and chest width (CW), were conducted in Yorkshire pigs. Illumina Porcine 80K SNP chips were used to genotype 589 of 5,572 Yorkshire pigs with body size records, and then the chip data was imputed to sequencing data. After quality control of imputed sequencing data, 784,267 SNPs were obtained, and the averaged linkage disequilibrium (r2) was 0.191. We used the single-trait model and the two-trait model to conduct single-step genome wide association study (ssGWAS) on seven body size traits; a total of 198 significant SNPS were finally identified according to the P-value and the contribution to the genetic variance of individual SNP. 11 candidate genes (CDH13, SIL1, CDC14A, TMRPSS15, TRAPPC9, CTNND2, KDM6B, CHD3, MUC13, MAPK4, and HMGA1) were found to be associated with body size traits in pigs; KDM6B and CHD3 jointly affect AC and CC, and MUC13 jointly affect RW and CW. These genes are involved in the regulation of bone growth and development as well as the absorption of nutrients and are associated with obesity. HMGA1 is proposed as a strong candidate gene for body size traits because of its important function and high consistency with other studies regarding the regulation of body size traits. Our results could provide valuable information for pig breeding based on molecular breeding.
Collapse
Affiliation(s)
- Huatao Liu
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hailiang Song
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Yifan Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Yao Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Fengxia Zhang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Yibing Liu
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Yong Shi
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Xiangdong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Chuduan Wang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
16
|
Deng Z, Huang T, Yan G, Yang B, Zhang Z, Xiao S, Ai H, Huang L. A further look at quantitative trait loci for growth and fatness traits in a White Duroc × Erhualian F 3 intercross population. Anim Biotechnol 2021; 33:1205-1216. [PMID: 34010090 DOI: 10.1080/10495398.2021.1884087] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Genetic analysis of porcine growth and fatness traits is beneficial to the swine industry and provides a reference to understand human obesity. Here, we obtained 29 growth and fatness traits for 473 individuals from a White Duroc × Erhualian F3 intercross population. Basic statistical analyses showed that: (1) Positive correlations between different-stage body weights were detected, the shorter the time interval the stronger the correlation. (2) Strong correlations existed in the paired fatness traits. (3) With the growth of age, the correlation between fatness and body weight was increasing. All pigs were genotyped by Illumina 50 K SNP chips and their whole-genome genotypes were imputed referred to 109 re-sequencing data. We performed common and imputation-based GWASs for these traits. Two genome-wide significant loci on swine chromosome (SSC) 4 and 7 were repeatedly detected. The strongest association (P = 3.24 × 10-19) was detected at 31.96 Mb on SSC7 for leaf fat weight. On this locus, seven major haplotypes were identified, of which two were novel and had an increasing-fatness effect. In the imputation-based GWAS, three new loci were identified. Our findings provide further insights into and enhance our understanding of genetic mechanism of porcine growth and fat deposition.
Collapse
Affiliation(s)
- Zheng Deng
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Tao Huang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Guorong Yan
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Bin Yang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Zhiyan Zhang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Shijun Xiao
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Huashui Ai
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Lusheng Huang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| |
Collapse
|
17
|
Torkamaneh D, Laroche J, Valliyodan B, O'Donoughue L, Cober E, Rajcan I, Vilela Abdelnoor R, Sreedasyam A, Schmutz J, Nguyen HT, Belzile F. Soybean (Glycine max) Haplotype Map (GmHapMap): a universal resource for soybean translational and functional genomics. PLANT BIOTECHNOLOGY JOURNAL 2021; 19:324-334. [PMID: 32794321 DOI: 10.1101/534578] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Revised: 07/24/2020] [Accepted: 08/07/2020] [Indexed: 05/27/2023]
Abstract
Here, we describe a worldwide haplotype map for soybean (GmHapMap) constructed using whole-genome sequence data for 1007 Glycine max accessions and yielding 14.9 million variants as well as 4.3 M tag single-nucleotide polymorphisms (SNPs). When sampling random subsets of these accessions, the number of variants and tag SNPs plateaued beyond approximately 800 and 600 accessions, respectively. This suggests extensive coverage of diversity within the cultivated soybean. GmHapMap variants were imputed onto 21 618 previously genotyped accessions with up to 96% success for common alleles. A local association analysis was performed with the imputed data using markers located in a 1-Mb region known to contribute to seed oil content and enabled us to identify a candidate causal SNP residing in the NPC1 gene. We determined gene-centric haplotypes (407 867 GCHs) for the 55 589 genes and showed that such haplotypes can help to identify alleles that differ in the resulting phenotype. Finally, we predicted 18 031 putative loss-of-function (LOF) mutations in 10 662 genes and illustrated how such a resource can be used to explore gene function. The GmHapMap provides a unique worldwide resource for applied soybean genomics and breeding.
Collapse
Affiliation(s)
- Davoud Torkamaneh
- Département de Phytologie, Université Laval, Québec City, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada
- Department of Plant Agriculture, University of Guelph, Guelph, ON, Canada
| | - Jérôme Laroche
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada
| | - Babu Valliyodan
- National Center for Soybean Biotechnology and Division of Plant Sciences, University of Missouri, Columbia, MO, USA
| | - Louise O'Donoughue
- CÉROM, Centre de recherche Sur Les Grains Inc., Saint-Mathieu de Beloeil, QC, Canada
| | - Elroy Cober
- Agriculture and Agri-Food Canada, Ottawa, ON, Canada
| | - Istvan Rajcan
- Department of Plant Agriculture, University of Guelph, Guelph, ON, Canada
| | - Ricardo Vilela Abdelnoor
- Brazilian Corporation of Agricultural Research (Embrapa Soja), Warta County, PR, Brazil
- Londrina State University (UEL), Londrina, PR, Brazil
| | | | - Jeremy Schmutz
- Institute for Biotechnology, HudsonAlpha, Huntsville, AL, USA
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, USA
| | - Henry T Nguyen
- National Center for Soybean Biotechnology and Division of Plant Sciences, University of Missouri, Columbia, MO, USA
| | - François Belzile
- Département de Phytologie, Université Laval, Québec City, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada
| |
Collapse
|
18
|
Torkamaneh D, Laroche J, Valliyodan B, O’Donoughue L, Cober E, Rajcan I, Vilela Abdelnoor R, Sreedasyam A, Schmutz J, Nguyen HT, Belzile F. Soybean (Glycine max) Haplotype Map (GmHapMap): a universal resource for soybean translational and functional genomics. PLANT BIOTECHNOLOGY JOURNAL 2021; 19:324-334. [PMID: 32794321 PMCID: PMC7868971 DOI: 10.1111/pbi.13466] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Revised: 07/24/2020] [Accepted: 08/07/2020] [Indexed: 05/10/2023]
Abstract
Here, we describe a worldwide haplotype map for soybean (GmHapMap) constructed using whole-genome sequence data for 1007 Glycine max accessions and yielding 14.9 million variants as well as 4.3 M tag single-nucleotide polymorphisms (SNPs). When sampling random subsets of these accessions, the number of variants and tag SNPs plateaued beyond approximately 800 and 600 accessions, respectively. This suggests extensive coverage of diversity within the cultivated soybean. GmHapMap variants were imputed onto 21 618 previously genotyped accessions with up to 96% success for common alleles. A local association analysis was performed with the imputed data using markers located in a 1-Mb region known to contribute to seed oil content and enabled us to identify a candidate causal SNP residing in the NPC1 gene. We determined gene-centric haplotypes (407 867 GCHs) for the 55 589 genes and showed that such haplotypes can help to identify alleles that differ in the resulting phenotype. Finally, we predicted 18 031 putative loss-of-function (LOF) mutations in 10 662 genes and illustrated how such a resource can be used to explore gene function. The GmHapMap provides a unique worldwide resource for applied soybean genomics and breeding.
Collapse
Affiliation(s)
- Davoud Torkamaneh
- Département de PhytologieUniversité LavalQuébec CityQCCanada
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébec CityQCCanada
- Department of Plant AgricultureUniversity of GuelphGuelphONCanada
| | - Jérôme Laroche
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébec CityQCCanada
| | - Babu Valliyodan
- National Center for Soybean Biotechnology and Division of Plant SciencesUniversity of MissouriColumbiaMOUSA
| | - Louise O’Donoughue
- CÉROMCentre de recherche Sur Les Grains Inc.Saint‐Mathieu de BeloeilQCCanada
| | - Elroy Cober
- Agriculture and Agri‐Food CanadaOttawaONCanada
| | - Istvan Rajcan
- Department of Plant AgricultureUniversity of GuelphGuelphONCanada
| | - Ricardo Vilela Abdelnoor
- Brazilian Corporation of Agricultural Research (Embrapa Soja)Warta CountyPRBrazil
- Londrina State University (UEL)LondrinaPRBrazil
| | | | - Jeremy Schmutz
- Institute for BiotechnologyHudsonAlphaHuntsvilleALUSA
- Department of EnergyJoint Genome InstituteWalnut CreekCAUSA
| | - Henry T. Nguyen
- National Center for Soybean Biotechnology and Division of Plant SciencesUniversity of MissouriColumbiaMOUSA
| | - François Belzile
- Département de PhytologieUniversité LavalQuébec CityQCCanada
- Institut de Biologie Intégrative et des Systèmes (IBIS)Université LavalQuébec CityQCCanada
| |
Collapse
|
19
|
Cai Z, Sarup P, Ostersen T, Nielsen B, Fredholm M, Karlskov-Mortensen P, Sørensen P, Jensen J, Guldbrandtsen B, Lund MS, Christensen OF, Sahana G. Genomic diversity revealed by whole-genome sequencing in three Danish commercial pig breeds. J Anim Sci 2020; 98:5873883. [PMID: 32687196 DOI: 10.1093/jas/skaa229] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Accepted: 07/14/2020] [Indexed: 01/04/2023] Open
Abstract
Whole-genome sequencing of 217 animals from three Danish commercial pig breeds (Duroc, Landrace [LL], and Yorkshire [YY]) was performed. Twenty-six million single-nucleotide polymorphisms (SNPs) and 8 million insertions or deletions (indels) were uncovered. Among the SNPs, 493,099 variants were located in coding sequences, and 29,430 were predicted to have a high functional impact such as gain or loss of stop codon. Using the whole-genome sequence dataset as the reference, the imputation accuracy for pigs genotyped with high-density SNP chips was examined. The overall average imputation accuracy for all biallelic variants (SNP and indel) was 0.69, while it was 0.83 for variants with minor allele frequency > 0.1. This study provides whole-genome reference data to impute SNP chip-genotyped animals for further studies to fine map quantitative trait loci as well as improving the prediction accuracy in genomic selection. Signatures of selection were identified both through analyses of fixation and differentiation to reveal selective sweeps that may have had prominent roles during breed development or subsequent divergent selection. However, the fixation indices did not indicate a strong divergence among these three breeds. In LL and YY, the integrated haplotype score identified genomic regions under recent selection. These regions contained genes for olfactory receptors and oxidoreductases. Olfactory receptor genes that might have played a major role in the domestication were previously reported to have been under selection in several species including cattle and swine.
Collapse
Affiliation(s)
- Zexi Cai
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| | - Pernille Sarup
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| | - Tage Ostersen
- SEGES Danish Pig Research Centre, Copenhagen, Denmark
| | | | - Merete Fredholm
- Department of Veterinary and Animal Sciences, University of Copenhagen, Copenhagen, Denmark
| | | | - Peter Sørensen
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| | - Just Jensen
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| | - Bernt Guldbrandtsen
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| | - Ole Fredslund Christensen
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| | - Goutam Sahana
- Center for Quantitative Genetics and Genomics, Faculty of Technical Sciences, Aarhus University, Tjele, Denmark
| |
Collapse
|
20
|
Hermisdorff IDC, Costa RB, de Albuquerque LG, Pausch H, Kadri NK. Investigating the accuracy of imputing autosomal variants in Nellore cattle using the ARS-UCD1.2 assembly of the bovine genome. BMC Genomics 2020; 21:772. [PMID: 33167856 PMCID: PMC7654006 DOI: 10.1186/s12864-020-07184-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Accepted: 10/26/2020] [Indexed: 11/22/2022] Open
Abstract
Background Imputation accuracy among other things depends on the size of the reference panel, the marker’s minor allele frequency (MAF), and the correct placement of single nucleotide polymorphism (SNP) on the reference genome assembly. Using high-density genotypes of 3938 Nellore cattle from Brazil, we investigated the accuracy of imputation from 50 K to 777 K SNP density using Minimac3, when map positions were determined according to the bovine genome assemblies UMD3.1 and ARS-UCD1.2. We assessed the effect of reference and target panel sizes on the pre-phasing based imputation quality using ten-fold cross-validation. Further, we compared the reliability of the model-based imputation quality score (Rsq) from Minimac3 to the empirical imputation accuracy. Results The overall accuracy of imputation measured as the squared correlation between true and imputed allele dosages (R2dose) was almost identical using either the UMD3.1 or ARS-UCD1.2 genome assembly. When the size of the reference panel increased from 250 to 2000, R2dose increased from 0.845 to 0.917, and the number of polymorphic markers in the imputed data set increased from 586,701 to 618,660. Advantages in both accuracy and marker density were also observed when larger target panels were imputed, likely resulting from more accurate haplotype inference. Imputation accuracy increased from 0.903 to 0.913, and the marker density in the imputed data increased from 593,239 to 595,570 when haplotypes were inferred in 500 and 2900 target animals. The model-based imputation quality scores from Minimac3 (Rsq) were systematically higher than empirically estimated accuracies. However, both metrics were positively correlated and the correlation increased with the size of the reference panel and MAF of imputed variants. Conclusions Accurate imputation of BovineHD BeadChip markers is possible in Nellore cattle using the new bovine reference genome assembly ARS-UCD1.2. The use of large reference and target panels improves the accuracy of the imputed genotypes and provides genotypes for more markers segregating at low frequency for downstream genomic analyses. The model-based imputation quality score from Minimac3 (Rsq) can be used to detect poorly imputed variants but its reliability depends on the size of the reference panel and MAF of the imputed variants. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-020-07184-8.
Collapse
Affiliation(s)
- Isis da Costa Hermisdorff
- School of Veterinary Medicine and Animal Science, Federal University of Bahia (UFBA), Salvador, Brazil.,Animal Genomics, ETH Zurich, Zurich, Switzerland
| | - Raphael Bermal Costa
- School of Veterinary Medicine and Animal Science, Federal University of Bahia (UFBA), Salvador, Brazil
| | - Lucia Galvão de Albuquerque
- Animal Science Department, School of Agricultural and Veterinary Sciences, São Paulo State University (Unesp), Jaboticabal, São Paulo, Brazil
| | | | | |
Collapse
|
21
|
Yang W, Yang Y, Zhao C, Yang K, Wang D, Yang J, Niu X, Gong J. Animal-ImputeDB: a comprehensive database with multiple animal reference panels for genotype imputation. Nucleic Acids Res 2020; 48:D659-D667. [PMID: 31584087 PMCID: PMC6943029 DOI: 10.1093/nar/gkz854] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Revised: 09/19/2019] [Accepted: 10/01/2019] [Indexed: 12/11/2022] Open
Abstract
Animal-ImputeDB (http://gong_lab.hzau.edu.cn/Animal_ImputeDB/) is a public database with genomic reference panels of 13 animal species for online genotype imputation, genetic variant search, and free download. Genotype imputation is a process of estimating missing genotypes in terms of the haplotypes and genotypes in a reference panel. It can effectively increase the density of single nucleotide polymorphisms (SNPs) and thus can be widely used in large-scale genome-wide association studies (GWASs) using relatively inexpensive and low-density SNP arrays. However, most animals except humans lack high-quality reference panels, which greatly limits the application of genotype imputation in animals. To overcome this limitation, we developed Animal-ImputeDB, which is dedicated to collecting genotype data and whole-genome resequencing data of nonhuman animals from various studies and databases. A computational pipeline was developed to process different types of raw data to construct reference panels. Finally, 13 high-quality reference panels including ∼400 million SNPs from 2265 samples were constructed. In Animal-ImputeDB, an easy-to-use online tool consisting of two popular imputation tools was designed for the purpose of genotype imputation. Collectively, Animal-ImputeDB serves as an important resource for animal genotype imputation and will greatly facilitate research on animal genomic selection and genetic improvement.
Collapse
Affiliation(s)
- Wenqian Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China
| | - Yanbo Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China
| | - Cecheng Zhao
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China
| | - Kun Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China
| | - Dongyang Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China
| | - Jiajun Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China
| | - Xiaohui Niu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China
| | - Jing Gong
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, P. R. China.,College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, P. R. China
| |
Collapse
|
22
|
Ye S, Song H, Ding X, Zhang Z, Li J. Pre-selecting markers based on fixation index scores improved the power of genomic evaluations in a combined Yorkshire pig population. Animal 2020; 14:1555-1564. [PMID: 32209149 DOI: 10.1017/s1751731120000506] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Combining different swine populations in genomic prediction can be an important tool, leading to an increased accuracy of genomic prediction using single nucleotide polymorphism (SNP) chip data compared with within-population genomic. However, the expected higher accuracy of multi-population genomic prediction has not been realized. This may be due to an inconsistent linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTL) across populations, and the weak genetic relationships across populations. In this study, we determined the impact of different genomic relationship matrices, SNP density and pre-selected variants on prediction accuracy using a combined Yorkshire pig population. Our objective was to provide useful strategies for improving the accuracy of genomic prediction within a combined population. Results showed that the accuracy of genomic best linear unbiased prediction (GBLUP) using imputed whole-genome sequencing (WGS) data in the combined population was always higher than that within populations. Furthermore, the use of imputed WGS data always resulted in a higher accuracy of GBLUP than the use of 80K chip data for the combined population. Additionally, the accuracy of GBLUP with a non-linear genomic relationship matrix was markedly increased (0.87% to 15.17% for 80K chip data, and 0.43% to 4.01% for imputed WGS data) compared with that obtained with a linear genomic relationship matrix, except for the prediction of XD population in the combined population using imputed WGS data. More importantly, the application of pre-selected variants based on fixation index (Fst) scores improved the accuracy of multi-population genomic prediction, especially for 80K chip data. For BLUP|GA (BLUP approach given the genetic architecture), the use of a linear method with an appropriate weight to build a weight-relatedness matrix led to a higher prediction accuracy compared with the use of only pre-selected SNPs for genomic evaluations, especially for the total number of piglets born. However, for the non-linear method, BLUP|GA showed only a small increase or even a decrease in prediction accuracy compared with the use of only pre-selected SNPs. Overall, the best genomic evaluation strategy for reproduction-related traits for a combined population was found to be GBLUP performed with a non-linear genomic relationship matrix using variants pre-selected from the 80K chip data based on Fst scores.
Collapse
Affiliation(s)
- S Ye
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, No. 483, Wushan Road, Tianhe District, 510642Guangzhou, China
| | - H Song
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, No. 2, Yuanmingyuan West Road, Haidian District, 100193Beijing, China
| | - X Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, No. 2, Yuanmingyuan West Road, Haidian District, 100193Beijing, China
| | - Z Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, No. 483, Wushan Road, Tianhe District, 510642Guangzhou, China
| | - J Li
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, No. 483, Wushan Road, Tianhe District, 510642Guangzhou, China
| |
Collapse
|
23
|
Zhang F, Wang Y, Mukiibi R, Chen L, Vinsky M, Plastow G, Basarab J, Stothard P, Li C. Genetic architecture of quantitative traits in beef cattle revealed by genome wide association studies of imputed whole genome sequence variants: I: feed efficiency and component traits. BMC Genomics 2020; 21:36. [PMID: 31931702 PMCID: PMC6956504 DOI: 10.1186/s12864-019-6362-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 12/02/2019] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Genome wide association studies (GWAS) on residual feed intake (RFI) and its component traits including daily dry matter intake (DMI), average daily gain (ADG), and metabolic body weight (MWT) were conducted in a population of 7573 animals from multiple beef cattle breeds based on 7,853,211 imputed whole genome sequence variants. The GWAS results were used to elucidate genetic architectures of the feed efficiency related traits in beef cattle. RESULTS The DNA variant allele substitution effects approximated a bell-shaped distribution for all the traits while the distribution of additive genetic variances explained by single DNA variants followed a scaled inverse chi-squared distribution to a greater extent. With a threshold of P-value < 1.00E-05, 16, 72, 88, and 116 lead DNA variants on multiple chromosomes were significantly associated with RFI, DMI, ADG, and MWT, respectively. In addition, lead DNA variants with potentially large pleiotropic effects on DMI, ADG, and MWT were found on chromosomes 6, 14 and 20. On average, missense, 3'UTR, 5'UTR, and other regulatory region variants exhibited larger allele substitution effects in comparison to other functional classes. Intergenic and intron variants captured smaller proportions of additive genetic variance per DNA variant. Instead 3'UTR and synonymous variants explained a greater amount of genetic variance per DNA variant for all the traits examined while missense, 5'UTR and other regulatory region variants accounted for relatively more additive genetic variance per sequence variant for RFI and ADG, respectively. In total, 25 to 27 enriched cellular and molecular functions were identified with lipid metabolism and carbohydrate metabolism being the most significant for the feed efficiency traits. CONCLUSIONS RFI is controlled by many DNA variants with relatively small effects whereas DMI, ADG, and MWT are influenced by a few DNA variants with large effects and many DNA variants with small effects. Nucleotide polymorphisms in regulatory region and synonymous functional classes play a more important role per sequence variant in determining variation of the feed efficiency traits. The genetic architecture as revealed by the GWAS of the imputed 7,853,211 DNA variants will improve our understanding on the genetic control of feed efficiency traits in beef cattle.
Collapse
Affiliation(s)
- Feng Zhang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada.,State Key Laboratory for Swine Genetics, Breeding and Production Technology, Jiangxi Agricultural University, Nanchang, Jiangxi, China.,Present Address: Institute of Translational Medicine, Nanchang University, Nanchang, Jiangxi, China
| | - Yining Wang
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Robert Mukiibi
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Liuhong Chen
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada.,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Michael Vinsky
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada
| | - Graham Plastow
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - John Basarab
- Alberta Agriculture and Forestry, Lacombe Research and Development Centre, 6000 C&E Trail, Lacombe, AB, Canada
| | - Paul Stothard
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada
| | - Changxi Li
- Lacombe Research and Development Centre, Agriculture and Agri-Food Canada, Lacombe, AB, Canada. .,Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB, Canada.
| |
Collapse
|
24
|
Pook T, Mayer M, Geibel J, Weigend S, Cavero D, Schoen CC, Simianer H. Improving Imputation Quality in BEAGLE for Crop and Livestock Data. G3 (BETHESDA, MD.) 2020; 10:177-188. [PMID: 31676508 PMCID: PMC6945036 DOI: 10.1534/g3.119.400798] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 10/31/2019] [Indexed: 12/14/2022]
Abstract
Imputation is one of the key steps in the preprocessing and quality control protocol of any genetic study. Most imputation algorithms were originally developed for the use in human genetics and thus are optimized for a high level of genetic diversity. Different versions of BEAGLE were evaluated on genetic datasets of doubled haploids of two European maize landraces, a commercial breeding line and a diversity panel in chicken, respectively, with different levels of genetic diversity and structure which can be taken into account in BEAGLE by parameter tuning. Especially for phasing BEAGLE 5.0 outperformed the newest version (5.1) which in turn also lead to improved imputation. Earlier versions were far more dependent on the adaption of parameters in all our tests. For all versions, the parameter ne (effective population size) had a major effect on the error rate for imputation of ungenotyped markers, reducing error rates by up to 98.5%. Further improvement was obtained by tuning of the parameters affecting the structure of the haplotype cluster that is used to initialize the underlying Hidden Markov Model of BEAGLE. The number of markers with extremely high error rates for the maize datasets were more than halved by the use of a flint reference genome (F7, PE0075 etc.) instead of the commonly used B73. On average, error rates for imputation of ungenotyped markers were reduced by 8.5% by excluding genetically distant individuals from the reference panel for the chicken diversity panel. To optimize imputation accuracy one has to find a balance between representing as much of the genetic diversity as possible while avoiding the introduction of noise by including genetically distant individuals.
Collapse
Affiliation(s)
- Torsten Pook
- Department of Animal Sciences, Animal Breeding and Genetics Group,
- Center for Integrated Breeding Research, University of Goettingen, 37075 Goettingen, Germany
| | - Manfred Mayer
- Technical University of Munich, Plant Breeding, TUM School of Life Sciences Weihenstephan, 85354 Freising, Germany
| | - Johannes Geibel
- Department of Animal Sciences, Animal Breeding and Genetics Group
- Center for Integrated Breeding Research, University of Goettingen, 37075 Goettingen, Germany
| | - Steffen Weigend
- Center for Integrated Breeding Research, University of Goettingen, 37075 Goettingen, Germany
- Friedrich-Loeffler-Institut, Institute of Farm Animal Genetics, 31353 Neustadt-Mariensee, Germany, and
| | | | - Chris C Schoen
- Technical University of Munich, Plant Breeding, TUM School of Life Sciences Weihenstephan, 85354 Freising, Germany
| | - Henner Simianer
- Department of Animal Sciences, Animal Breeding and Genetics Group
- Center for Integrated Breeding Research, University of Goettingen, 37075 Goettingen, Germany
| |
Collapse
|
25
|
Zhang J, Zhang Y, Gong H, Cui L, Ma J, Chen C, Ai H, Xiao S, Huang L, Yang B. Landscape of Loci and Candidate Genes for Muscle Fatty Acid Composition in Pigs Revealed by Multiple Population Association Analysis. Front Genet 2019; 10:1067. [PMID: 31708975 PMCID: PMC6824322 DOI: 10.3389/fgene.2019.01067] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2019] [Accepted: 10/04/2019] [Indexed: 01/19/2023] Open
Abstract
Genome wide association analyses in diverse populations can identify complex trait loci that are specifically present in one population or shared across multiple populations, which help to better understand the genetic architecture of complex traits in a broader genetic context. In this study, we conducted genome-wide association studies and meta-analysis for 38 fatty acid composition traits with 12–19 million imputed genome sequence SNPs in 2446 pigs from six populations, encompassing White Duroc × Erhualian F2, Sutai, Duroc-Landrace-Yorkshire (DLY) three-way cross, Laiwu, Erhualian, and Bamaxiang pigs that were originally genotyped with 60 K or 1.4 million single nucleotide polymorphism (SNP) chips. The analyses uncovered 285 lead SNPs (P < 5 × 10-8), among which 78 locate more than 1 Mb to the lead chip SNPs were considered as novel, largely augmented the landscape of loci for porcine muscle fatty acid composition. Meta-analysis enhanced the association significance at loci near FADS2, ABCD2, ELOVL5, ELOVL6, ELOVL7, SCD, and THRSP genes, suggesting possible existence of population shared mutations underlying these loci. Further haplotype analysis at SCD loci identified a shared 3.7 kb haplotype in F2, Sutai and DLY pigs showing consistent effects of decreasing C18:0 contents in the three populations. In contrast, at FASN loci, we found an Erhualian specific haplotype explaining the population specific association signals in Erhualian pigs. This study refines our understanding on landscape of loci and candidate genes for fatty acid composition traits of pigs.
Collapse
Affiliation(s)
- Junjie Zhang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Yifeng Zhang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Huanfa Gong
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Leilei Cui
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Junwu Ma
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Congying Chen
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Huashui Ai
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Shijun Xiao
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Lusheng Huang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Bin Yang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| |
Collapse
|
26
|
Song H, Ye S, Jiang Y, Zhang Z, Zhang Q, Ding X. Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs. Genet Sel Evol 2019; 51:58. [PMID: 31638889 PMCID: PMC6805481 DOI: 10.1186/s12711-019-0500-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 10/07/2019] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND For genomic selection in populations with a small reference population, combining populations of the same breed or populations of related breeds is an effective way to increase the size of the reference population. However, genomic predictions based on single nucleotide polymorphism (SNP)-chip genotype data using combined populations with different genetic backgrounds or from different breeds have not shown a clear advantage over using within-population or within-breed predictions. The increasing availability of whole-genome sequencing (WGS) data provides new opportunities for combined population genomic prediction. Our objective was to investigate the accuracy of genomic prediction using imputation-based WGS data from combined populations in pigs. Using 80K SNP panel genotypes, WGS genotypes, or genotypes on WGS variants that were pruned based on linkage disequilibrium (LD), three methods [genomic best linear unbiased prediction (GBLUP), single-step (ss)GBLUP, and genomic feature (GF)BLUP] were implemented with different prior information to identify the best method to improve the accuracy of genomic prediction for combined populations in pigs. RESULTS In total, 2089 and 2043 individuals with production and reproduction phenotypes, respectively, from three Yorkshire populations with different genetic backgrounds were genotyped with the PorcineSNP80 panel. Imputation accuracy from 80K to WGS variants reached 92%. The results showed that use of the WGS data compared to the 80K SNP panel did not increase the accuracy of genomic prediction in a single population, but using WGS data with LD pruning and GFBLUP with prior information did yield higher accuracy than the 80K SNP panel. For the 80K SNP panel genotypes, using the combined population resulted in a slight improvement, no change, or even a slight decrease in accuracy in comparison with the single population for GBLUP and ssGBLUP, while accuracy increased by 1 to 2.4% when using WGS data. Notably, the GFBLUP method did not perform well for both the combined population and the single populations. CONCLUSIONS The use of WGS data was beneficial for combined population genomic prediction. Simply increasing the number of SNPs to the WGS level did not increase accuracy for a single population, while using pruned WGS data based on LD and GFBLUP with prior information could yield higher accuracy than the 80K SNP panel.
Collapse
Affiliation(s)
- Hailiang Song
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Yifan Jiang
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, Taian, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
27
|
Wu P, Wang K, Zhou J, Chen D, Yang Q, Yang X, Liu Y, Feng B, Jiang A, Shen L, Xiao W, Jiang Y, Zhu L, Zeng Y, Xu X, Li X, Tang G. GWAS on Imputed Whole-Genome Resequencing From Genotyping-by-Sequencing Data for Farrowing Interval of Different Parities in Pigs. Front Genet 2019; 10:1012. [PMID: 31681435 PMCID: PMC6813215 DOI: 10.3389/fgene.2019.01012] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2019] [Accepted: 09/23/2019] [Indexed: 12/30/2022] Open
Abstract
The whole-genome sequencing (WGS) data can potentially discover all genetic variants. Studies have shown the power of WGS for genome-wide association study (GWAS) lies in the ability to identify quantitative trait loci and nucleotides (QTNs). However, the resequencing of thousands of target individuals is expensive. Genotype imputation is a powerful approach for WGS and to identify causal mutations. This study aimed to evaluate the imputation accuracy from genotyping-by-sequencing (GBS) to WGS in two pig breeds using a resequencing reference population and to detect single-nucleotide polymorphisms (SNPs) and candidate genes for farrowing interval (FI) of different parities using the data before and after imputation for GWAS. Six hundred target pigs, 300 Landrace and 300 Large White pigs, were genotyped by GBS, and 60 reference pigs, 20 Landrace and 40 Large White pigs, were sequenced by whole-genome resequencing. Imputation for pigs was conducted using Beagle software. The average imputation accuracy (allelic R 2) from GBS to WGS was 0.42 for Landrace pigs and 0.45 for Large White pigs. For Landrace pigs (Large White pigs), 4,514,934 (5,533,290) SNPs had an accuracy >0.3, resulting an average accuracy of 0.73 (0.72), and 2,093,778 (2,468,645) SNPs had an accuracy >0.8, resulting an average accuracy of 0.94 (0.93). Association studies with data before and after imputation were performed for FI of different parities in two populations. Before imputation, 18 and 128 significant SNPs were detected for FI in Landrace and Large White pigs, respectively. After imputation, 125 and 27 significant SNPs were identified for dataset with an accuracy >0.3 and 0.8 in Large White pigs, and 113 and 18 SNPs were found among imputed sequence variants. Among these significant SNPs, six top SNPs were detected in both GBS data and imputed WGS data, namely, SSC2: 136127645, SSC5: 103426443, SSC6: 27811226, SSC10: 3609429, SSC14: 15199253, and SSC15: 150297519. Overall, many candidate genes could be involved in FI of different parities in pigs. Although imputation from GBS to WGS data resulted in a low imputation accuracy, association analyses with imputed WGS data were optimized to detect QTNs for complex trait. The obtained results provide new insight into genotype imputation, genetic architecture, and candidate genes for FI of different parities in Landrace and Large White pigs.
Collapse
Affiliation(s)
- Pingxian Wu
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Kai Wang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Jie Zhou
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Dejuan Chen
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Qiang Yang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Xidi Yang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Yihui Liu
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Bo Feng
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Anan Jiang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Linyuan Shen
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Weihang Xiao
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Yanzhi Jiang
- College of Life Science, Sichuan Agricultural University, Yaan, China
| | - Li Zhu
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Yangshuang Zeng
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Xu Xu
- Sichuan Province Department of Agriculture and Rural Affairs, Sichuan Animal Husbandry Station, Chengdu, China
| | - Xuewei Li
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| | - Guoqing Tang
- Farm Animal Genetic Resources Exploration and Innovation, Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu, China
| |
Collapse
|
28
|
Xu W, Chen D, Yan G, Xiao S, Huang T, Zhang Z, Huang L. Rediscover and Refine QTLs for Pig Scrotal Hernia by Increasing a Specially Designed F 3 Population and Using Whole-Genome Sequence Imputation Technology. Front Genet 2019; 10:890. [PMID: 31608119 PMCID: PMC6768097 DOI: 10.3389/fgene.2019.00890] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 08/23/2019] [Indexed: 11/14/2022] Open
Abstract
Pig scrotal hernia is one of the most common congenital defects triggered by both genetic and environmental factors, leading to severe economic loss as well as poor animal welfare in the pig industry. Identification and implementation of genomic regions controlling scrotal hernia in breeding is of great appeal to reduce incidences of hernia in pig production. The aim of this study was to identify such regions or molecular markers affecting scrotal hernia in pigs. First of all, we summarized and analyzed the results of some international teams on scrotal hernia and designed a specially population which contains 246 male individuals. We then performed genome-wide association study (GWAS) in this specially designed population using two scenarios, i.e., the target panel data before and after imputation, which contain 42,365 SNPs and 18,756,672 SNPs, respectively. In addition, a series of methods including genetic differentiation analysis, linkage disequilibrium and linkage analysis (LDLA), and haplotype sharing analysis were appropriate to provide for further analysis to identify the potential gene underlying the QTL. The GWAS in this report detected a highly significant region affecting scrotal hernia within a 24.8Mb region (114.1-138.9Mb) on SSC8. And the result of genetic differentiation analysis also showed a strong genetic differentiation signal between 116.1 and 132.7Mb on SSC8. In addition, the QTL interval was refined to 2.99Mb by combining LDLA and genetic differentiation analysis. Finally, two susceptibility haplotypes were identified through haplotype sharing analysis, with one potential causal gene in it. Our study provided deeper insights into the genetic architecture of pig scrotal hernia and contributed to further fine-mapping and characterize haplotype and gene that influence scrotal hernia in pigs.
Collapse
Affiliation(s)
| | | | | | | | | | - Zhiyan Zhang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Lusheng Huang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| |
Collapse
|
29
|
Fang ZH, Pausch H. Multi-trait meta-analyses reveal 25 quantitative trait loci for economically important traits in Brown Swiss cattle. BMC Genomics 2019; 20:695. [PMID: 31481029 PMCID: PMC6724290 DOI: 10.1186/s12864-019-6066-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Accepted: 08/27/2019] [Indexed: 01/02/2023] Open
Abstract
Background Little is known about the genetic architecture of economically important traits in Brown Swiss cattle because only few genome-wide association studies (GWAS) have been carried out in this breed. Moreover, most GWAS have been performed for single traits, thus not providing detailed insights into potentially existing pleiotropic effects of trait-associated loci. Results To compile a comprehensive catalogue of large-effect quantitative trait loci (QTL) segregating in Brown Swiss cattle, we carried out association tests between partially imputed genotypes at 598,016 SNPs and daughter-derived phenotypes for more than 50 economically important traits, including milk production, growth and carcass quality, body conformation, reproduction and calving traits in 4578 artificial insemination bulls from two cohorts of Brown Swiss cattle (Austrian-German and Swiss populations). Across-cohort multi-trait meta-analyses of the results from the single-trait GWAS revealed 25 quantitative trait loci (QTL; P < 8.36 × 10− 8) for economically relevant traits on 17 Bos taurus autosomes (BTA). Evidence of pleiotropy was detected at five QTL located on BTA5, 6, 17, 21 and 25. Of these, two QTL at BTA6:90,486,780 and BTA25:1,455,150 affect a diverse range of economically important traits, including traits related to body conformation, calving, longevity and milking speed. Furthermore, the QTL at BTA6:90,486,780 seems to be a target of ongoing selection as evidenced by an integrated haplotype score of 2.49 and significant changes in allele frequency over the past 25 years, whereas either no or only weak evidence of selection was detected at all other QTL. Conclusions Our findings provide a comprehensive overview of QTL segregating in Brown Swiss cattle. Detected QTL explain between 2 and 10% of the variation in the estimated breeding values and thus may be considered as the most important QTL segregating in the Brown Swiss cattle breed. Multi-trait association testing boosts the power to detect pleiotropic QTL and assesses the full spectrum of phenotypes that are affected by trait-associated variants. Electronic supplementary material The online version of this article (10.1186/s12864-019-6066-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zih-Hua Fang
- Animal Genomics, Institute of Agricultural Science, ETH Zürich, 8092, Zürich, Switzerland.
| | - Hubert Pausch
- Animal Genomics, Institute of Agricultural Science, ETH Zürich, 8092, Zürich, Switzerland
| |
Collapse
|
30
|
Moreira GCM, Salvian M, Boschiero C, Cesar ASM, Reecy JM, Godoy TF, Ledur MC, Garrick D, Mourão GB, Coutinho LL. Genome-wide association scan for QTL and their positional candidate genes associated with internal organ traits in chickens. BMC Genomics 2019; 20:669. [PMID: 31438838 PMCID: PMC6704653 DOI: 10.1186/s12864-019-6040-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2019] [Accepted: 08/16/2019] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Poultry breeding programs have been focused on improvement of growth and carcass traits, however, this has resulted in correlated changes in internal organ weights and increased incidence of metabolic disorders. These disorders can affect feed efficiency or even cause death. We used a high density SNP array (600 K, Affymetrix) to estimate genomic heritability, perform genome-wide association analysis, and identify genomic regions and positional candidate genes (PCGs) associated with internal organ traits in an F2 chicken population. We integrated knowledge of haplotype blocks, selection signature regions and sequencing data to refine the list of PCGs. RESULTS Estimated genomic heritability for internal organ traits in chickens ranged from low (LUNGWT, 0.06) to high (GIZZWT, 0.45). A total of 20 unique 1 Mb windows identified on GGA1, 2, 4, 7, 12, 15, 18, 19, 21, 27 and 28 were significantly associated with intestine length, and weights or percentages of liver, gizzard or lungs. Within these windows, 14 PCGs were identified based on their biological functions: TNFSF11, GTF2F2, SPERT, KCTD4, HTR2A, RB1, PCDH7, LCORL, LDB2, NR4A2, GPD2, PTPN11, ITGB4 and SLC6A4. From those genes, two were located within haplotype blocks and three overlapped with selection signature regions. A total of 13,748 annotated sequence SNPs were in the 14 PCGs, including 156 SNPs in coding regions (124 synonymous, 26 non-synonymous, and 6 splice variants). Seven deleterious SNPs were identified in TNFSF11, NR4A2 or ITGB4 genes. CONCLUSIONS The results from this study provide novel insights to understand the genetic architecture of internal organ traits in chickens. The QTL detection performed using a high density SNP array covered the whole genome allowing the discovery of novel QTL associated with organ traits. We identified PCGs within the QTL involved in biological processes that may regulate internal organ growth and development. Potential functional genetic variations were identified generating crucial information that, after validation, might be used in poultry breeding programs to reduce the occurrence of metabolic disorders.
Collapse
Affiliation(s)
| | - Mayara Salvian
- University of São Paulo (USP), Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo, Brazil
| | - Clarissa Boschiero
- University of São Paulo (USP), Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo, Brazil
| | - Aline Silva Mello Cesar
- University of São Paulo (USP), Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo, Brazil
| | - James M. Reecy
- Department of Animal Science, Iowa State University (ISU), Ames, Iowa USA
| | - Thaís Fernanda Godoy
- University of São Paulo (USP), Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo, Brazil
| | | | - Dorian Garrick
- School of Agriculture, Massey University, Ruakura, Hamilton, New Zealand
| | - Gerson Barreto Mourão
- University of São Paulo (USP), Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo, Brazil
| | - Luiz L. Coutinho
- University of São Paulo (USP), Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, São Paulo, Brazil
| |
Collapse
|
31
|
Li C, Li M, Li X, Ni W, Xu Y, Yao R, Wei B, Zhang M, Li H, Zhao Y, Liu L, Ullah Y, Jiang Y, Hu S. Whole-Genome Resequencing Reveals Loci Associated With Thoracic Vertebrae Number in Sheep. Front Genet 2019; 10:674. [PMID: 31379930 PMCID: PMC6657399 DOI: 10.3389/fgene.2019.00674] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 06/27/2019] [Indexed: 12/31/2022] Open
Abstract
The number of vertebrae, especially thoracic vertebrae, is an important economic trait that may influence carcass length and meat production in animals. However, the genetic basis of vertebrae number in sheep is still poorly understood. To detect the candidate genes, 400 increased number of thoracic vertebrae (T14L6) and 200 normal (T13L6) Kazakh sheep were collected. We generated and sequenced 60 pools of genomic DNA (each pool prepared by mixing genomic DNA from 10 sheep with the same thoracic traits), with an average depth of coverage of 25.65×. We identified a total of 42,075,402 SNPs and 11 putatively selected genomic regions, including the VRTN gene and the HoxA gene family that regulate vertebral development. The most prominent areas of selective elimination were located in a region of chromosome 7, including VRTN, which regulates spinal development and morphology. Further investigation indicated that the expression level of the VRTN gene during fetal development was significantly higher in sheep with more thoracic vertebrae than in those with a normal number of thoracic vertebrae. A genome-wide comparison between sheep with increased and normal numbers of thoracic vertebrae showed that the VRTN gene is the major selection locus for the number of thoracic vertebrae in sheep and has the potential to be utilized in sheep breeding in the future.
Collapse
Affiliation(s)
- Cunyuan Li
- College of Life Sciences, Shihezi University, Shihezi, China.,College of Animal Science and Technology, Shihezi University, Shihezi, China
| | - Ming Li
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Xiaoyue Li
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Wei Ni
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yueren Xu
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Rui Yao
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Bin Wei
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Mengdan Zhang
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Huixiang Li
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yue Zhao
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Li Liu
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yaseen Ullah
- College of Life Sciences, Shihezi University, Shihezi, China
| | - Yu Jiang
- College of Animal Science and Technology, Northwest A&F University, Xianyang, China
| | - Shengwei Hu
- College of Life Sciences, Shihezi University, Shihezi, China
| |
Collapse
|
32
|
Yan G, Guo T, Xiao S, Zhang F, Xin W, Huang T, Xu W, Li Y, Zhang Z, Huang L. Imputation-Based Whole-Genome Sequence Association Study Reveals Constant and Novel Loci for Hematological Traits in a Large-Scale Swine F 2 Resource Population. Front Genet 2018; 9:401. [PMID: 30405681 PMCID: PMC6204663 DOI: 10.3389/fgene.2018.00401] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2018] [Accepted: 09/03/2018] [Indexed: 11/13/2022] Open
Abstract
The whole-genome sequences of progenies with low-density single-nucleotide polymorphism (SNP) genotypes can be imputed with high accuracy based on the deep-coverage sequences of key ancestors. With this imputation technology, a more powerful genome-wide association study (GWAS) can be carried out using imputed whole-genome variants and the phenotypes of interest to overcome the shortcomings of low-power detection and the large confidence interval derived from low-density SNP markers in classic association studies. In this study, 19 ancestors of a large-scale swine F2 White Duroc × Erhualian population were deeply sequenced for their genome with an average coverage of 25×. Considering 98 pigs from 10 different breeds with high-quality deep sequenced genomes, we imputed the whole genomic variants of 1020 F2 pigs genotyped by the PorcineSNP60 BeadChip with high accuracy and obtained 14,851,440 sequence variants after quality control. Based on this, 87 novel quantitative traits loci (QTLs) for 18 hematological traits at three different physiological stages of the F2 pigs were identified, among which most of the novel QTLs have been repeated in two of the three stages. Literature mining pinpointed that the FGF14 and LCLAT1 genes at SSC11 and SSC3 may affect the MCH at day 240 and MCV at day 18, respectively. The present study shows that combining high-quality imputed genomic variants and correlated phenomic traits into GWAS can improve the capability to detect QTL considerably. The large number of different QTLs for hematological traits identified at multiple growth stages implies the complexity and time specificity of these traits.
Collapse
Affiliation(s)
- Guorong Yan
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Tianfu Guo
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Shijun Xiao
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Feng Zhang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Wenshui Xin
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Tao Huang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Wenwu Xu
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Yiping Li
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Zhiyan Zhang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| | - Lusheng Huang
- State Key Laboratory for Pig Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang, China
| |
Collapse
|