1
|
Liu C, Chen Z, Zhang Z, Wang Z, Guo X, Pan Y, Wang Q. Unveiling the Genetic Mechanism of Meat Color in Pigs through GWAS, Multi-Tissue, and Single-Cell Transcriptome Signatures Exploration. Int J Mol Sci 2024; 25:3682. [PMID: 38612491 PMCID: PMC11012088 DOI: 10.3390/ijms25073682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 03/22/2024] [Accepted: 03/23/2024] [Indexed: 04/14/2024] Open
Abstract
Meat color traits directly influence consumer acceptability and purchasing decisions. Nevertheless, there is a paucity of comprehensive investigation into the genetic mechanisms underlying meat color traits in pigs. Utilizing genome-wide association studies (GWAS) on five meat color traits and the detection of selection signatures in pig breeds exhibiting distinct meat color characteristics, we identified a promising candidate SNP, 6_69103754, exhibiting varying allele frequencies among pigs with different meat color characteristics. This SNP has the potential to affect the redness and chroma index values of pork. Moreover, transcriptome-wide association studies (TWAS) analysis revealed the expression of candidate genes associated with meat color traits in specific tissues. Notably, the largest number of candidate genes were observed from transcripts derived from adipose, liver, lung, spleen tissues, and macrophage cell type, indicating their crucial role in meat color development. Several shared genes associated with redness, yellowness, and chroma indices traits were identified, including RINL in adipose tissue, ENSSSCG00000034844 and ITIH1 in liver tissue, TPX2 and MFAP2 in lung tissue, and ZBTB17, FAM131C, KIFC3, NTPCR, and ENGSSSCG00000045605 in spleen tissue. Furthermore, single-cell enrichment analysis revealed a significant association between the immune system and meat color. This finding underscores the significance of the immune system associated with meat color. Overall, our study provides a comprehensive analysis of the genetic mechanisms underlying meat color traits, offering valuable insights for future breeding efforts aimed at improving meat quality.
Collapse
Affiliation(s)
- Cheng Liu
- Department of Animal Science, College of Animal Science, Zhejiang University, 866# Yuhangtang Road, Hangzhou 310058, China; (C.L.); (Z.C.); (Z.Z.); (Z.W.); (X.G.); (Y.P.)
| | - Zitao Chen
- Department of Animal Science, College of Animal Science, Zhejiang University, 866# Yuhangtang Road, Hangzhou 310058, China; (C.L.); (Z.C.); (Z.Z.); (Z.W.); (X.G.); (Y.P.)
| | - Zhe Zhang
- Department of Animal Science, College of Animal Science, Zhejiang University, 866# Yuhangtang Road, Hangzhou 310058, China; (C.L.); (Z.C.); (Z.Z.); (Z.W.); (X.G.); (Y.P.)
| | - Zhen Wang
- Department of Animal Science, College of Animal Science, Zhejiang University, 866# Yuhangtang Road, Hangzhou 310058, China; (C.L.); (Z.C.); (Z.Z.); (Z.W.); (X.G.); (Y.P.)
| | - Xiaoling Guo
- Department of Animal Science, College of Animal Science, Zhejiang University, 866# Yuhangtang Road, Hangzhou 310058, China; (C.L.); (Z.C.); (Z.Z.); (Z.W.); (X.G.); (Y.P.)
| | - Yuchun Pan
- Department of Animal Science, College of Animal Science, Zhejiang University, 866# Yuhangtang Road, Hangzhou 310058, China; (C.L.); (Z.C.); (Z.Z.); (Z.W.); (X.G.); (Y.P.)
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya 572000, China
| | - Qishan Wang
- Department of Animal Science, College of Animal Science, Zhejiang University, 866# Yuhangtang Road, Hangzhou 310058, China; (C.L.); (Z.C.); (Z.Z.); (Z.W.); (X.G.); (Y.P.)
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya 572000, China
| |
Collapse
|
2
|
Chen S, Liu S, Shi S, Yin H, Tang Y, Zhang J, Li W, Liu G, Qu K, Ding X, Wang Y, Liu J, Zhang S, Fang L, Yu Y. Cross-Species Comparative DNA Methylation Reveals Novel Insights into Complex Trait Genetics among Cattle, Sheep, and Goats. Mol Biol Evol 2024; 41:msae003. [PMID: 38266195 PMCID: PMC10834038 DOI: 10.1093/molbev/msae003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 12/28/2023] [Accepted: 01/04/2024] [Indexed: 01/26/2024] Open
Abstract
The cross-species characterization of evolutionary changes in the functional genome can facilitate the translation of genetic findings across species and the interpretation of the evolutionary basis underlying complex phenotypes. Yet, this has not been fully explored between cattle, sheep, goats, and other mammals. Here, we systematically characterized the evolutionary dynamics of DNA methylation and gene expression in 3 somatic tissues (i.e. brain, liver, and skeletal muscle) and sperm across 7 mammalian species, including 3 ruminant livestock species (cattle, sheep, and goats), humans, pigs, mice, and dogs, by generating and integrating 160 DNA methylation and transcriptomic data sets. We demonstrate dynamic changes of DNA hypomethylated regions and hypermethylated regions in tissue-type manner across cattle, sheep, and goats. Specifically, based on the phylo-epigenetic model of DNA methylome, we identified a total of 25,074 hypomethylated region extension events specific to cattle, which participated in rewiring tissue-specific regulatory network. Furthermore, by integrating genome-wide association studies of 50 cattle traits, we provided novel insights into the genetic and evolutionary basis of complex phenotypes in cattle. Overall, our study provides a valuable resource for exploring the evolutionary dynamics of the functional genome and highlights the importance of cross-species characterization of multiomics data sets for the evolutionary interpretation of complex phenotypes in cattle livestock.
Collapse
Affiliation(s)
- Siqian Chen
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Shuli Liu
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310024, China
| | - Shaolei Shi
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Hongwei Yin
- Agriculture Genome Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Yongjie Tang
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Jinning Zhang
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Wenlong Li
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Gang Liu
- National Animal Husbandry Service, Beijing 100125, China
| | - Kaixing Qu
- Academy of Science and Technology, Chuxiong Normal University, Chuxiong 675000, China
| | - Xiangdong Ding
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Yachun Wang
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Jianfeng Liu
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Shengli Zhang
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics (QGG), Aarhus University, Aarhus, Denmark
| | - Ying Yu
- National Engineering Laboratory for Animal Breeding, State Key Laboratory of Animal Biotech Breeding, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing 100193, China
| |
Collapse
|
3
|
Ryan CA, Purfield DC, Naderi S, Berry DP. Associations between polymorphisms in the myostatin gene with calving difficulty and carcass merit in cattle. J Anim Sci 2023; 101:skad371. [PMID: 37935361 PMCID: PMC10684047 DOI: 10.1093/jas/skad371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 11/02/2023] [Indexed: 11/09/2023] Open
Abstract
A fully functional myostatin gene inhibits muscle fiber growth. The objective of the present study was to quantify the association between 21 known myostatin mutations with both calving and carcass traits in 12 cattle breeds. The myostatin genotypes of 32,770 dam-progeny combinations were used in the association analysis of calving dystocia, with the genotypes of 129,803 animals used in the mixed model association analyses of carcass weight, conformation, and fat score. The mixed model included additive genetic, maternal, and permanent environmental effects where appropriate. The mutant genotypes of nt821, Q204X, and F94L were all associated (P < 0.01) with more calving difficulty when present in either the dam or the progeny. The nt821 deletion had the greatest association with calving difficulty when the homozygous deletion was present in either the calf (0.37 points greater calving difficulty score relative to calves carrying no copies of the deletion based on a one to four scale) or the dam (1.30 points greater calving difficulty score relative to dams carrying no copies of the deletion), although the association between the calf's nt821 genotype and calving difficulty differed depending on the nt821 genotype of the dam. With the exception of nt748_78, nt414, and nt374_51, all other seven segregating myostatin variants were associated (range of allele substitution effect size relative to animals with no copies of the mutant allele) with carcass weight (2.36 kg lighter to 15.56 kg heavier), all 10 segregating variants with conformation (0.15 units less conformed to 2.24 units more conformed assessed on a scale of 1 to 15), and all segregating variants other than E226X with carcass fat (0.23 units less carcass fat cover to 3.85 units more carcass fat cover assessed on a scale of 1 to 15). Of these, the F94L, Q204X, and nt821 mutations generally had the greatest association with all three carcass traits, giving rise to heavier and more conformed carcasses. Despite the antagonistic genetic relationship between calving difficulty and carcass traits, the nt374_51, F94L, and E226X mutations were all associated with improved carcass merit while having minimal expected consequences on calving difficulty. Thus, animals carrying these mutation(s) may have favorable genetic merit for calving difficulty and carcass merit. Furthermore, depending on the dam genotype, a bull with two copies of the nt821 mutation can produce progeny with improved carcass merit while minimizing calving problems.
Collapse
Affiliation(s)
- Cliona A Ryan
- Department of Animal Bioscience, Animal and Grassland Research and Innovation Centre, Teagasc, Moorepark, Fermoy, Co. Cork, Ireland
- Department of Biological Sciences, Munster Technological University, Bishopstown, Co. Cork, Ireland
| | - Deirdre C Purfield
- Department of Biological Sciences, Munster Technological University, Bishopstown, Co. Cork, Ireland
| | - Saeid Naderi
- Irish Cattle Breeding Federation, Bandon, Co. Cork, Ireland
| | - Donagh P Berry
- Department of Animal Bioscience, Animal and Grassland Research and Innovation Centre, Teagasc, Moorepark, Fermoy, Co. Cork, Ireland
| |
Collapse
|
4
|
Cai X, Teng J, Ren D, Zhang H, Li J, Zhang Z. Model Comparison of Heritability Enrichment Analysis in Livestock Population. Genes (Basel) 2022; 13:genes13091644. [PMID: 36140810 PMCID: PMC9498849 DOI: 10.3390/genes13091644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Revised: 09/03/2022] [Accepted: 09/10/2022] [Indexed: 11/16/2022] Open
Abstract
Heritability enrichment analysis is an important means of exploring the genetic architecture of complex traits in human genetics. Heritability enrichment is typically defined as the proportion of an SNP subset explained heritability, divided by the proportion of SNPs. Heritability enrichment enables better study of underlying complex traits, such as functional variant/gene subsets, biological networks and metabolic pathways detected through integrating explosively increased omics data. This would be beneficial for genomic prediction of disease risk in humans and genetic values estimation of important economical traits in livestock and plant species. However, in livestock, factors affecting the heritability enrichment estimation of complex traits have not been examined. Previous studies on humans reported that the frequencies, effect sizes, and levels of linkage disequilibrium (LD) of underlying causal variants (CVs) would affect the heritability enrichment estimation. Therefore, the distribution of heritability across the genome should be fully considered to obtain the unbiased estimation of heritability enrichment. To explore the performance of different heritability enrichment models in livestock populations, we used the VanRaden, GCTA and α models, assuming different α values, and the LDAK model, considering LD weight. We simulated three types of phenotypes, with CVs from various minor allele frequency (MAF) ranges: genome-wide (0.005 ≤ MAF ≤ 0.5), common (0.05 ≤ MAF ≤ 0.5), and uncommon (0.01 ≤ MAF < 0.05). The performances of the models with two different subsets (one of which contained known CVs and the other consisting of randomly selected markers) were compared to verify the accuracy of heritability enrichment estimation of functional variant sets. Our results showed that models with known CV subsets provided more robust enrichment estimation. Models with different α values tended to provide stable and accurate estimates for common and genome-wide CVs (relative deviation 0.5−2.2%), while tending to underestimate the enrichment of uncommon CVs. As the α value increased, enrichments from 15.73% higher than true value (i.e., 3.00) to 48.93% lower than true value for uncommon CVs were observed. In addition, the long-range LD windows (e.g., 5000 kb) led to large bias of the enrichment estimations for both common and uncommon CVs. Overall, heritability enrichment estimations were sensitive for the α value assumption and LD weight consideration of different models. Accuracy would be greatly improved by using a suitable model. This study would be helpful in understanding the genetic architecture of complex traits and provides a reference for genetic analysis in the livestock population.
Collapse
|
5
|
Chen Z, Zhang Z, Wang Z, Zhang Z, Wang Q, Pan Y. Heterozygosity and homozygosity regions affect reproductive success and the loss of reproduction: a case study with litter traits in pigs. Comput Struct Biotechnol J 2022; 20:4060-4071. [PMID: 35983229 PMCID: PMC9364102 DOI: 10.1016/j.csbj.2022.07.039] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Revised: 07/21/2022] [Accepted: 07/22/2022] [Indexed: 12/23/2022] Open
Abstract
Runs of heterozygosity (ROHet) and homozygosity (ROH) harbor useful information related to traits of interest. There is a lack of investigating the effect of ROHet and ROH on reproductive success and the loss of reproduction in mammals. Here, we detected and characterized the ROHet and ROH patterns in the genomes of Chinese indigenous pigs (i.e., Jinhua, Chun’an, Longyou Black, and Shengxian Spotted pigs), revealing the similar genetic characteristics of indigenous pigs. Later, we highlighted the underlying litter traits-related ROHet and ROH using association analysis with linear model in these four indigenous pig breeds. To pinpoint the promising candidate genes associated with litter traits, we further in-depth explore the selection patterns of other five pig breeds (i.e., Erhualian, Meishan, Minzhu, Rongchang, and Diqing pigs) with different levels of reproduction performance at the underlying litter traits-related ROHet and ROH using FST and genetic diversity ratio. Then, we identified a set of known and novel candidate genes associated with reproductive performance in pigs. For the novel candidate genes (i.e., CCDC91, SASH1, SAMD5, MACF1, MFSD2A, EPC2, and MBD5), we obtained public available datasets and performed multi-omics analyses integrating transcriptome-wide association studies and comparative single-cell RNA-seq analyses to uncover the roles of them in mammalian reproductive performance. The genes have not been widely reported to be fertility-related genes and can be complementally considered as prior biological information to modify genomic selections models that benefits pig genetic improvement of litter traits. Besides, our findings provide new insights into the function of ROHet and ROH in mammals.
Collapse
|
6
|
Ren D, Cai X, Lin Q, Ye H, Teng J, Li J, Ding X, Zhang Z. Impact of linkage disequilibrium heterogeneity along the genome on genomic prediction and heritability estimation. Genet Sel Evol 2022; 54:47. [PMID: 35761182 PMCID: PMC9235212 DOI: 10.1186/s12711-022-00737-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 06/15/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Compared to medium-density single nucleotide polymorphism (SNP) data, high-density SNP data contain abundant genetic variants and provide more information for the genetic evaluation of livestock, but it has been shown that they do not confer any advantage for genomic prediction and heritability estimation. One possible reason is the uneven distribution of the linkage disequilibrium (LD) along the genome, i.e., LD heterogeneity among regions. The aim of this study was to effectively use genome-wide SNP data for genomic prediction and heritability estimation by using models that control LD heterogeneity among regions. METHODS The LD-adjusted kinship (LDAK) and LD-stratified multicomponent (LDS) models were used to control LD heterogeneity among regions and were compared with the classical model that has no such control. Simulated and real traits of 2000 dairy cattle individuals with imputed high-density (770K) SNP data were used. Five types of phenotypes were simulated, which were controlled by very strongly, strongly, moderately, weakly and very weakly tagged causal variants, respectively. The performances of the models with high- and medium-density (50K) panels were compared to verify that the models that controlled LD heterogeneity among regions were more effective with high-density data. RESULTS Compared to the medium-density panel, the use of the high-density panel did not improve and even decreased prediction accuracies and heritability estimates from the classical model for both simulated and real traits. Compared to the classical model, LDS effectively improved the accuracy of genomic predictions and unbiasedness of heritability estimates, regardless of the genetic architecture of the trait. LDAK applies only to traits that are mainly controlled by weakly tagged causal variants, but is still less effective than LDS for this type of trait. Compared with the classical model, LDS improved prediction accuracy by about 13% for simulated phenotypes and by 0.3 to ~ 10.7% for real traits with the high-density panel, and by ~ 1% for simulated phenotypes and by - 0.1 to ~ 6.9% for real traits with the medium-density panel. CONCLUSIONS Grouping SNPs based on regional LD to construct the LD-stratified multicomponent model can effectively eliminate the adverse effects of LD heterogeneity among regions, and greatly improve the efficiency of high-density SNP data for genomic prediction and heritability estimation.
Collapse
Affiliation(s)
- Duanyang Ren
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Xiaodian Cai
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Qing Lin
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Haoqiang Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Jinyan Teng
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
7
|
Utilization Strategies of Two Environment Phenotypes in Genomic Prediction. Genes (Basel) 2022; 13:genes13050722. [PMID: 35627107 PMCID: PMC9141986 DOI: 10.3390/genes13050722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/01/2023] Open
Abstract
Multiple environment phenotypes may be utilized to implement genomic prediction in plant breeding, while it is unclear about optimal utilization strategies according to its different availability. It is necessary to assess the utilization strategies of genomic prediction models based on different availability of multiple environment phenotypes. Here, we compared the prediction accuracy of three genomic prediction models (genomic prediction model (genomic best linear unbiased prediction (GBLUP), genomic best linear unbiased prediction (GFBLUP), and multi-trait genomic best linear unbiased prediction (mtGBLUP)) which leveraged diverse information from multiple environment phenotypes using a rice dataset containing 19 agronomic traits in two disparate seasons. We found that the prediction accuracy of genomic prediction models considering multiple environment phenotypes (GFBLUP and mtGBLUP) was better than the classical genomic prediction model (GBLUP model). The deviation of prediction accuracy of between GBLUP and mtGBLUP or GFBLUP was associated with the phenotypic correlation. In summary, the genomic prediction models considering multiple environment phenotypes (GFBLUP and mtGBLUP) demonstrated better prediction accuracy. In addition, we could utilize different genomic prediction strategies according to different availability of multiple environment phenotypes.
Collapse
|
8
|
Xiang R, Fang L, Sanchez MP, Cheng H, Zhang Z. Editorial: Multi-Layered Genome-Wide Association/Prediction in Animals. Front Genet 2022; 13:877748. [PMID: 35464854 PMCID: PMC9023786 DOI: 10.3389/fgene.2022.877748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 03/07/2022] [Indexed: 12/02/2022] Open
Affiliation(s)
- Ruidong Xiang
- Faculty of Veterinary and Agricultural Science, The University of Melbourne, Parkville, VIC, Australia
- Agriculture Victoria, AgriBio, Centre for AgriBiosciences, Bundoora, VIC, Australia
- *Correspondence: Ruidong Xiang,
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, United Kingdom
| | | | - Hao Cheng
- Department of Animal Science, University of California, Davis, Davis, CA, United States
| | - Zhe Zhang
- College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
9
|
Impact of Marker Pruning Strategies Based on Different Measurements of Marker Distance on Genomic Prediction in Dairy Cattle. Animals (Basel) 2021; 11:ani11071992. [PMID: 34359120 PMCID: PMC8300388 DOI: 10.3390/ani11071992] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 06/27/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022] Open
Abstract
Simple Summary The usefulness of genomic prediction (GP) has been widely proofed by breeding analysis in livestock, plants and aquatic populations. It is well known that ‘marker density’ is a critical factor that affects the accuracy of GP, however, how to properly measure ‘marker density’ in GP is yet to be determined. With population-level whole-genome sequence data or high-density single nucleotide polymorphism (SNP) data available, this question seems to be answered more convincingly. In this study, we investigated and discussed the impact of four ‘marker density’ measures that reflect genetic or physical distances between SNPs on the accuracy of GP in a Germany Holstein dairy cattle population. Our results showed that the degree of variation of physical distance between adjacent SNPs had significant effects on the accuracy of GP, while the genetic distance between SNPs had no relationship with the accuracy of GP. Therefore, for studies based on high-density SNP data, the default strategy of pruning SNPs based on genetic distance is detrimental to heritability estimation and genomic prediction. The results extended the communities knowledge of ‘marker density’ and provided useful suggestions for the application and research on genome prediction. Abstract With the availability of high-density single-nucleotide polymorphism (SNP) data and the development of genotype imputation methods, high-density panel-based genomic prediction (GP) has become possible in livestock breeding. It is generally considered that the genomic estimated breeding value (GEBV) accuracy increases with the marker density, while studies have shown that the GEBV accuracy does not increase or even decrease when high-density panels were used. Therefore, in addition to the SNP number, other measurements of ‘marker density’ seem to have impacts on the GEBV accuracy, and exploring the relationship between the GEBV accuracy and the measurements of ‘marker density’ based on high-density SNP or whole-genome sequence data is important for the field of GP. In this study, we constructed different SNP panels with certain SNP numbers (e.g., 1 k) by using the physical distance (PhyD), genetic distance (GenD) and random distance (RanD) between SNPs respectively based on the high-density SNP data of a Germany Holstein dairy cattle population. Therefore, there are three different panels at a certain SNP number level. These panels were used to construct GP models to predict fat percentage, milk yield and somatic cell score. Meanwhile, the mean (d¯) and variance (σd2) of the physical distance between SNPs and the mean (r2¯) and variance (σr22) of the genetic distance between SNPs in each panel were used as marker density-related measurements and their influence on the GEBV accuracy was investigated. At the same SNP number level, the d¯ of all panels is basically the same, but the σd2, r2¯ and σr22 are different. Therefore, we only investigated the effects of σd2, r2¯ and σr22 on the GEBV accuracy. The results showed that at a certain SNP number level, the GEBV accuracy was negatively correlated with σd2, but not with r2¯ and σr22. Compared with GenD and RanD, the σd2 of panels constructed by PhyD is smaller. The low and moderate-density panels (< 50 k) constructed by RanD or GenD have large σd2, which is not conducive to genomic prediction. The GEBV accuracy of the low and moderate-density panels constructed by PhyD is 3.8~34.8% higher than that of the low and moderate-density panels constructed by RanD and GenD. Panels with 20–30 k SNPs constructed by PhyD can achieve the same or slightly higher GEBV accuracy than that of high-density SNP panels for all three traits. In summary, the smaller the variation degree of physical distance between adjacent SNPs, the higher the GEBV accuracy. The low and moderate-density panels construct by physical distance are beneficial to genomic prediction, while pruning high-density SNP data based on genetic distance is detrimental to genomic prediction. The results provide suggestions for the development of SNP panels and the research of genome prediction based on whole-genome sequence data.
Collapse
|