1
|
Sun J, Wei J, Pan Y, Cao M, Li X, Xiao J, Yang G, Yu T. Improving genomic prediction accuracy of pig reproductive traits based on genotype imputation using preselected markers with different imputation platforms. Animal 2024; 19:101387. [PMID: 39708733 DOI: 10.1016/j.animal.2024.101387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 11/21/2024] [Accepted: 11/22/2024] [Indexed: 12/23/2024] Open
Abstract
Genomic prediction has been widely applied to the pig industry and has greatly accelerated the progress of genetic improvement in pigs. With the development of sequencing technology and price reduction, more and more genotype imputation panels of pig have been investigated, providing an effective and economical method to further study the genetic variation of pig economic traits. In this study, the imputation from 80 k Single Nucleotide Polymorphism chip data of 832 Large White pigs to whole-genome sequencing genotypes was performed by Swine Imputation Server, Pig Haplotypes Reference Panel (PHARP), Animal Genotype Imputation Database and 1k-pig-genomes four thousand-pig imputation panels. Then, linkage disequilibrium (LD) pruning and genome-wide association study (GWAS) preselected markers strategies were utilised to compare the genomic prediction accuracy of the different imputation data for reproductive traits, respectively. Our results showed that the PHARP panel exhibited the best genomic prediction accuracy among the four imputation panels. Meanwhile, the genomic prediction accuracy of the imputation data can be further improved by utilising the LD pruning and GWAS preselected marker strategies. In conclusion, our study provides insights into imputation data for pig genetic breeding.
Collapse
Affiliation(s)
- J Sun
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China; Key Laboratory of Agroecological Processes in Subtropical Region, Institute of Subtropical Agriculture, Chinese Academy of Sciences, Changsha, Hunan 410125, China
| | - J Wei
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Y Pan
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - M Cao
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - X Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - J Xiao
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - G Yang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - T Yu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, Laboratory of Animal Fat Deposition & Muscle Development, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China.
| |
Collapse
|
2
|
Novielli P, Romano D, Pavan S, Losciale P, Stellacci AM, Diacono D, Bellotti R, Tangaro S. Explainable artificial intelligence for genotype-to-phenotype prediction in plant breeding: a case study with a dataset from an almond germplasm collection. FRONTIERS IN PLANT SCIENCE 2024; 15:1434229. [PMID: 39319003 PMCID: PMC11420924 DOI: 10.3389/fpls.2024.1434229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Accepted: 08/13/2024] [Indexed: 09/26/2024]
Abstract
Background Advances in DNA sequencing revolutionized plant genomics and significantly contributed to the study of genetic diversity. However, predicting phenotypes from genomic data remains a challenge, particularly in the context of plant breeding. Despite significant progress, accurately predicting phenotypes from high-dimensional genomic data remains a challenge, particularly in identifying the key genetic factors influencing these predictions. This study aims to bridge this gap by integrating explainable artificial intelligence (XAI) techniques with advanced machine learning models. This approach is intended to enhance both the predictive accuracy and interpretability of genotype-to-phenotype models, thereby improving their reliability and supporting more informed breeding decisions. Results This study compares several ML methods for genotype-to-phenotype prediction, using data available from an almond germplasm collection. After preprocessing and feature selection, regression models are employed to predict almond shelling fraction. Best predictions were obtained by the Random Forest method (correlation = 0.727 ± 0.020, an R 2 = 0.511 ± 0.025, and an RMSE = 7.746 ± 0.199). Notably, the application of the SHAP (SHapley Additive exPlanations) values algorithm to explain the results highlighted several genomic regions associated with the trait, including one, having the highest feature importance, located in a gene potentially involved in seed development. Conclusions Employing explainable artificial intelligence algorithms enhances model interpretability, identifying genetic polymorphisms associated with the shelling percentage. These findings underscore XAI's efficacy in predicting phenotypic traits from genomic data, highlighting its significance in optimizing crop production for sustainable agriculture.
Collapse
Affiliation(s)
- Pierfrancesco Novielli
- Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Università degli Studi di Bari Aldo Moro, Bari, Italy
- Istituto Nazionale di Fisica Nucleare, Sezione di Bari, Bari, Italy
| | - Donato Romano
- Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Università degli Studi di Bari Aldo Moro, Bari, Italy
- Istituto Nazionale di Fisica Nucleare, Sezione di Bari, Bari, Italy
| | - Stefano Pavan
- Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Università degli Studi di Bari Aldo Moro, Bari, Italy
| | - Pasquale Losciale
- Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Università degli Studi di Bari Aldo Moro, Bari, Italy
| | - Anna Maria Stellacci
- Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Università degli Studi di Bari Aldo Moro, Bari, Italy
| | - Domenico Diacono
- Istituto Nazionale di Fisica Nucleare, Sezione di Bari, Bari, Italy
| | - Roberto Bellotti
- Istituto Nazionale di Fisica Nucleare, Sezione di Bari, Bari, Italy
- Dipartimento Interateneo di Fisica “M. Merlin”, Università degli Studi di Bari Aldo Moro, Bari, Italy
| | - Sabina Tangaro
- Dipartimento di Scienze del Suolo, della Pianta e degli Alimenti, Università degli Studi di Bari Aldo Moro, Bari, Italy
- Istituto Nazionale di Fisica Nucleare, Sezione di Bari, Bari, Italy
| |
Collapse
|
3
|
Song H, Dong T, Wang W, Yan X, Geng C, Bai S, Hu H. GWAS Enhances Genomic Prediction Accuracy of Caviar Yield, Caviar Color and Body Weight Traits in Sturgeons Using Whole-Genome Sequencing Data. Int J Mol Sci 2024; 25:9756. [PMID: 39273703 PMCID: PMC11395957 DOI: 10.3390/ijms25179756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2024] [Revised: 09/05/2024] [Accepted: 09/07/2024] [Indexed: 09/15/2024] Open
Abstract
Caviar yield, caviar color, and body weight are crucial economic traits in sturgeon breeding. Understanding the molecular mechanisms behind these traits is essential for their genetic improvement. In this study, we performed whole-genome sequencing on 673 Russian sturgeons, renowned for their high-quality caviar. With an average sequencing depth of 13.69×, we obtained approximately 10.41 million high-quality single nucleotide polymorphisms (SNPs). Using a genome-wide association study (GWAS) with a single-marker regression model, we identified SNPs and genes associated with these traits. Our findings revealed several candidate genes for each trait: caviar yield: TFAP2A, RPS6KA3, CRB3, TUBB, H2AFX, morc3, BAG1, RANBP2, PLA2G1B, and NYAP1; caviar color: NFX1, OTULIN, SRFBP1, PLEK, INHBA, and NARS; body weight: ACVR1, HTR4, fmnl2, INSIG2, GPD2, ACVR1C, TANC1, KCNH7, SLC16A13, XKR4, GALR2, RPL39, ACVR2A, ADCY10, and ZEB2. Additionally, using the genomic feature BLUP (GFBLUP) method, which combines linkage disequilibrium (LD) pruning markers with GWAS prior information, we improved genomic prediction accuracy by 2%, 1.9%, and 3.1% for caviar yield, caviar color, and body weight traits, respectively, compared to the GBLUP method. In conclusion, this study enhances our understanding of the genetic mechanisms underlying caviar yield, caviar color, and body weight traits in sturgeons, providing opportunities for genetic improvement of these traits through genomic selection.
Collapse
Affiliation(s)
- Hailiang Song
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
- National Innovation Center for Digital Seed Industry, Beijing 100097, China
| | - Tian Dong
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Wei Wang
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Xiaoyu Yan
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Chenfan Geng
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Song Bai
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
| | - Hongxia Hu
- Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China
- Key Laboratory of Sturgeon Genetics and Breeding, Ministry of Agriculture and Rural Affairs, Hangzhou 311799, China
- National Innovation Center for Digital Seed Industry, Beijing 100097, China
| |
Collapse
|
4
|
Cai X, Zhang W, Gao N, Wei C, Wu X, Si J, Gao Y, Li J, Yin T, Zhang Z. Integrating large-scale meta-analysis of genome-wide association studies improve the genomic prediction accuracy for combined pig populations. J Anim Breed Genet 2024. [PMID: 39215551 DOI: 10.1111/jbg.12896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 07/18/2024] [Accepted: 08/18/2024] [Indexed: 09/04/2024]
Abstract
The strategy of combining reference populations has been widely recognized as an effective way to enhance the accuracy of genomic prediction (GP). This study investigated the efficiency of genomic prediction using prior information and combined reference population. In total, prior information considering trait-associated single nucleotide polymorphisms (SNPs) obtained from meta-analysis of genome-wide association studies (GWAS meta-analysis) was incorporated into three models to assess the performance of GP using combined reference populations. Two different Yorkshire populations with imputed whole genome sequence (WGS) data (9,741,620 SNPs), named as P1 (1259 individuals) and P2 (1018 individuals), were used to predict genomic estimated breeding values for three live carcass traits, including backfat thickness, loin muscle area, and loin muscle depth. A 10 × 5 fold cross-validation was used to evaluate the prediction accuracy of 203 randomly selected candidate pigs from the P2 population and the reference population consisted of the remaining pigs from P2 and the stepwise added pigs from P1. By integrating SNPs with different p-value thresholds from GWAS meta-analysis downloaded from PigGTEx Project, the prediction accuracy of GBLUP, genomic feature BLUP (GFBLUP) and GBLUP given genetic architecture (BLUP|GA) were compared. Moreover, we explored effects of reference population size and heritability enrichment of genomic features on the prediction accuracy improvement of GFBLUP and BLUP|GA relative to GBLUP. The prediction accuracy of GBLUP using all WGS markers showed average improvement of 4.380% using the P1 + P2 reference population compared with the P2 reference population. Using the combined reference population, GFBLUP and BLUP|GA yielded 6.179% and 5.525% higher accuracies than GBLUP using all SNPs based on the single reference population, respectively. Positive regression coefficients were estimated in relation to the improvement in prediction accuracy (between GFBLUP/BLUP|GA and GBLUP) and the size of the reference as well as the heritability enrichment of genomic features. Compared to the classic GBLUP model, GFBLUP and BLUP|GA models integrating GWAS meta-analysis information increase the prediction accuracy and using combined populations with enlarged reference population size further enhances prediction accuracy of the two approaches. The heritability enrichment of genomic features can be used as an indicator to reflect weather prior information is accurately presented.
Collapse
Affiliation(s)
- Xiaodian Cai
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Wenjing Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Ning Gao
- College of Animal Science and Technology, Hunan Agricultural University, Changsha, China
| | - Chen Wei
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xibo Wu
- Guangxi State Farmd Yongxin Animal Husbandry Group Co., Ltd, Nanning, China
| | - Jinglei Si
- Guangxi State Farmd Yongxin Animal Husbandry Group Co., Ltd, Nanning, China
| | - Yahui Gao
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jiaqi Li
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Tong Yin
- Institute of Animal Breeding and Genetics, Justus Liebig University, Giessen, Germany
| | - Zhe Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
5
|
Kebede FG, Derks MFL, Dessie T, Hanotte O, Barros CP, Crooijmans RPMA, Komen H, Bastiaansen JWM. Landscape genomics reveals regions associated with adaptive phenotypic and genetic variation in Ethiopian indigenous chickens. BMC Genomics 2024; 25:284. [PMID: 38500079 PMCID: PMC10946127 DOI: 10.1186/s12864-024-10193-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 03/05/2024] [Indexed: 03/20/2024] Open
Abstract
Climate change is a threat to sustainable livestock production and livelihoods in the tropics. It has adverse impacts on feed and water availability, disease prevalence, production, environmental temperature, and biodiversity. Unravelling the drivers of local adaptation and understanding the underlying genetic variation in random mating indigenous livestock populations informs the design of genetic improvement programmes that aim to increase productivity and resilience. In the present study, we combined environmental, genomic, and phenotypic information of Ethiopian indigenous chickens to investigate their environmental adaptability. Through a hybrid sampling strategy, we captured wide biological and ecological variabilities across the country. Our environmental dataset comprised mean values of 34 climatic, vegetation and soil variables collected over a thirty-year period for 260 geolocations. Our biological dataset included whole genome sequences and quantitative measurements (on eight traits) from 513 individuals, representing 26 chicken populations spread along 4 elevational gradients (6-7 populations per gradient). We performed signatures of selection analyses ([Formula: see text] and XP-EHH) to detect footprints of natural selection, and redundancy analyses (RDA) to determine genotype-environment and genotype-phenotype-associations. RDA identified 1909 outlier SNPs linked with six environmental predictors, which have the highest contributions as ecological drivers of adaptive phenotypic variation. The same method detected 2430 outlier SNPs that are associated with five traits. A large overlap has been observed between signatures of selection identified by[Formula: see text]and XP-EHH showing that both methods target similar selective sweep regions. Average genetic differences measured by [Formula: see text] are low between gradients, but XP-EHH signals are the strongest between agroecologies. Genes in the calcium signalling pathway, those associated with the hypoxia-inducible factor (HIF) transcription factors, and sports performance (GALNTL6) are under selection in high-altitude populations. Our study underscores the relevance of landscape genomics as a powerful interdisciplinary approach to dissect adaptive phenotypic and genetic variation in random mating indigenous livestock populations.
Collapse
Affiliation(s)
- Fasil Getachew Kebede
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, Wageningen, PB-6708, The Netherlands.
- International Livestock Research Institute, P.O. Box 5689, Addis Ababa, Ethiopia.
| | - Martijn F L Derks
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, Wageningen, PB-6708, The Netherlands
| | - Tadelle Dessie
- International Livestock Research Institute, P.O. Box 5689, Addis Ababa, Ethiopia
| | - Olivier Hanotte
- International Livestock Research Institute, P.O. Box 5689, Addis Ababa, Ethiopia
- School of Life Sciences, The University of Nottingham, Nottingham, NG7 2RD, UK
| | - Carolina Pita Barros
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, Wageningen, PB-6708, The Netherlands
| | - Richard P M A Crooijmans
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, Wageningen, PB-6708, The Netherlands
| | - Hans Komen
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, Wageningen, PB-6708, The Netherlands
| | - John W M Bastiaansen
- Animal Breeding and Genomics, Wageningen University & Research, Droevendaalsesteeg 1, Wageningen, PB-6708, The Netherlands
| |
Collapse
|
6
|
Zhang Y, Zhuang Z, Liu Y, Huang J, Luan M, Zhao X, Dong L, Ye J, Yang M, Zheng E, Cai G, Wu Z, Yang J. Genomic prediction based on preselected single-nucleotide polymorphisms from genome-wide association study and imputed whole-genome sequence data annotation for growth traits in Duroc pigs. Evol Appl 2024; 17:e13651. [PMID: 38362509 PMCID: PMC10868536 DOI: 10.1111/eva.13651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 10/31/2023] [Accepted: 01/13/2024] [Indexed: 02/17/2024] Open
Abstract
The use of whole-genome sequence (WGS) data is expected to improve genomic prediction (GP) power of complex traits because it may contain mutations that in strong linkage disequilibrium pattern with causal mutations. However, a few previous studies have shown no or small improvement in prediction accuracy using WGS data. Incorporating prior biological information into GP seems to be an attractive strategy that might improve prediction accuracy. In this study, a total of 6334 pigs were genotyped using 50K chips and subsequently imputed to the WGS level. This cohort includes two prior discovery populations that comprise 294 Landrace pigs and 186 Duroc pigs, as well as two validation populations that consist of 3770 American Duroc pigs and 2084 Canadian Duroc pigs. Then we used annotation information and genome-wide association study (GWAS) from the WGS data to make GP for six growth traits in two Duroc pig populations. Based on variant annotation, we partitioned different genomic classes, such as intron, intergenic, and untranslated regions, for imputed WGS data. Based on GWAS results of WGS data, we obtained trait-associated single-nucleotide polymorphisms (SNPs). We then applied the genomic feature best linear unbiased prediction (GFBLUP) and genomic best linear unbiased prediction (GBLUP) models to estimate the genomic estimated breeding values for growth traits with these different variant panels, including six genomic classes and trait-associated SNPs. Compared with 50K chip data, GBLUP with imputed WGS data had no increase in prediction accuracy. Using only annotations resulted in no increase in prediction accuracy compared to GBLUP with 50K, but adding annotation information into the GFBLUP model with imputed WGS data could improve the prediction accuracy with increases of 0.00%-2.82%. In conclusion, a GFBLUP model that incorporated prior biological information might increase the advantage of using imputed WGS data for GP.
Collapse
Affiliation(s)
- Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Jinyan Huang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Menghao Luan
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Xiang Zhao
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Linsong Dong
- Guangdong Zhongxin Breeding Technology Co., LtdGuangzhouChina
| | - Jian Ye
- Guangdong Zhongxin Breeding Technology Co., LtdGuangzhouChina
| | - Ming Yang
- College of Animal Science and TechnologyZhongkai University of Agriculture and EngineeringGuangzhouChina
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
- Guangdong Zhongxin Breeding Technology Co., LtdGuangzhouChina
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine IndustrySouth China Agricultural UniversityGuangzhouChina
- Guangdong Provincial Key Laboratory of Agro‐animal Genomics and Molecular BreedingSouth China Agricultural UniversityGuangzhouChina
| |
Collapse
|
7
|
Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023; 13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open
Abstract
Enhancing the accuracy of genomic prediction is a key goal in genomic selection (GS) research. Integrating prior biological information into GS methods using appropriate models can improve prediction accuracy for complex traits. Genome-wide association study (GWAS) is widely utilized to identify potential candidate loci associated with complex traits in livestock and poultry, offering essential genomic insights. In this study, a GWAS was conducted on 685 Duroc × Landrace × Yorkshire (DLY) pigs to extract significant single-nucleotide polymorphisms (SNPs) as genomic features. We compared two GS models, genomic best linear unbiased prediction (GBLUP) and genomic feature BLUP (GFBLUP), by using imputed whole-genome sequencing (WGS) data on 651 Yorkshire pigs. The results revealed that the GBLUP model achieved prediction accuracies of 0.499 for backfat thickness (BFT) and 0.423 for loin muscle area (LMA). By applying the GFBLUP model with GWAS-based SNP preselection, the average prediction accuracies for BFT and LMA traits reached 0.491 and 0.440, respectively. Specifically, the GFBLUP model displayed a 4.8% enhancement in predicting LMA compared to the GBLUP model. These findings suggest that, in certain scenarios, the GFBLUP model may offer superior genomic prediction accuracy when compared to the GBLUP model, underscoring the potential value of incorporating genomic features to refine GS models.
Collapse
Affiliation(s)
- Yiyi Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuling Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Fuchen Zhou
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zekai Yao
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuexin Zhan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfei Fan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Xianglun Meng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zebin Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Langqing Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Jie Yang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Enqin Zheng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
8
|
Zheng M, Liao J, Li Z, Xu Z, Jiang Z, Tan L, Fu R, Xu H, Li Z, Zhang X, Nie Q. Evaluation of the selection of key individuals for genotype imputation in Chinese yellow-feathered chicken. Poult Sci 2023; 102:102901. [PMID: 37499612 PMCID: PMC10393784 DOI: 10.1016/j.psj.2023.102901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 06/02/2023] [Accepted: 06/24/2023] [Indexed: 07/29/2023] Open
Abstract
Genotype imputation is a powerful technique employed by next-generation sequencing (NGS) and genotyping arrays, which can significantly enhance the cost-effectiveness and efficiency of genomic selection. The accuracy of imputation is largely determined by the choice of reference panel, with previous studies generally demonstrating that a closely related population as a reference panel leads to greater accuracy than a more distantly related population. Various strategies have been proposed for selecting desirable individuals via targeted resequencing, but their efficiencies need further improvement. In this study, we present a practical broiler selection methodology for a local Chinese chicken line that integrates established methods based on pedigree, genomics, and random sampling, and leverages genotype and pedigree information from the yellow-plumage dwarf chicken line. The efficacy of these selection strategies was assessed by evaluating their ability to accurately impute masked genotypes from data obtained using a 600K chip. Our findings reveal that the pedigree-based method yields superior accuracy in genotype imputation, whereas the haplotype-based method exhibits greater stability. Nonetheless, the impact of these targeted methods for selecting key individuals is slightly different when initiating a new sequencing project in a production context. Overall, this study highlights the advantages of using the pedigree-based approach as the preferred method for optimizing genotype imputation in broiler chickens.
Collapse
Affiliation(s)
- Ming Zheng
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Jiahao Liao
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Zhuohang Li
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Zhenqiang Xu
- Guangdong Wens Nanfang Poultry Breeding Co., Ltd., Xinxing 527439, China
| | - Ziqin Jiang
- Guangdong Wens Nanfang Poultry Breeding Co., Ltd., Xinxing 527439, China
| | - Liangtian Tan
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Rong Fu
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Haiping Xu
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Zhenhui Li
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Xiquan Zhang
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Qinghua Nie
- Lingnan Guangdong Laboratory of Modern Agriculture, College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou 510642, Guangdong, China; State Key Laboratory of Livestock and Poultry Breeding, South China Agricultural University, Guangzhou 510642, Guangdong, China.
| |
Collapse
|
9
|
Weber SE, Frisch M, Snowdon RJ, Voss-Fels KP. Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets. FRONTIERS IN PLANT SCIENCE 2023; 14:1217589. [PMID: 37731980 PMCID: PMC10507710 DOI: 10.3389/fpls.2023.1217589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/21/2023] [Indexed: 09/22/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype data in breeding populations tends to exhibit considerable redundancy. Therefore, interest is growing in the use of haplotype blocks to overcome redundancy by summarizing co-inherited features. Moreover, haplotype blocks can help to capture local epistasis caused by interacting loci. Here, we compared genomic prediction methods that either used single SNPs or haplotype blocks with regards to their prediction accuracy for important traits in crop datasets. We used four published datasets from canola, maize, wheat and soybean. Different approaches to construct haplotype blocks were compared, including blocks based on LD, physical distance, number of adjacent markers and the algorithms implemented in the software "Haploview" and "HaploBlocker". The tested prediction methods included Genomic Best Linear Unbiased Prediction (GBLUP), Extended GBLUP to account for additive by additive epistasis (EGBLUP), Bayesian LASSO and Reproducing Kernel Hilbert Space (RKHS) regression. We found improved prediction accuracy in some traits when using haplotype blocks compared to SNP-based predictions, however the magnitude of improvement was very trait- and model-specific. Especially in settings with low marker density, haplotype blocks can improve genomic prediction accuracy. In most cases, physically large haplotype blocks yielded a strong decrease in prediction accuracy. Especially when prediction accuracy varies greatly across different prediction models, prediction based on haplotype blocks can improve prediction accuracy of underperforming models. However, there is no "best" method to build haplotype blocks, since prediction accuracy varied considerably across methods and traits. Hence, criteria used to define haplotype blocks should not be viewed as fixed biological parameters, but rather as hyperparameters that need to be adjusted for every dataset.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Kai P. Voss-Fels
- Institute for Grapevine Breeding, Hochschule Geisenheim University, Geisenheim, Germany
| |
Collapse
|
10
|
Zhuang Z, Wu J, Qiu Y, Ruan D, Ding R, Xu C, Zhou S, Zhang Y, Liu Y, Ma F, Yang J, Sun Y, Zheng E, Yang M, Cai G, Yang J, Wu Z. Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs. J Anim Sci Biotechnol 2023; 14:67. [PMID: 37161604 PMCID: PMC10170792 DOI: 10.1186/s40104-023-00863-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/05/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Pork quality can directly affect customer purchase tendency and meat quality traits have become valuable in modern pork production. However, genetic improvement has been slow due to high phenotyping costs. In this study, whole genome sequence (WGS) data was used to evaluate the prediction accuracy of genomic best linear unbiased prediction (GBLUP) for meat quality in large-scale crossbred commercial pigs. RESULTS We produced WGS data (18,695,907 SNPs and 2,106,902 INDELs exceed quality control) from 1,469 sequenced Duroc × (Landrace × Yorkshire) pigs and developed a reference panel for meat quality including meat color score, marbling score, L* (lightness), a* (redness), and b* (yellowness) of genomic prediction. The prediction accuracy was defined as the Pearson correlation coefficient between adjusted phenotypes and genomic estimated breeding values in the validation population. Using different marker density panels derived from WGS data, accuracy differed substantially among meat quality traits, varied from 0.08 to 0.47. Results showed that MultiBLUP outperform GBLUP and yielded accuracy increases ranging from 17.39% to 75%. We optimized the marker density and found medium- and high-density marker panels are beneficial for the estimation of heritability for meat quality. Moreover, we conducted genotype imputation from 50K chip to WGS level in the same population and found average concordance rate to exceed 95% and r2 = 0.81. CONCLUSIONS Overall, estimation of heritability for meat quality traits can benefit from the use of WGS data. This study showed the superiority of using WGS data to genetically improve pork quality in genomic prediction.
Collapse
Affiliation(s)
- Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yibin Qiu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Donglin Ruan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Rongrong Ding
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Cineng Xu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Shenping Zhou
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Fucai Ma
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jifei Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ying Sun
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ming Yang
- College of Animal Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
- Yunfu Subcenter of Guangdong Laboratory for Lingnan Modern Agriculture, Yunfu, 527400, China.
| |
Collapse
|
11
|
Zhang R, Zhang Y, Liu T, Jiang B, Li Z, Qu Y, Chen Y, Li Z. Utilizing Variants Identified with Multiple Genome-Wide Association Study Methods Optimizes Genomic Selection for Growth Traits in Pigs. Animals (Basel) 2023; 13:ani13040722. [PMID: 36830509 PMCID: PMC9952664 DOI: 10.3390/ani13040722] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Revised: 02/09/2023] [Accepted: 02/15/2023] [Indexed: 02/22/2023] Open
Abstract
Improving the prediction accuracies of economically important traits in genomic selection (GS) is a main objective for researchers and breeders in the livestock industry. This study aims at utilizing potentially functional SNPs and QTLs identified with various genome-wide association study (GWAS) models in GS of pig growth traits. We used three well-established GWAS methods, including the mixed linear model, Bayesian model and meta-analysis, as well as 60K SNP-chip and whole genome sequence (WGS) data from 1734 Yorkshire and 1123 Landrace pigs to detect SNPs related to four growth traits: average daily gain, backfat thickness, body weight and birth weight. A total of 1485 significant loci and 24 candidate genes which are involved in skeletal muscle development, fatty deposition, lipid metabolism and insulin resistance were identified. Compared with using all SNP-chip data, GS with the pre-selected functional SNPs in the standard genomic best linear unbiased prediction (GBLUP), and a two-kernel based GBLUP model yielded average gains in accuracy by 4 to 46% (from 0.19 ± 0.07 to 0.56 ± 0.07) and 5 to 27% (from 0.16 ± 0.06 to 0.57 ± 0.05) for the four traits, respectively, suggesting that the prioritization of preselected functional markers in GS models had the potential to improve prediction accuracies for certain traits in livestock breeding.
Collapse
Affiliation(s)
- Ruifeng Zhang
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510006, China
| | - Yi Zhang
- Institute of Neuroscience, Panzhihua University, Panzhihua 617000, China
| | - Tongni Liu
- Genetic Data Center, Faculty of Forestry, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Bo Jiang
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510006, China
| | - Zhenyang Li
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510006, China
| | - Youping Qu
- Guangdong IPIG Technology Co., Ltd., Guangzhou 510006, China
| | - Yaosheng Chen
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510006, China
| | - Zhengcao Li
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510006, China
- Correspondence:
| |
Collapse
|
12
|
Wang B, Li P, Hou L, Zhou W, Tao W, Liu C, Liu K, Niu P, Zhang Z, Li Q, Su G, Huang R. Genome-wide association study and genomic prediction for intramuscular fat content in Suhuai pigs using imputed whole-genome sequencing data. Evol Appl 2022; 15:2054-2066. [PMID: 36540634 PMCID: PMC9753832 DOI: 10.1111/eva.13496] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 08/22/2022] [Accepted: 10/04/2022] [Indexed: 11/29/2022] Open
Abstract
Integrating the single-nucleotide polymorphisms (SNPs) significantly affecting target traits from imputed whole-genome sequencing (iWGS) data into the genomic prediction (GP) model is an economic, efficient, and feasible strategy to improve prediction accuracy. The objective was to dissect the genetic architecture of intramuscular fat content (IFC) by genome wide association studies (GWAS) and to investigate the accuracy of GP based on pedigree-based BLUP (PBLUP) model, genomic best linear unbiased prediction (GBLUP) models and Bayesian mixture (BayesMix) models under different strategies. A total of 482 Suhuai pigs were genotyped using an 80 K SNP chip. Furthermore, 30 key samples were selected for resequencing and were used as a reference panel to impute the 80 K chip data to the WGS dataset. The 80 K data and iWGS data were used to perform GWAS and test GP accuracies under different scenarios. GWAS results revealed that there were four major regions affecting IFC. Two important functional candidate genes were found in the two most significant regions, including protein kinase C epsilon (PRKCE) and myosin light chain 2 (MYL2). The results of the predictions showed that the PBLUP model had the lowest reliability (0.096 ± 0.032). The reliability (0.229 ± 0.035) was improved by replacing pedigree information with 80 K chip data. Compared with using 80 K SNPs alone, pruning iWGS SNPs with the R-squared cutoff of linkage disequilibrium (0.55) led to a slight improvement (0.006), adding significant iWGS SNPs led to an improvement of reliability by 0.050 when using a one-component GBLUP, a further increase of 0.033 when using a two-component GBLUP model. For BayesMix models, compared with using 80 K SNPs alone, adding additional significant iWGS SNPs into one- or two-component BayesMix models led to improvements of reliabilities for IFC by 0.040 and 0.089, respectively. Our results may facilitate further identification of causal genes for IFC and may be beneficial for the improvement of IFC in pig breeding programs.
Collapse
Affiliation(s)
- Binbin Wang
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
- Center for Quantitative Genetics and GenomicsAarhus UniversityAarhusDenmark
- Huaian AcademyNanjing Agricultural UniversityChina
| | - Pinghua Li
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
- Huaian AcademyNanjing Agricultural UniversityChina
| | - Liming Hou
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
- Huaian AcademyNanjing Agricultural UniversityChina
| | - Wuduo Zhou
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
| | - Wei Tao
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
- Huaian AcademyNanjing Agricultural UniversityChina
| | - Chenxi Liu
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
- Huaian AcademyNanjing Agricultural UniversityChina
| | - Kaiyue Liu
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
- Huaian AcademyNanjing Agricultural UniversityChina
| | - Peipei Niu
- Huaian AcademyNanjing Agricultural UniversityChina
| | | | - Qiang Li
- Huaiyin Xinhuai Pig Breeding Farm of Huaian CityChina
| | - Guosheng Su
- Center for Quantitative Genetics and GenomicsAarhus UniversityAarhusDenmark
| | - Ruihua Huang
- Key Laboratory in Nanjing for Evaluation and Utilization of Pigs ResourcesMinistry of Agriculture and Rural Areas of China, Institute of Swine Science, Nanjing Agricultural UniversityNanjingChina
- Huaian AcademyNanjing Agricultural UniversityChina
| |
Collapse
|
13
|
Genomic Selection in Chinese Holsteins Using Regularized Regression Models for Feature Selection of Whole Genome Sequencing Data. Animals (Basel) 2022; 12:ani12182419. [PMID: 36139283 PMCID: PMC9495168 DOI: 10.3390/ani12182419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/09/2022] [Accepted: 09/12/2022] [Indexed: 11/30/2022] Open
Abstract
Simple Summary Genomic selection (GS) is increasingly widely used in animal breeding, owing to its high efficiency in the genetic improvement of economic traits. In China, GS has been implemented for genetic evaluation of young bulls in dairy cattle breeding programs since 2012. GS is commonly based on single nucleotide polymorphism (SNP) chips. The cost of whole genome sequencing (WGS) has decreased tremendously in recent years, allowing increased studies of WGS-based GS. In this study, based on the imputed WGS data of approximately 8000 Chinese Holsteins, we investigated the performance of GS of milk production traits using the feature selection method of regularized regression. The results showed that WGS-based GS using regularized regression models and the commonly used linear mixed models achieved comparable prediction accuracies. For milk and protein yields, GS using a combination of SNPs selected with a regularized regression model and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with a linear mixed model combined with 50K SNP chip data performed best for fat yield. The proposed method of GS based on WGS data, i.e., feature selection using regularization regression models, provides a valuable novel strategy for genomic selection. Abstract Genomic selection (GS) is an efficient method to improve genetically economic traits. Feature selection is an important method for GS based on whole-genome sequencing (WGS) data. We investigated the prediction performance of GS of milk production traits using imputed WGS data on 7957 Chinese Holsteins. We used two regularized regression models, least absolute shrinkage and selection operator (LASSO) and elastic net (EN) for feature selection. For comparison, we performed genome-wide association studies based on a linear mixed model (LMM), and the N single nucleotide polymorphisms (SNPs) with the lowest p-values were selected (LMMLASSO and LMMEN), where N was the number of non-zero effect SNPs selected by LASSO or EN. GS was conducted using a genomic best linear unbiased prediction (GBLUP) model and several sets of SNPs: (1) selected WGS SNPs; (2) 50K SNP chip data; (3) WGS data; and (4) a combined set of selected WGS SNPs and 50K SNP chip data. The results showed that the prediction accuracies of GS with features selected using LASSO or EN were comparable to those using features selected with LMMLASSO or LMMEN. For milk and protein yields, GS using a combination of SNPs selected with LASSO and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with LMMLASSO combined with 50K SNP chip data performed best for fat yield. The proposed method, feature selection using regularization regression models, provides a valuable novel strategy for WGS-based GS.
Collapse
|
14
|
Jiang Y, Song H, Gao H, Zhang Q, Ding X. Exploring the optimal strategy of imputation from SNP array to whole-genome sequencing data in farm animals. Front Genet 2022; 13:963654. [PMID: 36092888 PMCID: PMC9459117 DOI: 10.3389/fgene.2022.963654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 08/01/2022] [Indexed: 11/13/2022] Open
Abstract
Genotype imputation from BeadChip to whole-genome sequencing (WGS) data is a cost-effective method of obtaining genotypes of WGS variants. Beagle, one of the most popular imputation software programs, has been widely used for genotype inference in humans and non-human species. A few studies have systematically and comprehensively compared the performance of beagle versions and parameter settings of farm animals. Here, we investigated the imputation performance of three representative versions of Beagle (Beagle 4.1, Beagle 5.0, and Beagle 5.4), and the effective population size (Ne) parameter setting for three species (cattle, pig, and chicken). Six scenarios were investigated to explore the impact of certain key factors on imputation performance. The results showed that the default Ne (1,000,000) is not suitable for livestock and poultry in small reference or low-density arrays of target panels, with 2.47%–10.45% drops in accuracy. Beagle 5 significantly reduced the computation time (4.66-fold–13.24-fold) without an accuracy loss. In addition, using a large combined-reference panel or high-density chip provides greater imputation accuracy, especially for low minor allele frequency (MAF) variants. Finally, a highly significant correlation in the measures of imputation accuracy can be obtained with an MAF equal to or greater than 0.05.
Collapse
Affiliation(s)
- Yifan Jiang
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Hailiang Song
- Beijing Key Laboratory of Fisheries Biotechnology, Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences, Beijing, China
| | - Hongding Gao
- Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, Shandong Agricultural University, Taian, China
| | - Xiangdong Ding
- National Engineering Laboratory for Animal Breeding, Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, China Agricultural University, Beijing, China
- *Correspondence: Xiangdong Ding,
| |
Collapse
|
15
|
Erven JAM, Çakirlar C, Bradley DG, Raemaekers DCM, Madsen O. Imputation of Ancient Whole Genome Sus scrofa DNA Introduces Biases Toward Main Population Components in the Reference Panel. Front Genet 2022; 13:872486. [PMID: 35903348 PMCID: PMC9315352 DOI: 10.3389/fgene.2022.872486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 05/20/2022] [Indexed: 11/13/2022] Open
Abstract
Sequencing ancient DNA to high coverage is often limited by sample quality and cost. Imputing missing genotypes can potentially increase information content and quality of ancient data, but requires different computational approaches than modern DNA imputation. Ancient imputation beyond humans has not been investigated. In this study we report results of a systematic evaluation of imputation of three whole genome ancient Sus scrofa samples from the Early and Late Neolithic (∼7,100-4,500 BP), to test the utility of imputation. We show how issues like genetic architecture and, reference panel divergence, composition and size affect imputation accuracy. We evaluate a variety of imputation methods, including Beagle5, GLIMPSE, and Impute5 with varying filters, pipelines, and variant calling methods. We achieved genotype concordance in most cases reaching above 90%; with the highest being 98% with ∼2,000,000 variants recovered using GLIMPSE. Despite this high concordance the sources of diversity present in the genotypes called in the original high coverage genomes were not equally imputed leading to biases in downstream analyses; a trend toward genotypes most common in the reference panel is observed. This demonstrates that the current reference panel does not possess the full diversity needed for accurate imputation of ancient Sus, due to missing variations from Near Eastern and Mesolithic wild boar. Imputation of ancient Sus scrofa holds potential but should be approached with caution due to these biases, and suggests that there is no universal approach for imputation of non-human ancient species.
Collapse
Affiliation(s)
- J. A. M. Erven
- Groningen Institute of Archaeology, University of Groningen, Groningen, Netherlands
| | - C. Çakirlar
- Groningen Institute of Archaeology, University of Groningen, Groningen, Netherlands
| | - D. G. Bradley
- Smurfit Institute of Genetics, Trinity College Dublin, Dublin, Ireland
| | - D. C. M. Raemaekers
- Groningen Institute of Archaeology, University of Groningen, Groningen, Netherlands
| | - O. Madsen
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, Netherlands
| |
Collapse
|
16
|
Ren D, Cai X, Lin Q, Ye H, Teng J, Li J, Ding X, Zhang Z. Impact of linkage disequilibrium heterogeneity along the genome on genomic prediction and heritability estimation. Genet Sel Evol 2022; 54:47. [PMID: 35761182 PMCID: PMC9235212 DOI: 10.1186/s12711-022-00737-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 06/15/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Compared to medium-density single nucleotide polymorphism (SNP) data, high-density SNP data contain abundant genetic variants and provide more information for the genetic evaluation of livestock, but it has been shown that they do not confer any advantage for genomic prediction and heritability estimation. One possible reason is the uneven distribution of the linkage disequilibrium (LD) along the genome, i.e., LD heterogeneity among regions. The aim of this study was to effectively use genome-wide SNP data for genomic prediction and heritability estimation by using models that control LD heterogeneity among regions. METHODS The LD-adjusted kinship (LDAK) and LD-stratified multicomponent (LDS) models were used to control LD heterogeneity among regions and were compared with the classical model that has no such control. Simulated and real traits of 2000 dairy cattle individuals with imputed high-density (770K) SNP data were used. Five types of phenotypes were simulated, which were controlled by very strongly, strongly, moderately, weakly and very weakly tagged causal variants, respectively. The performances of the models with high- and medium-density (50K) panels were compared to verify that the models that controlled LD heterogeneity among regions were more effective with high-density data. RESULTS Compared to the medium-density panel, the use of the high-density panel did not improve and even decreased prediction accuracies and heritability estimates from the classical model for both simulated and real traits. Compared to the classical model, LDS effectively improved the accuracy of genomic predictions and unbiasedness of heritability estimates, regardless of the genetic architecture of the trait. LDAK applies only to traits that are mainly controlled by weakly tagged causal variants, but is still less effective than LDS for this type of trait. Compared with the classical model, LDS improved prediction accuracy by about 13% for simulated phenotypes and by 0.3 to ~ 10.7% for real traits with the high-density panel, and by ~ 1% for simulated phenotypes and by - 0.1 to ~ 6.9% for real traits with the medium-density panel. CONCLUSIONS Grouping SNPs based on regional LD to construct the LD-stratified multicomponent model can effectively eliminate the adverse effects of LD heterogeneity among regions, and greatly improve the efficiency of high-density SNP data for genomic prediction and heritability estimation.
Collapse
Affiliation(s)
- Duanyang Ren
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Xiaodian Cai
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Qing Lin
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Haoqiang Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Jinyan Teng
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture and Rural Affairs, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
17
|
Fu W, Wang R, Xu N, Wang J, Li R, Asadollahpour Nanaei H, Nie Q, Zhao X, Han J, Yang N, Jiang Y. Galbase: a comprehensive repository for integrating chicken multi-omics data. BMC Genomics 2022; 23:364. [PMID: 35549894 PMCID: PMC9097087 DOI: 10.1186/s12864-022-08598-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 04/28/2022] [Indexed: 11/10/2022] Open
Abstract
Background Multi-omics data can provide a stereoscopic view to explore potential causal variations and genes, as well as underlying genetic mechanisms of complex traits. However, for many non-mammalian species, including chickens, these resources are poorly integrated and reused, greatly limiting genetic research and breeding processes of the species. Results Here, we constructed Galbase, an easily accessible repository that integrates public chicken multi-omics data from 928 re-sequenced genomes, 429 transcriptomes, 379 epigenomes, 15,275 QTL entries, and 7,526 associations. A total of 21.67 million SNPs, 2.71 million InDels, and 488,583 cis-regulatory elements were included. Galbase allows users to retrieve genomic variations in geographical maps, gene expression profiling in heatmaps, and epigenomic signals in peak patterns. It also provides modules for batch annotation of genes, regions, and loci based on multi-layered omics data. Additionally, a series of convenient tools, including the UCSC Genome Browser, WashU Epigenome Browser, BLAT, BLAST, and LiftOver, were also integrated to facilitate search, visualization, and analysis of sequence features. Conclusion Galbase grants new opportunities to research communities to undertake in-depth functional genomic studies on chicken. All features of Galbase make it a useful resource to identify genetic variations responsible for chicken complex traits. Galbase is publicly available at http://animal.nwsuaf.edu.cn/ChickenVar. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08598-2.
Collapse
Affiliation(s)
- Weiwei Fu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Rui Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Naiyi Xu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Jinxin Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Ran Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Hojjat Asadollahpour Nanaei
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Qinghua Nie
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, 510642, Guangdong, China
| | - Xin Zhao
- Department of Animal Science, McGill University, Montreal, Québec, Canada
| | - Jianlin Han
- CAAS-ILRI Joint Laboratory On Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.,Livestock Genetics Program, International Livestock Research Institute (ILRI), Nairobi, Kenya
| | - Ning Yang
- National Engineering Laboratory for Animal Breeding and Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, China Agricultural University, Beijing, 100193, China
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China. .,Center for Functional Genomics, Institute of Future Agriculture, Northwest A&F University, Yangling, China.
| |
Collapse
|
18
|
Impact of Marker Pruning Strategies Based on Different Measurements of Marker Distance on Genomic Prediction in Dairy Cattle. Animals (Basel) 2021; 11:ani11071992. [PMID: 34359120 PMCID: PMC8300388 DOI: 10.3390/ani11071992] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 06/27/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022] Open
Abstract
Simple Summary The usefulness of genomic prediction (GP) has been widely proofed by breeding analysis in livestock, plants and aquatic populations. It is well known that ‘marker density’ is a critical factor that affects the accuracy of GP, however, how to properly measure ‘marker density’ in GP is yet to be determined. With population-level whole-genome sequence data or high-density single nucleotide polymorphism (SNP) data available, this question seems to be answered more convincingly. In this study, we investigated and discussed the impact of four ‘marker density’ measures that reflect genetic or physical distances between SNPs on the accuracy of GP in a Germany Holstein dairy cattle population. Our results showed that the degree of variation of physical distance between adjacent SNPs had significant effects on the accuracy of GP, while the genetic distance between SNPs had no relationship with the accuracy of GP. Therefore, for studies based on high-density SNP data, the default strategy of pruning SNPs based on genetic distance is detrimental to heritability estimation and genomic prediction. The results extended the communities knowledge of ‘marker density’ and provided useful suggestions for the application and research on genome prediction. Abstract With the availability of high-density single-nucleotide polymorphism (SNP) data and the development of genotype imputation methods, high-density panel-based genomic prediction (GP) has become possible in livestock breeding. It is generally considered that the genomic estimated breeding value (GEBV) accuracy increases with the marker density, while studies have shown that the GEBV accuracy does not increase or even decrease when high-density panels were used. Therefore, in addition to the SNP number, other measurements of ‘marker density’ seem to have impacts on the GEBV accuracy, and exploring the relationship between the GEBV accuracy and the measurements of ‘marker density’ based on high-density SNP or whole-genome sequence data is important for the field of GP. In this study, we constructed different SNP panels with certain SNP numbers (e.g., 1 k) by using the physical distance (PhyD), genetic distance (GenD) and random distance (RanD) between SNPs respectively based on the high-density SNP data of a Germany Holstein dairy cattle population. Therefore, there are three different panels at a certain SNP number level. These panels were used to construct GP models to predict fat percentage, milk yield and somatic cell score. Meanwhile, the mean (d¯) and variance (σd2) of the physical distance between SNPs and the mean (r2¯) and variance (σr22) of the genetic distance between SNPs in each panel were used as marker density-related measurements and their influence on the GEBV accuracy was investigated. At the same SNP number level, the d¯ of all panels is basically the same, but the σd2, r2¯ and σr22 are different. Therefore, we only investigated the effects of σd2, r2¯ and σr22 on the GEBV accuracy. The results showed that at a certain SNP number level, the GEBV accuracy was negatively correlated with σd2, but not with r2¯ and σr22. Compared with GenD and RanD, the σd2 of panels constructed by PhyD is smaller. The low and moderate-density panels (< 50 k) constructed by RanD or GenD have large σd2, which is not conducive to genomic prediction. The GEBV accuracy of the low and moderate-density panels constructed by PhyD is 3.8~34.8% higher than that of the low and moderate-density panels constructed by RanD and GenD. Panels with 20–30 k SNPs constructed by PhyD can achieve the same or slightly higher GEBV accuracy than that of high-density SNP panels for all three traits. In summary, the smaller the variation degree of physical distance between adjacent SNPs, the higher the GEBV accuracy. The low and moderate-density panels construct by physical distance are beneficial to genomic prediction, while pruning high-density SNP data based on genetic distance is detrimental to genomic prediction. The results provide suggestions for the development of SNP panels and the research of genome prediction based on whole-genome sequence data.
Collapse
|
19
|
Al-Khudhair A, VanRaden PM, Null DJ, Li B. Marker selection and genomic prediction of economically important traits using imputed high-density genotypes for 5 breeds of dairy cattle. J Dairy Sci 2021; 104:4478-4485. [PMID: 33612229 DOI: 10.3168/jds.2020-19260] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Accepted: 11/22/2020] [Indexed: 11/19/2022]
Abstract
Marker sets used in US dairy genomic predictions were previously expanded by including high-density (HD) or sequence markers with the largest effects for Holstein breed only. Other non-Holstein breeds lacked enough HD genotyped animals to be used as a reference population at that time, and thus were not included in the genomic prediction. Recently, numbers of non-Holstein breeds genotyped using HD panels reached an acceptable level for imputation and marker selection, allowing HD genomic prediction and HD marker selection for Holstein plus 4 other breeds. Genotypes for 351,461 Holsteins, 347,570 Jerseys, 42,346 Brown Swiss, 9,364 Ayrshires (including Red dairy cattle), and 4,599 Guernseys were imputed to the HD marker list that included 643,059 SNP. The separate HD reference populations included Illumina BovineHD (San Diego, CA) genotypes for 4,012 Holsteins, 407 Jerseys, 181 Brown Swiss, 527 Ayrshires, and 147 Guernseys. The 643,059 variants included the HD SNP and all 79,254 (80K) genetic markers and QTL used in routine national genomic evaluations. Before imputation, approximately 91 to 97% of genotypes were unknown for each breed; after imputation, 1.1% of Holstein, 3.2% of Jersey, 6.7% of Brown Swiss, 4.8% of Ayrshire, and 4.2% of Guernsey alleles remained unknown due to lower density haplotypes that had no matching HD haplotype. The higher remaining missing rates in non-Holstein breeds are mainly due to fewer HD genotyped animals in the imputation reference populations. Allele effects for up to 39 traits were estimated separately within each breed using phenotypic reference populations that included up to 6,157 Jersey males and 110,130 Jersey females. Correlations of HD with 80K genomic predictions for young animals averaged 0.986, 0.989, 0.985, 0.992, and 0.978 for Jersey, Ayrshire, Brown Swiss, Guernsey, and Holstein breeds, respectively. Correlations were highest for yield traits (about 0.991) and lowest for foot angle and rear legs-side view (0.981and 0.982, respectively). Some HD effects were more than twice as large as the largest 80K SNP effect, and HD markers had larger effects than nearby 80K markers for many breed-trait combinations. Previous studies selected and included markers with large effects for Holstein traits; the newly selected HD markers should also improve non-Holstein and crossbred genomic predictions and were added to official US genomic predictions in April 2020.
Collapse
Affiliation(s)
- A Al-Khudhair
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| | - P M VanRaden
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350.
| | - D J Null
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| | - B Li
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705-2350
| |
Collapse
|
20
|
Ye S, Li J, Zhang Z. Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction. J Anim Sci Biotechnol 2020; 11:109. [PMID: 33292577 PMCID: PMC7708144 DOI: 10.1186/s40104-020-00515-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 09/22/2020] [Indexed: 12/02/2022] Open
Abstract
Background Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. Results Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. Conclusions Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction.
Collapse
Affiliation(s)
- Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China.
| |
Collapse
|
21
|
Huang X, Otecko NO, Peng M, Weng Z, Li W, Chen J, Zhong M, Zhong F, Jin S, Geng Z, Luo W, He D, Ma C, Han J, Ommeh SC, Zhang Y, Zhang X, Du B. Genome-wide genetic structure and selection signatures for color in 10 traditional Chinese yellow-feathered chicken breeds. BMC Genomics 2020; 21:316. [PMID: 32312230 PMCID: PMC7171827 DOI: 10.1186/s12864-020-6736-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 04/15/2020] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Yellow-feathered chickens (YFCs) have a long history in China. They are well-known for the nutritional and commercial importance attributable to their yellow color phenotype. Currently, there is a huge paucity in knowledge of the genetic determinants responsible for phenotypic and biochemical properties of these iconic chickens. This study aimed to uncover the genetic structure and the molecular underpinnings of the YFCs trademark coloration. RESULTS The whole-genomes of 100 YFCs from 10 major traditional breeds and 10 Huaibei partridge chickens from China were re-sequenced. Comparative population genomics based on autosomal single nucleotide polymorphisms (SNPs) revealed three geographically based clusters among the YFCs. Compared to other Chinese indigenous chicken genomes incorporated from previous studies, a closer genetic proximity within YFC breeds than between YFC breeds and other chicken populations is evident. Through genome-wide scans for selective sweeps, we identified RALY heterogeneous nuclear ribonucleoprotein (RALY), leucine rich repeat containing G protein-coupled receptor 4 (LGR4), solute carrier family 23 member 2 (SLC23A2), and solute carrier family 2 member 14 (SLC2A14), besides the classical beta-carotene dioxygenase 2 (BCDO2), as major candidates pigment determining genes in the YFCs. CONCLUSION We provide the first comprehensive genomic data of the YFCs. Our analyses show phylogeographical patterns among the YFCs and potential candidate genes giving rise to the yellow color trait of the YFCs. This study lays the foundation for further research on the genome-phenotype cross-talks that define important poultry traits and for formulating genetic breeding and conservation strategies for the YFCs.
Collapse
Affiliation(s)
- Xunhe Huang
- Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Areas, Guangdong Innovation Centre for Science and Technology of Wuhua Yellow Chicken, School of Life Science of Jiaying University, Meizhou, 514015, China
| | - Newton O Otecko
- State Key Laboratory of Genetic Resources and Evolution and Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650204, China
| | - Minsheng Peng
- State Key Laboratory of Genetic Resources and Evolution and Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650204, China
| | - Zhuoxian Weng
- Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Areas, Guangdong Innovation Centre for Science and Technology of Wuhua Yellow Chicken, School of Life Science of Jiaying University, Meizhou, 514015, China.,College of Animal Science and Technology, Hunan Agricultural University, Changsha, 410128, China
| | - Weina Li
- Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Areas, Guangdong Innovation Centre for Science and Technology of Wuhua Yellow Chicken, School of Life Science of Jiaying University, Meizhou, 514015, China
| | - Jiebo Chen
- Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Areas, Guangdong Innovation Centre for Science and Technology of Wuhua Yellow Chicken, School of Life Science of Jiaying University, Meizhou, 514015, China
| | - Ming Zhong
- Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Areas, Guangdong Innovation Centre for Science and Technology of Wuhua Yellow Chicken, School of Life Science of Jiaying University, Meizhou, 514015, China
| | - Fusheng Zhong
- Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Areas, Guangdong Innovation Centre for Science and Technology of Wuhua Yellow Chicken, School of Life Science of Jiaying University, Meizhou, 514015, China
| | - Sihua Jin
- College of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China
| | - Zhaoyu Geng
- College of Animal Science and Technology, Anhui Agricultural University, Hefei, 230036, China
| | - Wei Luo
- College of Animal Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Danlin He
- College of Animal Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Cheng Ma
- State Key Laboratory of Genetic Resources and Evolution and Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650204, China
| | - Jianlin Han
- CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, 100193, China.,International Livestock Research Institute (ILRI), Nairobi, 30709-00100, Kenya
| | - Sheila C Ommeh
- Animal Biotechnology Group, Institute For Biotechnology Research, Jomo Kenyatta University of Agriculture and Technology, Nairobi, 62000-00200, Kenya
| | - Yaping Zhang
- State Key Laboratory of Genetic Resources and Evolution and Yunnan Laboratory of Molecular Biology of Domestic Animals, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China. .,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650204, China. .,State Key Laboratory for Conservation and Utilization of Bio-resources in Yunnan, Yunnan University, Kunming, 650091, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
| | - Xiquan Zhang
- College of Animal Sciences, South China Agricultural University, Guangzhou, 510642, China.
| | - Bingwang Du
- Guangdong Provincial Key Laboratory of Conservation and Precision Utilization of Characteristic Agricultural Resources in Mountainous Areas, Guangdong Innovation Centre for Science and Technology of Wuhua Yellow Chicken, School of Life Science of Jiaying University, Meizhou, 514015, China.
| |
Collapse
|
22
|
Ye S, Song H, Ding X, Zhang Z, Li J. Pre-selecting markers based on fixation index scores improved the power of genomic evaluations in a combined Yorkshire pig population. Animal 2020; 14:1555-1564. [PMID: 32209149 DOI: 10.1017/s1751731120000506] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Combining different swine populations in genomic prediction can be an important tool, leading to an increased accuracy of genomic prediction using single nucleotide polymorphism (SNP) chip data compared with within-population genomic. However, the expected higher accuracy of multi-population genomic prediction has not been realized. This may be due to an inconsistent linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTL) across populations, and the weak genetic relationships across populations. In this study, we determined the impact of different genomic relationship matrices, SNP density and pre-selected variants on prediction accuracy using a combined Yorkshire pig population. Our objective was to provide useful strategies for improving the accuracy of genomic prediction within a combined population. Results showed that the accuracy of genomic best linear unbiased prediction (GBLUP) using imputed whole-genome sequencing (WGS) data in the combined population was always higher than that within populations. Furthermore, the use of imputed WGS data always resulted in a higher accuracy of GBLUP than the use of 80K chip data for the combined population. Additionally, the accuracy of GBLUP with a non-linear genomic relationship matrix was markedly increased (0.87% to 15.17% for 80K chip data, and 0.43% to 4.01% for imputed WGS data) compared with that obtained with a linear genomic relationship matrix, except for the prediction of XD population in the combined population using imputed WGS data. More importantly, the application of pre-selected variants based on fixation index (Fst) scores improved the accuracy of multi-population genomic prediction, especially for 80K chip data. For BLUP|GA (BLUP approach given the genetic architecture), the use of a linear method with an appropriate weight to build a weight-relatedness matrix led to a higher prediction accuracy compared with the use of only pre-selected SNPs for genomic evaluations, especially for the total number of piglets born. However, for the non-linear method, BLUP|GA showed only a small increase or even a decrease in prediction accuracy compared with the use of only pre-selected SNPs. Overall, the best genomic evaluation strategy for reproduction-related traits for a combined population was found to be GBLUP performed with a non-linear genomic relationship matrix using variants pre-selected from the 80K chip data based on Fst scores.
Collapse
Affiliation(s)
- S Ye
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, No. 483, Wushan Road, Tianhe District, 510642Guangzhou, China
| | - H Song
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, No. 2, Yuanmingyuan West Road, Haidian District, 100193Beijing, China
| | - X Ding
- Key Laboratory of Animal Genetics and Breeding of Ministry of Agriculture, National Engineering Laboratory of Animal Breeding, College of Animal Science and Technology, China Agricultural University, No. 2, Yuanmingyuan West Road, Haidian District, 100193Beijing, China
| | - Z Zhang
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, No. 483, Wushan Road, Tianhe District, 510642Guangzhou, China
| | - J Li
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, No. 483, Wushan Road, Tianhe District, 510642Guangzhou, China
| |
Collapse
|
23
|
Liu T, Luo C, Ma J, Wang Y, Shu D, Su G, Qu H. High-Throughput Sequencing With the Preselection of Markers Is a Good Alternative to SNP Chips for Genomic Prediction in Broilers. Front Genet 2020; 11:108. [PMID: 32174971 PMCID: PMC7056902 DOI: 10.3389/fgene.2020.00108] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Accepted: 01/30/2020] [Indexed: 11/13/2022] Open
Abstract
The choice of a genetic marker genotyping platform is important for genomic prediction in livestock and poultry. High-throughput sequencing can produce more genetic markers, but the genotype quality is lower than that obtained with single nucleotide polymorphism (SNP) chips. The aim of this study was to compare the accuracy of genomic prediction between high-throughput sequencing and SNP chips in broilers. In this study, we developed a new SNP marker screening method, the pre-marker-selection (PMS) method, to determine whether an SNP marker can be used for genomic prediction. We also compared a method which preselection marker based results from genome-wide association studies (GWAS). With the two methods, we analysed body weight at the12th week (BW) and feed conversion ratio (FCR) in a local broiler population. A total of 395 birds were selected from the F2 generation of the population, and 10X specific-locus amplified fragment sequencing (SLAF-seq) and the Illumina Chicken 60K SNP Beadchip were used for genotyping. The genomic best linear unbiased prediction method (GBLUP) was used to predict the genomic breeding values. The accuracy of genomic prediction was validated by the leave-one-out cross-validation method. Without SNP marker screening, the accuracies of the genomic estimated breeding value (GEBV) of BW and FCR were 0.509 and 0.249, respectively, when using SLAF-seq, and the accuracies were 0.516 and 0.232, respectively, when using the SNP chip. With SNP marker screening by the PMS method, the accuracies of GEBV of the two traits were 0.671 and 0.499, respectively, when using SLAF-seq, and 0.605 and 0.422, respectively, when using the SNP chip. Our SNP marker screening method led to an increase of prediction accuracy by 0.089-0.250. With SNP marker screening by the GWAS method, the accuracies of genomic prediction for the two traits were also improved, but the gains of accuracy were less than the gains with PMS method for all traits. The results from this study indicate that our PMS method can improve the accuracy of GEBV, and that more accurate genomic prediction can be obtained from an increased number of genomic markers when using high-throughput sequencing in local broiler populations. Due to its lower genotyping cost, high-throughput sequencing could be a good alternative to SNP chips for genomic prediction in breeding programmes of local broiler populations.
Collapse
Affiliation(s)
- Tianfei Liu
- State Key Laboratory of Livestock and Poultry Breeding, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Chenglong Luo
- State Key Laboratory of Livestock and Poultry Breeding, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Jie Ma
- Guangdong Provincial Key Laboratory of Animal Breeding and Nutrition, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Yan Wang
- State Key Laboratory of Livestock and Poultry Breeding, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Dingming Shu
- State Key Laboratory of Livestock and Poultry Breeding, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, Tjele, Denmark
| | - Hao Qu
- State Key Laboratory of Livestock and Poultry Breeding, Institute of Animal Science, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| |
Collapse
|