1
|
Zhao Z, Niu Q, Wu J, Wu T, Xie X, Wang Z, Zhang L, Gao H, Gao X, Xu L, Zhu B, Li J. Integrating multi-layered biological priors to improve genomic prediction accuracy in beef cattle. Biol Direct 2024; 19:147. [PMID: 39741345 DOI: 10.1186/s13062-024-00574-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Accepted: 12/02/2024] [Indexed: 01/02/2025] Open
Abstract
BACKGROUND Integrating multi-layered information can enhance the accuracy of genomic prediction for complex traits. However, the improvement and application of effective strategies for genomic prediction (GP) using multi-omics data remains challenging. METHODS We generated 11 feature sets for sequencing variants from genomics, transcriptomics, metabolomics, and epigenetics data in beef cattle, then we assessed the contribution of functional variants using genomic restricted maximum likelihood (GREML). We next estimated and ranked variant scores for 43 economically important traits, and compared the prediction accuracy of the top and bottom sets using genomic best linear unbiased prediction (GBLUP) and BayesB model. In addition, we annotated the variants from GWAS with functional feature sets and performed enrichment analysis. RESULTS We observed significant enrichments for 32 functional categories in 11 feature sets. The evolutionary related sets (conservation regions and selection signatures) contributed significantly to heritability (31.78-fold and 14.48-fold enrichment), while metabolomics and transcriptomics showed low heritability enrichments. We observed a significant increase in prediction accuracy using the top feature set variants compared to whole-genome sequencing (WGS) data. The prediction accuracy based on the top 10% variant set showed an average increase of 11.6% and 7.54% using BayesB and GBLUP across traits, respectively. Notably, the greatest increase of 31.52% was obtained for spleen weight (SW) using BayesB. Also, we found that the top 10% of variants show strong enrichment with weight related QTLs based on the Cattle QTL database. CONCLUSIONS Our findings suggest that integrating biological prior information from multiple layers can enhance our understanding of the genetic architecture underlying complex traits and further improve genomic prediction in beef cattle.
Collapse
Affiliation(s)
- Zhida Zhao
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Qunhao Niu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Jiayuan Wu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Tianyi Wu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Xueyuan Xie
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Zezhao Wang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lupei Zhang
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Huijiang Gao
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Xue Gao
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lingyang Xu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| | - Bo Zhu
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
- Northern Agriculture and Livestock Husbandry Technology Innovation Center, Hohhot, 010010, China.
| | - Junya Li
- Key Laboratory of Animal Genetics Breeding and Reproduction, Ministry of Agriculture and Rural Affairs, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| |
Collapse
|
2
|
Li M, Hall T, MacHugh DE, Chen L, Garrick D, Wang L, Zhao F. KPRR: a novel machine learning approach for effectively capturing nonadditive effects in genomic prediction. Brief Bioinform 2024; 26:bbae683. [PMID: 39749663 PMCID: PMC11695904 DOI: 10.1093/bib/bbae683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 12/08/2024] [Accepted: 12/13/2024] [Indexed: 01/04/2025] Open
Abstract
Nonadditive genetic effects pose significant challenges to traditional genomic selection methods for quantitative traits. Machine learning approaches, particularly kernel-based methods, offer promising solutions to overcome these limitations. In this study, we developed a novel machine learning method, KPRR, which integrated a polynomial kernel into ridge regression to effectively capture nonadditive genetic effects. The predictive performance and computational efficiency of KPRR were evaluated using six datasets from various species, encompassing a total of 18 traits. All the traits were known to be influenced by additive, dominance, or epistatic genetic effects. We compared the performance of KPRR against six other genomic prediction methods: SPVR, BayesB, GBLUP, GEBLUP, GDBLUP, and DeepGS. For datasets dominated by additive effects, KPRR achieved superior prediction accuracies in the wheat dataset and comparable performance in the cattle dataset when compared to GBLUP. For datasets influenced by dominance effects, KPRR matched GDBLUP in accuracies in the pig dataset and outperformed GDBLUP in the sheep dataset. For datasets exhibiting epistatic effects, KPRR outperformed other methods in some traits, while BayesB showed superior performance in others. Incorporating nonadditive effects into a GBLUP model led to overall improvements in prediction accuracy. Regarding computational efficiency, KPRR was consistently the fastest, while BayesB was the slowest. Our findings demonstrated that KPRR provided significant advantages over traditional genomic prediction methods in capturing nonadditive effects.
Collapse
Affiliation(s)
- Mianyan Li
- State Key Laboratory of Animal Biotech Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Yuanmingyuan West Road, Beijing, 100193, China
- Animal Genomics Laboratory, UCD School of Agriculture and Food Science, University College Dublin, Belfield, Dublin, D04 V1W8, Ireland
| | - Thomas Hall
- Animal Genomics Laboratory, UCD School of Agriculture and Food Science, University College Dublin, Belfield, Dublin, D04 V1W8, Ireland
| | - David E MacHugh
- Animal Genomics Laboratory, UCD School of Agriculture and Food Science, University College Dublin, Belfield, Dublin, D04 V1W8, Ireland
- UCD Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Belfield, Dublin, D04 V1W8, Ireland
- UCD One Health Centre, University College Dublin, Belfield, Dublin, D04 V1W8, Ireland
| | - Liang Chen
- The Affiliated High School of Peking University, Daniwan Road, Beijing, 100190, China
| | - Dorian Garrick
- Theta Solutions LLC., Hot Springs Road, Katikati, 3178, New Zealand
| | - Lixian Wang
- State Key Laboratory of Animal Biotech Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Yuanmingyuan West Road, Beijing, 100193, China
| | - Fuping Zhao
- State Key Laboratory of Animal Biotech Breeding, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Yuanmingyuan West Road, Beijing, 100193, China
| |
Collapse
|
3
|
Cheng H, Zhang ZY, Han H, Wei R, Zhao W, Sun YC, Xu BB, Hou XL, Wang JL, He YQ, Fu Y, Wang QS, Pan YC, Zhang Z, Wang Z. Cross-ancestry meta-genome-wide association studies provide insights to the understanding of semen traits in pigs. Animal 2024; 18:101331. [PMID: 39405960 DOI: 10.1016/j.animal.2024.101331] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 08/30/2024] [Accepted: 09/03/2024] [Indexed: 11/18/2024] Open
Abstract
Semen traits play a crucial role in pig reproduction and fertility. However, limited data availability hinder a comprehensive understanding of the genetic mechanisms underlying these traits. In this study, we integrated 597 299 ejaculates and 3 596 sequence data to identify genetic variants and candidate genes related to four semen traits, including sperm progressive motility (MOT), semen volume, sperm concentration (CON), and effective sperm count (SUM). A cross-ancestry meta-genome-wide association study was conducted to detect 163 lead single nucleotide polymorphisms (SNPs) associated with MOT, CON, and SUM. Subsequently, transcriptome-wide association studies and colocalisation analyses were integrated to identify 176 candidate genes, many of which have documented roles in spermatogenesis or male mammal semen traits. Our analysis highlighted the potential involvement of CSM5, PDZD9, and LDAF1 in regulating semen traits through multiple methods. Finally, to validate the function of significant SNPs, we performed genomic feature best linear unbiased prediction in 348 independent pigs using identified trait-related SNP subsets as genomic features. We found that integrating the top 0.1, 1, and 5% significant SNPs as genomic features could enhance genomic prediction accuracy for CON and MOT compared to traditional genomic best linear unbiased prediction. This study contributes to a comprehensive understanding of the genetic mechanisms of boar semen traits and provides insight for developing genomic selection models.
Collapse
Affiliation(s)
- H Cheng
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China
| | - Z Y Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China
| | - H Han
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China
| | - R Wei
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China
| | - W Zhao
- SciGene Biotechnology Co., Ltd., Hefei 230031, China
| | - Y C Sun
- Haidian Foreign Language Academy, Beijing 100195, China
| | - B B Xu
- SciGene Biotechnology Co., Ltd., Hefei 230031, China
| | - X L Hou
- SciGene Biotechnology Co., Ltd., Hefei 230031, China
| | - J L Wang
- SciGene Biotechnology Co., Ltd., Hefei 230031, China
| | - Y Q He
- SciGene Biotechnology Co., Ltd., Hefei 230031, China
| | - Y Fu
- SciGene Biotechnology Co., Ltd., Hefei 230031, China
| | - Q S Wang
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China; Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya 572000, China
| | - Y C Pan
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China; Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya 572000, China
| | - Z Zhang
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China
| | - Z Wang
- College of Animal Sciences, Zhejiang University, Hangzhou 310058, China; Key Laboratory of Livestock and Poultry Resources Evaluation and Utilization, Ministry of Agriculture and Rural Affairs, Hangzhou 310058, China.
| |
Collapse
|
4
|
Xu D, Chan WH, Haron H. Enhancing infectious disease prediction model selection with multi-objective optimization: an empirical study. PeerJ Comput Sci 2024; 10:e2217. [PMID: 39145229 PMCID: PMC11323180 DOI: 10.7717/peerj-cs.2217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Accepted: 07/04/2024] [Indexed: 08/16/2024]
Abstract
As the pandemic continues to pose challenges to global public health, developing effective predictive models has become an urgent research topic. This study aims to explore the application of multi-objective optimization methods in selecting infectious disease prediction models and evaluate their impact on improving prediction accuracy, generalizability, and computational efficiency. In this study, the NSGA-II algorithm was used to compare models selected by multi-objective optimization with those selected by traditional single-objective optimization. The results indicate that decision tree (DT) and extreme gradient boosting regressor (XGBoost) models selected through multi-objective optimization methods outperform those selected by other methods in terms of accuracy, generalizability, and computational efficiency. Compared to the ridge regression model selected through single-objective optimization methods, the decision tree (DT) and XGBoost models demonstrate significantly lower root mean square error (RMSE) on real datasets. This finding highlights the potential advantages of multi-objective optimization in balancing multiple evaluation metrics. However, this study's limitations suggest future research directions, including algorithm improvements, expanded evaluation metrics, and the use of more diverse datasets. The conclusions of this study emphasize the theoretical and practical significance of multi-objective optimization methods in public health decision support systems, indicating their wide-ranging potential applications in selecting predictive models.
Collapse
Affiliation(s)
- Deren Xu
- Faculty of Computing, Universiti Teknologi Malaysia, Faculty of Computing, Johor, Johor Bahru, Malaysia
| | - Weng Howe Chan
- Universiti Teknologi Malaysia, UTM Big Data Centre, Ibnu Sina Institute For Scientific and Industrial Resarch, Universiti Teknologi Malaysia, Johor, Johor Bahru, Malaysia
| | - Habibollah Haron
- Faculty of Computing, Universiti Teknologi Malaysia, Faculty of Computing, Johor, Johor Bahru, Malaysia
| |
Collapse
|
5
|
Yin C, Shi H, Zhou P, Wang Y, Tao X, Yin Z, Zhang X, Liu Y. Genomic Prediction of Growth Traits in Yorkshire Pigs of Different Reference Group Sizes Using Different Estimated Breeding Value Models. Animals (Basel) 2024; 14:1098. [PMID: 38612337 PMCID: PMC11010886 DOI: 10.3390/ani14071098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 03/31/2024] [Accepted: 04/02/2024] [Indexed: 04/14/2024] Open
Abstract
The need for sufficient reference population data poses a significant challenge in breeding programs aimed at improving pig farming on a small to medium scale. To overcome this hurdle, investigating the advantages of combing reference populations of varying sizes is crucial for enhancing the accuracy of the genomic estimated breeding value (GEBV). Genomic selection (GS) in populations with limited reference data can be optimized by combining populations of the same breed or related breeds. This study focused on understanding the effect of combing different reference group sizes on the accuracy of GS for determining the growth effectiveness and percentage of lean meat in Yorkshire pigs. Specifically, our study investigated two important traits: the age at 100 kg live weight (AGE100) and the backfat thickness at 100 kg live weight (BF100). This research assessed the efficiency of genomic prediction (GP) using different GEBV models across three Yorkshire populations with varying genetic backgrounds. The GeneSeek 50K GGP porcine high-density array was used for genotyping. A total of 2295 Yorkshire pigs were included, representing three Yorkshire pig populations with different genetic backgrounds-295 from Danish (small) lines from Huaibei City, Anhui Province, 500 from Canadian (medium) lines from Lixin County, Anhui Province, and 1500 from American (large) lines from Shanghai. To evaluate the impact of different population combination scenarios on the GS accuracy, three approaches were explored: (1) combining all three populations for prediction, (2) combining two populations to predict the third, and (3) predicting each population independently. Five GEBV models, including three Bayesian models (BayesA, BayesB, and BayesC), the genomic best linear unbiased prediction (GBLUP) model, and single-step GBLUP (ssGBLUP) were implemented through 20 repetitions of five-fold cross-validation (CV). The results indicate that predicting one target population using the other two populations yielded the highest accuracy, providing a novel approach for improving the genomic selection accuracy in Yorkshire pigs. In this study, it was found that using different populations of the same breed to predict small- and medium-sized herds might be effective in improving the GEBV. This investigation highlights the significance of incorporating population combinations in genetic models for predicting the breeding value, particularly for pig farmers confronted with resource limitations.
Collapse
Affiliation(s)
- Chang Yin
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Haoran Shi
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Peng Zhou
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Yuwei Wang
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Xuzhe Tao
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| | - Zongjun Yin
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China; (Z.Y.); (X.Z.)
| | - Xiaodong Zhang
- College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, China; (Z.Y.); (X.Z.)
| | - Yang Liu
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (C.Y.); (H.S.); (P.Z.); (Y.W.); (X.T.)
| |
Collapse
|
6
|
Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023; 13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open
Abstract
Enhancing the accuracy of genomic prediction is a key goal in genomic selection (GS) research. Integrating prior biological information into GS methods using appropriate models can improve prediction accuracy for complex traits. Genome-wide association study (GWAS) is widely utilized to identify potential candidate loci associated with complex traits in livestock and poultry, offering essential genomic insights. In this study, a GWAS was conducted on 685 Duroc × Landrace × Yorkshire (DLY) pigs to extract significant single-nucleotide polymorphisms (SNPs) as genomic features. We compared two GS models, genomic best linear unbiased prediction (GBLUP) and genomic feature BLUP (GFBLUP), by using imputed whole-genome sequencing (WGS) data on 651 Yorkshire pigs. The results revealed that the GBLUP model achieved prediction accuracies of 0.499 for backfat thickness (BFT) and 0.423 for loin muscle area (LMA). By applying the GFBLUP model with GWAS-based SNP preselection, the average prediction accuracies for BFT and LMA traits reached 0.491 and 0.440, respectively. Specifically, the GFBLUP model displayed a 4.8% enhancement in predicting LMA compared to the GBLUP model. These findings suggest that, in certain scenarios, the GFBLUP model may offer superior genomic prediction accuracy when compared to the GBLUP model, underscoring the potential value of incorporating genomic features to refine GS models.
Collapse
Affiliation(s)
- Yiyi Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuling Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Fuchen Zhou
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zekai Yao
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuexin Zhan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfei Fan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Xianglun Meng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zebin Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Langqing Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Jie Yang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Enqin Zheng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
7
|
Bharati R, Sen MK, Severová L, Svoboda R, Fernández-Cusimamani E. Polyploidization and genomic selection integration for grapevine breeding: a perspective. FRONTIERS IN PLANT SCIENCE 2023; 14:1248978. [PMID: 38034577 PMCID: PMC10684766 DOI: 10.3389/fpls.2023.1248978] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 10/30/2023] [Indexed: 12/02/2023]
Abstract
Grapevines are economically important woody perennial crops widely cultivated for their fruits that are used for making wine, grape juice, raisins, and table grapes. However, grapevine production is constantly facing challenges due to climate change and the prevalence of pests and diseases, causing yield reduction, lower fruit quality, and financial losses. To ease the burden, continuous crop improvement to develop superior grape genotypes with desirable traits is imperative. Polyploidization has emerged as a promising tool to generate genotypes with novel genetic combinations that can confer desirable traits such as enhanced organ size, improved fruit quality, and increased resistance to both biotic and abiotic stresses. While previous studies have shown high polyploid induction rates in Vitis spp., rigorous screening of genotypes among the produced polyploids to identify those exhibiting desired traits remains a major bottleneck. In this perspective, we propose the integration of the genomic selection approach with omics data to predict genotypes with desirable traits among the vast unique individuals generated through polyploidization. This integrated approach can be a powerful tool for accelerating the breeding of grapevines to develop novel and improved grapevine varieties.
Collapse
Affiliation(s)
- Rohit Bharati
- Department of Crop Sciences and Agroforestry, The Faculty of Tropical AgriSciences, Czech University of Life Sciences Prague, Suchdol, Czechia
| | - Madhab Kumar Sen
- Department of Agroecology and Crop Production, Faculty of Agrobiology, Food and Natural Resources, Czech University of Life Sciences Prague, Suchdol, Czechia
| | - Lucie Severová
- Department of Economic Theories, Faculty of Economics and Management, Czech University of Life Sciences Prague, Prague, Czechia
| | - Roman Svoboda
- Department of Economic Theories, Faculty of Economics and Management, Czech University of Life Sciences Prague, Prague, Czechia
| | - Eloy Fernández-Cusimamani
- Department of Crop Sciences and Agroforestry, The Faculty of Tropical AgriSciences, Czech University of Life Sciences Prague, Suchdol, Czechia
| |
Collapse
|
8
|
Zhuang Z, Wu J, Qiu Y, Ruan D, Ding R, Xu C, Zhou S, Zhang Y, Liu Y, Ma F, Yang J, Sun Y, Zheng E, Yang M, Cai G, Yang J, Wu Z. Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs. J Anim Sci Biotechnol 2023; 14:67. [PMID: 37161604 PMCID: PMC10170792 DOI: 10.1186/s40104-023-00863-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/05/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Pork quality can directly affect customer purchase tendency and meat quality traits have become valuable in modern pork production. However, genetic improvement has been slow due to high phenotyping costs. In this study, whole genome sequence (WGS) data was used to evaluate the prediction accuracy of genomic best linear unbiased prediction (GBLUP) for meat quality in large-scale crossbred commercial pigs. RESULTS We produced WGS data (18,695,907 SNPs and 2,106,902 INDELs exceed quality control) from 1,469 sequenced Duroc × (Landrace × Yorkshire) pigs and developed a reference panel for meat quality including meat color score, marbling score, L* (lightness), a* (redness), and b* (yellowness) of genomic prediction. The prediction accuracy was defined as the Pearson correlation coefficient between adjusted phenotypes and genomic estimated breeding values in the validation population. Using different marker density panels derived from WGS data, accuracy differed substantially among meat quality traits, varied from 0.08 to 0.47. Results showed that MultiBLUP outperform GBLUP and yielded accuracy increases ranging from 17.39% to 75%. We optimized the marker density and found medium- and high-density marker panels are beneficial for the estimation of heritability for meat quality. Moreover, we conducted genotype imputation from 50K chip to WGS level in the same population and found average concordance rate to exceed 95% and r2 = 0.81. CONCLUSIONS Overall, estimation of heritability for meat quality traits can benefit from the use of WGS data. This study showed the superiority of using WGS data to genetically improve pork quality in genomic prediction.
Collapse
Affiliation(s)
- Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yibin Qiu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Donglin Ruan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Rongrong Ding
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Cineng Xu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Shenping Zhou
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Fucai Ma
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jifei Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ying Sun
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ming Yang
- College of Animal Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
- Yunfu Subcenter of Guangdong Laboratory for Lingnan Modern Agriculture, Yunfu, 527400, China.
| |
Collapse
|
9
|
Ma S, Li P, Liu H, Xi Y, Xu Q, Qi J, Wang J, Li L, Wang J, Hu J, He H, Han C, Bai L. Genome-wide association analysis of the primary feather growth traits of duck: identification of potential Loci for growth regulation. Poult Sci 2022; 102:102243. [PMID: 36334470 PMCID: PMC9636485 DOI: 10.1016/j.psj.2022.102243] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 10/05/2022] [Accepted: 10/05/2022] [Indexed: 11/28/2022] Open
Abstract
The feather is an important epidermal appendage, plays an important role in the life activities of avian specie, and has important economic value. Revealing the molecular regulation mechanism of feather growth has a significant meaning in studying adaptive evolution, physiology, and mating of avian species and also provides a theoretical reference for poultry breeding. In this study, the genome-wide association analysis (GWAS) of 358 ducks was based on primary feather length phenotypic data (28-60 d), length growth rates (LGRs), and maturity scores (60 d) to explore the genetic basis affecting feather growth and maturation. The results showed that, among the primary feather 1 to 5 in ducks, the mean LGR of primary feather 2 was the fastest, with the longest length. The primary feathers in males grew and matured slightly faster than in females. The mean maturity scores of primary feather 10∼7 were higher than primary feather 1 to 3 in ducks. GWAS further showed 116 SNPs associated with feather length traits. In addition, 2 candidate regions (Chr1: 127,407,230-127,524,879 bp and Chr21: 182,061,707-183,616,298 bp) were associated with LGR, which contain total 13 candidate genes (The extremely significant SNPs were mainly located in 2 genes: Chr1: REPS2 and Chr21: PTPRT). Four candidate regions (Chr1: 29,113,036-28,675,018 bp, Chr2: 18,253,612-149,111,290 bp, Chr15: 6,489,774 to 12,138,221 bp and Chr21: 6,578,021-8,472,904 bp) were associated with feather maturity, which contain total 24 candidate genes (The extremely significant SNPs were mainly located in 4 genes: Chr1: IMMP2L, DOCK4 and DDX10, Chr2: LDLRAD4). In conclusion, sex factors influence feather growth and maturity, and the genetic basis of the growth /maturity trait between different feathers is similar. REPS2, PTPRT genes, and IMMP2L, DOCK4, DDX10, and LDLRAD4 are important candidate genes that influence feather growth and maturity, respectively.
Collapse
Affiliation(s)
- Shengchao Ma
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China,College of Animal Science, Xinjiang Agricultural University, P. R. China
| | - Pengcheng Li
- Berry Genomics Corporation, Beijing 100015, P. R. China
| | - Hehe Liu
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China,Corresponding author:
| | - Yang Xi
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Qian Xu
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jingjing Qi
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jianmei Wang
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Liang Li
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jiwen Wang
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Jiwei Hu
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Hua He
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Chunchun Han
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| | - Lili Bai
- Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal and Technology (Institute of Animal Genetics and Breeding), Sichuan Agricultural University, P. R. China,Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, P. R. China
| |
Collapse
|
10
|
Genomic Selection in Chinese Holsteins Using Regularized Regression Models for Feature Selection of Whole Genome Sequencing Data. Animals (Basel) 2022; 12:ani12182419. [PMID: 36139283 PMCID: PMC9495168 DOI: 10.3390/ani12182419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/09/2022] [Accepted: 09/12/2022] [Indexed: 11/30/2022] Open
Abstract
Simple Summary Genomic selection (GS) is increasingly widely used in animal breeding, owing to its high efficiency in the genetic improvement of economic traits. In China, GS has been implemented for genetic evaluation of young bulls in dairy cattle breeding programs since 2012. GS is commonly based on single nucleotide polymorphism (SNP) chips. The cost of whole genome sequencing (WGS) has decreased tremendously in recent years, allowing increased studies of WGS-based GS. In this study, based on the imputed WGS data of approximately 8000 Chinese Holsteins, we investigated the performance of GS of milk production traits using the feature selection method of regularized regression. The results showed that WGS-based GS using regularized regression models and the commonly used linear mixed models achieved comparable prediction accuracies. For milk and protein yields, GS using a combination of SNPs selected with a regularized regression model and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with a linear mixed model combined with 50K SNP chip data performed best for fat yield. The proposed method of GS based on WGS data, i.e., feature selection using regularization regression models, provides a valuable novel strategy for genomic selection. Abstract Genomic selection (GS) is an efficient method to improve genetically economic traits. Feature selection is an important method for GS based on whole-genome sequencing (WGS) data. We investigated the prediction performance of GS of milk production traits using imputed WGS data on 7957 Chinese Holsteins. We used two regularized regression models, least absolute shrinkage and selection operator (LASSO) and elastic net (EN) for feature selection. For comparison, we performed genome-wide association studies based on a linear mixed model (LMM), and the N single nucleotide polymorphisms (SNPs) with the lowest p-values were selected (LMMLASSO and LMMEN), where N was the number of non-zero effect SNPs selected by LASSO or EN. GS was conducted using a genomic best linear unbiased prediction (GBLUP) model and several sets of SNPs: (1) selected WGS SNPs; (2) 50K SNP chip data; (3) WGS data; and (4) a combined set of selected WGS SNPs and 50K SNP chip data. The results showed that the prediction accuracies of GS with features selected using LASSO or EN were comparable to those using features selected with LMMLASSO or LMMEN. For milk and protein yields, GS using a combination of SNPs selected with LASSO and 50K SNP chip data achieved the best prediction performance, and GS using SNPs selected with LMMLASSO combined with 50K SNP chip data performed best for fat yield. The proposed method, feature selection using regularization regression models, provides a valuable novel strategy for WGS-based GS.
Collapse
|
11
|
Wade AR, Duruflé H, Sanchez L, Segura V. eQTLs are key players in the integration of genomic and transcriptomic data for phenotype prediction. BMC Genomics 2022; 23:476. [PMID: 35764918 PMCID: PMC9238188 DOI: 10.1186/s12864-022-08690-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 06/11/2022] [Indexed: 11/10/2022] Open
Abstract
Background Multi-omics represent a promising link between phenotypes and genome variation. Few studies yet address their integration to understand genetic architecture and improve predictability. Results Our study used 241 poplar genotypes, phenotyped in two common gardens, with xylem and cambium RNA sequenced at one site, yielding large phenotypic, genomic (SNP), and transcriptomic datasets. Prediction models for each trait were built separately for SNPs and transcripts, and compared to a third model integrated by concatenation of both omics. The advantage of integration varied across traits and, to understand such differences, an eQTL analysis was performed to characterize the interplay between the genome and transcriptome and classify the predicting features into cis or trans relationships. A strong, significant negative correlation was found between the change in predictability and the change in predictor ranking for trans eQTLs for traits evaluated in the site of transcriptomic sampling. Conclusions Consequently, beneficial integration happens when the redundancy of predictors is decreased, likely leaving the stage to other less prominent but complementary predictors. An additional gene ontology (GO) enrichment analysis appeared to corroborate such statistical output. To our knowledge, this is a novel finding delineating a promising method to explore data integration. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08690-7.
Collapse
|
12
|
He Z, Li S, Li W, Ding J, Zheng M, Li Q, Fahey AG, Wen J, Liu R, Zhao G. Comparison of genomic prediction methods for residual feed intake in broilers. Anim Genet 2022; 53:466-469. [PMID: 35292985 DOI: 10.1111/age.13186] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 02/24/2022] [Accepted: 02/28/2022] [Indexed: 11/30/2022]
Abstract
Residual feed intake (RFI) is a measure of the feed efficiency of animals. Previous studies have identified SNPs associated with RFI. The objective of this study was to compare the GBLUP model with the GA-BLUP model including previously identified associated SNPs. The nine associated SNPs were obtained from the genome-wide association study on a discovery population as preselection information. These models were analysed using ASREML software using a 5-fold cross-validation method on a validation population. With the genetic architecture (GA) matrix used, which was conducted with the nine RFI-associated SNPs, the prediction accuracy of RFI was improved compared with the original GBLUP model. The calculated optimal ω was 0.981 for RFI, which is in line with the optimal range from 0.9 to 1.0 in the gradient test. The prediction accuracy increased by 2% in the GA-BLUP model with ω being 0.981 compared with the GBLUP model. In conclusion, the GA-BLUP with the nine RFI-associated SNPs and an optimal ω can improve the prediction accuracy for a specific trait compared with GBLUP.
Collapse
Affiliation(s)
- Zhengxiao He
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.,School of Agriculture and Food Science, University College Dublin, Dublin, Ireland
| | - Sen Li
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Wei Li
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Jiqiang Ding
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Maiqing Zheng
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Qinghe Li
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Alan G Fahey
- School of Agriculture and Food Science, University College Dublin, Dublin, Ireland
| | - Jie Wen
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Ranran Liu
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Guiping Zhao
- State Key Laboratory of Animal Nutrition, Key Laboratory of Animal (Poultry) Genetics Breeding and Reproduction, Ministry of Agriculture, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| |
Collapse
|
13
|
Martini JWR, Gao N, Crossa J. Incorporating Omics Data in Genomic Prediction. Methods Mol Biol 2022; 2467:341-357. [PMID: 35451782 DOI: 10.1007/978-1-0716-2205-6_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In this chapter, we discuss the motivation for integrating other types of omics data into genomic prediction methods. We give an overview of literature investigating the performance of omics-enhanced predictions, and highlight potential pitfalls when applying these methods in breeding. We emphasize that the statistical methods available for genomic data can be transferred to the general omics case. However, when using a framework of omic relationship matrices, the standardization of the variables may be more relevant than it is for a genomic relationship matrix based on single-nucleotide polymorphisms.
Collapse
Affiliation(s)
- Johannes W R Martini
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico.
| | - Ning Gao
- School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico
| |
Collapse
|
14
|
Shi S, Zhang Z, Li B, Zhang S, Fang L. Incorporation of Trait-Specific Genetic Information into Genomic Prediction Models. Methods Mol Biol 2022; 2467:329-340. [PMID: 35451781 DOI: 10.1007/978-1-0716-2205-6_11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Due to the rapid development of high-throughput sequencing technology, we can easily obtain not only the genetic variants at the whole-genome sequence level (e.g., from 1000 Genomes project and 1000 Bull Genomes project), but also a wide range of functional annotations (e.g., enhancers and promoters from ENCODE, FAANG, and FarmGTEx projects) across a wide range of tissues, cell types, developmental stages, and environmental conditions. This huge amount of information leads to a revolution in studying genetics and genomics of complex traits in humans, livestock, and plant species. In this chapter, we focused on and reviewed the genomic prediction methods that incorporate external biological information into genomic prediction, such as sequence ontology, linkage disequilibrium (LD) of SNPs, quantitative trait loci (QTL), and multi-layer omics data (e.g., transcriptome, epigenome, and microbiome).
Collapse
Affiliation(s)
- Shaolei Shi
- College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhe Zhang
- Department of Animal Breeding and genetics, College of Animal Science, South China Agricultural University (SCAU), Guangzhou, China
| | - Bingjie Li
- The Roslin Institute Building, Scotland's Rural College, Edinburgh, UK
| | - Shengli Zhang
- College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Lingzhao Fang
- MRC Human Genetics Unit at the Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, UK.
| |
Collapse
|
15
|
Hu H, Campbell MT, Yeats TH, Zheng X, Runcie DE, Covarrubias-Pazaran G, Broeckling C, Yao L, Caffe-Treml M, Gutiérrez LA, Smith KP, Tanaka J, Hoekenga OA, Sorrells ME, Gore MA, Jannink JL. Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021. [PMID: 34643760 DOI: 10.25739/8p1e-0931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.
Collapse
Affiliation(s)
- Haixiao Hu
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA.
| | - Malachy T Campbell
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Trevor H Yeats
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Xuying Zheng
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Daniel E Runcie
- Department of Plant Sciences, University of California Davis, Davis, CA, 95616, USA
| | - Giovanny Covarrubias-Pazaran
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera México-Veracruz, El Batán, 56130, Texcoco, Edo. de México, México
| | - Corey Broeckling
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Linxing Yao
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Melanie Caffe-Treml
- Department of Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, SD, 57007, USA
| | - Lucı A Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Kevin P Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - James Tanaka
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Owen A Hoekenga
- Cayuga Genetics Consulting Group LLC, Ithaca, NY, 14850, USA
| | - Mark E Sorrells
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Jean-Luc Jannink
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
- USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| |
Collapse
|
16
|
Hu H, Campbell MT, Yeats TH, Zheng X, Runcie DE, Covarrubias-Pazaran G, Broeckling C, Yao L, Caffe-Treml M, Gutiérrez LA, Smith KP, Tanaka J, Hoekenga OA, Sorrells ME, Gore MA, Jannink JL. Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:4043-4054. [PMID: 34643760 PMCID: PMC8580906 DOI: 10.1007/s00122-021-03946-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 09/05/2021] [Indexed: 05/26/2023]
Abstract
Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.
Collapse
Affiliation(s)
- Haixiao Hu
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA.
| | - Malachy T Campbell
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Trevor H Yeats
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Xuying Zheng
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Daniel E Runcie
- Department of Plant Sciences, University of California Davis, Davis, CA, 95616, USA
| | - Giovanny Covarrubias-Pazaran
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera México-Veracruz, El Batán, 56130, Texcoco, Edo. de México, México
| | - Corey Broeckling
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Linxing Yao
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Melanie Caffe-Treml
- Department of Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, SD, 57007, USA
| | - Lucı A Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Kevin P Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - James Tanaka
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Owen A Hoekenga
- Cayuga Genetics Consulting Group LLC, Ithaca, NY, 14850, USA
| | - Mark E Sorrells
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Jean-Luc Jannink
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
- USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| |
Collapse
|
17
|
Yang R, Xu Z, Wang Q, Zhu D, Bian C, Ren J, Huang Z, Zhu X, Tian Z, Wang Y, Jiang Z, Zhao Y, Zhang D, Li N, Hu X. Genome‑wide association study and genomic prediction for growth traits in yellow-plumage chicken using genotyping-by-sequencing. Genet Sel Evol 2021; 53:82. [PMID: 34706641 PMCID: PMC8555081 DOI: 10.1186/s12711-021-00672-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 09/08/2021] [Indexed: 12/25/2022] Open
Abstract
Background Growth traits are of great importance for poultry breeding and production and have been the topic of extensive investigation, with many quantitative trait loci (QTL) detected. However, due to their complex genetic background, few causative genes have been confirmed and the underlying molecular mechanisms remain unclear, thus limiting our understanding of QTL and their potential use for the genetic improvement of poultry. Therefore, deciphering the genetic architecture is a promising avenue for optimising genomic prediction strategies and exploiting genomic information for commercial breeding. The objectives of this study were to: (1) conduct a genome-wide association study to identify key genetic factors and explore the polygenicity of chicken growth traits; (2) investigate the efficiency of genomic prediction in broilers; and (3) evaluate genomic predictions that harness genomic features. Results We identified five significant QTL, including one on chromosome 4 with major effects and four on chromosomes 1, 2, 17, and 27 with minor effects, accounting for 14.5 to 34.1% and 0.2 to 2.6% of the genomic additive genetic variance, respectively, and 23.3 to 46.7% and 0.6 to 4.5% of the observed predictive accuracy of breeding values, respectively. Further analysis showed that the QTL with minor effects collectively had a considerable influence, reflecting the polygenicity of the genetic background. The accuracy of genomic best linear unbiased predictions (BLUP) was improved by 22.0 to 70.3% compared to that of the conventional pedigree-based BLUP model. The genomic feature BLUP model further improved the observed prediction accuracy by 13.8 to 15.2% compared to the genomic BLUP model. Conclusions A major QTL and four minor QTL were identified for growth traits; the remaining variance was due to QTL effects that were too small to be detected. The genomic BLUP and genomic feature BLUP models yielded considerably higher prediction accuracy compared to the pedigree-based BLUP model. This study revealed the polygenicity of growth traits in yellow-plumage chickens and demonstrated that the predictive ability can be greatly improved by using genomic information and related features. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00672-9.
Collapse
Affiliation(s)
- Ruifei Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.,College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Zhenqiang Xu
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Qi Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Di Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Cheng Bian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Jiangli Ren
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhuolin Huang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoning Zhu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Zhixin Tian
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Yuzhe Wang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Ziqin Jiang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China
| | - Yiqiang Zhao
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Dexiang Zhang
- Wen's Nanfang Poultry Breeding Co. Ltd, Yunfu, 527400, Guangdong Province, China.
| | - Ning Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Xiaoxiang Hu
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China.
| |
Collapse
|
18
|
Vu NT, Phuc TH, Oanh KTP, Sang NV, Trang TT, Nguyen NH. Accuracies of genomic predictions for disease resistance of striped catfish to Edwardsiella ictaluri using artificial intelligence algorithms. G3-GENES GENOMES GENETICS 2021; 12:6408442. [PMID: 34788431 PMCID: PMC8727988 DOI: 10.1093/g3journal/jkab361] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 10/10/2021] [Indexed: 02/04/2023]
Abstract
Assessments of genomic prediction accuracies using artificial intelligent (AI) algorithms (i.e., machine and deep learning methods) are currently not available or very limited in aquaculture species. The principal aim of this study was to examine the predictive performance of these new methods for disease resistance to Edwardsiella ictaluri in a population of striped catfish Pangasianodon hypophthalmus and to make comparisons with four common methods, i.e., pedigree-based best linear unbiased prediction (PBLUP), genomic-based best linear unbiased prediction (GBLUP), single-step GBLUP (ssGBLUP) and a nonlinear Bayesian approach (notably BayesR). Our analyses using machine learning (i.e., ML-KAML) and deep learning (i.e., DL-MLP and DL-CNN) together with the four common methods (PBLUP, GBLUP, ssGBLUP, and BayesR) were conducted for two main disease resistance traits (i.e., survival status coded as 0 and 1 and survival time, i.e., days that the animals were still alive after the challenge test) in a pedigree consisting of 560 individual animals (490 offspring and 70 parents) genotyped for 14,154 single nucleotide polymorphism (SNPs). The results using 6,470 SNPs after quality control showed that machine learning methods outperformed PBLUP, GBLUP, and ssGBLUP, with the increases in the prediction accuracies for both traits by 9.1–15.4%. However, the prediction accuracies obtained from machine learning methods were comparable to those estimated using BayesR. Imputation of missing genotypes using AlphaFamImpute increased the prediction accuracies by 5.3–19.2% in all the methods and data used. On the other hand, there were insignificant decreases (0.3–5.6%) in the prediction accuracies for both survival status and survival time when multivariate models were used in comparison to univariate analyses. Interestingly, the genomic prediction accuracies based on only highly significant SNPs (P < 0.00001, 318–400 SNPs for survival status and 1,362–1,589 SNPs for survival time) were somewhat lower (0.3–15.6%) than those obtained from the whole set of 6,470 SNPs. In most of our analyses, the accuracies of genomic prediction were somewhat higher for survival time than survival status (0/1 data). It is concluded that although there are prospects for the application of genomic selection to increase disease resistance to E. ictaluri in striped catfish breeding programs, further evaluation of these methods should be made in independent families/populations when more data are accumulated in future generations to avoid possible biases in the genetic parameters estimates and prediction accuracies for the disease-resistant traits studied in this population of striped catfish P. hypophthalmus.
Collapse
Affiliation(s)
- Nguyen Thanh Vu
- School of Science, Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD, Australia.,Genecology Research Center, University of the Sunshine Coast, Sippy Downs, QLD, Australia.,Research Institute for Aquaculture No.2, Ho Chi Minh 710000, Vietnam
| | - Tran Huu Phuc
- Research Institute for Aquaculture No.2, Ho Chi Minh 710000, Vietnam
| | - Kim Thi Phuong Oanh
- Institute of Genome Research, Vietnam Academy of Science and Technology, Hanoi, Vietnam
| | - Nguyen Van Sang
- Research Institute for Aquaculture No.2, Ho Chi Minh 710000, Vietnam
| | - Trinh Thi Trang
- School of Science, Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD, Australia.,Genecology Research Center, University of the Sunshine Coast, Sippy Downs, QLD, Australia.,Vietnam National University of Agriculture, Gia Lam 131000, Vietnam
| | - Nguyen Hong Nguyen
- School of Science, Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD, Australia.,Genecology Research Center, University of the Sunshine Coast, Sippy Downs, QLD, Australia
| |
Collapse
|