1
|
Zhang Z, Zhao W, Wang Z, Pan Y, Wang Q, Zhang Z. Integration of ssGWAS and ROH analyses for uncovering genetic variants associated with reproduction traits in Large White pigs. Anim Genet 2024. [PMID: 39129705 DOI: 10.1111/age.13465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 05/26/2024] [Accepted: 07/05/2024] [Indexed: 08/13/2024]
Abstract
The low heritability of reproduction traits such as total number born (TNB), number born alive (NBA) and adjusted litter weight until 21 days at weaning (ALW) poses a challenge for genetic improvement. In this study, we aimed to identify genetic variants that influence these traits and evaluate the accuracy of genomic selection (GS) using these variants as genomic features. We performed single-step genome-wide association studies (ssGWAS) on 17 823 Large White (LW) pigs, of which 2770 were genotyped by 50K single nucleotide polymorphism (SNP) chips. Additionally, we analyzed runs of homozygosity (ROH) in the population and tested their effects on the traits. The genomic feature best linear unbiased prediction (GFBLUP) was then carried out in an independent population of 350 LW pigs using identified trait-related SNP subsets as genomic features. As a result, our findings identified five, one and four SNP windows that explaining more than 1% of genetic variance for ALW, TNB, and NBA, respectively and discovered 358 hotspots and nine ROH islands. The ROH SSC1:21814570-27186456 and SSC11:7220366-14276394 were found to be significantly associated with ALW and NBA, respectively. We assessed the genomic estimated breeding value accuracy through 20 replicates of five-fold cross-validation. Our findings demonstrate that GFBLUP, incorporating SNPs located in effective ROH (p-value < 0.05) as genomic features, might enhance GS accuracy for ALW compared with GBLUP. Additionally, using SNPs explaining more than 0.1% of the genetic variance in ssGWAS for NBA as genomic features might improve the GS accuracy, too. However, it is important to note that the incorporation of inappropriate genomic features can significantly reduce GS accuracy. In conclusion, our findings provide valuable insights into the genetic mechanisms of reproductive traits in pigs and suggest that the ssGWAS and ROH have the potential to enhance the accuracy of GS for reproductive traits in LW pigs.
Collapse
Affiliation(s)
- Zhenyang Zhang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Wei Zhao
- SciGene Biotechnology Co. Ltd, Hefei, China
| | - Zhen Wang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| | - Yuchun Pan
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
- Hainan Institute, Zhejiang University, Sanya, China
| | - Qishan Wang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
- Hainan Institute, Zhejiang University, Sanya, China
| | - Zhe Zhang
- Department of Animal Science, College of Animal Science, Zhejiang University, Hangzhou, China
| |
Collapse
|
2
|
Vourlaki IT, Ramos-Onsins SE, Pérez-Enciso M, Castanera R. Evaluation of deep learning for predicting rice traits using structural and single-nucleotide genomic variants. PLANT METHODS 2024; 20:121. [PMID: 39127715 DOI: 10.1186/s13007-024-01250-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 07/28/2024] [Indexed: 08/12/2024]
Abstract
BACKGROUND Structural genomic variants (SVs) are prevalent in plant genomes and have played an important role in evolution and domestication, as they constitute a significant source of genomic and phenotypic variability. Nevertheless, most methods in quantitative genetics focusing on crop improvement, such as genomic prediction, consider only Single Nucleotide Polymorphisms (SNPs). Deep Learning (DL) is a promising strategy for genomic prediction, but its performance using SVs and SNPs as genetic markers remains unknown. RESULTS We used rice to investigate whether combining SVs and SNPs can result in better trait prediction over SNPs alone and examine the potential advantage of Deep Learning (DL) networks over Bayesian Linear models. Specifically, the performances of BayesC (considering additive effects) and a Bayesian Reproducible Kernel Hilbert space (RKHS) regression (considering both additive and non-additive effects) were compared to those of two different DL architectures, the Multilayer Perceptron, and the Convolution Neural Network, to explore their prediction ability by using various marker input strategies. We found that exploiting structural and nucleotide variation slightly improved prediction ability on complex traits in 87% of the cases. DL models outperformed Bayesian models in 75% of the studied cases, considering the four traits and the two validation strategies used. Finally, DL systematically improved prediction ability of binary traits against the Bayesian models. CONCLUSIONS Our study reveals that the use of structural genomic variants can improve trait prediction in rice, independently of the methodology used. Also, our results suggest that Deep Learning (DL) networks can perform better than Bayesian models in the prediction of binary traits, and in quantitative traits when the training and target sets are not closely related. This highlights the potential of DL to enhance crop improvement in specific scenarios and the importance to consider SVs in addition to SNPs in genomic selection.
Collapse
Affiliation(s)
- Ioanna-Theoni Vourlaki
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193, Barcelona, Spain.
- IRTA (Institut de Recerca i Tecnologia Agroalimentàries), Caldes de Montbui, 08140, Barcelona, Spain.
| | - Sebastián E Ramos-Onsins
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193, Barcelona, Spain
| | - Miguel Pérez-Enciso
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193, Barcelona, Spain
- Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, Spain
- Universitat Autónoma de Barcelona, 08193, Barcelona, Spain
| | - Raúl Castanera
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, Edifici CRAG, Bellaterra, 08193, Barcelona, Spain.
- IRTA (Institut de Recerca i Tecnologia Agroalimentàries), Caldes de Montbui, 08140, Barcelona, Spain.
| |
Collapse
|
3
|
Pocrnic I, Lourenco D, Misztal I. Single nucleotide polymorphism profile for quantitative trait nucleotide in populations with small effective size and its impact on mapping and genomic predictions. Genetics 2024; 227:iyae103. [PMID: 38913695 PMCID: PMC11304960 DOI: 10.1093/genetics/iyae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/07/2024] [Accepted: 06/16/2024] [Indexed: 06/26/2024] Open
Abstract
Increasing SNP density by incorporating sequence information only marginally increases prediction accuracies of breeding values in livestock. To find out why, we used statistical models and simulations to investigate the shape of distribution of estimated SNP effects (a profile) around quantitative trait nucleotides (QTNs) in populations with a small effective population size (Ne). A QTN profile created by averaging SNP effects around each QTN was similar to the shape of expected pairwise linkage disequilibrium (PLD) based on Ne and genetic distance between SNP, with a distinct peak for the QTN. Populations with smaller Ne showed lower but wider QTN profiles. However, adding more genotyped individuals with phenotypes dragged the profile closer to the QTN. The QTN profile was higher and narrower for populations with larger compared to smaller Ne. Assuming the PLD curve for the QTN profile, 80% of the additive genetic variance explained by each QTN was contained in ± 1/Ne Morgan interval around the QTN, corresponding to 2 Mb in cattle and 5 Mb in pigs and chickens. With such large intervals, identifying QTN is difficult even if all of them are in the data and the assumed genetic architecture is simplistic. Additional complexity in QTN detection arises from confounding of QTN profiles with signals due to relationships, overlapping profiles with closely spaced QTN, and spurious signals. However, small Ne allows for accurate predictions with large data even without QTN identification because QTNs are accounted for by QTN profiles if SNP density is sufficient to saturate the segments.
Collapse
Affiliation(s)
- Ivan Pocrnic
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
4
|
Kasper C. Animal board invited review: Heritability of nitrogen use efficiency in fattening pigs: Current state and possible directions. Animal 2024; 18:101225. [PMID: 39013333 DOI: 10.1016/j.animal.2024.101225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 06/17/2024] [Accepted: 06/17/2024] [Indexed: 07/18/2024] Open
Abstract
Pork, an important component of human nutrition worldwide, contributes considerably to anthropogenic nitrogen and greenhouse gas emissions. Reducing the environmental impact of pig production is therefore essential. This can be achieved through system-level strategies, such as optimising resource use, improving manure management and recycling leftovers from human food production, and at the individual animal level by maintaining pig health and fine-tuning dietary protein levels to individual requirements. Breeding, coupled with nutritional strategies, offers a lasting solution to improve nitrogen use efficiency (NUE) - the ratio of nitrogen retained in the body to nitrogen ingested. With a heritability as high as 0.54, incorporating NUE into breeding programmes appears promising. Nitrogen use efficiency involves multiple tissues and metabolic processes, and is influenced by the environment and individual animal characteristics, including its genetic background. Heritable genetic variation in NUE may therefore occur in many different processes, including the central nervous regulation of feed intake, the endocrine system, the gastrointestinal tract where digestion and absorption take place, and the composition of the gut microbiome. An animal's postabsorptive protein metabolism might also harbour important genetic variation, especially in the maintenance requirements of tissues and organs. Precise phenotyping, although challenging and costly, is essential for successful breeding. Various measurement techniques, such as imaging techniques and mechanistic models, are being explored for their potential in genetic analysis. Despite the difficulties in phenotyping, some studies have estimated the heritability and genetic correlations of NUE. These studies suggest that direct selection for NUE is more effective than indirect methods through feed efficiency. The complexity of NUE indicates a polygenic trait architecture, which has been confirmed by genome-wide association studies that have been unable to identify significant quantitative trait loci. Building sufficiently large reference populations to train genomic prediction models is an important next step. However, this will require the development of truly high-throughput phenotyping methods. In conclusion, breeding pigs with higher NUE is both feasible and necessary but will require increased efforts in high-throughput phenotyping and improved genome annotation.
Collapse
Affiliation(s)
- C Kasper
- Animal GenoPhenomics, Agroscope, Posieux, Switzerland.
| |
Collapse
|
5
|
Rajawat D, Ghildiyal K, Sonejita Nayak S, Sharma A, Parida S, Kumar S, Ghosh AK, Singh U, Sivalingam J, Bhushan B, Dutt T, Panigrahi M. Genome-wide mining of diversity and evolutionary signatures revealed selective hotspots in Indian Sahiwal cattle. Gene 2024; 901:148178. [PMID: 38242377 DOI: 10.1016/j.gene.2024.148178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 01/10/2024] [Accepted: 01/16/2024] [Indexed: 01/21/2024]
Abstract
The Sahiwal cattle breed is the best indigenous dairy cattle breed, and it plays a pivotal role in the Indian dairy industry. This is due to its exceptional milk-producing potential, adaptability to local tropical conditions, and its resilience to ticks and diseases. The study aimed to identify selective sweeps and estimate intrapopulation genetic diversity parameters in Sahiwal cattle using ddRAD sequencing-based genotyping data from 82 individuals. After applying filtering criteria, 78,193 high-quality SNPs remained for further analysis. The population exhibited an average minor allele frequency of 0.221 ± 0.119. Genetic diversity metrics, including observed (0.597 ± 0.196) and expected heterozygosity (0.433 ± 0.096), nucleotide diversity (0.327 ± 0.114), the proportion of polymorphic SNPs (0.726), and allelic richness (1.323 ± 0.134), indicated ample genomic diversity within the breed. Furthermore, an effective population size of 74 was observed in the most recent generation. The overall mean linkage disequilibrium (r2) for pairwise SNPs was 0.269 ± 0.057. Moreover, a greater proportion of short Runs of Homozygosity (ROH) segments were observed suggesting that there may be low levels of recent inbreeding in this population. The genomic inbreeding coefficients, computed using different inbreeding estimates (FHOM, FUNI, FROH, and FGROM), ranged from -0.0289 to 0.0725. Subsequently, we found 146 regions undergoing selective sweeps using five distinct statistical tests: Tajima's D, CLR, |iHS|, |iHH12|, and ROH. These regions, located in non-overlapping 500 kb windows, were mapped and revealed various protein-coding genes associated with enhanced immune systems and disease resistance (IFNL3, IRF8, BLK), as well as production traits (NRXN1, PLCE1, GHR). Notably, we identified interleukin 2 (IL2) on Chr17: 35217075-35223276 as a gene linked to tick resistance and uncovered a cluster of genes (HSPA8, UBASH3B, ADAMTS18, CRTAM) associated with heat stress. These findings indicate the evolutionary impact of natural and artificial selection on the environmental adaptation of the Sahiwal cattle population.
Collapse
Affiliation(s)
- Divya Rajawat
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Kanika Ghildiyal
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Sonali Sonejita Nayak
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Anurodh Sharma
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Subhashree Parida
- Pharmacology & Toxicology Division, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Shive Kumar
- Department of Animal Genetics and Breeding, Govind Ballabh Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India
| | - A K Ghosh
- Department of Animal Genetics and Breeding, Govind Ballabh Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, India
| | - Umesh Singh
- ICAR Central Institute for Research on Cattle, Meerut, UP, India
| | | | - Bharat Bhushan
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Triveni Dutt
- Livestock Production and Management Section, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Manjit Panigrahi
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India.
| |
Collapse
|
6
|
Liu Y, Zhang Y, Zhou F, Yao Z, Zhan Y, Fan Z, Meng X, Zhang Z, Liu L, Yang J, Wu Z, Cai G, Zheng E. Increased Accuracy of Genomic Prediction Using Preselected SNPs from GWAS with Imputed Whole-Genome Sequence Data in Pigs. Animals (Basel) 2023; 13:3871. [PMID: 38136908 PMCID: PMC10740755 DOI: 10.3390/ani13243871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 12/13/2023] [Accepted: 12/14/2023] [Indexed: 12/24/2023] Open
Abstract
Enhancing the accuracy of genomic prediction is a key goal in genomic selection (GS) research. Integrating prior biological information into GS methods using appropriate models can improve prediction accuracy for complex traits. Genome-wide association study (GWAS) is widely utilized to identify potential candidate loci associated with complex traits in livestock and poultry, offering essential genomic insights. In this study, a GWAS was conducted on 685 Duroc × Landrace × Yorkshire (DLY) pigs to extract significant single-nucleotide polymorphisms (SNPs) as genomic features. We compared two GS models, genomic best linear unbiased prediction (GBLUP) and genomic feature BLUP (GFBLUP), by using imputed whole-genome sequencing (WGS) data on 651 Yorkshire pigs. The results revealed that the GBLUP model achieved prediction accuracies of 0.499 for backfat thickness (BFT) and 0.423 for loin muscle area (LMA). By applying the GFBLUP model with GWAS-based SNP preselection, the average prediction accuracies for BFT and LMA traits reached 0.491 and 0.440, respectively. Specifically, the GFBLUP model displayed a 4.8% enhancement in predicting LMA compared to the GBLUP model. These findings suggest that, in certain scenarios, the GFBLUP model may offer superior genomic prediction accuracy when compared to the GBLUP model, underscoring the potential value of incorporating genomic features to refine GS models.
Collapse
Affiliation(s)
- Yiyi Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuling Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Fuchen Zhou
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zekai Yao
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Yuexin Zhan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfei Fan
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Xianglun Meng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zebin Zhang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Langqing Liu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Jie Yang
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
- Guangdong Zhongxin Breeding Technology Co., Ltd., Guangzhou 510642, China
| | - Enqin Zheng
- National Engineering Research Center for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou 510642, China; (Y.L.); (Y.Z.); (F.Z.); (Z.Y.); (Y.Z.); (Z.F.); (X.M.); (Z.Z.); (L.L.); (J.Y.); (Z.W.)
- Guangdong Provincial Key Laboratory of Agro-Animal Genomics and Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
7
|
Rios ACH, Nasner SLC, Londoño-Gil M, Gonzalez-Herrera LG, Lopez-Herrera A, Flórez JCR. Genome-wide association study for reproduction traits in Colombian Creole Blanco Orejinegro cattle. Trop Anim Health Prod 2023; 55:429. [PMID: 38044379 DOI: 10.1007/s11250-023-03847-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 11/27/2023] [Indexed: 12/05/2023]
Abstract
The profitability of the beef cattle production system relies heavily on reproductive traits. Unfortunately, certain traits, such as age at first calving (AFC), calving interval (CI), and gestation length (GL), can pose challenges in traditional breeding programs because of their low heritability (0.01-0.12) and sex-limited characteristics. Another important aspect is the conservation of the genetic resources of animals adapted to the Colombian regions, which implies the preservation and rational use of the creole breeds in the country market. Therefore, this study aimed to identify genomic regions in the creole cattle breed Blanco Orejinegro (BON) that influence the reproductive traits in females. The dataset comprised 439 animals and 118,116 single-nucleotide polymorphisms' (SNPs) markers. The GS3 program was used to identify the SNP effects employing the BAYES Cπ methodology. The number of SNPs with effect for AFC was 25, 1527 for CI, and 23 for GL. Some of the genes found associated with reproductive and growth traits as well as immune response and environmental adaptation ECE1, EPH, EPHB2, SMARCAL1, IGFBP5, IGFBP2, FCGRT, EGFR, MUL1, PINK1, STPG1, CNGB1, TGFB1, OXTR, IL22RA1, MYOM3, OXTR, CNR2, HIVEP3, CTPS1, CXCL8, FCGRT, MREG, TMEM169, PECR, and MC1R. Our results evidenced a high contribution of the genetic architecture of the Colombian creole cattle breed Blanco Orejinegro that may impact should be included in implementing genetic improvement and conservation programs.
Collapse
Affiliation(s)
- Ana Cristina Herrera Rios
- Grupo de Investigación Biodiversidad y Genética Molecular (BIOGEM), Universidad Nacional de Colombia Sede Medellín, Carrera 65 N 59A-110, 050034, Medellín, Colombia.
- Grupo de Investigación Nutri-Solla, SOLLA S.A., Cra 42 #33-80, Itagüí, Antioquia, Colombia.
| | - Sindy Liliana Caivio Nasner
- Grupo de Investigación Biomolecular y Pecuaria BIOPEC, Universidad Tecnológica de Pereira, Cra. 27 N10-02, 660003, Pereira, Risaralda, Colombia
| | - Marisol Londoño-Gil
- Grupo de Investigación Biodiversidad y Genética Molecular (BIOGEM), Universidad Nacional de Colombia Sede Medellín, Carrera 65 N 59A-110, 050034, Medellín, Colombia
| | - Luis Gabriel Gonzalez-Herrera
- Grupo de Investigación Biodiversidad y Genética Molecular (BIOGEM), Universidad Nacional de Colombia Sede Medellín, Carrera 65 N 59A-110, 050034, Medellín, Colombia
| | - Albeiro Lopez-Herrera
- Grupo de Investigación Biodiversidad y Genética Molecular (BIOGEM), Universidad Nacional de Colombia Sede Medellín, Carrera 65 N 59A-110, 050034, Medellín, Colombia
| | - Juan Carlos Rincón Flórez
- Grupo de Investigación Biodiversidad y Genética Molecular (BIOGEM), Universidad Nacional de Colombia Sede Medellín, Carrera 65 N 59A-110, 050034, Medellín, Colombia
- Grupo de Investigación Biodiversidad y Genética Molecular (BIOGEM), Universidad Nacional de Colombia Sede Palmira, Carrera 32 N 12 - 00, PC 763352, Palmira, Colombia
| |
Collapse
|
8
|
Ghildiyal K, Nayak SS, Rajawat D, Sharma A, Chhotaray S, Bhushan B, Dutt T, Panigrahi M. Genomic insights into the conservation of wild and domestic animal diversity: A review. Gene 2023; 886:147719. [PMID: 37597708 DOI: 10.1016/j.gene.2023.147719] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 06/20/2023] [Accepted: 08/16/2023] [Indexed: 08/21/2023]
Abstract
Due to environmental change and anthropogenic activities, global biodiversity has suffered an unprecedented loss, and the world is now heading toward the sixth mass extinction event. This urges the need to step up our efforts to promote the sustainable use of animal genetic resources and plan effective strategies for their conservation. Although habitat preservation and restoration are the primary means of conserving biodiversity, genomic technologies offer a variety of novel tools for identifying biodiversity hotspots and thus, support conservation efforts. Conservation genomics is a broad area of science that encompasses the application of genomic data from thousands or tens of thousands of genome-wide markers to address important conservation biology concerns. Genomic approaches have revolutionized the way we understand and manage animal populations, providing tools to identify and preserve unique genetic variants and alleles responsible for adaptive genetic variation, reducing the deleterious consequences of inbreeding, and increasing the adaptive potential of threatened species. The advancement of genomic technologies, particularly comparative genomic approaches, and the increased accessibility of genomic resources in the form of genome-enabled taxa for non-model organisms, provides a distinct advantage in defining conservation units over traditional genetics approaches. The objective of this review is to provide an exhaustive overview of the concept of conservation genomics, discuss the rationale behind the transition from conservation genetics to genomic approaches, and emphasize the potential applications of genomic techniques for conservation purposes. We also highlight interesting case studies in both livestock and wildlife species where genomic techniques have been used to accomplish conservation goals. Finally, we address some challenges and future perspectives in this field.
Collapse
Affiliation(s)
- Kanika Ghildiyal
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Sonali Sonejita Nayak
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Divya Rajawat
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Anurodh Sharma
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Supriya Chhotaray
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Bharat Bhushan
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Triveni Dutt
- Livestock Production and Management Section, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Manjit Panigrahi
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India.
| |
Collapse
|
9
|
Id-Lahoucine S, Cánovas A, Legarra A, Casellas J. Transmission ratio distortion regions in the context of genomic evaluation and their effects on reproductive traits in cattle. J Dairy Sci 2023; 106:7786-7798. [PMID: 37210358 DOI: 10.3168/jds.2022-23062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 04/19/2023] [Indexed: 05/22/2023]
Abstract
Transmission ratio distortion (TRD), which is a deviation from Mendelian expectations, has been associated with basic mechanisms of life such as sperm and ova fertility and viability at developmental stages of the reproductive cycle. In this study different models including TRD regions were tested for different reproductive traits [days from first service to conception (FSTC), number of services, first service nonreturn rate (NRR), and stillbirth (SB)]. Thus, in addition to a basic model with systematic and random effects, including genetic effects modeled through a genomic relationship matrix, we developed 2 additional models, including a second genomic relationship matrix based on TRD regions, and TRD regions as a random effect assuming heterogeneous variances. The analyses were performed with 10,623 cows and 1,520 bulls genotyped for 47,910 SNPs, 590 TRD regions, and several records ranging from 9,587 (FSTC) to 19,667 (SB). The results of this study showed the ability of TRD regions to capture some additional genetic variance for some traits; however, this did not translate into higher accuracy for genomic prediction. This could be explained by the nature of TRD itself, which may arise in different stages of the reproductive cycle. Nevertheless, important effects of TRD regions were found on SB (31 regions) and NRR (18 regions) when comparing at-risk versus control matings, especially for regions with allelic TRD pattern. Particularly for NRR, the probability of observing nonpregnant cow increases by up to 27% for specific TRD regions, and the probability of observing stillbirth increased by up to 254%. These results support the relevance of several TRD regions on some reproductive traits, especially those with allelic patterns that have not received as much attention as recessive TRD patterns.
Collapse
Affiliation(s)
- S Id-Lahoucine
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph N1G 2W1, ON, Canada
| | - A Cánovas
- Centre for Genetic Improvement of Livestock, Department of Animal Biosciences, University of Guelph, Guelph N1G 2W1, ON, Canada.
| | - A Legarra
- INRAE, UR631 SAGA, BP 52627, 32326 Castanet-Tolosan, France
| | - J Casellas
- Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra 08193, Barcelona, Spain
| |
Collapse
|
10
|
Zhu D, Zhao Y, Zhang R, Wu H, Cai G, Wu Z, Wang Y, Hu X. Genomic prediction based on selective linkage disequilibrium pruning of low-coverage whole-genome sequence variants in a pure Duroc population. Genet Sel Evol 2023; 55:72. [PMID: 37853325 PMCID: PMC10583454 DOI: 10.1186/s12711-023-00843-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 09/14/2023] [Indexed: 10/20/2023] Open
Abstract
BACKGROUND Although the accumulation of whole-genome sequencing (WGS) data has accelerated the identification of mutations underlying complex traits, its impact on the accuracy of genomic predictions is limited. Reliable genotyping data and pre-selected beneficial loci can be used to improve prediction accuracy. Previously, we reported a low-coverage sequencing genotyping method that yielded 11.3 million highly accurate single-nucleotide polymorphisms (SNPs) in pigs. Here, we introduce a method termed selective linkage disequilibrium pruning (SLDP), which refines the set of SNPs that show a large gain during prediction of complex traits using whole-genome SNP data. RESULTS We used the SLDP method to identify and select markers among millions of SNPs based on genome-wide association study (GWAS) prior information. We evaluated the performance of SLDP with respect to three real traits and six simulated traits with varying genetic architectures using two representative models (genomic best linear unbiased prediction and BayesR) on samples from 3579 Duroc boars. SLDP was determined by testing 180 combinations of two core parameters (GWAS P-value thresholds and linkage disequilibrium r2). The parameters for each trait were optimized in the training population by five fold cross-validation and then tested in the validation population. Similar to previous GWAS prior-based methods, the performance of SLDP was mainly affected by the genetic architecture of the traits analyzed. Specifically, SLDP performed better for traits controlled by major quantitative trait loci (QTL) or a small number of quantitative trait nucleotides (QTN). Compared with two commercial SNP chips, genotyping-by-sequencing data, and an unselected whole-genome SNP panel, the SLDP strategy led to significant improvements in prediction accuracy, which ranged from 0.84 to 3.22% for real traits controlled by major or moderate QTL and from 1.23 to 11.47% for simulated traits controlled by a small number of QTN. CONCLUSIONS The SLDP marker selection method can be incorporated into mainstream prediction models to yield accuracy improvements for traits with a relatively simple genetic architecture, however, it has no significant advantage for traits not controlled by major QTL. The main factors that affect its performance are the genetic architecture of traits and the reliability of GWAS prior information. Our findings can facilitate the application of WGS-based genomic selection.
Collapse
Affiliation(s)
- Di Zhu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Yiqiang Zhao
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Ran Zhang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Hanyu Wu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, China
| | - Gengyuan Cai
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, China
| | - Zhenfang Wu
- National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangdong, China.
| | - Yuzhe Wang
- National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, China.
| | - Xiaoxiang Hu
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, China.
| |
Collapse
|
11
|
Della Coletta R, Fernandes SB, Monnahan PJ, Mikel MA, Bohn MO, Lipka AE, Hirsch CN. Importance of genetic architecture in marker selection decisions for genomic prediction. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:220. [PMID: 37819415 DOI: 10.1007/s00122-023-04469-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 09/25/2023] [Indexed: 10/13/2023]
Abstract
KEY MESSAGE We demonstrate potential for improved multi-environment genomic prediction accuracy using structural variant markers. However, the degree of observed improvement is highly dependent on the genetic architecture of the trait. Breeders commonly use genetic markers to predict the performance of untested individuals as a way to improve the efficiency of breeding programs. These genomic prediction models have almost exclusively used single nucleotide polymorphisms (SNPs) as their source of genetic information, even though other types of markers exist, such as structural variants (SVs). Given that SVs are associated with environmental adaptation and not all of them are in linkage disequilibrium to SNPs, SVs have the potential to bring additional information to multi-environment prediction models that are not captured by SNPs alone. Here, we evaluated different marker types (SNPs and/or SVs) on prediction accuracy across a range of genetic architectures for simulated traits across multiple environments. Our results show that SVs can improve prediction accuracy, but it is highly dependent on the genetic architecture of the trait and the relative gain in accuracy is minimal. When SVs are the only causative variant type, 70% of the time SV predictors outperform SNP predictors. However, the improvement in accuracy in these instances is only 1.5% on average. Further simulations with predictors in varying degrees of LD with causative variants of different types (e.g., SNPs, SVs, SNPs and SVs) showed that prediction accuracy increased as linkage disequilibrium between causative variants and predictors increased regardless of the marker type. This study demonstrates that knowing the genetic architecture of a trait in deciding what markers to use in large-scale genomic prediction modeling in a breeding program is more important than what types of markers to use.
Collapse
Affiliation(s)
- Rafael Della Coletta
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Samuel B Fernandes
- Department of Crop, Soil and Environmental Sciences at University of Arkansas, Fayetteville, AR, 72701, USA
| | - Patrick J Monnahan
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Mark A Mikel
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
- Roy J. Carver Biotechnology Center, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Martin O Bohn
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Alexander E Lipka
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA.
| |
Collapse
|
12
|
Jang S, Tsuruta S, Leite NG, Misztal I, Lourenco D. Dimensionality of genomic information and its impact on genome-wide associations and variant selection for genomic prediction: a simulation study. Genet Sel Evol 2023; 55:49. [PMID: 37460964 DOI: 10.1186/s12711-023-00823-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/03/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Identifying true positive variants in genome-wide associations (GWA) depends on several factors, including the number of genotyped individuals. The limited dimensionality of genomic information may give insights into the optimal number of individuals to be used in GWA. This study investigated different discovery set sizes based on the number of largest eigenvalues explaining a certain proportion of variance in the genomic relationship matrix (G). In addition, we investigated the impact on the prediction accuracy by adding variants, which were selected based on different set sizes, to the regular single nucleotide polymorphism (SNP) chips used for genomic prediction. METHODS We simulated sequence data that included 500k SNPs with 200 or 2000 quantitative trait nucleotides (QTN). A regular 50k panel included one in every ten simulated SNPs. Effective population size (Ne) was set to 20 or 200. GWA were performed using a number of genotyped animals equivalent to the number of largest eigenvalues of G (EIG) explaining 50, 60, 70, 80, 90, 95, 98, and 99% of the variance. In addition, the largest discovery set consisted of 30k genotyped animals. Limited or extensive phenotypic information was mimicked by changing the trait heritability. Significant and large-effect size SNPs were added to the 50k panel and used for single-step genomic best linear unbiased prediction (ssGBLUP). RESULTS Using a number of genotyped animals corresponding to at least EIG98 allowed the identification of QTN with the largest effect sizes when Ne was large. Populations with smaller Ne required more than EIG98. Furthermore, including genotyped animals with a higher reliability (i.e., a higher trait heritability) improved the identification of the most informative QTN. Prediction accuracy was highest when the significant or the large-effect SNPs representing twice the number of simulated QTN were added to the 50k panel. CONCLUSIONS Accurately identifying causative variants from sequence data depends on the effective population size and, therefore, on the dimensionality of genomic information. This dimensionality can help identify the most suitable sample size for GWA and could be considered for variant selection, especially when resources are restricted. Even when variants are accurately identified, their inclusion in prediction models has limited benefits.
Collapse
Affiliation(s)
- Sungbong Jang
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA.
| | - Shogo Tsuruta
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Natalia Galoro Leite
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Ignacy Misztal
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| | - Daniela Lourenco
- Department of Animal and Dairy Science, University of Georgia, Athens, GA, 30602, USA
| |
Collapse
|
13
|
Zhuang Z, Wu J, Qiu Y, Ruan D, Ding R, Xu C, Zhou S, Zhang Y, Liu Y, Ma F, Yang J, Sun Y, Zheng E, Yang M, Cai G, Yang J, Wu Z. Improving the accuracy of genomic prediction for meat quality traits using whole genome sequence data in pigs. J Anim Sci Biotechnol 2023; 14:67. [PMID: 37161604 PMCID: PMC10170792 DOI: 10.1186/s40104-023-00863-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/05/2023] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Pork quality can directly affect customer purchase tendency and meat quality traits have become valuable in modern pork production. However, genetic improvement has been slow due to high phenotyping costs. In this study, whole genome sequence (WGS) data was used to evaluate the prediction accuracy of genomic best linear unbiased prediction (GBLUP) for meat quality in large-scale crossbred commercial pigs. RESULTS We produced WGS data (18,695,907 SNPs and 2,106,902 INDELs exceed quality control) from 1,469 sequenced Duroc × (Landrace × Yorkshire) pigs and developed a reference panel for meat quality including meat color score, marbling score, L* (lightness), a* (redness), and b* (yellowness) of genomic prediction. The prediction accuracy was defined as the Pearson correlation coefficient between adjusted phenotypes and genomic estimated breeding values in the validation population. Using different marker density panels derived from WGS data, accuracy differed substantially among meat quality traits, varied from 0.08 to 0.47. Results showed that MultiBLUP outperform GBLUP and yielded accuracy increases ranging from 17.39% to 75%. We optimized the marker density and found medium- and high-density marker panels are beneficial for the estimation of heritability for meat quality. Moreover, we conducted genotype imputation from 50K chip to WGS level in the same population and found average concordance rate to exceed 95% and r2 = 0.81. CONCLUSIONS Overall, estimation of heritability for meat quality traits can benefit from the use of WGS data. This study showed the superiority of using WGS data to genetically improve pork quality in genomic prediction.
Collapse
Affiliation(s)
- Zhanwei Zhuang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yibin Qiu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Donglin Ruan
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Rongrong Ding
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Cineng Xu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Shenping Zhou
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yuling Zhang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Yiyi Liu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Fucai Ma
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jifei Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ying Sun
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Enqin Zheng
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Ming Yang
- College of Animal Science and Technology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, China
| | - Gengyuan Cai
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China
| | - Jie Yang
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
| | - Zhenfang Wu
- College of Animal Science and National Engineering Research Center for Breeding Swine Industry, South China Agricultural University, Guangzhou, 510642, China.
- Guangdong Provincial Key Laboratory of Agro-animal Genomics and Molecular Breeding, Guangzhou, 510642, China.
- Yunfu Subcenter of Guangdong Laboratory for Lingnan Modern Agriculture, Yunfu, 527400, China.
| |
Collapse
|
14
|
Sánchez-Roncancio C, García B, Gallardo-Hidalgo J, Yáñez JM. GWAS on Imputed Whole-Genome Sequence Variants Reveal Genes Associated with Resistance to Piscirickettsia salmonis in Rainbow Trout ( Oncorhynchus mykiss). Genes (Basel) 2022; 14:114. [PMID: 36672855 PMCID: PMC9859203 DOI: 10.3390/genes14010114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 12/27/2022] [Accepted: 12/28/2022] [Indexed: 12/31/2022] Open
Abstract
Genome-wide association studies (GWAS) allow the identification of associations between genetic variants and important phenotypes in domestic animals, including disease-resistance traits. Whole Genome Sequencing (WGS) data can help increase the resolution and statistical power of association mapping. Here, we conduced GWAS to asses he facultative intracellular bacterium Piscirickettsia salmonis, which affects farmed rainbow trout, Oncorhynchus mykiss, in Chile using imputed genotypes at the sequence level and searched for candidate genes located in genomic regions associated with the trait. A total of 2130 rainbow trout were intraperitoneally challenged with P. salmonis under controlled conditions and genotyped using a 57K single nucleotide polymorphism (SNP) panel. Genotype imputation was performed in all the genotyped animals using WGS data from 102 individuals. A total of 488,979 imputed WGS variants were available in the 2130 individuals after quality control. GWAS revealed genome-wide significant quantitative trait loci (QTL) in Omy02, Omy03, Omy25, Omy26 and Omy27 for time to death and in Omy26 for binary survival. Twenty-four (24) candidate genes associated with P. salmonis resistance were identified, which were mainly related to phagocytosis, innate immune response, inflammation, oxidative response, lipid metabolism and apoptotic process. Our results provide further knowledge on the genetic variants and genes associated with resistance to intracellular bacterial infection in rainbow trout.
Collapse
Affiliation(s)
- Charles Sánchez-Roncancio
- Doctorado en Acuicultura, Programa Cooperativo: Universidad de Chile. Universidad Católica del Norte. Pontificia Universidad Católica de Valparaíso, Chile
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
| | - Baltasar García
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, La Pintana, Santiago 8820808, Chile
| | - Jousepth Gallardo-Hidalgo
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, La Pintana, Santiago 8820808, Chile
| | - José M. Yáñez
- Center for Research and Innovation in Aquaculture (CRIA), Universidad de Chile, Santiago 8820808, Chile
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, La Pintana, Santiago 8820808, Chile
- Núcleo Milenio de Salmonidos Invasores Australes (INVASAL), Concepcion 4030000, Chile
| |
Collapse
|
15
|
Ros-Freixedes R, Johnsson M, Whalen A, Chen CY, Valente BD, Herring WO, Gorjanc G, Hickey JM. Genomic prediction with whole-genome sequence data in intensely selected pig lines. GENETICS SELECTION EVOLUTION 2022; 54:65. [PMID: 36153511 PMCID: PMC9509613 DOI: 10.1186/s12711-022-00756-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 09/05/2022] [Indexed: 12/03/2022]
Abstract
Background Early simulations indicated that whole-genome sequence data (WGS) could improve the accuracy of genomic predictions within and across breeds. However, empirical results have been ambiguous so far. Large datasets that capture most of the genomic diversity in a population must be assembled so that allele substitution effects are estimated with high accuracy. The objectives of this study were to use a large pig dataset from seven intensely selected lines to assess the benefits of using WGS for genomic prediction compared to using commercial marker arrays and to identify scenarios in which WGS provides the largest advantage. Methods We sequenced 6931 individuals from seven commercial pig lines with different numerical sizes. Genotypes of 32.8 million variants were imputed for 396,100 individuals (17,224 to 104,661 per line). We used BayesR to perform genomic prediction for eight complex traits. Genomic predictions were performed using either data from a standard marker array or variants preselected from WGS based on association tests. Results The accuracies of genomic predictions based on preselected WGS variants were not robust across traits and lines and the improvements in prediction accuracy that we achieved so far with WGS compared to standard marker arrays were generally small. The most favourable results for WGS were obtained when the largest training sets were available and standard marker arrays were augmented with preselected variants with statistically significant associations to the trait. With this method and training sets of around 80k individuals, the accuracy of within-line genomic predictions was on average improved by 0.025. With multi-line training sets, improvements of 0.04 compared to marker arrays could be expected. Conclusions Our results showed that WGS has limited potential to improve the accuracy of genomic predictions compared to marker arrays in intensely selected pig lines. Thus, although we expect that larger improvements in accuracy from the use of WGS are possible with a combination of larger training sets and optimised pipelines for generating and analysing such datasets, the use of WGS in the current implementations of genomic prediction should be carefully evaluated against the cost of large-scale WGS data on a case-by-case basis. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-022-00756-0.
Collapse
|
16
|
Vourlaki IT, Castanera R, Ramos-Onsins SE, Casacuberta JM, Pérez-Enciso M. Transposable element polymorphisms improve prediction of complex agronomic traits in rice. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:3211-3222. [PMID: 35931838 PMCID: PMC9482605 DOI: 10.1007/s00122-022-04180-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Accepted: 07/12/2022] [Indexed: 06/15/2023]
Abstract
KEY MESSAGE Transposon insertion polymorphisms can improve prediction of complex agronomic traits in rice compared to using SNPs only, especially when accessions to be predicted are less related to the training set. Transposon insertion polymorphisms (TIPs) are significant sources of genetic variation. Previous work has shown that TIPs can improve detection of causative loci on agronomic traits in rice. Here, we quantify the fraction of variance explained by single nucleotide polymorphisms (SNPs) compared to TIPs, and we explore whether TIPs can improve prediction of traits when compared to using only SNPs. We used eleven traits of agronomic relevance from by five different rice population groups (Aus, Indica, Aromatic, Japonica, and Admixed), 738 accessions in total. We assess prediction by applying data split validation in two scenarios. In the within-population scenario, we predicted performance of improved Indica varieties using the rest of Indica accessions. In the across population scenario, we predicted all Aromatic and Admixed accessions using the rest of populations. In each scenario, Bayes C and a Bayesian reproducible kernel Hilbert space regression were compared. We find that TIPs can explain an important fraction of total genetic variance and that they also improve genomic prediction. In the across population prediction scenario, TIPs outperformed SNPs in nine out of the eleven traits analyzed. In some traits like leaf senescence or grain width, using TIPs increased predictive correlation by 30-50%. Our results evidence, for the first time, that TIPs genotyping can improve prediction on complex agronomic traits in rice, especially when accessions to be predicted are less related to training accessions.
Collapse
Affiliation(s)
- Ioanna-Theoni Vourlaki
- Universitat Autònoma de Barcelona, Department of Animal Production, 08193, Bellaterra, Barcelona, Spain.
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain.
| | - Raúl Castanera
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain
| | - Sebastián E Ramos-Onsins
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain
| | - Josep M Casacuberta
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain
| | - Miguel Pérez-Enciso
- Universitat Autònoma de Barcelona, Department of Animal Production, 08193, Bellaterra, Barcelona, Spain.
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, 08193, Bellaterra, Barcelona, Spain.
- Catalan Institute for Research and Advanced studies, ICREA, 08010, Barcelona, Spain.
| |
Collapse
|
17
|
Ye W, Xu L, Li Y, Liu L, Ma Z, Sun D, Han B. Single Nucleotide Polymorphisms of ALDH18A1 and MAT2A Genes and Their Genetic Associations with Milk Production Traits of Chinese Holstein Cows. Genes (Basel) 2022; 13:genes13081437. [PMID: 36011348 PMCID: PMC9407996 DOI: 10.3390/genes13081437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 07/16/2022] [Accepted: 08/10/2022] [Indexed: 11/16/2022] Open
Abstract
Our preliminary work had suggested two genes, aldehyde dehydrogenase 18 family member A1 (ALDH18A1) and methionine adenosyltransferase 2A (MAT2A), related to amino acid synthesis and metabolism as candidates affecting milk traits by analyzing the liver transcriptome and proteome of dairy cows at different lactation stages. In this study, the single nucleotide polymorphisms (SNPs) of ALDH18A1 and MAT2A genes were identified and their genetic effects and underlying causative mechanisms on milk production traits in dairy cattle were analyzed, with the aim of providing effective genetic information for the molecular breeding of dairy cows. By resequencing the entire coding and partial flanking regions of ALDH18A1 and MAT2A, we found eight SNPs located in ALDH18A1 and two in MAT2A. Single-SNP association analysis showed that most of the 10 SNPs of these two genes were significantly associated with the milk yield traits, 305-day milk yield, fat yield, and protein yield in the first and second lactations (corrected p ≤ 0.0488). Using Haploview 4.2, we found that the seven SNPs of ALDH18A1 formed two haplotype blocks; subsequently, the haplotype-based association analysis showed that both haplotypes were significantly associated with 305-day milk yield, fat yield, and protein yield (corrected p ≤ 0.014). Furthermore, by Jaspar and Genomatix software, we found that 26:g.17130318 C>A and 11:g.49472723G>C, respectively, in the 5′ flanking region of ALDH18A1 and MAT2A genes changed the transcription factor binding sites (TFBSs), which might regulate the expression of corresponding genes to affect the phenotypes of milk production traits. Therefore, these two SNPs were considered as potential functional mutations, but they also require further verification. In summary, ALDH18A1 and MAT2A were proved to probably have genetic effects on milk production traits, and their valuable SNPs might be used as candidate genetic markers for dairy cattle’s genomic selection (GS).
Collapse
Affiliation(s)
- Wen Ye
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Engineering Laboratory for Animal Breeding, China Agricultural University, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Lingna Xu
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Engineering Laboratory for Animal Breeding, China Agricultural University, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Yanhua Li
- Beijing Dairy Cattle Center, Beijing 100192, China
| | - Lin Liu
- Beijing Dairy Cattle Center, Beijing 100192, China
| | - Zhu Ma
- Beijing Dairy Cattle Center, Beijing 100192, China
| | - Dongxiao Sun
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Engineering Laboratory for Animal Breeding, China Agricultural University, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, Beijing 100193, China
| | - Bo Han
- Department of Animal Genetics and Breeding, College of Animal Science and Technology, National Engineering Laboratory for Animal Breeding, China Agricultural University, Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, Beijing 100193, China
- Correspondence:
| |
Collapse
|
18
|
Weighted Single-Step Genomic Best Linear Unbiased Prediction Method Application for Assessing Pigs on Meat Productivity and Reproduction Traits. Animals (Basel) 2022; 12:ani12131693. [PMID: 35804591 PMCID: PMC9264777 DOI: 10.3390/ani12131693] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Revised: 06/10/2022] [Accepted: 06/28/2022] [Indexed: 11/16/2022] Open
Abstract
Changes in the accuracy of the genomic estimates obtained by the ssGBLUP and wssGBLUP methods were evaluated using different reference groups. The weighting procedure’s reasonableness of application Pwas considered to improve the accuracy of genomic predictions for meat, fattening and reproduction traits in pigs. Six reference groups were formed to assess the genomic data quantity impact on the accuracy of predicted values (groups of genotyped animals). The datasets included 62,927 records of meat and fattening productivity (fat thickness over 6–7 ribs (BF1, mm)), muscle depth (MD, mm) and precocity up to 100 kg (age, days) and 16,070 observations of reproductive qualities (the number of all born piglets (TNB) and the number of live-born piglets (NBA), according to the results of the first farrowing). The wssGBLUP method has an advantage over ssGBLUP in terms of estimation reliability. When using a small reference group, the difference in the accuracy of ssGBLUP over BLUP AM is from −1.9 to +7.3 percent points, while for wssGBLUP, the change in accuracy varies from +18.2 to +87.3 percent points. Furthermore, the superiority of the wssGBLUP is also maintained for the largest group of genotyped animals: from +4.7 to +15.9 percent points for ssGBLUP and from +21.1 to +90.5 percent points for wssGBLUP. However, for all analyzed traits, the number of markers explaining 5% of genetic variability varied from 71 to 108, and the number of such SNPs varied depending on the size of the reference group (79–88 for BF1, 72–81 for MD, 71–108 for age). The results of the genetic variation distribution have the greatest similarity between groups of about 1000 and about 1500 individuals. Thus, the size of the reference group of more than 1000 individuals gives more stable results for the estimation based on the wssGBLUP method, while using the reference group of 500 individuals can lead to distorted results of GEBV.
Collapse
|
19
|
Rare and population-specific functional variation across pig lines. Genet Sel Evol 2022; 54:39. [PMID: 35659233 PMCID: PMC9164375 DOI: 10.1186/s12711-022-00732-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 05/17/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND It is expected that functional, mainly missense and loss-of-function (LOF), and regulatory variants are responsible for most phenotypic differences between breeds and genetic lines of livestock species that have undergone diverse selection histories. However, there is still limited knowledge about the existing missense and LOF variation in commercial livestock populations, in particular regarding population-specific variation and how it can affect applications such as across-breed genomic prediction. METHODS We re-sequenced the whole genome of 7848 individuals from nine commercial pig lines (average sequencing coverage: 4.1×) and imputed whole-genome genotypes for 440,610 pedigree-related individuals. The called variants were categorized according to predicted functional annotation (from LOF to intergenic) and prevalence level (number of lines in which the variant segregated; from private to widespread). Variants in each category were examined in terms of their distribution along the genome, alternative allele frequency, per-site Wright's fixation index (FST), individual load, and association to production traits. RESULTS Of the 46 million called variants, 28% were private (called in only one line) and 21% were widespread (called in all nine lines). Genomic regions with a low recombination rate were enriched with private variants. Low-prevalence variants (called in one or a few lines only) were enriched for lower allele frequencies, lower FST, and putatively functional and regulatory roles (including LOF and deleterious missense variants). On average, individuals carried fewer private deleterious missense alleles than expected compared to alleles with other predicted consequences. Only a small subset of the low-prevalence variants had intermediate allele frequencies and explained small fractions of phenotypic variance (up to 3.2%) of production traits. The significant low-prevalence variants had higher per-site FST than the non-significant ones. These associated low-prevalence variants were tagged by other more widespread variants in high linkage disequilibrium, including intergenic variants. CONCLUSIONS Most low-prevalence variants have low minor allele frequencies and only a small subset of low-prevalence variants contributed detectable fractions of phenotypic variance of production traits. Accounting for low-prevalence variants is therefore unlikely to noticeably benefit across-breed analyses, such as the prediction of genomic breeding values in a population using reference populations of a different genetic background.
Collapse
|
20
|
Danilevicz MF, Gill M, Anderson R, Batley J, Bennamoun M, Bayer PE, Edwards D. Plant Genotype to Phenotype Prediction Using Machine Learning. Front Genet 2022; 13:822173. [PMID: 35664329 PMCID: PMC9159391 DOI: 10.3389/fgene.2022.822173] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 03/07/2022] [Indexed: 12/13/2022] Open
Abstract
Genomic prediction tools support crop breeding based on statistical methods, such as the genomic best linear unbiased prediction (GBLUP). However, these tools are not designed to capture non-linear relationships within multi-dimensional datasets, or deal with high dimension datasets such as imagery collected by unmanned aerial vehicles. Machine learning (ML) algorithms have the potential to surpass the prediction accuracy of current tools used for genotype to phenotype prediction, due to their capacity to autonomously extract data features and represent their relationships at multiple levels of abstraction. This review addresses the challenges of applying statistical and machine learning methods for predicting phenotypic traits based on genetic markers, environment data, and imagery for crop breeding. We present the advantages and disadvantages of explainable model structures, discuss the potential of machine learning models for genotype to phenotype prediction in crop breeding, and the challenges, including the scarcity of high-quality datasets, inconsistent metadata annotation and the requirements of ML models.
Collapse
Affiliation(s)
- Monica F. Danilevicz
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Mitchell Gill
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Robyn Anderson
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Mohammed Bennamoun
- School of Physics, Mathematics and Computing, University of Western Australia, Perth, WA, Australia
| | - Philipp E. Bayer
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
- *Correspondence: David Edwards,
| |
Collapse
|
21
|
Yáñez JM, Xu P, Carvalheiro R, Hayes B. Genomics applied to livestock and aquaculture breeding. Evol Appl 2022; 15:517-522. [PMID: 35505887 PMCID: PMC9046759 DOI: 10.1111/eva.13378] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 03/31/2022] [Indexed: 11/29/2022] Open
Affiliation(s)
- José M. Yáñez
- Facultad de Ciencias Veterinarias y Pecuarias Universidad de Chile
| | - Peng Xu
- Fujian Key Laboratory of Genetics and Breeding of Marine Organisms College of Ocean and Earth Sciences Xiamen University Xiamen China
| | - Roberto Carvalheiro
- Departamento de Zootecnia Faculdade de Ciências Agrárias e Veterinárias UNESP – Univ Estadual Paulista Jaboticabal, São Paulo Brazil
- CSIRO Agriculture & Food Hobart Tasmania Australia
| | - Ben Hayes
- Centre for Animal Science Queensland Alliance for Agriculture and Food Innovation The University of Queensland Australia
| |
Collapse
|
22
|
Yoshida GM, Yáñez JM. Increased accuracy of genomic predictions for growth under chronic thermal stress in rainbow trout by prioritizing variants from GWAS using imputed sequence data. Evol Appl 2022; 15:537-552. [PMID: 35505881 PMCID: PMC9046923 DOI: 10.1111/eva.13240] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 04/01/2021] [Accepted: 04/03/2021] [Indexed: 02/07/2023] Open
Abstract
Through imputation of genotypes, genome-wide association study (GWAS) and genomic prediction (GP) using whole-genome sequencing (WGS) data are cost-efficient and feasible in aquaculture breeding schemes. The objective was to dissect the genetic architecture of growth traits under chronic heat stress in rainbow trout (Oncorhynchus mykiss) and to assess the accuracy of GP based on imputed WGS and different preselected single nucleotide polymorphism (SNP) arrays. A total of 192 and 764 fish challenged to a heat stress experiment for 62 days were genotyped using a customized 1 K and 26 K SNP panels, respectively, and then, genotype imputation was performed from a low-density chip to WGS using 102 parents (36 males and 66 females) as the reference population. Imputed WGS data were used to perform GWAS and test GP accuracy under different preselected SNP scenarios. Heritability was estimated for body weight (BW), body length (BL) and average daily gain (ADG). Estimates using imputed WGS data ranged from 0.33 ± 0.05 to 0.55 ± 0.05 for growth traits under chronic heat stress. GWAS revealed that the top five cumulatively SNPs explained a maximum of 0.94%, 0.86% and 0.51% of genetic variance for BW, BL and ADG, respectively. Some important functional candidate genes associated with growth-related traits were found among the most important SNPs, including signal transducer and activator of transcription 5B and 3 (STAT5B and STAT3, respectively) and cytokine-inducible SH2-containing protein (CISH). WGS data resulted in a slight increase in prediction accuracy compared with pedigree-based method, whereas preselected SNPs based on the top GWAS hits improved prediction accuracies, with values ranging from 1.2 to 13.3%. Our results support the evidence of the polygenic nature of growth traits when measured under heat stress. The accuracies of GP can be improved using preselected variants from GWAS, and the use of WGS marginally increases prediction accuracy.
Collapse
Affiliation(s)
| | - José M. Yáñez
- Facultad de Ciencias Veterinarias y PecuariasUniversidad de ChileSantiagoChile
- Núcleo Milenio INVASALConcepciónChile
| |
Collapse
|
23
|
Selection and Drift: A Comparison between Historic and Recent Dutch Friesian Cattle and Recent Holstein Friesian Using WGS Data. Animals (Basel) 2022; 12:ani12030329. [PMID: 35158654 PMCID: PMC8833835 DOI: 10.3390/ani12030329] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 01/21/2022] [Accepted: 01/26/2022] [Indexed: 12/13/2022] Open
Abstract
Simple Summary Over the last century, genetic diversity in the cattle species has been affected by the replacement of many local, dual-purpose breeds with a few specialized, high-output dairy breeds. This replacement caused a sharp decline in the population size of local breeds. In the Netherlands, the local Dutch Friesian breed has gradually been replaced by the Holstein Friesian. This resulted in a rapid decrease in numbers of the Dutch Friesian breed with an associated risk of loss of genetic diversity due to drift. The objective of this study is to investigate genomewide genetic diversity between a group of historic and recent Dutch Friesian bulls and a group of recently used Holstein Friesian bulls. Our findings showed that a large amount of diversity is shared between the three groups, but each of them has some unique genetic identity (12% of the single nucleotide polymorphism were group-specific). The genetic diversity of the Dutch Friesians reduced over time, but this did not lead to higher inbreeding levels—especially, inbreeding due to recent ancestors has not increased. Genetically, the recent Dutch Friesians were slightly more different from Holstein Friesians than the historic Dutch Friesians. Our results also highlighted the presence of several genomic regions that differentiated between the groups. Abstract Over the last century, genetic diversity in many cattle breeds has been affected by the replacement of traditional local breeds with just a few milk-producing breeds. In the Netherlands, the local Dutch Friesian breed (DF) has gradually been replaced by the Holstein Friesian breed (HF). The objective of this study is to investigate genomewide genetic diversity between a group of historically and recently used DF bulls and a group of recently used HF bulls. Genetic material of 12 historic (hDF), 12 recent DF bulls (rDF), and 12 recent HF bulls (rHF) in the Netherlands was sequenced. Based on the genomic information, different parameters—e.g., allele frequencies, inbreeding coefficient, and runs of homozygosity (ROH)—were calculated. Our findings showed that a large amount of diversity is shared between the three groups, but each of them has a unique genetic identity (12% of the single nucleotide polymorphisms were group-specific). The rDF is slightly more diverged from rHF than hDF. The inbreeding coefficient based on runs of homozygosity (Froh) was higher for rDF (0.24) than for hDF (0.17) or rHF (0.13). Our results also displayed the presence of several genomic regions that differentiated between the groups. In addition, thirteen, forty-five, and six ROH islands were identified in hDF, rDF, and rHF, respectively. The genetic diversity of the DF breed reduced over time, but this did not lead to higher inbreeding levels—especially, inbreeding due to recent ancestors was not increased.
Collapse
|
24
|
Guillenea A, Su G, Lund MS, Karaman E. Genomic prediction in Nordic Red dairy cattle considering breed origin of alleles. J Dairy Sci 2022; 105:2426-2438. [PMID: 35033341 DOI: 10.3168/jds.2021-21173] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 11/23/2021] [Indexed: 01/02/2023]
Abstract
This study investigated the reliability of genomic prediction (GP) using breed origin of alleles (BOA) approach in the Nordic Red (RDC) population, which has an admixed population structure. The RDC population consists of animals with varying degrees of genetic materials from the Danish Red (RDM), Swedish Red (SRB), Finnish Ayrshire (FAY), and Holstein (HOL) because bulls have been used across the breeds. The BOA approach was tested using 39,550 RDC animals in the reference population and 11,786 in the validation population. Deregressed proofs (DRP) of milk, fat and protein were used as response variable for GP. Direct genomic breeding values (DGV) for animals in the validation population were calculated with (BOA model) or without (joint model) considering breed origin of alleles. The joint model assumed homogeneous marker effects and a single set of marker effects were estimated, whereas BOA model assumed heterogeneous marker effects, and different sets of marker effects were estimated across the breeds. For the BOA approach, we tested scenarios assuming both correlated (BOA_cor) and uncorrelated (BOA_uncor) marker effects between the breeds. Additionally, we investigated GP using a standard Illumina 50K chip and including SNP selected from imputed whole-genome sequencing (50K+WGS). We also studied the effect of estimating (co)variances for genome regions of different sizes to exploit the information of the genome regions contributing to the (co)variance between the breeds. Region sizes were set as 1 SNP, a group of 30 or 100 adjacent SNP, or the whole genome. Reliability of DGV was measured as squared correlations between DGV and DRP divided by the reliability of DRP. Across the 3 traits, in general, RS30 and RS100 SNP yielded the highest reliabilities. Including WGS SNP improved reliabilities in almost all scenarios (0.297 on average for 50K and 0.307 on average for 50K+WGS). The BOA_uncor (0.233 on average) was inferior to the joint model (0.339 on average), but the reliabilities obtained using BOA_cor (0.334 on average) in most cases were not significantly different from those obtained using the joint model. The results indicate that both including additional whole-genome sequencing SNP and dividing the genome into fixed regions improve GP in the RDC. The BOA models have the potential to increase the reliability of GP, but the benefit is limited in populations with a high exchange of genetic material for a long time, as is the case for RDC.
Collapse
Affiliation(s)
- Ana Guillenea
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Mogens Sand Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830 Tjele, Denmark
| |
Collapse
|
25
|
Legarra A, Garcia-Baccino CA, Wientjes YCJ, Vitezica ZG. The correlation of substitution effects across populations and generations in the presence of nonadditive functional gene action. Genetics 2021; 219:iyab138. [PMID: 34718531 PMCID: PMC8664574 DOI: 10.1093/genetics/iyab138] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 08/19/2021] [Indexed: 11/14/2022] Open
Abstract
Allele substitution effects at quantitative trait loci (QTL) are part of the basis of quantitative genetics theory and applications such as association analysis and genomic prediction. In the presence of nonadditive functional gene action, substitution effects are not constant across populations. We develop an original approach to model the difference in substitution effects across populations as a first order Taylor series expansion from a "focal" population. This expansion involves the difference in allele frequencies and second-order statistical effects (additive by additive and dominance). The change in allele frequencies is a function of relationships (or genetic distances) across populations. As a result, it is possible to estimate the correlation of substitution effects across two populations using three elements: magnitudes of additive, dominance, and additive by additive variances; relationships (Nei's minimum distances or Fst indexes); and assumed heterozygosities. Similarly, the theory applies as well to distinct generations in a population, in which case the distance across generations is a function of increase of inbreeding. Simulation results confirmed our derivations. Slight biases were observed, depending on the nonadditive mechanism and the reference allele. Our derivations are useful to understand and forecast the possibility of prediction across populations and the similarity of GWAS effects.
Collapse
Affiliation(s)
- Andres Legarra
- INRAE/INP, UMR 1388 GenPhySE, Castanet-Tolosan 31326, France
| | - Carolina A. Garcia-Baccino
- INRAE/INP, UMR 1388 GenPhySE, Castanet-Tolosan 31326, France
- Departamento de Producción Animal, Facultad de Agronomía, Universidad de Buenos Aires, Buenos Aires C1417DSQ, Argentina
- SAS NUCLEUS, Le Rheu 35650, France
| | - Yvonne C. J. Wientjes
- Wageningen University & Research, Animal Breeding and Genomics, Wageningen 6700 AH, the Netherlands
| | | |
Collapse
|
26
|
Zhao C, Teng J, Zhang X, Wang D, Zhang X, Li S, Jiang X, Li H, Ning C, Zhang Q. Towards a Cost-Effective Implementation of Genomic Prediction Based on Low Coverage Whole Genome Sequencing in Dezhou Donkey. Front Genet 2021; 12:728764. [PMID: 34804115 PMCID: PMC8595392 DOI: 10.3389/fgene.2021.728764] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 09/20/2021] [Indexed: 11/25/2022] Open
Abstract
Low-coverage whole genome sequencing is a low-cost genotyping technology. Combined with genotype imputation approaches, it is likely to become a critical component of cost-effective genomic selection programs in agricultural livestock. Here, we used the low-coverage sequence data of 617 Dezhou donkeys to investigate the performance of genotype imputation for low-coverage whole genome sequence data and genomic prediction based on the imputed genotype data. The specific aims were as follows: 1) to measure the accuracy of genotype imputation under different sequencing depths, sample sizes, minor allele frequency (MAF), and imputation pipelines and 2) to assess the accuracy of genomic prediction under different marker densities derived from the imputed sequence data, different strategies for constructing the genomic relationship matrixes, and single-vs. multi-trait models. We found that a high imputation accuracy (>0.95) can be achieved for sequence data with a sequencing depth as low as 1x and the number of sequenced individuals ≥400. For genomic prediction, the best performance was obtained by using a marker density of 410K and a G matrix constructed using expected marker dosages. Multi-trait genomic best linear unbiased prediction (GBLUP) performed better than single-trait GBLUP. Our study demonstrates that low-coverage whole genome sequencing would be a cost-effective approach for genomic prediction in Dezhou donkey.
Collapse
Affiliation(s)
- Changheng Zhao
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Jun Teng
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xinhao Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China.,National Engineering Research Center for Gelatin-based TCM, Dong-E E-Jiao Co., Ltd., Dong'e County, China
| | - Dan Wang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xinyi Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Shiyin Li
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Xin Jiang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Haijing Li
- National Engineering Research Center for Gelatin-based TCM, Dong-E E-Jiao Co., Ltd., Dong'e County, China
| | - Chao Ning
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| | - Qin Zhang
- Shandong Provincial Key Laboratory of Animal Biotechnology and Disease Control and Prevention, College of Animal Science and Veterinary Medicine, Shandong Agricultural University, Tai'an, China
| |
Collapse
|
27
|
Mollandin F, Rau A, Croiseau P. An evaluation of the predictive performance and mapping power of the BayesR model for genomic prediction. G3 GENES|GENOMES|GENETICS 2021; 11:6317672. [PMID: 34849780 PMCID: PMC8527474 DOI: 10.1093/g3journal/jkab225] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 06/27/2021] [Indexed: 12/02/2022]
Abstract
Technological advances and decreasing costs have led to the rise of increasingly dense genotyping data, making feasible the identification of potential causal markers. Custom genotyping chips, which combine medium-density genotypes with a custom genotype panel, can capitalize on these candidates to potentially yield improved accuracy and interpretability in genomic prediction. A particularly promising model to this end is BayesR, which divides markers into four effect size classes. BayesR has been shown to yield accurate predictions and promise for quantitative trait loci (QTL) mapping in real data applications, but an extensive benchmarking in simulated data is currently lacking. Based on a set of real genotypes, we generated simulated data under a variety of genetic architectures and phenotype heritabilities, and we evaluated the impact of excluding or including causal markers among the genotypes. We define several statistical criteria for QTL mapping, including several based on sliding windows to account for linkage disequilibrium (LD). We compare and contrast these statistics and their ability to accurately prioritize known causal markers. Overall, we confirm the strong predictive performance for BayesR in moderately to highly heritable traits, particularly for 50k custom data. In cases of low heritability or weak LD with the causal marker in 50k genotypes, QTL mapping is a challenge, regardless of the criterion used. BayesR is a promising approach to simultaneously obtain accurate predictions and interpretable classifications of SNPs into effect size classes. We illustrated the performance of BayesR in a variety of simulation scenarios, and compared the advantages and limitations of each.
Collapse
Affiliation(s)
- Fanny Mollandin
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas 78350, France
| | - Andrea Rau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas 78350, France
- BioEcoAgro Joint Research Unit, INRAE, Université de Liège, Université de Lille, Université de Picardie Jules Verne, Peronne 80203, France
| | - Pascal Croiseau
- INRAE, AgroParisTech, GABI, Université Paris-Saclay, Jouy-en-Josas 78350, France
| |
Collapse
|
28
|
Casellas J, Martín de Hijas-Villalba M, Vázquez-Gómez M, Id-Lahoucine S. Low-coverage whole-genome sequencing in livestock species for individual traceability and parentage testing. Livest Sci 2021. [DOI: 10.1016/j.livsci.2021.104629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
29
|
An Overview of the Use of Genotyping Techniques for Assessing Genetic Diversity in Local Farm Animal Breeds. Animals (Basel) 2021; 11:ani11072016. [PMID: 34359144 PMCID: PMC8300386 DOI: 10.3390/ani11072016] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 06/30/2021] [Accepted: 07/01/2021] [Indexed: 12/18/2022] Open
Abstract
Simple Summary The number of local farm animal breeds is declining worldwide. However, these breeds have different degrees of genetic diversity. Measuring genetic diversity is important for the development of conservation strategies and, therefore, various genomic analysis techniques are available. The aim of the present work was to shed light on the use of these techniques in diversity studies of local breeds. In summary, a total of 133 worldwide studies that examined genetic diversity in local cattle, sheep, goat, chicken and pig breeds were reviewed. The results show that over time, almost all available genomic techniques were used and various diversity parameters were calculated. Therefore, the present results provide a comprehensive overview of the application of these techniques in the field of local breeds. This can provide helpful insights into the advancement of the conservation of breeds with high genetic diversity. Abstract Globally, many local farm animal breeds are threatened with extinction. However, these breeds contribute to the high amount of genetic diversity required to combat unforeseen future challenges of livestock production systems. To assess genetic diversity, various genotyping techniques have been developed. Based on the respective genomic information, different parameters, e.g., heterozygosity, allele frequencies and inbreeding coefficient, can be measured in order to reveal genetic diversity between and within breeds. The aim of the present work was to shed light on the use of genotyping techniques in the field of local farm animal breeds. Therefore, a total of 133 studies across the world that examined genetic diversity in local cattle, sheep, goat, chicken and pig breeds were reviewed. The results show that diversity of cattle was most often investigated with microsatellite use as the main technique. Furthermore, a large variety of diversity parameters that were calculated with different programs were identified. For 15% of the included studies, the used genotypes are publicly available, and, in 6%, phenotypes were recorded. In conclusion, the present results provide a comprehensive overview of the application of genotyping techniques in the field of local breeds. This can provide helpful insights to advance the conservation of breeds.
Collapse
|
30
|
Validation of the Prediction Accuracy for 13 Traits in Chinese Simmental Beef Cattle Using a Preselected Low-Density SNP Panel. Animals (Basel) 2021; 11:ani11071890. [PMID: 34202066 PMCID: PMC8300368 DOI: 10.3390/ani11071890] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 06/08/2021] [Accepted: 06/15/2021] [Indexed: 11/17/2022] Open
Abstract
Simple Summary To reduce the breeding costs and promote the application of genomic selection (GS) in Chinese Simmental beef cattle, we developed a customized low-density single-nucleotide polymorphism (SNP) panel consisting of 30,684 SNPs. When comparing the predictive performance of the low-density SNP panel to that of the BovineHD Beadchip for 13 traits, we found that this ~30 K panel achieved moderate to high prediction accuracies for most traits, while reducing the prediction accuracies of six traits by 0.04–0.09 and decreasing the prediction accuracy of one trait by 0.2. For the remaining six traits, the usage of the low-density SNP panel was associated with a slight increase in prediction accuracy. Our studies suggested that the low-density SNP panel (~30 K) is a feasible and promising tool for cost-effective genomic prediction in Chinese Simmental beef cattle, which may provide breeding organizations with a cheaper option and greater returns on investment. Abstract Chinese Simmental beef cattle play a key role in the Chinese beef industry due to their great adaptability and marketability. To achieve efficient genetic gain at a low breeding cost, it is crucial to develop a customized cost-effective low-density SNP panel for this cattle population. Thirteen growth, carcass, and meat quality traits and a BovineHD Beadchip genotyping of 1346 individuals were used to select trait-associated variants and variants contributing to great genetic variance. In addition, highly informative SNPs with high MAF in each 500 kb sliding window and in each genic region were also included separately. A low-density SNP panel consisting of 30,684 SNPs was developed, with an imputation accuracy of 97.4% when imputed to the 770 K level. Among 13 traits, the average prediction accuracy levels evaluated by genomic best linear unbiased prediction (GBLUP) and BayesA/B/Cπ were 0.22–0.47 and 0.18–0.60 for the ~30 K array and BovineHD Beadchip, respectively. Generally, the predictive performance of the ~30 K array was trait-dependent, with reduced prediction accuracies for seven traits. While differences in terms of prediction accuracy were observed among the 13 traits, the low-density SNP panel achieved moderate to high accuracies for most of the traits and even improved the accuracies for some traits.
Collapse
|
31
|
Geibel J, Reimer C, Weigend S, Weigend A, Pook T, Simianer H. How array design creates SNP ascertainment bias. PLoS One 2021; 16:e0245178. [PMID: 33784304 PMCID: PMC8009414 DOI: 10.1371/journal.pone.0245178] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 12/22/2020] [Indexed: 12/30/2022] Open
Abstract
Single nucleotide polymorphisms (SNPs), genotyped with arrays, have become a widely used marker type in population genetic analyses over the last 10 years. However, compared to whole genome re-sequencing data, arrays are known to lack a substantial proportion of globally rare variants and tend to be biased towards variants present in populations involved in the development process of the respective array. This affects population genetic estimators and is known as SNP ascertainment bias. We investigated factors contributing to ascertainment bias in array development by redesigning the Axiom™ Genome-Wide Chicken Array in silico and evaluating changes in allele frequency spectra and heterozygosity estimates in a stepwise manner. A sequential reduction of rare alleles during the development process was shown. This was mainly caused by the identification of SNPs in a limited set of populations and a within-population selection of common SNPs when aiming for equidistant spacing. These effects were shown to be less severe with a larger discovery panel. Additionally, a generally massive overestimation of expected heterozygosity for the ascertained SNP sets was shown. This overestimation was 24% higher for populations involved in the discovery process than not involved populations in case of the original array. The same was observed after the SNP discovery step in the redesign. However, an unequal contribution of populations during the SNP selection can mask this effect but also adds uncertainty. Finally, we make suggestions for the design of specialized arrays for large scale projects where whole genome re-sequencing techniques are still too expensive.
Collapse
Affiliation(s)
- Johannes Geibel
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, Göttingen, Germany
- Center for Integrated Breeding Research, University of Goettingen, Göttingen, Germany
- * E-mail:
| | - Christian Reimer
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, Göttingen, Germany
- Center for Integrated Breeding Research, University of Goettingen, Göttingen, Germany
| | - Steffen Weigend
- Center for Integrated Breeding Research, University of Goettingen, Göttingen, Germany
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, Neustadt-Mariensee, Germany
| | - Annett Weigend
- Institute of Farm Animal Genetics, Friedrich-Loeffler-Institut, Neustadt-Mariensee, Germany
| | - Torsten Pook
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, Göttingen, Germany
- Center for Integrated Breeding Research, University of Goettingen, Göttingen, Germany
| | - Henner Simianer
- Department of Animal Sciences, Animal Breeding and Genetics Group, University of Goettingen, Göttingen, Germany
- Center for Integrated Breeding Research, University of Goettingen, Göttingen, Germany
| |
Collapse
|
32
|
Pérez-Enciso M, Steibel JP. Phenomes: the current frontier in animal breeding. Genet Sel Evol 2021; 53:22. [PMID: 33673800 PMCID: PMC7934239 DOI: 10.1186/s12711-021-00618-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 02/22/2021] [Indexed: 12/13/2022] Open
Abstract
Improvements in genomic technologies have outpaced the most optimistic predictions, allowing industry-scale application of genomic selection. However, only marginal gains in genetic prediction accuracy can now be expected by increasing marker density up to sequence, unless causative mutations are identified. We argue that some of the most scientifically disrupting and industry-relevant challenges relate to ‘phenomics’ instead of ‘genomics’. Thanks to developments in sensor technology and artificial intelligence, there is a wide range of analytical tools that are already available and many more will be developed. We can now address some of the pressing societal demands on the industry, such as animal welfare concerns or efficiency in the use of resources. From the statistical and computational point of view, phenomics raises two important issues that require further work: penalization and dimension reduction. This will be complicated by the inherent heterogeneity and ‘missingness’ of the data. Overall, we can expect that precision livestock technologies will make it possible to collect hundreds of traits on a continuous basis from large numbers of animals. Perhaps the main revolution will come from redesigning animal breeding schemes to explicitly allow for high-dimensional phenomics. In the meantime, phenomics data will definitely enlighten our knowledge on the biological basis of phenotypes.
Collapse
Affiliation(s)
- Miguel Pérez-Enciso
- ICREA, Passeig de Lluís Companys 23, 08010, Barcelona, Spain. .,Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Bellaterra, 08193, Barcelona, Spain.
| | - Juan P Steibel
- Department of Animal Science, Michigan State University, East Lansing, MI, 48824, USA.,Department of Fisheries and Wildlife, Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
33
|
Londoño-Gil M, Rincón Flórez JC, Lopez-Herrera A, Gonzalez-Herrera LG. GENOME-WIDE ASSOCIATION STUDY FOR GROWTH TRAITS IN BLANCO OREJINERO (BON) CATTLE FROM COLOMBIA. Livest Sci 2021. [DOI: 10.1016/j.livsci.2020.104366] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
34
|
Calleja-Rodriguez A, Pan J, Funda T, Chen Z, Baison J, Isik F, Abrahamsson S, Wu HX. Evaluation of the efficiency of genomic versus pedigree predictions for growth and wood quality traits in Scots pine. BMC Genomics 2020; 21:796. [PMID: 33198692 PMCID: PMC7667760 DOI: 10.1186/s12864-020-07188-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 10/26/2020] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Genomic selection (GS) or genomic prediction is a promising approach for tree breeding to obtain higher genetic gains by shortening time of progeny testing in breeding programs. As proof-of-concept for Scots pine (Pinus sylvestris L.), a genomic prediction study was conducted with 694 individuals representing 183 full-sib families that were genotyped with genotyping-by-sequencing (GBS) and phenotyped for growth and wood quality traits. 8719 SNPs were used to compare different genomic with pedigree prediction models. Additionally, four prediction efficiency methods were used to evaluate the impact of genomic breeding value estimations by assigning diverse ratios of training and validation sets, as well as several subsets of SNP markers. RESULTS Genomic Best Linear Unbiased Prediction (GBLUP) and Bayesian Ridge Regression (BRR) combined with expectation maximization (EM) imputation algorithm showed slightly higher prediction efficiencies than Pedigree Best Linear Unbiased Prediction (PBLUP) and Bayesian LASSO, with some exceptions. A subset of approximately 6000 SNP markers, was enough to provide similar prediction efficiencies as the full set of 8719 markers. Additionally, prediction efficiencies of genomic models were enough to achieve a higher selection response, that varied between 50-143% higher than the traditional pedigree-based selection. CONCLUSIONS Although prediction efficiencies were similar for genomic and pedigree models, the relative selection response was doubled for genomic models by assuming that earlier selections can be done at the seedling stage, reducing the progeny testing time, thus shortening the breeding cycle length roughly by 50%.
Collapse
Affiliation(s)
- Ainhoa Calleja-Rodriguez
- Skogforsk (The Forestry Research Institute of Sweden), Box 3, Sävar, SE 918 21 Sweden
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
| | - Jin Pan
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
| | - Tomas Funda
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
- Department of Genetics and Breeding, Faculty of Agrobiology and Natural Resources, Czech University of Life Sciences Prague, Prague, 165 00 Czech Republic
- Key Laboratory of Forest Genetics and Biotechnology, Nanjing Forestry University, Nanjing, 210037 China
| | - Zhiqiang Chen
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
| | - John Baison
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
- RAGT Seeds, Essex, CB 101TA United Kingdom
| | - Fikret Isik
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC 27695 USA
| | - Sara Abrahamsson
- Skogforsk (The Forestry Research Institute of Sweden), Box 3, Sävar, SE 918 21 Sweden
| | - Harry X. Wu
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, Umeå, SE - 901 83 Sweden
- Beijing Advanced Innovation Centre for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, 100083 China
- National Research Collection Australia, CSIRO, Canberra, ACT 2601 Australia
| |
Collapse
|
35
|
Accuracy of genomic evaluation using imputed high-density genotypes for carcass traits in commercial Hanwoo population. Livest Sci 2020. [DOI: 10.1016/j.livsci.2020.104256] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
36
|
Lau LY, Nguyen LT, Reverter A, Moore SS, Lynn A, McBride‐Kelly L, Phillips‐Rose L, Plath M, Macfarlane R, Vasudivan V, Morton L, Ardley R, Ye Y, Fortes MRS. Gene regulation could be attributed to TCF3 and other key transcription factors in the muscle of pubertal heifers. Vet Med Sci 2020; 6:695-710. [PMID: 32432381 PMCID: PMC7738712 DOI: 10.1002/vms3.278] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Revised: 03/13/2020] [Accepted: 04/09/2020] [Indexed: 01/17/2023] Open
Abstract
Puberty is a whole-body event, driven by the hypothalamic integration of peripheral signals such as leptin or IGF-1. In the process of puberty, reproductive development is simultaneous to growth, including muscle growth. To enhance our understanding of muscle function related to puberty, we performed transcriptome analyses of muscle samples from six pre- and six post-pubertal Brahman heifers (Bos indicus). Our aims were to perform differential expression analyses and co-expression analyses to derive a regulatory gene network associate with puberty. As a result, we identified 431 differentially expressed (DEx) transcripts (genes and non-coding RNAs) when comparing pre- to post-pubertal average gene expression. The DEx transcripts were compared with all expressed transcripts in our samples (over 14,000 transcripts) for functional enrichment analyses. The DEx transcripts were associated with "extracellular region," "inflammatory response" and "hormone activity" (adjusted p < .05). Inflammatory response for muscle regeneration is a necessary aspect of muscle growth, which is accelerated during puberty. The term "hormone activity" may signal genes that respond to progesterone signalling in the muscle, as the presence of this hormone is an important difference between pre- and post-pubertal heifers in our experimental design. The DEx transcript with the highest average expression difference was a mitochondrial gene, ENSBTAG00000043574 that might be another important link between energy metabolism and puberty. In the derived co-expression gene network, we identified six hub genes: CDC5L, MYC, TCF3, RUNX2, ATF2 and CREB1. In the same network, 48 key regulators of DEx transcripts were identified, using a regulatory impact factor metric. The hub gene TCF3 was also a key regulator. The majority of the key regulators (22 genes) are members of the zinc finger family, which has been implicated in bovine puberty in other tissues. In conclusion, we described how puberty may affect muscle gene expression in cattle.
Collapse
Affiliation(s)
- Li Yieng Lau
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Loan T. Nguyen
- Queensland Alliance for Agriculture and Food InnovationThe University of QueenslandBrisbaneQLDAustralia
| | - Antonio Reverter
- CSIRO Agriculture and FoodQueensland Biosciences PrecinctBrisbaneQLDAustralia
| | - Stephen S. Moore
- Queensland Alliance for Agriculture and Food InnovationThe University of QueenslandBrisbaneQLDAustralia
| | - Aaron Lynn
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Liam McBride‐Kelly
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Louis Phillips‐Rose
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Mackenzie Plath
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Rhys Macfarlane
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Vanisha Vasudivan
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Lachlan Morton
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Ryan Ardley
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Yunan Ye
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
| | - Marina R. S. Fortes
- School of Chemistry and Molecular BiologyThe University of QueenslandBrisbaneQLDAustralia
- Queensland Alliance for Agriculture and Food InnovationThe University of QueenslandBrisbaneQLDAustralia
| |
Collapse
|
37
|
Talouarn E, Teissier M, Bardou P, Larroque H, Clément V, Palhière I, Tosser-Klopp G, Rupp R, Robert-Granié C. Using sequence variants of a QTL region improves the accuracy of genomic evaluation in French Saanen goats. J Dairy Sci 2020; 104:588-601. [PMID: 33131807 DOI: 10.3168/jds.2020-18837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 08/11/2020] [Indexed: 11/19/2022]
Abstract
The enhanced availability of sequence data in livestock provides an opportunity for more accurate predictions in routine genomic evaluations. Such evaluations would therefore no longer rely only on the linkage disequilibrium between a chip marker and the causal mutation. The objective of this study was to assess the usefulness of sequence data in Saanen goats (n = 33) to better capture a quantitative trait locus (QTL) on chromosome 19 (CHI19) and improve the accuracy of predictions for 3 milk production traits, 5 type traits, and somatic cell scores. All 1,207 50K genotypes were imputed to the sequence level. Four scenarios, each using a subset of CHI19 imputed variants, were then tested. Sequence-derived information included all CHI19 variants (529,576), all variants in the QTL region (22,269), 178 variants selected in the QTL region and added to an updated chip, or 178 randomly selected variants on CHI19. Two genomic evaluation models were applied: single-step genomic BLUP and weighted single-step genomic BLUP. All scenarios were compared with single-step genomic BLUP using 50K genotypes. Best overall results were obtained using single-step genomic BLUP on 50K genotypes completed with all variants in the QTL region of chromosome 19 (6.2% average increase in accuracy for 9 traits) with the highest accuracy gain for fat yield (17.9%), significant increases for milk (13.7%) and protein yields (12.5%), and type traits associated with CHI19. Despite its association with the QTL region of chromosome 19, the somatic cell score showed decreased accuracy in every alternative scenario. Using all CHI19 variants led to an overall decrease of 4.8% in prediction accuracy. The updated chip was efficient and improved genomic evaluations by 3.1 to 6.4% on average, depending on the scenario. Indeed, information from only a few carefully selected variants increased accuracies for traits of interest when used in a single-step genomic BLUP model. In conclusion, using QTL region variants imputed from sequence data in single-step genomic evaluations represents a promising perspective for such evaluations in dairy goats. Furthermore, using only a limited number of selected variants in QTL regions, as available on SNP chip updates, significantly increases the accuracy for QTL-associated traits without deteriorating the evaluation accuracy for other traits. The latter approach is interesting, as it avoids time-consuming imputation and data formatting processes and provides reliable genotypes.
Collapse
Affiliation(s)
- Estelle Talouarn
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France.
| | - Marc Teissier
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | | - Hélène Larroque
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | | - Isabelle Palhière
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | | - Rachel Rupp
- GenPhySE, Université de Toulouse, INRAE, ENVT, F-31326 Castanet-Tolosan, France
| | | |
Collapse
|
38
|
Yáñez JM, Joshi R, Yoshida GM. Genomics to accelerate genetic improvement in tilapia. Anim Genet 2020; 51:658-674. [PMID: 32761644 DOI: 10.1111/age.12989] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 06/25/2020] [Accepted: 06/26/2020] [Indexed: 12/13/2022]
Abstract
Selective breeding of tilapia populations started in the early 1990s and over the past three decades tilapia has become one of the most important farmed freshwater species, being produced in more than 125 countries around the globe. Although genome assemblies have been available since 2011, most of the tilapia industry still depends on classical selection techniques using mass spawning or pedigree information to select for growth traits with reported genetic gains of up to 20% per generation. The involvement of international breeding companies and research institutions has resulted in the rapid development and application of genomic resources in the last few years. GWAS and genomic selection are expected to contribute to uncovering the genetic variants involved in economically relevant traits and increasing the genetic gain in selective breeding programs, respectively. Developments over the next few years will probably focus on achieving a deep understanding of genetic architecture of complex traits, as well as accelerating genetic progress in the selection for growth-, quality- and robustness-related traits. Novel phenotyping technologies (i.e. phenomics), lower-cost whole-genome sequencing approaches, functional genomics and gene editing tools will be crucial in future developments for the improvement of tilapia aquaculture.
Collapse
Affiliation(s)
- J M Yáñez
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Av Santa Rosa 11735, La Pintana, Santiago, 8820808, Chile.,Núcleo Milenio INVASAL, Casilla 160-C, Concepción, Chile
| | - R Joshi
- GenoMar Genetics AS, Bolette Brygge 1, Oslo, 0252, Norway
| | - G M Yoshida
- Facultad de Ciencias Veterinarias y Pecuarias, Universidad de Chile, Av Santa Rosa 11735, La Pintana, Santiago, 8820808, Chile
| |
Collapse
|
39
|
Liu A, Lund MS, Boichard D, Mao X, Karaman E, Fritz S, Aamand GP, Wang Y, Su G. Imputation for sequencing variants preselected to a customized low-density chip. Sci Rep 2020; 10:9524. [PMID: 32533087 PMCID: PMC7293337 DOI: 10.1038/s41598-020-66523-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Accepted: 05/19/2020] [Indexed: 12/27/2022] Open
Abstract
The sequencing variants preselected from association analyses and bioinformatics analyses could improve genomic prediction. In this study, the imputation of sequencing SNPs preselected from major dairy breeds in Denmark-Finland-Sweden (DFS) and France (FRA) was investigated for both contemporary animals and old bulls in Danish Jersey. For contemporary animals, a two-step imputation which first imputed to 54 K and then to 54 K + DFS + FRA SNPs achieved highest accuracy. Correlations between observed and imputed genotypes were 91.6% for DFS SNPs and 87.6% for FRA SNPs, while concordance rates were 96.6% for DFS SNPs and 93.5% for FRA SNPs. The SNPs with lower minor allele frequency (MAF) tended to have lower correlations but higher concordance rates. For old bulls, imputation for DFS and FRA SNPs were relatively accurate even for bulls without progenies (correlations higher than 97.2% and concordance rates higher than 98.4%). For contemporary animals, given limited imputation accuracy of preselected sequencing SNPs especially for SNPs with low MAF, it would be a good strategy to directly genotype preselected sequencing SNPs with a customized SNP chip. For old bulls, given high imputation accuracy for preselected sequencing SNPs with all MAF ranges, it would be unnecessary to re-genotype preselected sequencing SNPs.
Collapse
Affiliation(s)
- Aoxing Liu
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.,Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China
| | - Mogens Sandø Lund
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Didier Boichard
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France
| | - Xiaowei Mao
- Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, 100044, Beijing, P.R. China.,CAS Center for Excellence in Life and Paleoenvironment, 100044, Beijing, P.R. China
| | - Emre Karaman
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark
| | - Sebastien Fritz
- GABI, INRA, AgroParisTech, Université Paris Saclay, 78350, Jouy-en-Josas, France.,ALLICE, 75012, Paris, France
| | | | - Yachun Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, MARA; National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, China Agricultural University, 100193, Beijing, P.R. China.
| | - Guosheng Su
- Center for Quantitative Genetics and Genomics, Aarhus University, 8830, Tjele, Denmark.
| |
Collapse
|
40
|
Manimekalai R, Suresh G, Govinda Kurup H, Athiappan S, Kandalam M. Role of NGS and SNP genotyping methods in sugarcane improvement programs. Crit Rev Biotechnol 2020; 40:865-880. [PMID: 32508157 DOI: 10.1080/07388551.2020.1765730] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Sugarcane (Saccharum spp.) is one of the most economically significant crops because of its high sucrose content and it is a promising biomass feedstock for biofuel production. Sugarcane genome sequencing and analysis is a difficult task due to its heterozygosity and polyploidy. Long sequence read technologies, PacBio Single-Molecule Real-Time (SMRT) sequencing, the Illumina TruSeq, and the Oxford Nanopore sequencing could solve the problem of genome assembly. On the applications side, next generation sequencing (NGS) technologies played a major role in the discovery of single nucleotide polymorphism (SNP) and the development of low to high throughput genotyping platforms. The two mainstream high throughput genotyping platforms are the SNP microarray and genotyping by sequencing (GBS). This paper reviews the NGS in sugarcane genomics, genotyping methodologies, and the choice of these methods. Array-based SNP genotyping is robust, provides consistent SNPs, and relatively easier downstream data analysis. The GBS method identifies large scale SNPs across the germplasm. A combination of targeted GBS and array-based genotyping methods should be used to increase the accuracy of genomic selection and marker-assisted breeding.
Collapse
Affiliation(s)
- Ramaswamy Manimekalai
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Gayathri Suresh
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Hemaprabha Govinda Kurup
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Selvi Athiappan
- Crop Improvement Division, ICAR - Sugarcane Breeding Institute, Indian Council of Agricultural Research (ICAR), Coimbatore, Tamil Nadu, India
| | - Mallikarjuna Kandalam
- Business Development, Asia Pacific Japan region, Thermo Fisher Scientific, Waltham, MA, USA
| |
Collapse
|
41
|
Fikere M, Barbulescu DM, Malmberg MM, Maharjan P, Salisbury PA, Kant S, Panozzo J, Norton S, Spangenberg GC, Cogan NOI, Daetwyler HD. Genomic Prediction and Genetic Correlation of Agronomic, Blackleg Disease, and Seed Quality Traits in Canola ( Brassica napus L.). PLANTS (BASEL, SWITZERLAND) 2020; 9:E719. [PMID: 32517116 PMCID: PMC7356366 DOI: 10.3390/plants9060719] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 05/29/2020] [Accepted: 06/02/2020] [Indexed: 02/02/2023]
Abstract
Genomic selection accelerates genetic progress in crop breeding through the prediction of future phenotypes of selection candidates based on only their genomic information. Here we report genetic correlations and genomic prediction accuracies in 22 agronomic, disease, and seed quality traits measured across multiple years (2015-2017) in replicated trials under rain-fed and irrigated conditions in Victoria, Australia. Two hundred and two spring canola lines were genotyped for 62,082 Single Nucleotide Polymorphisms (SNPs) using transcriptomic genotype-by-sequencing (GBSt). Traits were evaluated in single trait and bivariate genomic best linear unbiased prediction (GBLUP) models and cross-validation. GBLUP were also expanded to include genotype-by-environment G × E interactions. Genomic heritability varied from 0.31to 0.66. Genetic correlations were highly positive within traits across locations and years. Oil content was positively correlated with most agronomic traits. Strong, not previously documented, negative correlations were observed between average internal infection (a measure of blackleg disease) and arachidic and stearic acids. The genetic correlations between fatty acid traits followed the expected patterns based on oil biosynthesis pathways. Genomic prediction accuracy ranged from 0.29 for emergence count to 0.69 for seed yield. The incorporation of G × E translates into improved prediction accuracy by up to 6%. The genomic prediction accuracies achieved indicate that genomic selection is ready for application in canola breeding.
Collapse
Affiliation(s)
- Mulusew Fikere
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3086, Australia; (M.F.); (M.M.M.); (G.C.S.); (N.O.I.C.)
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC 3083, Australia;
- Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of Queensland, Brisbane, QLD 4072, Australia
| | - Denise M. Barbulescu
- Agriculture Victoria, Grains Innovation Park, Horsham, VIC 3400, Australia; (D.M.B.); (P.M.); (S.K.); (J.P.); (S.N.)
| | - M. Michelle Malmberg
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3086, Australia; (M.F.); (M.M.M.); (G.C.S.); (N.O.I.C.)
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC 3083, Australia;
| | - Pankaj Maharjan
- Agriculture Victoria, Grains Innovation Park, Horsham, VIC 3400, Australia; (D.M.B.); (P.M.); (S.K.); (J.P.); (S.N.)
| | - Phillip A. Salisbury
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC 3083, Australia;
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Surya Kant
- Agriculture Victoria, Grains Innovation Park, Horsham, VIC 3400, Australia; (D.M.B.); (P.M.); (S.K.); (J.P.); (S.N.)
- Centre for Agricultural Innovation, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Joe Panozzo
- Agriculture Victoria, Grains Innovation Park, Horsham, VIC 3400, Australia; (D.M.B.); (P.M.); (S.K.); (J.P.); (S.N.)
- Centre for Agricultural Innovation, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Sally Norton
- Agriculture Victoria, Grains Innovation Park, Horsham, VIC 3400, Australia; (D.M.B.); (P.M.); (S.K.); (J.P.); (S.N.)
| | - German C. Spangenberg
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3086, Australia; (M.F.); (M.M.M.); (G.C.S.); (N.O.I.C.)
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC 3083, Australia;
| | - Noel O. I. Cogan
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3086, Australia; (M.F.); (M.M.M.); (G.C.S.); (N.O.I.C.)
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC 3083, Australia;
| | - Hans D. Daetwyler
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC 3086, Australia; (M.F.); (M.M.M.); (G.C.S.); (N.O.I.C.)
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC 3083, Australia;
| |
Collapse
|
42
|
Genomic Analysis Using Bayesian Methods under Different Genotyping Platforms in Korean Duroc Pigs. Animals (Basel) 2020; 10:ani10050752. [PMID: 32344859 PMCID: PMC7277155 DOI: 10.3390/ani10050752] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 04/16/2020] [Accepted: 04/22/2020] [Indexed: 12/03/2022] Open
Abstract
Simple Summary This study investigated the informative regions and the efficiency of genomic predictions for backfat thickness, days to 90 kg body weight, loin muscle area, and lean percentage in Korean Duroc pigs. The several regions of the genome were identified and a significant marker was found near the MC4R gene for growth and production-related traits. No differences in genomic accuracy were identified on the basis of the Bayesian approaches in these four growth and production-related traits. The genomic accuracy is improved by using deregressed estimated breeding values including parental information as a response variable in Korean Duroc pigs. Abstract Genomic evaluation has been widely applied to several species using commercial single nucleotide polymorphism (SNP) genotyping platforms. This study investigated the informative genomic regions and the efficiency of genomic prediction by using two Bayesian approaches (BayesB and BayesC) under two moderate-density SNP genotyping panels in Korean Duroc pigs. Growth and production records of 1026 individuals were genotyped using two medium-density, SNP genotyping platforms: Illumina60K and GeneSeek80K. These platforms consisted of 61,565 and 68,528 SNP markers, respectively. The deregressed estimated breeding values (DEBVs) derived from estimated breeding values (EBVs) and their reliabilities were taken as response variables. Two Bayesian approaches were implemented to perform the genome-wide association study (GWAS) and genomic prediction. Multiple significant regions for days to 90 kg (DAYS), lean muscle area (LMA), and lean percent (PCL) were detected. The most significant SNP marker, located near the MC4R gene, was detected using GeneSeek80K. Accuracy of genomic predictions was higher using the GeneSeek80K SNP panel for DAYS (Δ2%) and LMA (Δ2–3%) with two response variables, with no gains in accuracy by the Bayesian approaches in four growth and production-related traits. Genomic prediction is best derived from DEBVs including parental information as a response variable between two DEBVs regardless of the genotyping platform and the Bayesian method for genomic prediction accuracy in Korean Duroc pig breeding.
Collapse
|
43
|
Groß C, Derks M, Megens HJ, Bosse M, Groenen MAM, Reinders M, de Ridder D. pCADD: SNV prioritisation in Sus scrofa. Genet Sel Evol 2020; 52:4. [PMID: 32033531 PMCID: PMC7006094 DOI: 10.1186/s12711-020-0528-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 01/28/2020] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND In animal breeding, identification of causative genetic variants is of major importance and high economical value. Usually, the number of candidate variants exceeds the number of variants that can be validated. One way of prioritizing probable candidates is by evaluating their potential to have a deleterious effect, e.g. by predicting their consequence. Due to experimental difficulties to evaluate variants that do not cause an amino-acid substitution, other prioritization methods are needed. For human genomes, the prediction of deleterious genomic variants has taken a step forward with the introduction of the combined annotation dependent depletion (CADD) method. In theory, this approach can be applied to any species. Here, we present pCADD (p for pig), a model to score single nucleotide variants (SNVs) in pig genomes. RESULTS To evaluate whether pCADD captures sites with biological meaning, we used transcripts from miRNAs and introns, sequences from genes that are specific for a particular tissue, and the different sites of codons, to test how well pCADD scores differentiate between functional and non-functional elements. Furthermore, we conducted an assessment of examples of non-coding and coding SNVs, which are causal for changes in phenotypes. Our results show that pCADD scores discriminate between functional and non-functional sequences and prioritize functional SNVs, and that pCADD is able to score the different positions in a codon relative to their redundancy. Taken together, these results indicate that based on pCADD scores, regions with biological relevance can be identified and distinguished according to their rate of adaptation. CONCLUSIONS We present the ability of pCADD to prioritize SNVs in the pig genome with respect to their putative deleteriousness, in accordance to the biological significance of the region in which they are located. We created scores for all possible SNVs, coding and non-coding, for all autosomes and the X chromosome of the pig reference sequence Sscrofa11.1, proposing a toolbox to prioritize variants and evaluate sequences to highlight new sites of interest to explain biological functions that are relevant to animal breeding.
Collapse
Affiliation(s)
- Christian Groß
- Delft Bioinformatics Lab, University of Technology Delft, 2600GA, Delft, The Netherlands. .,Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, The Netherlands.
| | - Martijn Derks
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Hendrik-Jan Megens
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Mirte Bosse
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Martien A M Groenen
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Marcel Reinders
- Delft Bioinformatics Lab, University of Technology Delft, 2600GA, Delft, The Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University & Research, 6708 PB, Wageningen, The Netherlands
| |
Collapse
|
44
|
Hou L, Liang W, Xu G, Huang B, Zhang X, Hu CY, Wang C. Accuracy of genomic prediction using mixed low-density marker panels. ANIMAL PRODUCTION SCIENCE 2020. [DOI: 10.1071/an18503] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Low-density single-nucleotide polymorphism (LD-SNP) panel is one effective way to reduce the cost of genomic selection in animal breeding. The present study proposes a new type of LD-SNP panel called mixed low-density (MLD) panel, which considers SNPs with a substantial effect estimated by Bayes method B (BayesB) from many traits and evenly spaced distribution simultaneously. Simulated and real data were used to compare the imputation accuracy and genomic-selection accuracy of two types of LD-SNP panels. The result of genotyping imputation for simulated data showed that the number of quantitative trait loci (QTL) had limited influence on the imputation accuracy only for MLD panels. Evenly spaced (ELD) panel was not affected by QTL. For real data, ELD performed slightly better than did MLD when panel contained 500 and 1000 SNP. However, this advantage vanished quickly as the density increased. The result of genomic selection for simulated data using BayesB showed that MLD performed much better than did ELD when QTL was 100. For real data, MLD also outperformed ELD in growth and carcass traits when using BayesB. In conclusion, the MLD strategy is superior to ELD in genomic selection under most situations.
Collapse
|
45
|
Ramayo-Caldas Y, Mármol-Sánchez E, Ballester M, Sánchez JP, González-Prendes R, Amills M, Quintanilla R. Integrating genome-wide co-association and gene expression to identify putative regulators and predictors of feed efficiency in pigs. Genet Sel Evol 2019; 51:48. [PMID: 31477014 PMCID: PMC6721172 DOI: 10.1186/s12711-019-0490-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 08/19/2019] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Feed efficiency (FE) has a major impact on the economic sustainability of pig production. We used a systems-based approach that integrates single nucleotide polymorphism (SNP) co-association and gene-expression data to identify candidate genes, biological pathways, and potential predictors of FE in a Duroc pig population. RESULTS We applied an association weight matrix (AWM) approach to analyse the results from genome-wide association studies (GWAS) for nine FE associated and production traits using 31K SNPs by defining residual feed intake (RFI) as the target phenotype. The resulting co-association network was formed by 829 SNPs. Additive effects of this SNP panel explained 61% of the phenotypic variance of RFI, and the resulting phenotype prediction accuracy estimated by cross-validation was 0.65 (vs. 0.20 using pedigree-based best linear unbiased prediction and 0.12 using the 31K SNPs). Sixty-eight transcription factor (TF) genes were identified in the co-association network; based on the lossless approach, the putative main regulators were COPS5, GTF2H5, RUNX1, HDAC4, ESR1, USP16, SMARCA2 and GTF2F2. Furthermore, gene expression data of the gluteus medius muscle was explored through differential expression and multivariate analyses. A list of candidate genes showing functional and/or structural associations with FE was elaborated based on results from both AWM and gene expression analyses, and included the aforementioned TF genes and other ones that have key roles in metabolism, e.g. ESRRG, RXRG, PPARGC1A, TCF7L2, LHX4, MAML2, NFATC3, NFKBIZ, TCEA1, CDCA7L, LZTFL1 or CBFB. The most enriched biological pathways in this list were associated with behaviour, immunity, nervous system, and neurotransmitters, including melatonin, glutamate receptor, and gustation pathways. Finally, an expression GWAS allowed identifying 269 SNPs associated with the candidate genes' expression (eSNPs). Addition of these eSNPs to the AWM panel of 829 SNPs did not improve the accuracy of genomic predictions. CONCLUSIONS Candidate genes that have a direct or indirect effect on FE-related traits belong to various biological processes that are mainly related to immunity, behaviour, energy metabolism, and the nervous system. The pituitary gland, hypothalamus and thyroid axis, and estrogen signalling play fundamental roles in the regulation of FE in pigs. The 829 selected SNPs explained 61% of the phenotypic variance of RFI, which constitutes a promising perspective for applying genetic selection on FE relying on molecular-based prediction.
Collapse
Affiliation(s)
- Yuliaxis Ramayo-Caldas
- 0000 0001 1943 6646grid.8581.4Animal Breeding and Genetics Program, Institute for Research and Technology in Food and Agriculture (IRTA), Torre Marimon, 08140 Caldes de Montbui, Spain
| | - Emilio Mármol-Sánchez
- grid.7080.fDepartment of Animal Genetics, Centre for Research in Agricultural Genomics (CRAG), CSCIC-IRTA-UAB-UB, Campus de LA Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Maria Ballester
- 0000 0001 1943 6646grid.8581.4Animal Breeding and Genetics Program, Institute for Research and Technology in Food and Agriculture (IRTA), Torre Marimon, 08140 Caldes de Montbui, Spain
| | - Juan Pablo Sánchez
- 0000 0001 1943 6646grid.8581.4Animal Breeding and Genetics Program, Institute for Research and Technology in Food and Agriculture (IRTA), Torre Marimon, 08140 Caldes de Montbui, Spain
| | - Rayner González-Prendes
- grid.7080.fDepartment of Animal Genetics, Centre for Research in Agricultural Genomics (CRAG), CSCIC-IRTA-UAB-UB, Campus de LA Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Marcel Amills
- grid.7080.fDepartment of Animal Genetics, Centre for Research in Agricultural Genomics (CRAG), CSCIC-IRTA-UAB-UB, Campus de LA Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
- grid.7080.fDepartament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Raquel Quintanilla
- 0000 0001 1943 6646grid.8581.4Animal Breeding and Genetics Program, Institute for Research and Technology in Food and Agriculture (IRTA), Torre Marimon, 08140 Caldes de Montbui, Spain
| |
Collapse
|
46
|
Pérez-Enciso M, Zingaretti LM. A Guide for Using Deep Learning for Complex Trait Genomic Prediction. Genes (Basel) 2019; 10:E553. [PMID: 31330861 PMCID: PMC6678200 DOI: 10.3390/genes10070553] [Citation(s) in RCA: 70] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 07/06/2019] [Accepted: 07/18/2019] [Indexed: 11/17/2022] Open
Abstract
Deep learning (DL) has emerged as a powerful tool to make accurate predictions from complex data such as image, text, or video. However, its ability to predict phenotypic values from molecular data is less well studied. Here, we describe the theoretical foundations of DL and provide a generic code that can be easily modified to suit specific needs. DL comprises a wide variety of algorithms which depend on numerous hyperparameters. Careful optimization of hyperparameter values is critical to avoid overfitting. Among the DL architectures currently tested in genomic prediction, convolutional neural networks (CNNs) seem more promising than multilayer perceptrons (MLPs). A limitation of DL is in interpreting the results. This may not be relevant for genomic prediction in plant or animal breeding but can be critical when deciding the genetic risk to a disease. Although DL technologies are not "plug-and-play", they are easily implemented using Keras and TensorFlow public software. To illustrate the principles described here, we implemented a Keras-based code in GitHub.
Collapse
Affiliation(s)
- Miguel Pérez-Enciso
- Catalan Institution for Research and Advanced Studies (ICREA), Passeig de Lluís Companys 23, 08010 Barcelona, Spain.
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, 08193 Bellaterra, Barcelona, Spain.
| | - Laura M Zingaretti
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, 08193 Bellaterra, Barcelona, Spain
| |
Collapse
|
47
|
Ye S, Gao N, Zheng R, Chen Z, Teng J, Yuan X, Zhang H, Chen Z, Zhang X, Li J, Zhang Z. Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction. Front Genet 2019; 10:673. [PMID: 31379929 PMCID: PMC6650575 DOI: 10.3389/fgene.2019.00673] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Accepted: 06/27/2019] [Indexed: 11/13/2022] Open
Abstract
Genomic prediction with imputed whole-genome sequencing (WGS) data is an attractive approach to improve predictive ability with low cost. However, high accuracy has not been realized using this method in livestock. In this study, we imputed 435 individuals from 600K single nucleotide polymorphism (SNP) chip data to WGS data using different reference panels. We also investigated the prediction accuracy of genomic best linear unbiased prediction (GBLUP) using imputed WGS data from different reference panels, linkage disequilibrium (LD)-based marker pruning, and pre-selected variants based on Genome-wide association society (GWAS) results. Results showed that the imputation accuracies from 600K to WGS data were 0.873 ± 0.038, 0.906 ± 0.036, and 0.979 ± 0.010 for the internal, external, and combined reference panels, respectively. In most traits of chickens, the prediction accuracy of imputed WGS data obtained from the internal reference panel was greater than or equal to that of the combined reference panel; the external reference panel had the lowest prediction accuracy. Compared with 600K chip data, GBLUP with imputed WGS data had only a small increase (1-3%) in prediction accuracy. Using only variants selected from imputed WGS data based on GWAS results resulted in almost no increase for most traits and even increased the bias of the regression coefficient. The impact of the degree of LD of selected and remaining variants on prediction accuracy was different. For average daily gain (ADG), residual feed intake (RFI), intestine length (IL), and body weight in 91 days (BW91), the accuracy of GBLUP increased as the degree of LD of selected variants decreased, but the opposite relationship occurred for the remaining variants. But for breast muscle weight (BMW) and average daily feed intake (ADFI), the accuracy of GBLUP increased as the degree of LD of selected variants increased, and the degree of LD of remaining variants had a small effect on prediction accuracy. Overall, the optimal imputation strategy to obtain WGS data for genomic prediction should consider the relationship between selected individuals and target population individuals to avoid heterogeneity of imputation. LD-based marker pruning can be used to improve the accuracy of genomic prediction using imputed WGS data.
Collapse
Affiliation(s)
- Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Ning Gao
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Rongrong Zheng
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zitao Chen
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jinyan Teng
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiaolong Yuan
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Hao Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zanmou Chen
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Xiquan Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, China
| |
Collapse
|
48
|
Improvement of genomic prediction by integrating additional single nucleotide polymorphisms selected from imputed whole genome sequencing data. Heredity (Edinb) 2019; 124:37-49. [PMID: 31278370 PMCID: PMC6906477 DOI: 10.1038/s41437-019-0246-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2019] [Revised: 05/11/2019] [Accepted: 06/17/2019] [Indexed: 11/10/2022] Open
Abstract
The availability of whole genome sequencing (WGS) data enables the discovery of causative single nucleotide polymorphisms (SNPs) or SNPs in high linkage disequilibrium with causative SNPs. This study investigated effects of integrating SNPs selected from imputed WGS data into the data of 54K chip on genomic prediction in Danish Jersey. The WGS SNPs, mainly including peaks of quantitative trait loci, structure variants, regulatory regions of genes, and SNPs within genes with strong effects predicted with variant effect predictor, were selected in previous analyses for dairy breeds in Denmark–Finland–Sweden (DFS) and France (FRA). Animals genotyped with 54K chip, standard LD chip, and customized LD chip which covered selected WGS SNPs and SNPs in the standard LD chip, were imputed to 54K together with DFS and FRA SNPs. Genomic best linear unbiased prediction (GBLUP) and Bayesian four-distribution mixture models considering 54K and selected WGS SNPs as one (a one-component model) or two separate genetic components (a two-component model) were used to predict breeding values. For milk production traits and mastitis, both DFS (0.025) and FRA (0.029) sets of additional WGS SNPs improved reliabilities, and inclusions of all selected WGS SNPs generally achieved highest improvements of reliabilities (0.034). A Bayesian four-distribution model yielded higher reliabilities than a GBLUP model for milk and protein, but extra gains in reliabilities from using selected WGS SNPs were smaller for a Bayesian four-distribution model than a GBLUP model. Generally, no significant difference was observed between one-component and two-component models, except for using GBLUP models for milk.
Collapse
|
49
|
Hoff JL, Decker JE, Schnabel RD, Seabury CM, Neibergs HL, Taylor JF. QTL-mapping and genomic prediction for bovine respiratory disease in U.S. Holsteins using sequence imputation and feature selection. BMC Genomics 2019; 20:555. [PMID: 31277567 PMCID: PMC6612181 DOI: 10.1186/s12864-019-5941-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 06/26/2019] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND National genetic evaluations for disease resistance do not exist, precluding the genetic improvement of cattle for these traits. We imputed BovineHD genotypes to whole genome sequence for 2703 Holsteins that were cases or controls for Bovine Respiratory Disease and sampled from either California or New Mexico to construct and compare genomic prediction models. The sequence variation reference dataset comprised variants called for 1578 animals from Run 5 of the 1000 Bull Genomes Project, including 450 Holsteins and 29 animals sequenced from this study population. Genotypes for 9,282,726 variants with minor allele frequencies ≥5% were imputed and used to obtain genomic predictions in GEMMA using a Bayesian Sparse Linear Mixed Model. RESULTS Variation explained by markers increased from 13.6% using BovineHD data to 14.4% using imputed whole genome sequence data and the resolution of genomic regions detected as harbouring QTL substantially increased. Explained variation in the analysis of the combined California and New Mexico data was less than when data for each state were separately analysed and the estimated genetic correlation between risk of Bovine Respiratory Disease in California and New Mexico Holsteins was - 0.36. Consequently, genomic predictions trained using the data from one state did not accurately predict disease risk in the other state. To determine if a prediction model could be developed with utility in both states, we selected variants within genomic regions harbouring: 1) genes involved in the normal immune response to infection by pathogens responsible for Bovine Respiratory Disease detected by RNA-Seq analysis, and/or 2) QTL identified in the association analysis of the imputed sequence variants. The model based on QTL selected variants is biased but when trained in one state generated BRD risk predictions with positive accuracies in the other state. CONCLUSIONS We demonstrate the utility of sequence-based and biology-driven model development for genomic selection. Disease phenotypes cannot be routinely recorded in most livestock species and the observed phenotypes may vary in their genomic architecture due to variation in the pathogen composition across environments. Elucidation of trait biology and genetic architecture may guide the development of prediction models with utility across breeds and environments.
Collapse
Affiliation(s)
- Jesse L Hoff
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA
| | - Jared E Decker
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA.,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA
| | - Robert D Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA.,Informatics Institute, University of Missouri, Columbia, MO, 65211, USA
| | - Christopher M Seabury
- Department of Veterinary Pathobiology, Texas A&M University, College Station, TX, 77843, USA
| | - Holly L Neibergs
- Department of Animal Sciences, Washington State University, Pullman, WA, 99163, USA
| | - Jeremy F Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO, 65211, USA.
| |
Collapse
|
50
|
Al Kalaldeh M, Gibson J, Duijvesteijn N, Daetwyler HD, MacLeod I, Moghaddar N, Lee SH, van der Werf JHJ. Using imputed whole-genome sequence data to improve the accuracy of genomic prediction for parasite resistance in Australian sheep. Genet Sel Evol 2019; 51:32. [PMID: 31242855 PMCID: PMC6595562 DOI: 10.1186/s12711-019-0476-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 06/18/2019] [Indexed: 01/16/2023] Open
Abstract
Background This study aimed at (1) comparing the accuracies of genomic prediction for parasite resistance in sheep based on whole-genome sequence (WGS) data to those based on 50k and high-density (HD) single nucleotide polymorphism (SNP) panels; (2) investigating whether the use of variants within quantitative trait loci (QTL) regions that were selected from regional heritability mapping (RHM) in an independent dataset improved the accuracy more than variants selected from genome-wide association studies (GWAS); and (3) comparing the prediction accuracies between variants selected from WGS data to variants selected from the HD SNP panel. Results The accuracy of genomic prediction improved marginally from 0.16 ± 0.02 and 0.18 ± 0.01 when using all the variants from 50k and HD genotypes, respectively, to 0.19 ± 0.01 when using all the variants from WGS data. Fitting a GRM from the selected variants alongside a GRM from the 50k SNP genotypes improved the prediction accuracy substantially compared to fitting the 50k SNP genotypes alone. The gain in prediction accuracy was slightly more pronounced when variants were selected from WGS data compared to when variants were selected from the HD panel. When sequence variants that passed the GWAS \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$- log_{10} (p\,value)$$\end{document}-log10(pvalue) threshold of 3 across the entire genome were selected, the prediction accuracy improved by 5% (up to 0.21 ± 0.01), whereas when selection was limited to sequence variants that passed the same GWAS \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$- log_{10} (p\,value)$$\end{document}-log10(pvalue) threshold of 3 in regions identified by RHM, the accuracy improved by 9% (up to 0.25 ± 0.01). Conclusions Our results show that through careful selection of sequence variants from the QTL regions, the accuracy of genomic prediction for parasite resistance in sheep can be improved. These findings have important implications for genomic prediction in sheep.
Collapse
Affiliation(s)
- Mohammad Al Kalaldeh
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia. .,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| | - John Gibson
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Naomi Duijvesteijn
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Hans D Daetwyler
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,Centre for AgriBioscience, Agriculture Victoria, Bundoora, VIC, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3083, Australia
| | - Iona MacLeod
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,Centre for AgriBioscience, Agriculture Victoria, Bundoora, VIC, 3083, Australia
| | - Nasir Moghaddar
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| | - Sang Hong Lee
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, University of South Australia, Adelaide, SA, 5000, Australia
| | - Julius H J van der Werf
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia.,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia
| |
Collapse
|