1
|
Sabadin F, Carvalho HF, Galli G, Fritsche-Neto R. Population-tailored mock genome enables genomic studies in species without a reference genome. Mol Genet Genomics 2021; 297:33-46. [PMID: 34755217 DOI: 10.1007/s00438-021-01831-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 10/28/2021] [Indexed: 11/26/2022]
Abstract
Based on molecular markers, genomic prediction enables us to speed up breeding schemes and increase the response to selection. There are several high-throughput genotyping platforms able to deliver thousands of molecular markers for genomic study purposes. However, even though its widely applied in plant breeding, species without a reference genome cannot fully benefit from genomic tools and modern breeding schemes. We used a method to assemble a population-tailored mock genome to call single-nucleotide polymorphism (SNP) markers without an available reference genome, and for the first time, we compared the results with standard genotyping platforms (array and genotyping-by-sequencing (GBS) using a reference genome) for performance in genomic prediction models. Our results indicate that using a population-tailored mock genome to call SNP delivers reliable estimates for the genomic relationship between genotypes. Furthermore, genomic prediction estimates were comparable to standard approaches, especially when considering only additive effects. However, mock genomes were slightly worse than arrays at predicting traits influenced by dominance effects, but still performed as well as standard GBS methods that use a reference genome. Nevertheless, the array-based SNP markers methods achieved the best predictive ability and reliability to estimate variance components. Overall, the mock genomes can be a worthy alternative for genomic selection studies, especially for those species where the reference genome is not available.
Collapse
Affiliation(s)
- Felipe Sabadin
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil.
| | - Humberto Fanelli Carvalho
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
| | - Giovanni Galli
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
| | - Roberto Fritsche-Neto
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
| |
Collapse
|
2
|
Yoshida GM, Yáñez JM. Increased accuracy of genomic predictions for growth under chronic thermal stress in rainbow trout by prioritizing variants from GWAS using imputed sequence data. Evol Appl 2021; 15:537-552. [PMID: 35505881 PMCID: PMC9046923 DOI: 10.1111/eva.13240] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 04/01/2021] [Accepted: 04/03/2021] [Indexed: 02/07/2023] Open
Abstract
Through imputation of genotypes, genome‐wide association study (GWAS) and genomic prediction (GP) using whole‐genome sequencing (WGS) data are cost‐efficient and feasible in aquaculture breeding schemes. The objective was to dissect the genetic architecture of growth traits under chronic heat stress in rainbow trout (Oncorhynchus mykiss) and to assess the accuracy of GP based on imputed WGS and different preselected single nucleotide polymorphism (SNP) arrays. A total of 192 and 764 fish challenged to a heat stress experiment for 62 days were genotyped using a customized 1 K and 26 K SNP panels, respectively, and then, genotype imputation was performed from a low‐density chip to WGS using 102 parents (36 males and 66 females) as the reference population. Imputed WGS data were used to perform GWAS and test GP accuracy under different preselected SNP scenarios. Heritability was estimated for body weight (BW), body length (BL) and average daily gain (ADG). Estimates using imputed WGS data ranged from 0.33 ± 0.05 to 0.55 ± 0.05 for growth traits under chronic heat stress. GWAS revealed that the top five cumulatively SNPs explained a maximum of 0.94%, 0.86% and 0.51% of genetic variance for BW, BL and ADG, respectively. Some important functional candidate genes associated with growth‐related traits were found among the most important SNPs, including signal transducer and activator of transcription 5B and 3 (STAT5B and STAT3, respectively) and cytokine‐inducible SH2‐containing protein (CISH). WGS data resulted in a slight increase in prediction accuracy compared with pedigree‐based method, whereas preselected SNPs based on the top GWAS hits improved prediction accuracies, with values ranging from 1.2 to 13.3%. Our results support the evidence of the polygenic nature of growth traits when measured under heat stress. The accuracies of GP can be improved using preselected variants from GWAS, and the use of WGS marginally increases prediction accuracy.
Collapse
Affiliation(s)
- Grazyella M. Yoshida
- Facultad de Ciencias Veterinarias y Pecuarias Universidad de Chile Santiago Chile
| | - José M. Yáñez
- Facultad de Ciencias Veterinarias y Pecuarias Universidad de Chile Santiago Chile
- Núcleo Milenio INVASAL Concepción Chile
| |
Collapse
|
3
|
Liu K, Cao H, Dong X, Liu H, Wen Y, Mao H, Lu L, Yin Z. Polymorphisms of pro-opiomelanocortin gene and the association with reproduction traits in chickens. Anim Reprod Sci 2019; 210:106196. [PMID: 31635770 DOI: 10.1016/j.anireprosci.2019.106196] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Revised: 09/17/2019] [Accepted: 09/24/2019] [Indexed: 12/16/2022]
Abstract
Pro-opiomelanocortin (POMC) is a member of prohormone family and has important functions in stress response, skin pigmentation, thermoregulation and reproduction. In this study, the single nucleotide polymorphisms (SNPs) of POMC gene exons were detected by direct sequencing in 317 Zhenning yellow chickens. The sequencing results indicated there were seven mutation sites (g.1140C > T, g.1185 T > C, g.2085 T > C, g.3566A > C, g.3572 G > A, g.3594 G > A and g.3628 G > A) and all of these were synonymous. Furthermore, seven haplotypes were formed and sixteen diplotypes were obtained. The associations between the POMC gene polymorphisms or diplotypes and reproduction traits were also analyzed. The association analysis results indicated that the SNP of g.1140C > T was associated with egg production at 300 d of age (E300), fertilization rate (FR), hatching rate of hatching eggs (HEHR) and hatching rate of fertilized eggs (FEHR; P < 0.05). The SNP of g.3566A>C was associated with FR (P < 0.05), SNP of g.3594G>A was associated with egg weight at 300d of age (EW300; P < 0.05), and SNP of g.3628G>A was associated with HEHR and FEHR (P < 0.01), respectively. Furthermore, chickens with H2H3 diplotype had greater EW300 and FR than those with H1H7 and H3H4 diplotypes (P < 0.05). These results indicate the expression of the POMC gene had significant genotype effects on the reproduction traits of Zhenning yellow chickens, and that the H2H3 diplotype could be used as a potential genetic marker to improve the reproduction traits in chicken breeding.
Collapse
Affiliation(s)
- Ke Liu
- College of Animal Sciences, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Haiyue Cao
- College of Animal Sciences, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Xinyang Dong
- College of Animal Sciences, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Honghua Liu
- College of Animal Sciences, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Yaya Wen
- College of Animal Sciences, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Haiguang Mao
- College of Animal Sciences, Zhejiang University, Zijingang Campus, Hangzhou 310058, China
| | - Lei Lu
- Ningbo Zhenning Animal Husbandry Co. Ltd, Ningbo 315000, China
| | - Zhaozheng Yin
- College of Animal Sciences, Zhejiang University, Zijingang Campus, Hangzhou 310058, China.
| |
Collapse
|
4
|
Bresolin T, Rosa GJDM, Valente BD, Espigolan R, Gordo DGM, Braz CU, Fernandes Júnior GA, Magalhães AFB, Garcia DA, Frezarim GB, Leão GFC, Carvalheiro R, Baldi F, Nunes de Oliveira H, Galvão de Albuquerque L. Effect of quality control, density and allele frequency of markers on the accuracy of genomic prediction for complex traits in Nellore cattle. ANIMAL PRODUCTION SCIENCE 2019. [DOI: 10.1071/an16821] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
This study was designed to test the impact of quality control, density and allele frequency of single nucleotide polymorphisms (SNP) markers on the accuracy of genomic predictions, using three traits with different heritabilities and two methods of prediction in a Nellore cattle population genotyped with the Illumina Bovine HD Assay. A total of 1756; 3150 and 3119 records of age at first calving (AFC); weaning weight (WW) and yearling weight (YW), respectively, were used. Three scenarios with different exclusion thresholds for minor allele frequency (MAF), deviation from Hardy–Weinberg equilibrium (HWE) and correlation between SNP pairs (r2) were constructed for all traits: (1) high rigor (S1): call rate <0.98, MAF <0.05, HWE with P <10−5, and r2 >0.999; (2) Moderate rigor (S2): call rate <0.85 and MAF <0.01; (3) Low rigor (S3): only non-autosomal SNP and those mapped on the same position were excluded. Additionally, to assess the prediction accuracy from different markers density, six panels (10K, 50K, 100K, 300K, 500K and 700K) were customised using the high-density genotyping assay as reference. Finally, from the markers available in high-density genotyping assay, six groups (G) with different minor allele frequency bins were defined to estimate the accuracy of genomic prediction. The range of MAF bins was approximately equal for the traits studied: G1 (0.000–0.009), G2 (0.010–0.064), G3 (0.065–0.174), G4 (0.175–0.325), G5 (0.326–0.500) and G6 (0.000–0.500). The Genomic Best Linear Unbiased Predictor and BayesCπ methods were used to estimate the SNP marker effects. Five-fold cross-validation was used to measure the accuracy of genomic prediction for all scenarios. There were no effects of genotypes quality control criteria on the accuracies of genomic predictions. For all traits, the higher density panel did not provide greater prediction accuracies than the low density one (10K panel). The groups of SNP with low MAF (MAF ≤0.007 for AFC, MAF ≤0.009 for WW and MAF ≤0.008 for YW) provided lower prediction accuracies than the groups with higher allele frequencies.
Collapse
|
5
|
Abdollahi-Arpanahi R, Morota G, Peñagaricano F. Predicting bull fertility using genomic data and biological information. J Dairy Sci 2017; 100:9656-9666. [PMID: 28987577 DOI: 10.3168/jds.2017-13288] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2017] [Accepted: 09/13/2017] [Indexed: 01/04/2023]
Abstract
The genomic prediction of unobserved genetic values or future phenotypes for complex traits has revolutionized agriculture and human medicine. Fertility traits are undoubtedly complex traits of great economic importance to the dairy industry. Although genomic prediction for improved cow fertility has received much attention, bull fertility largely has been ignored. The first aim of this study was to investigate the feasibility of genomic prediction of sire conception rate (SCR) in US Holstein dairy cattle. Standard genomic prediction often ignores any available information about functional features of the genome, although it is believed that such information can yield more accurate and more persistent predictions. Hence, the second objective was to incorporate prior biological information into predictive models and evaluate their performance. The analyses included the use of kernel-based models fitting either all single nucleotide polymorphisms (SNP; 55K) or only markers with presumed functional roles, such as SNP linked to Gene Ontology or Medical Subject Heading terms related to male fertility, or SNP significantly associated with SCR. Both single- and multikernel models were evaluated using linear and Gaussian kernels. Predictive ability was evaluated in 5-fold cross-validation. The entire set of SNP exhibited predictive correlations around 0.35. Neither Gene Ontology nor Medical Subject Heading gene sets achieved predictive abilities higher than their counterparts using random sets of SNP. Notably, kernel models fitting significant SNP achieved the best performance with increases in accuracy up to 5% compared with the standard whole-genome approach. Models fitting Gaussian kernels outperformed their counterparts fitting linear kernels irrespective of the set of SNP. Overall, our findings suggest that genomic prediction of bull fertility is feasible in dairy cattle. This provides potential for accurate genome-guided decisions, such as early culling of bull calves with low SCR predictions. In addition, exploiting nonlinear effects through the use of Gaussian kernels together with the incorporation of relevant markers seems to be a promising alternative to the standard approach. The inclusion of gene set results into prediction models deserves further research.
Collapse
Affiliation(s)
- Rostam Abdollahi-Arpanahi
- Department of Animal Sciences, University of Florida, Gainesville 32611; Department of Animal and Poultry Science, University of Tehran, Pakdasht, Iran 3391653755
| | - Gota Morota
- Department of Animal Science, University of Nebraska, Lincoln 68583
| | - Francisco Peñagaricano
- Department of Animal Sciences, University of Florida, Gainesville 32611; University of Florida Genetics Institute, Gainesville 32611.
| |
Collapse
|
6
|
Zhang Z, Xu ZQ, Luo YY, Zhang HB, Gao N, He JL, Ji CL, Zhang DX, Li JQ, Zhang XQ. Whole genomic prediction of growth and carcass traits in a Chinese quality chicken population. J Anim Sci 2017; 95:72-80. [PMID: 28177394 DOI: 10.2527/jas.2016.0823] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
By incorporating high-density markers into breeding value prediction models, the whole genomic prediction (WGP) method can effectively accelerate genetic improvement in livestock breeding. However, the performance of WGP varies across species and populations and is affected by the underlying genetic architecture. In particular, very little is known about the performance of WGP for many chicken breeds. Here we estimate the genetic parameters and evaluate the performance of WGP for 18 growth and carcass traits in a Chinese quality chicken population. In total, 435 chickens were systematically phenotyped and genotyped using a 600K genotyping array. Two variance component estimation scenarios, 3 breeding value prediction methods, and 2 validation procedures were compared. The results showed that the heritability of these 18 traits was medium to high (ranging from 0.28 to 0.60) and that deviations existed between the heritability estimated from pedigrees and markers. Compared with conventional breeding methods, WGP could potentially increase the selection accuracy by 20% or more depending on the prediction model used, the trait under consideration, and the genetic connectedness between the training and validation individuals. Our results showed the potential of implementing genomic selection in small breeding herds.
Collapse
|
7
|
Optimizing Training Population Data and Validation of Genomic Selection for Economic Traits in Soft Winter Wheat. G3-GENES GENOMES GENETICS 2016; 6:2919-28. [PMID: 27440921 PMCID: PMC5015948 DOI: 10.1534/g3.116.032532] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Genomic selection (GS) is a breeding tool that estimates breeding values (GEBVs) of individuals based solely on marker data by using a model built using phenotypic and marker data from a training population (TP). The effectiveness of GS increases as the correlation of GEBVs and phenotypes (accuracy) increases. Using phenotypic and genotypic data from a TP of 470 soft winter wheat lines, we assessed the accuracy of GS for grain yield, Fusarium Head Blight (FHB) resistance, softness equivalence (SE), and flour yield (FY). Four TP data sampling schemes were tested: (1) use all TP data, (2) use subsets of TP lines with low genotype-by-environment interaction, (3) use subsets of markers significantly associated with quantitative trait loci (QTL), and (4) a combination of 2 and 3. We also correlated the phenotypes of relatives of the TP to their GEBVs calculated from TP data. The GS accuracy within the TP using all TP data ranged from 0.35 (FHB) to 0.62 (FY). On average, the accuracy of GS from using subsets of data increased by 54% relative to using all TP data. Using subsets of markers selected for significant association with the target trait had the greatest impact on GS accuracy. Between-environment prediction accuracy was also increased by using data subsets. The accuracy of GS when predicting the phenotypes of TP relatives ranged from 0.00 to 0.85. These results suggest that GS could be useful for these traits and GS accuracy can be greatly improved by using subsets of TP data.
Collapse
|
8
|
Uemoto Y, Sasaki S, Kojima T, Sugimoto Y, Watanabe T. Impact of QTL minor allele frequency on genomic evaluation using real genotype data and simulated phenotypes in Japanese Black cattle. BMC Genet 2015; 16:134. [PMID: 26586567 PMCID: PMC4653875 DOI: 10.1186/s12863-015-0287-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Accepted: 10/27/2015] [Indexed: 11/14/2022] Open
Abstract
Background Genetic variance that is not captured by single nucleotide polymorphisms (SNPs) is due to imperfect linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTLs), and the extent of LD between SNPs and QTLs depends on different minor allele frequencies (MAF) between them. To evaluate the impact of MAF of QTLs on genomic evaluation, we performed a simulation study using real cattle genotype data. Methods In total, 1368 Japanese Black cattle and 592,034 SNPs (Illumina BovineHD BeadChip) were used. We simulated phenotypes using real genotypes under different scenarios, varying the MAF categories, QTL heritability, number of QTLs, and distribution of QTL effect. After generating true breeding values and phenotypes, QTL heritability was estimated and the prediction accuracy of genomic estimated breeding value (GEBV) was assessed under different SNP densities, prediction models, and population size by a reference-test validation design. Results The extent of LD between SNPs and QTLs in this population was higher in the QTLs with high MAF than in those with low MAF. The effect of MAF of QTLs depended on the genetic architecture, evaluation strategy, and population size in genomic evaluation. In genetic architecture, genomic evaluation was affected by the MAF of QTLs combined with the QTL heritability and the distribution of QTL effect. The number of QTL was not affected on genomic evaluation if the number of QTL was more than 50. In the evaluation strategy, we showed that different SNP densities and prediction models affect the heritability estimation and genomic prediction and that this depends on the MAF of QTLs. In addition, accurate QTL heritability and GEBV were obtained using denser SNP information and the prediction model accounted for the SNPs with low and high MAFs. In population size, a large sample size is needed to increase the accuracy of GEBV. Conclusion The MAF of QTL had an impact on heritability estimation and prediction accuracy. Most genetic variance can be captured using denser SNPs and the prediction model accounted for MAF, but a large sample size is needed to increase the accuracy of GEBV under all QTL MAF categories. Electronic supplementary material The online version of this article (doi:10.1186/s12863-015-0287-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yoshinobu Uemoto
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| | - Shinji Sasaki
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| | - Takatoshi Kojima
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| | - Yoshikazu Sugimoto
- Shirakawa Institute of Animal Genetics, Japan Livestock Technology Association, Nishigo, Fukushima, 961-8511, Japan.
| | - Toshio Watanabe
- National Livestock Breeding Center, Nishigo, Fukushima, 961-8511, Japan.
| |
Collapse
|