1
|
Genotyping, the Usefulness of Imputation to Increase SNP Density, and Imputation Methods and Tools. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2467:113-138. [PMID: 35451774 DOI: 10.1007/978-1-0716-2205-6_4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Imputation has become a standard practice in modern genetic research to increase genome coverage and improve accuracy of genomic selection and genome-wide association study as a large number of samples can be genotyped at lower density (and lower cost) and, imputed up to denser marker panels or to sequence level, using information from a limited reference population. Most genotype imputation algorithms use information from relatives and population linkage disequilibrium. A number of software for imputation have been developed originally for human genetics and, more recently, for animal and plant genetics considering pedigree information and very sparse SNP arrays or genotyping-by-sequencing data. In comparison to human populations, the population structures in farmed species and their limited effective sizes allow to accurately impute high-density genotypes or sequences from very low-density SNP panels and a limited set of reference individuals. Whatever the imputation method, the imputation accuracy, measured by the correct imputation rate or the correlation between true and imputed genotypes, increased with the increasing relatedness of the individual to be imputed with its denser genotyped ancestors and as its own genotype density increased. Increasing the imputation accuracy pushes up the genomic selection accuracy whatever the genomic evaluation method. Given the marker densities, the most important factors affecting imputation accuracy are clearly the size of the reference population and the relationship between individuals in the reference and target populations.
Collapse
|
2
|
Lee D, Kim Y, Chung Y, Lee D, Seo D, Choi TJ, Lim D, Yoon D, Lee SH. Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle. JOURNAL OF ANIMAL SCIENCE AND TECHNOLOGY 2021; 63:1232-1246. [PMID: 34957440 PMCID: PMC8672260 DOI: 10.5187/jast.2021.e117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 11/20/2022]
Abstract
Recently, the cattle genome sequence has been completed, followed by developing a
commercial single nucleotide polymorphism (SNP) chip panel in the animal genome
industry. In order to increase statistical power for detecting quantitative
trait locus (QTL), a number of animals should be genotyped. However, a
high-density chip for many animals would be increasing the genotyping cost.
Therefore, statistical inference of genotype imputation (low-density chip to
high-density) will be useful in the animal industry. The purpose of this study
is to investigate the effect of the reference population size and marker density
on the imputation accuracy and to suggest the appropriate number of reference
population sets for the imputation in Hanwoo cattle. A total of 3,821 Hanwoo
cattle were divided into reference and validation populations. The reference
sets consisted of 50k (38,916) marker data and different population sizes (500,
1,000, 1,500, 2,000, and 3,600). The validation sets consisted of four
validation sets (Total 889) and the different marker density (5k [5,000], 10k
[10,000], and 15k [15,000]). The accuracy of imputation was calculated by direct
comparison of the true genotype and the imputed genotype. In conclusion, when
the lowest marker density (5k) was used in the validation set, according to the
reference population size, the imputation accuracy was 0.793 to 0.929. On the
other hand, when the highest marker density (15k), according to the reference
population size, the imputation accuracy was 0.904 to 0.967. Moreover, the
reference population size should be more than 1,000 to obtain at least 88%
imputation accuracy in Hanwoo cattle.
Collapse
Affiliation(s)
- DooHo Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
| | - Yeongkuk Kim
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
| | - Yoonji Chung
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
| | - Dongjae Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
| | - Dongwon Seo
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
| | - Tae Jeong Choi
- National Institute of Animal Science, Cheonan 31000, Korea
| | - Dajeong Lim
- Animal Genomics and Bioinformatics Division, National Institute of Animal Science, Wanju 55365, Korea
| | - Duhak Yoon
- Department of Animal Science & Biotechnology, Kyungpook National University, Sangju 37224, Korea
| | - Seung Hwan Lee
- Division of Animal and Dairy Science, Chungnam National University, Daejeon 34134, Korea
| |
Collapse
|
3
|
Cappetta E, Andolfo G, Guadagno A, Di Matteo A, Barone A, Frusciante L, Ercolano MR. Tomato genomic prediction for good performance under high-temperature and identification of loci involved in thermotolerance response. HORTICULTURE RESEARCH 2021; 8:212. [PMID: 34593775 PMCID: PMC8484564 DOI: 10.1038/s41438-021-00647-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Revised: 07/05/2021] [Accepted: 07/14/2021] [Indexed: 06/13/2023]
Abstract
Many studies showed that few degrees above tomato optimum growth temperature threshold can lead to serious loss in production. Therefore, the development of innovative strategies to obtain tomato cultivars with improved yield under high temperature conditions is a main goal both for basic genetic studies and breeding activities. In this paper, a F4 segregating population was phenotypically evaluated for quantitative and qualitative traits under heat stress conditions. Moreover, a genotyping by sequencing (GBS) approach has been employed for building up genomic selection (GS) models both for yield and soluble solid content (SCC). Several parameters, including training population size, composition and marker quality were tested to predict genotype performance under heat stress conditions. A good prediction accuracy for the two analyzed traits (0.729 for yield production and 0.715 for SCC) was obtained. The predicted models improved the genetic gain of selection in the next breeding cycles, suggesting that GS approach is a promising strategy to accelerate breeding for heat tolerance in tomato. Finally, the annotation of SNPs located in gene body regions combined with QTL analysis allowed the identification of five candidates putatively involved in high temperatures response, and the building up of a GS model based on calibrated panel of SNP markers.
Collapse
Affiliation(s)
- Elisa Cappetta
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055, Portici, Naples, Italy
- Institute of Bioscience and BioResources, National Research Council, Via Università 100, 80055, Portici, Italy
| | - Giuseppe Andolfo
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055, Portici, Naples, Italy
| | - Anna Guadagno
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055, Portici, Naples, Italy
| | - Antonio Di Matteo
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055, Portici, Naples, Italy
| | - Amalia Barone
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055, Portici, Naples, Italy
| | - Luigi Frusciante
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055, Portici, Naples, Italy
| | - Maria Raffaella Ercolano
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055, Portici, Naples, Italy.
| |
Collapse
|
4
|
Impact of Marker Pruning Strategies Based on Different Measurements of Marker Distance on Genomic Prediction in Dairy Cattle. Animals (Basel) 2021; 11:ani11071992. [PMID: 34359120 PMCID: PMC8300388 DOI: 10.3390/ani11071992] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 06/27/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022] Open
Abstract
Simple Summary The usefulness of genomic prediction (GP) has been widely proofed by breeding analysis in livestock, plants and aquatic populations. It is well known that ‘marker density’ is a critical factor that affects the accuracy of GP, however, how to properly measure ‘marker density’ in GP is yet to be determined. With population-level whole-genome sequence data or high-density single nucleotide polymorphism (SNP) data available, this question seems to be answered more convincingly. In this study, we investigated and discussed the impact of four ‘marker density’ measures that reflect genetic or physical distances between SNPs on the accuracy of GP in a Germany Holstein dairy cattle population. Our results showed that the degree of variation of physical distance between adjacent SNPs had significant effects on the accuracy of GP, while the genetic distance between SNPs had no relationship with the accuracy of GP. Therefore, for studies based on high-density SNP data, the default strategy of pruning SNPs based on genetic distance is detrimental to heritability estimation and genomic prediction. The results extended the communities knowledge of ‘marker density’ and provided useful suggestions for the application and research on genome prediction. Abstract With the availability of high-density single-nucleotide polymorphism (SNP) data and the development of genotype imputation methods, high-density panel-based genomic prediction (GP) has become possible in livestock breeding. It is generally considered that the genomic estimated breeding value (GEBV) accuracy increases with the marker density, while studies have shown that the GEBV accuracy does not increase or even decrease when high-density panels were used. Therefore, in addition to the SNP number, other measurements of ‘marker density’ seem to have impacts on the GEBV accuracy, and exploring the relationship between the GEBV accuracy and the measurements of ‘marker density’ based on high-density SNP or whole-genome sequence data is important for the field of GP. In this study, we constructed different SNP panels with certain SNP numbers (e.g., 1 k) by using the physical distance (PhyD), genetic distance (GenD) and random distance (RanD) between SNPs respectively based on the high-density SNP data of a Germany Holstein dairy cattle population. Therefore, there are three different panels at a certain SNP number level. These panels were used to construct GP models to predict fat percentage, milk yield and somatic cell score. Meanwhile, the mean (d¯) and variance (σd2) of the physical distance between SNPs and the mean (r2¯) and variance (σr22) of the genetic distance between SNPs in each panel were used as marker density-related measurements and their influence on the GEBV accuracy was investigated. At the same SNP number level, the d¯ of all panels is basically the same, but the σd2, r2¯ and σr22 are different. Therefore, we only investigated the effects of σd2, r2¯ and σr22 on the GEBV accuracy. The results showed that at a certain SNP number level, the GEBV accuracy was negatively correlated with σd2, but not with r2¯ and σr22. Compared with GenD and RanD, the σd2 of panels constructed by PhyD is smaller. The low and moderate-density panels (< 50 k) constructed by RanD or GenD have large σd2, which is not conducive to genomic prediction. The GEBV accuracy of the low and moderate-density panels constructed by PhyD is 3.8~34.8% higher than that of the low and moderate-density panels constructed by RanD and GenD. Panels with 20–30 k SNPs constructed by PhyD can achieve the same or slightly higher GEBV accuracy than that of high-density SNP panels for all three traits. In summary, the smaller the variation degree of physical distance between adjacent SNPs, the higher the GEBV accuracy. The low and moderate-density panels construct by physical distance are beneficial to genomic prediction, while pruning high-density SNP data based on genetic distance is detrimental to genomic prediction. The results provide suggestions for the development of SNP panels and the research of genome prediction based on whole-genome sequence data.
Collapse
|
5
|
Validation of the Prediction Accuracy for 13 Traits in Chinese Simmental Beef Cattle Using a Preselected Low-Density SNP Panel. Animals (Basel) 2021; 11:ani11071890. [PMID: 34202066 PMCID: PMC8300368 DOI: 10.3390/ani11071890] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 06/08/2021] [Accepted: 06/15/2021] [Indexed: 11/17/2022] Open
Abstract
Simple Summary To reduce the breeding costs and promote the application of genomic selection (GS) in Chinese Simmental beef cattle, we developed a customized low-density single-nucleotide polymorphism (SNP) panel consisting of 30,684 SNPs. When comparing the predictive performance of the low-density SNP panel to that of the BovineHD Beadchip for 13 traits, we found that this ~30 K panel achieved moderate to high prediction accuracies for most traits, while reducing the prediction accuracies of six traits by 0.04–0.09 and decreasing the prediction accuracy of one trait by 0.2. For the remaining six traits, the usage of the low-density SNP panel was associated with a slight increase in prediction accuracy. Our studies suggested that the low-density SNP panel (~30 K) is a feasible and promising tool for cost-effective genomic prediction in Chinese Simmental beef cattle, which may provide breeding organizations with a cheaper option and greater returns on investment. Abstract Chinese Simmental beef cattle play a key role in the Chinese beef industry due to their great adaptability and marketability. To achieve efficient genetic gain at a low breeding cost, it is crucial to develop a customized cost-effective low-density SNP panel for this cattle population. Thirteen growth, carcass, and meat quality traits and a BovineHD Beadchip genotyping of 1346 individuals were used to select trait-associated variants and variants contributing to great genetic variance. In addition, highly informative SNPs with high MAF in each 500 kb sliding window and in each genic region were also included separately. A low-density SNP panel consisting of 30,684 SNPs was developed, with an imputation accuracy of 97.4% when imputed to the 770 K level. Among 13 traits, the average prediction accuracy levels evaluated by genomic best linear unbiased prediction (GBLUP) and BayesA/B/Cπ were 0.22–0.47 and 0.18–0.60 for the ~30 K array and BovineHD Beadchip, respectively. Generally, the predictive performance of the ~30 K array was trait-dependent, with reduced prediction accuracies for seven traits. While differences in terms of prediction accuracy were observed among the 13 traits, the low-density SNP panel achieved moderate to high accuracies for most of the traits and even improved the accuracies for some traits.
Collapse
|
6
|
Joshi R, Skaarud A, Alvarez AT, Moen T, Ødegård J. Bayesian genomic models boost prediction accuracy for survival to Streptococcus agalactiae infection in Nile tilapia (Oreochromus nilioticus). Genet Sel Evol 2021; 53:37. [PMID: 33882834 PMCID: PMC8058985 DOI: 10.1186/s12711-021-00629-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 04/06/2021] [Indexed: 11/10/2022] Open
Abstract
Background Streptococcosis is a major bacterial disease in Nile tilapia that is caused by Streptococcus agalactiae infection, and development of resistant strains of Nile tilapia represents a sustainable approach towards combating this disease. In this study, we performed a controlled disease trial on 120 full-sib families to (i) quantify and characterize the potential of genomic selection for survival to S. agalactiae infection in Nile tilapia, and (ii) identify the best genomic model and the optimal density of single nucleotide polymorphisms (SNPs) for this trait. Methods In total, 40 fish per family (15 fish intraperitoneally injected and 25 fish as cohabitants) were used in the challenge test. Mortalities were recorded every 3 h for 35 days. After quality control, genotypes (50,690 SNPs) and phenotypes (0 for dead and 1 for alive) for 2472 cohabitant fish were available. Genetic parameters were obtained using various genomic selection models (genomic best linear unbiased prediction (GBLUP), BayesB, BayesC, BayesR and BayesS) and a traditional pedigree-based model (PBLUP). The pedigree-based analysis used a deep 17-generation pedigree. Prediction accuracy and bias were evaluated using five replicates of tenfold cross-validation. The genomic models were further analyzed using 10 subsets of SNPs at different densities to explore the effect of pruning and SNP density on predictive accuracy. Results Moderate estimates of heritabilities ranging from 0.15 ± 0.03 to 0.26 ± 0.05 were obtained with the different models. Compared to a pedigree-based model, GBLUP (using all the SNPs) increased prediction accuracy by 15.4%. Furthermore, use of the most appropriate Bayesian genomic selection model and SNP density increased the prediction accuracy up to 71%. The 40 to 50 SNPs with non-zero effects were consistent for all BayesB, BayesC and BayesS models with respect to marker id and/or marker locations. Conclusions These results demonstrate the potential of genomic selection for survival to S. agalactiae infection in Nile tilapia. Compared to the PBLUP and GBLUP models, Bayesian genomic models were found to boost the prediction accuracy significantly. Supplementary Information The online version contains supplementary material available at 10.1186/s12711-021-00629-y.
Collapse
Affiliation(s)
- Rajesh Joshi
- GenoMar Genetics AS, Tjuvholmen allé 11, 0252, Oslo, Norway.
| | - Anders Skaarud
- GenoMar Genetics AS, Tjuvholmen allé 11, 0252, Oslo, Norway
| | | | - Thomas Moen
- AquaGen AS, Sluppen, P.O. Box 1240, 7462, Trondheim, Norway
| | - Jørgen Ødegård
- AquaGen AS, Sluppen, P.O. Box 1240, 7462, Trondheim, Norway
| |
Collapse
|
7
|
Linkage disequilibrium vs. pedigree: Genomic selection prediction accuracy in conifer species. PLoS One 2020; 15:e0232201. [PMID: 32520936 PMCID: PMC7286500 DOI: 10.1371/journal.pone.0232201] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 04/08/2020] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND The presupposition of genomic selection (GS) is that predictive accuracies should be based on population-wide linkage disequilibrium (LD). However, in species with large, highly complex genomes the limitation of marker density may preclude the ability to resolve LD accurately enough for GS. Here we investigate such an effect in two conifer species with ~ 20 Gbp genomes, Douglas-fir (Pseudotsuga menziesii Mirb. (Franco)) and Interior spruce (Picea glauca (Moench) Voss x Picea engelmannii Parry ex Engelm.). Random sampling of markers was performed to obtain SNP sets with totals in the range of 200-50,000, this was replicated 10 times. Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) was deployed as the GS method to test these SNP sets, and 10-fold cross-validation was performed on 1,321 Douglas-fir trees, representing 37 full-sib F1 families and on 1,126 Interior spruce trees, representing 25 open-pollinated (half-sib) families. Both trials are located on 3 sites in British Columbia, Canada. RESULTS As marker number increased, so did GS predictive accuracy for both conifer species. However, a plateau in the gain of accuracy became apparent around 10,000-15,000 markers for both Douglas-fir and Interior spruce. Despite random marker selection, little variation in predictive accuracy was observed across replications. On average, Douglas-fir prediction accuracies were higher than those of Interior spruce, reflecting the difference between full- and half-sib families for Douglas-fir and Interior spruce populations, respectively, as well as their respective effective population size. CONCLUSIONS Although possibly advantageous within an advanced breeding population, reducing marker density cannot be recommended for carrying out GS in conifers. Significant LD between markers and putative causal variants was not detected using 50,000 SNPS, and GS was enabled only through the tracking of relatedness in the populations studied. Dramatically increasing marker density would enable said markers to better track LD with causal variants in these large, genetically diverse genomes; as well as providing a model that could be used across populations, breeding programs, and traits.
Collapse
|
8
|
Xu L, Gao N, Wang Z, Xu L, Liu Y, Chen Y, Xu L, Gao X, Zhang L, Gao H, Zhu B, Li J. Incorporating Genome Annotation Into Genomic Prediction for Carcass Traits in Chinese Simmental Beef Cattle. Front Genet 2020; 11:481. [PMID: 32499816 PMCID: PMC7243208 DOI: 10.3389/fgene.2020.00481] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 04/17/2020] [Indexed: 01/08/2023] Open
Abstract
Various methods have been proposed for genomic prediction (GP) in livestock. These methods have mainly focused on statistical considerations and did not include genome annotation information. In this study, to improve the predictive performance of carcass traits in Chinese Simmental beef cattle, we incorporated the genome annotation information into GP. Single nucleotide polymorphisms (SNPs) were annotated to five genomic classes: intergenic, gene, exon, protein coding sequences, and 3'/5' untranslated region. Haploblocks were constructed for all markers and these five genomic classes by defining a biologically functional unit, and haplotype effects were modeled in both numerical dosage and categorical coding strategies. The first-order epistatic effects among SNPs and haplotypes were modeled using a categorical epistasis model. For all makers, the extension from the SNP-based model to a haplotype-based model improved the accuracy by 5.4-9.8% for carcass weight (CW), live weight (LW), and striploin (SI). For the five genomic classes using the haplotype-based prediction model, the incorporation of gene class information into the model improved the accuracies by an average of 1.4, 2.1, and 1.3% for CW, LW, and SI, respectively, compared with their corresponding results for all markers. Including the first-order epistatic effects into the prediction models improved the accuracies in some traits and genomic classes. Therefore, for traits with moderate-to-high heritability, incorporating genome annotation information of gene class into haplotype-based prediction models could be considered as a promising tool for GP in Chinese Simmental beef cattle, and modeling epistasis in prediction can further increase the accuracy to some degree.
Collapse
Affiliation(s)
- Ling Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ning Gao
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Zezhao Wang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lei Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ying Liu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yan Chen
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lingyang Xu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xue Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lupei Zhang
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Huijiang Gao
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Bo Zhu
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| | - Junya Li
- Laboratory of Molecular Biology and Bovine Breeding, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Centre of Beef Cattle Genetic Evaluation, Beijing, China
| |
Collapse
|
9
|
Interest of using imputation for genomic evaluation in layer chicken. Poult Sci 2020; 99:2324-2336. [PMID: 32359567 PMCID: PMC7597443 DOI: 10.1016/j.psj.2020.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 12/27/2019] [Accepted: 01/01/2020] [Indexed: 11/21/2022] Open
Abstract
With the availability of the 600K Affymetrix Axiom high-density (HD) single nucleotide polymorphism (SNP) chip, genomic selection has been implemented in broiler and layer chicken. However, the cost of this SNP chip is too high to genotype all selection candidates. A solution is to develop a low-density SNP chip, at a lower price, and to impute all missing markers. But to routinely implement this solution, the impact of imputation on genomic evaluation accuracy must be studied. It is also interesting to study the consequences of the use of low-density SNP chips in genomic evaluation accuracy. In this perspective, the interest of using imputation in genomic selection was studied in a pure layer line. Two low-density SNP chip designs were compared: an equidistant methodology and a methodology based on linkage disequilibrium. Egg weight, egg shell color, egg shell strength, and albumen height were evaluated with single-step genomic best linear unbiased prediction methodology. The impact of imputation errors or the absence of imputation on the ranking of the male selection candidates was assessed with a genomic evaluation based on ancestry. Thus, genomic estimated breeding values (GEBV) obtained with imputed HD genotypes or low-density genotypes were compared with GEBV obtained with the HD SNP chip. The relative accuracy of GEBV was also investigated by considering as reference GEBV estimated on the offspring. A limited reordering of the breeders, selected on a multitrait index, was observed. Spearman correlations between GEBV on HD genotypes and GEBV on low-density genotypes (with or without imputation) were always higher than 0.94 with more than 3K SNP. For the genetically closer, top 150 individuals for a specific trait, with imputation, the reordering was reduced with correlation higher than 0.94 with more than 3K SNP. Without imputation, the correlations remained lower than 0.85 with less than 3K and 16K SNP for equidistant and linkage disequilibrium methodology, respectively. The differences in GEBV correlations between both methodologies were never significant. The conclusions were the same for all studied traits.
Collapse
|
10
|
Hou L, Liang W, Xu G, Huang B, Zhang X, Hu CY, Wang C. Accuracy of genomic prediction using mixed low-density marker panels. ANIMAL PRODUCTION SCIENCE 2020. [DOI: 10.1071/an18503] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Low-density single-nucleotide polymorphism (LD-SNP) panel is one effective way to reduce the cost of genomic selection in animal breeding. The present study proposes a new type of LD-SNP panel called mixed low-density (MLD) panel, which considers SNPs with a substantial effect estimated by Bayes method B (BayesB) from many traits and evenly spaced distribution simultaneously. Simulated and real data were used to compare the imputation accuracy and genomic-selection accuracy of two types of LD-SNP panels. The result of genotyping imputation for simulated data showed that the number of quantitative trait loci (QTL) had limited influence on the imputation accuracy only for MLD panels. Evenly spaced (ELD) panel was not affected by QTL. For real data, ELD performed slightly better than did MLD when panel contained 500 and 1000 SNP. However, this advantage vanished quickly as the density increased. The result of genomic selection for simulated data using BayesB showed that MLD performed much better than did ELD when QTL was 100. For real data, MLD also outperformed ELD in growth and carcass traits when using BayesB. In conclusion, the MLD strategy is superior to ELD in genomic selection under most situations.
Collapse
|
11
|
Bresolin T, Rosa GJDM, Valente BD, Espigolan R, Gordo DGM, Braz CU, Fernandes Júnior GA, Magalhães AFB, Garcia DA, Frezarim GB, Leão GFC, Carvalheiro R, Baldi F, Nunes de Oliveira H, Galvão de Albuquerque L. Effect of quality control, density and allele frequency of markers on the accuracy of genomic prediction for complex traits in Nellore cattle. ANIMAL PRODUCTION SCIENCE 2019. [DOI: 10.1071/an16821] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
This study was designed to test the impact of quality control, density and allele frequency of single nucleotide polymorphisms (SNP) markers on the accuracy of genomic predictions, using three traits with different heritabilities and two methods of prediction in a Nellore cattle population genotyped with the Illumina Bovine HD Assay. A total of 1756; 3150 and 3119 records of age at first calving (AFC); weaning weight (WW) and yearling weight (YW), respectively, were used. Three scenarios with different exclusion thresholds for minor allele frequency (MAF), deviation from Hardy–Weinberg equilibrium (HWE) and correlation between SNP pairs (r2) were constructed for all traits: (1) high rigor (S1): call rate <0.98, MAF <0.05, HWE with P <10−5, and r2 >0.999; (2) Moderate rigor (S2): call rate <0.85 and MAF <0.01; (3) Low rigor (S3): only non-autosomal SNP and those mapped on the same position were excluded. Additionally, to assess the prediction accuracy from different markers density, six panels (10K, 50K, 100K, 300K, 500K and 700K) were customised using the high-density genotyping assay as reference. Finally, from the markers available in high-density genotyping assay, six groups (G) with different minor allele frequency bins were defined to estimate the accuracy of genomic prediction. The range of MAF bins was approximately equal for the traits studied: G1 (0.000–0.009), G2 (0.010–0.064), G3 (0.065–0.174), G4 (0.175–0.325), G5 (0.326–0.500) and G6 (0.000–0.500). The Genomic Best Linear Unbiased Predictor and BayesCπ methods were used to estimate the SNP marker effects. Five-fold cross-validation was used to measure the accuracy of genomic prediction for all scenarios. There were no effects of genotypes quality control criteria on the accuracies of genomic predictions. For all traits, the higher density panel did not provide greater prediction accuracies than the low density one (10K panel). The groups of SNP with low MAF (MAF ≤0.007 for AFC, MAF ≤0.009 for WW and MAF ≤0.008 for YW) provided lower prediction accuracies than the groups with higher allele frequencies.
Collapse
|
12
|
Abo-Ismail MK, Lansink N, Akanno E, Karisa BK, Crowley JJ, Moore SS, Bork E, Stothard P, Basarab JA, Plastow GS. Development and validation of a small SNP panel for feed efficiency in beef cattle. J Anim Sci 2018; 96:375-397. [PMID: 29390120 DOI: 10.1093/jas/sky020] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 01/17/2018] [Indexed: 12/11/2022] Open
Abstract
The objective of this study was to develop and validate a customized cost-effective single nucleotide polymorphism (SNP) panel for genetic improvement of feed efficiency in beef cattle. The SNPs identified in previous association studies and through extensive analysis of candidate genomic regions and genes, were screened for their functional impact and allele frequency in Angus and Hereford breeds used as validation candidates for the panel. Association analyses were performed on genotypes of 159 SNPs from new samples of Angus (n = 160), Hereford (n = 329), and Angus-Hereford crossbred (n = 382) cattle using allele substitution and genotypic models in ASReml. Genomic heritabilities were estimated for feed efficiency traits using the full set of SNPs, SNPs associated with at least one of the traits (at P ≤ 0.05 and P < 0.10), as well as the Illumina bovine 50K representing a widely used commercial genotyping panel. A total of 63 SNPs within 43 genes showed association (P ≤ 0.05) with at least one trait. The minor alleles of SNPs located in the GHR and CAST genes were associated with decreasing effects on residual feed intake (RFI) and/or RFI adjusted for backfat (RFIf), whereas minor alleles of SNPs within MKI67 gene were associated with increasing effects on RFI and RFIf. Additionally, the minor allele of rs137400016 SNP within CNTFR was associated with increasing average daily gain (ADG). The SNPs genotypes within UMPS, SMARCAL, CCSER1, and LMCD1 genes showed significant over-dominance effects whereas other SNPs located in SMARCAL1, ANXA2, CACNA1G, and PHYHIPL genes showed additive effects on RFI and RFIf. Gene enrichment analysis indicated that gland development, as well as ion and cation transport are important physiological mechanisms contributing to variation in feed efficiency traits. The study revealed the effect of the Jak-STAT signaling pathway on feed efficiency through the CNTFR, OSMR, and GHR genes. Genomic heritability using the 63 significant (P ≤ 0.05) SNPs was 0.09, 0.09, 0.13, 0.05, 0.05, and 0.07 for ADG, dry matter intake, midpoint metabolic weight, RFI, RFIf, and backfat, respectively. These SNPs contributed to genetic variation in the studied traits and thus can potentially be used or tested to generate cost-effective molecular breeding values for feed efficiency in beef cattle.
Collapse
Affiliation(s)
- M K Abo-Ismail
- Livestock Gentec at University of Alberta, Edmonton, AB, Canada
- Animal and Poultry Production Department, Damanhour University, Damanhour, Egypt
| | - N Lansink
- Livestock Gentec at University of Alberta, Edmonton, AB, Canada
| | - E Akanno
- Livestock Gentec at University of Alberta, Edmonton, AB, Canada
| | - B K Karisa
- Livestock Gentec at University of Alberta, Edmonton, AB, Canada
| | - J J Crowley
- Livestock Gentec at University of Alberta, Edmonton, AB, Canada
- Canadian Beef Breeds Council, Calgary, AB, Canada
| | - S S Moore
- Centre for Animal Science, University of Queensland, St Lucia, Australia
| | - E Bork
- Rangeland Research Institute, Agriculture/Forestry Center, University of Alberta, Edmonton, AB, Canada
| | - P Stothard
- Livestock Gentec at University of Alberta, Edmonton, AB, Canada
| | - J A Basarab
- Alberta Agriculture and Forestry, Lacombe Research Centre, Lacombe, AB, Canada
| | - G S Plastow
- Livestock Gentec at University of Alberta, Edmonton, AB, Canada
| |
Collapse
|
13
|
Arojju SK, Conaghan P, Barth S, Milbourne D, Casler MD, Hodkinson TR, Michel T, Byrne SL. Genomic prediction of crown rust resistance in Lolium perenne. BMC Genet 2018; 19:35. [PMID: 29843601 PMCID: PMC5975627 DOI: 10.1186/s12863-018-0613-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2017] [Accepted: 04/18/2018] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Genomic selection (GS) can accelerate genetic gains in breeding programmes by reducing the time it takes to complete a cycle of selection. Puccinia coronata f. sp lolli (crown rust) is one of the most widespread diseases of perennial ryegrass and can lead to reductions in yield, persistency and nutritional value. Here, we used a large perennial ryegrass population to assess the accuracy of using genome wide markers to predict crown rust resistance and to investigate the factors affecting predictive ability. RESULTS Using these data, predictive ability for crown rust resistance in the complete population reached a maximum of 0.52. Much of the predictive ability resulted from the ability of markers to capture genetic relationships among families within the training set, and reducing the marker density had little impact on predictive ability. Using permutation based variable importance measure and genome wide association studies (GWAS) to identify and rank markers enabled the identification of a small subset of SNPs that could achieve predictive abilities close to those achieved using the complete marker set. CONCLUSION Using a GWAS to identify and rank markers enabled a small panel of markers to be identified that could achieve higher predictive ability than the same number of randomly selected markers, and predictive abilities close to those achieved with the entire marker set. This was particularly evident in a sub-population characterised by having on-average higher genome-wide linkage disequilibirum (LD). Higher predictive abilities with selected markers over random markers suggests they are in LD with QTL. Accuracy due to genetic relationships will decay rapidly over generations whereas accuracy due to LD will persist, which is advantageous for practical breeding applications.
Collapse
Affiliation(s)
- Sai Krishna Arojju
- Teagasc, Crop Science Department, Oak Park, Carlow, R93 XE12 Ireland
- Department of Botany, School of Natural Sciences, Trinity College Dublin, Dublin 2, Ireland
| | - Patrick Conaghan
- Teagasc, Grassland Science Research Department, Animal and Grassland Research and Innovation Centre, Oak Park, Carlow, R93 XE12 Ireland
| | - Susanne Barth
- Teagasc, Crop Science Department, Oak Park, Carlow, R93 XE12 Ireland
| | - Dan Milbourne
- Teagasc, Crop Science Department, Oak Park, Carlow, R93 XE12 Ireland
| | - Michael D. Casler
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI53706 USA
- Agricultural Research Service, United State Department of Agriculture, Madison, WI53706 USA
| | - Trevor R. Hodkinson
- Department of Botany, School of Natural Sciences, Trinity College Dublin, Dublin 2, Ireland
| | - Thibauld Michel
- Teagasc, Crop Science Department, Oak Park, Carlow, R93 XE12 Ireland
| | - Stephen L. Byrne
- Teagasc, Crop Science Department, Oak Park, Carlow, R93 XE12 Ireland
| |
Collapse
|
14
|
Tan B, Grattapaglia D, Martins GS, Ferreira KZ, Sundberg B, Ingvarsson PK. Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F 1 hybrids. BMC PLANT BIOLOGY 2017; 17:110. [PMID: 28662679 PMCID: PMC5492818 DOI: 10.1186/s12870-017-1059-6] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 06/15/2017] [Indexed: 05/18/2023]
Abstract
BACKGROUND Genomic prediction is a genomics assisted breeding methodology that can increase genetic gains by accelerating the breeding cycle and potentially improving the accuracy of breeding values. In this study, we use 41,304 informative SNPs genotyped in a Eucalyptus breeding population involving 90 E.grandis and 78 E.urophylla parents and their 949 F1 hybrids to develop genomic prediction models for eight phenotypic traits - basic density and pulp yield, circumference at breast height and height and tree volume scored at age three and six years. We assessed the impact of different genomic prediction methods, the composition and size of the training and validation set and the number and genomic location of SNPs on the predictive ability (PA). RESULTS Heritabilities estimated using the realized genomic relationship matrix (GRM) were considerably higher than estimates based on the expected pedigree, mainly due to inconsistencies in the expected pedigree that were readily corrected by the GRM. Moreover, the GRM more precisely capture Mendelian sampling among related individuals, such that the genetic covariance was based on the true proportion of the genome shared between individuals. PA improved considerably when increasing the size of the training set and by enhancing relatedness to the validation set. Prediction models trained on pure species parents could not predict well in F1 hybrids, indicating that model training has to be carried out in hybrid populations if one is to predict in hybrid selection candidates. The different genomic prediction methods provided similar results for all traits, therefore either GBLUP or rrBLUP represents better compromises between computational time and prediction efficiency. Only slight improvement was observed in PA when more than 5000 SNPs were used for all traits. Using SNPs in intergenic regions provided slightly better PA than using SNPs sampled exclusively in genic regions. CONCLUSIONS The size and composition of the training set and number of SNPs used are the two most important factors for model prediction, compared to the statistical methods and the genomic location of SNPs. Furthermore, training the prediction model based on pure parental species only provide limited ability to predict traits in interspecific hybrids. Our results provide additional promising perspectives for the implementation of genomic prediction in Eucalyptus breeding programs by the selection of interspecific hybrids.
Collapse
Affiliation(s)
- Biyue Tan
- Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University, Umeå, SE-90187 Sweden
- Biomaterials Division, Stora Enso AB, Nacka, SE-13104 Sweden
| | - Dario Grattapaglia
- EMBRAPA Genetic Resources and Biotechnology – EPqB, Brasilia, DF 70770-910 Brazil
- Universidade Católica de Brasília- SGAN, 916 modulo B, Brasilia, DF 70790-160 Brazil
| | | | | | - Björn Sundberg
- Biomaterials Division, Stora Enso AB, Nacka, SE-13104 Sweden
| | - Pär K. Ingvarsson
- Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University, Umeå, SE-90187 Sweden
- Present address: Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Sciences, Uppsala, SE-75007 Sweden
| |
Collapse
|
15
|
Zhang Z, Xu ZQ, Luo YY, Zhang HB, Gao N, He JL, Ji CL, Zhang DX, Li JQ, Zhang XQ. Whole genomic prediction of growth and carcass traits in a Chinese quality chicken population. J Anim Sci 2017; 95:72-80. [PMID: 28177394 DOI: 10.2527/jas.2016.0823] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
By incorporating high-density markers into breeding value prediction models, the whole genomic prediction (WGP) method can effectively accelerate genetic improvement in livestock breeding. However, the performance of WGP varies across species and populations and is affected by the underlying genetic architecture. In particular, very little is known about the performance of WGP for many chicken breeds. Here we estimate the genetic parameters and evaluate the performance of WGP for 18 growth and carcass traits in a Chinese quality chicken population. In total, 435 chickens were systematically phenotyped and genotyped using a 600K genotyping array. Two variance component estimation scenarios, 3 breeding value prediction methods, and 2 validation procedures were compared. The results showed that the heritability of these 18 traits was medium to high (ranging from 0.28 to 0.60) and that deviations existed between the heritability estimated from pedigrees and markers. Compared with conventional breeding methods, WGP could potentially increase the selection accuracy by 20% or more depending on the prediction model used, the trait under consideration, and the genetic connectedness between the training and validation individuals. Our results showed the potential of implementing genomic selection in small breeding herds.
Collapse
|
16
|
Xia J, Fan H, Chang T, Xu L, Zhang W, Song Y, Zhu B, Zhang L, Gao X, Chen Y, Li J, Gao H. Searching for new loci and candidate genes for economically important traits through gene-based association analysis of Simmental cattle. Sci Rep 2017; 7:42048. [PMID: 28169328 PMCID: PMC5294460 DOI: 10.1038/srep42048] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 01/06/2017] [Indexed: 12/26/2022] Open
Abstract
Single-marker genome-wide association study (GWAS) is a convenient strategy of genetic analysis that has been successful in detecting the association of a number of single-nucleotide polymorphisms (SNPs) with quantitative traits. However, analysis of individual SNPs can only account for a small proportion of genetic variation and offers only limited knowledge of complex traits. This inadequacy may be overcome by employing a gene-based GWAS analytic approach, which can be considered complementary to the single-SNP association analysis. Here we performed an initial single-SNP GWAS for bone weight (BW) and meat pH value with a total of 770,000 SNPs in 1141 Simmental cattle. Additionally, 21836 cattle genes collected from the Ensembl Genes 83 database were analyzed to find supplementary evidence to support the importance of gene-based association study. Results of the single SNP-based association study showed that there were 11 SNPs significantly associated with bone weight (BW) and two SNPs associated with meat pH value. Interestingly, all of these SNPs were located in genes detected by the gene-based association study.
Collapse
Affiliation(s)
- Jiangwei Xia
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Huizhong Fan
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Tianpeng Chang
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Lingyang Xu
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Wengang Zhang
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Yuxin Song
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Bo Zhu
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Lupei Zhang
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Xue Gao
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Yan Chen
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Junya Li
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| | - Huijiang Gao
- Institute of Animal Science, Chinese Academy of Agricultural Science, Beijing, China
| |
Collapse
|
17
|
Moghaddar N, Gore KP, Daetwyler HD, Hayes BJ, van der Werf JHJ. Accuracy of genotype imputation based on random and selected reference sets in purebred and crossbred sheep populations and its effect on accuracy of genomic prediction. Genet Sel Evol 2015; 47:97. [PMID: 26694131 PMCID: PMC4688977 DOI: 10.1186/s12711-015-0175-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 11/30/2015] [Indexed: 02/02/2023] Open
Abstract
Background The objectives of this study were to investigate the accuracy of genotype imputation from low (12k) to medium (50k Illumina-Ovine) SNP (single nucleotide polymorphism) densities in purebred and crossbred Merino sheep based on a random or selected reference set and to evaluate the impact of using imputed genotypes on accuracy of genomic prediction. Methods Imputation validation sets were composed of random purebred or crossbred Merinos, while imputation reference sets were of variable sizes and included random purebred or crossbred Merinos or a group of animals that were selected based on high genetic relatedness to animals in the validation set. The Beagle software program was used for imputation and accuracy of imputation was assessed based on the Pearson correlation coefficient between observed and imputed genotypes. Genomic evaluation was performed based on genomic best linear unbiased prediction and its accuracy was evaluated as the Pearson correlation coefficient between genomic estimated breeding values using either observed (12k/50k) or imputed genotypes with varying levels of imputation accuracy and accurate estimated breeding values based on progeny-tests. Results Imputation accuracy increased as the size of the reference set increased. However, accuracy was higher for purebred Merinos that were imputed from other purebred Merinos (on average 0.90 to 0.95 based on 1000 to 3000 animals) than from crossbred Merinos (0.78 to 0.87 based on 1000 to 3000 animals) or from non-Merino purebreds (on average 0.50). The imputation accuracy for crossbred Merinos based on 1000 to 3000 other crossbred Merino ranged from 0.86 to 0.88. Considerably higher imputation accuracy was observed when a selected reference set with a high genetic relationship to target animals was used vs. a random reference set of the same size (0.96 vs. 0.88, respectively). Accuracy of genomic prediction based on 50k genotypes imputed with high accuracy (0.88 to 0.99) decreased only slightly (0.0 to 0.67 % across traits) compared to using observed 50k genotypes. Accuracy of genomic prediction based on observed 12k genotypes was higher than accuracy based on lowly accurate (0.62 to 0.86) imputed 50k genotypes.
Collapse
Affiliation(s)
- Nasir Moghaddar
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia. .,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| | - Klint P Gore
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia. .,Animal Genetics & Breeding Unit (AGBU), University of New England, Armidale, NSW, 2351, Australia.
| | - Hans D Daetwyler
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia. .,Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, VIC, Australia. .,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia.
| | - Ben J Hayes
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia. .,Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, VIC, Australia. .,School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia.
| | - Julius H J van der Werf
- Cooperative Research Centre for Sheep Industry Innovation, Armidale, NSW, 2351, Australia. .,School of Environmental and Rural Science, University of New England, Armidale, NSW, 2351, Australia.
| |
Collapse
|
18
|
Ogawa S, Matsuda H, Taniguchi Y, Watanabe T, Sugimoto Y, Iwaisaki H. Estimation of variance and genomic prediction using genotypes imputed from low-density marker subsets for carcass traits in Japanese black cattle. Anim Sci J 2015; 87:1106-13. [DOI: 10.1111/asj.12570] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Revised: 09/15/2015] [Accepted: 10/07/2015] [Indexed: 12/31/2022]
Affiliation(s)
| | | | - Yukio Taniguchi
- Graduate School of Agriculture; Kyoto University; Kyoto Japan
| | | | - Yoshikazu Sugimoto
- Shirakawa Institute of Animal Genetics; Japan Livestock Technology Association; Nishigo Fukushima Japan
| | | |
Collapse
|
19
|
Gao N, Li J, He J, Xiao G, Luo Y, Zhang H, Chen Z, Zhang Z. Improving accuracy of genomic prediction by genetic architecture based priors in a Bayesian model. BMC Genet 2015; 16:120. [PMID: 26466667 PMCID: PMC4606514 DOI: 10.1186/s12863-015-0278-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2015] [Accepted: 10/08/2015] [Indexed: 11/15/2022] Open
Abstract
Background In recent years, with the development of high-throughput sequencing technology and the commercial availability of genotyping bead chips, more attention is being directed towards the utilization of abundant genetic markers in animal and plant breeding programs, human disease risk prediction and personal medicine. Several useful approaches to accomplish genomic prediction have been developed and used widely, but still have room for improvement to gain more accuracy. In this study, an improved Bayesian approach, termed BayesBπ, which differs from the original BayesB in priors assigning, is proposed. An effective method for calculating the locus-specific π by converting p-values from association between SNPs and traits’ phenotypes is given and systemically validated using a German Holstein dairy cattle population. Furthermore, the new method is applied to a loblolly pine (Pinus taeda) dataset. Results Compared with the original BayesB, BayesBπ can improve the accuracy of genomic prediction up to 7.62 % for milk fat percentage, a trait which shows a large effect of quantitative trait loci (QTL). For milk yield, which is controlled by small to moderate effect genes, the accuracy of genomic prediction can be improved up to 4.94 %. For somatic cell score, of which no large effect QTL has been reported, GBLUP performs better than Bayesian methods. BayesBπ outperforms BayesCπ in 10 out of 12 scenarios in the dairy cattle population, especially in small to moderate population sizes where accuracy of BayesCπ are dramatically low. Results of the loblolly pine dataset show that BayesBπ outperforms BayesB in 14 out of 17 traits and BayesCπ in 8 out of 17 traits, respectively. Conclusions For traits controlled by large effect genes, BayesBπ can improve the accuracy of genomic prediction and unbiasedness of BayesB in moderate size populations. Knowledge of traits’ genetic architectures can be integrated into practices of genomic prediction by assigning locus-specific priors to markers, which will help Bayesian approaches perform better in variable selection and marker effects shrinkage. Electronic supplementary material The online version of this article (doi:10.1186/s12863-015-0278-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ning Gao
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China. .,Department of Animal Sciences, Animal Breeding and Genetics Group, Georg-August-Universität Göttingen, Göttingen, 37075, Germany.
| | - Jiaqi Li
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Jinlong He
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Guang Xiao
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Yuanyu Luo
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Hao Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Zanmou Chen
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Zhe Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China. .,Department of Animal Sciences, Animal Breeding and Genetics Group, Georg-August-Universität Göttingen, Göttingen, 37075, Germany.
| |
Collapse
|
20
|
Zhang Z, Erbe M, He J, Ober U, Gao N, Zhang H, Simianer H, Li J. Accuracy of whole-genome prediction using a genetic architecture-enhanced variance-covariance matrix. G3 (BETHESDA, MD.) 2015; 5:615-27. [PMID: 25670771 PMCID: PMC4390577 DOI: 10.1534/g3.114.016261] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Accepted: 02/05/2015] [Indexed: 01/22/2023]
Abstract
Obtaining accurate predictions of unobserved genetic or phenotypic values for complex traits in animal, plant, and human populations is possible through whole-genome prediction (WGP), a combined analysis of genotypic and phenotypic data. Because the underlying genetic architecture of the trait of interest is an important factor affecting model selection, we propose a new strategy, termed BLUP|GA (BLUP-given genetic architecture), which can use genetic architecture information within the dataset at hand rather than from public sources. This is achieved by using a trait-specific covariance matrix ( T: ), which is a weighted sum of a genetic architecture part ( S: matrix) and the realized relationship matrix ( G: ). The algorithm of BLUP|GA (BLUP-given genetic architecture) is provided and illustrated with real and simulated datasets. Predictive ability of BLUP|GA was validated with three model traits in a dairy cattle dataset and 11 traits in three public datasets with a variety of genetic architectures and compared with GBLUP and other approaches. Results show that BLUP|GA outperformed GBLUP in 20 of 21 scenarios in the dairy cattle dataset and outperformed GBLUP, BayesA, and BayesB in 12 of 13 traits in the analyzed public datasets. Further analyses showed that the difference of accuracies for BLUP|GA and GBLUP significantly correlate with the distance between the T: and G: matrices. The new strategy applied in BLUP|GA is a favorable and flexible alternative to the standard GBLUP model, allowing to account for the genetic architecture of the quantitative trait under consideration when necessary. This feature is mainly due to the increased similarity between the trait-specific relationship matrix ( T: matrix) and the genetic relationship matrix at unobserved causal loci. Applying BLUP|GA in WGP would ease the burden of model selection.
Collapse
Affiliation(s)
- Zhe Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China Department of Animal Sciences, Animal Breeding and Genetics Group, Georg-August-Universität Göttingen, Göttingen 37075, Germany
| | - Malena Erbe
- Department of Animal Sciences, Animal Breeding and Genetics Group, Georg-August-Universität Göttingen, Göttingen 37075, Germany
| | - Jinlong He
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
| | - Ulrike Ober
- Department of Animal Sciences, Animal Breeding and Genetics Group, Georg-August-Universität Göttingen, Göttingen 37075, Germany
| | - Ning Gao
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
| | - Hao Zhang
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
| | - Henner Simianer
- Department of Animal Sciences, Animal Breeding and Genetics Group, Georg-August-Universität Göttingen, Göttingen 37075, Germany
| | - Jiaqi Li
- National Engineering Research Center for Breeding Swine Industry, Guangdong Provincial Key Lab of Agro-animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
21
|
Zhang Z, Li X, Ding X, Li J, Zhang Q. GPOPSIM: a simulation tool for whole-genome genetic data. BMC Genet 2015; 16:10. [PMID: 25652552 PMCID: PMC4328615 DOI: 10.1186/s12863-015-0173-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2014] [Accepted: 01/22/2015] [Indexed: 01/16/2023] Open
Abstract
Background Population-wide genotypic and phenotypic data is frequently used to predict the disease risk or genetic/phenotypic values, or to localize genetic variations responsible for complex traits. GPOPSIM is a simulation tool for pedigree, phenotypes, and genomic data, with a variety of population and genome structures and trait genetic architectures. It provides flexible parameter settings for a wide discipline of users, especially can simulate multiple genetically correlated traits with desired genetic parameters and underlying genetic architectures. Results The model implemented in GPOPSIM is presented, and the code has been made freely available to the community. Data simulated by GPOPSIM is a good mimic to the real data in terms of genome structure and trait underlying genetic architecture. Conclusions GPOPSIM would be a useful tool for the methodological and theoretical studies in the population and quantitative genetics and breeding. Electronic supplementary material The online version of this article (doi:10.1186/s12863-015-0173-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Xiujin Li
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, College of Animal Science, South China Agricultural University, Guangzhou, 510642, China.
| | - Qin Zhang
- Key Laboratory of Animal Genetics and Breeding of the Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College of Animal Science and Technology, College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
22
|
Isidro J, Jannink JL, Akdemir D, Poland J, Heslot N, Sorrells ME. Training set optimization under population structure in genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2015; 128:145-58. [PMID: 25367380 PMCID: PMC4282691 DOI: 10.1007/s00122-014-2418-4] [Citation(s) in RCA: 164] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Accepted: 10/12/2014] [Indexed: 05/17/2023]
Abstract
Population structure must be evaluated before optimization of the training set population. Maximizing the phenotypic variance captured by the training set is important for optimal performance. The optimization of the training set (TRS) in genomic selection has received much interest in both animal and plant breeding, because it is critical to the accuracy of the prediction models. In this study, five different TRS sampling algorithms, stratified sampling, mean of the coefficient of determination (CDmean), mean of predictor error variance (PEVmean), stratified CDmean (StratCDmean) and random sampling, were evaluated for prediction accuracy in the presence of different levels of population structure. In the presence of population structure, the most phenotypic variation captured by a sampling method in the TRS is desirable. The wheat dataset showed mild population structure, and CDmean and stratified CDmean methods showed the highest accuracies for all the traits except for test weight and heading date. The rice dataset had strong population structure and the approach based on stratified sampling showed the highest accuracies for all traits. In general, CDmean minimized the relationship between genotypes in the TRS, maximizing the relationship between TRS and the test set. This makes it suitable as an optimization criterion for long-term selection. Our results indicated that the best selection criterion used to optimize the TRS seems to depend on the interaction of trait architecture and population structure.
Collapse
|
23
|
Ryu J, Lee C. Genomic heritability of bovine growth using a mixed model. ASIAN-AUSTRALASIAN JOURNAL OF ANIMAL SCIENCES 2014; 27:1521-5. [PMID: 25358309 PMCID: PMC4213694 DOI: 10.5713/ajas.2014.14287] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2014] [Revised: 06/24/2014] [Accepted: 07/08/2014] [Indexed: 11/27/2022]
Abstract
This study investigated heritability for bovine growth estimated with genomewide single nucleotide polymorphism (SNP) information obtained from a DNA microarray chip. Three hundred sixty seven Korean cattle were genotyped with the Illumina BovineSNP50 BeadChip, and 39,112 SNPs of 364 animals filtered by quality assurance were analyzed to estimate heritability of body weights at 6, 9, 12, 15, 18, 21, and 24 months of age. Restricted maximum likelihood estimate of heritability was obtained using covariance structure of genomic relationships among animals in a mixed model framework. Heritability estimates ranged from 0.58 to 0.76 for body weights at different ages. The heritability estimates using genomic information in this study were larger than those which had been estimated previously using pedigree information. The results revealed a trend that the heritability for body weight increased at a younger age (6 months). This suggests an early genetic evaluation for bovine growth using genomic information to increase genetic merits of animals.
Collapse
|
24
|
Impact of genotype imputation on the performance of GBLUP and Bayesian methods for genomic prediction. PLoS One 2014; 9:e101544. [PMID: 25025158 PMCID: PMC4099124 DOI: 10.1371/journal.pone.0101544] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2014] [Accepted: 06/09/2014] [Indexed: 01/25/2023] Open
Abstract
The aim of this study was to evaluate the impact of genotype imputation on the performance of the GBLUP and Bayesian methods for genomic prediction. A total of 10,309 Holstein bulls were genotyped on the BovineSNP50 BeadChip (50 k). Five low density single nucleotide polymorphism (SNP) panels, containing 6,177, 2,480, 1,536, 768 and 384 SNPs, were simulated from the 50 k panel. A fraction of 0%, 33% and 66% of the animals were randomly selected from the training sets to have low density genotypes which were then imputed into 50 k genotypes. A GBLUP and a Bayesian method were used to predict direct genomic values (DGV) for validation animals using imputed or their actual 50 k genotypes. Traits studied included milk yield, fat percentage, protein percentage and somatic cell score (SCS). Results showed that performance of both GBLUP and Bayesian methods was influenced by imputation errors. For traits affected by a few large QTL, the Bayesian method resulted in greater reductions of accuracy due to imputation errors than GBLUP. Including SNPs with largest effects in the low density panel substantially improved the accuracy of genomic prediction for the Bayesian method. Including genotypes imputed from the 6 k panel achieved almost the same accuracy of genomic prediction as that of using the 50 k panel even when 66% of the training population was genotyped on the 6 k panel. These results justified the application of the 6 k panel for genomic prediction. Imputations from lower density panels were more prone to errors and resulted in lower accuracy of genomic prediction. But for animals that have close relationship to the reference set, genotype imputation may still achieve a relatively high accuracy.
Collapse
|
25
|
Buzanskas ME, Grossi DA, Ventura RV, Schenkel FS, Sargolzaei M, Meirelles SLC, Mokry FB, Higa RH, Mudadu MA, da Silva MVGB, Niciura SCM, Júnior RAAT, Alencar MM, Regitano LCA, Munari DP. Genome-wide association for growth traits in Canchim beef cattle. PLoS One 2014; 9:e94802. [PMID: 24733441 PMCID: PMC3986245 DOI: 10.1371/journal.pone.0094802] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2013] [Accepted: 03/20/2014] [Indexed: 12/01/2022] Open
Abstract
Studies are being conducted on the applicability of genomic data to improve the accuracy of the selection process in livestock, and genome-wide association studies (GWAS) provide valuable information to enhance the understanding on the genetics of complex traits. The aim of this study was to identify genomic regions and genes that play roles in birth weight (BW), weaning weight adjusted for 210 days of age (WW), and long-yearling weight adjusted for 420 days of age (LYW) in Canchim cattle. GWAS were performed by means of the Generalized Quasi-Likelihood Score (GQLS) method using genotypes from the BovineHD BeadChip and estimated breeding values for BW, WW, and LYW. Data consisted of 285 animals from the Canchim breed and 114 from the MA genetic group (derived from crossings between Charolais sires and ½ Canchim + ½ Zebu dams). After applying a false discovery rate correction at a 10% significance level, a total of 4, 12, and 10 SNPs were significantly associated with BW, WW, and LYW, respectively. These SNPs were surveyed to their corresponding genes or to surrounding genes within a distance of 250 kb. The genes DPP6 (dipeptidyl-peptidase 6) and CLEC3B (C-type lectin domain family 3 member B) were highlighted, considering its functions on the development of the brain and skeletal system, respectively. The GQLS method identified regions on chromosome associated with birth weight, weaning weight, and long-yearling weight in Canchim and MA animals. New candidate regions for body weight traits were detected and some of them have interesting biological functions, of which most have not been previously reported. The observation of QTL reports for body weight traits, covering areas surrounding the genes (SNPs) herein identified provides more evidence for these associations. Future studies targeting these areas could provide further knowledge to uncover the genetic architecture underlying growth traits in Canchim cattle.
Collapse
Affiliation(s)
- Marcos E. Buzanskas
- Departamento de Ciências Exatas, UNESP - Univ Estadual Paulista, Faculdade de Ciências Agrárias e Veterinárias, Jaboticabal, São Paulo, Brazil
| | - Daniela A. Grossi
- Department of Animal and Poultry Science, University of Guelph, Centre for Genetic Improvement of Livestock (CGIL), Guelph, Ontario, Canada
| | - Ricardo V. Ventura
- Department of Animal and Poultry Science, University of Guelph, Centre for Genetic Improvement of Livestock (CGIL), Guelph, Ontario, Canada
- Beef Improvement Opportunities (BIO), Guelph, Ontario, Canada
| | - Flávio S. Schenkel
- Department of Animal and Poultry Science, University of Guelph, Centre for Genetic Improvement of Livestock (CGIL), Guelph, Ontario, Canada
| | - Mehdi Sargolzaei
- Department of Animal and Poultry Science, University of Guelph, Centre for Genetic Improvement of Livestock (CGIL), Guelph, Ontario, Canada
- The Semex Alliance, Guelph, Ontario, Canada
| | - Sarah L. C. Meirelles
- Department of Animal Science, Federal University of Lavras (UFLA), Lavras, Minas Gerais, Brazil
| | - Fabiana B. Mokry
- Department of Genetics and Evolution, Federal University of São Carlos (UFSCar), São Carlos, São Paulo, Brazil
| | - Roberto H. Higa
- Embrapa Agricultural Informatics, Campinas, São Paulo, Brazil
| | | | | | | | | | | | | | - Danísio P. Munari
- Departamento de Ciências Exatas, UNESP - Univ Estadual Paulista, Faculdade de Ciências Agrárias e Veterinárias, Jaboticabal, São Paulo, Brazil
- * E-mail:
| |
Collapse
|
26
|
Ogawa S, Matsuda H, Taniguchi Y, Watanabe T, Nishimura S, Sugimoto Y, Iwaisaki H. Effects of single nucleotide polymorphism marker density on degree of genetic variance explained and genomic evaluation for carcass traits in Japanese Black beef cattle. BMC Genet 2014; 15:15. [PMID: 24491120 PMCID: PMC3913948 DOI: 10.1186/1471-2156-15-15] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2013] [Accepted: 01/31/2014] [Indexed: 01/25/2023] Open
Abstract
Background Japanese Black cattle are a beef breed whose meat is well known to excel in meat quality, especially in marbling, and whose effective population size is relatively low in Japan. Unlike dairy cattle, the accuracy of genomic evaluation (GE) for carcass traits in beef cattle, including this breed, has been poorly studied. For carcass weight and marbling score in the breed, as well as the extent of whole genome linkage disequilibrium (LD), the effects of equally-spaced single nucleotide polymorphisms (SNPs) density on genomic relationship matrix (G matrix), genetic variance explained and GE were investigated using the genotype data of about 40,000 SNPs and two statistical models. Results Using all pairs of two adjacent SNPs in the whole SNP set, the means of LD (r2) at ranges 0–0.1, 0.1–0.2, 0.2–0.5 and 0.5–1 Mb were 0.22, 0.13, 0.10 and 0.08, respectively, and 25.7, 13.9, 10.4 and 6.4% of the r2 values exceeded 0.3, respectively. While about 90% of the genetic variance for carcass weight estimated using all available SNPs was explained using 4,000–6,000 SNPs, the corresponding percentage for marbling score was consistently lower. With the conventional linear model incorporating the G matrix, correlation between the genomic estimated breeding values (GEBVs) obtained using 4,000 SNPs and all available SNPs was 0.99 for carcass weight and 0.98 for marbling score, with an underestimation of the former GEBVs, especially for marbling score. Conclusions The Japanese Black is likely to be in a breed group with a relatively high extent of whole genome LD. The results indicated that the degree of marbling is controlled by only QTLs with relatively small effects, compared with carcass weight, and that using at least 4,000 equally-spaced SNPs, there is a possibility of ranking animals genetically for these carcass traits in this breed.
Collapse
Affiliation(s)
- Shinichiro Ogawa
- Graduate School of Agriculture, Kyoto University, Sakyo-ku, Kyoto 606-8502, Japan.
| | | | | | | | | | | | | |
Collapse
|
27
|
Predictive ability of selected subsets of single nucleotide polymorphisms (SNPs) in a moderately sized dairy cattle population. Animal 2014; 8:208-16. [DOI: 10.1017/s1751731113002188] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
28
|
Ding X, Zhang Z, Li X, Wang S, Wu X, Sun D, Yu Y, Liu J, Wang Y, Zhang Y, Zhang S, Zhang Y, Zhang Q. Accuracy of genomic prediction for milk production traits in the Chinese Holstein population using a reference population consisting of cows. J Dairy Sci 2013; 96:5315-23. [DOI: 10.3168/jds.2012-6194] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2012] [Accepted: 03/25/2013] [Indexed: 11/19/2022]
|
29
|
Mokry FB, Higa RH, de Alvarenga Mudadu M, Oliveira de Lima A, Meirelles SLC, Barbosa da Silva MVG, Cardoso FF, Morgado de Oliveira M, Urbinati I, Méo Niciura SC, Tullio RR, Mello de Alencar M, Correia de Almeida Regitano L. Genome-wide association study for backfat thickness in Canchim beef cattle using Random Forest approach. BMC Genet 2013; 14:47. [PMID: 23738659 PMCID: PMC3680339 DOI: 10.1186/1471-2156-14-47] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Accepted: 05/23/2013] [Indexed: 12/17/2022] Open
Abstract
Background Meat quality involves many traits, such as marbling, tenderness, juiciness, and backfat thickness, all of which require attention from livestock producers. Backfat thickness improvement by means of traditional selection techniques in Canchim beef cattle has been challenging due to its low heritability, and it is measured late in an animal’s life. Therefore, the implementation of new methodologies for identification of single nucleotide polymorphisms (SNPs) linked to backfat thickness are an important strategy for genetic improvement of carcass and meat quality. Results The set of SNPs identified by the random forest approach explained as much as 50% of the deregressed estimated breeding value (dEBV) variance associated with backfat thickness, and a small set of 5 SNPs were able to explain 34% of the dEBV for backfat thickness. Several quantitative trait loci (QTL) for fat-related traits were found in the surrounding areas of the SNPs, as well as many genes with roles in lipid metabolism. Conclusions These results provided a better understanding of the backfat deposition and regulation pathways, and can be considered a starting point for future implementation of a genomic selection program for backfat thickness in Canchim beef cattle.
Collapse
Affiliation(s)
- Fabiana Barichello Mokry
- Department of Genetics and Evolution, Federal University of São Carlos, Rodovia Washington Luiz, km 235, PO BOX 676, 13565-905, São Carlos, Brazil.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Watanabe T, Matsuda H, Arakawa A, Yamada T, Iwaisaki H, Nishimura S, Sugimoto Y. Estimation of variance components for carcass traits in Japanese Black cattle using 50K SNP genotype data. Anim Sci J 2013; 85:1-7. [DOI: 10.1111/asj.12074] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2012] [Accepted: 02/26/2013] [Indexed: 02/03/2023]
Affiliation(s)
- Toshio Watanabe
- National Livestock Breeding Center; Nishigo Japan
- Shirakawa Institute of Animal Genetics; Nishigo Japan
| | | | - Aisaku Arakawa
- Graduate School of Agriculture; Kyoto University; Kyoto Japan
| | - Takahisa Yamada
- Graduate School of Agriculture; Kyoto University; Kyoto Japan
| | | | | | | |
Collapse
|
31
|
Khatkar MS, Moser G, Hayes BJ, Raadsma HW. Strategies and utility of imputed SNP genotypes for genomic analysis in dairy cattle. BMC Genomics 2012; 13:538. [PMID: 23043356 PMCID: PMC3531262 DOI: 10.1186/1471-2164-13-538] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2012] [Accepted: 10/06/2012] [Indexed: 12/21/2022] Open
Abstract
Background We investigated strategies and factors affecting accuracy of imputing genotypes from lower-density SNP panels (Illumina 3K, 7K, Affymetrix 15K and 25K, and evenly spaced subsets) up to one medium (Illumina 50K) and one high-density (Illumina 800K) SNP panel. We also evaluated the utility of imputed genotypes on the accuracy of genomic selection using Australian Holstein-Friesian cattle data from 2727 and 845 animals genotyped with 50K and 800K SNP chip, respectively. Animals were divided into reference and test sets (genotyped with higher and lower density SNP panels, respectively) for evaluating the accuracies of imputation. For the accuracy of genomic selection, a comparison of direct genetic values (DGV) was made by dividing the data into training and validation sets under a range of imputation scenarios. Results Of the three methods compared for imputation, IMPUTE2 outperformed Beagle and fastPhase for almost all scenarios. Higher SNP densities in the test animals, larger reference sets and higher relatedness between test and reference animals increased the accuracy of imputation. 50K specific genotypes were imputed with moderate allelic error rates from 15K (2.85%) and 25K (2.75%) genotypes. Using IMPUTE2, SNP genotypes up to 800K were imputed with low allelic error rate (0.79% genome-wide) from 50K genotypes, and with moderate error rate from 3K (4.78%) and 7K (2.00%) genotypes. The error rate of imputing up to 800K from 3K or 7K was further reduced when an additional middle tier of 50K genotypes was incorporated in a 3-tiered framework. Accuracies of DGV for five production traits using imputed 50K genotypes were close to those obtained with the actual 50K genotypes and higher compared to using 3K or 7K genotypes. The loss in accuracy of DGV was small when most of the training animals also had imputed (50K) genotypes. Additional gains in DGV accuracies were small when SNP densities increased from 50K to imputed 800K. Conclusion Population-based genotype imputation can be used to predict and combine genotypes from different low, medium and high-density SNP chips with a high level of accuracy. Imputing genotypes from low-density SNP panels to at least 50K SNP density increases the accuracy of genomic selection.
Collapse
Affiliation(s)
- Mehar S Khatkar
- Reprogen - Animal Bioscience, Faculty of Veterinary Science, University of Sydney, 425 Werombi Road, Camden, NSW 2570, Australia.
| | | | | | | |
Collapse
|
32
|
Abstract
Genomic selection can increase genetic gain per generation through early selection. Genomic selection is expected to be particularly valuable for traits that are costly to phenotype and expressed late in the life cycle of long-lived species. Alternative approaches to genomic selection prediction models may perform differently for traits with distinct genetic properties. Here the performance of four different original methods of genomic selection that differ with respect to assumptions regarding distribution of marker effects, including (i) ridge regression–best linear unbiased prediction (RR–BLUP), (ii) Bayes A, (iii) Bayes Cπ, and (iv) Bayesian LASSO are presented. In addition, a modified RR–BLUP (RR–BLUP B) that utilizes a selected subset of markers was evaluated. The accuracy of these methods was compared across 17 traits with distinct heritabilities and genetic architectures, including growth, development, and disease-resistance properties, measured in a Pinus taeda (loblolly pine) training population of 951 individuals genotyped with 4853 SNPs. The predictive ability of the methods was evaluated using a 10-fold, cross-validation approach, and differed only marginally for most method/trait combinations. Interestingly, for fusiform rust disease-resistance traits, Bayes Cπ, Bayes A, and RR–BLUB B had higher predictive ability than RR–BLUP and Bayesian LASSO. Fusiform rust is controlled by few genes of large effect. A limitation of RR–BLUP is the assumption of equal contribution of all markers to the observed variation. However, RR-BLUP B performed equally well as the Bayesian approaches.The genotypic and phenotypic data used in this study are publically available for comparative analysis of genomic selection prediction models.
Collapse
|