1
|
Azevedo CF, Ferrão LFV, Benevenuto J, de Resende MDV, Nascimento M, Nascimento ACC, Munoz PR. Using visual scores for genomic prediction of complex traits in breeding programs. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 137:9. [PMID: 38102495 DOI: 10.1007/s00122-023-04512-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 11/21/2023] [Indexed: 12/17/2023]
Abstract
KEY MESSAGE An approach for handling visual scores with potential errors and subjectivity in scores was evaluated in simulated and blueberry recurrent selection breeding schemes to assist breeders in their decision-making. Most genomic prediction methods are based on assumptions of normality due to their simplicity and ease of implementation. However, in plant and animal breeding, continuous traits are often visually scored as categorical traits and analyzed as a Gaussian variable, thus violating the normality assumption, which could affect the prediction of breeding values and the estimation of genetic parameters. In this study, we examined the main challenges of visual scores for genomic prediction and genetic parameter estimation using mixed models, Bayesian, and machine learning methods. We evaluated these approaches using simulated and real breeding data sets. Our contribution in this study is a five-fold demonstration: (i) collecting data using an intermediate number of categories (1-3 and 1-5) is the best strategy, even considering errors associated with visual scores; (ii) Linear Mixed Models and Bayesian Linear Regression are robust to the normality violation, but marginal gains can be achieved when using Bayesian Ordinal Regression Models (BORM) and Random Forest Classification; (iii) genetic parameters are better estimated using BORM; (iv) our conclusions using simulated data are also applicable to real data in autotetraploid blueberry; and (v) a comparison of continuous and categorical phenotypes found that investing in the evaluation of 600-1000 categorical data points with low error, when it is not feasible to collect continuous phenotypes, is a strategy for improving predictive abilities. Our findings suggest the best approaches for effectively using visual scores traits to explore genetic information in breeding programs and highlight the importance of investing in the training of evaluator teams and in high-quality phenotyping.
Collapse
Affiliation(s)
- Camila Ferreira Azevedo
- Statistics Department, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
- Horticultural Sciences Department, Blueberry Breeding and Genomics Lab, University of Florida, Gainesville, FL, USA
| | - Luis Felipe Ventorim Ferrão
- Horticultural Sciences Department, Blueberry Breeding and Genomics Lab, University of Florida, Gainesville, FL, USA
| | - Juliana Benevenuto
- Horticultural Sciences Department, Blueberry Breeding and Genomics Lab, University of Florida, Gainesville, FL, USA
| | - Marcos Deon Vilela de Resende
- Statistics Department, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
- Department of Forestry Engineering, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
- Embrapa Café, Brasília, Distrito Federal, Brazil
| | - Moyses Nascimento
- Statistics Department, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | | | - Patricio R Munoz
- Horticultural Sciences Department, Blueberry Breeding and Genomics Lab, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
2
|
Fritsche-Neto R, Ali J, De Asis EJ, Allahgholipour M, Labroo MR. Improving hybrid rice breeding programs via stochastic simulations: number of parents, number of hybrids, tester update, and genomic prediction of hybrid performance. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 137:3. [PMID: 38085288 PMCID: PMC10716074 DOI: 10.1007/s00122-023-04508-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 11/18/2023] [Indexed: 12/18/2023]
Abstract
KEY MESSAGE Schemes that use genomic prediction outperform others, updating testers increases hybrid genetic gain, and larger population sizes tend to have higher genetic gain and less depletion of genetic variance One of the most common methods to improve hybrid performance is reciprocal recurrent selection (RRS). Genomic prediction (GP) can be used to increase genetic gain in RRS by reducing cycle length, but it is also possible to use GP to predict single-cross hybrid performance. The impact of the latter method on genetic gain has yet to be previously reported. Therefore, we compared via stochastic simulations various phenotypic and genomics-assisted RRS breeding schemes which used GP to predict hybrid performance rather than reducing cycle length, which allows minimal changes to traditional breeding schemes. We also compared three breeding sizes scenarios that varied the number of genotypes crossed within heterotic pools, the number of genotypes crossed between heterotic pools, the number of hybrids evaluated, and the number of genomic predicted hybrids. Our results demonstrated that schemes that used genomic prediction of hybrid performance outperformed the others for the average interpopulation hybrid population and the best hybrid performance. Furthermore, updating the testers increased hybrid genetic gain with phenotypic RRS. As expected, the largest breeding size tested had the highest rates of genetic improvement and the lowest decrease in additive genetic variance due to the drift. Therefore, this study demonstrates the usefulness of single-cross prediction, which may be easier to implement than rapid-cycling RRS and cyclical updating of testers. We also reiterate that larger population sizes tend to have higher genetic gain and less depletion of genetic variance.
Collapse
Affiliation(s)
- Roberto Fritsche-Neto
- International Rice Research Institute (IRRI), Los Banos, Philippines.
- H. Rouse Caffey Rice Research Station, LSU AgCenter, Rayne, USA.
| | - Jauhar Ali
- International Rice Research Institute (IRRI), Los Banos, Philippines.
| | - Erik Jon De Asis
- International Rice Research Institute (IRRI), Los Banos, Philippines
| | | | - Marlee Rose Labroo
- Excellence in Breeding Platform, Consultative Group of International Agricultural Research, Lisbon, Mexico
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| |
Collapse
|
3
|
Canal GB, Oliveira GF, de Almeida FAN, Péres MZ, Moro GLJ, Dos Santos Oliveira WB, Azevedo CF, Nascimento M, da Silva Ferreira MF, Ferreira A. Genomic studies of the additive and dominant genetic control on production traits of Euterpe edulis fruits. Sci Rep 2023; 13:9795. [PMID: 37328527 PMCID: PMC10276026 DOI: 10.1038/s41598-023-36970-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 06/13/2023] [Indexed: 06/18/2023] Open
Abstract
In forest genetic improvement programs for non-domesticated species, limited knowledge of kinship can compromise or make the estimation of variance components and genetic parameters of traits of interest unfeasible. We used mixed models and genomics (in the latter, considering additive and non-additive effects) to evaluate the genetic architecture of 12 traits in juçaizeiro for fruit production. A population of 275 genotypes without genetic relationship knowledge was phenotyped over three years and genotyped by whole genome SNP markers. We have verified superiority in the quality of the fits, the prediction accuracy for unbalanced data, and the possibility of unfolding the genetic effects into their additive and non-additive terms in the genomic models. Estimates of the variance components and genetic parameters obtained by the additive models may be overestimated since, when considering the dominance effect in the model, there are substantial reductions in them. The number of bunches, fresh fruit mass of bunch, rachis length, fresh mass of 25 fruits, and amount of pulp were strongly influenced by the dominance effect, showing that genomic models with such effect should be considered for these traits, which may result in selective improvements by being able to return more accurate genomic breeding values. The present study reveals the additive and non-additive genetic control of the evaluated traits and highlights the importance of genomic information-based approaches for populations without knowledge of kinship and experimental design. Our findings underscore the critical role of genomic data in elucidating the genetic control architecture of quantitative traits, thereby providing crucial insights for driving species' genetic improvement.
Collapse
Affiliation(s)
- Guilherme Bravim Canal
- Department of Agronomy, Federal University of Espírito Santo, Alegre, Espírito Santo, 29500-000, Brazil
| | | | | | - Marcello Zatta Péres
- Department of Agronomy, Federal University of Espírito Santo, Alegre, Espírito Santo, 29500-000, Brazil
| | | | | | | | - Moysés Nascimento
- Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | | | - Adésio Ferreira
- Department of Agronomy, Federal University of Espírito Santo, Alegre, Espírito Santo, 29500-000, Brazil
| |
Collapse
|
4
|
de Andrade LRB, Sousa MBE, Wolfe M, Jannink JL, de Resende MDV, Azevedo CF, de Oliveira EJ. Increasing cassava root yield: Additive-dominant genetic models for selection of parents and clones. FRONTIERS IN PLANT SCIENCE 2022; 13:1071156. [PMID: 36589120 PMCID: PMC9800927 DOI: 10.3389/fpls.2022.1071156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Accepted: 12/02/2022] [Indexed: 06/17/2023]
Abstract
Genomic selection has been promising in situations where phenotypic assessments are expensive, laborious, and/or inefficient. This work evaluated the efficiency of genomic prediction methods combined with genetic models in clone and parent selection with the goal of increasing fresh root yield, dry root yield, as well as dry matter content in cassava roots. The bias and predictive ability of the combinations of prediction methods Genomic Best Linear Unbiased Prediction (G-BLUP), Bayes B, Bayes Cπ, and Reproducing Kernel Hilbert Spaces with additive and additive-dominant genetic models were estimated. Fresh and dry root yield exhibited predominantly dominant heritability, while dry matter content exhibited predominantly additive heritability. The combination of prediction methods and genetic models did not show significant differences in the predictive ability for dry matter content. On the other hand, the prediction methods with additive-dominant genetic models had significantly higher predictive ability than the additive genetic models for fresh and dry root yield, allowing higher genetic gains in clone selection. However, higher predictive ability for genotypic values did not result in differences in breeding value predictions between additive and additive-dominant genetic models. G-BLUP with the classical additive-dominant genetic model had the best predictive ability and bias estimates for fresh and dry root yield. For dry matter content, the highest predictive ability was obtained by G-BLUP with the additive genetic model. Dry matter content exhibited the highest heritability, predictive ability, and bias estimates compared with other traits. The prediction methods showed similar selection gains with approximately 67% of the phenotypic selection gain. By shortening the breeding cycle time by 40%, genomic selection may overcome phenotypic selection by 10%, 13%, and 18% for fresh root yield, dry root yield, and dry matter content, respectively, with a selection proportion of 15%. The most suitable genetic model for each trait allows for genomic selection optimization in cassava with high selection gains, thereby accelerating the release of new varieties.
Collapse
Affiliation(s)
| | | | - Marnin Wolfe
- Department of Crop, Soil and Environment Sciences, Auburn University, Auburn, AL, United States
| | - Jean-Luc Jannink
- Section on Plant Breeding and Genetics, School of Integrative Plant Sciences, Cornell University, Ithaca, NY, United States
- United States Department of Agriculture – Agriculture Research Service, Plant, Soil and Nutrition Research, Ithaca, NY, United States
| | - Marcos Deon Vilela de Resende
- Department of Forestry Engineering, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
- Embrapa Florestas, Colombo, Paraná, Brazil
- Department of Statistics, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
| | | | | |
Collapse
|
5
|
DoVale JC, Carvalho HF, Sabadin F, Fritsche-Neto R. Genotyping marker density and prediction models effects in long-term breeding schemes of cross-pollinated crops. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:4523-4539. [PMID: 36261658 DOI: 10.1007/s00122-022-04236-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Accepted: 10/09/2022] [Indexed: 06/16/2023]
Abstract
In genomic recurrent selection, the more markers, the better because they buffer the linkage disequilibrium losses caused by recombination over cycles, and consequently, provide higher responses to selection. Reductions of genotyping marker density have been extensively evaluated as potential strategies to reduce the genotyping costs of genomic selection (GS). Low-density marker panels are appealing in GS because they entail lower multicollinearity and computing time and allow more individuals to be genotyped for the same cost. However, statistical models used in GS are usually evaluated with empirical data, using "static" training sets and populations. This may be adequate for making predictions during a breeding program's initial cycles but not for the long-term. Moreover, studies that focus on long selective breeding cycles generally do not consider GS models with the effect of dominance, which is particularly important for breeding outcomes in cross-pollinated crops. Hence, dominance effects are important and unexplored in GS for long-term programs involving allogamous species. To address it, we employed two approaches: analysis of empirical maize datasets and simulations of long-term breeding applying phenotypic and genomic recurrent selection (intrapopulation and reciprocal schemes). In both schemes, we simulated twenty breeding cycles and assessed the effect of marker density reduction on the population mean, the best crosses, additive variance, selective accuracy, and response to selection with models [additive, additive-dominant, general (GCA), and this plus specific combining ability (GCA + SCA)]. Our results indicate that marker reduction based on linkage disequilibrium levels provides useful predictions only within a cycle, as accuracy significantly decreases over cycles. In the long-term, without training set updating, high-marker density provides the best responses to selection. The model to be used depends on the breeding scheme: additive for intrapopulation and additive-dominant or GCA + SCA for reciprocal.
Collapse
Affiliation(s)
- Júlio César DoVale
- Department of Crop Science, Federal University of Ceará, Fortaleza, CE, Brazil.
| | | | - Felipe Sabadin
- Virginia Tech: Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | | |
Collapse
|
6
|
Simeão RM, Resende MDV, Alves RS, Pessoa-Filho M, Azevedo ALS, Jones CS, Pereira JF, Machado JC. Genomic Selection in Tropical Forage Grasses: Current Status and Future Applications. FRONTIERS IN PLANT SCIENCE 2021; 12:665195. [PMID: 33995461 PMCID: PMC8120112 DOI: 10.3389/fpls.2021.665195] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 04/06/2021] [Indexed: 05/06/2023]
Abstract
The world population is expected to be larger and wealthier over the next few decades and will require more animal products, such as milk and beef. Tropical regions have great potential to meet this growing global demand, where pasturelands play a major role in supporting increased animal production. Better forage is required in consonance with improved sustainability as the planted area should not increase and larger areas cultivated with one or a few forage species should be avoided. Although, conventional tropical forage breeding has successfully released well-adapted and high-yielding cultivars over the last few decades, genetic gains from these programs have been low in view of the growing food demand worldwide. To guarantee their future impact on livestock production, breeding programs should leverage genotyping, phenotyping, and envirotyping strategies to increase genetic gains. Genomic selection (GS) and genome-wide association studies play a primary role in this process, with the advantage of increasing genetic gain due to greater selection accuracy, reduced cycle time, and increased number of individuals that can be evaluated. This strategy provides solutions to bottlenecks faced by conventional breeding methods, including long breeding cycles and difficulties to evaluate complex traits. Initial results from implementing GS in tropical forage grasses (TFGs) are promising with notable improvements over phenotypic selection alone. However, the practical impact of GS in TFG breeding programs remains unclear. The development of appropriately sized training populations is essential for the evaluation and validation of selection markers based on estimated breeding values. Large panels of single-nucleotide polymorphism markers in different tropical forage species are required for multiple application targets at a reduced cost. In this context, this review highlights the current challenges, achievements, availability, and development of genomic resources and statistical methods for the implementation of GS in TFGs. Additionally, the prediction accuracies from recent experiments and the potential to harness diversity from genebanks are discussed. Although, GS in TFGs is still incipient, the advances in genomic tools and statistical models will speed up its implementation in the foreseeable future. All TFG breeding programs should be prepared for these changes.
Collapse
Affiliation(s)
| | | | - Rodrigo S. Alves
- Instituto Nacional de Ciência e Tecnologia do Café, Universidade Federal de Viçosa, Viçosa, Brazil
| | | | | | - Chris S. Jones
- International Livestock Research Institute, Nairobi, Kenya
| | | | | |
Collapse
|
7
|
Frizzarin M, Gormley IC, Berry DP, Murphy TB, Casa A, Lynch A, McParland S. Predicting cow milk quality traits from routinely available milk spectra using statistical machine learning methods. J Dairy Sci 2021; 104:7438-7447. [PMID: 33865578 DOI: 10.3168/jds.2020-19576] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 03/09/2021] [Indexed: 11/19/2022]
Abstract
Numerous statistical machine learning methods suitable for application to highly correlated features, as those that exist for spectral data, could potentially improve prediction performance over the commonly used partial least squares approach. Milk samples from 622 individual cows with known detailed protein composition and technological trait data accompanied by mid-infrared spectra were available to assess the predictive ability of different regression and classification algorithms. The regression-based approaches were partial least squares regression (PLSR), ridge regression (RR), least absolute shrinkage and selection operator (LASSO), elastic net, principal component regression, projection pursuit regression, spike and slab regression, random forests, boosting decision trees, neural networks (NN), and a post-hoc approach of model averaging (MA). Several classification methods (i.e., partial least squares discriminant analysis (PLSDA), random forests, boosting decision trees, and support vector machines (SVM)) were also used after stratifying the traits of interest into categories. In the regression analyses, MA was the best prediction method for 6 of the 14 traits investigated [curd firmness at 60 min, αS1-casein (CN), αS2-CN, κ-CN, α-lactalbumin, and β-lactoglobulin B], whereas NN and RR were the best algorithms for 3 traits each (rennet coagulation time, curd-firming time, and heat stability, and curd firmness at 30 min, β-CN, and β-lactoglobulin A, respectively), PLSR was best for pH, and LASSO was best for CN micelle size. When traits were divided into 2 classes, SVM had the greatest accuracy for the majority of the traits investigated. Although the well-established PLSR-based method performed competitively, the application of statistical machine learning methods for regression analyses reduced the root mean square error compared with PLSR from between 0.18% (κ-CN) to 3.67% (heat stability). The use of modern statistical machine learning methods for trait prediction from mid-infrared spectroscopy may improve the prediction accuracy for some traits.
Collapse
Affiliation(s)
- M Frizzarin
- School of Mathematics and Statistics, University College Dublin, Belfield, Dublin 4, Ireland; Teagasc, Animal & Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork, P61 P302 Ireland
| | - I C Gormley
- School of Mathematics and Statistics, University College Dublin, Belfield, Dublin 4, Ireland
| | - D P Berry
- Teagasc, Animal & Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork, P61 P302 Ireland
| | - T B Murphy
- School of Mathematics and Statistics, University College Dublin, Belfield, Dublin 4, Ireland
| | - A Casa
- School of Mathematics and Statistics, University College Dublin, Belfield, Dublin 4, Ireland
| | - A Lynch
- School of Mathematics and Statistics, University College Dublin, Belfield, Dublin 4, Ireland
| | - S McParland
- Teagasc, Animal & Grassland Research and Innovation Centre, Moorepark, Fermoy, Co. Cork, P61 P302 Ireland.
| |
Collapse
|
8
|
Baral K, Coulman B, Biligetu B, Fu YB. Advancing crested wheatgrass [Agropyron cristatum (L.) Gaertn.] breeding through genotyping-by-sequencing and genomic selection. PLoS One 2020; 15:e0239609. [PMID: 33031422 PMCID: PMC7544028 DOI: 10.1371/journal.pone.0239609] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Accepted: 09/09/2020] [Indexed: 11/18/2022] Open
Abstract
Crested wheatgrass [Agropyron cristatum (L.) Gaertn.] provides high quality, highly palatable forage for early season grazing. Genetic improvement of crested wheatgrass has been challenged by its complex genome, outcrossing nature, long breeding cycle, and lack of informative molecular markers. Genomic selection (GS) has potential for improving traits of perennial forage species, and genotyping-by-sequencing (GBS) has enabled the development of genome-wide markers in non-model polyploid plants. An attempt was made to explore the utility of GBS and GS in crested wheatgrass breeding. Sequencing and phenotyping 325 genotypes representing 10 diverse breeding lines were performed. Bioinformatics analysis identified 827, 3,616, 14,090 and 46,136 single nucleotide polymorphism markers at 20%, 30%, 40% and 50% missing marker levels, respectively. Four GS models (BayesA, BayesB, BayesCπ, and rrBLUP) were examined for the accuracy of predicting nine agro-morphological and three nutritive value traits. Moderate accuracy (0.20 to 0.32) was obtained for the prediction of heading days, leaf width, plant height, clump diameter, tillers per plant and early spring vigor for genotypes evaluated at Saskatoon, Canada. Similar accuracy (0.29 to 0.35) was obtained for predicting fall regrowth and plant height for genotypes evaluated at Swift Current, Canada. The Bayesian models displayed similar or higher accuracy than rrBLUP. These findings show the feasibility of GS application for a non-model species to advance plant breeding.
Collapse
Affiliation(s)
- Kiran Baral
- Department of Plant Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Bruce Coulman
- Department of Plant Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Bill Biligetu
- Department of Plant Sciences, College of Agriculture and Bioresources, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Yong-Bi Fu
- Plant Gene Resources of Canada, Saskatoon Research and Development Centre, Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
| |
Collapse
|
9
|
Costa-Neto G, Fritsche-Neto R, Crossa J. Nonlinear kernels, dominance, and envirotyping data increase the accuracy of genome-based prediction in multi-environment trials. Heredity (Edinb) 2020; 126:92-106. [PMID: 32855544 PMCID: PMC7852533 DOI: 10.1038/s41437-020-00353-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Revised: 07/29/2020] [Accepted: 07/30/2020] [Indexed: 01/15/2023] Open
Abstract
Modern whole-genome prediction (WGP) frameworks that focus on multi-environment trials (MET) integrate large-scale genomics, phenomics, and envirotyping data. However, the more complex the statistical model, the longer the computational processing times, which do not always result in accuracy gains. We investigated the use of new kernel methods and modeling structures involving genomics and nongenomic sources of variation in two MET maize data sets. Five WGP models were considered, advancing in complexity from a main-effect additive model (A) to more complex structures, including dominance deviations (D), genotype × environment interaction (AE and DE), and the reaction-norm model using environmental covariables (W) and their interaction with A and D (AW + DW). A combination of those models built with three different kernel methods, Gaussian kernel (GK), Deep kernel (DK), and the benchmark genomic best linear-unbiased predictor (GBLUP/GB), was tested under three prediction scenarios: newly developed hybrids (CV1), sparse MET conditions (CV2), and new environments (CV0). GK and DK outperformed GB in prediction accuracy and reduction of computation time (~up to 20%) under all model-kernel scenarios. GK was more efficient in capturing the variation due to A + AE and D + DE effects and translated it into accuracy gains (~up to 85% compared with GB). DK provided more consistent predictions, even for more complex structures such as W + AW + DW. Our results suggest that DK and GK are more efficient in translating model complexity into accuracy, and more suitable for including dominance and reaction-norm effects in a biologically accurate and faster way.
Collapse
Affiliation(s)
- Germano Costa-Neto
- Department of Genetics, "Luiz de Queiroz" Agriculture College, University of São Paulo, São Paulo, Brazil
| | - Roberto Fritsche-Neto
- Department of Genetics, "Luiz de Queiroz" Agriculture College, University of São Paulo, São Paulo, Brazil
| | - José Crossa
- Biometrics and Statistics Unit, Genetic Resources Program, and Global Wheat Program, International Maize and Wheat Improvement Center (CIMMYT), Mexico, Mexico.
| |
Collapse
|
10
|
Stock J, Bennewitz J, Hinrichs D, Wellmann R. A Review of Genomic Models for the Analysis of Livestock Crossbred Data. Front Genet 2020; 11:568. [PMID: 32670349 PMCID: PMC7332767 DOI: 10.3389/fgene.2020.00568] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 05/11/2020] [Indexed: 12/11/2022] Open
Abstract
Livestock breeding has shifted during the past decade toward genomic selection. For the estimation of breeding values in purebred breeding schemes, genomic best linear unbiased prediction has become the method of choice. Systematic crossbreeding with the aim to utilize heterosis and breed complementary effects is widely used in livestock breeding, especially in pig and poultry breeding. The goal is to improve the performance of the crossbred animals. Due to genotype-by-environment interactions, imperfect linkage disequilibrium, and the existence of dominance and imprinting, purebred and crossbred performances are not perfectly correlated. Hence, more complex genomic models are required for crossbred populations. This study reviews and compares such models. Compared to purebred genomic models, the reviewed models were of much higher complexity due to the inclusion of dominance effects, breed-specific effects, imprinting effects, and the joint evaluation of purebred and crossbred performance data. With the model assessment work conducted until now, it is not possible to come to a clear recommendation as to which existing method is most suitable for a specific breeding program and a specific genetic trait architecture. Since it is expected that a superior method includes all the different genetic effects in a single model, a dominance model with imprinting and breed-specific SNP effects is proposed. Further progress could be made by assuming realistic covariance structures between the genetic effects of the different breeding lines, and by using larger marker panels and mixture distributions for the SNP effects.
Collapse
Affiliation(s)
- Joana Stock
- Department of Animal Breeding and Genetics, Institute of Animal Science, University of Hohenheim, Stuttgart, Germany
| | - Jörn Bennewitz
- Department of Animal Breeding and Genetics, Institute of Animal Science, University of Hohenheim, Stuttgart, Germany
| | - Dirk Hinrichs
- Department of Animal Breeding, University of Kassel, Witzenhausen, Germany
| | - Robin Wellmann
- Department of Animal Breeding and Genetics, Institute of Animal Science, University of Hohenheim, Stuttgart, Germany
| |
Collapse
|
11
|
de Andrade LRB, Sousa MBE, Oliveira EJ, de Resende MDV, Azevedo CF. Cassava yield traits predicted by genomic selection methods. PLoS One 2019; 14:e0224920. [PMID: 31725759 PMCID: PMC6855463 DOI: 10.1371/journal.pone.0224920] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/24/2019] [Indexed: 01/01/2023] Open
Abstract
Genomic selection (GS) has been used to optimize genetic gains when phenotypic selection is considered costly and difficult to measure. The objective of this work was to evaluate the efficiency and consistency of GS prediction for cassava yield traits (Manihot esculenta Crantz) using different methods, taking into account the effect of population structure. BLUPs and deregressed BLUPs were obtained for 888 cassava accessions and evaluated for fresh root yield, dry root yield and dry matter content in roots in 21 trials conducted from 2011 to 2016. The deregressed BLUPs obtained for the accessions from a 48K single nucleotide polymorphism dataset were used for genomic predictions based on the BayesB, BLASSO, RR-BLUP, G-BLUP and RKHS methods. The accessions’ BLUPs were used in the validation step using four cross-validation strategies, taking into account population structure and different GS methods. Similar estimates of predictive ability and bias were identified for the different genomic selection methods in the first cross-validation strategy. Lower predictive ability was observed for fresh root yield (0.4569 –RR-BLUP to 0.4756—RKHS) and dry root yield (0.4689 –G-BLUP to 0.4818—RKHS) in comparison with dry matter content (0.5655 –BLASSO to 0.5670 –RKHS). However, the RKHS method exhibited higher efficiency and consistency in most of the validation scenarios in terms of prediction ability for fresh root yield and dry root yield. The correlations of the genomic estimated breeding values between the genomic selection methods were quite high (0.99–1.00), resulting in high coincidence of clone selection regardless of the genomic selection method. The deviance analyses within and between the validation clusters formed by the discriminant analysis of principal components were significant for all traits. Therefore, this study indicated that i) the prediction of dry matter content was more accurate compared to that of yield traits, possibly as a result of the smaller influence of non-additive genetic effects; ii) the RKHS method resulted in high and stable prediction ability in most of the validation scenarios; and iii) some kinship between the validation and training populations is desirable in order for genomic selection to succeed due to the significant effect of population structure on genomic selection predictions.
Collapse
Affiliation(s)
| | - Massaine Bandeira e Sousa
- Center of Agrarian, Environmental and Biological Sciences, Universidade Federal do Recôncavo da Bahia, Cruz das Almas, Bahia, Brazil
| | | | - Marcos Deon Vilela de Resende
- Department of Forestry Engineering, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
- Embrapa Florestas, Colombo, Paraná, Brazil
- Department of Statistics, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
| | | |
Collapse
|
12
|
Ballesta P, Maldonado C, Pérez-Rodríguez P, Mora F. SNP and Haplotype-Based Genomic Selection of Quantitative Traits in Eucalyptus globulus. PLANTS 2019; 8:plants8090331. [PMID: 31492041 PMCID: PMC6783840 DOI: 10.3390/plants8090331] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Revised: 09/02/2019] [Accepted: 09/03/2019] [Indexed: 01/02/2023]
Abstract
Eucalyptus globulus (Labill.) is one of the most important cultivated eucalypts in temperate and subtropical regions and has been successfully subjected to intensive breeding. In this study, Bayesian genomic models that include the effects of haplotype and single nucleotide polymorphisms (SNP) were assessed to predict quantitative traits related to wood quality and tree growth in a 6-year-old breeding population. To this end, the following markers were considered: (a) ~14 K SNP markers (SNP), (b) ~3 K haplotypes (HAP), and (c) haplotypes and SNPs that were not assigned to a haplotype (HAP-SNP). Predictive ability values (PA) were dependent on the genomic prediction models and markers. On average, Bayesian ridge regression (BRR) and Bayes C had the highest PA for the majority of traits. Notably, genomic models that included the haplotype effect (either HAP or HAP-SNP) significantly increased the PA of low-heritability traits. For instance, BRR based on HAP had the highest PA (0.58) for stem straightness. Consistently, the heritability estimates from genomic models were higher than the pedigree-based estimates for these traits. The results provide additional perspectives for the implementation of genomic selection in Eucalyptus breeding programs, which could be especially beneficial for improving traits with low heritability.
Collapse
Affiliation(s)
- Paulina Ballesta
- Institute of Biological Sciences, University of Talca, 2 Norte 685, Talca 3460000, Chile.
| | - Carlos Maldonado
- Institute of Biological Sciences, University of Talca, 2 Norte 685, Talca 3460000, Chile.
| | - Paulino Pérez-Rodríguez
- Colegio de Postgraduados, Statistics and Computer Sciences, Montecillos, Edo. de México 56230, Mexico.
| | - Freddy Mora
- Institute of Biological Sciences, University of Talca, 2 Norte 685, Talca 3460000, Chile.
| |
Collapse
|
13
|
Zhu H, Zhang Y, Bai Y, Yang H, Yan H, Liu J, Shi L, Song X, Li L, Dong S, Pan C, Lan X, Qu L. Relationship between SNPs of POU1F1 Gene and Litter Size and Growth Traits in Shaanbei White Cashmere Goats. Animals (Basel) 2019; 9:ani9030114. [PMID: 30934610 PMCID: PMC6466355 DOI: 10.3390/ani9030114] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 03/09/2019] [Accepted: 03/20/2019] [Indexed: 02/07/2023] Open
Abstract
POU (Pit-Oct-Unc) class 1 homeobox 1 (POU1F1, or Pit-1) is a transcription factor that directly regulates pituitary hormone-related genes, as well as affects the reproduction and growth in mammals. Thus, POU1F1 gene was investigated as a candidate gene for litter size and growth performance in goats. In the current study, using direct DNA sequencing, c.682G > T, c.723T > G and c.837T > C loci were genotyped in Shaanbei white cashmere (SBWC) goats (n = 609), but c.876 + 110T > C was monomorphic. Besides, the c.682G > T locus was first identified by HinfI (Haemophilus influenzae Rf) restriction endonuclease. Association analysis results showed that the c.682G > T, c.837T > C loci and diplotypes were significantly associated with goat litter size (p < 0.05). The positive genotypes were GT and TT for the two SNPs, respectively, and the optimal diplotype was H3H7 (GTTT-TTTT). On the other hand, the c.682G > T, c.723T > G and c.837T > C strongly affected growth traits and body measurement indexes in SBWC goats (p < 0.05). The positive genotypes or allele of these SNPs were GT, G and TT, respectively. Additionally, the goats with H3H7 diplotype also had a greater growth status than others (p < 0.05). Here, individuals with same genotype had both a better litter size and growth traits, showing a positive correlation between these economic traits. Meanwhile, the positive genotypes of four SNPs were combined to obtain the optimal diplotype, which was also H3H7. These SNPs, especially the diplotype, could be used for the genomic selection of excellent individuals with a greater litter size and better growth status in goat breeding.
Collapse
Affiliation(s)
- Haijing Zhu
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Yanghai Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China.
| | - Yangyang Bai
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Han Yang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China.
| | - Hailong Yan
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Jinwang Liu
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Lei Shi
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Xiaoyue Song
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Longping Li
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Shuwei Dong
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| | - Chuanying Pan
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China.
| | - Xianyong Lan
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China.
| | - Lei Qu
- Shaanxi Provincial Engineering and Technology Research Center of Cashmere Goats, Yulin University, Yulin 719000, China.
- Life Science Research Center, Yulin University, Yulin 719000, China.
| |
Collapse
|
14
|
Liu X, Wang H, Hu X, Li K, Liu Z, Wu Y, Huang C. Improving Genomic Selection With Quantitative Trait Loci and Nonadditive Effects Revealed by Empirical Evidence in Maize. FRONTIERS IN PLANT SCIENCE 2019; 10:1129. [PMID: 31620155 PMCID: PMC6759780 DOI: 10.3389/fpls.2019.01129] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 08/15/2019] [Indexed: 05/20/2023]
Abstract
Genomic selection (GS), a tool developed for molecular breeding, is used by plant breeders to improve breeding efficacy by shortening the breeding cycle and to facilitate the selection of candidate lines for creating hybrids without phenotyping in various environments. Association and linkage mapping have been widely used to explore and detect candidate genes in order to understand the genetic mechanisms of quantitative traits. In the current study, phenotypic and genotypic data from three experimental populations, including data on six agronomic traits (e.g., plant height, ear height, ear length, ear diameter, grain yield per plant, and hundred-kernel weight), were used to evaluate the effect of trait-relevant markers (TRMs) on prediction accuracy estimation. Integrating information from mapping into a statistical model can efficiently improve prediction performance compared with using stochastically selected markers to perform GS. The prediction accuracy can reach plateau when a total of 500-1,000 TRMs are utilized in GS. The prediction accuracy can be significantly enhanced by including nonadditive effects and TRMs in the GS model when genotypic data with high proportions of heterozygous alleles and complex agronomic traits with high proportion of nonadditive variancein phenotypic variance are used to perform GS. In addition, taking information on population structure into account can slightly improve prediction performance when the genetic relationship between the training and testing sets is influenced by population stratification due to different allele frequencies. In conclusion, GS is a useful approach for prescreening candidate lines, and the empirical evidence provided by the current study for TRMs and nonadditive effects can inform plant breeding and in turn contribute to the improvement of selection efficiency in practical GS-assisted breeding programs.
Collapse
|
15
|
Granato I, Cuevas J, Luna-Vázquez F, Crossa J, Montesinos-López O, Burgueño J, Fritsche-Neto R. BGGE: A New Package for Genomic-Enabled Prediction Incorporating Genotype × Environment Interaction Models. G3 (BETHESDA, MD.) 2018; 8:3039-3047. [PMID: 30049744 PMCID: PMC6118304 DOI: 10.1534/g3.118.200435] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 07/23/2018] [Indexed: 12/14/2022]
Abstract
One of the major issues in plant breeding is the occurrence of genotype × environment (GE) interaction. Several models have been created to understand this phenomenon and explore it. In the genomic era, several models were employed to improve selection by using markers and account for GE interaction simultaneously. Some of these models use special genetic covariance matrices. In addition, the scale of multi-environment trials is getting larger, and this increases the computational challenges. In this context, we propose an R package that, in general, allows building GE genomic covariance matrices and fitting linear mixed models, in particular, to a few genomic GE models. Here we propose two functions: one to prepare the genomic kernels accounting for the genomic GE and another to perform genomic prediction using a Bayesian linear mixed model. A specific treatment is given for sparse covariance matrices, in particular, to block diagonal matrices that are present in some GE models in order to decrease the computational demand. In empirical comparisons with Bayesian Genomic Linear Regression (BGLR), accuracies and the mean squared error were similar; however, the computational time was up to five times lower than when using the classic approach. Bayesian Genomic Genotype × Environment Interaction (BGGE) is a fast, efficient option for creating genomic GE kernels and making genomic predictions.
Collapse
Affiliation(s)
- Italo Granato
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
| | - Jaime Cuevas
- Universidad de Quintana Roo, Chetumal, Quintana Roo, México
| | | | - Jose Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, México DF, México
| | | | - Juan Burgueño
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, México DF, México
| | - Roberto Fritsche-Neto
- Department of Genetics, "Luiz de Queiroz" College of Agriculture, University of São Paulo, Piracicaba, São Paulo, Brazil
| |
Collapse
|
16
|
Varona L, Legarra A, Toro MA, Vitezica ZG. Non-additive Effects in Genomic Selection. Front Genet 2018; 9:78. [PMID: 29559995 PMCID: PMC5845743 DOI: 10.3389/fgene.2018.00078] [Citation(s) in RCA: 97] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2017] [Accepted: 02/19/2018] [Indexed: 12/02/2022] Open
Abstract
In the last decade, genomic selection has become a standard in the genetic evaluation of livestock populations. However, most procedures for the implementation of genomic selection only consider the additive effects associated with SNP (Single Nucleotide Polymorphism) markers used to calculate the prediction of the breeding values of candidates for selection. Nevertheless, the availability of estimates of non-additive effects is of interest because: (i) they contribute to an increase in the accuracy of the prediction of breeding values and the genetic response; (ii) they allow the definition of mate allocation procedures between candidates for selection; and (iii) they can be used to enhance non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes. This study presents a review of methods for the incorporation of non-additive genetic effects into genomic selection procedures and their potential applications in the prediction of future performance, mate allocation, crossbreeding, and purebred selection. The work concludes with a brief outline of some ideas for future lines of that may help the standard inclusion of non-additive effects in genomic selection.
Collapse
Affiliation(s)
- Luis Varona
- Departamento de Anatomía, Embriología y Genética Animal, Universidad de Zaragoza, Zaragoza, Spain.,Instituto Agroalimentario de Aragón (IA2), Zaragoza, Spain
| | - Andres Legarra
- Génétique Physiologie et Systèmes d'Elevage (GenPhySE), Institut National de la Recherche Agronomique de Toulouse, Castanet-Tolosan, France
| | - Miguel A Toro
- Departamento Producción Agraria, ETS Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, Madrid, Spain
| | - Zulma G Vitezica
- Génétique Physiologie et Systèmes d'Elevage (GenPhySE), Université de Toulouse, Castanet-Tolosan, France
| |
Collapse
|
17
|
Dias KODG, Gezan SA, Guimarães CT, Nazarian A, da Costa E Silva L, Parentoni SN, de Oliveira Guimarães PE, de Oliveira Anoni C, Pádua JMV, de Oliveira Pinto M, Noda RW, Ribeiro CAG, de Magalhães JV, Garcia AAF, de Souza JC, Guimarães LJM, Pastina MM. Improving accuracies of genomic predictions for drought tolerance in maize by joint modeling of additive and dominance effects in multi-environment trials. Heredity (Edinb) 2018; 121:24-37. [PMID: 29472694 DOI: 10.1038/s41437-018-0053-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Revised: 11/04/2017] [Accepted: 12/15/2017] [Indexed: 01/16/2023] Open
Abstract
Breeding for drought tolerance is a challenging task that requires costly, extensive, and precise phenotyping. Genomic selection (GS) can be used to maximize selection efficiency and the genetic gains in maize (Zea mays L.) breeding programs for drought tolerance. Here, we evaluated the accuracy of genomic selection (GS) using additive (A) and additive + dominance (AD) models to predict the performance of untested maize single-cross hybrids for drought tolerance in multi-environment trials. Phenotypic data of five drought tolerance traits were measured in 308 hybrids along eight trials under water-stressed (WS) and well-watered (WW) conditions over two years and two locations in Brazil. Hybrids' genotypes were inferred based on their parents' genotypes (inbred lines) using single-nucleotide polymorphism markers obtained via genotyping-by-sequencing. GS analyses were performed using genomic best linear unbiased prediction by fitting a factor analytic (FA) multiplicative mixed model. Two cross-validation (CV) schemes were tested: CV1 and CV2. The FA framework allowed for investigating the stability of additive and dominance effects across environments, as well as the additive-by-environment and the dominance-by-environment interactions, with interesting applications for parental and hybrid selection. Results showed differences in the predictive accuracy between A and AD models, using both CV1 and CV2, for the five traits in both water conditions. For grain yield (GY) under WS and using CV1, the AD model doubled the predictive accuracy in comparison to the A model. Through CV2, GS models benefit from borrowing information of correlated trials, resulting in an increase of 40% and 9% in the predictive accuracy of GY under WS for A and AD models, respectively. These results highlight the importance of multi-environment trial analyses using GS models that incorporate additive and dominance effects for genomic predictions of GY under drought in maize single-cross hybrids.
Collapse
Affiliation(s)
- Kaio Olímpio Das Graças Dias
- Departamento de Biologia, Universidade Federal de Lavras, Lavras, MG, Brazil.,Departamento de Genética, Escola Superior de Agricultura "Luiz de Queiroz", Universidade de São Paulo, Piracicaba, SP, Brazil
| | | | | | - Alireza Nazarian
- School of Forest Resources & Conservation, University of Florida, Gainesville, FL, USA
| | | | | | | | - Carina de Oliveira Anoni
- Departamento de Genética, Escola Superior de Agricultura "Luiz de Queiroz", Universidade de São Paulo, Piracicaba, SP, Brazil
| | | | | | | | | | | | - Antonio Augusto Franco Garcia
- Departamento de Genética, Escola Superior de Agricultura "Luiz de Queiroz", Universidade de São Paulo, Piracicaba, SP, Brazil
| | | | | | | |
Collapse
|
18
|
El-Dien OG, Ratcliffe B, Klápště J, Porth I, Chen C, El-Kassaby YA. Multienvironment genomic variance decomposition analysis of open-pollinated Interior spruce ( Picea glauca x engelmannii). MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2018; 38:26. [PMID: 29491726 PMCID: PMC5814545 DOI: 10.1007/s11032-018-0784-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2017] [Accepted: 01/29/2018] [Indexed: 05/29/2023]
Abstract
The advantages of open-pollinated (OP) family testing over controlled crossing (i.e., structured pedigree) are the potential to screen and rank a large number of parents and offspring with minimal cost and efforts; however, the method produces inflated genetic parameters as the actual sibling relatedness within OP families rarely meets the half-sib relatedness assumption. Here, we demonstrate the unsurpassed utility of OP testing after shifting the analytical mode from pedigree- (ABLUP) to genomic-based (GBLUP) relationship using phenotypic tree height (HT) and wood density (WD) and genotypic (30k SNPs) data for 1126 38-year-old Interior spruce (Picea glauca (Moench) Voss x P. engelmannii Parry ex Engelm.) trees, representing 25 OP families, growing on three sites in Interior British Columbia, Canada. The use of the genomic realized relationship permitted genetic variance decomposition to additive, dominance, and epistatic genetic variances, and their interactions with the environment, producing more accurate narrow-sense heritability and breeding value estimates as compared to the pedigree-based counterpart. The impact of retaining (random folding) vs. removing (family folding) genetic similarity between the training and validation populations on the predictive accuracy of genomic selection was illustrated and highlighted the former caveats and latter advantages. Moreover, GBLUP models allowed breeding value prediction for individuals from families that were not included in the developed models, which was not possible with the ABLUP. Response to selection differences between the ABLUP and GBLUP models indicated the presence of systematic genetic gain overestimation of 35 and 63% for HT and WD, respectively, mainly caused by the inflated estimates of additive genetic variance and individuals' breeding values given by the ABLUP models. Extending the OP genomic-based models from single to multisite made the analysis applicable to existing OP testing programs.
Collapse
Affiliation(s)
- Omnia Gamal El-Dien
- Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
- Pharmacognosy Department, Faculty of Pharmacy, Alexandria University, Alexandria, Egypt
| | - Blaise Ratcliffe
- Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| | - Jaroslav Klápště
- Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
- Department of Genetics and Physiology of Forest Trees, Faculty of Forestry and Wood Sciences, Czech University of Life Sciences Prague, Kamycka 129, 165 21 Prague 6, Czech Republic
- Present Address: Scion (New Zealand Forest Research Institute Ltd.), 49 Sala Street, Whakarewarewa, Rotorua, 3046 New Zealand
| | - Ilga Porth
- Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
- Present Address: Départment des Sciences du Bois et de la Forêt, Faculté de Foresterie, de Géographie et Géomatique, Université Laval, Quebec City, QC G1V 0A6 Canada
| | - Charles Chen
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK 74078-3035 USA
| | - Yousry A. El-Kassaby
- Department of Forest and Conservation Sciences, Faculty of Forestry, University of British Columbia, Vancouver, BC V6T 1Z4 Canada
| |
Collapse
|
19
|
Lopes FB, Wu XL, Li H, Xu J, Perkins T, Genho J, Ferretti R, Tait RG, Bauck S, Rosa GJM. Improving accuracy of genomic prediction in Brangus cattle by adding animals with imputed low-density SNP genotypes. J Anim Breed Genet 2018; 135:14-27. [PMID: 29345073 DOI: 10.1111/jbg.12312] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Accepted: 12/04/2017] [Indexed: 11/27/2022]
Abstract
Reliable genomic prediction of breeding values for quantitative traits requires the availability of sufficient number of animals with genotypes and phenotypes in the training set. As of 31 October 2016, there were 3,797 Brangus animals with genotypes and phenotypes. These Brangus animals were genotyped using different commercial SNP chips. Of them, the largest group consisted of 1,535 animals genotyped by the GGP-LDV4 SNP chip. The remaining 2,262 genotypes were imputed to the SNP content of the GGP-LDV4 chip, so that the number of animals available for training the genomic prediction models was more than doubled. The present study showed that the pooling of animals with both original or imputed 40K SNP genotypes substantially increased genomic prediction accuracies on the ten traits. By supplementing imputed genotypes, the relative gains in genomic prediction accuracies on estimated breeding values (EBV) were from 12.60% to 31.27%, and the relative gain in genomic prediction accuracies on de-regressed EBV was slightly small (i.e. 0.87%-18.75%). The present study also compared the performance of five genomic prediction models and two cross-validation methods. The five genomic models predicted EBV and de-regressed EBV of the ten traits similarly well. Of the two cross-validation methods, leave-one-out cross-validation maximized the number of animals at the stage of training for genomic prediction. Genomic prediction accuracy (GPA) on the ten quantitative traits was validated in 1,106 newly genotyped Brangus animals based on the SNP effects estimated in the previous set of 3,797 Brangus animals, and they were slightly lower than GPA in the original data. The present study was the first to leverage currently available genotype and phenotype resources in order to harness genomic prediction in Brangus beef cattle.
Collapse
Affiliation(s)
- F B Lopes
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA.,Biostatistics and Bioinformatics, GeneSeek (A Neogen Company), Lincoln, NE, USA
| | - X-L Wu
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA.,Biostatistics and Bioinformatics, GeneSeek (A Neogen Company), Lincoln, NE, USA
| | - H Li
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA.,Biostatistics and Bioinformatics, GeneSeek (A Neogen Company), Lincoln, NE, USA
| | - J Xu
- Biostatistics and Bioinformatics, GeneSeek (A Neogen Company), Lincoln, NE, USA.,Department of Statistics, University of Nebraska, Lincoln, NE, USA
| | - T Perkins
- International Brangus Breeders Association, San Antonio, TX, USA
| | - J Genho
- Livestock Genetic Services LLC, Woodville, VA, USA
| | - R Ferretti
- Biostatistics and Bioinformatics, GeneSeek (A Neogen Company), Lincoln, NE, USA
| | - R G Tait
- Biostatistics and Bioinformatics, GeneSeek (A Neogen Company), Lincoln, NE, USA
| | - S Bauck
- Biostatistics and Bioinformatics, GeneSeek (A Neogen Company), Lincoln, NE, USA
| | - G J M Rosa
- Department of Animal Sciences, University of Wisconsin, Madison, WI, USA
| |
Collapse
|
20
|
Viana JMS, Mundim GB, Pereira HD, Andrade ACB, e Silva FF. Efficiency of genome-wide association studies in random cross populations. MOLECULAR BREEDING 2017; 37:102. [PMID: 0 DOI: 10.1007/s11032-017-0703-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
|
21
|
de Azevedo Peixoto L, Laviola BG, Alves AA, Rosado TB, Bhering LL. Breeding Jatropha curcas by genomic selection: A pilot assessment of the accuracy of predictive models. PLoS One 2017; 12:e0173368. [PMID: 28296913 PMCID: PMC5351973 DOI: 10.1371/journal.pone.0173368] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Accepted: 02/20/2017] [Indexed: 11/19/2022] Open
Abstract
Genomic wide selection is a promising approach for improving the selection accuracy in plant breeding, particularly in species with long life cycles, such as Jatropha. Therefore, the objectives of this study were to estimate the genetic parameters for grain yield (GY) and the weight of 100 seeds (W100S) using restricted maximum likelihood (REML); to compare the performance of GWS methods to predict GY and W100S; and to estimate how many markers are needed to train the GWS model to obtain the maximum accuracy. Eight GWS models were compared in terms of predictive ability. The impact that the marker density had on the predictive ability was investigated using a varying number of markers, from 2 to 1,248. Because the genetic variance between evaluated genotypes was significant, it was possible to obtain selection gain. All of the GWS methods tested in this study can be used to predict GY and W100S in Jatropha. A training model fitted using 1,000 and 800 markers is sufficient to capture the maximum genetic variance and, consequently, maximum prediction ability of GY and W100S, respectively. This study demonstrated the applicability of genome-wide prediction to identify useful genetic sources of GY and W100S for Jatropha breeding. Further research is needed to confirm the applicability of the proposed approach to other complex traits.
Collapse
Affiliation(s)
| | - Bruno Galvêas Laviola
- Empresa Brasileira de Pesquisa Agropecuária, Embrapa Agroenergia, Parque Estação Biológica–PqEB s/n, Asa Norte, Brasília, Brazil
| | - Alexandre Alonso Alves
- Empresa Brasileira de Pesquisa Agropecuária, Embrapa Agroenergia, Parque Estação Biológica–PqEB s/n, Asa Norte, Brasília, Brazil
| | - Tatiana Barbosa Rosado
- Empresa Brasileira de Pesquisa Agropecuária, Embrapa Agroenergia, Parque Estação Biológica–PqEB s/n, Asa Norte, Brasília, Brazil
| | | |
Collapse
|
22
|
Li Y, Hawken R, Sapp R, George A, Lehnert S, Henshall J, Reverter A. Evaluation of non-additive genetic variation in feed-related traits of broiler chickens. Poult Sci 2017; 96:754-763. [DOI: 10.3382/ps/pew333] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Accepted: 08/11/2016] [Indexed: 12/29/2022] Open
|
23
|
|