1
|
Alemu A, Åstrand J, Montesinos-López OA, Isidro Y Sánchez J, Fernández-Gónzalez J, Tadesse W, Vetukuri RR, Carlsson AS, Ceplitis A, Crossa J, Ortiz R, Chawade A. Genomic selection in plant breeding: Key factors shaping two decades of progress. MOLECULAR PLANT 2024; 17:552-578. [PMID: 38475993 DOI: 10.1016/j.molp.2024.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/22/2024] [Accepted: 03/08/2024] [Indexed: 03/14/2024]
Abstract
Genomic selection, the application of genomic prediction (GP) models to select candidate individuals, has significantly advanced in the past two decades, effectively accelerating genetic gains in plant breeding. This article provides a holistic overview of key factors that have influenced GP in plant breeding during this period. We delved into the pivotal roles of training population size and genetic diversity, and their relationship with the breeding population, in determining GP accuracy. Special emphasis was placed on optimizing training population size. We explored its benefits and the associated diminishing returns beyond an optimum size. This was done while considering the balance between resource allocation and maximizing prediction accuracy through current optimization algorithms. The density and distribution of single-nucleotide polymorphisms, level of linkage disequilibrium, genetic complexity, trait heritability, statistical machine-learning methods, and non-additive effects are the other vital factors. Using wheat, maize, and potato as examples, we summarize the effect of these factors on the accuracy of GP for various traits. The search for high accuracy in GP-theoretically reaching one when using the Pearson's correlation as a metric-is an active research area as yet far from optimal for various traits. We hypothesize that with ultra-high sizes of genotypic and phenotypic datasets, effective training population optimization methods and support from other omics approaches (transcriptomics, metabolomics and proteomics) coupled with deep-learning algorithms could overcome the boundaries of current limitations to achieve the highest possible prediction accuracy, making genomic selection an effective tool in plant breeding.
Collapse
Affiliation(s)
- Admas Alemu
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Johanna Åstrand
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden; Lantmännen Lantbruk, Svalöv, Sweden
| | | | - Julio Isidro Y Sánchez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Javier Fernández-Gónzalez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Wuletaw Tadesse
- International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat, Morocco
| | - Ramesh R Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Anders S Carlsson
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco, México 52640, Mexico
| | - Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| |
Collapse
|
2
|
Fernández-González J, Haquin B, Combes E, Bernard K, Allard A, Isidro Y Sánchez J. Maximizing efficiency in sunflower breeding through historical data optimization. PLANT METHODS 2024; 20:42. [PMID: 38493115 PMCID: PMC10943787 DOI: 10.1186/s13007-024-01151-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 01/30/2024] [Indexed: 03/18/2024]
Abstract
Genomic selection (GS) has become an increasingly popular tool in plant breeding programs, propelled by declining genotyping costs, an increase in computational power, and rediscovery of the best linear unbiased prediction methodology over the past two decades. This development has led to an accumulation of extensive historical datasets with genotypic and phenotypic information, triggering the question of how to best utilize these datasets. Here, we investigate whether all available data or a subset should be used to calibrate GS models for across-year predictions in a 7-year dataset of a commercial hybrid sunflower breeding program. We employed a multi-objective optimization approach to determine the ideal years to include in the training set (TRS). Next, for a given combination of TRS years, we further optimized the TRS size and its genetic composition. We developed the Min_GRM size optimization method which consistently found the optimal TRS size, reducing dimensionality by 20% with an approximately 1% loss in predictive ability. Additionally, the Tails_GEGVs algorithm displayed potential, outperforming the use of all data by using just 60% of it for grain yield, a high-complexity, low-heritability trait. Moreover, maximizing the genetic diversity of the TRS resulted in a consistent predictive ability across the entire range of genotypic values in the test set. Interestingly, the Tails_GEGVs algorithm, due to its ability to leverage heterogeneity, enhanced predictive performance for key hybrids with extreme genotypic values. Our study provides new insights into the optimal utilization of historical data in plant breeding programs, resulting in improved GS model predictive ability.
Collapse
Affiliation(s)
- Javier Fernández-González
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA)-Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Universidad Politécnica de Madrid (UPM), Campus de Montegancedo-UPM, Pozuelo de Alarcón, Madrid, 28223, Spain.
| | | | | | | | | | - Julio Isidro Y Sánchez
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA)-Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Universidad Politécnica de Madrid (UPM), Campus de Montegancedo-UPM, Pozuelo de Alarcón, Madrid, 28223, Spain.
| |
Collapse
|
3
|
Lorenzi A, Bauland C, Pin S, Madur D, Combes V, Palaffre C, Guillaume C, Touzy G, Mary-Huard T, Charcosset A, Moreau L. Portability of genomic predictions trained on sparse factorial designs across two maize silage breeding cycles. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:75. [PMID: 38453705 DOI: 10.1007/s00122-024-04566-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 01/30/2024] [Indexed: 03/09/2024]
Abstract
KEY MESSAGE We validated the efficiency of genomic predictions calibrated on sparse factorial training sets to predict the next generation of hybrids and tested different strategies for updating predictions along generations. Genomic selection offers new prospects for revisiting hybrid breeding schemes by replacing extensive phenotyping of individuals with genomic predictions. Finding the ideal design for training genomic prediction models is still an open question. Previous studies have shown promising predictive abilities using sparse factorial instead of tester-based training sets to predict single-cross hybrids from the same generation. This study aims to further investigate the use of factorials and their optimization to predict line general combining abilities (GCAs) and hybrid values across breeding cycles. It relies on two breeding cycles of a maize reciprocal genomic selection scheme involving multiparental connected reciprocal populations from flint and dent complementary heterotic groups selected for silage performances. Selection based on genomic predictions trained on a factorial design resulted in a significant genetic gain for dry matter yield in the new generation. Results confirmed the efficiency of sparse factorial training sets to predict candidate line GCAs and hybrid values across breeding cycles. Compared to a previous study based on the first generation, the advantage of factorial over tester training sets appeared lower across generations. Updating factorial training sets by adding single-cross hybrids between selected lines from the previous generation or a random subset of hybrids from the new generation both improved predictive abilities. The CDmean criterion helped determine the set of single-crosses to phenotype to update the training set efficiently. Our results validated the efficiency of sparse factorial designs for calibrating hybrid genomic prediction experimentally and showed the benefit of updating it along generations.
Collapse
Affiliation(s)
- Alizarine Lorenzi
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
- RAGT2n, Genetics and Analytics Unit, 12510, Druelle, France
| | - Cyril Bauland
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Sophie Pin
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Delphine Madur
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Valérie Combes
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Carine Palaffre
- UE 0394 SMH, INRAE, 2297 Route de l'INRA, 40390, Saint-Martin-de-Hinx, France
| | | | - Gaëtan Touzy
- RAGT2n, Genetics and Analytics Unit, 12510, Druelle, France
| | - Tristan Mary-Huard
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
- Université Paris-Saclay, AgroParisTech, INRAE, UMR MIA Paris-Saclay, 91120, Palaiseau, France
| | - Alain Charcosset
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Laurence Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France.
| |
Collapse
|
4
|
Hoque A, Anderson JV, Rahman M. Genomic prediction for agronomic traits in a diverse Flax (Linum usitatissimum L.) germplasm collection. Sci Rep 2024; 14:3196. [PMID: 38326469 PMCID: PMC10850546 DOI: 10.1038/s41598-024-53462-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 01/31/2024] [Indexed: 02/09/2024] Open
Abstract
Breeding programs require exhaustive phenotyping of germplasms, which is time-demanding and expensive. Genomic prediction helps breeders harness the diversity of any collection to bypass phenotyping. Here, we examined the genomic prediction's potential for seed yield and nine agronomic traits using 26,171 single nucleotide polymorphism (SNP) markers in a set of 337 flax (Linum usitatissimum L.) germplasm, phenotyped in five environments. We evaluated 14 prediction models and several factors affecting predictive ability based on cross-validation schemes. Models yielded significant variation among predictive ability values across traits for the whole marker set. The ridge regression (RR) model covering additive gene action yielded better predictive ability for most of the traits, whereas it was higher for low heritable traits by models capturing epistatic gene action. Marker subsets based on linkage disequilibrium decay distance gave significantly higher predictive abilities to the whole marker set, but for randomly selected markers, it reached a plateau above 3000 markers. Markers having significant association with traits improved predictive abilities compared to the whole marker set when marker selection was made on the whole population instead of the training set indicating a clear overfitting. The correction for population structure did not increase predictive abilities compared to the whole collection. However, stratified sampling by picking representative genotypes from each cluster improved predictive abilities. The indirect predictive ability for a trait was proportionate to its correlation with other traits. These results will help breeders to select the best models, optimum marker set, and suitable genotype set to perform an indirect selection for quantitative traits in this diverse flax germplasm collection.
Collapse
Affiliation(s)
- Ahasanul Hoque
- Department of Plant Sciences, North Dakota State University, Fargo, ND, USA
- Department of Genetics and Plant Breeding, Bangladesh Agricultural University, Mymensingh, 2202, Bangladesh
| | - James V Anderson
- USDA-ARS, Edward T. Schafer Agricultural Research Center, Fargo, ND, USA
| | - Mukhlesur Rahman
- Department of Plant Sciences, North Dakota State University, Fargo, ND, USA.
| |
Collapse
|
5
|
Danguy des Déserts A, Durand N, Servin B, Goudemand-Dugué E, Alliot JM, Ruiz D, Charmet G, Elsen JM, Bouchet S. Comparison of genomic-enabled cross selection criteria for the improvement of inbred line breeding populations. G3 (BETHESDA, MD.) 2023; 13:jkad195. [PMID: 37625792 PMCID: PMC10627264 DOI: 10.1093/g3journal/jkad195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 03/15/2023] [Accepted: 08/22/2023] [Indexed: 08/27/2023]
Abstract
A crucial step in inbred plant breeding is the choice of mating design to derive high-performing inbred varieties while also maintaining a competitive breeding population to secure sufficient genetic gain in future generations. In practice, the mating design usually relies on crosses involving the best parental inbred lines to ensure high mean progeny performance. This excludes crosses involving lower performing but more complementary parents in terms of favorable alleles. We predicted the ability of crosses to produce putative outstanding progenies (high mean and high variance progeny distribution) using genomic prediction models. This study compared the benefits and drawbacks of 7 genomic cross selection criteria (CSC) in terms of genetic gain for 1 trait and genetic diversity in the next generation. Six CSC were already published, and we propose an improved CSC that can estimate the proportion of progeny above a threshold defined for the whole mating plan. We simulated mating designs optimized using different CSC. The 835 elite parents came from a real breeding program and were evaluated between 2000 and 2016. We applied constraints on parental contributions and genetic similarities between selected parents according to usual breeder practices. Our results showed that CSC based on progeny variance estimation increased the genetic value of superior progenies by up to 5% in the next generation compared to CSC based on the progeny mean estimation (i.e. parental genetic values) alone. It also increased the genetic gain (up to 4%) and/or maintained more genetic diversity at QTLs (up to 4% more genic variance when the marker effects were perfectly estimated).
Collapse
Affiliation(s)
- Alice Danguy des Déserts
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 63000 Clermont-Ferrand, Puy de Dôme, Auvergne, France
- INRAE-Université de Toulouse, UMR1388, GenPhySE, 31320 Castanet-Tolosan, Haute-Garonne, Occitanie, France
| | - Nicolas Durand
- ENAC-Ecole Nationale de l'Aviation Civile, 31000 Toulouse, Haute-Garonne, Occitanie, France
| | - Bertrand Servin
- INRAE-Université de Toulouse, UMR1388, GenPhySE, 31320 Castanet-Tolosan, Haute-Garonne, Occitanie, France
| | - Ellen Goudemand-Dugué
- Florimond-Desprez Veuve & Fils SAS, 59242 Cappelle-en-Pévèle, Nord, Hauts-de-France, France
| | - Jean-Marc Alliot
- IRIT-APO, Institut de recherche en informatique de Toulouse - Algorithmes Parallèles et Optimisation, 31000 Toulouse, Haute-Garonne, Occitanie, France
| | - Daniel Ruiz
- INPT-ENSEEIHT, Institut National Polytechnique de Toulouse, École Nationale Supérieure d'Électrotechnique, d'Électronique, d'Informatique, d'Hydraulique et des Télécommunications, 31000 Toulouse, Haute-Garonne, Occitanie, France
| | - Gilles Charmet
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 63000 Clermont-Ferrand, Puy de Dôme, Auvergne, France
| | - Jean-Michel Elsen
- INRAE-Université de Toulouse, UMR1388, GenPhySE, 31320 Castanet-Tolosan, Haute-Garonne, Occitanie, France
| | - Sophie Bouchet
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 63000 Clermont-Ferrand, Puy de Dôme, Auvergne, France
| |
Collapse
|
6
|
Fernández-González J, Akdemir D, Isidro Y Sánchez J. A comparison of methods for training population optimization in genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:30. [PMID: 36892603 PMCID: PMC9998580 DOI: 10.1007/s00122-023-04265-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 11/21/2022] [Indexed: 06/18/2023]
Abstract
Maximizing CDmean and Avg_GRM_self were the best criteria for training set optimization. A training set size of 50-55% (targeted) or 65-85% (untargeted) is needed to obtain 95% of the accuracy. With the advent of genomic selection (GS) as a widespread breeding tool, mechanisms to efficiently design an optimal training set for GS models became more relevant, since they allow maximizing the accuracy while minimizing the phenotyping costs. The literature described many training set optimization methods, but there is a lack of a comprehensive comparison among them. This work aimed to provide an extensive benchmark among optimization methods and optimal training set size by testing a wide range of them in seven datasets, six different species, different genetic architectures, population structure, heritabilities, and with several GS models to provide some guidelines about their application in breeding programs. Our results showed that targeted optimization (uses information from the test set) performed better than untargeted (does not use test set data), especially when heritability was low. The mean coefficient of determination was the best targeted method, although it was computationally intensive. Minimizing the average relationship within the training set was the best strategy for untargeted optimization. Regarding the optimal training set size, maximum accuracy was obtained when the training set was the entire candidate set. Nevertheless, a 50-55% of the candidate set was enough to reach 95-100% of the maximum accuracy in the targeted scenario, while we needed a 65-85% for untargeted optimization. Our results also suggested that a diverse training set makes GS robust against population structure, while including clustering information was less effective. The choice of the GS model did not have a significant influence on the prediction accuracies.
Collapse
Affiliation(s)
- Javier Fernández-González
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| | - Deniz Akdemir
- CIBMTR (Center for International Blood and Marrow Transplant Research), National Marrow Donor Program/Be The Match, Minneapolis, USA
| | - Julio Isidro Y Sánchez
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| |
Collapse
|
7
|
Building a Calibration Set for Genomic Prediction, Characteristics to Be Considered, and Optimization Approaches. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2467:77-112. [PMID: 35451773 DOI: 10.1007/978-1-0716-2205-6_3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The efficiency of genomic selection strongly depends on the prediction accuracy of the genetic merit of candidates. Numerous papers have shown that the composition of the calibration set is a key contributor to prediction accuracy. A poorly defined calibration set can result in low accuracies, whereas an optimized one can considerably increase accuracy compared to random sampling, for a same size. Alternatively, optimizing the calibration set can be a way of decreasing the costs of phenotyping by enabling similar levels of accuracy compared to random sampling but with fewer phenotypic units. We present here the different factors that have to be considered when designing a calibration set, and review the different criteria proposed in the literature. We classified these criteria into two groups: model-free criteria based on relatedness, and criteria derived from the linear mixed model. We introduce criteria targeting specific prediction objectives including the prediction of highly diverse panels, biparental families, or hybrids. We also review different ways of updating the calibration set, and different procedures for optimizing phenotyping experimental designs.
Collapse
|
8
|
Robert P, Auzanneau J, Goudemand E, Oury FX, Rolland B, Heumez E, Bouchet S, Le Gouis J, Rincent R. Phenomic selection in wheat breeding: identification and optimisation of factors influencing prediction accuracy and comparison to genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:895-914. [PMID: 34988629 DOI: 10.1007/s00122-021-04005-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 11/23/2021] [Indexed: 05/15/2023]
Abstract
Phenomic selection is a promising alternative or complement to genomic selection in wheat breeding. Models combining spectra from different environments maximise the predictive ability of grain yield and heading date of wheat breeding lines. Phenomic selection (PS) is a recent breeding approach similar to genomic selection (GS) except that genotyping is replaced by near-infrared (NIR) spectroscopy. PS can potentially account for non-additive effects and has the major advantage of being low cost and high throughput. Factors influencing GS predictive abilities have been intensively studied, but little is known about PS. We tested and compared the abilities of PS and GS to predict grain yield and heading date from several datasets of bread wheat lines corresponding to the first or second years of trial evaluation from two breeding companies and one research institute in France. We evaluated several factors affecting PS predictive abilities including the possibility of combining spectra collected in different environments. A simple H-BLUP model predicted both traits with prediction ability from 0.26 to 0.62 and with an efficient computation time. Our results showed that the environments in which lines are grown had a crucial impact on predictive ability based on the spectra acquired and was specific to the trait considered. Models combining NIR spectra from different environments were the best PS models and were at least as accurate as GS in most of the datasets. Furthermore, a GH-BLUP model combining genotyping and NIR spectra was the best model of all (prediction ability from 0.31 to 0.73). We demonstrated also that as for GS, the size and the composition of the training set have a crucial impact on predictive ability. PS could therefore replace or complement GS for efficient wheat breeding programs.
Collapse
Affiliation(s)
- Pauline Robert
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, ClermontFerrand, France
- Agri-Obtentions, Ferme de Gauvilliers, 78660, Orsonville, France
- Florimond-Desprez Veuve & Fils SAS, 3 rue Florimond-Desprez, BP 41, 59242, Cappelle-en-Pévèle, France
| | - Jérôme Auzanneau
- Agri-Obtentions, Ferme de Gauvilliers, 78660, Orsonville, France
| | - Ellen Goudemand
- Florimond-Desprez Veuve & Fils SAS, 3 rue Florimond-Desprez, BP 41, 59242, Cappelle-en-Pévèle, France
| | - François-Xavier Oury
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, ClermontFerrand, France
| | - Bernard Rolland
- INRAE-Agrocampus Ouest-Université Rennes 1, UMR1349, IGEPP, Domaine de la Motte, 35653, Le Rheu, France
| | - Emmanuel Heumez
- INRAE, UE 972, Grandes Cultures Innovation Environnement, 2 Chaussée Brunehaut, 80200, EstréesMons, France
| | - Sophie Bouchet
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, ClermontFerrand, France
| | - Jacques Le Gouis
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, ClermontFerrand, France
| | - Renaud Rincent
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France.
- INRAE-Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, ClermontFerrand, France.
| |
Collapse
|
9
|
Rio S, Akdemir D, Carvalho T, Sánchez JIY. Assessment of genomic prediction reliability and optimization of experimental designs in multi-environment trials. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:405-419. [PMID: 34807267 PMCID: PMC8866390 DOI: 10.1007/s00122-021-03972-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Accepted: 10/08/2021] [Indexed: 06/13/2023]
Abstract
New forms of the coefficient of determination can help to forecast the accuracy of genomic prediction and optimize experimental designs in multi-environment trials with genotype-by-environment interactions. In multi-environment trials, the relative performance of genotypes may vary depending on the environmental conditions, and this phenomenon is commonly referred to as genotype-by-environment interaction (G[Formula: see text]E). With genomic prediction, G[Formula: see text]E can be accounted for by modeling the genetic covariance between trials, even when the overall experimental design is highly unbalanced between trials, thanks to the genomic relationship between genotypes. In this study, we propose new forms of the coefficient of determination (CD, i.e., the expected model-based square correlation between a genetic value and its corresponding prediction) that can be used to forecast the genomic prediction reliability of genotypes, both for their trial-specific performance and their mean performance. As the expected prediction reliability based on these new CD criteria is generally a good approximation of the observed reliability, we demonstrate that they can be used to optimize multi-environment trials in the presence of G[Formula: see text]E. In addition, this reliability may be highly variable between genotypes, especially in unbalanced designs with complex pedigree relationships between genotypes. Therefore, it can be useful for breeders to assess it before selecting genotypes based on their predicted genetic values. Using a wheat population evaluated both for simulated and phenology traits, and two maize populations evaluated for grain yield, we illustrate this approach and confirm the value of our new CD criteria.
Collapse
Affiliation(s)
- Simon Rio
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA) Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón Madrid, Spain
| | - Deniz Akdemir
- CIBMTR (Center for International Blood and Marrow Transplant Research), National Marrow Donor Program/Be The Match, Minneapolis, MN USA
| | - Tiago Carvalho
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA) Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón Madrid, Spain
| | - Julio Isidro y Sánchez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA) Campus de Montegancedo-UPM, 28223 Pozuelo de Alarcón Madrid, Spain
| |
Collapse
|
10
|
Sandhu KS, Merrick LF, Sankaran S, Zhang Z, Carter AH. Prospectus of Genomic Selection and Phenomics in Cereal, Legume and Oilseed Breeding Programs. Front Genet 2022. [PMCID: PMC8814369 DOI: 10.3389/fgene.2021.829131] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The last decade witnessed an unprecedented increase in the adoption of genomic selection (GS) and phenomics tools in plant breeding programs, especially in major cereal crops. GS has demonstrated the potential for selecting superior genotypes with high precision and accelerating the breeding cycle. Phenomics is a rapidly advancing domain to alleviate phenotyping bottlenecks and explores new large-scale phenotyping and data acquisition methods. In this review, we discuss the lesson learned from GS and phenomics in six self-pollinated crops, primarily focusing on rice, wheat, soybean, common bean, chickpea, and groundnut, and their implementation schemes are discussed after assessing their impact in the breeding programs. Here, the status of the adoption of genomics and phenomics is provided for those crops, with a complete GS overview. GS’s progress until 2020 is discussed in detail, and relevant information and links to the source codes are provided for implementing this technology into plant breeding programs, with most of the examples from wheat breeding programs. Detailed information about various phenotyping tools is provided to strengthen the field of phenomics for a plant breeder in the coming years. Finally, we highlight the benefits of merging genomic selection, phenomics, and machine and deep learning that have resulted in extraordinary results during recent years in wheat, rice, and soybean. Hence, there is a potential for adopting these technologies into crops like the common bean, chickpea, and groundnut. The adoption of phenomics and GS into different breeding programs will accelerate genetic gain that would create an impact on food security, realizing the need to feed an ever-growing population.
Collapse
Affiliation(s)
- Karansher S. Sandhu
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
- *Correspondence: Karansher S. Sandhu,
| | - Lance F. Merrick
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| | - Sindhuja Sankaran
- Department of Biological System Engineering, Washington State University, Pullman, WA, United States
| | - Zhiwu Zhang
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| | - Arron H. Carter
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| |
Collapse
|
11
|
Paux E, Lafarge S, Balfourier F, Derory J, Charmet G, Alaux M, Perchet G, Bondoux M, Baret F, Barillot R, Ravel C, Sourdille P, Le Gouis J. Breeding for Economically and Environmentally Sustainable Wheat Varieties: An Integrated Approach from Genomics to Selection. BIOLOGY 2022; 11:149. [PMID: 35053148 PMCID: PMC8773325 DOI: 10.3390/biology11010149] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 01/10/2022] [Accepted: 01/11/2022] [Indexed: 12/21/2022]
Abstract
There is currently a strong societal demand for sustainability, quality, and safety in bread wheat production. To address these challenges, new and innovative knowledge, resources, tools, and methods to facilitate breeding are needed. This starts with the development of high throughput genomic tools including single nucleotide polymorphism (SNP) arrays, high density molecular marker maps, and full genome sequences. Such powerful tools are essential to perform genome-wide association studies (GWAS), to implement genomic and phenomic selection, and to characterize the worldwide diversity. This is also useful to breeders to broaden the genetic basis of elite varieties through the introduction of novel sources of genetic diversity. Improvement in varieties particularly relies on the detection of genomic regions involved in agronomical traits including tolerance to biotic (diseases and pests) and abiotic (drought, nutrient deficiency, high temperature) stresses. When enough resolution is achieved, this can result in the identification of candidate genes that could further be characterized to identify relevant alleles. Breeding must also now be approached through in silico modeling to simulate plant development, investigate genotype × environment interactions, and introduce marker-trait linkage information in the models to better implement genomic selection. Breeders must be aware of new developments and the information must be made available to the world wheat community to develop new high-yielding varieties that can meet the challenge of higher wheat production in a sustainable and fluctuating agricultural context. In this review, we compiled all knowledge and tools produced during the BREEDWHEAT project to show how they may contribute to face this challenge in the coming years.
Collapse
Affiliation(s)
- Etienne Paux
- UMR GDEC Genetics, Diversity & Ecophysiology of Cereals, INRAE—Université Clermont-Auvergne, 5, Chemin de Beaulieu, 63000 Clermont-Ferrand, France; (E.P.); (F.B.); (G.C.); (C.R.); (P.S.)
| | - Stéphane Lafarge
- Limagrain, Chappes Research Center, Route d’Ennezat, 63720 Chappes, France; (S.L.); (J.D.)
| | - François Balfourier
- UMR GDEC Genetics, Diversity & Ecophysiology of Cereals, INRAE—Université Clermont-Auvergne, 5, Chemin de Beaulieu, 63000 Clermont-Ferrand, France; (E.P.); (F.B.); (G.C.); (C.R.); (P.S.)
| | - Jérémy Derory
- Limagrain, Chappes Research Center, Route d’Ennezat, 63720 Chappes, France; (S.L.); (J.D.)
| | - Gilles Charmet
- UMR GDEC Genetics, Diversity & Ecophysiology of Cereals, INRAE—Université Clermont-Auvergne, 5, Chemin de Beaulieu, 63000 Clermont-Ferrand, France; (E.P.); (F.B.); (G.C.); (C.R.); (P.S.)
| | - Michael Alaux
- Université Paris-Saclay—INRAE, URGI, 78026 Versailles, France;
- Université Paris-Saclay—INRAE, BioinfOmics, Plant Bioinformatics Facility, 78026 Versailles, France
| | - Geoffrey Perchet
- Vegepolys Valley, Maison du Végétal, 26 Rue Jean Dixmeras, 49066 Angers, France;
| | - Marion Bondoux
- INRAE—Transfert, 5, Chemin de Beaulieu, 63000 Clermont-Ferrand, France;
| | - Frédéric Baret
- UMR EMMAH, INRAE—Université d’Avignon et des Pays de Vaucluse, 84914 Avignon, France;
| | | | - Catherine Ravel
- UMR GDEC Genetics, Diversity & Ecophysiology of Cereals, INRAE—Université Clermont-Auvergne, 5, Chemin de Beaulieu, 63000 Clermont-Ferrand, France; (E.P.); (F.B.); (G.C.); (C.R.); (P.S.)
| | - Pierre Sourdille
- UMR GDEC Genetics, Diversity & Ecophysiology of Cereals, INRAE—Université Clermont-Auvergne, 5, Chemin de Beaulieu, 63000 Clermont-Ferrand, France; (E.P.); (F.B.); (G.C.); (C.R.); (P.S.)
| | - Jacques Le Gouis
- UMR GDEC Genetics, Diversity & Ecophysiology of Cereals, INRAE—Université Clermont-Auvergne, 5, Chemin de Beaulieu, 63000 Clermont-Ferrand, France; (E.P.); (F.B.); (G.C.); (C.R.); (P.S.)
| | | |
Collapse
|
12
|
Bartholomé J, Prakash PT, Cobb JN. Genomic Prediction: Progress and Perspectives for Rice Improvement. Methods Mol Biol 2022; 2467:569-617. [PMID: 35451791 DOI: 10.1007/978-1-0716-2205-6_21] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Genomic prediction can be a powerful tool to achieve greater rates of genetic gain for quantitative traits if thoroughly integrated into a breeding strategy. In rice as in other crops, the interest in genomic prediction is very strong with a number of studies addressing multiple aspects of its use, ranging from the more conceptual to the more practical. In this chapter, we review the literature on rice (Oryza sativa) and summarize important considerations for the integration of genomic prediction in breeding programs. The irrigated breeding program at the International Rice Research Institute is used as a concrete example on which we provide data and R scripts to reproduce the analysis but also to highlight practical challenges regarding the use of predictions. The adage "To someone with a hammer, everything looks like a nail" describes a common psychological pitfall that sometimes plagues the integration and application of new technologies to a discipline. We have designed this chapter to help rice breeders avoid that pitfall and appreciate the benefits and limitations of applying genomic prediction, as it is not always the best approach nor the first step to increasing the rate of genetic gain in every context.
Collapse
Affiliation(s)
- Jérôme Bartholomé
- CIRAD, UMR AGAP Institut, Montpellier, France.
- AGAP Institut, Univ Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France.
- Rice Breeding Platform, International Rice Research Institute, Manila, Philippines.
| | | | | |
Collapse
|
13
|
Crossa J, Montesinos-López OA, Pérez-Rodríguez P, Costa-Neto G, Fritsche-Neto R, Ortiz R, Martini JWR, Lillemo M, Montesinos-López A, Jarquin D, Breseghello F, Cuevas J, Rincent R. Genome and Environment Based Prediction Models and Methods of Complex Traits Incorporating Genotype × Environment Interaction. Methods Mol Biol 2022; 2467:245-283. [PMID: 35451779 DOI: 10.1007/978-1-0716-2205-6_9] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Genomic-enabled prediction models are of paramount importance for the successful implementation of genomic selection (GS) based on breeding values. As opposed to animal breeding, plant breeding includes extensive multienvironment and multiyear field trial data. Hence, genomic-enabled prediction models should include genotype × environment (G × E) interaction, which most of the time increases the prediction performance when the response of lines are different from environment to environment. In this chapter, we describe a historical timeline since 2012 related to advances of the GS models that take into account G × E interaction. We describe theoretical and practical aspects of those GS models, including the gains in prediction performance when including G × E structures for both complex continuous and categorical scale traits. Then, we detailed and explained the main G × E genomic prediction models for complex traits measured in continuous and noncontinuous (categorical) scale. Related to G × E interaction models this review also examine the analyses of the information generated with high-throughput phenotype data (phenomic) and the joint analyses of multitrait and multienvironment field trial data that is also employed in the general assessment of multitrait G × E interaction. The inclusion of nongenomic data in increasing the accuracy and biological reliability of the G × E approach is also outlined. We show the recent advances in large-scale envirotyping (enviromics), and how the use of mechanistic computational modeling can derive the crop growth and development aspects useful for predicting phenotypes and explaining G × E.
Collapse
Affiliation(s)
- José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Carretera México-Veracruz, Mexico
- Colegio de Postgraduados, Montecillos, Mexico
| | | | | | - Germano Costa-Neto
- Departamento de Genética, Escola Superior de Agricultura "Luiz de Queiroz" (ESALQ/USP), São Paulo, Brazil
| | - Roberto Fritsche-Neto
- Departamento de Genética, Escola Superior de Agricultura "Luiz de Queiroz" (ESALQ/USP), São Paulo, Brazil
| | - Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Sciences (SLU), Alnarp, Sweden
| | - Johannes W R Martini
- International Maize and Wheat Improvement Center (CIMMYT), Carretera México-Veracruz, Mexico
| | - Morten Lillemo
- Department of Plant Sciences, Norwegian University of Life Sciences, IHA/CIGENE, Ås, Norway
| | - Abelardo Montesinos-López
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, Guadalajara, Jalisco, Mexico
| | | | | | - Jaime Cuevas
- Universidad de Quintana Roo, Chetumal, Quintana Roo, Mexico.
| | - Renaud Rincent
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution - Le Moulon, Gif-sur-Yvette, France.
| |
Collapse
|
14
|
Michel S, Löschenberger F, Ametz C, Bürstmayr H. Genomic selection of parents and crosses beyond the native gene pool of a breeding program. THE PLANT GENOME 2021; 14:e20153. [PMID: 34651462 DOI: 10.1002/tpg2.20153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 08/03/2021] [Indexed: 06/13/2023]
Abstract
Genomic selection has become a valuable tool for selecting cultivar candidates in many plant breeding programs. Genomic selection of elite parents and crossing combinations with germplasm developed outside a breeding program has, however, hardly been explored until now. The aim of this study was to assess the potential of this method for commonly ranking and selecting elite germplasm developed within and beyond a given breeding program. A winter wheat (Triticum aestivum L.) population consisting of 611 in-house and 87 externally developed lines was used to compare training population compositions and statistical models for genomically predicting baking quality in this framework. Augmenting training populations with lines from other breeding programs had a larger influence on the prediction ability than adding in-house generated lines when aiming to commonly rank both germplasm sets. Exploiting preexisting information of secondary correlated traits resulted likewise in more accurate predictions both in empirical analyses and simulations. Genotyping germplasm developed beyond a given breeding program is moreover a convenient way to clarify its relationships with a breeder's own germplasm because pedigree information is oftentimes not available for this purpose. Genomic predictions can thus support a more informed diversity management, especially when integrating simply to phenotype correlated traits to partly circumvent resource reallocations for a costly phenotyping of germplasm from other programs.
Collapse
Affiliation(s)
- Sebastian Michel
- Dep. of Agrobiotechnology, IFA-Tulln, Univ. of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430 Tulln, Austria
| | | | - Christian Ametz
- Saatzucht Donau GesmbH & CoKG, Saatzuchtstrasse 11, 2301 Probstdorf, Austria
| | - Hermann Bürstmayr
- Dep. of Agrobiotechnology, IFA-Tulln, Univ. of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430 Tulln, Austria
| |
Collapse
|
15
|
Development of a highly efficient ion-ozone cavitation technology for accelerated bread production. Sci Rep 2021; 11:19129. [PMID: 34580334 PMCID: PMC8476621 DOI: 10.1038/s41598-021-98341-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 09/06/2021] [Indexed: 11/17/2022] Open
Abstract
The bakery market is one of the most capacious in Kazakhstan. Manufacturers of bread products are in dire need of the introduction of intensive technologies for improving product quality and safety. This article presents the results of research to develop technology for accelerated production of bread with ion-ozone cavitation treatment. The influence of various modes of exposure to ion-ozone cavitation has been investigated. After baking, bread samples were examined for organoleptic, physicochemical, rheological and microbiological indicators. The optimal method is treatment with ion-ozone at a concentration of 0.0025 units/mg, at a pressure of 1.0 atm for 1 min. As a result, it was proved that this mode accelerates the process of obtaining dough and shortens the fermentation time, and baking bread increases the qualitative and quantitative indicators according to the control method. The results showed that the ion-ozone technology reduces the length of the process of making dough and bread by three times compared to traditional technologies. The developed products with existing analogues in the Kazakhstan market will differ due to their high taste and consumer properties, product safety, long shelf life and low cost.
Collapse
|
16
|
Isidro y Sánchez J, Akdemir D. Training Set Optimization for Sparse Phenotyping in Genomic Selection: A Conceptual Overview. FRONTIERS IN PLANT SCIENCE 2021; 12:715910. [PMID: 34589099 PMCID: PMC8475495 DOI: 10.3389/fpls.2021.715910] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/10/2021] [Indexed: 06/13/2023]
Abstract
Genomic selection (GS) is becoming an essential tool in breeding programs due to its role in increasing genetic gain per unit time. The design of the training set (TRS) in GS is one of the key steps in the implementation of GS in plant and animal breeding programs mainly because (i) TRS optimization is critical for the efficiency and effectiveness of GS, (ii) breeders test genotypes in multi-year and multi-location trials to select the best-performing ones. In this framework, TRS optimization can help to decrease the number of genotypes to be tested and, therefore, reduce phenotyping cost and time, and (iii) we can obtain better prediction accuracies from optimally selected TRS than an arbitrary TRS. Here, we concentrate the efforts on reviewing the lessons learned from TRS optimization studies and their impact on crop breeding and discuss important features for the success of TRS optimization under different scenarios. In this article, we review the lessons learned from training population optimization in plants and the major challenges associated with the optimization of GS including population size, the relationship between training and test set (TS), update of TRS, and the use of different packages and algorithms for TRS implementation in GS. Finally, we describe general guidelines to improving the rate of genetic improvement by maximizing the use of the TRS optimization in the GS framework.
Collapse
Affiliation(s)
- Julio Isidro y Sánchez
- Centro de Biotecnologia y Genómica de Plantas, Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria, Universidad Politécnica de Madrid, Campus de Montegancedo, Madrid, Spain
| | - Deniz Akdemir
- Animal and Crop Science Division, Agriculture and Food Science Centre, University College Dublin, Dublin, Ireland
| |
Collapse
|
17
|
Michel S, Löschenberger F, Ametz C, Bürstmayr H. Genotyping crossing parents and family bulks can facilitate cost-efficient genomic prediction strategies in small-scale line breeding programs. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:1575-1586. [PMID: 33638651 PMCID: PMC8081688 DOI: 10.1007/s00122-021-03794-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 02/16/2021] [Indexed: 06/12/2023]
Abstract
Genomic relationship matrices based on mid-parent and family bulk genotypes represent cost-efficient alternatives to full genomic prediction approaches with individually genotyped early generation selection candidates. The routine usage of genomic selection for improving line varieties has gained an increasing popularity in recent years. Harnessing the benefits of this approach can, however, be too costly for many small-scale breeding programs, as in most genomic breeding strategies several hundred or even thousands of lines have to be genotyped each year. The aim of this study was thus to compare a full genomic prediction strategy using individually genotyped selection candidates with genomic predictions based on genotypes obtained from pooled DNA of progeny families as well as genotypes inferred from crossing parents. A population of 722 wheat lines representing 63 families tested in more than 100 multi-environment trials during 2010-2019 was for this purpose employed to conduct an empirical study, which was supplemented by a simulation with genotypic data from further 3855 lines. A similar or higher prediction ability was achieved for grain yield, protein yield, and the protein content when using mid-parent or family bulk genotypes in comparison with pedigree selection in the empirical across family prediction scenario. The difference of these methods with a full genomic prediction strategy became furthermore marginal if pre-existing phenotypic data of the selection candidates was already available. Similar observations were made in the simulation, where the usage of individually genotyped lines or family bulks was generally preferable with smaller family sizes. The proposed methods can thus be regarded as alternatives to full genomic or pedigree selection strategies, especially when pedigree information is limited like in the exchange of germplasm between breeding programs.
Collapse
Affiliation(s)
- Sebastian Michel
- Department of Agrobiotechnology, IFA-Tulln, University of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430, Tulln, Austria.
| | | | - Christian Ametz
- Saatzucht Donau GesmbH. & CoKG, Saatzuchtstrasse 11, 2301, Probstdorf, Austria
| | - Hermann Bürstmayr
- Department of Agrobiotechnology, IFA-Tulln, University of Natural Resources and Life Sciences Vienna, Konrad-Lorenz-Str. 20, 3430, Tulln, Austria
| |
Collapse
|
18
|
Pégard M, Segura V, Muñoz F, Bastien C, Jorge V, Sanchez L. Favorable Conditions for Genomic Evaluation to Outperform Classical Pedigree Evaluation Highlighted by a Proof-of-Concept Study in Poplar. FRONTIERS IN PLANT SCIENCE 2020; 11:581954. [PMID: 33193528 PMCID: PMC7655903 DOI: 10.3389/fpls.2020.581954] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 09/22/2020] [Indexed: 06/11/2023]
Abstract
Forest trees like poplar are particular in many ways compared to other domesticated species. They have long juvenile phases, ongoing crop-wild gene flow, extensive outcrossing, and slow growth. All these particularities tend to make the conduction of breeding programs and evaluation stages costly both in time and resources. Perennials like trees are therefore good candidates for the implementation of genomic selection (GS) which is a good way to accelerate the breeding process, by unchaining selection from phenotypic evaluation without affecting precision. In this study, we tried to compare GS to pedigree-based traditional evaluation, and evaluated under which conditions genomic evaluation outperforms classical pedigree evaluation. Several conditions were evaluated as the constitution of the training population by cross-validation, the implementation of multi-trait, single trait, additive and non-additive models with different estimation methods (G-BLUP or weighted G-BLUP). Finally, the impact of the marker densification was tested through four marker density sets. The population under study corresponds to a pedigree of 24 parents and 1,011 offspring, structured into 35 full-sib families. Four evaluation batches were planted in the same location and seven traits were evaluated on 1 and 2 years old trees. The quality of prediction was reported by the accuracy, the Spearman rank correlation and prediction bias and tested with a cross-validation and an independent individual test set. Our results show that genomic evaluation performance could be comparable to the already well-optimized pedigree-based evaluation under certain conditions. Genomic evaluation appeared to be advantageous when using an independent test set and a set of less precise phenotypes. Genome-based methods showed advantages over pedigree counterparts when ranking candidates at the within-family levels, for most of the families. Our study also showed that looking at ranking criteria as Spearman rank correlation can reveal benefits to genomic selection hidden by biased predictions.
Collapse
Affiliation(s)
| | - Vincent Segura
- BioForA, INRA, ONF, Orléans, France
- AGAP, Univ Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | | | | | | | | |
Collapse
|