1
|
Rincent R, Solin J, Lorenzi A, Nunes L, Griveau Y, Pirus L, Kermarrec D, Bauland C, Reymond M, Moreau L. Using phenomic selection to predict hybrid values with NIR spectra measured on the parental lines: proof of concept on maize. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2025; 138:28. [PMID: 39797978 PMCID: PMC11724800 DOI: 10.1007/s00122-024-04809-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Accepted: 12/21/2024] [Indexed: 01/13/2025]
Abstract
KEY MESSAGE Phenomic selection based on parental spectra can be used to predict GCA and SCA in a sparse factorial design. Prediction approaches such as genomic selection can be game changers in hybrid breeding. They allow predicting the genetic values of hybrids without the need for their physical production. This leads to significant reductions in breeding cycle length, and so to the increase in genetic progress. However, these methods are often underutilized in breeding programs due to the substantial cost involved in genotyping thousands of candidate parental lines annually. To address this limitation, we propose a cost-effective alternative based on phenomic selection, where genotyping of parental lines is replaced by NIR spectroscopy. Standard prediction models are then applied for genomic and phenomic selection, using similarity matrices derived from either genotyping data (genomic selection) or NIR spectral data (phenomic selection). Our hypothesis is that the chemical composition of parental tissues captured by NIRS reflects the genetic similarity between parental lines. We evaluated both strategies using a sparse factorial design, whose hybrids have been phenotyped in a multi-environment trial network, and with NIR spectra acquired on the parental lines on two independent environments. Both genomic and phenomic prediction approaches demonstrated moderate-to-high predictive abilities across various cross-validation scenarios. Our results also showcase the capability of phenomic selection to predict Mendelian sampling. This study serves as a proof of concept that low-cost high-throughput phenomics of parental lines can effectively be used to predict maize hybrids in independent trials. This paves the way for widespread adoption of prediction approaches at the very first stages of hybrid breeding, benefiting both major and orphan species.
Collapse
Affiliation(s)
- Renaud Rincent
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France.
| | - Junita Solin
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
| | | | - Laura Nunes
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
| | - Yves Griveau
- Institut Jean-Pierre Bourgin (IJPB), Université Paris-Saclay, INRAE, AgroParisTech, Versailles, France
| | | | - Dominique Kermarrec
- INRAE, Unité Expérimentale Ressources Génétiques Végétales en Conditions Océaniques (UERGCO), Kéraïber, 29260, Ploudaniel, France
| | - Cyril Bauland
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
| | - Matthieu Reymond
- Institut Jean-Pierre Bourgin (IJPB), Université Paris-Saclay, INRAE, AgroParisTech, Versailles, France
| | - Laurence Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
| |
Collapse
|
2
|
Roscher-Ehrig L, Weber SE, Abbadi A, Malenica M, Abel S, Hemker R, Snowdon RJ, Wittkop B, Stahl A. Phenomic Selection for Hybrid Rapeseed Breeding. PLANT PHENOMICS (WASHINGTON, D.C.) 2024; 6:0215. [PMID: 39049840 PMCID: PMC11268845 DOI: 10.34133/plantphenomics.0215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 06/19/2024] [Indexed: 07/27/2024]
Abstract
Phenomic selection is a recent approach suggested as a low-cost, high-throughput alternative to genomic selection. Instead of using genetic markers, it employs spectral data to predict complex traits using equivalent statistical models. Phenomic selection has been shown to outperform genomic selection when using spectral data that was obtained within the same generation as the traits that were predicted. However, for hybrid breeding, the key question is whether spectral data from parental genotypes can be used to effectively predict traits in the hybrid generation. Here, we aimed to evaluate the potential of phenomic selection for hybrid rapeseed breeding. We performed predictions for various traits in a structured population of 410 test hybrids, grown in multiple environments, using near-infrared spectroscopy data obtained from harvested seeds of both the hybrids and their parental lines with different linear and nonlinear models. We found that phenomic selection within the hybrid generation outperformed genomic selection for seed yield and plant height, even when spectral data was collected at single locations, while being less affected by population structure. Furthermore, we demonstrate that phenomic prediction across generations is feasible, and selecting hybrids based on spectral data obtained from parental genotypes is competitive with genomic selection. We conclude that phenomic selection is a promising approach for rapeseed breeding that can be easily implemented without any additional costs or efforts as near-infrared spectroscopy is routinely assessed in rapeseed breeding.
Collapse
Affiliation(s)
| | - Sven E. Weber
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | | | | | | | | | - Rod J. Snowdon
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | - Benjamin Wittkop
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | - Andreas Stahl
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants,
Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| |
Collapse
|
3
|
Melchinger AE, Fernando R, Melchinger AJ, Schön CC. Optimizing selection based on BLUPs or BLUEs in multiple sets of genotypes differing in their population parameters. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:104. [PMID: 38622324 PMCID: PMC11018695 DOI: 10.1007/s00122-024-04592-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 03/05/2024] [Indexed: 04/17/2024]
Abstract
KEY MESSAGE Selection response in truncation selection across multiple sets of candidates hinges on their post-selection proportions, which can deviate grossly from their initial proportions. For BLUPs, using a uniform threshold for all candidates maximizes the selection response, irrespective of differences in population parameters. Plant breeding programs typically involve multiple families from either the same or different populations, varying in means, genetic variances and prediction accuracy of BLUPs or BLUEs for true genetic values (TGVs) of candidates. We extend the classical breeder's equation for truncation selection from single to multiple sets of genotypes, indicating that the expected overall selection response ( Δ G Tot ) for TGVs depends on the selection response within individual sets and their post-selection proportions. For BLUEs, we show that maximizingΔ G Tot requires thresholds optimally tailored for each set, contingent on their population parameters. For BLUPs, we prove thatΔ G Tot is maximized by applying a uniform threshold across all candidates from all sets. We provide explicit formulas for the origin of the selected candidates from different sets and show that their proportions before and after selection can differ substantially, especially for sets with inferior properties and low proportion. We discuss implications of these results for (a) optimum allocation of resources to training and prediction sets and (b) the need to counteract narrowing the genetic variation under genomic selection. For genomic selection of hybrids based on BLUPs of GCA of their parent lines, selecting distinct proportions in the two parent populations can be advantageous, if these differ substantially in the variance and/or prediction accuracy of GCA. Our study sheds light on the complex interplay of selection thresholds and population parameters for the selection response in plant breeding programs, offering insights into the effective resource management and prudent application of genomic selection for improved crop development.
Collapse
Affiliation(s)
- Albrecht E Melchinger
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany.
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, 70599, Stuttgart, Germany.
| | - Rohan Fernando
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| | | | - Chris-Carolin Schön
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany
| |
Collapse
|
4
|
Lorenzi A, Bauland C, Pin S, Madur D, Combes V, Palaffre C, Guillaume C, Touzy G, Mary-Huard T, Charcosset A, Moreau L. Portability of genomic predictions trained on sparse factorial designs across two maize silage breeding cycles. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:75. [PMID: 38453705 PMCID: PMC11341662 DOI: 10.1007/s00122-024-04566-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 01/30/2024] [Indexed: 03/09/2024]
Abstract
KEY MESSAGE We validated the efficiency of genomic predictions calibrated on sparse factorial training sets to predict the next generation of hybrids and tested different strategies for updating predictions along generations. Genomic selection offers new prospects for revisiting hybrid breeding schemes by replacing extensive phenotyping of individuals with genomic predictions. Finding the ideal design for training genomic prediction models is still an open question. Previous studies have shown promising predictive abilities using sparse factorial instead of tester-based training sets to predict single-cross hybrids from the same generation. This study aims to further investigate the use of factorials and their optimization to predict line general combining abilities (GCAs) and hybrid values across breeding cycles. It relies on two breeding cycles of a maize reciprocal genomic selection scheme involving multiparental connected reciprocal populations from flint and dent complementary heterotic groups selected for silage performances. Selection based on genomic predictions trained on a factorial design resulted in a significant genetic gain for dry matter yield in the new generation. Results confirmed the efficiency of sparse factorial training sets to predict candidate line GCAs and hybrid values across breeding cycles. Compared to a previous study based on the first generation, the advantage of factorial over tester training sets appeared lower across generations. Updating factorial training sets by adding single-cross hybrids between selected lines from the previous generation or a random subset of hybrids from the new generation both improved predictive abilities. The CDmean criterion helped determine the set of single-crosses to phenotype to update the training set efficiently. Our results validated the efficiency of sparse factorial designs for calibrating hybrid genomic prediction experimentally and showed the benefit of updating it along generations.
Collapse
Affiliation(s)
- Alizarine Lorenzi
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
- RAGT2n, Genetics and Analytics Unit, 12510, Druelle, France
| | - Cyril Bauland
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Sophie Pin
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Delphine Madur
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Valérie Combes
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Carine Palaffre
- UE 0394 SMH, INRAE, 2297 Route de l'INRA, 40390, Saint-Martin-de-Hinx, France
| | | | - Gaëtan Touzy
- RAGT2n, Genetics and Analytics Unit, 12510, Druelle, France
| | - Tristan Mary-Huard
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
- Université Paris-Saclay, AgroParisTech, INRAE, UMR MIA Paris-Saclay, 91120, Palaiseau, France
| | - Alain Charcosset
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Laurence Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution (GQE) - Le Moulon, 91190, Gif-Sur-Yvette, France.
| |
Collapse
|
5
|
Fritsche-Neto R, Ali J, De Asis EJ, Allahgholipour M, Labroo MR. Improving hybrid rice breeding programs via stochastic simulations: number of parents, number of hybrids, tester update, and genomic prediction of hybrid performance. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 137:3. [PMID: 38085288 PMCID: PMC10716074 DOI: 10.1007/s00122-023-04508-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 11/18/2023] [Indexed: 12/18/2023]
Abstract
KEY MESSAGE Schemes that use genomic prediction outperform others, updating testers increases hybrid genetic gain, and larger population sizes tend to have higher genetic gain and less depletion of genetic variance One of the most common methods to improve hybrid performance is reciprocal recurrent selection (RRS). Genomic prediction (GP) can be used to increase genetic gain in RRS by reducing cycle length, but it is also possible to use GP to predict single-cross hybrid performance. The impact of the latter method on genetic gain has yet to be previously reported. Therefore, we compared via stochastic simulations various phenotypic and genomics-assisted RRS breeding schemes which used GP to predict hybrid performance rather than reducing cycle length, which allows minimal changes to traditional breeding schemes. We also compared three breeding sizes scenarios that varied the number of genotypes crossed within heterotic pools, the number of genotypes crossed between heterotic pools, the number of hybrids evaluated, and the number of genomic predicted hybrids. Our results demonstrated that schemes that used genomic prediction of hybrid performance outperformed the others for the average interpopulation hybrid population and the best hybrid performance. Furthermore, updating the testers increased hybrid genetic gain with phenotypic RRS. As expected, the largest breeding size tested had the highest rates of genetic improvement and the lowest decrease in additive genetic variance due to the drift. Therefore, this study demonstrates the usefulness of single-cross prediction, which may be easier to implement than rapid-cycling RRS and cyclical updating of testers. We also reiterate that larger population sizes tend to have higher genetic gain and less depletion of genetic variance.
Collapse
Affiliation(s)
- Roberto Fritsche-Neto
- International Rice Research Institute (IRRI), Los Banos, Philippines.
- H. Rouse Caffey Rice Research Station, LSU AgCenter, Rayne, USA.
| | - Jauhar Ali
- International Rice Research Institute (IRRI), Los Banos, Philippines.
| | - Erik Jon De Asis
- International Rice Research Institute (IRRI), Los Banos, Philippines
| | | | - Marlee Rose Labroo
- Excellence in Breeding Platform, Consultative Group of International Agricultural Research, Lisbon, Mexico
- International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
| |
Collapse
|
6
|
Sweet PK, Bernardo R. Reciprocal testcross design for genome-wide prediction of maize single-cross performance. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:184. [PMID: 37555961 DOI: 10.1007/s00122-023-04435-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 07/25/2023] [Indexed: 08/10/2023]
Abstract
KEY MESSAGE A reciprocal testcross design increases the relatedness among single crosses and testcrosses, thereby increasing the effectiveness of genome-wide prediction in maize. A reciprocal testcross design uses parental inbreds in an opposite heterotic group as testers in maize (Zea mays L.) inbred development. In particular, doubled haploids from the A × B cross are testcrossed with inbreds Y and Z, and doubled haploids from Y × Z are testcrossed with inbreds A and B. Our objective was to determine if a reciprocal testcross design is superior to a traditional, non-reciprocal testcross design. A total of 700 Iowa Stiff Stalk Synthetic (BSSS) doubled haploids and 231 non-BSSS doubled haploids were developed from 10 breeding populations and had data on 11,032 single nucleotide polymorphism markers. Each doubled haploid was testcrossed to one to five testers from the opposite heterotic group, and the resulting 1642 testcrosses were evaluated in multilocation yield trials in 2019. Divergent selection for yield/moisture, on the basis of genome-wide predictions according to a reciprocal testcross design, led to significant responses (in 2020) in all 10 populations for yield/moisture and moisture and in three populations for yield. Predictive ability for yield/moisture and moisture was 0.11 to 0.26 higher with a reciprocal testcross design than with a testcross design. This higher predictive ability was attributed to a stronger relatedness between the training and test populations. No significant difference in predictive ability was found for yield, for which predictive ability was lower. Differences among genetic models that included and excluded specific combining ability were small. Overall, the empirical results supported the usefulness of a reciprocal testcross design in maize breeding.
Collapse
Affiliation(s)
| | - Rex Bernardo
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, USA.
| |
Collapse
|
7
|
Melchinger AE, Fernando R, Stricker C, Schön CC, Auinger HJ. Genomic prediction in hybrid breeding: I. Optimizing the training set design. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:176. [PMID: 37532821 PMCID: PMC10397156 DOI: 10.1007/s00122-023-04413-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 06/23/2023] [Indexed: 08/04/2023]
Abstract
KEY MESSAGE Training sets produced by maximizing the number of parent lines, each involved in one cross, had the highest prediction accuracy for H0 hybrids, but lowest for H1 and H2 hybrids. Genomic prediction holds great promise for hybrid breeding but optimum composition of the training set (TS) as determined by the number of parents (nTS) and crosses per parent (c) has received little attention. Our objective was to examine prediction accuracy ([Formula: see text]) of GCA for lines used as parents of the TS (I1 lines) or not (I0 lines), and H0, H1 and H2 hybrids, comprising crosses of type I0 × I0, I1 × I0 and I1 × I1, respectively, as function of nTS and c. In the theory, we developed estimates for [Formula: see text] of GBLUPs for hybrids: (i)[Formula: see text] based on the expected prediction accuracy, and (ii) [Formula: see text] based on [Formula: see text] of GBLUPs of GCA and SCA effects. In the simulation part, hybrid populations were generated using molecular data from two experimental maize data sets. Additive and dominance effects of QTL borrowed from literature were used to simulate six scenarios of traits differing in the proportion (τSCA = 1%, 6%, 22%) of SCA variance in σG2 and heritability (h2 = 0.4, 0.8). Values of [Formula: see text] and [Formula: see text] closely agreed with [Formula: see text] for hybrids. For given size NTS = nTS × c of TS, [Formula: see text] of H0 hybrids and GCA of I0 lines was highest for c = 1. Conversely, for GCA of I1 lines and H1 and H2 hybrids, c = 1 yielded lowest [Formula: see text] with concordant results across all scenarios for both data sets. In view of these opposite trends, the optimum choice of c for maximizing selection response across all types of hybrids depends on the size and resources of the breeding program.
Collapse
Affiliation(s)
- Albrecht E Melchinger
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany.
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, 70599, Stuttgart, Germany.
| | - Rohan Fernando
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| | - Christian Stricker
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany
| | - Chris-Carolin Schön
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany
| | - Hans-Jürgen Auinger
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany
| |
Collapse
|
8
|
DoVale JC, Carvalho HF, Sabadin F, Fritsche-Neto R. Genotyping marker density and prediction models effects in long-term breeding schemes of cross-pollinated crops. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:4523-4539. [PMID: 36261658 DOI: 10.1007/s00122-022-04236-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Accepted: 10/09/2022] [Indexed: 06/16/2023]
Abstract
In genomic recurrent selection, the more markers, the better because they buffer the linkage disequilibrium losses caused by recombination over cycles, and consequently, provide higher responses to selection. Reductions of genotyping marker density have been extensively evaluated as potential strategies to reduce the genotyping costs of genomic selection (GS). Low-density marker panels are appealing in GS because they entail lower multicollinearity and computing time and allow more individuals to be genotyped for the same cost. However, statistical models used in GS are usually evaluated with empirical data, using "static" training sets and populations. This may be adequate for making predictions during a breeding program's initial cycles but not for the long-term. Moreover, studies that focus on long selective breeding cycles generally do not consider GS models with the effect of dominance, which is particularly important for breeding outcomes in cross-pollinated crops. Hence, dominance effects are important and unexplored in GS for long-term programs involving allogamous species. To address it, we employed two approaches: analysis of empirical maize datasets and simulations of long-term breeding applying phenotypic and genomic recurrent selection (intrapopulation and reciprocal schemes). In both schemes, we simulated twenty breeding cycles and assessed the effect of marker density reduction on the population mean, the best crosses, additive variance, selective accuracy, and response to selection with models [additive, additive-dominant, general (GCA), and this plus specific combining ability (GCA + SCA)]. Our results indicate that marker reduction based on linkage disequilibrium levels provides useful predictions only within a cycle, as accuracy significantly decreases over cycles. In the long-term, without training set updating, high-marker density provides the best responses to selection. The model to be used depends on the breeding scheme: additive for intrapopulation and additive-dominant or GCA + SCA for reciprocal.
Collapse
Affiliation(s)
- Júlio César DoVale
- Department of Crop Science, Federal University of Ceará, Fortaleza, CE, Brazil.
| | | | - Felipe Sabadin
- Virginia Tech: Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | | |
Collapse
|
9
|
Lorenzi A, Bauland C, Mary-Huard T, Pin S, Palaffre C, Guillaume C, Lehermeier C, Charcosset A, Moreau L. Genomic prediction of hybrid performance: comparison of the efficiency of factorial and tester designs used as training sets in a multiparental connected reciprocal design for maize silage. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:3143-3160. [PMID: 35918515 DOI: 10.1007/s00122-022-04176-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 07/06/2022] [Indexed: 06/15/2023]
Abstract
Calibrating a genomic selection model on a sparse factorial design rather than on tester designs is advantageous for some traits, and equivalent for others. In maize breeding, the selection of the candidate inbred lines is based on topcross evaluations using a limited number of testers. Then, a subset of single-crosses between these selected lines is evaluated to identify the best hybrid combinations. Genomic selection enables the prediction of all possible single-crosses between candidate lines but raises the question of defining the best training set design. Previous simulation results have shown the potential of using a sparse factorial design instead of tester designs as the training set. To validate this result, a 363 hybrid factorial design was obtained by crossing 90 dent and flint inbred lines from six segregating families. Two tester designs were also obtained by crossing the same inbred lines to two testers of the opposite group. These designs were evaluated for silage in eight environments and used to predict independent performances of a 951 hybrid factorial design. At a same number of hybrids and lines, the factorial design was as efficient as the tester designs, and, for some traits, outperformed them. All available designs were used as both training and validation set to evaluate their efficiency. When the objective was to predict single-crosses between untested lines, we showed an advantage of increasing the number of lines involved in the training set, by (1) allocating each of them to a different tester for the tester design, or (2) reducing the number of hybrids per line for the factorial design. Our results confirm the potential of sparse factorial designs for genomic hybrid breeding.
Collapse
Affiliation(s)
- Alizarine Lorenzi
- Génétique Quantitative et Evolution - Le Moulon, INRAE, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Cyril Bauland
- Génétique Quantitative et Evolution - Le Moulon, INRAE, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Tristan Mary-Huard
- Génétique Quantitative et Evolution - Le Moulon, INRAE, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
- MIA, INRAE, AgroParisTech, Université Paris-Saclay, 75005, Paris, France
| | - Sophie Pin
- Génétique Quantitative et Evolution - Le Moulon, INRAE, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Carine Palaffre
- UE 0394 SMH, INRAE, 2297 Route de l'INRA, 40390, Saint-Martin-de-Hinx, France
| | | | | | - Alain Charcosset
- Génétique Quantitative et Evolution - Le Moulon, INRAE, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Laurence Moreau
- Génétique Quantitative et Evolution - Le Moulon, INRAE, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France.
| |
Collapse
|
10
|
Building a Calibration Set for Genomic Prediction, Characteristics to Be Considered, and Optimization Approaches. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2467:77-112. [PMID: 35451773 DOI: 10.1007/978-1-0716-2205-6_3] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The efficiency of genomic selection strongly depends on the prediction accuracy of the genetic merit of candidates. Numerous papers have shown that the composition of the calibration set is a key contributor to prediction accuracy. A poorly defined calibration set can result in low accuracies, whereas an optimized one can considerably increase accuracy compared to random sampling, for a same size. Alternatively, optimizing the calibration set can be a way of decreasing the costs of phenotyping by enabling similar levels of accuracy compared to random sampling but with fewer phenotypic units. We present here the different factors that have to be considered when designing a calibration set, and review the different criteria proposed in the literature. We classified these criteria into two groups: model-free criteria based on relatedness, and criteria derived from the linear mixed model. We introduce criteria targeting specific prediction objectives including the prediction of highly diverse panels, biparental families, or hybrids. We also review different ways of updating the calibration set, and different procedures for optimizing phenotyping experimental designs.
Collapse
|
11
|
Martins Oliveira IC, Bernardeli A, Soler Guilhen JH, Pastina MM. Genomic Prediction of Complex Traits in an Allogamous Annual Crop: The Case of Maize Single-Cross Hybrids. Methods Mol Biol 2022; 2467:543-567. [PMID: 35451790 DOI: 10.1007/978-1-0716-2205-6_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
For many plant and animal species, commercial products are hybrids between individuals from different genetic groups. For allogamous plant species such as maize, the breeding objective is to produce single-cross hybrid varieties from two inbred lines each selected in complementary groups. Efficient hybrid breeding requires methods that (1) quickly generate homozygous and homogeneous parental lines with high combining abilities, (2) efficiently choose among the large number of available parental lines the most promising ones, and (3) predict the performances of sets of non-phenotyped single-cross hybrids, or hybrids phenotyped in a limited number of environments, based on their relationship with another set of hybrids with known performances. The maize breeding community has been developing model-based prediction of hybrid performances well before the genomic era. This chapter (1) provides a reminder of the maize breeding scheme before the genomic era; (2) describes how genomic data were incorporated in the prediction models involved in different steps of genomic-based single-cross maize hybrid breeding; and (3) reviews factors affecting the accuracy of genomic prediction, approaches for optimizing GP-based single-cross maize hybrid breeding schemes, and ensuring the long-term sustainability of genomic selection.
Collapse
Affiliation(s)
| | - Arthur Bernardeli
- Department of Agronomy, Universidade Federal de Viçosa, Viçosa-MG, Brazil
| | | | | |
Collapse
|
12
|
Lell M, Reif J, Zhao Y. Optimizing the setup of multienvironmental hybrid wheat yield trials for boosting the selection capability. THE PLANT GENOME 2021; 14:e20150. [PMID: 34541826 DOI: 10.1002/tpg2.20150] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 07/22/2021] [Indexed: 06/13/2023]
Abstract
The accuracy of genomic prediction increases with increasing heritability, and thus the challenge of optimizing the design of multienvironment yield trials under a limited budget arises. With this in mind, we aimed to find the best of several options to sparsely distribute a fixed number of plots across different environments to increase the accuracy of hybrid performance prediction. We used a comprehensive published genomic and phenotypic data set of 1,604 winter wheat (Triticum aestivum L.) hybrids and compared several commonly used biometric models for phenotypic data analysis in a resampling study to identify the one that most accurately estimated the hybrid performance in different imbalanced trials. Our results showed that when using information about genotypic relationships, genotypic values were more strongly associated with the reference values than when this information was ignored. In addition, a balanced environmental sampling resulted in an adequate characterization of each environment and increased the accuracy for estimating the hybrid performance. One promising design involved dividing the genotypes into equally sized subgroups that were tested in a subset of environments, with the constraint that the subgroups overlapped with respect to the environments. This scenario appears to be particularly appropriate, as it provided both high accuracies in the estimates of genotypic values and had low variability resulting from the data sample used. Thus, we were able to clearly demonstrate the utility for optimizing the design of multienvironment hybrid wheat yield trials in times of genomic selection.
Collapse
Affiliation(s)
- Moritz Lell
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, D-06466, Germany
| | - Jochen Reif
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, D-06466, Germany
| | - Yusheng Zhao
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, D-06466, Germany
| |
Collapse
|
13
|
Isidro y Sánchez J, Akdemir D. Training Set Optimization for Sparse Phenotyping in Genomic Selection: A Conceptual Overview. FRONTIERS IN PLANT SCIENCE 2021; 12:715910. [PMID: 34589099 PMCID: PMC8475495 DOI: 10.3389/fpls.2021.715910] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/10/2021] [Indexed: 06/13/2023]
Abstract
Genomic selection (GS) is becoming an essential tool in breeding programs due to its role in increasing genetic gain per unit time. The design of the training set (TRS) in GS is one of the key steps in the implementation of GS in plant and animal breeding programs mainly because (i) TRS optimization is critical for the efficiency and effectiveness of GS, (ii) breeders test genotypes in multi-year and multi-location trials to select the best-performing ones. In this framework, TRS optimization can help to decrease the number of genotypes to be tested and, therefore, reduce phenotyping cost and time, and (iii) we can obtain better prediction accuracies from optimally selected TRS than an arbitrary TRS. Here, we concentrate the efforts on reviewing the lessons learned from TRS optimization studies and their impact on crop breeding and discuss important features for the success of TRS optimization under different scenarios. In this article, we review the lessons learned from training population optimization in plants and the major challenges associated with the optimization of GS including population size, the relationship between training and test set (TS), update of TRS, and the use of different packages and algorithms for TRS implementation in GS. Finally, we describe general guidelines to improving the rate of genetic improvement by maximizing the use of the TRS optimization in the GS framework.
Collapse
Affiliation(s)
- Julio Isidro y Sánchez
- Centro de Biotecnologia y Genómica de Plantas, Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria, Universidad Politécnica de Madrid, Campus de Montegancedo, Madrid, Spain
| | - Deniz Akdemir
- Animal and Crop Science Division, Agriculture and Food Science Centre, University College Dublin, Dublin, Ireland
| |
Collapse
|
14
|
Auinger HJ, Lehermeier C, Gianola D, Mayer M, Melchinger AE, da Silva S, Knaak C, Ouzunova M, Schön CC. Calibration and validation of predicted genomic breeding values in an advanced cycle maize population. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:3069-3081. [PMID: 34117908 PMCID: PMC8354938 DOI: 10.1007/s00122-021-03880-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 05/31/2021] [Indexed: 05/26/2023]
Abstract
Model training on data from all selection cycles yielded the highest prediction accuracy by attenuating specific effects of individual cycles. Expected reliability was a robust predictor of accuracies obtained with different calibration sets. The transition from phenotypic to genome-based selection requires a profound understanding of factors that determine genomic prediction accuracy. We analysed experimental data from a commercial maize breeding programme to investigate if genomic measures can assist in identifying optimal calibration sets for model training. The data set consisted of six contiguous selection cycles comprising testcrosses of 5968 doubled haploid lines genotyped with a minimum of 12,000 SNP markers. We evaluated genomic prediction accuracies in two independent prediction sets in combination with calibration sets differing in sample size and genomic measures (effective sample size, average maximum kinship, expected reliability, number of common polymorphic SNPs and linkage phase similarity). Our results indicate that across selection cycles prediction accuracies were as high as 0.57 for grain dry matter yield and 0.76 for grain dry matter content. Including data from all selection cycles in model training yielded the best results because interactions between calibration and prediction sets as well as the effects of different testers and specific years were attenuated. Among genomic measures, the expected reliability of genomic breeding values was the best predictor of empirical accuracies obtained with different calibration sets. For grain yield, a large difference between expected and empirical reliability was observed in one prediction set. We propose to use this difference as guidance for determining the weight phenotypic data of a given selection cycle should receive in model retraining and for selection when both genomic breeding values and phenotypes are available.
Collapse
Affiliation(s)
- Hans-Jürgen Auinger
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany
| | | | - Daniel Gianola
- Department of Animal and Dairy Sciences, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Manfred Mayer
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany
| | - Albrecht E Melchinger
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, 70593, Stuttgart, Germany
| | | | | | | | - Chris-Carolin Schön
- Plant Breeding, TUM School of Life Sciences, Technical University of Munich, 85354, Freising, Germany.
| |
Collapse
|
15
|
Fugeray-Scarbel A, Bastien C, Dupont-Nivet M, Lemarié S. Why and How to Switch to Genomic Selection: Lessons From Plant and Animal Breeding Experience. Front Genet 2021; 12:629737. [PMID: 34305998 PMCID: PMC8301370 DOI: 10.3389/fgene.2021.629737] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Accepted: 06/11/2021] [Indexed: 11/25/2022] Open
Abstract
The present study is a transversal analysis of the interest in genomic selection for plant and animal species. It focuses on the arguments that may convince breeders to switch to genomic selection. The arguments are classified into three different “bricks.” The first brick considers the addition of genotyping to improve the accuracy of the prediction of breeding values. The second consists of saving costs and/or shortening the breeding cycle by replacing all or a portion of the phenotyping effort with genotyping. The third concerns population management to improve the choice of parents to either optimize crossbreeding or maintain genetic diversity. We analyse the relevance of these different bricks for a wide range of animal and plant species and sought to explain the differences between species according to their biological specificities and the organization of breeding programs.
Collapse
Affiliation(s)
| | | | | | | | - Stéphane Lemarié
- Université Grenoble Alpes, INRAE, CNRS, Grenoble INP, GAEL, Grenoble, France
| |
Collapse
|
16
|
Technow F, Podlich D, Cooper M. Back to the future: Implications of genetic complexity for the structure of hybrid breeding programs. G3 (BETHESDA, MD.) 2021; 11:6265599. [PMID: 33950172 PMCID: PMC8495936 DOI: 10.1093/g3journal/jkab153] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 04/28/2021] [Indexed: 11/14/2022]
Abstract
Commercial hybrid breeding operations can be described as decentralized networks of smaller, more or less isolated breeding programs. There is further a tendency for the disproportionate use of successful inbred lines for generating the next generation of recombinants, which has led to a series of significant bottlenecks, particularly in the history of the North American and European maize germplasm. Both the decentralization and the disproportionate contribution of inbred lines reduce effective population size and constrain the accessible genetic space. Under these conditions, long-term response to selection is not expected to be optimal under the classical infinitesimal model of quantitative genetics. In this study, we therefore aim to propose a rationale for the success of large breeding operations in the context of genetic complexity arising from the structure and properties of interactive genetic networks. For this, we use simulations based on the NK model of genetic architecture. We indeed found that constraining genetic space through program decentralization and disproportionate contribution of parental inbred lines, is required to expose additive genetic variation and thus facilitate heritable genetic gains under high levels of genetic complexity. These results introduce new insights into why the historically grown structure of hybrid breeding programs was successful in improving the yield potential of hybrid crops over the last century. We also hope that a renewed appreciation for “why things worked” in the past can guide the adoption of novel technologies and the design of future breeding strategies for navigating biological complexity.
Collapse
Affiliation(s)
- Frank Technow
- Plant Breeding, Corteva Agriscience, Tavistock, ON, N0B 2R0, Canada
| | - Dean Podlich
- Systems and Innovation for Breeding and Seed Products, Corteva Agriscience, Johnston, IA, 50131, USA
| | - Mark Cooper
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4067, Australia
| |
Collapse
|
17
|
Kadam DC, Rodriguez OR, Lorenz AJ. Optimization of training sets for genomic prediction of early-stage single crosses in maize. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:687-699. [PMID: 33398385 DOI: 10.1007/s00122-020-03722-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 11/03/2020] [Indexed: 06/12/2023]
Abstract
Training population optimization algorithms are useful for efficiently training genomic prediction models for single-cross performance, especially if the population is extended beyond only realized crosses to all possible single crosses. Genomic prediction of single-cross performance could allow effective evaluation of all possible single crosses between all inbreds developed in a hybrid breeding program. The objectives of the present study were to investigate the effect of different levels of relatedness on genomic predictive ability of single crosses, evaluate the usefulness of deterministic formula to forecast prediction accuracy in advance, and determine the potential for TRS optimization based on prediction error variance (PEVmean) and coefficient of determination (CDmean) criteria. We used 481 single crosses made by crossing 89 random recombinant inbred lines (RILs) belonging to the Iowa stiff stalk synthetic group with 103 random RILs belonging to the non-stiff stalk synthetic heterotic group. As expected, predictive ability was enhanced by ensuring close relationships between TRSs and target sets, even when TRS sizes were smaller. We found that designing a TRS based on PEVmean or CDmean criteria is useful for increasing the efficiency of genomic prediction of maize single crosses. We went further and extended the sampling space from that of all observed single crosses to all possible single crosses, providing a much larger genetic space within which to design a training population. Using all possible single crosses increased the advantage of the PEVmean and CDmean methods based on expected prediction accuracy. This finding suggests that it may be worthwhile using an optimization algorithm to select a training population from all possible single crosses to maximize efficiency in training accurate models for hybrid genomic prediction.
Collapse
Affiliation(s)
- Dnyaneshwar C Kadam
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Oscar R Rodriguez
- Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE, 68583, USA
| | - Aaron J Lorenz
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA.
| |
Collapse
|
18
|
Haug B, Messmer MM, Enjalbert J, Goldringer I, Forst E, Flutre T, Mary-Huard T, Hohmann P. Advances in Breeding for Mixed Cropping - Incomplete Factorials and the Producer/Associate Concept. FRONTIERS IN PLANT SCIENCE 2021; 11:620400. [PMID: 33505418 PMCID: PMC7829252 DOI: 10.3389/fpls.2020.620400] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 12/11/2020] [Indexed: 05/27/2023]
Abstract
Mixed cropping has been suggested as a resource-efficient approach to meet high produce demands while maintaining biodiversity and minimizing environmental impact. Current breeding programs do not select for enhanced general mixing ability (GMA) and neglect biological interactions within species mixtures. Clear concepts and efficient experimental designs, adapted to breeding for mixed cropping and encoded into appropriate statistical models, are lacking. Thus, a model framework for GMA and SMA (specific mixing ability) was established. Results of a simulation study showed that an incomplete factorial design combines advantages of two commonly used full factorials, and enables to estimate GMA, SMA, and their variances in a resource-efficient way. This model was extended to the Producer (Pr) and Associate (As) concept to exploit additional information based on fraction yields. It was shown that the Pr/As concept allows to characterize genotypes for their contribution to total mixture yield, and, when relating to plant traits, allows to describe biological interaction functions (BIF) in a mixed crop. Incomplete factorial designs show the potential to drastically improve genetic gain by testing an increased number of genotypes using the same amount of resources. The Pr/As concept can further be employed to maximize GMA in an informed and efficient way. The BIF of a trait can be used to optimize species ratios at harvest as well as to extend our understanding of competitive and facilitative interactions in a mixed plant community. This study provides an integrative methodological framework to promote breeding for mixed cropping.
Collapse
Affiliation(s)
- Benedikt Haug
- Department of Crop Sciences, Research Institute of Organic Agriculture FiBL, Frick, Switzerland
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, Gif-sur-Yvette, France
| | - Monika M. Messmer
- Department of Crop Sciences, Research Institute of Organic Agriculture FiBL, Frick, Switzerland
| | - Jérôme Enjalbert
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, Gif-sur-Yvette, France
| | - Isabelle Goldringer
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, Gif-sur-Yvette, France
| | - Emma Forst
- Dipartimento di Scienze Agrarie, Alimentari ed Ambientali (D3A), Università Politecnica delle Marche, Ancona, Italy
| | - Timothée Flutre
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, Gif-sur-Yvette, France
| | - Tristan Mary-Huard
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE – Le Moulon, Gif-sur-Yvette, France
- UMR MIA-Paris, AgroParisTech, INRA, Université Paris-Saclay, Paris, France
| | - Pierre Hohmann
- Department of Crop Sciences, Research Institute of Organic Agriculture FiBL, Frick, Switzerland
| |
Collapse
|