1
|
Roscher-Ehrig L, Weber SE, Abbadi A, Malenica M, Abel S, Hemker R, Snowdon RJ, Wittkop B, Stahl A. Phenomic Selection for Hybrid Rapeseed Breeding. PLANT PHENOMICS (WASHINGTON, D.C.) 2024; 6:0215. [PMID: 39049840 PMCID: PMC11268845 DOI: 10.34133/plantphenomics.0215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 06/19/2024] [Indexed: 07/27/2024]
Abstract
Phenomic selection is a recent approach suggested as a low-cost, high-throughput alternative to genomic selection. Instead of using genetic markers, it employs spectral data to predict complex traits using equivalent statistical models. Phenomic selection has been shown to outperform genomic selection when using spectral data that was obtained within the same generation as the traits that were predicted. However, for hybrid breeding, the key question is whether spectral data from parental genotypes can be used to effectively predict traits in the hybrid generation. Here, we aimed to evaluate the potential of phenomic selection for hybrid rapeseed breeding. We performed predictions for various traits in a structured population of 410 test hybrids, grown in multiple environments, using near-infrared spectroscopy data obtained from harvested seeds of both the hybrids and their parental lines with different linear and nonlinear models. We found that phenomic selection within the hybrid generation outperformed genomic selection for seed yield and plant height, even when spectral data was collected at single locations, while being less affected by population structure. Furthermore, we demonstrate that phenomic prediction across generations is feasible, and selecting hybrids based on spectral data obtained from parental genotypes is competitive with genomic selection. We conclude that phenomic selection is a promising approach for rapeseed breeding that can be easily implemented without any additional costs or efforts as near-infrared spectroscopy is routinely assessed in rapeseed breeding.
Collapse
Affiliation(s)
| | - Sven E. Weber
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | | | | | | | | | - Rod J. Snowdon
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | - Benjamin Wittkop
- Department of Plant Breeding,
Justus Liebig University, Giessen, Germany
| | - Andreas Stahl
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants,
Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| |
Collapse
|
2
|
Zhang H, Tang Y, Yue Y, Chen Y. Advances in the evolution research and genetic breeding of peanut. Gene 2024; 916:148425. [PMID: 38575102 DOI: 10.1016/j.gene.2024.148425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/15/2024] [Accepted: 04/01/2024] [Indexed: 04/06/2024]
Abstract
Peanut is an important cash crop used in oil, food and feed in our country. The rapid development of sequencing technology has promoted the research on the related aspects of peanut genetic breeding. This paper reviews the research progress of peanut origin and evolution, genetic breeding, molecular markers and their applications, genomics, QTL mapping and genome selection techniques. The main problems of molecular genetic breeding in peanut research worldwide include: the narrow genetic resources of cultivated species, unstable genetic transformation and unclear molecular mechanism of important agronomic traits. Considering the severe challenges regarding the supply of edible oil, and the main problems in peanut production, the urgent research directions of peanut are put forward: The de novo domestication and the exploitation of excellent genes from wild resources to improve modern cultivars; Integration of multi-omics data to enhance the importance of big data in peanut genetics and breeding; Cloning the important genes related to peanut agronomic traits and analyzing their fine regulation mechanisms; Precision molecular design breeding and using gene editing technology to accurately improve the key traits of peanut.
Collapse
Affiliation(s)
- Hui Zhang
- College of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| | - Yueyi Tang
- Shandong Peanut Research Institute, Qingdao 266100, China
| | - Yunlai Yue
- College of Agriculture, South China Agricultural University, Guangzhou 510642, China
| | - Yong Chen
- College of Agriculture, South China Agricultural University, Guangzhou 510642, China.
| |
Collapse
|
3
|
Weber SE, Roscher-Ehrig L, Kox T, Abbadi A, Stahl A, Snowdon RJ. Genomic prediction in Brassica napus: evaluating the benefit of imputed whole-genome sequencing data. Genome 2024; 67:210-222. [PMID: 38708850 DOI: 10.1139/gen-2023-0126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2024]
Abstract
Advances in sequencing technology allow whole plant genomes to be sequenced with high quality. Combining genotypic and phenotypic data in genomic prediction helps breeders to select crossing partners in partially phenotyped populations. In plant breeding programs, the cost of sequencing entire breeding populations still exceeds available genotyping budgets. Hence, the method for genotyping is still mainly single nucleotide polymorphism (SNP) arrays; however, arrays are unable to assess the entire genome- and population-wide diversity. A compromise involves genotyping the entire population using an SNP array and a subset of the population with whole-genome sequencing. Both datasets can then be used to impute markers from whole-genome sequencing onto the entire population. Here, we evaluate whether imputation of whole-genome sequencing data enhances genomic predictions, using data from a nested association mapping population of rapeseed (Brassica napus). Employing two cross-validation schemes that mimic scenarios for the prediction of close and distant relatives, we show that imputed marker data do not significantly improve prediction accuracy, likely due to redundancy in relationship estimates and imputation errors. In simulation studies, only small improvements were observed, further corroborating the findings. We conclude that SNP arrays are already equipped with the information that is added by imputation through relationship and linkage disequilibrium.
Collapse
Affiliation(s)
- Sven E Weber
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Lennard Roscher-Ehrig
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | | | | | - Andreas Stahl
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants, Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| |
Collapse
|
4
|
Laurençon M, Legrix J, Wagner MH, Demilly D, Baron C, Rolland S, Ducournau S, Laperche A, Nesi N. Genomic and phenomic predictions help capture low-effect alleles promoting seed germination in oilseed rape in addition to QTL analyses. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:156. [PMID: 38858297 PMCID: PMC11164772 DOI: 10.1007/s00122-024-04659-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Accepted: 05/25/2024] [Indexed: 06/12/2024]
Abstract
KEY MESSAGE Phenomic prediction implemented on a large diversity set can efficiently predict seed germination, capture low-effect favorable alleles that are not revealed by GWAS and identify promising genetic resources. Oilseed rape faces many challenges, especially at the beginning of its developmental cycle. Achieving rapid and uniform seed germination could help to ensure a successful establishment and therefore enabling the crop to compete with weeds and tolerate stresses during the earliest developmental stages. The polygenic nature of seed germination was highlighted in several studies, and more knowledge is needed about low- to moderate-effect underlying loci in order to enhance seed germination effectively by improving the genetic background and incorporating favorable alleles. A total of 17 QTL were detected for seed germination-related traits, for which the favorable alleles often corresponded to the most frequent alleles in the panel. Genomic and phenomic predictions methods provided moderate-to-high predictive abilities, demonstrating the ability to capture small additive and non-additive effects for seed germination. This study also showed that phenomic prediction estimated phenotypic values closer to phenotypic values than GEBV. Finally, as the predictive ability of phenomic prediction was less influenced by the genetic structure of the panel, it is worth using this prediction method to characterize genetic resources, particularly with a view to design prebreeding populations.
Collapse
Affiliation(s)
- Marianne Laurençon
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Julie Legrix
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Marie-Hélène Wagner
- Groupe d'Etude et de Contrôle des Variétés Et des Semences (GEVES), 49070, Beaucouzé, France
| | - Didier Demilly
- Groupe d'Etude et de Contrôle des Variétés Et des Semences (GEVES), 49070, Beaucouzé, France
| | - Cécile Baron
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Sophie Rolland
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| | - Sylvie Ducournau
- Groupe d'Etude et de Contrôle des Variétés Et des Semences (GEVES), 49070, Beaucouzé, France
| | - Anne Laperche
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France.
| | - Nathalie Nesi
- Institute of Genetics, Environment and Plant Protection (IGEPP), INRAE - Institut Agro Rennes-Angers - Université de Rennes, 35650, Le Rheu, France
| |
Collapse
|
5
|
Kristensen PS, Sarup P, Fé D, Orabi J, Snell P, Ripa L, Mohlfeld M, Chu TT, Herrström J, Jahoor A, Jensen J. Prediction of additive, epistatic, and dominance effects using models accounting for incomplete inbreeding in parental lines of hybrid rye and sugar beet. FRONTIERS IN PLANT SCIENCE 2023; 14:1193433. [PMID: 38162304 PMCID: PMC10756082 DOI: 10.3389/fpls.2023.1193433] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 10/16/2023] [Indexed: 01/03/2024]
Abstract
Genomic models for prediction of additive and non-additive effects within and across different heterotic groups are lacking for breeding of hybrid crops. In this study, genomic prediction models accounting for incomplete inbreeding in parental lines from two different heterotic groups were developed and evaluated. The models can be used for prediction of general combining ability (GCA) of parental lines from each heterotic group as well as specific combining ability (SCA) of all realized and potential crosses. Here, GCA was estimated as the sum of additive genetic effects and within-group epistasis due to high degree of inbreeding in parental lines. SCA was estimated as the sum of across-group epistasis and dominance effects. Three models were compared. In model 1, it was assumed that each hybrid was produced from two completely inbred parental lines. Model 1 was extended to include three-way hybrids from parental lines with arbitrary levels of inbreeding: In model 2, parents of the three-way hybrids could have any levels of inbreeding, while the grandparents of the maternal parent were assumed completely inbred. In model 3, all parental components could have any levels of inbreeding. Data from commercial breeding programs for hybrid rye and sugar beet was used to evaluate the models. The traits grain yield and root yield were analyzed for rye and sugar beet, respectively. Additive genetic variances were larger than epistatic and dominance variances. The models' predictive abilities for total genetic value, for GCA of each parental line and for SCA were evaluated based on different cross-validation strategies. Predictive abilities were highest for total genetic values and lowest for SCA. Predictive abilities for SCA and for GCA of maternal lines were higher for model 2 and model 3 than for model 1. The implementation of the genomic prediction models in hybrid breeding programs can potentially lead to increased genetic gain in two different ways: I) by facilitating the selection of crossing parents with high GCA within heterotic groups and II) by prediction of SCA of all realized and potential combinations of parental lines to produce hybrids with high total genetic values.
Collapse
Affiliation(s)
| | - Pernille Sarup
- Research and Development, Nordic Seed A/S, Odder, Denmark
| | - Dario Fé
- Research Division, DLF Seeds A/S, Store Heddinge, Denmark
| | - Jihad Orabi
- Research and Development, Nordic Seed A/S, Odder, Denmark
| | - Per Snell
- Research and Development, DLF Beet Seed AB, Landskrona, Sweden
| | - Linda Ripa
- Research and Development, DLF Beet Seed AB, Landskrona, Sweden
| | | | - Thinh Tuan Chu
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark
| | | | - Ahmed Jahoor
- Research and Development, Nordic Seed A/S, Odder, Denmark
- Breeding, Nordic Seed Germany GmbH, Nienstädt, Germany
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Just Jensen
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark
| |
Collapse
|
6
|
Weber SE, Chawla HS, Ehrig L, Hickey LT, Frisch M, Snowdon RJ. Accurate prediction of quantitative traits with failed SNP calls in canola and maize. FRONTIERS IN PLANT SCIENCE 2023; 14:1221750. [PMID: 37936929 PMCID: PMC10627008 DOI: 10.3389/fpls.2023.1221750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 10/05/2023] [Indexed: 11/09/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard to select superior genotypes in large breeding populations that are only partially phenotyped. Many breeding programs commonly rely on single-nucleotide polymorphism (SNP) markers to capture genome-wide data for selection candidates. For this purpose, SNP arrays with moderate to high marker density represent a robust and cost-effective tool to generate reproducible, easy-to-handle, high-throughput genotype data from large-scale breeding populations. However, SNP arrays are prone to technical errors that lead to failed allele calls. To overcome this problem, failed calls are often imputed, based on the assumption that failed SNP calls are purely technical. However, this ignores the biological causes for failed calls-for example: deletions-and there is increasing evidence that gene presence-absence and other kinds of genome structural variants can play a role in phenotypic expression. Because deletions are frequently not in linkage disequilibrium with their flanking SNPs, permutation of missing SNP calls can potentially obscure valuable marker-trait associations. In this study, we analyze published datasets for canola and maize using four parametric and two machine learning models and demonstrate that failed allele calls in genomic prediction are highly predictive for important agronomic traits. We present two statistical pipelines, based on population structure and linkage disequilibrium, that enable the filtering of failed SNP calls that are likely caused by biological reasons. For the population and trait examined, prediction accuracy based on these filtered failed allele calls was competitive to standard SNP-based prediction, underlying the potential value of missing data in genomic prediction approaches. The combination of SNPs with all failed allele calls or the filtered allele calls did not outperform predictions with only SNP-based prediction due to redundancy in genomic relationship estimates.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | | | - Lennard Ehrig
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Lee T. Hickey
- Centre for Crop Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| |
Collapse
|
7
|
Weber SE, Frisch M, Snowdon RJ, Voss-Fels KP. Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets. FRONTIERS IN PLANT SCIENCE 2023; 14:1217589. [PMID: 37731980 PMCID: PMC10507710 DOI: 10.3389/fpls.2023.1217589] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/21/2023] [Indexed: 09/22/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype data in breeding populations tends to exhibit considerable redundancy. Therefore, interest is growing in the use of haplotype blocks to overcome redundancy by summarizing co-inherited features. Moreover, haplotype blocks can help to capture local epistasis caused by interacting loci. Here, we compared genomic prediction methods that either used single SNPs or haplotype blocks with regards to their prediction accuracy for important traits in crop datasets. We used four published datasets from canola, maize, wheat and soybean. Different approaches to construct haplotype blocks were compared, including blocks based on LD, physical distance, number of adjacent markers and the algorithms implemented in the software "Haploview" and "HaploBlocker". The tested prediction methods included Genomic Best Linear Unbiased Prediction (GBLUP), Extended GBLUP to account for additive by additive epistasis (EGBLUP), Bayesian LASSO and Reproducing Kernel Hilbert Space (RKHS) regression. We found improved prediction accuracy in some traits when using haplotype blocks compared to SNP-based predictions, however the magnitude of improvement was very trait- and model-specific. Especially in settings with low marker density, haplotype blocks can improve genomic prediction accuracy. In most cases, physically large haplotype blocks yielded a strong decrease in prediction accuracy. Especially when prediction accuracy varies greatly across different prediction models, prediction based on haplotype blocks can improve prediction accuracy of underperforming models. However, there is no "best" method to build haplotype blocks, since prediction accuracy varied considerably across methods and traits. Hence, criteria used to define haplotype blocks should not be viewed as fixed biological parameters, but rather as hyperparameters that need to be adjusted for every dataset.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Kai P. Voss-Fels
- Institute for Grapevine Breeding, Hochschule Geisenheim University, Geisenheim, Germany
| |
Collapse
|
8
|
Chen SP, Tung CW, Wang PH, Liao CT. A statistical package for evaluation of hybrid performance in plant breeding via genomic selection. Sci Rep 2023; 13:12204. [PMID: 37500737 PMCID: PMC10374541 DOI: 10.1038/s41598-023-39434-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 07/25/2023] [Indexed: 07/29/2023] Open
Abstract
Hybrid breeding employs heterosis, which could potentially improve the yield and quality of a crop. Genomic selection (GS) is a promising approach for the selection of quantitative traits in plant breeding. The main objectives of this study are to (i) propose a GS-based approach to identify potential parental lines and superior hybrid combinations from a breeding population, which is composed of hybrids produced by a half diallel mating design; (ii) develop a software package for users to carry out the proposed approach. An R package, designated EHPGS, was generated to facilitate the employment of the genomic best linear unbiased model considering additive plus dominance marker effects for the hybrid performance evaluation. The R package contains a Bayesian statistical algorithm for calculating genomic estimated breeding value (GEBVs), GEBV-based specific combining ability, general combining ability, mid-parent heterosis, and better-parent heterosis. Three datasets that have been published in literature, including pumpkin (Cucurbita maxima), maize (Zea mays), and wheat (Triticum aestivum L.), were reanalyzed to illustrate the use of EHPGS.
Collapse
Affiliation(s)
- Szu-Ping Chen
- Department of Agronomy, National Taiwan University, Taipei, Taiwan
| | - Chih-Wei Tung
- Department of Agronomy, National Taiwan University, Taipei, Taiwan
| | - Pei-Hsien Wang
- Department of Agronomy, National Taiwan University, Taipei, Taiwan
| | - Chen-Tuo Liao
- Department of Agronomy, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
9
|
DoVale JC, Carvalho HF, Sabadin F, Fritsche-Neto R. Genotyping marker density and prediction models effects in long-term breeding schemes of cross-pollinated crops. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:4523-4539. [PMID: 36261658 DOI: 10.1007/s00122-022-04236-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Accepted: 10/09/2022] [Indexed: 06/16/2023]
Abstract
In genomic recurrent selection, the more markers, the better because they buffer the linkage disequilibrium losses caused by recombination over cycles, and consequently, provide higher responses to selection. Reductions of genotyping marker density have been extensively evaluated as potential strategies to reduce the genotyping costs of genomic selection (GS). Low-density marker panels are appealing in GS because they entail lower multicollinearity and computing time and allow more individuals to be genotyped for the same cost. However, statistical models used in GS are usually evaluated with empirical data, using "static" training sets and populations. This may be adequate for making predictions during a breeding program's initial cycles but not for the long-term. Moreover, studies that focus on long selective breeding cycles generally do not consider GS models with the effect of dominance, which is particularly important for breeding outcomes in cross-pollinated crops. Hence, dominance effects are important and unexplored in GS for long-term programs involving allogamous species. To address it, we employed two approaches: analysis of empirical maize datasets and simulations of long-term breeding applying phenotypic and genomic recurrent selection (intrapopulation and reciprocal schemes). In both schemes, we simulated twenty breeding cycles and assessed the effect of marker density reduction on the population mean, the best crosses, additive variance, selective accuracy, and response to selection with models [additive, additive-dominant, general (GCA), and this plus specific combining ability (GCA + SCA)]. Our results indicate that marker reduction based on linkage disequilibrium levels provides useful predictions only within a cycle, as accuracy significantly decreases over cycles. In the long-term, without training set updating, high-marker density provides the best responses to selection. The model to be used depends on the breeding scheme: additive for intrapopulation and additive-dominant or GCA + SCA for reciprocal.
Collapse
Affiliation(s)
- Júlio César DoVale
- Department of Crop Science, Federal University of Ceará, Fortaleza, CE, Brazil.
| | | | - Felipe Sabadin
- Virginia Tech: Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | | |
Collapse
|
10
|
Gabur I, Simioniuc DP, Snowdon RJ, Cristea D. Machine Learning Applied to the Search for Nonlinear Features in Breeding Populations. Front Artif Intell 2022; 5:876578. [PMID: 35669178 PMCID: PMC9164111 DOI: 10.3389/frai.2022.876578] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 04/19/2022] [Indexed: 11/13/2022] Open
Abstract
Large plant breeding populations are traditionally a source of novel allelic diversity and are at the core of selection efforts for elite material. Finding rare diversity requires a deep understanding of biological interactions between the genetic makeup of one genotype and its environmental conditions. Most modern breeding programs still rely on linear regression models to solve this problem, generalizing the complex genotype by phenotype interactions through manually constructed linear features. However, the identification of positive alleles vs. background can be addressed using deep learning approaches that have the capacity to learn complex nonlinear functions for the inputs. Machine learning (ML) is an artificial intelligence (AI) approach involving a range of algorithms to learn from input data sets and predict outcomes in other related samples. This paper describes a variety of techniques that include supervised and unsupervised ML algorithms to improve our understanding of nonlinear interactions from plant breeding data sets. Feature selection (FS) methods are combined with linear and nonlinear predictors and compared to traditional prediction methods used in plant breeding. Recent advances in ML allowed the construction of complex models that have the capacity to better differentiate between positive alleles and the genetic background. Using real plant breeding program data, we show that ML methods have the ability to outperform current approaches, increase prediction accuracies, decrease the computing time drastically, and improve the detection of important alleles involved in qualitative or quantitative traits.
Collapse
Affiliation(s)
- Iulian Gabur
- Department of Plant Breeding, Justus-Liebig-University, Giessen, Germany
- Department of Plant Sciences, Iasi University of Life Sciences, Iasi, Romania
- *Correspondence: Iulian Gabur
| | | | - Rod J. Snowdon
- Department of Plant Breeding, Justus-Liebig-University, Giessen, Germany
| | - Dan Cristea
- Institute of Computer Science, Romanian Academy, Iasi Branch, Iasi, Romania
| |
Collapse
|
11
|
Terraillon J, Frisch M, Falke KC, Jaiser H, Spiller M, Cselényi L, Krumnacker K, Boxberger S, Habekuß A, Kopahnke D, Serfling A, Ordon F, Zenke-Philippi C. Genomic Prediction Can Provide Precise Estimates of the Genotypic Value of Barley Lines Evaluated in Unreplicated Trials. FRONTIERS IN PLANT SCIENCE 2022; 13:735256. [PMID: 35528936 PMCID: PMC9072862 DOI: 10.3389/fpls.2022.735256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 03/14/2022] [Indexed: 06/14/2023]
Abstract
Genomic prediction has been established in breeding programs to predict the genotypic values of selection candidates without phenotypic data. First results in wheat showed that genomic predictions can also prove useful to select among material for which phenotypic data are available. In such a scenario, the selection candidates are evaluated with low intensity in the field. Genome-wide effects are estimated from the field data and are then used to predict the genotypic values of the selection candidates. The objectives of our simulation study were to investigate the correlations r(y, g) between genomic predictions y and genotypic values g and to compare these with the correlations r(p, g) between phenotypic values p and genotypic values g. We used data from a yield trial of 250 barley lines to estimate variance components and genome-wide effects. These parameters were used as basis for simulations. The simulations included multiple crossing schemes, population sizes, and varying sizes of the components of the masking variance. The genotypic values g of the selection candidates were obtained by genetic simulations, the phenotypic values p by simulating evaluation in the field, and the genomic predictions y by RR-BLUP effect estimation from the phenotypic values. The correlations r(y, g) were greater than the correlations r(p, g) for all investigated scenarios. We conclude that using genomic predictions for selection among candidates tested with low intensity in the field can proof useful for increasing the efficiency of barley breeding programs.
Collapse
Affiliation(s)
- Jérôme Terraillon
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Matthias Frisch
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - K. Christin Falke
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Heidi Jaiser
- Saatzucht Josef Breun GmbH & Co. KG, Herzogenaurach, Germany
| | | | - László Cselényi
- W. von Borries-Eckendorf GmbH & Co. KG, Leopoldshöhe, Germany
| | | | | | - Antje Habekuß
- Institute for Resistance Research and Stress Tolerance, Julius Kühn Institute, Quedlinburg, Germany
| | - Doris Kopahnke
- Institute for Resistance Research and Stress Tolerance, Julius Kühn Institute, Quedlinburg, Germany
| | - Albrecht Serfling
- Institute for Resistance Research and Stress Tolerance, Julius Kühn Institute, Quedlinburg, Germany
| | - Frank Ordon
- Institute for Resistance Research and Stress Tolerance, Julius Kühn Institute, Quedlinburg, Germany
| | - Carola Zenke-Philippi
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| |
Collapse
|
12
|
Hu D, Jing J, Snowdon RJ, Mason AS, Shen J, Meng J, Zou J. Exploring the gene pool of Brassica napus by genomics-based approaches. PLANT BIOTECHNOLOGY JOURNAL 2021; 19:1693-1712. [PMID: 34031989 PMCID: PMC8428838 DOI: 10.1111/pbi.13636] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 05/13/2021] [Accepted: 05/14/2021] [Indexed: 05/08/2023]
Abstract
De novo allopolyploidization in Brassica provides a very successful model for reconstructing polyploid genomes using progenitor species and relatives to broaden crop gene pools and understand genome evolution after polyploidy, interspecific hybridization and exotic introgression. B. napus (AACC), the major cultivated rapeseed species and the third largest oilseed crop in the world, is a young Brassica species with a limited genetic base resulting from its short history of domestication, cultivation, and intensive selection during breeding for target economic traits. However, the gene pool of B. napus has been significantly enriched in recent decades that has been benefit from worldwide effects by the successful introduction of abundant subgenomic variation and novel genomic variation via intraspecific, interspecific and intergeneric crosses. An important question in this respect is how to utilize such variation to breed crops adapted to the changing global climate. Here, we review the genetic diversity, genome structure, and population-level differentiation of the B. napus gene pool in relation to known exotic introgressions from various species of the Brassicaceae, especially those elucidated by recent genome-sequencing projects. We also summarize progress in gene cloning, trait-marker associations, gene editing, molecular marker-assisted selection and genome-wide prediction, and describe the challenges and opportunities of these techniques as molecular platforms to exploit novel genomic variation and their value in the rapeseed gene pool. Future progress will accelerate the creation and manipulation of genetic diversity with genomic-based improvement, as well as provide novel insights into the neo-domestication of polyploid crops with novel genetic diversity from reconstructed genomes.
Collapse
Affiliation(s)
- Dandan Hu
- National Key Laboratory of Crop Genetic ImprovementCollege of Plant Science & TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Jinjie Jing
- National Key Laboratory of Crop Genetic ImprovementCollege of Plant Science & TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Rod J. Snowdon
- Department of Plant BreedingIFZ Research Centre for Biosystems, Land Use and NutritionJustus Liebig UniversityGiessenGermany
| | - Annaliese S. Mason
- Department of Plant BreedingIFZ Research Centre for Biosystems, Land Use and NutritionJustus Liebig UniversityGiessenGermany
- Plant Breeding DepartmentINRESThe University of BonnBonnGermany
| | - Jinxiong Shen
- National Key Laboratory of Crop Genetic ImprovementCollege of Plant Science & TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Jinling Meng
- National Key Laboratory of Crop Genetic ImprovementCollege of Plant Science & TechnologyHuazhong Agricultural UniversityWuhanChina
| | - Jun Zou
- National Key Laboratory of Crop Genetic ImprovementCollege of Plant Science & TechnologyHuazhong Agricultural UniversityWuhanChina
| |
Collapse
|
13
|
Sinha P, Singh VK, Bohra A, Kumar A, Reif JC, Varshney RK. Genomics and breeding innovations for enhancing genetic gain for climate resilience and nutrition traits. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:1829-1843. [PMID: 34014373 PMCID: PMC8205890 DOI: 10.1007/s00122-021-03847-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Accepted: 04/29/2021] [Indexed: 05/03/2023]
Abstract
KEY MESSAGE Integrating genomics technologies and breeding methods to tweak core parameters of the breeder's equation could accelerate delivery of climate-resilient and nutrient rich crops for future food security. Accelerating genetic gain in crop improvement programs with respect to climate resilience and nutrition traits, and the realization of the improved gain in farmers' fields require integration of several approaches. This article focuses on innovative approaches to address core components of the breeder's equation. A prerequisite to enhancing genetic variance (σ2g) is the identification or creation of favorable alleles/haplotypes and their deployment for improving key traits. Novel alleles for new and existing target traits need to be accessed and added to the breeding population while maintaining genetic diversity. Selection intensity (i) in the breeding program can be improved by testing a larger population size, enabled by the statistical designs with minimal replications and high-throughput phenotyping. Selection priorities and criteria to select appropriate portion of the population too assume an important role. The most important component of breeder's equation is heritability (h2). Heritability estimates depend on several factors including the size and the type of population and the statistical methods. The present article starts with a brief discussion on the potential ways to enhance σ2g in the population. We highlight statistical methods and experimental designs that could improve trait heritability estimation. We also offer a perspective on reducing the breeding cycle time (t), which could be achieved through the selection of appropriate parents, optimizing the breeding scheme, rapid fixation of target alleles, and combining speed breeding with breeding programs to optimize trials for release. Finally, we summarize knowledge from multiple disciplines for enhancing genetic gains for climate resilience and nutritional traits.
Collapse
Affiliation(s)
- Pallavi Sinha
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
- International Rice Research Institute (IRRI), IRRI South Asia Hub, ICRISAT, Hyderabad, India
| | - Vikas K Singh
- International Rice Research Institute (IRRI), IRRI South Asia Hub, ICRISAT, Hyderabad, India
| | - Abhishek Bohra
- ICAR- Indian Institute of Pulses Research (IIPR), Kanpur, India
| | - Arvind Kumar
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Jochen C Reif
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany
| | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India.
- State Agricultural Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, WA, Australia.
| |
Collapse
|
14
|
González-Diéguez D, Legarra A, Charcosset A, Moreau L, Lehermeier C, Teyssèdre S, Vitezica ZG. Genomic prediction of hybrid crops allows disentangling dominance and epistasis. Genetics 2021; 218:iyab026. [PMID: 33864072 PMCID: PMC8128411 DOI: 10.1093/genetics/iyab026] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 02/06/2021] [Indexed: 12/28/2022] Open
Abstract
We revisited, in a genomic context, the theory of hybrid genetic evaluation models of hybrid crosses of pure lines, as the current practice is largely based on infinitesimal model assumptions. Expressions for covariances between hybrids due to additive substitution effects and dominance and epistatic deviations were analytically derived. Using dense markers in a GBLUP analysis, it is possible to split specific combining ability into dominance and across-groups epistatic deviations, and to split general combining ability (GCA) into within-line additive effects and within-line additive by additive (and higher order) epistatic deviations. We analyzed a publicly available maize data set of Dent × Flint hybrids using our new model (called GCA-model) up to additive by additive epistasis. To model higher order interactions within GCAs, we also fitted "residual genetic" line effects. Our new GCA-model was compared with another genomic model which assumes a uniquely defined effect of genes across origins. Most variation in hybrids is accounted by GCA. Variances due to dominance and epistasis have similar magnitudes. Models based on defining effects either differently or identically across heterotic groups resulted in similar predictive abilities for hybrids. The currently used model inflates the estimated additive genetic variance. This is not important for hybrid predictions but has consequences for the breeding scheme-e.g. overestimation of the genetic gain within heterotic group. Therefore, we recommend using GCA-model, which is appropriate for genomic prediction and variance component estimation in hybrid crops using genomic data, and whose results can be practically interpreted and used for breeding purposes.
Collapse
Affiliation(s)
| | - Andrés Legarra
- INRAE, INP, UMR 1388 GenPhySE, F-31326 Castanet-Tolosan, France
| | - Alain Charcosset
- GQE-Le Moulon, INRAE, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Laurence Moreau
- GQE-Le Moulon, INRAE, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | | | | | | |
Collapse
|
15
|
Voss-Fels KP, Wei X, Ross EM, Frisch M, Aitken KS, Cooper M, Hayes BJ. Strategies and considerations for implementing genomic selection to improve traits with additive and non-additive genetic architectures in sugarcane breeding. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:1493-1511. [PMID: 33587151 DOI: 10.1007/s00122-021-03785-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Accepted: 01/27/2021] [Indexed: 05/14/2023]
Abstract
Simulations highlight the potential of genomic selection to substantially increase genetic gain for complex traits in sugarcane. The success rate depends on the trait genetic architecture and the implementation strategy. Genomic selection (GS) has the potential to increase the rate of genetic gain in sugarcane beyond the levels achieved by conventional phenotypic selection (PS). To assess different implementation strategies, we simulated two different GS-based breeding strategies and compared genetic gain and genetic variance over five breeding cycles to standard PS. GS scheme 1 followed similar routines like conventional PS but included three rapid recurrent genomic selection (RRGS) steps. GS scheme 2 also included three RRGS steps but did not include a progeny assessment stage and therefore differed more fundamentally from PS. Under an additive trait model, both simulated GS schemes achieved annual genetic gains of 2.6-2.7% which were 1.9 times higher compared to standard phenotypic selection (1.4%). For a complex non-additive trait model, the expected annual rates of genetic gain were lower for all breeding schemes; however, the rates for the GS schemes (1.5-1.6%) were still greater than PS (1.1%). Investigating cost-benefit ratios with regard to numbers of genotyped clones showed that substantial benefits could be achieved when only 1500 clones were genotyped per 10-year breeding cycle for the additive genetic model. Our results show that under a complex non-additive genetic model, the success rate of GS depends on the implementation strategy, the number of genotyped clones and the stage of the breeding program, likely reflecting how changes in QTL allele frequencies change additive genetic variance and therefore the efficiency of selection. These results are encouraging and motivate further work to facilitate the adoption of GS in sugarcane breeding.
Collapse
Affiliation(s)
- Kai P Voss-Fels
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4072, Australia
| | - Xianming Wei
- Sugar Research Australia, Mackay, QLD, 4741, Australia
| | - Elizabeth M Ross
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4072, Australia
| | - Matthias Frisch
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Giessen, Germany
| | - Karen S Aitken
- Agriculture and Food, CSIRO, QBP, St. Lucia, QLD, 4067, Australia
| | - Mark Cooper
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4072, Australia
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, QLD, 4072, Australia.
| |
Collapse
|
16
|
Knoch D, Werner CR, Meyer RC, Riewe D, Abbadi A, Lücke S, Snowdon RJ, Altmann T. Multi-omics-based prediction of hybrid performance in canola. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:1147-1165. [PMID: 33523261 PMCID: PMC7973648 DOI: 10.1007/s00122-020-03759-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 12/19/2020] [Indexed: 05/05/2023]
Abstract
Complementing or replacing genetic markers with transcriptomic data and use of reproducing kernel Hilbert space regression based on Gaussian kernels increases hybrid prediction accuracies for complex agronomic traits in canola. In plant breeding, hybrids gained particular importance due to heterosis, the superior performance of offspring compared to their inbred parents. Since the development of new top performing hybrids requires labour-intensive and costly breeding programmes, including testing of large numbers of experimental hybrids, the prediction of hybrid performance is of utmost interest to plant breeders. In this study, we tested the effectiveness of hybrid prediction models in spring-type oilseed rape (Brassica napus L./canola) employing different omics profiles, individually and in combination. To this end, a population of 950 F1 hybrids was evaluated for seed yield and six other agronomically relevant traits in commercial field trials at several locations throughout Europe. A subset of these hybrids was also evaluated in a climatized glasshouse regarding early biomass production. For each of the 477 parental rapeseed lines, 13,201 single nucleotide polymorphisms (SNPs), 154 primary metabolites, and 19,479 transcripts were determined and used as predictive variables. Both, SNP markers and transcripts, effectively predict hybrid performance using (genomic) best linear unbiased prediction models (gBLUP). Compared to models using pure genetic markers, models incorporating transcriptome data resulted in significantly higher prediction accuracies for five out of seven agronomic traits, indicating that transcripts carry important information beyond genomic data. Notably, reproducing kernel Hilbert space regression based on Gaussian kernels significantly exceeded the predictive abilities of gBLUP models for six of the seven agronomic traits, demonstrating its potential for implementation in future canola breeding programmes.
Collapse
Affiliation(s)
- Dominic Knoch
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
| | - Christian R. Werner
- The Roslin Institute, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG Scotland, UK
| | - Rhonda C. Meyer
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
| | - David Riewe
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
- Institute for Ecological Chemistry, Plant Analysis and Stored Product Protection, Julius Kühn Institute (JKI)—Federal Research Centre for Cultivated Plants, 14195 Berlin, Germany
| | - Amine Abbadi
- NPZ Innovation GmbH, Hohenlieth, 24363 Holtsee, Germany
- Norddeutsche Pflanzenzucht Hans-Georg Lembke KG, Hohenlieth, 24363 Holtsee, Germany
| | - Sophie Lücke
- Norddeutsche Pflanzenzucht Hans-Georg Lembke KG, Hohenlieth, 24363 Holtsee, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany
| | - Thomas Altmann
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
| |
Collapse
|
17
|
Labroo MR, Studer AJ, Rutkoski JE. Heterosis and Hybrid Crop Breeding: A Multidisciplinary Review. Front Genet 2021; 12:643761. [PMID: 33719351 PMCID: PMC7943638 DOI: 10.3389/fgene.2021.643761] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 02/08/2021] [Indexed: 11/24/2022] Open
Abstract
Although hybrid crop varieties are among the most popular agricultural innovations, the rationale for hybrid crop breeding is sometimes misunderstood. Hybrid breeding is slower and more resource-intensive than inbred breeding, but it allows systematic improvement of a population by recurrent selection and exploitation of heterosis simultaneously. Inbred parental lines can identically reproduce both themselves and their F1 progeny indefinitely, whereas outbred lines cannot, so uniform outbred lines must be bred indirectly through their inbred parents to harness heterosis. Heterosis is an expected consequence of whole-genome non-additive effects at the population level over evolutionary time. Understanding heterosis from the perspective of molecular genetic mechanisms alone may be elusive, because heterosis is likely an emergent property of populations. Hybrid breeding is a process of recurrent population improvement to maximize hybrid performance. Hybrid breeding is not maximization of heterosis per se, nor testing random combinations of individuals to find an exceptional hybrid, nor using heterosis in place of population improvement. Though there are methods to harness heterosis other than hybrid breeding, such as use of open-pollinated varieties or clonal propagation, they are not currently suitable for all crops or production environments. The use of genomic selection can decrease cycle time and costs in hybrid breeding, particularly by rapidly establishing heterotic pools, reducing testcrossing, and limiting the loss of genetic variance. Open questions in optimal use of genomic selection in hybrid crop breeding programs remain, such as how to choose founders of heterotic pools, the importance of dominance effects in genomic prediction, the necessary frequency of updating the training set with phenotypic information, and how to maintain genetic variance and prevent fixation of deleterious alleles.
Collapse
Affiliation(s)
| | | | - Jessica E. Rutkoski
- Department of Crop Sciences, University of Illinois at Urbana–Champaign, Urbana, IL, United States
| |
Collapse
|
18
|
Mohd Saad NS, Severn-Ellis AA, Pradhan A, Edwards D, Batley J. Genomics Armed With Diversity Leads the Way in Brassica Improvement in a Changing Global Environment. Front Genet 2021; 12:600789. [PMID: 33679880 PMCID: PMC7930750 DOI: 10.3389/fgene.2021.600789] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 01/15/2021] [Indexed: 12/14/2022] Open
Abstract
Meeting the needs of a growing world population in the face of imminent climate change is a challenge; breeding of vegetable and oilseed Brassica crops is part of the race in meeting these demands. Available genetic diversity constituting the foundation of breeding is essential in plant improvement. Elite varieties, land races, and crop wild species are important resources of useful variation and are available from existing genepools or genebanks. Conservation of diversity in genepools, genebanks, and even the wild is crucial in preventing the loss of variation for future breeding efforts. In addition, the identification of suitable parental lines and alleles is critical in ensuring the development of resilient Brassica crops. During the past two decades, an increasing number of high-quality nuclear and organellar Brassica genomes have been assembled. Whole-genome re-sequencing and the development of pan-genomes are overcoming the limitations of the single reference genome and provide the basis for further exploration. Genomic and complementary omic tools such as microarrays, transcriptomics, epigenetics, and reverse genetics facilitate the study of crop evolution, breeding histories, and the discovery of loci associated with highly sought-after agronomic traits. Furthermore, in genomic selection, predicted breeding values based on phenotype and genome-wide marker scores allow the preselection of promising genotypes, enhancing genetic gains and substantially quickening the breeding cycle. It is clear that genomics, armed with diversity, is set to lead the way in Brassica improvement; however, a multidisciplinary plant breeding approach that includes phenotype = genotype × environment × management interaction will ultimately ensure the selection of resilient Brassica varieties ready for climate change.
Collapse
Affiliation(s)
| | | | | | | | - Jacqueline Batley
- School of Biological Sciences Western Australia and UWA Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| |
Collapse
|
19
|
Werner CR, Gaynor RC, Gorjanc G, Hickey JM, Kox T, Abbadi A, Leckband G, Snowdon RJ, Stahl A. How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding. FRONTIERS IN PLANT SCIENCE 2020; 11:592977. [PMID: 33391305 PMCID: PMC7772221 DOI: 10.3389/fpls.2020.592977] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 11/24/2020] [Indexed: 05/27/2023]
Abstract
Over the last two decades, the application of genomic selection has been extensively studied in various crop species, and it has become a common practice to report prediction accuracies using cross validation. However, genomic prediction accuracies obtained from random cross validation can be strongly inflated due to population or family structure, a characteristic shared by many breeding populations. An understanding of the effect of population and family structure on prediction accuracy is essential for the successful application of genomic selection in plant breeding programs. The objective of this study was to make this effect and its implications for practical breeding programs comprehensible for breeders and scientists with a limited background in quantitative genetics and genomic selection theory. We, therefore, compared genomic prediction accuracies obtained from different random cross validation approaches and within-family prediction in three different prediction scenarios. We used a highly structured population of 940 Brassica napus hybrids coming from 46 testcross families and two subpopulations. Our demonstrations show how genomic prediction accuracies obtained from among-family predictions in random cross validation and within-family predictions capture different measures of prediction accuracy. While among-family prediction accuracy measures prediction accuracy of both the parent average component and the Mendelian sampling term, within-family prediction only measures how accurately the Mendelian sampling term can be predicted. With this paper we aim to foster a critical approach to different measures of genomic prediction accuracy and a careful analysis of values observed in genomic selection experiments and reported in literature.
Collapse
Affiliation(s)
- Christian R. Werner
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | | | | | | | - Rod J. Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Andreas Stahl
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants, Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| |
Collapse
|
20
|
Geng X, Sun G, Qu Y, Sarfraz Z, Jia Y, He S, Pan Z, Sun J, Iqbal MS, Wang Q, Qin H, Liu J, Liu H, Yang J, Ma Z, Xu D, Yang J, Zhang J, Li Z, Cai Z, Zhang X, Zhang X, Zhou G, Li L, Zhu H, Wang L, Pang B, Du X. Genome-wide dissection of hybridization for fiber quality- and yield-related traits in upland cotton. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 104:1285-1300. [PMID: 32996179 PMCID: PMC7756405 DOI: 10.1111/tpj.14999] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Revised: 07/14/2020] [Accepted: 09/03/2020] [Indexed: 06/11/2023]
Abstract
An evaluation of combining ability can facilitate the selection of suitable parents and superior F1 hybrids for hybrid cotton breeding, although the molecular genetic basis of combining ability has not been fully characterized. In the present study, 282 female parents were crossed with four male parents in accordance with the North Carolina II mating scheme to generate 1128 hybrids. The parental lines were genotyped based on restriction site-associated DNA sequencing and 306 814 filtered single nucleotide polymorphisms were used for genome-wide association analysis involving the phenotypes, general combining ability (GCA) values, and specific combining ability values of eight fiber quality- and yield-related traits. The main results were: (i) all parents could be clustered into five subgroups based on population structure analyses and the GCA performance of the female parents had significant differences between subgroups; (ii) 20 accessions with a top 5% GCA value for more than one trait were identified as elite parents for hybrid cotton breeding; (iii) 120 significant single nucleotide polymorphisms, clustered into 66 quantitative trait loci, such as the previously reported Gh_A07G1769 and GhHOX3 genes, were found to be significantly associated with GCA; and (iv) identified quantitative trait loci for GCA had a cumulative effect on GCA of the accessions. Overall, our results suggest that pyramiding the favorable loci for GCA may improve the efficiency of hybrid cotton breeding.
Collapse
Affiliation(s)
- Xiaoli Geng
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
- Zhengzhou Research BaseState Key Laboratory of Cotton BiologyZhengzhou UniversityZhengzhou455001China
| | - Gaofei Sun
- Anyang Institute of TechnologyAnyang455000China
| | - Yujie Qu
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
| | - Zareen Sarfraz
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
| | - Yinhua Jia
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
- Zhengzhou Research BaseState Key Laboratory of Cotton BiologyZhengzhou UniversityZhengzhou455001China
| | - Shoupu He
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
- Zhengzhou Research BaseState Key Laboratory of Cotton BiologyZhengzhou UniversityZhengzhou455001China
| | - Zhaoe Pan
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
| | - Junling Sun
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
| | - Muhammad S. Iqbal
- Cotton Research StationAyub Agricultural Research InstituteFaisalabad38000Pakistan
| | - Qinglian Wang
- Henan Institute of Science and TechnologyXinxiang453003China
| | - Hongde Qin
- Cash Crop InstituteHubei Academy of Agricultural SciencesWuhan430000China
| | - Jinhai Liu
- Zhongmian Cotton Seed Industry Technology Co., LtdZhengzhou455001China
| | - Hui Liu
- Jing Hua Seed Industry Technologies IncJingzhou434000China
| | - Jun Yang
- Cotton Research Institute of Jiangxi ProvinceJiujiang332000China
| | - Zhiying Ma
- Key Laboratory of Crop Germplasm Resources of HebeiHebei Agricultural UniversityBaoding071000China
| | - Dongyong Xu
- Guoxin Rural Technical Service AssociationHejian062450China
| | - Jinlong Yang
- Zhongmian Cotton Seed Industry Technology Co., LtdZhengzhou455001China
| | | | - Zhikun Li
- Key Laboratory of Crop Germplasm Resources of HebeiHebei Agricultural UniversityBaoding071000China
| | - Zhongmin Cai
- Zhongmian Cotton Seed Industry Technology Co., LtdZhengzhou455001China
| | - Xuelin Zhang
- Hunan Cotton Research InstituteChangde415000China
| | - Xin Zhang
- Henan Institute of Science and TechnologyXinxiang453003China
| | - Guanyin Zhou
- Zhongmian Cotton Seed Industry Technology Co., LtdZhengzhou455001China
| | - Lin Li
- Zhongli Company of ShandongDongying257000China
| | - Haiyong Zhu
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
| | - Liru Wang
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
| | - Baoyin Pang
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
| | - Xiongming Du
- State Key Laboratory of Cotton BiologyInstitute of Cotton ResearchChinese Academy of Agricultural SciencesAnyang455000China
- Zhengzhou Research BaseState Key Laboratory of Cotton BiologyZhengzhou UniversityZhengzhou455001China
| |
Collapse
|
21
|
Seye AI, Bauland C, Charcosset A, Moreau L. Revisiting hybrid breeding designs using genomic predictions: simulations highlight the superiority of incomplete factorials between segregating families over topcross designs. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2020; 133:1995-2010. [PMID: 32185420 DOI: 10.1007/s00122-020-03573-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Accepted: 02/28/2020] [Indexed: 06/10/2023]
Abstract
Simulations showed that hybrid performances issued from an incomplete factorial between segregating families of two heterotic groups enable to calibrate genomic predictions of hybrid value more efficiently than tester-based designs. Genomic selection offers new opportunities to revisit hybrid breeding by replacing extensive phenotyping of hybrid combinations by genomic predictions. A key question remains to identify the best design to calibrate genomic prediction models. We proposed to use single-cross hybrids issued from an incomplete factorial design between segregating populations and compared this strategy with a conventional approach based on topcross evaluation. Two multiparental segregating populations of lines, each specific of one heterotic group, were simulated. Hybrids considered as training sets were generated using either (1) a parental line from the opposite group as tester or (2) following an incomplete factorial design. Different specific combining ability (SCA) proportions were simulated by considering different levels of group divergence and dominance effects for the simulated QTL. For the incomplete factorial design, for a same number of hybrids, we considered different numbers of parental lines and different contributions of lines (one to four) to calibration hybrids. We evaluated for different training set sizes prediction accuracies of new hybrids and genetic gains along three generations. At a given training set size, factorial design was as efficient (considering accuracy) as tester design in additive scenarios, but significantly outperformed tester design when SCA was present. The contribution number of each parental line to the incomplete factorial design had a small impact on accuracies. Our simulations confirmed experimental results and showed that calibrating models on hybrids between two multiparental populations is a cost-efficient way to perform genomic predictions in both groups, opening prospects for revisiting reciprocal recurrent selection schemes.
Collapse
Affiliation(s)
- A I Seye
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution - Le Moulon, 91190, Gif-sur-Yvette, France
| | - C Bauland
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution - Le Moulon, 91190, Gif-sur-Yvette, France
| | - A Charcosset
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution - Le Moulon, 91190, Gif-sur-Yvette, France
| | - L Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution - Le Moulon, 91190, Gif-sur-Yvette, France.
| |
Collapse
|
22
|
Vasseur F, Fouqueau L, de Vienne D, Nidelet T, Violle C, Weigel D. Nonlinear phenotypic variation uncovers the emergence of heterosis in Arabidopsis thaliana. PLoS Biol 2019; 17:e3000214. [PMID: 31017902 PMCID: PMC6481775 DOI: 10.1371/journal.pbio.3000214] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 03/21/2019] [Indexed: 12/22/2022] Open
Abstract
Heterosis describes the phenotypic superiority of hybrids over their parents in traits related to agronomic performance and fitness. Understanding and predicting nonadditive inheritance such as heterosis is crucial for evolutionary biology as well as for plant and animal breeding. However, the physiological bases of heterosis remain debated. Moreover, empirical data in various species have shown that diverse genetic and molecular mechanisms are likely to explain heterosis, making it difficult to predict its emergence and amplitude from parental genotypes alone. In this study, we examined a model of physiological dominance initially proposed by Sewall Wright to explain the nonadditive inheritance of traits like metabolic fluxes at the cellular level. We evaluated Wright's model for two fitness-related traits at the whole-plant level, growth rate and fruit number, using 450 hybrids derived from crosses among natural accessions of A. thaliana. We found that allometric relationships between traits constrain phenotypic variation in a nonlinear and similar manner in hybrids and accessions. These allometric relationships behave predictably, explaining up to 75% of heterosis amplitude, while genetic distance among parents at best explains 7%. Thus, our findings are consistent with Wright's model of physiological dominance and suggest that the emergence of heterosis on plant performance is an intrinsic property of nonlinear relationships between traits. Furthermore, our study highlights the potential of a geometric approach of phenotypic relationships for predicting heterosis of major components of crop productivity and yield.
Collapse
Affiliation(s)
- François Vasseur
- Max Planck Institute for Developmental Biology, Tübingen, Germany
- CEFE, CNRS, Univ Montpellier, Univ Paul Valéry Montpellier, EPHE, IRD, Montpellier, France
- Laboratoire d’Ecophysiologie des Plantes sous Stress Environnementaux (LEPSE), INRA, Montpellier SupAgro, UMR759, Montpellier, France
| | - Louise Fouqueau
- CEFE, CNRS, Univ Montpellier, Univ Paul Valéry Montpellier, EPHE, IRD, Montpellier, France
| | - Dominique de Vienne
- GQE–Le Moulon, INRA, Univ Paris-Sud, CNRS, AgroParisTech, Univ Paris-Saclay, Gif-sur-Yvette, France
| | - Thibault Nidelet
- SPO, INRA, Montpellier SupAgro, Univ Montpellier, Montpellier, France
| | - Cyrille Violle
- CEFE, CNRS, Univ Montpellier, Univ Paul Valéry Montpellier, EPHE, IRD, Montpellier, France
| | - Detlef Weigel
- Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|
23
|
Stahl A, Vollrath P, Samans B, Frisch M, Wittkop B, Snowdon RJ. Effect of breeding on nitrogen use efficiency-associated traits in oilseed rape. JOURNAL OF EXPERIMENTAL BOTANY 2019; 70:1969-1986. [PMID: 30753580 PMCID: PMC6436158 DOI: 10.1093/jxb/erz044] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Accepted: 01/06/2019] [Indexed: 05/21/2023]
Abstract
Oilseed rape is one of the most important dicotyledonous field crops in the world, where it plays a key role in productive cereal crop rotations. However, its production requires high nitrogen fertilization and its nitrogen footprint exceeds that of most other globally important crops. Hence, increased nitrogen use efficiency (NUE) in this crop is of high priority for sustainable agriculture. We report a comprehensive study of macrophysiological characteristics associated with breeding progress, conducted under contrasting nitrogen fertilization levels in a large panel of elite oilseed rape varieties representing breeding progress over the past 20 years. The results indicate that increased plant biomass at flowering, along with increases in primary yield components, have increased NUE in modern varieties. Nitrogen uptake efficiency has improved through breeding, particularly at high nitrogen. Despite low heritability, the number of seeds per silique is associated positively with increased yield in modern varieties. Seed weight remains unaffected by breeding progress; however, recent selection for high seed oil content and for high seed yields appears to have promoted a negative correlation (r= -0.39 at high and r= -0.49 at low nitrogen) between seed weight and seed oil concentration. Overall, our results reveal valuable breeding targets to improve NUE in oilseed rape.
Collapse
Affiliation(s)
- Andreas Stahl
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
- Correspondence:
| | - Paul Vollrath
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Birgit Samans
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Matthias Frisch
- Department of Biometry and Population Genetics, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Benjamin Wittkop
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| |
Collapse
|
24
|
Voss-Fels KP, Cooper M, Hayes BJ. Accelerating crop genetic gains with genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:669-686. [PMID: 30569365 DOI: 10.1007/s00122-018-3270-8] [Citation(s) in RCA: 131] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2018] [Accepted: 12/12/2018] [Indexed: 05/05/2023]
Abstract
Genomic prediction based on additive genetic effects can accelerate genetic gain. There are opportunities for further improvement by including non-additive effects that access untapped sources of genetic diversity. Several studies have reported a worrying gap between the projected global future demand for plant-based products and the current annual rates of production increase, indicating that enhancing the rate of genetic gain might be critical for future food security. Therefore, new breeding technologies and strategies are required to significantly boost genetic improvement of future crop cultivars. Genomic selection (GS) has delivered considerable genetic gain in animal breeding and is becoming an essential component of many modern plant breeding programmes as well. In this paper, we review the lessons learned from implementing GS in livestock and the impact of GS on crop breeding, and discuss important features for the success of GS under different breeding scenarios. We highlight major challenges associated with GS including rapid genotyping, phenotyping, genotype-by-environment interaction and non-additivity and give examples for opportunities to overcome these issues. Finally, the potential of combining GS with other modern technologies in order to maximise the rate of crop genetic improvement is discussed, including the potential of increasing prediction accuracy by integration of crop growth models in GS frameworks.
Collapse
Affiliation(s)
- Kai Peter Voss-Fels
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia
| | - Mark Cooper
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia
| | - Ben John Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, 4072, Australia.
| |
Collapse
|
25
|
Liu X, Wang H, Hu X, Li K, Liu Z, Wu Y, Huang C. Improving Genomic Selection With Quantitative Trait Loci and Nonadditive Effects Revealed by Empirical Evidence in Maize. FRONTIERS IN PLANT SCIENCE 2019; 10:1129. [PMID: 31620155 PMCID: PMC6759780 DOI: 10.3389/fpls.2019.01129] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 08/15/2019] [Indexed: 05/20/2023]
Abstract
Genomic selection (GS), a tool developed for molecular breeding, is used by plant breeders to improve breeding efficacy by shortening the breeding cycle and to facilitate the selection of candidate lines for creating hybrids without phenotyping in various environments. Association and linkage mapping have been widely used to explore and detect candidate genes in order to understand the genetic mechanisms of quantitative traits. In the current study, phenotypic and genotypic data from three experimental populations, including data on six agronomic traits (e.g., plant height, ear height, ear length, ear diameter, grain yield per plant, and hundred-kernel weight), were used to evaluate the effect of trait-relevant markers (TRMs) on prediction accuracy estimation. Integrating information from mapping into a statistical model can efficiently improve prediction performance compared with using stochastically selected markers to perform GS. The prediction accuracy can reach plateau when a total of 500-1,000 TRMs are utilized in GS. The prediction accuracy can be significantly enhanced by including nonadditive effects and TRMs in the GS model when genotypic data with high proportions of heterozygous alleles and complex agronomic traits with high proportion of nonadditive variancein phenotypic variance are used to perform GS. In addition, taking information on population structure into account can slightly improve prediction performance when the genetic relationship between the training and testing sets is influenced by population stratification due to different allele frequencies. In conclusion, GS is a useful approach for prescreening candidate lines, and the empirical evidence provided by the current study for TRMs and nonadditive effects can inform plant breeding and in turn contribute to the improvement of selection efficiency in practical GS-assisted breeding programs.
Collapse
|