1
|
Ahadi P, Balasundaram B, Borrero JS, Chen C. Development and optimization of expected cross value for mate selection problems. Heredity (Edinb) 2024; 133:113-125. [PMID: 38956397 PMCID: PMC11286873 DOI: 10.1038/s41437-024-00697-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 07/04/2024] Open
Abstract
In this study, we address the mate selection problem in the hybridization stage of a breeding pipeline, which constitutes the multi-objective breeding goal key to the performance of a variety development program. The solution framework we formulate seeks to ensure that individuals with the most desirable genomic characteristics are selected to cross in order to maximize the likelihood of the inheritance of desirable genetic materials to the progeny. Unlike approaches that use phenotypic values for parental selection and evaluate individuals separately, we use a criterion that relies on the genetic architecture of traits and evaluates combinations of genomic information of the pairs of individuals. We introduce the expected cross value (ECV) criterion that measures the expected number of desirable alleles for gametes produced by pairs of individuals sampled from a population of potential parents. We use the ECV criterion to develop an integer linear programming formulation for the parental selection problem. The formulation is capable of controlling the inbreeding level between selected mates. We evaluate the approach or two applications: (i) improving multiple target traits simultaneously, and (ii) finding a multi-parental solution to design crossing blocks. We evaluate the performance of the ECV criterion using a simulation study. Finally, we discuss how the ECV criterion and the proposed integer linear programming techniques can be applied to improve breeding efficiency while maintaining genetic diversity in a breeding program.
Collapse
Affiliation(s)
- Pouya Ahadi
- H. Milton Stewart School of Industrial and Systems Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| | | | - Juan S Borrero
- School of Industrial Engineering and Management, Oklahoma State University, Stillwater, OK, USA
| | - Charles Chen
- Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK, USA.
| |
Collapse
|
2
|
Villiers K, Voss-Fels KP, Dinglasan E, Jacobs B, Hickey L, Hayes BJ. Evolutionary computing to assemble standing genetic diversity and achieve long-term genetic gain. THE PLANT GENOME 2024; 17:e20467. [PMID: 38816340 DOI: 10.1002/tpg2.20467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/08/2024] [Accepted: 04/27/2024] [Indexed: 06/01/2024]
Abstract
Loss of genetic diversity in elite crop breeding pools can severely limit long-term genetic gains and limit ability to make gains in new traits, like heat tolerance, that are becoming important as the climate changes. Here, we investigate and propose potential breeding program applications of optimal haplotype stacking (OHS), a selection method that retains useful diversity in the population. OHS selects sets of candidates containing, between them, haplotype segments with very high segment breeding values for the target trait. We compared the performance of OHS, a similar method called optimal population value (OPV), truncation selection on genomic estimated breeding values (GEBVs), and optimal contribution selection (OCS) in stochastic simulations of recurrent selection on founder wheat genotypes. After 100 generations of intercrossing and selection, OCS and truncation selection had exhausted the genetic diversity, while considerable diversity remained in the OHS population. Gain under OHS in these simulations ultimately exceeded that from truncation selection or OCS. OHS achieved faster gains when the population size was small, with many progeny per cross. A promising hybrid strategy, involving a single cycle of OHS in the first generation followed by recurrent truncation selection, substantially improved long-term gain compared with truncation selection and performed similarly to OCS. The results of this study provide initial insights into where OHS could be incorporated into breeding programs.
Collapse
Affiliation(s)
- Kira Villiers
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia
| | - Kai P Voss-Fels
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia
- Department of Grapevine Breeding, Hochschule Geisenheim University, Geisenheim, Germany
| | - Eric Dinglasan
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia
| | - Bertus Jacobs
- LongReach Plant Breeders Management Pty Ltd, Lonsdale, South Australia, Australia
| | - Lee Hickey
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia
| | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, Queensland, Australia
| |
Collapse
|
3
|
Sanchez D, Allier A, Ben Sadoun S, Mary-Huard T, Bauland C, Palaffre C, Lagardère B, Madur D, Combes V, Melkior S, Bettinger L, Murigneux A, Moreau L, Charcosset A. Assessing the potential of genetic resource introduction into elite germplasm: a collaborative multiparental population for flint maize. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:19. [PMID: 38214870 PMCID: PMC10786986 DOI: 10.1007/s00122-023-04509-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 11/18/2023] [Indexed: 01/13/2024]
Abstract
KEY MESSAGE Implementing a collaborative pre-breeding multi-parental population efficiently identifies promising donor x elite pairs to enrich the flint maize elite germplasm. Genetic diversity is crucial for maintaining genetic gains and ensuring breeding programs' long-term success. In a closed breeding program, selection inevitably leads to a loss of genetic diversity. While managing diversity can delay this loss, introducing external sources of diversity is necessary to bring back favorable genetic variation. Genetic resources exhibit greater diversity than elite materials, but their lower performance levels hinder their use. This is the case for European flint maize, for which elite germplasm has incorporated only a limited portion of the diversity available in landraces. To enrich the diversity of this elite genetic pool, we established an original cooperative maize bridging population that involves crosses between private elite materials and diversity donors to create improved genotypes that will facilitate the incorporation of original favorable variations. Twenty donor × elite BC1S2 families were created and phenotyped for hybrid value for yield related traits. Crosses showed contrasted means and variances and therefore contrasted potential in terms of selection as measured by their usefulness criterion (UC). Average expected mean performance gain over the initial elite material was 5%. The most promising donor for each elite line was identified. Results also suggest that one more generation, i.e., 3 in total, of crossing to the elite is required to fully exploit the potential of a donor. Altogether, our results support the usefulness of incorporating genetic resources into elite flint maize. They call for further effort to create fixed diversity donors and identify those most suitable for each elite program.
Collapse
Affiliation(s)
- Dimitri Sanchez
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Antoine Allier
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
- Syngenta, 12 Chemin de L'Hobit, 31790, Saint-Sauveur, France
| | - Sarah Ben Sadoun
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Tristan Mary-Huard
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
- Université Paris-Saclay, AgroParisTech, INRAE, UMR MIA-Paris Saclay, 91120, Palaiseau, France
| | - Cyril Bauland
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Carine Palaffre
- UE 0394 SMH, INRAE, 2297 Route de l'INRA, 40390, Saint-Martin-de-Hinx, France
| | - Bernard Lagardère
- UE 0394 SMH, INRAE, 2297 Route de l'INRA, 40390, Saint-Martin-de-Hinx, France
| | - Delphine Madur
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Valérie Combes
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
| | | | | | - Alain Murigneux
- Limagrain Europe, 28 Route d'Ennezat, 63720, Chappes, France
| | - Laurence Moreau
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France
| | - Alain Charcosset
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, Génétique Quantitative et Evolution-Le Moulon, 91190, Gif-Sur-Yvette, France.
| |
Collapse
|
4
|
Biová J, Kaňovská I, Chan YO, Immadi MS, Joshi T, Bilyeu K, Škrabišová M. Natural and artificial selection of multiple alleles revealed through genomic analyses. Front Genet 2024; 14:1320652. [PMID: 38259621 PMCID: PMC10801239 DOI: 10.3389/fgene.2023.1320652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 11/17/2023] [Indexed: 01/24/2024] Open
Abstract
Genome-to-phenome research in agriculture aims to improve crops through in silico predictions. Genome-wide association study (GWAS) is potent in identifying genomic loci that underlie important traits. As a statistical method, increasing the sample quantity, data quality, or diversity of the GWAS dataset positively impacts GWAS power. For more precise breeding, concrete candidate genes with exact functional variants must be discovered. Many post-GWAS methods have been developed to narrow down the associated genomic regions and, ideally, to predict candidate genes and causative mutations (CMs). Historical natural selection and breeding-related artificial selection both act to change the frequencies of different alleles of genes that control phenotypes. With higher diversity and more extensive GWAS datasets, there is an increased chance of multiple alleles with independent CMs in a single causal gene. This can be caused by the presence of samples from geographically isolated regions that arose during natural or artificial selection. This simple fact is a complicating factor in GWAS-driven discoveries. Currently, none of the existing association methods address this issue and need to identify multiple alleles and, more specifically, the actual CMs. Therefore, we developed a tool that computes a score for a combination of variant positions in a single candidate gene and, based on the highest score, identifies the best number and combination of CMs. The tool is publicly available as a Python package on GitHub, and we further created a web-based Multiple Alleles discovery (MADis) tool that supports soybean and is hosted in SoyKB (https://soykb.org/SoybeanMADisTool/). We tested and validated the algorithm and presented the utilization of MADis in a pod pigmentation L1 gene case study with multiple CMs from natural or artificial selection. Finally, we identified a candidate gene for the pod color L2 locus and predicted the existence of multiple alleles that potentially cause loss of pod pigmentation. In this work, we show how a genomic analysis can be employed to explore the natural and artificial selection of multiple alleles and, thus, improve and accelerate crop breeding in agriculture.
Collapse
Affiliation(s)
- Jana Biová
- Department of Biochemistry, Faculty of Science, Palacký University in Olomouc, Olomouc, Czechia
| | - Ivana Kaňovská
- Department of Biochemistry, Faculty of Science, Palacký University in Olomouc, Olomouc, Czechia
| | - Yen On Chan
- MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO, United States
- Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, United States
| | - Manish Sridhar Immadi
- Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, United States
| | - Trupti Joshi
- MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO, United States
- Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, United States
- Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, United States
- Department of Biomedical Informatics, Biostatistics and Medical Epidemiology, University of Missouri-Columbia, Columbia, MO, United States
| | - Kristin Bilyeu
- United States Department of Agriculture-Agricultural Research Service, Plant Genetics Research Unit, Columbia, MO, United States
| | - Mária Škrabišová
- Department of Biochemistry, Faculty of Science, Palacký University in Olomouc, Olomouc, Czechia
| |
Collapse
|
5
|
Trentin HU, Krause MD, Zunjare RU, Almeida VC, Peterlini E, Rotarenco V, Frei UK, Beavis WD, Lübberstedt T. Genetic basis of maize maternal haploid induction beyond MATRILINEAL and ZmDMP. FRONTIERS IN PLANT SCIENCE 2023; 14:1218042. [PMID: 37860246 PMCID: PMC10582762 DOI: 10.3389/fpls.2023.1218042] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 09/05/2023] [Indexed: 10/21/2023]
Abstract
In maize, doubled haploid (DH) lines are created in vivo through crosses with maternal haploid inducers. Their induction ability, usually expressed as haploid induction rate (HIR), is known to be under polygenic control. Although two major genes (MTL and ZmDMP) affecting this trait were recently described, many others remain unknown. To identify them, we designed and performed a SNP based (~9007) genome-wide association study using a large and diverse panel of 159 maternal haploid inducers. Our analyses identified a major gene near MTL, which is present in all inducers and necessary to disrupt haploid induction. We also found a significant quantitative trait loci (QTL) on chromosome 10 using a case-control mapping approach, in which 793 noninducers were used as controls. This QTL harbors a kokopelli ortholog, whose role in maternal haploid induction was recently described in Arabidopsis. QTL with smaller effects were identified on six of the ten maize chromosomes, confirming the polygenic nature of this trait. These QTL could be incorporated into inducer breeding programs through marker-assisted selection approaches. Further improving HIR is important to reduce the cost of DH line production.
Collapse
Affiliation(s)
- Henrique Uliana Trentin
- Bayer Crop Science, Coxilha, RS, Brazil
- Department of Agronomy, Iowa State University, Ames, IA, United States
| | | | - Rajkumar Uttamrao Zunjare
- Department of Agronomy, Iowa State University, Ames, IA, United States
- Division of Genetics, ICAR-Indian Agricultural Research Institute, New Delhi, India
| | - Vinícius Costa Almeida
- Department of Agronomy, Iowa State University, Ames, IA, United States
- Federal University of Viçosa, Viçosa, MG, Brazil
| | - Edicarlos Peterlini
- Department of Agronomy, Iowa State University, Ames, IA, United States
- Department of Agronomy, State University of Maringá, Maringá, PR, Brazil
| | | | | | | | | |
Collapse
|
6
|
Zhang Z, Wang L. A simulation framework for reciprocal recurrent selection-based hybrid breeding under transparent and opaque simulators. FRONTIERS IN PLANT SCIENCE 2023; 14:1174168. [PMID: 37441181 PMCID: PMC10333587 DOI: 10.3389/fpls.2023.1174168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Accepted: 06/02/2023] [Indexed: 07/15/2023]
Abstract
Hybrid breeding is an established and effective process to improve offspring performance, while it is resource-intensive and time-consuming for the recurrent process in reality. To enable breeders and researchers to evaluate the effectiveness of competing decision-making strategies, we present a modular simulation framework for reciprocal recurrent selection-based hybrid breeding. Consisting of multiple modules such as heterotic separation, genomic prediction, and genomic selection, this simulation framework allows breeders to efficiently simulate the hybrid breeding process with multiple options of simulators and decision-making strategies. We also integrate the recently proposed concepts of transparent and opaque simulators into the framework in order to reflect the breeding process more realistically. Simulation results show the performance comparison among different breeding strategies under the two simulators.
Collapse
Affiliation(s)
- Zerui Zhang
- Program of Bioinformatics and Computational Biology, Iowa State University, Ames, IA, United States
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United States
- Department of Statistics, Iowa State University, Ames, IA, United States
| | - Lizhi Wang
- Program of Bioinformatics and Computational Biology, Iowa State University, Ames, IA, United States
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United States
| |
Collapse
|
7
|
Iqbal M, Semagn K, Jarquin D, Randhawa H, McCallum BD, Howard R, Aboukhaddour R, Ciechanowska I, Strenzke K, Crossa J, Céron-Rojas JJ, N’Diaye A, Pozniak C, Spaner D. Identification of Disease Resistance Parents and Genome-Wide Association Mapping of Resistance in Spring Wheat. PLANTS (BASEL, SWITZERLAND) 2022; 11:2905. [PMID: 36365358 PMCID: PMC9658635 DOI: 10.3390/plants11212905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 10/03/2022] [Accepted: 10/25/2022] [Indexed: 06/16/2023]
Abstract
The likelihood of success in developing modern cultivars depend on multiple factors, including the identification of suitable parents to initiate new crosses, and characterizations of genomic regions associated with target traits. The objectives of the present study were to (a) determine the best economic weights of four major wheat diseases (leaf spot, common bunt, leaf rust, and stripe rust) and grain yield for multi-trait restrictive linear phenotypic selection index (RLPSI), (b) select the top 10% cultivars and lines (hereafter referred as genotypes) with better resistance to combinations of the four diseases and acceptable grain yield as potential parents, and (c) map genomic regions associated with resistance to each disease using genome-wide association study (GWAS). A diversity panel of 196 spring wheat genotypes was evaluated for their reaction to stripe rust at eight environments, leaf rust at four environments, leaf spot at three environments, common bunt at two environments, and grain yield at five environments. The panel was genotyped with the Wheat 90K SNP array and a few KASP SNPs of which we used 23,342 markers for statistical analyses. The RLPSI analysis performed by restricting the expected genetic gain for yield displayed significant (p < 0.05) differences among the 3125 economic weights. Using the best four economic weights, a subset of 22 of the 196 genotypes were selected as potential parents with resistance to the four diseases and acceptable grain yield. GWAS identified 37 genomic regions, which included 12 for common bunt, 13 for leaf rust, 5 for stripe rust, and 7 for leaf spot. Each genomic region explained from 6.6 to 16.9% and together accounted for 39.4% of the stripe rust, 49.1% of the leaf spot, 94.0% of the leaf rust, and 97.9% of the common bunt phenotypic variance combined across all environments. Results from this study provide valuable information for wheat breeders selecting parental combinations for new crosses to develop improved germplasm with enhanced resistance to the four diseases as well as the physical positions of genomic regions that confer resistance, which facilitates direct comparisons for independent mapping studies in the future.
Collapse
Affiliation(s)
- Muhammad Iqbal
- Department of Agricultural, Food and Nutritional Science, University of Alberta, 4–10 Agriculture-Forestry Centre, Edmonton, AB T6G 2P5, Canada
| | - Kassa Semagn
- Department of Agricultural, Food and Nutritional Science, University of Alberta, 4–10 Agriculture-Forestry Centre, Edmonton, AB T6G 2P5, Canada
| | - Diego Jarquin
- Agronomy Department, University of Florida, Gainesville, FL 32611, USA
| | - Harpinder Randhawa
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403 1st Avenue South, Lethbridge, AB T1J 4B1, Canada
| | - Brent D. McCallum
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100, Morden, MB R6M 1Y5, Canada
| | - Reka Howard
- Department of Statistics, University of Nebraska—Lincoln, Lincoln, NE 68583, USA
| | - Reem Aboukhaddour
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403 1st Avenue South, Lethbridge, AB T1J 4B1, Canada
| | - Izabela Ciechanowska
- Department of Agricultural, Food and Nutritional Science, University of Alberta, 4–10 Agriculture-Forestry Centre, Edmonton, AB T6G 2P5, Canada
| | - Klaus Strenzke
- Department of Agricultural, Food and Nutritional Science, University of Alberta, 4–10 Agriculture-Forestry Centre, Edmonton, AB T6G 2P5, Canada
| | - José Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera, Veracruz 52640, Mexico
| | - J. Jesus Céron-Rojas
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Km 45 Carretera, Veracruz 52640, Mexico
| | - Amidou N’Diaye
- Crop Development Centre and Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8, Canada
| | - Curtis Pozniak
- Crop Development Centre and Department of Plant Sciences, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK S7N 5A8, Canada
| | - Dean Spaner
- Department of Agricultural, Food and Nutritional Science, University of Alberta, 4–10 Agriculture-Forestry Centre, Edmonton, AB T6G 2P5, Canada
| |
Collapse
|
8
|
Li Y, Shi F, Lin Z, Robinson H, Moody D, Rattey A, Godoy J, Mullan D, Keeble-Gagnere G, Hayden MJ, Tibbits JFG, Daetwyler HD. Benefit of Introgression Depends on Level of Genetic Trait Variation in Cereal Breeding Programmes. FRONTIERS IN PLANT SCIENCE 2022; 13:786452. [PMID: 35783964 PMCID: PMC9240786 DOI: 10.3389/fpls.2022.786452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 05/19/2022] [Indexed: 06/15/2023]
Abstract
We investigated the benefit from introgression of external lines into a cereal breeding programme and strategies that accelerated introgression of the favourable alleles while minimising linkage drag using stochastic computer simulation. We simulated genomic selection for disease resistance and grain yield in two environments with a high level of genotype-by-environment interaction (G × E) for the latter trait, using genomic data of a historical barley breeding programme as the base generation. Two populations (existing and external) were created from this base population with different allele frequencies for few (N = 10) major and many (N ~ 990) minor simulated disease quantitative trait loci (QTL). The major disease QTL only existed in the external population and lines from the external population were introgressed into the existing population which had minor disease QTL with low, medium and high allele frequencies. The study revealed that the benefit of introgression depended on the level of genetic variation for the target trait in the existing cereal breeding programme. Introgression of external resources into the existing population was beneficial only when the existing population lacked variation in disease resistance or when minor disease QTL were already at medium or high frequency. When minor disease QTL were at low frequencies, no extra genetic gain was achieved from introgression. More benefit in the disease trait was obtained from the introgression if the major disease QTL had larger effect sizes, more selection emphasis was applied on disease resistance, or more external lines were introgressed. While our strategies to increase introgression of major disease QTL were generally successful, most were not able to completely avoid negative impacts on selection for grain yield with the only exception being when major introgression QTL effects were very large. Breeding programmes are advised to carefully consider the level of genetic variation in a trait available in their breeding programme before deciding to introgress germplasms.
Collapse
Affiliation(s)
- Yongjun Li
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
| | - Fan Shi
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
| | - Zibei Lin
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
| | | | | | | | | | | | | | - Matthew J. Hayden
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| | | | - Hans D. Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, Bundoora, VIC, Australia
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, Australia
| |
Collapse
|
9
|
Vanavermaete D, Fostier J, Maenhout S, De Baets B. Deep scoping: a breeding strategy to preserve, reintroduce and exploit genetic variation. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:3845-3861. [PMID: 34387711 PMCID: PMC8580937 DOI: 10.1007/s00122-021-03932-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 07/30/2021] [Indexed: 06/13/2023]
Abstract
The deep scoping method incorporates the use of a gene bank together with different population layers to reintroduce genetic variation into the breeding population, thus maximizing the long-term genetic gain without reducing the short-term genetic gain or increasing the total financial cost. Genomic prediction is often combined with truncation selection to identify superior parental individuals that can pass on favorable quantitative trait locus (QTL) alleles to their offspring. However, truncation selection reduces genetic variation within the breeding population, causing a premature convergence to a sub-optimal genetic value. In order to also increase genetic gain in the long term, different methods have been proposed that better preserve genetic variation. However, when the genetic variation of the breeding population has already been reduced as a result of prior intensive selection, even those methods will not be able to avert such premature convergence. Pre-breeding provides a solution for this problem by reintroducing genetic variation into the breeding population. Unfortunately, as pre-breeding often relies on a separate breeding population to increase the genetic value of wild specimens before introducing them in the elite population, it comes with an increased financial cost. In this paper, on the basis of a simulation study, we propose a new method that reintroduces genetic variation in the breeding population on a continuous basis without the need for a separate pre-breeding program or a larger population size. This way, we are able to introduce favorable QTL alleles into an elite population and maximize the genetic gain in the short as well as in the long term without increasing the financial cost.
Collapse
Affiliation(s)
- David Vanavermaete
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, B-9000, Ghent, Belgium.
| | - Jan Fostier
- IDLab, Department of Information Technology, Ghent University - imec, B-9052, Ghent, Belgium
| | | | - Bernard De Baets
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, B-9000, Ghent, Belgium
| |
Collapse
|
10
|
Han Y, Cameron JN, Wang L, Pham H, Beavis WD. Dynamic Programming for Resource Allocation in Multi-Allelic Trait Introgression. FRONTIERS IN PLANT SCIENCE 2021; 12:544854. [PMID: 34220873 PMCID: PMC8253225 DOI: 10.3389/fpls.2021.544854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 05/21/2021] [Indexed: 06/13/2023]
Abstract
Trait introgression is a complex process that plant breeders use to introduce desirable alleles from one variety or species to another. Two of the major types of decisions that must be made during this sophisticated and uncertain workflow are: parental selection and resource allocation. We formulated the trait introgression problem as an engineering process and proposed a Markov Decision Processes (MDP) model to optimize the resource allocation procedure. The efficiency of the MDP model was compared with static resource allocation strategies and their trade-offs among budget, deadline, and probability of success are demonstrated. Simulation results suggest that dynamic resource allocation strategies from the MDP model significantly improve the efficiency of the trait introgression by allocating the right amount of resources according to the genetic outcome of previous generations.
Collapse
Affiliation(s)
- Ye Han
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United States
| | - John N. Cameron
- Department of Agronomy, Iowa State University, Ames, IA, United States
| | - Lizhi Wang
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United States
| | - Hieu Pham
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, United States
| | - William D. Beavis
- Department of Agronomy, Iowa State University, Ames, IA, United States
| |
Collapse
|
11
|
Amini F, Franco FR, Hu G, Wang L. The look ahead trace back optimizer for genomic selection under transparent and opaque simulators. Sci Rep 2021; 11:4124. [PMID: 33602979 PMCID: PMC7893003 DOI: 10.1038/s41598-021-83567-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 02/02/2021] [Indexed: 11/29/2022] Open
Abstract
Recent advances in genomic selection (GS) have demonstrated the importance of not only the accuracy of genomic prediction but also the intelligence of selection strategies. The look ahead selection algorithm, for example, has been found to significantly outperform the widely used truncation selection approach in terms of genetic gain, thanks to its strategy of selecting breeding parents that may not necessarily be elite themselves but have the best chance of producing elite progeny in the future. This paper presents the look ahead trace back algorithm as a new variant of the look ahead approach, which introduces several improvements to further accelerate genetic gain especially under imperfect genomic prediction. Perhaps an even more significant contribution of this paper is the design of opaque simulators for evaluating the performance of GS algorithms. These simulators are partially observable, explicitly capture both additive and non-additive genetic effects, and simulate uncertain recombination events more realistically. In contrast, most existing GS simulation settings are transparent, either explicitly or implicitly allowing the GS algorithm to exploit certain critical information that may not be possible in actual breeding programs. Comprehensive computational experiments were carried out using a maize data set to compare a variety of GS algorithms under four simulators with different levels of opacity. These results reveal how differently a same GS algorithm would interact with different simulators, suggesting the need for continued research in the design of more realistic simulators. As long as GS algorithms continue to be trained in silico rather than in planta, the best way to avoid disappointing discrepancy between their simulated and actual performances may be to make the simulator as akin to the complex and opaque nature as possible.
Collapse
Affiliation(s)
- Fatemeh Amini
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, 50011, USA
| | - Felipe Restrepo Franco
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, 50011, USA
| | - Guiping Hu
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, 50011, USA
| | - Lizhi Wang
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
12
|
A look-ahead Monte Carlo simulation method for improving parental selection in trait introgression. Sci Rep 2021; 11:3918. [PMID: 33594238 PMCID: PMC7887201 DOI: 10.1038/s41598-021-83634-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 01/14/2021] [Indexed: 11/08/2022] Open
Abstract
Multiple trait introgression is the process by which multiple desirable traits are converted from a donor to a recipient cultivar through backcrossing and selfing. The goal of this procedure is to recover all the attributes of the recipient cultivar, with the addition of the specified desirable traits. A crucial step in this process is the selection of parents to form new crosses. In this study, we propose a new selection approach that estimates the genetic distribution of the progeny of backcrosses after multiple generations using information of recombination events. Our objective is to select the most promising individuals for further backcrossing or selfing. To demonstrate the effectiveness of the proposed method, a case study has been conducted using maize data where our method is compared with state-of-the-art approaches. Simulation results suggest that the proposed method, look-ahead Monte Carlo, achieves higher probability of success than existing approaches. Our proposed selection method can assist breeders to efficiently design trait introgression projects.
Collapse
|
13
|
Villar-Hernández BDJ, Pérez-Elizalde S, Martini JWR, Toledo F, Perez-Rodriguez P, Krause M, García-Calvillo ID, Covarrubias-Pazaran G, Crossa J. Application of multi-trait Bayesian decision theory for parental genomic selection. G3-GENES GENOMES GENETICS 2021; 11:6104551. [PMID: 33693601 PMCID: PMC8022966 DOI: 10.1093/g3journal/jkab012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 01/04/2021] [Indexed: 12/01/2022]
Abstract
In all breeding programs, the decision about which individuals to select and intermate to form the next selection cycle is crucial. The improvement of genetic stocks requires considering multiple traits simultaneously, given that economic value and net genetic merits depend on many traits; therefore, with the advance of computational and statistical tools and genomic selection (GS), researchers are focusing on multi-trait selection. Selection of the best individuals is difficult, especially in traits that are antagonistically correlated, where improvement in one trait might imply a reduction in other(s). There are approaches that facilitate multi-trait selection, and recently a Bayesian decision theory (BDT) has been proposed. Parental selection using BDT has the potential to be effective in multi-trait selection given that it summarizes all relevant quantitative genetic concepts such as heritability, response to selection and the structure of dependence between traits (correlation). In this study, we applied BDT to provide a treatment for the complexity of multi-trait parental selection using three multivariate loss functions (LF), Kullback–Leibler (KL), Energy Score, and Multivariate Asymmetric Loss (MALF), to select the best-performing parents for the next breeding cycle in two extensive real wheat data sets. Results show that the high ranking lines in genomic estimated breeding value (GEBV) for certain traits did not always have low values for the posterior expected loss (PEL). For both data sets, the KL LF gave similar importance to all traits including grain yield. In contrast, the Energy Score and MALF gave a better performance in three of four traits that were different than grain yield. The BDT approach should help breeders to decide based not only on the GEBV per se of the parent to be selected, but also on the level of uncertainty according to the Bayesian paradigm.
Collapse
Affiliation(s)
- Bartolo de Jesús Villar-Hernández
- Colegio de Postgraduados, Montecillos, Edo. de Mexico, CP 56264,Mexico.,Universidad Autonoma de Coahuila, Saltillo, CP 25280, Mexico
| | | | - Johannes W R Martini
- International Maize and Wheat Improvement Center (CIMMYT). Km 45 Carretera México-Veracruz, El Batán Km. 45, CP 56237 Mexico
| | - Fernando Toledo
- International Maize and Wheat Improvement Center (CIMMYT). Km 45 Carretera México-Veracruz, El Batán Km. 45, CP 56237 Mexico
| | - P Perez-Rodriguez
- Colegio de Postgraduados, Montecillos, Edo. de Mexico, CP 56264,Mexico
| | - Margaret Krause
- International Maize and Wheat Improvement Center (CIMMYT). Km 45 Carretera México-Veracruz, El Batán Km. 45, CP 56237 Mexico
| | | | - Giovanny Covarrubias-Pazaran
- International Maize and Wheat Improvement Center (CIMMYT). Km 45 Carretera México-Veracruz, El Batán Km. 45, CP 56237 Mexico
| | - José Crossa
- Colegio de Postgraduados, Montecillos, Edo. de Mexico, CP 56264,Mexico.,International Maize and Wheat Improvement Center (CIMMYT). Km 45 Carretera México-Veracruz, El Batán Km. 45, CP 56237 Mexico
| |
Collapse
|
14
|
Origin Specific Genomic Selection: A Simple Process To Optimize the Favorable Contribution of Parents to Progeny. G3-GENES GENOMES GENETICS 2020; 10:2445-2455. [PMID: 32430306 PMCID: PMC7341124 DOI: 10.1534/g3.120.401132] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Modern crop breeding is in constant demand for new genetic diversity as part of the arms race with genetic gain. The elite gene pool has limited genetic variation and breeders are trying to introduce novelty from unadapted germplasm, landraces and wild relatives. For polygenic traits, currently available approaches to introgression are not ideal, as there is a demonstrable bias against exotic alleles during selection. Here, we propose a partitioned form of genomic selection, called Origin Specific Genomic Selection (OSGS), where we identify and target selection on favorable exotic alleles. Briefly, within a population derived from a bi-parental cross, we isolate alleles originating from the elite and exotic parents, which then allows us to separate out the predicted marker effects based on the allele origins. We validated the usefulness of OSGS using two nested association mapping (NAM) datasets: barley NAM (elite-exotic) and maize NAM (elite-elite), as well as by computer simulation. Our results suggest that OSGS works well in its goal to increase the contribution of favorable exotic alleles in bi-parental crosses, and it is possible to extend the approach to broader multi-parental populations.
Collapse
|
15
|
Allier A, Teyssèdre S, Lehermeier C, Moreau L, Charcosset A. Optimized breeding strategies to harness genetic resources with different performance levels. BMC Genomics 2020; 21:349. [PMID: 32393177 PMCID: PMC7216646 DOI: 10.1186/s12864-020-6756-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 04/23/2020] [Indexed: 11/10/2022] Open
Abstract
Background The narrow genetic base of elite germplasm compromises long-term genetic gain and increases the vulnerability to biotic and abiotic stresses in unpredictable environmental conditions. Therefore, an efficient strategy is required to broaden the genetic base of commercial breeding programs while not compromising short-term variety release. Optimal cross selection aims at identifying the optimal set of crosses that balances the expected genetic value and diversity. We propose to consider genomic selection and optimal cross selection to recurrently improve genetic resources (i.e. pre-breeding), to bridge the improved genetic resources with elites (i.e. bridging), and to manage introductions into the elite breeding population. Optimal cross selection is particularly adapted to jointly identify bridging, introduction and elite crosses to ensure an overall consistency of the genetic base broadening strategy. Results We compared simulated breeding programs introducing donors with different performance levels, directly or indirectly after bridging. We also evaluated the effect of the training set composition on the success of introductions. We observed that with recurrent introductions of improved donors, it is possible to maintain the genetic diversity and increase mid- and long-term performances with only limited penalty at short-term. Considering a bridging step yielded significantly higher mid- and long-term genetic gain when introducing low performing donors. The results also suggested to consider marker effects estimated with a broad training population including donor by elite and elite by elite progeny to identify bridging, introduction and elite crosses. Conclusion Results of this study provide guidelines on how to harness polygenic variation present in genetic resources to broaden elite germplasm.
Collapse
Affiliation(s)
- Antoine Allier
- GQE - Le Moulon, INRAE, University Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France. .,RAGT2n, Statistical Genetics Unit, 12510, Druelle, France.
| | | | | | - Laurence Moreau
- GQE - Le Moulon, INRAE, University Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Alain Charcosset
- GQE - Le Moulon, INRAE, University Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France.
| |
Collapse
|
16
|
Allier A, Teyssèdre S, Lehermeier C, Charcosset A, Moreau L. Genomic prediction with a maize collaborative panel: identification of genetic resources to enrich elite breeding programs. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2020; 133:201-215. [PMID: 31595338 DOI: 10.1007/s00122-019-03451-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Accepted: 09/28/2019] [Indexed: 05/02/2023]
Abstract
Collaborative diversity panels and genomic prediction seem relevant to identify and harness genetic resources for polygenic trait-specific enrichment of elite germplasms. In plant breeding, genetic diversity is important to maintain the pace of genetic gain and the ability to respond to new challenges in a context of climatic and social expectation changes. Many genetic resources are accessible to breeders but cannot all be considered for broadening the genetic diversity of elite germplasm. This study presents the use of genomic predictions trained on a collaborative diversity panel, which assembles genetic resources and elite lines, to identify resources to enrich an elite germplasm. A maize collaborative panel (386 lines) was considered to estimate genome-wide marker effects. Relevant predictive abilities (0.40-0.55) were observed on a large population of private elite materials, which supported the interest of such a collaborative panel for diversity management perspectives. Grain-yield estimated marker effects were used to select a donor that best complements an elite recipient at individual loci or haplotype segments, or that is expected to give the best-performing progeny with the elite. Among existing and new criteria that were compared, some gave more weight to the donor-elite complementarity than to the donor value, and appeared more adapted to long-term objective. We extended this approach to the selection of a set of donors complementing an elite population. We defined a crossing plan between identified donors and elite recipients. Our results illustrated how collaborative projects based on diversity panels including both public resources and elite germplasm can contribute to a better characterization of genetic resources in view of their use to enrich elite germplasm.
Collapse
Affiliation(s)
- Antoine Allier
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
- RAGT2n, Genetics and Analytics Unit, 12510, Druelle, France
| | | | | | - Alain Charcosset
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Laurence Moreau
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France.
| |
Collapse
|
17
|
Optimizing Selection and Mating in Genomic Selection with a Look-Ahead Approach: An Operations Research Framework. G3-GENES GENOMES GENETICS 2019; 9:2123-2133. [PMID: 31109922 PMCID: PMC6643893 DOI: 10.1534/g3.118.200842] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
New genotyping technologies have made large amounts of genotypic data available for plant breeders to use in their efforts to accelerate the rate of genetic gain. Genomic selection (GS) techniques allow breeders to use genotypic data to identify and select, for example, plants predicted to exhibit drought tolerance, thereby saving expensive and limited field-testing resources relative to phenotyping all plants within a population. A major limitation of existing GS approaches is the trade-off between short-term genetic gain and long-term potential. Some approaches focus on achieving short-term genetic gain at the cost of reduced genetic diversity necessary for long-term gains. In contrast, others compromise short-term progress to preserve long-term potential without consideration of the time and resources required to achieve it. Our contribution is to define a new “look-ahead” metric for assessing selection decisions, which evaluates the probability of achieving high genetic gains by a specific time with limited resources. Moreover, we propose a heuristic algorithm to identify optimal selection decisions that maximize the look-ahead metric. Simulation results demonstrate that look-ahead selection outperforms other published selection methods.
Collapse
|
18
|
Usefulness Criterion and Post-selection Parental Contributions in Multi-parental Crosses: Application to Polygenic Trait Introgression. G3-GENES GENOMES GENETICS 2019; 9:1469-1479. [PMID: 30819823 PMCID: PMC6505154 DOI: 10.1534/g3.119.400129] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Predicting the usefulness of crosses in terms of expected genetic gain and genetic diversity is of interest to secure performance in the progeny and to maintain long-term genetic gain in plant breeding. A wide range of crossing schemes are possible including large biparental crosses, backcrosses, four-way crosses, and synthetic populations. In silico progeny simulations together with genome-based prediction of quantitative traits can be used to guide mating decisions. However, the large number of multi-parental combinations can hinder the use of simulations in practice. Analytical solutions have been proposed recently to predict the distribution of a quantitative trait in the progeny of biparental crosses using information of recombination frequency and linkage disequilibrium between loci. Here, we extend this approach to obtain the progeny distribution of more complex crosses including two to four parents. Considering agronomic traits and parental genome contribution as jointly multivariate normally distributed traits, the usefulness criterion parental contribution (UCPC) enables to (i) evaluate the expected genetic gain for agronomic traits, and at the same time (ii) evaluate parental genome contributions to the selected fraction of progeny. We validate and illustrate UCPC in the context of multiple allele introgression from a donor into one or several elite recipients in maize (Zea mays L.). Recommendations regarding the interest of two-way, three-way, and backcrosses were derived depending on the donor performance. We believe that the computationally efficient UCPC approach can be useful for mate selection and allocation in many plant and animal breeding contexts.
Collapse
|
19
|
Andorf C, Beavis WD, Hufford M, Smith S, Suza WP, Wang K, Woodhouse M, Yu J, Lübberstedt T. Technological advances in maize breeding: past, present and future. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:817-849. [PMID: 30798332 DOI: 10.1007/s00122-019-03306-3] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 02/05/2019] [Indexed: 05/18/2023]
Abstract
Maize has for many decades been both one of the most important crops worldwide and one of the primary genetic model organisms. More recently, maize breeding has been impacted by rapid technological advances in sequencing and genotyping technology, transformation including genome editing, doubled haploid technology, parallelled by progress in data sciences and the development of novel breeding approaches utilizing genomic information. Herein, we report on past, current and future developments relevant for maize breeding with regard to (1) genome analysis, (2) germplasm diversity characterization and utilization, (3) manipulation of genetic diversity by transformation and genome editing, (4) inbred line development and hybrid seed production, (5) understanding and prediction of hybrid performance, (6) breeding methodology and (7) synthesis of opportunities and challenges for future maize breeding.
Collapse
Affiliation(s)
| | - William D Beavis
- Department of Agronomy, Iowa State University, Agronomy Hall, Ames, IA, 50011-1010, USA
| | - Matthew Hufford
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA, 50011-1010, USA
| | - Stephen Smith
- Department of Agronomy, Iowa State University, Agronomy Hall, Ames, IA, 50011-1010, USA
| | - Walter P Suza
- Department of Agronomy, Iowa State University, Agronomy Hall, Ames, IA, 50011-1010, USA
| | - Kan Wang
- Department of Agronomy, Iowa State University, Agronomy Hall, Ames, IA, 50011-1010, USA
| | | | - Jianming Yu
- Department of Agronomy, Iowa State University, Agronomy Hall, Ames, IA, 50011-1010, USA
| | - Thomas Lübberstedt
- Department of Agronomy, Iowa State University, Agronomy Hall, Ames, IA, 50011-1010, USA.
| |
Collapse
|
20
|
Abstract
Plant and animal breeders are interested in selecting the best individuals from a candidate set for the next breeding cycle. In this paper, we propose a formal method under the Bayesian decision theory framework to tackle the selection problem based on genomic selection (GS) in single- and multi-trait settings. We proposed and tested three univariate loss functions (Kullback-Leibler, KL; Continuous Ranked Probability Score, CRPS; Linear-Linear loss, LinLin) and their corresponding multivariate generalizations (Kullback-Leibler, KL; Energy Score, EnergyS; and the Multivariate Asymmetric Loss Function, MALF). We derived and expressed all the loss functions in terms of heritability and tested them on a real wheat dataset for one cycle of selection and in a simulated selection program. The performance of each univariate loss function was compared with the standard method of selection (Std) that does not use loss functions. We compared the performance in terms of the selection response and the decrease in the population’s genetic variance during recurrent breeding cycles. Results suggest that it is possible to obtain better performance in a long-term breeding program using the single-trait scheme by selecting 30% of the best individuals in each cycle but not by selecting 10% of the best individuals. For the multi-trait approach, results show that the population mean for all traits under consideration had positive gains, even though two of the traits were negatively correlated. The corresponding population variances were not statistically different from the different loss function during the 10th selection cycle. Using the loss function should be a useful criterion when selecting the candidates for selection for the next breeding cycle.
Collapse
|
21
|
Selection on Expected Maximum Haploid Breeding Values Can Increase Genetic Gain in Recurrent Genomic Selection. G3-GENES GENOMES GENETICS 2018; 8:1173-1181. [PMID: 29434032 PMCID: PMC5873908 DOI: 10.1534/g3.118.200091] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Genomic selection (GS) offers the possibility to estimate the effects of genome-wide molecular markers, which can be used to calculate genomic estimated breeding values (GEBVs) for individuals without phenotypes. GEBVs can serve as a selection criterion in recurrent GS, maximizing single-cycle but not necessarily long-term genetic gain. As simple genome-wide sums, GEBVs do not take into account other genomic information, such as the map positions of loci and linkage phases of alleles. Therefore, we herein propose a novel selection criterion called expected maximum haploid breeding value (EMBV). EMBV predicts the expected performance of the best among a limited number of gametes that a candidate contributes to the next generation, if selected. We used simulations to examine the performance of EMBV in comparison with GEBV as well as the recently proposed criterion optimal haploid value (OHV) and weighted GS. We considered different population sizes, numbers of selected candidates, chromosome numbers and levels of dominant gene action. Criterion EMBV outperformed GEBV after about 5 selection cycles, achieved higher long-term genetic gain and maintained higher diversity in the population. The other selection criteria showed the potential to surpass both GEBV and EMBV in advanced cycles of the breeding program, but yielded substantially lower genetic gain in early to intermediate cycles, which makes them unattractive for practical breeding. Moreover, they were largely inferior in scenarios with dominant gene action. Overall, EMBV shows high potential to be a promising alternative selection criterion to GEBV for recurrent genomic selection.
Collapse
|
22
|
Osthushenrich T, Frisch M, Zenke-Philippi C, Jaiser H, Spiller M, Cselényi L, Krumnacker K, Boxberger S, Kopahnke D, Habekuß A, Ordon F, Herzog E. Prediction of Means and Variances of Crosses With Genome-Wide Marker Effects in Barley. FRONTIERS IN PLANT SCIENCE 2018; 9:1899. [PMID: 30627135 PMCID: PMC6309237 DOI: 10.3389/fpls.2018.01899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Accepted: 12/07/2018] [Indexed: 05/10/2023]
Abstract
Background: The expected genetic variance is an important criterion for the selection of crossing partners which will produce superior combinations of genotypes in their progeny. The advent of molecular markers has opened up new vistas for obtaining precise predictors for the genetic variance of a cross, but fast prediction methods that allow plant breeders to select crossing partners based on already available data from their breeding programs without complicated calculations or simulation of breeding populations are still lacking. The main objective of the present study was to demonstrate the practical applicability of an analytical approach for the selection of superior cross combinations with experimental data from a barley breeding program. We used genome-wide marker effects to predict the yield means and genetic variances of 14 DH families resulting from crosses of four donor lines with five registered elite varieties with the genotypic information of the parental lines. For the validation of the predicted parameters, the analytical approach was extended by the masking variance as a major component of phenotypic variance. The predicted parameters were used to fit normal distribution curves of the phenotypic values and to conduct an Anderson-Darling goodness-of-fit test for the observed phenotypic data of the 14 DH families from the field trial. Results: There was no evidence that the observed phenotypic values deviated from the predicted phenotypic normal distributions in 13 out of 14 crosses. The correlations between the observed and the predicted means and the observed and predicted variances were r = 0.95 and r = 0.34, respectively. After removing two crosses with downward outliers in the phenotypic data, the correlation between the observed and predicted variances increased to r = 0.76. A ranking of the 14 crosses based on the sum of predicted mean and genetic variance identified the 50% best crosses from the field trial correctly. Conclusions: We conclude that the prediction accuracy of the presented approach is sufficiently high to identify superior crosses even with limited phenotypic data. We therefore expect that the analytical approach based on genome-wide marker effects is applicable in a wide range of breeding programs.
Collapse
Affiliation(s)
- Tanja Osthushenrich
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Matthias Frisch
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Carola Zenke-Philippi
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
| | - Heidi Jaiser
- Saatzucht Josef Breun GmbH & Co. KG, Herzogenaurach, Germany
| | | | - László Cselényi
- W. von Borries-Eckendorf GmbH & Co. KG, Leopoldshöhe, Germany
| | | | | | - Doris Kopahnke
- Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute, Quedlinburg, Germany
| | - Antje Habekuß
- Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute, Quedlinburg, Germany
| | - Frank Ordon
- Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute, Quedlinburg, Germany
| | - Eva Herzog
- Institute of Agronomy and Plant Breeding II, Justus Liebig University, Gießen, Germany
- *Correspondence: Eva Herzog
| |
Collapse
|
23
|
Genetic Gain Increases by Applying the Usefulness Criterion with Improved Variance Prediction in Selection of Crosses. Genetics 2017; 207:1651-1661. [PMID: 29038144 DOI: 10.1534/genetics.117.300403] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 10/10/2017] [Indexed: 11/18/2022] Open
Abstract
A crucial step in plant breeding is the selection and combination of parents to form new crosses. Genome-based prediction guides the selection of high-performing parental lines in many crop breeding programs which ensures a high mean performance of progeny. To warrant maximum selection progress, a new cross should also provide a large progeny variance. The usefulness concept as measure of the gain that can be obtained from a specific cross accounts for variation in progeny variance. Here, it is shown that genetic gain can be considerably increased when crosses are selected based on their genomic usefulness criterion compared to selection based on mean genomic estimated breeding values. An efficient and improved method to predict the genetic variance of a cross based on Markov chain Monte Carlo samples of marker effects from a whole-genome regression model is suggested. In simulations representing selection procedures in crop breeding programs, the performance of this novel approach is compared with existing methods, like selection based on mean genomic estimated breeding values and optimal haploid values. In all cases, higher genetic gain was obtained compared with previously suggested methods. When 1% of progenies per cross were selected, the genetic gain based on the estimated usefulness criterion increased by 0.14 genetic standard deviation compared to a selection based on mean genomic estimated breeding values. Analytical derivations of the progeny genotypic variance-covariance matrix based on parental genotypes and genetic map information make simulations of progeny dispensable, and allow fast implementation in large-scale breeding programs.
Collapse
|
24
|
Improving Response in Genomic Selection with a Population-Based Selection Strategy: Optimal Population Value Selection. Genetics 2017; 206:1675-1682. [PMID: 28526698 DOI: 10.1534/genetics.116.197103] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2016] [Accepted: 05/15/2017] [Indexed: 12/18/2022] Open
Abstract
Genomic selection (GS) identifies individuals for inclusion in breeding programs based on the sum of their estimated marker effects or genomic estimated breeding values (GEBVs). Due to significant correlation between GEBVs and true breeding values, this has resulted in enhanced rates of genetic gain as compared to traditional methods of selection. Three extensions to GS, weighted genomic selection (WGS), optimal haploid value (OHV) selection, and genotype building (GB) selection have been proposed to improve long-term response, and to facilitate the efficient development of doubled haploids. In separate simulation studies, these methods were shown to outperform GS under various assumptions. However, further potential for improvement exists. In this paper, optimal population value (OPV) selection is introduced as selection based on the maximum possible haploid value in a subset of the population. Instead of evaluating the breeding merit of individuals as in GS, WGS, and OHV selection, the proposed method evaluates the breeding merit of a set of individuals as in GB. After testing these selection methods extensively, OPV and GB selection were found to achieve greater responses than GS, WGS, and OHV, with OPV outperforming GB across most percentiles. These results suggest a new paradigm for selection methods in which an individual's value is dependent upon its complementarity with others.
Collapse
|