1
|
Bančič J, Odeny DA, Ojulong HF, Josiah SM, Buntjer J, Gaynor RC, Hoad SP, Gorjanc G, Dawson IK. Genomic and phenotypic characterization of finger millet indicates a complex diversification history. Plant Genome 2024; 17:e20392. [PMID: 37986545 DOI: 10.1002/tpg2.20392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 07/10/2023] [Accepted: 08/16/2023] [Indexed: 11/22/2023]
Abstract
Advances in sequencing technologies mean that insights into crop diversification can now be explored in crops beyond major staples. We use a genome assembly of finger millet, an allotetraploid orphan crop, to analyze DArTseq single nucleotide polymorphisms (SNPs) at the whole and sub-genome level. A set of 8778 SNPs and 13 agronomic traits was used to characterize a diverse panel of 423 landraces from Africa and Asia. Through principal component analysis (PCA) and discriminant analysis of principal components, four distinct groups of accessions were identified that coincided with the primary geographic regions of finger millet cultivation. Notably, East Africa, presumed to be the crop's origin, exhibited the lowest genetic diversity. The PCA of phenotypic data also revealed geographic differentiation, albeit with differing relationships among geographic areas than indicated with genomic data. Further exploration of the sub-genomes A and B using neighbor-joining trees revealed distinct features that provide supporting evidence for the complex evolutionary history of finger millet. Although genome-wide association study found only a limited number of significant marker-trait associations, a clustering approach based on the distribution of marker effects obtained from a ridge regression genomic model was employed to investigate trait complexity. This analysis uncovered two distinct clusters. Overall, the findings suggest that finger millet has undergone complex and context-specific diversification, indicative of a lengthy domestication history. These analyses provide insights for the future development of finger millet.
Collapse
Affiliation(s)
- Jon Bančič
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, UK
- Scotland's Rural College (SRUC), Kings Buildings, Edinburgh, UK
| | - Damaris A Odeny
- International Crops Research Institute for the Semi-Arid Tropics, ICRAF House, Gigiri Nairobi, Kenya
| | - Henry F Ojulong
- International Crops Research Institute for the Semi-Arid Tropics, ICRAF House, Gigiri Nairobi, Kenya
| | - Samuel M Josiah
- Department of Horticulture, University of Georgia, Athens, Georgia, USA
| | - Jaap Buntjer
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, UK
| | - R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, UK
| | - Stephen P Hoad
- Scotland's Rural College (SRUC), Kings Buildings, Edinburgh, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, UK
| | - Ian K Dawson
- Scotland's Rural College (SRUC), Kings Buildings, Edinburgh, UK
| |
Collapse
|
2
|
Tolhurst DJ, Gaynor RC, Gardunia B, Hickey JM, Gorjanc G. Correction to: Genomic selection using random regressions on known and latent environmental covariates. Theor Appl Genet 2023; 136:178. [PMID: 37542182 PMCID: PMC10403425 DOI: 10.1007/s00122-023-04417-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/06/2023]
Affiliation(s)
- Daniel J Tolhurst
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, UK.
| | | | | | - John M Hickey
- Corn Product Design, Bayer CropScience, Barcelona, Spain
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, UK
| |
Collapse
|
3
|
Pocrnic I, Obšteter J, Gaynor RC, Wolc A, Gorjanc G. Assessment of long-term trends in genetic mean and variance after the introduction of genomic selection in layers: a simulation study. Front Genet 2023; 14:1168212. [PMID: 37234871 PMCID: PMC10206274 DOI: 10.3389/fgene.2023.1168212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 05/02/2023] [Indexed: 05/28/2023] Open
Abstract
Nucleus-based breeding programs are characterized by intense selection that results in high genetic gain, which inevitably means reduction of genetic variation in the breeding population. Therefore, genetic variation in such breeding systems is typically managed systematically, for example, by avoiding mating the closest relatives to limit progeny inbreeding. However, intense selection requires maximum effort to make such breeding programs sustainable in the long-term. The objective of this study was to use simulation to evaluate the long-term impact of genomic selection on genetic mean and variance in an intense layer chicken breeding program. We developed a large-scale stochastic simulation of an intense layer chicken breeding program to compare conventional truncation selection to genomic truncation selection optimized with either minimization of progeny inbreeding or full-scale optimal contribution selection. We compared the programs in terms of genetic mean, genic variance, conversion efficiency, rate of inbreeding, effective population size, and accuracy of selection. Our results confirmed that genomic truncation selection has immediate benefits compared to conventional truncation selection in all specified metrics. A simple minimization of progeny inbreeding after genomic truncation selection did not provide any significant improvements. Optimal contribution selection was successful in having better conversion efficiency and effective population size compared to genomic truncation selection, but it must be fine-tuned for balance between loss of genetic variance and genetic gain. In our simulation, we measured this balance using trigonometric penalty degrees between truncation selection and a balanced solution and concluded that the best results were between 45° and 65°. This balance is specific to the breeding program and depends on how much immediate genetic gain a breeding program may risk vs. save for the future. Furthermore, our results show that the persistence of accuracy is better with optimal contribution selection compared to truncation selection. In general, our results show that optimal contribution selection can ensure long-term success in intensive breeding programs using genomic selection.
Collapse
Affiliation(s)
- Ivan Pocrnic
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom
| | - Jana Obšteter
- Agricultural Institute of Slovenia, Ljubljana, Slovenia
| | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom
| | - Anna Wolc
- Department of Animal Science, Iowa State University, Ames, IA, United States
- Hy-Line International, Dallas Center, IA, United States
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
4
|
Jackson R, Buntjer JB, Bentley AR, Lage J, Byrne E, Burt C, Jack P, Berry S, Flatman E, Poupard B, Smith S, Hayes C, Barber T, Love B, Gaynor RC, Gorjanc G, Howell P, Mackay IJ, Hickey JM, Ober ES. Phenomic and genomic prediction of yield on multiple locations in winter wheat. Front Genet 2023; 14:1164935. [PMID: 37229190 PMCID: PMC10203586 DOI: 10.3389/fgene.2023.1164935] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 04/20/2023] [Indexed: 05/27/2023] Open
Abstract
Genomic selection has recently become an established part of breeding strategies in cereals. However, a limitation of linear genomic prediction models for complex traits such as yield is that these are unable to accommodate Genotype by Environment effects, which are commonly observed over trials on multiple locations. In this study, we investigated how this environmental variation can be captured by the collection of a large number of phenomic markers using high-throughput field phenotyping and whether it can increase GS prediction accuracy. For this purpose, 44 winter wheat (Triticum aestivum L.) elite populations, comprising 2,994 lines, were grown on two sites over 2 years, to approximate the size of trials in a practical breeding programme. At various growth stages, remote sensing data from multi- and hyperspectral cameras, as well as traditional ground-based visual crop assessment scores, were collected with approximately 100 different data variables collected per plot. The predictive power for grain yield was tested for the various data types, with or without genome-wide marker data sets. Models using phenomic traits alone had a greater predictive value (R2 = 0.39-0.47) than genomic data (approximately R2 = 0.1). The average improvement in predictive power by combining trait and marker data was 6%-12% over the best phenomic-only model, and performed best when data from one full location was used to predict the yield on an entire second location. The results suggest that genetic gain in breeding programmes can be increased by utilisation of large numbers of phenotypic variables using remote sensing in field trials, although at what stage of the breeding cycle phenomic selection could be most profitably applied remains to be answered.
Collapse
Affiliation(s)
- Robert Jackson
- The John Bingham Laboratory, NIAB, Cambridge, United Kingdom
| | - Jaap B. Buntjer
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Scotland, United Kingdom
| | | | - Jacob Lage
- KWS UK Ltd, Thriplow, Royston, Cambridgeshire, United Kingdom
| | - Ed Byrne
- KWS UK Ltd, Thriplow, Royston, Cambridgeshire, United Kingdom
| | - Chris Burt
- RAGT UK, Ickleton, Saffron Walden, Cambridgeshire, United Kingdom
| | - Peter Jack
- RAGT UK, Ickleton, Saffron Walden, Cambridgeshire, United Kingdom
| | - Simon Berry
- Limagrain UK Ltd, Rothwell, Market Rasen, Lincolnshire, United Kingdom
| | - Edward Flatman
- Limagrain UK Ltd, Rothwell, Market Rasen, Lincolnshire, United Kingdom
| | - Bruno Poupard
- Limagrain UK Ltd, Rothwell, Market Rasen, Lincolnshire, United Kingdom
| | - Stephen Smith
- Elsoms Wheat Limited, Spalding, Linconshire, United Kingdom
| | | | - Tobias Barber
- The John Bingham Laboratory, NIAB, Cambridge, United Kingdom
| | - Bethany Love
- The John Bingham Laboratory, NIAB, Cambridge, United Kingdom
| | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Scotland, United Kingdom
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Scotland, United Kingdom
| | - Phil Howell
- The John Bingham Laboratory, NIAB, Cambridge, United Kingdom
| | - Ian J. Mackay
- The John Bingham Laboratory, NIAB, Cambridge, United Kingdom
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Scotland, United Kingdom
| | - Eric S. Ober
- The John Bingham Laboratory, NIAB, Cambridge, United Kingdom
| |
Collapse
|
5
|
Werner CR, Gaynor RC, Sargent DJ, Lillo A, Gorjanc G, Hickey JM. Genomic selection strategies for clonally propagated crops. Theor Appl Genet 2023; 136:74. [PMID: 36952013 PMCID: PMC10036424 DOI: 10.1007/s00122-023-04300-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 01/14/2023] [Indexed: 05/27/2023]
Abstract
For genomic selection in clonally propagated crops with diploid (-like) meiotic behavior to be effective, crossing parents should be selected based on genomic predicted cross-performance unless dominance is negligible. For genomic selection (GS) in clonal breeding programs to be effective, parents should be selected based on genomic predicted cross-performance unless dominance is negligible. Genomic prediction of cross-performance enables efficient exploitation of the additive and dominance value simultaneously. Here, we compared different GS strategies for clonally propagated crops with diploid (-like) meiotic behavior, using strawberry as an example. We used stochastic simulation to evaluate six combinations of three breeding programs and two parent selection methods. The three breeding programs included (1) a breeding program that introduced GS in the first clonal stage, and (2) two variations of a two-part breeding program with one and three crossing cycles per year, respectively. The two parent selection methods were (1) parent selection based on genomic estimated breeding values (GEBVs) and (2) parent selection based on genomic predicted cross-performance (GPCP). Selection of parents based on GPCP produced faster genetic gain than selection of parents based on GEBVs because it reduced inbreeding when the dominance degree increased. The two-part breeding programs with one and three crossing cycles per year using GPCP always produced the most genetic gain unless dominance was negligible. We conclude that (1) in clonal breeding programs with GS, parents should be selected based on GPCP, and (2) a two-part breeding program with parent selection based on GPCP to rapidly drive population improvement has great potential to improve breeding clonally propagated crops.
Collapse
Affiliation(s)
- Christian R Werner
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK.
| | - R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - Daniel J Sargent
- NIAB EMR, New Road, East Malling, Kent, ME19 6BJ, UK
- East Malling Enterprise Centre, Driscoll's Genetics Ltd, New Road, East Malling, Kent, ME19 6BJ, UK
| | - Alessandra Lillo
- East Malling Enterprise Centre, Driscoll's Genetics Ltd, New Road, East Malling, Kent, ME19 6BJ, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| |
Collapse
|
6
|
Tolhurst DJ, Gaynor RC, Gardunia B, Hickey JM, Gorjanc G. Genomic selection using random regressions on known and latent environmental covariates. Theor Appl Genet 2022; 135:3393-3415. [PMID: 36066596 PMCID: PMC9519718 DOI: 10.1007/s00122-022-04186-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 06/28/2022] [Indexed: 05/26/2023]
Abstract
The integration of known and latent environmental covariates within a single-stage genomic selection approach provides breeders with an informative and practical framework to utilise genotype by environment interaction for prediction into current and future environments. This paper develops a single-stage genomic selection approach which integrates known and latent environmental covariates within a special factor analytic framework. The factor analytic linear mixed model of Smith et al. (2001) is an effective method for analysing multi-environment trial (MET) datasets, but has limited practicality since the underlying factors are latent so the modelled genotype by environment interaction (GEI) is observable, rather than predictable. The advantage of using random regressions on known environmental covariates, such as soil moisture and daily temperature, is that the modelled GEI becomes predictable. The integrated factor analytic linear mixed model (IFA-LMM) developed in this paper includes a model for predictable and observable GEI in terms of a joint set of known and latent environmental covariates. The IFA-LMM is demonstrated on a late-stage cotton breeding MET dataset from Bayer CropScience. The results show that the known covariates predominately capture crossover GEI and explain 34.4% of the overall genetic variance. The most notable covariates are maximum downward solar radiation (10.1%), average cloud cover (4.5%) and maximum temperature (4.0%). The latent covariates predominately capture non-crossover GEI and explain 40.5% of the overall genetic variance. The results also show that the average prediction accuracy of the IFA-LMM is [Formula: see text] higher than conventional random regression models for current environments and [Formula: see text] higher for future environments. The IFA-LMM is therefore an effective method for analysing MET datasets which also utilises crossover and non-crossover GEI for genomic prediction into current and future environments. This is becoming increasingly important with the emergence of rapidly changing environments and climate change.
Collapse
Affiliation(s)
- Daniel J Tolhurst
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, United Kingdom.
| | | | | | - John M Hickey
- Corn Product Design, Bayer CropScience, Barcelona, Spain
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, United Kingdom
| |
Collapse
|
7
|
Lara LADC, Pocrnic I, Oliveira TDP, Gaynor RC, Gorjanc G. Temporal and genomic analysis of additive genetic variance in breeding programmes. Heredity (Edinb) 2022; 128:21-32. [PMID: 34912044 PMCID: PMC8733024 DOI: 10.1038/s41437-021-00485-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/24/2021] [Accepted: 11/01/2021] [Indexed: 11/13/2022] Open
Abstract
Genetic variance is a central parameter in quantitative genetics and breeding. Assessing changes in genetic variance over time as well as the genome is therefore of high interest. Here, we extend a previously proposed framework for temporal analysis of genetic variance using the pedigree-based model, to a new framework for temporal and genomic analysis of genetic variance using marker-based models. To this end, we describe the theory of partitioning genetic variance into genic variance and within-chromosome and between-chromosome linkage-disequilibrium, and how to estimate these variance components from a marker-based model fitted to observed phenotype and marker data. The new framework involves three steps: (i) fitting a marker-based model to data, (ii) sampling realisations of marker effects from the fitted model and for each sample calculating realisations of genetic values and (iii) calculating the variance of sampled genetic values by time and genome partitions. Analysing time partitions indicates breeding programme sustainability, while analysing genome partitions indicates contributions from chromosomes and chromosome pairs and linkage-disequilibrium. We demonstrate the framework with a simulated breeding programme involving a complex trait. Results show good concordance between simulated and estimated variances, provided that the fitted model is capturing genetic complexity of a trait. We observe a reduction of genetic variance due to selection and drift changing allele frequencies, and due to selection inducing negative linkage-disequilibrium.
Collapse
Affiliation(s)
- Letícia A de C Lara
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, UK.
| | - Ivan Pocrnic
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, UK
| | - Thiago de P Oliveira
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, UK
| | - R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
8
|
Powell O, Mrode R, Gaynor RC, Johnsson M, Gorjanc G, Hickey JM. Genomic evaluations using data recorded on smallholder dairy farms in low- to middle-income countries. JDS Communications 2021; 2:366-370. [PMID: 36337118 PMCID: PMC9623656 DOI: 10.3168/jdsc.2021-0092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 07/14/2021] [Indexed: 12/02/2022]
Abstract
Genomic evaluations outperformed pedigree-based genetic evaluations. Shared haplotypes captured "hidden" genetic relationships to strengthen connectedness in genomic evaluations. Genomic evaluations were possible using LMIC smallholder records from herds with ≤4 cows. . Modelling herd as a random effect produced EBVs with the highest accuracies.
Breeding has increased genetic gain for dairy cattle in advanced economies but has had limited success in improving dairy cattle in low- to middle-income countries (LMIC). Genetic evaluations are a central component of delivering genetic gain, because they separate the genetic and environmental effects of animals' phenotypes. Genetic evaluations have been successful in advanced economies because of large data sets and strong genetic connectedness, provided by the widespread use of artificial insemination (AI) and accurate recording of pedigree information. In smallholder dairy production systems of many LMICs, the limited use of AI and small herd sizes results in a data structure with insufficient genetic connectedness between herds to facilitate genetic evaluations based on pedigree. Genomic information keeps track of shared haplotypes rather than shared relatives captured by pedigree records. Therefore, genomic information could capture “hidden” genetic relationships, that are not captured by pedigree information, to strengthen genetic connectedness in LMIC smallholder dairy data sets. This study's objective was to use simulation to quantify the power of genomic information to enable genetic evaluation using LMIC smallholder dairy data sets. The results from this study show that (1) genetic evaluations using genomic information were more accurate than those using pedigree information in populations with a high effective population size and weak genetic connectedness; and (2) genetic evaluations modeling herd as a random effect had higher or equal accuracy than those modeling herd as a fixed effect. This demonstrates the potential of genomic information to be an enabling technology in LMIC smallholder dairy production systems by facilitating genetic evaluations with in situ records collected from herds of ≤4 cows. The establishment of routine genomic evaluations could allow the development of LMIC breeding programs comprising an informal set of nucleus animals distributed across many small herds within the target environment. These nucleus animals could be used for genetic evaluation, and the best animals could be disseminated to participating smallholder dairy farms. Together, this could increase the productivity, profitability, and sustainability of LMIC smallholder dairy production systems.
Collapse
Affiliation(s)
- Owen Powell
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, United Kingdom
- Corresponding author
| | - Raphael Mrode
- Scotland's Rural College (SRUC), Easter Bush, Midlothian, EH25 9RG, United Kingdom
- International Livestock Research Institute (ILRI), Nairobi 00100, Kenya
| | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, United Kingdom
| | - Martin Johnsson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, United Kingdom
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, 750 07, Uppsala, Sweden
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, United Kingdom
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG, United Kingdom
| |
Collapse
|
9
|
Abstract
This paper introduces AlphaSimR, an R package for stochastic simulations of plant and animal breeding programs. AlphaSimR is a highly flexible software package able to simulate a wide range of plant and animal breeding programs for diploid and autopolyploid species. AlphaSimR is ideal for testing the overall strategy and detailed design of breeding programs. AlphaSimR utilizes a scripting approach to building simulations that is particularly well suited for modeling highly complex breeding programs, such as commercial breeding programs. The primary benefit of this scripting approach is that it frees users from preset breeding program designs and allows them to model nearly any breeding program design. This paper lists the main features of AlphaSimR and provides a brief example simulation to show how to use the software.
Collapse
Affiliation(s)
- R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian EH25 9RG, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian EH25 9RG, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian EH25 9RG, UK
| |
Collapse
|
10
|
Gonen S, Wimmer V, Gaynor RC, Byrne E, Gorjanc G, Hickey JM. Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations. Crop Sci 2021; 61:2243-2253. [PMID: 34413534 PMCID: PMC8362159 DOI: 10.1002/csc2.20409] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 11/07/2020] [Indexed: 06/13/2023]
Abstract
This paper presents an extension to a heuristic method for phasing and imputation of genotypes of descendants in biparental populations so that it can phase and impute genotypes of parents that are ungenotyped or partially genotyped. The imputed genotypes of the parent are used to impute low-density (Ld) genotyped descendants to high density (Hd). The extension was implemented as part of the AlphaPlantImpute software and works in three steps. First, it identifies whether a parent has no or Ld genotypes and identifies its relatives that have Hd genotypes. Second, using the Hd genotypes of relatives, it determines whether the parent is homozygous or heterozygous for a given locus. Third, it phases heterozygous positions of the parent by matching haplotypes to its relatives. We measured the accuracy (correlation between true and imputed genotypes) of imputing parent genotypes in simulated biparental populations from different scenarios. We tested the imputation accuracy of the missing parent's descendants using the true genotype of the parent and compared this with using the imputed genotypes of the parent. Across all scenarios, the imputation accuracy of a parent was >0.98 and did not drop below ∼0.96. The imputation accuracy of a parent was always higher when it was inbred than outbred. Including ancestors of the parent at Hd, increasing the number of crosses and the number of Hd descendants increased the imputation accuracy. The high imputation accuracy achieved for the parent translated to little or no impact on the imputation accuracy of its descendants.
Collapse
Affiliation(s)
- Serap Gonen
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEaster Bush Research CentreMidlothianEH25 9RGUK
| | | | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEaster Bush Research CentreMidlothianEH25 9RGUK
| | - Ed Byrne
- KWS‐UK Ltd56 Church StreetThriplowHertfordshireSG8 7REUK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEaster Bush Research CentreMidlothianEH25 9RGUK
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesUniversity of EdinburghEaster Bush Research CentreMidlothianEH25 9RGUK
| |
Collapse
|
11
|
Faber NR, McFarlane GR, Gaynor RC, Pocrnic I, Whitelaw CBA, Gorjanc G. Novel combination of CRISPR-based gene drives eliminates resistance and localises spread. Sci Rep 2021; 11:3719. [PMID: 33664305 PMCID: PMC7933345 DOI: 10.1038/s41598-021-83239-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 01/27/2021] [Indexed: 12/20/2022] Open
Abstract
Invasive species are among the major driving forces behind biodiversity loss. Gene drive technology may offer a humane, efficient and cost-effective method of control. For safe and effective deployment it is vital that a gene drive is both self-limiting and can overcome evolutionary resistance. We present HD-ClvR in this modelling study, a novel combination of CRISPR-based gene drives that eliminates resistance and localises spread. As a case study, we model HD-ClvR in the grey squirrel (Sciurus carolinensis), which is an invasive pest in the UK and responsible for both biodiversity and economic losses. HD-ClvR combats resistance allele formation by combining a homing gene drive with a cleave-and-rescue gene drive. The inclusion of a self-limiting daisyfield gene drive allows for controllable localisation based on animal supplementation. We use both randomly mating and spatial models to simulate this strategy. Our findings show that HD-ClvR could effectively control a targeted grey squirrel population, with little risk to other populations. HD-ClvR offers an efficient, self-limiting and controllable gene drive for managing invasive pests.
Collapse
Affiliation(s)
- Nicky R Faber
- Highlander Lab, The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK. .,Laboratory of Genetics, Wageningen University and Research, Droevendaalsesteeg 1, 6708PB, Wageningen, The Netherlands.
| | - Gus R McFarlane
- Whitelaw Group, The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - R Chris Gaynor
- AlphaGenes Group, The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - Ivan Pocrnic
- Highlander Lab, The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - C Bruce A Whitelaw
- Whitelaw Group, The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| | - Gregor Gorjanc
- Highlander Lab, The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian, EH25 9RG, UK
| |
Collapse
|
12
|
Bančič J, Werner CR, Gaynor RC, Gorjanc G, Odeny DA, Ojulong HF, Dawson IK, Hoad SP, Hickey JM. Modeling Illustrates That Genomic Selection Provides New Opportunities for Intercrop Breeding. Front Plant Sci 2021; 12:605172. [PMID: 33633761 PMCID: PMC7902002 DOI: 10.3389/fpls.2021.605172] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 01/05/2021] [Indexed: 05/14/2023]
Abstract
Intercrop breeding programs using genomic selection can produce faster genetic gain than intercrop breeding programs using phenotypic selection. Intercropping is an agricultural practice in which two or more component crops are grown together. It can lead to enhanced soil structure and fertility, improved weed suppression, and better control of pests and diseases. Especially in subsistence agriculture, intercropping has great potential to optimize farming and increase profitability. However, breeding for intercrop varieties is complex as it requires simultaneous improvement of two or more component crops that combine well in the field. We hypothesize that genomic selection can significantly simplify and accelerate the process of breeding crops for intercropping. Therefore, we used stochastic simulation to compare four different intercrop breeding programs implementing genomic selection and an intercrop breeding program entirely based on phenotypic selection. We assumed three different levels of genetic correlation between monocrop grain yield and intercrop grain yield to investigate how the different breeding strategies are impacted by this factor. We found that all four simulated breeding programs using genomic selection produced significantly more intercrop genetic gain than the phenotypic selection program regardless of the genetic correlation with monocrop yield. We suggest a genomic selection strategy which combines monocrop and intercrop trait information to predict general intercropping ability to increase selection accuracy in the early stages of a breeding program and to minimize the generation interval.
Collapse
Affiliation(s)
- Jon Bančič
- Royal (Dick) School of Veterinary Studies, The Roslin Institute, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
- Scotland’s Rural College (SRUC), Edinburgh, United Kingdom
| | - Christian R. Werner
- Royal (Dick) School of Veterinary Studies, The Roslin Institute, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - R. Chris Gaynor
- Royal (Dick) School of Veterinary Studies, The Roslin Institute, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - Gregor Gorjanc
- Royal (Dick) School of Veterinary Studies, The Roslin Institute, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - Damaris A. Odeny
- International Crops Research Institute for the Semi-Arid Tropics, ICRAF House, Nairobi, Kenya
| | - Henry F. Ojulong
- International Crops Research Institute for the Semi-Arid Tropics, ICRAF House, Nairobi, Kenya
| | - Ian K. Dawson
- Scotland’s Rural College (SRUC), Edinburgh, United Kingdom
| | | | - John M. Hickey
- Royal (Dick) School of Veterinary Studies, The Roslin Institute, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| |
Collapse
|
13
|
Werner CR, Gaynor RC, Gorjanc G, Hickey JM, Kox T, Abbadi A, Leckband G, Snowdon RJ, Stahl A. How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding. Front Plant Sci 2020; 11:592977. [PMID: 33391305 PMCID: PMC7772221 DOI: 10.3389/fpls.2020.592977] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Accepted: 11/24/2020] [Indexed: 05/27/2023]
Abstract
Over the last two decades, the application of genomic selection has been extensively studied in various crop species, and it has become a common practice to report prediction accuracies using cross validation. However, genomic prediction accuracies obtained from random cross validation can be strongly inflated due to population or family structure, a characteristic shared by many breeding populations. An understanding of the effect of population and family structure on prediction accuracy is essential for the successful application of genomic selection in plant breeding programs. The objective of this study was to make this effect and its implications for practical breeding programs comprehensible for breeders and scientists with a limited background in quantitative genetics and genomic selection theory. We, therefore, compared genomic prediction accuracies obtained from different random cross validation approaches and within-family prediction in three different prediction scenarios. We used a highly structured population of 940 Brassica napus hybrids coming from 46 testcross families and two subpopulations. Our demonstrations show how genomic prediction accuracies obtained from among-family predictions in random cross validation and within-family predictions capture different measures of prediction accuracy. While among-family prediction accuracy measures prediction accuracy of both the parent average component and the Mendelian sampling term, within-family prediction only measures how accurately the Mendelian sampling term can be predicted. With this paper we aim to foster a critical approach to different measures of genomic prediction accuracy and a careful analysis of values observed in genomic selection experiments and reported in literature.
Collapse
Affiliation(s)
- Christian R. Werner
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian, United Kingdom
| | | | | | | | - Rod J. Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
| | - Andreas Stahl
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany
- Julius Kuehn Institute (JKI), Federal Research Centre for Cultivated Plants, Institute for Resistance Research and Stress Tolerance, Quedlinburg, Germany
| |
Collapse
|
14
|
Edwards SM, Buntjer JB, Jackson R, Bentley AR, Lage J, Byrne E, Burt C, Jack P, Berry S, Flatman E, Poupard B, Smith S, Hayes C, Gaynor RC, Gorjanc G, Howell P, Ober E, Mackay IJ, Hickey JM. The effects of training population design on genomic prediction accuracy in wheat. Theor Appl Genet 2019; 132:1943-1952. [PMID: 30888431 PMCID: PMC6588656 DOI: 10.1007/s00122-019-03327-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Accepted: 03/11/2019] [Indexed: 05/22/2023]
Abstract
Genomic selection offers several routes for increasing the genetic gain or efficiency of plant breeding programmes. In various species of livestock, there is empirical evidence of increased rates of genetic gain from the use of genomic selection to target different aspects of the breeder's equation. Accurate predictions of genomic breeding value are central to this, and the design of training sets is in turn central to achieving sufficient levels of accuracy. In summary, small numbers of close relatives and very large numbers of distant relatives are expected to enable predictions with higher accuracy. To quantify the effect of some of the properties of training sets on the accuracy of genomic selection in crops, we performed an extensive field-based winter wheat trial. In summary, this trial involved the construction of 44 F2:4 bi- and tri-parental populations, from which 2992 lines were grown on four field locations and yield was measured. For each line, genotype data were generated for 25 K segregating SNP markers. The overall heritability of yield was estimated to 0.65, and estimates within individual families ranged between 0.10 and 0.85. Genomic prediction accuracies of yield BLUEs were 0.125-0.127 using two different cross-validation approaches and generally increased with training set size. Using related crosses in training and validation sets generally resulted in higher prediction accuracies than using unrelated crosses. The results of this study emphasise the importance of the training panel design in relation to the genetic material to which the resulting prediction model is to be applied.
Collapse
Affiliation(s)
- Stefan McKinnon Edwards
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Jaap B Buntjer
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Robert Jackson
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge, CB3 0LE, UK
| | - Alison R Bentley
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge, CB3 0LE, UK
| | - Jacob Lage
- KWS UK Ltd, 56 Church Street, Hertfordshire, SG8 7RE, UK
| | - Ed Byrne
- KWS UK Ltd, 56 Church Street, Hertfordshire, SG8 7RE, UK
| | - Chris Burt
- RAGT UK, Grange Rd, Saffron Walden, CB10 1TA, UK
| | - Peter Jack
- RAGT UK, Grange Rd, Saffron Walden, CB10 1TA, UK
| | - Simon Berry
- Limagrain UK Ltd, Rothwell, Market Rasen, Lincolnshire, LN7 6DT, UK
| | - Edward Flatman
- Limagrain UK Ltd, Rothwell, Market Rasen, Lincolnshire, LN7 6DT, UK
| | - Bruno Poupard
- Limagrain UK Ltd, Rothwell, Market Rasen, Lincolnshire, LN7 6DT, UK
| | - Stephen Smith
- Elsoms Wheat Limited, Pinchbeck Road, Spalding, Linconshire, PE11 1QG, UK
| | - Charlotte Hayes
- Elsoms Wheat Limited, Pinchbeck Road, Spalding, Linconshire, PE11 1QG, UK
| | - R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK
| | - Phil Howell
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge, CB3 0LE, UK
| | - Eric Ober
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge, CB3 0LE, UK
| | | | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush, Midlothian, Scotland, UK.
| |
Collapse
|
15
|
Johnsson M, Gaynor RC, Jenko J, Gorjanc G, de Koning DJ, Hickey JM. Removal of alleles by genome editing (RAGE) against deleterious load. Genet Sel Evol 2019; 51:14. [PMID: 30995904 PMCID: PMC6472060 DOI: 10.1186/s12711-019-0456-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Accepted: 04/01/2019] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND In this paper, we simulate deleterious load in an animal breeding program, and compare the efficiency of genome editing and selection for decreasing it. Deleterious variants can be identified by bioinformatics screening methods that use sequence conservation and biological prior information about protein function. However, once deleterious variants have been identified, how can they be used in breeding? RESULTS We simulated a closed animal breeding population that is subject to both natural selection against deleterious load and artificial selection for a quantitative trait representing the breeding goal. Deleterious load was polygenic and was due to either codominant or recessive variants. We compared strategies for removal of deleterious alleles by genome editing (RAGE) to selection against carriers. When deleterious variants were codominant, the best strategy for prioritizing variants was to prioritize low-frequency variants. When deleterious variants were recessive, the best strategy was to prioritize variants with an intermediate frequency. Selection against carriers was inefficient when variants were codominant, but comparable to editing one variant per sire when variants were recessive. CONCLUSIONS Genome editing of deleterious alleles reduces deleterious load, but requires the simultaneous editing of multiple deleterious variants in the same sire to be effective when deleterious variants are recessive. In the short term, selection against carriers is a possible alternative to genome editing when variants are recessive. Our results suggest that, in the future, there is the potential to use RAGE against deleterious load in animal breeding.
Collapse
Affiliation(s)
- Martin Johnsson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, EH25 9RG Scotland UK
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, 750 07 Uppsala, Sweden
| | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, EH25 9RG Scotland UK
| | - Janez Jenko
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, EH25 9RG Scotland UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, EH25 9RG Scotland UK
| | - Dirk-Jan de Koning
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7023, 750 07 Uppsala, Sweden
| | - John M. Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, EH25 9RG Scotland UK
| |
Collapse
|
16
|
Gonen S, Wimmer V, Gaynor RC, Byrne E, Gorjanc G, Hickey JM. A heuristic method for fast and accurate phasing and imputation of single-nucleotide polymorphism data in bi-parental plant populations. Theor Appl Genet 2018; 131:2345-2357. [PMID: 30078163 PMCID: PMC6208939 DOI: 10.1007/s00122-018-3156-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Accepted: 07/31/2018] [Indexed: 05/02/2023]
Abstract
Key message New fast and accurate method for phasing and imputation of SNP chip genotypes within diploid bi-parental plant populations. This paper presents a new heuristic method for phasing and imputation of genomic data in diploid plant species. Our method, called AlphaPlantImpute, explicitly leverages features of plant breeding programmes to maximise the accuracy of imputation. The features are a small number of parents, which can be inbred and usually have high-density genomic data, and few recombinations separating parents and focal individuals genotyped at low density (i.e. descendants that are the imputation targets). AlphaPlantImpute works roughly in three steps. First, it identifies informative low-density genotype markers in parents. Second, it tracks the inheritance of parental alleles and haplotypes to focal individuals at informative markers. Finally, it uses this low-density information as anchor points to impute focal individuals to high density. We tested the imputation accuracy of AlphaPlantImpute in simulated bi-parental populations across different scenarios. We also compared its accuracy to existing software called PlantImpute. In general, AlphaPlantImpute had better or equal imputation accuracy as PlantImpute. The computational time and memory requirements of AlphaPlantImpute were tiny compared to PlantImpute. For example, accuracy of imputation was 0.96 for a scenario where both parents were inbred and genotyped at 25,000 markers per chromosome and a focal F2 individual was genotyped with 50 markers per chromosome. The maximum memory requirement for this scenario was 0.08 GB and took 37 s to complete.
Collapse
Affiliation(s)
- Serap Gonen
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| | | | - R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - Ed Byrne
- KWS-UK Ltd, 56 Church Street, Thriplow, Hertfordshire, SG8 7RE, UK
| | - Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK.
| |
Collapse
|
17
|
Gorjanc G, Gaynor RC, Hickey JM. Optimal cross selection for long-term genetic gain in two-part programs with rapid recurrent genomic selection. Theor Appl Genet 2018; 131:1953-1966. [PMID: 29876589 PMCID: PMC6096640 DOI: 10.1007/s00122-018-3125-3] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 05/28/2018] [Indexed: 05/18/2023]
Abstract
Key message Optimal cross selection increases long-term genetic gain of two-part programs with rapid recurrent genomic selection. It achieves this by optimising efficiency of converting genetic diversity into genetic gain through reducing the loss of genetic diversity and reducing the drop of genomic prediction accuracy with rapid cycling. This study evaluates optimal cross selection to balance selection and maintenance of genetic diversity in two-part plant breeding programs with rapid recurrent genomic selection. The two-part program reorganises a conventional breeding program into a population improvement component with recurrent genomic selection to increase the mean value of germplasm and a product development component with standard methods to develop new lines. Rapid recurrent genomic selection has a large potential, but is challenging due to genotyping costs or genetic drift. Here we simulate a wheat breeding program for 20 years and compare optimal cross selection against truncation selection in the population improvement component with one to six cycles per year. With truncation selection we crossed a small or a large number of parents. With optimal cross selection we jointly optimised selection, maintenance of genetic diversity, and cross allocation with AlphaMate program. The results show that the two-part program with optimal cross selection delivered the largest genetic gain that increased with the increasing number of cycles. With four cycles per year optimal cross selection had 78% (15%) higher long-term genetic gain than truncation selection with a small (large) number of parents. Higher genetic gain was achieved through higher efficiency of converting genetic diversity into genetic gain; optimal cross selection quadrupled (doubled) efficiency of truncation selection with a small (large) number of parents. Optimal cross selection also reduced the drop of genomic selection accuracy due to the drift between training and prediction populations. In conclusion optimal cross selection enables optimal management and exploitation of population improvement germplasm in two-part programs.
Collapse
Affiliation(s)
- Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK.
| | - R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| |
Collapse
|
18
|
Gorjanc G, Gaynor RC, Hickey JM. Optimal cross selection for long-term genetic gain in two-part programs with rapid recurrent genomic selection. Theor Appl Genet 2018. [PMID: 29876589 DOI: 10.1101/227215] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Key message Optimal cross selection increases long-term genetic gain of two-part programs with rapid recurrent genomic selection. It achieves this by optimising efficiency of converting genetic diversity into genetic gain through reducing the loss of genetic diversity and reducing the drop of genomic prediction accuracy with rapid cycling. This study evaluates optimal cross selection to balance selection and maintenance of genetic diversity in two-part plant breeding programs with rapid recurrent genomic selection. The two-part program reorganises a conventional breeding program into a population improvement component with recurrent genomic selection to increase the mean value of germplasm and a product development component with standard methods to develop new lines. Rapid recurrent genomic selection has a large potential, but is challenging due to genotyping costs or genetic drift. Here we simulate a wheat breeding program for 20 years and compare optimal cross selection against truncation selection in the population improvement component with one to six cycles per year. With truncation selection we crossed a small or a large number of parents. With optimal cross selection we jointly optimised selection, maintenance of genetic diversity, and cross allocation with AlphaMate program. The results show that the two-part program with optimal cross selection delivered the largest genetic gain that increased with the increasing number of cycles. With four cycles per year optimal cross selection had 78% (15%) higher long-term genetic gain than truncation selection with a small (large) number of parents. Higher genetic gain was achieved through higher efficiency of converting genetic diversity into genetic gain; optimal cross selection quadrupled (doubled) efficiency of truncation selection with a small (large) number of parents. Optimal cross selection also reduced the drop of genomic selection accuracy due to the drift between training and prediction populations. In conclusion optimal cross selection enables optimal management and exploitation of population improvement germplasm in two-part programs.
Collapse
Affiliation(s)
- Gregor Gorjanc
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK.
| | - R Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - John M Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, Easter Bush Research Centre, University of Edinburgh, Midlothian, EH25 9RG, UK
| |
Collapse
|
19
|
Jarquín D, Lemes da Silva C, Gaynor RC, Poland J, Fritz A, Howard R, Battenfield S, Crossa J. Increasing Genomic-Enabled Prediction Accuracy by Modeling Genotype × Environment Interactions in Kansas Wheat. Plant Genome 2017; 10. [PMID: 28724062 DOI: 10.3835/plantgenome2016.12.0130] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Wheat ( L.) breeding programs test experimental lines in multiple locations over multiple years to get an accurate assessment of grain yield and yield stability. Selections in early generations of the breeding pipeline are based on information from only one or few locations and thus materials are advanced with little knowledge of the genotype × environment interaction (G × E) effects. Later, large trials are conducted in several locations to assess the performance of more advanced lines across environments. Genomic selection (GS) models that include G × E covariates allow us to borrow information not only from related materials, but also from historical and correlated environments to better predict performance within and across specific environments. We used reaction norm models with several cross-validation schemes to demonstrate the increased breeding efficiency of Kansas State University's hard red winter wheat breeding program. The GS reaction norm models line effect (L) + environment effect (E), L + E + genotype environment (G), and L + E + G + (G × E) effects) showed high accuracy values (>0.4) when predicting the yield performance in untested environments, sites or both. The GS model L + E + G + (G × E) presented the highest prediction ability ( = 0.54) when predicting yield in incomplete field trials for locations with a moderate number of lines. The difficulty of predicting future years (forward prediction) is indicated by the relatively low accuracy ( = 0.171) seen even when environments with 300+ lines were included.
Collapse
|
20
|
Ladejobi O, Elderfield J, Gardner KA, Gaynor RC, Hickey J, Hibberd JM, Mackay IJ, Bentley AR. Maximizing the potential of multi-parental crop populations. Appl Transl Genom 2016; 11:9-17. [PMID: 28018845 PMCID: PMC5167364 DOI: 10.1016/j.atg.2016.10.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Revised: 10/22/2016] [Accepted: 10/24/2016] [Indexed: 11/03/2022]
Abstract
Most agriculturally significant crop traits are quantitatively inherited which limits the ease and efficiency of trait dissection. Multi-parent populations overcome the limitations of traditional trait mapping and offer new potential to accurately define the genetic basis of complex crop traits. The increasing popularity and use of nested association mapping (NAM) and multi-parent advanced generation intercross (MAGIC) populations raises questions about the optimal design and allocation of resources in their creation. In this paper we review strategies for the creation of multi-parent populations and describe two complementary in silico studies addressing the design and construction of NAM and MAGIC populations. The first simulates the selection of diverse founder parents and the second the influence of multi-parent crossing schemes (and number of founders) on haplotype creation and diversity. We present and apply two open software resources to simulate alternate strategies for the development of multi-parent populations.
Collapse
Affiliation(s)
- Olufunmilayo Ladejobi
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge CB3 0LE, United Kingdom
- Department of Plant Sciences, The University of Cambridge, Downing Street, Cambridge CB2 3EA, United Kingdom
| | - James Elderfield
- Department of Plant Sciences, The University of Cambridge, Downing Street, Cambridge CB2 3EA, United Kingdom
| | - Keith A. Gardner
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge CB3 0LE, United Kingdom
| | - R. Chris Gaynor
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian EH25 9RG, United Kingdom
| | - John Hickey
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush Research Centre, Midlothian EH25 9RG, United Kingdom
| | - Julian M. Hibberd
- Department of Plant Sciences, The University of Cambridge, Downing Street, Cambridge CB2 3EA, United Kingdom
| | - Ian J. Mackay
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge CB3 0LE, United Kingdom
| | - Alison R. Bentley
- The John Bingham Laboratory, NIAB, Huntingdon Road, Cambridge CB3 0LE, United Kingdom
| |
Collapse
|
21
|
Faux AM, Gorjanc G, Gaynor RC, Battagin M, Edwards SM, Wilson DL, Hearne SJ, Gonen S, Hickey JM. AlphaSim: Software for Breeding Program Simulation. Plant Genome 2016; 9. [PMID: 27902803 DOI: 10.3835/plantgenome2016.02.0013] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
This paper describes AlphaSim, a software package for simulating plant and animal breeding programs. AlphaSim enables the simulation of multiple aspects of breeding programs with a high degree of flexibility. AlphaSim simulates breeding programs in a series of steps: (i) simulate haplotype sequences and pedigree; (ii) drop haplotypes into the base generation of the pedigree and select single-nucleotide polymorphism (SNP) and quantitative trait nucleotide (QTN); (iii) assign QTN effects, calculate genetic values, and simulate phenotypes; (iv) drop haplotypes into the burn-in generations; and (v) perform selection and simulate new generations. The program is flexible in terms of historical population structure and diversity, recent pedigree structure, trait architecture, and selection strategy. It integrates biotechnologies such as doubled-haploids (DHs) and gene editing and allows the user to simulate multiple traits and multiple environments, specify recombination hot spots and cold spots, specify gene jungles and deserts, perform genomic predictions, and apply optimal contribution selection. AlphaSim also includes restart functionalities, which increase its flexibility by allowing the simulation process to be paused so that the parameters can be changed or to import an externally created pedigree, trial design, or results of an analysis of previously simulated data. By combining the options, a user can simulate simple or complex breeding programs with several generations, variable population structures and variable breeding decisions over time. In conclusion, AlphaSim is a flexible and computationally efficient software package to simulate biotechnology enhanced breeding programs with the aim of performing rapid, low-cost, and objective in silico comparison of breeding technologies.
Collapse
|
22
|
Battenfield SD, Guzmán C, Gaynor RC, Singh RP, Peña RJ, Dreisigacker S, Fritz AK, Poland JA. Genomic Selection for Processing and End-Use Quality Traits in the CIMMYT Spring Bread Wheat Breeding Program. Plant Genome 2016; 9. [PMID: 27898810 DOI: 10.3835/plantgenome2016.01.0005] [Citation(s) in RCA: 88] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Wheat ( L.) cultivars must possess suitable end-use quality for release and consumer acceptability. However, breeding for quality traits is often considered a secondary target relative to yield largely because of amount of seed needed and expense. Without testing and selection, many undesirable materials are advanced, expending additional resources. Here, we develop and validate whole-genome prediction models for end-use quality phenotypes in the CIMMYT bread wheat breeding program. Model accuracy was tested using forward prediction on breeding lines ( = 5520) tested in unbalanced yield trials from 2009 to 2015 at Ciudad Obregon, Sonora, Mexico. Quality parameters included test weight, 1000-kernel weight, hardness, grain and flour protein, flour yield, sodium dodecyl sulfate sedimentation, Mixograph and Alveograph performance, and loaf volume. In general, prediction accuracy substantially increased over time as more data was available to train the model. Reflecting practical implementation of genomic selection (GS) in the breeding program, forward prediction accuracies () for quality parameters were assessed in 2015 and ranged from 0.32 (grain hardness) to 0.62 (mixing time). Increased selection intensity was possible with GS since more entries can be genotyped than phenotyped and expected genetic gain was 1.4 to 2.7 times higher across all traits than phenotypic selection. Given the limitations in measuring many lines for quality, we conclude that GS is a powerful tool to facilitate early generation selection for end-use quality in wheat, leaving larger populations for selection on yield during advanced testing and leading to better gain for both quality and yield in bread wheat breeding programs.
Collapse
|