1
|
Alemu A, Åstrand J, Montesinos-López OA, Isidro Y Sánchez J, Fernández-Gónzalez J, Tadesse W, Vetukuri RR, Carlsson AS, Ceplitis A, Crossa J, Ortiz R, Chawade A. Genomic selection in plant breeding: Key factors shaping two decades of progress. MOLECULAR PLANT 2024; 17:552-578. [PMID: 38475993 DOI: 10.1016/j.molp.2024.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/22/2024] [Accepted: 03/08/2024] [Indexed: 03/14/2024]
Abstract
Genomic selection, the application of genomic prediction (GP) models to select candidate individuals, has significantly advanced in the past two decades, effectively accelerating genetic gains in plant breeding. This article provides a holistic overview of key factors that have influenced GP in plant breeding during this period. We delved into the pivotal roles of training population size and genetic diversity, and their relationship with the breeding population, in determining GP accuracy. Special emphasis was placed on optimizing training population size. We explored its benefits and the associated diminishing returns beyond an optimum size. This was done while considering the balance between resource allocation and maximizing prediction accuracy through current optimization algorithms. The density and distribution of single-nucleotide polymorphisms, level of linkage disequilibrium, genetic complexity, trait heritability, statistical machine-learning methods, and non-additive effects are the other vital factors. Using wheat, maize, and potato as examples, we summarize the effect of these factors on the accuracy of GP for various traits. The search for high accuracy in GP-theoretically reaching one when using the Pearson's correlation as a metric-is an active research area as yet far from optimal for various traits. We hypothesize that with ultra-high sizes of genotypic and phenotypic datasets, effective training population optimization methods and support from other omics approaches (transcriptomics, metabolomics and proteomics) coupled with deep-learning algorithms could overcome the boundaries of current limitations to achieve the highest possible prediction accuracy, making genomic selection an effective tool in plant breeding.
Collapse
Affiliation(s)
- Admas Alemu
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Johanna Åstrand
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden; Lantmännen Lantbruk, Svalöv, Sweden
| | | | - Julio Isidro Y Sánchez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Javier Fernández-Gónzalez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Wuletaw Tadesse
- International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat, Morocco
| | - Ramesh R Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Anders S Carlsson
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco, México 52640, Mexico
| | - Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| |
Collapse
|
2
|
Mora-Poblete F, Ballesta P, Lobos GA, Molina-Montenegro M, Gleadow R, Ahmar S, Jiménez-Aspee F. Genome-wide association study of cyanogenic glycosides, proline, sugars, and pigments in Eucalyptus cladocalyx after 18 consecutive dry summers. PHYSIOLOGIA PLANTARUM 2021; 172:1550-1569. [PMID: 33511661 DOI: 10.1111/ppl.13349] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 01/07/2021] [Accepted: 01/20/2021] [Indexed: 06/12/2023]
Abstract
Natural variation of cyanogenic glycosides, soluble sugars, proline, and nondestructive optical sensing of pigments (chlorophyll, flavonols, and anthocyanins) was examined in ex situ natural populations of Eucalyptus cladocalyx F. Muell. grown under dry environmental conditions in the southern Atacama Desert, Chile. After 18 consecutive dry seasons, considerable plant-to-plant phenotypic variation for all the traits was observed in the field. For example, leaf hydrogen cyanide (HCN) concentrations varied from 0 (two acyanogenic individuals) to 1.54 mg cyanide g-1 DW. Subsequent genome-wide association study revealed associations with several genes with a known function in plants. HCN content was associated robustly with genes encoding Cytochrome P450 proteins, and with genes involved in the detoxification mechanism of HCN in cells (β-cyanoalanine synthase and cyanoalanine nitrilase). Another important finding was that sugars, proline, and pigment content were linked to genes involved in transport, biosynthesis, and/or catabolism. Estimates of genomic heritability (based on haplotypes) ranged between 0.46 and 0.84 (HCN and proline content, respectively). Proline and soluble sugars had the highest predictive ability of genomic prediction models (PA = 0.65 and PA = 0.71, respectively). PA values for HCN content and flavonols were relatively moderate, with estimates ranging from 0.44 to 0.50. These findings provide new understanding on the genetic architecture of cyanogenic capacity, and other key complex traits in cyanogenic E. cladocalyx.
Collapse
Affiliation(s)
| | - Paulina Ballesta
- Institute of Biological Sciences, Universidad de Talca, Talca, Chile
| | - Gustavo A Lobos
- Plant Breeding and Phenomic Center, Faculty of Agricultural Sciences, Universidad de Talca, Talca, Chile
| | - Marco Molina-Montenegro
- Institute of Biological Sciences, Universidad de Talca, Talca, Chile
- Centro de Estudios Avanzados en Zonas Áridas (CEAZA), Facultad de Ciencias del Mar, Universidad Católica del Norte, Coquimbo, Chile
| | - Roslyn Gleadow
- School of Biological Sciences, Monash University, Melbourne, Victoria, Australia
| | - Sunny Ahmar
- Institute of Biological Sciences, Universidad de Talca, Talca, Chile
- College of Plant Sciences and Technology, Huazhong Agricultural University, Wuhan, China
| | - Felipe Jiménez-Aspee
- Department of Food Biofunctionality, Institute of Nutritional Sciences, University of Hohenheim, Stuttgart, Germany
- Departamento de Ciencias Básicas Biomédicas, Facultad de Ciencias de la Salud, Universidad de Talca, Talca, Chile
| |
Collapse
|
3
|
Ballesta P, Bush D, Silva FF, Mora F. Genomic Predictions Using Low-Density SNP Markers, Pedigree and GWAS Information: A Case Study with the Non-Model Species Eucalyptus cladocalyx. PLANTS (BASEL, SWITZERLAND) 2020; 9:E99. [PMID: 31941085 PMCID: PMC7020392 DOI: 10.3390/plants9010099] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Revised: 12/20/2019] [Accepted: 01/09/2020] [Indexed: 11/16/2022]
Abstract
High-throughput genotyping techniques have enabled large-scale genomic analysis to precisely predict complex traits in many plant species. However, not all species can be well represented in commercial SNP (single nucleotide polymorphism) arrays. In this study, a high-density SNP array (60 K) developed for commercial Eucalyptus was used to genotype a breeding population of Eucalyptus cladocalyx, yielding only ~3.9 K informative SNPs. Traditional Bayesian genomic models were investigated to predict flowering, stem quality and growth traits by considering the following effects: (i) polygenic background and all informative markers (GS model) and (ii) polygenic background, QTL-genotype effects (determined by GWAS) and SNP markers that were not associated with any trait (GSq model). The estimates of pedigree-based heritability and genomic heritability varied from 0.08 to 0.34 and 0.002 to 0.5, respectively, whereas the predictive ability varied from 0.19 (GS) and 0.45 (GSq). The GSq approach outperformed GS models in terms of predictive ability when the proportion of the variance explained by the significant marker-trait associations was higher than those explained by the polygenic background and non-significant markers. This approach can be particularly useful for plant/tree species poorly represented in the high-density SNP arrays, developed for economically important species, or when high-density marker panels are not available.
Collapse
Affiliation(s)
- Paulina Ballesta
- Institute of Biological Sciences, University of Talca, 2 Norte 685, Talca 3460000, Chile;
| | - David Bush
- CSIRO–Australian Tree Seed Centre, Acton 2601, Australia;
| | - Fabyano Fonseca Silva
- Department of Animal Science, Universidade Federal de Viçosa, Viçosa 36570-900, Brazil;
| | - Freddy Mora
- Institute of Biological Sciences, University of Talca, 2 Norte 685, Talca 3460000, Chile;
| |
Collapse
|