1
|
Nascimento M, Nascimento ACC, Azevedo CF, de Oliveira ACB, Caixeta ET, Jarquin D. Enhancing genomic prediction with Stacking Ensemble Learning in Arabica Coffee. FRONTIERS IN PLANT SCIENCE 2024; 15:1373318. [PMID: 39086911 PMCID: PMC11288849 DOI: 10.3389/fpls.2024.1373318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 06/12/2024] [Indexed: 08/02/2024]
Abstract
Coffee Breeding programs have traditionally relied on observing plant characteristics over years, a slow and costly process. Genomic selection (GS) offers a DNA-based alternative for faster selection of superior cultivars. Stacking Ensemble Learning (SEL) combines multiple models for potentially even more accurate selection. This study explores SEL potential in coffee breeding, aiming to improve prediction accuracy for important traits [yield (YL), total number of the fruits (NF), leaf miner infestation (LM), and cercosporiosis incidence (Cer)] in Coffea Arabica. We analyzed data from 195 individuals genotyped for 21,211 single-nucleotide polymorphism (SNP) markers. To comprehensively assess model performance, we employed a cross-validation (CV) scheme. Genomic Best Linear Unbiased Prediction (GBLUP), multivariate adaptive regression splines (MARS), Quantile Random Forest (QRF), and Random Forest (RF) served as base learners. For the meta-learner within the SEL framework, various options were explored, including Ridge Regression, RF, GBLUP, and Single Average. The SEL method was able to predict the predictive ability (PA) of important traits in Coffea Arabica. SEL presented higher PA compared with those obtained for all base learner methods. The gains in PA in relation to GBLUP were 87.44% (the ratio between the PA obtained from best Stacking model and the GBLUP), 37.83%, 199.82%, and 14.59% for YL, NF, LM and Cer, respectively. Overall, SEL presents a promising approach for GS. By combining predictions from multiple models, SEL can potentially enhance the PA of GS for complex traits.
Collapse
Affiliation(s)
- Moyses Nascimento
- Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa, Brazil
- Agronomy Department, University of Florida, Gainesville, FL, United States
| | - Ana Carolina Campana Nascimento
- Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa, Brazil
- Agronomy Department, University of Florida, Gainesville, FL, United States
| | - Camila Ferreira Azevedo
- Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa, Brazil
| | | | | | - Diego Jarquin
- Agronomy Department, University of Florida, Gainesville, FL, United States
| |
Collapse
|
2
|
da Silva RA, Caixeta ET, Silva LDF, Sousa TV, Barreiros PRRM, Oliveira ACBD, Pereira AA, Barreto CAV, Nascimento M. Identification of SNP Markers and Candidate Genes Associated with Major Agronomic Traits in Coffea arabica. PLANTS (BASEL, SWITZERLAND) 2024; 13:1876. [PMID: 38999716 PMCID: PMC11243787 DOI: 10.3390/plants13131876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 06/30/2024] [Accepted: 07/03/2024] [Indexed: 07/14/2024]
Abstract
Genome-wide association studies (GWASs) allow for inferences about the relationships between genomic variants and phenotypic traits in natural or breeding populations. However, few have used this methodology in Coffea arabica. We aimed to identify chromosomal regions with significant associations between SNP markers and agronomic traits in C. arabica. We used a coffee panel consisting of 195 plants derived from 13 families in F2 generations and backcrosses of crosses between leaf rust-susceptible and -resistant genotypes. The plants were phenotyped for 18 agronomic markers and genotyped for 21,211 SNP markers. A GWAS enabled the identification of 110 SNPs with significant associations (p < 0.05) for several agronomic traits in C. arabica: plant height, plagiotropic branch length, number of vegetative nodes, canopy diameter, fruit size, cercosporiosis incidence, and rust incidence. The effects of each SNP marker associated with the traits were analyzed, such that they can be used for molecular marker-assisted selection. For the first time, a GWAS was used for these important agronomic traits in C. arabica, enabling applications in accelerated coffee breeding through marker-assisted selection and ensuring greater efficiency and time reduction. Furthermore, our findings provide preliminary knowledge to further confirm the genomic loci and potential candidate genes contributing to various structural and disease-related traits of C. arabica.
Collapse
Affiliation(s)
- Ruane Alice da Silva
- Biotechnology Applied to Agriculture Institute (Bioagro), Federal University of Viçosa (UFV), Viçosa 36570-900, Brazil
| | - Eveline Teixeira Caixeta
- Biotechnology Applied to Agriculture Institute (Bioagro), Federal University of Viçosa (UFV), Viçosa 36570-900, Brazil
- Embrapa Coffee, Brazilian Agricultural Research Corporation (Embrapa), Brasília 70770-901, Brazil
| | - Letícia de Faria Silva
- Biotechnology Applied to Agriculture Institute (Bioagro), Federal University of Viçosa (UFV), Viçosa 36570-900, Brazil
| | - Tiago Vieira Sousa
- Biological Sciences Center, Iturama University Campus, Universidade Federal do Triângulo Mineiro (UFTM), Iturama 38025-180, Brazil
| | | | - Antonio Carlos Baião de Oliveira
- Embrapa Coffee, Brazilian Agricultural Research Corporation (Embrapa), Brasília 70770-901, Brazil
- Agricultural Research Company of Minas Gerais (EPAMIG), Viçosa 36571-000, Brazil
| | | | - Cynthia Aparecida Valiati Barreto
- Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa 36570-900, Brazil
| | - Moysés Nascimento
- Laboratory of Intelligence Computational and Statistical Learning (LICAE), Department of Statistics, Federal University of Viçosa, Viçosa 36570-900, Brazil
| |
Collapse
|
3
|
Bose S, Banerjee S, Kumar S, Saha A, Nandy D, Hazra S. Review of applications of artificial intelligence (AI) methods in crop research. J Appl Genet 2024; 65:225-240. [PMID: 38216788 DOI: 10.1007/s13353-023-00826-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 12/23/2023] [Accepted: 12/26/2023] [Indexed: 01/14/2024]
Abstract
Sophisticated and modern crop improvement techniques can bridge the gap for feeding the ever-increasing population. Artificial intelligence (AI) refers to the simulation of human intelligence in machines, which refers to the application of computational algorithms, machine learning (ML) and deep learning (DL) techniques. This is aimed to generalise patterns and relationships from historical data, employing various mathematical optimisation techniques thus making prediction models for facilitating selection of superior genotypes. These techniques are less resource intensive and can solve the problem based on the analysis of large-scale phenotypic datasets. ML for genomic selection (GS) uses high-throughput genotyping technologies to gather genetic information on a large number of markers across the genome. The prediction of GS models is based on the mathematical relation between genotypic and phenotypic data from the training population. ML techniques have emerged as powerful tools for genome editing through analysing large-scale genomic data and facilitating the development of accurate prediction models. Precise phenotyping is a prerequisite to advance crop breeding for solving agricultural production-related issues. ML algorithms can solve this problem through generating predictive models, based on the analysis of large-scale phenotypic datasets. DL models also have the potential reliability of precise phenotyping. This review provides a comprehensive overview on various ML and DL models, their applications, potential to enhance the efficiency, specificity and safety towards advanced crop improvement protocols such as genomic selection, genome editing, along with phenotypic prediction to promote accelerated breeding.
Collapse
Affiliation(s)
- Suvojit Bose
- Department of Vegetables and Spice Crops, Uttar Banga Krishi Viswavidyalaya, Pundibari, Cooch Behar, 736165, West Bengal, India
| | | | - Soumya Kumar
- School of Agricultural Sciences, JIS University, Kolkata, 700109, West Bengal, India
| | - Akash Saha
- School of Agricultural Sciences, JIS University, Kolkata, 700109, West Bengal, India
| | - Debalina Nandy
- School of Agricultural Sciences, JIS University, Kolkata, 700109, West Bengal, India
| | - Soham Hazra
- Department of Agriculture, Brainware University, Barasat, 700125, West Bengal, India.
| |
Collapse
|
4
|
Ferrão MAG, da Fonseca AFA, Volpi PS, de Souza LC, Comério M, Filho ACV, Riva-Souza EM, Munoz PR, Ferrão RG, Ferrão LFV. Genomic-assisted breeding for climate-smart coffee. THE PLANT GENOME 2024; 17:e20321. [PMID: 36946358 DOI: 10.1002/tpg2.20321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 01/25/2023] [Accepted: 02/12/2023] [Indexed: 06/18/2023]
Abstract
Coffee is a universal beverage that drives a multi-industry market on a global basis. Today, the sustainability of coffee production is threatened by accelerated climate changes. In this work, we propose the implementation of genomic-assisted breeding for climate-smart coffee in Coffea canephora. This species is adapted to higher temperatures and is more resilient to biotic and abiotic stresses. After evaluating two populations, over multiple harvests, and under severe drought weather condition, we dissected the genetic architecture of yield, disease resistance, and quality-related traits. By integrating genome-wide association studies and diallel analyses, our contribution is four-fold: (i) we identified a set of molecular markers with major effects associated with disease resistance and post-harvest traits, while yield and plant architecture presented a polygenic background; (ii) we demonstrated the relevance of nonadditive gene actions and projected hybrid vigor when genotypes from different geographically botanical groups are crossed; (iii) we computed medium-to-large heritability values for most of the traits, representing potential for fast genetic progress; and (iv) we provided a first step toward implementing molecular breeding to accelerate improvements in C. canephora. Altogether, this work is a blueprint for how quantitative genetics and genomics can assist coffee breeding and support the supply chain in the face of the current global changes.
Collapse
Affiliation(s)
- Maria Amélia G Ferrão
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
- Empresa Brasileira de Pesquisa Agropecuária-Embrapa Café, Brasília, Brazil
| | - Aymbire F A da Fonseca
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
- Empresa Brasileira de Pesquisa Agropecuária-Embrapa Café, Brasília, Brazil
| | - Paulo S Volpi
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
| | - Lucimara C de Souza
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
| | - Marcone Comério
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
| | - Abraão C Verdin Filho
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
| | - Elaine M Riva-Souza
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
| | - Patricio R Munoz
- Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, FL, USA
| | - Romário G Ferrão
- Instituto Capixaba de Pesquisa, Assistência Técnica e Extensão Rural-Incaper, ES, Brazil
- Multivix Group, ES, Brazil
| | - Luís Felipe V Ferrão
- Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, FL, USA
| |
Collapse
|
5
|
Manthena V, Jarquín D, Howard R. Integrating and optimizing genomic, weather, and secondary trait data for multiclass classification. Front Genet 2023; 13:1032691. [PMID: 37065625 PMCID: PMC10090538 DOI: 10.3389/fgene.2022.1032691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 12/22/2022] [Indexed: 04/18/2023] Open
Abstract
Modern plant breeding programs collect several data types such as weather, images, and secondary or associated traits besides the main trait (e.g., grain yield). Genomic data is high-dimensional and often over-crowds smaller data types when naively combined to explain the response variable. There is a need to develop methods able to effectively combine different data types of differing sizes to improve predictions. Additionally, in the face of changing climate conditions, there is a need to develop methods able to effectively combine weather information with genotype data to predict the performance of lines better. In this work, we develop a novel three-stage classifier to predict multi-class traits by combining three data types-genomic, weather, and secondary trait. The method addressed various challenges in this problem, such as confounding, differing sizes of data types, and threshold optimization. The method was examined in different settings, including binary and multi-class responses, various penalization schemes, and class balances. Then, our method was compared to standard machine learning methods such as random forests and support vector machines using various classification accuracy metrics and using model size to evaluate the sparsity of the model. The results showed that our method performed similarly to or better than machine learning methods across various settings. More importantly, the classifiers obtained were highly sparse, allowing for a straightforward interpretation of relationships between the response and the selected predictors.
Collapse
Affiliation(s)
- Vamsi Manthena
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE, United States
| | - Diego Jarquín
- Agronomy Department, University of Florida, Gainesville, FL, United States
| | - Reka Howard
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE, United States
| |
Collapse
|
6
|
Seyum EG, Bille NH, Abtew WG, Munyengwa N, Bell JM, Cros D. Genomic selection in tropical perennial crops and plantation trees: a review. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2022; 42:58. [PMID: 37313015 PMCID: PMC10248687 DOI: 10.1007/s11032-022-01326-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 09/06/2022] [Indexed: 06/15/2023]
Abstract
To overcome the multiple challenges currently faced by agriculture, such as climate change and soil deterioration, more efficient plant breeding strategies are required. Genomic selection (GS) is crucial for the genetic improvement of quantitative traits, as it can increase selection intensity, shorten the generation interval, and improve selection accuracy for traits that are difficult to phenotype. Tropical perennial crops and plantation trees are of major economic importance and have consequently been the subject of many GS articles. In this review, we discuss the factors that affect GS accuracy (statistical models, linkage disequilibrium, information concerning markers, relatedness between training and target populations, the size of the training population, and trait heritability) and the genetic gain expected in these species. The impact of GS will be particularly strong in tropical perennial crops and plantation trees as they have long breeding cycles and constrained selection intensity. Future GS prospects are also discussed. High-throughput phenotyping will allow constructing of large training populations and implementing of phenomic selection. Optimized modeling is needed for longitudinal traits and multi-environment trials. The use of multi-omics, haploblocks, and structural variants will enable going beyond single-locus genotype data. Innovative statistical approaches, like artificial neural networks, are expected to efficiently handle the increasing amounts of heterogeneous multi-scale data. Targeted recombinations on sites identified from profiles of marker effects have the potential to further increase genetic gain. GS can also aid re-domestication and introgression breeding. Finally, GS consortia will play an important role in making the best of these opportunities. Supplementary Information The online version contains supplementary material available at 10.1007/s11032-022-01326-4.
Collapse
Affiliation(s)
- Essubalew Getachew Seyum
- Department of Plant Biology and Physiology, Faculty of Sciences, University of Yaoundé I, Yaoundé, Cameroon
- Department of Horticulture and Plant Sciences, College of Agriculture and Veterinary Medicine, Jimma University, P.O. Box 307, Jimma, Ethiopia
| | - Ngalle Hermine Bille
- Department of Plant Biology and Physiology, Faculty of Sciences, University of Yaoundé I, Yaoundé, Cameroon
| | - Wosene Gebreselassie Abtew
- Department of Horticulture and Plant Sciences, College of Agriculture and Veterinary Medicine, Jimma University, P.O. Box 307, Jimma, Ethiopia
| | - Norman Munyengwa
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Brisbane, QLD 4072 Australia
| | - Joseph Martin Bell
- Department of Plant Biology and Physiology, Faculty of Sciences, University of Yaoundé I, Yaoundé, Cameroon
| | - David Cros
- CIRAD, UMR AGAP Institut, 34398 Montpellier, France
- UMR AGAP Institut, CIRAD, INRAE, Univ. Montpellier, Institut Agro, 34398 Montpellier, France
| |
Collapse
|
7
|
Genomic prediction through machine learning and neural networks for traits with epistasis. Comput Struct Biotechnol J 2022; 20:5490-5499. [PMID: 36249559 PMCID: PMC9547190 DOI: 10.1016/j.csbj.2022.09.029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 09/20/2022] [Accepted: 09/20/2022] [Indexed: 11/22/2022] Open
Abstract
Performance of machine learning and neural netowrks in Genomic analysis. Heritability and QTL number impacts on performance machine learning methods. Machine learning models in genomic analyses. Neural networks can present better performance for complex quantitative traits.
Genomic wide selection (GWS) is one contributions of molecular genetics to breeding. Machine learning (ML) and artificial neural networks (ANN) methods are non-parameterized and can develop more accurate and parsimonious models for GWS analysis. Multivariate Adaptive Regression Splines (MARS) is considered one of the most flexible ML methods, automatically modeling nonlinearities and interactions of the predictor variables. This study aimed to evaluate and compare methods based on ANN, ML, including MARS, and G-BLUP through GWS. An F2 population formed by 1000 individuals and genotyped for 4010 SNP markers and twelve traits from a model considering epistatic effect, with QTL numbers ranging from eight to 480 and heritability (h2) of 0.3, 0.5 or 0.8 were simulated. Variation in heritability and number of QTL impacts the performance of methods. About quantitative traits (40, 80, 120, 240, and 480 QTLs) was observed highest R2 to Radial Base Network (RBF) and G-BLUP, followed by Random Forest (RF), Bagging (BA), and Boosting (BO). RF and BA also showed better results for traits to h2 of 0.3 with R2 values 16.51% and 16.30%, respectively, while MARS methods showed better results for oligogenic traits with R2 values ranging from 39,12 % to 43,20 % in h2 of 0.5 and from 59.92% to 78,56% in h2 of 0.8. Non-additive MARS methods also showed high R2 for traits with high heritability and 240 QTLs or more. ANN and ML methods are powerful tools to predict genetic values in traits with epistatic effect, for different degrees of heritability and QTL numbers.
Collapse
|
8
|
Paixão PTM, Nascimento ACC, Nascimento M, Azevedo CF, Oliveira GF, da Silva FL, Caixeta ET. Factor analysis applied in genomic selection studies in the breeding of Coffea canephora. EUPHYTICA: NETHERLANDS JOURNAL OF PLANT BREEDING 2022; 218:42. [PMID: 35310815 PMCID: PMC8918905 DOI: 10.1007/s10681-022-02998-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 02/23/2022] [Indexed: 06/14/2023]
Abstract
Brazil stands out worldwide in the production of coffee. The observed increases in its productivity and morpho agronomic traits are the results of the improvement of several methodologies applied in obtaining improved cultivars, among which the predictive methods of genetic value stand out. These contribute significantly to the selection of higher genotypes, increasing the genetic gain per unit time. In this context, genomic-wide selection (GWS) is a tool that stands out, since it allows predicting the future phenotype of an individual based only on molecular information. Performing joint selection of traits is the interest of most breeding programs, and factor analysis (FA) has been used to assist in this end. The aim of this study was to evaluate the use of FA in the context of GWS, in genotypes of Coffea canephora. It was found that FA was efficient to elucidate the relationships between the traits and generate new variables. The factors formed can assist in the selection, as in addition to allowing joint interpretations, they present good estimates of predictive capacity, heritability and accuracy. Furthermore, high agreement was observed between the individuals selected based on the factors and those selected considering the individual traits. Additionally, it was observed agreement between the top 10% individuals selected based on the "vigor factor" and each variable individually. However, the selection based on "vigor factor" presented individuals with more suitable size from the phytotechnical point of view.
Collapse
Affiliation(s)
| | | | - Moysés Nascimento
- Department of Statistics, Universidade Federal de Viçosa (UFV), Viçosa, MG Brazil
| | | | | | | | | |
Collapse
|
9
|
Rodriguez Neira JD, Peripolli E, de Negreiros MPM, Espigolan R, López-Correa R, Aguilar I, Lobo RB, Baldi F. Prediction ability for growth and maternal traits using SNP arrays based on different marker densities in Nellore cattle using the ssGBLUP. J Appl Genet 2022; 63:389-400. [PMID: 35133621 DOI: 10.1007/s13353-022-00685-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 01/25/2022] [Accepted: 02/02/2022] [Indexed: 11/25/2022]
Abstract
This study aimed to investigate the prediction ability for growth and maternal traits using different low-density customized SNP arrays selected by informativeness and distribution of markers across the genome employing single-step genomic BLUP (ssGBLUP). Phenotypic records for adjusted weight at 210 and 450 days of age were utilized. A total of 945 animals were genotyped with high-density chip, and 267 individuals born after 2008 were selected as validation population. We evaluated 11 scenarios using five customized density arrays (40 k, 20 k, 10 k, 5 k and 2 k) and the HD array was used as desirable scenario. The GEBV predictions and BIF (Beef Improvement Federation) accuracy were obtained with BLUPF90 family programs. Linear regression was used to evaluate the prediction ability, inflation, and bias of GEBV of each customized array. An overestimation of partial GEBVs in contrast with complete GEBVs and increase of BIF accuracy with the density arrays diminished were observed. For all traits, the prediction ability was higher as the array density increased and it was similar with customized arrays higher than 10 k SNPs. Level of inflation was lower as the density array increased of and was higher for MW210 effect. The bias was susceptible to overestimation of GEBVs when the density customized arrays decreased. These results revealed that the BIF accuracy is sensible to overestimation using low-density customized arrays while the prediction ability with least 10,000 informative SNPs obtained from the Illumina BovineHD BeadChip shows accurate and less biased predictions. Low-density customized arrays under ssGBLUP method could be feasible and cost-effective in genomic selection.
Collapse
Affiliation(s)
- Juan Diego Rodriguez Neira
- Departamento de Zootecnia, Faculdade de Ciências Agrarias e Veterinárias, Universidade Estadual Paulista (Unesp), Jaboticabal, 14884-900, Brazil.
| | - Elisa Peripolli
- Departamento de Zootecnia, Faculdade de Ciências Agrarias e Veterinárias, Universidade Estadual Paulista (Unesp), Jaboticabal, 14884-900, Brazil
| | - Maria Paula Marinho de Negreiros
- Departamento de Medicina Veterinária, Faculdade de Zootecnia e Engenharia de Alimentos, Universidade de São Paulo (Usp), Pirassununga, 13535-900, Brazil
| | - Rafael Espigolan
- Departamento de Medicina Veterinária, Faculdade de Zootecnia e Engenharia de Alimentos, Universidade de São Paulo (Usp), Pirassununga, 13535-900, Brazil
| | - Rodrigo López-Correa
- Departamento de Genética y Mejoramiento Animal, Facultad de Veterinaria, Universidad de La República, Montevideo, Uruguay
| | - Ignacio Aguilar
- Instituto Nacional de Investigación Agropecuaria (INIA), Montevideo, Uruguay
| | - Raysildo B Lobo
- Associação Nacional de Criadores e Pesquisadores (ANCP), Ribeirão Preto, Brazil
| | - Fernando Baldi
- Departamento de Zootecnia, Faculdade de Ciências Agrarias e Veterinárias, Universidade Estadual Paulista (Unesp), Jaboticabal, 14884-900, Brazil
| |
Collapse
|
10
|
Coelho de Sousa I, Nascimento M, de Castro Sant’anna I, Teixeira Caixeta E, Ferreira Azevedo C, Damião Cruz C, Lopes da Silva F, Ruas Alkimim E, Campana Nascimento AC, Vergara Lopes Serão N. Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora. PLoS One 2022; 17:e0262055. [PMID: 35081139 PMCID: PMC8791507 DOI: 10.1371/journal.pone.0262055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 12/15/2021] [Indexed: 11/18/2022] Open
Abstract
Many methodologies are used to predict the genetic merit in animals and plants, but some of them require priori assumptions that may increase the complexity of the model. Artificial neural network (ANN) has advantage to not require priori assumptions about the relationships between inputs and the output allowing great flexibility to handle different types of complex non-additive effects, such as dominance and epistasis. Despite this advantage, the biological interpretability of ANNs is still limited. The aim of this research was to estimate the heritability and markers effects for two traits in Coffea canephora using an additive-dominance architecture ANN and to compare it with genomic best linear unbiased prediction (GBLUP). The data used consists of 51 clones of C. canephora varietal Conilon, 32 of varietal group Robusta and 82 intervarietal hybrids. From this, 165 phenotyped individuals were genotyped for 14,387 SNPs. Due to the high computational cost of ANNs, we used Bagging decision tree to reduce the dimensionality of the data, selecting the markers that accumulated 70% of the total importance. An ANN with three hidden layers was run, each varying from 1 to 40 neurons summing 64,000 neural networks. The network architectures with the best predictive ability were selected. The best architectures were composed by 4, 15, and 33 neurons in the first, second and third hidden layers, respectively, for yield, and by 13, 20, and 24 neurons, respectively for rust resistance. The predictive ability was greater when using ANN with three hidden layers than using one hidden layer and GBLUP, with 0.72 and 0.88 for yield and coffee leaf rust resistance, respectively. The concordance rate (CR) of the 10% larger markers effects among the methods varied between 10% and 13.8%, for additive effects and between 5.4% and 11.9% for dominance effects. The narrow-sense ([Formula: see text]) and dominance-only ([Formula: see text]) heritability estimates were 0.25 and 0.06, respectively, for yield, and 0.67 and 0.03, respectively for rust resistance. The ANN was able to estimate the heritabilities from an additive-dominance genomic architectures and the ANN with three hidden layers obtained best predictive ability when compared with those obtained from GBLUP and ANN with one hidden layer.
Collapse
Affiliation(s)
- Ithalo Coelho de Sousa
- Department of Animal Science, Iowa State University, Ames, Iowa, United States of America
- Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | - Moysés Nascimento
- Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | - Isabela de Castro Sant’anna
- Rubber Tree and Agroforestry Systems Research Center, Campinas Agronomy Institute (IAC), Votuporanga, São Paulo, Brazil
| | | | | | - Cosme Damião Cruz
- Department of General Biology, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | - Felipe Lopes da Silva
- Department of Plant Science, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | | | | | | |
Collapse
|
11
|
Alkimim ER, Caixeta ET, Sousa TV, Gois IB, Lopes da Silva F, Sakiyama NS, Zambolim L, Alves RS, de Resende MDV. Designing the best breeding strategy for Coffea canephora: Genetic evaluation of pure and hybrid individuals aiming to select for productivity and disease resistance traits. PLoS One 2021; 16:e0260997. [PMID: 34965248 PMCID: PMC8716045 DOI: 10.1371/journal.pone.0260997] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 11/20/2021] [Indexed: 12/03/2022] Open
Abstract
Breeding programs of the species Coffea canephora rely heavily on the significant genetic variability between and within its two varietal groups (conilon and robusta). The use of hybrid families and individuals has been less common. The objectives of this study were to evaluate parents and families from the populations of conilon, robusta, and its hybrids and to define the best breeding and selection strategies for productivity and disease resistance traits. As such, 71 conilon clones, 56 robusta clones, and 20 hybrid families were evaluated over several years for the following traits: vegetative vigor, incidence of rust and cercosporiosis, fruit ripening time, fruit size, plant height, canopy diameter, and yield per plant. Components of variance and genetic parameters were estimated via residual maximum likelihood (REML) and genotypic values were predicted via best linear unbiased prediction (BLUP). Genetic variability among parents (clones) and hybrid families was detected for most of the evaluated traits. The Mulamba-Rank index suggests potential gains up to 17% for the genotypic aggregate of traits in the hybrid population. An intrapopulation recurrent selection within the hybrid population would be the best breeding strategy because the genetic variability, narrow and broad senses heritabilities and selective accuracies for important traits were maximized in the crossed population. Besides, such strategy is simple, low cost and quicker than the concurrent reciprocal recurrent selection in the two parental populations, and this maximizes the genetic gain for unit of time.
Collapse
Affiliation(s)
| | - Eveline Teixeira Caixeta
- Brazilian Agricultural Research Corporation—Embrapa Café, Viçosa, MG, Brazil
- * E-mail: (ETC); (MDVR)
| | | | | | | | | | | | - Rodrigo Silva Alves
- National Institute of Coffee Science and Technology—INCT Café, Lavras, MG, Brazil
| | | |
Collapse
|
12
|
Strategies to Increase Prediction Accuracy in Genomic Selection of Complex Traits in Alfalfa ( Medicago sativa L.). Cells 2021; 10:cells10123372. [PMID: 34943880 PMCID: PMC8699225 DOI: 10.3390/cells10123372] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 11/19/2021] [Accepted: 11/24/2021] [Indexed: 12/27/2022] Open
Abstract
Agronomic traits such as biomass yield and abiotic stress tolerance are genetically complex and challenging to improve through conventional breeding approaches. Genomic selection (GS) is an alternative approach in which genome-wide markers are used to determine the genomic estimated breeding value (GEBV) of individuals in a population. In alfalfa (Medicago sativa L.), previous results indicated that low to moderate prediction accuracy values (<70%) were obtained in complex traits, such as yield and abiotic stress resistance. There is a need to increase the prediction value in order to employ GS in breeding programs. In this paper we reviewed different statistic models and their applications in polyploid crops, such as alfalfa and potato. Specifically, we used empirical data affiliated with alfalfa yield under salt stress to investigate approaches that use DNA marker importance values derived from machine learning models, and genome-wide association studies (GWAS) of marker-trait association scores based on different GWASpoly models, in weighted GBLUP analyses. This approach increased prediction accuracies from 50% to more than 80% for alfalfa yield under salt stress. Finally, we expended the weighted GBLUP approach to potato and analyzed 13 phenotypic traits and obtained similar results. This is the first report on alfalfa to use variable importance and GWAS-assisted approaches to increase the prediction accuracy of GS, thus helping to select superior alfalfa lines based on their GEBVs.
Collapse
|
13
|
Medeiros AC, Caixeta ET, Oliveira ACBD, Sousa TV, Stock VDM, Cruz CD, Zambolim L, Pereira AA. Combining Ability and Molecular Marker Approach Identified Genetic Resources to Improve Agronomic Performance in Coffea arabica Breeding. FRONTIERS IN SUSTAINABLE FOOD SYSTEMS 2021. [DOI: 10.3389/fsufs.2021.705278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Plant breeding aims to develop cultivars with good agronomic traits through gene recombination and elite genotype selection. To support Coffea arabica breeding programs and assist parent selection, molecular characterization, genetic diversity (GD) analyses, and circulating diallel studies were strategically integrated to develop new cultivars. Molecular markers were used to assess the GD of 76 candidate parents and verify the crossing of potential F1 hybrids. Based on the complementary agronomic traits and genetic distance, eight elite parents were selected for circulating diallel analysis. The parents and 12 hybrids were evaluated based on 10 morpho-agronomic traits. For each trait, the effects of general and specific combining abilities, as well as the averages of the parents, hybrids, and predicted hybrids, were estimated. Crosses that maximize the genetic gains for the main agronomic traits of C. arabica were identified. Joint analysis of phenotypic and molecular data was used to estimate the correlation between molecular GD, phenotypic diversity (PD), phenotypic mean, and combining ability. The selection of parents that optimize the allele combination for the important traits of C. arabica is discussed in detail.
Collapse
|
14
|
Dos Santos A, Rodrigues EV, Laviola BG, Teodoro LPR, Teodoro PE, Bhering LL. Increasing selection gain and accuracy of harvest prediction models in Jatropha through genome-wide selection. Sci Rep 2021; 11:13583. [PMID: 34193953 PMCID: PMC8245479 DOI: 10.1038/s41598-021-93022-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 06/17/2021] [Indexed: 11/09/2022] Open
Abstract
Genome-wide selection (GWS) has been becoming an essential tool in the genetic breeding of long-life species, as it increases the gain per time unit. This study had a hypothesis that GWS is a tool that can decrease the breeding cycle in Jatropha. Our objective was to compare GWS with phenotypic selection in terms of accuracy and efficiency over three harvests. Models were developed throughout the harvests to evaluate their applicability in predicting genetic values in later harvests. For this purpose, 386 individuals of the breeding population obtained from crossings between 42 parents were evaluated. The population was evaluated in random block design, with six replicates over three harvests. The genetic effects of markers were predicted in the population using 811 SNP's markers with call rate = 95% and minor allele frequency (MAF) > 4%. GWS enables gains of 108 to 346% over the phenotypic selection, with a 50% reduction in the selection cycle. This technique has potential for the Jatropha breeding since it allows the accurate obtaining of GEBV and higher efficiency compared to the phenotypic selection by reducing the time necessary to complete the selection cycle. In order to apply GWS in the first harvests, a large number of individuals in the breeding population are needed. In the case of few individuals in the population, it is recommended to perform a larger number of harvests.
Collapse
Affiliation(s)
| | - Erina Vitório Rodrigues
- Life and Earth Sciences, Universidade de Brasília - Campus Planaltina, Brasília, Distrito Federal, Brazil
| | - Bruno Galvêas Laviola
- Genetics and Biotechnology Laboratory, Embrapa Agroenergia, Brasília, Distrito Federal, Brazil
| | | | - Paulo Eduardo Teodoro
- Department of Agronomy, Universidade Federal Do Mato Grosso Do Sul, Chapadão Do Sul, Mato Grosso Do Sul, Brazil.
| | - Leonardo Lopes Bhering
- Department of General Biology, Universidade Federal de Viçosa, Viçosa, Minas Gerais, Brazil
| |
Collapse
|
15
|
Zhang D, Vega FE, Solano W, Su F, Infante F, Meinhardt LW. Selecting a core set of nuclear SNP markers for molecular characterization of Arabica coffee (Coffea arabica L.) genetic resources. CONSERV GENET RESOUR 2021. [DOI: 10.1007/s12686-021-01201-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
16
|
Gonçalves MTV, Morota G, Costa PMDA, Vidigal PMP, Barbosa MHP, Peternelli LA. Near-infrared spectroscopy outperforms genomics for predicting sugarcane feedstock quality traits. PLoS One 2021; 16:e0236853. [PMID: 33661948 PMCID: PMC7932073 DOI: 10.1371/journal.pone.0236853] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Accepted: 01/20/2021] [Indexed: 11/19/2022] Open
Abstract
The main objectives of this study were to evaluate the prediction performance of genomic and near-infrared spectroscopy (NIR) data and whether the integration of genomic and NIR predictor variables can increase the prediction accuracy of two feedstock quality traits (fiber and sucrose content) in a sugarcane population (Saccharum spp.). The following three modeling strategies were compared: M1 (genome-based prediction), M2 (NIR-based prediction), and M3 (integration of genomics and NIR wavenumbers). Data were collected from a commercial population comprised of three hundred and eighty-five individuals, genotyped for single nucleotide polymorphisms and screened using NIR spectroscopy. We compared partial least squares (PLS) and BayesB regression methods to estimate marker and wavenumber effects. In order to assess model performance, we employed random sub-sampling cross-validation to calculate the mean Pearson correlation coefficient between observed and predicted values. Our results showed that models fitted using BayesB were more predictive than PLS models. We found that NIR (M2) provided the highest prediction accuracy, whereas genomics (M1) presented the lowest predictive ability, regardless of the measured traits and regression methods used. The integration of predictors derived from NIR spectroscopy and genomics into a single model (M3) did not significantly improve the prediction accuracy for the two traits evaluated. These findings suggest that NIR-based prediction can be an effective strategy for predicting the genetic merit of sugarcane clones.
Collapse
Affiliation(s)
| | - Gota Morota
- Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, United States of America
| | | | | | | | | |
Collapse
|
17
|
Tong H, Nikoloski Z. Machine learning approaches for crop improvement: Leveraging phenotypic and genotypic big data. JOURNAL OF PLANT PHYSIOLOGY 2021; 257:153354. [PMID: 33385619 DOI: 10.1016/j.jplph.2020.153354] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 12/14/2020] [Accepted: 12/15/2020] [Indexed: 05/07/2023]
Abstract
Highly efficient and accurate selection of elite genotypes can lead to dramatic shortening of the breeding cycle in major crops relevant for sustaining present demands for food, feed, and fuel. In contrast to classical approaches that emphasize the need for resource-intensive phenotyping at all stages of artificial selection, genomic selection dramatically reduces the need for phenotyping. Genomic selection relies on advances in machine learning and the availability of genotyping data to predict agronomically relevant phenotypic traits. Here we provide a systematic review of machine learning approaches applied for genomic selection of single and multiple traits in major crops in the past decade. We emphasize the need to gather data on intermediate phenotypes, e.g. metabolite, protein, and gene expression levels, along with developments of modeling techniques that can lead to further improvements of genomic selection. In addition, we provide a critical view of factors that affect genomic selection, with attention to transferability of models between different environments. Finally, we highlight the future aspects of integrating high-throughput molecular phenotypic data from omics technologies with biological networks for crop improvement.
Collapse
Affiliation(s)
- Hao Tong
- Bioinformatics Group, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany; Bioinformatics and Mathematical Modeling Department, Centre for Plant Systems Biology and Biotechnology, Plovdiv, Bulgaria; Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany
| | - Zoran Nikoloski
- Bioinformatics Group, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany; Bioinformatics and Mathematical Modeling Department, Centre for Plant Systems Biology and Biotechnology, Plovdiv, Bulgaria; Systems Biology and Mathematical Modeling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany.
| |
Collapse
|
18
|
Akpertey A, Padi FK, Meinhardt L, Zhang D. Effectiveness of Single Nucleotide Polymorphism Markers in Genotyping Germplasm Collections of Coffea canephora Using KASP Assay. FRONTIERS IN PLANT SCIENCE 2021; 11:612593. [PMID: 33569071 PMCID: PMC7868401 DOI: 10.3389/fpls.2020.612593] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 12/28/2020] [Indexed: 06/12/2023]
Abstract
Accurate genotype identification is imperative for effective use of Coffea canephora L. germplasm to breed new varieties with tolerance or resistance to biotic and abiotic stresses (including moisture stress and pest and disease stresses such as coffee berry borer and rust) and for high yield and improved cup quality. The present study validated 192 published single nucleotide polymorphism (SNP) markers and selected a panel of 120 loci to examine parentage and labeling errors, genetic diversity, and population structure in 400 C. canephora accessions assembled from different coffee-producing countries and planted in a field gene bank in Ghana. Of the 400 genotypes analyzed, both synonymous (trees with same SNP profiles but different names, 12.8%) and homonymous (trees with same name but different SNP profiles, 5.8%) mislabeling were identified. Parentage analysis showed that 33.3% of the progenies derived from controlled crossing and 0% of the progenies derived from an open pollinated biclonal seed garden had parentage (both parents) corresponding to breeder records. The results suggest mislabeling of the mother trees used in seed gardens and pollen contamination from unwanted paternal parents. After removing the duplicated accessions, Bayesian clustering analysis partitioned the 270 unique genotypes into two main populations. Analysis of molecular variance (AMOVA) showed that the between-population variation accounts for 41% of the total molecular variation and the genetic divergence was highly significant (Fst = 0.256; P < 0.001). Taken together, our results demonstrate the effectiveness of using the selected SNP panel in gene bank management, varietal identification, seed garden management, nursery verification, and coffee bean authentication for C. canephora breeding programs.
Collapse
Affiliation(s)
| | | | - Lyndel Meinhardt
- Sustainable Perennial Crops Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD, United States
| | - Dapeng Zhang
- Sustainable Perennial Crops Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD, United States
| |
Collapse
|
19
|
Oliveira GF, Nascimento ACC, Nascimento M, Sant'Anna IDC, Romero JV, Azevedo CF, Bhering LL, Moura ETC. Quantile regression in genomic selection for oligogenic traits in autogamous plants: A simulation study. PLoS One 2021; 16:e0243666. [PMID: 33400704 PMCID: PMC7785117 DOI: 10.1371/journal.pone.0243666] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 11/25/2020] [Indexed: 11/19/2022] Open
Abstract
This study assessed the efficiency of Genomic selection (GS) or genome-wide selection (GWS), based on Regularized Quantile Regression (RQR), in the selection of genotypes to breed autogamous plant populations with oligogenic traits. To this end, simulated data of an F2 population were used, with traits with different heritability levels (0.10, 0.20 and 0.40), controlled by four genes. The generations were advanced (up to F6) at two selection intensities (10% and 20%). The genomic genetic value was computed by RQR for different quantiles (0.10, 0.50 and 0.90), and by the traditional GWS methods, specifically RR-BLUP and BLASSO. A second objective was to find the statistical methodology that allows the fastest fixation of favorable alleles. In general, the results of the RQR model were better than or equal to those of traditional GWS methodologies, achieving the fixation of favorable alleles in most of the evaluated scenarios. At a heritability level of 0.40 and a selection intensity of 10%, RQR (0.50) was the only methodology that fixed the alleles quickly, i.e., in the fourth generation. Thus, it was concluded that the application of RQR in plant breeding, to simulated autogamous plant populations with oligogenic traits, could reduce time and consequently costs, due to the reduction of selfing generations to fix alleles in the evaluated scenarios.
Collapse
Affiliation(s)
| | | | - Moysés Nascimento
- Department of Statistics, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | | | - Juan Vicente Romero
- AGROSAVIA, The Colombian Agricultural Research Corporation, Mosquera, Colômbia
| | | | - Leonardo Lopes Bhering
- Department of General Biology, Federal University of Viçosa, Viçosa, Minas Gerais, Brazil
| | | |
Collapse
|
20
|
Luo S, He Y, Li Q, Jiao W, Zhu Y, Zhao X. Nondestructive estimation of potato yield using relative variables derived from multi-period LAI and hyperspectral data based on weighted growth stage. PLANT METHODS 2020; 16:150. [PMID: 33292407 PMCID: PMC7654003 DOI: 10.1186/s13007-020-00693-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2020] [Accepted: 11/02/2020] [Indexed: 06/12/2023]
Abstract
BACKGROUND The accurate estimation of potato yield at regional scales is crucial for food security, precision agriculture, and agricultural sustainable development. METHODS In this study, we developed a new method using multi-period relative vegetation indices (rVIs) and relative leaf area index (rLAI) data to improve the accuracy of potato yield estimation based on the weighted growth stage. Two experiments of field and greenhouse (water and nitrogen fertilizer experiments) in 2018 were performed to obtain the spectra and LAI data of the whole growth stage of potato. Then the weighted growth stage was determined by three weighting methods (improved analytic hierarchy process method, IAHP; entropy weight method, EW; and optimal combination weighting method, OCW) and the Slogistic model. A comparison of the estimation performance of rVI-based and rLAI-based models with a single and weighted stage was completed. RESULTS The results showed that among the six test rVIs, the relative red edge chlorophyll index (rCIred edge) was the optimal index of the single-stage estimation models with the correlation with potato yield. The most suitable single stage for potato yield estimation was the tuber expansion stage. For weighted growth stage models, the OCW-LAI model was determined as the best one to accurately predict the potato yield with an adjusted R2 value of 0.8333, and the estimation error about 8%. CONCLUSION This study emphasizes the importance of inconsistent contributions of multi-period or different types of data to the results when they are used together, and the weights need to be considered.
Collapse
Affiliation(s)
- Shanjun Luo
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
- School of Remote Sensing and Information Engineering, Wuhan University, Wuhan, 430079, China
| | - Yingbin He
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| | - Qian Li
- School of Economics and Management, Tiangong University, Tianjin, 300387, China
| | - Weihua Jiao
- Center for Agricultural and Rural Economic Research, Shandong University of Finance and Economics, Jinan, 250014, China
| | - Yaqiu Zhu
- School of Economics and Management, Tiangong University, Tianjin, 300387, China
| | - Xihai Zhao
- Institute of Agricultural Resources and Regional Planning, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| |
Collapse
|
21
|
Cortés AJ, Restrepo-Montoya M, Bedoya-Canas LE. Modern Strategies to Assess and Breed Forest Tree Adaptation to Changing Climate. FRONTIERS IN PLANT SCIENCE 2020; 11:583323. [PMID: 33193532 PMCID: PMC7609427 DOI: 10.3389/fpls.2020.583323] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 09/29/2020] [Indexed: 05/02/2023]
Abstract
Studying the genetics of adaptation to new environments in ecologically and industrially important tree species is currently a major research line in the fields of plant science and genetic improvement for tolerance to abiotic stress. Specifically, exploring the genomic basis of local adaptation is imperative for assessing the conditions under which trees will successfully adapt in situ to global climate change. However, this knowledge has scarcely been used in conservation and forest tree improvement because woody perennials face major research limitations such as their outcrossing reproductive systems, long juvenile phase, and huge genome sizes. Therefore, in this review we discuss predictive genomic approaches that promise increasing adaptive selection accuracy and shortening generation intervals. They may also assist the detection of novel allelic variants from tree germplasm, and disclose the genomic potential of adaptation to different environments. For instance, natural populations of tree species invite using tools from the population genomics field to study the signatures of local adaptation. Conventional genetic markers and whole genome sequencing both help identifying genes and markers that diverge between local populations more than expected under neutrality, and that exhibit unique signatures of diversity indicative of "selective sweeps." Ultimately, these efforts inform the conservation and breeding status capable of pivoting forest health, ecosystem services, and sustainable production. Key long-term perspectives include understanding how trees' phylogeographic history may affect the adaptive relevant genetic variation available for adaptation to environmental change. Encouraging "big data" approaches (machine learning-ML) capable of comprehensively merging heterogeneous genomic and ecological datasets is becoming imperative, too.
Collapse
Affiliation(s)
- Andrés J. Cortés
- Corporación Colombiana de Investigación Agropecuaria AGROSAVIA, Rionegro, Colombia
- Departamento de Ciencias Forestales, Facultad de Ciencias Agrarias, Universidad Nacional de Colombia – Sede Medellín, Medellín, Colombia
| | - Manuela Restrepo-Montoya
- Departamento de Ciencias Forestales, Facultad de Ciencias Agrarias, Universidad Nacional de Colombia – Sede Medellín, Medellín, Colombia
| | - Larry E. Bedoya-Canas
- Departamento de Ciencias Forestales, Facultad de Ciencias Agrarias, Universidad Nacional de Colombia – Sede Medellín, Medellín, Colombia
| |
Collapse
|
22
|
Salgotra RK, Stewart CN. Functional Markers for Precision Plant Breeding. Int J Mol Sci 2020; 21:E4792. [PMID: 32640763 PMCID: PMC7370099 DOI: 10.3390/ijms21134792] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 06/19/2020] [Accepted: 07/02/2020] [Indexed: 01/24/2023] Open
Abstract
Advances in molecular biology including genomics, high-throughput sequencing, and genome editing enable increasingly faster and more precise cultivar development. Identifying genes and functional markers (FMs) that are highly associated with plant phenotypic variation is a grand challenge. Functional genomics approaches such as transcriptomics, targeting induced local lesions in genomes (TILLING), homologous recombinant (HR), association mapping, and allele mining are all strategies to identify FMs for breeding goals, such as agronomic traits and biotic and abiotic stress resistance. The advantage of FMs over other markers used in plant breeding is the close genomic association of an FM with a phenotype. Thereby, FMs may facilitate the direct selection of genes associated with phenotypic traits, which serves to increase selection efficiencies to develop varieties. Herein, we review the latest methods in FM development and how FMs are being used in precision breeding for agronomic and quality traits as well as in breeding for biotic and abiotic stress resistance using marker assisted selection (MAS) methods. In summary, this article describes the use of FMs in breeding for development of elite crop cultivars to enhance global food security goals.
Collapse
Affiliation(s)
- Romesh K. Salgotra
- School of Biotechnology, Sher-e-Kashmir University of Agricultural Sciences & Technology of Jammu, Chatha, Jammu 190008, India
| | - C. Neal Stewart
- Department of Plant Sciences, University of Tennessee, Knoxville, TN 37996, USA
| |
Collapse
|