1
|
Inamori M, Kimura T, Mori M, Tarumoto Y, Hattori T, Hayano M, Umeda M, Iwata H. Machine learning for genomic and pedigree prediction in sugarcane. THE PLANT GENOME 2024:e20486. [PMID: 38923818 DOI: 10.1002/tpg2.20486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 05/07/2024] [Accepted: 05/08/2024] [Indexed: 06/28/2024]
Abstract
Sugarcane (Saccharum spp.) plays a crucial role in global sugar production; however, the efficiency of breeding programs has been hindered by its heterozygous polyploid genomes. Considering non-additive genetic effects is essential in genome prediction (GP) models of crops with highly heterozygous polyploid genomes. This study incorporates non-additive genetic effects and pedigree information using machine learning methods to track sugarcane breeding lines and enhance the prediction by assessing the degree of association between genotypes. This study measured the stalk biomass and sugar content of 297 clones from 87 families within a breeding population used in the Japanese sugarcane breeding program. Subsequently, we conducted analyses based on the marker genotypes of 33,149 single-nucleotide polymorphisms. To validate the accuracy of GP in the population, we first predicted the prediction accuracy of the best linear unbiased prediction (BLUP) based on a genomic relationship matrix. Prediction accuracy was assessed using two different cross-validation methods: repeated 10-fold cross-validation and leave-one-family-out cross-validation. The accuracy of GP of the first and second methods ranged from 0.36 to 0.74 and 0.15 to 0.63, respectively. Next, we compared the prediction accuracy of BLUP and two machine learning methods: random forests and simulation annealing ensemble (SAE), a newly developed machine learning method that explicitly models the interaction between variables. Both pedigree and genomic information were utilized as input in these methods. Through repeated 10-fold cross-validation, we found that the accuracy of the machine learning methods consistently surpassed that of BLUP in most cases. In leave-one-family-out cross-validation, SAE demonstrated the highest accuracy among the methods. These results underscore the effectiveness of GP in Japanese sugarcane breeding and highlight the significant potential of machine learning methods.
Collapse
Affiliation(s)
- Minoru Inamori
- Laboratory of Biometry and Bioinformatics, Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| | - Tatsuro Kimura
- Toyota Motor Corporation, New Business Planning Division, Agriculture & Biotechnology Business Department, Toyota, Japan
| | - Masaaki Mori
- Toyota Motor Corporation, Environment Affairs and Engineering Management Division, CN Advanced Engineering Development Center, Tokyo, Japan
| | - Yusuke Tarumoto
- NARO Kyushu Okinawa Agricultural Research Center, Tanegashima Sugarcane Breeding Site, Nishinoomote, Japan
| | - Taiichiro Hattori
- NARO Kyushu Okinawa Agricultural Research Center, Tanegashima Sugarcane Breeding Site, Nishinoomote, Japan
- NARO Kyushu Okinawa Agricultural Research Center, Itoman Resident Office, Itoman, Japan
| | - Michiko Hayano
- NARO Kyushu Okinawa Agricultural Research Center, Tanegashima Sugarcane Breeding Site, Nishinoomote, Japan
- NARO Institute for Agro-Environmental Science, Tsukuba, Japan
| | - Makoto Umeda
- NARO Kyushu Okinawa Agricultural Research Center, Tanegashima Sugarcane Breeding Site, Nishinoomote, Japan
| | - Hiroyoshi Iwata
- Laboratory of Biometry and Bioinformatics, Department of Agricultural and Environmental Biology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
2
|
Daduwal HS, Bhardwaj R, Srivastava RK. Pearl millet a promising fodder crop for changing climate: a review. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:169. [PMID: 38913173 DOI: 10.1007/s00122-024-04671-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 06/05/2024] [Indexed: 06/25/2024]
Abstract
The agricultural sector faces colossal challenges amid environmental changes and a burgeoning human population. In this context, crops must adapt to evolving climatic conditions while meeting increasing production demands. The dairy industry is anticipated to hold the highest value in the agriculture sector in future. The rise in the livestock population is expected to result in an increased demand for fodder feed. Consequently, it is crucial to seek alternative options, as crops demand fewer resources and are resilient to climate change. Pearl millet offers an apposite key to these bottlenecks, as it is a promising climate resilience crop with significantly low energy, water and carbon footprints compared to other crops. Numerous studies have explored its potential as a fodder crop, revealing promising performance. Despite its capabilities, pearl millet has often been overlooked. To date, few efforts have been made to document molecular aspects of fodder-related traits. However, several QTLs and candidate genes related to forage quality have been identified in other fodder crops, which can be harnessed to enhance the forage quality of pearl millet. Lately, excellent genomic resources have been developed in pearl millet allowing deployment of cutting-edge genomics-assisted breeding for achieving a higher rate of genetic gains. This review would facilitate a deeper understanding of various aspects of fodder pearl millet in retrospect along with the future challenges and their solution. This knowledge may pave the way for designing efficient breeding strategies in pearl millet thereby supporting sustainable agriculture and livestock production in a changing world.
Collapse
Affiliation(s)
- Harmanpreet Singh Daduwal
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab, 141004, India
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | - Ruchika Bhardwaj
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab, 141004, India
| | - Rakesh K Srivastava
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India.
| |
Collapse
|
3
|
Zhou C, Jiang W, Guo J, Zhu L, Liu L, Liu S, Chen R, Du B, Huang J. Genome-wide association study and genomic prediction for resistance to brown planthopper in rice. FRONTIERS IN PLANT SCIENCE 2024; 15:1373081. [PMID: 38576786 PMCID: PMC10991774 DOI: 10.3389/fpls.2024.1373081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 03/08/2024] [Indexed: 04/06/2024]
Abstract
The brown planthopper (BPH) is the most destructive insect pest that threatens rice production globally. Developing rice varieties incorporating BPH-resistant genes has proven to be an effective control measure against BPH. In this study, we assessed the resistance of a core collection consisting of 502 rice germplasms by evaluating resistance scores, weight gain rates and honeydew excretions. A total of 117 rice varieties (23.31%) exhibited resistance to BPH. Genome-wide association studies (GWAS) were performed on both the entire panel of 502 rice varieties and its subspecies, and 6 loci were significantly associated with resistance scores (P value < 1.0e-8). Within these loci, we identified eight candidate genes encoding receptor-like protein kinase (RLK), nucleotide-binding and leucine-rich repeat (NB-LRR), or LRR proteins. Two loci had not been detected in previous study and were entirely novel. Furthermore, we evaluated the predictive ability of genomic selection for resistance to BPH. The results revealed that the highest prediction accuracy for BPH resistance reached 0.633. As expected, the prediction accuracy increased progressively with an increasing number of SNPs, and a total of 6.7K SNPs displayed comparable accuracy to 268K SNPs. Among various statistical models tested, the random forest model exhibited superior predictive accuracy. Moreover, increasing the size of training population improved prediction accuracy; however, there was no significant difference in prediction accuracy between a training population size of 737 and 1179. Additionally, when there existed close genetic relatedness between the training and validation populations, higher prediction accuracies were observed compared to scenarios when they were genetically distant. These findings provide valuable resistance candidate genes and germplasm resources and are crucial for the application of genomic selection for breeding durable BPH-resistant rice varieties.
Collapse
Affiliation(s)
- Cong Zhou
- Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences/The Key Laboratory of Biology and Genetic Improvement of Oil Crops, The Ministry of Agriculture and Rural Affairs, Wuhan, China
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, China
| | - Weihua Jiang
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, China
| | - Jianping Guo
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, China
| | - Lili Zhu
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, China
| | - Lijiang Liu
- Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences/The Key Laboratory of Biology and Genetic Improvement of Oil Crops, The Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Shengyi Liu
- Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences/The Key Laboratory of Biology and Genetic Improvement of Oil Crops, The Ministry of Agriculture and Rural Affairs, Wuhan, China
| | - Rongzhi Chen
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, China
| | - Bo Du
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, China
| | - Jin Huang
- Cash Crops Research Institute, Hubei Academy of Agricultural Sciences, Wuhan, China
| |
Collapse
|
4
|
Blancon J, Buet C, Dubreuil P, Tixier MH, Baret F, Praud S. Maize green leaf area index dynamics: genetic basis of a new secondary trait for grain yield in optimal and drought conditions. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:68. [PMID: 38441678 PMCID: PMC10914915 DOI: 10.1007/s00122-024-04572-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 02/03/2024] [Indexed: 03/07/2024]
Abstract
KEY MESSAGE Green Leaf Area Index dynamics is a promising secondary trait for grain yield and drought tolerance. Multivariate GWAS is particularly well suited to identify the genetic determinants of the green leaf area index dynamics. Improvement of maize grain yield is impeded by important genotype-environment interactions, especially under drought conditions. The use of secondary traits, that are correlated with yield, more heritable and less prone to genotype-environment interactions, can increase breeding efficiency. Here, we studied the genetic basis of a new secondary trait: the green leaf area index (GLAI) dynamics over the maize life cycle. For this, we used an unmanned aerial vehicle to characterize the GLAI dynamics of a diverse panel in well-watered and water-deficient trials in two years. From the dynamics, we derived 24 traits (slopes, durations, areas under the curve), and showed that six of them were heritable traits representative of the panel diversity. To identify the genetic determinants of GLAI, we compared two genome-wide association approaches: a univariate (single-trait) method and a multivariate (multi-trait) method combining GLAI traits, grain yield, and precocity. The explicit modeling of correlation structure between secondary traits and grain yield in the multivariate mixed model led to 2.5 times more associations detected. A total of 475 quantitative trait loci (QTLs) were detected. The genetic architecture of GLAI traits appears less complex than that of yield with stronger-effect QTLs that are more stable between environments. We also showed that a subset of GLAI QTLs explains nearly one fifth of yield variability across a larger environmental network of 11 water-deficient trials. GLAI dynamics is a promising grain yield secondary trait in optimal and drought conditions, and the detected QTLs could help to increase breeding efficiency through a marker-assisted approach.
Collapse
Affiliation(s)
- Justin Blancon
- UMR GDEC, INRAE, Université Clermont Auvergne, 63000, Clermont-Ferrand, France.
- Biogemma, Centre de Recherche de Chappes, 63720, Chappes, France.
| | - Clément Buet
- Biogemma, Centre de Recherche de Chappes, 63720, Chappes, France
| | - Pierre Dubreuil
- Biogemma, Centre de Recherche de Chappes, 63720, Chappes, France
| | | | | | - Sébastien Praud
- Biogemma, Centre de Recherche de Chappes, 63720, Chappes, France
| |
Collapse
|
5
|
Hoque A, Anderson JV, Rahman M. Genomic prediction for agronomic traits in a diverse Flax (Linum usitatissimum L.) germplasm collection. Sci Rep 2024; 14:3196. [PMID: 38326469 PMCID: PMC10850546 DOI: 10.1038/s41598-024-53462-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 01/31/2024] [Indexed: 02/09/2024] Open
Abstract
Breeding programs require exhaustive phenotyping of germplasms, which is time-demanding and expensive. Genomic prediction helps breeders harness the diversity of any collection to bypass phenotyping. Here, we examined the genomic prediction's potential for seed yield and nine agronomic traits using 26,171 single nucleotide polymorphism (SNP) markers in a set of 337 flax (Linum usitatissimum L.) germplasm, phenotyped in five environments. We evaluated 14 prediction models and several factors affecting predictive ability based on cross-validation schemes. Models yielded significant variation among predictive ability values across traits for the whole marker set. The ridge regression (RR) model covering additive gene action yielded better predictive ability for most of the traits, whereas it was higher for low heritable traits by models capturing epistatic gene action. Marker subsets based on linkage disequilibrium decay distance gave significantly higher predictive abilities to the whole marker set, but for randomly selected markers, it reached a plateau above 3000 markers. Markers having significant association with traits improved predictive abilities compared to the whole marker set when marker selection was made on the whole population instead of the training set indicating a clear overfitting. The correction for population structure did not increase predictive abilities compared to the whole collection. However, stratified sampling by picking representative genotypes from each cluster improved predictive abilities. The indirect predictive ability for a trait was proportionate to its correlation with other traits. These results will help breeders to select the best models, optimum marker set, and suitable genotype set to perform an indirect selection for quantitative traits in this diverse flax germplasm collection.
Collapse
Affiliation(s)
- Ahasanul Hoque
- Department of Plant Sciences, North Dakota State University, Fargo, ND, USA
- Department of Genetics and Plant Breeding, Bangladesh Agricultural University, Mymensingh, 2202, Bangladesh
| | - James V Anderson
- USDA-ARS, Edward T. Schafer Agricultural Research Center, Fargo, ND, USA
| | - Mukhlesur Rahman
- Department of Plant Sciences, North Dakota State University, Fargo, ND, USA.
| |
Collapse
|
6
|
Montesinos-López A, Gutiérrez-Pulido H, Ramos-Pulido S, Montesinos-López JC, Montesinos-López OA, Crossa J. Bayesian discrete lognormal regression model for genomic prediction. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:21. [PMID: 38221602 DOI: 10.1007/s00122-023-04526-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 12/11/2023] [Indexed: 01/16/2024]
Abstract
KEY MESSAGE Genomic prediction models for quantitative traits assume continuous and normally distributed phenotypes. In this research, we proposed a novel Bayesian discrete lognormal regression model. Genomic selection is a powerful tool in modern breeding programs that uses genomic information to predict the performance of individuals and select those with desirable traits. It has revolutionized animal and plant breeding, as it allows breeders to identify the best candidates without labor-intensive and time-consuming phenotypic evaluations. While several statistical models have been developed, most of them have been for quantitative continuous traits and only a few for count responses. In this paper, we propose a discrete lognormal regression model in the Bayesian context, that with a Gibbs sampler to explore the corresponding posterior distribution and make the predictions. Two datasets of resistance disease is used in the wheat crop and are then evaluated against the traditional Gaussian model and a lognormal model. The results indicate the proposed model is a competitive and natural model for predicting count genomic traits.
Collapse
Affiliation(s)
- Abelardo Montesinos-López
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, C. P. 44430, Guadalajara, Jalisco, México
| | - Humberto Gutiérrez-Pulido
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, C. P. 44430, Guadalajara, Jalisco, México
| | - Sofía Ramos-Pulido
- Departamento de Matemáticas, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara, C. P. 44430, Guadalajara, Jalisco, México
| | | | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Carretera México-Veracruz Km. 45, El Batán, C. P. 56237, Texcoco, Edo. de México, México.
- Colegio de Postgraduados, C. P. 56230, Montecillos, Edo. de México, México.
- Centre for Crop & Food Innovation, Food Futures Institute, Murdoch University, Murdoch, 6150, Australia.
| |
Collapse
|
7
|
Rabieyan E, Darvishzadeh R, Alipour H. Genetic analyses and prediction for lodging‑related traits in a diverse Iranian hexaploid wheat collection. Sci Rep 2024; 14:275. [PMID: 38167972 PMCID: PMC10761700 DOI: 10.1038/s41598-023-49927-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 12/13/2023] [Indexed: 01/05/2024] Open
Abstract
Lodging is one of the most important limiting environmental factors for achieving the maximum yield and quality of grains in cereals, including wheat. However, little is known about the genetic foundation underlying lodging resistance (LR) in wheat. In this study, 208 landraces and 90 cultivars were phenotyped in two cropping seasons (2018-2019 and 2019-2020) for 19 LR-related traits. A genome-wide association study (GWAS) and genomics prediction were carried out to dissect the genomic regions of LR. The number of significant marker pairs (MPs) was highest for genome B in both landraces (427,017) and cultivars (37,359). The strongest linkage disequilibrium (LD) between marker pairs was found on chromosome 4A (0.318). For stem lodging-related traits, 465, 497, and 478 marker-trait associations (MTAs) and 45 candidate genes were identified in year 1, year 2, and pooled. Gene ontology exhibited genomic region on Chr. 2B, 6B, and 7B control lodging. Most of these genes have key roles in defense response, calcium ion transmembrane transport, carbohydrate metabolic process, nitrogen compound metabolic process, and some genes harbor unknown functions that, all together may respond to lodging as a complex network. The module associated with starch and sucrose biosynthesis was highlighted. Regarding genomic prediction, the GBLUP model performed better than BRR and RRBLUP. This suggests that GBLUP would be a good tool for wheat genome selection. As a result of these findings, it has been possible to identify pivotal QTLs and genes that could be used to improve stem lodging resistance in Triticum aestivum L.
Collapse
Affiliation(s)
- Ehsan Rabieyan
- Department of Plant Production and Genetics, Urmia University, Urmia, Iran
| | - Reza Darvishzadeh
- Department of Plant Production and Genetics, Urmia University, Urmia, Iran
| | - Hadi Alipour
- Department of Plant Production and Genetics, Urmia University, Urmia, Iran.
| |
Collapse
|
8
|
Magon G, De Rosa V, Martina M, Falchi R, Acquadro A, Barcaccia G, Portis E, Vannozzi A, De Paoli E. Boosting grapevine breeding for climate-smart viticulture: from genetic resources to predictive genomics. FRONTIERS IN PLANT SCIENCE 2023; 14:1293186. [PMID: 38148866 PMCID: PMC10750425 DOI: 10.3389/fpls.2023.1293186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 11/27/2023] [Indexed: 12/28/2023]
Abstract
The multifaceted nature of climate change is increasing the urgency to select resilient grapevine varieties, or generate new, fitter cultivars, to withstand a multitude of new challenging conditions. The attainment of this goal is hindered by the limiting pace of traditional breeding approaches, which require decades to result in new selections. On the other hand, marker-assisted breeding has proved useful when it comes to traits governed by one or few genes with great effects on the phenotype, but its efficacy is still restricted for complex traits controlled by many loci. On these premises, innovative strategies are emerging which could help guide selection, taking advantage of the genetic diversity within the Vitis genus in its entirety. Multiple germplasm collections are also available as a source of genetic material for the introgression of alleles of interest via adapted and pioneering transformation protocols, which present themselves as promising tools for future applications on a notably recalcitrant species such as grapevine. Genome editing intersects both these strategies, not only by being an alternative to obtain focused changes in a relatively rapid way, but also by supporting a fine-tuning of new genotypes developed with other methods. A review on the state of the art concerning the available genetic resources and the possibilities of use of innovative techniques in aid of selection is presented here to support the production of climate-smart grapevine genotypes.
Collapse
Affiliation(s)
- Gabriele Magon
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), Laboratory of Plant Genetics and Breeding, University of Padova, Agripolis, Viale dell’Università 16, Legnaro, Italy
| | - Valeria De Rosa
- Department of Agricultural, Food, Environmental and Animal Sciences (DI4A), University of Udine, Via delle Scienze, 206, Udine, Italy
| | - Matteo Martina
- Department of Agricultural, Forest and Food Sciences (DISAFA), Plant Genetics, University of Torino, Largo P. Braccini 2, Grugliasco, Italy
| | - Rachele Falchi
- Department of Agricultural, Food, Environmental and Animal Sciences (DI4A), University of Udine, Via delle Scienze, 206, Udine, Italy
| | - Alberto Acquadro
- Department of Agricultural, Forest and Food Sciences (DISAFA), Plant Genetics, University of Torino, Largo P. Braccini 2, Grugliasco, Italy
| | - Gianni Barcaccia
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), Laboratory of Plant Genetics and Breeding, University of Padova, Agripolis, Viale dell’Università 16, Legnaro, Italy
| | - Ezio Portis
- Department of Agricultural, Forest and Food Sciences (DISAFA), Plant Genetics, University of Torino, Largo P. Braccini 2, Grugliasco, Italy
| | - Alessandro Vannozzi
- Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), Laboratory of Plant Genetics and Breeding, University of Padova, Agripolis, Viale dell’Università 16, Legnaro, Italy
| | - Emanuele De Paoli
- Department of Agricultural, Food, Environmental and Animal Sciences (DI4A), University of Udine, Via delle Scienze, 206, Udine, Italy
| |
Collapse
|
9
|
Meher PK, Gupta A, Rustgi S, Mir RR, Kumar A, Kumar J, Balyan HS, Gupta PK. Evaluation of eight Bayesian genomic prediction models for three micronutrient traits in bread wheat (Triticum aestivum L.). THE PLANT GENOME 2023; 16:e20332. [PMID: 37122189 DOI: 10.1002/tpg2.20332] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 02/21/2023] [Accepted: 03/13/2023] [Indexed: 06/19/2023]
Abstract
In wheat, genomic prediction accuracy (GPA) was assessed for three micronutrient traits (grain iron, grain zinc, and β-carotenoid concentrations) using eight Bayesian regression models. For this purpose, data on 246 accessions, each genotyped with 17,937 DArT markers, were utilized. The phenotypic data on traits were available for 2013-2014 from Powerkheda (Madhya Pradesh) and for 2014-2015 from Meerut (Uttar Pradesh), India. The accuracy of the models was measured in terms of reliability, which was computed following a repeated cross-validation approach. The predictions were obtained independently for each of the two environments after adjusting for the local effects and across environments after adjusting for the environmental effects. The Bayes ridge regression (BayesRR) model outperformed the other seven models, whereas BayesLASSO (BayesL) was the least efficient. The GPA increased with an increase in the size of the training set as well as with an increase in marker density. The GPA values differed for the three traits and were higher for the best linear unbiased estimate (BLUE) (obtained after adjusting for the environmental effects) relative to those for the two environments. The GPA also remained unaffected after accounting for the population structure. The results of the present study suggest that only the best model should be used for the estimations of genomic estimated breeding values (GEBVs) before their use for genomic selection to improve the grain micronutrient contents.
Collapse
Affiliation(s)
- Prabina Kumar Meher
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Ajit Gupta
- Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sachin Rustgi
- Department of Plant and Environmental Sciences, Pee Dee Research and Education Centre, Clemson University, Florence, South Carolina, USA
| | - Reyazul Rouf Mir
- Division of Genetics and Plant Breeding, SKUAST-Kashmir, Kashmir, India
| | - Anuj Kumar
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
- Laboratory of Immunity, Shantou University Medical College, Shantou, People's Republic of China
| | - Jitendra Kumar
- National Agri-Food Biotechnology Institute (NABI), Ajitgarh, India
| | - Harindra Singh Balyan
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, India
| | - Pushpendra Kumar Gupta
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, India
| |
Collapse
|
10
|
Rai M, Chucha D, Deepika D, Lap B, Magudeeswari P, Padmavathi G, Singh N, Tyagi W. Pyramiding of qDTY 1.1 and qDTY 3.1 into rice mega-variety Samba Mahsuri-Sub1: physiological performance under water deficit conditions. PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2023; 29:1931-1943. [PMID: 38222275 PMCID: PMC10784446 DOI: 10.1007/s12298-023-01387-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 10/18/2023] [Accepted: 11/09/2023] [Indexed: 01/16/2024]
Abstract
Water deficit is a crucial factor causing huge loss to rice productivity. The present study aimed to develop a multiple stress tolerant genotype by pyramiding drought tolerant yield QTLs qDTY1.1 and qDTY3.1 into high yielding rice variety Samba Mahsuri Sub-1(SMS) through marker assisted pyramiding. To achieve this six introgression lines of SMS carrying qDTY1.1 (SAB) were crossed with DRR-50, an Essentially Derived Variety of SMS carrying qDTY3.1. The SAB lines are taller than SMS due to tight linkage between qDTY1.1 and wild type SD-1. Therefore, F2 generation of crosses were screened for recombinants between SD-1 and qDTY1.1. Phenotyping of 1530 F2 plants representing three F2 populations from 35 F1 hybrids, identified 305 dwarf plants. Three dwarf F2 plants along with three others carrying qDTY1.1 and qDTY3.1 were forwarded to F3 generation. From the six F3 (SABD) lines fourteen pyramided progenies were selected and forward to F4 generation. The six SABD F3 lines SABD-7, SABD-8, SABD-9, SABD-76, SABD-79 and SABD-80 along with parents were evaluated under moisture stress (MS) for various physiological parameters. Chlorophyll and relative water content were more, while canopy temperature and malonaldehyde (MDA) content were lesser in SABD lines compared to parents indicating tolerance under MS. Variance due to genotypes was highly significant for all the yield related traits except test weight. Based on seed morphology, agronomic characters and physiological parameters six superior lines SABD-9-3, SABD-9-2, SABD-9-6, SABD-9-7, SABD-76-2 and SABD-76-6 performing better under MS were identified, which could be released after multi-location evaluation. Supplementary Information The online version contains supplementary material available at 10.1007/s12298-023-01387-5.
Collapse
Affiliation(s)
- Mayank Rai
- School of Crop Improvement, College of Post Graduate Studies in Agricultural Sciences, Central Agricultural University (Imphal), Umiam, India
| | - Diezehlouno Chucha
- School of Crop Improvement, College of Post Graduate Studies in Agricultural Sciences, Central Agricultural University (Imphal), Umiam, India
| | - Dake Deepika
- School of Crop Improvement, College of Post Graduate Studies in Agricultural Sciences, Central Agricultural University (Imphal), Umiam, India
| | - Bharati Lap
- School of Crop Improvement, College of Post Graduate Studies in Agricultural Sciences, Central Agricultural University (Imphal), Umiam, India
| | - P. Magudeeswari
- School of Crop Improvement, College of Post Graduate Studies in Agricultural Sciences, Central Agricultural University (Imphal), Umiam, India
| | - G. Padmavathi
- ICAR- Indian Institute of Rice Research, Hyderabad, Telangana India
| | - Nagendra Singh
- ICAR- National Institute for Plant Biotechnology, New Delhi, India
| | - Wricha Tyagi
- School of Crop Improvement, College of Post Graduate Studies in Agricultural Sciences, Central Agricultural University (Imphal), Umiam, India
- Present Address: International Crops Research Institute for the Semi-Arid Tropics, Patancheru, Telangana India
| |
Collapse
|
11
|
Kinoshita S, Sakurai K, Hamazaki K, Tsusaka T, Sakurai M, Kurosawa T, Aoki Y, Shirasawa K, Isobe S, Iwata H. Assessing the Potential for Genome-Assisted Breeding in Red Perilla Using Quantitative Trait Locus Analysis and Genomic Prediction. Genes (Basel) 2023; 14:2137. [PMID: 38136959 PMCID: PMC10742415 DOI: 10.3390/genes14122137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 11/24/2023] [Accepted: 11/25/2023] [Indexed: 12/24/2023] Open
Abstract
Red perilla is an important medicinal plant used in Kampo medicine. The development of elite varieties of this species is urgently required. Medicinal compounds are generally considered target traits in medicinal plant breeding; however, selection based on compound phenotypes (i.e., conventional selection) is expensive and time consuming. Here, we propose genomic selection (GS) and marker-assisted selection (MAS), which use marker information for selection, as suitable selection methods for medicinal plants, and we evaluate the effectiveness of these methods in perilla breeding. Three breeding populations generated from crosses between one red and three green perilla genotypes were used to elucidate the genetic mechanisms underlying the production of major medicinal compounds using quantitative trait locus analysis and evaluating the accuracy of genomic prediction (GP). We found that GP had a sufficiently high accuracy for all traits, confirming that GS is an effective method for perilla breeding. Moreover, the three populations showed varying degrees of segregation, suggesting that using these populations in breeding may simultaneously enhance multiple target traits. This study contributes to research on the genetic mechanisms of the major medicinal compounds of red perilla, as well as the breeding efficiency of this medicinal plant.
Collapse
Affiliation(s)
- Sei Kinoshita
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo, Tokyo 113-8657, Japan; (S.K.); (K.S.)
| | - Kengo Sakurai
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo, Tokyo 113-8657, Japan; (S.K.); (K.S.)
| | - Kosuke Hamazaki
- RIKEN Center for Advanced Intelligence Project, Kashiwa, Chiba 227-0871, Japan;
| | - Takahiro Tsusaka
- TSUMURA & CO., Ami, Ibaraki 300-1155, Japan; (T.T.); (M.S.); (T.K.); (Y.A.)
| | - Miki Sakurai
- TSUMURA & CO., Ami, Ibaraki 300-1155, Japan; (T.T.); (M.S.); (T.K.); (Y.A.)
| | - Terue Kurosawa
- TSUMURA & CO., Ami, Ibaraki 300-1155, Japan; (T.T.); (M.S.); (T.K.); (Y.A.)
| | - Youichi Aoki
- TSUMURA & CO., Ami, Ibaraki 300-1155, Japan; (T.T.); (M.S.); (T.K.); (Y.A.)
| | - Kenta Shirasawa
- Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan; (K.S.); (S.I.)
| | - Sachiko Isobe
- Kazusa DNA Research Institute, Kisarazu, Chiba 292-0818, Japan; (K.S.); (S.I.)
| | - Hiroyoshi Iwata
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo, Tokyo 113-8657, Japan; (S.K.); (K.S.)
| |
Collapse
|
12
|
Heinrich F, Lange TM, Kircher M, Ramzan F, Schmitt AO, Gültas M. Exploring the potential of incremental feature selection to improve genomic prediction accuracy. Genet Sel Evol 2023; 55:78. [PMID: 37946104 PMCID: PMC10634161 DOI: 10.1186/s12711-023-00853-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 11/02/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND The ever-increasing availability of high-density genomic markers in the form of single nucleotide polymorphisms (SNPs) enables genomic prediction, i.e. the inference of phenotypes based solely on genomic data, in the field of animal and plant breeding, where it has become an important tool. However, given the limited number of individuals, the abundance of variables (SNPs) can reduce the accuracy of prediction models due to overfitting or irrelevant SNPs. Feature selection can help to reduce the number of irrelevant SNPs and increase the model performance. In this study, we investigated an incremental feature selection approach based on ranking the SNPs according to the results of a genome-wide association study that we combined with random forest as a prediction model, and we applied it on several animal and plant datasets. RESULTS Applying our approach to different datasets yielded a wide range of outcomes, i.e. from a substantial increase in prediction accuracy in a few cases to minor improvements when only a fraction of the available SNPs were used. Compared with models using all available SNPs, our approach was able to achieve comparable performances with a considerably reduced number of SNPs in several cases. Our approach showcased state-of-the-art efficiency and performance while having a faster computation time. CONCLUSIONS The results of our study suggest that our incremental feature selection approach has the potential to improve prediction accuracy substantially. However, this gain seems to depend on the genomic data used. Even for datasets where the number of markers is smaller than the number of individuals, feature selection may still increase the performance of the genomic prediction. Our approach is implemented in R and is available at https://github.com/FelixHeinrich/GP_with_IFS/ .
Collapse
Affiliation(s)
- Felix Heinrich
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075, Göttingen, Germany.
| | - Thomas Martin Lange
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075, Göttingen, Germany
| | - Magdalena Kircher
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Bünteweg 17p, 30559, Hannover, Germany
| | - Faisal Ramzan
- Institute of Animal and Dairy Sciences, University of Agriculture Faisalabad, Jail Road, 38000, Faisalabad, Pakistan
| | - Armin Otto Schmitt
- Breeding Informatics Group, Department of Animal Sciences, Georg-August University, Margarethe von Wrangell-Weg 7, 37075, Göttingen, Germany
- Center for Integrated Breeding Research (CiBreed), Georg-August University, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany
| | - Mehmet Gültas
- Center for Integrated Breeding Research (CiBreed), Georg-August University, Albrecht-Thaer-Weg 3, 37075, Göttingen, Germany.
- Faculty of Agriculture, South Westphalia University of Applied Sciences, 59494, Soest, Germany.
| |
Collapse
|
13
|
Zhang Y, Zhang M, Ye J, Xu Q, Feng Y, Xu S, Hu D, Wei X, Hu P, Yang Y. Integrating genome-wide association study into genomic selection for the prediction of agronomic traits in rice ( Oryza sativa L.). MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2023; 43:81. [PMID: 37965378 PMCID: PMC10641074 DOI: 10.1007/s11032-023-01423-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 10/09/2023] [Indexed: 11/16/2023]
Abstract
Accurately identifying varieties with targeted agronomic traits was thought to contribute to genetic selection and accelerate rice breeding progress. Genomic selection (GS) is a promising technique that uses markers covering the whole genome to predict the genomic-estimated breeding values (GEBV), with the ability to select before phenotypes are measured. To choose the appropriate GS models for breeding work, we analyzed the predictability of nine agronomic traits measured from a population of 459 diverse rice varieties. By the comparison of eight representative GS models, we found that the prediction accuracies ranged from 0.407 to 0.896, with reproducing kernel Hilbert space (RKHS) having the highest predictive ability in most traits. Further results demonstrated the predictivity of GS is altered by several factors. Moreover, we assessed the method of integrating genome-wide association study (GWAS) into various GS models. The predictabilities of GS combined peak-associated markers generated from six different GWAS models were significantly different; a recommendation of Mixed Linear Model (MLM)-RKHS was given for the GWAS-GS-integrated prediction. Finally, based on the above result, we experimented with applying the P-values obtained from optimal GWAS models into ridge regression best linear unbiased prediction (rrBLUP), which benefited the low predictive traits in rice. Supplementary Information The online version contains supplementary material available at 10.1007/s11032-023-01423-y.
Collapse
Affiliation(s)
- Yuanyuan Zhang
- Zhejiang Lab, Hangzhou, 311121 China
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
| | - Mengchen Zhang
- Zhejiang Lab, Hangzhou, 311121 China
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, 572024 China
| | - Junhua Ye
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
| | - Qun Xu
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
| | - Yue Feng
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, 572024 China
| | - Siliang Xu
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
| | - Dongxiu Hu
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
| | - Xinghua Wei
- Zhejiang Lab, Hangzhou, 311121 China
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, 572024 China
| | - Peisong Hu
- Zhejiang Lab, Hangzhou, 311121 China
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
| | - Yaolong Yang
- Zhejiang Lab, Hangzhou, 311121 China
- CNRRI-Zhejiang Lab Computational Breeding Joint Laboratory, China National Rice Research Institute, Hangzhou, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, 572024 China
| |
Collapse
|
14
|
de Verdal H, Baertschi C, Frouin J, Quintero C, Ospina Y, Alvarez MF, Cao TV, Bartholomé J, Grenier C. Optimization of Multi-Generation Multi-location Genomic Prediction Models for Recurrent Genomic Selection in an Upland Rice Population. RICE (NEW YORK, N.Y.) 2023; 16:43. [PMID: 37758969 PMCID: PMC10533757 DOI: 10.1186/s12284-023-00661-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 09/19/2023] [Indexed: 09/29/2023]
Abstract
Genomic selection is a worthy breeding method to improve genetic gain in recurrent selection breeding schemes. The integration of multi-generation and multi-location information could significantly improve genomic prediction models in the context of shuttle breeding. The Cirad-CIAT upland rice breeding program applies recurrent genomic selection and seeks to optimize the scheme to increase genetic gain while reducing phenotyping efforts. We used a synthetic population (PCT27) of which S0 plants were all genotyped and advanced by selfing and bulk seed harvest to the S0:2, S0:3, and S0:4 generations. The PCT27 was then divided into two sets. The S0:2 and S0:3 progenies for PCT27A and the S0:4 progenies for PCT27B were phenotyped in two locations: Santa Rosa the target selection location, within the upland rice growing area, and Palmira, the surrogate location, far from the upland rice growing area but easier for experimentation. While the calibration used either one of the two sets phenotyped in one or two locations, the validation population was only the PCT27B phenotyped in Santa Rosa. Five scenarios of genomic prediction and 24 models were performed and compared. Training the prediction model with the PCT27B phenotyped in Santa Rosa resulted in predictive abilities ranging from 0.19 for grain zinc concentration to 0.30 for grain yield. Expanding the training set with the inclusion of the PCT27A resulted in greater predictive abilities for all traits but grain yield, with increases from 5% for plant height to 61% for grain zinc concentration. Models with the PCT27B phenotyped in two locations resulted in higher prediction accuracy when the models assumed no genotype-by-environment (G × E) interaction for flowering (0.38) and grain zinc concentration (0.27). For plant height, the model assuming a single G × E variance provided higher accuracy (0.28). The gain in predictive ability for grain yield was the greatest (0.25) when environment-specific variance deviation effect for G × E was considered. While the best scenario was specific to each trait, the results indicated that the gain in predictive ability provided by the multi-location and multi-generation calibration was low. Yet, this approach could lead to increased selection intensity, acceleration of the breeding cycle, and a sizable economic advantage for the program.
Collapse
Affiliation(s)
- Hugues de Verdal
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France.
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France.
| | - Cédric Baertschi
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
| | - Julien Frouin
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
| | - Constanza Quintero
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia
| | - Yolima Ospina
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia
| | | | - Tuong-Vi Cao
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
| | - Jérôme Bartholomé
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia
| | - Cécile Grenier
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France.
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, 34398, Montpellier, France.
- Alliance Bioversity-CIAT, A.A.6713, Km 17 Recta Palmira Cali, Cali, Colombia.
| |
Collapse
|
15
|
Bartholomé J, Frouin J, Brottier L, Cao TV, Boisnard A, Ahmadi N, Courtois B. Genomic selection for salinity tolerance in japonica rice. PLoS One 2023; 18:e0291833. [PMID: 37756295 PMCID: PMC10530037 DOI: 10.1371/journal.pone.0291833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Accepted: 09/06/2023] [Indexed: 09/29/2023] Open
Abstract
Improving plant performance in salinity-prone conditions is a significant challenge in breeding programs. Genomic selection is currently integrated into many plant breeding programs as a tool for increasing selection intensity and precision for complex traits and for reducing breeding cycle length. A rice reference panel (RP) of 241 Oryza sativa L. japonica accessions genotyped with 20,255 SNPs grown in control and mild salinity stress conditions was evaluated at the vegetative stage for eight morphological traits and ion mass fractions (Na and K). Weak to strong genotype-by-condition interactions were found for the traits considered. Cross-validation showed that the predictive ability of genomic prediction methods ranged from 0.25 to 0.64 for multi-environment models with morphological traits and from 0.05 to 0.40 for indices of stress response and ion mass fractions. The performances of a breeding population (BP) comprising 393 japonica accessions were predicted with models trained on the RP. For validation of the predictive performances of the models, a subset of 41 accessions was selected from the BP and phenotyped under the same experimental conditions as the RP. The predictive abilities estimated on this subset ranged from 0.00 to 0.66 for the multi-environment models, depending on the traits, and were strongly correlated with the predictive abilities on cross-validation in the RP in salt condition (r = 0.69). We show here that genomic selection is efficient for predicting the salt stress tolerance of breeding lines. Genomic selection could improve the efficiency of rice breeding strategies for salinity-prone environments.
Collapse
Affiliation(s)
- Jérôme Bartholomé
- UMR AGAP Institut, CIRAD, Cali, Colombia
- UMR AGAP Institut, Institut Agro, Univ Montpellier, CIRAD, INRAE, Montpellier, France
- Alliance Bioversity-CIAT, Recta Palmira Cali, Colombia
| | - Julien Frouin
- UMR AGAP Institut, Institut Agro, Univ Montpellier, CIRAD, INRAE, Montpellier, France
- CIRAD, UMR AGAP Institut, Montpellier, France
| | - Laurent Brottier
- UMR AGAP Institut, Institut Agro, Univ Montpellier, CIRAD, INRAE, Montpellier, France
- CIRAD, UMR AGAP Institut, Montpellier, France
| | - Tuong-Vi Cao
- UMR AGAP Institut, Institut Agro, Univ Montpellier, CIRAD, INRAE, Montpellier, France
- CIRAD, UMR AGAP Institut, Montpellier, France
| | | | - Nourollah Ahmadi
- UMR AGAP Institut, Institut Agro, Univ Montpellier, CIRAD, INRAE, Montpellier, France
- CIRAD, UMR AGAP Institut, Montpellier, France
| | - Brigitte Courtois
- UMR AGAP Institut, Institut Agro, Univ Montpellier, CIRAD, INRAE, Montpellier, France
- CIRAD, UMR AGAP Institut, Montpellier, France
| |
Collapse
|
16
|
Pereira de Castro A, Breseghello F, Furtini IV, Utumi MM, Pereira JA, Cao TV, Bartholomé J. Population improvement via recurrent selection drives genetic gain in upland rice breeding. Heredity (Edinb) 2023; 131:201-210. [PMID: 37407693 PMCID: PMC10462700 DOI: 10.1038/s41437-023-00636-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Revised: 06/16/2023] [Accepted: 06/19/2023] [Indexed: 07/07/2023] Open
Abstract
One of the main challenges of breeding programs is to identify superior genotypes from a large number of candidates. By gradually increasing the frequency of favorable alleles in the breeding population, recurrent selection improves the population mean for target traits, increasing the chance to identify promising genotypes. In rice, population improvement through recurrent selection has been used very little to date, except in Latin America. At Embrapa (Brazilian Agricultural Research Corporation), the upland rice breeding program is conducted in two phases: population improvement followed by product development. In this study, the CNA6 population, evaluated over five cycles (3 to 7) of selection, including 20 field trials, was used to assess the realized genetic gain. A high rate of genetic gain was observed for grain yield, at 215 kg.ha-1 per cycle or 67.8 kg.ha-1 per year (3.08%). The CNA6 population outperformed the controls only for the last cycle, with a yield difference of 1128 kg.ha-1. An analysis of the product development pipeline, based on 29 advanced yield trials with lines derived from cycles 3 to 6, showed that lines derived from the CNA6 population had high grain yield, but did not outperform the controls. These results demonstrate that the application of recurrent selection to a breeding population with sufficient genetic variability can result in significant genetic gains for quantitative traits, such as grain yield. The integration of this strategy into a two-phase breeding program also makes it possible to increase quantitative traits while selecting for other traits of interest.
Collapse
Affiliation(s)
| | | | | | | | | | - Tuong-Vi Cao
- AGAP Institut, Univ Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France
- CIRAD, UMR AGAP Institut, F-34398, Montpellier, France
| | - Jérôme Bartholomé
- AGAP Institut, Univ Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France
- CIRAD, UMR AGAP Institut, F-34398, Montpellier, France
- Alliance Bioversity-CIAT, Cali, Colombia
| |
Collapse
|
17
|
Sinha D, Maurya AK, Abdi G, Majeed M, Agarwal R, Mukherjee R, Ganguly S, Aziz R, Bhatia M, Majgaonkar A, Seal S, Das M, Banerjee S, Chowdhury S, Adeyemi SB, Chen JT. Integrated Genomic Selection for Accelerating Breeding Programs of Climate-Smart Cereals. Genes (Basel) 2023; 14:1484. [PMID: 37510388 PMCID: PMC10380062 DOI: 10.3390/genes14071484] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 07/14/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
Rapidly rising population and climate changes are two critical issues that require immediate action to achieve sustainable development goals. The rising population is posing increased demand for food, thereby pushing for an acceleration in agricultural production. Furthermore, increased anthropogenic activities have resulted in environmental pollution such as water pollution and soil degradation as well as alterations in the composition and concentration of environmental gases. These changes are affecting not only biodiversity loss but also affecting the physio-biochemical processes of crop plants, resulting in a stress-induced decline in crop yield. To overcome such problems and ensure the supply of food material, consistent efforts are being made to develop strategies and techniques to increase crop yield and to enhance tolerance toward climate-induced stress. Plant breeding evolved after domestication and initially remained dependent on phenotype-based selection for crop improvement. But it has grown through cytological and biochemical methods, and the newer contemporary methods are based on DNA-marker-based strategies that help in the selection of agronomically useful traits. These are now supported by high-end molecular biology tools like PCR, high-throughput genotyping and phenotyping, data from crop morpho-physiology, statistical tools, bioinformatics, and machine learning. After establishing its worth in animal breeding, genomic selection (GS), an improved variant of marker-assisted selection (MAS), has made its way into crop-breeding programs as a powerful selection tool. To develop novel breeding programs as well as innovative marker-based models for genetic evaluation, GS makes use of molecular genetic markers. GS can amend complex traits like yield as well as shorten the breeding period, making it advantageous over pedigree breeding and marker-assisted selection (MAS). It reduces the time and resources that are required for plant breeding while allowing for an increased genetic gain of complex attributes. It has been taken to new heights by integrating innovative and advanced technologies such as speed breeding, machine learning, and environmental/weather data to further harness the GS potential, an approach known as integrated genomic selection (IGS). This review highlights the IGS strategies, procedures, integrated approaches, and associated emerging issues, with a special emphasis on cereal crops. In this domain, efforts have been taken to highlight the potential of this cutting-edge innovation to develop climate-smart crops that can endure abiotic stresses with the motive of keeping production and quality at par with the global food demand.
Collapse
Affiliation(s)
- Dwaipayan Sinha
- Department of Botany, Government General Degree College, Mohanpur 721436, India
| | - Arun Kumar Maurya
- Department of Botany, Multanimal Modi College, Modinagar, Ghaziabad 201204, India
| | - Gholamreza Abdi
- Department of Biotechnology, Persian Gulf Research Institute, Persian Gulf University, Bushehr 75169, Iran
| | - Muhammad Majeed
- Department of Botany, University of Gujrat, Punjab 50700, Pakistan
| | - Rachna Agarwal
- Applied Genomics Section, Bhabha Atomic Research Centre, Mumbai 400085, India
| | - Rashmi Mukherjee
- Research Center for Natural and Applied Sciences, Department of Botany (UG & PG), Raja Narendralal Khan Women's College, Gope Palace, Midnapur 721102, India
| | - Sharmistha Ganguly
- Department of Dravyaguna, Institute of Post Graduate Ayurvedic Education and Research, Kolkata 700009, India
| | - Robina Aziz
- Department of Botany, Government, College Women University, Sialkot 51310, Pakistan
| | - Manika Bhatia
- TERI School of Advanced Studies, New Delhi 110070, India
| | - Aqsa Majgaonkar
- Department of Botany, St. Xavier's College (Autonomous), Mumbai 400001, India
| | - Sanchita Seal
- Department of Botany, Polba Mahavidyalaya, Polba 712148, India
| | - Moumita Das
- V. Sivaram Research Foundation, Bangalore 560040, India
| | - Swastika Banerjee
- Department of Botany, Kairali College of +3 Science, Champua, Keonjhar 758041, India
| | - Shahana Chowdhury
- Department of Biotechnology, Faculty of Engineering Sciences, German University Bangladesh, TNT Road, Telipara, Chandona Chowrasta, Gazipur 1702, Bangladesh
| | - Sherif Babatunde Adeyemi
- Ethnobotany/Phytomedicine Laboratory, Department of Plant Biology, Faculty of Life Sciences, University of Ilorin, Ilorin P.M.B 1515, Nigeria
| | - Jen-Tsung Chen
- Department of Life Sciences, National University of Kaohsiung, Kaohsiung 811, Taiwan
| |
Collapse
|
18
|
Krishna TPA, Veeramuthu D, Maharajan T, Soosaimanickam M. The Era of Plant Breeding: Conventional Breeding to Genomics-assisted Breeding for Crop Improvement. Curr Genomics 2023; 24:24-35. [PMID: 37920729 PMCID: PMC10334699 DOI: 10.2174/1389202924666230517115912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 03/31/2023] [Accepted: 04/14/2023] [Indexed: 11/04/2023] Open
Abstract
Plant breeding has made a significant contribution to increasing agricultural production. Conventional breeding based on phenotypic selection is not effective for crop improvement. Because phenotype is considerably influenced by environmental factors, which will affect the selection of breeding materials for crop improvement. The past two decades have seen tremendous progress in plant breeding research. Especially the availability of high-throughput molecular markers followed by genomic-assisted approaches significantly contributed to advancing plant breeding. Integration of speed breeding with genomic and phenomic facilities allowed rapid quantitative trait loci (QTL)/gene identifications and ultimately accelerated crop improvement programs. The advances in sequencing technology helps to understand the genome organization of many crops and helped with genomic selection in crop breeding. Plant breeding has gradually changed from phenotype-to-genotype-based to genotype-to-phenotype-based selection. High-throughput phenomic platforms have played a significant role in the modern breeding program and are considered an essential part of precision breeding. In this review, we discuss the rapid advance in plant breeding technology for efficient crop improvements and provide details on various approaches/platforms that are helpful for crop improvement. This review will help researchers understand the recent developments in crop breeding and improvements.
Collapse
Affiliation(s)
| | - Duraipandiyan Veeramuthu
- Division of Plant Biotechnology, Entomology Research Institute, Loyola College, Chennai, Tamil Nadu, India
| | - Theivanayagam Maharajan
- Division of Plant Biotechnology, Entomology Research Institute, Loyola College, Chennai, Tamil Nadu, India
| | - Mariapackiam Soosaimanickam
- Division of Plant Biotechnology, Entomology Research Institute, Loyola College, Chennai, Tamil Nadu, India
- Department of Advanced Zoology & Biotechnology, Loyola College, Nungambakkam, Chennai, 600034, India
| |
Collapse
|
19
|
He S, Liang S, Meng L, Cao L, Ye G. Sparse Phenotyping and Haplotype-Based Models for Genomic Prediction in Rice. RICE (NEW YORK, N.Y.) 2023; 16:27. [PMID: 37284992 DOI: 10.1186/s12284-023-00643-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 05/20/2023] [Indexed: 06/08/2023]
Abstract
The multi-environment genomic selection enables plant breeders to select varieties resilient to diverse environments or particularly adapted to specific environments, which holds a great potential to be used in rice breeding. To realize the multi-environment genomic selection, a robust training set with multi-environment phenotypic data is of necessity. Considering the huge potential of genomic prediction enhanced sparse phenotyping on the cost saving of multi-environment trials (MET), the establishment of a multi-environment training set could also benefit from it. Optimizing the genomic prediction methods is also crucial to enhance the multi-environment genomic selection. Using haplotype-based genomic prediction models is able to capture local epistatic effects which could be conserved and accumulated across generations much like additive effects thereby benefitting breeding. However, previous studies often used fixed length haplotypes composed by a few adjacent molecular markers disregarding the linkage disequilibrium (LD) which is of essential role in determining the haplotype length. In our study, based on three rice populations with different sizes and compositions, we investigated the usefulness and effectiveness of multi-environment training sets with varying phenotyping intensities and different haplotype-based genomic prediction models based on LD-derived haplotype blocks for two agronomic traits, i.e., days to heading (DTH) and plant height (PH). Results showed that phenotyping merely 30% records in multi-environment training set is able to provide a comparable prediction accuracy to high phenotyping intensities; the local epistatic effects are much likely existent in DTH; dividing the LD-derived haplotype blocks into small segments with two or three single nucleotide polymorphisms (SNPs) helps to maintain the predictive ability of haplotype-based models in large populations; modelling the covariances between environments improves genomic prediction accuracy. Our study provides means to improve the efficiency of multi-environment genomic selection in rice.
Collapse
Affiliation(s)
- Sang He
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
- CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Shanshan Liang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin, 300387, China
| | - Lijun Meng
- Kunpeng Institute of Modern Agriculture at Foshan, Foshan, 528200, China
| | - Liyong Cao
- Key Laboratory for Zhejiang Super Rice Research, China National Rice Research Institute, Hangzhou, 310006, China.
| | - Guoyou Ye
- CAAS-IRRI Joint Laboratory for Genomics-Assisted Germplasm Enhancement, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China.
- Rice Breeding Innovations Platform, International Rice Research Institute, Metro Manila, Philippines.
| |
Collapse
|
20
|
Wu PY, Stich B, Renner J, Muders K, Prigge V, van Inghelandt D. Optimal implementation of genomic selection in clone breeding programs-Exemplified in potato: I. Effect of selection strategy, implementation stage, and selection intensity on short-term genetic gain. THE PLANT GENOME 2023:e20327. [PMID: 37177848 DOI: 10.1002/tpg2.20327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 03/01/2023] [Accepted: 03/03/2023] [Indexed: 05/15/2023]
Abstract
Genomic selection (GS) is used in many animal and plant breeding programs to enhance genetic gain for complex traits. However, its optimal integration in clone breeding programs, for example potato, that up to now relied on phenotypic selection (PS) requires further research. In this study, we performed computer simulations based on an empirical genomic dataset of tetraploid potato to (i) investigate under a fixed budget how the weight of GS relative to PS, the stage of implementing GS, the correlation between an auxiliary trait and the target trait, the variance components, and the prediction accuracy affect the genetic gain of the target trait, (ii) determine the optimal allocation of resources maximizing the genetic gain of the target trait, and (iii) make recommendations to breeders how to implement GS in clone and especially potato breeding programs. In our simulation results, any selection strategy involving GS had a higher short-term genetic gain for the target trait than Standard-PS. In addition, we showed that implementing GS in consecutive selection stages can largely enhance short-term genetic gain and recommend the breeders to implement GS at single hills and A clone stages. Furthermore, we observed for selection strategies involving GS that the optimal allocation of resources maximizing the genetic gain of the target trait differed considerably from those typically used in potato breeding programs and, thus, require the adjustment of the selection and phenotyping intensities. The trends are described in our study. Therefore, our study provides new insight for breeders regarding how to optimally implement GS in a commercial potato breeding program to improve the short-term genetic gain for their target trait.
Collapse
Affiliation(s)
- Po-Ya Wu
- Institute of Quantitative Genetics and Genomics of Plants, Heinrich Heine University, Düsseldorf, Germany
| | - Benjamin Stich
- Institute of Quantitative Genetics and Genomics of Plants, Heinrich Heine University, Düsseldorf, Germany
- Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine University, Düsseldorf, Germany
- Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Juliane Renner
- Böhm-Nordkartoffel Agrarproduktion GmbH & Co. OHG, Hohenmocker, Germany
| | | | | | - Delphine van Inghelandt
- Institute of Quantitative Genetics and Genomics of Plants, Heinrich Heine University, Düsseldorf, Germany
| |
Collapse
|
21
|
Anilkumar C, Muhammed Azharudheen TP, Sah RP, Sunitha NC, Devanna BN, Marndi BC, Patra BC. Gene based markers improve precision of genome-wide association studies and accuracy of genomic predictions in rice breeding. Heredity (Edinb) 2023; 130:335-345. [PMID: 36792661 PMCID: PMC10163052 DOI: 10.1038/s41437-023-00599-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 02/02/2023] [Accepted: 02/03/2023] [Indexed: 02/17/2023] Open
Abstract
It is hypothesized that the genome-wide genic markers may increase the prediction accuracy of genomic selection for quantitative traits. To test this hypothesis, a set of candidate gene-based markers for yield and grain traits-related genes cloned across the rice genome were custom-designed. A multi-model, multi-locus genome-wide association study (GWAS) was performed using new genic markers developed to test their effectiveness for gene discovery. Two multi-locus models, FarmCPU and mrMLM, along with a single-locus mixed linear model (MLM), identified 28 significant marker-trait associations. These associations revealed novel causative alleles for grain weight and pleiotropic associations with other traits. For instance, the marker YD91 derived from the gene OsAAP3 on chromosome 1 was consistently associated with grain weight, while the gene has a significant effect on grain yield. Furthermore, nine genomic selection methods, including regression-based and machine learning-based models, were used to predict grain weight using a leave-one-out five-fold cross-validation approach to optimize the genomic selection model with genic markers. Among nine prediction models, Kernel Hilbert Space Regression (RKHS) is the best among regression-based models, and Random Forest Regression (RFR) is the best among machine learning-based models. Genomic prediction accuracies with and without GWAS significant markers were compared to assess the effectiveness of markers. The rapid decreases in prediction accuracy upon dropping GWAS significant markers indicate the effectiveness of new genic markers in genomic selection. Apart from that, the candidate gene-based markers were found to be more effective in genomic selection programs for better accuracy.
Collapse
|
22
|
Maulana F, Perumal R, Serba DD, Tesso T. Genomic prediction of hybrid performance in grain sorghum ( Sorghum bicolor L.). FRONTIERS IN PLANT SCIENCE 2023; 14:1139896. [PMID: 37180401 PMCID: PMC10167770 DOI: 10.3389/fpls.2023.1139896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Accepted: 03/22/2023] [Indexed: 05/16/2023]
Abstract
Genomic selection is expected to improve selection efficiency and genetic gain in breeding programs. The objective of this study was to assess the efficacy of predicting the performance of grain sorghum hybrids using genomic information of parental genotypes. One hundred and two public sorghum inbred parents were genotyped using genotyping-by-sequencing. Ninty-nine of the inbreds were crossed to three tester female parents generating a total of 204 hybrids for evaluation at two environments. The hybrids were sorted in to three sets of 77,59 and 68 and evaluated along with two commercial checks using a randomized complete block design in three replications. The sequence analysis generated 66,265 SNP markers that were used to predict the performance of 204 F1 hybrids resulted from crosses between the parents. Both additive (partial model) and additive and dominance (full model) were constructed and tested using various training population (TP) sizes and cross-validation procedures. Increasing TP size from 41 to 163 increased prediction accuracies for all traits. With the partial model, the five-fold cross validated prediction accuracies ranged from 0.03 for thousand kernel weight (TKW) to 0.58 for grain yield (GY) while it ranged from 0.06 for TKW to 0.67 for GY with the full model. The results suggest that genomic prediction could become an effective tool for predicting the performance of sorghum hybrids based on parental genotypes.
Collapse
Affiliation(s)
- Frank Maulana
- Department of Agronomy, Kansas State University, Manhattan, KS, United States
| | - Ramasamy Perumal
- Kansas State University, Agricultural Research Center, Hays, KS, United States
| | - Desalegn D. Serba
- United States Department of Agriculture-Agricultural Research Service (USDA-ARS), U.S. Arid Land Agricultural Research Center, Maricopa, AZ, United States
| | - Tesfaye Tesso
- Department of Agronomy, Kansas State University, Manhattan, KS, United States
| |
Collapse
|
23
|
Zhou X, Xiang X, Zhang M, Cao D, Du C, Zhang L, Hu J. Combining GS-assisted GWAS and transcriptome analysis to mine candidate genes for nitrogen utilization efficiency in Populus cathayana. BMC PLANT BIOLOGY 2023; 23:182. [PMID: 37020197 PMCID: PMC10074878 DOI: 10.1186/s12870-023-04202-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Accepted: 03/29/2023] [Indexed: 06/19/2023]
Abstract
BACKGROUND Forest trees such as poplar, shrub willow, et al. are essential natural resources for sustainable and renewable energy production, and their wood can reduce dependence on fossil fuels and reduce environmental pollution. However, the productivity of forest trees is often limited by the availability of nitrogen (N), improving nitrogen use efficiency (NUE) is an important way to address it. Currently, NUE genetic resources are scarce in forest tree research, and more genetic resources are urgently needed. RESULTS Here, we performed genome-wide association studies (GWAS) using the mixed linear model (MLM) to identify genetic loci regulating growth traits in Populus cathayana at two N levels, and attempted to enhance the signal strength of single nucleotide polymorphism (SNP) detection by performing genome selection (GS) assistance GWAS. The results of the two GWAS analyses identified 55 and 40 SNPs that were respectively associated with plant height (PH) and ground diameter (GD), and 92 and 69 candidate genes, including 30 overlapping genes. The prediction accuracy of the GS model (rrBLUP) for phenotype exceeds 0.9. Transcriptome analysis of 13 genotypes under two N levels showed that genes related to carbon and N metabolism, amino acid metabolism, energy metabolism, and signal transduction were differentially expressed in the xylem of P. cathayana under N treatment. Furthermore, we observed strong regional patterns in gene expression levels of P. cathayana, with significant differences between different regions. Among them, P. cathayana in Longquan region exhibited the highest response to N. Finally, through weighted gene co-expression network analysis (WGCNA), we identified a module closely related to the N metabolic process and eight hub genes. CONCLUSIONS Integrating the GWAS, RNA-seq and WGCNA data, we ultimately identified four key regulatory genes (PtrNAC123, PtrNAC025, Potri.002G233100, and Potri.006G236200) involved in the wood formation process, and they may affect P. cathayana growth and wood formation by regulating nitrogen metabolism. This study will provide strong evidence for N regulation mechanisms, and reliable genetic resources for growth and NUE genetic improvement in poplar.
Collapse
Affiliation(s)
- Xinglu Zhou
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Xiaodong Xiang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Min Zhang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Demei Cao
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Changjian Du
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China
| | - Lei Zhang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China.
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China.
| | - Jianjun Hu
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of National Forestry and Grassland Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, 100091, China.
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, 210037, Jiangsu, China.
| |
Collapse
|
24
|
Liang M, Cao S, Deng T, Du L, Li K, An B, Du Y, Xu L, Zhang L, Gao X, Li J, Guo P, Gao H. MAK: a machine learning framework improved genomic prediction via multi-target ensemble regressor chains and automatic selection of assistant traits. Brief Bioinform 2023; 24:7031157. [PMID: 36752363 DOI: 10.1093/bib/bbad043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 01/13/2023] [Accepted: 01/20/2023] [Indexed: 02/09/2023] Open
Abstract
Incorporating the genotypic and phenotypic of the correlated traits into the multi-trait model can significantly improve the prediction accuracy of the target trait in animal and plant breeding, as well as human genetics. However, in most cases, the phenotypic information of the correlated and target trait of the individual to be evaluated was null simultaneously, particularly for the newborn. Therefore, we propose a machine learning framework, MAK, to improve the prediction accuracy of the target trait by constructing the multi-target ensemble regression chains and selecting the assistant trait automatically, which predicted the genomic estimated breeding values of the target trait using genotypic information only. The prediction ability of MAK was significantly more robust than the genomic best linear unbiased prediction, BayesB, BayesRR and the multi trait Bayesian method in the four real animal and plant datasets, and the computational efficiency of MAK was roughly 100 times faster than BayesB and BayesRR.
Collapse
Affiliation(s)
- Mang Liang
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Sheng Cao
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Tianyu Deng
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Lili Du
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Keanning Li
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Bingxing An
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Yueying Du
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Lingyang Xu
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Lupei Zhang
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Xue Gao
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Junya Li
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | | | - Huijiang Gao
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| |
Collapse
|
25
|
Wu PY, Ou JH, Liao CT. Sample size determination for training set optimization in genomic prediction. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:57. [PMID: 36912999 PMCID: PMC10011335 DOI: 10.1007/s00122-023-04254-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 11/07/2022] [Indexed: 06/18/2023]
Abstract
A practical approach is developed to determine a cost-effective optimal training set for selective phenotyping in a genomic prediction study. An R function is provided to facilitate the application of the approach. Genomic prediction (GP) is a statistical method used to select quantitative traits in animal or plant breeding. For this purpose, a statistical prediction model is first built that uses phenotypic and genotypic data in a training set. The trained model is then used to predict genomic estimated breeding values (GEBVs) for individuals within a breeding population. Setting the sample size of the training set usually takes into account time and space constraints that are inevitable in an agricultural experiment. However, the determination of the sample size remains an unresolved issue for a GP study. By applying the logistic growth curve to identify prediction accuracy for the GEBVs and the training set size, a practical approach was developed to determine a cost-effective optimal training set for a given genome dataset with known genotypic data. Three real genome datasets were used to illustrate the proposed approach. An R function is provided to facilitate widespread application of this approach to sample size determination, which can help breeders to identify a set of genotypes with an economical sample size for selective phenotyping.
Collapse
Affiliation(s)
- Po-Ya Wu
- Department of Agronomy, National Taiwan University, Taipei, Taiwan
- Institute for Quantitative Genetics and Genomics of Plants, Heinrich Heine University, Düsseldorf, Germany
| | - Jen-Hsiang Ou
- Department of Agronomy, National Taiwan University, Taipei, Taiwan
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Chen-Tuo Liao
- Department of Agronomy, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
26
|
Fernández-González J, Akdemir D, Isidro Y Sánchez J. A comparison of methods for training population optimization in genomic selection. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:30. [PMID: 36892603 PMCID: PMC9998580 DOI: 10.1007/s00122-023-04265-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 11/21/2022] [Indexed: 06/18/2023]
Abstract
Maximizing CDmean and Avg_GRM_self were the best criteria for training set optimization. A training set size of 50-55% (targeted) or 65-85% (untargeted) is needed to obtain 95% of the accuracy. With the advent of genomic selection (GS) as a widespread breeding tool, mechanisms to efficiently design an optimal training set for GS models became more relevant, since they allow maximizing the accuracy while minimizing the phenotyping costs. The literature described many training set optimization methods, but there is a lack of a comprehensive comparison among them. This work aimed to provide an extensive benchmark among optimization methods and optimal training set size by testing a wide range of them in seven datasets, six different species, different genetic architectures, population structure, heritabilities, and with several GS models to provide some guidelines about their application in breeding programs. Our results showed that targeted optimization (uses information from the test set) performed better than untargeted (does not use test set data), especially when heritability was low. The mean coefficient of determination was the best targeted method, although it was computationally intensive. Minimizing the average relationship within the training set was the best strategy for untargeted optimization. Regarding the optimal training set size, maximum accuracy was obtained when the training set was the entire candidate set. Nevertheless, a 50-55% of the candidate set was enough to reach 95-100% of the maximum accuracy in the targeted scenario, while we needed a 65-85% for untargeted optimization. Our results also suggested that a diverse training set makes GS robust against population structure, while including clustering information was less effective. The choice of the GS model did not have a significant influence on the prediction accuracies.
Collapse
Affiliation(s)
- Javier Fernández-González
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| | - Deniz Akdemir
- CIBMTR (Center for International Blood and Marrow Transplant Research), National Marrow Donor Program/Be The Match, Minneapolis, USA
| | - Julio Isidro Y Sánchez
- Centro de Biotecnologia y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnologia Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223, Madrid, Spain.
| |
Collapse
|
27
|
Application of SolCAP Genotyping in Potato (Solanum tuberosum L.) Association Mapping. Methods Mol Biol 2023; 2638:415-435. [PMID: 36781660 DOI: 10.1007/978-1-0716-3024-2_30] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
Potato variety development entails a number of breeding steps, as well as testing and, finally, commercialization. Historically, phenotypic assesment were carried out to select and germplasm development. The US Department of Agriculture (USDA) funded the Solanaceae Coordinated Agricultural Project (SolCAP) to decode genomic resources into tools that breeders and geneticists can use. This project resulted in the creation of a genome-wide single-nucleotide polymorphism (SNP) array that can be used to evaluate elite potato-breeding germplasm. This array was used to genotype a diverse panel of Solanum species, as well as numerous biparental, diploid, and tetraploid populations. It has high marker density to generate genetic maps that can be used to identify numerous quantitative trait loci (QTLs) for agronomic, quality, biotic, and abiotic resistance traits. Up to now, numerous QTLs for important traits have been identified using new diploid and tetraploid genetic maps. SNP markers were used to assess germplasm relationships and fingerprint varieties and identify candidate genes. The Infinium 8303 SolCAP Potato array offers a common set of SNP markers that can be used for mapping, germplasm assessment, and fingerprinting with confidence. This array has also been helpful in furthering our understanding of the potato genome. Furthermore, some other Infinium potato arrays (i.e., 12 K, 20 K, and 25 K) have been genotyped, and breeders can map quantitative trait loci (QTLs) across multiple populations to improve our understanding of economically important traits and lead to marker-assisted selection (MAS) and breeding and, ultimately, improved varieties.
Collapse
|
28
|
Nguyen VH, Morantte RIZ, Lopena V, Verdeprado H, Murori R, Ndayiragije A, Katiyar SK, Islam MR, Juma RU, Flandez-Galvez H, Glaszmann JC, Cobb JN, Bartholomé J. Multi-environment Genomic Selection in Rice Elite Breeding Lines. RICE (NEW YORK, N.Y.) 2023; 16:7. [PMID: 36752880 PMCID: PMC9908796 DOI: 10.1186/s12284-023-00623-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 01/31/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND Assessing the performance of elite lines in target environments is essential for breeding programs to select the most relevant genotypes. One of the main complexities in this task resides in accounting for the genotype by environment interactions. Genomic prediction models that integrate information from multi-environment trials and environmental covariates can be efficient tools in this context. The objective of this study was to assess the predictive ability of different genomic prediction models to optimize the use of multi-environment information. We used 111 elite breeding lines representing the diversity of the international rice research institute breeding program for irrigated ecosystems. The lines were evaluated for three traits (days to flowering, plant height, and grain yield) in 15 environments in Asia and Africa and genotyped with 882 SNP markers. We evaluated the efficiency of genomic prediction to predict untested environments using seven multi-environment models and three cross-validation scenarios. RESULTS The elite lines were found to belong to the indica group and more specifically the indica-1B subgroup which gathered improved material originating from the Green Revolution. Phenotypic correlations between environments were high for days to flowering and plant height (33% and 54% of pairwise correlation greater than 0.5) but low for grain yield (lower than 0.2 in most cases). Clustering analyses based on environmental covariates separated Asia's and Africa's environments into different clusters or subclusters. The predictive abilities ranged from 0.06 to 0.79 for days to flowering, 0.25-0.88 for plant height, and - 0.29-0.62 for grain yield. We found that models integrating genotype-by-environment interaction effects did not perform significantly better than models integrating only main effects (genotypes and environment or environmental covariates). The different cross-validation scenarios showed that, in most cases, the use of all available environments gave better results than a subset. CONCLUSION Multi-environment genomic prediction models with main effects were sufficient for accurate phenotypic prediction of elite lines in targeted environments. These results will help refine the testing strategy to update the genomic prediction models to improve predictive ability.
Collapse
Affiliation(s)
- Van Hieu Nguyen
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
- Institute of Crop Science, College of Agriculture and Food Science, University of the Philippines, Los Baños, Laguna, Philippines
| | - Rose Imee Zhella Morantte
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Vitaliano Lopena
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Holden Verdeprado
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Rosemary Murori
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Alexis Ndayiragije
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Sanjay Kumar Katiyar
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Md Rafiqul Islam
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Roselyne Uside Juma
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
| | - Hayde Flandez-Galvez
- Institute of Crop Science, College of Agriculture and Food Science, University of the Philippines, Los Baños, Laguna, Philippines
| | - Jean-Christophe Glaszmann
- CIRAD, UMR AGAP Institut, 34398, Montpellier, France
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | - Joshua N Cobb
- Rice Breeding Innovation Platform, International Rice Research Institute, DAPO, Box7777, Metro Manila, Philippines
- RiceTec. Inc, PO Box 1305, Alvin, TX, 77512, USA
| | - Jérôme Bartholomé
- UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France.
- CIRAD, UMR AGAP Institut, Cali, Colombia.
- Alliance Bioversity-CIAT, Cali, Colombia.
| |
Collapse
|
29
|
Jeon D, Kang Y, Lee S, Choi S, Sung Y, Lee TH, Kim C. Digitalizing breeding in plants: A new trend of next-generation breeding based on genomic prediction. FRONTIERS IN PLANT SCIENCE 2023; 14:1092584. [PMID: 36743488 PMCID: PMC9892199 DOI: 10.3389/fpls.2023.1092584] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 01/05/2023] [Indexed: 06/18/2023]
Abstract
As the world's population grows and food needs diversification, the demand for cereals and horticultural crops with beneficial traits increases. In order to meet a variety of demands, suitable cultivars and innovative breeding methods need to be developed. Breeding methods have changed over time following the advance of genetics. With the advent of new sequencing technology in the early 21st century, predictive breeding, such as genomic selection (GS), emerged when large-scale genomic information became available. GS shows good predictive ability for the selection of individuals with traits of interest even for quantitative traits by using various types of the whole genome-scanning markers, breaking away from the limitations of marker-assisted selection (MAS). In the current review, we briefly describe the history of breeding techniques, each breeding method, various statistical models applied to GS and methods to increase the GS efficiency. Consequently, we intend to propose and define the term digital breeding through this review article. Digital breeding is to develop a predictive breeding methods such as GS at a higher level, aiming to minimize human intervention by automatically proceeding breeding design, propagating breeding populations, and to make selections in consideration of various environments, climates, and topography during the breeding process. We also classified the phases of digital breeding based on the technologies and methods applied to each phase. This review paper will provide an understanding and a direction for the final evolution of plant breeding in the future.
Collapse
Affiliation(s)
- Donghyun Jeon
- Plant Computational Genomics Laboratory, Department of Science in Smart Agriculture Systems, Chungnam National University, Daejeon, Republic of Korea
| | - Yuna Kang
- Plant Computational Genomics Laboratory, Department of Crop Science, Chungnam National University, Daejeon, Republic of Korea
| | - Solji Lee
- Plant Computational Genomics Laboratory, Department of Crop Science, Chungnam National University, Daejeon, Republic of Korea
| | - Sehyun Choi
- Plant Computational Genomics Laboratory, Department of Crop Science, Chungnam National University, Daejeon, Republic of Korea
| | - Yeonjun Sung
- Plant Computational Genomics Laboratory, Department of Science in Smart Agriculture Systems, Chungnam National University, Daejeon, Republic of Korea
| | - Tae-Ho Lee
- Genomics Division, National Institute of Agricultural Sciences, Jeonju, Republic of Korea
| | - Changsoo Kim
- Plant Computational Genomics Laboratory, Department of Science in Smart Agriculture Systems, Chungnam National University, Daejeon, Republic of Korea
- Plant Computational Genomics Laboratory, Department of Crop Science, Chungnam National University, Daejeon, Republic of Korea
| |
Collapse
|
30
|
Semalaiyappan J, Selvanayagam S, Rathore A, Gupta SK, Chakraborty A, Gujjula KR, Haktan S, Viswanath A, Malipatil R, Shah P, Govindaraj M, Ignacio JC, Reddy S, Singh AK, Thirunavukkarasu N. Development of a new AgriSeq 4K mid-density SNP genotyping panel and its utility in pearl millet breeding. FRONTIERS IN PLANT SCIENCE 2023; 13:1068883. [PMID: 36704175 PMCID: PMC9871632 DOI: 10.3389/fpls.2022.1068883] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 12/15/2022] [Indexed: 06/18/2023]
Abstract
Pearl millet is a crucial nutrient-rich staple food in Asia and Africa and adapted to the climate of semi-arid topics. Since the genomic resources in pearl millet are very limited, we have developed a brand-new mid-density 4K SNP panel and demonstrated its utility in genetic studies. A set of 4K SNPs were mined from 925 whole-genome sequences through a comprehensive in-silico pipeline. Three hundred and seventy-three genetically diverse pearl millet inbreds were genotyped using the newly-developed 4K SNPs through the AgriSeq Targeted Genotyping by Sequencing technology. The 4K SNPs were uniformly distributed across the pearl millet genome and showed considerable polymorphism information content (0.23), genetic diversity (0.29), expected heterozygosity (0.29), and observed heterozygosity (0.03). The SNP panel successfully differentiated the accessions into two major groups, namely B and R lines, through genetic diversity, PCA, and structure models as per their pedigree. The linkage disequilibrium (LD) analysis showed Chr3 had higher LD regions while Chr1 and Chr2 had more low LD regions. The genetic divergence between the B- and R-line populations was 13%, and within the sub-population variability was 87%. In this experiment, we have mined 4K SNPs and optimized the genotyping protocol through AgriSeq technology for routine use, which is cost-effective, fast, and highly reproducible. The newly developed 4K mid-density SNP panel will be useful in genomics and molecular breeding experiments such as assessing the genetic diversity, trait mapping, backcross breeding, and genomic selection in pearl millet.
Collapse
Affiliation(s)
- Janani Semalaiyappan
- Genomics and Molecular Breeding Lab, ICAR-Indian Institute of Millets Research, Rajendranagar, India
| | - Sivasubramani Selvanayagam
- Accelerated Crop Improvement, International Crop Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Abhishek Rathore
- Excellence in Breeding (EiB) Platform, The International Maize and Wheat Improvement Center (CIMMYT), El Batán, Mexico
| | - SK. Gupta
- Accelerated Crop Improvement, International Crop Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Animikha Chakraborty
- Genomics and Molecular Breeding Lab, ICAR-Indian Institute of Millets Research, Rajendranagar, India
| | | | - Suren Haktan
- Bioinformatics, Thermo Fisher Scientific, Austin, TX, United States
| | - Aswini Viswanath
- Genomics and Molecular Breeding Lab, ICAR-Indian Institute of Millets Research, Rajendranagar, India
| | - Renuka Malipatil
- Genomics and Molecular Breeding Lab, ICAR-Indian Institute of Millets Research, Rajendranagar, India
| | - Priya Shah
- Genomics and Molecular Breeding Lab, ICAR-Indian Institute of Millets Research, Rajendranagar, India
| | | | - John Carlos Ignacio
- Department of Horticulture and Crop Science, The Ohio State University, Wooster, OH, United States
| | - Sanjana Reddy
- Genomics and Molecular Breeding Lab, ICAR-Indian Institute of Millets Research, Rajendranagar, India
| | | | - Nepolean Thirunavukkarasu
- Genomics and Molecular Breeding Lab, ICAR-Indian Institute of Millets Research, Rajendranagar, India
| |
Collapse
|
31
|
Bhandari P, Kim J, Lee TG. Genetic architecture of fresh-market tomato yield. BMC PLANT BIOLOGY 2023; 23:18. [PMID: 36624387 PMCID: PMC9827693 DOI: 10.1186/s12870-022-04018-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 12/22/2022] [Indexed: 06/17/2023]
Abstract
BACKGROUND The fresh-market tomato (Solanum lycopersicum) is bred for direct consumption and is selected for a high yield of large fruits. To understand the genetic variations (distinct types of DNA sequence polymorphism) that influence the yield, we collected the phenotypic variations in the yields of total fruit, extra-large-sized fruit, small-sized fruit, or red-colored fruit from 68 core inbred contemporary U.S. fresh-market tomatoes for three consecutive years and the genomic information in 8,289,741 single nucleotide polymorphism (SNP) positions from the whole-genome resequencing of these tomatoes. RESULTS Genome-wide association (GWA) mapping using the SNP data with or without SNP filtering steps using the regularization methods, validated with quantitative trait loci (QTL) linkage mapping, identified 18 significant association signals for traits evaluated. Among them, 10 of which were not located within genomic regions previously identified as being associated with fruit size/shape. When mapping-driven association signals [558 SNPs associated with 28 yield (component) traits] were used to calculate genomic estimated breeding values (GEBVs) of evaluated traits, the prediction accuracies of the extra-large-sized fruit and small-sized fruit yields were higher than those of the total and red-colored fruit yields, as we tested the generated breeding values in inbred tomatoes and F2 populations. Improved accuracy in GEBV calculation of evaluated traits was achieved by using 364 SNPs identified using the regularization methods. CONCLUSIONS Together, these results provide an understanding of the genetic variations underlying the heritable phenotypic variability in yield in contemporary tomato breeding and the information necessary for improving such economically important and complex quantitative trait through breeding.
Collapse
Affiliation(s)
- Prashant Bhandari
- Horticultural Sciences Department, University of Florida, Gainesville, FL, 32611, USA
| | - Juhee Kim
- Gulf Coast Research and Education Center, University of Florida, Wimauma, FL, 33598, USA
| | - Tong Geon Lee
- Horticultural Sciences Department, University of Florida, Gainesville, FL, 32611, USA.
- Gulf Coast Research and Education Center, University of Florida, Wimauma, FL, 33598, USA.
- Plant Breeders Working Group, University of Florida, Gainesville, FL, 32611, USA.
- Plant Molecular and Cellular Biology Graduate Program, University of Florida, Gainesville, FL, 32611, USA.
- Bayer, Chesterfield, MO, 63017, USA.
| |
Collapse
|
32
|
Biswas PS, Ahmed MME, Afrin W, Rahman A, Shalahuddin AKM, Islam R, Akter F, Syed MA, Sarker MRA, Ifterkharuddaula KM, Islam MR. Enhancing genetic gain through the application of genomic selection in developing irrigated rice for the favorable ecosystem in Bangladesh. Front Genet 2023; 14:1083221. [PMID: 36911402 PMCID: PMC9992429 DOI: 10.3389/fgene.2023.1083221] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 01/18/2023] [Indexed: 02/24/2023] Open
Abstract
Increasing selection differential and decreasing cycle time, the rate of genetic improvement can be accelerated. Creating and capturing higher genetic with higher accuracy within the shortest possible time is the prerequisite for enhancing genetic gain for any trait. Comprehensive yield testing at multi-locations at early generations together with the shortest line fixation time can expedite the rapid recycling of parents in the breeding program through recurrent selection. Genomic selection is efficient in capturing high breeding value individuals taking additive genetic effects of all genes into account with and without extensive field testing, thus reducing breeding cycle time enhances genetic gain. In the Bangladesh Rice Research Institute, GS technology together with the trait-specific marker-assisted selection at the early generation of RGA-derived breeding lines showed a prediction accuracy of 0.454-0.701 with 0.989-2.623 relative efficiency over the four consecutive years of exercise. This study reports that the application of GS together with trait-specific MAS has expedited the yield improvement by 117 kg ha-1·year-1, which is around seven-fold larger than the baseline annual genetic gain and shortened the breeding cycle by around 1.5 years from the existing 4.5 years.
Collapse
Affiliation(s)
- Partha S Biswas
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - M M Emam Ahmed
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - Wazifa Afrin
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - Anisar Rahman
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - A K M Shalahuddin
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - Rafiqul Islam
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - Fahamida Akter
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - Md Abu Syed
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - Md Ruhul Amin Sarker
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | - K M Ifterkharuddaula
- Plant Breeding Division, Bangladesh Rice Research Institute, Gazipur, Bangladesh
| | | |
Collapse
|
33
|
Rabieyan E, Bihamta MR, Moghaddam ME, Mohammadi V, Alipour H. Genome-wide association mapping and genomic prediction of agronomical traits and breeding values in Iranian wheat under rain-fed and well-watered conditions. BMC Genomics 2022; 23:831. [PMID: 36522726 PMCID: PMC9753272 DOI: 10.1186/s12864-022-08968-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 10/26/2022] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The markers detected by genome-wide association study (GWAS) make it possible to dissect genetic structure and diversity at many loci. This can enable a wheat breeder to reveal and used genomic loci controlling drought tolerance. This study was focused on determining the population structure of Iranian 208 wheat landraces and 90 cultivars via genotyping-by-sequencing (GBS) and also on detecting marker-trait associations (MTAs) by GWAS and genomic prediction (GS) of wheat agronomic traits for drought-tolerance breeding. GWASs were conducted using both the original phenotypes (pGWAS) and estimated breeding values (eGWAS). The bayesian ridge regression (BRR), genomic best linear unbiased prediction (gBLUP), and ridge regression-best linear unbiased prediction (rrBLUP) approaches were used to estimate breeding values and estimate prediction accuracies in genomic selection. RESULTS Population structure analysis using 2,174,975 SNPs revealed four genetically distinct sub-populations from wheat accessions. D-Genome harbored the lowest number of significant marker pairs and the highest linkage disequilibrium (LD), reflecting different evolutionary histories of wheat genomes. From pGWAS, BRR, gBLUP, and rrBLUP, 284, 363, 359 and 295 significant MTAs were found under normal and 195, 365, 362 and 302 under stress conditions, respectively. The gBLUP with the most similarity (80.98 and 71.28% in well-watered and rain-fed environments, correspondingly) with the pGWAS method in the terms of discovered significant SNPs, suggesting the potential of gBLUP in uncovering SNPs. Results from gene ontology revealed that 29 and 30 SNPs in the imputed dataset were located in protein-coding regions for well-watered and rain-fed conditions, respectively. gBLUP model revealed genetic effects better than other models, suggesting a suitable tool for genome selection in wheat. CONCLUSION We illustrate that Iranian landraces of bread wheat contain novel alleles that are adaptive to drought stress environments. gBLUP model can be helpful for fine mapping and cloning of the relevant QTLs and genes, and for carrying out trait introgression and marker-assisted selection in both normal and drought environments in wheat collections.
Collapse
Affiliation(s)
- Ehsan Rabieyan
- grid.46072.370000 0004 0612 7950Department of Agronomy and Plant Breeding, Faculty of Agricultural Sciences and Engineering, University of Tehran, Karaj, Iran
| | - Mohammad Reza Bihamta
- grid.46072.370000 0004 0612 7950Department of Agronomy and Plant Breeding, Faculty of Agricultural Sciences and Engineering, University of Tehran, Karaj, Iran
| | | | - Valiollah Mohammadi
- grid.46072.370000 0004 0612 7950Department of Agronomy and Plant Breeding, Faculty of Agricultural Sciences and Engineering, University of Tehran, Karaj, Iran
| | - Hadi Alipour
- grid.412763.50000 0004 0442 8645Department of Plant Production and Genetics, Faculty of Agriculture, Urmia University, Urmia, Iran
| |
Collapse
|
34
|
Sun B, Guo R, Liu Z, Shi X, Yang Q, Shi J, Zhang M, Yang C, Zhao S, Zhang J, He J, Zhang J, Su J, Song Q, Yan L. Genetic variation and marker-trait association affect the genomic selection prediction accuracy of soybean protein and oil content. FRONTIERS IN PLANT SCIENCE 2022; 13:1064623. [PMID: 36582644 PMCID: PMC9793221 DOI: 10.3389/fpls.2022.1064623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 11/10/2022] [Indexed: 06/17/2023]
Abstract
INTRODUCTION Genomic selection (GS) is a potential breeding approach for soybean improvement. METHODS In this study, GS was performed on soybean protein and oil content using the Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) based on 1,007 soybean accessions. The SoySNP50K SNP dataset of the accessions was obtained from the USDA-ARS, Beltsville, MD lab, and the protein and oil content of the accessions were obtained from GRIN. RESULTS Our results showed that the prediction accuracy of oil content was higher than that of protein content. When the training population size was 100, the prediction accuracies for protein content and oil content were 0.60 and 0.79, respectively. The prediction accuracy increased with the size of the training population. Training populations with similar phenotype or with close genetic relationships to the prediction population exhibited better prediction accuracy. A greatest prediction accuracy for both protein and oil content was observed when approximately 3,000 markers with -log10(P) greater than 1 were included. DISCUSSION This information will help improve GS efficiency and facilitate the application of GS.
Collapse
Affiliation(s)
- Bo Sun
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
- College of Life Sciences, Hebei Agricultural University, Baoding, China
| | - Rui Guo
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Zhi Liu
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Xiaolei Shi
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Qing Yang
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Jiayao Shi
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Mengchen Zhang
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Chunyan Yang
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Shugang Zhao
- College of Life Sciences, Hebei Agricultural University, Baoding, China
| | - Jie Zhang
- College of Life Sciences, Hebei Agricultural University, Baoding, China
| | - Jianhan He
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| | - Jiaoping Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement, National Center for Soybean Improvement, Nanjing, China
- Key Laboratory for Biology and Genetic Improvement of Soybean, (General, Ministry of Agriculture), Nanjing Agricultural University, Nanjing, China
| | - Jianhui Su
- Agricultural-Regionalization Workstation of Shijiazhuang’s Gaocheng District, Shijiazhuang, China
| | - Qijian Song
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD, United States
| | - Long Yan
- Institute of Cereal and Oil Crops, Hebei Academy of Agricultural and Forestry Sciences, Shijiazhuang Branch Center of National Center for Soybean Improvement, The Key Laboratory of Crop Genetics and Breeding, Shijiazhuang, China
| |
Collapse
|
35
|
Javid S, Bihamta MR, Omidi M, Abbasi AR, Alipour H, Ingvarsson PK. Genome-Wide Association Study (GWAS) and genome prediction of seedling salt tolerance in bread wheat (Triticum aestivum L.). BMC PLANT BIOLOGY 2022; 22:581. [PMID: 36513980 PMCID: PMC9746167 DOI: 10.1186/s12870-022-03936-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Accepted: 11/10/2022] [Indexed: 06/17/2023]
Abstract
BACKGROUND Salinity tolerance in wheat is imperative for improving crop genetic capacity in response to the expanding phenomenon of soil salinization. However, little is known about the genetic foundation underlying salinity tolerance at the seedling growth stage of wheat. Herein, a GWAS analysis was carried out by the random-SNP-effect mixed linear model (mrMLM) multi-locus model to uncover candidate genes responsible for salt tolerance at the seedling stage in 298 Iranian bread wheat accessions, including 208 landraces and 90 cultivars. RESULTS A total of 29 functional marker-trait associations (MTAs) were detected under salinity, 100 mM NaCl (sodium chloride). Of these, seven single nucleotide polymorphisms (SNPs) including rs54146, rs257, rs37983, rs18682, rs55629, rs15183, and rs63185 with R2 ≥ 10% were found to be linked with relative water content, root fresh weight, root dry weight, root volume, shoot high, proline, and shoot potassium (K+), respectively. Further, a total of 27 candidate genes were functionally annotated to be involved in response to the saline environment. Most of these genes have key roles in photosynthesis, response to abscisic acid, cell redox homeostasis, sucrose and carbohydrate metabolism, ubiquitination, transmembrane transport, chromatin silencing, and some genes harbored unknown functions that all together may respond to salinity as a complex network. For genomic prediction (GP), the genomic best linear unbiased prediction (GBLUP) model reflected genetic effects better than both bayesian ridge regression (BRR) and ridge regression-best linear unbiased prediction (RRBLUP), suggesting GBLUP as a favorable tool for wheat genomic selection. CONCLUSION The SNPs and candidate genes identified in the current work can be used potentially for developing salt-tolerant varieties at the seedling growth stage by marker-assisted selection.
Collapse
Affiliation(s)
- Saeideh Javid
- Department of Agronomy and Plant Breeding, University of Tehran, Karaj, Iran
| | | | - Mansour Omidi
- Department of Agronomy and Plant Breeding, University of Tehran, Karaj, Iran
| | - Ali Reza Abbasi
- Department of Agronomy and Plant Breeding, University of Tehran, Karaj, Iran
| | - Hadi Alipour
- Department of Plant Production and Genetics, Urmia University, Urmia, Iran
| | - Pär K Ingvarsson
- Department of Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| |
Collapse
|
36
|
Budhlakoti N, Mishra DC, Majumdar SG, Kumar A, Srivastava S, Rai SN, Rai A. Integrated model for genomic prediction under additive and non-additive genetic architecture. FRONTIERS IN PLANT SCIENCE 2022; 13:1027558. [PMID: 36531414 PMCID: PMC9749549 DOI: 10.3389/fpls.2022.1027558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 10/11/2022] [Indexed: 06/17/2023]
Abstract
Using data from genome-wide molecular markers, genomic selection procedures have proved useful for estimating breeding values and phenotypic prediction. The link between an individual genotype and phenotype has been modelled using a number of parametric methods to estimate individual breeding value. It has been observed that parametric methods perform satisfactorily only when the system under study has additive genetic architecture. To capture non-additive (dominance and epistasis) effects, nonparametric approaches have also been developed; however, they typically fall short of capturing additive effects. The idea behind this study is to select the most appropriate model from each parametric and nonparametric category and build an integrated model that can incorporate the best features of both models. It was observed from the results of the current study that GBLUP performed admirably under additive architecture, while SVM's performance in non-additive architecture was found to be encouraging. A robust model for genomic prediction has been developed in light of these findings, which can handle both additive and epistatic effects simultaneously by minimizing their error variance. The developed integrated model has been assessed using standard evaluation measures like predictive ability and error variance.
Collapse
Affiliation(s)
- Neeraj Budhlakoti
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Dwijesh Chandra Mishra
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Sayanti Guha Majumdar
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - Anuj Kumar
- Department of Microbiology and Immunology, Dalhousie University, Halifax, NS, Canada
| | - Sudhir Srivastava
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| | - S. N. Rai
- Bioinformatics and Biostatistics Department, University of Louisville, Louisville, KY, United States
| | - Anil Rai
- Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, New Delhi, India
| |
Collapse
|
37
|
Rai M, Tyagi W. Haplotype breeding for unlocking and utilizing plant genomics data. Front Genet 2022; 13:1006288. [DOI: 10.3389/fgene.2022.1006288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 10/28/2022] [Indexed: 11/17/2022] Open
|
38
|
Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. MOLECULAR PLANT 2022; 15:1664-1695. [PMID: 36081348 DOI: 10.1016/j.molp.2022.09.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 09/02/2022] [Indexed: 05/12/2023]
Abstract
The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.
Collapse
Affiliation(s)
- Yunbi Xu
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; CIMMYT-China Tropical Maize Research Center, School of Food Science and Engineering, Foshan University, Foshan, Guangdong 528231, China; Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China.
| | - Xingping Zhang
- Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China
| | - Huihui Li
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan 572024, China
| | - Hongjian Zheng
- CIMMYT-China Specialty Maize Research Center, Shanghai Academy of Agricultural Sciences, Shanghai 201400, China
| | - Jianan Zhang
- MolBreeding Biotechnology Co., Ltd., Shijiazhuang, Hebei 050035, China
| | - Michael S Olsen
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Rajeev K Varshney
- State Agricultural Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, Australia
| | - Boddupalli M Prasanna
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Qian Qian
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
39
|
Reyes VP, Kitony JK, Nishiuchi S, Makihara D, Doi K. Utilization of Genotyping-by-Sequencing (GBS) for Rice Pre-Breeding and Improvement: A Review. Life (Basel) 2022; 12:1752. [PMID: 36362909 PMCID: PMC9694628 DOI: 10.3390/life12111752] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 10/27/2022] [Accepted: 10/28/2022] [Indexed: 09/29/2023] Open
Abstract
Molecular markers play a crucial role in the improvement of rice. To benefit from these markers, genotyping is carried out to identify the differences at a specific position in the genome of individuals. The advances in sequencing technologies have led to the development of different genotyping techniques such as genotyping-by-sequencing. Unlike PCR-fragment-based genotyping, genotyping-by-sequencing has enabled the parallel sequencing and genotyping of hundreds of samples in a single run, making it more cost-effective. Currently, GBS is being used in several pre-breeding programs of rice to identify beneficial genes and QTL from different rice genetic resources. In this review, we present the current advances in the utilization of genotyping-by-sequencing for the development of rice pre-breeding materials and the improvement of existing rice cultivars. The challenges and perspectives of using this approach are also highlighted.
Collapse
Affiliation(s)
- Vincent Pamugas Reyes
- Graduate School of Bioagricultural Sciences, Nagoya University, Nagoya 464-8601, Japan
| | | | - Shunsaku Nishiuchi
- Graduate School of Bioagricultural Sciences, Nagoya University, Nagoya 464-8601, Japan
| | - Daigo Makihara
- International Center for Research and Education in Agriculture, Nagoya University, Nagoya 464-8601, Japan
| | - Kazuyuki Doi
- Graduate School of Bioagricultural Sciences, Nagoya University, Nagoya 464-8601, Japan
| |
Collapse
|
40
|
A divide-and-conquer approach for genomic prediction in rubber tree using machine learning. Sci Rep 2022; 12:18023. [PMID: 36289298 PMCID: PMC9605989 DOI: 10.1038/s41598-022-20416-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 09/13/2022] [Indexed: 01/20/2023] Open
Abstract
Rubber tree (Hevea brasiliensis) is the main feedstock for commercial rubber; however, its long vegetative cycle has hindered the development of more productive varieties via breeding programs. With the availability of H. brasiliensis genomic data, several linkage maps with associated quantitative trait loci have been constructed and suggested as a tool for marker-assisted selection. Nonetheless, novel genomic strategies are still needed, and genomic selection (GS) may facilitate rubber tree breeding programs aimed at reducing the required cycles for performance assessment. Even though such a methodology has already been shown to be a promising tool for rubber tree breeding, increased model predictive capabilities and practical application are still needed. Here, we developed a novel machine learning-based approach for predicting rubber tree stem circumference based on molecular markers. Through a divide-and-conquer strategy, we propose a neural network prediction system with two stages: (1) subpopulation prediction and (2) phenotype estimation. This approach yielded higher accuracies than traditional statistical models in a single-environment scenario. By delivering large accuracy improvements, our methodology represents a powerful tool for use in Hevea GS strategies. Therefore, the incorporation of machine learning techniques into rubber tree GS represents an opportunity to build more robust models and optimize Hevea breeding programs.
Collapse
|
41
|
Kim KW, Nawade B, Nam J, Chu SH, Ha J, Park YJ. Development of an inclusive 580K SNP array and its application for genomic selection and genome-wide association studies in rice. FRONTIERS IN PLANT SCIENCE 2022; 13:1036177. [PMID: 36352876 PMCID: PMC9637963 DOI: 10.3389/fpls.2022.1036177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Accepted: 09/30/2022] [Indexed: 06/16/2023]
Abstract
Rice is a globally cultivated crop and is primarily a staple food source for more than half of the world's population. Various single-nucleotide polymorphism (SNP) arrays have been developed and utilized as standard genotyping methods for rice breeding research. Considering the importance of SNP arrays with more inclusive genetic information for GWAS and genomic selection, we integrated SNPs from eight different data resources: resequencing data from the Korean World Rice Collection (KRICE) of 475 accessions, 3,000 rice genome project (3 K-RGP) data, 700 K high-density rice array, Affymetrix 44 K SNP array, QTARO, Reactome, and plastid and GMO information. The collected SNPs were filtered and selected based on the breeder's interest, covering all key traits or research areas to develop an integrated array system representing inclusive genomic polymorphisms. A total of 581,006 high-quality SNPs were synthesized with an average distance of 200 bp between adjacent SNPs, generating a 580 K Axiom Rice Genotyping Chip (580 K _ KNU chip). Further validation of this array on 4,720 genotypes revealed robust and highly efficient genotyping. This has also been demonstrated in genome-wide association studies (GWAS) and genomic selection (GS) of three traits: clum length, heading date, and panicle length. Several SNPs significantly associated with cut-off, -log10 p-value >7.0, were detected in GWAS, and the GS predictabilities for the three traits were more than 0.5, in both rrBLUP and convolutional neural network (CNN) models. The Axiom 580 K Genotyping array will provide a cost-effective genotyping platform and accelerate rice GWAS and GS studies.
Collapse
Affiliation(s)
- Kyu-Won Kim
- Center for Crop Breeding on Omics and Artificial Intelligence, Kongju National University, Yesan, South Korea
| | - Bhagwat Nawade
- Center for Crop Breeding on Omics and Artificial Intelligence, Kongju National University, Yesan, South Korea
| | - Jungrye Nam
- Center for Crop Breeding on Omics and Artificial Intelligence, Kongju National University, Yesan, South Korea
| | - Sang-Ho Chu
- Center for Crop Breeding on Omics and Artificial Intelligence, Kongju National University, Yesan, South Korea
| | - Jungmin Ha
- Department of Plant Science, Gangneung-Wonju National University, Gangneung, South Korea
| | - Yong-Jin Park
- Center for Crop Breeding on Omics and Artificial Intelligence, Kongju National University, Yesan, South Korea
- Department of Plant Resources, College of Industrial Sciences, Kongju National University, Yesan, South Korea
| |
Collapse
|
42
|
Shen Q, Xie Y, Qiu X, Yu J. The era of cultivating smart rice with high light efficiency and heat tolerance has come of age. FRONTIERS IN PLANT SCIENCE 2022; 13:1021203. [PMID: 36275525 PMCID: PMC9585279 DOI: 10.3389/fpls.2022.1021203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 09/16/2022] [Indexed: 06/16/2023]
Abstract
How to improve the yield of crops has always been the focus of breeding research. Due to the population growth and global climate change, the demand for food has increased sharply, which has brought great challenges to agricultural production. In order to make up for the limitation of global cultivated land area, it is necessary to further improve the output of crops. Photosynthesis is the main source of plant assimilate accumulation, which has a profound impact on the formation of its yield. This review focuses on the cultivation of high light efficiency plants, introduces the main technical means and research progress in improving the photosynthetic efficiency of plants, and discusses the main problems and difficulties faced by the cultivation of high light efficiency plants. At the same time, in view of the frequent occurrence of high-temperature disasters caused by global warming, which seriously threatened plant normal production, we reviewed the response mechanism of plants to heat stress, introduced the methods and strategies of how to cultivate heat tolerant crops, especially rice, and briefly reviewed the progress of heat tolerant research at present. Given big progress in these area, the era of cultivating smart rice with high light efficiency and heat tolerance has come of age.
Collapse
Affiliation(s)
- Qiuping Shen
- The Key Laboratory for Quality Improvement of Agricultural Products of Zhejiang Province, Zhejiang A & F University, Hangzhou, China
- College of Advanced Agricultural Sciences, Zhejiang A & F University, Hangzhou, China
| | - Yujun Xie
- The Key Laboratory for Quality Improvement of Agricultural Products of Zhejiang Province, Zhejiang A & F University, Hangzhou, China
- College of Advanced Agricultural Sciences, Zhejiang A & F University, Hangzhou, China
| | - Xinzhe Qiu
- College of Advanced Agricultural Sciences, Zhejiang A & F University, Hangzhou, China
| | - Jinsheng Yu
- The Key Laboratory for Quality Improvement of Agricultural Products of Zhejiang Province, Zhejiang A & F University, Hangzhou, China
- College of Advanced Agricultural Sciences, Zhejiang A & F University, Hangzhou, China
| |
Collapse
|
43
|
Sabadin F, DoVale JC, Platten JD, Fritsche-Neto R. Optimizing self-pollinated crop breeding employing genomic selection: From schemes to updating training sets. FRONTIERS IN PLANT SCIENCE 2022; 13:935885. [PMID: 36275547 PMCID: PMC9583387 DOI: 10.3389/fpls.2022.935885] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 09/12/2022] [Indexed: 06/16/2023]
Abstract
Long-term breeding schemes using genomic selection (GS) can boost the response to selection per year. Although several studies have shown that GS delivers a higher response to selection, only a few analyze which stage GS produces better results and how to update the training population to maintain prediction accuracy. We used stochastic simulation to compare five GS breeding schemes in a self-pollinated long-term breeding program. Also, we evaluated four strategies, using distinct methods and sizes, to update the training set. Finally, regarding breeding schemes, we proposed a new approach using GS to select the best individuals in each F2 progeny, based on genomic estimated breeding values and genetic divergence, to cross them and generate a new recombination event. Our results showed that the best scenario was using GS in F2, followed by the phenotypic selection of new parents in F4. For TS updating, adding new data every cycle (over 768) to update the TS maintains the prediction accuracy at satisfactory levels for more breeding cycles. However, only the last three generations can be kept in the TS, optimizing the genetic relationship between TS and the targeted population and reducing the computing demand and risks. Hence, we believe that our results may help breeders optimize GS in their programs and improve genetic gain in long-term schemes.
Collapse
Affiliation(s)
- Felipe Sabadin
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States
| | - Julio César DoVale
- Department of Crop Science, Federal University of Ceará, Fortaleza, Ceará, Brazil
| | | | - Roberto Fritsche-Neto
- International Rice Research Institute (IRRI), Los Baños, Philippines
- H. Rouse Caffey Rice Research Station, Louisiana State University (LSU) AgCenter, Rayne, LA, United States
| |
Collapse
|
44
|
Anilkumar C, Sunitha NC, Devate NB, Ramesh S. Advances in integrated genomic selection for rapid genetic gain in crop improvement: a review. PLANTA 2022; 256:87. [PMID: 36149531 DOI: 10.1007/s00425-022-03996-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 09/11/2022] [Indexed: 06/16/2023]
Abstract
Genomic selection and its importance in crop breeding. Integration of GS with new breeding tools and developing SOP for GS to achieve maximum genetic gain with low cost and time. The success of conventional breeding approaches is not sufficient to meet the demand of a growing population for nutritious food and other plant-based products. Whereas, marker assisted selection (MAS) is not efficient in capturing all the favorable alleles responsible for economic traits in the process of crop improvement. Genomic selection (GS) developed in livestock breeding and then adapted to plant breeding promised to overcome the drawbacks of MAS and significantly improve complicated traits controlled by gene/QTL with small effects. Large-scale deployment of GS in important crops, as well as simulation studies in a variety of contexts, addressed G × E interaction effects and non-additive effects, as well as lowering breeding costs and time. The current study provides a complete overview of genomic selection, its process, and importance in modern plant breeding, along with insights into its application. GS has been implemented in the improvement of complex traits including tolerance to biotic and abiotic stresses. Furthermore, this review hypothesises that using GS in conjunction with other crop improvement platforms accelerates the breeding process to increase genetic gain. The objective of this review is to highlight the development of an appropriate GS model, the global open source network for GS, and trans-disciplinary approaches for effective accelerated crop improvement. The current study focused on the application of data science, including machine learning and deep learning tools, to enhance the accuracy of prediction models. Present study emphasizes on developing plant breeding strategies centered on GS combined with routine conventional breeding principles by developing GS-SOP to achieve enhanced genetic gain.
Collapse
Affiliation(s)
- C Anilkumar
- ICAR-National Rice Research Institute, Cuttack, India
| | - N C Sunitha
- University of Agricultural Sciences, Bangalore, India
| | | | - S Ramesh
- University of Agricultural Sciences, Bangalore, India.
| |
Collapse
|
45
|
Ning C, Xie K, Huang J, Di Y, Wang Y, Yang A, Hu J, Zhang Q, Wang D, Fan X. Marker density and statistical model designs to increase accuracy of genomic selection for wool traits in Angora rabbits. Front Genet 2022; 13:968712. [PMID: 36118881 PMCID: PMC9478554 DOI: 10.3389/fgene.2022.968712] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 08/17/2022] [Indexed: 11/13/2022] Open
Abstract
The Angora rabbit, a well-known breed for fiber production, has been undergoing traditional breeding programs relying mainly on phenotypes. Genomic selection (GS) uses genomic information and promises to accelerate genetic gain. Practically, to implement GS in Angora rabbit breeding, it is necessary to evaluate different marker densities and GS models to develop suitable strategies for an optimized breeding pipeline. Considering a lack in microarray, low-coverage sequencing combined with genotype imputation was used to boost the number of SNPs across the rabbit genome. Here, in a population of 629 Angora rabbits, a total of 18,577,154 high-quality SNPs were imputed (imputation accuracy above 98%) based on low-coverage sequencing of 3.84X genomic coverage, and wool traits and body weight were measured at 70, 140 and 210 days of age. From the original markers, 0.5K, 1K, 3K, 5K, 10K, 50K, 100K, 500K, 1M and 2M were randomly selected and evaluated, resulting in 50K markers as the baseline for the heritability estimation and genomic prediction. Comparing to the GS performance of single-trait models, the prediction accuracy of nearly all traits could be improved by multi-trait models, which might because multiple-trait models used information from genetically correlated traits. Furthermore, we observed high significant negative correlation between the increased prediction accuracy from single-trait to multiple-trait models and estimated heritability. The results indicated that low-heritability traits could borrow more information from correlated traits and hence achieve higher prediction accuracy. The research first reported heritability estimation in rabbits by using genome-wide markers, and provided 50K as an optimal marker density for further microarray design, genetic evaluation and genomic selection in Angora rabbits. We expect that the work could provide strategies for GS in early selection, and optimize breeding programs in rabbits.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Dan Wang
- *Correspondence: Dan Wang, ; Xinzhong Fan,
| | | |
Collapse
|
46
|
Cerioli T, Hernandez CO, Angira B, McCouch SR, Robbins KR, Famoso AN. Development and validation of an optimized marker set for genomic selection in southern U.S. rice breeding programs. THE PLANT GENOME 2022; 15:e20219. [PMID: 35611838 DOI: 10.1002/tpg2.20219] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 03/28/2022] [Indexed: 06/15/2023]
Abstract
The potential of genomic selection (GS) to increase the efficiency of breeding programs has been clearly demonstrated; however, the implementation of GS in rice (Oryza sativa L.) breeding programs has been limited. In recent years, efforts have begun to work toward implementing GS into the Louisiana State University (LSU) Agricultural Center rice breeding program. One of the first steps for successful GS implementation is to establish a suitable marker set for the target germplasm and a reliable, cost-effective genotyping platform capable of providing informative marker data with an adequate turnaround time. The objective of this study was to develop a marker set for routine GS and demonstrate its effectiveness in southern U.S. rice germplasm. The utility of the resulting marker set, the LSU500, for GS applications was demonstrated using four years of breeding data across 7,607 experimental lines and four elite biparental populations. The predictive ability of GS ranged from 0.13 to 0.78 for key traits across different market classes and yield trials. Comparisons between phenotypic selection and GS within biparental populations demonstrates similar performance of GS compared with phenotypic selection in predicting future performance. The prediction accuracies obtained with the LSU500 marker set demonstrates the utility of this marker set for cost-effective GS applications in southern U.S. rice breeding programs. The LSU500 marker set has been established through the genotyping service provider Agriplex Genomics, and in the future, it will undergo improvements to reduce the cost and increase the accuracy of GS.
Collapse
Affiliation(s)
- Tommaso Cerioli
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| | - Christopher O Hernandez
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| | - Brijesh Angira
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| | - Susan R McCouch
- Section of Plant Breeding and Genetics, School of Integrative Plant Sciences, Cornell Univ., Ithaca, NY, 14850, USA
- Cornell Institute for Digital Agriculture, Cornell Univ., Ithaca, NY, 14850, USA
| | - Kelly R Robbins
- Section of Plant Breeding and Genetics, School of Integrative Plant Sciences, Cornell Univ., Ithaca, NY, 14850, USA
| | - Adam N Famoso
- H. Rouse Caffey Rice Research Station, Louisiana State Univ. Agricultural Center, Rayne, LA, 70578, USA
| |
Collapse
|
47
|
Meena MR, Appunu C, Arun Kumar R, Manimekalai R, Vasantha S, Krishnappa G, Kumar R, Pandey SK, Hemaprabha G. Recent Advances in Sugarcane Genomics, Physiology, and Phenomics for Superior Agronomic Traits. Front Genet 2022; 13:854936. [PMID: 35991570 PMCID: PMC9382102 DOI: 10.3389/fgene.2022.854936] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 05/26/2022] [Indexed: 11/13/2022] Open
Abstract
Advances in sugarcane breeding have contributed significantly to improvements in agronomic traits and crop yield. However, the growing global demand for sugar and biofuel in the context of climate change requires further improvements in cane and sugar yields. Attempts to achieve the desired rates of genetic gain in sugarcane by conventional breeding means are difficult as many agronomic traits are genetically complex and polygenic, with each gene exerting small effects. Unlike those of many other crops, the sugarcane genome is highly heterozygous due to its autopolyploid nature, which further hinders the development of a comprehensive genetic map. Despite these limitations, many superior agronomic traits/genes for higher cane yield, sugar production, and disease/pest resistance have been identified through the mapping of quantitative trait loci, genome-wide association studies, and transcriptome approaches. Improvements in traits controlled by one or two loci are relatively easy to achieve; however, this is not the case for traits governed by many genes. Many desirable phenotypic traits are controlled by quantitative trait nucleotides (QTNs) with small and variable effects. Assembling these desired QTNs by conventional breeding methods is time consuming and inefficient due to genetic drift. However, recent developments in genomics selection (GS) have allowed sugarcane researchers to select and accumulate desirable alleles imparting superior traits as GS is based on genomic estimated breeding values, which substantially increases the selection efficiency and genetic gain in sugarcane breeding programs. Next-generation sequencing techniques coupled with genome-editing technologies have provided new vistas in harnessing the sugarcane genome to look for desirable agronomic traits such as erect canopy, leaf angle, prolonged greening, high biomass, deep root system, and the non-flowering nature of the crop. Many desirable cane-yielding traits, such as single cane weight, numbers of tillers, numbers of millable canes, as well as cane quality traits, such as sucrose and sugar yield, have been explored using these recent biotechnological tools. This review will focus on the recent advances in sugarcane genomics related to genetic gain and the identification of favorable alleles for superior agronomic traits for further utilization in sugarcane breeding programs.
Collapse
Affiliation(s)
- Mintu Ram Meena
- Regional Centre, ICAR-Sugarcane Breeding Institute, Karnal, India
- *Correspondence: Mintu Ram Meena, ; Chinnaswamy Appunu,
| | - Chinnaswamy Appunu
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
- *Correspondence: Mintu Ram Meena, ; Chinnaswamy Appunu,
| | - R. Arun Kumar
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | | | - S. Vasantha
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
| | | | - Ravinder Kumar
- Regional Centre, ICAR-Sugarcane Breeding Institute, Karnal, India
| | - S. K. Pandey
- Regional Centre, ICAR-Sugarcane Breeding Institute, Karnal, India
| | - G. Hemaprabha
- ICAR-Sugarcane Breeding Institute, Coimbatore, India
| |
Collapse
|
48
|
Zia B, Shi A, Olaoye D, Xiong H, Ravelombola W, Gepts P, Schwartz HF, Brick MA, Otto K, Ogg B, Chen S. Genome-Wide Association Study and Genomic Prediction for Bacterial Wilt Resistance in Common Bean ( Phaseolus vulgaris) Core Collection. Front Genet 2022; 13:853114. [PMID: 35711938 PMCID: PMC9197503 DOI: 10.3389/fgene.2022.853114] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 04/14/2022] [Indexed: 11/30/2022] Open
Abstract
Common bean (Phaseolus vulgaris) is one of the major legume crops cultivated worldwide. Bacterial wilt (BW) of common bean (Curtobacterium flaccumfaciens pv. flaccumfaciens), being a seed-borne disease, has been a challenge in common bean producing regions. A genome-wide association study (GWAS) was conducted to identify SNP markers associated with BW resistance in the USDA common bean core collection. A total of 168 accessions were evaluated for resistance against three different isolates of BW. Our study identified a total of 14 single nucleotide polymorphism (SNP) markers associated with the resistance to BW isolates 528, 557, and 597 using mixed linear models (MLMs) in BLINK, FarmCPU, GAPIT, and TASSEL 5. These SNPs were located on chromosomes Phaseolus vulgaris [Pv]02, Pv04, Pv08, and Pv09 for isolate 528; Pv07, Pv10, and Pv11 for isolate 557; and Pv04, Pv08, and Pv10 for isolate 597. The genomic prediction accuracy was assessed by utilizing seven GP models with 1) all the 4,568 SNPs and 2) the 14 SNP markers. The overall prediction accuracy (PA) ranged from 0.30 to 0.56 for resistance against the three BW isolates. A total of 14 candidate genes were discovered for BW resistance located on chromosomes Pv02, Pv04, Pv07, Pv08, and Pv09. This study revealed vital information for developing genetic resistance against the BW pathogen in common bean. Accordingly, the identified SNP markers and candidate genes can be utilized in common bean molecular breeding programs to develop novel resistant cultivars.
Collapse
Affiliation(s)
- Bazgha Zia
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Ainong Shi
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Dotun Olaoye
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Haizheng Xiong
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Waltram Ravelombola
- Organic & Specialty Crop Breeding, Texas A&M AgriLife Research, Vernon, TX, United States
| | - Paul Gepts
- Department of Plant Sciences/MS1, University of California, Davis, Davis, CA, United States
| | - Howard F Schwartz
- Department of Agricultural Biology, Colorado State University, Fort Collins, CO, United States
| | - Mark A Brick
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, United States
| | - Kristen Otto
- Department of Agricultural Biology, Colorado State University, Fort Collins, CO, United States
| | - Barry Ogg
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, United States
| | - Senyu Chen
- Department of Plant Pathology, University of Minnesota, Minneapolis, MN, United States
| |
Collapse
|
49
|
Singh M, Nara U. Genetic insights in pearl millet breeding in the genomic era: challenges and prospects. PLANT BIOTECHNOLOGY REPORTS 2022; 17:15-37. [PMID: 35692233 PMCID: PMC9169599 DOI: 10.1007/s11816-022-00767-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 04/30/2022] [Accepted: 05/17/2022] [Indexed: 05/28/2023]
Abstract
Pearl millet, a vital staple food and an important cereal, is emerging as crop having various end-uses as feed, food as well as fodder. Advancement in high-throughput sequencing technology has boosted up pearl millet genomic research in past few years. The available draft genome of pearl millet providing an insight into the advancement of several breeding lines. Comparative and functional genomics have untangled several loci and genes regulating adaptive and agronomic traits in pearl millet. Additionally, the knowledge achieved has far away from being applicable in real breeding practices. We believe that the best path ahead is to adopt genome-based approaches for tailored designing of pearl millet as multi-functional crop with outstanding agronomic traits for various end uses. Presently review highlight several novel concepts and techniques in crop breeding, and summarize the recent advances in pearl millet genomic research, peculiarly genome-wide association dissections of several novel alleles and genes for agronomically important traits.
Collapse
Affiliation(s)
- Mandeep Singh
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| | - Usha Nara
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| |
Collapse
|
50
|
Danilevicz MF, Gill M, Anderson R, Batley J, Bennamoun M, Bayer PE, Edwards D. Plant Genotype to Phenotype Prediction Using Machine Learning. Front Genet 2022; 13:822173. [PMID: 35664329 PMCID: PMC9159391 DOI: 10.3389/fgene.2022.822173] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 03/07/2022] [Indexed: 12/13/2022] Open
Abstract
Genomic prediction tools support crop breeding based on statistical methods, such as the genomic best linear unbiased prediction (GBLUP). However, these tools are not designed to capture non-linear relationships within multi-dimensional datasets, or deal with high dimension datasets such as imagery collected by unmanned aerial vehicles. Machine learning (ML) algorithms have the potential to surpass the prediction accuracy of current tools used for genotype to phenotype prediction, due to their capacity to autonomously extract data features and represent their relationships at multiple levels of abstraction. This review addresses the challenges of applying statistical and machine learning methods for predicting phenotypic traits based on genetic markers, environment data, and imagery for crop breeding. We present the advantages and disadvantages of explainable model structures, discuss the potential of machine learning models for genotype to phenotype prediction in crop breeding, and the challenges, including the scarcity of high-quality datasets, inconsistent metadata annotation and the requirements of ML models.
Collapse
Affiliation(s)
- Monica F. Danilevicz
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Mitchell Gill
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Robyn Anderson
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Mohammed Bennamoun
- School of Physics, Mathematics and Computing, University of Western Australia, Perth, WA, Australia
| | - Philipp E. Bayer
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
- *Correspondence: David Edwards,
| |
Collapse
|