1
|
Long Y, Wang C, Liu C, Li H, Pu A, Dong Z, Wei X, Wan X. Molecular mechanisms controlling grain size and weight and their biotechnological breeding applications in maize and other cereal crops. J Adv Res 2024; 62:27-46. [PMID: 37739122 DOI: 10.1016/j.jare.2023.09.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 09/03/2023] [Accepted: 09/18/2023] [Indexed: 09/24/2023] Open
Abstract
BACKGROUND Cereal crops are a primary energy source for humans. Grain size and weight affect both evolutionary fitness and grain yield of cereals. Although studies on gene mining and molecular mechanisms controlling grain size and weight are constantly emerging in cereal crops, only a few systematic reviews on the underlying molecular mechanisms and their breeding applications are available so far. AIM OF REVIEW This review provides a general state-of-the-art overview of molecular mechanisms and targeted strategies for improving grain size and weight of cereals as well as insights for future yield-improving biotechnology-assisted breeding. KEY SCIENTIFIC CONCEPTS OF REVIEW In this review, the evolution of research on grain size and weight over the last 20 years is traced based on a bibliometric analysis of 1158 publications and the main signaling pathways and transcriptional factors involved are summarized. In addition, the roles of post-transcriptional regulation and photosynthetic product accumulation affecting grain size and weight in maize and rice are outlined. State-of-the-art strategies for discovering novel genes related to grain size and weight in maize and other cereal crops as well as advanced breeding biotechnology strategies being used for improving yield including marker-assisted selection, genomic selection, transgenic breeding, and genome editing are also discussed.
Collapse
Affiliation(s)
- Yan Long
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Industry Research Institute of Biotechnology Breeding, Yili Normal University, Yining 835000, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China
| | - Cheng Wang
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China
| | - Chang Liu
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China
| | - Huangai Li
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China
| | - Aqing Pu
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China
| | - Zhenying Dong
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Industry Research Institute of Biotechnology Breeding, Yili Normal University, Yining 835000, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China
| | - Xun Wei
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Industry Research Institute of Biotechnology Breeding, Yili Normal University, Yining 835000, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China
| | - Xiangyuan Wan
- Research Institute of Biology and Agriculture, University of Science and Technology Beijing, Beijing 100083, China; Industry Research Institute of Biotechnology Breeding, Yili Normal University, Yining 835000, China; Beijing Engineering Laboratory of Main Crop Bio-Tech Breeding, Beijing International Science and Technology Cooperation Base of Bio-Tech Breeding, Zhongzhi International Institute of Agricultural Biosciences, Beijing 100192, China.
| |
Collapse
|
2
|
Alemu A, Åstrand J, Montesinos-López OA, Isidro Y Sánchez J, Fernández-Gónzalez J, Tadesse W, Vetukuri RR, Carlsson AS, Ceplitis A, Crossa J, Ortiz R, Chawade A. Genomic selection in plant breeding: Key factors shaping two decades of progress. MOLECULAR PLANT 2024; 17:552-578. [PMID: 38475993 DOI: 10.1016/j.molp.2024.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/22/2024] [Accepted: 03/08/2024] [Indexed: 03/14/2024]
Abstract
Genomic selection, the application of genomic prediction (GP) models to select candidate individuals, has significantly advanced in the past two decades, effectively accelerating genetic gains in plant breeding. This article provides a holistic overview of key factors that have influenced GP in plant breeding during this period. We delved into the pivotal roles of training population size and genetic diversity, and their relationship with the breeding population, in determining GP accuracy. Special emphasis was placed on optimizing training population size. We explored its benefits and the associated diminishing returns beyond an optimum size. This was done while considering the balance between resource allocation and maximizing prediction accuracy through current optimization algorithms. The density and distribution of single-nucleotide polymorphisms, level of linkage disequilibrium, genetic complexity, trait heritability, statistical machine-learning methods, and non-additive effects are the other vital factors. Using wheat, maize, and potato as examples, we summarize the effect of these factors on the accuracy of GP for various traits. The search for high accuracy in GP-theoretically reaching one when using the Pearson's correlation as a metric-is an active research area as yet far from optimal for various traits. We hypothesize that with ultra-high sizes of genotypic and phenotypic datasets, effective training population optimization methods and support from other omics approaches (transcriptomics, metabolomics and proteomics) coupled with deep-learning algorithms could overcome the boundaries of current limitations to achieve the highest possible prediction accuracy, making genomic selection an effective tool in plant breeding.
Collapse
Affiliation(s)
- Admas Alemu
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Johanna Åstrand
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden; Lantmännen Lantbruk, Svalöv, Sweden
| | | | - Julio Isidro Y Sánchez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Javier Fernández-Gónzalez
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM) - Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, 28223 Madrid, Spain
| | - Wuletaw Tadesse
- International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat, Morocco
| | - Ramesh R Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | - Anders S Carlsson
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Km 45, Carretera México-Veracruz, Texcoco, México 52640, Mexico
| | - Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden.
| | - Aakash Chawade
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Alnarp, Sweden
| |
Collapse
|
3
|
Wu C, Luo J, Xiao Y. Multi-omics assists genomic prediction of maize yield with machine learning approaches. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2024; 44:14. [PMID: 38343399 PMCID: PMC10853138 DOI: 10.1007/s11032-024-01454-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 01/19/2024] [Indexed: 02/28/2024]
Abstract
With the improvement of high-throughput technologies in recent years, large multi-dimensional plant omics data have been produced, and big-data-driven yield prediction research has received increasing attention. Machine learning offers promising computational and analytical solutions to interpret the biological meaning of large amounts of data in crops. In this study, we utilized multi-omics datasets from 156 maize recombinant inbred lines, containing 2496 single nucleotide polymorphisms (SNPs), 46 image traits (i-traits) from 16 developmental stages obtained through an automatic phenotyping platform, and 133 primary metabolites. Based on benchmark tests with different types of prediction models, some machine learning methods, such as Partial Least Squares (PLS), Random Forest (RF), and Gaussian process with Radial basis function kernel (GaussprRadial), achieved better prediction for maize yield, albeit slight difference for method preferences among i-traits, genomic, and metabolic data. We found that better yield prediction may be caused by various capabilities in ranking and filtering data features, which is found to be linked with biological meaning such as photosynthesis-related or kernel development-related regulations. Finally, by integrating multiple omics data with the RF machine learning approach, we can further improve the prediction accuracy of grain yield from 0.32 to 0.43. Our research provides new ideas for the application of plant omics data and artificial intelligence approaches to facilitate crop genetic improvements. Supplementary Information The online version contains supplementary material available at 10.1007/s11032-024-01454-z.
Collapse
Affiliation(s)
- Chengxiu Wu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070 China
| | - Jingyun Luo
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070 China
| | - Yingjie Xiao
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070 China
- Hubei Hongshan Laboratory, Wuhan, 430070 China
| |
Collapse
|
4
|
Kitashova A, Brodsky V, Chaturvedi P, Pierides I, Ghatak A, Weckwerth W, Nägele T. Quantifying the impact of dynamic plant-environment interactions on metabolic regulation. JOURNAL OF PLANT PHYSIOLOGY 2023; 290:154116. [PMID: 37839392 DOI: 10.1016/j.jplph.2023.154116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 10/03/2023] [Accepted: 10/06/2023] [Indexed: 10/17/2023]
Abstract
A plant's genome encodes enzymes, transporters and many other proteins which constitute metabolism. Interactions of plants with their environment shape their growth, development and resilience towards adverse conditions. Although genome sequencing technologies and applications have experienced triumphantly rapid development during the last decades, enabling nowadays a fast and cheap sequencing of full genomes, prediction of metabolic phenotypes from genotype × environment interactions remains, at best, very incomplete. The main reasons are a lack of understanding of how different levels of molecular organisation depend on each other, and how they are constituted and expressed within a setup of growth conditions. Phenotypic plasticity, e.g., of the genetic model plant Arabidopsis thaliana, has provided important insights into plant-environment interactions and the resulting genotype x phenotype relationships. Here, we summarize previous and current findings about plant development in a changing environment and how this might be shaped and reflected in metabolism and its regulation. We identify current challenges in the study of plant development and metabolic regulation and provide an outlook of how methodological workflows might support the application of findings made in model systems to crops and their cultivation.
Collapse
Affiliation(s)
- Anastasia Kitashova
- LMU Munich, Faculty of Biology, Plant Evolutionary Cell Biology, 82152, Planegg, Germany.
| | - Vladimir Brodsky
- LMU Munich, Faculty of Biology, Plant Evolutionary Cell Biology, 82152, Planegg, Germany.
| | - Palak Chaturvedi
- University of Vienna, Molecular Systems Biology Lab (MOSYS), Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Djerassiplatz 1, 1030, Vienna, Austria.
| | - Iro Pierides
- University of Vienna, Molecular Systems Biology Lab (MOSYS), Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Djerassiplatz 1, 1030, Vienna, Austria.
| | - Arindam Ghatak
- University of Vienna, Molecular Systems Biology Lab (MOSYS), Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Djerassiplatz 1, 1030, Vienna, Austria; Vienna Metabolomics Center, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria.
| | - Wolfram Weckwerth
- University of Vienna, Molecular Systems Biology Lab (MOSYS), Department of Functional and Evolutionary Ecology, Faculty of Life Sciences, Djerassiplatz 1, 1030, Vienna, Austria; Vienna Metabolomics Center, University of Vienna, Djerassiplatz 1, 1030, Vienna, Austria.
| | - Thomas Nägele
- LMU Munich, Faculty of Biology, Plant Evolutionary Cell Biology, 82152, Planegg, Germany.
| |
Collapse
|
5
|
Ndikuryayo C, Ndayiragije A, Kilasi NL, Kusolwa P. Identification of Drought Tolerant Rice ( Oryza Sativa L.) Genotypes with Asian and African Backgrounds. PLANTS (BASEL, SWITZERLAND) 2023; 12:922. [PMID: 36840269 PMCID: PMC9967662 DOI: 10.3390/plants12040922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 01/18/2023] [Accepted: 01/28/2023] [Indexed: 06/18/2023]
Abstract
Drought is among the major abiotic stresses on rice production that can cause yield losses of up to 100% under severe drought conditions. Neither of the rice varieties currently grown in Burundi can withstand very low and irregular precipitation. This study identified genotypes that have putative quantitative trait loci (QTLs) associated with drought tolerance and determined their performance in the field. Two hundred and fifteen genotypes were grown in the field under both drought and irrigated conditions. Genomic deoxyribonucleic acid (DNA) was extracted from rice leaves for further genotypic screening. The results revealed the presence of the QTLs qDTY12.1, qDTY3.1, qDTY2-2_1, and qDTY1.1 in 90%, 85%, 53%, and 22% of the evaluated genotypes, respectively. The results of the phenotypic evaluation showed a significant yield reduction due to drought stress. Yield components and other agronomic traits were also negatively affected by drought. Genotypes having high yield best linear unbiased predictions (BLUPs) with two or more major QTLs for drought tolerance, including IR 108044-B-B-B-3-B-B, IR 92522-45-3-1-4, and BRRI DHAN 55 are of great interest for breeding programs to improve the drought tolerance of lines or varieties with other preferred traits.
Collapse
Affiliation(s)
- Cyprien Ndikuryayo
- Department of Crop Science and Horticulture, Sokoine University of Agriculture, Morogoro P.O. Box 3001, Tanzania
- International Rice Research Institute (IRRI), Bujumbura P.O. Box 5132, Burundi
- Burundi Institute of Agricultural Sciences (ISABU), Avenue de la Cathédrale, Bujumbura P.O. Box 795, Burundi
| | - Alexis Ndayiragije
- International Rice Research Institute (IRRI), Bujumbura P.O. Box 5132, Burundi
| | - Newton Lwiyiso Kilasi
- Department of Crop Science and Horticulture, Sokoine University of Agriculture, Morogoro P.O. Box 3001, Tanzania
| | - Paul Kusolwa
- Department of Crop Science and Horticulture, Sokoine University of Agriculture, Morogoro P.O. Box 3001, Tanzania
| |
Collapse
|
6
|
Yan J, Wang X. Machine learning bridges omics sciences and plant breeding. TRENDS IN PLANT SCIENCE 2023; 28:199-210. [PMID: 36153276 DOI: 10.1016/j.tplants.2022.08.018] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 08/15/2022] [Accepted: 08/23/2022] [Indexed: 06/16/2023]
Abstract
Some of the biological knowledge obtained from fundamental research will be implemented in applied plant breeding. To bridge basic research and breeding practice, machine learning (ML) holds great promise to translate biological knowledge and omics data into precision-designed plant breeding. Here, we review ML for multi-omics analysis in plants, including data dimensionality reduction, inference of gene-regulation networks, and gene discovery and prioritization. These applications will facilitate understanding trait regulation mechanisms and identifying target genes potentially applicable to knowledge-driven molecular design breeding. We also highlight applications of deep learning in plant phenomics and ML in genomic selection-assisted breeding, such as various ML algorithms that model the correlations among genotypes (genes), phenotypes (traits), and environments, to ultimately achieve data-driven genomic design breeding.
Collapse
Affiliation(s)
- Jun Yan
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China
| | - Xiangfeng Wang
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China.
| |
Collapse
|
7
|
Jubair S, Domaratzki M. Crop genomic selection with deep learning and environmental data: A survey. Front Artif Intell 2023; 5:1040295. [PMID: 36703955 PMCID: PMC9871498 DOI: 10.3389/frai.2022.1040295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 12/22/2022] [Indexed: 01/12/2023] Open
Abstract
Machine learning techniques for crop genomic selections, especially for single-environment plants, are well-developed. These machine learning models, which use dense genome-wide markers to predict phenotype, routinely perform well on single-environment datasets, especially for complex traits affected by multiple markers. On the other hand, machine learning models for predicting crop phenotype, especially deep learning models, using datasets that span different environmental conditions, have only recently emerged. Models that can accept heterogeneous data sources, such as temperature, soil conditions and precipitation, are natural choices for modeling GxE in multi-environment prediction. Here, we review emerging deep learning techniques that incorporate environmental data directly into genomic selection models.
Collapse
Affiliation(s)
- Sheikh Jubair
- Department of Computer Science, University of Manitoba, Winnipeg, MB, Canada,*Correspondence: Sheikh Jubair ✉
| | - Mike Domaratzki
- Department of Computer Science, University of Western Ontario, London, ON, Canada
| |
Collapse
|
8
|
Mahmood U, Li X, Fan Y, Chang W, Niu Y, Li J, Qu C, Lu K. Multi-omics revolution to promote plant breeding efficiency. FRONTIERS IN PLANT SCIENCE 2022; 13:1062952. [PMID: 36570904 PMCID: PMC9773847 DOI: 10.3389/fpls.2022.1062952] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 11/24/2022] [Indexed: 06/17/2023]
Abstract
Crop production is the primary goal of agricultural activities, which is always taken into consideration. However, global agricultural systems are coming under increasing pressure from the rising food demand of the rapidly growing world population and changing climate. To address these issues, improving high-yield and climate-resilient related-traits in crop breeding is an effective strategy. In recent years, advances in omics techniques, including genomics, transcriptomics, proteomics, and metabolomics, paved the way for accelerating plant/crop breeding to cope with the changing climate and enhance food production. Optimized omics and phenotypic plasticity platform integration, exploited by evolving machine learning algorithms will aid in the development of biological interpretations for complex crop traits. The precise and progressive assembly of desire alleles using precise genome editing approaches and enhanced breeding strategies would enable future crops to excel in combating the changing climates. Furthermore, plant breeding and genetic engineering ensures an exclusive approach to developing nutrient sufficient and climate-resilient crops, the productivity of which can sustainably and adequately meet the world's food, nutrition, and energy needs. This review provides an overview of how the integration of omics approaches could be exploited to select crop varieties with desired traits.
Collapse
Affiliation(s)
- Umer Mahmood
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Xiaodong Li
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Yonghai Fan
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Wei Chang
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Yue Niu
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
| | - Jiana Li
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
- Academy of Agricultural Sciences, Southwest University, Chongqing, China
- Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
| | - Cunmin Qu
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
- Academy of Agricultural Sciences, Southwest University, Chongqing, China
- Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
| | - Kun Lu
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City and Southwest University, College of Agronomy and Biotechnology, Southwest University, Chongqing, China
- Academy of Agricultural Sciences, Southwest University, Chongqing, China
- Engineering Research Center of South Upland Agriculture, Ministry of Education, Chongqing, China
| |
Collapse
|
9
|
Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. MOLECULAR PLANT 2022; 15:1664-1695. [PMID: 36081348 DOI: 10.1016/j.molp.2022.09.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 09/02/2022] [Indexed: 05/12/2023]
Abstract
The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.
Collapse
Affiliation(s)
- Yunbi Xu
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; CIMMYT-China Tropical Maize Research Center, School of Food Science and Engineering, Foshan University, Foshan, Guangdong 528231, China; Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China.
| | - Xingping Zhang
- Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China
| | - Huihui Li
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan 572024, China
| | - Hongjian Zheng
- CIMMYT-China Specialty Maize Research Center, Shanghai Academy of Agricultural Sciences, Shanghai 201400, China
| | - Jianan Zhang
- MolBreeding Biotechnology Co., Ltd., Shijiazhuang, Hebei 050035, China
| | - Michael S Olsen
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Rajeev K Varshney
- State Agricultural Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, Australia
| | - Boddupalli M Prasanna
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Qian Qian
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
10
|
Robert P, Goudemand E, Auzanneau J, Oury FX, Rolland B, Heumez E, Bouchet S, Caillebotte A, Mary-Huard T, Le Gouis J, Rincent R. Phenomic selection in wheat breeding: prediction of the genotype-by-environment interaction in multi-environment breeding trials. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2022; 135:3337-3356. [PMID: 35939074 DOI: 10.1007/s00122-022-04170-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 06/28/2022] [Indexed: 06/15/2023]
Abstract
Phenomic prediction of wheat grain yield and heading date in different multi-environmental trial scenarios is accurate. Modelling the genotype-by-environment interaction effect using phenomic data is a potentially low-cost complement to genomic prediction. The performance of wheat cultivars in multi-environmental trials (MET) is difficult to predict because of the genotype-by-environment interactions (G × E). Phenomic selection is supposed to be efficient for modelling the G × E effect because it accounts for non-additive effects. Here, phenomic data are near-infrared (NIR) spectra obtained from plant material. While phenomic selection has recently been shown to accurately predict wheat grain yield in single environments, its accuracy needs to be investigated for MET. We used four datasets from two winter wheat breeding programs to test and compare the predictive abilities of phenomic and genomic models for grain yield and heading date in different MET scenarios. We also compared different methods to model the G × E using different covariance matrices based on spectra. On average, phenomic and genomic prediction abilities are similar in all different MET scenarios. Better predictive abilities were obtained when G × E effects were modelled with NIR spectra than without them, and it was better to use all the spectra of all genotypes in all environments for modelling the G × E. To facilitate the implementation of phenomic prediction, we tested MET designs where the NIR spectra were measured only on the genotype-environment combinations phenotyped for the target trait. Missing spectra were predicted with a weighted multivariate ridge regression. Intermediate predictive abilities for grain yield were obtained in a sparse testing scenario and for new genotypes, which shows that phenomic selection is an efficient and practicable prediction method for dealing with G × E.
Collapse
Affiliation(s)
- Pauline Robert
- INRAE, CNRS, AgroParisTech, GQE - Le Moulon, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
- INRAE - Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, Clermont-Ferrand, France
- Agri-Obtentions, Ferme de Gauvilliers, 78660, Orsonville, France
- Florimond-Desprez Veuve & Fils SAS, 3 rue Florimond-Desprez, BP 41, 59242, Cappelle-en-Pévèle, France
| | - Ellen Goudemand
- Florimond-Desprez Veuve & Fils SAS, 3 rue Florimond-Desprez, BP 41, 59242, Cappelle-en-Pévèle, France
| | - Jérôme Auzanneau
- Agri-Obtentions, Ferme de Gauvilliers, 78660, Orsonville, France
| | - François-Xavier Oury
- INRAE - Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, Clermont-Ferrand, France
| | - Bernard Rolland
- INRAE-Agrocampus Ouest-Université Rennes 1, UMR1349, IGEPP, Domaine de la Motte, 35653, Le Rheu, France
| | - Emmanuel Heumez
- INRAE, UE 972, Grandes Cultures Innovation Environnement, 2 Chaussée Brunehaut, 80200, Estrées-Mons, France
| | - Sophie Bouchet
- INRAE - Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, Clermont-Ferrand, France
| | - Antoine Caillebotte
- INRAE, CNRS, AgroParisTech, GQE - Le Moulon, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Tristan Mary-Huard
- INRAE, CNRS, AgroParisTech, GQE - Le Moulon, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
- MIA, INRAE, AgroParisTech, Université Paris-Saclay, 75005, Paris, France
| | - Jacques Le Gouis
- INRAE - Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, Clermont-Ferrand, France
| | - Renaud Rincent
- INRAE, CNRS, AgroParisTech, GQE - Le Moulon, Université Paris-Saclay, 91190, Gif-sur-Yvette, France.
- INRAE - Université Clermont-Auvergne, UMR1095, GDEC, 5 chemin de Beaulieu, 63000, Clermont-Ferrand, France.
| |
Collapse
|
11
|
Anilkumar C, Sunitha NC, Devate NB, Ramesh S. Advances in integrated genomic selection for rapid genetic gain in crop improvement: a review. PLANTA 2022; 256:87. [PMID: 36149531 DOI: 10.1007/s00425-022-03996-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 09/11/2022] [Indexed: 06/16/2023]
Abstract
Genomic selection and its importance in crop breeding. Integration of GS with new breeding tools and developing SOP for GS to achieve maximum genetic gain with low cost and time. The success of conventional breeding approaches is not sufficient to meet the demand of a growing population for nutritious food and other plant-based products. Whereas, marker assisted selection (MAS) is not efficient in capturing all the favorable alleles responsible for economic traits in the process of crop improvement. Genomic selection (GS) developed in livestock breeding and then adapted to plant breeding promised to overcome the drawbacks of MAS and significantly improve complicated traits controlled by gene/QTL with small effects. Large-scale deployment of GS in important crops, as well as simulation studies in a variety of contexts, addressed G × E interaction effects and non-additive effects, as well as lowering breeding costs and time. The current study provides a complete overview of genomic selection, its process, and importance in modern plant breeding, along with insights into its application. GS has been implemented in the improvement of complex traits including tolerance to biotic and abiotic stresses. Furthermore, this review hypothesises that using GS in conjunction with other crop improvement platforms accelerates the breeding process to increase genetic gain. The objective of this review is to highlight the development of an appropriate GS model, the global open source network for GS, and trans-disciplinary approaches for effective accelerated crop improvement. The current study focused on the application of data science, including machine learning and deep learning tools, to enhance the accuracy of prediction models. Present study emphasizes on developing plant breeding strategies centered on GS combined with routine conventional breeding principles by developing GS-SOP to achieve enhanced genetic gain.
Collapse
Affiliation(s)
- C Anilkumar
- ICAR-National Rice Research Institute, Cuttack, India
| | - N C Sunitha
- University of Agricultural Sciences, Bangalore, India
| | | | - S Ramesh
- University of Agricultural Sciences, Bangalore, India.
| |
Collapse
|
12
|
Hansen PB, Ruud AK, de los Campos G, Malinowska M, Nagy I, Svane SF, Thorup-Kristensen K, Jensen JD, Krusell L, Asp T. Integration of DNA Methylation and Transcriptome Data Improves Complex Trait Prediction in Hordeum vulgare. PLANTS 2022; 11:plants11172190. [PMID: 36079572 PMCID: PMC9459846 DOI: 10.3390/plants11172190] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 08/19/2022] [Accepted: 08/21/2022] [Indexed: 11/30/2022]
Abstract
Whole-genome multi-omics profiles contain valuable information for the characterization and prediction of complex traits in plants. In this study, we evaluate multi-omics models to predict four complex traits in barley (Hordeum vulgare); grain yield, thousand kernel weight, protein content, and nitrogen uptake. Genomic, transcriptomic, and DNA methylation data were obtained from 75 spring barley lines tested in the RadiMax semi-field phenomics facility under control and water-scarce treatment. By integrating multi-omics data at genomic, transcriptomic, and DNA methylation regulatory levels, a higher proportion of phenotypic variance was explained (0.72–0.91) than with genomic models alone (0.55–0.86). The correlation between predictions and phenotypes varied from 0.17–0.28 for control plants and 0.23–0.37 for water-scarce plants, and the increase in accuracy was significant for nitrogen uptake and protein content compared to models using genomic information alone. Adding transcriptomic and DNA methylation information to the prediction models explained more of the phenotypic variance attributed to the environment in grain yield and nitrogen uptake. It furthermore explained more of the non-additive genetic effects for thousand kernel weight and protein content. Our results show the feasibility of multi-omics prediction for complex traits in barley.
Collapse
Affiliation(s)
- Pernille Bjarup Hansen
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
- Correspondence: (P.B.H.); (T.A.); Tel.: +45-87158243 (T.A.)
| | - Anja Karine Ruud
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
| | - Gustavo de los Campos
- Departments of Epidemiology & Biostatistics and Statistics & Probability, Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Marta Malinowska
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
| | - Istvan Nagy
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
| | - Simon Fiil Svane
- Section for Crop Sciences, Department of Plant and Environmental Sciences, Copenhagen University, 2630 Taastrup, Denmark
| | - Kristian Thorup-Kristensen
- Section for Crop Sciences, Department of Plant and Environmental Sciences, Copenhagen University, 2630 Taastrup, Denmark
| | | | - Lene Krusell
- Sejet Plant Breeding, Nørremarksvej 67, 8700 Horsens, Denmark
| | - Torben Asp
- Center for Quantitative Genetics and Genomics, Aarhus University, 4200 Slagelse, Denmark
- Correspondence: (P.B.H.); (T.A.); Tel.: +45-87158243 (T.A.)
| |
Collapse
|
13
|
Sirangelo TM, Ludlow RA, Spadafora ND. Multi-Omics Approaches to Study Molecular Mechanisms in Cannabis sativa. PLANTS (BASEL, SWITZERLAND) 2022; 11:2182. [PMID: 36015485 PMCID: PMC9416457 DOI: 10.3390/plants11162182] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 08/18/2022] [Accepted: 08/19/2022] [Indexed: 06/15/2023]
Abstract
Cannabis (Cannabis sativa L.), also known as hemp, is one of the oldest cultivated crops, grown for both its use in textile and cordage production, and its unique chemical properties. However, due to the legislation regulating cannabis cultivation, it is not a well characterized crop, especially regarding molecular and genetic pathways. Only recently have regulations begun to ease enough to allow more widespread cannabis research, which, coupled with the availability of cannabis genome sequences, is fuelling the interest of the scientific community. In this review, we provide a summary of cannabis molecular resources focusing on the most recent and relevant genomics, transcriptomics and metabolomics approaches and investigations. Multi-omics methods are discussed, with this combined approach being a powerful tool to identify correlations between biological processes and metabolic pathways across diverse omics layers, and to better elucidate the relationships between cannabis sub-species. The correlations between genotypes and phenotypes, as well as novel metabolites with therapeutic potential are also explored in the context of cannabis breeding programs. However, further studies are needed to fully elucidate the complex metabolomic matrix of this crop. For this reason, some key points for future research activities are discussed, relying on multi-omics approaches.
Collapse
Affiliation(s)
- Tiziana M. Sirangelo
- CREA—Council for Agricultural Research and Agricultural Economy Analysis, Genomics and Bioinformatics Department, 26836 Montanaso Lombardo, Italy
| | - Richard A. Ludlow
- School of Biosciences, Cardiff University, Sir Martin Evans Building, Museum Avenue, Cardiff CF10 3AX, UK
| | - Natasha D. Spadafora
- Department of Chemical, Pharmaceutical and Agricultural Sciences, University of Ferrara, 44121 Ferrara, Italy
| |
Collapse
|
14
|
Ferrão LFV, Sater H, Lyrene P, Amadeu RR, Sims CA, Tieman DM, Munoz PR. Terpene volatiles mediates the chemical basis of blueberry aroma and consumer acceptability. Food Res Int 2022; 158:111468. [DOI: 10.1016/j.foodres.2022.111468] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 05/27/2022] [Accepted: 06/01/2022] [Indexed: 11/04/2022]
|
15
|
Predicting hybrid rice performance using AIHIB model based on artificial intelligence. Sci Rep 2022; 12:9709. [PMID: 35690641 PMCID: PMC9188612 DOI: 10.1038/s41598-022-13805-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 05/27/2022] [Indexed: 11/24/2022] Open
Abstract
Hybrid breeding is fast becoming a key instrument in plants' crop productivity. Grain yield performance of hybrids (F1) under different parental genetic features has consequently received considerable attention in the literature. The main objective of this study was to introduce a new method, known as AI_HIB under different parental genetic features using artificial intelligence (AI) techniques. In so doing, the rice cultivars TAM, KHZ, SPD, GHB, IR28, AHM, SHP and their F1 hybrid were used. Having recorded Grain Yield (GY), Unfertile Panicle Number (UFP), Plant Height (HE), Days to Flowering (DF), Panicle Exertion (PE), Panicle Length (PL), Filled Grain Number (FG), Primary Branches Number (PBN), Flag Leaf Length (FLL), Flag Leaf Width (FLW), Flag Leaf Area (FLA), and Plant Biomass (BI) in the field, we include these features in our proposed model. When using the GA and PSO algorithm to select the features, grain yield had the highest frequency at the input of the Artificial Neural Network (ANN), Adaptive Neuro-Fuzzy Inference System (ANFIS) and Support Vector Machine (SVM) structure. The AI_HIB_ANN result revealed that the trained neural network with parental data enjoyed a good ability to predict the response of hybrid performance. Findings also reflected that the obtained MSE was low and R2 value was greater than 96%. AI_HIB_SVM and AI_HIB_ANFIS showed that measuring attributes could predict number of primary branches, plant height, days to flowering and grain yield per plant with accuracies of 99%. These findings have significant implications as it presents a new promising prediction method for hybrid rice yield based on the characteristics of the parent lines by AI. These findings contribute to provide a basis for designing a smartphone application in terms of the AI_HIB_SVM and AI_HIB_ANFIS methods to easily predict hybrid performance with a high accuracy rate.
Collapse
|
16
|
Wang X, Wen Y. A penalized linear mixed model with generalized method of moments for prediction analysis on high-dimensional multi-omics data. Brief Bioinform 2022; 23:6596990. [PMID: 35649346 PMCID: PMC9310531 DOI: 10.1093/bib/bbac193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 03/18/2022] [Accepted: 04/27/2022] [Indexed: 11/13/2022] Open
Abstract
With the advances in high-throughput biotechnologies, high-dimensional multi-layer omics data become increasingly available. They can provide both confirmatory and complementary information to disease risk and thus have offered unprecedented opportunities for risk prediction studies. However, the high-dimensionality and complex inter/intra-relationships among multi-omics data have brought tremendous analytical challenges. Here we present a computationally efficient penalized linear mixed model with generalized method of moments estimator (MpLMMGMM) for the prediction analysis on multi-omics data. Our method extends the widely used linear mixed model proposed for genomic risk predictions to model multi-omics data, where kernel functions are used to capture various types of predictive effects from different layers of omics data and penalty terms are introduced to reduce the impact of noise. Compared with existing penalized linear mixed models, the proposed method adopts the generalized method of moments estimator and it is much more computationally efficient. Through extensive simulation studies and the analysis of positron emission tomography imaging outcomes, we have demonstrated that MpLMMGMM can simultaneously consider a large number of variables and efficiently select those that are predictive from the corresponding omics layers. It can capture both linear and nonlinear predictive effects and achieves better prediction performance than competing methods.
Collapse
Affiliation(s)
- Xiaqiong Wang
- Department of Statistics, University of Auckland, 38 Princes Street, 1010, Auckland, New Zealand
| | - Yalu Wen
- Department of Statistics, University of Auckland, 38 Princes Street, 1010, Auckland, New Zealand
| |
Collapse
|
17
|
Melandri G, Monteverde E, Riewe D, AbdElgawad H, McCouch SR, Bouwmeester H. Can biochemical traits bridge the gap between genomics and plant performance? A study in rice under drought. PLANT PHYSIOLOGY 2022; 189:1139-1152. [PMID: 35166848 PMCID: PMC9157150 DOI: 10.1093/plphys/kiac053] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 01/17/2022] [Indexed: 05/13/2023]
Abstract
The possibility of introducing metabolic/biochemical phenotyping to complement genomics-based predictions in breeding pipelines has been considered for years. Here we examine to what extent and under what environmental conditions metabolic/biochemical traits can effectively contribute to understanding and predicting plant performance. In this study, multivariable statistical models based on flag leaf central metabolism and oxidative stress status were used to predict grain yield (GY) performance for 271 indica rice (Oryza sativa) accessions grown in the field under well-watered and reproductive stage drought conditions. The resulting models displayed significantly higher predictability than multivariable models based on genomic data for the prediction of GY under drought (Q2 = 0.54-0.56 versus 0.35) and for stress-induced GY loss (Q2 = 0.59-0.64 versus 0.03-0.06). Models based on the combined datasets showed predictabilities similar to metabolic/biochemical-based models alone. In contrast to genetic markers, models with enzyme activities and metabolite values also quantitatively integrated the effect of physiological differences such as plant height on GY. The models highlighted antioxidant enzymes of the ascorbate-glutathione cycle and a lipid oxidation stress marker as important predictors of rice GY stability under drought at the reproductive stage, and these stress-related variables were more predictive than leaf central metabolites. These findings provide evidence that metabolic/biochemical traits can integrate dynamic cellular and physiological responses to the environment and can help bridge the gap between the genome and the phenome of crops as predictors of GY performance under drought.
Collapse
Affiliation(s)
- Giovanni Melandri
- Laboratory of Plant Physiology, Wageningen University and Research, Wageningen, the Netherlands
- School of Integrative Plant Sciences, Plant Breeding and Genetics Section, Cornell University, Ithaca, New York, USA
| | - Eliana Monteverde
- School of Integrative Plant Sciences, Plant Breeding and Genetics Section, Cornell University, Ithaca, New York, USA
- Departamento de Biología Vegetal, Facultad de Agronomía, Laboratorio de Evolución y Domesticación de las Plantas, Universidad de La República, Montevideo, Uruguay
| | - David Riewe
- Julius Kühn-Institute (JKI), Federal Research Centre for Cultivated Plants, Institute for Ecological Chemistry, Plant Analysis and Stored Product Protection, Berlin, Germany
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Seeland, Germany
| | - Hamada AbdElgawad
- Laboratory for Integrated Molecular Plant Physiology Research, University of Antwerp, Antwerp, Belgium
- Department of Botany, Faculty of Science, Beni-Suef University, Beni Suef, Egypt
| | - Susan R McCouch
- School of Integrative Plant Sciences, Plant Breeding and Genetics Section, Cornell University, Ithaca, New York, USA
| | - Harro Bouwmeester
- Laboratory of Plant Physiology, Wageningen University and Research, Wageningen, the Netherlands
- Plant Hormone Biology group, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, the Netherlands
| |
Collapse
|
18
|
Zhou R, Jiang F, Niu L, Song X, Yu L, Yang Y, Wu Z. Increase Crop Resilience to Heat Stress Using Omic Strategies. FRONTIERS IN PLANT SCIENCE 2022; 13:891861. [PMID: 35656008 PMCID: PMC9152541 DOI: 10.3389/fpls.2022.891861] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 04/04/2022] [Indexed: 06/15/2023]
Abstract
Varieties of various crops with high resilience are urgently needed to feed the increased population in climate change conditions. Human activities and climate change have led to frequent and strong weather fluctuation, which cause various abiotic stresses to crops. The understanding of crops' responses to abiotic stresses in different aspects including genes, RNAs, proteins, metabolites, and phenotypes can facilitate crop breeding. Using multi-omics methods, mainly genomics, transcriptomics, proteomics, metabolomics, and phenomics, to study crops' responses to abiotic stresses will generate a better, deeper, and more comprehensive understanding. More importantly, multi-omics can provide multiple layers of information on biological data to understand plant biology, which will open windows for new opportunities to improve crop resilience and tolerance. However, the opportunities and challenges coexist. Interpretation of the multidimensional data from multi-omics and translation of the data into biological meaningful context remained a challenge. More reasonable experimental designs starting from sowing seed, cultivating the plant, and collecting and extracting samples were necessary for a multi-omics study as the first step. The normalization, transformation, and scaling of single-omics data should consider the integration of multi-omics. This review reports the current study of crops at abiotic stresses in particular heat stress using omics, which will help to accelerate crop improvement to better tolerate and adapt to climate change.
Collapse
Affiliation(s)
- Rong Zhou
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
- Department of Food Science, Aarhus University, Aarhus, Denmark
| | - Fangling Jiang
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Lifei Niu
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Xiaoming Song
- College of Life Sciences, North China University of Science and Technology, Tangshan, China
| | - Lu Yu
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Yuwen Yang
- Excellence and Innovation Center, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Zhen Wu
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|
19
|
Genome-Enabled Prediction Methods Based on Machine Learning. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2467:189-218. [PMID: 35451777 DOI: 10.1007/978-1-0716-2205-6_7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Growth of artificial intelligence and machine learning (ML) methodology has been explosive in recent years. In this class of procedures, computers get knowledge from sets of experiences and provide forecasts or classification. In genome-wide based prediction (GWP), many ML studies have been carried out. This chapter provides a description of main semiparametric and nonparametric algorithms used in GWP in animals and plants. Thirty-four ML comparative studies conducted in the last decade were used to develop a meta-analysis through a Thurstonian model, to evaluate algorithms with the best predictive qualities. It was found that some kernel, Bayesian, and ensemble methods displayed greater robustness and predictive ability. However, the type of study and data distribution must be considered in order to choose the most appropriate model for a given problem.
Collapse
|
20
|
Sakurai N. Recent applications of metabolomics in plant breeding. BREEDING SCIENCE 2022; 72:56-65. [PMID: 36045891 PMCID: PMC8987846 DOI: 10.1270/jsbbs.21065] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 12/19/2021] [Indexed: 05/27/2023]
Abstract
Metabolites play a central role in maintaining organismal life and in defining crop phenotypes, such as nutritional value, fragrance, color, and stress resistance. Among the 'omes' in biology, the metabolome is the closest to the phenotype. Consequently, metabolomics has been applied to crop improvement as method for monitoring changes in chemical compositions, clarifying the mechanisms underlying cellular functions, discovering markers and diagnostics, and phenotyping for mQTL, mGWAS, and metabolite-genome predictions. In this review, 359 reports of the most recent applications of metabolomics to plant breeding-related studies were examined. In addition to the major crops, more than 160 other crops including rare medicinal plants were considered. One bottleneck associated with using metabolomics is the wide array of instruments that are used to obtain data and the ambiguity associated with metabolite identification and quantification. To further the application of metabolomics to plant breeding, the features and perspectives of the technology are discussed.
Collapse
Affiliation(s)
- Nozomu Sakurai
- Bioinformation and DDBJ Center, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
| |
Collapse
|
21
|
Shen G, Hu W, Wang X, Zhou X, Han Z, Sherif A, Ayaad M, Xing Y. Assembly of yield heterosis of an elite rice hybrid is promising by manipulating dominant quantitative trait loci. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2022; 64:688-701. [PMID: 34995015 DOI: 10.1111/jipb.13220] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 01/04/2022] [Indexed: 05/27/2023]
Abstract
In the past, rice hybrids with strong heterosis have been obtained empirically, by developing and testing thousands of combinations. Here, we aimed to determine whether heterosis of an elite hybrid could be achieved by manipulating major quantitative trait loci. We used 202 chromosome segment substitution lines from the elite hybrid Shanyou 63 to evaluate single segment heterosis (SSH) of yield per plant and identify heterotic loci. All nine detected heterotic loci acted in a dominant fashion, and no SSH exhibited overdominance. Functional alleles of key yield-related genes Ghd7, Ghd7.1, Hd1, and GS3 were dispersed in both parents. No functional alleles of three investigated genes were expressed at higher levels in the hybrids than in the more desirable parents. A hybrid pyramiding eight heterotic loci in the female parent Zhenshan 97 background had a comparable yield to Shanyou 63 and much higher yield than Zhenshan 97. Five hybrids pyramiding eight or nine heterotic loci in the combined parental genome background showed similar yield performance to that of Shanyou 63. These results suggest that dominance underlying functional complementation is an important contributor to yield heterosis and that heterosis assembly might be successfully promised by manipulating several major dominant heterotic loci.
Collapse
Affiliation(s)
- Guojing Shen
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
| | - Wei Hu
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
| | - Xianmeng Wang
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
- Hubei Hongshan Laboratory, Wuhan, 430070, China
| | - Xiangchun Zhou
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
- Hubei Hongshan Laboratory, Wuhan, 430070, China
| | - Zhongming Han
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
| | - Ahmed Sherif
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
- Hubei Hongshan Laboratory, Wuhan, 430070, China
| | - Mohammed Ayaad
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
- Plant Research Department, Nuclear Research Center, Atomic Energy Authority, Abo-Zaabal, 13759, Egypt
| | - Yongzhong Xing
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
- Hubei Hongshan Laboratory, Wuhan, 430070, China
| |
Collapse
|
22
|
Van Tassel DL, DeHaan LR, Diaz-Garcia L, Hershberger J, Rubin MJ, Schlautman B, Turner K, Miller AJ. Re-imagining crop domestication in the era of high throughput phenomics. CURRENT OPINION IN PLANT BIOLOGY 2022; 65:102150. [PMID: 34883308 DOI: 10.1016/j.pbi.2021.102150] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 10/19/2021] [Accepted: 10/25/2021] [Indexed: 06/13/2023]
Abstract
De novo domestication is an exciting option for increasing species diversity and ecosystem service functionality of agricultural landscapes. Genomic selection (GS), the application of genomic markers to predict phenotypic traits in a breeding population, offers the possibility of rapid genetic improvement, making GS especially attractive for modifying traits of long-lived species. However, for some wild species just entering the domestication pipeline, especially those with large and complex genomes, a lack of funding and/or prior genome characterization, GS is often out of reach. High throughput phenomics has the potential to augment traditional pedigree selection, reduce costs and amplify impacts of genomic selection, and even create new predictive selection approaches independent of sequencing or pedigrees.
Collapse
Affiliation(s)
| | - Lee R DeHaan
- The Land Institute, 2440 E Water Well Rd., Salina, KS, 67401, USA
| | | | - Jenna Hershberger
- The Land Institute, 2440 E Water Well Rd., Salina, KS, 67401, USA; Donald Danforth Plant Science Center, 975 North Warson Road, Saint Louis, MO, 63132, USA
| | - Matthew J Rubin
- Donald Danforth Plant Science Center, 975 North Warson Road, Saint Louis, MO, 63132, USA
| | | | - Kathryn Turner
- The Land Institute, 2440 E Water Well Rd., Salina, KS, 67401, USA
| | - Allison J Miller
- Donald Danforth Plant Science Center, 975 North Warson Road, Saint Louis, MO, 63132, USA; Saint Louis University Department of Biology, 3507 Laclede Avenue, St. Louis, MO, 63103, USA.
| |
Collapse
|
23
|
Bartholomé J, Prakash PT, Cobb JN. Genomic Prediction: Progress and Perspectives for Rice Improvement. Methods Mol Biol 2022; 2467:569-617. [PMID: 35451791 DOI: 10.1007/978-1-0716-2205-6_21] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Genomic prediction can be a powerful tool to achieve greater rates of genetic gain for quantitative traits if thoroughly integrated into a breeding strategy. In rice as in other crops, the interest in genomic prediction is very strong with a number of studies addressing multiple aspects of its use, ranging from the more conceptual to the more practical. In this chapter, we review the literature on rice (Oryza sativa) and summarize important considerations for the integration of genomic prediction in breeding programs. The irrigated breeding program at the International Rice Research Institute is used as a concrete example on which we provide data and R scripts to reproduce the analysis but also to highlight practical challenges regarding the use of predictions. The adage "To someone with a hammer, everything looks like a nail" describes a common psychological pitfall that sometimes plagues the integration and application of new technologies to a discipline. We have designed this chapter to help rice breeders avoid that pitfall and appreciate the benefits and limitations of applying genomic prediction, as it is not always the best approach nor the first step to increasing the rate of genetic gain in every context.
Collapse
Affiliation(s)
- Jérôme Bartholomé
- CIRAD, UMR AGAP Institut, Montpellier, France.
- AGAP Institut, Univ Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France.
- Rice Breeding Platform, International Rice Research Institute, Manila, Philippines.
| | | | | |
Collapse
|
24
|
Martini JWR, Gao N, Crossa J. Incorporating Omics Data in Genomic Prediction. Methods Mol Biol 2022; 2467:341-357. [PMID: 35451782 DOI: 10.1007/978-1-0716-2205-6_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In this chapter, we discuss the motivation for integrating other types of omics data into genomic prediction methods. We give an overview of literature investigating the performance of omics-enhanced predictions, and highlight potential pitfalls when applying these methods in breeding. We emphasize that the statistical methods available for genomic data can be transferred to the general omics case. However, when using a framework of omic relationship matrices, the standardization of the variables may be more relevant than it is for a genomic relationship matrix based on single-nucleotide polymorphisms.
Collapse
Affiliation(s)
- Johannes W R Martini
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico.
| | - Ning Gao
- School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - José Crossa
- International Maize and Wheat Improvement Center (CIMMYT), Veracruz, CP, Mexico
| |
Collapse
|
25
|
Hu H, Campbell MT, Yeats TH, Zheng X, Runcie DE, Covarrubias-Pazaran G, Broeckling C, Yao L, Caffe-Treml M, Gutiérrez LA, Smith KP, Tanaka J, Hoekenga OA, Sorrells ME, Gore MA, Jannink JL. Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021. [PMID: 34643760 DOI: 10.25739/8p1e-0931] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.
Collapse
Affiliation(s)
- Haixiao Hu
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA.
| | - Malachy T Campbell
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Trevor H Yeats
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Xuying Zheng
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Daniel E Runcie
- Department of Plant Sciences, University of California Davis, Davis, CA, 95616, USA
| | - Giovanny Covarrubias-Pazaran
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera México-Veracruz, El Batán, 56130, Texcoco, Edo. de México, México
| | - Corey Broeckling
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Linxing Yao
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Melanie Caffe-Treml
- Department of Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, SD, 57007, USA
| | - Lucı A Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Kevin P Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - James Tanaka
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Owen A Hoekenga
- Cayuga Genetics Consulting Group LLC, Ithaca, NY, 14850, USA
| | - Mark E Sorrells
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Jean-Luc Jannink
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
- USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| |
Collapse
|
26
|
Hu H, Campbell MT, Yeats TH, Zheng X, Runcie DE, Covarrubias-Pazaran G, Broeckling C, Yao L, Caffe-Treml M, Gutiérrez LA, Smith KP, Tanaka J, Hoekenga OA, Sorrells ME, Gore MA, Jannink JL. Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:4043-4054. [PMID: 34643760 PMCID: PMC8580906 DOI: 10.1007/s00122-021-03946-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 09/05/2021] [Indexed: 05/26/2023]
Abstract
Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.
Collapse
Affiliation(s)
- Haixiao Hu
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA.
| | - Malachy T Campbell
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Trevor H Yeats
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Xuying Zheng
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Daniel E Runcie
- Department of Plant Sciences, University of California Davis, Davis, CA, 95616, USA
| | - Giovanny Covarrubias-Pazaran
- International Maize and Wheat Improvement Center (CIMMYT), Km. 45, Carretera México-Veracruz, El Batán, 56130, Texcoco, Edo. de México, México
| | - Corey Broeckling
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Linxing Yao
- Proteomics and Metabolomics Facility, Colorado State University, C130 Microbiology, 2021 Campus Delivery, Fort Collins, CO, 80521, USA
| | - Melanie Caffe-Treml
- Department of Agronomy, Horticulture & Plant Science, South Dakota State University, Brookings, SD, 57007, USA
| | - Lucı A Gutiérrez
- Department of Agronomy, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Kevin P Smith
- Department of Agronomy & Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - James Tanaka
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Owen A Hoekenga
- Cayuga Genetics Consulting Group LLC, Ithaca, NY, 14850, USA
| | - Mark E Sorrells
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Michael A Gore
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
| | - Jean-Luc Jannink
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, 14853, USA
- USDA-ARS, Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA
| |
Collapse
|
27
|
Knoch D, Werner CR, Meyer RC, Riewe D, Abbadi A, Lücke S, Snowdon RJ, Altmann T. Multi-omics-based prediction of hybrid performance in canola. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:1147-1165. [PMID: 33523261 PMCID: PMC7973648 DOI: 10.1007/s00122-020-03759-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 12/19/2020] [Indexed: 05/05/2023]
Abstract
Complementing or replacing genetic markers with transcriptomic data and use of reproducing kernel Hilbert space regression based on Gaussian kernels increases hybrid prediction accuracies for complex agronomic traits in canola. In plant breeding, hybrids gained particular importance due to heterosis, the superior performance of offspring compared to their inbred parents. Since the development of new top performing hybrids requires labour-intensive and costly breeding programmes, including testing of large numbers of experimental hybrids, the prediction of hybrid performance is of utmost interest to plant breeders. In this study, we tested the effectiveness of hybrid prediction models in spring-type oilseed rape (Brassica napus L./canola) employing different omics profiles, individually and in combination. To this end, a population of 950 F1 hybrids was evaluated for seed yield and six other agronomically relevant traits in commercial field trials at several locations throughout Europe. A subset of these hybrids was also evaluated in a climatized glasshouse regarding early biomass production. For each of the 477 parental rapeseed lines, 13,201 single nucleotide polymorphisms (SNPs), 154 primary metabolites, and 19,479 transcripts were determined and used as predictive variables. Both, SNP markers and transcripts, effectively predict hybrid performance using (genomic) best linear unbiased prediction models (gBLUP). Compared to models using pure genetic markers, models incorporating transcriptome data resulted in significantly higher prediction accuracies for five out of seven agronomic traits, indicating that transcripts carry important information beyond genomic data. Notably, reproducing kernel Hilbert space regression based on Gaussian kernels significantly exceeded the predictive abilities of gBLUP models for six of the seven agronomic traits, demonstrating its potential for implementation in future canola breeding programmes.
Collapse
Affiliation(s)
- Dominic Knoch
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
| | - Christian R. Werner
- The Roslin Institute, University of Edinburgh, Easter Bush, Midlothian, EH25 9RG Scotland, UK
| | - Rhonda C. Meyer
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
| | - David Riewe
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
- Institute for Ecological Chemistry, Plant Analysis and Stored Product Protection, Julius Kühn Institute (JKI)—Federal Research Centre for Cultivated Plants, 14195 Berlin, Germany
| | - Amine Abbadi
- NPZ Innovation GmbH, Hohenlieth, 24363 Holtsee, Germany
- Norddeutsche Pflanzenzucht Hans-Georg Lembke KG, Hohenlieth, 24363 Holtsee, Germany
| | - Sophie Lücke
- Norddeutsche Pflanzenzucht Hans-Georg Lembke KG, Hohenlieth, 24363 Holtsee, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392 Giessen, Germany
| | - Thomas Altmann
- Department of Molecular Genetics, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466 Seeland, OT Gatersleben Germany
| |
Collapse
|
28
|
Xu Y, Zhao Y, Wang X, Ma Y, Li P, Yang Z, Zhang X, Xu C, Xu S. Incorporation of parental phenotypic data into multi-omic models improves prediction of yield-related traits in hybrid rice. PLANT BIOTECHNOLOGY JOURNAL 2021; 19:261-272. [PMID: 32738177 PMCID: PMC7868986 DOI: 10.1111/pbi.13458] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 06/14/2020] [Accepted: 07/22/2020] [Indexed: 05/15/2023]
Abstract
Hybrid breeding has been shown to effectively increase rice productivity. However, identifying desirable hybrids out of numerous potential combinations is a daunting challenge. Genomic selection holds great promise for accelerating hybrid breeding by enabling early selection before phenotypes are measured. With the recent advances in multi-omic technologies, hybrid prediction based on transcriptomic and metabolomic data has received increasing attention. However, the current omic-based hybrid prediction has ignored parental phenotypic information, which is of fundamental importance in plant breeding. In this study, we integrated parental phenotypic information into various multi-omic prediction models applied in hybrid breeding of rice and compared the predictabilities of 15 combinations from four sets of predictors from the parents, that is genome, transcriptome, metabolome and phenome. The predictability for each combination was evaluated using the best linear unbiased prediction and a modified fast HAT method. We found significant interactions between predictors and traits in predictability, but joint prediction with various combinations of the predictors significantly improved predictability relative to prediction of any single source omic data for each trait investigated. Incorporation of parental phenotypic data into various omic predictors increased the predictability, averagely by 13.6%, 54.5%, 19.9% and 8.3%, for grain yield, number of tillers per plant, number of grains per panicle and 1000 grain weight, respectively. Among nine models of incorporating parental traits, the AD-All model was the most effective one. This novel strategy of incorporating parental phenotypic data into multi-omic prediction is expected to improve hybrid breeding progress, especially with the development of high-throughput phenotyping technologies.
Collapse
Affiliation(s)
- Yang Xu
- Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
| | - Yue Zhao
- Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
| | - Xin Wang
- Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
| | - Ying Ma
- Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
| | - Pengcheng Li
- Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
| | - Zefeng Yang
- Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
| | - Xuecai Zhang
- International Maize and Wheat Improvement Center (CIMMYT)MexicoDFMexico
| | - Chenwu Xu
- Jiangsu Key Laboratory of Crop Genetics and PhysiologyKey Laboratory of Plant Functional Genomics of Ministry of EducationJiangsu Key Laboratory of Crop Genomics and Molecular BreedingCo‐Innovation Center for Modern Production Technology of Grain CropsAgricultural College of Yangzhou UniversityYangzhouChina
| | - Shizhong Xu
- Department of Botany and Plant SciencesUniversity of CaliforniaRiversideCAUSA
| |
Collapse
|
29
|
Michel S, Wagner C, Nosenko T, Steiner B, Samad-Zamini M, Buerstmayr M, Mayer K, Buerstmayr H. Merging Genomics and Transcriptomics for Predicting Fusarium Head Blight Resistance in Wheat. Genes (Basel) 2021; 12:114. [PMID: 33477759 PMCID: PMC7832326 DOI: 10.3390/genes12010114] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 01/14/2021] [Accepted: 01/16/2021] [Indexed: 01/13/2023] Open
Abstract
Genomic selection with genome-wide distributed molecular markers has evolved into a well-implemented tool in many breeding programs during the last decade. The resistance against Fusarium head blight (FHB) in wheat is probably one of the most thoroughly studied systems within this framework. Aside from the genome, other biological strata like the transcriptome have likewise shown some potential in predictive breeding strategies but have not yet been investigated for the FHB-wheat pathosystem. The aims of this study were thus to compare the potential of genomic with transcriptomic prediction, and to assess the merit of blending incomplete transcriptomic with complete genomic data by the single-step method. A substantial advantage of gene expression data over molecular markers has been observed for the prediction of FHB resistance in the studied diversity panel of breeding lines and released cultivars. An increase in prediction ability was likewise found for the single-step predictions, although this can mostly be attributed to an increased accuracy among the RNA-sequenced genotypes. The usage of transcriptomics can thus be seen as a complement to already established predictive breeding pipelines with pedigree and genomic data, particularly when more cost-efficient multiplexing techniques for RNA-sequencing will become more accessible in the future.
Collapse
Affiliation(s)
- Sebastian Michel
- Institute of Biotechnology in Plant Production (IFA-Tulln), University of Natural Resources and Life Sciences Vienna, 3430 Tulln, Austria; (C.W.); (B.S.); (M.S.-Z.); (M.B.); (H.B.)
| | - Christian Wagner
- Institute of Biotechnology in Plant Production (IFA-Tulln), University of Natural Resources and Life Sciences Vienna, 3430 Tulln, Austria; (C.W.); (B.S.); (M.S.-Z.); (M.B.); (H.B.)
| | - Tetyana Nosenko
- PGSB Plant Genome and Systems Biology, Helmholtz Center Munich, German Research Center for Environmental Health, 85764 Neuherberg, Germany; (T.N.); (K.M.)
- Research Unit Environmental Simulation (EUS) at the Institute of Biochemical Plant Pathology (BIOP), Helmholtz Zentrum München, 85764 Neuherberg, Germany
| | - Barbara Steiner
- Institute of Biotechnology in Plant Production (IFA-Tulln), University of Natural Resources and Life Sciences Vienna, 3430 Tulln, Austria; (C.W.); (B.S.); (M.S.-Z.); (M.B.); (H.B.)
| | - Mina Samad-Zamini
- Institute of Biotechnology in Plant Production (IFA-Tulln), University of Natural Resources and Life Sciences Vienna, 3430 Tulln, Austria; (C.W.); (B.S.); (M.S.-Z.); (M.B.); (H.B.)
- Saatzucht Edelhof GmbH, 3910 Zwettl, Austria
| | - Maria Buerstmayr
- Institute of Biotechnology in Plant Production (IFA-Tulln), University of Natural Resources and Life Sciences Vienna, 3430 Tulln, Austria; (C.W.); (B.S.); (M.S.-Z.); (M.B.); (H.B.)
| | - Klaus Mayer
- PGSB Plant Genome and Systems Biology, Helmholtz Center Munich, German Research Center for Environmental Health, 85764 Neuherberg, Germany; (T.N.); (K.M.)
| | - Hermann Buerstmayr
- Institute of Biotechnology in Plant Production (IFA-Tulln), University of Natural Resources and Life Sciences Vienna, 3430 Tulln, Austria; (C.W.); (B.S.); (M.S.-Z.); (M.B.); (H.B.)
| |
Collapse
|
30
|
Ye S, Li J, Zhang Z. Multi-omics-data-assisted genomic feature markers preselection improves the accuracy of genomic prediction. J Anim Sci Biotechnol 2020; 11:109. [PMID: 33292577 PMCID: PMC7708144 DOI: 10.1186/s40104-020-00515-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 09/22/2020] [Indexed: 12/02/2022] Open
Abstract
Background Presently, multi-omics data (e.g., genomics, transcriptomics, proteomics, and metabolomics) are available to improve genomic predictors. Omics data not only offers new data layers for genomic prediction but also provides a bridge between organismal phenotypes and genome variation that cannot be readily captured at the genome sequence level. Therefore, using multi-omics data to select feature markers is a feasible strategy to improve the accuracy of genomic prediction. In this study, simultaneously using whole-genome sequencing (WGS) and gene expression level data, four strategies for single-nucleotide polymorphism (SNP) preselection were investigated for genomic predictions in the Drosophila Genetic Reference Panel. Results Using genomic best linear unbiased prediction (GBLUP) with complete WGS data, the prediction accuracies were 0.208 ± 0.020 (0.181 ± 0.022) for the startle response and 0.272 ± 0.017 (0.307 ± 0.015) for starvation resistance in the female (male) lines. Compared with GBLUP using complete WGS data, both GBLUP and the genomic feature BLUP (GFBLUP) did not improve the prediction accuracy using SNPs preselected from complete WGS data based on the results of genome-wide association studies (GWASs) or transcriptome-wide association studies (TWASs). Furthermore, by using SNPs preselected from the WGS data based on the results of the expression quantitative trait locus (eQTL) mapping of all genes, only the startle response had greater accuracy than GBLUP with the complete WGS data. The best accuracy values in the female and male lines were 0.243 ± 0.020 and 0.220 ± 0.022, respectively. Importantly, by using SNPs preselected based on the results of the eQTL mapping of significant genes from TWAS, both GBLUP and GFBLUP resulted in great accuracy and small bias of genomic prediction. Compared with the GBLUP using complete WGS data, the best accuracy values represented increases of 60.66% and 39.09% for the starvation resistance and 27.40% and 35.36% for startle response in the female and male lines, respectively. Conclusions Overall, multi-omics data can assist genomic feature preselection and improve the performance of genomic prediction. The new knowledge gained from this study will enrich the use of multi-omics in genomic prediction.
Collapse
Affiliation(s)
- Shaopan Ye
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Jiaqi Li
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
| | - Zhe Zhang
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, National Engineering Research Centre for Breeding Swine Industry, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China.
| |
Collapse
|
31
|
Xu Y, Liu X, Fu J, Wang H, Wang J, Huang C, Prasanna BM, Olsen MS, Wang G, Zhang A. Enhancing Genetic Gain through Genomic Selection: From Livestock to Plants. PLANT COMMUNICATIONS 2020; 1:100005. [PMID: 33404534 PMCID: PMC7747995 DOI: 10.1016/j.xplc.2019.100005] [Citation(s) in RCA: 88] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Although long-term genetic gain has been achieved through increasing use of modern breeding methods and technologies, the rate of genetic gain needs to be accelerated to meet humanity's demand for agricultural products. In this regard, genomic selection (GS) has been considered most promising for genetic improvement of the complex traits controlled by many genes each with minor effects. Livestock scientists pioneered GS application largely due to livestock's significantly higher individual values and the greater reduction in generation interval that can be achieved in GS. Large-scale application of GS in plants can be achieved by refining field management to improve heritability estimation and prediction accuracy and developing optimum GS models with the consideration of genotype-by-environment interaction and non-additive effects, along with significant cost reduction. Moreover, it would be more effective to integrate GS with other breeding tools and platforms for accelerating the breeding process and thereby further enhancing genetic gain. In addition, establishing an open-source breeding network and developing transdisciplinary approaches would be essential in enhancing breeding efficiency for small- and medium-sized enterprises and agricultural research systems in developing countries. New strategies centered on GS for enhancing genetic gain need to be developed.
Collapse
Affiliation(s)
- Yunbi Xu
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
- CIMMYT-China Tropical Maize Research Center, Foshan University, Foshan 528231, China
- CIMMYT-China Specialty Maize Research Center, Shanghai Academy of Agricultural Sciences, Shanghai 201400, China
| | - Xiaogang Liu
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Junjie Fu
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Hongwu Wang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Jiankang Wang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Changling Huang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Boddupalli M. Prasanna
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Michael S. Olsen
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Guoying Wang
- Institute of Crop Science/CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Aimin Zhang
- Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
32
|
Da L, Liu Y, Yang J, Tian T, She J, Ma X, Xu W, Su Z. AppleMDO: A Multi-Dimensional Omics Database for Apple Co-Expression Networks and Chromatin States. FRONTIERS IN PLANT SCIENCE 2019; 10:1333. [PMID: 31695717 PMCID: PMC6817610 DOI: 10.3389/fpls.2019.01333] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 09/25/2019] [Indexed: 05/17/2023]
Abstract
As an economically important crop, apple is one of the most cultivated fruit trees in temperate regions worldwide. Recently, a large number of high-quality transcriptomic and epigenomic datasets for apple were made available to the public, which could be helpful in inferring gene regulatory relationships and thus predicting gene function at the genome level. Through integration of the available apple genomic, transcriptomic, and epigenomic datasets, we constructed co-expression networks, identified functional modules, and predicted chromatin states. A total of 112 RNA-seq datasets were integrated to construct a global network and a conditional network (tissue-preferential network). Furthermore, a total of 1,076 functional modules with closely related gene sets were identified to assess the modularity of biological networks and further subjected to functional enrichment analysis. The results showed that the function of many modules was related to development, secondary metabolism, hormone response, and transcriptional regulation. Transcriptional regulation is closely related to epigenetic marks on chromatin. A total of 20 epigenomic datasets, which included ChIP-seq, DNase-seq, and DNA methylation analysis datasets, were integrated and used to classify chromatin states. Based on the ChromHMM algorithm, the genome was divided into 620,122 fragments, which were classified into 24 states according to the combination of epigenetic marks and enriched-feature regions. Finally, through the collaborative analysis of different omics datasets, the online database AppleMDO (http://bioinformatics.cau.edu.cn/AppleMDO/) was established for cross-referencing and the exploration of possible novel functions of apple genes. In addition, gene annotation information and functional support toolkits were also provided. Our database might be convenient for researchers to develop insights into the function of genes related to important agronomic traits and might serve as a reference for other fruit trees.
Collapse
|