1
|
Gaur A, Jindal Y, Singh V, Tiwari R, Juliana P, Kaushik D, Kumar KJY, Ahlawat OP, Singh G, Sheoran S. GWAS elucidated grain yield genetics in Indian spring wheat under diverse water conditions. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:177. [PMID: 38972024 DOI: 10.1007/s00122-024-04680-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 06/11/2024] [Indexed: 07/08/2024]
Abstract
KEY MESSAGE Underpinned natural variations and key genes associated with yield under different water regimes, and identified genomic signatures of genetic gain in the Indian wheat breeding program. A novel KASP marker for TKW under water stress was developed and validated. A comprehensive genome-wide association study was conducted on 300 spring wheat genotypes to elucidate the natural variations associated with grain yield and its eleven contributing traits under fully irrigated, restricted water, and simulated no water conditions. Utilizing the 35K Wheat Breeders' Array, we identified 1155 quantitative trait nucleotides (QTNs), with 207 QTNs exhibiting stability across diverse conditions. These QTNs were further delimited into 539 genomic regions using a genome-wide LD value of 3.0 Mbp, revealing pleiotropic control across traits and conditions. Sub-genome A was significantly associated with traits under irrigated conditions, while sub-genome B showed more QTNs under water stressed conditions. Favourable alleles with significantly associated QTNs were delineated, with a notable pyramiding effect for enhancing trait performance. Additionally, allele of only 921 QTNs significantly affected the population mean. Allele profiling highlighted C-306 as a most potential source of drought tolerance. Moreover, 762 genes overlapping significant QTNs were identified, narrowing down to 27 putative candidate genes overlapping 29 novel and functional SNPs expressing (≥ 0.5 tpm) relevance across various growth conditions. A new KASP assay was developed, targeting a gene TraesCS2A03G1123700 regulating thousand kernel weight under severe drought condition. Genomic selection models (GBLUP, BayesB, MxE, and R-Norm) demonstrated an average prediction accuracy of 0.06-0.58 across environments, indicating potential for trait selection. Retrospective analysis of the Indian wheat breeding program supported a genetic gain in GY at the rate of ca. 0.56% per breeding cycle, since 1960, supporting the identification of genomic signatures driving trait selection and genetic gain. These findings offer insight into improving the rate of genetic gain in wheat breeding programs globally.
Collapse
Affiliation(s)
- Arpit Gaur
- Department of Genetics and Plant Breeding, CCS Haryana Agricultural University, Hisar, India
- Crop Improvement, ICAR- Indian Institute of Wheat and Barley Research, Karnal, India
| | - Yogesh Jindal
- Department of Genetics and Plant Breeding, CCS Haryana Agricultural University, Hisar, India
| | - Vikram Singh
- Department of Genetics and Plant Breeding, CCS Haryana Agricultural University, Hisar, India
| | - Ratan Tiwari
- Crop Improvement, ICAR- Indian Institute of Wheat and Barley Research, Karnal, India
| | | | - Deepak Kaushik
- Department of Genetics and Plant Breeding, CCS Haryana Agricultural University, Hisar, India
| | | | - Om Parkash Ahlawat
- Crop Improvement, ICAR- Indian Institute of Wheat and Barley Research, Karnal, India
| | - Gyanendra Singh
- Crop Improvement, ICAR- Indian Institute of Wheat and Barley Research, Karnal, India
| | - Sonia Sheoran
- Crop Improvement, ICAR- Indian Institute of Wheat and Barley Research, Karnal, India.
| |
Collapse
|
2
|
Jeong SW, Lyu JI, Jeong H, Baek J, Moon JK, Lee C, Choi MG, Kim KH, Park YI. SUnSeT: spectral unmixing of hyperspectral images for phenotyping soybean seed traits. PLANT CELL REPORTS 2024; 43:164. [PMID: 38852113 PMCID: PMC11162974 DOI: 10.1007/s00299-024-03249-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 05/06/2024] [Indexed: 06/10/2024]
Abstract
KEY MESSAGE Hyperspectral features enable accurate classification of soybean seeds using linear discriminant analysis and GWAS for novel seed trait genes. Evaluating crop seed traits such as size, shape, and color is crucial for assessing seed quality and improving agricultural productivity. The introduction of the SUnSet toolbox, which employs hyperspectral sensor-derived image analysis, addresses this necessity. In a validation test involving 420 seed accessions from the Korean Soybean Core Collections, the pixel purity index algorithm identified seed- specific hyperspectral endmembers to facilitate segmentation. Various metrics extracted from ventral and lateral side images facilitated the categorization of seeds into three size groups and four shape groups. Additionally, quantitative RGB triplets representing seven seed coat colors, averaged reflectance spectra, and pigment indices were acquired. Machine learning models, trained on a dataset comprising 420 accession seeds and 199 predictors encompassing seed size, shape, and reflectance spectra, achieved accuracy rates of 95.8% for linear discriminant analysis model. Furthermore, a genome-wide association study utilizing hyperspectral features uncovered associations between seed traits and genes governing seed pigmentation and shapes. This comprehensive approach underscores the effectiveness of SUnSet in advancing precision agriculture through meticulous seed trait analysis.
Collapse
Affiliation(s)
- Seok Won Jeong
- Biological Sciences, Chungnam National University, 99 Daehagro, Youseong, Daejon, 34134, Korea
| | - Jae Il Lyu
- Gene Engineering Division, National Institute of Agricultural Sciences, 370 Nongsaengmyeongro, Jeonju, Jeollabuk-do, 54874, Korea
| | - HwangWeon Jeong
- Gene Engineering Division, National Institute of Agricultural Sciences, 370 Nongsaengmyeongro, Jeonju, Jeollabuk-do, 54874, Korea
| | - Jeongho Baek
- Gene Engineering Division, National Institute of Agricultural Sciences, 370 Nongsaengmyeongro, Jeonju, Jeollabuk-do, 54874, Korea
| | - Jung-Kyung Moon
- Crop Foundation Research Division, National Institute of Crop Sciences, 181 Hyeoksinro, Wanju, Jeollabuk-do, 55365, Korea
| | - Chaewon Lee
- Crop Cultivation and Environment Research Division, National Institute of Crop Sciences, 54 Seohoro, Suwon, Kyounggi-do, 16613, Korea
| | - Myoung-Goo Choi
- Wheat Research Team, National Institute of Crop Sciences, RDA, 181 Hyeoksinro, Wanju, Jeollabuk-do, 55365, Korea
| | - Kyoung-Hwan Kim
- Gene Engineering Division, National Institute of Agricultural Sciences, 370 Nongsaengmyeongro, Jeonju, Jeollabuk-do, 54874, Korea
| | - Youn-Il Park
- Biological Sciences, Chungnam National University, 99 Daehagro, Youseong, Daejon, 34134, Korea.
| |
Collapse
|
3
|
Zhou W, Yan Z, Zhang L. A comparative study of 11 non-linear regression models highlighting autoencoder, DBN, and SVR, enhanced by SHAP importance analysis in soybean branching prediction. Sci Rep 2024; 14:5905. [PMID: 38467662 PMCID: PMC10928191 DOI: 10.1038/s41598-024-55243-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/21/2024] [Indexed: 03/13/2024] Open
Abstract
To explore a robust tool for advancing digital breeding practices through an artificial intelligence-driven phenotype prediction expert system, we undertook a thorough analysis of 11 non-linear regression models. Our investigation specifically emphasized the significance of Support Vector Regression (SVR) and SHapley Additive exPlanations (SHAP) in predicting soybean branching. By using branching data (phenotype) of 1918 soybean accessions and 42 k SNP (Single Nucleotide Polymorphism) polymorphic data (genotype), this study systematically compared 11 non-linear regression AI models, including four deep learning models (DBN (deep belief network) regression, ANN (artificial neural network) regression, Autoencoders regression, and MLP (multilayer perceptron) regression) and seven machine learning models (e.g., SVR (support vector regression), XGBoost (eXtreme Gradient Boosting) regression, Random Forest regression, LightGBM regression, GPs (Gaussian processes) regression, Decision Tree regression, and Polynomial regression). After being evaluated by four valuation metrics: R2 (R-squared), MAE (Mean Absolute Error), MSE (Mean Squared Error), and MAPE (Mean Absolute Percentage Error), it was found that the SVR, Polynomial Regression, DBN, and Autoencoder outperformed other models and could obtain a better prediction accuracy when they were used for phenotype prediction. In the assessment of deep learning approaches, we exemplified the SVR model, conducting analyses on feature importance and gene ontology (GO) enrichment to provide comprehensive support. After comprehensively comparing four feature importance algorithms, no notable distinction was observed in the feature importance ranking scores across the four algorithms, namely Variable Ranking, Permutation, SHAP, and Correlation Matrix, but the SHAP value could provide rich information on genes with negative contributions, and SHAP importance was chosen for feature selection. The results of this study offer valuable insights into AI-mediated plant breeding, addressing challenges faced by traditional breeding programs. The method developed has broad applicability in phenotype prediction, minor QTL (quantitative trait loci) mining, and plant smart-breeding systems, contributing significantly to the advancement of AI-based breeding practices and transitioning from experience-based to data-based breeding.
Collapse
Affiliation(s)
- Wei Zhou
- Florida Agricultural and Mechanical University, Tallahassee, FL, 32307, USA.
| | - Zhengxiao Yan
- Florida State University, Tallahassee, FL, 32306, USA
| | - Liting Zhang
- Florida State University, Tallahassee, FL, 32306, USA
| |
Collapse
|
4
|
Pengphorm P, Thongrom S, Daengngam C, Duangpan S, Hussain T, Boonrat P. Optimal-Band Analysis for Chlorophyll Quantification in Rice Leaves Using a Custom Hyperspectral Imaging System. PLANTS (BASEL, SWITZERLAND) 2024; 13:259. [PMID: 38256812 PMCID: PMC10819252 DOI: 10.3390/plants13020259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 01/03/2024] [Accepted: 01/12/2024] [Indexed: 01/24/2024]
Abstract
Hyperspectral imaging (HSI) is a promising tool in chlorophyll quantification, providing a non-invasive method to collect important information for effective crop management. HSI contributes to food security solutions by optimising crop yields. In this study, we presented a custom HSI system specifically designed to provide a quantitative analysis of leaf chlorophyll content (LCC). To ensure precise estimation, significant wavelengths were identified using optimal-band analysis. Our research was centred on two sets of 120 leaf samples sourced from Thailand's unique Chaew Khing rice variant. The samples were subjected to (i) an analytical LCC assessment and (ii) HSI imaging for spectral reflectance data capture. A linear regression comparison of these datasets revealed that the green (575 ± 2 nm) and near-infrared (788 ± 2 nm) bands were the most outstanding performers. Notably, the green normalised difference vegetation index (GNDVI) was the most reliable during cross-validation (R2=0.78 and RMSE = 2.4 µg∙cm-2), outperforming other examined vegetable indices (VIs), such as the simple ratio (RED/GREEN) and the chlorophyll index. The potential development of a streamlined sensor dependent only on these two wavelengths is a significant outcome of identifying these two optimal bands. This innovation can be seamlessly integrated into farming landscapes or attached to UAVs, allowing real-time monitoring and rapid, targeted N management interventions.
Collapse
Affiliation(s)
- Panuwat Pengphorm
- Division of Physical Science, Faculty of Science, Prince of Songkla University, Hat Yai 90110, Songkhla, Thailand; (P.P.); (S.T.); (C.D.)
- National Astronomical Research Institute of Thailand (Public Organization), Mae Rim 50180, Chiang Mai, Thailand
| | - Sukrit Thongrom
- Division of Physical Science, Faculty of Science, Prince of Songkla University, Hat Yai 90110, Songkhla, Thailand; (P.P.); (S.T.); (C.D.)
- National Astronomical Research Institute of Thailand (Public Organization), Mae Rim 50180, Chiang Mai, Thailand
| | - Chalongrat Daengngam
- Division of Physical Science, Faculty of Science, Prince of Songkla University, Hat Yai 90110, Songkhla, Thailand; (P.P.); (S.T.); (C.D.)
- National Astronomical Research Institute of Thailand (Public Organization), Mae Rim 50180, Chiang Mai, Thailand
| | - Saowapa Duangpan
- Agricultural Innovation and Management Division, Faculty of Natural Resources, Prince of Songkla University, Hat Yai 90110, Songkhla, Thailand;
- Oil Palm Agronomical Research Center, Faculty of Natural Resources, Prince of Songkla University, Hat Yai 90110, Songkhla, Thailand
| | - Tajamul Hussain
- Hermiston Agricultural Research and Extension Center, Oregon State University, Hermiston, OR 97838, USA;
| | - Pawita Boonrat
- Faculty of Technology and Environment, Prince of Songkla University, Phuket Campus, Kathu 83120, Phuket, Thailand
| |
Collapse
|
5
|
Lozada DN, Sandhu KS, Bhatta M. Ridge regression and deep learning models for genome-wide selection of complex traits in New Mexican Chile peppers. BMC Genom Data 2023; 24:80. [PMID: 38110866 PMCID: PMC10726521 DOI: 10.1186/s12863-023-01179-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 12/05/2023] [Indexed: 12/20/2023] Open
Abstract
BACKGROUND Genomewide prediction estimates the genomic breeding values of selection candidates which can be utilized for population improvement and cultivar development. Ridge regression and deep learning-based selection models were implemented for yield and agronomic traits of 204 chile pepper genotypes evaluated in multi-environment trials in New Mexico, USA. RESULTS Accuracy of prediction differed across different models under ten-fold cross-validations, where high prediction accuracy was observed for highly heritable traits such as plant height and plant width. No model was superior across traits using 14,922 SNP markers for genomewide selection. Bayesian ridge regression had the highest average accuracy for first pod date (0.77) and total yield per plant (0.33). Multilayer perceptron (MLP) was the most superior for flowering time (0.76) and plant height (0.73), whereas the genomic BLUP model had the highest accuracy for plant width (0.62). Using a subset of 7,690 SNP loci resulting from grouping markers based on linkage disequilibrium coefficients resulted in improved accuracy for first pod date, ten pod weight, and total yield per plant, even under a relatively small training population size for MLP and random forest models. Genomic and ridge regression BLUP models were sufficient for optimal prediction accuracies for small training population size. Combining phenotypic selection and genomewide selection resulted in improved selection response for yield-related traits, indicating that integrated approaches can result in improved gains achieved through selection. CONCLUSIONS Accuracy values for ridge regression and deep learning prediction models demonstrate the potential of implementing genomewide selection for genetic improvement in chile pepper breeding programs. Ultimately, a large training data is relevant for improved genomic selection accuracy for the deep learning models.
Collapse
Affiliation(s)
- Dennis N Lozada
- Department of Plant and Environmental Sciences, New Mexico State University, Las Cruces, NM, 88003, USA.
- Chile Pepper Institute, New Mexico State University, Las Cruces, NM, 88003, USA.
| | | | | |
Collapse
|
6
|
Chen C, Powell O, Dinglasan E, Ross EM, Yadav S, Wei X, Atkin F, Deomano E, Hayes BJ. Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits. THE PLANT GENOME 2023; 16:e20390. [PMID: 37728221 DOI: 10.1002/tpg2.20390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 08/01/2023] [Accepted: 08/29/2023] [Indexed: 09/21/2023]
Abstract
Sugarcane has a complex, highly polyploid genome with multi-species ancestry. Additive models for genomic prediction of clonal performance might not capture interactions between genes and alleles from different ploidies and ancestral species. As such, genomic prediction in sugarcane presents an interesting case for machine learning (ML) methods, which are purportedly able to deal with high levels of complexity in prediction. Here, we investigated deep learning (DL) neural networks, including multilayer networks (MLP) and convolution neural networks (CNN), and an ensemble machine learning approach, random forest (RF), for genomic prediction in sugarcane. The data set used was 2912 sugarcane clones, scored for 26,086 genome wide single nucleotide polymorphism markers, with final assessment trial data for total cane harvested (TCH), commercial cane sugar (CCS), and fiber content (Fiber). The clones in the latest trial (2017) were used as a validation set. We compared prediction accuracy of these methods to genomic best linear unbiased prediction (GBLUP) extended to include dominance and epistatic effects. The prediction accuracies from GBLUP models were up to 0.37 for TCH, 0.43 for CCS, and 0.48 for Fiber, while the optimized ML models had prediction accuracies of 0.35 for TCH, 0.38 for CCS, and 0.48 for Fiber. Both RF and DL neural network models have comparable predictive ability with the additive GBLUP model but are less accurate than the extended GBLUP model.
Collapse
Affiliation(s)
- Chensong Chen
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Queensland, Australia
| | - Owen Powell
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Queensland, Australia
| | - Eric Dinglasan
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Queensland, Australia
| | - Elizabeth M Ross
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Queensland, Australia
| | - Seema Yadav
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Queensland, Australia
| | | | | | | | - Ben J Hayes
- Queensland Alliance for Agriculture and Food Innovation, University of Queensland, Queensland, Australia
| |
Collapse
|
7
|
Gill HS, Brar N, Halder J, Hall C, Seabourn BW, Chen YR, St Amand P, Bernardo A, Bai G, Glover K, Turnipseed B, Sehgal SK. Multi-trait genomic selection improves the prediction accuracy of end-use quality traits in hard winter wheat. THE PLANT GENOME 2023; 16:e20331. [PMID: 37194433 DOI: 10.1002/tpg2.20331] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 02/16/2023] [Accepted: 03/01/2023] [Indexed: 05/18/2023]
Abstract
Improvement of end-use quality remains one of the most important goals in hard winter wheat (HWW) breeding. Nevertheless, the evaluation of end-use quality traits is confined to later development generations owing to resource-intensive phenotyping. Genomic selection (GS) has shown promise in facilitating selection for end-use quality; however, lower prediction accuracy (PA) for complex traits remains a challenge in GS implementation. Multi-trait genomic prediction (MTGP) models can improve PA for complex traits by incorporating information on correlated secondary traits, but these models remain to be optimized in HWW. A set of advanced breeding lines from 2015 to 2021 were genotyped with 8725 single-nucleotide polymorphisms and was used to evaluate MTGP to predict various end-use quality traits that are otherwise difficult to phenotype in earlier generations. The MTGP model outperformed the ST model with up to a twofold increase in PA. For instance, PA was improved from 0.38 to 0.75 for bake absorption and from 0.32 to 0.52 for loaf volume. Further, we compared MTGP models by including different combinations of easy-to-score traits as covariates to predict end-use quality traits. Incorporation of simple traits, such as flour protein (FLRPRO) and sedimentation weight value (FLRSDS), substantially improved the PA of MT models. Thus, the rapid low-cost measurement of traits like FLRPRO and FLRSDS can facilitate the use of GP to predict mixograph and baking traits in earlier generations and provide breeders an opportunity for selection on end-use quality traits by culling inferior lines to increase selection accuracy and genetic gains.
Collapse
Affiliation(s)
- Harsimardeep S Gill
- Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, South Dakota, USA
| | - Navreet Brar
- Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, South Dakota, USA
| | - Jyotirmoy Halder
- Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, South Dakota, USA
| | - Cody Hall
- Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, South Dakota, USA
| | - Bradford W Seabourn
- USDA-ARS, CGAHR, Hard Winter Wheat Quality Laboratory, Manhattan, Kansas, USA
| | - Yuanhong R Chen
- USDA-ARS, CGAHR, Hard Winter Wheat Quality Laboratory, Manhattan, Kansas, USA
| | - Paul St Amand
- USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, Kansas, USA
| | - Amy Bernardo
- USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, Kansas, USA
| | - Guihua Bai
- USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, Kansas, USA
| | - Karl Glover
- Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, South Dakota, USA
| | - Brent Turnipseed
- Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, South Dakota, USA
| | - Sunish K Sehgal
- Department of Agronomy, Horticulture and Plant Science, South Dakota State University, Brookings, South Dakota, USA
| |
Collapse
|
8
|
Verplaetse N, Passemiers A, Arany A, Moreau Y, Raimondi D. Large sample size and nonlinear sparse models outline epistatic effects in inflammatory bowel disease. Genome Biol 2023; 24:224. [PMID: 37798735 PMCID: PMC10552306 DOI: 10.1186/s13059-023-03064-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/20/2023] [Indexed: 10/07/2023] Open
Abstract
BACKGROUND Despite clear evidence of nonlinear interactions in the molecular architecture of polygenic diseases, linear models have so far appeared optimal in genotype-to-phenotype modeling. A key bottleneck for such modeling is that genetic data intrinsically suffers from underdetermination ([Formula: see text]). Millions of variants are present in each individual while the collection of large, homogeneous cohorts is hindered by phenotype incidence, sequencing cost, and batch effects. RESULTS We demonstrate that when we provide enough training data and control the complexity of nonlinear models, a neural network outperforms additive approaches in whole exome sequencing-based inflammatory bowel disease case-control prediction. To do so, we propose a biologically meaningful sparsified neural network architecture, providing empirical evidence for positive and negative epistatic effects present in the inflammatory bowel disease pathogenesis. CONCLUSIONS In this paper, we show that underdetermination is likely a major driver for the apparent optimality of additive modeling in clinical genetics today.
Collapse
Affiliation(s)
- Nora Verplaetse
- Department of of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium.
| | - Antoine Passemiers
- Department of of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Adam Arany
- Department of of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Yves Moreau
- Department of of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Daniele Raimondi
- Department of of Electrical Engineering, Katholieke Universiteit Leuven, Leuven, Belgium.
| |
Collapse
|
9
|
Cembrowska-Lech D, Krzemińska A, Miller T, Nowakowska A, Adamski C, Radaczyńska M, Mikiciuk G, Mikiciuk M. An Integrated Multi-Omics and Artificial Intelligence Framework for Advance Plant Phenotyping in Horticulture. BIOLOGY 2023; 12:1298. [PMID: 37887008 PMCID: PMC10603917 DOI: 10.3390/biology12101298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/27/2023] [Accepted: 09/28/2023] [Indexed: 10/28/2023]
Abstract
This review discusses the transformative potential of integrating multi-omics data and artificial intelligence (AI) in advancing horticultural research, specifically plant phenotyping. The traditional methods of plant phenotyping, while valuable, are limited in their ability to capture the complexity of plant biology. The advent of (meta-)genomics, (meta-)transcriptomics, proteomics, and metabolomics has provided an opportunity for a more comprehensive analysis. AI and machine learning (ML) techniques can effectively handle the complexity and volume of multi-omics data, providing meaningful interpretations and predictions. Reflecting the multidisciplinary nature of this area of research, in this review, readers will find a collection of state-of-the-art solutions that are key to the integration of multi-omics data and AI for phenotyping experiments in horticulture, including experimental design considerations with several technical and non-technical challenges, which are discussed along with potential solutions. The future prospects of this integration include precision horticulture, predictive breeding, improved disease and stress response management, sustainable crop management, and exploration of plant biodiversity. The integration of multi-omics and AI holds immense promise for revolutionizing horticultural research and applications, heralding a new era in plant phenotyping.
Collapse
Affiliation(s)
- Danuta Cembrowska-Lech
- Department of Physiology and Biochemistry, Institute of Biology, University of Szczecin, Felczaka 3c, 71-412 Szczecin, Poland;
- Polish Society of Bioinformatics and Data Science BIODATA, Popiełuszki 4c, 71-214 Szczecin, Poland; (A.K.); (T.M.)
| | - Adrianna Krzemińska
- Polish Society of Bioinformatics and Data Science BIODATA, Popiełuszki 4c, 71-214 Szczecin, Poland; (A.K.); (T.M.)
- Institute of Biology, University of Szczecin, Wąska 13, 71-415 Szczecin, Poland;
| | - Tymoteusz Miller
- Polish Society of Bioinformatics and Data Science BIODATA, Popiełuszki 4c, 71-214 Szczecin, Poland; (A.K.); (T.M.)
- Institute of Marine and Environmental Sciences, University of Szczecin, Wąska 13, 71-415 Szczecin, Poland
| | - Anna Nowakowska
- Department of Physiology and Biochemistry, Institute of Biology, University of Szczecin, Felczaka 3c, 71-412 Szczecin, Poland;
| | - Cezary Adamski
- Institute of Biology, University of Szczecin, Wąska 13, 71-415 Szczecin, Poland;
| | | | - Grzegorz Mikiciuk
- Department of Horticulture, Faculty of Environmental Management and Agriculture, West Pomeranian University of Technology in Szczecin, Słowackiego 17, 71-434 Szczecin, Poland;
| | - Małgorzata Mikiciuk
- Department of Bioengineering, Faculty of Environmental Management and Agriculture, West Pomeranian University of Technology in Szczecin, Słowackiego 17, 71-434 Szczecin, Poland;
| |
Collapse
|
10
|
Gao P, Zhao H, Luo Z, Lin Y, Feng W, Li Y, Kong F, Li X, Fang C, Wang X. SoyDNGP: a web-accessible deep learning framework for genomic prediction in soybean breeding. Brief Bioinform 2023; 24:bbad349. [PMID: 37824739 DOI: 10.1093/bib/bbad349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 09/13/2023] [Accepted: 09/14/2023] [Indexed: 10/14/2023] Open
Abstract
Soybean is a globally significant crop, playing a vital role in human nutrition and agriculture. Its complex genetic structure and wide trait variation, however, pose challenges for breeders and researchers aiming to optimize its yield and quality. Addressing this biological complexity requires innovative and accurate tools for trait prediction. In response to this challenge, we have developed SoyDNGP, a deep learning-based model that offers significant advancements in the field of soybean trait prediction. Compared to existing methods, such as DeepGS and DNNGP, SoyDNGP boasts a distinct advantage due to its minimal increase in parameter volume and superior predictive accuracy. Through rigorous performance comparison, including prediction accuracy and model complexity, SoyDNGP represents improved performance to its counterparts. Furthermore, it effectively predicted complex traits with remarkable precision, demonstrating robust performance across different sample sizes and trait complexities. We also tested the versatility of SoyDNGP across multiple crop species, including cotton, maize, rice and tomato. Our results showed its consistent and comparable performance, emphasizing SoyDNGP's potential as a versatile tool for genomic prediction across a broad range of crops. To enhance its accessibility to users without extensive programming experience, we designed a user-friendly web server, available at http://xtlab.hzau.edu.cn/SoyDNGP. The server provides two features: 'Trait Lookup', offering users the ability to access pre-existing trait predictions for over 500 soybean accessions, and 'Trait Prediction', allowing for the upload of VCF files for trait estimation. By providing a high-performing, accessible tool for trait prediction, SoyDNGP opens up new possibilities in the quest for optimized soybean breeding.
Collapse
Affiliation(s)
- Pengfei Gao
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| | - Haonan Zhao
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| | - Zheng Luo
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| | - Yifan Lin
- Hubei Hongshan Laboratory, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| | - Wanjie Feng
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| | - Yaling Li
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| | - Fanjiang Kong
- Guangzhou Key Laboratory of Crop Gene Editing, Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou 510006, China
| | - Xia Li
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| | - Chao Fang
- Guangzhou Key Laboratory of Crop Gene Editing, Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou 510006, China
| | - Xutong Wang
- National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
- Hubei Hongshan Laboratory, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
| |
Collapse
|
11
|
Yan Q, Fruzangohar M, Taylor J, Gong D, Walter J, Norman A, Shi JQ, Coram T. Improved genomic prediction using machine learning with Variational Bayesian sparsity. PLANT METHODS 2023; 19:96. [PMID: 37660084 PMCID: PMC10474716 DOI: 10.1186/s13007-023-01073-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 08/22/2023] [Indexed: 09/04/2023]
Abstract
BACKGROUND Genomic prediction has become a powerful modelling tool for assessing line performance in plant and livestock breeding programmes. Among the genomic prediction modelling approaches, linear based models have proven to provide accurate predictions even when the number of genetic markers exceeds the number of data samples. However, breeding programmes are now compiling data from large numbers of lines and test environments for analyses, rendering these approaches computationally prohibitive. Machine learning (ML) now offers a solution to this problem through the construction of fully connected deep learning architectures and high parallelisation of the predictive task. However, the fully connected nature of these architectures immediately generates an over-parameterisation of the network that needs addressing for efficient and accurate predictions. RESULTS In this research we explore the use of an ML architecture governed by variational Bayesian sparsity in its initial layers that we have called VBS-ML. The use of VBS-ML provides a mechanism for feature selection of important markers linked to the trait, immediately reducing the network over-parameterisation. Selected markers then propagate to the remaining fully connected feed-forward components of the ML network to form the final genomic prediction. We illustrated the approach with four large Australian wheat breeding data sets that range from 2665 lines to 10375 lines genotyped across a large set of markers. For all data sets, the use of the VBS-ML architecture improved genomic prediction accuracy over legacy linear based modelling approaches. CONCLUSIONS An ML architecture governed under a variational Bayesian paradigm was shown to improve genomic prediction accuracy over legacy modelling approaches. This VBS-ML approach can be used to dramatically decrease the parameter burden on the network and provide a computationally feasible approach for improving genomic prediction conducted with large breeding population numbers and genetic markers.
Collapse
Affiliation(s)
- Qingsen Yan
- School of Computer Science, Northwestern Polytechnical University, Xi’an, China
| | - Mario Fruzangohar
- School of Food, Agriculture and Wine, University of Adelaide, Adelaide, Australia
| | - Julian Taylor
- School of Food, Agriculture and Wine, University of Adelaide, Adelaide, Australia
| | - Dong Gong
- School of Computer Science and Engineering, The University of New South Wales, Sydney, Australia
| | - James Walter
- Australian Grains Technologies, Roseworthy, Australia
| | - Adam Norman
- Australian Grains Technologies, Roseworthy, Australia
| | - Javen Qinfeng Shi
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, Australia
| | - Tristan Coram
- Australian Grains Technologies, Roseworthy, Australia
| |
Collapse
|
12
|
Mora-Poblete F, Maldonado C, Henrique L, Uhdre R, Scapim CA, Mangolim CA. Multi-trait and multi-environment genomic prediction for flowering traits in maize: a deep learning approach. FRONTIERS IN PLANT SCIENCE 2023; 14:1153040. [PMID: 37593046 PMCID: PMC10428628 DOI: 10.3389/fpls.2023.1153040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 07/12/2023] [Indexed: 08/19/2023]
Abstract
Maize (Zea mays L.), the third most widely cultivated cereal crop in the world, plays a critical role in global food security. To improve the efficiency of selecting superior genotypes in breeding programs, researchers have aimed to identify key genomic regions that impact agronomic traits. In this study, the performance of multi-trait, multi-environment deep learning models was compared to that of Bayesian models (Markov Chain Monte Carlo generalized linear mixed models (MCMCglmm), Bayesian Genomic Genotype-Environment Interaction (BGGE), and Bayesian Multi-Trait and Multi-Environment (BMTME)) in terms of the prediction accuracy of flowering-related traits (Anthesis-Silking Interval: ASI, Female Flowering: FF, and Male Flowering: MF). A tropical maize panel of 258 inbred lines from Brazil was evaluated in three sites (Cambira-2018, Sabaudia-2018, and Iguatemi-2020 and 2021) using approximately 290,000 single nucleotide polymorphisms (SNPs). The results demonstrated a 14.4% increase in prediction accuracy when employing multi-trait models compared to the use of a single trait in a single environment approach. The accuracy of predictions also improved by 6.4% when using a single trait in a multi-environment scheme compared to using multi-trait analysis. Additionally, deep learning models consistently outperformed Bayesian models in both single and multiple trait and environment approaches. A complementary genome-wide association study identified associations with 26 candidate genes related to flowering time traits, and 31 marker-trait associations were identified, accounting for 37%, 37%, and 22% of the phenotypic variation of ASI, FF and MF, respectively. In conclusion, our findings suggest that deep learning models have the potential to significantly improve the accuracy of predictions, regardless of the approach used and provide support for the efficacy of this method in genomic selection for flowering-related traits in tropical maize.
Collapse
Affiliation(s)
| | - Carlos Maldonado
- Centro de Genómica y Bioinformática, Facultad de Ciencias, Universidad Mayor, Santiago, Chile
| | - Luma Henrique
- Department of Agronomy, State University of Maringá, Paraná, Brazil
| | - Renan Uhdre
- Department of Agronomy, State University of Maringá, Paraná, Brazil
| | | | | |
Collapse
|
13
|
Bhat JA, Feng X, Mir ZA, Raina A, Siddique KHM. Recent advances in artificial intelligence, mechanistic models, and speed breeding offer exciting opportunities for precise and accelerated genomics-assisted breeding. PHYSIOLOGIA PLANTARUM 2023; 175:e13969. [PMID: 37401892 DOI: 10.1111/ppl.13969] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 06/11/2023] [Accepted: 06/27/2023] [Indexed: 07/05/2023]
Abstract
Given the challenges of population growth and climate change, there is an urgent need to expedite the development of high-yielding stress-tolerant crop cultivars. While traditional breeding methods have been instrumental in ensuring global food security, their efficiency, precision, and labour intensiveness have become increasingly inadequate to address present and future challenges. Fortunately, recent advances in high-throughput phenomics and genomics-assisted breeding (GAB) provide a promising platform for enhancing crop cultivars with greater efficiency. However, several obstacles must be overcome to optimize the use of these techniques in crop improvement, such as the complexity of phenotypic analysis of big image data. In addition, the prevalent use of linear models in genome-wide association studies (GWAS) and genomic selection (GS) fails to capture the nonlinear interactions of complex traits, limiting their applicability for GAB and impeding crop improvement. Recent advances in artificial intelligence (AI) techniques have opened doors to nonlinear modelling approaches in crop breeding, enabling the capture of nonlinear and epistatic interactions in GWAS and GS and thus making this variation available for GAB. While statistical and software challenges persist in AI-based models, they are expected to be resolved soon. Furthermore, recent advances in speed breeding have significantly reduced the time (3-5-fold) required for conventional breeding. Thus, integrating speed breeding with AI and GAB could improve crop cultivar development within a considerably shorter timeframe while ensuring greater accuracy and efficiency. In conclusion, this integrated approach could revolutionize crop breeding paradigms and safeguard food production in the face of population growth and climate change.
Collapse
Affiliation(s)
| | - Xianzhong Feng
- Zhejiang Lab, Hangzhou, China
- Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, China
| | - Zahoor A Mir
- ICAR-National Bureau of Plant Genetic Resources, New Delhi, India
| | - Aamir Raina
- Department of Botany, Faculty of Life Sciences, Aligarh Muslim University, Aligarh, India
| | - Kadambot H M Siddique
- The UWA Institute of Agriculture and School of Agriculture & Environment, The University of Western Australia, Perth, Western Australia, Australia
| |
Collapse
|
14
|
Hashem M, Sandhu KS, Ismail SM, Börner A, Sallam A. Validation and marker-assisted selection of DArT-genomic regions associated with wheat yield-related traits under normal and drought conditions. Front Genet 2023; 14:1195566. [PMID: 37292145 PMCID: PMC10245129 DOI: 10.3389/fgene.2023.1195566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 05/10/2023] [Indexed: 06/10/2023] Open
Abstract
Quantitative trait loci (QTL) is one of the most important steps in marker-assisted selection. Few studies have validated quantitative trait loci for marker-assisted selection of yield traits under drought stress conditions in wheat. A set of 138 highly diverse wheat genotypes were tested under normal and drought stress conditions for 2 years. Plant height, heading date, spike length, grain number per spike, grain yield per spike, and 1000-kernel weight were scored. High genetic variation was found among genotypes in all traits scored under both conditions in the 2 years. The same panel was genotyped using a diversity-array technology (DArT) marker, and a genome-wide association study was performed to find alleles associated with yield traits under all conditions. A set of 191 significant DArT markers were identified in this study. The results of the genome-wide association study revealed eight common markers in wheat that were significantly associated with the same traits under both conditions in the 2 years. Out of the eight markers, seven were located on the D genome except one marker. Four validated markers were located on the 3D chromosome and found in complete linkage disequilibrium. Moreover, these four markers were significantly associated with the heading date under both conditions and the grain yield per spike under drought stress condition in the 2 years. This high-linkage disequilibrium genomic region was located within the TraesCS3D02G002400 gene model. Furthermore, of the eight validated markers, seven were previously reported to be associated with yield traits under normal and drought conditions. The results of this study provided very promising DArT markers that can be used for marker-assisted selection to genetically improve yield traits under normal and drought conditions.
Collapse
Affiliation(s)
- Mostafa Hashem
- Department of Genetics, Faculty of Agriculture, Assiut University, Assuit, Egypt
| | | | - Saleh M. Ismail
- Soils and Water Department, Faculty of Agriculture, Assiut University, Assiut, Egypt
| | - Andreas Börner
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany
| | - Ahmed Sallam
- Department of Genetics, Faculty of Agriculture, Assiut University, Assuit, Egypt
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany
| |
Collapse
|
15
|
Anilkumar C, Muhammed Azharudheen TP, Sah RP, Sunitha NC, Devanna BN, Marndi BC, Patra BC. Gene based markers improve precision of genome-wide association studies and accuracy of genomic predictions in rice breeding. Heredity (Edinb) 2023; 130:335-345. [PMID: 36792661 PMCID: PMC10163052 DOI: 10.1038/s41437-023-00599-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 02/02/2023] [Accepted: 02/03/2023] [Indexed: 02/17/2023] Open
Abstract
It is hypothesized that the genome-wide genic markers may increase the prediction accuracy of genomic selection for quantitative traits. To test this hypothesis, a set of candidate gene-based markers for yield and grain traits-related genes cloned across the rice genome were custom-designed. A multi-model, multi-locus genome-wide association study (GWAS) was performed using new genic markers developed to test their effectiveness for gene discovery. Two multi-locus models, FarmCPU and mrMLM, along with a single-locus mixed linear model (MLM), identified 28 significant marker-trait associations. These associations revealed novel causative alleles for grain weight and pleiotropic associations with other traits. For instance, the marker YD91 derived from the gene OsAAP3 on chromosome 1 was consistently associated with grain weight, while the gene has a significant effect on grain yield. Furthermore, nine genomic selection methods, including regression-based and machine learning-based models, were used to predict grain weight using a leave-one-out five-fold cross-validation approach to optimize the genomic selection model with genic markers. Among nine prediction models, Kernel Hilbert Space Regression (RKHS) is the best among regression-based models, and Random Forest Regression (RFR) is the best among machine learning-based models. Genomic prediction accuracies with and without GWAS significant markers were compared to assess the effectiveness of markers. The rapid decreases in prediction accuracy upon dropping GWAS significant markers indicate the effectiveness of new genic markers in genomic selection. Apart from that, the candidate gene-based markers were found to be more effective in genomic selection programs for better accuracy.
Collapse
|
16
|
Massahiro Yassue R, Galli G, James Chen C, Fritsche‐Neto R, Morota G. Genome-wide association analysis of hyperspectral reflectance data to dissect the genetic architecture of growth-related traits in maize under plant growth-promoting bacteria inoculation. PLANT DIRECT 2023; 7:e492. [PMID: 37102161 PMCID: PMC10123960 DOI: 10.1002/pld3.492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 03/09/2023] [Accepted: 03/13/2023] [Indexed: 06/19/2023]
Abstract
Plant growth-promoting bacteria (PGPB) may be of use for increasing crop yield and plant resilience to biotic and abiotic stressors. Using hyperspectral reflectance data to assess growth-related traits may shed light on the underlying genetics as such data can help assess biochemical and physiological traits. This study aimed to integrate hyperspectral reflectance data with genome-wide association analyses to examine maize growth-related traits under PGPB inoculation. A total of 360 inbred maize lines with 13,826 single nucleotide polymorphisms (SNPs) were evaluated with and without PGPB inoculation; 150 hyperspectral wavelength reflectances at 386-1021 nm and 131 hyperspectral indices were used in the analysis. Plant height, stalk diameter, and shoot dry mass were measured manually. Overall, hyperspectral signatures produced similar or higher genomic heritability estimates than those of manually measured phenotypes, and they were genetically correlated with manually measured phenotypes. Furthermore, several hyperspectral reflectance values and spectral indices were identified by genome-wide association analysis as potential markers for growth-related traits under PGPB inoculation. Eight SNPs were detected, which were commonly associated with manually measured and hyperspectral phenotypes. Different genomic regions were found for plant growth and hyperspectral phenotypes between with and without PGPB inoculation. Moreover, the hyperspectral phenotypes were associated with genes previously reported as candidates for nitrogen uptake efficiency, tolerance to abiotic stressors, and kernel size. In addition, a Shiny web application was developed to explore multiphenotype genome-wide association results interactively. Taken together, our results demonstrate the usefulness of hyperspectral-based phenotyping for studying maize growth-related traits in response to PGPB inoculation.
Collapse
Affiliation(s)
- Rafael Massahiro Yassue
- Department of Genetics, ‘Luiz de Queiroz’ College of AgricultureUniversity of São PauloSão PauloBrazil
- School of Animal SciencesVirginia Polytechnic Institute and State UniversityBlacksburgVirginiaUSA
| | - Giovanni Galli
- Department of Genetics, ‘Luiz de Queiroz’ College of AgricultureUniversity of São PauloSão PauloBrazil
| | - Chun‐Peng James Chen
- School of Animal SciencesVirginia Polytechnic Institute and State UniversityBlacksburgVirginiaUSA
- Center for Advanced Innovation in AgricultureVirginia Polytechnic Institute and State UniversityBlacksburgVirginiaUSA
| | - Roberto Fritsche‐Neto
- Department of Genetics, ‘Luiz de Queiroz’ College of AgricultureUniversity of São PauloSão PauloBrazil
- Quantitative Genetics and Biometrics ClusterInternational Rice Research InstituteLos BañosPhilippines
| | - Gota Morota
- School of Animal SciencesVirginia Polytechnic Institute and State UniversityBlacksburgVirginiaUSA
- Center for Advanced Innovation in AgricultureVirginia Polytechnic Institute and State UniversityBlacksburgVirginiaUSA
| |
Collapse
|
17
|
Bisht A, Saini DK, Kaur B, Batra R, Kaur S, Kaur I, Jindal S, Malik P, Sandhu PK, Kaur A, Gill BS, Wani SH, Kaur B, Mir RR, Sandhu KS, Siddique KHM. Multi-omics assisted breeding for biotic stress resistance in soybean. Mol Biol Rep 2023; 50:3787-3814. [PMID: 36692674 DOI: 10.1007/s11033-023-08260-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/09/2023] [Indexed: 01/25/2023]
Abstract
Biotic stress is a critical factor limiting soybean growth and development. Soybean responses to biotic stresses such as insects, nematodes, fungal, bacterial, and viral pathogens are governed by complex regulatory and defense mechanisms. Next-generation sequencing has availed research techniques and strategies in genomics and post-genomics. This review summarizes the available information on marker resources, quantitative trait loci, and marker-trait associations involved in regulating biotic stress responses in soybean. We discuss the differential expression of related genes and proteins reported in different transcriptomics and proteomics studies and the role of signaling pathways and metabolites reported in metabolomic studies. Recent advances in omics technologies offer opportunities to reshape and improve biotic stress resistance in soybean by altering gene regulation and/or other regulatory networks. We suggest using 'integrated omics' to precisely understand how soybean responds to different biotic stresses. We also discuss the potential challenges of integrating multi-omics for the functional analysis of genes and their regulatory networks and the development of biotic stress-resistant cultivars. This review will help direct soybean breeding programs to develop resistance against different biotic stresses.
Collapse
Affiliation(s)
- Ashita Bisht
- Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
- CSK Himachal Pradesh Krishi Vishvavidyalaya, Highland Agricultural Research and Extension Centre, 175142, Kukumseri, Lahaul and Spiti, India
| | - Dinesh Kumar Saini
- Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India.
| | - Baljeet Kaur
- Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
| | - Ritu Batra
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, 25004, Meerut, India
| | - Sandeep Kaur
- Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
| | - Ishveen Kaur
- Agriculture, Environmental and Sustainability Sciences, College of sciences, University of Texas Rio Grande Valley, 78539, Edinburg, TX, USA
| | - Suruchi Jindal
- Division of Molecular Biology and Genetic Engineering, School of Bioengineering and Biosciences, Lovely Professional University, Phagwara, India
| | - Palvi Malik
- , Gurdev Singh Khush Institute of Genetics, Plant Breeding and Biotechnology, Punjab Agricultural University,, 141004, Ludhiana, India
| | - Pawanjit Kaur Sandhu
- Department of Chemistry, University of British Columbia, V1V 1V7, Okanagan, Kelowna, Canada
| | - Amandeep Kaur
- Division of Molecular Biology and Genetic Engineering, School of Bioengineering and Biosciences, Lovely Professional University, Phagwara, India
| | - Balwinder Singh Gill
- Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
| | - Shabir Hussain Wani
- MRCFC Khudwani, Sher-e-Kashmir University of Agricultural Sciences and Technology, Kashmir, Shalimar, India
| | - Balwinder Kaur
- Department of Entomology, UF/IFAS Research and Education Center, 33430, Belle Glade, Florida, USA
| | - Reyazul Rouf Mir
- Division of Genetics and Plant Breeding, Faculty of Agriculture, SKUAST-Kashmir, 193201, India
| | - Karansher Singh Sandhu
- Department of Crop and Soil Sciences, Washington State University, 99163, Pullman, WA, USA.
| | - Kadambot H M Siddique
- The UWA Institute of Agriculture, The University of Western Australia, 6001, Perth, WA, Australia.
| |
Collapse
|
18
|
Liang M, Cao S, Deng T, Du L, Li K, An B, Du Y, Xu L, Zhang L, Gao X, Li J, Guo P, Gao H. MAK: a machine learning framework improved genomic prediction via multi-target ensemble regressor chains and automatic selection of assistant traits. Brief Bioinform 2023; 24:7031157. [PMID: 36752363 DOI: 10.1093/bib/bbad043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 01/13/2023] [Accepted: 01/20/2023] [Indexed: 02/09/2023] Open
Abstract
Incorporating the genotypic and phenotypic of the correlated traits into the multi-trait model can significantly improve the prediction accuracy of the target trait in animal and plant breeding, as well as human genetics. However, in most cases, the phenotypic information of the correlated and target trait of the individual to be evaluated was null simultaneously, particularly for the newborn. Therefore, we propose a machine learning framework, MAK, to improve the prediction accuracy of the target trait by constructing the multi-target ensemble regression chains and selecting the assistant trait automatically, which predicted the genomic estimated breeding values of the target trait using genotypic information only. The prediction ability of MAK was significantly more robust than the genomic best linear unbiased prediction, BayesB, BayesRR and the multi trait Bayesian method in the four real animal and plant datasets, and the computational efficiency of MAK was roughly 100 times faster than BayesB and BayesRR.
Collapse
Affiliation(s)
- Mang Liang
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Sheng Cao
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Tianyu Deng
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Lili Du
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Keanning Li
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Bingxing An
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Yueying Du
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Lingyang Xu
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Lupei Zhang
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Xue Gao
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | - Junya Li
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| | | | - Huijiang Gao
- Chinese Academy of Agricultural Sciences Institute of Animal Science
| |
Collapse
|
19
|
Kumar M, Kumar S, Sandhu KS, Kumar N, Saripalli G, Prakash R, Nambardar A, Sharma H, Gautam T, Balyan HS, Gupta PK. GWAS and genomic prediction for pre-harvest sprouting tolerance involving sprouting score and two other related traits in spring wheat. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2023; 43:14. [PMID: 37313293 PMCID: PMC10248620 DOI: 10.1007/s11032-023-01357-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 01/26/2023] [Indexed: 06/15/2023]
Abstract
In wheat, a genome-wide association study (GWAS) and genomic prediction (GP) analysis were conducted for pre-harvest sprouting (PHS) tolerance and two of its related traits. For this purpose, an association panel of 190 accessions was phenotyped for PHS (using sprouting score), falling number, and grain color over two years and genotyped with 9904 DArTseq based SNP markers. GWAS for main-effect quantitative trait nucleotides (M-QTNs) using three different models (CMLM, SUPER, and FarmCPU) and epistatic QTNs (E-QTNs) using PLINK were performed. A total of 171 M-QTNs (CMLM, 47; SUPER, 70; FarmCPU, 54) for all three traits, and 15 E-QTNs involved in 20 first-order epistatic interactions were identified. Some of the above QTNs overlapped the previously reported QTLs, MTAs, and cloned genes, allowing delineating 26 PHS-responsive genomic regions that spread over 16 wheat chromosomes. As many as 20 definitive and stable QTNs were considered important for use in marker-assisted recurrent selection (MARS). The gene, TaPHS1, for PHS tolerance (PHST) associated with one of the QTNs was also validated using the KASP assay. Some of the M-QTNs were shown to have a key role in the abscisic acid pathway involved in PHST. Genomic prediction accuracies (based on the cross-validation approach) using three different models ranged from 0.41 to 0.55, which are comparable to the results of previous studies. In summary, the results of the present study improved our understanding of the genetic architecture of PHST and its related traits in wheat and provided novel genomic resources for wheat breeding based on MARS and GP. Supplementary Information The online version contains supplementary material available at 10.1007/s11032-023-01357-5.
Collapse
Affiliation(s)
- Manoj Kumar
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| | - Sachin Kumar
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| | | | - Neeraj Kumar
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC USA
| | - Gautam Saripalli
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
- Department of Plant Science and Landscape Architecture, University of Maryland, College Park, MD USA
| | - Ram Prakash
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| | - Akash Nambardar
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| | - Hemant Sharma
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| | - Tinku Gautam
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| | - Harindra Singh Balyan
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| | - Pushpendra Kumar Gupta
- Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, Meerut, UP India
| |
Collapse
|
20
|
Abstract
Over the past decade, advances in plant genotyping have been critical in enabling the identification of genetic diversity, in understanding evolution, and in dissecting important traits in both crops and native plants. The widespread popularity of single-nucleotide polymorphisms (SNPs) has prompted significant improvements to SNP-based genotyping, including SNP arrays, genotyping by sequencing, and whole-genome resequencing. More recent approaches, including genotyping structural variants, utilizing pangenomes to capture species-wide genetic diversity and exploiting machine learning to analyze genotypic data sets, are pushing the boundaries of what plant genotyping can offer. In this chapter, we highlight these innovations and discuss how they will accelerate and advance future genotyping efforts.
Collapse
|
21
|
Singh J, Chhabra B, Raza A, Yang SH, Sandhu KS. Important wheat diseases in the US and their management in the 21st century. FRONTIERS IN PLANT SCIENCE 2023; 13:1010191. [PMID: 36714765 PMCID: PMC9877539 DOI: 10.3389/fpls.2022.1010191] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 11/28/2022] [Indexed: 05/27/2023]
Abstract
Wheat is a crop of historical significance, as it marks the turning point of human civilization 10,000 years ago with its domestication. Due to the rapid increase in population, wheat production needs to be increased by 50% by 2050 and this growth will be mainly based on yield increases, as there is strong competition for scarce productive arable land from other sectors. This increasing demand can be further achieved using sustainable approaches including integrated disease pest management, adaption to warmer climates, less use of water resources and increased frequency of abiotic stress tolerances. Out of 200 diseases of wheat, 50 cause economic losses and are widely distributed. Each year, about 20% of wheat is lost due to diseases. Some major wheat diseases are rusts, smut, tan spot, spot blotch, fusarium head blight, common root rot, septoria blotch, powdery mildew, blast, and several viral, nematode, and bacterial diseases. These diseases badly impact the yield and cause mortality of the plants. This review focuses on important diseases of the wheat present in the United States, with comprehensive information of causal organism, economic damage, symptoms and host range, favorable conditions, and disease management strategies. Furthermore, major genetic and breeding efforts to control and manage these diseases are discussed. A detailed description of all the QTLs, genes reported and cloned for these diseases are provided in this review. This study will be of utmost importance to wheat breeding programs throughout the world to breed for resistance under changing environmental conditions.
Collapse
Affiliation(s)
- Jagdeep Singh
- Department of Crop, Soil & Environmental Sciences, Auburn University, Auburn, AL, United States
| | - Bhavit Chhabra
- Department of Plant Science and Landscape Architecture, University of Maryland, College Park, MD, United States
| | - Ali Raza
- College of Agriculture, Oil Crops Research Institute, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Seung Hwan Yang
- Department of Integrative Biotechnology, Chonnam National University, Yeosu, Republic of Korea
| | | |
Collapse
|
22
|
Jubair S, Domaratzki M. Crop genomic selection with deep learning and environmental data: A survey. Front Artif Intell 2023; 5:1040295. [PMID: 36703955 PMCID: PMC9871498 DOI: 10.3389/frai.2022.1040295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 12/22/2022] [Indexed: 01/12/2023] Open
Abstract
Machine learning techniques for crop genomic selections, especially for single-environment plants, are well-developed. These machine learning models, which use dense genome-wide markers to predict phenotype, routinely perform well on single-environment datasets, especially for complex traits affected by multiple markers. On the other hand, machine learning models for predicting crop phenotype, especially deep learning models, using datasets that span different environmental conditions, have only recently emerged. Models that can accept heterogeneous data sources, such as temperature, soil conditions and precipitation, are natural choices for modeling GxE in multi-environment prediction. Here, we review emerging deep learning techniques that incorporate environmental data directly into genomic selection models.
Collapse
Affiliation(s)
- Sheikh Jubair
- Department of Computer Science, University of Manitoba, Winnipeg, MB, Canada,*Correspondence: Sheikh Jubair ✉
| | - Mike Domaratzki
- Department of Computer Science, University of Western Ontario, London, ON, Canada
| |
Collapse
|
23
|
Vu NT, Phuc TH, Nguyen NH, Van Sang N. Effects of common full-sib families on accuracy of genomic prediction for tagging weight in striped catfish Pangasianodon hypophthalmus. Front Genet 2023; 13:1081246. [PMID: 36685869 PMCID: PMC9845282 DOI: 10.3389/fgene.2022.1081246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 12/06/2022] [Indexed: 01/06/2023] Open
Abstract
Common full-sib families (c 2 ) make up a substantial proportion of total phenotypic variation in traits of commercial importance in aquaculture species and omission or inclusion of the c 2 resulted in possible changes in genetic parameter estimates and re-ranking of estimated breeding values. However, the impacts of common full-sib families on accuracy of genomic prediction for commercial traits of economic importance are not well known in many species, including aquatic animals. This research explored the impacts of common full-sib families on accuracy of genomic prediction for tagging weight in a population of striped catfish comprising 11,918 fish traced back to the base population (four generations), in which 560 individuals had genotype records of 14,154 SNPs. Our single step genomic best linear unbiased prediction (ssGLBUP) showed that the accuracy of genomic prediction for tagging weight was reduced by 96.5%-130.3% when the common full-sib families were included in statistical models. The reduction in the prediction accuracy was to a smaller extent in multivariate analysis than in univariate models. Imputation of missing genotypes somewhat reduced the upward biases in the prediction accuracy for tagging weight. It is therefore suggested that genomic evaluation models for traits recorded during the early phase of growth development should account for the common full-sib families to minimise possible biases in the accuracy of genomic prediction and hence, selection response.
Collapse
Affiliation(s)
- Nguyen Thanh Vu
- School of Science, Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD, Australia,Center for Bio-Innovation, University of the Sunshine Coast, Maroochydore, QLD, Australia,Research Institute for Aquaculture No. 2, Ho Chi Minh City, Vietnam
| | - Tran Huu Phuc
- Research Institute for Aquaculture No. 2, Ho Chi Minh City, Vietnam
| | - Nguyen Hong Nguyen
- School of Science, Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD, Australia,Center for Bio-Innovation, University of the Sunshine Coast, Maroochydore, QLD, Australia,*Correspondence: Nguyen Hong Nguyen, ; Nguyen Van Sang,
| | - Nguyen Van Sang
- Research Institute for Aquaculture No. 2, Ho Chi Minh City, Vietnam,*Correspondence: Nguyen Hong Nguyen, ; Nguyen Van Sang,
| |
Collapse
|
24
|
Nazzicari N, Biscarini F. Stacked kinship CNN vs. GBLUP for genomic predictions of additive and complex continuous phenotypes. Sci Rep 2022; 12:19889. [PMID: 36400808 PMCID: PMC9674857 DOI: 10.1038/s41598-022-24405-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 11/15/2022] [Indexed: 11/19/2022] Open
Abstract
Deep learning is impacting many fields of data science with often spectacular results. However, its application to whole-genome predictions in plant and animal science or in human biology has been rather limited, with mostly underwhelming results. While most works focus on exploring alternative network architectures, in this study we propose an innovative representation of marker genotype data and tested it against the GBLUP (Genomic BLUP) benchmark with linear and nonlinear phenotypes. From publicly available cattle SNP genotype data, different types of genomic kinship matrices are stacked together in a 3D pile from where 2D grayscale slices are extracted and fed to a deep convolutional neural network (DNN). We simulated nine phenotype scenarios with combinations of additivity, dominance and epistasis, and compared the DNN to GBLUP-A (computed using only the additive kinship matrix) and GBLUP-optim (additive, dominance, and epistasis kinship matrices, as needed). Results varied depending on the accuracy metric employed, with DNN performing better in terms of root mean squared error (1-12% lower than GBLUP-A; 1-9% lower than GBLUP-optim) but worse in terms of Pearson's correlation (0.505 for DNN compared to 0.672 and 0.669 of GBLUP-A and GBLUP-optim for fully additive case; 0.274 for DNN, 0.279 for GBLUP-A, and 0.477 for GBLUP-optim for fully dominant case). The proposed approach offers a basis to explore further the application of DNN to tabular data in whole-genome predictions.
Collapse
Affiliation(s)
- Nelson Nazzicari
- CREA Council for Agricultural Research and Analysis of Agricultural Economics, Research Centre for Animal Production and Aquaculture, Viale Piacenza 29, 26900 Lodi, Italy
| | - Filippo Biscarini
- grid.510304.3CNR: National Research Council, Institute of Agricultural Biology and Biotechnology, Via Bassini 15, Milan, 20133 Italy
| |
Collapse
|
25
|
Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. MOLECULAR PLANT 2022; 15:1664-1695. [PMID: 36081348 DOI: 10.1016/j.molp.2022.09.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 09/02/2022] [Indexed: 05/12/2023]
Abstract
The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.
Collapse
Affiliation(s)
- Yunbi Xu
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; CIMMYT-China Tropical Maize Research Center, School of Food Science and Engineering, Foshan University, Foshan, Guangdong 528231, China; Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China.
| | - Xingping Zhang
- Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China
| | - Huihui Li
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan 572024, China
| | - Hongjian Zheng
- CIMMYT-China Specialty Maize Research Center, Shanghai Academy of Agricultural Sciences, Shanghai 201400, China
| | - Jianan Zhang
- MolBreeding Biotechnology Co., Ltd., Shijiazhuang, Hebei 050035, China
| | - Michael S Olsen
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Rajeev K Varshney
- State Agricultural Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, Australia
| | - Boddupalli M Prasanna
- CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya
| | - Qian Qian
- Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
26
|
John M, Haselbeck F, Dass R, Malisi C, Ricca P, Dreischer C, Schultheiss SJ, Grimm DG. A comparison of classical and machine learning-based phenotype prediction methods on simulated data and three plant species. FRONTIERS IN PLANT SCIENCE 2022; 13:932512. [PMID: 36407627 PMCID: PMC9673477 DOI: 10.3389/fpls.2022.932512] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 07/25/2022] [Indexed: 06/16/2023]
Abstract
Genomic selection is an integral tool for breeders to accurately select plants directly from genotype data leading to faster and more resource-efficient breeding programs. Several prediction methods have been established in the last few years. These range from classical linear mixed models to complex non-linear machine learning approaches, such as Support Vector Regression, and modern deep learning-based architectures. Many of these methods have been extensively evaluated on different crop species with varying outcomes. In this work, our aim is to systematically compare 12 different phenotype prediction models, including basic genomic selection methods to more advanced deep learning-based techniques. More importantly, we assess the performance of these models on simulated phenotype data as well as on real-world data from Arabidopsis thaliana and two breeding datasets from soy and corn. The synthetic phenotypic data allow us to analyze all prediction models and especially the selected markers under controlled and predefined settings. We show that Bayes B and linear regression models with sparsity constraints perform best under different simulation settings with respect to explained variance. Further, we can confirm results from other studies that there is no superiority of more complex neural network-based architectures for phenotype prediction compared to well-established methods. However, on real-world data, for which several prediction models yield comparable results with slight advantages for Elastic Net, this picture is less clear, suggesting that there is a lot of room for future research.
Collapse
Affiliation(s)
- Maura John
- Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Bioinformatics, Straubing, Germany
- Weihenstephan-Triesdorf University of Applied Sciences, Bioinformatics, Straubing, Germany
| | - Florian Haselbeck
- Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Bioinformatics, Straubing, Germany
- Weihenstephan-Triesdorf University of Applied Sciences, Bioinformatics, Straubing, Germany
| | | | | | | | | | | | - Dominik G. Grimm
- Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Bioinformatics, Straubing, Germany
- Weihenstephan-Triesdorf University of Applied Sciences, Bioinformatics, Straubing, Germany
- Technical University of Munich, Department of Informatics, Garching, Germany
| |
Collapse
|
27
|
Anilkumar C, Sunitha NC, Devate NB, Ramesh S. Advances in integrated genomic selection for rapid genetic gain in crop improvement: a review. PLANTA 2022; 256:87. [PMID: 36149531 DOI: 10.1007/s00425-022-03996-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 09/11/2022] [Indexed: 06/16/2023]
Abstract
Genomic selection and its importance in crop breeding. Integration of GS with new breeding tools and developing SOP for GS to achieve maximum genetic gain with low cost and time. The success of conventional breeding approaches is not sufficient to meet the demand of a growing population for nutritious food and other plant-based products. Whereas, marker assisted selection (MAS) is not efficient in capturing all the favorable alleles responsible for economic traits in the process of crop improvement. Genomic selection (GS) developed in livestock breeding and then adapted to plant breeding promised to overcome the drawbacks of MAS and significantly improve complicated traits controlled by gene/QTL with small effects. Large-scale deployment of GS in important crops, as well as simulation studies in a variety of contexts, addressed G × E interaction effects and non-additive effects, as well as lowering breeding costs and time. The current study provides a complete overview of genomic selection, its process, and importance in modern plant breeding, along with insights into its application. GS has been implemented in the improvement of complex traits including tolerance to biotic and abiotic stresses. Furthermore, this review hypothesises that using GS in conjunction with other crop improvement platforms accelerates the breeding process to increase genetic gain. The objective of this review is to highlight the development of an appropriate GS model, the global open source network for GS, and trans-disciplinary approaches for effective accelerated crop improvement. The current study focused on the application of data science, including machine learning and deep learning tools, to enhance the accuracy of prediction models. Present study emphasizes on developing plant breeding strategies centered on GS combined with routine conventional breeding principles by developing GS-SOP to achieve enhanced genetic gain.
Collapse
Affiliation(s)
- C Anilkumar
- ICAR-National Rice Research Institute, Cuttack, India
| | - N C Sunitha
- University of Agricultural Sciences, Bangalore, India
| | | | - S Ramesh
- University of Agricultural Sciences, Bangalore, India.
| |
Collapse
|
28
|
Sandhu KS, Shiv A, Kaur G, Meena MR, Raja AK, Vengavasi K, Mall AK, Kumar S, Singh PK, Singh J, Hemaprabha G, Pathak AD, Krishnappa G, Kumar S. Integrated Approach in Genomic Selection to Accelerate Genetic Gain in Sugarcane. PLANTS 2022; 11:plants11162139. [PMID: 36015442 PMCID: PMC9412483 DOI: 10.3390/plants11162139] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 08/08/2022] [Accepted: 08/08/2022] [Indexed: 11/30/2022]
Abstract
Marker-assisted selection (MAS) has been widely used in the last few decades in plant breeding programs for the mapping and introgression of genes for economically important traits, which has enabled the development of a number of superior cultivars in different crops. In sugarcane, which is the most important source for sugar and bioethanol, marker development work was initiated long ago; however, marker-assisted breeding in sugarcane has been lagging, mainly due to its large complex genome, high levels of polyploidy and heterozygosity, varied number of chromosomes, and use of low/medium-density markers. Genomic selection (GS) is a proven technology in animal breeding and has recently been incorporated in plant breeding programs. GS is a potential tool for the rapid selection of superior genotypes and accelerating breeding cycle. However, its full potential could be realized by an integrated approach combining high-throughput phenotyping, genotyping, machine learning, and speed breeding with genomic selection. For better understanding of GS integration, we comprehensively discuss the concept of genetic gain through the breeder’s equation, GS methodology, prediction models, current status of GS in sugarcane, challenges of prediction accuracy, challenges of GS in sugarcane, integrated GS, high-throughput phenotyping (HTP), high-throughput genotyping (HTG), machine learning, and speed breeding followed by its prospective applications in sugarcane improvement.
Collapse
Affiliation(s)
- Karansher Singh Sandhu
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99163, USA
| | - Aalok Shiv
- Division of Crop Improvement, ICAR-Indian Institute of Sugarcane Research, Lucknow 226002, India
| | - Gurleen Kaur
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA
| | - Mintu Ram Meena
- Regional Center, ICAR-Sugarcane Breeding Institute, Karnal 132001, India
| | - Arun Kumar Raja
- Division of Crop Production, ICAR-Sugarcane Breeding Institute, Coimbatore 641007, India
| | - Krishnapriya Vengavasi
- Division of Crop Production, ICAR-Sugarcane Breeding Institute, Coimbatore 641007, India
| | - Ashutosh Kumar Mall
- Division of Crop Improvement, ICAR-Indian Institute of Sugarcane Research, Lucknow 226002, India
| | - Sanjeev Kumar
- Division of Crop Improvement, ICAR-Indian Institute of Sugarcane Research, Lucknow 226002, India
| | - Praveen Kumar Singh
- Division of Crop Improvement, ICAR-Indian Institute of Sugarcane Research, Lucknow 226002, India
| | - Jyotsnendra Singh
- Division of Crop Improvement, ICAR-Indian Institute of Sugarcane Research, Lucknow 226002, India
| | - Govind Hemaprabha
- Division of Crop Improvement, ICAR-Sugarcane Breeding Institute, Coimbatore 641007, India
| | - Ashwini Dutt Pathak
- Division of Crop Improvement, ICAR-Indian Institute of Sugarcane Research, Lucknow 226002, India
| | - Gopalareddy Krishnappa
- Division of Crop Improvement, ICAR-Sugarcane Breeding Institute, Coimbatore 641007, India
- Correspondence: (G.K.); (S.K.)
| | - Sanjeev Kumar
- Division of Crop Improvement, ICAR-Indian Institute of Sugarcane Research, Lucknow 226002, India
- Correspondence: (G.K.); (S.K.)
| |
Collapse
|
29
|
Chung PY, Liao CT. Selection of parental lines for plant breeding via genomic prediction. FRONTIERS IN PLANT SCIENCE 2022; 13:934767. [PMID: 35968112 PMCID: PMC9363737 DOI: 10.3389/fpls.2022.934767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 07/01/2022] [Indexed: 06/15/2023]
Abstract
A set of superior parental lines is imperative for the development of high-performing inbred lines in any biparental crossing program for crops. The main objectives of this study are to (a) develop a genomic prediction approach to identify superior parental lines for multi-trait selection, and (b) generate a software package for users to execute the proposed approach before conducting field experiments. According to different breeding goals of the target traits, a novel selection index integrating information from genomic-estimated breeding values (GEBVs) of candidate accessions was proposed to evaluate the composite performance of simulated progeny populations. Two rice (Oryza sativa L.) genome datasets were analyzed to illustrate the potential applications of the proposed approach. One dataset applied to the parental selection for producing inbred lines with satisfactory performance in primary and secondary traits simultaneously. The other one applied to demonstrate the application of producing inbred lines with high adaptability to different environments. Overall, the results showed that incorporating GEBV and genomic diversity into a selection strategy based on the proposed selection index could assist in selecting superior parents to meet the desired breeding goals and increasing long-term genetic gain. An R package, called IPLGP, was generated to facilitate the widespread application of the approach.
Collapse
Affiliation(s)
- Ping-Yuan Chung
- Department of Agronomy, National Taiwan University, Taipei, Taiwan
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
| | - Chen-Tuo Liao
- Department of Agronomy, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
30
|
Gill T, Gill SK, Saini DK, Chopra Y, de Koff JP, Sandhu KS. A Comprehensive Review of High Throughput Phenotyping and Machine Learning for Plant Stress Phenotyping. PHENOMICS (CHAM, SWITZERLAND) 2022; 2:156-183. [PMID: 36939773 PMCID: PMC9590503 DOI: 10.1007/s43657-022-00048-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 01/29/2022] [Accepted: 02/11/2022] [Indexed: 02/04/2023]
Abstract
During the last decade, there has been rapid adoption of ground and aerial platforms with multiple sensors for phenotyping various biotic and abiotic stresses throughout the developmental stages of the crop plant. High throughput phenotyping (HTP) involves the application of these tools to phenotype the plants and can vary from ground-based imaging to aerial phenotyping to remote sensing. Adoption of these HTP tools has tried to reduce the phenotyping bottleneck in breeding programs and help to increase the pace of genetic gain. More specifically, several root phenotyping tools are discussed to study the plant's hidden half and an area long neglected. However, the use of these HTP technologies produces big data sets that impede the inference from those datasets. Machine learning and deep learning provide an alternative opportunity for the extraction of useful information for making conclusions. These are interdisciplinary approaches for data analysis using probability, statistics, classification, regression, decision theory, data visualization, and neural networks to relate information extracted with the phenotypes obtained. These techniques use feature extraction, identification, classification, and prediction criteria to identify pertinent data for use in plant breeding and pathology activities. This review focuses on the recent findings where machine learning and deep learning approaches have been used for plant stress phenotyping with data being collected using various HTP platforms. We have provided a comprehensive overview of different machine learning and deep learning tools available with their potential advantages and pitfalls. Overall, this review provides an avenue for studying various HTP platforms with particular emphasis on using the machine learning and deep learning tools for drawing legitimate conclusions. Finally, we propose the conceptual challenges being faced and provide insights on future perspectives for managing those issues.
Collapse
Affiliation(s)
- Taqdeer Gill
- grid.280741.80000 0001 2284 9820Department of Agricultural and Environmental Sciences, Tennessee State University, Nashville, TN 37209 USA
| | - Simranveer K. Gill
- grid.412577.20000 0001 2176 2352College of Agriculture, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| | - Dinesh K. Saini
- grid.412577.20000 0001 2176 2352Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| | - Yuvraj Chopra
- grid.412577.20000 0001 2176 2352College of Agriculture, Punjab Agricultural University, Ludhiana, Punjab 141004 India
| | - Jason P. de Koff
- grid.280741.80000 0001 2284 9820Department of Agricultural and Environmental Sciences, Tennessee State University, Nashville, TN 37209 USA
| | - Karansher S. Sandhu
- grid.30064.310000 0001 2157 6568Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99163 USA
| |
Collapse
|
31
|
Sandhu KS, Patil SS, Aoun M, Carter AH. Multi-Trait Multi-Environment Genomic Prediction for End-Use Quality Traits in Winter Wheat. Front Genet 2022; 13:831020. [PMID: 35173770 PMCID: PMC8841657 DOI: 10.3389/fgene.2022.831020] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/06/2022] [Indexed: 11/13/2022] Open
Abstract
Soft white wheat is a wheat class used in foreign and domestic markets to make various end products requiring specific quality attributes. Due to associated cost, time, and amount of seed needed, phenotyping for the end-use quality trait is delayed until later generations. Previously, we explored the potential of using genomic selection (GS) for selecting superior genotypes earlier in the breeding program. Breeders typically measure multiple traits across various locations, and it opens up the avenue for exploring multi-trait-based GS models. This study's main objective was to explore the potential of using multi-trait GS models for predicting seven different end-use quality traits using cross-validation, independent prediction, and across-location predictions in a wheat breeding program. The population used consisted of 666 soft white wheat genotypes planted for 5 years at two locations in Washington, United States. We optimized and compared the performances of four uni-trait- and multi-trait-based GS models, namely, Bayes B, genomic best linear unbiased prediction (GBLUP), multilayer perceptron (MLP), and random forests. The prediction accuracies for multi-trait GS models were 5.5 and 7.9% superior to uni-trait models for the within-environment and across-location predictions. Multi-trait machine and deep learning models performed superior to GBLUP and Bayes B for across-location predictions, but their advantages diminished when the genotype by environment component was included in the model. The highest improvement in prediction accuracy, that is, 35% was obtained for flour protein content with the multi-trait MLP model. This study showed the potential of using multi-trait-based GS models to enhance prediction accuracy by using information from previously phenotyped traits. It would assist in speeding up the breeding cycle time in a cost-friendly manner.
Collapse
Affiliation(s)
- Karansher S. Sandhu
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| | - Shruti Sunil Patil
- School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, United States1
| | - Meriem Aoun
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| | - Arron H. Carter
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| |
Collapse
|
32
|
Sandhu KS, Merrick LF, Sankaran S, Zhang Z, Carter AH. Prospectus of Genomic Selection and Phenomics in Cereal, Legume and Oilseed Breeding Programs. Front Genet 2022. [PMCID: PMC8814369 DOI: 10.3389/fgene.2021.829131] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The last decade witnessed an unprecedented increase in the adoption of genomic selection (GS) and phenomics tools in plant breeding programs, especially in major cereal crops. GS has demonstrated the potential for selecting superior genotypes with high precision and accelerating the breeding cycle. Phenomics is a rapidly advancing domain to alleviate phenotyping bottlenecks and explores new large-scale phenotyping and data acquisition methods. In this review, we discuss the lesson learned from GS and phenomics in six self-pollinated crops, primarily focusing on rice, wheat, soybean, common bean, chickpea, and groundnut, and their implementation schemes are discussed after assessing their impact in the breeding programs. Here, the status of the adoption of genomics and phenomics is provided for those crops, with a complete GS overview. GS’s progress until 2020 is discussed in detail, and relevant information and links to the source codes are provided for implementing this technology into plant breeding programs, with most of the examples from wheat breeding programs. Detailed information about various phenotyping tools is provided to strengthen the field of phenomics for a plant breeder in the coming years. Finally, we highlight the benefits of merging genomic selection, phenomics, and machine and deep learning that have resulted in extraordinary results during recent years in wheat, rice, and soybean. Hence, there is a potential for adopting these technologies into crops like the common bean, chickpea, and groundnut. The adoption of phenomics and GS into different breeding programs will accelerate genetic gain that would create an impact on food security, realizing the need to feed an ever-growing population.
Collapse
Affiliation(s)
- Karansher S. Sandhu
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
- *Correspondence: Karansher S. Sandhu,
| | - Lance F. Merrick
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| | - Sindhuja Sankaran
- Department of Biological System Engineering, Washington State University, Pullman, WA, United States
| | - Zhiwu Zhang
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| | - Arron H. Carter
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, United States
| |
Collapse
|
33
|
Saini DK, Chopra Y, Singh J, Sandhu KS, Kumar A, Bazzer S, Srivastava P. Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2022; 42:1. [PMID: 37309486 PMCID: PMC10248672 DOI: 10.1007/s11032-021-01272-7] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 12/10/2021] [Indexed: 06/14/2023]
Abstract
Genome-wide association studies (GWAS) are effectively applied to detect the marker trait associations (MTAs) using whole genome-wide variants for complex quantitative traits in different crop species. GWAS has been applied in wheat for different quality, biotic and abiotic stresses, and agronomic and yield-related traits. Predictions for marker-trait associations are controlled with the development of better statistical models taking population structure and familial relatedness into account. In this review, we have provided a detailed overview of the importance of association mapping, population design, high-throughput genotyping and phenotyping platforms, advancements in statistical models and multiple threshold comparisons, and recent GWA studies conducted in wheat. The information about MTAs utilized for gene characterization and adopted in breeding programs is also provided. In the literature that we surveyed, as many as 86,122 wheat lines have been studied under various GWA studies reporting 46,940 loci. However, further utilization of these is largely limited. The future breakthroughs in area of genomic selection, multi-omics-based approaches, machine, and deep learning models in wheat breeding after exploring the complex genetic structure with the GWAS are also discussed. This is a most comprehensive study of a large number of reports on wheat GWAS and gives a comparison and timeline of technological developments in this area. This will be useful to new researchers or groups who wish to invest in GWAS.
Collapse
Affiliation(s)
- Dinesh K. Saini
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, 141004 India
| | - Yuvraj Chopra
- College of Agriculture, Punjab Agricultural University, Ludhiana, 141004 India
| | - Jagmohan Singh
- Division of Plant Pathology, Indian Agricultural Research Institute, New Delhi, 110012 India
| | - Karansher S. Sandhu
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99163 USA
| | - Anand Kumar
- Department of Genetics and Plant Breeding, Chandra Shekhar Azad University of Agriculture and Technology, Kanpur, 202002 India
| | - Sumandeep Bazzer
- Division of Plant Sciences, University of Missouri, Columbia, MO 65211 USA
| | - Puja Srivastava
- Department of Plant Breeding and Genetics, Punjab Agricultural University, Ludhiana, 141004 India
| |
Collapse
|
34
|
Strategies to Increase Prediction Accuracy in Genomic Selection of Complex Traits in Alfalfa ( Medicago sativa L.). Cells 2021; 10:cells10123372. [PMID: 34943880 PMCID: PMC8699225 DOI: 10.3390/cells10123372] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 11/19/2021] [Accepted: 11/24/2021] [Indexed: 12/27/2022] Open
Abstract
Agronomic traits such as biomass yield and abiotic stress tolerance are genetically complex and challenging to improve through conventional breeding approaches. Genomic selection (GS) is an alternative approach in which genome-wide markers are used to determine the genomic estimated breeding value (GEBV) of individuals in a population. In alfalfa (Medicago sativa L.), previous results indicated that low to moderate prediction accuracy values (<70%) were obtained in complex traits, such as yield and abiotic stress resistance. There is a need to increase the prediction value in order to employ GS in breeding programs. In this paper we reviewed different statistic models and their applications in polyploid crops, such as alfalfa and potato. Specifically, we used empirical data affiliated with alfalfa yield under salt stress to investigate approaches that use DNA marker importance values derived from machine learning models, and genome-wide association studies (GWAS) of marker-trait association scores based on different GWASpoly models, in weighted GBLUP analyses. This approach increased prediction accuracies from 50% to more than 80% for alfalfa yield under salt stress. Finally, we expended the weighted GBLUP approach to potato and analyzed 13 phenotypic traits and obtained similar results. This is the first report on alfalfa to use variable importance and GWAS-assisted approaches to increase the prediction accuracy of GS, thus helping to select superior alfalfa lines based on their GEBVs.
Collapse
|
35
|
Kaur B, Sandhu KS, Kamal R, Kaur K, Singh J, Röder MS, Muqaddasi QH. Omics for the Improvement of Abiotic, Biotic, and Agronomic Traits in Major Cereal Crops: Applications, Challenges, and Prospects. PLANTS 2021; 10:plants10101989. [PMID: 34685799 PMCID: PMC8541486 DOI: 10.3390/plants10101989] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 09/17/2021] [Accepted: 09/18/2021] [Indexed: 12/22/2022]
Abstract
Omics technologies, namely genomics, transcriptomics, proteomics, metabolomics, and phenomics, are becoming an integral part of virtually every commercial cereal crop breeding program, as they provide substantial dividends per unit time in both pre-breeding and breeding phases. Continuous advances in omics assure time efficiency and cost benefits to improve cereal crops. This review provides a comprehensive overview of the established omics methods in five major cereals, namely rice, sorghum, maize, barley, and bread wheat. We cover the evolution of technologies in each omics section independently and concentrate on their use to improve economically important agronomic as well as biotic and abiotic stress-related traits. Advancements in the (1) identification, mapping, and sequencing of molecular/structural variants; (2) high-density transcriptomics data to study gene expression patterns; (3) global and targeted proteome profiling to study protein structure and interaction; (4) metabolomic profiling to quantify organ-level, small-density metabolites, and their composition; and (5) high-resolution, high-throughput, image-based phenomics approaches are surveyed in this review.
Collapse
Affiliation(s)
- Balwinder Kaur
- Everglades Research and Education Center, University of Florida, 3200 E. Palm Beach Rd., Belle Glade, FL 33430, USA;
| | - Karansher S. Sandhu
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA 99163, USA;
| | - Roop Kamal
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraße 3, 06466 Stadt Seeland, Germany; (R.K.); or (M.S.R.)
| | - Kawalpreet Kaur
- Department of Agricultural, Food and Nutritional Science, University of Alberta, Edmonton, AB T6G 2P5, Canada;
| | - Jagmohan Singh
- Division of Plant Pathology, ICAR-Indian Agricultural Research Institute, New Delhi 110012, India;
| | - Marion S. Röder
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraße 3, 06466 Stadt Seeland, Germany; (R.K.); or (M.S.R.)
| | - Quddoos H. Muqaddasi
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Corrensstraße 3, 06466 Stadt Seeland, Germany; (R.K.); or (M.S.R.)
- Correspondence: or
| |
Collapse
|