Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sandhu K, Patil SS, Pumphrey M, Carter A. Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program. Plant Genome 2021;14:e20119. [PMID: 34482627 DOI: 10.1002/tpg2.20119] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 05/18/2021] [Indexed: 06/13/2023]

For:	Sandhu K, Patil SS, Pumphrey M, Carter A. Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program. Plant Genome 2021;14:e20119. [PMID: 34482627 DOI: 10.1002/tpg2.20119] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 05/18/2021] [Indexed: 06/13/2023]

Number

Cited by Other Article(s)

Gaur A, Jindal Y, Singh V, Tiwari R, Juliana P, Kaushik D, Kumar KJY, Ahlawat OP, Singh G, Sheoran S. GWAS elucidated grain yield genetics in Indian spring wheat under diverse water conditions. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024;137:177. [PMID: 38972024 DOI: 10.1007/s00122-024-04680-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 06/11/2024] [Indexed: 07/08/2024]

Abstract

KEY MESSAGE

Underpinned natural variations and key genes associated with yield under different water regimes, and identified genomic signatures of genetic gain in the Indian wheat breeding program. A novel KASP marker for TKW under water stress was developed and validated. A comprehensive genome-wide association study was conducted on 300 spring wheat genotypes to elucidate the natural variations associated with grain yield and its eleven contributing traits under fully irrigated, restricted water, and simulated no water conditions. Utilizing the 35K Wheat Breeders' Array, we identified 1155 quantitative trait nucleotides (QTNs), with 207 QTNs exhibiting stability across diverse conditions. These QTNs were further delimited into 539 genomic regions using a genome-wide LD value of 3.0 Mbp, revealing pleiotropic control across traits and conditions. Sub-genome A was significantly associated with traits under irrigated conditions, while sub-genome B showed more QTNs under water stressed conditions. Favourable alleles with significantly associated QTNs were delineated, with a notable pyramiding effect for enhancing trait performance. Additionally, allele of only 921 QTNs significantly affected the population mean. Allele profiling highlighted C-306 as a most potential source of drought tolerance. Moreover, 762 genes overlapping significant QTNs were identified, narrowing down to 27 putative candidate genes overlapping 29 novel and functional SNPs expressing (≥ 0.5 tpm) relevance across various growth conditions. A new KASP assay was developed, targeting a gene TraesCS2A03G1123700 regulating thousand kernel weight under severe drought condition. Genomic selection models (GBLUP, BayesB, MxE, and R-Norm) demonstrated an average prediction accuracy of 0.06-0.58 across environments, indicating potential for trait selection. Retrospective analysis of the Indian wheat breeding program supported a genetic gain in GY at the rate of ca. 0.56% per breeding cycle, since 1960, supporting the identification of genomic signatures driving trait selection and genetic gain. These findings offer insight into improving the rate of genetic gain in wheat breeding programs globally.

Collapse

Jeong SW, Lyu JI, Jeong H, Baek J, Moon JK, Lee C, Choi MG, Kim KH, Park YI. SUnSeT: spectral unmixing of hyperspectral images for phenotyping soybean seed traits. PLANT CELL REPORTS 2024;43:164. [PMID: 38852113 PMCID: PMC11162974 DOI: 10.1007/s00299-024-03249-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 05/06/2024] [Indexed: 06/10/2024]

Zhou W, Yan Z, Zhang L. A comparative study of 11 non-linear regression models highlighting autoencoder, DBN, and SVR, enhanced by SHAP importance analysis in soybean branching prediction. Sci Rep 2024;14:5905. [PMID: 38467662 PMCID: PMC10928191 DOI: 10.1038/s41598-024-55243-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/21/2024] [Indexed: 03/13/2024] Open

Abstract

To explore a robust tool for advancing digital breeding practices through an artificial intelligence-driven phenotype prediction expert system, we undertook a thorough analysis of 11 non-linear regression models. Our investigation specifically emphasized the significance of Support Vector Regression (SVR) and SHapley Additive exPlanations (SHAP) in predicting soybean branching. By using branching data (phenotype) of 1918 soybean accessions and 42 k SNP (Single Nucleotide Polymorphism) polymorphic data (genotype), this study systematically compared 11 non-linear regression AI models, including four deep learning models (DBN (deep belief network) regression, ANN (artificial neural network) regression, Autoencoders regression, and MLP (multilayer perceptron) regression) and seven machine learning models (e.g., SVR (support vector regression), XGBoost (eXtreme Gradient Boosting) regression, Random Forest regression, LightGBM regression, GPs (Gaussian processes) regression, Decision Tree regression, and Polynomial regression). After being evaluated by four valuation metrics: R2 (R-squared), MAE (Mean Absolute Error), MSE (Mean Squared Error), and MAPE (Mean Absolute Percentage Error), it was found that the SVR, Polynomial Regression, DBN, and Autoencoder outperformed other models and could obtain a better prediction accuracy when they were used for phenotype prediction. In the assessment of deep learning approaches, we exemplified the SVR model, conducting analyses on feature importance and gene ontology (GO) enrichment to provide comprehensive support. After comprehensively comparing four feature importance algorithms, no notable distinction was observed in the feature importance ranking scores across the four algorithms, namely Variable Ranking, Permutation, SHAP, and Correlation Matrix, but the SHAP value could provide rich information on genes with negative contributions, and SHAP importance was chosen for feature selection. The results of this study offer valuable insights into AI-mediated plant breeding, addressing challenges faced by traditional breeding programs. The method developed has broad applicability in phenotype prediction, minor QTL (quantitative trait loci) mining, and plant smart-breeding systems, contributing significantly to the advancement of AI-based breeding practices and transitioning from experience-based to data-based breeding.

Collapse

Pengphorm P, Thongrom S, Daengngam C, Duangpan S, Hussain T, Boonrat P. Optimal-Band Analysis for Chlorophyll Quantification in Rice Leaves Using a Custom Hyperspectral Imaging System. PLANTS (BASEL, SWITZERLAND) 2024;13:259. [PMID: 38256812 PMCID: PMC10819252 DOI: 10.3390/plants13020259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 01/03/2024] [Accepted: 01/12/2024] [Indexed: 01/24/2024]

Lozada DN, Sandhu KS, Bhatta M. Ridge regression and deep learning models for genome-wide selection of complex traits in New Mexican Chile peppers. BMC Genom Data 2023;24:80. [PMID: 38110866 PMCID: PMC10726521 DOI: 10.1186/s12863-023-01179-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 12/05/2023] [Indexed: 12/20/2023] Open

Abstract

BACKGROUND

Genomewide prediction estimates the genomic breeding values of selection candidates which can be utilized for population improvement and cultivar development. Ridge regression and deep learning-based selection models were implemented for yield and agronomic traits of 204 chile pepper genotypes evaluated in multi-environment trials in New Mexico, USA.

RESULTS

Accuracy of prediction differed across different models under ten-fold cross-validations, where high prediction accuracy was observed for highly heritable traits such as plant height and plant width. No model was superior across traits using 14,922 SNP markers for genomewide selection. Bayesian ridge regression had the highest average accuracy for first pod date (0.77) and total yield per plant (0.33). Multilayer perceptron (MLP) was the most superior for flowering time (0.76) and plant height (0.73), whereas the genomic BLUP model had the highest accuracy for plant width (0.62). Using a subset of 7,690 SNP loci resulting from grouping markers based on linkage disequilibrium coefficients resulted in improved accuracy for first pod date, ten pod weight, and total yield per plant, even under a relatively small training population size for MLP and random forest models. Genomic and ridge regression BLUP models were sufficient for optimal prediction accuracies for small training population size. Combining phenotypic selection and genomewide selection resulted in improved selection response for yield-related traits, indicating that integrated approaches can result in improved gains achieved through selection.

CONCLUSIONS

Accuracy values for ridge regression and deep learning prediction models demonstrate the potential of implementing genomewide selection for genetic improvement in chile pepper breeding programs. Ultimately, a large training data is relevant for improved genomic selection accuracy for the deep learning models.

Collapse

Chen C, Powell O, Dinglasan E, Ross EM, Yadav S, Wei X, Atkin F, Deomano E, Hayes BJ. Genomic prediction with machine learning in sugarcane, a complex highly polyploid clonally propagated crop with substantial non-additive variation for key traits. THE PLANT GENOME 2023;16:e20390. [PMID: 37728221 DOI: 10.1002/tpg2.20390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 08/01/2023] [Accepted: 08/29/2023] [Indexed: 09/21/2023]

Gill HS, Brar N, Halder J, Hall C, Seabourn BW, Chen YR, St Amand P, Bernardo A, Bai G, Glover K, Turnipseed B, Sehgal SK. Multi-trait genomic selection improves the prediction accuracy of end-use quality traits in hard winter wheat. THE PLANT GENOME 2023;16:e20331. [PMID: 37194433 DOI: 10.1002/tpg2.20331] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 02/16/2023] [Accepted: 03/01/2023] [Indexed: 05/18/2023]

Verplaetse N, Passemiers A, Arany A, Moreau Y, Raimondi D. Large sample size and nonlinear sparse models outline epistatic effects in inflammatory bowel disease. Genome Biol 2023;24:224. [PMID: 37798735 PMCID: PMC10552306 DOI: 10.1186/s13059-023-03064-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/20/2023] [Indexed: 10/07/2023] Open

Cembrowska-Lech D, Krzemińska A, Miller T, Nowakowska A, Adamski C, Radaczyńska M, Mikiciuk G, Mikiciuk M. An Integrated Multi-Omics and Artificial Intelligence Framework for Advance Plant Phenotyping in Horticulture. BIOLOGY 2023;12:1298. [PMID: 37887008 PMCID: PMC10603917 DOI: 10.3390/biology12101298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/27/2023] [Accepted: 09/28/2023] [Indexed: 10/28/2023]

Gao P, Zhao H, Luo Z, Lin Y, Feng W, Li Y, Kong F, Li X, Fang C, Wang X. SoyDNGP: a web-accessible deep learning framework for genomic prediction in soybean breeding. Brief Bioinform 2023;24:bbad349. [PMID: 37824739 DOI: 10.1093/bib/bbad349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 09/13/2023] [Accepted: 09/14/2023] [Indexed: 10/14/2023] Open

Abstract

Soybean is a globally significant crop, playing a vital role in human nutrition and agriculture. Its complex genetic structure and wide trait variation, however, pose challenges for breeders and researchers aiming to optimize its yield and quality. Addressing this biological complexity requires innovative and accurate tools for trait prediction. In response to this challenge, we have developed SoyDNGP, a deep learning-based model that offers significant advancements in the field of soybean trait prediction. Compared to existing methods, such as DeepGS and DNNGP, SoyDNGP boasts a distinct advantage due to its minimal increase in parameter volume and superior predictive accuracy. Through rigorous performance comparison, including prediction accuracy and model complexity, SoyDNGP represents improved performance to its counterparts. Furthermore, it effectively predicted complex traits with remarkable precision, demonstrating robust performance across different sample sizes and trait complexities. We also tested the versatility of SoyDNGP across multiple crop species, including cotton, maize, rice and tomato. Our results showed its consistent and comparable performance, emphasizing SoyDNGP's potential as a versatile tool for genomic prediction across a broad range of crops. To enhance its accessibility to users without extensive programming experience, we designed a user-friendly web server, available at http://xtlab.hzau.edu.cn/SoyDNGP. The server provides two features: 'Trait Lookup', offering users the ability to access pre-existing trait predictions for over 500 soybean accessions, and 'Trait Prediction', allowing for the upload of VCF files for trait estimation. By providing a high-performing, accessible tool for trait prediction, SoyDNGP opens up new possibilities in the quest for optimized soybean breeding.

Collapse

Affiliation(s)

Pengfei Gao National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
Haonan Zhao National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
Zheng Luo National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
Yifan Lin Hubei Hongshan Laboratory, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
Wanjie Feng National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
Yaling Li National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
Fanjiang Kong Guangzhou Key Laboratory of Crop Gene Editing, Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou 510006, China
Xia Li National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China
Chao Fang Guangzhou Key Laboratory of Crop Gene Editing, Guangdong Key Laboratory of Plant Adaptation and Molecular Design, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, Guangzhou 510006, China
Xutong Wang National Key Laboratory of Crop Genetic Improvement, College of Plant Science and Technology, Huazhong Agricultural University, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China Hubei Hongshan Laboratory, No. 1 Shizishan Road, Hongshan District, Wuhan, Hubei 430070, China

Collapse

Yan Q, Fruzangohar M, Taylor J, Gong D, Walter J, Norman A, Shi JQ, Coram T. Improved genomic prediction using machine learning with Variational Bayesian sparsity. PLANT METHODS 2023;19:96. [PMID: 37660084 PMCID: PMC10474716 DOI: 10.1186/s13007-023-01073-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 08/22/2023] [Indexed: 09/04/2023]

Abstract

BACKGROUND

Genomic prediction has become a powerful modelling tool for assessing line performance in plant and livestock breeding programmes. Among the genomic prediction modelling approaches, linear based models have proven to provide accurate predictions even when the number of genetic markers exceeds the number of data samples. However, breeding programmes are now compiling data from large numbers of lines and test environments for analyses, rendering these approaches computationally prohibitive. Machine learning (ML) now offers a solution to this problem through the construction of fully connected deep learning architectures and high parallelisation of the predictive task. However, the fully connected nature of these architectures immediately generates an over-parameterisation of the network that needs addressing for efficient and accurate predictions.

RESULTS

In this research we explore the use of an ML architecture governed by variational Bayesian sparsity in its initial layers that we have called VBS-ML. The use of VBS-ML provides a mechanism for feature selection of important markers linked to the trait, immediately reducing the network over-parameterisation. Selected markers then propagate to the remaining fully connected feed-forward components of the ML network to form the final genomic prediction. We illustrated the approach with four large Australian wheat breeding data sets that range from 2665 lines to 10375 lines genotyped across a large set of markers. For all data sets, the use of the VBS-ML architecture improved genomic prediction accuracy over legacy linear based modelling approaches.

CONCLUSIONS

An ML architecture governed under a variational Bayesian paradigm was shown to improve genomic prediction accuracy over legacy modelling approaches. This VBS-ML approach can be used to dramatically decrease the parameter burden on the network and provide a computationally feasible approach for improving genomic prediction conducted with large breeding population numbers and genetic markers.

Collapse

Mora-Poblete F, Maldonado C, Henrique L, Uhdre R, Scapim CA, Mangolim CA. Multi-trait and multi-environment genomic prediction for flowering traits in maize: a deep learning approach. FRONTIERS IN PLANT SCIENCE 2023;14:1153040. [PMID: 37593046 PMCID: PMC10428628 DOI: 10.3389/fpls.2023.1153040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Accepted: 07/12/2023] [Indexed: 08/19/2023]

Abstract

Maize (Zea mays L.), the third most widely cultivated cereal crop in the world, plays a critical role in global food security. To improve the efficiency of selecting superior genotypes in breeding programs, researchers have aimed to identify key genomic regions that impact agronomic traits. In this study, the performance of multi-trait, multi-environment deep learning models was compared to that of Bayesian models (Markov Chain Monte Carlo generalized linear mixed models (MCMCglmm), Bayesian Genomic Genotype-Environment Interaction (BGGE), and Bayesian Multi-Trait and Multi-Environment (BMTME)) in terms of the prediction accuracy of flowering-related traits (Anthesis-Silking Interval: ASI, Female Flowering: FF, and Male Flowering: MF). A tropical maize panel of 258 inbred lines from Brazil was evaluated in three sites (Cambira-2018, Sabaudia-2018, and Iguatemi-2020 and 2021) using approximately 290,000 single nucleotide polymorphisms (SNPs). The results demonstrated a 14.4% increase in prediction accuracy when employing multi-trait models compared to the use of a single trait in a single environment approach. The accuracy of predictions also improved by 6.4% when using a single trait in a multi-environment scheme compared to using multi-trait analysis. Additionally, deep learning models consistently outperformed Bayesian models in both single and multiple trait and environment approaches. A complementary genome-wide association study identified associations with 26 candidate genes related to flowering time traits, and 31 marker-trait associations were identified, accounting for 37%, 37%, and 22% of the phenotypic variation of ASI, FF and MF, respectively. In conclusion, our findings suggest that deep learning models have the potential to significantly improve the accuracy of predictions, regardless of the approach used and provide support for the efficacy of this method in genomic selection for flowering-related traits in tropical maize.

Collapse

Bhat JA, Feng X, Mir ZA, Raina A, Siddique KHM. Recent advances in artificial intelligence, mechanistic models, and speed breeding offer exciting opportunities for precise and accelerated genomics-assisted breeding. PHYSIOLOGIA PLANTARUM 2023;175:e13969. [PMID: 37401892 DOI: 10.1111/ppl.13969] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 06/11/2023] [Accepted: 06/27/2023] [Indexed: 07/05/2023]

Hashem M, Sandhu KS, Ismail SM, Börner A, Sallam A. Validation and marker-assisted selection of DArT-genomic regions associated with wheat yield-related traits under normal and drought conditions. Front Genet 2023;14:1195566. [PMID: 37292145 PMCID: PMC10245129 DOI: 10.3389/fgene.2023.1195566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 05/10/2023] [Indexed: 06/10/2023] Open

Abstract

Quantitative trait loci (QTL) is one of the most important steps in marker-assisted selection. Few studies have validated quantitative trait loci for marker-assisted selection of yield traits under drought stress conditions in wheat. A set of 138 highly diverse wheat genotypes were tested under normal and drought stress conditions for 2 years. Plant height, heading date, spike length, grain number per spike, grain yield per spike, and 1000-kernel weight were scored. High genetic variation was found among genotypes in all traits scored under both conditions in the 2 years. The same panel was genotyped using a diversity-array technology (DArT) marker, and a genome-wide association study was performed to find alleles associated with yield traits under all conditions. A set of 191 significant DArT markers were identified in this study. The results of the genome-wide association study revealed eight common markers in wheat that were significantly associated with the same traits under both conditions in the 2 years. Out of the eight markers, seven were located on the D genome except one marker. Four validated markers were located on the 3D chromosome and found in complete linkage disequilibrium. Moreover, these four markers were significantly associated with the heading date under both conditions and the grain yield per spike under drought stress condition in the 2 years. This high-linkage disequilibrium genomic region was located within the TraesCS3D02G002400 gene model. Furthermore, of the eight validated markers, seven were previously reported to be associated with yield traits under normal and drought conditions. The results of this study provided very promising DArT markers that can be used for marker-assisted selection to genetically improve yield traits under normal and drought conditions.

Collapse

Anilkumar C, Muhammed Azharudheen TP, Sah RP, Sunitha NC, Devanna BN, Marndi BC, Patra BC. Gene based markers improve precision of genome-wide association studies and accuracy of genomic predictions in rice breeding. Heredity (Edinb) 2023;130:335-345. [PMID: 36792661 PMCID: PMC10163052 DOI: 10.1038/s41437-023-00599-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 02/02/2023] [Accepted: 02/03/2023] [Indexed: 02/17/2023] Open

Massahiro Yassue R, Galli G, James Chen C, Fritsche‐Neto R, Morota G. Genome-wide association analysis of hyperspectral reflectance data to dissect the genetic architecture of growth-related traits in maize under plant growth-promoting bacteria inoculation. PLANT DIRECT 2023;7:e492. [PMID: 37102161 PMCID: PMC10123960 DOI: 10.1002/pld3.492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 03/09/2023] [Accepted: 03/13/2023] [Indexed: 06/19/2023]

Abstract

Plant growth-promoting bacteria (PGPB) may be of use for increasing crop yield and plant resilience to biotic and abiotic stressors. Using hyperspectral reflectance data to assess growth-related traits may shed light on the underlying genetics as such data can help assess biochemical and physiological traits. This study aimed to integrate hyperspectral reflectance data with genome-wide association analyses to examine maize growth-related traits under PGPB inoculation. A total of 360 inbred maize lines with 13,826 single nucleotide polymorphisms (SNPs) were evaluated with and without PGPB inoculation; 150 hyperspectral wavelength reflectances at 386-1021 nm and 131 hyperspectral indices were used in the analysis. Plant height, stalk diameter, and shoot dry mass were measured manually. Overall, hyperspectral signatures produced similar or higher genomic heritability estimates than those of manually measured phenotypes, and they were genetically correlated with manually measured phenotypes. Furthermore, several hyperspectral reflectance values and spectral indices were identified by genome-wide association analysis as potential markers for growth-related traits under PGPB inoculation. Eight SNPs were detected, which were commonly associated with manually measured and hyperspectral phenotypes. Different genomic regions were found for plant growth and hyperspectral phenotypes between with and without PGPB inoculation. Moreover, the hyperspectral phenotypes were associated with genes previously reported as candidates for nitrogen uptake efficiency, tolerance to abiotic stressors, and kernel size. In addition, a Shiny web application was developed to explore multiphenotype genome-wide association results interactively. Taken together, our results demonstrate the usefulness of hyperspectral-based phenotyping for studying maize growth-related traits in response to PGPB inoculation.

Collapse

Bisht A, Saini DK, Kaur B, Batra R, Kaur S, Kaur I, Jindal S, Malik P, Sandhu PK, Kaur A, Gill BS, Wani SH, Kaur B, Mir RR, Sandhu KS, Siddique KHM. Multi-omics assisted breeding for biotic stress resistance in soybean. Mol Biol Rep 2023;50:3787-3814. [PMID: 36692674 DOI: 10.1007/s11033-023-08260-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/09/2023] [Indexed: 01/25/2023]

Affiliation(s)

Ashita Bisht Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India CSK Himachal Pradesh Krishi Vishvavidyalaya, Highland Agricultural Research and Extension Centre, 175142, Kukumseri, Lahaul and Spiti, India
Dinesh Kumar Saini Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India.
Baljeet Kaur Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
Ritu Batra Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, 25004, Meerut, India
Sandeep Kaur Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
Ishveen Kaur Agriculture, Environmental and Sustainability Sciences, College of sciences, University of Texas Rio Grande Valley, 78539, Edinburg, TX, USA
Suruchi Jindal Division of Molecular Biology and Genetic Engineering, School of Bioengineering and Biosciences, Lovely Professional University, Phagwara, India
Palvi Malik , Gurdev Singh Khush Institute of Genetics, Plant Breeding and Biotechnology, Punjab Agricultural University,, 141004, Ludhiana, India
Pawanjit Kaur Sandhu Department of Chemistry, University of British Columbia, V1V 1V7, Okanagan, Kelowna, Canada
Amandeep Kaur Division of Molecular Biology and Genetic Engineering, School of Bioengineering and Biosciences, Lovely Professional University, Phagwara, India
Balwinder Singh Gill Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
Shabir Hussain Wani MRCFC Khudwani, Sher-e-Kashmir University of Agricultural Sciences and Technology, Kashmir, Shalimar, India
Balwinder Kaur Department of Entomology, UF/IFAS Research and Education Center, 33430, Belle Glade, Florida, USA
Reyazul Rouf Mir Division of Genetics and Plant Breeding, Faculty of Agriculture, SKUAST-Kashmir, 193201, India
Karansher Singh Sandhu Department of Crop and Soil Sciences, Washington State University, 99163, Pullman, WA, USA.
Kadambot H M Siddique The UWA Institute of Agriculture, The University of Western Australia, 6001, Perth, WA, Australia.

Collapse

Liang M, Cao S, Deng T, Du L, Li K, An B, Du Y, Xu L, Zhang L, Gao X, Li J, Guo P, Gao H. MAK: a machine learning framework improved genomic prediction via multi-target ensemble regressor chains and automatic selection of assistant traits. Brief Bioinform 2023;24:7031157. [PMID: 36752363 DOI: 10.1093/bib/bbad043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 01/13/2023] [Accepted: 01/20/2023] [Indexed: 02/09/2023] Open

Kumar M, Kumar S, Sandhu KS, Kumar N, Saripalli G, Prakash R, Nambardar A, Sharma H, Gautam T, Balyan HS, Gupta PK. GWAS and genomic prediction for pre-harvest sprouting tolerance involving sprouting score and two other related traits in spring wheat. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2023;43:14. [PMID: 37313293 PMCID: PMC10248620 DOI: 10.1007/s11032-023-01357-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 01/26/2023] [Indexed: 06/15/2023]

Innovative Advances in Plant Genotyping. Methods Mol Biol 2023;2638:451-465. [PMID: 36781662 DOI: 10.1007/978-1-0716-3024-2_32] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]

Singh J, Chhabra B, Raza A, Yang SH, Sandhu KS. Important wheat diseases in the US and their management in the 21st century. FRONTIERS IN PLANT SCIENCE 2023;13:1010191. [PMID: 36714765 PMCID: PMC9877539 DOI: 10.3389/fpls.2022.1010191] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 11/28/2022] [Indexed: 05/27/2023]

Jubair S, Domaratzki M. Crop genomic selection with deep learning and environmental data: A survey. Front Artif Intell 2023;5:1040295. [PMID: 36703955 PMCID: PMC9871498 DOI: 10.3389/frai.2022.1040295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 12/22/2022] [Indexed: 01/12/2023] Open

Vu NT, Phuc TH, Nguyen NH, Van Sang N. Effects of common full-sib families on accuracy of genomic prediction for tagging weight in striped catfish Pangasianodon hypophthalmus. Front Genet 2023;13:1081246. [PMID: 36685869 PMCID: PMC9845282 DOI: 10.3389/fgene.2022.1081246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 12/06/2022] [Indexed: 01/06/2023] Open

Nazzicari N, Biscarini F. Stacked kinship CNN vs. GBLUP for genomic predictions of additive and complex continuous phenotypes. Sci Rep 2022;12:19889. [PMID: 36400808 PMCID: PMC9674857 DOI: 10.1038/s41598-022-24405-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 11/15/2022] [Indexed: 11/19/2022] Open

Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. MOLECULAR PLANT 2022;15:1664-1695. [PMID: 36081348 DOI: 10.1016/j.molp.2022.09.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 09/02/2022] [Indexed: 05/12/2023]

Abstract

The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.

Collapse

John M, Haselbeck F, Dass R, Malisi C, Ricca P, Dreischer C, Schultheiss SJ, Grimm DG. A comparison of classical and machine learning-based phenotype prediction methods on simulated data and three plant species. FRONTIERS IN PLANT SCIENCE 2022;13:932512. [PMID: 36407627 PMCID: PMC9673477 DOI: 10.3389/fpls.2022.932512] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 07/25/2022] [Indexed: 06/16/2023]

Anilkumar C, Sunitha NC, Devate NB, Ramesh S. Advances in integrated genomic selection for rapid genetic gain in crop improvement: a review. PLANTA 2022;256:87. [PMID: 36149531 DOI: 10.1007/s00425-022-03996-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 09/11/2022] [Indexed: 06/16/2023]

Abstract

Genomic selection and its importance in crop breeding. Integration of GS with new breeding tools and developing SOP for GS to achieve maximum genetic gain with low cost and time. The success of conventional breeding approaches is not sufficient to meet the demand of a growing population for nutritious food and other plant-based products. Whereas, marker assisted selection (MAS) is not efficient in capturing all the favorable alleles responsible for economic traits in the process of crop improvement. Genomic selection (GS) developed in livestock breeding and then adapted to plant breeding promised to overcome the drawbacks of MAS and significantly improve complicated traits controlled by gene/QTL with small effects. Large-scale deployment of GS in important crops, as well as simulation studies in a variety of contexts, addressed G × E interaction effects and non-additive effects, as well as lowering breeding costs and time. The current study provides a complete overview of genomic selection, its process, and importance in modern plant breeding, along with insights into its application. GS has been implemented in the improvement of complex traits including tolerance to biotic and abiotic stresses. Furthermore, this review hypothesises that using GS in conjunction with other crop improvement platforms accelerates the breeding process to increase genetic gain. The objective of this review is to highlight the development of an appropriate GS model, the global open source network for GS, and trans-disciplinary approaches for effective accelerated crop improvement. The current study focused on the application of data science, including machine learning and deep learning tools, to enhance the accuracy of prediction models. Present study emphasizes on developing plant breeding strategies centered on GS combined with routine conventional breeding principles by developing GS-SOP to achieve enhanced genetic gain.

Collapse

Sandhu KS, Shiv A, Kaur G, Meena MR, Raja AK, Vengavasi K, Mall AK, Kumar S, Singh PK, Singh J, Hemaprabha G, Pathak AD, Krishnappa G, Kumar S. Integrated Approach in Genomic Selection to Accelerate Genetic Gain in Sugarcane. PLANTS 2022;11:plants11162139. [PMID: 36015442 PMCID: PMC9412483 DOI: 10.3390/plants11162139] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 08/08/2022] [Accepted: 08/08/2022] [Indexed: 11/30/2022]

Chung PY, Liao CT. Selection of parental lines for plant breeding via genomic prediction. FRONTIERS IN PLANT SCIENCE 2022;13:934767. [PMID: 35968112 PMCID: PMC9363737 DOI: 10.3389/fpls.2022.934767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 07/01/2022] [Indexed: 06/15/2023]

Gill T, Gill SK, Saini DK, Chopra Y, de Koff JP, Sandhu KS. A Comprehensive Review of High Throughput Phenotyping and Machine Learning for Plant Stress Phenotyping. PHENOMICS (CHAM, SWITZERLAND) 2022;2:156-183. [PMID: 36939773 PMCID: PMC9590503 DOI: 10.1007/s43657-022-00048-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 01/29/2022] [Accepted: 02/11/2022] [Indexed: 02/04/2023]

Abstract

During the last decade, there has been rapid adoption of ground and aerial platforms with multiple sensors for phenotyping various biotic and abiotic stresses throughout the developmental stages of the crop plant. High throughput phenotyping (HTP) involves the application of these tools to phenotype the plants and can vary from ground-based imaging to aerial phenotyping to remote sensing. Adoption of these HTP tools has tried to reduce the phenotyping bottleneck in breeding programs and help to increase the pace of genetic gain. More specifically, several root phenotyping tools are discussed to study the plant's hidden half and an area long neglected. However, the use of these HTP technologies produces big data sets that impede the inference from those datasets. Machine learning and deep learning provide an alternative opportunity for the extraction of useful information for making conclusions. These are interdisciplinary approaches for data analysis using probability, statistics, classification, regression, decision theory, data visualization, and neural networks to relate information extracted with the phenotypes obtained. These techniques use feature extraction, identification, classification, and prediction criteria to identify pertinent data for use in plant breeding and pathology activities. This review focuses on the recent findings where machine learning and deep learning approaches have been used for plant stress phenotyping with data being collected using various HTP platforms. We have provided a comprehensive overview of different machine learning and deep learning tools available with their potential advantages and pitfalls. Overall, this review provides an avenue for studying various HTP platforms with particular emphasis on using the machine learning and deep learning tools for drawing legitimate conclusions. Finally, we propose the conceptual challenges being faced and provide insights on future perspectives for managing those issues.

Collapse

Sandhu KS, Patil SS, Aoun M, Carter AH. Multi-Trait Multi-Environment Genomic Prediction for End-Use Quality Traits in Winter Wheat. Front Genet 2022;13:831020. [PMID: 35173770 PMCID: PMC8841657 DOI: 10.3389/fgene.2022.831020] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/06/2022] [Indexed: 11/13/2022] Open

Abstract

Soft white wheat is a wheat class used in foreign and domestic markets to make various end products requiring specific quality attributes. Due to associated cost, time, and amount of seed needed, phenotyping for the end-use quality trait is delayed until later generations. Previously, we explored the potential of using genomic selection (GS) for selecting superior genotypes earlier in the breeding program. Breeders typically measure multiple traits across various locations, and it opens up the avenue for exploring multi-trait-based GS models. This study's main objective was to explore the potential of using multi-trait GS models for predicting seven different end-use quality traits using cross-validation, independent prediction, and across-location predictions in a wheat breeding program. The population used consisted of 666 soft white wheat genotypes planted for 5 years at two locations in Washington, United States. We optimized and compared the performances of four uni-trait- and multi-trait-based GS models, namely, Bayes B, genomic best linear unbiased prediction (GBLUP), multilayer perceptron (MLP), and random forests. The prediction accuracies for multi-trait GS models were 5.5 and 7.9% superior to uni-trait models for the within-environment and across-location predictions. Multi-trait machine and deep learning models performed superior to GBLUP and Bayes B for across-location predictions, but their advantages diminished when the genotype by environment component was included in the model. The highest improvement in prediction accuracy, that is, 35% was obtained for flour protein content with the multi-trait MLP model. This study showed the potential of using multi-trait-based GS models to enhance prediction accuracy by using information from previously phenotyped traits. It would assist in speeding up the breeding cycle time in a cost-friendly manner.

Collapse

Sandhu KS, Merrick LF, Sankaran S, Zhang Z, Carter AH. Prospectus of Genomic Selection and Phenomics in Cereal, Legume and Oilseed Breeding Programs. Front Genet 2022. [PMCID: PMC8814369 DOI: 10.3389/fgene.2021.829131] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Saini DK, Chopra Y, Singh J, Sandhu KS, Kumar A, Bazzer S, Srivastava P. Comprehensive evaluation of mapping complex traits in wheat using genome-wide association studies. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2022;42:1. [PMID: 37309486 PMCID: PMC10248672 DOI: 10.1007/s11032-021-01272-7] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 12/10/2021] [Indexed: 06/14/2023]

Strategies to Increase Prediction Accuracy in Genomic Selection of Complex Traits in Alfalfa (Medicago sativa L.). Cells 2021;10:cells10123372. [PMID: 34943880 PMCID: PMC8699225 DOI: 10.3390/cells10123372] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 11/19/2021] [Accepted: 11/24/2021] [Indexed: 12/27/2022] Open

Kaur B, Sandhu KS, Kamal R, Kaur K, Singh J, Röder MS, Muqaddasi QH. Omics for the Improvement of Abiotic, Biotic, and Agronomic Traits in Major Cereal Crops: Applications, Challenges, and Prospects. PLANTS 2021;10:plants10101989. [PMID: 34685799 PMCID: PMC8541486 DOI: 10.3390/plants10101989] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 09/17/2021] [Accepted: 09/18/2021] [Indexed: 12/22/2022]