Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Liu Y, Wang D, He F, Wang J, Joshi T, Xu D. Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean. Front Genet 2019;10:1091. [PMID: 31824557 PMCID: PMC6883005 DOI: 10.3389/fgene.2019.01091] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Accepted: 10/09/2019] [Indexed: 12/21/2022] Open

For:	Liu Y, Wang D, He F, Wang J, Joshi T, Xu D. Phenotype Prediction and Genome-Wide Association Study Using Deep Convolutional Neural Network of Soybean. Front Genet 2019;10:1091. [PMID: 31824557 PMCID: PMC6883005 DOI: 10.3389/fgene.2019.01091] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Accepted: 10/09/2019] [Indexed: 12/21/2022] Open

Number

Cited by Other Article(s)

Wu H, Gao B, Zhang R, Huang Z, Yin Z, Hu X, Yang CX, Du ZQ. Residual network improves the prediction accuracy of genomic selection. Anim Genet 2024;55:599-611. [PMID: 38746973 DOI: 10.1111/age.13445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 04/21/2024] [Accepted: 04/29/2024] [Indexed: 07/04/2024]

Li J, Zhang D, Yang F, Zhang Q, Pan S, Zhao X, Zhang Q, Han Y, Yang J, Wang K, Zhao C. TrG2P: A transfer-learning-based tool integrating multi-trait data for accurate prediction of crop yield. PLANT COMMUNICATIONS 2024;5:100975. [PMID: 38751121 DOI: 10.1016/j.xplc.2024.100975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 04/14/2024] [Accepted: 05/11/2024] [Indexed: 06/24/2024]

Abstract

Yield prediction is the primary goal of genomic selection (GS)-assisted crop breeding. Because yield is a complex quantitative trait, making predictions from genotypic data is challenging. Transfer learning can produce an effective model for a target task by leveraging knowledge from a different, but related, source domain and is considered a great potential method for improving yield prediction by integrating multi-trait data. However, it has not previously been applied to genotype-to-phenotype prediction owing to the lack of an efficient implementation framework. We therefore developed TrG2P, a transfer-learning-based framework. TrG2P first employs convolutional neural networks (CNN) to train models using non-yield-trait phenotypic and genotypic data, thus obtaining pre-trained models. Subsequently, the convolutional layer parameters from these pre-trained models are transferred to the yield prediction task, and the fully connected layers are retrained, thus obtaining fine-tuned models. Finally, the convolutional layer and the first fully connected layer of the fine-tuned models are fused, and the last fully connected layer is trained to enhance prediction performance. We applied TrG2P to five sets of genotypic and phenotypic data from maize (Zea mays), rice (Oryza sativa), and wheat (Triticum aestivum) and compared its model precision to that of seven other popular GS tools: ridge regression best linear unbiased prediction (rrBLUP), random forest, support vector regression, light gradient boosting machine (LightGBM), CNN, DeepGS, and deep neural network for genomic prediction (DNNGP). TrG2P improved the accuracy of yield prediction by 39.9%, 6.8%, and 1.8% in rice, maize, and wheat, respectively, compared with predictions generated by the best-performing comparison model. Our work therefore demonstrates that transfer learning is an effective strategy for improving yield prediction by integrating information from non-yield-trait data. We attribute its enhanced prediction accuracy to the valuable information available from traits associated with yield and to training dataset augmentation. The Python implementation of TrG2P is available at https://github.com/lijinlong1991/TrG2P. The web-based tool is available at http://trg2p.ebreed.cn:81.

Collapse

Affiliation(s)

Jinlong Li Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Dongfeng Zhang Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Feng Yang Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Qiusi Zhang Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Shouhui Pan Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Xiangyu Zhao Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Qi Zhang Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Yanyun Han Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China
Jinliang Yang Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, NE 68583, USA; Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE 68583, USA
Kaiyi Wang Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China.
Chunjiang Zhao Information Technology Research Center, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China; National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China.

Collapse

Graça M, Nobre R, Sousa L, Ilic A. Distributed transformer for high order epistasis detection in large-scale datasets. Sci Rep 2024;14:14579. [PMID: 38918413 PMCID: PMC11199512 DOI: 10.1038/s41598-024-65317-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2024] [Accepted: 06/19/2024] [Indexed: 06/27/2024] Open

Bose S, Banerjee S, Kumar S, Saha A, Nandy D, Hazra S. Review of applications of artificial intelligence (AI) methods in crop research. J Appl Genet 2024;65:225-240. [PMID: 38216788 DOI: 10.1007/s13353-023-00826-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 12/23/2023] [Accepted: 12/26/2023] [Indexed: 01/14/2024]

Yurtseven A, Buyanova S, Agrawal AA, Bochkareva OO, Kalinina OV. Machine learning and phylogenetic analysis allow for predicting antibiotic resistance in M. tuberculosis. BMC Microbiol 2023;23:404. [PMID: 38124060 PMCID: PMC10731705 DOI: 10.1186/s12866-023-03147-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 12/07/2023] [Indexed: 12/23/2023] Open

Wu C, Zhang Y, Ying Z, Li L, Wang J, Yu H, Zhang M, Feng X, Wei X, Xu X. A transformer-based genomic prediction method fused with knowledge-guided module. Brief Bioinform 2023;25:bbad438. [PMID: 38058185 PMCID: PMC10701102 DOI: 10.1093/bib/bbad438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 10/15/2023] [Accepted: 11/03/2023] [Indexed: 12/08/2023] Open

Khvorykh GV, Sapozhnikov NA, Limborska SA, Khrunin AV. Evaluation of Density-Based Spatial Clustering for Identifying Genomic Loci Associated with Ischemic Stroke in Genome-Wide Data. Int J Mol Sci 2023;24:15355. [PMID: 37895035 PMCID: PMC10607504 DOI: 10.3390/ijms242015355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 09/19/2023] [Accepted: 09/28/2023] [Indexed: 10/29/2023] Open

Canella Vieira C, Zhou J, Jarquin D, Zhou J, Diers B, Riechers DE, Nguyen HT, Shannon G. Genetic architecture of soybean tolerance to off-target dicamba. FRONTIERS IN PLANT SCIENCE 2023;14:1230068. [PMID: 37877091 PMCID: PMC10590897 DOI: 10.3389/fpls.2023.1230068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Accepted: 09/27/2023] [Indexed: 10/26/2023]

Abstract

The adoption of dicamba-tolerant (DT) soybean in the United States resulted in extensive off-target dicamba damage to non-DT vegetation across soybean-producing states. Although soybeans are highly sensitive to dicamba, the intensity of observed symptoms and yield losses are affected by the genetic background of genotypes. Thus, the objective of this study was to detect novel marker-trait associations and expand on previously identified genomic regions related to soybean response to off-target dicamba. A total of 551 non-DT advanced breeding lines derived from 232 unique bi-parental populations were phenotyped for off-target dicamba across nine environments for three years. Breeding lines were genotyped using the Illumina Infinium BARCSoySNP6K BeadChip. Filtered SNPs were included as predictors in Random Forest (RF) and Support Vector Machine (SVM) models in a forward stepwise selection loop to identify the combination of SNPs yielding the highest classification accuracy. Both RF and SVM models yielded high classification accuracies (0.76 and 0.79, respectively) with minor extreme misclassifications (observed tolerant predicted as susceptible, and vice-versa). Eight genomic regions associated with off-target dicamba tolerance were identified on chromosomes 6 [Linkage Group (LG) C2], 8 (LG A2), 9 (LG K), 10 (LG O), and 19 (LG L). Although the genetic architecture of tolerance is complex, high classification accuracies were obtained when including the major effect SNP identified on chromosome 6 as the sole predictor. In addition, candidate genes with annotated functions associated with phases II (conjugation of hydroxylated herbicides to endogenous sugar molecules) and III (transportation of herbicide conjugates into the vacuole) of herbicide detoxification in plants were co-localized with significant markers within each genomic region. Genomic prediction models, as reported in this study, can greatly facilitate the identification of genotypes with superior tolerance to off-target dicamba.

Collapse

Ferrão LFV, Dhakal R, Dias R, Tieman D, Whitaker V, Gore MA, Messina C, Resende MFR. Machine learning applications to improve flavor and nutritional content of horticultural crops through breeding and genetics. Curr Opin Biotechnol 2023;83:102968. [PMID: 37515935 DOI: 10.1016/j.copbio.2023.102968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2023] [Revised: 06/19/2023] [Accepted: 06/21/2023] [Indexed: 07/31/2023]

Susmitha P, Kumar P, Yadav P, Sahoo S, Kaur G, Pandey MK, Singh V, Tseng TM, Gangurde SS. Genome-wide association study as a powerful tool for dissecting competitive traits in legumes. FRONTIERS IN PLANT SCIENCE 2023;14:1123631. [PMID: 37645459 PMCID: PMC10461012 DOI: 10.3389/fpls.2023.1123631] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 06/08/2023] [Indexed: 08/31/2023]

Abstract

Legumes are extremely valuable because of their high protein content and several other nutritional components. The major challenge lies in maintaining the quantity and quality of protein and other nutritional compounds in view of climate change conditions. The global need for plant-based proteins has increased the demand for seeds with a high protein content that includes essential amino acids. Genome-wide association studies (GWAS) have evolved as a standard approach in agricultural genetics for examining such intricate characters. Recent development in machine learning methods shows promising applications for dimensionality reduction, which is a major challenge in GWAS. With the advancement in biotechnology, sequencing, and bioinformatics tools, estimation of linkage disequilibrium (LD) based associations between a genome-wide collection of single-nucleotide polymorphisms (SNPs) and desired phenotypic traits has become accessible. The markers from GWAS could be utilized for genomic selection (GS) to predict superior lines by calculating genomic estimated breeding values (GEBVs). For prediction accuracy, an assortment of statistical models could be utilized, such as ridge regression best linear unbiased prediction (rrBLUP), genomic best linear unbiased predictor (gBLUP), Bayesian, and random forest (RF). Both naturally diverse germplasm panels and family-based breeding populations can be used for association mapping based on the nature of the breeding system (inbred or outbred) in the plant species. MAGIC, MCILs, RIAILs, NAM, and ROAM are being used for association mapping in several crops. Several modifications of NAM, such as doubled haploid NAM (DH-NAM), backcross NAM (BC-NAM), and advanced backcross NAM (AB-NAM), have also been used in crops like rice, wheat, maize, barley mustard, etc. for reliable marker-trait associations (MTAs), phenotyping accuracy is equally important as genotyping. Highthroughput genotyping, phenomics, and computational techniques have advanced during the past few years, making it possible to explore such enormous datasets. Each population has unique virtues and flaws at the genomics and phenomics levels, which will be covered in more detail in this review study. The current investigation includes utilizing elite breeding lines as association mapping population, optimizing the choice of GWAS selection, population size, and hurdles in phenotyping, and statistical methods which will analyze competitive traits in legume breeding.

Collapse

Bhat JA, Feng X, Mir ZA, Raina A, Siddique KHM. Recent advances in artificial intelligence, mechanistic models, and speed breeding offer exciting opportunities for precise and accelerated genomics-assisted breeding. PHYSIOLOGIA PLANTARUM 2023;175:e13969. [PMID: 37401892 DOI: 10.1111/ppl.13969] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 06/11/2023] [Accepted: 06/27/2023] [Indexed: 07/05/2023]

Mowlaei ME, Shi X. FSF-GA: A Feature Selection Framework for Phenotype Prediction Using Genetic Algorithms. Genes (Basel) 2023;14:genes14051059. [PMID: 37239419 DOI: 10.3390/genes14051059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 05/04/2023] [Accepted: 05/06/2023] [Indexed: 05/28/2023] Open

Kick DR, Wallace JG, Schnable JC, Kolkman JM, Alaca B, Beissinger TM, Edwards J, Ertl D, Flint-Garcia S, Gage JL, Hirsch CN, Knoll JE, de Leon N, Lima DC, Moreta DE, Singh MP, Thompson A, Weldekidan T, Washburn JD. Yield prediction through integration of genetic, environment, and management data through deep learning. G3 (BETHESDA, MD.) 2023;13:jkad006. [PMID: 36625555 PMCID: PMC10085787 DOI: 10.1093/g3journal/jkad006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 07/28/2022] [Accepted: 12/23/2022] [Indexed: 01/11/2023]

Affiliation(s)

Daniel R Kick United States Department of Agriculture, Agricultural Research Service Plant Genetics Research Unit, Columbia, MO 65211, USA Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA
Jason G Wallace Department of Crop & Soil Science, University of Georgia, Athens, GA 30602, USA
James C Schnable Center for Plant Science Innovation and Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, NE 68588, USA
Judith M Kolkman School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
Barış Alaca Division of Plant Breeding Methodology, Department of Crop Science, University of Goettingen, Goettingen 37073, Germany Center for Integrated Breeding Research, University of Goettingen, Goettingen 37073, Germany
Timothy M Beissinger Division of Plant Breeding Methodology, Department of Crop Science, University of Goettingen, Goettingen 37073, Germany Center for Integrated Breeding Research, University of Goettingen, Goettingen 37073, Germany
Jode Edwards United States Department of Agriculture, Agricultural Research Service, Ames, IA 50011, USA
David Ertl Research and Business Development, Iowa Corn Promotion Board, Johnston, IA 50131, USA
Sherry Flint-Garcia United States Department of Agriculture, Agricultural Research Service Plant Genetics Research Unit, Columbia, MO 65211, USA
Joseph L Gage Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695, USA
Candice N Hirsch Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
Joseph E Knoll United States Department of Agriculture, Agricultural Research Service Crop Genetics and Breeding Research Unit, Tifton, GA 31793, USA
Natalia de Leon Department of Agronomy, University of Wisconsin, Madison, WI 53706, USA
Dayane C Lima Plant Breeding and Plant Genetics Program, University of Wisconsin, Madison, WI 53706, USA
Danilo E Moreta School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
Maninder P Singh Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI 48824, USA
Addie Thompson Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI 48824, USA
Teclemariam Weldekidan Plant and Soil Sciences, University of Delaware, Newark, DE 19716, USA
Jacob D Washburn United States Department of Agriculture, Agricultural Research Service Plant Genetics Research Unit, Columbia, MO 65211, USA Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA

Collapse

Du J, Wang C, Wang L, Mao S, Zhu B, Li Z, Fan X. Automatic block-wise genotype-phenotype association detection based on hidden Markov model. BMC Bioinformatics 2023;24:138. [PMID: 37029361 PMCID: PMC10082540 DOI: 10.1186/s12859-023-05265-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 03/31/2023] [Indexed: 04/09/2023] Open

Bisht A, Saini DK, Kaur B, Batra R, Kaur S, Kaur I, Jindal S, Malik P, Sandhu PK, Kaur A, Gill BS, Wani SH, Kaur B, Mir RR, Sandhu KS, Siddique KHM. Multi-omics assisted breeding for biotic stress resistance in soybean. Mol Biol Rep 2023;50:3787-3814. [PMID: 36692674 DOI: 10.1007/s11033-023-08260-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/09/2023] [Indexed: 01/25/2023]

Affiliation(s)

Ashita Bisht Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India CSK Himachal Pradesh Krishi Vishvavidyalaya, Highland Agricultural Research and Extension Centre, 175142, Kukumseri, Lahaul and Spiti, India
Dinesh Kumar Saini Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India.
Baljeet Kaur Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
Ritu Batra Department of Genetics and Plant Breeding, Chaudhary Charan Singh University, 25004, Meerut, India
Sandeep Kaur Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
Ishveen Kaur Agriculture, Environmental and Sustainability Sciences, College of sciences, University of Texas Rio Grande Valley, 78539, Edinburg, TX, USA
Suruchi Jindal Division of Molecular Biology and Genetic Engineering, School of Bioengineering and Biosciences, Lovely Professional University, Phagwara, India
Palvi Malik , Gurdev Singh Khush Institute of Genetics, Plant Breeding and Biotechnology, Punjab Agricultural University,, 141004, Ludhiana, India
Pawanjit Kaur Sandhu Department of Chemistry, University of British Columbia, V1V 1V7, Okanagan, Kelowna, Canada
Amandeep Kaur Division of Molecular Biology and Genetic Engineering, School of Bioengineering and Biosciences, Lovely Professional University, Phagwara, India
Balwinder Singh Gill Department of Plant Breeding and Genetics, Punjab Agricultural University, 141004, Ludhiana, India
Shabir Hussain Wani MRCFC Khudwani, Sher-e-Kashmir University of Agricultural Sciences and Technology, Kashmir, Shalimar, India
Balwinder Kaur Department of Entomology, UF/IFAS Research and Education Center, 33430, Belle Glade, Florida, USA
Reyazul Rouf Mir Division of Genetics and Plant Breeding, Faculty of Agriculture, SKUAST-Kashmir, 193201, India
Karansher Singh Sandhu Department of Crop and Soil Sciences, Washington State University, 99163, Pullman, WA, USA.
Kadambot H M Siddique The UWA Institute of Agriculture, The University of Western Australia, 6001, Perth, WA, Australia.

Collapse

Innovative Advances in Plant Genotyping. Methods Mol Biol 2023;2638:451-465. [PMID: 36781662 DOI: 10.1007/978-1-0716-3024-2_32] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]

Jeon D, Kang Y, Lee S, Choi S, Sung Y, Lee TH, Kim C. Digitalizing breeding in plants: A new trend of next-generation breeding based on genomic prediction. FRONTIERS IN PLANT SCIENCE 2023;14:1092584. [PMID: 36743488 PMCID: PMC9892199 DOI: 10.3389/fpls.2023.1092584] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 01/05/2023] [Indexed: 06/18/2023]

Jubair S, Domaratzki M. Crop genomic selection with deep learning and environmental data: A survey. Front Artif Intell 2023;5:1040295. [PMID: 36703955 PMCID: PMC9871498 DOI: 10.3389/frai.2022.1040295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 12/22/2022] [Indexed: 01/12/2023] Open

Wang K, Abid MA, Rasheed A, Crossa J, Hearne S, Li H. DNNGP, a deep neural network-based method for genomic prediction using multi-omics data in plants. MOLECULAR PLANT 2023;16:279-293. [PMID: 36366781 DOI: 10.1016/j.molp.2022.11.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 09/28/2022] [Accepted: 11/08/2022] [Indexed: 06/16/2023]

Abstract

Genomic prediction is an effective way to accelerate the rate of agronomic trait improvement in plants. Traditional methods typically use linear regression models with clear assumptions; such methods are unable to capture the complex relationships between genotypes and phenotypes. Non-linear models (e.g., deep neural networks) have been proposed as a superior alternative to linear models because they can capture complex non-additive effects. Here we introduce a deep learning (DL) method, deep neural network genomic prediction (DNNGP), for integration of multi-omics data in plants. We trained DNNGP on four datasets and compared its performance with methods built with five classic models: genomic best linear unbiased prediction (GBLUP); two methods based on a machine learning (ML) framework, light gradient boosting machine (LightGBM) and support vector regression (SVR); and two methods based on a DL framework, deep learning genomic selection (DeepGS) and deep learning genome-wide association study (DLGWAS). DNNGP is novel in five ways. First, it can be applied to a variety of omics data to predict phenotypes. Second, the multilayered hierarchical structure of DNNGP dynamically learns features from raw data, avoiding overfitting and improving the convergence rate using a batch normalization layer and early stopping and rectified linear activation (rectified linear unit) functions. Third, when small datasets were used, DNNGP produced results that are competitive with results from the other five methods, showing greater prediction accuracy than the other methods when large-scale breeding data were used. Fourth, the computation time required by DNNGP was comparable with that of commonly used methods, up to 10 times faster than DeepGS. Fifth, hyperparameters can easily be batch tuned on a local machine. Compared with GBLUP, LightGBM, SVR, DeepGS and DLGWAS, DNNGP is superior to these existing widely used genomic selection (GS) methods. Moreover, DNNGP can generate robust assessments from diverse datasets, including omics data, and quickly incorporate complex and large datasets into usable models, making it a promising and practical approach for straightforward integration into existing GS platforms.

Collapse

Tirnaz S, Zandberg J, Thomas WJW, Marsh J, Edwards D, Batley J. Application of crop wild relatives in modern breeding: An overview of resources, experimental and computational methodologies. FRONTIERS IN PLANT SCIENCE 2022;13:1008904. [PMID: 36466237 PMCID: PMC9712971 DOI: 10.3389/fpls.2022.1008904] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 10/25/2022] [Indexed: 06/01/2023]

John M, Haselbeck F, Dass R, Malisi C, Ricca P, Dreischer C, Schultheiss SJ, Grimm DG. A comparison of classical and machine learning-based phenotype prediction methods on simulated data and three plant species. FRONTIERS IN PLANT SCIENCE 2022;13:932512. [PMID: 36407627 PMCID: PMC9673477 DOI: 10.3389/fpls.2022.932512] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 07/25/2022] [Indexed: 06/16/2023]

A divide-and-conquer approach for genomic prediction in rubber tree using machine learning. Sci Rep 2022;12:18023. [PMID: 36289298 PMCID: PMC9605989 DOI: 10.1038/s41598-022-20416-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 09/13/2022] [Indexed: 01/20/2023] Open

Ali A, Altaf MT, Nadeem MA, Karaköy T, Shah AN, Azeem H, Baloch FS, Baran N, Hussain T, Duangpan S, Aasim M, Boo KH, Abdelsalam NR, Hasan ME, Chung YS. Recent advancement in OMICS approaches to enhance abiotic stress tolerance in legumes. FRONTIERS IN PLANT SCIENCE 2022;13:952759. [PMID: 36247536 PMCID: PMC9554552 DOI: 10.3389/fpls.2022.952759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 08/12/2022] [Indexed: 06/16/2023]

Abstract

The world is facing rapid climate change and a fast-growing global population. It is believed that the world population will be 9.7 billion in 2050. However, recent agriculture production is not enough to feed the current population of 7.9 billion people, which is causing a huge hunger problem. Therefore, feeding the 9.7 billion population in 2050 will be a huge target. Climate change is becoming a huge threat to global agricultural production, and it is expected to become the worst threat to it in the upcoming years. Keeping this in view, it is very important to breed climate-resilient plants. Legumes are considered an important pillar of the agriculture production system and a great source of high-quality protein, minerals, and vitamins. During the last two decades, advancements in OMICs technology revolutionized plant breeding and emerged as a crop-saving tool in wake of the climate change. Various OMICs approaches like Next-Generation sequencing (NGS), Transcriptomics, Proteomics, and Metabolomics have been used in legumes under abiotic stresses. The scientific community successfully utilized these platforms and investigated the Quantitative Trait Loci (QTL), linked markers through genome-wide association studies, and developed KASP markers that can be helpful for the marker-assisted breeding of legumes. Gene-editing techniques have been successfully proven for soybean, cowpea, chickpea, and model legumes such as Medicago truncatula and Lotus japonicus. A number of efforts have been made to perform gene editing in legumes. Moreover, the scientific community did a great job of identifying various genes involved in the metabolic pathways and utilizing the resulted information in the development of climate-resilient legume cultivars at a rapid pace. Keeping in view, this review highlights the contribution of OMICs approaches to abiotic stresses in legumes. We envisage that the presented information will be helpful for the scientific community to develop climate-resilient legume cultivars.

Collapse

Affiliation(s)

Amjad Ali Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas, Turkey
Muhammad Tanveer Altaf Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas, Turkey
Muhammad Azhar Nadeem Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas, Turkey
Tolga Karaköy Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas, Turkey
Adnan Noor Shah Department of Agricultural Engineering, Khwaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan
Hajra Azeem Department of Plant Pathology, Faculty of Agricultural Sciences & Technology, Bahauddin Zakariya University, Multan, Pakistan
Faheem Shehzad Baloch Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas, Turkey
Nurettin Baran Bitkisel Uretim ve Teknolojileri Bolumu, Uygulamali Bilimler Faku Itesi, Mus Alparslan Universitesi, Mus, Turkey
Tajamul Hussain Laboratory of Plant Breeding and Climate Resilient Agriculture, Agricultural Innovation and Management Division, Faculty of Natural Resources, Prince of Songkla University, Hat Yai, Thailand
Saowapa Duangpan Laboratory of Plant Breeding and Climate Resilient Agriculture, Agricultural Innovation and Management Division, Faculty of Natural Resources, Prince of Songkla University, Hat Yai, Thailand
Muhammad Aasim Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas, Turkey
Kyung-Hwan Boo Subtropical/Tropical Organism Gene Bank, Department of Biotechnology, College of Applied Life Science, Jeju National University, Jeju, South Korea
Nader R. Abdelsalam Agricultural Botany Department, Faculty of Agriculture (Saba Basha), Alexandria University, Alexandria, Egypt
Mohamed E. Hasan Bioinformatics Department, Genetic Engineering and Biotechnology Research Institute, University of Sadat City, Sadat City, Egypt
Yong Suk Chung Department of Plant Resources and Environment, Jeju National University, Jeju, South Korea

Collapse

A joint learning approach for genomic prediction in polyploid grasses. Sci Rep 2022;12:12499. [PMID: 35864135 PMCID: PMC9304331 DOI: 10.1038/s41598-022-16417-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 07/11/2022] [Indexed: 12/20/2022] Open

Zhang Z, Pope M, Shakoor N, Pless R, Mockler TC, Stylianou A. Comparing Deep Learning Approaches for Understanding Genotype × Phenotype Interactions in Biomass Sorghum. Front Artif Intell 2022;5:872858. [PMID: 35860344 PMCID: PMC9289439 DOI: 10.3389/frai.2022.872858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 06/09/2022] [Indexed: 11/13/2022] Open

Muneeb M, Feng SF, Henschel A. Can We Convert Genotype Sequences Into Images for Cases/Controls Classification? FRONTIERS IN BIOINFORMATICS 2022;2:914435. [PMID: 36304278 PMCID: PMC9580854 DOI: 10.3389/fbinf.2022.914435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 06/01/2022] [Indexed: 11/13/2022] Open

Danilevicz MF, Gill M, Anderson R, Batley J, Bennamoun M, Bayer PE, Edwards D. Plant Genotype to Phenotype Prediction Using Machine Learning. Front Genet 2022;13:822173. [PMID: 35664329 PMCID: PMC9159391 DOI: 10.3389/fgene.2022.822173] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 03/07/2022] [Indexed: 12/13/2022] Open

Deciphering Pleiotropic Signatures of Regulatory SNPs in Zea mays L. Using Multi-Omics Data and Machine Learning Algorithms. Int J Mol Sci 2022;23:ijms23095121. [PMID: 35563516 PMCID: PMC9100765 DOI: 10.3390/ijms23095121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 04/28/2022] [Accepted: 05/02/2022] [Indexed: 01/25/2023] Open

Canella Vieira C, Zhou J, Usovsky M, Vuong T, Howland AD, Lee D, Li Z, Zhou J, Shannon G, Nguyen HT, Chen P. Exploring Machine Learning Algorithms to Unveil Genomic Regions Associated With Resistance to Southern Root-Knot Nematode in Soybeans. FRONTIERS IN PLANT SCIENCE 2022;13:883280. [PMID: 35592556 PMCID: PMC9111516 DOI: 10.3389/fpls.2022.883280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 04/08/2022] [Indexed: 06/15/2023]

Abstract

Southern root-knot nematode [SRKN, Meloidogyne incognita (Kofold & White) Chitwood] is a plant-parasitic nematode challenging to control due to its short life cycle, a wide range of hosts, and limited management options, of which genetic resistance is the main option to efficiently control the damage caused by SRKN. To date, a major quantitative trait locus (QTL) mapped on chromosome (Chr.) 10 plays an essential role in resistance to SRKN in soybean varieties. The confidence of discovered trait-loci associations by traditional methods is often limited by the assumptions of individual single nucleotide polymorphisms (SNPs) always acting independently as well as the phenotype following a Gaussian distribution. Therefore, the objective of this study was to conduct machine learning (ML)-based genome-wide association studies (GWAS) utilizing Random Forest (RF) and Support Vector Machine (SVM) algorithms to unveil novel regions of the soybean genome associated with resistance to SRKN. A total of 717 breeding lines derived from 330 unique bi-parental populations were genotyped with the Illumina Infinium BARCSoySNP6K BeadChip and phenotyped for SRKN resistance in a greenhouse. A GWAS pipeline involving a supervised feature dimension reduction based on Variable Importance in Projection (VIP) and SNP detection based on classification accuracy was proposed. Minor effect SNPs were detected by the proposed ML-GWAS methodology but not identified using Bayesian-information and linkage-disequilibrium Iteratively Nested Keyway (BLINK), Fixed and Random Model Circulating Probability Unification (FarmCPU), and Enriched Compressed Mixed Linear Model (ECMLM) models. Besides the genomic region on Chr. 10 that can explain most of SRKN resistance variance, additional minor effects SNPs were also identified on Chrs. 10 and 11. The findings in this study demonstrated that overfitting in GWAS may lead to lower prediction accuracy, and the detection of significant SNPs based on classification accuracy limited false-positive associations. The expansion of the basis of the genetic resistance to SRKN can potentially reduce the selection pressure over the major QTL on Chr. 10 and achieve higher levels of resistance.

Collapse

Mathew B, Hauptmann A, Léon J, Sillanpää MJ. NeuralLasso: Neural Networks Meet Lasso in Genomic Prediction. FRONTIERS IN PLANT SCIENCE 2022;13:800161. [PMID: 35574107 PMCID: PMC9100816 DOI: 10.3389/fpls.2022.800161] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 03/18/2022] [Indexed: 06/15/2023]

Genome-Enabled Prediction Methods Based on Machine Learning. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022;2467:189-218. [PMID: 35451777 DOI: 10.1007/978-1-0716-2205-6_7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Tay Fernandez CG, Nestor BJ, Danilevicz MF, Gill M, Petereit J, Bayer PE, Finnegan PM, Batley J, Edwards D. Pangenomes as a Resource to Accelerate Breeding of Under-Utilised Crop Species. Int J Mol Sci 2022;23:2671. [PMID: 35269811 PMCID: PMC8910360 DOI: 10.3390/ijms23052671] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 02/21/2022] [Accepted: 02/21/2022] [Indexed: 02/01/2023] Open

Nguyen Ba AN, Lawrence KR, Rego-Costa A, Gopalakrishnan S, Temko D, Michor F, Desai MM. Barcoded Bulk QTL mapping reveals highly polygenic and epistatic architecture of complex traits in yeast. eLife 2022;11:73983. [PMID: 35147078 PMCID: PMC8979589 DOI: 10.7554/elife.73983] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 02/11/2022] [Indexed: 11/25/2022] Open

Sandhu KS, Patil SS, Aoun M, Carter AH. Multi-Trait Multi-Environment Genomic Prediction for End-Use Quality Traits in Winter Wheat. Front Genet 2022;13:831020. [PMID: 35173770 PMCID: PMC8841657 DOI: 10.3389/fgene.2022.831020] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/06/2022] [Indexed: 11/13/2022] Open

Abstract

Soft white wheat is a wheat class used in foreign and domestic markets to make various end products requiring specific quality attributes. Due to associated cost, time, and amount of seed needed, phenotyping for the end-use quality trait is delayed until later generations. Previously, we explored the potential of using genomic selection (GS) for selecting superior genotypes earlier in the breeding program. Breeders typically measure multiple traits across various locations, and it opens up the avenue for exploring multi-trait-based GS models. This study's main objective was to explore the potential of using multi-trait GS models for predicting seven different end-use quality traits using cross-validation, independent prediction, and across-location predictions in a wheat breeding program. The population used consisted of 666 soft white wheat genotypes planted for 5 years at two locations in Washington, United States. We optimized and compared the performances of four uni-trait- and multi-trait-based GS models, namely, Bayes B, genomic best linear unbiased prediction (GBLUP), multilayer perceptron (MLP), and random forests. The prediction accuracies for multi-trait GS models were 5.5 and 7.9% superior to uni-trait models for the within-environment and across-location predictions. Multi-trait machine and deep learning models performed superior to GBLUP and Bayes B for across-location predictions, but their advantages diminished when the genotype by environment component was included in the model. The highest improvement in prediction accuracy, that is, 35% was obtained for flour protein content with the multi-trait MLP model. This study showed the potential of using multi-trait-based GS models to enhance prediction accuracy by using information from previously phenotyped traits. It would assist in speeding up the breeding cycle time in a cost-friendly manner.

Collapse

Coelho de Sousa I, Nascimento M, de Castro Sant’anna I, Teixeira Caixeta E, Ferreira Azevedo C, Damião Cruz C, Lopes da Silva F, Ruas Alkimim E, Campana Nascimento AC, Vergara Lopes Serão N. Marker effects and heritability estimates using additive-dominance genomic architectures via artificial neural networks in Coffea canephora. PLoS One 2022;17:e0262055. [PMID: 35081139 PMCID: PMC8791507 DOI: 10.1371/journal.pone.0262055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 12/15/2021] [Indexed: 11/18/2022] Open

Abstract

Many methodologies are used to predict the genetic merit in animals and plants, but some of them require priori assumptions that may increase the complexity of the model. Artificial neural network (ANN) has advantage to not require priori assumptions about the relationships between inputs and the output allowing great flexibility to handle different types of complex non-additive effects, such as dominance and epistasis. Despite this advantage, the biological interpretability of ANNs is still limited. The aim of this research was to estimate the heritability and markers effects for two traits in Coffea canephora using an additive-dominance architecture ANN and to compare it with genomic best linear unbiased prediction (GBLUP). The data used consists of 51 clones of C. canephora varietal Conilon, 32 of varietal group Robusta and 82 intervarietal hybrids. From this, 165 phenotyped individuals were genotyped for 14,387 SNPs. Due to the high computational cost of ANNs, we used Bagging decision tree to reduce the dimensionality of the data, selecting the markers that accumulated 70% of the total importance. An ANN with three hidden layers was run, each varying from 1 to 40 neurons summing 64,000 neural networks. The network architectures with the best predictive ability were selected. The best architectures were composed by 4, 15, and 33 neurons in the first, second and third hidden layers, respectively, for yield, and by 13, 20, and 24 neurons, respectively for rust resistance. The predictive ability was greater when using ANN with three hidden layers than using one hidden layer and GBLUP, with 0.72 and 0.88 for yield and coffee leaf rust resistance, respectively. The concordance rate (CR) of the 10% larger markers effects among the methods varied between 10% and 13.8%, for additive effects and between 5.4% and 11.9% for dominance effects. The narrow-sense ([Formula: see text]) and dominance-only ([Formula: see text]) heritability estimates were 0.25 and 0.06, respectively, for yield, and 0.67 and 0.03, respectively for rust resistance. The ANN was able to estimate the heritabilities from an additive-dominance genomic architectures and the ANN with three hidden layers obtained best predictive ability when compared with those obtained from GBLUP and ANN with one hidden layer.

Collapse

Sandhu KS, Merrick LF, Sankaran S, Zhang Z, Carter AH. Prospectus of Genomic Selection and Phenomics in Cereal, Legume and Oilseed Breeding Programs. Front Genet 2022. [PMCID: PMC8814369 DOI: 10.3389/fgene.2021.829131] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Mathema VB, Duangkumpha K, Wanichthanarak K, Jariyasopit N, Dhakal E, Sathirapongsasuti N, Kitiyakara C, Sirivatanauksorn Y, Khoomrung S. CRISP: a deep learning architecture for GC × GC-TOFMS contour ROI identification, simulation and analysis in imaging metabolomics. Brief Bioinform 2022;23:6503617. [PMID: 35022651 PMCID: PMC8921635 DOI: 10.1093/bib/bbab550] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 11/19/2021] [Accepted: 11/29/2021] [Indexed: 01/16/2023] Open

Abstract

Two-dimensional gas chromatography–time-of-flight mass spectrometry (GC × GC–TOFMS) provides a large amount of molecular information from biological samples. However, the lack of a comprehensive compound library or customizable bioinformatics tool is currently a challenge in GC × GC–TOFMS data analysis. We present an open-source deep learning (DL) software called contour regions of interest (ROI) identification, simulation and untargeted metabolomics profiler (CRISP). CRISP integrates multiple customizable deep neural network architectures for assisting the semi-automated identification of ROIs, contour synthesis, resolution enhancement and classification of GC × GC–TOFMS-based contour images. The approach includes the novel aggregate feature representative contour (AFRC) construction and stacked ROIs. This generates an unbiased contour image dataset that enhances the contrasting characteristics between different test groups and can be suitable for small sample sizes. The utility of the generative models and the accuracy and efficacy of the platform were demonstrated using a dataset of GC × GC–TOFMS contour images from patients with late-stage diabetic nephropathy and healthy control groups. CRISP successfully constructed AFRC images and identified over five ROIs to create a deepstacked dataset. The high fidelity, 512 × 512-pixels generative model was trained as a generator with a Fréchet inception distance of <47.00. The trained classifier achieved an AUROC of >0.96 and a classification accuracy of >95.00% for datasets with and without column bleed. Overall, CRISP demonstrates good potential as a DL-based approach for the rapid analysis of 4-D GC × GC–TOFMS untargeted metabolite profiles by directly implementing contour images. CRISP is available at https://github.com/vivekmathema/GCxGC-CRISP.

Collapse

Affiliation(s)

Vivek Bhakta Mathema Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
Kassaporn Duangkumpha Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
Kwanjeera Wanichthanarak Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
Narumol Jariyasopit Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
Esha Dhakal Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
Nuankanya Sathirapongsasuti Section of Translational Medicine, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, Thailand.,Research Network of NANOTEC - MU Ramathibodi on Nanomedicine, Bangkok, Thailand
Chagriya Kitiyakara Department of Medicine, Faculty of Medicine, Ramathibodi Hospital, Rama VI Rd., Ratchathewi, Bangkok 10400, Thailand
Yongyut Sirivatanauksorn Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand
Sakda Khoomrung Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok 10700, Thailand.,Center of Excellence for Innovation in Chemistry (PERCH-CIC), Faculty of Science, Mahidol University, Bangkok, Thailand

Collapse

Kwon OS, Hong M, Kim TH, Hwang I, Shim J, Choi EK, Lim HE, Yu HT, Uhm JS, Joung B, Oh S, Lee MH, Kim YH, Pak HN. Genome-wide association study-based prediction of atrial fibrillation using artificial intelligence. Open Heart 2022;9:e001898. [PMID: 35086918 PMCID: PMC8796259 DOI: 10.1136/openhrt-2021-001898] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 01/05/2022] [Indexed: 12/12/2022] Open

Predicting Heritability of Oil Palm Breeding Using Phenotypic Traits and Machine Learning. SUSTAINABILITY 2021. [DOI: 10.3390/su132212613] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Ubbens J, Parkin I, Eynck C, Stavness I, Sharpe AG. Deep neural networks for genomic prediction do not estimate marker effects. THE PLANT GENOME 2021;14:e20147. [PMID: 34596363 DOI: 10.1002/tpg2.20147] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 07/09/2021] [Indexed: 06/13/2023]

Sandhu K, Patil SS, Pumphrey M, Carter A. Multitrait machine- and deep-learning models for genomic selection using spectral information in a wheat breeding program. THE PLANT GENOME 2021;14:e20119. [PMID: 34482627 DOI: 10.1002/tpg2.20119] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 05/18/2021] [Indexed: 06/13/2023]

Molecular Classification and Interpretation of Amyotrophic Lateral Sclerosis Using Deep Convolution Neural Networks and Shapley Values. Genes (Basel) 2021;12:genes12111754. [PMID: 34828360 PMCID: PMC8626003 DOI: 10.3390/genes12111754] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 10/23/2021] [Accepted: 10/23/2021] [Indexed: 11/17/2022] Open

Singh D, Chaudhary P, Taunk J, Singh CK, Singh D, Tomar RSS, Aski M, Konjengbam NS, Raje RS, Singh S, Sengar RS, Yadav RK, Pal M. Fab Advances in Fabaceae for Abiotic Stress Resilience: From 'Omics' to Artificial Intelligence. Int J Mol Sci 2021;22:10535. [PMID: 34638885 PMCID: PMC8509049 DOI: 10.3390/ijms221910535] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 09/17/2021] [Accepted: 09/23/2021] [Indexed: 11/16/2022] Open

Passamonti MM, Somenzi E, Barbato M, Chillemi G, Colli L, Joost S, Milanesi M, Negrini R, Santini M, Vajana E, Williams JL, Ajmone-Marsan P. The Quest for Genes Involved in Adaptation to Climate Change in Ruminant Livestock. Animals (Basel) 2021;11:2833. [PMID: 34679854 PMCID: PMC8532622 DOI: 10.3390/ani11102833] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 09/21/2021] [Accepted: 09/23/2021] [Indexed: 12/14/2022] Open

Affiliation(s)

Matilde Maria Passamonti Department of Animal Science, Food and Nutrition—DIANA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy; (M.M.P.); (E.S.); (M.B.); (L.C.); (R.N.); (J.L.W.)
Elisa Somenzi Department of Animal Science, Food and Nutrition—DIANA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy; (M.M.P.); (E.S.); (M.B.); (L.C.); (R.N.); (J.L.W.)
Mario Barbato Department of Animal Science, Food and Nutrition—DIANA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy; (M.M.P.); (E.S.); (M.B.); (L.C.); (R.N.); (J.L.W.)
Giovanni Chillemi Department for Innovation in Biological, Agro-Food and Forest Systems–DIBAF, Università Della Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy; (G.C.); (M.M.)
Licia Colli Department of Animal Science, Food and Nutrition—DIANA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy; (M.M.P.); (E.S.); (M.B.); (L.C.); (R.N.); (J.L.W.) Research Center on Biodiversity and Ancient DNA—BioDNA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy
Stéphane Joost Laboratory of Geographic Information Systems (LASIG), School of Architecture, Civil and Environmental Engineering (ENAC), Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; (S.J.); (E.V.)
Marco Milanesi Department for Innovation in Biological, Agro-Food and Forest Systems–DIBAF, Università Della Tuscia, Via S. Camillo de Lellis snc, 01100 Viterbo, Italy; (G.C.); (M.M.)
Riccardo Negrini Department of Animal Science, Food and Nutrition—DIANA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy; (M.M.P.); (E.S.); (M.B.); (L.C.); (R.N.); (J.L.W.)
Monia Santini Impacts on Agriculture, Forests and Ecosystem Services (IAFES) Division, Fondazione Centro Euro-Mediterraneo Sui Cambiamenti Climatici (CMCC), Viale Trieste 127, 01100 Viterbo, Italy;
Elia Vajana Laboratory of Geographic Information Systems (LASIG), School of Architecture, Civil and Environmental Engineering (ENAC), Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; (S.J.); (E.V.)
John Lewis Williams Department of Animal Science, Food and Nutrition—DIANA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy; (M.M.P.); (E.S.); (M.B.); (L.C.); (R.N.); (J.L.W.)
Paolo Ajmone-Marsan Department of Animal Science, Food and Nutrition—DIANA, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy; (M.M.P.); (E.S.); (M.B.); (L.C.); (R.N.); (J.L.W.) Nutrigenomics and Proteomics Research Center—PRONUTRIGEN, Università Cattolica del Sacro Cuore, Via Emilia Parmense, 84, 29122 Piacenza, Italy

Collapse

Nakhle F, Harfouche AL. Ready, Steady, Go AI: A practical tutorial on fundamentals of artificial intelligence and its applications in phenomics image analysis. PATTERNS (NEW YORK, N.Y.) 2021;2:100323. [PMID: 34553170 PMCID: PMC8441561 DOI: 10.1016/j.patter.2021.100323] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Razzaq A, Kaur P, Akhter N, Wani SH, Saleem F. Next-Generation Breeding Strategies for Climate-Ready Crops. FRONTIERS IN PLANT SCIENCE 2021;12:620420. [PMID: 34367194 PMCID: PMC8336580 DOI: 10.3389/fpls.2021.620420] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 06/14/2021] [Indexed: 05/17/2023]

Abstract

Climate change is a threat to global food security due to the reduction of crop productivity around the globe. Food security is a matter of concern for stakeholders and policymakers as the global population is predicted to bypass 10 billion in the coming years. Crop improvement via modern breeding techniques along with efficient agronomic practices innovations in microbiome applications, and exploiting the natural variations in underutilized crops is an excellent way forward to fulfill future food requirements. In this review, we describe the next-generation breeding tools that can be used to increase crop production by developing climate-resilient superior genotypes to cope with the future challenges of global food security. Recent innovations in genomic-assisted breeding (GAB) strategies allow the construction of highly annotated crop pan-genomes to give a snapshot of the full landscape of genetic diversity (GD) and recapture the lost gene repertoire of a species. Pan-genomes provide new platforms to exploit these unique genes or genetic variation for optimizing breeding programs. The advent of next-generation clustered regularly interspaced short palindromic repeat/CRISPR-associated (CRISPR/Cas) systems, such as prime editing, base editing, and de nova domestication, has institutionalized the idea that genome editing is revamped for crop improvement. Also, the availability of versatile Cas orthologs, including Cas9, Cas12, Cas13, and Cas14, improved the editing efficiency. Now, the CRISPR/Cas systems have numerous applications in crop research and successfully edit the major crop to develop resistance against abiotic and biotic stress. By adopting high-throughput phenotyping approaches and big data analytics tools like artificial intelligence (AI) and machine learning (ML), agriculture is heading toward automation or digitalization. The integration of speed breeding with genomic and phenomic tools can allow rapid gene identifications and ultimately accelerate crop improvement programs. In addition, the integration of next-generation multidisciplinary breeding platforms can open exciting avenues to develop climate-ready crops toward global food security.

Collapse

Sandhu KS, Aoun M, Morris CF, Carter AH. Genomic Selection for End-Use Quality and Processing Traits in Soft White Winter Wheat Breeding Program with Machine and Deep Learning Models. BIOLOGY 2021;10:689. [PMID: 34356544 PMCID: PMC8301459 DOI: 10.3390/biology10070689] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Revised: 07/13/2021] [Accepted: 07/17/2021] [Indexed: 01/12/2023]

Abstract

Breeding for grain yield, biotic and abiotic stress resistance, and end-use quality are important goals of wheat breeding programs. Screening for end-use quality traits is usually secondary to grain yield due to high labor needs, cost of testing, and large seed requirements for phenotyping. Genomic selection provides an alternative to predict performance using genome-wide markers under forward and across location predictions, where a previous year's dataset can be used to build the models. Due to large datasets in breeding programs, we explored the potential of the machine and deep learning models to predict fourteen end-use quality traits in a winter wheat breeding program. The population used consisted of 666 wheat genotypes screened for five years (2015-19) at two locations (Pullman and Lind, WA, USA). Nine different models, including two machine learning (random forest and support vector machine) and two deep learning models (convolutional neural network and multilayer perceptron) were explored for cross-validation, forward, and across locations predictions. The prediction accuracies for different traits varied from 0.45-0.81, 0.29-0.55, and 0.27-0.50 under cross-validation, forward, and across location predictions. In general, forward prediction accuracies kept increasing over time due to increments in training data size and was more evident for machine and deep learning models. Deep learning models were superior over the traditional ridge regression best linear unbiased prediction (RRBLUP) and Bayesian models under all prediction scenarios. The high accuracy observed for end-use quality traits in this study support predicting them in early generations, leading to the advancement of superior genotypes to more extensive grain yield trails. Furthermore, the superior performance of machine and deep learning models strengthens the idea to include them in large scale breeding programs for predicting complex traits.

Collapse

Zeng S, Mao Z, Ren Y, Wang D, Xu D, Joshi T. G2PDeep: a web-based deep-learning framework for quantitative phenotype prediction and discovery of genomic markers. Nucleic Acids Res 2021;49:W228-W236. [PMID: 34037802 PMCID: PMC8262736 DOI: 10.1093/nar/gkab407] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 04/28/2021] [Accepted: 05/03/2021] [Indexed: 11/13/2022] Open

Marsh JI, Hu H, Gill M, Batley J, Edwards D. Crop breeding for a changing climate: integrating phenomics and genomics with bioinformatics. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021;134:1677-1690. [PMID: 33852055 DOI: 10.1007/s00122-021-03820-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 03/18/2021] [Indexed: 05/05/2023]

Gaillochet C, Develtere W, Jacobs TB. CRISPR screens in plants: approaches, guidelines, and future prospects. THE PLANT CELL 2021;33:794-813. [PMID: 33823021 PMCID: PMC8226290 DOI: 10.1093/plcell/koab099] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 04/02/2021] [Indexed: 05/20/2023]