Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Grinberg NF, Orhobor OI, King RD. An evaluation of machine-learning for predicting phenotype: studies in yeast, rice, and wheat. Mach Learn 2019;109:251-277. [PMID: 32174648 PMCID: PMC7048706 DOI: 10.1007/s10994-019-05848-5] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Revised: 09/17/2019] [Accepted: 09/19/2019] [Indexed: 11/01/2022]

For:	Grinberg NF, Orhobor OI, King RD. An evaluation of machine-learning for predicting phenotype: studies in yeast, rice, and wheat. Mach Learn 2019;109:251-277. [PMID: 32174648 PMCID: PMC7048706 DOI: 10.1007/s10994-019-05848-5] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Revised: 09/17/2019] [Accepted: 09/19/2019] [Indexed: 11/01/2022]

Number

Cited by Other Article(s)

Botkin J, Medina C, Park S, Poudel K, Cha M, Lee Y, Prom LK, Curtin SJ, Xu Z, Ahn E. Analyzing Medicago spp. seed morphology using GWAS and machine learning. Sci Rep 2024;14:17588. [PMID: 39080407 PMCID: PMC11289399 DOI: 10.1038/s41598-024-67790-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 07/16/2024] [Indexed: 08/02/2024] Open

Odriozola I, Rasmussen JA, Gilbert MTP, Limborg MT, Alberdi A. A practical introduction to holo-omics. CELL REPORTS METHODS 2024;4:100820. [PMID: 38986611 PMCID: PMC11294832 DOI: 10.1016/j.crmeth.2024.100820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 04/17/2024] [Accepted: 06/20/2024] [Indexed: 07/12/2024]

Li X, Chen X, Wang Q, Yang N, Sun C. Integrating Bioinformatics and Machine Learning for Genomic Prediction in Chickens. Genes (Basel) 2024;15:690. [PMID: 38927626 PMCID: PMC11202573 DOI: 10.3390/genes15060690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 05/12/2024] [Accepted: 05/23/2024] [Indexed: 06/28/2024] Open

Cortés AJ. Abiotic Stress Tolerance Boosted by Genetic Diversity in Plants. Int J Mol Sci 2024;25:5367. [PMID: 38791404 PMCID: PMC11121514 DOI: 10.3390/ijms25105367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 03/14/2024] [Indexed: 05/26/2024] Open

Sandell FL, Holzweber T, Street NR, Dohm JC, Himmelbauer H. Genomic basis of seed colour in quinoa inferred from variant patterns using extreme gradient boosting. PLANT BIOTECHNOLOGY JOURNAL 2024;22:1312-1324. [PMID: 38213076 PMCID: PMC11022794 DOI: 10.1111/pbi.14267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 11/03/2023] [Accepted: 11/28/2023] [Indexed: 01/13/2024]

Chang-Brahim I, Koppensteiner LJ, Beltrame L, Bodner G, Saranti A, Salzinger J, Fanta-Jende P, Sulzbachner C, Bruckmüller F, Trognitz F, Samad-Zamini M, Zechner E, Holzinger A, Molin EM. Reviewing the essential roles of remote phenotyping, GWAS and explainable AI in practical marker-assisted selection for drought-tolerant winter wheat breeding. FRONTIERS IN PLANT SCIENCE 2024;15:1319938. [PMID: 38699541 PMCID: PMC11064034 DOI: 10.3389/fpls.2024.1319938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 03/13/2024] [Indexed: 05/05/2024]

Abstract

Marker-assisted selection (MAS) plays a crucial role in crop breeding improving the speed and precision of conventional breeding programmes by quickly and reliably identifying and selecting plants with desired traits. However, the efficacy of MAS depends on several prerequisites, with precise phenotyping being a key aspect of any plant breeding programme. Recent advancements in high-throughput remote phenotyping, facilitated by unmanned aerial vehicles coupled to machine learning, offer a non-destructive and efficient alternative to traditional, time-consuming, and labour-intensive methods. Furthermore, MAS relies on knowledge of marker-trait associations, commonly obtained through genome-wide association studies (GWAS), to understand complex traits such as drought tolerance, including yield components and phenology. However, GWAS has limitations that artificial intelligence (AI) has been shown to partially overcome. Additionally, AI and its explainable variants, which ensure transparency and interpretability, are increasingly being used as recognised problem-solving tools throughout the breeding process. Given these rapid technological advancements, this review provides an overview of state-of-the-art methods and processes underlying each MAS, from phenotyping, genotyping and association analyses to the integration of explainable AI along the entire workflow. In this context, we specifically address the challenges and importance of breeding winter wheat for greater drought tolerance with stable yields, as regional droughts during critical developmental stages pose a threat to winter wheat production. Finally, we explore the transition from scientific progress to practical implementation and discuss ways to bridge the gap between cutting-edge developments and breeders, expediting MAS-based winter wheat breeding for drought tolerance.

Collapse

Affiliation(s)

Ignacio Chang-Brahim Unit Bioresources, Center for Health & Bioresources, AIT Austrian Institute of Technology, Tulln, Austria
Lukas J. Koppensteiner Saatzucht Edelhof GmbH, Zwettl, Austria
Lorenzo Beltrame Unit Assistive and Autonomous Systems, Center for Vision, Automation & Control, AIT Austrian Institute of Technology, Vienna, Austria
Gernot Bodner Department of Crop Sciences, Institute of Agronomy, University of Natural Resources and Life Sciences Vienna, Tulln, Austria
Anna Saranti Human-Centered AI Lab, Department of Forest- and Soil Sciences, Institute of Forest Engineering, University of Natural Resources and Life Sciences Vienna, Vienna, Austria
Jules Salzinger Unit Assistive and Autonomous Systems, Center for Vision, Automation & Control, AIT Austrian Institute of Technology, Vienna, Austria
Phillipp Fanta-Jende Unit Assistive and Autonomous Systems, Center for Vision, Automation & Control, AIT Austrian Institute of Technology, Vienna, Austria
Christoph Sulzbachner Unit Assistive and Autonomous Systems, Center for Vision, Automation & Control, AIT Austrian Institute of Technology, Vienna, Austria
Felix Bruckmüller Unit Assistive and Autonomous Systems, Center for Vision, Automation & Control, AIT Austrian Institute of Technology, Vienna, Austria
Friederike Trognitz Unit Bioresources, Center for Health & Bioresources, AIT Austrian Institute of Technology, Tulln, Austria
Mina Samad-Zamini Saatzucht Edelhof GmbH, Zwettl, Austria
Elisabeth Zechner Verein zur Förderung einer nachhaltigen und regionalen Pflanzenzüchtung, Zwettl, Austria
Andreas Holzinger Human-Centered AI Lab, Department of Forest- and Soil Sciences, Institute of Forest Engineering, University of Natural Resources and Life Sciences Vienna, Vienna, Austria
Eva M. Molin Unit Bioresources, Center for Health & Bioresources, AIT Austrian Institute of Technology, Tulln, Austria Human-Centered AI Lab, Department of Forest- and Soil Sciences, Institute of Forest Engineering, University of Natural Resources and Life Sciences Vienna, Vienna, Austria

Collapse

Egebjerg JM, Szomek M, Thaysen K, Juhl AD, Kozakijevic S, Werner S, Pratsch C, Schneider G, Kapishnikov S, Ekman A, Röttger R, Wüstner D. Automated quantification of vacuole fusion and lipophagy in Saccharomyces cerevisiae from fluorescence and cryo-soft X-ray microscopy data using deep learning. Autophagy 2024;20:902-922. [PMID: 37908116 PMCID: PMC11062380 DOI: 10.1080/15548627.2023.2270378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 10/02/2023] [Indexed: 11/02/2023] Open

Abstract

During starvation in the yeast Saccharomyces cerevisiae vacuolar vesicles fuse and lipid droplets (LDs) can become internalized into the vacuole in an autophagic process named lipophagy. There is a lack of tools to quantitatively assess starvation-induced vacuole fusion and lipophagy in intact cells with high resolution and throughput. Here, we combine soft X-ray tomography (SXT) with fluorescence microscopy and use a deep-learning computational approach to visualize and quantify these processes in yeast. We focus on yeast homologs of mammalian NPC1 (NPC intracellular cholesterol transporter 1; Ncr1 in yeast) and NPC2 proteins, whose dysfunction leads to Niemann Pick type C (NPC) disease in humans. We developed a convolutional neural network (CNN) model which classifies fully fused versus partially fused vacuoles based on fluorescence images of stained cells. This CNN, named Deep Yeast Fusion Network (DYFNet), revealed that cells lacking Ncr1 (ncr1∆ cells) or Npc2 (npc2∆ cells) have a reduced capacity for vacuole fusion. Using a second CNN model, we implemented a pipeline named LipoSeg to perform automated instance segmentation of LDs and vacuoles from high-resolution reconstructions of X-ray tomograms. From that, we obtained 3D renderings of LDs inside and outside of the vacuole in a fully automated manner and additionally measured droplet volume, number, and distribution. We find that ncr1∆ and npc2∆ cells could ingest LDs into vacuoles normally but showed compromised degradation of LDs and accumulation of lipid vesicles inside vacuoles. Our new method is versatile and allows for analysis of vacuole fusion, droplet size and lipophagy in intact cells.Abbreviations: BODIPY493/503: 4,4-difluoro-1,3,5,7,8-pentamethyl-4-bora-3a,4a-diaza-s-Indacene; BPS: bathophenanthrolinedisulfonic acid disodium salt hydrate; CNN: convolutional neural network; DHE; dehydroergosterol; npc2∆, yeast deficient in Npc2; DSC, Dice similarity coefficient; EM, electron microscopy; EVs, extracellular vesicles; FIB-SEM, focused ion beam milling-scanning electron microscopy; FM 4-64, N-(3-triethylammoniumpropyl)-4-(6-[4-{diethylamino} phenyl] hexatrienyl)-pyridinium dibromide; LDs, lipid droplets; Ncr1, yeast homolog of human NPC1 protein; ncr1∆, yeast deficient in Ncr1; NPC, Niemann Pick type C; NPC2, Niemann Pick type C homolog; OD600, optical density at 600 nm; ReLU, rectifier linear unit; PPV, positive predictive value; NPV, negative predictive value; MCC, Matthews correlation coefficient; SXT, soft X-ray tomography; UV, ultraviolet; YPD, yeast extract peptone dextrose.

Collapse

Zhou W, Yan Z, Zhang L. A comparative study of 11 non-linear regression models highlighting autoencoder, DBN, and SVR, enhanced by SHAP importance analysis in soybean branching prediction. Sci Rep 2024;14:5905. [PMID: 38467662 PMCID: PMC10928191 DOI: 10.1038/s41598-024-55243-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/21/2024] [Indexed: 03/13/2024] Open

Abstract

To explore a robust tool for advancing digital breeding practices through an artificial intelligence-driven phenotype prediction expert system, we undertook a thorough analysis of 11 non-linear regression models. Our investigation specifically emphasized the significance of Support Vector Regression (SVR) and SHapley Additive exPlanations (SHAP) in predicting soybean branching. By using branching data (phenotype) of 1918 soybean accessions and 42 k SNP (Single Nucleotide Polymorphism) polymorphic data (genotype), this study systematically compared 11 non-linear regression AI models, including four deep learning models (DBN (deep belief network) regression, ANN (artificial neural network) regression, Autoencoders regression, and MLP (multilayer perceptron) regression) and seven machine learning models (e.g., SVR (support vector regression), XGBoost (eXtreme Gradient Boosting) regression, Random Forest regression, LightGBM regression, GPs (Gaussian processes) regression, Decision Tree regression, and Polynomial regression). After being evaluated by four valuation metrics: R2 (R-squared), MAE (Mean Absolute Error), MSE (Mean Squared Error), and MAPE (Mean Absolute Percentage Error), it was found that the SVR, Polynomial Regression, DBN, and Autoencoder outperformed other models and could obtain a better prediction accuracy when they were used for phenotype prediction. In the assessment of deep learning approaches, we exemplified the SVR model, conducting analyses on feature importance and gene ontology (GO) enrichment to provide comprehensive support. After comprehensively comparing four feature importance algorithms, no notable distinction was observed in the feature importance ranking scores across the four algorithms, namely Variable Ranking, Permutation, SHAP, and Correlation Matrix, but the SHAP value could provide rich information on genes with negative contributions, and SHAP importance was chosen for feature selection. The results of this study offer valuable insights into AI-mediated plant breeding, addressing challenges faced by traditional breeding programs. The method developed has broad applicability in phenotype prediction, minor QTL (quantitative trait loci) mining, and plant smart-breeding systems, contributing significantly to the advancement of AI-based breeding practices and transitioning from experience-based to data-based breeding.

Collapse

Kerruish DWM, Cormican P, Kenny EM, Kearns J, Colgan E, Boulton CA, Stelma SNE. The origins of the Guinness stout yeast. Commun Biol 2024;7:68. [PMID: 38216745 PMCID: PMC10786833 DOI: 10.1038/s42003-023-05587-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 11/14/2023] [Indexed: 01/14/2024] Open

Bonet D, Levin M, Montserrat DM, Ioannidis AG. Machine Learning Strategies for Improved Phenotype Prediction in Underrepresented Populations. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2024;29:404-418. [PMID: 38160295 PMCID: PMC10799683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/03/2024]

Heinrich F, Lange TM, Kircher M, Ramzan F, Schmitt AO, Gültas M. Exploring the potential of incremental feature selection to improve genomic prediction accuracy. Genet Sel Evol 2023;55:78. [PMID: 37946104 PMCID: PMC10634161 DOI: 10.1186/s12711-023-00853-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 11/02/2023] [Indexed: 11/12/2023] Open

Abstract

BACKGROUND

The ever-increasing availability of high-density genomic markers in the form of single nucleotide polymorphisms (SNPs) enables genomic prediction, i.e. the inference of phenotypes based solely on genomic data, in the field of animal and plant breeding, where it has become an important tool. However, given the limited number of individuals, the abundance of variables (SNPs) can reduce the accuracy of prediction models due to overfitting or irrelevant SNPs. Feature selection can help to reduce the number of irrelevant SNPs and increase the model performance. In this study, we investigated an incremental feature selection approach based on ranking the SNPs according to the results of a genome-wide association study that we combined with random forest as a prediction model, and we applied it on several animal and plant datasets.

RESULTS

Applying our approach to different datasets yielded a wide range of outcomes, i.e. from a substantial increase in prediction accuracy in a few cases to minor improvements when only a fraction of the available SNPs were used. Compared with models using all available SNPs, our approach was able to achieve comparable performances with a considerably reduced number of SNPs in several cases. Our approach showcased state-of-the-art efficiency and performance while having a faster computation time.

CONCLUSIONS

The results of our study suggest that our incremental feature selection approach has the potential to improve prediction accuracy substantially. However, this gain seems to depend on the genomic data used. Even for datasets where the number of markers is smaller than the number of individuals, feature selection may still increase the performance of the genomic prediction. Our approach is implemented in R and is available at https://github.com/FelixHeinrich/GP_with_IFS/ .

Collapse

Bonet D, Levin M, Montserrat DM, Ioannidis AG. Machine Learning Strategies for Improved Phenotype Prediction in Underrepresented Populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.12.561949. [PMID: 37904983 PMCID: PMC10614800 DOI: 10.1101/2023.10.12.561949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]

Verplaetse N, Passemiers A, Arany A, Moreau Y, Raimondi D. Large sample size and nonlinear sparse models outline epistatic effects in inflammatory bowel disease. Genome Biol 2023;24:224. [PMID: 37798735 PMCID: PMC10552306 DOI: 10.1186/s13059-023-03064-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/20/2023] [Indexed: 10/07/2023] Open

Sadeqi MB, Ballvora A, Dadshani S, Léon J. Genetic Parameter and Hyper-Parameter Estimation Underlie Nitrogen Use Efficiency in Bread Wheat. Int J Mol Sci 2023;24:14275. [PMID: 37762585 PMCID: PMC10531695 DOI: 10.3390/ijms241814275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 09/07/2023] [Accepted: 09/14/2023] [Indexed: 09/29/2023] Open

Duc NT, Ramlal A, Rajendran A, Raju D, Lal SK, Kumar S, Sahoo RN, Chinnusamy V. Image-based phenotyping of seed architectural traits and prediction of seed weight using machine learning models in soybean. FRONTIERS IN PLANT SCIENCE 2023;14:1206357. [PMID: 37771485 PMCID: PMC10523016 DOI: 10.3389/fpls.2023.1206357] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 07/26/2023] [Indexed: 09/30/2023]

Abstract

Among seed attributes, weight is one of the main factors determining the soybean harvest index. Recently, the focus of soybean breeding has shifted to improving seed size and weight for crop optimization in terms of seed and oil yield. With recent technological advancements, there is an increasing application of imaging sensors that provide simple, real-time, non-destructive, and inexpensive image data for rapid image-based prediction of seed traits in plant breeding programs. The present work is related to digital image analysis of seed traits for the prediction of hundred-seed weight (HSW) in soybean. The image-based seed architectural traits (i-traits) measured were area size (AS), perimeter length (PL), length (L), width (W), length-to-width ratio (LWR), intersection of length and width (IS), seed circularity (CS), and distance between IS and CG (DS). The phenotypic investigation revealed significant genetic variability among 164 soybean genotypes for both i-traits and manually measured seed weight. Seven popular machine learning (ML) algorithms, namely Simple Linear Regression (SLR), Multiple Linear Regression (MLR), Random Forest (RF), Support Vector Regression (SVR), LASSO Regression (LR), Ridge Regression (RR), and Elastic Net Regression (EN), were used to create models that can predict the weight of soybean seeds based on the image-based novel features derived from the Red-Green-Blue (RGB)/visual image. Among the models, random forest and multiple linear regression models that use multiple explanatory variables related to seed size traits (AS, L, W, and DS) were identified as the best models for predicting seed weight with the highest prediction accuracy (coefficient of determination, R2=0.98 and 0.94, respectively) and the lowest prediction error, i.e., root mean square error (RMSE) and mean absolute error (MAE). Finally, principal components analysis (PCA) and a hierarchical clustering approach were used to identify IC538070 as a superior genotype with a larger seed size and weight. The identified donors/traits can potentially be used in soybean improvement programs.

Collapse

Kovuri P, Yadav A, Sinha H. Role of genetic architecture in phenotypic plasticity. Trends Genet 2023;39:703-714. [PMID: 37173192 DOI: 10.1016/j.tig.2023.04.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 04/06/2023] [Accepted: 04/11/2023] [Indexed: 05/15/2023]

Xu B, Meng R, Chen G, Liang L, Lv Z, Zhou L, Sun R, Zhao F, Yang W. Improved weed mapping in corn fields by combining UAV-based spectral, textural, structural, and thermal measurements. PEST MANAGEMENT SCIENCE 2023;79:2591-2602. [PMID: 36883563 DOI: 10.1002/ps.7443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 01/20/2023] [Accepted: 03/08/2023] [Indexed: 06/02/2023]

Abstract

BACKGROUND

Spatial-explicit weed information is critical for controlling weed infestation and reducing corn yield losses. The development of unmanned aerial vehicle (UAV)-based remote sensing presents an unprecedented opportunity for efficient, timely weed mapping. Spectral, textural, and structural measurements have been used for weed mapping, whereas thermal measurements-for example, canopy temperature (CT)-were seldom considered and used. In this study, we quantified the optimal combination of spectral, textural, structural, and CT measurements based on different machine-learning algorithms for weed mapping.

RESULTS

CT improved weed-mapping accuracies as complementary information for spectral, textural, and structural features (up to 5% and 0.051 improvements in overall accuracy [OA] and Marco-F1, respectively). The fusion of textural, structural, and thermal features achieved the best performance in weed mapping (OA = 96.4%, Marco-F1 = 0.964), followed by the fusion of structural and thermal features (OA = 93.6%, Marco-F1 = 0.936). The Support Vector Machine-based model achieved the best performance in weed mapping, with 3.5% and 7.1% improvements in OA and 0.036 and 0.071 in Marco-F1 respectively, compared with the best models of Random Forest and Naïve Bayes Classifier.

CONCLUSION

Thermal measurement can complement other types of remote-sensing measurements and improve the weed-mapping accuracy within the data-fusion framework. Importantly, integrating textural, structural, and thermal features achieved the best performance for weed mapping. Our study provides a novel method for weed mapping using UAV-based multisource remote sensing measurements, which is critical for ensuring crop production in precision agriculture. © 2023 The Authors. Pest Management Science published by John Wiley & Sons Ltd on behalf of Society of Chemical Industry.

Collapse

Zhao L, Walkowiak S, Fernando WGD. Artificial Intelligence: A Promising Tool in Exploring the Phytomicrobiome in Managing Disease and Promoting Plant Health. PLANTS (BASEL, SWITZERLAND) 2023;12:plants12091852. [PMID: 37176910 PMCID: PMC10180744 DOI: 10.3390/plants12091852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 04/25/2023] [Accepted: 04/27/2023] [Indexed: 05/15/2023]

Liang M, Cao S, Deng T, Du L, Li K, An B, Du Y, Xu L, Zhang L, Gao X, Li J, Guo P, Gao H. MAK: a machine learning framework improved genomic prediction via multi-target ensemble regressor chains and automatic selection of assistant traits. Brief Bioinform 2023;24:7031157. [PMID: 36752363 DOI: 10.1093/bib/bbad043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 01/13/2023] [Accepted: 01/20/2023] [Indexed: 02/09/2023] Open

Wang W, Guo W, Le L, Yu J, Wu Y, Li D, Wang Y, Wang H, Lu X, Qiao H, Gu X, Tian J, Zhang C, Pu L. Integration of high-throughput phenotyping, GWAS, and predictive models reveals the genetic architecture of plant height in maize. MOLECULAR PLANT 2023;16:354-373. [PMID: 36447436 DOI: 10.1016/j.molp.2022.11.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 09/05/2022] [Accepted: 11/27/2022] [Indexed: 06/16/2023]

Abstract

Plant height (PH) is an essential trait in maize (Zea mays) that is tightly associated with planting density, biomass, lodging resistance, and grain yield in the field. Dissecting the dynamics of maize plant architecture will be beneficial for ideotype-based maize breeding and prediction, as the genetic basis controlling PH in maize remains largely unknown. In this study, we developed an automated high-throughput phenotyping platform (HTP) to systematically and noninvasively quantify 77 image-based traits (i-traits) and 20 field traits (f-traits) for 228 maize inbred lines across all developmental stages. Time-resolved i-traits with novel digital phenotypes and complex correlations with agronomic traits were characterized to reveal the dynamics of maize growth. An i-trait-based genome-wide association study identified 4945 trait-associated SNPs, 2603 genetic loci, and 1974 corresponding candidate genes. We found that rapid growth of maize plants occurs mainly at two developmental stages, stage 2 (S2) to S3 and S5 to S6, accounting for the final PH indicators. By integrating the PH-association network with the transcriptome profiles of specific internodes, we revealed 13 hub genes that may play vital roles during rapid growth. The candidate genes and novel i-traits identified at multiple growth stages may be used as potential indicators for final PH in maize. One candidate gene, ZmVATE, was functionally validated and shown to regulate PH-related traits in maize using genetic mutation. Furthermore, machine learning was used to build predictive models for final PH based on i-traits, and their performance was assessed across developmental stages. Moderate, strong, and very strong correlations between predictions and experimental datasets were achieved from the early S4 (tenth-leaf) stage. Colletively, our study provides a valuable tool for dissecting the spatiotemporal formation of specific internodes and the genetic architecture of PH, as well as resources and predictive models that are useful for molecular design breeding and predicting maize varieties with ideal plant architectures.

Collapse

Affiliation(s)

Weixuan Wang Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, China
Weijun Guo Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Liang Le Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Jia Yu Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Yue Wu Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Dongwei Li Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Yifan Wang Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Huan Wang Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Xiaoduo Lu Institute of Molecular Breeding for Maize, Qilu Normal University, Jinan 250200, China
Hong Qiao Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, TX 78712, USA; Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
Xiaofeng Gu Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Jian Tian Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
Chunyi Zhang Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; Sanya Institute, Hainan Academy of Agricultural Sciences, Sanya 572000, China.
Li Pu Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya 572024, China.

Collapse

Guo T, Li X. Machine learning for predicting phenotype from genotype and environment. Curr Opin Biotechnol 2023;79:102853. [PMID: 36463837 DOI: 10.1016/j.copbio.2022.102853] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 11/01/2022] [Accepted: 11/07/2022] [Indexed: 12/03/2022]

Farooq M, van Dijk AD, Nijveen H, Mansoor S, de Ridder D. Genomic prediction in plants: opportunities for ensemble machine learning based approaches. F1000Res 2023;11:802. [PMID: 37035464 PMCID: PMC10080209 DOI: 10.12688/f1000research.122437.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/04/2023] [Indexed: 01/12/2023] Open

Raimondi D, Orlando G, Verplaetse N, Fariselli P, Moreau Y. Editorial: Towards genome interpretation: Computational methods to model the genotype-phenotype relationship. FRONTIERS IN BIOINFORMATICS 2022;2:1098941. [PMID: 36530385 PMCID: PMC9749061 DOI: 10.3389/fbinf.2022.1098941] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 11/17/2022] [Indexed: 11/12/2023] Open

Wang K, Yang B, Li Q, Liu S. Systematic Evaluation of Genomic Prediction Algorithms for Genomic Prediction and Breeding of Aquatic Animals. Genes (Basel) 2022;13:genes13122247. [PMID: 36553514 PMCID: PMC9778314 DOI: 10.3390/genes13122247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Revised: 11/18/2022] [Accepted: 11/25/2022] [Indexed: 12/04/2022] Open

Durge AR, Shrimankar DD, Sawarkar AD. Heuristic Analysis of Genomic Sequence Processing Models for High Efficiency Prediction: A Statistical Perspective. Curr Genomics 2022;23:299-317. [PMID: 36778194 PMCID: PMC9878859 DOI: 10.2174/1389202923666220927105311] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 11/22/2022] Open

Xu Y, Zhang X, Li H, Zheng H, Zhang J, Olsen MS, Varshney RK, Prasanna BM, Qian Q. Smart breeding driven by big data, artificial intelligence, and integrated genomic-enviromic prediction. MOLECULAR PLANT 2022;15:1664-1695. [PMID: 36081348 DOI: 10.1016/j.molp.2022.09.001] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 09/02/2022] [Indexed: 05/12/2023]

Abstract

The first paradigm of plant breeding involves direct selection-based phenotypic observation, followed by predictive breeding using statistical models for quantitative traits constructed based on genetic experimental design and, more recently, by incorporation of molecular marker genotypes. However, plant performance or phenotype (P) is determined by the combined effects of genotype (G), envirotype (E), and genotype by environment interaction (GEI). Phenotypes can be predicted more precisely by training a model using data collected from multiple sources, including spatiotemporal omics (genomics, phenomics, and enviromics across time and space). Integration of 3D information profiles (G-P-E), each with multidimensionality, provides predictive breeding with both tremendous opportunities and great challenges. Here, we first review innovative technologies for predictive breeding. We then evaluate multidimensional information profiles that can be integrated with a predictive breeding strategy, particularly envirotypic data, which have largely been neglected in data collection and are nearly untouched in model construction. We propose a smart breeding scheme, integrated genomic-enviromic prediction (iGEP), as an extension of genomic prediction, using integrated multiomics information, big data technology, and artificial intelligence (mainly focused on machine and deep learning). We discuss how to implement iGEP, including spatiotemporal models, environmental indices, factorial and spatiotemporal structure of plant breeding data, and cross-species prediction. A strategy is then proposed for prediction-based crop redesign at both the macro (individual, population, and species) and micro (gene, metabolism, and network) scales. Finally, we provide perspectives on translating smart breeding into genetic gain through integrative breeding platforms and open-source breeding initiatives. We call for coordinated efforts in smart breeding through iGEP, institutional partnerships, and innovative technological support.

Collapse

Pedrini S, Doecke JD, Hone E, Wang P, Thota R, Bush AI, Rowe CC, Dore V, Villemagne VL, Ames D, Rainey‐Smith S, Verdile G, Sohrabi HR, Raida MR, Taddei K, Gandy S, Masters CL, Chatterjee P, Martins R. Plasma high-density lipoprotein cargo is altered in Alzheimer's disease and is associated with regional brain volume. J Neurochem 2022;163:53-67. [PMID: 36000528 PMCID: PMC9804612 DOI: 10.1111/jnc.15681] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 07/12/2022] [Accepted: 07/22/2022] [Indexed: 01/05/2023]

Affiliation(s)

Steve Pedrini School of Medical SciencesEdith Cowan UniversityJoondalupWestern AustraliaAustralia,CRC for Mental HealthMelbourneVictoriaAustralia
James D. Doecke Australian E‐Health Research CentreCSIROBrisbaneQueenslandAustralia
Eugene Hone School of Medical SciencesEdith Cowan UniversityJoondalupWestern AustraliaAustralia,CRC for Mental HealthMelbourneVictoriaAustralia
Penghao Wang College of Science, Health, Engineering and EducationMurdoch UniversityMurdochWestern AustraliaAustralia
Rohith Thota Faculty of Medicine, Health and Human Sciences, Department of Biomedical SciencesMacquarie UniversitySydneyNew South WalesAustralia
Ashley I. Bush CRC for Mental HealthMelbourneVictoriaAustralia,The Florey Institute, The University of MelbourneParkvilleVictoriaAustralia
Christopher C. Rowe Department of Nuclear Medicine and Centre for PETAustin HealthHeidelbergVictoriaAustralia
Vincent Dore Department of Nuclear Medicine and Centre for PETAustin HealthHeidelbergVictoriaAustralia
Victor L. Villemagne Department of PsychiatryUniversity of PittsburghPittsburghPennsylvaniaUSA
David Ames National Ageing Research InstituteParkvilleVictoriaAustralia,University of Melbourne Academic unit for Psychiatry of Old AgeSt George's HospitalKewVictoriaAustralia
Stephanie Rainey‐Smith School of Medical SciencesEdith Cowan UniversityJoondalupWestern AustraliaAustralia,Centre for Healthy Ageing, Health Futures InstituteMurdoch UniversityMurdochWestern AustraliaAustralia
Giuseppe Verdile Curtin Medical SchoolCurtin UniversityBentleyWestern AustraliaAustralia,Curtin Health Innovation Research InstituteCurtin UniversityBentleyWestern AustraliaAustralia
Hamid R. Sohrabi Centre for Healthy Ageing, Health Futures InstituteMurdoch UniversityMurdochWestern AustraliaAustralia
Manfred R. Raida Life Science Institute, Singapore Lipidomics IncubatorNational University of SingaporeSingapore CitySingapore
Kevin Taddei School of Medical SciencesEdith Cowan UniversityJoondalupWestern AustraliaAustralia,CRC for Mental HealthMelbourneVictoriaAustralia
Sam Gandy Department of NeurologyIcahn School of Medicine at Mount SinaiNew York CityNew YorkUSA
Colin L. Masters The Florey Institute, The University of MelbourneParkvilleVictoriaAustralia
Pratishtha Chatterjee Faculty of Medicine, Health and Human Sciences, Department of Biomedical SciencesMacquarie UniversitySydneyNew South WalesAustralia
Ralph N. Martins School of Medical SciencesEdith Cowan UniversityJoondalupWestern AustraliaAustralia,CRC for Mental HealthMelbourneVictoriaAustralia,Faculty of Medicine, Health and Human Sciences, Department of Biomedical SciencesMacquarie UniversitySydneyNew South WalesAustralia,School of Psychiatry and Clinical NeurosciencesUniversity of Western AustraliaCrawleyWestern AustraliaAustralia
the AIBL Research Group

Collapse

Ayat M, Domaratzki M. Sparse bayesian learning for genomic selection in yeast. FRONTIERS IN BIOINFORMATICS 2022;2:960889. [PMID: 36304259 PMCID: PMC9580947 DOI: 10.3389/fbinf.2022.960889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 08/02/2022] [Indexed: 11/13/2022] Open

Farooq M, van Dijk AD, Nijveen H, Mansoor S, de Ridder D. Genomic prediction in plants: opportunities for ensemble machine learning based approaches. F1000Res 2022;11:802. [PMID: 37035464 PMCID: PMC10080209 DOI: 10.12688/f1000research.122437.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/08/2022] [Indexed: 12/15/2022] Open

Imbalanced regression using regressor-classifier ensembles. Mach Learn 2022. [DOI: 10.1007/s10994-022-06199-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

Zhang Q, Zhang Q, Jensen J. Association Studies and Genomic Prediction for Genetic Improvements in Agriculture. FRONTIERS IN PLANT SCIENCE 2022;13:904230. [PMID: 35720549 PMCID: PMC9201771 DOI: 10.3389/fpls.2022.904230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 05/16/2022] [Indexed: 06/15/2023]

Wang W, Cheng Y, Ren Y, Zhang Z, Geng H. Prediction of Chlorophyll Content in Multi-Temporal Winter Wheat Based on Multispectral and Machine Learning. FRONTIERS IN PLANT SCIENCE 2022;13:896408. [PMID: 35712585 PMCID: PMC9197342 DOI: 10.3389/fpls.2022.896408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 04/19/2022] [Indexed: 06/15/2023]

Abstract

To obtain the canopy chlorophyll content of winter wheat in a rapid and non-destructive high-throughput manner, the study was conducted on winter wheat in Xinjiang Manas Experimental Base in 2021, and the multispectral images of two water treatments' normal irrigation (NI) and drought stress (DS) in three key fertility stages (heading, flowering, and filling) of winter wheat were obtained by DJI P4M unmanned aerial vehicle (UAV). The flag leaf chlorophyll content (CC) data of different genotypes in the field were obtained by SPAD-502 Plus chlorophyll meter. Firstly, the CC distribution of different genotypes was studied, then, 13 vegetation indices, combined with the Random Forest algorithm and correlation evaluation of CC, and 14 vegetation indices were used for vegetation index preference. Finally, preferential vegetation indices and nine machine learning algorithms, Ridge regression with cross-validation (RidgeCV), Ridge, Adaboost Regression, Bagging_Regressor, K_Neighbor, Gradient_Boosting_Regressor, Random Forest, Support Vector Machine (SVM), and Least absolute shrinkage and selection operator (Lasso), were preferentially selected to construct the CC estimation models under two water treatments at three different fertility stages, which were evaluated by correlation coefficient (r), root means square error (RMSE) and the normalized root mean square error (NRMSE) to select the optimal estimation model. The results showed that the CC values under normal irrigation were higher than those underwater limitation treatment at different fertility stages; several vegetation indices and CC values showed a highly significant correlation, with the highest correlation reaching.51; in the prediction model construction of CC values, different models under normal irrigation and water limitation treatment had high estimation accuracy, among which the model with the highest prediction accuracy under normal irrigation was at the heading stage. The highest precision of the model prediction under normal irrigation was in the RidgeCV model (r = 0.63, RMSE = 3.28, NRMSE = 16.2%) and the highest precision of the model prediction under water limitation treatment was in the SVM model (r = 0.63, RMSE = 3.47, NRMSE = 19.2%).

Collapse

Danilevicz MF, Gill M, Anderson R, Batley J, Bennamoun M, Bayer PE, Edwards D. Plant Genotype to Phenotype Prediction Using Machine Learning. Front Genet 2022;13:822173. [PMID: 35664329 PMCID: PMC9159391 DOI: 10.3389/fgene.2022.822173] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 03/07/2022] [Indexed: 12/13/2022] Open

Genome-Enabled Prediction Methods Based on Machine Learning. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022;2467:189-218. [PMID: 35451777 DOI: 10.1007/978-1-0716-2205-6_7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Obesity-Associated Differentially Methylated Regions in Colon Cancer. J Pers Med 2022;12:jpm12050660. [PMID: 35629083 PMCID: PMC9142939 DOI: 10.3390/jpm12050660] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 04/11/2022] [Accepted: 04/18/2022] [Indexed: 02/01/2023] Open

Parvandeh S, Donehower LA, Katsonis P, Hsu TK, Asmussen J, Lee K, Lichtarge O. EPIMUTESTR: a nearest neighbor machine learning approach to predict cancer driver genes from the evolutionary action of coding variants. Nucleic Acids Res 2022;50:e70. [PMID: 35412634 PMCID: PMC9262594 DOI: 10.1093/nar/gkac215] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 03/17/2022] [Accepted: 03/21/2022] [Indexed: 02/01/2023] Open

Perez BC, Bink MCAM, Svenson KL, Churchill GA, Calus MPL. Prediction performance of linear models and gradient boosting machine on complex phenotypes in outbred mice. G3 (BETHESDA, MD.) 2022;12:6528848. [PMID: 35166767 PMCID: PMC8982369 DOI: 10.1093/g3journal/jkac039] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 01/29/2022] [Indexed: 12/14/2022]

Bartholomé J, Prakash PT, Cobb JN. Genomic Prediction: Progress and Perspectives for Rice Improvement. Methods Mol Biol 2022;2467:569-617. [PMID: 35451791 DOI: 10.1007/978-1-0716-2205-6_21] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Dalla Lana F, Madden LV, Paul PA. Logistic Models Derived via LASSO Methods for Quantifying the Risk of Natural Contamination of Maize Grain with Deoxynivalenol. PHYTOPATHOLOGY 2021;111:2250-2267. [PMID: 34009008 DOI: 10.1094/phyto-03-21-0104-r] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Raimondi D, Corso M, Fariselli P, Moreau Y. From genotype to phenotype in Arabidopsis thaliana: in-silico genome interpretation predicts 288 phenotypes from sequencing data. Nucleic Acids Res 2021;50:e16. [PMID: 34792168 PMCID: PMC8860592 DOI: 10.1093/nar/gkab1099] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 10/06/2021] [Accepted: 10/22/2021] [Indexed: 01/09/2023] Open

Predicting Heritability of Oil Palm Breeding Using Phenotypic Traits and Machine Learning. SUSTAINABILITY 2021. [DOI: 10.3390/su132212613] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Zhao Y, Lyu X, Xiao W, Tian S, Zhang J, Hu Z, Fu Y. Evaluation of the soil profile quality of subsided land in a coal mining area backfilled with river sediment based on monitoring wheat growth biomass with UAV systems. ENVIRONMENTAL MONITORING AND ASSESSMENT 2021;193:576. [PMID: 34392439 DOI: 10.1007/s10661-021-09250-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 06/28/2021] [Indexed: 06/13/2023]

Awlia M, Alshareef N, Saber N, Korte A, Oakey H, Panzarová K, Trtílek M, Negrão S, Tester M, Julkowska MM. Genetic mapping of the early responses to salt stress in Arabidopsis thaliana. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021;107:544-563. [PMID: 33964046 DOI: 10.1111/tpj.15310] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 03/05/2021] [Accepted: 04/19/2021] [Indexed: 06/12/2023]

Mores A, Borrelli GM, Laidò G, Petruzzino G, Pecchioni N, Amoroso LGM, Desiderio F, Mazzucotelli E, Mastrangelo AM, Marone D. Genomic Approaches to Identify Molecular Bases of Crop Resistance to Diseases and to Develop Future Breeding Strategies. Int J Mol Sci 2021;22:5423. [PMID: 34063853 PMCID: PMC8196592 DOI: 10.3390/ijms22115423] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 04/30/2021] [Accepted: 05/15/2021] [Indexed: 12/16/2022] Open

Affiliation(s)

Antonia Mores Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, S.S. 673, Km 25,200, 71122 Foggia, Italy; (A.M.); (G.M.B.); (G.L.); (G.P.); (N.P.); (A.M.M.)
Grazia Maria Borrelli Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, S.S. 673, Km 25,200, 71122 Foggia, Italy; (A.M.); (G.M.B.); (G.L.); (G.P.); (N.P.); (A.M.M.)
Giovanni Laidò Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, S.S. 673, Km 25,200, 71122 Foggia, Italy; (A.M.); (G.M.B.); (G.L.); (G.P.); (N.P.); (A.M.M.)
Giuseppe Petruzzino Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, S.S. 673, Km 25,200, 71122 Foggia, Italy; (A.M.); (G.M.B.); (G.L.); (G.P.); (N.P.); (A.M.M.)
Nicola Pecchioni Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, S.S. 673, Km 25,200, 71122 Foggia, Italy; (A.M.); (G.M.B.); (G.L.); (G.P.); (N.P.); (A.M.M.)
Luca Giuseppe Maria Amoroso IEEE, Institute of Electrical and Electronics Engineers, 22040 Anzano del Parco, Italy;
Francesca Desiderio Council for Agricultural Research and Economics, Genomics and Bioinformatics Research Center, Via San Protaso 302, 29017 Fiorenzuola d’Arda, Italy; (F.D.); (E.M.)
Elisabetta Mazzucotelli Council for Agricultural Research and Economics, Genomics and Bioinformatics Research Center, Via San Protaso 302, 29017 Fiorenzuola d’Arda, Italy; (F.D.); (E.M.)
Anna Maria Mastrangelo Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, S.S. 673, Km 25,200, 71122 Foggia, Italy; (A.M.); (G.M.B.); (G.L.); (G.P.); (N.P.); (A.M.M.)
Daniela Marone Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, S.S. 673, Km 25,200, 71122 Foggia, Italy; (A.M.); (G.M.B.); (G.L.); (G.P.); (N.P.); (A.M.M.)

Collapse

Cortés AJ, López-Hernández F. Harnessing Crop Wild Diversity for Climate Change Adaptation. Genes (Basel) 2021;12:783. [PMID: 34065368 PMCID: PMC8161384 DOI: 10.3390/genes12050783] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 04/28/2021] [Accepted: 05/19/2021] [Indexed: 12/20/2022] Open

Abstract

Warming and drought are reducing global crop production with a potential to substantially worsen global malnutrition. As with the green revolution in the last century, plant genetics may offer concrete opportunities to increase yield and crop adaptability. However, the rate at which the threat is happening requires powering new strategies in order to meet the global food demand. In this review, we highlight major recent 'big data' developments from both empirical and theoretical genomics that may speed up the identification, conservation, and breeding of exotic and elite crop varieties with the potential to feed humans. We first emphasize the major bottlenecks to capture and utilize novel sources of variation in abiotic stress (i.e., heat and drought) tolerance. We argue that adaptation of crop wild relatives to dry environments could be informative on how plant phenotypes may react to a drier climate because natural selection has already tested more options than humans ever will. Because isolated pockets of cryptic diversity may still persist in remote semi-arid regions, we encourage new habitat-based population-guided collections for genebanks. We continue discussing how to systematically study abiotic stress tolerance in these crop collections of wild and landraces using geo-referencing and extensive environmental data. By uncovering the genes that underlie the tolerance adaptive trait, natural variation has the potential to be introgressed into elite cultivars. However, unlocking adaptive genetic variation hidden in related wild species and early landraces remains a major challenge for complex traits that, as abiotic stress tolerance, are polygenic (i.e., regulated by many low-effect genes). Therefore, we finish prospecting modern analytical approaches that will serve to overcome this issue. Concretely, genomic prediction, machine learning, and multi-trait gene editing, all offer innovative alternatives to speed up more accurate pre- and breeding efforts toward the increase in crop adaptability and yield, while matching future global food demands in the face of increased heat and drought. In order for these 'big data' approaches to succeed, we advocate for a trans-disciplinary approach with open-source data and long-term funding. The recent developments and perspectives discussed throughout this review ultimately aim to contribute to increased crop adaptability and yield in the face of heat waves and drought events.

Collapse

Rohde PD, Kristensen TN, Sarup P, Muñoz J, Malmendal A. Prediction of complex phenotypes using the Drosophila melanogaster metabolome. Heredity (Edinb) 2021;126:717-732. [PMID: 33510469 PMCID: PMC8102504 DOI: 10.1038/s41437-021-00404-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 01/04/2021] [Accepted: 01/04/2021] [Indexed: 01/30/2023] Open

Grinberg NF, Wallace C. Multi-tissue transcriptome-wide association studies. Genet Epidemiol 2021;45:324-337. [PMID: 33369784 PMCID: PMC8048510 DOI: 10.1002/gepi.22374] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Revised: 11/04/2020] [Accepted: 11/18/2020] [Indexed: 12/20/2022]

Maldonado C, Mora-Poblete F, Contreras-Soto RI, Ahmar S, Chen JT, do Amaral Júnior AT, Scapim CA. Genome-Wide Prediction of Complex Traits in Two Outcrossing Plant Species Through Deep Learning and Bayesian Regularized Neural Network. FRONTIERS IN PLANT SCIENCE 2020;11:593897. [PMID: 33329658 PMCID: PMC7728740 DOI: 10.3389/fpls.2020.593897] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 10/27/2020] [Indexed: 05/25/2023]

Orhobor OI, Alexandrov NN, King RD. Predicting rice phenotypes with meta and multi-target learning. Mach Learn 2020. [DOI: 10.1007/s10994-020-05881-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Han Y, Adolphs R. Estimating the heritability of psychological measures in the Human Connectome Project dataset. PLoS One 2020;15:e0235860. [PMID: 32645058 PMCID: PMC7347217 DOI: 10.1371/journal.pone.0235860] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 06/24/2020] [Indexed: 12/03/2022] Open

Abstract

The Human Connectome Project (HCP) is a large structural and functional MRI dataset with a rich array of behavioral and genotypic measures, as well as a biologically verified family structure. This makes it a valuable resource for investigating questions about individual differences, including questions about heritability. While its MRI data have been analyzed extensively in this regard, to our knowledge a comprehensive estimation of the heritability of the behavioral dataset has never been conducted. Using a set of behavioral measures of personality, emotion and cognition, we show that it is possible to re-identify the same individual across two testing times (fingerprinting), and to identify identical twins significantly above chance. Standard heritability estimates of 37 behavioral measures were derived from twin correlations, and machine-learning models (univariate linear model, Ridge classifier and Random Forest model) were trained to classify monozygotic twins and dizygotic twins. Correlations between the standard heritability metric and each set of model weights ranged from 0.36 to 0.7, and questionnaire-based and task-based measures did not differ significantly in their heritability. We further explored the heritability of a smaller number of latent factors extracted from the 37 measures and repeated the heritability estimation; in this case, the correlations between the standard heritability and each set of model weights were lower, ranging from 0.05 to 0.43. One specific discrepancy arose for the general intelligence factor, which all models assigned high importance, but the standard heritability calculation did not. We present a thorough investigation of the heritabilities of the behavioral measures in the HCP as a resource for other investigators, and illustrate the utility of machine-learning methods for qualitative characterization of the differential heritability across diverse measures.

Collapse