1
|
Peng S, Rajjou L. Advancing plant biology through deep learning-powered natural language processing. PLANT CELL REPORTS 2024; 43:208. [PMID: 39102077 DOI: 10.1007/s00299-024-03294-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 07/19/2024] [Indexed: 08/06/2024]
Abstract
The application of deep learning methods, specifically the utilization of Large Language Models (LLMs), in the field of plant biology holds significant promise for generating novel knowledge on plant cell systems. The LLM framework exhibits exceptional potential, particularly with the development of Protein Language Models (PLMs), allowing for in-depth analyses of nucleic acid and protein sequences. This analytical capacity facilitates the discernment of intricate patterns and relationships within biological data, encompassing multi-scale information within DNA or protein sequences. The contribution of PLMs extends beyond mere sequence patterns and structure--function recognition; it also supports advancements in genetic improvements for agriculture. The integration of deep learning approaches into the domain of plant sciences offers opportunities for major breakthroughs in basic research across multi-scale plant traits. Consequently, the strategic application of deep learning methodologies, particularly leveraging the potential of LLMs, will undoubtedly play a pivotal role in advancing plant sciences, plant production, plant uses and propelling the trajectory toward sustainable agroecological and agro-food transitions.
Collapse
Affiliation(s)
- Shuang Peng
- Université Paris-Saclay, INRAE, AgroParisTech, Institut Jean-Pierre Bourgin for Plant Sciences (IJPB), 78000, Versailles, France
| | - Loïc Rajjou
- Université Paris-Saclay, INRAE, AgroParisTech, Institut Jean-Pierre Bourgin for Plant Sciences (IJPB), 78000, Versailles, France.
| |
Collapse
|
2
|
Wang H, Moghe GD, Kovaleski AP, Keller M, Martinson TE, Wright AH, Franklin JL, Hébert-Haché A, Provost C, Reinke M, Atucha A, North MG, Russo JP, Helwi P, Centinari M, Londo JP. NYUS.2: an automated machine learning prediction model for the large-scale real-time simulation of grapevine freezing tolerance in North America. HORTICULTURE RESEARCH 2024; 11:uhad286. [PMID: 38487294 PMCID: PMC10939402 DOI: 10.1093/hr/uhad286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 12/17/2023] [Indexed: 03/17/2024]
Abstract
Accurate and real-time monitoring of grapevine freezing tolerance is crucial for the sustainability of the grape industry in cool climate viticultural regions. However, on-site data are limited due to the complexity of measurement. Current prediction models underperform under diverse climate conditions, which limits the large-scale deployment of these methods. We combined grapevine freezing tolerance data from multiple regions in North America and generated a predictive model based on hourly temperature-derived features and cultivar features using AutoGluon, an automated machine learning engine. Feature importance was quantified by AutoGluon and SHAP (SHapley Additive exPlanations) value. The final model was evaluated and compared with previous models for its performance under different climate conditions. The final model achieved an overall 1.36°C root-mean-square error during model testing and outperformed two previous models using three test cultivars at all testing regions. Two feature importance quantification methods identified five shared essential features. Detailed analysis of the features indicates that the model has adequately extracted some biological mechanisms during training. The final model, named NYUS.2, was deployed along with two previous models as an R shiny-based application in the 2022-23 dormancy season, enabling large-scale and real-time simulation of grapevine freezing tolerance in North America for the first time.
Collapse
Affiliation(s)
- Hongrui Wang
- School of Integrative Plant Science, Horticulture Section, Cornell AgriTech, Cornell University, Geneva, NY 14456, USA
| | - Gaurav D Moghe
- School of Integrative Plant Science, Plant Biology Section, Cornell University, Ithaca, NY 14850, USA
| | - Al P Kovaleski
- Plant and Agroecosystem Sciences Department, University of Wisconsin–Madison, Madison, WI 53706, USA
| | - Markus Keller
- Department of Viticulture and Enology, Irrigated Agriculture Research and Extension Center, Washington State University, Prosser, WA 99350, USA
| | - Timothy E Martinson
- School of Integrative Plant Science, Horticulture Section, Cornell AgriTech, Cornell University, Geneva, NY 14456, USA
| | - A Harrison Wright
- Kentville Research and Development Centre, Agriculture and Agri-Food Canada, Kentville, Nova Scotia, B4N 1J5, Canada
| | - Jeffrey L Franklin
- Kentville Research and Development Centre, Agriculture and Agri-Food Canada, Kentville, Nova Scotia, B4N 1J5, Canada
| | | | - Caroline Provost
- Centre de Recherche Agroalimentaire de Mirabel, Mirabel, Québec, J7N 2X8, Canada
| | - Michael Reinke
- Southwest Michigan Research and Extension Center, Michigan State University, Benton Harbor, MI 49022, USA
| | - Amaya Atucha
- Plant and Agroecosystem Sciences Department, University of Wisconsin–Madison, Madison, WI 53706, USA
| | - Michael G North
- Plant and Agroecosystem Sciences Department, University of Wisconsin–Madison, Madison, WI 53706, USA
| | - Jennifer P Russo
- School of Integrative Plant Science, Horticulture Section, Cornell AgriTech, Cornell University, Geneva, NY 14456, USA
| | - Pierre Helwi
- Martell & Co., 7 place Edouard Martell, Cognac 16100, France
| | - Michela Centinari
- Department of Plant Science, The Pennsylvania State University, University Park, PA 16802, USA
| | - Jason P Londo
- School of Integrative Plant Science, Horticulture Section, Cornell AgriTech, Cornell University, Geneva, NY 14456, USA
| |
Collapse
|
3
|
Kirschner GK. Because size matters: measuring chloroplast volumes by confocal and scanning electron microscopy. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 117:330-331. [PMID: 38285017 DOI: 10.1111/tpj.16603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2024]
|
4
|
Knoblauch J, Waadt R, Cousins AB, Kunz HH. Probing the in situ volumes of Arabidopsis leaf plastids using three-dimensional confocal and scanning electron microscopy. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 117:332-341. [PMID: 37985241 DOI: 10.1111/tpj.16554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 10/31/2023] [Accepted: 11/08/2023] [Indexed: 11/22/2023]
Abstract
Leaf plastids harbor a plethora of biochemical reactions including photosynthesis, one of the most important metabolic pathways on Earth. Scientists are eager to unveil the physiological processes within the organelle but also their interconnection with the rest of the plant cell. An increasingly important feature of this venture is to use experimental data in the design of metabolic models. A remaining obstacle has been the limited in situ volume information of plastids and other cell organelles. To fill this gap for chloroplasts, we established three microscopy protocols delivering in situ volumes based on: (i) chlorophyll fluorescence emerging from the thylakoid membrane, (ii) a CFP marker embedded in the envelope, and (iii) calculations from serial block-face scanning electron microscopy (SBFSEM). The obtained data were corroborated by comparing wild-type data with two mutant lines affected in the plastid division machinery known to produce small and large mesophyll chloroplasts, respectively. Furthermore, we also determined the volume of the much smaller guard cell plastids. Interestingly, their volume is not governed by the same components of the division machinery which defines mesophyll plastid size. Based on our three approaches, the average volume of a mature Col-0 wild-type mesophyll chloroplasts is 93 μm3 . Wild-type guard cell plastids are approximately 18 μm3 . Lastly, our comparative analysis shows that the chlorophyll fluorescence analysis can accurately determine chloroplast volumes, providing an important tool to research groups without access to transgenic marker lines expressing genetically encoded fluorescence proteins or costly SBFSEM equipment.
Collapse
Affiliation(s)
- Jan Knoblauch
- School of Biological Sciences, Washington State University, P.O. Box 644236, Pullman, Washington, 99164-4236, USA
| | - Rainer Waadt
- Institute of Plant Biology and Biotechnology, University of Münster, Schlossplatz 7, 48149, Münster, Germany
| | - Asaph B Cousins
- School of Biological Sciences, Washington State University, P.O. Box 644236, Pullman, Washington, 99164-4236, USA
| | - Hans-Henning Kunz
- School of Biological Sciences, Washington State University, P.O. Box 644236, Pullman, Washington, 99164-4236, USA
- LMU Munich, Plant Biochemistry, Großhadernerstr. 2-4, 82152, Planegg-Martinsried, Germany
| |
Collapse
|
5
|
Weil HL, Schneider K, Tschöpe M, Bauer J, Maus O, Frey K, Brilhaus D, Martins Rodrigues C, Doniparthi G, Wetzels F, Lukasczyk J, Kranz A, Grüning B, Zimmer D, Deßloch S, von Suchodoletz D, Usadel B, Garth C, Mühlhaus T. PLANTdataHUB: a collaborative platform for continuous FAIR data sharing in plant research. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023; 116:974-988. [PMID: 37818860 DOI: 10.1111/tpj.16474] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 09/08/2023] [Indexed: 10/13/2023]
Abstract
In modern reproducible, hypothesis-driven plant research, scientists are increasingly relying on research data management (RDM) services and infrastructures to streamline the processes of collecting, processing, sharing, and archiving research data. FAIR (i.e., findable, accessible, interoperable, and reusable) research data play a pivotal role in enabling the integration of interdisciplinary knowledge and facilitating the comparison and synthesis of a wide range of analytical findings. The PLANTdataHUB offers a solution that realizes RDM of scientific (meta)data as evolving collections of files in a directory - yielding FAIR digital objects called ARCs - with tools that enable scientists to plan, communicate, collaborate, publish, and reuse data on the same platform while gaining continuous quality control insights. The centralized platform is scalable from personal use to global communities and provides advanced federation capabilities for institutions that prefer to host their own satellite instances. This approach borrows many concepts from software development and adapts them to fit the challenges of the field of modern plant science undergoing digital transformation. The PLANTdataHUB supports researchers in each stage of a scientific project with adaptable continuous quality control insights, from the early planning phase to data publication. The central live instance of PLANTdataHUB is accessible at (https://git.nfdi4plants.org), and it will continue to evolve as a community-driven and dynamic resource that serves the needs of contemporary plant science.
Collapse
Affiliation(s)
- Heinrich Lukas Weil
- Computational Systems Biology, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Kevin Schneider
- Computational Systems Biology, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Marcel Tschöpe
- Computer Center, University of Freiburg, Freiburg im Breisgau, Germany
| | - Jonathan Bauer
- Computer Center, University of Freiburg, Freiburg im Breisgau, Germany
| | - Oliver Maus
- Computational Systems Biology, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Kevin Frey
- Computational Systems Biology, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Dominik Brilhaus
- Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | | | - Gajendra Doniparthi
- Heterogenous Information Systems, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Florian Wetzels
- Scientific Visualization Lab, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Jonas Lukasczyk
- Scientific Visualization Lab, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Angela Kranz
- IBG-4 Bioinformatics, BioSC, Forschungszentrum Jülich, Jülich, Germany
| | - Björn Grüning
- Bioinformatics Group, University of Freiburg, Freiburg im Breisgau, Germany
| | - David Zimmer
- Computational Systems Biology, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Stefan Deßloch
- Heterogenous Information Systems, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | | | - Björn Usadel
- Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- IBG-4 Bioinformatics, BioSC, Forschungszentrum Jülich, Jülich, Germany
| | - Christoph Garth
- Scientific Visualization Lab, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| | - Timo Mühlhaus
- Computational Systems Biology, University of Kaiserslautern-Landau, Kaiserslautern, Germany
| |
Collapse
|
6
|
Gouesbet G. Deciphering Macromolecular Interactions Involved in Abiotic Stress Signaling: A Review of Bioinformatics Analysis. Methods Mol Biol 2023; 2642:257-294. [PMID: 36944884 DOI: 10.1007/978-1-0716-3044-0_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
Plant functioning and responses to abiotic stresses largely involve regulations at the transcriptomic level via complex interactions of signal molecules, signaling cascades, and regulators. Nevertheless, all the signaling networks involved in responses to abiotic stresses have not yet been fully established. The in-depth analysis of transcriptomes in stressed plants has become a relevant state-of-the-art methodology to study these regulations and signaling pathways that allow plants to cope with or attempt to survive abiotic stresses. The plant science and molecular biology community has developed databases about genes, proteins, protein-protein interactions, protein-DNA interactions and ontologies, which are valuable sources of knowledge for deciphering such regulatory and signaling networks. The use of these data and the development of bioinformatics tools help to make sense of transcriptomic data in specific contexts, such as that of abiotic stress signaling, using functional biological approaches. The aim of this chapter is to present and assess some of the essential online tools and resources that will allow novices in bioinformatics to decipher transcriptomic data in order to characterize the cellular processes and functions involved in abiotic stress responses and signaling. The analysis of case studies further describes how these tools can be used to conceive signaling networks on the basis of transcriptomic data. In these case studies, particular attention was paid to the characterization of abiotic stress responses and signaling related to chemical and xenobiotic stressors.
Collapse
Affiliation(s)
- Gwenola Gouesbet
- University of Rennes, CNRS, ECOBIO [(Ecosystèmes, Biodiversité, Evolution)] - UMR 6553, Rennes, France.
| |
Collapse
|
7
|
Kuo EY, Yang RY, Chin YY, Chien YL, Chen YC, Wei CY, Kao LJ, Chang YH, Li YJ, Chen TY, Lee TM. Multi-omics approaches and genetic engineering of metabolism for improved biorefinery and wastewater treatment in microalgae. Biotechnol J 2022; 17:e2100603. [PMID: 35467782 DOI: 10.1002/biot.202100603] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 03/12/2022] [Accepted: 04/01/2022] [Indexed: 11/06/2022]
Abstract
Microalgae, a group of photosynthetic microorganisms rich in diverse and novel bioactive metabolites, have been explored for the production of biofuels, high value-added compounds as food and feeds, and pharmaceutical chemicals as agents with therapeutic benefits. This article reviews the development of omics resources and genetic engineering techniques including gene transformation methodologies, mutagenesis, and genome-editing tools in microalgae biorefinery and wastewater treatment. The introduction of these enlisted techniques has simplified the understanding of complex metabolic pathways undergoing microalgal cells. The multiomics approach of the integrated omics datasets, big data analysis, and machine learning for the discovery of objective traits and genes responsible for metabolic pathways was reviewed. Recent advances and limitations of multiomics analysis and genetic bioengineering technology to facilitate the improvement of microalgae as the dual role of wastewater treatment and biorefinery feedstock production are discussed. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Eva YuHua Kuo
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan.,Frontier Center for Ocean Science and Technology, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Ru-Yin Yang
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Yuan Yu Chin
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Yi-Lin Chien
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan.,Frontier Center for Ocean Science and Technology, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Yu Chu Chen
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Cheng-Yu Wei
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Li-Jung Kao
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Yi-Hua Chang
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Yu-Jia Li
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Te-Yuan Chen
- Doctoral Degree Program in Marine Biotechnology, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| | - Tse-Min Lee
- Department of Marine Biotechnology and Resources, National Sun Yat-sen University, Kaohsiung, 804, Taiwan.,Frontier Center for Ocean Science and Technology, National Sun Yat-sen University, Kaohsiung, 804, Taiwan.,Doctoral Degree Program in Marine Biotechnology, National Sun Yat-sen University, Kaohsiung, 804, Taiwan
| |
Collapse
|
8
|
Beck AE, Kleiner M, Garrell AK. Elucidating Plant-Microbe-Environment Interactions Through Omics-Enabled Metabolic Modelling Using Synthetic Communities. FRONTIERS IN PLANT SCIENCE 2022; 13:910377. [PMID: 35795346 PMCID: PMC9251461 DOI: 10.3389/fpls.2022.910377] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 05/16/2022] [Indexed: 05/10/2023]
Abstract
With a growing world population and increasing frequency of climate disturbance events, we are in dire need of methods to improve plant productivity, resilience, and resistance to both abiotic and biotic stressors, both for agriculture and conservation efforts. Microorganisms play an essential role in supporting plant growth, environmental response, and susceptibility to disease. However, understanding the specific mechanisms by which microbes interact with each other and with plants to influence plant phenotypes is a major challenge due to the complexity of natural communities, simultaneous competition and cooperation effects, signalling interactions, and environmental impacts. Synthetic communities are a major asset in reducing the complexity of these systems by simplifying to dominant components and isolating specific variables for controlled experiments, yet there still remains a large gap in our understanding of plant microbiome interactions. This perspectives article presents a brief review discussing ways in which metabolic modelling can be used in combination with synthetic communities to continue progress toward understanding the complexity of plant-microbe-environment interactions. We highlight the utility of metabolic models as applied to a community setting, identify different applications for both flux balance and elementary flux mode simulation approaches, emphasize the importance of ecological theory in guiding data interpretation, and provide ideas for how the integration of metabolic modelling techniques with big data may bridge the gap between simplified synthetic communities and the complexity of natural plant-microbe systems.
Collapse
Affiliation(s)
- Ashley E. Beck
- Department of Biological and Environmental Sciences, Carroll College, Helena, MT, United States
- *Correspondence: Ashley E. Beck,
| | - Manuel Kleiner
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, United States
| | - Anna-Katharina Garrell
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, United States
| |
Collapse
|