1
|
Murmu S, Sinha D, Chaurasia H, Sharma S, Das R, Jha GK, Archak S. A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions. FRONTIERS IN PLANT SCIENCE 2024; 15:1292054. [PMID: 38504888 PMCID: PMC10948452 DOI: 10.3389/fpls.2024.1292054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Accepted: 01/24/2024] [Indexed: 03/21/2024]
Abstract
Plants intricately deploy defense systems to counter diverse biotic and abiotic stresses. Omics technologies, spanning genomics, transcriptomics, proteomics, and metabolomics, have revolutionized the exploration of plant defense mechanisms, unraveling molecular intricacies in response to various stressors. However, the complexity and scale of omics data necessitate sophisticated analytical tools for meaningful insights. This review delves into the application of artificial intelligence algorithms, particularly machine learning and deep learning, as promising approaches for deciphering complex omics data in plant defense research. The overview encompasses key omics techniques and addresses the challenges and limitations inherent in current AI-assisted omics approaches. Moreover, it contemplates potential future directions in this dynamic field. In summary, AI-assisted omics techniques present a robust toolkit, enabling a profound understanding of the molecular foundations of plant defense and paving the way for more effective crop protection strategies amidst climate change and emerging diseases.
Collapse
Affiliation(s)
- Sneha Murmu
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Dipro Sinha
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Himanshushekhar Chaurasia
- Central Institute for Research on Cotton Technology, Indian Council of Agricultural Research (ICAR), Mumbai, India
| | - Soumya Sharma
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Ritwika Das
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Girish Kumar Jha
- Indian Agricultural Statistics Research Institute, Indian Council of Agricultural Research (ICAR), New Delhi, India
| | - Sunil Archak
- National Bureau of Plant Genetic Resources, Indian Council of Agricultural Research (ICAR), New Delhi, India
| |
Collapse
|
2
|
Yan J, Wang X. Machine learning bridges omics sciences and plant breeding. TRENDS IN PLANT SCIENCE 2023; 28:199-210. [PMID: 36153276 DOI: 10.1016/j.tplants.2022.08.018] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 08/15/2022] [Accepted: 08/23/2022] [Indexed: 06/16/2023]
Abstract
Some of the biological knowledge obtained from fundamental research will be implemented in applied plant breeding. To bridge basic research and breeding practice, machine learning (ML) holds great promise to translate biological knowledge and omics data into precision-designed plant breeding. Here, we review ML for multi-omics analysis in plants, including data dimensionality reduction, inference of gene-regulation networks, and gene discovery and prioritization. These applications will facilitate understanding trait regulation mechanisms and identifying target genes potentially applicable to knowledge-driven molecular design breeding. We also highlight applications of deep learning in plant phenomics and ML in genomic selection-assisted breeding, such as various ML algorithms that model the correlations among genotypes (genes), phenotypes (traits), and environments, to ultimately achieve data-driven genomic design breeding.
Collapse
Affiliation(s)
- Jun Yan
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China
| | - Xiangfeng Wang
- National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing 100094, China; Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing 100094, China.
| |
Collapse
|
3
|
Olivares-Yañez C, Sánchez E, Pérez-Lara G, Seguel A, Camejo PY, Larrondo LF, Vidal EA, Canessa P. A comprehensive transcription factor and DNA-binding motif resource for the construction of gene regulatory networks in Botrytis cinerea and Trichoderma atroviride. Comput Struct Biotechnol J 2021; 19:6212-6228. [PMID: 34900134 PMCID: PMC8637145 DOI: 10.1016/j.csbj.2021.11.012] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/11/2021] [Accepted: 11/11/2021] [Indexed: 11/25/2022] Open
Abstract
Botrytis cinerea and Trichoderma atroviride are two relevant fungi in agricultural systems. To gain insights into these organisms' transcriptional gene regulatory networks (GRNs), we generated a manually curated transcription factor (TF) dataset for each of them, followed by a GRN inference utilizing available sequence motifs describing DNA-binding specificity and global gene expression data. As a proof of concept of the usefulness of this resource to pinpoint key transcriptional regulators, we employed publicly available transcriptomics data and a newly generated dual RNA-seq dataset to build context-specific Botrytis and Trichoderma GRNs under two different biological paradigms: exposure to continuous light and Botrytis-Trichoderma confrontation assays. Network analysis of fungal responses to constant light revealed striking differences in the transcriptional landscape of both fungi. On the other hand, we found that the confrontation of both microorganisms elicited a distinct set of differentially expressed genes with changes in T. atroviride exceeding those in B. cinerea. Using our regulatory network data, we were able to determine, in both fungi, central TFs involved in this interaction response, including TFs controlling a large set of extracellular peptidases in the biocontrol agent T. atroviride. In summary, our work provides a comprehensive catalog of transcription factors and regulatory interactions for both organisms. This catalog can now serve as a basis for generating novel hypotheses on transcriptional regulatory circuits in different experimental contexts.
Collapse
Affiliation(s)
- Consuelo Olivares-Yañez
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile.,Centro de Biotecnologia Vegetal, Universidad Andres Bello, Republica 330, Santiago, Chile
| | - Evelyn Sánchez
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile.,Centro de Genomica y Bioinformatica, Facultad de Ciencias, Universidad Mayor, Camino la Pirámide 5750, Huechuraba, Santiago, Chile
| | - Gabriel Pérez-Lara
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile.,Centro de Biotecnologia Vegetal, Universidad Andres Bello, Republica 330, Santiago, Chile
| | - Aldo Seguel
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile.,Departamento de Genetica Molecular y Microbiologia, Facultad de Ciencias Biologicas, Pontificia Universidad Catolica de Chile, Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile
| | - Pamela Y Camejo
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile
| | - Luis F Larrondo
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile.,Departamento de Genetica Molecular y Microbiologia, Facultad de Ciencias Biologicas, Pontificia Universidad Catolica de Chile, Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile
| | - Elena A Vidal
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile.,Centro de Genomica y Bioinformatica, Facultad de Ciencias, Universidad Mayor, Camino la Pirámide 5750, Huechuraba, Santiago, Chile.,Escuela de Biotecnologia, Facultad de Ciencias, Universidad Mayor, Camino la Pirámide 5750, Huechuraba, Santiago, Chile
| | - Paulo Canessa
- ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Avda. Libertador Bernardo O'Higgins 340, Santiago, Chile.,Centro de Biotecnologia Vegetal, Universidad Andres Bello, Republica 330, Santiago, Chile
| |
Collapse
|
4
|
An insight into phytic acid biosynthesis and its reduction strategies to improve mineral bioavailability. THE NUCLEUS 2021. [DOI: 10.1007/s13237-021-00371-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
|
5
|
DeMers LC, Raboy V, Li S, Saghai Maroof MA. Network Inference of Transcriptional Regulation in Germinating Low Phytic Acid Soybean Seeds. FRONTIERS IN PLANT SCIENCE 2021; 12:708286. [PMID: 34531883 PMCID: PMC8438133 DOI: 10.3389/fpls.2021.708286] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 06/23/2021] [Indexed: 05/14/2023]
Abstract
The low phytic acid (lpa) trait in soybeans can be conferred by loss-of-function mutations in genes encoding myo-inositol phosphate synthase and two epistatically interacting genes encoding multidrug-resistance protein ATP-binding cassette (ABC) transporters. However, perturbations in phytic acid biosynthesis are associated with poor seed vigor. Since the benefits of the lpa trait, in terms of end-use quality and sustainability, far outweigh the negatives associated with poor seed performance, a fuller understanding of the molecular basis behind the negatives will assist crop breeders and engineers in producing variates with lpa and better germination rate. The gene regulatory network (GRN) for developing low and normal phytic acid soybean seeds was previously constructed, with genes modulating a variety of processes pertinent to phytic acid metabolism and seed viability being identified. In this study, a comparative time series analysis of low and normal phytic acid soybeans was carried out to investigate the transcriptional regulatory elements governing the transitional dynamics from dry seed to germinated seed. GRNs were reverse engineered from time series transcriptomic data of three distinct genotypic subsets composed of lpa soybean lines and their normal phytic acid sibling lines. Using a robust unsupervised network inference scheme, putative regulatory interactions were inferred for each subset of genotypes. These interactions were further validated by published regulatory interactions found in Arabidopsis thaliana and motif sequence analysis. Results indicate that lpa seeds have increased sensitivity to stress, which could be due to changes in phytic acid levels, disrupted inositol phosphate signaling, disrupted phosphate ion (Pi) homeostasis, and altered myo-inositol metabolism. Putative regulatory interactions were identified for the latter two processes. Changes in abscisic acid (ABA) signaling candidate transcription factors (TFs) putatively regulating genes in this process were identified as well. Analysis of the GRNs reveal altered regulation in processes that may be affecting the germination of lpa soybean seeds. Therefore, this work contributes to the ongoing effort to elucidate molecular mechanisms underlying altered seed viability, germination and field emergence of lpa crops, understanding of which is necessary in order to mitigate these problems.
Collapse
Affiliation(s)
- Lindsay C. DeMers
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States
| | - Victor Raboy
- National Small Grains Germplasm Research Center, Agricultural Research Service (USDA), Aberdeen, ID, United States
| | - Song Li
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States
| | - M. A. Saghai Maroof
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, VA, United States
| |
Collapse
|
6
|
Gupta C, Ramegowda V, Basu S, Pereira A. Using Network-Based Machine Learning to Predict Transcription Factors Involved in Drought Resistance. Front Genet 2021; 12:652189. [PMID: 34249082 PMCID: PMC8264776 DOI: 10.3389/fgene.2021.652189] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 05/13/2021] [Indexed: 12/13/2022] Open
Abstract
Gene regulatory networks underpin stress response pathways in plants. However, parsing these networks to prioritize key genes underlying a particular trait is challenging. Here, we have built the Gene Regulation and Association Network (GRAiN) of rice (Oryza sativa). GRAiN is an interactive query-based web-platform that allows users to study functional relationships between transcription factors (TFs) and genetic modules underlying abiotic-stress responses. We built GRAiN by applying a combination of different network inference algorithms to publicly available gene expression data. We propose a supervised machine learning framework that complements GRAiN in prioritizing genes that regulate stress signal transduction and modulate gene expression under drought conditions. Our framework converts intricate network connectivity patterns of 2160 TFs into a single drought score. We observed that TFs with the highest drought scores define the functional, structural, and evolutionary characteristics of drought resistance in rice. Our approach accurately predicted the function of OsbHLH148 TF, which we validated using in vitro protein-DNA binding assays and mRNA sequencing loss-of-function mutants grown under control and drought stress conditions. Our network and the complementary machine learning strategy lends itself to predicting key regulatory genes underlying other agricultural traits and will assist in the genetic engineering of desirable rice varieties.
Collapse
Affiliation(s)
- Chirag Gupta
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Venkategowda Ramegowda
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Supratim Basu
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Andy Pereira
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| |
Collapse
|
7
|
Abstract
Single-cell RNAseq is an emerging technology that allows the quantification of gene expression in individual cells. In plants, single-cell sequencing technology has been applied to generate root cell expression maps under many experimental conditions. DAP-seq and ATAC-seq have also been used to generate genome-scale maps of protein-DNA interactions and open chromatin regions in plants. In this protocol, we describe a multistep computational pipeline for the integration of single-cell RNAseq data with DAP-seq and ATAC-seq data to predict regulatory networks and key regulatory genes. Our approach utilizes machine learning methods including feature selection and stability selection to identify candidate regulatory genes. The network generated by this pipeline can be used to provide a putative annotation of gene regulatory modules and to identify candidate transcription factors that could play a key role in specific cell types.
Collapse
|
8
|
Gupta C, Ramegowda V, Basu S, Pereira A. Using Network-Based Machine Learning to Predict Transcription Factors Involved in Drought Resistance. Front Genet 2021. [PMID: 34249082 DOI: 10.1101/2020.04.29.068379] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023] Open
Abstract
Gene regulatory networks underpin stress response pathways in plants. However, parsing these networks to prioritize key genes underlying a particular trait is challenging. Here, we have built the Gene Regulation and Association Network (GRAiN) of rice (Oryza sativa). GRAiN is an interactive query-based web-platform that allows users to study functional relationships between transcription factors (TFs) and genetic modules underlying abiotic-stress responses. We built GRAiN by applying a combination of different network inference algorithms to publicly available gene expression data. We propose a supervised machine learning framework that complements GRAiN in prioritizing genes that regulate stress signal transduction and modulate gene expression under drought conditions. Our framework converts intricate network connectivity patterns of 2160 TFs into a single drought score. We observed that TFs with the highest drought scores define the functional, structural, and evolutionary characteristics of drought resistance in rice. Our approach accurately predicted the function of OsbHLH148 TF, which we validated using in vitro protein-DNA binding assays and mRNA sequencing loss-of-function mutants grown under control and drought stress conditions. Our network and the complementary machine learning strategy lends itself to predicting key regulatory genes underlying other agricultural traits and will assist in the genetic engineering of desirable rice varieties.
Collapse
Affiliation(s)
- Chirag Gupta
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Venkategowda Ramegowda
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Supratim Basu
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| | - Andy Pereira
- Department of Crop, Soil, and Environmental Sciences, University of Arkansas, Fayetteville, AR, United States
| |
Collapse
|
9
|
Redekar NR, Glover NM, Biyashev RM, Ha BK, Raboy V, Maroof MAS. Genetic interactions regulating seed phytate and oligosaccharides in soybean (Glycine max L.). PLoS One 2020; 15:e0235120. [PMID: 32584851 PMCID: PMC7316244 DOI: 10.1371/journal.pone.0235120] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Accepted: 06/08/2020] [Indexed: 12/20/2022] Open
Abstract
Two low-phytate soybean (Glycine max (L.) Merr.) mutant lines- V99-5089 (mips mutation on chromosome 11) and CX-1834 (mrp-l and mrp-n mutations on chromosomes 19 and 3, respectively) have proven to be valuable resources for breeding of low-phytate, high-sucrose, and low-raffinosaccharide soybeans, traits that are highly desirable from a nutritional and environmental standpoint. A recombinant inbred population derived from the cross CX1834 x V99-5089 provides an opportunity to study the effect of different combinations of these three mutations on soybean phytate and oligosaccharides levels. Of the 173 recombinant inbred lines tested, 163 lines were homozygous for various combinations of MIPS and two MRP loci alleles. These individuals were grouped into eight genotypic classes based on the combination of SNP alleles at the three mutant loci. The two genotypic classes that were homozygous mrp-l/mrp-n and either homozygous wild-type or mutant at the mips locus (MIPS/mrp-l/mrp-n or mips/mrp-l/mrp-n) displayed relatively similar ~55% reductions in seed phytate, 6.94 mg g -1 and 6.70 mg g-1 respectively, as compared with 15.2 mg g-1 in the wild-type MIPS/MRP-L/MRP-N seed. Therefore, in the presence of the double mutant mrp-l/mrp-n, the mips mutation did not cause a substantially greater decrease in seed phytate level. However, the nutritionally-desirable high-sucrose/low-stachyose/low-raffinose seed phenotype originally observed in soybeans homozygous for the mips allele was reversed in the presence of mrp-l/mrp-n mutations: homozygous mips/mrp-l/mrp-n seed displayed low-sucrose (7.70%), high-stachyose (4.18%), and the highest observed raffinose (0.94%) contents per gram of dry seed. Perhaps the block in phytic acid transport from its cytoplasmic synthesis site to its storage site, conditioned by mrp-l/mrp-n, alters myo-inositol flux in mips seeds in a way that restores to wild-type levels the mips conditioned reductions in raffinosaccharides. Overall this study determined the combinatorial effects of three low phytic acid causing mutations on regulation of seed phytate and oligosaccharides in soybean.
Collapse
Affiliation(s)
- Neelam R. Redekar
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Natasha M. Glover
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Ruslan M. Biyashev
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Bo-Keun Ha
- Institute of Plant Breeding, Genetics & Genomics, University of Georgia, Athens, Georgia, United States of America
| | - Victor Raboy
- National Small Grains Germplasm Center, USDA-ARS, Aberdeen, Idaho, United States of America
| | - M. A. Saghai Maroof
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| |
Collapse
|
10
|
Song Q, Lee J, Akter S, Rogers M, Grene R, Li S. Prediction of condition-specific regulatory genes using machine learning. Nucleic Acids Res 2020; 48:e62. [PMID: 32329779 PMCID: PMC7293043 DOI: 10.1093/nar/gkaa264] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 02/19/2020] [Accepted: 04/20/2020] [Indexed: 12/31/2022] Open
Abstract
Recent advances in genomic technologies have generated data on large-scale protein-DNA interactions and open chromatin regions for many eukaryotic species. How to identify condition-specific functions of transcription factors using these data has become a major challenge in genomic research. To solve this problem, we have developed a method called ConSReg, which provides a novel approach to integrate regulatory genomic data into predictive machine learning models of key regulatory genes. Using Arabidopsis as a model system, we tested our approach to identify regulatory genes in data sets from single cell gene expression and from abiotic stress treatments. Our results showed that ConSReg accurately predicted transcription factors that regulate differentially expressed genes with an average auROC of 0.84, which is 23.5-25% better than enrichment-based approaches. To further validate the performance of ConSReg, we analyzed an independent data set related to plant nitrogen responses. ConSReg provided better rankings of the correct transcription factors in 61.7% of cases, which is three times better than other plant tools. We applied ConSReg to Arabidopsis single cell RNA-seq data, successfully identifying candidate regulatory genes that control cell wall formation. Our methods provide a new approach to define candidate regulatory genes using integrated genomic data in plants.
Collapse
Affiliation(s)
- Qi Song
- Graduate program in Genetics, Bioinformatics and Computational Biology. Virginia Tech., Blacksburg, VA 24061, USA
| | - Jiyoung Lee
- Graduate program in Genetics, Bioinformatics and Computational Biology. Virginia Tech., Blacksburg, VA 24061, USA
| | - Shamima Akter
- School of Plant and Environmental Sciences. Virginia Tech., Blacksburg, VA 24061, USA
| | - Matthew Rogers
- Department of Statistics. Virginia Tech., Blacksburg, VA 24061, USA
| | - Ruth Grene
- Graduate program in Genetics, Bioinformatics and Computational Biology. Virginia Tech., Blacksburg, VA 24061, USA
- School of Plant and Environmental Sciences. Virginia Tech., Blacksburg, VA 24061, USA
| | - Song Li
- Graduate program in Genetics, Bioinformatics and Computational Biology. Virginia Tech., Blacksburg, VA 24061, USA
- School of Plant and Environmental Sciences. Virginia Tech., Blacksburg, VA 24061, USA
| |
Collapse
|
11
|
Van den Broeck L, Gordon M, Inzé D, Williams C, Sozzani R. Gene Regulatory Network Inference: Connecting Plant Biology and Mathematical Modeling. Front Genet 2020; 11:457. [PMID: 32547596 PMCID: PMC7270862 DOI: 10.3389/fgene.2020.00457] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Accepted: 04/14/2020] [Indexed: 12/26/2022] Open
Abstract
Plant responses to environmental and intrinsic signals are tightly controlled by multiple transcription factors (TFs). These TFs and their regulatory connections form gene regulatory networks (GRNs), which provide a blueprint of the transcriptional regulations underlying plant development and environmental responses. This review provides examples of experimental methodologies commonly used to identify regulatory interactions and generate GRNs. Additionally, this review describes network inference techniques that leverage gene expression data to predict regulatory interactions. These computational and experimental methodologies yield complex networks that can identify new regulatory interactions, driving novel hypotheses. Biological properties that contribute to the complexity of GRNs are also described in this review. These include network topology, network size, transient binding of TFs to DNA, and competition between multiple upstream regulators. Finally, this review highlights the potential of machine learning approaches to leverage gene expression data to predict phenotypic outputs.
Collapse
Affiliation(s)
- Lisa Van den Broeck
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, United States
| | - Max Gordon
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, NC, United States
| | - Dirk Inzé
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.,VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Cranos Williams
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, NC, United States
| | - Rosangela Sozzani
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, United States
| |
Collapse
|
12
|
DeMers LC, Redekar NR, Kachroo A, Tolin SA, Li S, Saghai Maroof MA. A transcriptional regulatory network of Rsv3-mediated extreme resistance against Soybean mosaic virus. PLoS One 2020; 15:e0231658. [PMID: 32315334 PMCID: PMC7173922 DOI: 10.1371/journal.pone.0231658] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 03/29/2020] [Indexed: 01/02/2023] Open
Abstract
Resistance genes are an effective means for disease control in plants. They predominantly function by inducing a hypersensitive reaction, which results in localized cell death restricting pathogen spread. Some resistance genes elicit an atypical response, termed extreme resistance, where resistance is not associated with a hypersensitive reaction and its standard defense responses. Unlike hypersensitive reaction, the molecular regulatory mechanism(s) underlying extreme resistance is largely unexplored. One of the few known, naturally occurring, instances of extreme resistance is resistance derived from the soybean Rsv3 gene, which confers resistance against the most virulent Soybean mosaic virus strains. To discern the regulatory mechanism underlying Rsv3-mediated extreme resistance, we generated a gene regulatory network using transcriptomic data from time course comparisons of Soybean mosaic virus-G7-inoculated resistant (L29, Rsv3-genotype) and susceptible (Williams82, rsv3-genotype) soybean cultivars. Our results show Rsv3 begins mounting a defense by 6 hpi via a complex phytohormone network, where abscisic acid, cytokinin, jasmonic acid, and salicylic acid pathways are suppressed. We identified putative regulatory interactions between transcription factors and genes in phytohormone regulatory pathways, which is consistent with the demonstrated involvement of these pathways in Rsv3-mediated resistance. One such transcription factor identified as a putative transcriptional regulator was MYC2 encoded by Glyma.07G051500. Known as a master regulator of abscisic acid and jasmonic acid signaling, MYC2 specifically recognizes the G-box motif ("CACGTG"), which was significantly enriched in our data among differentially expressed genes implicated in abscisic acid- and jasmonic acid-related activities. This suggests an important role for Glyma.07G051500 in abscisic acid- and jasmonic acid-derived defense signaling in Rsv3. Resultantly, the findings from our network offer insights into genes and biological pathways underlying the molecular defense mechanism of Rsv3-mediated extreme resistance against Soybean mosaic virus. The computational pipeline used to reconstruct the gene regulatory network in this study is freely available at https://github.com/LiLabAtVT/rsv3-network.
Collapse
Affiliation(s)
- Lindsay C. DeMers
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Neelam R. Redekar
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Aardra Kachroo
- Department of Plant Pathology, University of Kentucky, Lexington, Virginia, United States of America
| | - Sue A. Tolin
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Song Li
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| | - M. A. Saghai Maroof
- School of Plant and Environmental Sciences, Virginia Tech, Blacksburg, Virginia, United States of America
| |
Collapse
|
13
|
Haque S, Ahmad JS, Clark NM, Williams CM, Sozzani R. Computational prediction of gene regulatory networks in plant growth and development. CURRENT OPINION IN PLANT BIOLOGY 2019; 47:96-105. [PMID: 30445315 DOI: 10.1016/j.pbi.2018.10.005] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/05/2018] [Accepted: 10/18/2018] [Indexed: 05/22/2023]
Abstract
Plants integrate a wide range of cellular, developmental, and environmental signals to regulate complex patterns of gene expression. Recent advances in genomic technologies enable differential gene expression analysis at a systems level, allowing for improved inference of the network of regulatory interactions between genes. These gene regulatory networks, or GRNs, are used to visualize the causal regulatory relationships between regulators and their downstream target genes. Accordingly, these GRNs can represent spatial, temporal, and/or environmental regulations and can identify functional genes. This review summarizes recent computational approaches applied to different types of gene expression data to infer GRNs in the context of plant growth and development. Three stages of GRN inference are described: first, data collection and analysis based on the dataset type; second, network inference application based on data availability and proposed hypotheses; and third, validation based on in silico, in vivo, and in planta methods. In addition, this review relates data collection strategies to biological questions, organizes inference algorithms based on statistical methods and data types, discusses experimental design considerations, and provides guidelines for GRN inference with an emphasis on the benefits of integrative approaches, especially when a priori information is limited. Finally, this review concludes that computational frameworks integrating large-scale heterogeneous datasets are needed for a more accurate (e.g. fewer false interactions), detailed (e.g. discrimination between direct versus indirect interactions), and comprehensive (e.g. genetic regulation under various conditions and spatial locations) inference of GRNs.
Collapse
Affiliation(s)
- Samiul Haque
- Electrical and Computer Engineering, North Carolina State University, Raleigh, USA
| | - Jabeen S Ahmad
- Plant and Microbial Biology, North Carolina State University, Raleigh, USA
| | - Natalie M Clark
- Plant and Microbial Biology, North Carolina State University, Raleigh, USA
| | - Cranos M Williams
- Electrical and Computer Engineering, North Carolina State University, Raleigh, USA.
| | - Rosangela Sozzani
- Plant and Microbial Biology, North Carolina State University, Raleigh, USA.
| |
Collapse
|
14
|
Mochida K, Koda S, Inoue K, Nishii R. Statistical and Machine Learning Approaches to Predict Gene Regulatory Networks From Transcriptome Datasets. FRONTIERS IN PLANT SCIENCE 2018; 9:1770. [PMID: 30555503 PMCID: PMC6281826 DOI: 10.3389/fpls.2018.01770] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2018] [Accepted: 11/14/2018] [Indexed: 05/20/2023]
Abstract
Statistical and machine learning (ML)-based methods have recently advanced in construction of gene regulatory network (GRNs) based on high-throughput biological datasets. GRNs underlie almost all cellular phenomena; hence, comprehensive GRN maps are essential tools to elucidate gene function, thereby facilitating the identification and prioritization of candidate genes for functional analysis. High-throughput gene expression datasets have yielded various statistical and ML-based algorithms to infer causal relationship between genes and decipher GRNs. This review summarizes the recent advancements in the computational inference of GRNs, based on large-scale transcriptome sequencing datasets of model plants and crops. We highlight strategies to select contextual genes for GRN inference, and statistical and ML-based methods for inferring GRNs based on transcriptome datasets from plants. Furthermore, we discuss the challenges and opportunities for the elucidation of GRNs based on large-scale datasets obtained from emerging transcriptomic applications, such as from population-scale, single-cell level, and life-course transcriptome analyses.
Collapse
Affiliation(s)
- Keiichi Mochida
- Bioproductivity Informatics Research Team, RIKEN Center for Sustainable Resource Science, Yokohama, Japan
- Microalgae Production Control Technology Laboratory, RIKEN Baton Zone Program, RIKEN Cluster for Science, Technology and Innovation Hub, Yokohama, Japan
- Institute of Plant Science and Resources, Okayama University, Kurashiki, Japan
- Kihara Institute for Biological Research, Yokohama City University, Yokohama, Japan
- *Correspondence: Keiichi Mochida, Ryuei Nishii,
| | - Satoru Koda
- Graduate School of Mathematics, Kyushu University, Fukuoka, Japan
| | - Komaki Inoue
- Bioproductivity Informatics Research Team, RIKEN Center for Sustainable Resource Science, Yokohama, Japan
| | - Ryuei Nishii
- Institute of Mathematics for Industry, Kyushu University, Fukuoka, Japan
- *Correspondence: Keiichi Mochida, Ryuei Nishii,
| |
Collapse
|