1
|
Gomez-Cano F, Rodriguez J, Zhou P, Chu YH, Magnusson E, Gomez-Cano L, Krishnan A, Springer NM, de Leon N, Grotewold E. Prioritizing Metabolic Gene Regulators through Multi-Omic Network Integration in Maize. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.26.582075. [PMID: 38464086 PMCID: PMC10925184 DOI: 10.1101/2024.02.26.582075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Elucidating gene regulatory networks (GRNs) is a major area of study within plant systems biology. Phenotypic traits are intricately linked to specific gene expression profiles. These expression patterns arise primarily from regulatory connections between sets of transcription factors (TFs) and their target genes. In this study, we integrated publicly available co-expression networks derived from more than 6,000 RNA-seq samples, 283 protein-DNA interaction assays, and 16 million of SNPs used to identify expression quantitative loci (eQTL), to construct TF-target networks. In total, we analyzed ~4.6M interactions to generate four distinct types of TF-target networks: co-expression, protein-DNA interaction (PDI), trans-expression quantitative loci (trans-eQTL), and cis-eQTL combined with PDIs. To improve the functional annotation of TFs based on its target genes, we implemented three different strategies to integrate these four types of networks. We subsequently evaluated the effectiveness of our method through loss-of function mutant and random networks. The multi-network integration allowed us to identify transcriptional regulators of hormone-, metabolic- and development-related processes. Finally, using the topological properties of the fully integrated network, we identified potentially functional redundant TF paralogs. Our findings retrieved functions previously documented for numerous TFs and revealed novel functions that are crucial for informing the design of future experiments. The approach here-described lays the foundation for the integration of multi-omic datasets in maize and other plant systems.
Collapse
Affiliation(s)
- Fabio Gomez-Cano
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
- Current address: Department of Molecular, Cellular, and Development Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jonas Rodriguez
- Department of Plant and Agroecosystem Sciences, University of Wisconsin Madison, Madison, WI 53706, USA
| | - Peng Zhou
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108
| | - Yi-Hsuan Chu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
| | - Erika Magnusson
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108
| | - Lina Gomez-Cano
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
| | - Arjun Krishnan
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108
- Current address: Global Breeding, Bayer Crop Sciences, Chesterfield MO 63017, USA
| | - Natalia de Leon
- Department of Plant and Agroecosystem Sciences, University of Wisconsin Madison, Madison, WI 53706, USA
| | - Erich Grotewold
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824-6473, USA
| |
Collapse
|
2
|
Gavgani HN, Grotewold E, Gray J. Methodology for Constructing a Knowledgebase for Plant Gene Regulation Information. Methods Mol Biol 2023; 2698:277-300. [PMID: 37682481 DOI: 10.1007/978-1-0716-3354-0_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]
Abstract
The amount of biological data is growing at a rapid pace as many high-throughput omics technologies and data pipelines are developed. This is resulting in the growth of databases for DNA and protein sequences, gene expression, protein accumulation, structural, and localization information. The diversity and multi-omics nature of such bioinformatic data requires well-designed databases for flexible organization and presentation. Besides general-purpose online bioinformatic databases, users need narrowly focused online databases to quickly access a meaningful collection of related data for their research. Here, we describe the methodology used to implement a plant gene regulatory knowledgebase, with data, query, and tool features, as well as the ability to expand to accommodate future datasets. We exemplify this methodology for the GRASSIUS knowledgebase, but it is applicable to developing and updating similar plant gene regulatory knowledgebases. GRASSIUS organizes and presents gene regulatory data from grass species with a central focus on maize (Zea mays). The main class of data presented include not only the families of transcription factors (TFs) and co-regulators (CRs) but also protein-DNA interaction data, where available.
Collapse
Affiliation(s)
- Hadi Nayebi Gavgani
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
- Dandelions Therapeutics Inc., San Francisco, CA, USA
| | - Erich Grotewold
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - John Gray
- Department of Biological Sciences, University of Toledo, Toledo, OH, USA.
| |
Collapse
|
3
|
Schmitz RJ, Grotewold E, Stam M. Cis-regulatory sequences in plants: Their importance, discovery, and future challenges. THE PLANT CELL 2022; 34:718-741. [PMID: 34918159 PMCID: PMC8824567 DOI: 10.1093/plcell/koab281] [Citation(s) in RCA: 89] [Impact Index Per Article: 44.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 10/20/2021] [Indexed: 05/19/2023]
Abstract
The identification and characterization of cis-regulatory DNA sequences and how they function to coordinate responses to developmental and environmental cues is of paramount importance to plant biology. Key to these regulatory processes are cis-regulatory modules (CRMs), which include enhancers and silencers. Despite the extraordinary advances in high-quality sequence assemblies and genome annotations, the identification and understanding of CRMs, and how they regulate gene expression, lag significantly behind. This is especially true for their distinguishing characteristics and activity states. Here, we review the current knowledge on CRMs and breakthrough technologies enabling identification, characterization, and validation of CRMs; we compare the genomic distributions of CRMs with respect to their target genes between different plant species, and discuss the role of transposable elements harboring CRMs in the evolution of gene expression. This is an exciting time to study cis-regulomes in plants; however, significant existing challenges need to be overcome to fully understand and appreciate the role of CRMs in plant biology and in crop improvement.
Collapse
Affiliation(s)
- Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, Georgia 30602, USA
| | - Erich Grotewold
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, USA
| | | |
Collapse
|
4
|
Jiang L, Dong C, Liu T, Shi Y, Wang H, Tao Z, Liang Y, Lian J. Improved Functional Expression of Cytochrome P450s in Saccharomyces cerevisiae Through Screening a cDNA Library From Arabidopsis thaliana. Front Bioeng Biotechnol 2021; 9:764851. [PMID: 34957066 PMCID: PMC8696027 DOI: 10.3389/fbioe.2021.764851] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 11/24/2021] [Indexed: 01/08/2023] Open
Abstract
Cytochrome P450 enzymes (P450s) are a superfamily of heme-thiolate proteins widely existing in various organisms and play a key role in the metabolic network and secondary metabolism. However, the low expression levels and activities have become the biggest challenge for P450s studies. To improve the functional expression of P450s in Saccharomyces cerevisiae, an Arabidopsis thaliana cDNA library was expressed in the betaxanthin-producing yeast strain, which functioned as a biosensor for high throughput screening. Three new target genes AtGRP7, AtMSBP1, and AtCOL4 were identified to improve the functional expression of CYP76AD1 in yeast, with accordingly the accumulation of betaxanthin increased for 1.32-, 1.86-, and 1.10-fold, respectively. In addition, these three targets worked synergistically/additively to improve the production of betaxanthin, representing a total of 2.36-fold improvement when compared with the parent strain. More importantly, these genes were also determined to effectively increase the activity of another P450 enzyme (CYP736A167), catalyzing the hydroxylation of α-santalene to produce Z-α-santalol. Simultaneous overexpression of AtGRP7, AtMSBP1, and AtCOL4 increased α-santalene to Z-α-santalol conversion rate for more than 2.97-fold. The present study reported a novel strategy to improve the functional expression of P450s in S. cerevisiae and promises the construction of platform yeast strains for the production of natural products.
Collapse
Affiliation(s)
- Lihong Jiang
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China
| | - Chang Dong
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China.,Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, China
| | - Tengfei Liu
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China
| | - Yi Shi
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China
| | - Handing Wang
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China.,Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, China
| | - Zeng Tao
- Ministry of Agriculture Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Biotechnology, Zhejiang University, Hangzhou, China
| | - Yan Liang
- Ministry of Agriculture Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Biotechnology, Zhejiang University, Hangzhou, China
| | - Jiazhang Lian
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China.,Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, China
| |
Collapse
|
5
|
Springer N, de León N, Grotewold E. Challenges of Translating Gene Regulatory Information into Agronomic Improvements. TRENDS IN PLANT SCIENCE 2019; 24:1075-1082. [PMID: 31377174 DOI: 10.1016/j.tplants.2019.07.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 06/26/2019] [Accepted: 07/05/2019] [Indexed: 06/10/2023]
Abstract
Improvement of agricultural species has exploited the genetic variation responsible for complex quantitative traits. Much of the functional variation is regulatory, in cis-regulatory elements and trans-acting factors that ultimately contribute to gene expression differences. However, the identification of gene regulatory network components that, when modulated, will increase plant productivity or resilience, is challenging, yet essential to provide increased predictive power for genome engineering approaches that are likely to benefit useful traits. Here, we discuss the opportunities and limitations of using data obtained from gene coexpression, transcription factor binding, and genome-wide association mapping analyses to predict regulatory interactions that impact crop improvement. It is apparent that a combination of information from these data types is necessary for the reliable identification and utilization of important regulatory interactions that underlie complex agronomic traits.
Collapse
Affiliation(s)
- Nathan Springer
- Department of Plant and Microbial Biology, University of Minnesota, St Paul, MN 55108, USA.
| | - Natalia de León
- Department of Agronomy, University of Wisconsin, Madison, WI 56706, USA
| | - Erich Grotewold
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA.
| |
Collapse
|
6
|
Jahan MA, Harris B, Lowery M, Coburn K, Infante AM, Percifield RJ, Ammer AG, Kovinich N. The NAC family transcription factor GmNAC42-1 regulates biosynthesis of the anticancer and neuroprotective glyceollins in soybean. BMC Genomics 2019; 20:149. [PMID: 30786857 PMCID: PMC6381636 DOI: 10.1186/s12864-019-5524-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Accepted: 02/11/2019] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Glyceollins are isoflavonoid-derived pathogen-inducible defense metabolites (phytoalexins) from soybean (Glycine max L. Merr) that have important roles in providing defense against pathogens. They also have impressive anticancer and neuroprotective activities in mammals. Despite their potential usefulness as therapeutics, glyceollins are not economical to synthesize and are biosynthesized only transiently and in low amounts in response to specific stresses. Engineering the regulation of glyceollin biosynthesis may be a promising approach to enhance their bioproduction, yet the transcription factors (TFs) that regulate their biosynthesis have remained elusive. To address this, we first aimed to identify novel abiotic stresses that enhance or suppress the elicitation of glyceollins and then used a comparative transcriptomics approach to search for TF gene candidates that may positively regulate glyceollin biosynthesis. RESULTS Acidity stress (pH 3.0 medium) and dehydration exerted prolonged (week-long) inductive or suppressive effects on glyceollin biosynthesis, respectively. RNA-seq found that all known biosynthetic genes were oppositely regulated by acidity stress and dehydration, but known isoflavonoid TFs were not. Systemic acquired resistance (SAR) genes were highly enriched in the geneset. We chose to functionally characterize the NAC (NAM/ATAF1/2/CUC2)-family TF GmNAC42-1 that was annotated as an SAR gene and a homolog of the Arabidopsis thaliana (Arabidopsis) indole alkaloid phytoalexin regulator ANAC042. Overexpressing and silencing GmNAC42-1 in elicited soybean hairy roots dramatically enhanced and suppressed the amounts of glyceollin metabolites and biosynthesis gene mRNAs, respectively. Yet, overexpressing GmNAC42-1 in non-elicited hairy roots failed to stimulate the expressions of all biosynthesis genes. Thus, GmNAC42-1 was necessary but not sufficient to activate all biosynthesis genes on its own, suggesting an important role in the glyceollin gene regulatory network (GRN). The GmNAC42-1 protein directly bound the promoters of biosynthesis genes IFS2 and G4DT in the yeast one-hybrid (Y1H) system. CONCLUSIONS Acidity stress is a novel elicitor and dehydration is a suppressor of glyceollin biosynthesis. The TF gene GmNAC42-1 is an essential positive regulator of glyceollin biosynthesis. Overexpressing GmNAC42-1 in hairy roots can be used to increase glyceollin yields > 10-fold upon elicitation. Thus, manipulating the expressions of glyceollin TFs is an effective strategy for enhancing the bioproduction of glyceollins in soybean.
Collapse
Affiliation(s)
- Md Asraful Jahan
- Division of Plant and Soil Sciences, West Virginia University, Morgantown, West Virginia 26506 USA
| | - Brianna Harris
- Department of Biology, West Virginia University, Morgantown, West Virginia 26506 USA
| | - Matthew Lowery
- Department of Biochemistry, West Virginia University, Morgantown, West Virginia 26506 USA
| | - Katie Coburn
- Department of Biology, West Virginia University, Morgantown, West Virginia 26506 USA
| | - Aniello M. Infante
- Department of Biostatistics, West Virginia University, Morgantown, West Virginia 26506 USA
| | - Ryan J. Percifield
- Department of Biology, West Virginia University, Morgantown, West Virginia 26506 USA
| | - Amanda G. Ammer
- Microscope Imaging Facility, West Virginia University, Morgantown, West Virginia 26506 USA
| | - Nik Kovinich
- Division of Plant and Soil Sciences, West Virginia University, Morgantown, West Virginia 26506 USA
| |
Collapse
|
7
|
Yang F, Li W, Jiang N, Yu H, Morohashi K, Ouma WZ, Morales-Mantilla DE, Gomez-Cano FA, Mukundi E, Prada-Salcedo LD, Velazquez RA, Valentin J, Mejía-Guerra MK, Gray J, Doseff AI, Grotewold E. A Maize Gene Regulatory Network for Phenolic Metabolism. MOLECULAR PLANT 2017; 10:498-515. [PMID: 27871810 DOI: 10.1016/j.molp.2016.10.020] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Revised: 09/20/2016] [Accepted: 10/31/2016] [Indexed: 05/23/2023]
Abstract
The translation of the genotype into phenotype, represented for example by the expression of genes encoding enzymes required for the biosynthesis of phytochemicals that are important for interaction of plants with the environment, is largely carried out by transcription factors (TFs) that recognize specific cis-regulatory elements in the genes that they control. TFs and their target genes are organized in gene regulatory networks (GRNs), and thus uncovering GRN architecture presents an important biological challenge necessary to explain gene regulation. Linking TFs to the genes they control, central to understanding GRNs, can be carried out using gene- or TF-centered approaches. In this study, we employed a gene-centered approach utilizing the yeast one-hybrid assay to generate a network of protein-DNA interactions that participate in the transcriptional control of genes involved in the biosynthesis of maize phenolic compounds including general phenylpropanoids, lignins, and flavonoids. We identified 1100 protein-DNA interactions involving 54 phenolic gene promoters and 568 TFs. A set of 11 TFs recognized 10 or more promoters, suggesting a role in coordinating pathway gene expression. The integration of the gene-centered network with information derived from TF-centered approaches provides a foundation for a phenolics GRN characterized by interlaced feed-forward loops that link developmental regulators with biosynthetic genes.
Collapse
Affiliation(s)
- Fan Yang
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - Wei Li
- Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA; Department of Physiology and Cell Biology, Heart and Lung Research Institute, The Ohio State University, Columbus, OH 43210, USA
| | - Nan Jiang
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - Haidong Yu
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - Kengo Morohashi
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - Wilberforce Zachary Ouma
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA; Molecular, Cellular, and Developmental Biology (MCDB) Graduate Program, The Ohio State University, Columbus, OH 43210, USA
| | - Daniel E Morales-Mantilla
- Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA; Department of Physiology and Cell Biology, Heart and Lung Research Institute, The Ohio State University, Columbus, OH 43210, USA; Success in Graduate Education (SiGuE) Program, The Ohio State University, Columbus, OH 43210, USA
| | - Fabio Andres Gomez-Cano
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - Eric Mukundi
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - Luis Daniel Prada-Salcedo
- Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA; Department of Physiology and Cell Biology, Heart and Lung Research Institute, The Ohio State University, Columbus, OH 43210, USA
| | - Roberto Alers Velazquez
- Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA; Department of Physiology and Cell Biology, Heart and Lung Research Institute, The Ohio State University, Columbus, OH 43210, USA; Success in Graduate Education (SiGuE) Program, The Ohio State University, Columbus, OH 43210, USA
| | - Jasmin Valentin
- Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA; Department of Physiology and Cell Biology, Heart and Lung Research Institute, The Ohio State University, Columbus, OH 43210, USA; Success in Graduate Education (SiGuE) Program, The Ohio State University, Columbus, OH 43210, USA
| | - Maria Katherine Mejía-Guerra
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA
| | - John Gray
- Department of Biological Sciences, University of Toledo, Toledo, OH 43560, USA
| | - Andrea I Doseff
- Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA; Department of Physiology and Cell Biology, Heart and Lung Research Institute, The Ohio State University, Columbus, OH 43210, USA
| | - Erich Grotewold
- Center for Applied Sciences (CAPS), The Ohio State University, Columbus, OH 43210, USA; Department of Molecular Genetics, The Ohio State University, Columbus, OH 43210, USA.
| |
Collapse
|