1
|
Zuluaga DL, Blanco E, Mangini G, Sonnante G, Curci PL. A Survey of the Transcriptomic Resources in Durum Wheat: Stress Responses, Data Integration and Exploitation. PLANTS (BASEL, SWITZERLAND) 2023; 12:1267. [PMID: 36986956 PMCID: PMC10056183 DOI: 10.3390/plants12061267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 02/28/2023] [Accepted: 03/04/2023] [Indexed: 06/19/2023]
Abstract
Durum wheat (Triticum turgidum subsp. durum (Desf.) Husn.) is an allotetraploid cereal crop of worldwide importance, given its use for making pasta, couscous, and bulgur. Under climate change scenarios, abiotic (e.g., high and low temperatures, salinity, drought) and biotic (mainly exemplified by fungal pathogens) stresses represent a significant limit for durum cultivation because they can severely affect yield and grain quality. The advent of next-generation sequencing technologies has brought a huge development in transcriptomic resources with many relevant datasets now available for durum wheat, at various anatomical levels, also focusing on phenological phases and environmental conditions. In this review, we cover all the transcriptomic resources generated on durum wheat to date and focus on the corresponding scientific insights gained into abiotic and biotic stress responses. We describe relevant databases, tools and approaches, including connections with other "omics" that could assist data integration for candidate gene discovery for bio-agronomical traits. The biological knowledge summarized here will ultimately help in accelerating durum wheat breeding.
Collapse
Affiliation(s)
- Diana Lucia Zuluaga
- Institute of Biosciences and Bioresources, National Research Council (CNR), Via Amendola 165/A, 70126 Bari, Italy
| | | | | | | | - Pasquale Luca Curci
- Institute of Biosciences and Bioresources, National Research Council (CNR), Via Amendola 165/A, 70126 Bari, Italy
| |
Collapse
|
2
|
Julca I, Tan QW, Mutwil M. Toward kingdom-wide analyses of gene expression. TRENDS IN PLANT SCIENCE 2023; 28:235-249. [PMID: 36344371 DOI: 10.1016/j.tplants.2022.09.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 09/22/2022] [Accepted: 09/30/2022] [Indexed: 06/16/2023]
Abstract
Gene expression data for Archaeplastida are accumulating exponentially, with more than 300 000 RNA-sequencing (RNA-seq) experiments available for hundreds of species. The gene expression data stem from thousands of experiments that capture gene expression in various organs, tissues, cell types, (a)biotic perturbations, and genotypes. Advances in software tools make it possible to process all these data in a matter of weeks on modern office computers, giving us the possibility to study gene expression in a kingdom-wide manner for the first time. We discuss how the expression data can be accessed and processed and outline analyses that take advantage of cross-species analyses, allowing us to generate powerful and robust hypotheses about gene function and evolution.
Collapse
Affiliation(s)
- Irene Julca
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
| | - Qiao Wen Tan
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
| | - Marek Mutwil
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore.
| |
Collapse
|
3
|
Curci PL, Zhang J, Mähler N, Seyfferth C, Mannapperuma C, Diels T, Van Hautegem T, Jonsen D, Street N, Hvidsten TR, Hertzberg M, Nilsson O, Inzé D, Nelissen H, Vandepoele K. Identification of growth regulators using cross-species network analysis in plants. PLANT PHYSIOLOGY 2022; 190:2350-2365. [PMID: 35984294 PMCID: PMC9706488 DOI: 10.1093/plphys/kiac374] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 07/05/2022] [Indexed: 05/11/2023]
Abstract
With the need to increase plant productivity, one of the challenges plant scientists are facing is to identify genes that play a role in beneficial plant traits. Moreover, even when such genes are found, it is generally not trivial to transfer this knowledge about gene function across species to identify functional orthologs. Here, we focused on the leaf to study plant growth. First, we built leaf growth transcriptional networks in Arabidopsis (Arabidopsis thaliana), maize (Zea mays), and aspen (Populus tremula). Next, known growth regulators, here defined as genes that when mutated or ectopically expressed alter plant growth, together with cross-species conserved networks, were used as guides to predict novel Arabidopsis growth regulators. Using an in-depth literature screening, 34 out of 100 top predicted growth regulators were confirmed to affect leaf phenotype when mutated or overexpressed and thus represent novel potential growth regulators. Globally, these growth regulators were involved in cell cycle, plant defense responses, gibberellin, auxin, and brassinosteroid signaling. Phenotypic characterization of loss-of-function lines confirmed two predicted growth regulators to be involved in leaf growth (NPF6.4 and LATE MERISTEM IDENTITY2). In conclusion, the presented network approach offers an integrative cross-species strategy to identify genes involved in plant growth and development.
Collapse
Affiliation(s)
- Pasquale Luca Curci
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
- Institute of Biosciences and Bioresources, National Research Council (CNR), Via Amendola 165/A, 70126 Bari, Italy
| | - Jie Zhang
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Niklas Mähler
- Department of Plant Physiology, Umea Plant Science Centre (UPSC), Umeå University, 90187 Umeå, Sweden
| | - Carolin Seyfferth
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
- Department of Plant Physiology, Umea Plant Science Centre (UPSC), Umeå University, 90187 Umeå, Sweden
| | - Chanaka Mannapperuma
- Department of Plant Physiology, Umea Plant Science Centre (UPSC), Umeå University, 90187 Umeå, Sweden
| | - Tim Diels
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Tom Van Hautegem
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - David Jonsen
- SweTree Technologies AB, Skogsmarksgränd 7, SE-907 36 Umeå, Sweden
| | - Nathaniel Street
- Department of Plant Physiology, Umea Plant Science Centre (UPSC), Umeå University, 90187 Umeå, Sweden
| | - Torgeir R Hvidsten
- Department of Plant Physiology, Umea Plant Science Centre (UPSC), Umeå University, 90187 Umeå, Sweden
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, 1432 Ås, Norway
| | - Magnus Hertzberg
- SweTree Technologies AB, Skogsmarksgränd 7, SE-907 36 Umeå, Sweden
| | - Ove Nilsson
- Department of Forest Genetics and Plant Physiology, Umea Plant Science Centre (UPSC), Swedish University of Agricultural Sciences, 90183 Umeå, Sweden
| | - Dirk Inzé
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Hilde Nelissen
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
| | - Klaas Vandepoele
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
- VIB Center for Plant Systems Biology, Technologiepark 71, 9052 Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Technologiepark 71, 9052 Ghent, Belgium
| |
Collapse
|
4
|
Du Q, Campbell MT, Yu H, Liu K, Walia H, Zhang Q, Zhang C. Gene Co-expression Network Analysis and Linking Modules to Phenotyping Response in Plants. Methods Mol Biol 2022; 2539:261-268. [PMID: 35895209 DOI: 10.1007/978-1-0716-2537-8_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Environmental factors, including different stresses, can have an impact on the expression of genes and subsequently the phenotype and development of plants. Since a large number of genes are involved in response to the perturbation of the environment, identifying groups of co-expressed genes is meaningful. The gene co-expression network models can be used for the exploration, interpretation, and identification of genes responding to environmental changes. Once a gene co-expression network is constructed, one can determine gene modules and the association of gene modules to the phenotypic response. To link modules to phenotype, one approach is to find the correlated eigengenes of given modules or to integrate all eigengenes in regularized linear model. This manuscript describes the method from construction of co-expression network, module discovery, association between modules and phenotypic data, and finally to annotation/visualization.
Collapse
Affiliation(s)
- Qian Du
- School of Biological Sciences, Center for Plant Science and Innovation, University of Nebraska, Lincoln, NE, USA
| | - Malachy T Campbell
- Department of Agronomy and Horticulture, Center for Plant Science and Innovation, University of Nebraska, Lincoln, NE, USA
| | - Huihui Yu
- School of Biological Sciences, Center for Plant Science and Innovation, University of Nebraska, Lincoln, NE, USA
| | - Kan Liu
- School of Biological Sciences, Center for Plant Science and Innovation, University of Nebraska, Lincoln, NE, USA
| | - Harkamal Walia
- Department of Agronomy and Horticulture, Center for Plant Science and Innovation, University of Nebraska, Lincoln, NE, USA
| | - Qi Zhang
- Department of Mathematics and Statistics, College of Engineering and Physical Sciences (CEPS), University of New Hampshire, Durham, NH, USA
| | - Chi Zhang
- School of Biological Sciences, Center for Plant Science and Innovation, University of Nebraska, Lincoln, NE, USA.
| |
Collapse
|
5
|
Christie N, Mannapperuma C, Ployet R, van der Merwe K, Mähler N, Delhomme N, Naidoo S, Mizrachi E, Street NR, Myburg AA. qtlXplorer: an online systems genetics browser in the Eucalyptus Genome Integrative Explorer (EucGenIE). BMC Bioinformatics 2021; 22:595. [PMID: 34911434 PMCID: PMC8672637 DOI: 10.1186/s12859-021-04514-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 12/06/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Affordable high-throughput DNA and RNA sequencing technologies are allowing genomic analysis of plant and animal populations and as a result empowering new systems genetics approaches to study complex traits. The availability of intuitive tools to browse and analyze the resulting large-scale genetic and genomic datasets remain a significant challenge. Furthermore, these integrative genomics approaches require innovative methods to dissect the flow and interconnectedness of biological information underlying complex trait variation. The Plant Genome Integrative Explorer (PlantGenIE.org) is a multi-species database and domain that houses online tools for model and woody plant species including Eucalyptus. Since the Eucalyptus Genome Integrative Explorer (EucGenIE) is integrated within PlantGenIE, it shares genome and expression analysis tools previously implemented within the various subdomains (ConGenIE, PopGenIE and AtGenIE). Despite the success in setting up integrative genomics databases, online tools for systems genetics modelling and high-resolution dissection of complex trait variation in plant populations have been lacking. RESULTS We have developed qtlXplorer ( https://eucgenie.org/QTLXplorer ) for visualizing and exploring systems genetics data from genome-wide association studies including quantitative trait loci (QTLs) and expression-based QTL (eQTL) associations. This module allows users to, for example, find co-located QTLs and eQTLs using an interactive version of Circos, or explore underlying genes using JBrowse. It provides users with a means to build systems genetics models and generate hypotheses from large-scale population genomics data. We also substantially upgraded the EucGenIE resource and show how it enables users to combine genomics and systems genetics approaches to discover candidate genes involved in biotic stress responses and wood formation by focusing on two multigene families, laccases and peroxidases. CONCLUSIONS qtlXplorer adds a new dimension, population genomics, to the EucGenIE and PlantGenIE environment. The resource will be of interest to researchers and molecular breeders working in Eucalyptus and other woody plant species. It provides an example of how systems genetics data can be integrated with functional genetics data to provide biological insight and formulate hypotheses. Importantly, integration within PlantGenIE enables novel comparative genomics analyses to be performed from population-scale data.
Collapse
Affiliation(s)
- Nanette Christie
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa.
| | - Chanaka Mannapperuma
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 907 81, Umeå, Sweden
| | - Raphael Ployet
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa
| | - Karen van der Merwe
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa
| | - Niklas Mähler
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 907 81, Umeå, Sweden
| | - Nicolas Delhomme
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 83, Umeå, Sweden
| | - Sanushka Naidoo
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa
| | - Eshchar Mizrachi
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa
| | - Nathaniel R Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 907 81, Umeå, Sweden.
| | - Alexander A Myburg
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Private bag X20, Pretoria, 0028, South Africa
| |
Collapse
|
6
|
Li J, Singh U, Arendsee Z, Wurtele ES. Landscape of the Dark Transcriptome Revealed Through Re-mining Massive RNA-Seq Data. Front Genet 2021; 12:722981. [PMID: 34484307 PMCID: PMC8415361 DOI: 10.3389/fgene.2021.722981] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 07/26/2021] [Indexed: 12/13/2022] Open
Abstract
The "dark transcriptome" can be considered the multitude of sequences that are transcribed but not annotated as genes. We evaluated expression of 6,692 annotated genes and 29,354 unannotated open reading frames (ORFs) in the Saccharomyces cerevisiae genome across diverse environmental, genetic and developmental conditions (3,457 RNA-Seq samples). Over 30% of the highly transcribed ORFs have translation evidence. Phylostratigraphic analysis infers most of these transcribed ORFs would encode species-specific proteins ("orphan-ORFs"); hundreds have mean expression comparable to annotated genes. These data reveal unannotated ORFs most likely to be protein-coding genes. We partitioned a co-expression matrix by Markov Chain Clustering; the resultant clusters contain 2,468 orphan-ORFs. We provide the aggregated RNA-Seq yeast data with extensive metadata as a project in MetaOmGraph (MOG), a tool designed for interactive analysis and visualization. This approach enables reuse of public RNA-Seq data for exploratory discovery, providing a rich context for experimentalists to make novel, experimentally testable hypotheses about candidate genes.
Collapse
Affiliation(s)
- Jing Li
- Genetics and Genomics Graduate Program, Iowa State University, Ames, IA, United States
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
| | - Urminder Singh
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, United States
| | - Zebulun Arendsee
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, United States
| | - Eve Syrkin Wurtele
- Genetics and Genomics Graduate Program, Iowa State University, Ames, IA, United States
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, United States
- Center for Metabolic Biology, Iowa State University, Ames, IA, United States
- Bioinformatics and Computational Biology Program, Iowa State University, Ames, IA, United States
| |
Collapse
|
7
|
Ovens K, Eames BF, McQuillan I. Comparative Analyses of Gene Co-expression Networks: Implementations and Applications in the Study of Evolution. Front Genet 2021; 12:695399. [PMID: 34484293 PMCID: PMC8414652 DOI: 10.3389/fgene.2021.695399] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/19/2021] [Indexed: 11/13/2022] Open
Abstract
Similarities and differences in the associations of biological entities among species can provide us with a better understanding of evolutionary relationships. Often the evolution of new phenotypes results from changes to interactions in pre-existing biological networks and comparing networks across species can identify evidence of conservation or adaptation. Gene co-expression networks (GCNs), constructed from high-throughput gene expression data, can be used to understand evolution and the rise of new phenotypes. The increasing abundance of gene expression data makes GCNs a valuable tool for the study of evolution in non-model organisms. In this paper, we cover motivations for why comparing these networks across species can be valuable for the study of evolution. We also review techniques for comparing GCNs in the context of evolution, including local and global methods of graph alignment. While some protein-protein interaction (PPI) bioinformatic methods can be used to compare co-expression networks, they often disregard highly relevant properties, including the existence of continuous and negative values for edge weights. Also, the lack of comparative datasets in non-model organisms has hindered the study of evolution using PPI networks. We also discuss limitations and challenges associated with cross-species comparison using GCNs, and provide suggestions for utilizing co-expression network alignments as an indispensable tool for evolutionary studies going forward.
Collapse
Affiliation(s)
- Katie Ovens
- Augmented Intelligence & Precision Health Laboratory (AIPHL), Research Institute of the McGill University Health Centre, Montreal, QC, Canada
| | - B. Frank Eames
- Department of Anatomy, Physiology, & Pharmacology, University of Saskatchewan, Saskatoon, SK, Canada
| | - Ian McQuillan
- Department of Computer Science, University of Saskatchewan, Saskatoon, SK, Canada
| |
Collapse
|
8
|
Chowdhury HA, Bhattacharyya DK, Kalita JK. (Differential) Co-Expression Analysis of Gene Expression: A Survey of Best Practices. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:1154-1173. [PMID: 30668502 DOI: 10.1109/tcbb.2019.2893170] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Analysis of gene expression data is widely used in transcriptomic studies to understand functions of molecules inside a cell and interactions among molecules. Differential co-expression analysis studies diseases and phenotypic variations by finding modules of genes whose co-expression patterns vary across conditions. We review the best practices in gene expression data analysis in terms of analysis of (differential) co-expression, co-expression network, differential networking, and differential connectivity considering both microarray and RNA-seq data along with comparisons. We highlight hurdles in RNA-seq data analysis using methods developed for microarrays. We include discussion of necessary tools for gene expression analysis throughout the paper. In addition, we shed light on scRNA-seq data analysis by including preprocessing and scRNA-seq in co-expression analysis along with useful tools specific to scRNA-seq. To get insights, biological interpretation and functional profiling is included. Finally, we provide guidelines for the analyst, along with research issues and challenges which should be addressed.
Collapse
|
9
|
Watson A, Habib M, Bapteste E. Phylosystemics: Merging Phylogenomics, Systems Biology, and Ecology to Study Evolution. Trends Microbiol 2020; 28:176-190. [DOI: 10.1016/j.tim.2019.10.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Revised: 10/21/2019] [Accepted: 10/22/2019] [Indexed: 11/28/2022]
|
10
|
Weighill D, Tschaplinski TJ, Tuskan GA, Jacobson D. Data Integration in Poplar: 'Omics Layers and Integration Strategies. Front Genet 2019; 10:874. [PMID: 31608114 PMCID: PMC6773870 DOI: 10.3389/fgene.2019.00874] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2019] [Accepted: 08/20/2019] [Indexed: 12/20/2022] Open
Abstract
Populus trichocarpa is an important biofuel feedstock that has been the target of extensive research and is emerging as a model organism for plants, especially woody perennials. This research has generated several large ‘omics datasets. However, only few studies in Populus have attempted to integrate various data types. This review will summarize various ‘omics data layers, focusing on their application in Populus species. Subsequently, network and signal processing techniques for the integration and analysis of these data types will be discussed, with particular reference to examples in Populus.
Collapse
Affiliation(s)
- Deborah Weighill
- The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Knoxville, TN, United States.,Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Timothy J Tschaplinski
- The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Knoxville, TN, United States.,Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Gerald A Tuskan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Daniel Jacobson
- The Bredesen Center for Interdisciplinary Research and Graduate Education, University of Tennessee, Knoxville, Knoxville, TN, United States.,Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| |
Collapse
|
11
|
Rao X, Dixon RA. Co-expression networks for plant biology: why and how. Acta Biochim Biophys Sin (Shanghai) 2019; 51:981-988. [PMID: 31436787 DOI: 10.1093/abbs/gmz080] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 06/20/2019] [Accepted: 07/01/2019] [Indexed: 12/29/2022] Open
Abstract
Co-expression network analysis is one of the most powerful approaches for interpretation of large transcriptomic datasets. It enables characterization of modules of co-expressed genes that may share biological functional linkages. Such networks provide an initial way to explore functional associations from gene expression profiling and can be applied to various aspects of plant biology. This review presents the applications of co-expression network analysis in plant biology and addresses optimized strategies from the recent literature for performing co-expression analysis on plant biological systems. Additionally, we describe the combined interpretation of co-expression analysis with other genomic data to enhance the generation of biologically relevant information.
Collapse
Affiliation(s)
- Xiaolan Rao
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX 76203, USA
| | - Richard A Dixon
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX 76203, USA
| |
Collapse
|
12
|
Proost S, Mutwil M. CoNekT: an open-source framework for comparative genomic and transcriptomic network analyses. Nucleic Acids Res 2019; 46:W133-W140. [PMID: 29718322 PMCID: PMC6030989 DOI: 10.1093/nar/gky336] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Accepted: 04/18/2018] [Indexed: 12/22/2022] Open
Abstract
The recent accumulation of gene expression data in the form of RNA sequencing creates unprecedented opportunities to study gene regulation and function. Furthermore, comparative analysis of the expression data from multiple species can elucidate which functional gene modules are conserved across species, allowing the study of the evolution of these modules. However, performing such comparative analyses on raw data is not feasible for many biologists. Here, we present CoNekT (Co-expression Network Toolkit), an open source web server, that contains user-friendly tools and interactive visualizations for comparative analyses of gene expression data and co-expression networks. These tools allow analysis and cross-species comparison of (i) gene expression profiles; (ii) co-expression networks; (iii) co-expressed clusters involved in specific biological processes; (iv) tissue-specific gene expression; and (v) expression profiles of gene families. To demonstrate these features, we constructed CoNekT-Plants for green alga, seed plants and flowering plants (Picea abies, Chlamydomonas reinhardtii, Vitis vinifera, Arabidopsis thaliana, Oryza sativa, Zea mays and Solanum lycopersicum) and thus provide a web-tool with the broadest available collection of plant phyla. CoNekT-Plants is freely available from http://conekt.plant.tools, while the CoNekT source code and documentation can be found at https://github.molgen.mpg.de/proost/CoNekT/.
Collapse
Affiliation(s)
- Sebastian Proost
- Max-Planck Institute for Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam, Germany
| | - Marek Mutwil
- Max-Planck Institute for Molecular Plant Physiology, Am Muehlenberg 1, 14476 Potsdam, Germany.,School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
| |
Collapse
|
13
|
Wegrzyn JL, Staton MA, Street NR, Main D, Grau E, Herndon N, Buehler S, Falk T, Zaman S, Ramnath R, Richter P, Sun L, Condon B, Almsaeed A, Chen M, Mannapperuma C, Jung S, Ficklin S. Cyberinfrastructure to Improve Forest Health and Productivity: The Role of Tree Databases in Connecting Genomes, Phenomes, and the Environment. FRONTIERS IN PLANT SCIENCE 2019; 10:813. [PMID: 31293610 PMCID: PMC6603172 DOI: 10.3389/fpls.2019.00813] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 06/05/2019] [Indexed: 05/11/2023]
Abstract
Despite tremendous advancements in high throughput sequencing, the vast majority of tree genomes, and in particular, forest trees, remain elusive. Although primary databases store genetic resources for just over 2,000 forest tree species, these are largely focused on sequence storage, basic genome assemblies, and functional assignment through existing pipelines. The tree databases reviewed here serve as secondary repositories for community data. They vary in their focal species, the data they curate, and the analytics provided, but they are united in moving toward a goal of centralizing both data access and analysis. They provide frameworks to view and update annotations for complex genomes, interrogate systems level expression profiles, curate data for comparative genomics, and perform real-time analysis with genotype and phenotype data. The organism databases of today are no longer simply catalogs or containers of genetic information. These repositories represent integrated cyberinfrastructure that support cross-site queries and analysis in web-based environments. These resources are striving to integrate across diverse experimental designs, sequence types, and related measures through ontologies, community standards, and web services. Efficient, simple, and robust platforms that enhance the data generated by the research community, contribute to improving forest health and productivity.
Collapse
Affiliation(s)
- Jill L. Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Margaret A. Staton
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Nathaniel R. Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Dorrie Main
- Department of Horticulture, Washington State University, Pullman, WA, United States
| | - Emily Grau
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Nic Herndon
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Sean Buehler
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Taylor Falk
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Sumaira Zaman
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Risharde Ramnath
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Peter Richter
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Lang Sun
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, United States
| | - Bradford Condon
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Abdullah Almsaeed
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Ming Chen
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Chanaka Mannapperuma
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Sook Jung
- Department of Horticulture, Washington State University, Pullman, WA, United States
| | - Stephen Ficklin
- Department of Horticulture, Washington State University, Pullman, WA, United States
| |
Collapse
|
14
|
Lee J, Heath LS, Grene R, Li S. Comparing time series transcriptome data between plants using a network module finding algorithm. PLANT METHODS 2019; 15:61. [PMID: 31164912 PMCID: PMC6544932 DOI: 10.1186/s13007-019-0440-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Accepted: 05/17/2019] [Indexed: 06/01/2023]
Abstract
BACKGROUND Comparative transcriptome analysis is the comparison of expression patterns between homologous genes in different species. Since most molecular mechanistic studies in plants have been performed in model species, including Arabidopsis and rice, comparative transcriptome analysis is particularly important for functional annotation of genes in diverse plant species. Many biological processes, such as embryo development, are highly conserved between different plant species. The challenge is to establish one-to-one mapping of the developmental stages between two species. RESULTS In this manuscript, we solve this problem by converting the gene expression patterns into co-expression networks and then apply network module finding algorithms to the cross-species co-expression network. We describe how such analyses are carried out using bash scripts for preliminary data processing followed by using the R programming language for module finding with a simulated annealing method. We also provide instructions on how to visualize the resulting co-expression networks across species. CONCLUSIONS We provide a comprehensive pipeline from installing software and downloading raw transcriptome data to predicting homologous genes and finding orthologous co-expression networks. From the example provided, we demonstrate the application of our method to reveal functional conservation and divergence of genes in two plant species.
Collapse
Affiliation(s)
- Jiyoung Lee
- Genetics, Bioinformatics and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA
- School of Plant and Environmental Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA
| | - Lenwood S. Heath
- Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA
| | - Ruth Grene
- School of Plant and Environmental Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA
| | - Song Li
- Genetics, Bioinformatics and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA
- School of Plant and Environmental Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061 USA
| |
Collapse
|
15
|
Naidoo S, Slippers B, Plett JM, Coles D, Oates CN. The Road to Resistance in Forest Trees. FRONTIERS IN PLANT SCIENCE 2019; 10:273. [PMID: 31001287 PMCID: PMC6455082 DOI: 10.3389/fpls.2019.00273] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Accepted: 02/19/2019] [Indexed: 05/09/2023]
Abstract
In recent years, forests have been exposed to an unprecedented rise in pests and pathogens. This, coupled with the added challenge of climate change, renders forest plantation stock vulnerable to attack and severely limits productivity. Genotypes resistant to such biotic challenges are desired in plantation forestry to reduce losses. Conventional breeding has been a main avenue to obtain resistant genotypes. More recently, genetic engineering has become a viable approach to develop resistance against pests and pathogens in forest trees. Tree genomic resources have contributed to advancements in both these approaches. Genome-wide association studies and genomic selection in tree populations have accelerated breeding tools while integration of various levels of omics information facilitates the selection of candidate genes for genetic engineering. Furthermore, tree associations with non-pathogenic endophytic and subterranean microbes play a critical role in plant health and may be engineered in forest trees to improve resistance in the future. We look at recent studies in forest trees describing defense mechanisms using such approaches and propose the way forward to developing superior genotypes with enhanced resistance against biotic stress.
Collapse
Affiliation(s)
- Sanushka Naidoo
- Division of Genetics, Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Bernard Slippers
- Division of Genetics, Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Jonathan M. Plett
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW, Australia
| | - Donovin Coles
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW, Australia
| | - Caryn N. Oates
- Division of Genetics, Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| |
Collapse
|
16
|
Ambrosino L, Ruggieri V, Bostan H, Miralto M, Vitulo N, Zouine M, Barone A, Bouzayen M, Frusciante L, Pezzotti M, Valle G, Chiusano ML. Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations: an example for tomato and grapevine. BMC Bioinformatics 2018; 19:435. [PMID: 30497367 PMCID: PMC6266932 DOI: 10.1186/s12859-018-2420-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Background “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at http://biosrv.cab.unina.it/comparalogs/gene/search. Conclusions The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species. Electronic supplementary material The online version of this article (10.1186/s12859-018-2420-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Luca Ambrosino
- Department of Agriculture, University of Naples "Federico II,", Portici, Naples, Italy.,Current address: Research Infrastructures for Marine Biological Resources, Stazione Zoologica Anton Dohrn, Naples, Italy
| | - Valentino Ruggieri
- Department of Agriculture, University of Naples "Federico II,", Portici, Naples, Italy.,Current address: Center for Research in Agricultural Genomics, Cerdanyola, Barcelona, Spain
| | - Hamed Bostan
- Department of Agriculture, University of Naples "Federico II,", Portici, Naples, Italy.,Current address: Plants for Human Health Institute, North Carolina State University, Kannapolis, NC, USA
| | - Marco Miralto
- Department of Agriculture, University of Naples "Federico II,", Portici, Naples, Italy.,Current address: Research Infrastructures for Marine Biological Resources, Stazione Zoologica Anton Dohrn, Naples, Italy
| | - Nicola Vitulo
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Mohamed Zouine
- Génomique et Biotechnologie des Fruits, UMR990 INRA / INP-Toulouse, Université de Toulouse, Castanet-Tolosan, France
| | - Amalia Barone
- Department of Agriculture, University of Naples "Federico II,", Portici, Naples, Italy
| | - Mondher Bouzayen
- Génomique et Biotechnologie des Fruits, UMR990 INRA / INP-Toulouse, Université de Toulouse, Castanet-Tolosan, France
| | - Luigi Frusciante
- Department of Agriculture, University of Naples "Federico II,", Portici, Naples, Italy
| | - Mario Pezzotti
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Giorgio Valle
- CRIBI Biotechnology Centre, University of Padova, Padova, Italy
| | - Maria Luisa Chiusano
- Department of Agriculture, University of Naples "Federico II,", Portici, Naples, Italy. .,Research Infrastructures for Marine Biological Resources, Stazione Zoologica Anton Dohrn, Naples, Italy.
| |
Collapse
|
17
|
McClure RS, Overall CC, Hill EA, Song HS, Charania M, Bernstein HC, McDermott JE, Beliaev AS. Species-specific transcriptomic network inference of interspecies interactions. THE ISME JOURNAL 2018; 12:2011-2023. [PMID: 29795448 PMCID: PMC6052077 DOI: 10.1038/s41396-018-0145-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Revised: 02/22/2018] [Accepted: 03/26/2018] [Indexed: 12/25/2022]
Abstract
The advent of high-throughput 'omics approaches coupled with computational analyses to reconstruct individual genomes from metagenomes provides a basis for species-resolved functional studies. Here, a mutual information approach was applied to build a gene association network of a commensal consortium, in which a unicellular cyanobacterium Thermosynechococcus elongatus BP1 supported the heterotrophic growth of Meiothermus ruber strain A. Specifically, we used the context likelihood of relatedness (CLR) algorithm to generate a gene association network from 25 transcriptomic datasets representing distinct growth conditions. The resulting interspecies network revealed a number of linkages between genes in each species. While many of the linkages were supported by the existing knowledge of phototroph-heterotroph interactions and the metabolism of these two species several new interactions were inferred as well. These include linkages between amino acid synthesis and uptake genes, as well as carbohydrate and vitamin metabolism, terpenoid metabolism and cell adhesion genes. Further topological examination and functional analysis of specific gene associations suggested that the interactions are likely to center around the exchange of energetically costly metabolites between T. elongatus and M. ruber. Both the approach and conclusions derived from this work are widely applicable to microbial communities for identification of the interactions between species and characterization of community functioning as a whole.
Collapse
Affiliation(s)
- Ryan S McClure
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Christopher C Overall
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Eric A Hill
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Hyun-Seob Song
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Moiz Charania
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
| | - Hans C Bernstein
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
- The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University, Pullman, WA, USA
| | - Jason E McDermott
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA
- Department of Molecular Microbiology and Immunology, Oregon Health and Sciences University, Portland, OR, USA
| | - Alexander S Beliaev
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, 99352, USA.
- Institute for Future Environments, Queensland University of Technology, Brisbane, Australia.
- Center for Tropical Crops and Biocommodities, Queensland University of Technology, Brisbane, Australia.
| |
Collapse
|
18
|
Abstract
The classic Darwinian theory and the Synthetic evolutionary theory and their linear models, while invaluable to study the origins and evolution of species, are not primarily designed to model the evolution of organisations, typically that of ecosystems, nor that of processes. How could evolutionary theory better explain the evolution of biological complexity and diversity? Inclusive network-based analyses of dynamic systems could retrace interactions between (related or unrelated) components. This theoretical shift from a Tree of Life to a Dynamic Interaction Network of Life, which is supported by diverse molecular, cellular, microbiological, organismal, ecological and evolutionary studies, would further unify evolutionary biology.
Collapse
Affiliation(s)
- Eric Bapteste
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), F-75005 Paris, France
- CNRS, UMR7138, Institut de Biologie Paris-Seine, F-75005 Paris, France
| | - Philippe Huneman
- Institut d’Histoire et de Philosophie des Sciences et des Techniques (CNRS / Paris I Sorbonne), F-75006 Paris, France
| |
Collapse
|
19
|
Flores-Sandoval E, Romani F, Bowman JL. Co-expression and Transcriptome Analysis of Marchantia polymorpha Transcription Factors Supports Class C ARFs as Independent Actors of an Ancient Auxin Regulatory Module. FRONTIERS IN PLANT SCIENCE 2018; 9:1345. [PMID: 30327658 PMCID: PMC6174852 DOI: 10.3389/fpls.2018.01345] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 08/27/2018] [Indexed: 05/07/2023]
Abstract
We performed differential gene expression (DGE) and co-expression analyses with genes encoding components of hormonal signaling pathways and the ∼400 annotated transcription factors (TFs) of M. polymorpha across multiple developmental stages of the life cycle. We identify a putative auxin-related co-expression module that has significant overlap with transcripts induced in auxin-treated tissues. Consistent with phylogenetic and functional studies, the class C ARF, MpARF3, is not part of the auxin-related co-expression module and instead is associated with transcripts enriched in gamete-producing gametangiophores. We analyze the Mparf3 and MpmiR160 mutant transcriptomes in the context of coexpression to suggest that MpARF3 may antagonize the reproductive transition via activating the MpMIR11671 and MpMIR529c precursors whose encoded microRNAs target SQUAMOSA-PROMOTER BINDING PROTEIN-LIKE (SPL) transcripts of MpSPL1 and MpSPL2. Both MpSPL genes are part of the MpARF3 co-expression group corroborating their functional significance. We provide evidence of the independence of MpARF3 from the auxin-signaling module and provide new testable hypotheses on the role of auxin-related genes in patterning meristems and differentiation events in liverworts.
Collapse
Affiliation(s)
| | - Facundo Romani
- Facultad de Bioquímica y Ciencias Biológicas, Centro Científico Tecnológico CONICET Santa Fe, Instituto de Agrobiotecnología del Litoral, Universidad Nacional del Litoral – CONICET, Santa Fe, Argentina
| | - John L. Bowman
- School of Biological Sciences, Monash University, Melbourne, VIC, Australia
- *Correspondence: John L. Bowman,
| |
Collapse
|
20
|
Eidsaa M, Stubbs L, Almaas E. Comparative analysis of weighted gene co-expression networks in human and mouse. PLoS One 2017; 12:e0187611. [PMID: 29161290 PMCID: PMC5697817 DOI: 10.1371/journal.pone.0187611] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Accepted: 10/23/2017] [Indexed: 01/21/2023] Open
Abstract
The application of complex network modeling to analyze large co-expression data sets has gained traction during the last decade. In particular, the use of the weighted gene co-expression network analysis framework has allowed an unbiased and systems-level investigation of genotype-phenotype relationships in a wide range of systems. Since mouse is an important model organism for biomedical research on human disease, it is of great interest to identify similarities and differences in the functional roles of human and mouse orthologous genes. Here, we develop a novel network comparison approach which we demonstrate by comparing two gene-expression data sets from a large number of human and mouse tissues. The method uses weighted topological overlap alongside the recently developed network-decomposition method of s-core analysis, which is suitable for making gene-centrality rankings for weighted networks. The aim is to identify globally central genes separately in the human and mouse networks. By comparing the ranked gene lists, we identify genes that display conserved or diverged centrality-characteristics across the networks. This framework only assumes a single threshold value that is chosen from a statistical analysis, and it may be applied to arbitrary network structures and edge-weight distributions, also outside the context of biology. When conducting the comparative network analysis, both within and across the two species, we find a clear pattern of enrichment of transcription factors, for the homeobox domain in particular, among the globally central genes. We also perform gene-ontology term enrichment analysis and look at disease-related genes for the separate networks as well as the network comparisons. We find that gene ontology terms related to regulation and development are generally enriched across the networks. In particular, the genes FOXE3, RHO, RUNX2, ALX3 and RARA, which are disease genes in either human or mouse, are on the top-10 list of globally central genes in the human and mouse networks.
Collapse
Affiliation(s)
- Marius Eidsaa
- Department of Biotechnology, NTNU - Norwegian University of Science and Technology, N-7491 Trondheim, Norway
| | - Lisa Stubbs
- Institute for Genomic Biology, Neuroscience Program, Cell and Developmental Biology, University of Illinois at Urbana-Champaigne, Urbana, IL 61801, United States of America
| | - Eivind Almaas
- Department of Biotechnology, NTNU - Norwegian University of Science and Technology, N-7491 Trondheim, Norway
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and General Practice, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- * E-mail:
| |
Collapse
|
21
|
Jokipii‐Lukkari S, Sundell D, Nilsson O, Hvidsten TR, Street NR, Tuominen H. NorWood: a gene expression resource for evo-devo studies of conifer wood development. THE NEW PHYTOLOGIST 2017; 216:482-494. [PMID: 28186632 PMCID: PMC6079643 DOI: 10.1111/nph.14458] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Accepted: 12/22/2016] [Indexed: 05/04/2023]
Abstract
The secondary xylem of conifers is composed mainly of tracheids that differ anatomically and chemically from angiosperm xylem cells. There is currently no high-spatial-resolution data available profiling gene expression during wood formation for any coniferous species, which limits insight into tracheid development. RNA-sequencing data from replicated, high-spatial-resolution section series throughout the cambial and woody tissues of Picea abies were used to generate the NorWood.conGenIE.org web resource, which facilitates exploration of the associated gene expression profiles and co-expression networks. Integration within PlantGenIE.org enabled a comparative regulomics analysis, revealing divergent co-expression networks between P. abies and the two angiosperm species Arabidopsis thaliana and Populus tremula for the secondary cell wall (SCW) master regulator NAC Class IIB transcription factors. The SCW cellulose synthase genes (CesAs) were located in the neighbourhoods of the NAC factors in A. thaliana and P. tremula, but not in P. abies. The NorWood co-expression network enabled identification of potential SCW CesA regulators in P. abies. The NorWood web resource represents a powerful community tool for generating evo-devo insights into the divergence of wood formation between angiosperms and gymnosperms and for advancing understanding of the regulation of wood development in P. abies.
Collapse
Affiliation(s)
- Soile Jokipii‐Lukkari
- Umeå Plant Science CentreDepartment of Plant PhysiologyUmeå UniversitySE‐901 87UmeåSweden
- Umeå Plant Science CentreDepartment of Forest Genetics and Plant PhysiologySwedish University of Agricultural SciencesSE‐901 84UmeåSweden
| | - David Sundell
- Umeå Plant Science CentreDepartment of Plant PhysiologyUmeå UniversitySE‐901 87UmeåSweden
| | - Ove Nilsson
- Umeå Plant Science CentreDepartment of Forest Genetics and Plant PhysiologySwedish University of Agricultural SciencesSE‐901 84UmeåSweden
| | - Torgeir R. Hvidsten
- Umeå Plant Science CentreDepartment of Plant PhysiologyUmeå UniversitySE‐901 87UmeåSweden
- Department of Chemistry, Biotechnology and Food ScienceNorwegian University of Life Sciences1430ÅsNorway
| | - Nathaniel R. Street
- Umeå Plant Science CentreDepartment of Plant PhysiologyUmeå UniversitySE‐901 87UmeåSweden
| | - Hannele Tuominen
- Umeå Plant Science CentreDepartment of Plant PhysiologyUmeå UniversitySE‐901 87UmeåSweden
| |
Collapse
|
22
|
Sundell D, Street NR, Kumar M, Mellerowicz EJ, Kucukoglu M, Johnsson C, Kumar V, Mannapperuma C, Delhomme N, Nilsson O, Tuominen H, Pesquet E, Fischer U, Niittylä T, Sundberg B, Hvidsten TR. AspWood: High-Spatial-Resolution Transcriptome Profiles Reveal Uncharacterized Modularity of Wood Formation in Populus tremula. THE PLANT CELL 2017; 29:1585-1604. [PMID: 28655750 PMCID: PMC5559752 DOI: 10.1105/tpc.17.00153] [Citation(s) in RCA: 159] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 06/12/2017] [Accepted: 06/24/2017] [Indexed: 05/17/2023]
Abstract
Trees represent the largest terrestrial carbon sink and a renewable source of ligno-cellulose. There is significant scope for yield and quality improvement in these largely undomesticated species, and efforts to engineer elite varieties will benefit from improved understanding of the transcriptional network underlying cambial growth and wood formation. We generated high-spatial-resolution RNA sequencing data spanning the secondary phloem, vascular cambium, and wood-forming tissues of Populus tremula The transcriptome comprised 28,294 expressed, annotated genes, 78 novel protein-coding genes, and 567 putative long intergenic noncoding RNAs. Most paralogs originating from the Salicaceae whole-genome duplication had diverged expression, with the exception of those highly expressed during secondary cell wall deposition. Coexpression network analyses revealed that regulation of the transcriptome underlying cambial growth and wood formation comprises numerous modules forming a continuum of active processes across the tissues. A comparative analysis revealed that a majority of these modules are conserved in Picea abies The high spatial resolution of our data enabled identification of novel roles for characterized genes involved in xylan and cellulose biosynthesis, regulators of xylem vessel and fiber differentiation and lignification. An associated web resource (AspWood, http://aspwood.popgenie.org) provides interactive tools for exploring the expression profiles and coexpression network.
Collapse
Affiliation(s)
- David Sundell
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
| | - Nathaniel R Street
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
| | - Manoj Kumar
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Ewa J Mellerowicz
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Melis Kucukoglu
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Christoffer Johnsson
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Vikash Kumar
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Chanaka Mannapperuma
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
| | - Nicolas Delhomme
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
| | - Ove Nilsson
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Hannele Tuominen
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
| | - Edouard Pesquet
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
- Department of Ecology, Environment and Plant Sciences, Stockholm University, 106 91 Stockholm, Sweden
| | - Urs Fischer
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Totte Niittylä
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Björn Sundberg
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, 901 87 Umeå, Sweden
| | - Torgeir R Hvidsten
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, 901 87 Umeå, Sweden
- Department of Chemistry, Biotechnology and Food Sciences, Norwegian University of Life Sciences, 1433 Ås, Norway
| |
Collapse
|
23
|
Shi R, Wang JP, Lin YC, Li Q, Sun YH, Chen H, Sederoff RR, Chiang VL. Tissue and cell-type co-expression networks of transcription factors and wood component genes in Populus trichocarpa. PLANTA 2017; 245:927-938. [PMID: 28083709 DOI: 10.1007/s00425-016-2640-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 12/09/2016] [Indexed: 05/21/2023]
Abstract
Co-expression networks based on transcriptomes of Populus trichocarpa major tissues and specific cell types suggest redundant control of cell wall component biosynthetic genes by transcription factors in wood formation. We analyzed the transcriptomes of five tissues (xylem, phloem, shoot, leaf, and root) and two wood forming cell types (fiber and vessel) of Populus trichocarpa to assemble gene co-expression subnetworks associated with wood formation. We identified 165 transcription factors (TFs) that showed xylem-, fiber-, and vessel-specific expression. Of these 165 TFs, 101 co-expressed (correlation coefficient, r > 0.7) with the 45 secondary cell wall cellulose, hemicellulose, and lignin biosynthetic genes. Each cell wall component gene co-expressed on average with 34 TFs, suggesting redundant control of the cell wall component gene expression. Co-expression analysis showed that the 101 TFs and the 45 cell wall component genes each has two distinct groups (groups 1 and 2), based on their co-expression patterns. The group 1 TFs (44 members) are predominantly xylem and fiber specific, and are all highly positively co-expressed with the group 1 cell wall component genes (30 members), suggesting their roles as major wood formation regulators. Group 1 TFs include a lateral organ boundary domain gene (LBD) that has the highest number of positively correlated cell wall component genes (36) and TFs (47). The group 2 TFs have 57 members, including 14 vessel-specific TFs, and are generally less correlated with the cell wall component genes. An exception is a vessel-specific basic helix-loop-helix (bHLH) gene that negatively correlates with 20 cell wall component genes, and may function as a key transcriptional suppressor. The co-expression networks revealed here suggest a well-structured transcriptional homeostasis for cell wall component biosynthesis during wood formation.
Collapse
Affiliation(s)
- Rui Shi
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA
- Mountain Horticultural Crops Research and Extension Center, Department of Horticulture, North Carolina State University, Mills River, NC, 28759, USA
| | - Jack P Wang
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
| | - Ying-Chung Lin
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China
- Department of Life Sciences, College of Life Science, National Taiwan University, Taipei, 10617, Taiwan
| | - Quanzi Li
- State Key Laboratory of Tree Genetics and Breeding, Chinese Academy of Forestry, Beijing, 100091, China
| | - Ying-Hsuan Sun
- Department of Forestry, National Chung Hsing University, Taichung, 40227, Taiwan
| | - Hao Chen
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA
| | - Ronald R Sederoff
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA.
| | - Vincent L Chiang
- Forest Biotechnology Group, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, 27695, USA.
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, 150040, China.
- Department of Forest Biomaterials, North Carolina State University, Raleigh, NC, 27695, USA.
| |
Collapse
|
24
|
Briones-Moreno A, Hernández-García J, Vargas-Chávez C, Romero-Campero FJ, Romero JM, Valverde F, Blázquez MA. Evolutionary Analysis of DELLA-Associated Transcriptional Networks. FRONTIERS IN PLANT SCIENCE 2017; 8:626. [PMID: 28487716 PMCID: PMC5404181 DOI: 10.3389/fpls.2017.00626] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2017] [Accepted: 04/07/2017] [Indexed: 05/18/2023]
Abstract
DELLA proteins are transcriptional regulators present in all land plants which have been shown to modulate the activity of over 100 transcription factors in Arabidopsis, involved in multiple physiological and developmental processes. It has been proposed that DELLAs transduce environmental information to pre-wired transcriptional circuits because their stability is regulated by gibberellins (GAs), whose homeostasis largely depends on environmental signals. The ability of GAs to promote DELLA degradation coincides with the origin of vascular plants, but the presence of DELLAs in other land plants poses at least two questions: what regulatory properties have DELLAs provided to the behavior of transcriptional networks in land plants, and how has the recruitment of DELLAs by GA signaling affected this regulation. To address these issues, we have constructed gene co-expression networks of four different organisms within the green lineage with different properties regarding DELLAs: Arabidopsis thaliana and Solanum lycopersicum (both with GA-regulated DELLA proteins), Physcomitrella patens (with GA-independent DELLA proteins) and Chlamydomonas reinhardtii (a green alga without DELLA), and we have examined the relative evolution of the subnetworks containing the potential DELLA-dependent transcriptomes. Network analysis indicates a relative increase in parameters associated with the degree of interconnectivity in the DELLA-associated subnetworks of land plants, with a stronger effect in species with GA-regulated DELLA proteins. These results suggest that DELLAs may have played a role in the coordination of multiple transcriptional programs along evolution, and the function of DELLAs as regulatory 'hubs' became further consolidated after their recruitment by GA signaling in higher plants.
Collapse
Affiliation(s)
- Asier Briones-Moreno
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas – Universidad Politécnica de ValenciaValencia, Spain
| | - Jorge Hernández-García
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas – Universidad Politécnica de ValenciaValencia, Spain
| | - Carlos Vargas-Chávez
- Institute for Integrative Systems Biology (I2SysBio), University of ValenciaValencia, Spain
| | - Francisco J. Romero-Campero
- Department of Computer Science and Artificial Intelligence, Universidad de SevillaSevilla, Spain
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas – Universidad de SevillaSevilla, Spain
| | - José M. Romero
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas – Universidad de SevillaSevilla, Spain
| | - Federico Valverde
- Instituto de Bioquímica Vegetal y Fotosíntesis, Consejo Superior de Investigaciones Científicas – Universidad de SevillaSevilla, Spain
| | - Miguel A. Blázquez
- Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas – Universidad Politécnica de ValenciaValencia, Spain
- *Correspondence: Miguel A. Blázquez,
| |
Collapse
|
25
|
Kudo T, Terashima S, Takaki Y, Tomita K, Saito M, Kanno M, Yokoyama K, Yano K. PlantExpress: A Database Integrating OryzaExpress and ArthaExpress for Single-species and Cross-species Gene Expression Network Analyses with Microarray-Based Transcriptome Data. PLANT & CELL PHYSIOLOGY 2017; 58:e1. [PMID: 28158643 DOI: 10.1093/pcp/pcw208] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2016] [Accepted: 11/19/2016] [Indexed: 06/06/2023]
Abstract
Publicly available microarray-based transcriptome data on plants are remarkably valuable in terms of abundance and variation of samples, particularly for Oryza sativa (rice) and Arabidopsis thaliana (Arabidopsis). Here, we introduce the web database PlantExpress (http://plantomics.mind.meiji.ac.jp/PlantExpress/) as a platform for gene expression network (GEN) analysis with the public microarray data of rice and Arabidopsis. PlantExpress has two functional modes. The single-species mode is specialized for GEN analysis within one of the species, while the cross-species mode is optimized for comparative GEN analysis between the species. The single-species mode for rice is the new version of OryzaExpress, which we have maintained since 2006. The single-species mode for Arabidopsis, named ArthaExpress, was newly developed. PlantExpress stores data obtained from three microarrays, the Affymetrix Rice Genome Array, the Agilent Rice Gene Expression 4x44K Microarray, and the Affymetrix Arabidopsis ATH1 Genome Array, with respective totals of 2,678, 1,206, and 10,940 samples. This database employs a ‘MyList’ function with which users may save lists of arbitrary genes and samples (experimental conditions) to use in analyses. In cross-species mode, the MyList function allows performing comparative GEN analysis between rice and Arabidopsis. In addition, the gene lists saved in MyList can be directly exported to the PODC database, which provides information and a platform for comparative GEN analysis based on RNA-seq data and knowledge-based functional annotation of plant genes. PlantExpress will facilitate understanding the biological functions of plant genes.
Collapse
Affiliation(s)
- Toru Kudo
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| | - Shin Terashima
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| | - Yuno Takaki
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| | - Ken Tomita
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| | - Misa Saito
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| | - Maasa Kanno
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| | - Koji Yokoyama
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| | - Kentaro Yano
- Bioinformatics Laboratory, School of Agriculture, Meiji University, Higashi-mita, Tama-ku, Kawasaki, Kanagawa, Japan
| |
Collapse
|
26
|
Ingvarsson PK, Hvidsten TR, Street NR. Towards integration of population and comparative genomics in forest trees. THE NEW PHYTOLOGIST 2016; 212:338-44. [PMID: 27575589 DOI: 10.1111/nph.14153] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 06/27/2016] [Indexed: 05/08/2023]
Abstract
Contents 338 I. 338 II. 339 III. 340 IV. 342 343 References 343 SUMMARY: The past decade saw the initiation of an ongoing revolution in sequencing technologies that is transforming all fields of biology. This has been driven by the advent and widespread availability of high-throughput, massively parallel short-read sequencing (MPS) platforms. These technologies have enabled previously unimaginable studies, including draft assemblies of the massive genomes of coniferous species and population-scale resequencing. Transcriptomics studies have likewise been transformed, with RNA-sequencing enabling studies in nonmodel organisms, the discovery of previously unannotated genes (novel transcripts), entirely new classes of RNAs and previously unknown regulatory mechanisms. Here we touch upon current developments in the areas of genome assembly, comparative regulomics and population genetics as they relate to studies of forest tree species.
Collapse
Affiliation(s)
- Pär K Ingvarsson
- Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University, 901 87, Umeå, Sweden
| | - Torgeir R Hvidsten
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, 1432, Ås, Norway
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 901 87, Umeå, Sweden
| | - Nathaniel R Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, 901 87, Umeå, Sweden.
| |
Collapse
|
27
|
McClure RS, Overall CC, McDermott JE, Hill EA, Markillie LM, McCue LA, Taylor RC, Ludwig M, Bryant DA, Beliaev AS. Network analysis of transcriptomics expands regulatory landscapes in Synechococcus sp. PCC 7002. Nucleic Acids Res 2016; 44:8810-8825. [PMID: 27568004 PMCID: PMC5062996 DOI: 10.1093/nar/gkw737] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2015] [Accepted: 08/05/2016] [Indexed: 12/29/2022] Open
Abstract
Cyanobacterial regulation of gene expression must contend with a genome organization that lacks apparent functional context, as the majority of cellular processes and metabolic pathways are encoded by genes found at disparate locations across the genome and relatively few transcription factors exist. In this study, global transcript abundance data from the model cyanobacterium Synechococcus sp. PCC 7002 grown under 42 different conditions was analyzed using Context-Likelihood of Relatedness (CLR). The resulting network, organized into 11 modules, provided insight into transcriptional network topology as well as grouping genes by function and linking their response to specific environmental variables. When used in conjunction with genome sequences, the network allowed identification and expansion of novel potential targets of both DNA binding proteins and sRNA regulators. These results offer a new perspective into the multi-level regulation that governs cellular adaptations of the fast-growing physiologically robust cyanobacterium Synechococcus sp. PCC 7002 to changing environmental variables. It also provides a methodological high-throughput approach to studying multi-scale regulatory mechanisms that operate in cyanobacteria. Finally, it provides valuable context for integrating systems-level data to enhance gene grouping based on annotated function, especially in organisms where traditional context analyses cannot be implemented due to lack of operon-based functional organization.
Collapse
Affiliation(s)
- Ryan S McClure
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Christopher C Overall
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Jason E McDermott
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Eric A Hill
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Lye Meng Markillie
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Lee Ann McCue
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Ronald C Taylor
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Marcus Ludwig
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, State College, PA 16802, USA
| | - Donald A Bryant
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, State College, PA 16802, USA Department of Chemistry and Biochemistry, Montana State University, Bozeman, MT 59717, USA
| | - Alexander S Beliaev
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| |
Collapse
|
28
|
Proost S, Mutwil M. Tools of the trade: studying molecular networks in plants. CURRENT OPINION IN PLANT BIOLOGY 2016; 30:143-150. [PMID: 26990519 DOI: 10.1016/j.pbi.2016.02.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Revised: 02/23/2016] [Accepted: 02/29/2016] [Indexed: 06/05/2023]
Abstract
Driven by recent technological improvements, genes can be now studied in a larger biological context. Genes and their protein products rarely operate as a single entity and large-scale mapping by protein-protein interactions can unveil the molecular complexes that form in the cell to carry out various functions. Expression analysis under multiple conditions, supplemented with protein-DNA binding data can highlight when genes are active and how they are regulated. Representing these data in networks and finding strongly connected sub-graphs has proven to be a powerful tool to predict the function of unknown genes. As such networks are gradually becoming available for various plant species, it becomes possible to study how networks evolve. This review summarizes currently available network data and related tools for plants. Furthermore we aim to provide an outlook of future analyses that can be done in plants based on work done in other fields.
Collapse
Affiliation(s)
- Sebastian Proost
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Marek Mutwil
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany.
| |
Collapse
|
29
|
Tzfadia O, Diels T, De Meyer S, Vandepoele K, Aharoni A, Van de Peer Y. CoExpNetViz: Comparative Co-Expression Networks Construction and Visualization Tool. FRONTIERS IN PLANT SCIENCE 2016; 6:1194. [PMID: 26779228 PMCID: PMC4700130 DOI: 10.3389/fpls.2015.01194] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 12/11/2015] [Indexed: 05/23/2023]
Abstract
MOTIVATION Comparative transcriptomics is a common approach in functional gene discovery efforts. It allows for finding conserved co-expression patterns between orthologous genes in closely related plant species, suggesting that these genes potentially share similar function and regulation. Several efficient co-expression-based tools have been commonly used in plant research but most of these pipelines are limited to data from model systems, which greatly limit their utility. Moreover, in addition, none of the existing pipelines allow plant researchers to make use of their own unpublished gene expression data for performing a comparative co-expression analysis and generate multi-species co-expression networks. RESULTS We introduce CoExpNetViz, a computational tool that uses a set of query or "bait" genes as an input (chosen by the user) and a minimum of one pre-processed gene expression dataset. The CoExpNetViz algorithm proceeds in three main steps; (i) for every bait gene submitted, co-expression values are calculated using mutual information and Pearson correlation coefficients, (ii) non-bait (or target) genes are grouped based on cross-species orthology, and (iii) output files are generated and results can be visualized as network graphs in Cytoscape. AVAILABILITY The CoExpNetViz tool is freely available both as a PHP web server (link: http://bioinformatics.psb.ugent.be/webtools/coexpr/) (implemented in C++) and as a Cytoscape plugin (implemented in Java). Both versions of the CoExpNetViz tool support LINUX and Windows platforms.
Collapse
Affiliation(s)
- Oren Tzfadia
- Department of Plant Systems Biology, Vlaams Instituut voor BiotechnologieGhent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent UniversityGhent, Belgium
- Bioinformatics Institute Ghent, Ghent UniversityGhent, Belgium
| | - Tim Diels
- Department of Plant Systems Biology, Vlaams Instituut voor BiotechnologieGhent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent UniversityGhent, Belgium
- Bioinformatics Institute Ghent, Ghent UniversityGhent, Belgium
| | - Sam De Meyer
- Department of Plant Systems Biology, Vlaams Instituut voor BiotechnologieGhent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent UniversityGhent, Belgium
| | - Klaas Vandepoele
- Department of Plant Systems Biology, Vlaams Instituut voor BiotechnologieGhent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent UniversityGhent, Belgium
- Bioinformatics Institute Ghent, Ghent UniversityGhent, Belgium
| | - Asaph Aharoni
- Department of Plant Sciences and the Environment, Weizmann Institute of ScienceRehovot, Israel
| | - Yves Van de Peer
- Department of Plant Systems Biology, Vlaams Instituut voor BiotechnologieGhent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent UniversityGhent, Belgium
- Bioinformatics Institute Ghent, Ghent UniversityGhent, Belgium
- Genomics Research Institute, University of PretoriaPretoria, South Africa
| |
Collapse
|
30
|
Serin EAR, Nijveen H, Hilhorst HWM, Ligterink W. Learning from Co-expression Networks: Possibilities and Challenges. FRONTIERS IN PLANT SCIENCE 2016; 7:444. [PMID: 27092161 PMCID: PMC4825623 DOI: 10.3389/fpls.2016.00444] [Citation(s) in RCA: 186] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Accepted: 03/21/2016] [Indexed: 05/18/2023]
Abstract
Plants are fascinating and complex organisms. A comprehensive understanding of the organization, function and evolution of plant genes is essential to disentangle important biological processes and to advance crop engineering and breeding strategies. The ultimate aim in deciphering complex biological processes is the discovery of causal genes and regulatory mechanisms controlling these processes. The recent surge of omics data has opened the door to a system-wide understanding of the flow of biological information underlying complex traits. However, dealing with the corresponding large data sets represents a challenging endeavor that calls for the development of powerful bioinformatics methods. A popular approach is the construction and analysis of gene networks. Such networks are often used for genome-wide representation of the complex functional organization of biological systems. Network based on similarity in gene expression are called (gene) co-expression networks. One of the major application of gene co-expression networks is the functional annotation of unknown genes. Constructing co-expression networks is generally straightforward. In contrast, the resulting network of connected genes can become very complex, which limits its biological interpretation. Several strategies can be employed to enhance the interpretation of the networks. A strategy in coherence with the biological question addressed needs to be established to infer reliable networks. Additional benefits can be gained from network-based strategies using prior knowledge and data integration to further enhance the elucidation of gene regulatory relationships. As a result, biological networks provide many more applications beyond the simple visualization of co-expressed genes. In this study we review the different approaches for co-expression network inference in plants. We analyse integrative genomics strategies used in recent studies that successfully identified candidate genes taking advantage of gene co-expression networks. Additionally, we discuss promising bioinformatics approaches that predict networks for specific purposes.
Collapse
Affiliation(s)
- Elise A. R. Serin
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
| | - Harm Nijveen
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
- Laboratory of Bioinformatics, Wageningen UniversityWageningen, Netherlands
| | - Henk W. M. Hilhorst
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
| | - Wilco Ligterink
- Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen UniversityWageningen, Netherlands
- *Correspondence: Wilco Ligterink
| |
Collapse
|
31
|
Sundell D, Mannapperuma C, Netotea S, Delhomme N, Lin YC, Sjödin A, Van de Peer Y, Jansson S, Hvidsten TR, Street NR. The Plant Genome Integrative Explorer Resource: PlantGenIE.org. THE NEW PHYTOLOGIST 2015; 208:1149-56. [PMID: 26192091 DOI: 10.1111/nph.13557] [Citation(s) in RCA: 168] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 06/08/2015] [Indexed: 05/18/2023]
Abstract
Accessing and exploring large-scale genomics data sets remains a significant challenge to researchers without specialist bioinformatics training. We present the integrated PlantGenIE.org platform for exploration of Populus, conifer and Arabidopsis genomics data, which includes expression networks and associated visualization tools. Standard features of a model organism database are provided, including genome browsers, gene list annotation, Blast homology searches and gene information pages. Community annotation updating is supported via integration of WebApollo. We have produced an RNA-sequencing (RNA-Seq) expression atlas for Populus tremula and have integrated these data within the expression tools. An updated version of the ComPlEx resource for performing comparative plant expression analyses of gene coexpression network conservation between species has also been integrated. The PlantGenIE.org platform provides intuitive access to large-scale and genome-wide genomics data from model forest tree species, facilitating both community contributions to annotation improvement and tools supporting use of the included data resources to inform biological insight.
Collapse
Affiliation(s)
- David Sundell
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-907 81, Umeå, Sweden
- Computational Life Science Cluster (CLiC), Umeå University, SE-907 81, Umeå, Sweden
| | - Chanaka Mannapperuma
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-907 81, Umeå, Sweden
| | - Sergiu Netotea
- Computational Life Science Cluster (CLiC), Umeå University, SE-907 81, Umeå, Sweden
| | - Nicolas Delhomme
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-907 81, Umeå, Sweden
| | - Yao-Cheng Lin
- Department of Plant Systems Biology, VIB, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - Andreas Sjödin
- Computational Life Science Cluster (CLiC), Umeå University, SE-907 81, Umeå, Sweden
- Department of Chemistry, Umeå University, SE-907 81, Umeå, Sweden
| | - Yves Van de Peer
- Department of Plant Systems Biology, VIB, Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Genomics Research Institute, University of Pretoria, Hatfield Campus, 0028, Pretoria, South Africa
| | - Stefan Jansson
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-907 81, Umeå, Sweden
| | - Torgeir R Hvidsten
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-907 81, Umeå, Sweden
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, 1432, Ås, Norway
| | - Nathaniel R Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, SE-907 81, Umeå, Sweden
- Computational Life Science Cluster (CLiC), Umeå University, SE-907 81, Umeå, Sweden
| |
Collapse
|
32
|
A comparison of human and mouse gene co-expression networks reveals conservation and divergence at the tissue, pathway and disease levels. BMC Evol Biol 2015; 15:259. [PMID: 26589719 PMCID: PMC4654840 DOI: 10.1186/s12862-015-0534-7] [Citation(s) in RCA: 68] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2015] [Accepted: 11/09/2015] [Indexed: 12/25/2022] Open
Abstract
Background A deeper understanding of differences and similarities in transcriptional regulation between species can uncover important information about gene functions and the role of genes in disease. Deciphering such patterns between mice and humans is especially important since mice play an essential role in biomedical research. Results Here, in order to characterize evolutionary changes between humans and mice, we compared gene co-expression maps to evaluate the conservation of co-expression. We show that the conservation of co-expression connectivity of homologous genes is negatively correlated with molecular evolution rates, as expected. Then we investigated evolutionary aspects of gene sets related to functions, tissues, pathways and diseases. Genes expressed in the testis, eye and skin, and those associated with regulation of transcription, olfaction, PI3K signalling, response to virus and bacteria were more divergent between mice and humans in terms of co-expression connectivity. Surprisingly, a deeper investigation of the PI3K signalling cascade revealed that its divergence is caused by the most crucial genes of this pathway, such as mTOR and AKT2. On the other hand, our analysis revealed that genes expressed in the brain and in the bone, and those associated with cell adhesion, cell cycle, DNA replication and DNA repair are most strongly conserved in terms of co-expression network connectivity as well as having a lower rate of duplication events. Genes involved in lipid metabolism and genes specific to blood showed a signature of increased co-expression connectivity in the mouse. In terms of diseases, co-expression connectivity of genes related to metabolic disorders is the most strongly conserved between mice and humans and tumor-related genes the most divergent. Conclusions This work contributes to discerning evolutionary patterns between mice and humans in terms of gene interactions. Conservation of co-expression is a powerful approach to identify gene targets and processes with potential similarity and divergence between mice and humans, which has implications for drug testing and other studies employing the mouse as a model organism. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0534-7) contains supplementary material, which is available to authorized users.
Collapse
|
33
|
Gehan MA, Greenham K, Mockler TC, McClung CR. Transcriptional networks-crops, clocks, and abiotic stress. CURRENT OPINION IN PLANT BIOLOGY 2015; 24:39-46. [PMID: 25646668 DOI: 10.1016/j.pbi.2015.01.004] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Revised: 01/07/2015] [Accepted: 01/08/2015] [Indexed: 05/20/2023]
Abstract
Several factors affect the yield potential and geographical range of crops including the circadian clock, water availability, and seasonal temperature changes. In order to sustain and increase plant productivity on marginal land in the face of both biotic and abiotic stresses, we need to more efficiently generate stress-resistant crops through marker-assisted breeding, genetic modification, and new genome-editing technologies. To leverage these strategies for producing the next generation of crops, future transcriptomic data acquisition should be pursued with an appropriate temporal design and analyzed with a network-centric approach. The following review focuses on recent developments in abiotic stress transcriptional networks in economically important crops and will highlight the utility of correlation-based network analysis and applications.
Collapse
Affiliation(s)
- Malia A Gehan
- Donald Danforth Plant Science Center, St. Louis, MO 63132, United States
| | - Kathleen Greenham
- Department of Biological Sciences, Dartmouth College, Hanover, NH 03755, United States
| | - Todd C Mockler
- Donald Danforth Plant Science Center, St. Louis, MO 63132, United States
| | - C Robertson McClung
- Department of Biological Sciences, Dartmouth College, Hanover, NH 03755, United States.
| |
Collapse
|
34
|
Davila-Velderrain J, Martinez-Garcia JC, Alvarez-Buylla ER. Descriptive vs. mechanistic network models in plant development in the post-genomic era. Methods Mol Biol 2015; 1284:455-79. [PMID: 25757787 DOI: 10.1007/978-1-4939-2444-8_23] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2023]
Abstract
Network modeling is now a widespread practice in systems biology, as well as in integrative genomics, and it constitutes a rich and diverse scientific research field. A conceptually clear understanding of the reasoning behind the main existing modeling approaches, and their associated technical terminologies, is required to avoid confusions and accelerate the transition towards an undeniable necessary more quantitative, multidisciplinary approach to biology. Herein, we focus on two main network-based modeling approaches that are commonly used depending on the information available and the intended goals: inference-based methods and system dynamics approaches. As far as data-based network inference methods are concerned, they enable the discovery of potential functional influences among molecular components. On the other hand, experimentally grounded network dynamical models have been shown to be perfectly suited for the mechanistic study of developmental processes. How do these two perspectives relate to each other? In this chapter, we describe and compare both approaches and then apply them to a given specific developmental module. Along with the step-by-step practical implementation of each approach, we also focus on discussing their respective goals, utility, assumptions, and associated limitations. We use the gene regulatory network (GRN) involved in Arabidopsis thaliana Root Stem Cell Niche patterning as our illustrative example. We show that descriptive models based on functional genomics data can provide important background information consistent with experimentally supported functional relationships integrated in mechanistic GRN models. The rationale of analysis and modeling can be applied to any other well-characterized functional developmental module in multicellular organisms, like plants and animals.
Collapse
Affiliation(s)
- J Davila-Velderrain
- Laboratorio de Genética Molecular, Desarrollo y Evolución de Plantas, Instituto de Ecología, Universidad Nacional Autónoma de México, Ciudad Universitaria, Av. Universidad 3000, México D.F., 04510, Mexico
| | | | | |
Collapse
|
35
|
De La Torre AR, Birol I, Bousquet J, Ingvarsson PK, Jansson S, Jones SJM, Keeling CI, MacKay J, Nilsson O, Ritland K, Street N, Yanchuk A, Zerbe P, Bohlmann J. Insights into conifer giga-genomes. PLANT PHYSIOLOGY 2014; 166:1724-32. [PMID: 25349325 PMCID: PMC4256843 DOI: 10.1104/pp.114.248708] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world's forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20-30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes.
Collapse
Affiliation(s)
- Amanda R De La Torre
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Inanc Birol
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Jean Bousquet
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Pär K Ingvarsson
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Stefan Jansson
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Steven J M Jones
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Christopher I Keeling
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - John MacKay
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Ove Nilsson
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Kermit Ritland
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Nathaniel Street
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Alvin Yanchuk
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Philipp Zerbe
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| | - Jörg Bohlmann
- Department of Ecology and Environmental Sciences (A.R.D.L.T., P.K.I.) and Umeå Plant Science Center, Department of Plant Physiology (P.K.I., S.J., O.N., N.S.), Umeå University, SE-901 87 Umea, Sweden;Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada V5Z 4S6 (I.B., S.J.M.J.);Canada Research Chair in Forest and Environmental Genomics (J.Bou.) and Center for Forest Research and Institute for Systems and Integrative Biology (J.Bou., J.M.), Université Laval, Quebec, Quebec, Canada G1V 0A6;Michael Smith Laboratories (C.I.K., P.Z., J.Boh.) and Department of Forest and Conservation Sciences (K.R., J.Boh.), University of British Columbia, Vancouver, British Columbia, Canada V6T 1Z4; andBritish Columbia Ministry of Forests, Lands, and Natural Resource Operations, Victoria, British Columbia, Canada V8W 9C2 (A.Y.)
| |
Collapse
|
36
|
Mähler N, Cheregi O, Funk C, Netotea S, Hvidsten TR. Synergy: a web resource for exploring gene regulation in Synechocystis sp. PCC6803. PLoS One 2014; 9:e113496. [PMID: 25420108 PMCID: PMC4242644 DOI: 10.1371/journal.pone.0113496] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Accepted: 10/24/2014] [Indexed: 12/22/2022] Open
Abstract
Despite being a highly studied model organism, most genes of the cyanobacterium Synechocystis sp. PCC 6803 encode proteins with completely unknown function. To facilitate studies of gene regulation in Synechocystis, we have developed Synergy (http://synergy.plantgenie.org), a web application integrating co-expression networks and regulatory motif analysis. Co-expression networks were inferred from publicly available microarray experiments, while regulatory motifs were identified using a phylogenetic footprinting approach. Automatically discovered motifs were shown to be enriched in the network neighborhoods of regulatory proteins much more often than in the neighborhoods of non-regulatory genes, showing that the data provide a sound starting point for studying gene regulation in Synechocystis. Concordantly, we provide several case studies demonstrating that Synergy can be used to find biologically relevant regulatory mechanisms in Synechocystis. Synergy can be used to interactively perform analyses such as gene/motif search, network visualization and motif/function enrichment. Considering the importance of Synechocystis for photosynthesis and biofuel research, we believe that Synergy will become a valuable resource to the research community.
Collapse
Affiliation(s)
- Niklas Mähler
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | | | - Christiane Funk
- Department of Chemistry, Umeå University, Umeå, Sweden
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Sergiu Netotea
- Department of Chemistry, Umeå University, Umeå, Sweden
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
- Computational Life Science Cluster, Umeå University, Umeå, Sweden
| | - Torgeir R. Hvidsten
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| |
Collapse
|
37
|
Wong DCJ, Sweetman C, Ford CM. Annotation of gene function in citrus using gene expression information and co-expression networks. BMC PLANT BIOLOGY 2014; 14:186. [PMID: 25023870 PMCID: PMC4108274 DOI: 10.1186/1471-2229-14-186] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Accepted: 06/30/2014] [Indexed: 05/20/2023]
Abstract
BACKGROUND The genus Citrus encompasses major cultivated plants such as sweet orange, mandarin, lemon and grapefruit, among the world's most economically important fruit crops. With increasing volumes of transcriptomics data available for these species, Gene Co-expression Network (GCN) analysis is a viable option for predicting gene function at a genome-wide scale. GCN analysis is based on a "guilt-by-association" principle whereby genes encoding proteins involved in similar and/or related biological processes may exhibit similar expression patterns across diverse sets of experimental conditions. While bioinformatics resources such as GCN analysis are widely available for efficient gene function prediction in model plant species including Arabidopsis, soybean and rice, in citrus these tools are not yet developed. RESULTS We have constructed a comprehensive GCN for citrus inferred from 297 publicly available Affymetrix Genechip Citrus Genome microarray datasets, providing gene co-expression relationships at a genome-wide scale (33,000 transcripts). The comprehensive citrus GCN consists of a global GCN (condition-independent) and four condition-dependent GCNs that survey the sweet orange species only, all citrus fruit tissues, all citrus leaf tissues, or stress-exposed plants. All of these GCNs are clustered using genome-wide, gene-centric (guide) and graph clustering algorithms for flexibility of gene function prediction. For each putative cluster, gene ontology (GO) enrichment and gene expression specificity analyses were performed to enhance gene function, expression and regulation pattern prediction. The guide-gene approach was used to infer novel roles of genes involved in disease susceptibility and vitamin C metabolism, and graph-clustering approaches were used to investigate isoprenoid/phenylpropanoid metabolism in citrus peel, and citric acid catabolism via the GABA shunt in citrus fruit. CONCLUSIONS Integration of citrus gene co-expression networks, functional enrichment analysis and gene expression information provide opportunities to infer gene function in citrus. We present a publicly accessible tool, Network Inference for Citrus Co-Expression (NICCE, http://citrus.adelaide.edu.au/nicce/home.aspx), for the gene co-expression analysis in citrus.
Collapse
Affiliation(s)
- Darren CJ Wong
- School of Agriculture, Food and Wine, University of Adelaide, Adelaide 5064, South Australia, Australia
| | - Crystal Sweetman
- School of Agriculture, Food and Wine, University of Adelaide, Adelaide 5064, South Australia, Australia
| | - Christopher M Ford
- School of Agriculture, Food and Wine, University of Adelaide, Adelaide 5064, South Australia, Australia
| |
Collapse
|