51
|
Fadhal E, Gamieldien J, Mwambene EC. Protein interaction networks as metric spaces: a novel perspective on distribution of hubs. BMC SYSTEMS BIOLOGY 2014; 8:6. [PMID: 24438364 PMCID: PMC3902029 DOI: 10.1186/1752-0509-8-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Accepted: 01/07/2014] [Indexed: 02/02/2023]
Abstract
Background In the post-genomic era, a central and overarching question in the analysis of protein-protein interaction networks continues to be whether biological characteristics and functions of proteins such as lethality, physiological malfunctions and malignancy are intimately linked to the topological role proteins play in the network as a mathematical structure. One of the key features that have implicitly been presumed is the existence of hubs, highly connected proteins considered to play a crucial role in biological networks. We explore the structure of protein interaction networks of a number of organisms as metric spaces and show that hubs are non randomly positioned and, from a distance point of view, centrally located. Results By analysing how the human functional protein interaction network, the human signalling network, Saccharomyces cerevisiae, Arabidopsis thaliana and Escherichia coli protein-protein interaction networks from various databases are distributed as metric spaces, we found that proteins interact radially through a central node, high degree proteins coagulate in the centre of the network, and those far away from the centre have low degree. We further found that the distribution of proteins from the centre is in some hierarchy of importance and has biological significance. Conclusions We conclude that structurally, protein interaction networks are mathematical entities that share properties between organisms but not necessarily with other networks that follow power-law. We therefore conclude that (i) if there are hubs defined by degree, they are not distributed randomly; (ii) zones closest to the centre of the network are enriched for critically important proteins and are also functionally very specialised for specific 'house keeping’ functions; (iii) proteins closest to the network centre are functionally less dispensable and may present good targets for therapy development; and (iv) network biology requires its own network theory modelled on actual biological evidence and that simply adopting theories from the social sciences may be misleading.
Collapse
Affiliation(s)
| | | | - Eric C Mwambene
- Department of Mathematics and Applied Mathematics, University of the Western Cape, P/Bag X17, Bellville, South Africa.
| |
Collapse
|
52
|
Fukushima A, Kanaya S, Nishida K. Integrated network analysis and effective tools in plant systems biology. FRONTIERS IN PLANT SCIENCE 2014; 5:598. [PMID: 25408696 PMCID: PMC4219401 DOI: 10.3389/fpls.2014.00598] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Accepted: 10/14/2014] [Indexed: 05/18/2023]
Abstract
One of the ultimate goals in plant systems biology is to elucidate the genotype-phenotype relationship in plant cellular systems. Integrated network analysis that combines omics data with mathematical models has received particular attention. Here we focus on the latest cutting-edge computational advances that facilitate their combination. We highlight (1) network visualization tools, (2) pathway analyses, (3) genome-scale metabolic reconstruction, and (4) the integration of high-throughput experimental data and mathematical models. Multi-omics data that contain the genome, transcriptome, proteome, and metabolome and mathematical models are expected to integrate and expand our knowledge of complex plant metabolisms.
Collapse
Affiliation(s)
- Atsushi Fukushima
- RIKEN Center for Sustainable Resource ScienceTsurumi, Yokohama, Japan
- Japan Science and Technology Agency, National Bioscience Database CenterTokyo, Japan
- *Correspondence: Atsushi Fukushima, RIKEN Center for Sustainable Resource Science, 1-7-22 Suehirocho, Tsurumi, Yokohama 230-0045, Japan e-mail:
| | - Shigehiko Kanaya
- Graduate School of Information Science, Nara Institute of Science and TechnologyNara, Japan
| | - Kozo Nishida
- Japan Science and Technology Agency, National Bioscience Database CenterTokyo, Japan
- Laboratory for Biochemical Simulation, RIKEN Quantitative Biology CenterOsaka, Japan
| |
Collapse
|
53
|
Duan G, Walther D, Schulze WX. Reconstruction and analysis of nutrient-induced phosphorylation networks in Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2013; 4:540. [PMID: 24400017 PMCID: PMC3872036 DOI: 10.3389/fpls.2013.00540] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2013] [Accepted: 12/12/2013] [Indexed: 05/23/2023]
Abstract
Elucidating the dynamics of molecular processes in living organisms in response to external perturbations is a central goal in modern systems biology. We investigated the dynamics of protein phosphorylation events in Arabidopsis thaliana exposed to changing nutrient conditions. Phosphopeptide expression levels were detected at five consecutive time points over a time interval of 30 min after nutrient resupply following prior starvation. The three tested inorganic, ionic nutrients NH(+) 4, NO(-) 3, PO(3-) 4 elicited similar phosphosignaling responses that were distinguishable from those invoked by the sugars mannitol, sucrose. When embedded in the protein-protein interaction network of Arabidopsis thaliana, phosphoproteins were found to exhibit a higher degree compared to average proteins. Based on the time-series data, we reconstructed a network of regulatory interactions mediated by phosphorylation. The performance of different network inference methods was evaluated by the observed likelihood of physical interactions within and across different subcellular compartments and based on gene ontology semantic similarity. The dynamic phosphorylation network was then reconstructed using a Pearson correlation method with added directionality based on partial variance differences. The topology of the inferred integrated network corresponds to an information dissemination architecture, in which the phosphorylation signal is passed on to an increasing number of phosphoproteins stratified into an initiation, processing, and effector layer. Specific phosphorylation peptide motifs associated with the distinct layers were identified indicating the action of layer-specific kinases. Despite the limited temporal resolution, combined with information on subcellular location, the available time-series data proved useful for reconstructing the dynamics of the molecular signaling cascade in response to nutrient stress conditions in the plant Arabidopsis thaliana.
Collapse
Affiliation(s)
- Guangyou Duan
- Max Planck Institute of Molecular Plant PhysiologyPotsdam, Germany
| | - Dirk Walther
- Max Planck Institute of Molecular Plant PhysiologyPotsdam, Germany
| | - Waltraud X. Schulze
- Max Planck Institute of Molecular Plant PhysiologyPotsdam, Germany
- Department of Plant Systems Biology, Universität HohenheimStuttgart, Germany
| |
Collapse
|
54
|
Lu T, Dou Y, Zhang C. Fuzzy clustering of CPP family in plants with evolution and interaction analyses. BMC Bioinformatics 2013; 14 Suppl 13:S10. [PMID: 24268301 PMCID: PMC3849782 DOI: 10.1186/1471-2105-14-s13-s10] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Transcription factors have been studied intensively because they play an important role in gene expression regulation. However, the transcription factors in the CPP family (cystein-rich polycomb-like protein), compared with other transcription factor families, have not received sufficient attention, despite their wide prevalence in a broad spectrum of species, from plants to animals. The total number of known CPP transcription factors in plants is 111 from 16 plants, but only 2 of them have been studied so far, namely TSO1 and CPP1 in Arabidopsis thaliana and soybean, respectively. Methods In this work, to study their functions, we applied the fuzzy clustering method to all plant CPP transcription factors. The feature vector of each protein sequence for the fuzzy clustering method is encoded by the short length peptides and the combination of functional domain models. Results and conclusions With the fuzzy clustering method, all plant CPP transcription factors are grouped into two subfamilies. A systems approach, including Expressed Sequence Tag analysis, evolutionary analysis, protein-protein interaction network analysis and co-expression analysis, is employed to validate the clustering results, the results of which also indicates that the transcription factors from different subfamilies show uncorrelated responses.
Collapse
|
55
|
Ye CY, Li T, Yin H, Weston DJ, Tuskan GA, Tschaplinski TJ, Yang X. Evolutionary analyses of non-family genes in plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 73:788-797. [PMID: 23145488 DOI: 10.1111/tpj.12073] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2012] [Revised: 10/16/2012] [Accepted: 11/07/2012] [Indexed: 06/01/2023]
Abstract
There are a large number of 'non-family' (NF) genes that do not cluster into families with three or more members per genome. While gene families have been extensively studied, a systematic analysis of NF genes has not been reported. We performed comparative studies on NF genes in 14 plant species. Based on the clustering of protein sequences, we identified ~94,000 NF genes across these species that were divided into five evolutionary groups: Viridiplantae wide, angiosperm specific, monocot specific, dicot specific, and those that were species specific. Our analysis revealed that the NF genes resulted largely from less frequent gene duplications and/or a higher rate of gene loss after segmental duplication relative to genes in both low-copy-number families (LF; 3-10 copies per genome) and high-copy-number families (HF; >10 copies). Furthermore, we identified functions enriched in the NF gene set as compared with the HF genes. We found that NF genes were involved in essential biological processes shared by all plant lineages (e.g. photosynthesis and translation), as well as gene regulation and stress responses associated with phylogenetic diversification. In particular, our analysis of an Arabidopsis protein-protein interaction network revealed that hub proteins with the top 10% most connections were over-represented in the NF set relative to the HF set. This research highlights the roles that NF genes may play in evolutionary and functional genomics research.
Collapse
Affiliation(s)
- Chu-Yu Ye
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| | | | | | | | | | | | | |
Collapse
|
56
|
Tsai WC, Fu CH, Hsiao YY, Huang YM, Chen LJ, Wang M, Liu ZJ, Chen HH. OrchidBase 2.0: comprehensive collection of Orchidaceae floral transcriptomes. PLANT & CELL PHYSIOLOGY 2013; 54:e7. [PMID: 23314755 DOI: 10.1093/pcp/pcs187] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Both floral development and evolutionary trends of orchid flowers have long attracted the interest of biologists. However, expressed sequences derived from the flowers of other orchid subfamilies are still scarce except for a few species in Epidendroideae. In order to broadly increase our scope of Orchidaceae genetic information, we updated the OrchidBase to version 2.0 which has 1,562,071 newly added floral non-redundant transcribed sequences (unigenes) collected comprehensively from 10 orchid species across five subfamilies of Orchidaceae. A total of 662,671,362 reads were obtained by using next-generation sequencing (NGS) Solexa Illumina sequencers. After assembly, on average 156,207 unigenes were generated for each species. The average length of a unigene is 347 bp. We made a detailed annotation including general information, relative expression level, gene ontology (GO), KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway mapping and gene network prediction. The online resources for putative annotation can be searched either by text or by using BLAST, and the results can be explored on the website and downloaded. We have re-designed the user interface in the new version. Users can enter the Phalaenopsis transcriptome or Orchidaceae floral transcriptome to browse or search the unigenes. OrchidBase 2.0 is freely available at http://orchidbase.itps.ncku.edu.tw/.
Collapse
Affiliation(s)
- Wen-Chieh Tsai
- Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan, Taiwan
| | | | | | | | | | | | | | | |
Collapse
|
57
|
Giorgi FM, Del Fabbro C, Licausi F. Comparative study of RNA-seq- and microarray-derived coexpression networks in Arabidopsis thaliana. ACTA ACUST UNITED AC 2013; 29:717-24. [PMID: 23376351 DOI: 10.1093/bioinformatics/btt053] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
MOTIVATION Coexpression networks are data-derived representations of genes behaving in a similar way across tissues and experimental conditions. They have been used for hypothesis generation and guilt-by-association approaches for inferring functions of previously unknown genes. So far, the main platform for expression data has been DNA microarrays; however, the recent development of RNA-seq allows for higher accuracy and coverage of transcript populations. It is therefore important to assess the potential for biological investigation of coexpression networks derived from this novel technique in a condition-independent dataset. RESULTS We collected 65 publicly available Illumina RNA-seq high quality Arabidopsis thaliana samples and generated Pearson correlation coexpression networks. These networks were then compared with those derived from analogous microarray data. We show how Variance-Stabilizing Transformed (VST) RNA-seq data samples are the most similar to microarray ones, with respect to inter-sample variation, correlation coefficient distribution and network topological architecture. Microarray networks show a slightly higher score in biology-derived quality assessments such as overlap with the known protein-protein interaction network and edge ontological agreement. Different coexpression network centralities are investigated; in particular, we show how betweenness centrality is generally a positive marker for essential genes in A.thaliana, regardless of the platform originating the data. In the end, we focus on a specific gene network case, showing that although microarray data seem more suited for gene network reverse engineering, RNA-seq offers the great advantage of extending coexpression analyses to the entire transcriptome.
Collapse
|
58
|
A computational prediction of structure and function of novel homologue of Arabidopsis thaliana Vps51/Vps67 subunit in Corchorus olitorius. Interdiscip Sci 2013; 4:256-67. [PMID: 23354814 DOI: 10.1007/s12539-012-0139-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Revised: 06/05/2012] [Accepted: 07/29/2012] [Indexed: 10/27/2022]
Abstract
Vps mediated vesicular transport is important for transferring macromolecules trapped inside a vesicle. Although highly abundant, Vps shows tremendous sequence variation among diverse array of species. However, this difference in sequence, which seems to also translate into substantial functional variation, is hardly characterized in Corchorus spp. Here, our computational study investigates structural and functional features of one of the Vps subunit namely Vps51/Vps67 in C. olitorius. Broad scale structural characterization revealed novel information about the overall Vps structure and binding sites. Moreover, functional analyses indicate interaction partners which were unexplored to date. Since membrane trafficking is essentially associated with nutrient uptake and chemical de-toxification, characterization of the Vps subunit can well provide us with better insight into important agronomic traits such as stress response, immune response and phytoremediation capacity.
Collapse
|
59
|
Braun P, Aubourg S, Van Leene J, De Jaeger G, Lurin C. Plant protein interactomes. ANNUAL REVIEW OF PLANT BIOLOGY 2013; 64:161-87. [PMID: 23330791 DOI: 10.1146/annurev-arplant-050312-120140] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Protein-protein interactions are a critical element of biological systems, and the analysis of interaction partners can provide valuable hints about unknown functions of a protein. In recent years, several large-scale protein interaction studies have begun to unravel the complex networks through which plant proteins exert their functions. Two major classes of experimental approaches are used for protein interaction mapping: analysis of direct interactions using binary methods such as yeast two-hybrid or split ubiquitin, and analysis of protein complexes through affinity purification followed by mass spectrometry. In addition, bioinformatics predictions can suggest interactions that have evaded detection by other methods or those of proteins that have not been investigated. Here we review the major approaches to construct, analyze, use, and carry out quality control on plant protein interactome networks. We present experimental and computational approaches for large-scale mapping, methods for validation or smaller-scale functional studies, important bioinformatics resources, and findings from recently published large-scale plant interactome network maps.
Collapse
Affiliation(s)
- Pascal Braun
- Department of Plant Systems Biology, Center for Life and Food Sciences Weihenstephan, Technische Universität München (TUM), 85354 Freising-Weihenstephan, Germany.
| | | | | | | | | |
Collapse
|
60
|
Hollister JD, Arnold BJ, Svedin E, Xue KS, Dilkes BP, Bomblies K. Genetic adaptation associated with genome-doubling in autotetraploid Arabidopsis arenosa. PLoS Genet 2012; 8:e1003093. [PMID: 23284289 PMCID: PMC3527224 DOI: 10.1371/journal.pgen.1003093] [Citation(s) in RCA: 113] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 09/27/2012] [Indexed: 11/18/2022] Open
Abstract
Genome duplication, which results in polyploidy, is disruptive to fundamental biological processes. Genome duplications occur spontaneously in a range of taxa and problems such as sterility, aneuploidy, and gene expression aberrations are common in newly formed polyploids. In mammals, genome duplication is associated with cancer and spontaneous abortion of embryos. Nevertheless, stable polyploid species occur in both plants and animals. Understanding how natural selection enabled these species to overcome early challenges can provide important insights into the mechanisms by which core cellular functions can adapt to perturbations of the genomic environment. Arabidopsis arenosa includes stable tetraploid populations and is related to well-characterized diploids A. lyrata and A. thaliana. It thus provides a rare opportunity to leverage genomic tools to investigate the genetic basis of polyploid stabilization. We sequenced the genomes of twelve A. arenosa individuals and found signatures suggestive of recent and ongoing selective sweeps throughout the genome. Many of these are at genes implicated in genome maintenance functions, including chromosome cohesion and segregation, DNA repair, homologous recombination, transcriptional regulation, and chromatin structure. Numerous encoded proteins are predicted to interact with one another. For a critical meiosis gene, ASYNAPSIS1, we identified a non-synonymous mutation that is highly differentiated by cytotype, but present as a rare variant in diploid A. arenosa, indicating selection may have acted on standing variation already present in the diploid. Several genes we identified that are implicated in sister chromatid cohesion and segregation are homologous to genes identified in a yeast mutant screen as necessary for survival of polyploid cells, and also implicated in genome instability in human diseases including cancer. This points to commonalities across kingdoms and supports the hypothesis that selection has acted on genes controlling genome integrity in A. arenosa as an adaptive response to genome doubling.
Collapse
Affiliation(s)
- Jesse D. Hollister
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Brian J. Arnold
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Elisabeth Svedin
- Department of Horticulture and Landscape Architecture, Purdue University, West Lafayette, Indiana, United States of America
- Molecular Evolutionary Genetics, Interdisciplinary Life Science Program, Purdue University, West Lafayette, Indiana, United States of America
| | - Katherine S. Xue
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Brian P. Dilkes
- Department of Horticulture and Landscape Architecture, Purdue University, West Lafayette, Indiana, United States of America
- Molecular Evolutionary Genetics, Interdisciplinary Life Science Program, Purdue University, West Lafayette, Indiana, United States of America
| | - Kirsten Bomblies
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
61
|
Junker A, Rohn H, Schreiber F. Visual analysis of transcriptome data in the context of anatomical structures and biological networks. FRONTIERS IN PLANT SCIENCE 2012; 3:252. [PMID: 23162564 PMCID: PMC3498740 DOI: 10.3389/fpls.2012.00252] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2012] [Accepted: 10/22/2012] [Indexed: 05/12/2023]
Abstract
The complexity and temporal as well as spatial resolution of transcriptome datasets is constantly increasing due to extensive technological developments. Here we present methods for advanced visualization and intuitive exploration of transcriptomics data as necessary prerequisites in order to facilitate the gain of biological knowledge. Color-coding of structural images based on the expression level enables a fast visual data analysis in the background of the examined biological system. The network-based exploration of these visualizations allows for comparative analysis of genes with specific transcript patterns and supports the extraction of functional relationships even from large datasets. In order to illustrate the presented methods, the tool HIVE was applied for visualization and exploration of database-retrieved expression data for master regulators of Arabidopsis thaliana flower and seed development in the context of corresponding tissue-specific regulatory networks.
Collapse
Affiliation(s)
- Astrid Junker
- Leibniz Institute of Plant Genetics and Crop Plant Research GaterslebenGatersleben, Germany
| | - Hendrik Rohn
- Leibniz Institute of Plant Genetics and Crop Plant Research GaterslebenGatersleben, Germany
| | - Falk Schreiber
- Leibniz Institute of Plant Genetics and Crop Plant Research GaterslebenGatersleben, Germany
- Institute of Computer Science, Martin Luther University Halle-WittenbergHalle, Germany
- Clayton School of Information Technology, Monash UniversityClayton, VIC, Australia
| |
Collapse
|
62
|
Bassel GW, Gaudinier A, Brady SM, Hennig L, Rhee SY, De Smet I. Systems analysis of plant functional, transcriptional, physical interaction, and metabolic networks. THE PLANT CELL 2012; 24:3859-75. [PMID: 23110892 PMCID: PMC3517224 DOI: 10.1105/tpc.112.100776] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2012] [Revised: 08/21/2012] [Accepted: 10/11/2012] [Indexed: 05/19/2023]
Abstract
Physiological responses, developmental programs, and cellular functions rely on complex networks of interactions at different levels and scales. Systems biology brings together high-throughput biochemical, genetic, and molecular approaches to generate omics data that can be analyzed and used in mathematical and computational models toward uncovering these networks on a global scale. Various approaches, including transcriptomics, proteomics, interactomics, and metabolomics, have been employed to obtain these data on the cellular, tissue, organ, and whole-plant level. We summarize progress on gene regulatory, cofunction, protein interaction, and metabolic networks. We also illustrate the main approaches that have been used to obtain these networks, with specific examples from Arabidopsis thaliana, and describe the pros and cons of each approach.
Collapse
Affiliation(s)
- George W. Bassel
- School of Biosciences, University of Birmingham, Birmingham B15 2TT, United Kingdom
- Division of Plant and Crop Sciences, School of Biosciences and Centre for Plant Integrative Biology, University of Nottingham, Loughborough LE12 5RD, United Kingdom
| | - Allison Gaudinier
- Department of Plant Biology and Genome Center, University of California, Davis, California 95616
| | - Siobhan M. Brady
- Department of Plant Biology and Genome Center, University of California, Davis, California 95616
| | - Lars Hennig
- Department of Plant Biology and Forest Genetics, Uppsala BioCenter, Swedish University of Agricultural Sciences and Linnean Center for Plant Biology, SE-75007 Uppsala, Sweden
| | - Seung Y. Rhee
- Department of Plant Biology, Carnegie Institution for Science, Stanford, California 94305
| | - Ive De Smet
- Division of Plant and Crop Sciences, School of Biosciences and Centre for Plant Integrative Biology, University of Nottingham, Loughborough LE12 5RD, United Kingdom
| |
Collapse
|
63
|
Tackling drought stress: receptor-like kinases present new approaches. THE PLANT CELL 2012; 24:2262-78. [PMID: 22693282 DOI: 10.1105/tpc.112.096677] [Citation(s) in RCA: 125] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Global climate change and a growing population require tackling the reduction in arable land and improving biomass production and seed yield per area under varying conditions. One of these conditions is suboptimal water availability. Here, we review some of the classical approaches to dealing with plant response to drought stress and we evaluate how research on RECEPTOR-LIKE KINASES (RLKs) can contribute to improving plant performance under drought stress. RLKs are considered as key regulators of plant architecture and growth behavior, but they also function in defense and stress responses. The available literature and analyses of available transcript profiling data indeed suggest that RLKs can play an important role in optimizing plant responses to drought stress. In addition, RLK pathways are ideal targets for nontransgenic approaches, such as synthetic molecules, providing a novel strategy to manipulate their activity and supporting translational studies from model species, such as Arabidopsis thaliana, to economically useful crops.
Collapse
|
64
|
Wang C, Marshall A, Zhang D, Wilson ZA. ANAP: an integrated knowledge base for Arabidopsis protein interaction network analysis. PLANT PHYSIOLOGY 2012; 158:1523-33. [PMID: 22345505 PMCID: PMC3320167 DOI: 10.1104/pp.111.192203] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2011] [Accepted: 02/12/2012] [Indexed: 05/18/2023]
Abstract
Protein interactions are fundamental to the molecular processes occurring within an organism and can be utilized in network biology to help organize, simplify, and understand biological complexity. Currently, there are more than 10 publicly available Arabidopsis (Arabidopsis thaliana) protein interaction databases. However, there are limitations with these databases, including different types of interaction evidence, a lack of defined standards for protein identifiers, differing levels of information, and, critically, a lack of integration between them. In this paper, we present an interactive bioinformatics Web tool, ANAP (Arabidopsis Network Analysis Pipeline), which serves to effectively integrate the different data sets and maximize access to available data. ANAP has been developed for Arabidopsis protein interaction integration and network-based study to facilitate functional protein network analysis. ANAP integrates 11 Arabidopsis protein interaction databases, comprising 201,699 unique protein interaction pairs, 15,208 identifiers (including 11,931 The Arabidopsis Information Resource Arabidopsis Genome Initiative codes), 89 interaction detection methods, 73 species that interact with Arabidopsis, and 6,161 references. ANAP can be used as a knowledge base for constructing protein interaction networks based on user input and supports both direct and indirect interaction analysis. It has an intuitive graphical interface allowing easy network visualization and provides extensive detailed evidence for each interaction. In addition, ANAP displays the gene and protein annotation in the generated interactive network with links to The Arabidopsis Information Resource, the AtGenExpress Visualization Tool, the Arabidopsis 1,001 Genomes GBrowse, the Protein Knowledgebase, the Kyoto Encyclopedia of Genes and Genomes, and the Ensembl Genome Browser to significantly aid functional network analysis. The tool is available open access at http://gmdd.shgmo.org/Computational-Biology/ANAP.
Collapse
|
65
|
Abstract
The study of protein-protein interactions (PPIs) is essential to uncover unknown functions of proteins at the molecular level and to gain insight into complex cellular networks. Affinity purification and mass spectrometry (AP-MS), yeast two-hybrid, imaging approaches and numerous diverse databases have been developed as strategies to analyze PPIs. The past decade has seen an increase in the number of identified proteins with the development of MS and large-scale proteome analyses. Consequently, the false-positive protein identification rate has also increased. Therefore, the general consensus is to confirm PPI data using one or more independent approaches for an accurate evaluation. Furthermore, identifying minor PPIs is fundamental for understanding the functions of transient interactions and low-abundance proteins. Besides establishing PPI methodologies, we are now seeing the development of new methods and/or improvements in existing methods, which involve identifying minor proteins by MS, multidimensional protein identification technology or OFFGEL electrophoresis analyses, one-shot analysis with a long column or filter-aided sample preparation methods. These advanced techniques should allow thousands of proteins to be identified, whereas in-depth proteomic methods should permit the identification of transient binding or PPIs with weak affinity. Here, the current status of PPI analysis is reviewed and some advanced techniques are discussed briefly along with future challenges for plant proteomics.
Collapse
Affiliation(s)
- Yoichiro Fukao
- Plant Global Educational Project, Nara Institute of Science and Technology, Ikoma, Japan
| |
Collapse
|
66
|
Yang J, Osman K, Iqbal M, Stekel DJ, Luo Z, Armstrong SJ, Franklin FCH. Inferring the Brassica rapa Interactome Using Protein-Protein Interaction Data from Arabidopsis thaliana. FRONTIERS IN PLANT SCIENCE 2012; 3:297. [PMID: 23293649 PMCID: PMC3537189 DOI: 10.3389/fpls.2012.00297] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Accepted: 12/11/2012] [Indexed: 05/06/2023]
Abstract
Following successful completion of the Brassica rapa sequencing project, the next step is to investigate functions of individual genes/proteins. For Arabidopsis thaliana, large amounts of protein-protein interaction (PPI) data are available from the major PPI databases (DBs). It is known that Brassica crop species are closely related to A. thaliana. This provides an opportunity to infer the B. rapa interactome using PPI data available from A. thaliana. In this paper, we present an inferred B. rapa interactome that is based on the A. thaliana PPI data from two resources: (i) A. thaliana PPI data from three major DBs, BioGRID, IntAct, and TAIR. (ii) ortholog-based A. thaliana PPI predictions. Linking between B. rapa and A. thaliana was accomplished in three complementary ways: (i) ortholog predictions, (ii) identification of gene duplication based on synteny and collinearity, and (iii) BLAST sequence similarity search. A complementary approach was also applied, which used known/predicted domain-domain interaction data. Specifically, since the two species are closely related, we used PPI data from A. thaliana to predict interacting domains that might be conserved between the two species. The predicted interactome was investigated for the component that contains known A. thaliana meiotic proteins to demonstrate its usability.
Collapse
Affiliation(s)
- Jianhua Yang
- University of BirminghamBirmingham, UK
- *Correspondence: Jianhua Yang and F. Chris H. Franklin, University of Birmingham, B152TT Birmingham, UK. e-mail: ,
| | - Kim Osman
- University of BirminghamBirmingham, UK
| | | | | | - Zewei Luo
- University of BirminghamBirmingham, UK
| | | | - F. Chris H. Franklin
- University of BirminghamBirmingham, UK
- *Correspondence: Jianhua Yang and F. Chris H. Franklin, University of Birmingham, B152TT Birmingham, UK. e-mail: ,
| |
Collapse
|
67
|
Redestig H, Costa IG. Detection and interpretation of metabolite-transcript coresponses using combined profiling data. ACTA ACUST UNITED AC 2011; 27:i357-65. [PMID: 21685093 PMCID: PMC3117345 DOI: 10.1093/bioinformatics/btr231] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Motivation: Studying the interplay between gene expression and metabolite levels can yield important information on the physiology of stress responses and adaptation strategies. Performing transcriptomics and metabolomics in parallel during time-series experiments represents a systematic way to gain such information. Several combined profiling datasets have been added to the public domain and they form a valuable resource for hypothesis generating studies. Unfortunately, detecting coresponses between transcript levels and metabolite abundances is non-trivial: they cannot be assumed to overlap directly with underlying biochemical pathways and they may be subject to time delays and obscured by considerable noise. Results: Our aim was to predict pathway comemberships between metabolites and genes based on their coresponses to applied stress. We found that in the presence of strong noise and time-shifted responses, a hidden Markov model-based similarity outperforms the simpler Pearson correlation but performs comparably or worse in their absence. Therefore, we propose a supervised method that applies pathway information to summarize similarity statistics to a consensus statistic that is more informative than any of the single measures. Using four combined profiling datasets, we show that comembership between metabolites and genes can be predicted for numerous KEGG pathways; this opens opportunities for the detection of transcriptionally regulated pathways and novel metabolically related genes. Availability: A command-line software tool is available at http://www.cin.ufpe.br/~igcf/Metabolites. Contact:henning@psc.riken.jp; igcf@cin.ufpe.br Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
|
68
|
Severin AJ, Cannon SB, Graham MM, Grant D, Shoemaker RC. Changes in twelve homoeologous genomic regions in soybean following three rounds of polyploidy. THE PLANT CELL 2011; 23:3129-36. [PMID: 21917551 PMCID: PMC3203428 DOI: 10.1105/tpc.111.089573] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
With the advent of high-throughput sequencing, the availability of genomic sequence for comparative genomics is increasing exponentially. Numerous completed plant genome sequences enable characterization of patterns of the retention and evolution of genes within gene families due to multiple polyploidy events, gene loss and fractionation, and differential evolutionary pressures over time and across different gene families. In this report, we trace the changes that have occurred in 12 surviving homoeologous genomic regions from three rounds of polyploidy that contributed to the current Glycine max genome: a genome triplication before the origin of the rosids (~130 to 240 million years ago), a genome duplication early in the legumes (~58 million years ago), and a duplication in the Glycine lineage (~13 million years ago). Patterns of gene retention following the genome triplication event generally support predictions of the Gene Balance Hypothesis. Finally, we find that genes in networks with a high level of connectivity are more strongly conserved than those with low connectivity and that the enrichment of these highly connected genes in the 12 highly conserved homoeologous segments may in part explain their retention over more than 100 million years and repeated polyploidy events.
Collapse
Affiliation(s)
- Andrew J Severin
- Department of Agronomy, Iowa State University, Ames, Iowa 50011, USA
| | | | | | | | | |
Collapse
|
69
|
Remmerie N, De Vijlder T, Valkenborg D, Laukens K, Smets K, Vreeken J, Mertens I, Carpentier SC, Panis B, De Jaeger G, Blust R, Prinsen E, Witters E. Unraveling tobacco BY-2 protein complexes with BN PAGE/LC-MS/MS and clustering methods. J Proteomics 2011; 74:1201-17. [PMID: 21443973 DOI: 10.1016/j.jprot.2011.03.023] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2011] [Revised: 03/13/2011] [Accepted: 03/21/2011] [Indexed: 11/26/2022]
Abstract
To understand physiological processes, insight into protein complexes is very important. Through a combination of blue native gel electrophoresis and LC-MS/MS, we were able to isolate protein complexes and identify their potential subunits from Nicotiana tabacum cv. Bright Yellow-2. For this purpose, a bioanalytical approach was used that works without a priori knowledge of the interacting proteins. Different clustering methods (e.g., k-means and hierarchical clustering) and a biclustering approach were evaluated according to their ability to group proteins by their migration profile and to correlate the proteins to a specific complex. The biclustering approach was identified as a very powerful tool for the exploration of protein complexes of whole cell lysates since it allows for the promiscuous nature of proteins. Furthermore, it searches for associations between proteins that co-occur frequently throughout the BN gel, which increases the confidence of the putative associations between co-migrating proteins. The statistical significance and biological relevance of the profile clusters were verified using functional gene ontology annotation. The proof of concept for identifying protein complexes by our BN PAGE/LC-MS/MS approach is provided through the analysis of known protein complexes. Both well characterized long-lived protein complexes as well as potential temporary sequential multi-enzyme complexes were characterized.
Collapse
Affiliation(s)
- Noor Remmerie
- Center for Proteomics (CFP), Groenenborgerlaan 171, B-2020 Antwerp, Belgium
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
70
|
Reddy ASN, Ben-Hur A, Day IS. Experimental and computational approaches for the study of calmodulin interactions. PHYTOCHEMISTRY 2011; 72:1007-19. [PMID: 21338992 DOI: 10.1016/j.phytochem.2010.12.022] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2010] [Revised: 11/10/2010] [Accepted: 12/28/2010] [Indexed: 05/22/2023]
Abstract
Ca(2+), a universal messenger in eukaryotes, plays a major role in signaling pathways that control many growth and developmental processes in plants as well as their responses to various biotic and abiotic stresses. Cellular changes in Ca(2+) in response to diverse signals are recognized by protein sensors that either have their activity modulated or that interact with other proteins and modulate their activity. Calmodulins (CaMs) and CaM-like proteins (CMLs) are Ca(2+) sensors that have no enzymatic activity of their own but upon binding Ca(2+) interact and modulate the activity of other proteins involved in a large number of plant processes. Protein-protein interactions play a key role in Ca(2+)/CaM-mediated in signaling pathways. In this review, using CaM as an example, we discuss various experimental approaches and computational tools to identify protein-protein interactions. During the last two decades hundreds of CaM-binding proteins in plants have been identified using a variety of approaches ranging from simple screening of expression libraries with labeled CaM to high-throughput screens using protein chips. However, the high-throughput methods have not been applied to the entire proteome of any plant system. Nevertheless, the data provided by these screens allows the development of computational tools to predict CaM-interacting proteins. Using all known binding sites of CaM, we developed a computational method that predicted over 700 high confidence CaM interactors in the Arabidopsis proteome. Most (>600) of these are not known to bind calmodulin, suggesting that there are likely many more CaM targets than previously known. Functional analyses of some of the experimentally identified Ca(2+) sensor target proteins have uncovered their precise role in Ca(2+)-mediated processes. Further studies on identifying novel targets of CaM and CMLs and generating their interaction network - "calcium sensor interactome" - will help us in understanding how Ca(2+) regulates a myriad of cellular and physiological processes.
Collapse
Affiliation(s)
- A S N Reddy
- Department of Biology, Program in Molecular Plant Biology, Program in Cell and Molecular Biology, Colorado State University, Fort Collins, CO 80523, USA.
| | | | | |
Collapse
|
71
|
Lysenko A, Defoin-Platel M, Hassani-Pak K, Taubert J, Hodgman C, Rawlings CJ, Saqi M. Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis. BMC Bioinformatics 2011; 12:203. [PMID: 21612636 PMCID: PMC3118170 DOI: 10.1186/1471-2105-12-203] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2010] [Accepted: 05/25/2011] [Indexed: 12/18/2022] Open
Abstract
Background Combining multiple evidence-types from different information sources has the potential to reveal new relationships in biological systems. The integrated information can be represented as a relationship network, and clustering the network can suggest possible functional modules. The value of such modules for gaining insight into the underlying biological processes depends on their functional coherence. The challenges that we wish to address are to define and quantify the functional coherence of modules in relationship networks, so that they can be used to infer function of as yet unannotated proteins, to discover previously unknown roles of proteins in diseases as well as for better understanding of the regulation and interrelationship between different elements of complex biological systems. Results We have defined the functional coherence of modules with respect to the Gene Ontology (GO) by considering two complementary aspects: (i) the fragmentation of the GO functional categories into the different modules and (ii) the most representative functions of the modules. We have proposed a set of metrics to evaluate these two aspects and demonstrated their utility in Arabidopsis thaliana. We selected 2355 proteins for which experimentally established protein-protein interaction (PPI) data were available. From these we have constructed five relationship networks, four based on single types of data: PPI, co-expression, co-occurrence of protein names in scientific literature abstracts and sequence similarity and a fifth one combining these four evidence types. The ability of these networks to suggest biologically meaningful grouping of proteins was explored by applying Markov clustering and then by measuring the functional coherence of the clusters. Conclusions Relationship networks integrating multiple evidence-types are biologically informative and allow more proteins to be assigned to a putative functional module. Using additional evidence types concentrates the functional annotations in a smaller number of modules without unduly compromising their consistency. These results indicate that integration of more data sources improves the ability to uncover functional association between proteins, both by allowing more proteins to be linked and producing a network where modular structure more closely reflects the hierarchy in the gene ontology.
Collapse
Affiliation(s)
- Artem Lysenko
- Centre for Mathematical and Computational Biology, Rothamsted Research, Harpenden, Herts, AL5, 2JQ, UK.
| | | | | | | | | | | | | |
Collapse
|
72
|
Gu H, Zhu P, Jiao Y, Meng Y, Chen M. PRIN: a predicted rice interactome network. BMC Bioinformatics 2011; 12:161. [PMID: 21575196 PMCID: PMC3118165 DOI: 10.1186/1471-2105-12-161] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2010] [Accepted: 05/16/2011] [Indexed: 12/22/2022] Open
Abstract
Background Protein-protein interactions play a fundamental role in elucidating the molecular mechanisms of biomolecular function, signal transductions and metabolic pathways of living organisms. Although high-throughput technologies such as yeast two-hybrid system and affinity purification followed by mass spectrometry are widely used in model organisms, the progress of protein-protein interactions detection in plants is rather slow. With this motivation, our work presents a computational approach to predict protein-protein interactions in Oryza sativa. Results To better understand the interactions of proteins in Oryza sativa, we have developed PRIN, a Predicted Rice Interactome Network. Protein-protein interaction data of PRIN are based on the interologs of six model organisms where large-scale protein-protein interaction experiments have been applied: yeast (Saccharomyces cerevisiae), worm (Caenorhabditis elegans), fruit fly (Drosophila melanogaster), human (Homo sapiens), Escherichia coli K12 and Arabidopsis thaliana. With certain quality controls, altogether we obtained 76,585 non-redundant rice protein interaction pairs among 5,049 rice proteins. Further analysis showed that the topology properties of predicted rice protein interaction network are more similar to yeast than to the other 5 organisms. This may not be surprising as the interologs based on yeast contribute nearly 74% of total interactions. In addition, GO annotation, subcellular localization information and gene expression data are also mapped to our network for validation. Finally, a user-friendly web interface was developed to offer convenient database search and network visualization. Conclusions PRIN is the first well annotated protein interaction database for the important model plant Oryza sativa. It has greatly extended the current available protein-protein interaction data of rice with a computational approach, which will certainly provide further insights into rice functional genomics and systems biology. PRIN is available online at http://bis.zju.edu.cn/prin/.
Collapse
Affiliation(s)
- Haibin Gu
- Department of Bioinformatics, State Key Laboratory of Plant Physiology and Biochemistry, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | | | | | | | | |
Collapse
|
73
|
Lin M, Zhou X, Shen X, Mao C, Chen X. The predicted Arabidopsis interactome resource and network topology-based systems biology analyses. THE PLANT CELL 2011; 23:911-22. [PMID: 21441435 PMCID: PMC3082272 DOI: 10.1105/tpc.110.082529] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2010] [Revised: 12/30/2010] [Accepted: 03/10/2011] [Indexed: 05/17/2023]
Abstract
Predicted interactions are a valuable complement to experimentally reported interactions in molecular mechanism studies, particularly for higher organisms, for which reported experimental interactions represent only a small fraction of their total interactomes. With careful engineering consideration of the lessons from previous efforts, the predicted arabidopsis interactome resource (PAIR; ) presents 149,900 potential molecular interactions, which are expected to cover approximately 24% of the entire interactome with approximately 40% precision. This study demonstrates that, although PAIR still has limited coverage, it is rich enough to capture many significant functional linkages within and between higher-order biological systems, such as pathways and biological processes. These inferred interactions can nicely power several network topology-based systems biology analyses, such as gene set linkage analysis, protein function prediction, and identification of regulatory genes demonstrating insignificant expression changes. The drastically expanded molecular network in PAIR has considerably improved the capability of these analyses to integrate existing knowledge and suggest novel insights into the function and coordination of genes and gene networks.
Collapse
Affiliation(s)
- Mingzhi Lin
- State Key Laboratory of Plant Physiology and Biochemistry, Zhejiang University, Hangzhou 310058, People’s Republic of China
- Department of Bioinformatics, Zhejiang University, Hangzhou 310058, People’s Republic of China
| | - Xi Zhou
- Department of Bioinformatics, Zhejiang University, Hangzhou 310058, People’s Republic of China
| | - Xueling Shen
- Institute of Biochemistry, Zhejiang University, Hangzhou 310058, People’s Republic of China
| | - Chuanzao Mao
- State Key Laboratory of Plant Physiology and Biochemistry, Zhejiang University, Hangzhou 310058, People’s Republic of China
| | - Xin Chen
- State Key Laboratory of Plant Physiology and Biochemistry, Zhejiang University, Hangzhou 310058, People’s Republic of China
- Department of Bioinformatics, Zhejiang University, Hangzhou 310058, People’s Republic of China
- Institute of Biochemistry, Zhejiang University, Hangzhou 310058, People’s Republic of China
| |
Collapse
|
74
|
Demartini DR, Carlini CR, Thelen JJ. Proteome databases and other online resources for chloroplast research in Arabidopsis. Methods Mol Biol 2011; 775:93-115. [PMID: 21863440 DOI: 10.1007/978-1-61779-237-3_6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Proteomics aimed at addressing sub cellular fractions, such as chloroplasts, are a complex challenge. In the past few years, several studies in different laboratories have identified and, more recently, quantified, thousands of proteins within whole chloroplasts or chloroplast fractions. A considerable number of these studies are available for querying, using online resources, such as databases containing the proteins identified, encoding genes, acquired spectra, and phosphopeptides. The main purpose of this review is to identity and highlight useful features of these online resourses, mainly focused in proteomics databases related to chloroplast research in Arabidopsis thaliana. Several web sites were consulted. Among them, 11 were selected and discussed herein. The databases were classified into Plastid Databases, General Organelle Proteome Databases, and General Arabidopsis Proteome Databases. Special care was taken to present information regarding protein identification, protein quantification, and data integration. A selected list of online resources is presented in two tables. The databases analyzed are a useful source of information for researchers in the plastid organelle and plant proteomics fields.
Collapse
Affiliation(s)
- Diogo Ribeiro Demartini
- Department of Biophysics, Center of Biotechnology, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil.
| | | | | |
Collapse
|
75
|
Dietz KJ, Jacquot JP, Harris G. Hubs and bottlenecks in plant molecular signalling networks. THE NEW PHYTOLOGIST 2010; 188:919-38. [PMID: 20958306 DOI: 10.1111/j.1469-8137.2010.03502.x] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Conditional control of plant cell function and development relies on appropriate signal perception, signal integration and processing. The development of high throughput technologies such as proteomics and interactomics has enabled the identification of protein interaction networks that mediate signal processing from inputs to appropriate outputs. Such networks can be depicted in graphical representations using nodes and edges allowing for the immediate visualization and analysis of the network's topology. Hubs are network elements characterized by many edges (often degree grade k ≥ 5) which confer a degree of topological importance to them. The review introduces the concept of networks, hubs and bottlenecks and describes four examples from plant science in more detail, namely hubs in the redox regulatory network of the chloroplast with ferredoxin, thioredoxin and peroxiredoxin, in mitogen activated protein (MAP) kinase signal processing, in photomorphogenesis with the COP9 signalosome, COP1 and CDD, and monomeric GTPase function. Some guidance is provided to appropriate internet resources, web repositories, databases and their use. Plant networks can be generated from existing public databases and this type of analysis is valuable in support of existing hypotheses, or to allow for the generation of new concepts or ideas. However, intensive manual curating of in silico networks is still always necessary.
Collapse
Affiliation(s)
- Karl-Josef Dietz
- Plant Biochemistry and Physiology, Bielefeld University, D-33501 Bielefeld, Germany.
| | | | | |
Collapse
|
76
|
Abstract
The predicted Arabidopsis interactome resource (PAIR, http://www.cls.zju.edu.cn/pair/), comprised of 5990 experimentally reported molecular interactions in Arabidopsis thaliana together with 145 494 predicted interactions, is currently the most comprehensive data set of the Arabidopsis interactome with high reliability. PAIR predicts interactions by a fine-tuned support vector machine model that integrates indirect evidences for interaction, such as gene co-expressions, domain interactions, shared GO annotations, co-localizations, phylogenetic profile similarities and homologous interactions in other organisms (interologs). These predictions were expected to cover 24% of the entire Arabidopsis interactome, and their reliability was estimated to be 44%. Two independent example data sets were used to rigorously validate the prediction accuracy. PAIR features a user-friendly query interface, providing rich annotation on the relationships between two proteins. A graphical interaction network browser has also been integrated into the PAIR web interface to facilitate mining of specific pathways.
Collapse
Affiliation(s)
- Mingzhi Lin
- Department of Bioinformatics and Institute of Biochemistry, Zhejiang University, Hangzhou, PR China
| | | | | |
Collapse
|
77
|
He F, Zhou Y, Zhang Z. Deciphering the Arabidopsis floral transition process by integrating a protein-protein interaction network and gene expression data. PLANT PHYSIOLOGY 2010; 153:1492-505. [PMID: 20530214 PMCID: PMC2923896 DOI: 10.1104/pp.110.153650] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2010] [Accepted: 06/03/2010] [Indexed: 05/18/2023]
Abstract
In a plant, the progression from vegetative growth to reproductive growth is called the floral transition. Over the past several decades, the floral transition has been shown to be determined not by a single gene but by a complicated gene network. This important biological process, however, has not been investigated at a genome-wide network level. We collected Arabidopsis (Arabidopsis thaliana) protein-protein interaction data from several public databases and compiled them into a genome-wide Arabidopsis interactome. Then, we integrated gene expression profiles during the Arabidopsis floral transition process into the established protein-protein interaction network to identify two types of anticorrelated modules associated with vegetative and reproductive growth. Generally, the vegetative modules are conserved in plants, while the reproductive modules are more specific to advanced plants. The existence of floral transition switches demonstrates that vegetative and reproductive processes might be coordinated by the interacting interface of these modules. Our work also provides many candidates for mediating the interactions between these modules, which may play important roles during the Arabidopsis vegetative/reproductive switch.
Collapse
|
78
|
Brandão MM, Silva-Filho MC. Evolutionary history of Arabidopsis thaliana aminoacyl-tRNA synthetase dual-targeted proteins. Mol Biol Evol 2010; 28:79-85. [PMID: 20624849 DOI: 10.1093/molbev/msq176] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Aminoacyl-transfer RNA (tRNA) synthetases (aaRS) are key players in translation and act early in protein synthesis by mediating the attachment of amino acids to their cognate tRNA molecules. In plants, protein synthesis may occur in three subcellular compartments (cytosol, mitochondria, and chloroplasts), which requires multiple versions of the protein to be correctly delivered to its proper destination. The organellar aaRS are nuclear encoded and equipped with targeting information at the N-terminal sequence, which enables them to be specifically translocated to their final location. Most of the aaRS families present organellar proteins that are dual targeted to mitochondria and chloroplasts. Here, we examine the dual targeting behavior of aaRS from an evolutionary perspective. Our results show that Arabidopsis thaliana aaRS sequences are a result of a horizontal gene transfer event from bacteria. However, there is no evident bias indicating one single ancestor (Cyanobacteria or Proteobacteria). The dual-targeted aaRS phylogenetic relationship was characterized into two different categories (paralogs and homologs) depending on the state recovered for both dual-targeted and cytosolic proteins. Taken together, our results suggest that the dual-targeted condition is a gain-of-function derived from gene duplication. Selection may have maintained the original function in at least one of the copies as the additional copies diverged.
Collapse
Affiliation(s)
- Marcelo M Brandão
- Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Piracicaba, SP, Brazil
| | | |
Collapse
|