1
|
Abstract
Precision medicine relies on validated biomarkers with which to better classify patients by their probable disease risk, prognosis and/or response to treatment. Although affordable 'omics'-based technology has enabled faster identification of putative biomarkers, the validation of biomarkers is still stymied by low statistical power and poor reproducibility of results. This Review summarizes the successes and challenges of using different types of molecule as biomarkers, using lung cancer as a key illustrative example. Efforts at the national level of several countries to tie molecular measurement of samples to patient data via electronic medical records are the future of precision medicine research.
Collapse
Affiliation(s)
- Ashley J Vargas
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Room 3068A, MSC 425, 837 Convent Drive, Bethesda, Maryland 20892-4258, USA
- Division of Cancer Prevention, National Cancer Institute, Rockville, Maryland 20850, USA
| | - Curtis C Harris
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Room 3068A, MSC 425, 837 Convent Drive, Bethesda, Maryland 20892-4258, USA
| |
Collapse
|
2
|
Implying Analytic Measures for Unravelling Rheumatoid Arthritis Significant Proteins Through Drug–Target Interaction. Interdiscip Sci 2015; 8:122-131. [DOI: 10.1007/s12539-015-0108-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 11/28/2014] [Accepted: 01/03/2015] [Indexed: 12/22/2022]
|
3
|
Singh S, Vennila JJ, Snijesh VP, George G, Sunny C. Implying analytic measures for unraveling rheumatoid arthritis significant proteins through drug target interaction. Interdiscip Sci 2015. [PMID: 25663118 DOI: 10.1007/s12539-014-0245-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 11/28/2014] [Accepted: 01/03/2015] [Indexed: 11/25/2022]
Abstract
Rheumatoid arthritis (RA) is a systemic auto-immune and inflammatory disease that mainly alters the synovial joints and ultimately leads to their destruction. The involvement of the immune system and its related cells is a basic trademark of auto-immune associated diseases. The present work focuses on network analysis and its functional characterization to predict novel targets for RA. The interactive model called as Rheumatoid Arthritis Drug-Target-Protein (RA-DTP) is built of 1727 nodes and 7954 edges followed the power law distribution. RADTP comprised of 20 islands, 55 modules and 123 sub modules. Good interactome coverage of target-protein was detected in Island 2 (Q-Score 0.875) which includes 673 molecules with 20 modules and 68 sub modules. The biological landscape of these modules was examined based on the participation molecules in specific cellular localization, molecular function and biological pathway with favourable p value. Functional characterization and pathway analysis through KEGG, Biocarta and Reactome also showed their involvement in relation to the immune system and inflammatory processes and biological processes such as cell signalling and communication, glucosamine metabolic process, Renin Angiotensin system, BCR signals, Galactose metabolism, MAPK signalling, Complement and Coagulation system and NGF signalling pathways. Traffic values and centrality parameters were applied as the selection criteria for identifying potential targets from the important hubs which resulted into FOS, KNG1, PTGDS, HSP90AA1, REN, POMC, FCER1G, IL6, ICAM1, SGK1, NOS3 and PLA2G4A. This approach provides an insight to experimental validation of these associations of potential targets for clinical value to find their effect on animal studies.
Collapse
Affiliation(s)
- Sachidanand Singh
- Department of Bioinformatics, School of Biotechnology and Health Sciences, Karunya University, Coimbatore, India,
| | | | | | | | | |
Collapse
|
4
|
Rito T, Deane CM, Reinert G. The importance of age and high degree, in protein-protein interaction networks. J Comput Biol 2012; 19:785-95. [PMID: 22697248 DOI: 10.1089/cmb.2012.0054] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Here we present an in-depth analysis of the protein age patterns found in the edge and triangle subgraphs of the yeast protein-protein interaction network (PIN). We assess their statistical significance both according to what would be expected by chance given the node frequencies found in the yeast PIN, and also, for the case of triangles, given the age frequencies observed in the currently available pairwise data. We find that pairwise interactions between Old proteins are over-represented even when controlling for high degree, and triangle interactions between Old proteins are over-represented even when controlling for pairwise interaction frequencies. There is evidence for negative selection of interactions between Middle-aged and Old proteins within triangles, despite pairwise Middle-Old interactions being common. Most triangles consist solely of vertices with high degree. Our findings point towards an architecture of the yeast PIN that is highly heterogeneous, having connected clumps which contain a large number of interacting Old proteins along with selective age-dependent interaction patterns. Supplementary Material is available online (www.liebertonline.com/cmb).
Collapse
Affiliation(s)
- Tiago Rito
- Department of Statistics, University of Oxford, Oxford United Kingdom.
| | | | | |
Collapse
|
5
|
Vinogradov AE. Large scale of human duplicate genes divergence. J Mol Evol 2012; 75:25-33. [PMID: 22922908 DOI: 10.1007/s00239-012-9516-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Accepted: 08/03/2012] [Indexed: 01/25/2023]
Abstract
Proteome complexity increases in the evolution mostly by means of gene duplication followed by divergence. In this genome-scale study of human genome I show that density distribution of duplicate gene pairs along the axis of protein divergence between pair members forms two main peaks with a small peak and plateau before the first main peak. This picture indicates the existence of three evolutionary stages of duplicate gene evolution. The analysis of various functional parameters (gene expression level and breadth, transcription factor targets, protein interaction networks) suggests that subfunctionalization (partition of function) is a predominant mode of divergence in the first main peak, whereas neofunctionalization (acquiring of novel functions) prevails in the second main peak. The young duplicate pairs show a much higher expression level compared with singleton genes and more diverged duplicates, which indicates that requirement for high gene dosage is important for retention of duplicates just after the duplication event. Thus, a prevailing route of duplicate evolution seems to be the high gene dosage-subfunctionalization-neofunctionalization. This adaptationist model suggests that an organism is evolving in the direction of its most intensively used functions.
Collapse
|
6
|
Fung DCY, Li SS, Goel A, Hong SH, Wilkins MR. Visualization of the interactome: What are we looking at? Proteomics 2012; 12:1669-86. [DOI: 10.1002/pmic.201100454] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- David C. Y. Fung
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| | - Simone S. Li
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| | - Apurv Goel
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| | - Seok-Hee Hong
- School of Information Technologies; Faculty of Engineering and Information Technologies; The University of Sydney; New South Wales Australia
| | - Marc R. Wilkins
- New South Wales Systems Biology Initiative; and School of Biotechnology and Biomolecular Sciences; The University of New South Wales; New South Wales Australia
| |
Collapse
|
7
|
Mihalik Á, Csermely P. Heat shock partially dissociates the overlapping modules of the yeast protein-protein interaction network: a systems level model of adaptation. PLoS Comput Biol 2011; 7:e1002187. [PMID: 22022244 PMCID: PMC3192799 DOI: 10.1371/journal.pcbi.1002187] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2011] [Accepted: 07/24/2011] [Indexed: 11/18/2022] Open
Abstract
Network analysis became a powerful tool giving new insights to the understanding of cellular behavior. Heat shock, the archetype of stress responses, is a well-characterized and simple model of cellular dynamics. S. cerevisiae is an appropriate model organism, since both its protein-protein interaction network (interactome) and stress response at the gene expression level have been well characterized. However, the analysis of the reorganization of the yeast interactome during stress has not been investigated yet. We calculated the changes of the interaction-weights of the yeast interactome from the changes of mRNA expression levels upon heat shock. The major finding of our study is that heat shock induced a significant decrease in both the overlaps and connections of yeast interactome modules. In agreement with this the weighted diameter of the yeast interactome had a 4.9-fold increase in heat shock. Several key proteins of the heat shock response became centers of heat shock-induced local communities, as well as bridges providing a residual connection of modules after heat shock. The observed changes resemble to a 'stratus-cumulus' type transition of the interactome structure, since the unstressed yeast interactome had a globally connected organization, similar to that of stratus clouds, whereas the heat shocked interactome had a multifocal organization, similar to that of cumulus clouds. Our results showed that heat shock induces a partial disintegration of the global organization of the yeast interactome. This change may be rather general occurring in many types of stresses. Moreover, other complex systems, such as single proteins, social networks and ecosystems may also decrease their inter-modular links, thus develop more compact modules, and display a partial disintegration of their global structure in the initial phase of crisis. Thus, our work may provide a model of a general, system-level adaptation mechanism to environmental changes.
Collapse
Affiliation(s)
- Ágoston Mihalik
- Department of Medical Chemistry, Semmelweis University, Budapest, Hungary
| | - Peter Csermely
- Department of Medical Chemistry, Semmelweis University, Budapest, Hungary
- * E-mail:
| |
Collapse
|
8
|
Tasseff R, Nayak S, Song SO, Yen A, Varner JD. Modeling and analysis of retinoic acid induced differentiation of uncommitted precursor cells. Integr Biol (Camb) 2011; 3:578-91. [PMID: 21437295 PMCID: PMC3685823 DOI: 10.1039/c0ib00141d] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Manipulation of differentiation programs has therapeutic potential in a spectrum of human cancers and neurodegenerative disorders. In this study, we integrated computational and experimental methods to unravel the response of a lineage uncommitted precursor cell-line, HL-60, to Retinoic Acid (RA). HL-60 is a human myeloblastic leukemia cell-line used extensively to study human differentiation programs. Initially, we focused on the role of the BLR1 receptor in RA-induced differentiation and G1/0-arrest in HL-60. BLR1, a putative G protein-coupled receptor expressed following RA exposure, is required for RA-induced cell-cycle arrest and differentiation and causes persistent MAPK signaling. A mathematical model of RA-induced cell-cycle arrest and differentiation was formulated and tested against BLR1 wild-type (wt) knock-out and knock-in HL-60 cell-lines with and without RA. The current model described the dynamics of 729 proteins and protein complexes interconnected by 1356 interactions. An ensemble strategy was used to compensate for uncertain model parameters. The ensemble of HL-60 models recapitulated the positive feedback between BLR1 and MAPK signaling. The ensemble of models also correctly predicted Rb and p47phox regulation and the correlation between p21-CDK4-cyclin D formation and G1/0-arrest following exposure to RA. Finally, we investigated the robustness of the HL-60 network architecture to structural perturbations and generated experimentally testable hypotheses for future study. Taken together, the model presented here was a first step toward a systematic framework for analysis of programmed differentiation. These studies also demonstrated that mechanistic network modeling can help prioritize experimental directions by generating falsifiable hypotheses despite uncertainty.
Collapse
Affiliation(s)
- Ryan Tasseff
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca NY, 14853
| | - Satyaprakash Nayak
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca NY, 14853
| | - Sang Ok Song
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca NY, 14853
| | - Andrew Yen
- Department of Biomedical Sciences, Cornell University, Ithaca NY, 14853
| | - Jeffrey D. Varner
- Cornell University, 244 Olin Hall, Ithaca NY, 14853. Fax: 607 255 9166; Tel: 607 255 4258
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca NY, 14853
| |
Collapse
|
9
|
Azuaje F, Devaux Y, Wagner DR. Coordinated modular functionality and prognostic potential of a heart failure biomarker-driven interaction network. BMC SYSTEMS BIOLOGY 2010; 4:60. [PMID: 20462429 PMCID: PMC2890499 DOI: 10.1186/1752-0509-4-60] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2009] [Accepted: 05/12/2010] [Indexed: 01/28/2023]
Abstract
Background The identification of potentially relevant biomarkers and a deeper understanding of molecular mechanisms related to heart failure (HF) development can be enhanced by the implementation of biological network-based analyses. To support these efforts, here we report a global network of protein-protein interactions (PPIs) relevant to HF, which was characterized through integrative bioinformatic analyses of multiple sources of "omic" information. Results We found that the structural and functional architecture of this PPI network is highly modular. These network modules can be assigned to specialized processes, specific cellular regions and their functional roles tend to partially overlap. Our results suggest that HF biomarkers may be defined as key coordinators of intra- and inter-module communication. Putative biomarkers can, in general, be distinguished as "information traffic" mediators within this network. The top high traffic proteins are encoded by genes that are not highly differentially expressed across HF and non-HF patients. Nevertheless, we present evidence that the integration of expression patterns from high traffic genes may support accurate prediction of HF. We quantitatively demonstrate that intra- and inter-module functional activity may be controlled by a family of transcription factors known to be associated with the prevention of hypertrophy. Conclusion The systems-driven analysis reported here provides the basis for the identification of potentially novel biomarkers and understanding HF-related mechanisms in a more comprehensive and integrated way.
Collapse
Affiliation(s)
- Francisco Azuaje
- Laboratory of Cardiovascular Research, Centre de Recherche Public-Santé, L-1150 Luxembourg.
| | | | | |
Collapse
|
10
|
Gonçalves JP, Grãos M, Valente AX. POLAR MAPPER: a computational tool for integrated visualization of protein interaction networks and mRNA expression data. J R Soc Interface 2009; 6:881-96. [PMID: 19091689 PMCID: PMC2684442 DOI: 10.1098/rsif.2008.0407] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2008] [Accepted: 11/04/2008] [Indexed: 11/25/2022] Open
Abstract
Polar Mapper is a computational application for exposing the architecture of protein interaction networks. It facilitates the system-level analysis of mRNA expression data in the context of the underlying protein interaction network. Preliminary analysis of a human protein interaction network and comparison of the yeast oxidative stress and heat shock gene expression responses are addressed as case studies.
Collapse
Affiliation(s)
- Joana P. Gonçalves
- Unidade de Sistemas Biológicos, Biocant, 3060-197 Cantanhede, Portugal
- KDBIO Group, INESC-ID, 1000-029 Lisbon, Portugal
- IST, Technical University of Lisbon, 1169-047 Lisbon, Portugal
| | - Mário Grãos
- Unidade de Biologia Celular, Biocant, 3060-197 Cantanhede, Portugal
| | - André X.C.N. Valente
- Unidade de Sistemas Biológicos, Biocant, 3060-197 Cantanhede, Portugal
- Centro de Neurociências e Biologia Celular, Universidade de Coimbra, 3004-517 Coimbra, Portugal
| |
Collapse
|
11
|
Vinogradov AE, Anatskaya OV. Loss of protein interactions and regulatory divergence in yeast whole-genome duplicates. Genomics 2009; 93:534-42. [PMID: 19272438 DOI: 10.1016/j.ygeno.2009.02.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2008] [Revised: 02/26/2009] [Accepted: 02/27/2009] [Indexed: 11/19/2022]
Abstract
Whole-genome duplications are important for the growth of genome complexity. We investigated various factors involved in the evolution of yeast whole-genome duplicates (ohnologs) making emphasis on the analysis of protein interactions. We found that ohnologs have a lower number of protein interactions compared with small-scale duplicates and singletons (by about -40%). The loss of interactions was proportional to their initial number and independent of ohnolog position in the protein interaction network. A faster evolving member of an ohnolog pair has a lower number of interactions compared to its counterpart. The Gene Ontology mapping of non-overlapping and overlapping interactants of paired ohnologs reveals a sharp asymmetry in GO terms related to regulation. The fraction of these terms is much higher in non-overlapping interactants (compared to overlapping interactants and total dataset). Network clustering coefficient is lower in ohnologs, yet they show an increased density of protein interactions restricted within the whole ohnologs set. These facts suggest that subfunctionalization (or subneofunctionalization) reflected in the loss of protein interactions was a prevailing process in the divergence of ohnologs, which distinguishes them from small-scale duplicates. The loss of protein interactions was associated with the regulatory divergence between the members of an ohnolog pair. A small-scale modularity (reflected in clustering coefficient) probably was not important for ohnologs retention, yet a larger-scale modularity could be involved in their evolution.
Collapse
Affiliation(s)
- Alexander E Vinogradov
- Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Ave. 4, St. Petersburg 194064, Russia.
| | | |
Collapse
|
12
|
Topological properties of protein interaction networks from a structural perspective. Biochem Soc Trans 2009; 36:1398-403. [PMID: 19021563 DOI: 10.1042/bst0361398] [Citation(s) in RCA: 101] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Protein-protein interactions are usually shown as interaction networks (graphs), where the proteins are represented as nodes and the connections between the interacting proteins are shown as edges. The graph abstraction of protein interactions is crucial for understanding the global behaviour of the network. In this mini review, we summarize basic graph topological properties, such as node degree and betweenness, and their relation to essentiality and modularity of protein interactions. The classification of hub proteins into date and party hubs with distinct properties has significant implications for relating topological properties to the behaviour of the network. We emphasize that the integration of protein interface structure into interaction graph models provides a better explanation of hub proteins, and strengthens the relationship between the role of the hubs in the cell and their topological properties.
Collapse
|
13
|
Functional organization of the yeast proteome by a yeast interactome map. Proc Natl Acad Sci U S A 2009; 106:1490-5. [PMID: 19164585 DOI: 10.1073/pnas.0808624106] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It is hoped that comprehensive mapping of protein physical interactions will facilitate insights regarding both fundamental cell biology processes and the pathology of diseases. To fulfill this hope, good solutions to 2 issues will be essential: (i) how to obtain reliable interaction data in a high-throughput setting and (ii) how to structure interaction data in a meaningful form, amenable to and valuable for further biological research. In this article, we structure an interactome in terms of predicted permanent protein complexes and predicted transient, nongeneric interactions between these complexes. The interactome is generated by means of an associated computational algorithm, from raw high-throughput affinity purification/mass spectrometric interaction data. We apply our technique to the construction of an interactome for Saccharomyces cerevisiae, showing that it yields reliability typical of low-throughput experiments from high-throughput data. We discuss biological insights raised by this interactome including, via homology, a few related to human disease.
Collapse
|
14
|
Abstract
Prioritization, or ranking, of gene lists is becoming increasingly important for analyzing data generated from high-throughput assays like expression profiling and RNAi-based screening. This is especially the case when specific genes in a list need to be further validated using low-throughput experiments. In addition to gene set overlap enrichment methods, a complementary approach is to examine molecular interaction networks. These can provide putative functional insights based on gene connectivity, especially when many genes contain little or no annotation. For bench and computational biologists alike, using networks requires an informed selection of interaction data for network construction and strategies for managing network complexity. Moreover, graph theory and social network analysis methods can be used to isolate critical subnetworks and quantify network properties. Here, I discuss the basic components of networks, implications of their structure for functional interpretation, and common metrics used for prioritization. Although this is still an ongoing area of research, networks are providing new ways for gauging pathway impact in large-scale data sets.
Collapse
|
15
|
Kim WK, Marcotte EM. Age-dependent evolution of the yeast protein interaction network suggests a limited role of gene duplication and divergence. PLoS Comput Biol 2008; 4:e1000232. [PMID: 19043579 PMCID: PMC2583957 DOI: 10.1371/journal.pcbi.1000232] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2008] [Accepted: 10/17/2008] [Indexed: 11/18/2022] Open
Abstract
Proteins interact in complex protein–protein interaction (PPI) networks whose topological properties—such as scale-free topology, hierarchical modularity, and dissortativity—have suggested models of network evolution. Currently preferred models invoke preferential attachment or gene duplication and divergence to produce networks whose topology matches that observed for real PPIs, thus supporting these as likely models for network evolution. Here, we show that the interaction density and homodimeric frequency are highly protein age–dependent in real PPI networks in a manner which does not agree with these canonical models. In light of these results, we propose an alternative stochastic model, which adds each protein sequentially to a growing network in a manner analogous to protein crystal growth (CG) in solution. The key ideas are (1) interaction probability increases with availability of unoccupied interaction surface, thus following an anti-preferential attachment rule, (2) as a network grows, highly connected sub-networks emerge into protein modules or complexes, and (3) once a new protein is committed to a module, further connections tend to be localized within that module. The CG model produces PPI networks consistent in both topology and age distributions with real PPI networks and is well supported by the spatial arrangement of protein complexes of known 3-D structure, suggesting a plausible physical mechanism for network evolution. Proteins function together forming stable protein complexes or transient interactions in various cellular processes, such as gene regulation and signaling. Here, we address the basic question of how these networks of interacting proteins evolve. This is an important problem, as the structures of such networks underlie important features of biological systems, such as functional modularity, error-tolerance, and stability. It is not yet known how these network architectures originate or what driving forces underlie the observed network structure. Several models have been proposed over the past decade—in particular, a “rich get richer” model (preferential attachment) and a model based upon gene duplication and divergence—often based only on network topologies. Here, we show that real yeast protein interaction networks show a unique age distribution among interacting proteins, which rules out these canonical models. In light of these results, we developed a simple, alternative model based on well-established physical principles, analogous to the process of growing protein crystals in solution. The model better explains many features of real PPI networks, including the network topologies, their characteristic age distributions, and the spatial distribution of subunits of differing ages within protein complexes, suggesting a plausible physical mechanism of network evolution.
Collapse
Affiliation(s)
- Wan Kyu Kim
- Center for Systems and Synthetic Biology, Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas, United States of America
| | - Edward M. Marcotte
- Center for Systems and Synthetic Biology, Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas, United States of America
- * E-mail:
| |
Collapse
|
16
|
Vinogradov AE. Modularity of cellular networks shows general center-periphery polarization. ACTA ACUST UNITED AC 2008; 24:2814-7. [PMID: 18953046 DOI: 10.1093/bioinformatics/btn555] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The modular biology is supposed to be a bridge from the molecular to the systems biology. Using a new approach, it is shown here that the protein interaction networks of yeast Saccharomyces cerevisiae and bacteria Escherichia coli consist of two large-scale modularity layers, central and peripheral, separated by a zone of depressed modularity. This finding based on the analysis of network topology is further supported by the discovery that there are many more Gene Ontology categories (terms) and KEGG biochemical pathways that are overrepresented in the central and peripheral layers than in the intermediate zone. The categories of the central layer are mostly related to nuclear information processing, regulation and cell cycle, whereas the peripheral layer is dealing with various metabolic and energetic processes, transport and cell communication. A similar center-periphery polarization of modularity is found in the protein domain networks ('built-in interactome') and in a powergrid (as a non-biological example). These data suggest a 'polarized modularity' model of cellular networks where the central layer seems to be regulatory and to use information storage of the nucleus, whereas the peripheral layer seems devoted to more specialized tasks and environmental interactions, with a complex 'bus' between the layers.
Collapse
|
17
|
Turanalp ME, Can T. Discovering functional interaction patterns in protein-protein interaction networks. BMC Bioinformatics 2008; 9:276. [PMID: 18547430 PMCID: PMC2442100 DOI: 10.1186/1471-2105-9-276] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2007] [Accepted: 06/11/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In recent years, a considerable amount of research effort has been directed to the analysis of biological networks with the availability of genome-scale networks of genes and/or proteins of an increasing number of organisms. A protein-protein interaction (PPI) network is a particular biological network which represents physical interactions between pairs of proteins of an organism. Major research on PPI networks has focused on understanding the topological organization of PPI networks, evolution of PPI networks and identification of conserved subnetworks across different species, discovery of modules of interaction, use of PPI networks for functional annotation of uncharacterized proteins, and improvement of the accuracy of currently available networks. RESULTS In this article, we map known functional annotations of proteins onto a PPI network in order to identify frequently occurring interaction patterns in the functional space. We propose a new frequent pattern identification technique, PPISpan, adapted specifically for PPI networks from a well-known frequent subgraph identification method, gSpan. Existing module discovery techniques either look for specific clique-like highly interacting protein clusters or linear paths of interaction. However, our goal is different; instead of single clusters or pathways, we look for recurring functional interaction patterns in arbitrary topologies. We have applied PPISpan on PPI networks of Saccharomyces cerevisiae and identified a number of frequently occurring functional interaction patterns. CONCLUSION With the help of PPISpan, recurring functional interaction patterns in an organism's PPI network can be identified. Such an analysis offers a new perspective on the modular organization of PPI networks. The complete list of identified functional interaction patterns is available at http://bioserver.ceng.metu.edu.tr/PPISpan/.
Collapse
Affiliation(s)
- Mehmet E Turanalp
- Department of Computer Engineering, Selcuk University, Alaaddin Keykubat Kampusu, 42075 Selcuklu, Konya, Turkey.
| | | |
Collapse
|
18
|
Cui J, Li P, Li G, Xu F, Zhao C, Li Y, Yang Z, Wang G, Yu Q, Li Y, Shi T. AtPID: Arabidopsis thaliana protein interactome database--an integrative platform for plant systems biology. Nucleic Acids Res 2008; 36:D999-1008. [PMID: 17962307 PMCID: PMC2238993 DOI: 10.1093/nar/gkm844] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2007] [Revised: 09/25/2007] [Accepted: 09/25/2007] [Indexed: 12/01/2022] Open
Abstract
Arabidopsis thaliana Protein Interactome Database (AtPID) is an object database that integrates data from several bioinformatics prediction methods and manually collected information from the literature. It contains data relevant to protein-protein interaction, protein subcellular location, ortholog maps, domain attributes and gene regulation. The predicted protein interaction data were obtained from ortholog interactome, microarray profiles, GO annotation, and conserved domain and genome contexts. This database holds 28,062 protein-protein interaction pairs with 23,396 pairs generated from prediction methods. Among the rest 4666 pairs, 3866 pairs of them involving 1875 proteins were manually curated from the literature and 800 pairs were from enzyme complexes in KEGG. In addition, subcellular location information of 5562 proteins is available. AtPID was built via an intuitive query interface that provides easy access to the important features of proteins. Through the incorporation of both experimental and computational methods, AtPID is a rich source of information for system-level understanding of gene function and biological processes in A. thaliana. Public access to the AtPID database is available at http://atpid.biosino.org/.
Collapse
Affiliation(s)
- Jian Cui
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Peng Li
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Guang Li
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Feng Xu
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Chen Zhao
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Yuhua Li
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Zhongnan Yang
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Guang Wang
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Qingbo Yu
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Yixue Li
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| | - Tieliu Shi
- College of Life Sciences, the Northeast Forestry University, Harbin, Heilongjiang 150040, Shanghai Information Center for Life Sciences, Chinese Academy of Sciences, Shanghai 200031, Bioinformatics Center, Key Lab of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Daqing Institute of Biotechnology, Northeast Forestry University, Daqing, Heilongjiang 163316, College of Life and Environmental Sciences, Shanghai Normal University, Shanghai 200234 and Bioinformatics Center, Shanghai University, Shanghai 200444, China
| |
Collapse
|
19
|
Korcsmáros T, Kovács IA, Szalay MS, Csermely P. Molecular chaperones: the modular evolution of cellular networks. J Biosci 2007; 32:441-6. [PMID: 17536163 DOI: 10.1007/s12038-007-0043-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Molecular chaperones play a prominent role in signaling and transcriptional regulatory networks of the cell. Recent advances uncovered that chaperones act as genetic buffers stabilizing the phenotype of various cells and organisms and may serve as potential regulators of evolvability. Chaperones have weak links, connect hubs, are in the overlaps of network modules and may uncouple these modules during stress,which gives an additional protection for the cell at the network-level. Moreover,after stress chaperones are essential to re-build inter-modular contacts by their low affinity sampling of the potential interaction partners in different modules. This opens the way to the chaperone-regulated modular evolution of cellular networks,and helps us to design novel therapeutic and anti-aging strategies.
Collapse
Affiliation(s)
- Tamás Korcsmáros
- Department of Medical Chemistry, Semmelweis University, Budapest, Hungary
| | | | | | | |
Collapse
|
20
|
Zhang S, Jin G, Zhang XS, Chen L. Discovering functions and revealing mechanisms at molecular level from biological networks. Proteomics 2007; 7:2856-69. [PMID: 17703505 DOI: 10.1002/pmic.200700095] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
With the increasingly accumulated data from high-throughput technologies, study on biomolecular networks has become one of key focuses in systems biology and bioinformatics. In particular, various types of molecular networks (e.g., protein-protein interaction (PPI) network; gene regulatory network (GRN); metabolic network (MN); gene coexpression network (GCEN)) have been extensively investigated, and those studies demonstrate great potentials to discover basic functions and to reveal essential mechanisms for various biological phenomena, by understanding biological systems not at individual component level but at a system-wide level. Recent studies on networks have created very prolific researches on many aspects of living organisms. In this paper, we aim to review the recent developments on topics related to molecular networks in a comprehensive manner, with the special emphasis on the computational aspect. The contents of the survey cover global topological properties and local structural characteristics, network motifs, network comparison and query, detection of functional modules and network motifs, function prediction from network analysis, inferring molecular networks from biological data as well as representative databases and software tools.
Collapse
Affiliation(s)
- Shihua Zhang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | | | | | | |
Collapse
|
21
|
Wang Z, Zhang J. In search of the biological significance of modular structures in protein networks. PLoS Comput Biol 2007; 3:e107. [PMID: 17542644 PMCID: PMC1885274 DOI: 10.1371/journal.pcbi.0030107] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2006] [Accepted: 04/26/2007] [Indexed: 12/02/2022] Open
Abstract
Many complex networks such as computer and social networks exhibit modular structures, where links between nodes are much denser within modules than between modules. It is widely believed that cellular networks are also modular, reflecting the relative independence and coherence of different functional units in a cell. While many authors have claimed that observations from the yeast protein–protein interaction (PPI) network support the above hypothesis, the observed structural modularity may be an artifact because the current PPI data include interactions inferred from protein complexes through approaches that create modules (e.g., assigning pairwise interactions among all proteins in a complex). Here we analyze the yeast PPI network including protein complexes (PIC network) and excluding complexes (PEC network). We find that both PIC and PEC networks show a significantly greater structural modularity than that of randomly rewired networks. Nonetheless, there is little evidence that the structural modules correspond to functional units, particularly in the PEC network. More disturbingly, there is no evolutionary conservation among yeast, fly, and nematode modules at either the whole-module or protein-pair level. Neither is there a correlation between the evolutionary or phylogenetic conservation of a protein and the extent of its participation in various modules. Using computer simulation, we demonstrate that a higher-than-expected modularity can arise during network growth through a simple model of gene duplication, without natural selection for modularity. Taken together, our results suggest the intriguing possibility that the structural modules in the PPI network originated as an evolutionary byproduct without biological significance. Many complex networks are naturally divided into communities or modules, where links within modules are much denser than those across modules. For example, human individuals belonging to the same ethnic groups interact more than those from different ethnic groups. Cellular functions are also organized in a highly modular manner, where each module is a discrete object composed of a group of tightly linked components and performs a relatively independent task. It is interesting to ask whether this modularity in cellular function arises from modularity in molecular interaction networks such as the transcriptional regulatory network and protein–protein interaction (PPI) network. We analyze the yeast PPI network and show that it is indeed significantly more modular than randomly rewired networks. However, we find little evidence that the structural modules correspond to functional units. We also fail to observe any evolutionary conservation among yeast, fly, and nematode PPI modules. We then show by computer simulation that modular structures can arise during network growth via a simple model of gene duplication, without natural selection for modularity. Thus, it appears that the structural modules in the PPI network may have originated as an evolutionary byproduct without much biological significance.
Collapse
Affiliation(s)
- Zhi Wang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
22
|
Current awareness on yeast. Yeast 2007. [DOI: 10.1002/yea.1323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|