101
|
Mechanisms of protein oligomerization, the critical role of insertions and deletions in maintaining different oligomeric states. Proc Natl Acad Sci U S A 2010; 107:20352-7. [PMID: 21048085 DOI: 10.1073/pnas.1012999107] [Citation(s) in RCA: 140] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The main principles of protein-protein recognition are elucidated by the studies of homooligomers which in turn mediate and regulate gene expression, activity of enzymes, ion channels, receptors, and cell-cell adhesion processes. Here we explore oligomeric states of homologous proteins in various organisms to better understand the functional roles and evolutionary mechanisms of homooligomerization. We observe a great diversity in mechanisms controlling oligomerization and focus in our study on insertions and deletions in homologous proteins and how they enable or disable complex formation. We show that insertions and deletions which differentiate monomers and dimers have a significant tendency to be located on the interaction interfaces and about a quarter of all proteins studied and forty percent of enzymes have regions which mediate or disrupt the formation of oligomers. We suggest that relatively small insertions or deletions may have a profound effect on complex stability and/or specificity. Indeed removal of complex enabling regions from protein structures in many cases resulted in the complete or partial loss of stability. Moreover, we find that insertions and deletions modulating oligomerization have a lower aggregation propensity and contain a larger fraction of polar, charged residues, glycine and proline compared to conventional interfaces and protein surface. Most likely, these regions may mediate specific interactions, prevent nonspecific dysfunctional aggregation and preclude undesired interactions between close paralogs therefore separating their functional pathways. Last, we show how the presence or absence of insertions and deletions on interfaces might be of practical value in annotating protein oligomeric states.
Collapse
|
102
|
Abstract
Although the origin of mitochondria from the endosymbiosis of an α-proteobacterium is well established, the nature of the host cell, the metabolic complexity of the endosymbiont and the subsequent evolution of the proto-mitochondrion into all its current appearances are still the subject of discovery and sometimes debate. Here we review what has been inferred about the original composition and subsequent evolution of the mitochondrial proteome and essential mitochondrial systems. The evolutionary mosaic that currently constitutes mitochondrial proteomes contains (i) endosymbiotic proteins (15-45%), (ii) proteins without detectable orthologs outside the eukaryotic lineage (40%), and (iii) proteins that are derived from non-proteobacterial Bacteria, Bacteriophages and Archaea (15%, specifically multiple tRNA-modification proteins). Protein complexes are of endosymbiotic origin, but have greatly expanded with novel eukaryotic proteins; in contrast to mitochondrial enzymes that are both of proteobacterial and non-proteobacterial origin. This disparity is consistent with the complexity hypothesis, which argues that proteins that are a part of large, multi-subunit complexes are unlikely to undergo horizontal gene transfer. We observe that they neither change their subcellular compartments in the course of evolution, even when their genes do.
Collapse
Affiliation(s)
- Radek Szklarczyk
- Radboud University Nijmegen Medical Centre, CMBI/NCMLS, Nijmegen, The Netherlands
| | | |
Collapse
|
103
|
Fromer M, Linial M. Exposing the co-adaptive potential of protein-protein interfaces through computational sequence design. ACTA ACUST UNITED AC 2010; 26:2266-72. [PMID: 20679332 DOI: 10.1093/bioinformatics/btq412] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
MOTIVATION In nature, protein-protein interactions are constantly evolving under various selective pressures. Nonetheless, it is expected that crucial interactions are maintained through compensatory mutations between interacting proteins. Thus, many studies have used evolutionary sequence data to extract such occurrences of correlated mutation. However, this research is confounded by other evolutionary pressures that contribute to sequence covariance, such as common ancestry. RESULTS Here, we focus exclusively on the compensatory mutations deriving from physical protein interactions, by performing large-scale computational mutagenesis experiments for >260 protein-protein interfaces. We investigate the potential for co-adaptability present in protein pairs that are always found together in nature (obligate) and those that are occasionally in complex (transient). By modeling each complex both in bound and unbound forms, we find that naturally transient complexes possess greater relative capacity for correlated mutation than obligate complexes, even when differences in interface size are taken into account.
Collapse
Affiliation(s)
- Menachem Fromer
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | | |
Collapse
|
104
|
Lovell SC, Robertson DL. An integrated view of molecular coevolution in protein-protein interactions. Mol Biol Evol 2010; 27:2567-75. [PMID: 20551042 DOI: 10.1093/molbev/msq144] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Protein-protein interactions effectively mediate molecular function. They are the result of specific interactions between protein interfaces and are maintained by the action of evolutionary pressure on the regions of the interacting proteins that contribute to binding. For the most part, selection restricts amino acid replacements, accounting for the conservation of binding interfaces. However, in some cases, change in one protein will be mitigated by compensatory change in its binding partner, maintaining function in the face of evolutionary change. There have been several attempts to use correlations in sequence evolution to predict interactions of proteins. Most commonly, these approaches use the entire sequence to identify correlations and so infer probable binding. However, other factors such as shared evolutionary history and similarities in the rates of evolution confound these whole-sequence-based approaches. Here, we discuss recent work on this topic and argue that both site-specific coevolutionary change and whole-sequence evolution contribute to evolutionary signals in sets of interacting proteins. We discuss the relative effects of both types of selection and how they might be identified. This permits an integrated view of protein-protein interactions, their evolution, and coevolution.
Collapse
Affiliation(s)
- Simon C Lovell
- Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, United Kingdom.
| | | |
Collapse
|
105
|
Pérez-Bercoff A, Makino T, McLysaght A. Duplicability of self-interacting human genes. BMC Evol Biol 2010; 10:160. [PMID: 20509897 PMCID: PMC2894830 DOI: 10.1186/1471-2148-10-160] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2009] [Accepted: 05/28/2010] [Indexed: 12/05/2022] Open
Abstract
Background There is increasing interest in the evolution of protein-protein interactions because this should ultimately be informative of the patterns of evolution of new protein functions within the cell. One model proposes that the evolution of new protein-protein interactions and protein complexes proceeds through the duplication of self-interacting genes. This model is supported by data from yeast. We examined the relationship between gene duplication and self-interaction in the human genome. Results We investigated the patterns of self-interaction and duplication among 34808 interactions encoded by 8881 human genes, and show that self-interacting proteins are encoded by genes with higher duplicability than genes whose proteins lack this type of interaction. We show that this result is robust against the system used to define duplicate genes. Finally we compared the presence of self-interactions amongst proteins whose genes have duplicated either through whole-genome duplication (WGD) or small-scale duplication (SSD), and show that the former tend to have more interactions in general. After controlling for age differences between the two sets of duplicates this result can be explained by the time since the gene duplication. Conclusions Genes encoding self-interacting proteins tend to have higher duplicability than proteins lacking self-interactions. Moreover these duplicate genes have more often arisen through whole-genome rather than small-scale duplication. Finally, self-interacting WGD genes tend to have more interaction partners in general in the PIN, which can be explained by their overall greater age. This work adds to our growing knowledge of the importance of contextual factors in gene duplicability.
Collapse
Affiliation(s)
- Asa Pérez-Bercoff
- Smurfit Institute of Genetics, University of Dublin, Trinity College, Dublin 2, Ireland
| | | | | |
Collapse
|
106
|
Blackman BK, Strasburg JL, Raduski AR, Michaels SD, Rieseberg LH. The role of recently derived FT paralogs in sunflower domestication. Curr Biol 2010; 20:629-35. [PMID: 20303265 DOI: 10.1016/j.cub.2010.01.059] [Citation(s) in RCA: 161] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Revised: 01/26/2010] [Accepted: 01/27/2010] [Indexed: 11/17/2022]
Abstract
Gene duplication provides an important source of genetic raw material for phenotypic diversification, but few studies have detailed the mechanisms through which duplications produce evolutionary novelty within species. Here, we investigate how a set of recently duplicated homologs of the floral inducer FLOWERING LOCUS T (FT) has contributed to sunflower domestication. We find that changes in expression of these duplicates are associated with differences in flowering behavior between wild and domesticated sunflower. In addition, we present genetic and functional evidence demonstrating that a frameshift mutation in one paralog, Helianthus annuus FT 1 (HaFT1), underlies a major QTL for flowering time and experienced a selective sweep during early domestication. Notably, this dominant-negative allele delays flowering through interference with action of another paralog, HaFT4. Together, these data reveal that changes affecting the expression, sequence, and gene interactions of HaFT paralogs have played key roles during sunflower domestication. Our findings also illustrate the important role that evolving interactions between new gene family members may play in fostering phenotypic change.
Collapse
|
107
|
Reid AJ, Ranea JA, Orengo CA. Comparative evolutionary analysis of protein complexes in E. coli and yeast. BMC Genomics 2010; 11:79. [PMID: 20122144 PMCID: PMC2837643 DOI: 10.1186/1471-2164-11-79] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2009] [Accepted: 02/01/2010] [Indexed: 11/17/2022] Open
Abstract
Background Proteins do not act in isolation; they frequently act together in protein complexes to carry out concerted cellular functions. The evolution of complexes is poorly understood, especially in organisms other than yeast, where little experimental data has been available. Results We generated accurate, high coverage datasets of protein complexes for E. coli and yeast in order to study differences in the evolution of complexes between these two species. We show that substantial differences exist in how complexes have evolved between these organisms. A previously proposed model of complex evolution identified complexes with cores of interacting homologues. We support findings of the relative importance of this mode of evolution in yeast, but find that it is much less common in E. coli. Additionally it is shown that those homologues which do cluster in complexes are involved in eukaryote-specific functions. Furthermore we identify correlated pairs of non-homologous domains which occur in multiple protein complexes. These were identified in both yeast and E. coli and we present evidence that these too may represent complex cores in yeast but not those of E. coli. Conclusions Our results suggest that there are differences in the way protein complexes have evolved in E. coli and yeast. Whereas some yeast complexes have evolved by recruiting paralogues, this is not apparent in E. coli. Furthermore, such complexes are involved in eukaryotic-specific functions. This implies that the increase in gene family sizes seen in eukaryotes in part reflects multiple family members being used within complexes. However, in general, in both E. coli and yeast, homologous domains are used in different complexes.
Collapse
Affiliation(s)
- Adam J Reid
- Research Department of Structural & Molecular Biology, University College London, London, WC1E 6BT, UK.
| | | | | |
Collapse
|
108
|
Ma L, Pang CNI, Li SS, Wilkins MR. Proteins Deleterious on Overexpression Are Associated with High Intrinsic Disorder, Specific Interaction Domains, and Low Abundance. J Proteome Res 2010; 9:1218-25. [DOI: 10.1021/pr900693e] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Liang Ma
- Systems Biology Initiative and School of Biotechnology and Biomolecular Sciences, University of New South Wales, NSW, Australia
| | - Chi Nam Ignatius Pang
- Systems Biology Initiative and School of Biotechnology and Biomolecular Sciences, University of New South Wales, NSW, Australia
| | - Simone S. Li
- Systems Biology Initiative and School of Biotechnology and Biomolecular Sciences, University of New South Wales, NSW, Australia
| | - Marc R. Wilkins
- Systems Biology Initiative and School of Biotechnology and Biomolecular Sciences, University of New South Wales, NSW, Australia
| |
Collapse
|
109
|
|
110
|
Dayhoff JE, Shoemaker BA, Bryant SH, Panchenko AR. Evolution of protein binding modes in homooligomers. J Mol Biol 2009; 395:860-70. [PMID: 19879880 DOI: 10.1016/j.jmb.2009.10.052] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Revised: 10/22/2009] [Accepted: 10/24/2009] [Indexed: 10/20/2022]
Abstract
The evolution of protein interactions cannot be deciphered without a detailed analysis of interaction interfaces and binding modes. We performed a large-scale study of protein homooligomers in terms of their symmetry, interface sizes, and conservation of binding modes. We also focused specifically on the evolution of protein binding modes from nine families of homooligomers and mapped 60 different binding modes and oligomerization states onto the phylogenetic trees of these families. We observed a significant tendency for the same binding modes to be clustered together and conserved within clades on phylogenetic trees; this trend is especially pronounced for close homologs with 70% sequence identity or higher. Some binding modes are conserved among very distant homologs, pointing to their ancient evolutionary origin, while others are very specific for a certain phylogenetic group. Moreover, we found that the most ancient binding modes have a tendency to involve symmetrical (isologous) homodimer binding arrangements with larger interfaces, while recently evolved binding modes more often exhibit asymmetrical arrangements and smaller interfaces.
Collapse
Affiliation(s)
- Judith E Dayhoff
- Department of Biochemistry and Molecular and Cellular Biology, Georgetown University, Washington, DC, USA
| | | | | | | |
Collapse
|
111
|
Ratmann O, Wiuf C, Pinney JW. From evidence to inference: probing the evolution of protein interaction networks. HFSP JOURNAL 2009; 3:290-306. [PMID: 20357887 DOI: 10.2976/1.3167215] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2009] [Revised: 05/30/2009] [Indexed: 01/06/2023]
Abstract
The evolutionary mechanisms by which protein interaction networks grow and change are beginning to be appreciated as a major factor shaping their present-day structures and properties. Starting with a consideration of the biases and errors inherent in our current views of these networks, we discuss the dangers of constructing evolutionary arguments from naïve analyses of network topology. We argue that progress in understanding the processes of network evolution is only possible when hypotheses are formulated as plausible evolutionary models and compared against the observed data within the framework of probabilistic modeling. The value of such models is expected to be greatly enhanced as they incorporate more of the details of the biophysical properties of interacting proteins, gene phylogeny, and measurement error and as more advanced methodologies emerge for model comparison and the inference of ancestral network states.
Collapse
|
112
|
Peregrín-Alvarez JM, Xiong X, Su C, Parkinson J. The Modular Organization of Protein Interactions in Escherichia coli. PLoS Comput Biol 2009; 5:e1000523. [PMID: 19798435 PMCID: PMC2739439 DOI: 10.1371/journal.pcbi.1000523] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2009] [Accepted: 08/27/2009] [Indexed: 11/19/2022] Open
Abstract
Escherichia coli serves as an excellent model for the study of fundamental cellular processes such as metabolism, signalling and gene expression. Understanding the function and organization of proteins within these processes is an important step towards a 'systems' view of E. coli. Integrating experimental and computational interaction data, we present a reliable network of 3,989 functional interactions between 1,941 E. coli proteins ( approximately 45% of its proteome). These were combined with a recently generated set of 3,888 high-quality physical interactions between 918 proteins and clustered to reveal 316 discrete modules. In addition to known protein complexes (e.g., RNA and DNA polymerases), we identified modules that represent biochemical pathways (e.g., nitrate regulation and cell wall biosynthesis) as well as batteries of functionally and evolutionarily related processes. To aid the interpretation of modular relationships, several case examples are presented, including both well characterized and novel biochemical systems. Together these data provide a global view of the modular organization of the E. coli proteome and yield unique insights into structural and evolutionary relationships in bacterial networks.
Collapse
Affiliation(s)
- José M. Peregrín-Alvarez
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Molecular Biology and Biochemistry, University of Malaga, Malaga, Spain
| | - Xuejian Xiong
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - Chong Su
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - John Parkinson
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
113
|
Abraham AL, Pothier J, Rocha EPC. Alternative to homo-oligomerisation: the creation of local symmetry in proteins by internal amplification. J Mol Biol 2009; 394:522-34. [PMID: 19769988 DOI: 10.1016/j.jmb.2009.09.031] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2009] [Revised: 09/11/2009] [Accepted: 09/15/2009] [Indexed: 11/18/2022]
Abstract
The biologically active state of many proteins requires their prior homo-oligomerisation. Such complexes are typically symmetrical, a feature that has been proposed to increase their stability and facilitate the evolution of allosteric regulation. We wished to examine the possibility that similar structures and properties could arise from genetic amplifications leading to internal symmetrical repeats. For this, we identified internal structural repeats in a nonredundant Protein Data Bank subset. While testing if repeats in proteins tend to be symmetrical, we found that about half of the large internal repeats are symmetrical, most frequently around a rotation axis of 180 degrees . These repeats were most likely created by genetic amplification processes because they show significant sequence similarity. Symmetrical repeats tend to have a fixed number of copies corresponding to their rotational symmetry order, that is, two for 180 degrees rotation axis, whereas asymmetrical repeats are in longer proteins and show copy number variability. When possible, we confirmed that proteins with symmetrical repeats folding as an n-mer have homologues lacking the repeat with a higher oligomerisation number corresponding to the rotation symmetry order of the repeat. Phylogenetic analyses of these protein families suggest that typically, but not always, symmetrical repeats arise in one single event from proteins that are homo-oligomers. These results suggest that oligomerisation and amplification of internal sequences can interplay in evolutionary terms because they result in functional analogues when the latter exhibit rotational symmetry.
Collapse
Affiliation(s)
- Anne-Laure Abraham
- Atelier de BioInformatique, Université Pierre et Marie Curie-Paris 06, Paris, France.
| | | | | |
Collapse
|
114
|
Beltrao P, Trinidad JC, Fiedler D, Roguev A, Lim WA, Shokat KM, Burlingame AL, Krogan NJ. Evolution of phosphoregulation: comparison of phosphorylation patterns across yeast species. PLoS Biol 2009; 7:e1000134. [PMID: 19547744 PMCID: PMC2691599 DOI: 10.1371/journal.pbio.1000134] [Citation(s) in RCA: 161] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Accepted: 05/12/2009] [Indexed: 12/27/2022] Open
Abstract
Analysis of the phosphoproteomes and the gene interaction networks of divergent yeast species defines the relative contribution of changes in protein phosphorylation pathways to the generation of phenotypic diversity. The extent by which different cellular components generate phenotypic diversity is an ongoing debate in evolutionary biology that is yet to be addressed by quantitative comparative studies. We conducted an in vivo mass-spectrometry study of the phosphoproteomes of three yeast species (Saccharomyces cerevisiae, Candida albicans, and Schizosaccharomyces pombe) in order to quantify the evolutionary rate of change of phosphorylation. We estimate that kinase–substrate interactions change, at most, two orders of magnitude more slowly than transcription factor (TF)–promoter interactions. Our computational analysis linking kinases to putative substrates recapitulates known phosphoregulation events and provides putative evolutionary histories for the kinase regulation of protein complexes across 11 yeast species. To validate these trends, we used the E-MAP approach to analyze over 2,000 quantitative genetic interactions in S. cerevisiae and Sc. pombe, which demonstrated that protein kinases, and to a greater extent TFs, show lower than average conservation of genetic interactions. We propose therefore that protein kinases are an important source of phenotypic diversity. Natural selection at a population level requires phenotypic diversity, which at the molecular level arises by mutation of the genome of each individual. What kinds of changes at the level of the DNA are most important for the generation of phenotypic differences remains a fundamental question in evolutionary biology. One well-studied source of phenotypic diversity is mutation in gene regulatory regions that results in changes in gene expression, but what proportion of phenotypic diversity is due to such mutations is not entirely clear. We investigated the relative contribution to phenotypic diversity of mutations in protein-coding regions compared to mutations in gene regulatory sequences. Given the important regulatory role played by phosphorylation across biological systems, we focused on mutations in protein-coding regions that alter protein–protein interactions involved in the binding of kinases to their substrate proteins. We studied the evolution of this “phosphoregulation” by analyzing the in vivo complement of phosphorylated proteins (the “phosphoproteome”) in three highly diverged yeast species—the budding yeast Saccharomyces cerevisiae, the pathogenic yeast Candida albicans, and the fission yeast Schizosaccharomyces pombe—and integrating those data with existing data on thousands of known genetic interactions from S. cerevisiae and Sc. pombe. We show that kinase–substrate interactions are altered at a rate that is at most two orders of magnitude slower than the alteration of transcription factor (TF)–promoter interactions, whereas TFs and kinases both show a faster than average rate of functional divergence estimated by the cross-species analysis of genetic interactions. Our data provide a quantitative estimate of the relative frequencies of different kinds of functionally relevant mutations and demonstrate that, like mutations in gene regulatory regions, mutations that result in changes in kinase–substrate interactions are an important source of phenotypic diversity.
Collapse
Affiliation(s)
- Pedro Beltrao
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- California Institute for Quantitative Biosciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (PB); (NJK)
| | - Jonathan C. Trinidad
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, California, United States of America
| | - Dorothea Fiedler
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- Howard Hughes Medical Institute, University of California San Francisco, San Francisco, California, United States of America
| | - Assen Roguev
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- California Institute for Quantitative Biosciences, University of California San Francisco, San Francisco, California, United States of America
| | - Wendell A. Lim
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- Howard Hughes Medical Institute, University of California San Francisco, San Francisco, California, United States of America
- Cell Propulsion Laboratory (a National Institutes of Health Nanomedicine Development Center), University of California San Francisco, San Francisco, California, United States of America
| | - Kevan M. Shokat
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- Howard Hughes Medical Institute, University of California San Francisco, San Francisco, California, United States of America
| | - Alma L. Burlingame
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, California, United States of America
| | - Nevan J. Krogan
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
- California Institute for Quantitative Biosciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (PB); (NJK)
| |
Collapse
|
115
|
Petrey D, Honig B. Is protein classification necessary? Toward alternative approaches to function annotation. Curr Opin Struct Biol 2009; 19:363-8. [PMID: 19269161 PMCID: PMC2745633 DOI: 10.1016/j.sbi.2009.02.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2009] [Accepted: 02/02/2009] [Indexed: 11/16/2022]
Abstract
The current nonredundant protein sequence database contains over seven million entries and the number of individual functional domains is significantly larger than this value. The vast quantity of data associated with these proteins poses enormous challenges to any attempt at function annotation. Classification of proteins into sequence and structural groups has been widely used as an approach to simplifying the problem. In this article we question such strategies. We describe how the multifunctionality and structural diversity of even closely related proteins confounds efforts to assign function on the basis of overall sequence or structural similarity. Rather, we suggest that strategies that avoid classification may offer a more robust approach to protein function annotation.
Collapse
Affiliation(s)
- Donald Petrey
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Center for Computational Biology and Bioinformatics, Columbia University, New York, NY 10032, USA
| | | |
Collapse
|
116
|
Yosef N, Kupiec M, Ruppin E, Sharan R. A complex-centric view of protein network evolution. Nucleic Acids Res 2009; 37:e88. [PMID: 19465379 PMCID: PMC2709590 DOI: 10.1093/nar/gkp414] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The recent availability of protein-protein interaction networks for several species makes it possible to study protein complexes in an evolutionary context. In this article, we present a novel network-based framework for reconstructing the evolutionary history of protein complexes. Our analysis is based on generalizing evolutionary measures for single proteins to the level of whole subnetworks, comprehensively considering a broad set of computationally derived complexes and accounting for both sequence and interaction changes. Specifically, we compute sets of orthologous complexes across species, and use these to derive evolutionary rate and age measures for protein complexes. We observe significant correlations between the evolutionary properties of a complex and those of its member proteins, suggesting that protein complexes form early in evolution and evolve as coherent units. Additionally, our approach enables us to directly quantify the extent to which gene duplication has played a role in the evolution of complexes. We find that about one quarter of the sets of orthologous complexes have originated from evolutionary cores of homodimers that underwent duplication and divergence, testifying to the important role of gene duplication in protein complex evolution.
Collapse
Affiliation(s)
- Nir Yosef
- The Blavatnik School of Computer Science, Department of Molecular Microbiology and Biotechnology and School of Medicine, Tel-Aviv University, Tel-Aviv 69978, Israel
| | | | | | | |
Collapse
|
117
|
Meng G, Fronzes R, Chandran V, Remaut H, Waksman G. Protein oligomerization in the bacterial outer membrane (Review). Mol Membr Biol 2009; 26:136-45. [PMID: 19225986 DOI: 10.1080/09687680802712422] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The formation of homo-oligomeric assemblies is a well-established characteristic of many soluble proteins and enzymes. Oligomerization has been shown to increase protein stability, allow allosteric cooperativity, shape reaction compartments and provide multivalent interaction sites in soluble proteins. In comparison, our understanding of the prevalence and reasons behind protein oligomerization in membrane proteins is relatively sparse. Recent progress in structural biology of bacterial outer membrane proteins has suggested that oligomerization may be as common and versatile as in soluble proteins. Here we review the current understanding of oligomerization in the bacterial outer membrane from a structural and functional point of view.
Collapse
Affiliation(s)
- Guoyu Meng
- Institute of Structural and Molecular Biology, University College London and Birkbeck College, London, UK
| | | | | | | | | |
Collapse
|
118
|
Abstract
Contemporary protein architectures can be regarded as molecular fossils, historical imprints that mark important milestones in the history of life. Whereas sequences change at a considerable pace, higher-order structures are constrained by the energetic landscape of protein folding, the exploration of sequence and structure space, and complex interactions mediated by the proteostasis and proteolytic machineries of the cell. The survey of architectures in the living world that was fuelled by recent structural genomic initiatives has been summarized in protein classification schemes, and the overall structure of fold space explored with novel bioinformatic approaches. However, metrics of general structural comparison have not yet unified architectural complexity using the 'shared and derived' tenet of evolutionary analysis. In contrast, a shift of focus from molecules to proteomes and a census of protein structure in fully sequenced genomes were able to uncover global evolutionary patterns in the structure of proteins. Timelines of discovery of architectures and functions unfolded episodes of specialization, reductive evolutionary tendencies of architectural repertoires in proteomes and the rise of modularity in the protein world. They revealed a biologically complex ancestral proteome and the early origin of the archaeal lineage. Studies also identified an origin of the protein world in enzymes of nucleotide metabolism harbouring the P-loop-containing triphosphate hydrolase fold and the explosive discovery of metabolic functions that recapitulated well-defined prebiotic shells and involved the recruitment of structures and functions. These observations have important implications for origins of modern biochemistry and diversification of life.
Collapse
|
119
|
Gibson TA, Goldberg DS. Questioning the ubiquity of neofunctionalization. PLoS Comput Biol 2009; 5:e1000252. [PMID: 19119408 PMCID: PMC2597716 DOI: 10.1371/journal.pcbi.1000252] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2008] [Accepted: 11/12/2008] [Indexed: 11/19/2022] Open
Abstract
Gene duplication provides much of the raw material from which functional diversity evolves. Two evolutionary mechanisms have been proposed that generate functional diversity: neofunctionalization, the de novo acquisition of function by one duplicate, and subfunctionalization, the partitioning of ancestral functions between gene duplicates. With protein interactions as a surrogate for protein functions, evidence of prodigious neofunctionalization and subfunctionalization has been identified in analyses of empirical protein interactions and evolutionary models of protein interactions. However, we have identified three phenomena that have contributed to neofunctionalization being erroneously identified as a significant factor in protein interaction network evolution. First, self-interacting proteins are underreported in interaction data due to biological artifacts and design limitations in the two most common high-throughput protein interaction assays. Second, evolutionary inferences have been drawn from paralog analysis without consideration for concurrent and subsequent duplication events. Third, the theoretical model of prodigious neofunctionalization is unable to reproduce empirical network clustering and relies on untenable parameter requirements. In light of these findings, we believe that protein interaction evolution is more persuasively characterized by subfunctionalization and self-interactions.
Collapse
Affiliation(s)
- Todd A Gibson
- Computational Bioscience Program, University of Colorado Denver, Aurora, Colorado, United States of America.
| | | |
Collapse
|
120
|
Sellerio A, Bassetti B, Isambert H, Cosentino Lagomarsino M. A comparative evolutionary study of transcription networks. The global role of feedback and hierachical structures. ACTA ACUST UNITED AC 2009; 5:170-9. [DOI: 10.1039/b815339f] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
121
|
Wong P, Althammer S, Hildebrand A, Kirschner A, Pagel P, Geissler B, Smialowski P, Blöchl F, Oesterheld M, Schmidt T, Strack N, Theis FJ, Ruepp A, Frishman D. An evolutionary and structural characterization of mammalian protein complex organization. BMC Genomics 2008; 9:629. [PMID: 19108706 PMCID: PMC2645396 DOI: 10.1186/1471-2164-9-629] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2008] [Accepted: 12/23/2008] [Indexed: 12/25/2022] Open
Abstract
Background We have recently released a comprehensive, manually curated database of mammalian protein complexes called CORUM. Combining CORUM with other resources, we assembled a dataset of over 2700 mammalian complexes. The availability of a rich information resource allows us to search for organizational properties concerning these complexes. Results As the complexity of a protein complex in terms of the number of unique subunits increases, we observed that the number of such complexes and the mean non-synonymous to synonymous substitution ratio of associated genes tend to decrease. Similarly, as the number of different complexes a given protein participates in increases, the number of such proteins and the substitution ratio of the associated gene also tends to decrease. These observations provide evidence relating natural selection and the organization of mammalian complexes. We also observed greater homogeneity in terms of predicted protein isoelectric points, secondary structure and substitution ratio in annotated versus randomly generated complexes. A large proportion of the protein content and interactions in the complexes could be predicted from known binary protein-protein and domain-domain interactions. In particular, we found that large proteins interact preferentially with much smaller proteins. Conclusion We observed similar trends in yeast and other data. Our results support the existence of conserved relations associated with the mammalian protein complexes.
Collapse
Affiliation(s)
- Philip Wong
- Helmholtz Center Munich-German Research Center for Environmental Health (GmbH), Institute of Bioinformatics and Systems Biology, Ingolstädter Landstrasse 1, Neuherberg, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
122
|
Szklarczyk R, Huynen MA, Snel B. Complex fate of paralogs. BMC Evol Biol 2008; 8:337. [PMID: 19094234 PMCID: PMC2628386 DOI: 10.1186/1471-2148-8-337] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 12/18/2008] [Indexed: 01/20/2023] Open
Abstract
Background Thanks to recent high coverage mass-spectrometry studies and reconstructed protein complexes, we are now in an unprecedented position to study the evolution of biological systems. Gene duplications, known to be a major source of innovation in evolution, can now be readily examined in the context of protein complexes. Results We observe that paralogs operating in the same complex fulfill different roles: mRNA dosage increase for more than a hundred cytosolic ribosomal proteins, mutually exclusive participation of at least 54 paralogs resulting in alternative forms of complexes, and 24 proteins contributing to bona fide structural growth. Inspection of paralogous proteins participating in two independent complexes shows that an ancient, pre-duplication protein functioned in both multi-protein assemblies and a gene duplication event allowed the respective copies to specialize and split their roles. Conclusion Variants with conditionally assembled, paralogous subunits likely have played a role in yeast's adaptation to anaerobic conditions. In a number of cases the gene duplication has given rise to one duplicate that is no longer part of a protein complex and shows an accelerated rate of evolution. Such genes could provide the raw material for the evolution of new functions.
Collapse
Affiliation(s)
- Radek Szklarczyk
- Centre for Molecular and Biomolecular Informatics, NCMLS, Radboud University Medical Centre, PO Box 9101, 6500 HB Nijmegen, the Netherlands.
| | | | | |
Collapse
|
123
|
Kim WK, Marcotte EM. Age-dependent evolution of the yeast protein interaction network suggests a limited role of gene duplication and divergence. PLoS Comput Biol 2008; 4:e1000232. [PMID: 19043579 PMCID: PMC2583957 DOI: 10.1371/journal.pcbi.1000232] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2008] [Accepted: 10/17/2008] [Indexed: 11/18/2022] Open
Abstract
Proteins interact in complex protein–protein interaction (PPI) networks whose topological properties—such as scale-free topology, hierarchical modularity, and dissortativity—have suggested models of network evolution. Currently preferred models invoke preferential attachment or gene duplication and divergence to produce networks whose topology matches that observed for real PPIs, thus supporting these as likely models for network evolution. Here, we show that the interaction density and homodimeric frequency are highly protein age–dependent in real PPI networks in a manner which does not agree with these canonical models. In light of these results, we propose an alternative stochastic model, which adds each protein sequentially to a growing network in a manner analogous to protein crystal growth (CG) in solution. The key ideas are (1) interaction probability increases with availability of unoccupied interaction surface, thus following an anti-preferential attachment rule, (2) as a network grows, highly connected sub-networks emerge into protein modules or complexes, and (3) once a new protein is committed to a module, further connections tend to be localized within that module. The CG model produces PPI networks consistent in both topology and age distributions with real PPI networks and is well supported by the spatial arrangement of protein complexes of known 3-D structure, suggesting a plausible physical mechanism for network evolution. Proteins function together forming stable protein complexes or transient interactions in various cellular processes, such as gene regulation and signaling. Here, we address the basic question of how these networks of interacting proteins evolve. This is an important problem, as the structures of such networks underlie important features of biological systems, such as functional modularity, error-tolerance, and stability. It is not yet known how these network architectures originate or what driving forces underlie the observed network structure. Several models have been proposed over the past decade—in particular, a “rich get richer” model (preferential attachment) and a model based upon gene duplication and divergence—often based only on network topologies. Here, we show that real yeast protein interaction networks show a unique age distribution among interacting proteins, which rules out these canonical models. In light of these results, we developed a simple, alternative model based on well-established physical principles, analogous to the process of growing protein crystals in solution. The model better explains many features of real PPI networks, including the network topologies, their characteristic age distributions, and the spatial distribution of subunits of differing ages within protein complexes, suggesting a plausible physical mechanism of network evolution.
Collapse
Affiliation(s)
- Wan Kyu Kim
- Center for Systems and Synthetic Biology, Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas, United States of America
| | - Edward M. Marcotte
- Center for Systems and Synthetic Biology, Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, Texas, United States of America
- * E-mail:
| |
Collapse
|
124
|
Vera J, Millat T, Kolch W, Wolkenhauer O. Dynamics of receptor and protein transducer homodimerisation. BMC SYSTEMS BIOLOGY 2008; 2:92. [PMID: 18976473 PMCID: PMC2650691 DOI: 10.1186/1752-0509-2-92] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2007] [Accepted: 10/31/2008] [Indexed: 11/30/2022]
Abstract
Background Signalling pathways are complex systems in which not only simple monomeric molecules interact, but also more complex structures that include constitutive or induced protein assemblies. In particular, the hetero-and homo-dimerisation of proteins is a commonly encountered motif in signalling pathways. Several authors have suggested in recent times that dimerisation relates to a series of physical and biological outcomes used by the cell in the regulation of signal transduction. Results In this paper we investigate the role of homodimerisation in receptor-protein transducer interactions. Towards this end, mathematical modelling is used to analyse the features of such kind of interactions and to predict the behaviour of the system under different experimental conditions. A kinetic model in which the interaction between homodimers provokes a dual mechanism of activation (single and double protein transducer activation at the same time) is proposed. In addition, we analyse under which conditions the use of a power-law representation for the system is useful. Furthermore, we investigate the dynamical consequences of this dual mechanism and compare the performance of the system in different simulated experimental conditions. Conclusion The analysis of our mathematical model suggests that in receptor-protein interacting systems with dual mechanism there may be a shift between double and single activation in a way that intense double protein transducer activation could initiate and dominate the signal in the short term (getting a fast intense signal), while single protein activation could control the system in the medium and long term (when input signal is weaker and decreases slowly). Our investigation suggests that homodimerisation and oligomerisation are mechanisms used to enhance and regulate the dynamic properties of the initial steps in signalling pathways.
Collapse
Affiliation(s)
- Julio Vera
- University of Rostock, Rostock, Germany.
| | | | | | | |
Collapse
|
125
|
Vieira FG, Sánchez-Gracia A, Rozas J. Comparative genomic analysis of the odorant-binding protein family in 12 Drosophila genomes: purifying selection and birth-and-death evolution. Genome Biol 2008; 8:R235. [PMID: 18039354 PMCID: PMC2258175 DOI: 10.1186/gb-2007-8-11-r235] [Citation(s) in RCA: 136] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2007] [Revised: 06/08/2007] [Accepted: 11/08/2007] [Indexed: 01/18/2023] Open
Abstract
The comparative analysis of the odorant binding protein family in 12 Drosophila genomes allowed the identification of 595 putative family member genes and revealed insights into the evolution of this family in these species. Background Chemoreception is a widespread mechanism that is involved in critical biologic processes, including individual and social behavior. The insect peripheral olfactory system comprises three major multigene families: the olfactory receptor (Or), the gustatory receptor (Gr), and the odorant-binding protein (OBP) families. Members of the latter family establish the first contact with the odorants, and thus constitute the first step in the chemosensory transduction pathway. Results Comparative analysis of the OBP family in 12 Drosophila genomes allowed the identification of 595 genes that encode putative functional and nonfunctional members in extant species, with 43 gene gains and 28 gene losses (15 deletions and 13 pseudogenization events). The evolution of this family shows tandem gene duplication events, progressive divergence in DNA and amino acid sequence, and prevalence of pseudogenization events in external branches of the phylogenetic tree. We observed that the OBP arrangement in clusters is maintained across the Drosophila species and that purifying selection governs the evolution of the family; nevertheless, OBP genes differ in their functional constraints levels. Finally, we detect that the OBP repertoire evolves more rapidly in the specialist lineages of the Drosophila melanogaster group (D. sechellia and D. erecta) than in their closest generalists. Conclusion Overall, the evolution of the OBP multigene family is consistent with the birth-and-death model. We also found that members of this family exhibit different functional constraints, which is indicative of some functional divergence, and that they might be involved in some of the specialization processes that occurred through the diversification of the Drosophila genus.
Collapse
Affiliation(s)
- Filipe G Vieira
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Av, Diagonal 645, Barcelona 08028, Spain.
| | | | | |
Collapse
|
126
|
Makino T, McLysaght A. Interacting gene clusters and the evolution of the vertebrate immune system. Mol Biol Evol 2008; 25:1855-62. [PMID: 18573844 DOI: 10.1093/molbev/msn137] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Unraveling the "code" of genome structure is an important goal of genomics research. Colocalization of genes in eukaryotic genomes may facilitate preservation of favorable allele combinations between epistasic loci or coregulation of functionally related genes. However, the presence of interacting gene clusters in the human genome has remained unclear. We systematically searched the human genome for evidence of closely linked genes whose protein products interact. We find 83 pairs of interacting genes that are located within 1 Mbp in the human genome or 37 if we exclude hub proteins. This number of interacting gene clusters is significantly more than expected by chance and is not the result of tandem duplications. Furthermore, we find that these clusters are significantly more conserved across vertebrate (but not chordate) genomes than other pairs of genes located within 1 Mbp in the human genome. In many cases, the genes are both present but not clustered in older vertebrate lineages. These results suggest gene cluster creation along the human lineage. These clusters are not enriched for housekeeping genes, but we find a significant contribution from genes involved in "response to stimulus." Many of these genes are involved in the immune response, including, but not limited to, known clusters such as the major histocompatibility complex. That these clusters were formed contemporaneously with the origin of adaptive immunity within the vertebrate lineage suggests that novel evolutionary and regulatory constraints were associated with the operation of the immune system.
Collapse
Affiliation(s)
- Takashi Makino
- Smurfit Institute of Genetics, University of Dublin, Trinity College, Dublin, Ireland
| | | |
Collapse
|
127
|
Levy ED, Boeri Erba E, Robinson CV, Teichmann SA. Assembly reflects evolution of protein complexes. Nature 2008; 453:1262-5. [PMID: 18563089 DOI: 10.1038/nature06942] [Citation(s) in RCA: 327] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2007] [Accepted: 03/20/2008] [Indexed: 02/03/2023]
Abstract
A homomer is formed by self-interacting copies of a protein unit. This is functionally important, as in allostery, and structurally crucial because mis-assembly of homomers is implicated in disease. Homomers are widespread, with 50-70% of proteins with a known quaternary state assembling into such structures. Despite their prevalence, their role in the evolution of cellular machinery and the potential for their use in the design of new molecular machines, little is known about the mechanisms that drive formation of homomers at the level of evolution and assembly in the cell. Here we present an analysis of over 5,000 unique atomic structures and show that the quaternary structure of homomers is conserved in over 70% of protein pairs sharing as little as 30% sequence identity. Where quaternary structure is not conserved among the members of a protein family, a detailed investigation revealed well-defined evolutionary pathways by which proteins transit between different quaternary structure types. Furthermore, we show by perturbing subunit interfaces within complexes and by mass spectrometry analysis, that the (dis)assembly pathway mimics the evolutionary pathway. These data represent a molecular analogy to Haeckel's evolutionary paradigm of embryonic development, where an intermediate in the assembly of a complex represents a form that appeared in its own evolutionary history. Our model of self-assembly allows reliable prediction of evolution and assembly of a complex solely from its crystal structure.
Collapse
Affiliation(s)
- Emmanuel D Levy
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, UK.
| | | | | | | |
Collapse
|
128
|
Levy ED, Pereira-Leal JB. Evolution and dynamics of protein interactions and networks. Curr Opin Struct Biol 2008; 18:349-57. [DOI: 10.1016/j.sbi.2008.03.003] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2007] [Revised: 03/04/2008] [Accepted: 03/04/2008] [Indexed: 12/29/2022]
|
129
|
Taverner T, Hernández H, Sharon M, Ruotolo BT, Matak-Vinković D, Devos D, Russell RB, Robinson CV. Subunit architecture of intact protein complexes from mass spectrometry and homology modeling. Acc Chem Res 2008; 41:617-27. [PMID: 18314965 DOI: 10.1021/ar700218q] [Citation(s) in RCA: 116] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Proteomic studies have yielded detailed lists of protein components. Relatively little is known, however, of interactions between proteins or of their spatial arrangement. To bridge this gap, we are developing a mass spectrometry approach based on intact protein complexes. By studying intact complexes, we show that we are able to not only determine the stoichiometry of all subunits present but also deduce interaction maps and topological arrangements of subunits. To construct an interaction network, we use tandem mass spectrometry to define peripheral subunits and partial denaturation in solution to generate series of subcomplexes. These subcomplexes are subsequently assigned using tandem mass spectrometry. To facilitate this assignment process, we have developed an iterative search algorithm (SUMMIT) to both assign protein subcomplexes and generate protein interaction networks. This software package not only allows us to construct the subunit architecture of protein assemblies but also allows us to explore the limitations and potential of our approach. Using series of hypothetical complexes, generated at random from protein assemblies containing between six and fourteen subunits, we highlight the significance of tandem mass spectrometry for defining subunits present. We also demonstrate the importance of pairwise interactions and the optimal numbers of subcomplexes required to assign networks with up to fourteen subunits. To illustrate application of our approach, we describe the overall architecture of two endogenous protein assemblies isolated from yeast at natural expression levels, the 19S proteasome lid and the RNA exosome. In constructing our models, we did not consider previous electron microscopy images but rather deduced the subunit architecture from series of subcomplexes and our network algorithm. The results show that the proteasome lid complex consists of a bicluster with two tetrameric lobes. The exosome lid, by contrast, is a six-membered ring with three additional bridging subunits that confer stability to the ring and with a large subunit located at the base. Significantly, by combining data from MS and homology modeling, we were able to construct an atomic model of the yeast exosome. In summary, the architectural and atomic models of both protein complexes described here have been produced in advance of high-resolution structural data and as such provide an initial model for testing hypotheses and planning future experiments. In the case of the yeast exosome, the atomic model is validated by comparison with the atomic structure from X-ray diffraction of crystals of the reconstituted human exosome, which is homologous to that of the yeast. Overall therefore this mass spectrometry and homology modeling approach has given significant insight into the structure of two previously intractable protein complexes and as such has broad application in structural biology.
Collapse
Affiliation(s)
- Thomas Taverner
- Department of Chemistry, Lensfield Road, University of Cambridge, Cambridge CB2 1EW, U.K
| | - Helena Hernández
- Department of Chemistry, Lensfield Road, University of Cambridge, Cambridge CB2 1EW, U.K
| | - Michal Sharon
- Department of Chemistry, Lensfield Road, University of Cambridge, Cambridge CB2 1EW, U.K
| | - Brandon T. Ruotolo
- Department of Chemistry, Lensfield Road, University of Cambridge, Cambridge CB2 1EW, U.K
| | | | - Damien Devos
- EMBL, Meyerhofstrasse 1, 69177 Heidelberg, Germany
| | | | - Carol V. Robinson
- Department of Chemistry, Lensfield Road, University of Cambridge, Cambridge CB2 1EW, U.K
| |
Collapse
|
130
|
Amoutzias GD, Robertson DL, Van de Peer Y, Oliver SG. Choose your partners: dimerization in eukaryotic transcription factors. Trends Biochem Sci 2008; 33:220-9. [PMID: 18406148 DOI: 10.1016/j.tibs.2008.02.002] [Citation(s) in RCA: 178] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2007] [Revised: 02/12/2008] [Accepted: 02/15/2008] [Indexed: 11/16/2022]
Abstract
In many eukaryotic transcription factor gene families, proteins require a physical interaction with an identical molecule or with another molecule within the same family to form a functional dimer and bind DNA. Depending on the choice of partner and the cellular context, each dimer triggers a sequence of regulatory events that lead to a particular cellular fate, for example, proliferation or differentiation. Recent syntheses of genomic and functional data reveal that partner choice is not random; instead, dimerization specificities, which are strongly linked to the evolution of the protein family, apply. Our focus is on understanding these interaction specificities, their functional consequences and how they evolved. This knowledge is essential for understanding gene regulation and designing a new generation of drugs.
Collapse
Affiliation(s)
- Grigoris D Amoutzias
- Department of Plant Systems Biology, VIB, Ghent University, Technologiepark 927, B-9052 Ghent, Belgium
| | | | | | | |
Collapse
|
131
|
Bushell KM, Söllner C, Schuster-Boeckler B, Bateman A, Wright GJ. Large-scale screening for novel low-affinity extracellular protein interactions. Genome Res 2008; 18:622-30. [PMID: 18296487 DOI: 10.1101/gr.7187808] [Citation(s) in RCA: 162] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Extracellular protein-protein interactions are essential for both intercellular communication and cohesion within multicellular organisms. Approximately a fifth of human genes encode membrane-tethered or secreted proteins, but they are largely absent from recent large-scale protein interaction datasets, making current interaction networks biased and incomplete. This discrepancy is due to the unsuitability of popular high-throughput methods to detect extracellular interactions because of the biochemical intractability of membrane proteins and their interactions. For example, cell surface proteins contain insoluble hydrophobic transmembrane regions, and their extracellular interactions are often highly transient, having half-lives of less than a second. To detect transient extracellular interactions on a large scale, we developed AVEXIS (avidity-based extracellular interaction screen), a high-throughput assay that overcomes these technical issues and can detect very transient interactions (half-lives <or= 0.1 sec) with a low false-positive rate. We used it to systematically screen for receptor-ligand pairs within the zebrafish immunoglobulin superfamily and identified novel ligands for both well-known and orphan receptors. Genes encoding receptor-ligand pairs were often clustered phylogenetically and expressed in the same or adjacent tissues, immediately implying their involvement in similar biological processes. Using AVEXIS, we have determined the first systematic low-affinity extracellular protein interaction network, supported by independent biological data. This technique will now allow large-scale extracellular protein interaction mapping in a broad range of experimental contexts.
Collapse
Affiliation(s)
- K Mark Bushell
- Cell Surface Signalling Laboratory, Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1HH, United Kingdom
| | | | | | | | | |
Collapse
|
132
|
Presser A, Elowitz MB, Kellis M, Kishony R. The evolutionary dynamics of the Saccharomyces cerevisiae protein interaction network after duplication. Proc Natl Acad Sci U S A 2008; 105:950-4. [PMID: 18199840 PMCID: PMC2242688 DOI: 10.1073/pnas.0707293105] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2007] [Indexed: 11/18/2022] Open
Abstract
Gene duplication is an important mechanism in the evolution of protein interaction networks. Duplications are followed by the gain and loss of interactions, rewiring the network at some unknown rate. Because rewiring is likely to change the distribution of network motifs within the duplicated interaction set, it should be possible to study network rewiring by tracking the evolution of these motifs. We have developed a mathematical framework that, together with duplication data from comparative genomic and proteomic studies, allows us to infer the connectivity of the preduplication network and the changes in connectivity over time. We focused on the whole-genome duplication (WGD) event in Saccharomyces cerevisiae. The model allowed us to predict the frequency of intergene interaction before WGD and the post duplication probabilities of interaction gain and loss. We find that the predicted frequency of self-interactions in the preduplication network is significantly higher than that observed in today's network. This could suggest a structural difference between the modern and ancestral networks, preferential addition or retention of interactions between ohnologs, or selective pressure to preserve duplicates of self-interacting proteins.
Collapse
Affiliation(s)
- Aviva Presser
- *School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138
- Broad Institute, Cambridge, MA 02142
| | - Michael B. Elowitz
- Division of Biology and Department of Applied Physics, California Institute of Technology, Pasadena, CA 91125
| | - Manolis Kellis
- Broad Institute, Cambridge, MA 02142
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139; and
| | - Roy Kishony
- *School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115
| |
Collapse
|
133
|
Shimoda Y, Shinpo S, Kohara M, Nakamura Y, Tabata S, Sato S. A large scale analysis of protein-protein interactions in the nitrogen-fixing bacterium Mesorhizobium loti. DNA Res 2008; 15:13-23. [PMID: 18192278 PMCID: PMC2650630 DOI: 10.1093/dnares/dsm028] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Global viewing of protein–protein interactions (PPIs) is a useful way to assign biological roles to large numbers of proteins predicted by complete genome sequence. Here, we systematically analyzed PPIs in the nitrogen-fixing soil bacterium Mesorhizobium loti using a modified high-throughput yeast two-hybrid system. The aims of this study are primarily on the providing functional clues to M. loti proteins that are relevant to symbiotic nitrogen fixation and conserved in other rhizobium species, especially proteins with regulatory functions and unannotated proteins. By the screening of 1542 genes as bait, 3121 independent interactions involving 1804 proteins (24% of the total protein coding genes) were identified and each interaction was evaluated using an interaction generality (IG) measure and the general features of the interacting partners. Most PPIs detected in this study are novel interactions revealing potential functional relationships between genes for symbiotic nitrogen fixation and signal transduction. Furthermore, we have predicted the putative functions of unannotated proteins through their interactions with known proteins. The results described here represent new insight into protein network of M. loti and provide useful experimental clues to elucidate the biological function of rhizobial genes that can not be assigned directly from their genomic sequence.
Collapse
Affiliation(s)
- Yoshikazu Shimoda
- Kazusa DNA Research Institute, 2-6-7 Kazusa-kamatari, Kisarazu, Chiba 292-0818, Japan
| | | | | | | | | | | |
Collapse
|
134
|
Semple JI, Vavouri T, Lehner B. A simple principle concerning the robustness of protein complex activity to changes in gene expression. BMC SYSTEMS BIOLOGY 2008; 2:1. [PMID: 18171472 PMCID: PMC2242779 DOI: 10.1186/1752-0509-2-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/03/2007] [Accepted: 01/02/2008] [Indexed: 11/12/2022]
Abstract
Background The functions of a eukaryotic cell are largely performed by multi-subunit protein complexes that act as molecular machines or information processing modules in cellular networks. An important problem in systems biology is to understand how, in general, these molecular machines respond to perturbations. Results In yeast, genes that inhibit growth when their expression is reduced are strongly enriched amongst the subunits of multi-subunit protein complexes. This applies to both the core and peripheral subunits of protein complexes, and the subunits of each complex normally have the same loss-of-function phenotypes. In contrast, genes that inhibit growth when their expression is increased are not enriched amongst the core or peripheral subunits of protein complexes, and the behaviour of one subunit of a complex is not predictive for the other subunits with respect to over-expression phenotypes. Conclusion We propose the principle that the overall activity of a protein complex is in general robust to an increase, but not to a decrease in the expression of its subunits. This means that whereas phenotypes resulting from a decrease in gene expression can be predicted because they cluster on networks of protein complexes, over-expression phenotypes cannot be predicted in this way. We discuss the implications of these findings for understanding how cells are regulated, how they evolve, and how genetic perturbations connect to disease in humans.
Collapse
Affiliation(s)
- Jennifer I Semple
- EMBL-CRG Systems Biology Unit, Centre for Genomic Regulation (CRG), UPF, Dr, Aiguader 88, Barcelona 08003, Spain.
| | | | | |
Collapse
|
135
|
Rasteiro R, Pereira-Leal JB. Multiple domain insertions and losses in the evolution of the Rab prenylation complex. BMC Evol Biol 2007; 7:140. [PMID: 17705859 PMCID: PMC1994686 DOI: 10.1186/1471-2148-7-140] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2007] [Accepted: 08/17/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Rab proteins are regulators of vesicular trafficking, requiring a lipid modification for proper function, prenylation of C-terminal cysteines. This is catalysed by a complex of a catalytic heterodimer (Rab Geranylgeranyl Transferase - RabGGTase) and an accessory protein (Rab Escort Protein. REP). Components of this complex display domain insertions relative to paralogous proteins. The function of these inserted domains is unclear. RESULTS We profiled the domain architecture of the components of the Rab prenylation complex in evolution. We identified the orthologues of the components of the Rab prenylation machinery in 43 organisms, representing the crown eukaryotic groups. We characterize in detail the domain structure of all these components and the phylogenetic relationships between the individual domains. CONCLUSION We found different domain insertions in different taxa, in alpha-subunits of RGGTase and REP. Our results suggest that there were multiple insertions, expansions and contractions in the evolution of this prenylation complex.
Collapse
Affiliation(s)
- Rita Rasteiro
- Instituto Gulbenkian de Ciência, Apartado 14, P-2781-901 Oeiras Portugal
| | | |
Collapse
|