1
|
Xie J, Dong R, Zhu J, Lin H, Wang S, Lai L. MMFuncPhos: A Multi-Modal Learning Framework for Identifying Functional Phosphorylation Sites and Their Regulatory Types. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025:e2410981. [PMID: 39804866 DOI: 10.1002/advs.202410981] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2024] [Revised: 12/05/2024] [Indexed: 01/16/2025]
Abstract
Protein phosphorylation plays a crucial role in regulating a wide range of biological processes, and its dysregulation is strongly linked to various diseases. While many phosphorylation sites have been identified so far, their functionality and regulatory effects are largely unknown. Here, a deep learning model MMFuncPhos, based on a multi-modal deep learning framework, is developed to predict functional phosphorylation sites. MMFuncPhos outperforms existing functional phosphorylation site prediction approaches. EFuncType is further developed based on transfer learning to predict whether phosphorylation of a residue upregulates or downregulates enzyme activity for the first time. The functional phosphorylation sites predicted by MMFuncPhos and the regulatory types predicted by EFuncType align with experimental findings from several newly reported protein phosphorylation studies. The study contributes to the understanding of the functional regulatory mechanism of phosphorylation and provides valuable tools for precision medicine, enzyme engineering, and drug discovery. For user convenience, these two prediction models are integrated into a web server which can be accessed at http://pkumdl.cn:8000/mmfuncphos.
Collapse
Affiliation(s)
- Juan Xie
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Ruihan Dong
- PTN Graduate Program, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Jintao Zhu
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Haoyu Lin
- Peking-Tsinghua Center for Life Sciences at BNLMS, College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, China
| | - Shiwei Wang
- Peking University Chengdu Academy for Advanced Interdisciplinary Biotechnologies, Chengdu, Sichuan, 610213, China
| | - Luhua Lai
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
- Peking-Tsinghua Center for Life Sciences at BNLMS, College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, China
- Peking University Chengdu Academy for Advanced Interdisciplinary Biotechnologies, Chengdu, Sichuan, 610213, China
- Research Unit of Drug Design Method, Chinese Academy of Medical Sciences (2021RU014), Beijing, 100871, China
| |
Collapse
|
2
|
Desai M, Chowdhury SR, Sun B. A quest for cytosolic sequons and their functions. Sci Rep 2024; 14:7736. [PMID: 38565583 PMCID: PMC10987669 DOI: 10.1038/s41598-024-57334-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Accepted: 03/18/2024] [Indexed: 04/04/2024] Open
Abstract
Evolution shapes protein sequences for their functions. Here, we studied the moonlighting functions of the N-linked sequon NXS/T, where X is not P, in human nucleocytosolic proteins. By comparing membrane and secreted proteins in which sequons are well known for N-glycosylation, we discovered that cyto-sequons can participate in nucleic acid binding, particularly in zinc finger proteins. Our global studies further discovered that sequon occurrence is largely proportional to protein length. The contribution of sequons to protein functions, including both N-glycosylation and nucleic acid binding, can be regulated through their density as well as the biased usage between NXS and NXT. In proteins where other PTMs or structural features are rich, such as phosphorylation, transmembrane ɑ-helices, and disulfide bridges, sequon occurrence is scarce. The information acquired here should help understand the relationship between protein sequence and function and assist future protein design and engineering.
Collapse
Affiliation(s)
- Manthan Desai
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
- Department of Computing Science, Simon Fraser University, Burnaby, BC, Canada
| | | | - Bingyun Sun
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
- Department of Chemistry, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada.
| |
Collapse
|
3
|
High-throughput functional characterization of protein phosphorylation sites in yeast. Nat Biotechnol 2022; 40:382-390. [PMID: 34663920 PMCID: PMC7612524 DOI: 10.1038/s41587-021-01051-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 08/09/2021] [Indexed: 12/11/2022]
Abstract
Phosphorylation is a critical post-translational modification involved in the regulation of almost all cellular processes. However, fewer than 5% of thousands of recently discovered phosphosites have been functionally annotated. In this study, we devised a chemical genetic approach to study the functional relevance of phosphosites in Saccharomyces cerevisiae. We generated 474 yeast strains with mutations in specific phosphosites that were screened for fitness in 102 conditions, along with a gene deletion library. Of these phosphosites, 42% exhibited growth phenotypes, suggesting that these are more likely functional. We inferred their function based on the similarity of their growth profiles with that of gene deletions and validated a subset by thermal proteome profiling and lipidomics. A high fraction exhibited phenotypes not seen in the corresponding gene deletion, suggestive of a gain-of-function effect. For phosphosites conserved in humans, the severity of the yeast phenotypes is indicative of their human functional relevance. This high-throughput approach allows for functionally characterizing individual phosphosites at scale.
Collapse
|
4
|
Huckstep H, Fearnley LG, Davis MJ. Measuring pathway database coverage of the phosphoproteome. PeerJ 2021; 9:e11298. [PMID: 34113485 PMCID: PMC8162239 DOI: 10.7717/peerj.11298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 03/29/2021] [Indexed: 12/02/2022] Open
Abstract
Protein phosphorylation is one of the best known post-translational mechanisms playing a key role in the regulation of cellular processes. Over 100,000 distinct phosphorylation sites have been discovered through constant improvement of mass spectrometry based phosphoproteomics in the last decade. However, data saturation is occurring and the bottleneck of assigning biologically relevant functionality to phosphosites needs to be addressed. There has been finite success in using data-driven approaches to reveal phosphosite functionality due to a range of limitations. The alternate, more suitable approach is making use of prior knowledge from literature-derived databases. Here, we analysed seven widely used databases to shed light on their suitability to provide functional insights into phosphoproteomics data. We first determined the global coverage of each database at both the protein and phosphosite level. We also determined how consistent each database was in its phosphorylation annotations compared to a global standard. Finally, we looked in detail at the coverage of each database over six experimental datasets. Our analysis highlights the relative strengths and weaknesses of each database, providing a guide in how each can be best used to identify biological mechanisms in phosphoproteomic data.
Collapse
Affiliation(s)
- Hannah Huckstep
- Division of Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Parkville, Victoria, Australia
| | - Liam G. Fearnley
- Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Parkville, Victoria, Australia
- Division of Population Health, Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
| | - Melissa J. Davis
- Division of Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Parkville, Victoria, Australia
- Department of Clinical Pathology, Faculty of Medicine, Dentistry and Health Sciences, University of Melbourne, Melbourne, Victoria, Australia
| |
Collapse
|
5
|
The evolutionary scaling of cellular traits imposed by the drift barrier. Proc Natl Acad Sci U S A 2020; 117:10435-10444. [PMID: 32345718 DOI: 10.1073/pnas.2000446117] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Owing to internal homeostatic mechanisms, cellular traits may experience long periods of stable selective pressures, during which the stochastic forces of drift and mutation conspire to generate variation. However, even in the face of invariant selection, the drift barrier defined by the genetic effective population size, which is negatively associated with organism size, can have a substantial influence on the location and dispersion of the long-term steady-state distribution of mean phenotypes. In addition, for multilocus traits, the multiplicity of alternative, functionally equivalent states can draw mean phenotypes away from selective optima, even in the absence of mutation bias. Using a framework for traits with an additive genetic basis, it is shown that 1) optimal phenotypic states may be only rarely achieved; 2) gradients of mean phenotypes with respect to organism size (i.e., allometric relationships) are likely to be molded by differences in the power of random genetic drift across the tree of life; and 3) for any particular set of population-genetic conditions, significant variation in mean phenotypes may exist among lineages exposed to identical selection pressures. These results provide a potentially useful framework for understanding numerous aspects of cellular diversification and illustrate the risks of interpreting such variation in a purely adaptive framework.
Collapse
|
6
|
Ochoa D, Jarnuczak AF, Viéitez C, Gehre M, Soucheray M, Mateus A, Kleefeldt AA, Hill A, Garcia-Alonso L, Stein F, Krogan NJ, Savitski MM, Swaney DL, Vizcaíno JA, Noh KM, Beltrao P. The functional landscape of the human phosphoproteome. Nat Biotechnol 2020; 38:365-373. [PMID: 31819260 PMCID: PMC7100915 DOI: 10.1038/s41587-019-0344-3] [Citation(s) in RCA: 259] [Impact Index Per Article: 51.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Accepted: 11/05/2019] [Indexed: 12/18/2022]
Abstract
Protein phosphorylation is a key post-translational modification regulating protein function in almost all cellular processes. Although tens of thousands of phosphorylation sites have been identified in human cells, approaches to determine the functional importance of each phosphosite are lacking. Here, we manually curated 112 datasets of phospho-enriched proteins, generated from 104 different human cell types or tissues. We re-analyzed the 6,801 proteomics experiments that passed our quality control criteria, creating a reference phosphoproteome containing 119,809 human phosphosites. To prioritize functional sites, we used machine learning to identify 59 features indicative of proteomic, structural, regulatory or evolutionary relevance and integrate them into a single functional score. Our approach identifies regulatory phosphosites across different molecular mechanisms, processes and diseases, and reveals genetic susceptibilities at a genomic scale. Several regulatory phosphosites were experimentally validated, including identifying a role in neuronal differentiation for phosphosites in SMARCC2, a member of the SWI/SNF chromatin-remodeling complex.
Collapse
Affiliation(s)
- David Ochoa
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
| | - Andrew F Jarnuczak
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Cristina Viéitez
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Maja Gehre
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Margaret Soucheray
- Gladstone Institute of Data Science and Biotechnology, J. David Gladstone Institutes, San Francisco, CA, USA
- Department of Cellular and Molecular Pharmacology and the Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA, USA
| | - André Mateus
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Askar A Kleefeldt
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Anthony Hill
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Luz Garcia-Alonso
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Frank Stein
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Nevan J Krogan
- Gladstone Institute of Data Science and Biotechnology, J. David Gladstone Institutes, San Francisco, CA, USA
- Department of Cellular and Molecular Pharmacology and the Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA, USA
| | - Mikhail M Savitski
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Danielle L Swaney
- Gladstone Institute of Data Science and Biotechnology, J. David Gladstone Institutes, San Francisco, CA, USA
- Department of Cellular and Molecular Pharmacology and the Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA, USA
| | - Juan A Vizcaíno
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Kyung-Min Noh
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Pedro Beltrao
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.
| |
Collapse
|
7
|
A Theoretical Framework for Evolutionary Cell Biology. J Mol Biol 2020; 432:1861-1879. [PMID: 32087200 DOI: 10.1016/j.jmb.2020.02.006] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2019] [Revised: 01/20/2020] [Accepted: 02/04/2020] [Indexed: 11/24/2022]
Abstract
One of the last uncharted territories in evolutionary biology concerns the link with cell biology. Because all phenotypes ultimately derive from events at the cellular level, this connection is essential to building a mechanism-based theory of evolution. Given the impressive developments in cell biological methodologies at the structural and functional levels, the potential for rapid progress is great. The primary challenge for theory development is the establishment of a quantitative framework that transcends species boundaries. Two approaches to the problem are presented here: establishing the long-term steady-state distribution of mean phenotypes under specific regimes of mutation, selection, and drift and evaluating the energetic costs of cellular structures and functions. Although not meant to be the final word, these theoretical platforms harbor potential for generating insight into a diversity of unsolved problems, ranging from genome structure to cellular architecture to aspects of motility in organisms across the Tree of Life.
Collapse
|
8
|
McDonald M, Trost B, Napper S. Conservation of kinase-phosphorylation site pairings: Evidence for an evolutionarily dynamic phosphoproteome. PLoS One 2018; 13:e0202036. [PMID: 30106995 PMCID: PMC6091962 DOI: 10.1371/journal.pone.0202036] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 07/26/2018] [Indexed: 12/14/2022] Open
Abstract
Kinase-mediated protein phosphorylation is a central mechanism for regulation of cellular responses and phenotypes. While considerable information is available regarding the evolutionary relationships within the kinase family, as well as the evolutionary conservation of phosphorylation sites, each aspect of this partnership is typically considered in isolation, despite their clear functional relationship. Here, to offer a more holistic perspective on the evolution of protein phosphorylation, the conservation of protein phosphorylation sites is considered in the context of the conservation of the corresponding modifying kinases. Specifically, conservation of defined kinase-phosphorylation site pairings (KPSPs), as well as of each of the component parts (the kinase and the phosphorylation site), were examined across a range of species. As expected, greater evolutionary distance between species was generally associated with lower probability of KPSP conservation, and only a small fraction of KPSPs were maintained across all species, with the vast majority of KPSP losses due to the absence of the phosphorylation site. This supports a model in which a relatively stable kinome promotes the emergence of functional substrates from an evolutionarily malleable phosphoproteome.
Collapse
Affiliation(s)
- Megan McDonald
- Vaccine and Infectious Disease Organization-International Vaccine Research Center, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
- Department of Biochemistry, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Brett Trost
- Vaccine and Infectious Disease Organization-International Vaccine Research Center, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
- Department of Computer Science, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Scott Napper
- Vaccine and Infectious Disease Organization-International Vaccine Research Center, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
- Department of Biochemistry, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
- * E-mail:
| |
Collapse
|
9
|
Abrahams L, Hurst LD. Adenine Enrichment at the Fourth CDS Residue in Bacterial Genes Is Consistent with Error Proofing for +1 Frameshifts. Mol Biol Evol 2018; 34:3064-3080. [PMID: 28961919 PMCID: PMC5850271 DOI: 10.1093/molbev/msx223] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Beyond selection for optimal protein functioning, coding sequences (CDSs) are under selection at the RNA and DNA levels. Here, we identify a possible signature of “dual-coding,” namely extensive adenine (A) enrichment at bacterial CDS fourth sites. In 99.07% of studied bacterial genomes, fourth site A use is greater than expected given genomic A-starting codon use. Arguing for nucleotide level selection, A-starting serine and arginine second codons are heavily utilized when compared with their non-A starting synonyms. Several models have the ability to explain some of this trend. In part, A-enrichment likely reduces 5′ mRNA stability, promoting translation initiation. However T/U, which may also reduce stability, is avoided. Further, +1 frameshifts on the initiating ATG encode a stop codon (TGA) provided A is the fourth residue, acting either as a frameshift “catch and destroy” or a frameshift stop and adjust mechanism and hence implicated in translation initiation. Consistent with both, genomes lacking TGA stop codons exhibit weaker fourth site A-enrichment. Sequences lacking a Shine–Dalgarno sequence and those without upstream leader genes, that may be more error prone during initiation, have greater utilization of A, again suggesting a role in initiation. The frameshift correction model is consistent with the notion that many genomic features are error-mitigation factors and provides the first evidence for site-specific out of frame stop codon selection. We conjecture that the NTG universal start codon may have evolved as a consequence of TGA being a stop codon and the ability of NTGA to rapidly terminate or adjust a ribosome.
Collapse
Affiliation(s)
- Liam Abrahams
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
10
|
Evolution, dynamics and dysregulation of kinase signalling. Curr Opin Struct Biol 2018; 48:133-140. [DOI: 10.1016/j.sbi.2017.12.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2017] [Revised: 12/19/2017] [Accepted: 12/21/2017] [Indexed: 12/31/2022]
|
11
|
Evolutionary expansion and structural functionalism of the ancient family of profilin proteins. Gene 2017; 626:70-86. [PMID: 28501628 DOI: 10.1016/j.gene.2017.05.024] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Revised: 05/02/2017] [Accepted: 05/09/2017] [Indexed: 02/07/2023]
Abstract
Structure and functional similarities of a recent protein's orthologs with its ancient counterpart are largely determined by the configuration of evolutionary preservation of amino acids. The emergence of genome sequencing databases allowed dissecting the evolutionarily important gene families at a comprehensive and genome-wide scale. The profilin multi-gene family is an ancient, universal, and functionally diverged across kingdoms, which regulates various aspects of cellular development in both prokarya and eukarya, especially cell-wall maintenance through actin sequestering, nucleation and cytokinesis. We performed a meta-analysis of the evolutionary expansion, structural conservation, evolution of function motifs, and transcriptional biases of profilin proteins across kingdoms. An exhaustive search of various genome databases of cyanobacteria, fungi, animalia and plantae kingdoms revealed 172 paralogous/orthologous profilins those were phylogenetically clustered in various groups. Orthologous gene comparisons indicated that segmental and tandem duplication events under strong purifying selection are predominantly responsible for their convoluted structural divergences. Evidently, structural divergences were more prevalent in the paralogs than orthologs, and evolutionary variations in the exon/intron architecture were accomplished by 'exon/intron-gain' and insertion/deletion during sequence-exonization. Remarkably, temporal expression evolution of profilin paralogs/homeologs during cotton fiber domestication provides evolutionary impressions of the selection of highly diverged transcript abundance notably in the fiber morpho-evolution. These results provide global insights into the profilin evolution, their structural design across taxa; and their future utilization in translational research.
Collapse
|
12
|
Johnson JR, Santos SD, Johnson T, Pieper U, Strumillo M, Wagih O, Sali A, Krogan NJ, Beltrao P. Prediction of Functionally Important Phospho-Regulatory Events in Xenopus laevis Oocytes. PLoS Comput Biol 2015; 11:e1004362. [PMID: 26312481 PMCID: PMC4552029 DOI: 10.1371/journal.pcbi.1004362] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 05/27/2015] [Indexed: 01/10/2023] Open
Abstract
The African clawed frog Xenopus laevis is an important model organism for studies in developmental and cell biology, including cell-signaling. However, our knowledge of X. laevis protein post-translational modifications remains scarce. Here, we used a mass spectrometry-based approach to survey the phosphoproteome of this species, compiling a list of 2636 phosphosites. We used structural information and phosphoproteomic data for 13 other species in order to predict functionally important phospho-regulatory events. We found that the degree of conservation of phosphosites across species is predictive of sites with known molecular function. In addition, we predicted kinase-protein interactions for a set of cell-cycle kinases across all species. The degree of conservation of kinase-protein interactions was found to be predictive of functionally relevant regulatory interactions. Finally, using comparative protein structure models, we find that phosphosites within structured domains tend to be located at positions with high conformational flexibility. Our analysis suggests that a small class of phosphosites occurs in positions that have the potential to regulate protein conformation.
Collapse
Affiliation(s)
- Jeffrey R Johnson
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, California, United States of America
| | - Silvia D Santos
- Quantitative Cell Biology group, MRC Clinical Sciences Centre, Imperial College, London, United Kingdom
| | - Tasha Johnson
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, California, United States of America
| | - Ursula Pieper
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, University of California, San Francisco, San Francisco, California, United States of America; Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, University of California, San Francisco, San Francisco, California, United States of America
| | - Marta Strumillo
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany and European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| | - Omar Wagih
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany and European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, University of California, San Francisco, San Francisco, California, United States of America; Department of Pharmaceutical Chemistry, California Institute for Quantitative Biosciences, Byers Hall at Mission Bay, University of California, San Francisco, San Francisco, California, United States of America
| | - Nevan J Krogan
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, California, United States of America; Gladstone Institutes, San Francisco, California, United States of America
| | - Pedro Beltrao
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany and European Bioinformatics Institute (EMBL-EBI), Cambridge, United Kingdom; iBiMED and Department of Health Sciences, University of Aveiro, Aveiro, Portugal
| |
Collapse
|
13
|
Singh GP. Association between intrinsic disorder and serine/threonine phosphorylation in Mycobacterium tuberculosis. PeerJ 2015; 3:e724. [PMID: 25648268 PMCID: PMC4304846 DOI: 10.7717/peerj.724] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2014] [Accepted: 12/21/2014] [Indexed: 01/28/2023] Open
Abstract
Serine/threonine phosphorylation is an important mechanism that is involved in the regulation of protein function. In eukaryotes, phosphorylation occurs predominantly in intrinsically disordered regions of proteins. Though serine/threonine phosphorylation and protein disorder are much less prevalent in prokaryotes, some bacteria have high levels of serine/threonine phosphorylation and disorder, including the medically important M. tuberculosis. Here I show that serine/threonine phosphorylation sites in M. tuberculosis are highly enriched in intrinsically disordered regions, indicating similarity in the substrate recognition mechanisms of eukaryotic and M. tuberculosis kinases. Serine/threonine phosphorylation has been linked to the pathogenicity and survival of M. tuberculosis. Thus, a better understanding of how its kinases recognize their substrates could have important implications in understanding and controlling the biology of this deadly pathogen. These results also indicate that the association between serine/threonine phosphorylation and disorder is not a feature restricted to eukaryotes.
Collapse
Affiliation(s)
- Gajinder Pal Singh
- School of Biotechnology, KIIT University , Patia, Bhubaneswar, Odisha , India
| |
Collapse
|
14
|
Systematic analysis of the in situ crosstalk of tyrosine modifications reveals no additional natural selection on multiply modified residues. Sci Rep 2014; 4:7331. [PMID: 25476580 PMCID: PMC4256647 DOI: 10.1038/srep07331] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Accepted: 11/11/2014] [Indexed: 12/16/2022] Open
Abstract
Recent studies have indicated that different post-translational modifications (PTMs) synergistically orchestrate specific biological processes by crosstalks. However, the preference of the crosstalk among different PTMs and the evolutionary constraint on the PTM crosstalk need further dissections. In this study, the in situ crosstalk at the same positions among three tyrosine PTMs including sulfation, nitration and phosphorylation were systematically analyzed. The experimentally identified sulfation, nitration and phosphorylation sites were collected and integrated with reliable predictions to perform large-scale analyses of in situ crosstalks. From the results, we observed that the in situ crosstalk between sulfation and nitration is significantly under-represented, whereas both sulfation and nitration prefer to co-occupy with phosphorylation at same tyrosines. Further analyses suggested that sulfation and nitration preferentially co-occur with phosphorylation at specific positions in proteins, and participate in distinct biological processes and functions. More interestingly, the long-term evolutionary analysis indicated that multi-PTM targeting tyrosines didn't show any higher conservation than singly modified ones. Also, the analysis of human genetic variations demonstrated that there is no additional functional constraint on inherited disease, cancer or rare mutations of multiply modified tyrosines. Taken together, our systematic analyses provided a better understanding of the in situ crosstalk among PTMs.
Collapse
|
15
|
Landry CR, Freschi L, Zarin T, Moses AM. Turnover of protein phosphorylation evolving under stabilizing selection. Front Genet 2014; 5:245. [PMID: 25101120 PMCID: PMC4107968 DOI: 10.3389/fgene.2014.00245] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 07/08/2014] [Indexed: 12/31/2022] Open
Abstract
Most proteins are regulated by posttranslational modifications and changes in these modifications contribute to evolutionary changes as well as to human diseases. Phosphorylation of serines, threonines, and tyrosines are the most common modifications identified to date in eukaryotic proteomes. While the mode of action and the function of most phosphorylation sites remain unknown, functional studies have shown that phosphorylation affects protein stability, localization and ability to interact. Two broad modes of action have been described for protein phosphorylation. The first mode corresponds to the canonical and qualitative view whereby single phosphorylation sites act as molecular switches that either turn on or off specific protein functions through direct or allosteric effects. The second mode is more akin to a rheostat than a switch. In this case, a group of phosphorylation sites in a given protein region contributes collectively to the modification of the protein, irrespective of the precise position of individual sites, through an aggregate property. Here we discuss these two types of regulation and examine how they affect the rate and patterns of protein phosphorylation evolution. We describe how the evolution of clusters of phosphorylation sites can be studied under the framework of complex traits evolution and stabilizing selection.
Collapse
Affiliation(s)
- Christian R Landry
- Département de Biologie, Université Laval Québec, QC, Canada ; Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval Québec, QC, Canada ; Network for Research on Protein Function, Structure, and Engineering (PROTEO), Univeristé Laval Québec, QC, Canada
| | - Luca Freschi
- Département de Biologie, Université Laval Québec, QC, Canada ; Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval Québec, QC, Canada ; Network for Research on Protein Function, Structure, and Engineering (PROTEO), Univeristé Laval Québec, QC, Canada
| | - Taraneh Zarin
- Department of Cell and Systems Biology, University of Toronto Toronto, ON, Canada
| | - Alan M Moses
- Department of Cell and Systems Biology, University of Toronto Toronto, ON, Canada ; Department of Ecology and Evolutionary Biology, University of Toronto Toronto, ON, Canada ; Center for Analysis of Genome Evolution and Function, University of Toronto Toronto, ON, Canada
| |
Collapse
|
16
|
Gray VE, Liu L, Nirankari R, Hornbeck PV, Kumar S. Signatures of natural selection on mutations of residues with multiple posttranslational modifications. Mol Biol Evol 2014; 31:1641-5. [PMID: 24739307 DOI: 10.1093/molbev/msu137] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Posttranslational modifications (PTMs) regulate molecular structures and functions of proteins by covalently binding to amino acids. Hundreds of thousands of PTMs have been reported for the human proteome, with multiple PTMs known to affect tens of thousands of lysine (K) residues. Our molecular evolutionary analyses show that K residues with multiple PTMs exhibit greater conservation than those with a single PTM, but the difference is rather small. In contrast, short-term evolutionary trends revealed in an analysis of human population variation exhibited a much larger difference. Lysine residues with three PTMs show 1.8-fold enrichment of Mendelian disease-associated variants when compared with K residues with two PTMs, with the latter showing 1.7-fold enrichment of these variants when compared with the K residues with one PTM. Rare polymorphisms in humans show a similar trend, which suggests much greater negative selection against mutations of K residues with multiple PTMs within population. Conversely, common polymorphisms are overabundant at unmodified K residues and at K residues with fewer PTMs. The observed difference between inter- and intraspecies patterns of purifying selection on residues with PTMs suggests extensive species-specific drifting of PTM positions. These results suggest that the functionality of a protein is likely conserved, without necessarily conserving the PTM positions over evolutionary time.
Collapse
Affiliation(s)
- Vanessa E Gray
- Center for Evolutionary Medicine and Informatics, Biodesign Institute, Arizona State University
| | - Li Liu
- Center for Evolutionary Medicine and Informatics, Biodesign Institute, Arizona State University
| | - Ronika Nirankari
- Center for Evolutionary Medicine and Informatics, Biodesign Institute, Arizona State University
| | | | - Sudhir Kumar
- Center for Evolutionary Medicine and Informatics, Biodesign Institute, Arizona State UniversitySchool of Life Sciences, Arizona State UniversityCenter for Genomic Medicine and Research, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
17
|
Li J, Jia J, Li H, Yu J, Sun H, He Y, Lv D, Yang X, Glocker MO, Ma L, Yang J, Li L, Li W, Zhang G, Liu Q, Li Y, Xie L. SysPTM 2.0: an updated systematic resource for post-translational modification. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014; 2014:bau025. [PMID: 24705204 PMCID: PMC3975108 DOI: 10.1093/database/bau025] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Post-translational modifications (PTMs) of proteins play essential roles in almost all cellular processes, and are closely related to physiological activity and disease development of living organisms. The development of tandem mass spectrometry (MS/MS) has resulted in a rapid increase of PTMs identified on proteins from different species. The collection and systematic ordering of PTM data should provide invaluable information for understanding cellular processes and signaling pathways regulated by PTMs. For this original purpose we developed SysPTM, a systematic resource installed with comprehensive PTM data and a suite of web tools for annotation of PTMs in 2009. Four years later, there has been a significant advance with the generation of PTM data and, consequently, more sophisticated analysis requirements have to be met. Here we submit an updated version of SysPTM 2.0 (http://lifecenter.sgst.cn/SysPTM/), with almost doubled data content, enhanced web-based analysis tools of PTMBlast, PTMPathway, PTMPhylog, PTMCluster. Moreover, a new session SysPTM-H is constructed to graphically represent the combinatorial histone PTMs and dynamic regulation of histone modifying enzymes, and a new tool PTMGO is added for functional annotation and enrichment analysis. SysPTM 2.0 not only facilitates resourceful annotation of PTM sites but allows systematic investigation of PTM functions by the user. Database URL: http://lifecenter.sgst.cn/SysPTM/.
Collapse
Affiliation(s)
- Jing Li
- Key Laboratory of Biomedical Photonics of Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, P. R. China, Shanghai Center for Bioinformation Technology, Shanghai Institutes of Biomedicine, Shanghai Academy of Science and Technology, Shanghai 201203, P. R. China, Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, P. R. China, Department of Bioinformatics and Biostatistics, Shanghai Jiaotong University, Shanghai 200240, P. R. China, Key Laboratory of Systems Biology, Chinese Academy of Sciences, Shanghai 200031, P. R. China and Proteome Center Rostock, Department for Proteome Research, Institute of Immunology, University of Rostock, Rostock 18055, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Beltrao P, Bork P, Krogan NJ, van Noort V. Evolution and functional cross-talk of protein post-translational modifications. Mol Syst Biol 2013; 9:714. [PMID: 24366814 PMCID: PMC4019982 DOI: 10.1002/msb.201304521] [Citation(s) in RCA: 272] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Revised: 11/18/2013] [Accepted: 11/22/2013] [Indexed: 12/19/2022] Open
Abstract
Protein post-translational modifications (PTMs) allow the cell to regulate protein activity and play a crucial role in the response to changes in external conditions or internal states. Advances in mass spectrometry now enable proteome wide characterization of PTMs and have revealed a broad functional role for a range of different types of modifications. Here we review advances in the study of the evolution and function of PTMs that were spurred by these technological improvements. We provide an overview of studies focusing on the origin and evolution of regulatory enzymes as well as the evolutionary dynamics of modification sites. Finally, we discuss different mechanisms of altering protein activity via post-translational regulation and progress made in the large-scale functional characterization of PTM function.
Collapse
Affiliation(s)
- Pedro Beltrao
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)CambridgeUK
| | - Peer Bork
- Structural and Computational Biology UnitEuropean Molecular Biology LaboratoryHeidelbergGermany
- Max‐Delbruck‐Centre for Molecular MedicineBerlin‐BuchGermany
| | - Nevan J. Krogan
- Department of Cellular and Molecular PharmacologyUniversity of CaliforniaSan FranciscoCaliforniaUSA
- California Institute for Quantitative BiosciencesSan FranciscoCaliforniaUSA
- J. David Gladstone InstitutesSan FranciscoCaliforniaUSA
| | - Vera van Noort
- Structural and Computational Biology UnitEuropean Molecular Biology LaboratoryHeidelbergGermany
| |
Collapse
|
19
|
Schumacher J, Ramljak S, Asif AR, Schaffrath M, Zischler H, Herlyn H. Evolutionary conservation of mammalian sperm proteins associates with overall, not tyrosine, phosphorylation in human spermatozoa. J Proteome Res 2013; 12:5370-82. [PMID: 23919900 DOI: 10.1021/pr400228c] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
We investigated possible associations between sequence evolution of mammalian sperm proteins and their phosphorylation status in humans. As a reference, spermatozoa from three normozoospermic men were analyzed combining two-dimensional gel electrophoresis, immunoblotting, and mass spectrometry. We identified 99 sperm proteins (thereof 42 newly described) and determined the phosphorylation status for most of them. Sequence evolution was studied across six mammalian species using nonsynonymous/synonymous rate ratios (dN/dS) and amino acid distances. Site-specific purifying selection was assessed employing average ratios of evolutionary rates at phosphorylated versus nonphosphorylated amino acids (α). According to our data, mammalian sperm proteins do not show statistically significant sequence conservation difference, no matter if the human ortholog is a phosphoprotein with or without tyrosine (Y) phosphorylation. In contrast, overall phosphorylation of human sperm proteins, i.e., phosphorylation at serine (S), threonine (T), and/or Y residues, associates with above-average conservation of sequences. Complementary investigations suggest that numerous protein-protein interactants constrain sequence evolution of sperm phosphoproteins. Although our findings reject a special relevance of Y phosphorylation for sperm functioning, they still indicate that overall phosphorylation substantially contributes to proper functioning of sperm proteins. Hence, phosphorylated sperm proteins might be considered as prime candidates for diagnosis and treatment of reduced male fertility.
Collapse
Affiliation(s)
- Julia Schumacher
- Institute of Anthropology, University Mainz , Anselm-Franz-von-Bentzel-Weg 7, Mainz 55128, Germany
| | | | | | | | | | | |
Collapse
|
20
|
Levy ED, Michnick SW, Landry CR. Protein abundance is key to distinguish promiscuous from functional phosphorylation based on evolutionary information. Philos Trans R Soc Lond B Biol Sci 2012; 367:2594-606. [PMID: 22889910 DOI: 10.1098/rstb.2012.0078] [Citation(s) in RCA: 79] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
In eukaryotic cells, protein phosphorylation is an important and widespread mechanism used to regulate protein function. Yet, of the thousands of phosphosites identified to date, only a few hundred at best have a characterized function. It was recently shown that these functional sites are significantly more conserved than phosphosites of unknown function, stressing the importance of considering evolutionary conservation in assessing the global functional landscape of phosphosites. This leads us to review studies that examined the impact of phosphorylation on evolutionary conservation. While all these studies have shown that conservation is greater among phosphorylated sites compared with non-phosphorylated ones, the magnitude of this difference varies greatly. Further, not all studies have considered key factors that may influence the rate of phosphosite evolution. Such key factors are their localization in ordered or disordered regions, their stoichiometry or the abundance of their corresponding protein. Here we take into account all of these factors simultaneously, which reveals remarkable evolutionary patterns. First, while it is well established that protein conservation increases with abundance, we show that phosphosites partly follow an opposite trend. More precisely, Saccharomyces cerevisiae phosphosites present among abundant proteins are 1.5 times more likely to diverge in the closely related species Saccharomyces bayanus when compared with phosphosites present in the 5 per cent least abundant proteins. Second, we show that conservation is coupled to stoichiometry, whereby sites frequently phosphorylated are more conserved than those rarely phosphorylated. Finally, we provide a model of functional and noisy or 'accidental' phosphorylation that explains these observations.
Collapse
Affiliation(s)
- Emmanuel D Levy
- Département de Biochimie, Université de Montréal, Montréal, Québec, Canada.
| | | | | |
Collapse
|
21
|
Podder S, Ghosh TC. Evolutionary dynamics of human autoimmune disease genes and malfunctioned immunological genes. BMC Evol Biol 2012; 12:10. [PMID: 22276655 PMCID: PMC3347981 DOI: 10.1186/1471-2148-12-10] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2011] [Accepted: 01/25/2012] [Indexed: 02/01/2023] Open
Abstract
Background One of the main issues of molecular evolution is to divulge the principles in dictating the evolutionary rate differences among various gene classes. Immunological genes have received considerable attention in evolutionary biology as candidates for local adaptation and for studying functionally important polymorphisms. The normal structure and function of immunological genes will be distorted when they experience mutations leading to immunological dysfunctions. Results Here, we examined the fundamental differences between the genes which on mutation give rise to autoimmune or other immune system related diseases and the immunological genes that do not cause any disease phenotypes. Although the disease genes examined are analogous to non-disease genes in product, expression, function, and pathway affiliation, a statistically significant decrease in evolutionary rate has been found in autoimmune disease genes relative to all other immune related diseases and non-disease genes. Possible ways of accumulation of mutation in the three steps of the central dogma (DNA-mRNA-Protein) have been studied to trace the mutational effects predisposed to disease consequence and acquiring higher selection pressure. Principal Component Analysis and Multivariate Regression Analysis have established the predominant role of single nucleotide polymorphisms in guiding the evolutionary rate of immunological disease and non-disease genes followed by m-RNA abundance, paralogs number, fraction of phosphorylation residue, alternatively spliced exon, protein residue burial and protein disorder. Conclusions Our study provides an empirical insight into the etiology of autoimmune disease genes and other immunological diseases. The immediate utility of our study is to help in disease gene identification and may also help in medicinal improvement of immune related disease.
Collapse
|
22
|
Hagai T, Tóth-Petróczy Á, Azia A, Levy Y. The origins and evolution of ubiquitination sites. MOLECULAR BIOSYSTEMS 2012; 8:1865-77. [DOI: 10.1039/c2mb25052g] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
23
|
Zheng LL, Niu S, Hao P, Feng K, Cai YD, Li Y. Prediction of protein modification sites of pyrrolidone carboxylic acid using mRMR feature selection and analysis. PLoS One 2011; 6:e28221. [PMID: 22174779 PMCID: PMC3235115 DOI: 10.1371/journal.pone.0028221] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2011] [Accepted: 11/03/2011] [Indexed: 11/18/2022] Open
Abstract
Pyrrolidone carboxylic acid (PCA) is formed during a common post-translational modification (PTM) of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations.
Collapse
Affiliation(s)
- Lu-Lu Zheng
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Huazhong University of Science and Technology, Wuhan, China
| | | | | | | | | | | |
Collapse
|
24
|
Nishi H, Hashimoto K, Panchenko AR. Phosphorylation in protein-protein binding: effect on stability and function. Structure 2011; 19:1807-15. [PMID: 22153503 PMCID: PMC3240861 DOI: 10.1016/j.str.2011.09.021] [Citation(s) in RCA: 223] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2011] [Revised: 09/21/2011] [Accepted: 09/24/2011] [Indexed: 01/23/2023]
Abstract
Posttranslational modifications offer a dynamic way to regulate protein activity, subcellular localization, and stability. Here we estimate the effect of phosphorylation on protein binding and function for different types of complexes from human proteome. We find that phosphorylation sites tend to be located on binding interfaces in heterooligomeric and weak transient homooligomeric complexes. Analysis of molecular mechanisms of phosphorylation shows that phosphorylation may modulate the strength of interactions directly on interfaces and that binding hotspots tend to be phosphorylated in heterooligomers. Although the majority of complexes do not show significant estimated stability differences upon phosphorylation or dephosphorylation, for about one-third of all complexes it causes relatively large changes in binding energy. We discuss the cases where phosphorylation mediates the complex formation and regulates the function. We show that phosphorylation sites are more likely to be evolutionary conserved than other interfacial residues.
Collapse
Affiliation(s)
- Hafumi Nishi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Kosuke Hashimoto
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Anna R. Panchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|