1
|
Li Y, Gao R, Liu S, Zhang H, Lv H, Lai H. PhosBERT: A self-supervised learning model for identifying phosphorylation sites in SARS-CoV-2-infected human cells. Methods 2024; 230:140-146. [PMID: 39179191 DOI: 10.1016/j.ymeth.2024.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 08/03/2024] [Accepted: 08/17/2024] [Indexed: 08/26/2024] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a single-stranded RNA virus, which mainly causes respiratory and enteric diseases and is responsible for the outbreak of coronavirus disease 19 (COVID-19). Numerous studies have demonstrated that SARS-CoV-2 infection will lead to a significant dysregulation of protein post-translational modification profile in human cells. The accurate recognition of phosphorylation sites in host cells will contribute to a deep understanding of the pathogenic mechanisms of SARS-CoV-2 and also help to screen drugs and compounds with antiviral potential. Therefore, there is a need to develop cost-effective and high-precision computational strategies for specifically identifying SARS-CoV-2-infected phosphorylation sites. In this work, we first implemented a custom neural network model (named PhosBERT) on the basis of a pre-trained protein language model of ProtBert, which was a self-supervised learning approach developed on the Bidirectional Encoder Representation from Transformers (BERT) architecture. PhosBERT was then trained and validated on serine (S) and threonine (T) phosphorylation dataset and tyrosine (Y) phosphorylation dataset with 5-fold cross-validation, respectively. Independent validation results showed that PhosBERT could identify S/T phosphorylation sites with high accuracy and AUC (area under the receiver operating characteristic) value of 81.9% and 0.896. The prediction accuracy and AUC value of Y phosphorylation sites reached up to 87.1% and 0.902. It indicated that the proposed model was of good prediction ability and stability and would provide a new approach for studying SARS-CoV-2 phosphorylation sites.
Collapse
Affiliation(s)
- Yong Li
- Sichuan Vocational College of Health and Rehabilitation, Zigong 643000, Sichuan, China
| | - Ru Gao
- The People's Hospital of Ya 'an, Ya'an 625000, Sichuan, China; The People's Hospital of Wenjiang Chengdu, Chengdu 611130, Sichuan, China
| | - Shan Liu
- The People's Hospital of Wenjiang Chengdu, Chengdu 611130, Sichuan, China
| | - Hongqi Zhang
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Hao Lv
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| | - Hongyan Lai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.
| |
Collapse
|
2
|
Wang S, Di Y, Yang Y, Salovska B, Li W, Hu L, Yin J, Shao W, Zhou D, Cheng J, Liu D, Yang H, Liu Y. PTMoreR-enabled cross-species PTM mapping and comparative phosphoproteomics across mammals. CELL REPORTS METHODS 2024; 4:100859. [PMID: 39255793 PMCID: PMC11440062 DOI: 10.1016/j.crmeth.2024.100859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 05/13/2024] [Accepted: 08/15/2024] [Indexed: 09/12/2024]
Abstract
To support PTM proteomic analysis and annotation in different species, we developed PTMoreR, a user-friendly tool that considers the surrounding amino acid sequences of PTM sites during BLAST, enabling a motif-centric analysis across species. By controlling sequence window similarity, PTMoreR can map phosphoproteomic results between any two species, perform site-level functional enrichment analysis, and generate kinase-substrate networks. We demonstrate that the majority of real P-sites in mice can be inferred from experimentally derived human P-sites with PTMoreR mapping. Furthermore, the compositions of 129 mammalian phosphoproteomes can also be predicted using PTMoreR. The method also identifies cross-species phosphorylation events that occur on proteins with an increased tendency to respond to the environmental factors. Moreover, the classic kinase motifs can be extracted across mammalian species, offering an evolutionary angle for refining current motifs. PTMoreR supports PTM proteomics in non-human species and facilitates quantitative phosphoproteomic analysis.
Collapse
Affiliation(s)
- Shisheng Wang
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Yi Di
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA
| | - Yin Yang
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Barbora Salovska
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA
| | - Wenxue Li
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA
| | - Liqiang Hu
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Jiahui Yin
- Information Research Institute, Tongji University, Shanghai 200092, China
| | - Wenguang Shao
- State Key Laboratory of Microbial Metabolism, School of Life Science & Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Dong Zhou
- Department of Medicine, Division of Nephrology, University of Connecticut School of Medicine, Farmington, CT 06030, USA
| | - Jingqiu Cheng
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Dan Liu
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China; State Key Laboratory of Respiratory Health and Multimorbidity, West China Hospital, Sichuan University, Chengdu 610041, China.
| | - Hao Yang
- Department of Pulmonary and Critical Care Medicine, Proteomics-Metabolomics Analysis Platform, and NHC Key Lab of Transplant Engineering and Immunology, West China Hospital, Sichuan University, Chengdu 610041, China.
| | - Yansheng Liu
- Yale Cancer Biology Institute, Yale University, West Haven, CT 06516, USA; Department of Pharmacology, Yale University School of Medicine, New Haven, CT 06520, USA; Department of Biomedical Informatics & Data Science, Yale Univeristy School of Medicine, New Haven, CT 06510, USA.
| |
Collapse
|
3
|
Houles T, Yoon SO, Roux PP. The expanding landscape of canonical and non-canonical protein phosphorylation. Trends Biochem Sci 2024:S0968-0004(24)00191-9. [PMID: 39266329 DOI: 10.1016/j.tibs.2024.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 08/01/2024] [Accepted: 08/14/2024] [Indexed: 09/14/2024]
Abstract
Protein phosphorylation is a crucial regulatory mechanism in cell signaling, acting as a molecular switch that modulates protein function. Catalyzed by protein kinases and reversed by phosphoprotein phosphatases, it is essential in both normal physiological and pathological states. Recent advances have uncovered a vast and intricate landscape of protein phosphorylation that include histidine phosphorylation and more unconventional events, such as pyrophosphorylation and polyphosphorylation. Many questions remain about the true size of the phosphoproteome and, more importantly, its site-specific functional relevance. The involvement of unconventional actors such as pseudokinases and pseudophosphatases adds further complexity to be resolved. This review explores recent discoveries and ongoing challenges, highlighting the need for continued research to fully elucidate the roles and regulation of protein phosphorylation.
Collapse
Affiliation(s)
- Thibault Houles
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec, Canada; Institute of Molecular Genetics of Montpellier (IGMM), Université de Montpellier, CNRS, Montpellier, France.
| | - Sang-Oh Yoon
- Department of Physiology and Biophysics, College of Medicine, University of Illinois at Chicago, Chicago, IL 60612, USA
| | - Philippe P Roux
- Institute for Research in Immunology and Cancer (IRIC), Université de Montréal, Montreal, Quebec, Canada; Department of Pathology and Cell Biology, Faculty of Medicine, Université de Montréal, Montreal, Quebec, Canada.
| |
Collapse
|
4
|
Bradley D, Hogrebe A, Dandage R, Dubé AK, Leutert M, Dionne U, Chang A, Villén J, Landry CR. The fitness cost of spurious phosphorylation. EMBO J 2024:10.1038/s44318-024-00200-7. [PMID: 39256561 DOI: 10.1038/s44318-024-00200-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 07/23/2024] [Accepted: 07/24/2024] [Indexed: 09/12/2024] Open
Abstract
The fidelity of signal transduction requires the binding of regulatory molecules to their cognate targets. However, the crowded cell interior risks off-target interactions between proteins that are functionally unrelated. How such off-target interactions impact fitness is not generally known. Here, we use Saccharomyces cerevisiae to inducibly express tyrosine kinases. Because yeast lacks bona fide tyrosine kinases, the resulting tyrosine phosphorylation is biologically spurious. We engineered 44 yeast strains each expressing a tyrosine kinase, and quantitatively analysed their phosphoproteomes. This analysis resulted in ~30,000 phosphosites mapping to ~3500 proteins. The number of spurious pY sites generated correlates strongly with decreased growth, and we predict over 1000 pY events to be deleterious. However, we also find that many of the spurious pY sites have a negligible effect on fitness, possibly because of their low stoichiometry. This result is consistent with our evolutionary analyses demonstrating a lack of phosphotyrosine counter-selection in species with tyrosine kinases. Our results suggest that, alongside the risk for toxicity, the cell can tolerate a large degree of non-functional crosstalk as interaction networks evolve.
Collapse
Affiliation(s)
- David Bradley
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexander Hogrebe
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Rohan Dandage
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexandre K Dubé
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Mario Leutert
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| | - Ugo Dionne
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexis Chang
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Judit Villén
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
| | - Christian R Landry
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada.
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada.
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada.
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada.
- Department of Biology, Université Laval, Québec, QC, Canada.
| |
Collapse
|
5
|
Ramsbottom KA, Prakash A, Perez-Riverol Y, Camacho OM, Sun Z, Kundu DJ, Bowler-Barnett E, Martin M, Fan J, Chebotarov D, McNally KL, Deutsch EW, Vizcaíno JA, Jones AR. Meta-Analysis of Rice Phosphoproteomics Data to Understand Variation in Cell Signaling Across the Rice Pan-Genome. J Proteome Res 2024; 23:2518-2531. [PMID: 38810119 PMCID: PMC11232104 DOI: 10.1021/acs.jproteome.4c00187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
Phosphorylation is the most studied post-translational modification, and has multiple biological functions. In this study, we have reanalyzed publicly available mass spectrometry proteomics data sets enriched for phosphopeptides from Asian rice (Oryza sativa). In total we identified 15,565 phosphosites on serine, threonine, and tyrosine residues on rice proteins. We identified sequence motifs for phosphosites, and link motifs to enrichment of different biological processes, indicating different downstream regulation likely caused by different kinase groups. We cross-referenced phosphosites against the rice 3,000 genomes, to identify single amino acid variations (SAAVs) within or proximal to phosphosites that could cause loss of a site in a given rice variety and clustered the data to identify groups of sites with similar patterns across rice family groups. The data has been loaded into UniProt Knowledge-Base─enabling researchers to visualize sites alongside other data on rice proteins, e.g., structural models from AlphaFold2, PeptideAtlas, and the PRIDE database─enabling visualization of source evidence, including scores and supporting mass spectra.
Collapse
Affiliation(s)
- Kerry A Ramsbottom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, United Kingdom
| | - Ananth Prakash
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Oscar Martin Camacho
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, United Kingdom
| | - Zhi Sun
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Deepti J Kundu
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Emily Bowler-Barnett
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Maria Martin
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Jun Fan
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Dmytro Chebotarov
- International Rice Research Institute, DAPO Box 7777, Manila 1301, Philippines
| | - Kenneth L McNally
- International Rice Research Institute, DAPO Box 7777, Manila 1301, Philippines
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Andrew R Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7BE, United Kingdom
| |
Collapse
|
6
|
Batie M, Fasanya T, Kenneth NS, Rocha S. Oxygen-regulated post-translation modifications as master signalling pathway in cells. EMBO Rep 2023; 24:e57849. [PMID: 37877678 DOI: 10.15252/embr.202357849] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 09/22/2023] [Accepted: 10/12/2023] [Indexed: 10/26/2023] Open
Abstract
Oxygen is essential for viability in mammalian organisms. However, cells are often exposed to changes in oxygen availability, due to either increased demand or reduced oxygen supply, herein called hypoxia. To be able to survive and/or adapt to hypoxia, cells activate a variety of signalling cascades resulting in changes to chromatin, gene expression, metabolism and viability. Cellular signalling is often mediated via post-translational modifications (PTMs), and this is no different in response to hypoxia. Many enzymes require oxygen for their activity and oxygen can directly influence several PTMS. Here, we review the direct impact of changes in oxygen availability on PTMs such as proline, asparagine, histidine and lysine hydroxylation, lysine and arginine methylation and cysteine dioxygenation, with a focus on mammalian systems. In addition, indirect hypoxia-dependent effects on phosphorylation, ubiquitination and sumoylation will also be discussed. Direct and indirect oxygen-regulated changes to PTMs are coordinated to achieve the cell's ultimate response to hypoxia. However, specific oxygen sensitivity and the functional relevance of some of the identified PTMs still require significant research.
Collapse
Affiliation(s)
- Michael Batie
- Department of Biochemistry, Cell and Systems Biology, Institute of Molecular Systems and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Temitope Fasanya
- Department of Biochemistry, Cell and Systems Biology, Institute of Molecular Systems and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Niall S Kenneth
- Department of Biochemistry, Cell and Systems Biology, Institute of Molecular Systems and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Sonia Rocha
- Department of Biochemistry, Cell and Systems Biology, Institute of Molecular Systems and Integrative Biology, University of Liverpool, Liverpool, UK
| |
Collapse
|
7
|
Ramsbottom KA, Prakash A, Riverol YP, Camacho OM, Sun Z, Kundu DJ, Bowler-Barnett E, Martin M, Fan J, Chebotarov D, McNally KL, Deutsch EW, Vizcaíno JA, Jones AR. A meta-analysis of rice phosphoproteomics data to understand variation in cell signalling across the rice pan-genome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.17.567512. [PMID: 38014076 PMCID: PMC10680829 DOI: 10.1101/2023.11.17.567512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Phosphorylation is the most studied post-translational modification, and has multiple biological functions. In this study, we have re-analysed publicly available mass spectrometry proteomics datasets enriched for phosphopeptides from Asian rice (Oryza sativa). In total we identified 15,522 phosphosites on serine, threonine and tyrosine residues on rice proteins. We identified sequence motifs for phosphosites, and link motifs to enrichment of different biological processes, indicating different downstream regulation likely caused by different kinase groups. We cross-referenced phosphosites against the rice 3,000 genomes, to identify single amino acid variations (SAAVs) within or proximal to phosphosites that could cause loss of a site in a given rice variety. The data was clustered to identify groups of sites with similar patterns across rice family groups, for example those highly conserved in Japonica, but mostly absent in Aus type rice varieties - known to have different responses to drought. These resources can assist rice researchers to discover alleles with significantly different functional effects across rice varieties. The data has been loaded into UniProt Knowledge-Base - enabling researchers to visualise sites alongside other data on rice proteins e.g. structural models from AlphaFold2, PeptideAtlas and the PRIDE database - enabling visualisation of source evidence, including scores and supporting mass spectra.
Collapse
Affiliation(s)
- Kerry A Ramsbottom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, United Kingdom
| | - Ananth Prakash
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Yasset Perez Riverol
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Oscar Martin Camacho
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, United Kingdom
| | - Zhi Sun
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Deepti J. Kundu
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Emily Bowler-Barnett
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Maria Martin
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Jun Fan
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Dmytro Chebotarov
- International Rice Research Institute, DAPO 7777, Manila 1301, Philippines
| | - Kenneth L McNally
- International Rice Research Institute, DAPO 7777, Manila 1301, Philippines
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Andrew R Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, United Kingdom
| |
Collapse
|
8
|
Bradley D, Hogrebe A, Dandage R, Dubé AK, Leutert M, Dionne U, Chang A, Villén J, Landry CR. The fitness cost of spurious phosphorylation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.08.561337. [PMID: 37873463 PMCID: PMC10592693 DOI: 10.1101/2023.10.08.561337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The fidelity of signal transduction requires the binding of regulatory molecules to their cognate targets. However, the crowded cell interior risks off-target interactions between proteins that are functionally unrelated. How such off-target interactions impact fitness is not generally known, but quantifying this is required to understand the constraints faced by cell systems as they evolve. Here, we use the model organism S. cerevisiae to inducibly express tyrosine kinases. Because yeast lacks bona fide tyrosine kinases, most of the resulting tyrosine phosphorylation is spurious. This provides a suitable system to measure the impact of artificial protein interactions on fitness. We engineered 44 yeast strains each expressing a tyrosine kinase, and quantitatively analysed their phosphoproteomes. This analysis resulted in ~30,000 phosphosites mapping to ~3,500 proteins. Examination of the fitness costs in each strain revealed a strong correlation between the number of spurious pY sites and decreased growth. Moreover, the analysis of pY effects on protein structure and on protein function revealed over 1000 pY events that we predict to be deleterious. However, we also find that a large number of the spurious pY sites have a negligible effect on fitness, possibly because of their low stoichiometry. This result is consistent with our evolutionary analyses demonstrating a lack of phosphotyrosine counter-selection in species with bona fide tyrosine kinases. Taken together, our results suggest that, alongside the risk for toxicity, the cell can tolerate a large degree of non-functional crosstalk as interaction networks evolve.
Collapse
Affiliation(s)
- David Bradley
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexander Hogrebe
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Rohan Dandage
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexandre K Dubé
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Mario Leutert
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| | - Ugo Dionne
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| | - Alexis Chang
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Judit Villén
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Christian R Landry
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, QC, Canada
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université du Québec à Montréal, Montréal, QC, Canada
- Université Laval Big Data Research Center (BDRC_UL), Québec, QC, Canada
- Department of Biology, Université Laval, Québec, QC, Canada
| |
Collapse
|
9
|
Daly LA, Clarke CJ, Po A, Oswald SO, Eyers CE. Considerations for defining +80 Da mass shifts in mass spectrometry-based proteomics: phosphorylation and beyond. Chem Commun (Camb) 2023; 59:11484-11499. [PMID: 37681662 PMCID: PMC10521633 DOI: 10.1039/d3cc02909c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 08/21/2023] [Indexed: 09/09/2023]
Abstract
Post-translational modifications (PTMs) are ubiquitous and key to regulating protein function. Understanding the dynamics of individual PTMs and their biological roles requires robust characterisation. Mass spectrometry (MS) is the method of choice for the identification and quantification of protein modifications. This article focusses on the MS-based analysis of those covalent modifications that induce a mass shift of +80 Da, notably phosphorylation and sulfation, given the challenges associated with their discrimination and pinpointing the sites of modification on a polypeptide chain. Phosphorylation in particular is highly abundant, dynamic and can occur on numerous residues to invoke specific functions, hence robust characterisation is crucial to understanding biological relevance. Showcasing our work in the context of other developments in the field, we highlight approaches for enrichment and site localisation of phosphorylated (canonical and non-canonical) and sulfated peptides, as well as modification analysis in the context of intact proteins (top down proteomics) to explore combinatorial roles. Finally, we discuss the application of native ion-mobility MS to explore the effect of these PTMs on protein structure and ligand binding.
Collapse
Affiliation(s)
- Leonard A Daly
- Centre for Proteome Research, Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Christopher J Clarke
- Centre for Proteome Research, Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Allen Po
- Centre for Proteome Research, Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Sally O Oswald
- Centre for Proteome Research, Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| | - Claire E Eyers
- Centre for Proteome Research, Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, UK.
| |
Collapse
|
10
|
Kitamura N, Galligan JJ. A global view of the human post-translational modification landscape. Biochem J 2023; 480:1241-1265. [PMID: 37610048 PMCID: PMC10586784 DOI: 10.1042/bcj20220251] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 07/26/2023] [Accepted: 08/07/2023] [Indexed: 08/24/2023]
Abstract
Post-translational modifications (PTMs) provide a rapid response to stimuli, finely tuning metabolism and gene expression and maintain homeostasis. Advances in mass spectrometry over the past two decades have significantly expanded the list of known PTMs in biology and as instrumentation continues to improve, this list will surely grow. While many PTMs have been studied in detail (e.g. phosphorylation, acetylation), the vast majority lack defined mechanisms for their regulation and impact on cell fate. In this review, we will highlight the field of PTM research as it currently stands, discussing the mechanisms that dictate site specificity, analytical methods for their detection and study, and the chemical tools that can be leveraged to define PTM regulation. In addition, we will highlight the approaches needed to discover and validate novel PTMs. Lastly, this review will provide a starting point for those interested in PTM biology, providing a comprehensive list of PTMs and what is known regarding their regulation and metabolic origins.
Collapse
Affiliation(s)
- Naoya Kitamura
- Department of Pharmacology and College of Pharmacy, University of Arizona, Tucson, Arizona 85721, U.S.A
| | - James J. Galligan
- Department of Pharmacology and College of Pharmacy, University of Arizona, Tucson, Arizona 85721, U.S.A
| |
Collapse
|
11
|
Feng S, Sanford JA, Weber T, Hutchinson-Bunch CM, Dakup PP, Paurus VL, Attah K, Sauro HM, Qian WJ, Wiley HS. A Phosphoproteomics Data Resource for Systems-level Modeling of Kinase Signaling Networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.03.551714. [PMID: 37577496 PMCID: PMC10418157 DOI: 10.1101/2023.08.03.551714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Building mechanistic models of kinase-driven signaling pathways requires quantitative measurements of protein phosphorylation across physiologically relevant conditions, but this is rarely done because of the insensitivity of traditional technologies. By using a multiplexed deep phosphoproteome profiling workflow, we were able to generate a deep phosphoproteomics dataset of the EGFR-MAPK pathway in non-transformed MCF10A cells across physiological ligand concentrations with a time resolution of <12 min and in the presence and absence of multiple kinase inhibitors. An improved phosphosite mapping technique allowed us to reliably identify >46,000 phosphorylation sites on >6600 proteins, of which >4500 sites from 2110 proteins displayed a >2-fold increase in phosphorylation in response to EGF. This data was then placed into a cellular context by linking it to 15 previously published protein databases. We found that our results were consistent with much, but not all previously reported data regarding the activation and negative feedback phosphorylation of core EGFR-ERK pathway proteins. We also found that EGFR signaling is biphasic with substrates downstream of RAS/MAPK activation showing a maximum response at <3ng/ml EGF while direct substrates, such as HGS and STAT5B, showing no saturation. We found that RAS activation is mediated by at least 3 parallel pathways, two of which depend on PTPN11. There appears to be an approximately 4-minute delay in pathway activation at the step between RAS and RAF, but subsequent pathway phosphorylation was extremely rapid. Approximately 80 proteins showed a >2-fold increase in phosphorylation across all experiments and these proteins had a significantly higher median number of phosphorylation sites (~18) relative to total cellular phosphoproteins (~4). Over 60% of EGF-stimulated phosphoproteins were downstream of MAPK and included mediators of cellular processes such as gene transcription, transport, signal transduction and cytoskeletal arrangement. Their phosphorylation was either linear with respect to MAPK activation or biphasic, corresponding to the biphasic signaling seen at the level of the EGFR. This deep, integrated phosphoproteomics data resource should be useful in building mechanistic models of EGFR and MAPK signaling and for understanding how downstream responses are regulated.
Collapse
Affiliation(s)
- Song Feng
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - James A. Sanford
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - Thomas Weber
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | | | - Panshak P. Dakup
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - Vanessa L. Paurus
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - Kwame Attah
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - Herbert M. Sauro
- Department of Bioengineering, University of Washington, Seattle, WA
| | - Wei-Jun Qian
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| | - H. Steven Wiley
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, WA 99352 USA
| |
Collapse
|
12
|
Griswold-Prenner I, Kashyap AK, Mazhar S, Hall ZW, Fazelinia H, Ischiropoulos H. Unveiling the human nitroproteome: Protein tyrosine nitration in cell signaling and cancer. J Biol Chem 2023; 299:105038. [PMID: 37442231 PMCID: PMC10413360 DOI: 10.1016/j.jbc.2023.105038] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 06/28/2023] [Accepted: 07/06/2023] [Indexed: 07/15/2023] Open
Abstract
Covalent amino acid modification significantly expands protein functional capability in regulating biological processes. Tyrosine residues can undergo phosphorylation, sulfation, adenylation, halogenation, and nitration. These posttranslational modifications (PTMs) result from the actions of specific enzymes: tyrosine kinases, tyrosyl-protein sulfotransferase(s), adenylate transferase(s), oxidoreductases, peroxidases, and metal-heme containing proteins. Whereas phosphorylation, sulfation, and adenylation modify the hydroxyl group of tyrosine, tyrosine halogenation and nitration target the adjacent carbon residues. Because aberrant tyrosine nitration has been associated with human disorders and with animal models of disease, we have created an updated and curated database of 908 human nitrated proteins. We have also analyzed this new resource to provide insight into the role of tyrosine nitration in cancer biology, an area that has not previously been considered in detail. Unexpectedly, we have found that 879 of the 1971 known sites of tyrosine nitration are also sites of phosphorylation suggesting an extensive role for nitration in cell signaling. Overall, the review offers several forward-looking opportunities for future research and new perspectives for understanding the role of tyrosine nitration in cancer biology.
Collapse
Affiliation(s)
| | | | | | - Zach W Hall
- Nitrase Therapeutics, Brisbane, California, USA
| | - Hossein Fazelinia
- Children's Hospital of Philadelphia Research Institute, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Harry Ischiropoulos
- Children's Hospital of Philadelphia Research Institute, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
13
|
Franciosa G, Locard-Paulet M, Jensen LJ, Olsen JV. Recent advances in kinase signaling network profiling by mass spectrometry. Curr Opin Chem Biol 2023; 73:102260. [PMID: 36657259 DOI: 10.1016/j.cbpa.2022.102260] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 12/13/2022] [Accepted: 12/14/2022] [Indexed: 01/19/2023]
Abstract
Mass spectrometry-based phosphoproteomics is currently the leading methodology for the study of global kinase signaling. The scientific community is continuously releasing technological improvements for sensitive and fast identification of phosphopeptides, and their accurate quantification. To interpret large-scale phosphoproteomics data, numerous bioinformatic resources are available that help understanding kinase network functional role in biological systems upon perturbation. Some of these resources are databases of phosphorylation sites, protein kinases and phosphatases; others are bioinformatic algorithms to infer kinase activity, predict phosphosite functional relevance and visualize kinase signaling networks. In this review, we present the latest experimental and bioinformatic tools to profile protein kinase signaling networks and provide examples of their application in biomedicine.
Collapse
Affiliation(s)
- Giulia Franciosa
- Proteomics Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Marie Locard-Paulet
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Lars J Jensen
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Jesper V Olsen
- Proteomics Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
14
|
Burnage SC, Bell J, Wan W, Kislenko E, Rurack K. Combining a hybrid chip and tube microfluidic system with fluorescent molecularly imprinted polymer (MIP) core-shell particles for the derivatisation, extraction, and detection of peptides with N-terminating phosphorylated tyrosine. LAB ON A CHIP 2023; 23:466-474. [PMID: 36655759 DOI: 10.1039/d2lc00955b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
The reliable identification and quantitation of phosphorylated amino acids, peptides and proteins is one of the key challenges in contemporary bioanalytical research, an area of particular interest when attempting to diagnose and treat diseases at an early stage. We have developed a synthetic probe for targeting phosphorylated amino acids, based on core-shell submicron-sized particles consisting of a silica core, coated with a molecularly imprinted polymer (MIP) shell. The MIP layer contains a fluorescent probe crosslinker which binds selectively to phosphorylated tyrosine (pY) moieties with a significant imprinting factor (IF) and responds with a "light-up" fluorescence signal. The bead-based ratiometric detection scheme has been successfully transferred to a microfluidic chip format and its applicability to rapid assays has been exemplarily shown by discriminating a pY-terminating oligopeptide against its non-phosphorylated counterpart. Such miniaturised devices could lead to an automated pY or pY N-terminated peptide measurement system in the future. The setup combines a modular microfluidic system for amino acid derivatisation, extraction (by micropillar co-flow) and selective adsorption and detection with the fluorescent MIP core-shell particle probes. A miniaturised optical assembly for low-light fluorescence measurements was also developed, based on miniaturised opto-electronic parts and optical fibres. The emission from the MIP particles upon binding of pY or pY N-terminated peptides could be monitored in real-time.
Collapse
Affiliation(s)
- Samual C Burnage
- Bundesanstalt für Materialforschung und -prüfung (BAM), Richard-Willstätter-Str. 11, 12489 Berlin, Germany.
| | - Jérémy Bell
- Bundesanstalt für Materialforschung und -prüfung (BAM), Richard-Willstätter-Str. 11, 12489 Berlin, Germany.
| | - Wei Wan
- Bundesanstalt für Materialforschung und -prüfung (BAM), Richard-Willstätter-Str. 11, 12489 Berlin, Germany.
| | - Evgeniia Kislenko
- Bundesanstalt für Materialforschung und -prüfung (BAM), Richard-Willstätter-Str. 11, 12489 Berlin, Germany.
| | - Knut Rurack
- Bundesanstalt für Materialforschung und -prüfung (BAM), Richard-Willstätter-Str. 11, 12489 Berlin, Germany.
| |
Collapse
|
15
|
Sámano-Sánchez H, Gibson TJ, Chemes LB. Using Linear Motif Database Resources to Identify SH2 Domain Binders. Methods Mol Biol 2023; 2705:153-197. [PMID: 37668974 DOI: 10.1007/978-1-0716-3393-9_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2023]
Abstract
The SH2-binding phosphotyrosine class of short linear motifs (SLiMs) are key conditional regulatory elements, particularly in signaling protein complexes beneath the cell's plasma membrane. In addition to transmitting cellular signaling information, they can also play roles in cellular hijack by invasive pathogens. Researchers can take advantage of bioinformatics tools and resources to predict the motifs at conserved phosphotyrosine residues in regions of intrinsically disordered protein. A candidate SH2-binding motif can be established and assigned to one or more of the SH2 domain subgroups. It is, however, not so straightforward to predict which SH2 domains are capable of binding the given candidate. This is largely due to the cooperative nature of the binding amino acids which enables poorer binding residues to be tolerated when the other residues are optimal. High-throughput peptide arrays are powerful tools used to derive SH2 domain-binding specificity, but they are unable to capture these cooperative effects and also suffer from other shortcomings. Tissue and cell type expression can help to restrict the list of available interactors: for example, some well-studied SH2 domain proteins are only present in the immune cell lineages. In this article, we provide a table of motif patterns and four bioinformatics strategies that introduce a range of tools that can be used in motif hunting in cellular and pathogen proteins. Experimental followup is essential to determine which SH2 domain/motif-containing proteins are the actual functional partners.
Collapse
Affiliation(s)
- Hugo Sámano-Sánchez
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
- Zhejiang University School of Medicine, International Campus, Zhejiang University, Haining, China
- Biomedical Sciences, Edinburgh Medical School, The University of Edinburgh, Edinburgh, UK
| | - Toby J Gibson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Lucía B Chemes
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany.
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín (UNSAM) - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), San Martín, Argentina.
- Escuela de Bio y Nanotecnologías (EByN), Universidad Nacional de San Martín, San Martín, Argentina.
| |
Collapse
|
16
|
Gassaway BM, Li J, Rad R, Mintseris J, Mohler K, Levy T, Aguiar M, Beausoleil SA, Paulo JA, Rinehart J, Huttlin EL, Gygi SP. A multi-purpose, regenerable, proteome-scale, human phosphoserine resource for phosphoproteomics. Nat Methods 2022; 19:1371-1375. [PMID: 36280721 PMCID: PMC9847208 DOI: 10.1038/s41592-022-01638-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 09/06/2022] [Indexed: 01/21/2023]
Abstract
Mass-spectrometry-based phosphoproteomics has become indispensable for understanding cellular signaling in complex biological systems. Despite the central role of protein phosphorylation, the field still lacks inexpensive, regenerable, and diverse phosphopeptides with ground-truth phosphorylation positions. Here, we present Iterative Synthetically Phosphorylated Isomers (iSPI), a proteome-scale library of human-derived phosphoserine-containing phosphopeptides that is inexpensive, regenerable, and diverse, with precisely known positions of phosphorylation. We demonstrate possible uses of iSPI, including use as a phosphopeptide standard, a tool to evaluate and optimize phosphorylation-site localization algorithms, and a benchmark to compare performance across data analysis pipelines. We also present AScorePro, an updated version of the AScore algorithm specifically optimized for phosphorylation-site localization in higher energy fragmentation spectra, and the FLR viewer, a web tool for phosphorylation-site localization, to enable community use of the iSPI resource. iSPI and its associated data constitute a useful, multi-purpose resource for the phosphoproteomics community.
Collapse
Affiliation(s)
| | - Jiaming Li
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Ramin Rad
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Julian Mintseris
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Kyle Mohler
- Department of Cellular and Molecular Physiology and Systems Biology Institute, Yale Medical School, New Haven, CT, USA
| | - Tyler Levy
- Cell Signaling Technology, Danvers, MA, USA
| | | | | | - Joao A Paulo
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Jesse Rinehart
- Department of Cellular and Molecular Physiology and Systems Biology Institute, Yale Medical School, New Haven, CT, USA
| | - Edward L Huttlin
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
| | - Steven P Gygi
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
17
|
Perez-Riverol Y. Proteomic repository data submission, dissemination, and reuse: key messages. Expert Rev Proteomics 2022; 19:297-310. [PMID: 36529941 PMCID: PMC7614296 DOI: 10.1080/14789450.2022.2160324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 12/07/2022] [Indexed: 12/23/2022]
Abstract
INTRODUCTION The creation of ProteomeXchange data workflows in 2012 transformed the field of proteomics, consisting of the standardization of data submission and dissemination and enabling the widespread reanalysis of public MS proteomics data worldwide. ProteomeXchange has triggered a growing trend toward public dissemination of proteomics data, facilitating the assessment, reuse, comparative analyses, and extraction of new findings from public datasets. By 2022, the consortium is integrated by PRIDE, PeptideAtlas, MassIVE, jPOST, iProX, and Panorama Public. AREAS COVERED Here, we review and discuss the current ecosystem of resources, guidelines, and file formats for proteomics data dissemination and reanalysis. Special attention is drawn to new exciting quantitative and post-translational modification-oriented resources. The challenges and future directions on data depositions including the lack of metadata and cloud-based and high-performance software solutions for fast and reproducible reanalysis of the available data are discussed. EXPERT OPINION The success of ProteomeXchange and the amount of proteomics data available in the public domain have triggered the creation and/or growth of other protein knowledgebase resources. Data reuse is a leading, active, and evolving field; supporting the creation of new formats, tools, and workflows to rediscover and reshape the public proteomics data.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
18
|
Ramsbottom KA, Prakash A, Riverol YP, Camacho OM, Martin MJ, Vizcaíno JA, Deutsch EW, Jones AR. Method for Independent Estimation of the False Localization Rate for Phosphoproteomics. J Proteome Res 2022; 21:1603-1615. [PMID: 35640880 PMCID: PMC9251759 DOI: 10.1021/acs.jproteome.1c00827] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
![]()
Phosphoproteomic
methods are commonly employed to identify and
quantify phosphorylation sites on proteins. In recent years, various
tools have been developed, incorporating scores or statistics related
to whether a given phosphosite has been correctly identified or to
estimate the global false localization rate (FLR) within a given data
set for all sites reported. These scores have generally been calibrated
using synthetic datasets, and their statistical reliability on real
datasets is largely unknown, potentially leading to studies reporting
incorrectly localized phosphosites, due to inadequate statistical
control. In this work, we develop the concept of scoring modifications
on a decoy amino acid, that is, one that cannot be modified, to allow
for independent estimation of global FLR. We test a variety of amino
acids, on both synthetic and real data sets, demonstrating that the
selection can make a substantial difference to the estimated global
FLR. We conclude that while several different amino acids might be
appropriate, the most reliable FLR results were achieved using alanine
and leucine as decoys. We propose the use of a decoy amino acid to
control false reporting in the literature and in public databases
that re-distribute the data. Data are available via ProteomeXchange
with identifier PXD028840.
Collapse
Affiliation(s)
- Kerry A Ramsbottom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3BX, U.K
| | - Ananth Prakash
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, U.K
| | - Yasset Perez Riverol
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, U.K
| | - Oscar Martin Camacho
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3BX, U.K
| | - Maria-Jesus Martin
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, U.K
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge CB10 1SD, U.K
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Andrew R Jones
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3BX, U.K
| |
Collapse
|