1
|
Fijalkowski I, Snauwaert V, Van Damme P. Proteins à la carte: riboproteogenomic exploration of bacterial N-terminal proteoform expression. mBio 2024; 15:e0033324. [PMID: 38511928 PMCID: PMC11005335 DOI: 10.1128/mbio.00333-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 02/28/2024] [Indexed: 03/22/2024] Open
Abstract
In recent years, it has become evident that the true complexity of bacterial proteomes remains underestimated. Gene annotation tools are known to propagate biases and overlook certain classes of truly expressed proteins, particularly proteoforms-protein isoforms arising from a single gene. Recent (re-)annotation efforts heavily rely on ribosome profiling by providing a direct readout of translation to fully describe bacterial proteomes. In this study, we employ a robust riboproteogenomic pipeline to conduct a systematic census of expressed N-terminal proteoform pairs, representing two isoforms encoded by a single gene raised by annotated and alternative translation initiation, in Salmonella. Intriguingly, conditional-dependent changes in relative utilization of annotated and alternative translation initiation sites (TIS) were observed in several cases. This suggests that TIS selection is subject to regulatory control, adding yet another layer of complexity to our understanding of bacterial proteomes. IMPORTANCE With the emerging theme of genes within genes comprising the existence of alternative open reading frames (ORFs) generated by translation initiation at in-frame start codons, mechanisms that control the relative utilization of annotated and alternative TIS need to be unraveled and our molecular understanding of resulting proteoforms broadened. Utilizing complementary ribosome profiling strategies to map ORF boundaries, we uncovered dual-encoding ORFs generated by in-frame TIS usage in Salmonella. Besides demonstrating that alternative TIS usage may generate proteoforms with different characteristics, such as differential localization and specialized function, quantitative aspects of conditional retapamulin-assisted ribosome profiling (Ribo-RET) translation initiation maps offer unprecedented insights into the relative utilization of annotated and alternative TIS, enabling the exploration of gene regulatory mechanisms that control TIS usage and, consequently, the translation of N-terminal proteoform pairs.
Collapse
Affiliation(s)
- Igor Fijalkowski
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Valdes Snauwaert
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Petra Van Damme
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| |
Collapse
|
2
|
Fuchs S, Engelmann S. Small proteins in bacteria - Big challenges in prediction and identification. Proteomics 2023; 23:e2200421. [PMID: 37609810 DOI: 10.1002/pmic.202200421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 08/03/2023] [Accepted: 08/10/2023] [Indexed: 08/24/2023]
Abstract
Proteins with up to 100 amino acids have been largely overlooked due to the challenges associated with predicting and identifying them using traditional methods. Recent advances in bioinformatics and machine learning, DNA sequencing, RNA and Ribo-seq technologies, and mass spectrometry (MS) have greatly facilitated the detection and characterisation of these elusive proteins in recent years. This has revealed their crucial role in various cellular processes including regulation, signalling and transport, as toxins and as folding helpers for protein complexes. Consequently, the systematic identification and characterisation of these proteins in bacteria have emerged as a prominent field of interest within the microbial research community. This review provides an overview of different strategies for predicting and identifying these proteins on a large scale, leveraging the power of these advanced technologies. Furthermore, the review offers insights into the future developments that may be expected in this field.
Collapse
Affiliation(s)
- Stephan Fuchs
- Genome Competence Center (MF1), Department MFI, Robert-Koch-Institut, Berlin, Germany
| | - Susanne Engelmann
- Institute for Microbiology, Technische Universität Braunschweig, Braunschweig, Germany
- Microbial Proteomics, Helmholtzzentrum für Infektionsforschung GmbH, Braunschweig, Germany
| |
Collapse
|
3
|
Fang Z, Wanigasekara MSK, Yepremyan A, Lam B, Thapa P, Foss FW, Chowdhury SM. Mass Spectrometry-Cleavable Protein N-Terminal Tagging Strategy for System-Level Protease Activity Profiling. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2022; 33:189-197. [PMID: 34928623 DOI: 10.1021/jasms.1c00350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Proteolysis is one of the most important protein post-translational modifications (PTMs) that influences the functions, activities, and structures of nearly all proteins during their lifetime. To facilitate the targeted identification of low-abundant proteolytic products, we devised a strategy incorporating a novel biotinylated reagent PFP (pentafluorophenyl)-Rink-biotin to specifically target, enrich and identify proteolytic N-termini. Within the PFP-Rink-biotin reagent, a mass spectrometry (MS)-cleavable feature was designed to assist in the unambiguous confirmation of the enriched proteolytic N-termini. The proof-of-concept study was performed with multiple standard proteins whose N-termini were successfully modified, enriched and identified by a signature ion (SI) in the MS/MS fragmentation, along with the determination of N-terminal peptide sequences by multistage tandem MS of the complementary fragment generated after the cleavage of MS-cleavable bond. For large-scale application, the enrichment and identification of protein N-termini from Escherichia coli cells were demonstrated, facilitated by an in-house developed NTermFinder bioinformatics workflow. We believe this approach will be beneficial in improving the confidence of identifying proteolytic substrates in a native cellular environment.
Collapse
Affiliation(s)
- Zixiang Fang
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington Texas 76019, United States
| | - Maheshika S K Wanigasekara
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington Texas 76019, United States
| | - Akop Yepremyan
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington Texas 76019, United States
| | - Brandon Lam
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington Texas 76019, United States
| | - Pawan Thapa
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington Texas 76019, United States
| | - Frank W Foss
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington Texas 76019, United States
| | - Saiful M Chowdhury
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington Texas 76019, United States
| |
Collapse
|
4
|
Abstract
Modern genome-scale methods that identify new genes, such as proteogenomics and ribosome profiling, have revealed, to the surprise of many, that overlap in genes, open reading frames and even coding sequences is widespread and functionally integrated into prokaryotic, eukaryotic and viral genomes. In parallel, the constraints that overlapping regions place on genome sequences and their evolution can be harnessed in bioengineering to build more robust synthetic strains and constructs. With a focus on overlapping protein-coding and RNA-coding genes, this Review examines their discovery, topology and biogenesis in the context of their genome biology. We highlight exciting new uses for sequence overlap to control translation, compress synthetic genetic constructs, and protect against mutation.
Collapse
|
5
|
Fuchs S, Kucklick M, Lehmann E, Beckmann A, Wilkens M, Kolte B, Mustafayeva A, Ludwig T, Diwo M, Wissing J, Jänsch L, Ahrens CH, Ignatova Z, Engelmann S. Towards the characterization of the hidden world of small proteins in Staphylococcus aureus, a proteogenomics approach. PLoS Genet 2021; 17:e1009585. [PMID: 34061833 PMCID: PMC8195425 DOI: 10.1371/journal.pgen.1009585] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 06/11/2021] [Accepted: 05/07/2021] [Indexed: 01/08/2023] Open
Abstract
Small proteins play essential roles in bacterial physiology and virulence, however, automated algorithms for genome annotation are often not yet able to accurately predict the corresponding genes. The accuracy and reliability of genome annotations, particularly for small open reading frames (sORFs), can be significantly improved by integrating protein evidence from experimental approaches. Here we present a highly optimized and flexible bioinformatics workflow for bacterial proteogenomics covering all steps from (i) generation of protein databases, (ii) database searches and (iii) peptide-to-genome mapping to (iv) visualization of results. We used the workflow to identify high quality peptide spectrum matches (PSMs) for small proteins (≤ 100 aa, SP100) in Staphylococcus aureus Newman. Protein extracts from S. aureus were subjected to different experimental workflows for protein digestion and prefractionation and measured with highly sensitive mass spectrometers. In total, 175 proteins with up to 100 aa (SP100) were identified. Out of these 24 (ranging from 9 to 99 aa) were novel and not contained in the used genome annotation.144 SP100 are highly conserved and were found in at least 50% of the publicly available S. aureus genomes, while 127 are additionally conserved in other staphylococci. Almost half of the identified SP100 were basic, suggesting a role in binding to more acidic molecules such as nucleic acids or phospholipids.
Collapse
Affiliation(s)
- Stephan Fuchs
- Robert Koch Institute, Methodenentwicklung und Forschungsinfrastruktur (MF), Berlin, Germany
| | - Martin Kucklick
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| | - Erik Lehmann
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| | - Alexander Beckmann
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| | - Maya Wilkens
- Robert Koch Institute, Methodenentwicklung und Forschungsinfrastruktur (MF), Berlin, Germany
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| | - Baban Kolte
- University of Hamburg, Institute of Biochemistry and Molecular Biology, Hamburg, Germany
| | - Ayten Mustafayeva
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| | - Tobias Ludwig
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| | - Maurice Diwo
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| | - Josef Wissing
- Helmholtz Center for Infection Research GmbH, Cellular Proteomics, Braunschweig, Germany
| | - Lothar Jänsch
- Helmholtz Center for Infection Research GmbH, Cellular Proteomics, Braunschweig, Germany
| | - Christian H Ahrens
- Agroscope, Research Group Molecular Diagnostics, Genomics and Bioinformatics & SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Zoya Ignatova
- University of Hamburg, Institute of Biochemistry and Molecular Biology, Hamburg, Germany
| | - Susanne Engelmann
- University of Technical Sciences Braunschweig, Institute for Microbiology, Braunschweig, Germany
- Helmholtz Center for Infection Research GmbH, Microbial Proteomics, Braunschweig, Germany
| |
Collapse
|
6
|
Protein cleavage influences surface protein presentation in Mycoplasma pneumoniae. Sci Rep 2021; 11:6743. [PMID: 33762641 PMCID: PMC7990945 DOI: 10.1038/s41598-021-86217-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 02/23/2021] [Indexed: 01/31/2023] Open
Abstract
Mycoplasma pneumoniae is a significant cause of pneumonia and post infection sequelae affecting organ sites distant to the respiratory tract are common. It is also a model organism where extensive 'omics' studies have been conducted to gain insight into how minimal genome self-replicating organisms function. An N-terminome study undertaken here identified 4898 unique N-terminal peptides that mapped to 391 (56%) predicted M. pneumoniae proteins. True N-terminal sequences beginning with the initiating methionine (iMet) residue from the predicted Open Reading Frame (ORF) were identified for 163 proteins. Notably, almost half (317; 46%) of the ORFS derived from M. pneumoniae strain M129 are post-translationally modified, presumably by proteolytic processing, because dimethyl labelled neo-N-termini were characterised that mapped beyond the predicted N-terminus. An analysis of the N-terminome describes endoproteolytic processing events predominately targeting tryptic-like sites, though cleavages at negatively charged residues in P1' (D and E) with lysine or serine/alanine in P2' and P3' positions also occurred frequently. Surfaceome studies identified 160 proteins (23% of the proteome) to be exposed on the extracellular surface of M. pneumoniae. The two orthogonal methodologies used to characterise the surfaceome each identified the same 116 proteins, a 72% (116/160) overlap. Apart from lipoproteins, transporters, and adhesins, 93/160 (58%) of the surface proteins lack signal peptides and have well characterised, canonical functions in the cell. Of the 160 surface proteins identified, 134 were also targets of endo-proteolytic processing. These processing events are likely to have profound implications for how the host immune system recognises and responds to M. pneumoniae.
Collapse
|
7
|
Fijalkowska D, Fijalkowski I, Willems P, Van Damme P. Bacterial riboproteogenomics: the era of N-terminal proteoform existence revealed. FEMS Microbiol Rev 2021; 44:418-431. [PMID: 32386204 DOI: 10.1093/femsre/fuaa013] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 05/07/2020] [Indexed: 12/17/2022] Open
Abstract
With the rapid increase in the number of sequenced prokaryotic genomes, relying on automated gene annotation became a necessity. Multiple lines of evidence, however, suggest that current bacterial genome annotations may contain inconsistencies and are incomplete, even for so-called well-annotated genomes. We here discuss underexplored sources of protein diversity and new methodologies for high-throughput genome reannotation. The expression of multiple molecular forms of proteins (proteoforms) from a single gene, particularly driven by alternative translation initiation, is gaining interest as a prominent contributor to bacterial protein diversity. In consequence, riboproteogenomic pipelines were proposed to comprehensively capture proteoform expression in prokaryotes by the complementary use of (positional) proteomics and the direct readout of translated genomic regions using ribosome profiling. To complement these discoveries, tailored strategies are required for the functional characterization of newly discovered bacterial proteoforms.
Collapse
Affiliation(s)
- Daria Fijalkowska
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| | - Igor Fijalkowski
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| | - Patrick Willems
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| | - Petra Van Damme
- Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, B-9000 Ghent, Belgium
| |
Collapse
|
8
|
Misincorporation Proteomics Technologies: A Review. Proteomes 2021; 9:proteomes9010002. [PMID: 33494504 PMCID: PMC7924376 DOI: 10.3390/proteomes9010002] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 01/11/2021] [Accepted: 01/18/2021] [Indexed: 12/15/2022] Open
Abstract
Proteinopathies are diseases caused by factors that affect proteoform conformation. As such, a prevalent hypothesis is that the misincorporation of noncanonical amino acids into a proteoform results in detrimental structures. However, this hypothesis is missing proteomic evidence, specifically the detection of a noncanonical amino acid in a peptide sequence. This review aims to outline the current state of technology that can be used to investigate mistranslations and misincorporations whilst framing the pursuit as Misincorporation Proteomics (MiP). The current availability of technologies explored herein is mass spectrometry, sample enrichment/preparation, data analysis techniques, and the hyphenation of approaches. While many of these technologies show potential, our review reveals a need for further development and refinement of approaches is still required.
Collapse
|
9
|
Kaushal P, Lee C. N-terminomics - its past and recent advancements. J Proteomics 2020; 233:104089. [PMID: 33359939 DOI: 10.1016/j.jprot.2020.104089] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 07/22/2020] [Accepted: 12/20/2020] [Indexed: 02/06/2023]
Abstract
N-terminomics is a rapidly evolving branch of proteomics that encompasses the study of protein N-terminal sequence. A proteome-wide collection of such sequences has been widely used to understand the proteolytic cascades and in annotating the genome. Over the last two decades, various N-terminomic strategies have been developed for achieving high sensitivity, greater depth of coverage, and high-throughputness. We, in this review, cover how the field of N-terminomics has evolved to date, including discussion on various sample preparation and N-terminal peptide enrichment strategies. We also compare different N-terminomic methods and highlight their relative benefits and shortcomings in their implementation. In addition, an overview of the currently available bioinformatics tools and data analysis pipelines for the annotation of N-terminomic datasets is also included. SIGNIFICANCE: It has been recognized that proteins undergo several post-translational modifications (PTM), and a number of perturbed biological pathways are directly associated with modifications at the terminal sites of a protein. In this regard, N-terminomics can be applied to generate a proteome-wide landscape of mature N-terminal sequences, annotate their source of generation, and recognize their significance in the biological pathways. Besides, a system-wide study can be used to study complicated proteolytic machinery and protease cleavage patterns for potential therapeutic targets. Moreover, due to unprecedented improvements in the analytical methods and mass spectrometry instrumentation in recent times, the N-terminomic methodologies now offers an unparalleled ability to study proteoforms and their implications in clinical conditions. Such approaches can further be applied for the detection of low abundant proteoforms, annotation of non-canonical protein coding sites, identification of candidate disease biomarkers, and, last but not least, the discovery of novel drug targets.
Collapse
Affiliation(s)
- Prashant Kaushal
- Center for Theragnosis, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea; Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology, Seoul 02792, Republic of Korea
| | - Cheolju Lee
- Center for Theragnosis, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea; Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology, Seoul 02792, Republic of Korea; KHU-KIST Department of Converging Science and Technology, Kyung Hee University, 26 Kyunghee-daero, Dongdaemun-gu, Seoul 02447, Republic of Korea.
| |
Collapse
|
10
|
Ardern Z, Neuhaus K, Scherer S. Are Antisense Proteins in Prokaryotes Functional? Front Mol Biosci 2020; 7:187. [PMID: 32923454 PMCID: PMC7457138 DOI: 10.3389/fmolb.2020.00187] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Accepted: 07/16/2020] [Indexed: 12/16/2022] Open
Abstract
Many prokaryotic RNAs are transcribed from loci outside of annotated protein coding genes. Across bacterial species hundreds of short open reading frames antisense to annotated genes show evidence of both transcription and translation, for instance in ribosome profiling data. Determining the functional fraction of these protein products awaits further research, including insights from studies of molecular interactions and detailed evolutionary analysis. There are multiple lines of evidence, however, that many of these newly discovered proteins are of use to the organism. Condition-specific phenotypes have been characterized for a few. These proteins should be added to genome annotations, and the methods for predicting them standardized. Evolutionary analysis of these typically young sequences also may provide important insights into gene evolution. This research should be prioritized for its exciting potential to uncover large numbers of novel proteins with extremely diverse potential practical uses, including applications in synthetic biology and responding to pathogens.
Collapse
Affiliation(s)
- Zachary Ardern
- Chair for Microbial Ecology, Technical University of Munich, Munich, Germany
| | | | | |
Collapse
|
11
|
Clauwaert J, Menschaert G, Waegeman W. DeepRibo: a neural network for precise gene annotation of prokaryotes by combining ribosome profiling signal and binding site patterns. Nucleic Acids Res 2019; 47:e36. [PMID: 30753697 PMCID: PMC6451124 DOI: 10.1093/nar/gkz061] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 01/02/2019] [Accepted: 01/30/2019] [Indexed: 12/13/2022] Open
Abstract
Annotation of gene expression in prokaryotes often finds itself corrected due to small variations of the annotated gene regions observed between different (sub)-species. It has become apparent that traditional sequence alignment algorithms, used for the curation of genomes, are not able to map the full complexity of the genomic landscape. We present DeepRibo, a novel neural network utilizing features extracted from ribosome profiling information and binding site sequence patterns that shows to be a precise tool for the delineation and annotation of expressed genes in prokaryotes. The neural network combines recurrent memory cells and convolutional layers, adapting the information gained from both the high-throughput ribosome profiling data and ribosome binding translation initiation sequence region into one model. DeepRibo is designed as a single model trained on a variety of ribosome profiling experiments, used for the identification of open reading frames in prokaryotes without a priori knowledge of the translational landscape. Through extensive validation of the model trained on various sets of data, multiple species sequence similarity, mass spectrometry and Edman degradation verified proteins, the effectiveness of DeepRibo is highlighted.
Collapse
Affiliation(s)
- Jim Clauwaert
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 9000 Gent, Belgium
| | - Gerben Menschaert
- Biobix, Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 9000 Gent, Belgium
| | - Willem Waegeman
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 9000 Gent, Belgium
| |
Collapse
|
12
|
Kaspar JR, Walker AR. Expanding the Vocabulary of Peptide Signals in Streptococcus mutans. Front Cell Infect Microbiol 2019; 9:194. [PMID: 31245303 PMCID: PMC6563777 DOI: 10.3389/fcimb.2019.00194] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Accepted: 05/21/2019] [Indexed: 12/18/2022] Open
Abstract
Streptococci, including the dental pathogen Streptococcus mutans, undergo cell-to-cell signaling that is mediated by small peptides to control critical physiological functions such as adaptation to the environment, control of subpopulation behaviors and regulation of virulence factors. One such model pathway is the regulation of genetic competence, controlled by the ComRS signaling system and the peptide XIP. However, recent research in the characterization of this pathway has uncovered novel operons and peptides that are intertwined into its regulation. These discoveries, such as cell lysis playing a critical role in XIP release and importance of bacterial self-sensing during the signaling process, have caused us to reevaluate previous paradigms and shift our views on the true purpose of these signaling systems. The finding of new peptides such as the ComRS inhibitor XrpA and the peptides of the RcrRPQ operon also suggests there may be more peptides hidden in the genomes of streptococci that could play critical roles in the physiology of these organisms. In this review, we summarize the recent findings in S. mutans regarding the integration of other circuits into the ComRS signaling pathway, the true mode of XIP export, and how the RcrRPQ operon controls competence activation. We also look at how new technologies can be used to re-annotate the genome to find new open reading frames that encode peptide signals. Together, this summary of research will allow us to reconsider how we perceive these systems to behave and lead us to expand our vocabulary of peptide signals within the genus Streptococcus.
Collapse
Affiliation(s)
- Justin R. Kaspar
- Department of Oral Biology, University of Florida, Gainesville, FL, United States
| | | |
Collapse
|
13
|
Hurtado Silva M, Berry IJ, Strange N, Djordjevic SP, Padula MP. Terminomics Methodologies and the Completeness of Reductive Dimethylation: A Meta-Analysis of Publicly Available Datasets. Proteomes 2019; 7:proteomes7020011. [PMID: 30934878 PMCID: PMC6631386 DOI: 10.3390/proteomes7020011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 03/22/2019] [Accepted: 03/25/2019] [Indexed: 12/30/2022] Open
Abstract
Methods for analyzing the terminal sequences of proteins have been refined over the previous decade; however, few studies have evaluated the quality of the data that have been produced from those methodologies. While performing global N-terminal labelling on bacteria, we observed that the labelling was not complete and investigated whether this was a common occurrence. We assessed the completeness of labelling in a selection of existing, publicly available N-terminomics datasets and empirically determined that amine-based labelling chemistry does not achieve complete labelling and potentially has issues with labelling amine groups at sequence-specific residues. This finding led us to conduct a thorough review of the historical literature that showed that this is not an unexpected finding, with numerous publications reporting incomplete labelling. These findings have implications for the quantitation of N-terminal peptides and the biological interpretations of these data.
Collapse
Affiliation(s)
- Mariella Hurtado Silva
- Proteomics Core Facility and School of Life Sciences, Faculty of Science, University of Technology Sydney, Broadway NSW 2007, Australia.
| | - Iain J Berry
- Proteomics Core Facility and School of Life Sciences, Faculty of Science, University of Technology Sydney, Broadway NSW 2007, Australia.
- The ithree Institute, Faculty of Science, University of Technology Sydney, Broadway NSW 2007, Australia.
| | - Natalie Strange
- Proteomics Core Facility and School of Life Sciences, Faculty of Science, University of Technology Sydney, Broadway NSW 2007, Australia.
| | - Steven P Djordjevic
- The ithree Institute, Faculty of Science, University of Technology Sydney, Broadway NSW 2007, Australia.
| | - Matthew P Padula
- Proteomics Core Facility and School of Life Sciences, Faculty of Science, University of Technology Sydney, Broadway NSW 2007, Australia.
| |
Collapse
|
14
|
Abstract
Genetic coding in bacteria largely operates via the "one gene-one protein" paradigm. However, the peculiarities of the mRNA structure, the versatility of the genetic code, and the dynamic nature of translation sometimes allow organisms to deviate from the standard rules of protein encoding. Bacteria can use several unorthodox modes of translation to express more than one protein from a single mRNA cistron. One such alternative path is the use of additional translation initiation sites within the gene. Proteins whose translation is initiated at different start sites within the same reading frame will differ in their N termini but will have identical C-terminal segments. On the other hand, alternative initiation of translation in a register different from the frame dictated by the primary start codon will yield a protein whose sequence is entirely different from the one encoded in the main frame. The use of internal mRNA codons as translation start sites is controlled by the nucleotide sequence and the mRNA folding. The proteins of the alternative proteome generated via the "genes-within-genes" strategy may carry important functions. In this review, we summarize the currently known examples of bacterial genes encoding more than one protein due to the utilization of additional translation start sites and discuss the known or proposed functions of the alternative polypeptides in relation to the main protein product of the gene. We also discuss recent proteome- and genome-wide approaches that will allow the discovery of novel translation initiation sites in a systematic fashion.
Collapse
|
15
|
N-terminome and proteogenomic analysis of the Methylobacterium extorquens DM4 reference strain for dichloromethane utilization. J Proteomics 2018; 179:131-139. [DOI: 10.1016/j.jprot.2018.03.012] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Revised: 02/28/2018] [Accepted: 03/16/2018] [Indexed: 12/29/2022]
|
16
|
Berry IJ, Jarocki VM, Tacchi JL, Raymond BBA, Widjaja M, Padula MP, Djordjevic SP. N-terminomics identifies widespread endoproteolysis and novel methionine excision in a genome-reduced bacterial pathogen. Sci Rep 2017; 7:11063. [PMID: 28894154 PMCID: PMC5593965 DOI: 10.1038/s41598-017-11296-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 08/21/2017] [Indexed: 12/12/2022] Open
Abstract
Proteolytic processing alters protein function. Here we present the first systems-wide analysis of endoproteolysis in the genome-reduced pathogen Mycoplasma hyopneumoniae. 669 N-terminal peptides from 164 proteins were identified, demonstrating that functionally diverse proteins are processed, more than half of which 75 (53%) were accessible on the cell surface. Multiple cleavage sites were characterised, but cleavage with arginine in P1 predominated. Putative functions for a subset of cleaved fragments were assigned by affinity chromatography using heparin, actin, plasminogen and fibronectin as bait. Binding affinity was correlated with the number of cleavages in a protein, indicating that novel binding motifs are exposed, and protein disorder increases, after a cleavage event. Glyceraldehyde 3-phosphate dehydrogenase was used as a model protein to demonstrate this. We define the rules governing methionine excision, show that several aminopeptidases are involved, and propose that through processing, genome-reduced organisms can expand protein function.
Collapse
Affiliation(s)
- Iain J Berry
- The ithree institute, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia.,Proteomics Core Facility, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia
| | - Veronica M Jarocki
- The ithree institute, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia.,Proteomics Core Facility, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia
| | - Jessica L Tacchi
- The ithree institute, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia.,Proteomics Core Facility, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia
| | - Benjamin B A Raymond
- The ithree institute, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia.,Proteomics Core Facility, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia
| | - Michael Widjaja
- The ithree institute, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia.,Proteomics Core Facility, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia
| | - Matthew P Padula
- The ithree institute, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia.,Proteomics Core Facility, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia
| | - Steven P Djordjevic
- The ithree institute, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia. .,Proteomics Core Facility, University of Technology Sydney, PO Box 123, Broadway, NSW, 2007, Australia.
| |
Collapse
|
17
|
Giess A, Jonckheere V, Ndah E, Chyżyńska K, Van Damme P, Valen E. Ribosome signatures aid bacterial translation initiation site identification. BMC Biol 2017; 15:76. [PMID: 28854918 PMCID: PMC5576327 DOI: 10.1186/s12915-017-0416-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2017] [Accepted: 08/09/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND While methods for annotation of genes are increasingly reliable, the exact identification of translation initiation sites remains a challenging problem. Since the N-termini of proteins often contain regulatory and targeting information, developing a robust method for start site identification is crucial. Ribosome profiling reads show distinct patterns of read length distributions around translation initiation sites. These patterns are typically lost in standard ribosome profiling analysis pipelines, when reads from footprints are adjusted to determine the specific codon being translated. RESULTS Utilising these signatures in combination with nucleotide sequence information, we build a model capable of predicting translation initiation sites and demonstrate its high accuracy using N-terminal proteomics. Applying this to prokaryotic translatomes, we re-annotate translation initiation sites and provide evidence of N-terminal truncations and extensions of previously annotated coding sequences. These re-annotations are supported by the presence of structural and sequence-based features next to N-terminal peptide evidence. Finally, our model identifies 61 novel genes previously undiscovered in the Salmonella enterica genome. CONCLUSIONS Signatures within ribosome profiling read length distributions can be used in combination with nucleotide sequence information to provide accurate genome-wide identification of translation initiation sites.
Collapse
Affiliation(s)
- Adam Giess
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, 5020, Norway
| | - Veronique Jonckheere
- VIB-UGent Center for Medical Biotechnology, B-9000, Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000, Ghent, Belgium
| | - Elvis Ndah
- VIB-UGent Center for Medical Biotechnology, B-9000, Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000, Ghent, Belgium.,Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modelling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, B-9000, Ghent, Belgium
| | - Katarzyna Chyżyńska
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, 5020, Norway
| | - Petra Van Damme
- VIB-UGent Center for Medical Biotechnology, B-9000, Ghent, Belgium. .,Department of Biochemistry, Ghent University, B-9000, Ghent, Belgium.
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, 5020, Norway. .,Sars International Centre for Marine Molecular Biology, University of Bergen, 5008, Bergen, Norway.
| |
Collapse
|
18
|
Tholey A, Becker A. Top-down proteomics for the analysis of proteolytic events - Methods, applications and perspectives. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH 2017; 1864:2191-2199. [PMID: 28711385 DOI: 10.1016/j.bbamcr.2017.07.002] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Revised: 07/07/2017] [Accepted: 07/09/2017] [Indexed: 02/06/2023]
Abstract
Mass spectrometry based proteomics is an indispensable tool for almost all research areas relevant for the understanding of proteolytic processing, ranging from the identification of substrates, products and cleavage sites up to the analysis of structural features influencing protease activity. The majority of methods for these studies are based on bottom-up proteomics performing analysis at peptide level. As this approach is characterized by a number of pitfalls, e.g. loss of molecular information, there is an ongoing effort to establish top-down proteomics, performing separation and MS analysis both at intact protein level. We briefly introduce major approaches of bottom-up proteomics used in the field of protease research and highlight the shortcomings of these methods. We then discuss the present state-of-the-art of top-down proteomics. Together with the discussion of known challenges we show the potential of this approach and present a number of successful applications of top-down proteomics in protease research. This article is part of a Special Issue entitled: Proteolysis as a Regulatory Event in Pathophysiology edited by Stefan Rose-John.
Collapse
Affiliation(s)
- Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany.
| | - Alexander Becker
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| |
Collapse
|
19
|
Marshall NC, Finlay BB, Overall CM. Sharpening Host Defenses during Infection: Proteases Cut to the Chase. Mol Cell Proteomics 2017; 16:S161-S171. [PMID: 28179412 PMCID: PMC5393396 DOI: 10.1074/mcp.o116.066456] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Revised: 02/03/2017] [Indexed: 01/14/2023] Open
Abstract
The human immune system consists of an intricate network of tightly controlled pathways, where proteases are essential instigators and executioners at multiple levels. Invading microbial pathogens also encode proteases that have evolved to manipulate and dysregulate host proteins, including host proteases during the course of disease. The identification of pathogen proteases as well as their substrates and mechanisms of action have empowered significant developments in therapeutics for infectious diseases. Yet for many pathogens, there remains a great deal to be discovered. Recently, proteomic techniques have been developed that can identify proteolytically processed proteins across the proteome. These “degradomics” approaches can identify human substrates of microbial proteases during infection in vivo and expose the molecular-level changes that occur in the human proteome during infection as an operational network to develop hypotheses for further research as well as new therapeutics. This Perspective Article reviews how proteases are utilized during infection by both the human host and invading bacterial pathogens, including archetypal virulence-associated microbial proteases, such as the Clostridia spp. botulinum and tetanus neurotoxins. We highlight the potential knowledge that degradomics studies of host–pathogen interactions would uncover, as well as how degradomics has been successfully applied in similar contexts, including use with a viral protease. We review how microbial proteases have been targeted in current therapeutic approaches and how microbial proteases have shaped and even contributed to human therapeutics beyond infectious disease. Finally, we discuss how, moving forward, degradomics research can greatly contribute to our understanding of how microbial pathogens cause disease in vivo and lead to the identification of novel substrates in vivo, and the development of improved therapeutics to counter these pathogens.
Collapse
Affiliation(s)
- Natalie C Marshall
- From the ‡Department of Microbiology & Immunology.,§Michael Smith Laboratories
| | - B Brett Finlay
- From the ‡Department of Microbiology & Immunology.,§Michael Smith Laboratories.,¶Department of Biochemistry & Molecular Biology
| | - Christopher M Overall
- ¶Department of Biochemistry & Molecular Biology, .,**Department of Oral Biological & Medical Sciences, Centre for Blood Research, Life Sciences Institute, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
20
|
Paes JA, Virginio VG, Cancela M, Leal FMA, Borges TJ, Jaeger N, Bonorino C, Schrank IS, Ferreira HB. Pro-apoptotic effect of a Mycoplasma hyopneumoniae putative type I signal peptidase on PK(15) swine cells. Vet Microbiol 2017; 201:170-176. [PMID: 28284605 DOI: 10.1016/j.vetmic.2017.01.024] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Revised: 12/08/2016] [Accepted: 01/18/2017] [Indexed: 10/20/2022]
Abstract
Mycoplasma hyopneumoniae is an economically significant swine pathogen that causes porcine enzootic pneumonia (PEP). Important processes for swine infection by M. hyopneumoniae depend on cell surface proteins, many of which are secreted by secretion pathways not completely elucidated so far. A putative type I signal peptidase (SPase I), a possible component of a putative Sec-dependent pathway, was annotated as a product of the sipS gene in the pathogenic M. hyopneumoniae 7448 genome. This M. hyopneumoniae putative SPase I (MhSPase I) displays only 14% and 23% of sequence identity/similarity to Escherichia coli bona fide SPase I, and, in complementation assays performed with a conditional E. coli SPase I mutant, only a partial restoration of growth was achieved with the heterologous expression of a recombinant MhSPase I (rMhSPase I). Considering the putative surface location of MhSPase I and its previously demonstrated capacity to induce a strong humoral response, we then assessed its potential to elicit a cellular and possible immunomodulatory response. In assays for immunogenicity assessment, rMhSPase I unexpectedly showed a cytotoxic effect on murine splenocytes. This cytotoxic effect was further confirmed using the swine epithelial PK(15) cell line in MTT and annexin V-flow cytometry assays, which showed that rMhSPase I induces apoptosis in a dose dependent-way. It was also demonstrated that this pro-apoptotic effect of rMhSPase I involves activation of a caspase-3 cascade. The potential relevance of the rMhSPase I pro-apoptotic effect for M. hyopneumoniae-host interactions in the context of PEP is discussed.
Collapse
Affiliation(s)
- Jéssica A Paes
- Laboratório de Genômica Estrutural e Funcional, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Brazil
| | - Veridiana G Virginio
- Laboratório de Genômica Estrutural e Funcional, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Brazil
| | - Martín Cancela
- Laboratório de Biologia Molecular de Cestódeos, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Brazil
| | - Fernanda M A Leal
- Laboratório de Genômica Estrutural e Funcional, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Brazil
| | - Thiago J Borges
- Laboratório de Imunologia Celular, Instituto de Pesquisas Biomédicas, Pontifícia Universidade Católica do Rio Grande do Sul, Brazil
| | - Natália Jaeger
- Laboratório de Imunologia Celular, Instituto de Pesquisas Biomédicas, Pontifícia Universidade Católica do Rio Grande do Sul, Brazil
| | - Cristina Bonorino
- Laboratório de Imunologia Celular, Instituto de Pesquisas Biomédicas, Pontifícia Universidade Católica do Rio Grande do Sul, Brazil
| | - Irene S Schrank
- Laboratório de Microrganismos Diazotróficos, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Brazil
| | - Henrique B Ferreira
- Laboratório de Genômica Estrutural e Funcional, Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Brazil.
| |
Collapse
|
21
|
Abstract
Omics approaches have become popular in biology as powerful discovery tools, and currently gain in interest for diagnostic applications. Establishing the accurate genome sequence of any organism is easy, but the outcome of its annotation by means of automatic pipelines remains imprecise. Some protein-encoding genes may be missed as soon as they are specific and poorly conserved in a given taxon, while important to explain the specific traits of the organism. Translational starts are also poorly predicted in a relatively important number of cases, thus impacting the protein sequence database used in proteomics, comparative genomics, and systems biology. The use of high-throughput proteomics data to improve genome annotation is an attractive option to obtain a more comprehensive molecular picture of a given organism. Here, protocols for reannotating prokaryote genomes are described based on shotgun proteomics and derivatization of protein N-termini with a positively charged reagent coupled to high-resolution tandem mass spectrometry.
Collapse
|
22
|
Müller SA, Scilabra SD, Lichtenthaler SF. Proteomic Substrate Identification for Membrane Proteases in the Brain. Front Mol Neurosci 2016; 9:96. [PMID: 27790089 PMCID: PMC5062031 DOI: 10.3389/fnmol.2016.00096] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 09/21/2016] [Indexed: 12/26/2022] Open
Abstract
Cell-cell communication in the brain is controlled by multiple mechanisms, including proteolysis. Membrane-bound proteases generate signaling molecules from membrane-bound precursor proteins and control the length and function of cell surface membrane proteins. These proteases belong to different families, including members of the “a disintegrin and metalloprotease” (ADAM), the beta-site amyloid precursor protein cleaving enzymes (BACE), membrane-type matrix metalloproteases (MT-MMP) and rhomboids. Some of these proteases, in particular ADAM10 and BACE1 have been shown to be essential not only for the correct development of the mammalian brain, but also for myelination and maintaining neuronal connections in the adult nervous system. Additionally, these proteases are considered as drug targets for brain diseases, including Alzheimer’s disease (AD), schizophrenia and cancer. Despite their biomedical relevance, the molecular functions of these proteases in the brain have not been explored in much detail, as little was known about their substrates. This has changed with the recent development of novel proteomic methods which allow to identify substrates of membrane-bound proteases from cultured cells, primary neurons and other primary brain cells and even in vivo from minute amounts of mouse cerebrospinal fluid (CSF). This review summarizes the recent advances and highlights the strengths of the individual proteomic methods. Finally, using the example of the Alzheimer-related proteases BACE1, ADAM10 and γ-secretase, as well as ADAM17 and signal peptide peptidase like 3 (SPPL3), we illustrate how substrate identification with novel methods is instrumental in elucidating broad physiological functions of these proteases in the brain and other organs.
Collapse
Affiliation(s)
- Stephan A Müller
- German Center for Neurodegenerative Diseases (DZNE)Munich, Germany; Neuroproteomics, Klinikum rechts der Isar, Technische Universität MünchenMunich, Germany
| | - Simone D Scilabra
- German Center for Neurodegenerative Diseases (DZNE)Munich, Germany; Neuroproteomics, Klinikum rechts der Isar, Technische Universität MünchenMunich, Germany
| | - Stefan F Lichtenthaler
- German Center for Neurodegenerative Diseases (DZNE)Munich, Germany; Neuroproteomics, Klinikum rechts der Isar, Technische Universität MünchenMunich, Germany; Institute for Advanced Study, Technische Universität MunichGarching, Germany; Munich Cluster for Systems Neurology (SyNergy)Munich, Germany
| |
Collapse
|
23
|
Canut H, Albenne C, Jamet E. Post-translational modifications of plant cell wall proteins and peptides: A survey from a proteomics point of view. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2016; 1864:983-90. [PMID: 26945515 DOI: 10.1016/j.bbapap.2016.02.022] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Revised: 02/12/2016] [Accepted: 02/24/2016] [Indexed: 12/21/2022]
Abstract
Plant cell wall proteins (CWPs) and peptides are important players in cell walls contributing to their assembly and their remodeling during development and in response to environmental constraints. Since the rise of proteomics technologies at the beginning of the 2000's, the knowledge of CWPs has greatly increased leading to the discovery of new CWP families and to the description of the cell wall proteomes of different organs of many plants. Conversely, cell wall peptidomics data are still lacking. In addition to the identification of CWPs and peptides by mass spectrometry (MS) and bioinformatics, proteomics has allowed to describe their post-translational modifications (PTMs). At present, the best known PTMs consist in proteolytic cleavage, N-glycosylation, hydroxylation of P residues into hydroxyproline residues (O), O-glycosylation and glypiation. In this review, the methods allowing the capture of the modified proteins based on the specific properties of their PTMs as well as the MS technologies used for their characterization are briefly described. A focus is done on proteolytic cleavage leading to protein maturation or release of signaling peptides and on O-glycosylation. Some new technologies, like top-down proteomics and terminomics, are described. They aim at a finer description of proteoforms resulting from PTMs or degradation mechanisms. This article is part of a Special Issue entitled: Plant Proteomics--a bridge between fundamental processes and crop production, edited by Dr. Hans-Peter Mock.
Collapse
Affiliation(s)
- Hervé Canut
- Université de Toulouse, CNRS, UPS, 24 chemin de Borde Rouge, Auzeville, BP42617, 31326 Castanet Tolosan, France
| | - Cécile Albenne
- Université de Toulouse, CNRS, UPS, 24 chemin de Borde Rouge, Auzeville, BP42617, 31326 Castanet Tolosan, France
| | - Elisabeth Jamet
- Université de Toulouse, CNRS, UPS, 24 chemin de Borde Rouge, Auzeville, BP42617, 31326 Castanet Tolosan, France.
| |
Collapse
|