1
|
Beals J, Hu H, Li X. A survey of experimental and computational identification of small proteins. Brief Bioinform 2024; 25:bbae345. [PMID: 39007598 PMCID: PMC11247407 DOI: 10.1093/bib/bbae345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 05/27/2024] [Accepted: 07/02/2024] [Indexed: 07/16/2024] Open
Abstract
Small proteins (SPs) are typically characterized as eukaryotic proteins shorter than 100 amino acids and prokaryotic proteins shorter than 50 amino acids. Historically, they were disregarded because of the arbitrary size thresholds to define proteins. However, recent research has revealed the existence of many SPs and their crucial roles. Despite this, the identification of SPs and the elucidation of their functions are still in their infancy. To pave the way for future SP studies, we briefly introduce the limitations and advancements in experimental techniques for SP identification. We then provide an overview of available computational tools for SP identification, their constraints, and their evaluation. Additionally, we highlight existing resources for SP research. This survey aims to initiate further exploration into SPs and encourage the development of more sophisticated computational tools for SP identification in prokaryotes and microbiomes.
Collapse
Affiliation(s)
- Joshua Beals
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Xiaoman Li
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| |
Collapse
|
2
|
Genth J, Schäfer K, Cassidy L, Graspeuntner S, Rupp J, Tholey A. Identification of proteoforms of short open reading frame-encoded peptides in Blautia producta under different cultivation conditions. Microbiol Spectr 2023; 11:e0252823. [PMID: 37782090 PMCID: PMC10715070 DOI: 10.1128/spectrum.02528-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 08/14/2023] [Indexed: 10/03/2023] Open
Abstract
IMPORTANCE The identification of short open reading frame-encoded peptides (SEP) and different proteoforms in single cultures of gut microbes offers new insights into a largely neglected part of the microbial proteome landscape. This is of particular importance as SEP provide various predicted functions, such as acting as antimicrobial peptides, maintaining cell homeostasis under stress conditions, or even contributing to the virulence pattern. They are, thus, taking a poorly understood role in structure and function of microbial networks in the human body. A better understanding of SEP in the context of human health requires a precise understanding of the abundance of SEP both in commensal microbes as well as pathogens. For the gut beneficial B. producta, we demonstrate the importance of specific environmental conditions for biosynthesis of SEP expanding previous findings about their role in microbial interactions.
Collapse
Affiliation(s)
- Jerome Genth
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Kathrin Schäfer
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Simon Graspeuntner
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg-Lübeck-Borstel-Riems, Lübeck, Germany
| | - Jan Rupp
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg-Lübeck-Borstel-Riems, Lübeck, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| |
Collapse
|
3
|
Fuchs S, Engelmann S. Small proteins in bacteria - Big challenges in prediction and identification. Proteomics 2023; 23:e2200421. [PMID: 37609810 DOI: 10.1002/pmic.202200421] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 08/03/2023] [Accepted: 08/10/2023] [Indexed: 08/24/2023]
Abstract
Proteins with up to 100 amino acids have been largely overlooked due to the challenges associated with predicting and identifying them using traditional methods. Recent advances in bioinformatics and machine learning, DNA sequencing, RNA and Ribo-seq technologies, and mass spectrometry (MS) have greatly facilitated the detection and characterisation of these elusive proteins in recent years. This has revealed their crucial role in various cellular processes including regulation, signalling and transport, as toxins and as folding helpers for protein complexes. Consequently, the systematic identification and characterisation of these proteins in bacteria have emerged as a prominent field of interest within the microbial research community. This review provides an overview of different strategies for predicting and identifying these proteins on a large scale, leveraging the power of these advanced technologies. Furthermore, the review offers insights into the future developments that may be expected in this field.
Collapse
Affiliation(s)
- Stephan Fuchs
- Genome Competence Center (MF1), Department MFI, Robert-Koch-Institut, Berlin, Germany
| | - Susanne Engelmann
- Institute for Microbiology, Technische Universität Braunschweig, Braunschweig, Germany
- Microbial Proteomics, Helmholtzzentrum für Infektionsforschung GmbH, Braunschweig, Germany
| |
Collapse
|
4
|
Brantl S, Ul Haq I. Small proteins in Gram-positive bacteria. FEMS Microbiol Rev 2023; 47:fuad064. [PMID: 38052429 PMCID: PMC10730256 DOI: 10.1093/femsre/fuad064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 11/27/2023] [Accepted: 12/04/2023] [Indexed: 12/07/2023] Open
Abstract
Small proteins comprising less than 100 amino acids have been often ignored in bacterial genome annotations. About 10 years ago, focused efforts started to investigate whole peptidomes, which resulted in the discovery of a multitude of small proteins, but only a number of them have been characterized in detail. Generally, small proteins can be either membrane or cytosolic proteins. The latter interact with larger proteins, RNA or even metal ions. Here, we summarize our current knowledge on small proteins from Gram-positive bacteria with a special emphasis on the model organism Bacillus subtilis. Our examples include membrane-bound toxins of type I toxin-antitoxin systems, proteins that block the assembly of higher order structures, regulate sporulation or modulate the RNA degradosome. We do not consider antimicrobial peptides. Furthermore, we present methods for the identification and investigation of small proteins.
Collapse
Affiliation(s)
- Sabine Brantl
- AG Bakteriengenetik, Matthias-Schleiden-Institut, Friedrich-Schiller-Universität Jena, Philosophenweg 12, Jena D-07743, Germany
| | - Inam Ul Haq
- AG Bakteriengenetik, Matthias-Schleiden-Institut, Friedrich-Schiller-Universität Jena, Philosophenweg 12, Jena D-07743, Germany
| |
Collapse
|
5
|
Jiang R, Rempel DL, Gross ML. Toward a MALDI in-source decay (ISD) method for top-down analysis of protein footprinting. EUROPEAN JOURNAL OF MASS SPECTROMETRY (CHICHESTER, ENGLAND) 2023; 29:292-302. [PMID: 37750197 PMCID: PMC11092977 DOI: 10.1177/14690667231202695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Irreversible protein footprinting is a mass spectrometry-based approach in which solvent-accessible sites of a protein are modified to assess high-order protein structure. Structural insights can be gained by determining the position and extents of modification. The usual approach to obtain the "footprint" is to analyze the protein through bottom-up LC-MS/MS. In this approach, the proteins are digested to yield a mixture of peptides that are then separated by LC before locating the modification sites by MS/MS. This process consumes substantial amounts of time and is difficult to accelerate for applications that require quick and high-throughput analysis. Here, we describe employing matrix-assisted laser desorption/ionization (MALDI) in-source decay (ISD) to analyze a footprinted small test protein (ubiquitin) via a top-down approach. Matrix-assisted laser desorption/ionization is easily adapted for high-throughput analysis, and top-down strategies can avoid lengthy proteolysis and LC separation. We optimized the method with model peptides and then demonstrated its feasibility on ubiquitin submitted to two types of footprinting. We found that MALDI ISD can produce a comprehensive set of fragment ions for small proteins, affording footprinting information in a fast manner and giving results that agree with the established methods, and serve as a rough measure of protein solvent accessibility. To assist in the implementation of the MALDI approach, we developed a method of processing top-down ISD data.
Collapse
Affiliation(s)
- Ruidong Jiang
- Department of Chemistry, Washington University in St Louis, St Louis, MO, USA
| | - Don L Rempel
- Department of Chemistry, Washington University in St Louis, St Louis, MO, USA
| | - Michael L Gross
- Department of Chemistry, Washington University in St Louis, St Louis, MO, USA
| |
Collapse
|
6
|
Cassidy L, Kaulich PT, Tholey A. Proteoforms expand the world of microproteins and short open reading frame-encoded peptides. iScience 2023; 26:106069. [PMID: 36818287 PMCID: PMC9929600 DOI: 10.1016/j.isci.2023.106069] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Microproteins and short open reading frame-encoded peptides (SEPs) can, like all proteins, carry numerous posttranslational modifications. Together with posttranscriptional processes, this leads to a high number of possible distinct protein molecules, the proteoforms, out of a limited number of genes. The identification, quantification, and molecular characterization of proteoforms possess special challenges to established, mainly bottom-up proteomics (BUP) based analytical approaches. While BUP methods are powerful, proteins have to be inferred rather than directly identified, which hampers the detection of proteoforms. An alternative approach is top-down proteomics (TDP) which allows to identify intact proteoforms. This perspective article provides a brief overview of modified microproteins and SEPs, introduces the proteoform terminology, and compares present BUP and TDP workflows highlighting their major advantages and caveats. Necessary future developments in TDP to fully accentuate its potential for proteoform-centric analytics of microproteins and SEPs will be discussed.
Collapse
Affiliation(s)
- Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Philipp T. Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, 24105 Kiel, Germany,Corresponding author
| |
Collapse
|
7
|
Applications of MALDI-MS/MS-Based Proteomics in Biomedical Research. Molecules 2022; 27:molecules27196196. [PMID: 36234736 PMCID: PMC9570737 DOI: 10.3390/molecules27196196] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 09/14/2022] [Accepted: 09/15/2022] [Indexed: 11/22/2022] Open
Abstract
Matrix-assisted laser desorption/ionization (MALDI) mass spectrometry (MS) is one of the most widely used techniques in proteomics to achieve structural identification and characterization of proteins and peptides, including their variety of proteoforms due to post-translational modifications (PTMs) or protein–protein interactions (PPIs). MALDI-MS and MALDI tandem mass spectrometry (MS/MS) have been developed as analytical techniques to study small and large molecules, offering picomole to femtomole sensitivity and enabling the direct analysis of biological samples, such as biofluids, solid tissues, tissue/cell homogenates, and cell culture lysates, with a minimized procedure of sample preparation. In the last decades, structural identification of peptides and proteins achieved by MALDI-MS/MS helped researchers and clinicians to decipher molecular function, biological process, cellular component, and related pathways of the gene products as well as their involvement in pathogenesis of diseases. In this review, we highlight the applications of MALDI ionization source and tandem approaches for MS for analyzing biomedical relevant peptides and proteins. Furthermore, one of the most relevant applications of MALDI-MS/MS is to provide “molecular pictures”, which offer in situ information about molecular weight proteins without labeling of potential targets. Histology-directed MALDI-mass spectrometry imaging (MSI) uses MALDI-ToF/ToF or other MALDI tandem mass spectrometers for accurate sequence analysis of peptide biomarkers and biological active compounds directly in tissues, to assure complementary and essential spatial data compared with those obtained by LC-ESI-MS/MS technique.
Collapse
|