51
|
Signal-BNF: a Bayesian network fusing approach to predict signal peptides. J Biomed Biotechnol 2012; 2012:492174. [PMID: 23118510 PMCID: PMC3481251 DOI: 10.1155/2012/492174] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2012] [Revised: 09/09/2012] [Accepted: 09/09/2012] [Indexed: 11/29/2022] Open
Abstract
A signal peptide is a short peptide chain that directs the transport of a protein and has become the crucial vehicle in finding new drugs or reprogramming cells for gene therapy. As the avalanche of new protein sequences generated in the postgenomic era, the challenge of identifying new signal sequences has become even more urgent and critical in biomedical engineering. In this paper, we propose a novel predictor called Signal-BNF to predict the N-terminal signal peptide as well as its cleavage site based on Bayesian reasoning network. Signal-BNF is formed by fusing the results of different Bayesian classifiers which used different feature datasets as its input through weighted voting system. Experiment results show that Signal-BNF is superior to the popular online predictors such as Signal-3L and PrediSi. Signal-BNF is featured by high prediction accuracy that may serve as a useful tool for further investigating many unclear details regarding the molecular mechanism of the zip code protein-sorting system in cells.
Collapse
|
52
|
Szabo Z, Pohlschroder M. Diversity and subcellular distribution of archaeal secreted proteins. Front Microbiol 2012; 3:207. [PMID: 22783239 PMCID: PMC3387779 DOI: 10.3389/fmicb.2012.00207] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2012] [Accepted: 05/21/2012] [Indexed: 12/12/2022] Open
Abstract
Secreted proteins make up a significant percentage of a prokaryotic proteome and play critical roles in important cellular processes such as polymer degradation, nutrient uptake, signal transduction, cell wall biosynthesis, and motility. The majority of archaeal proteins are believed to be secreted either in an unfolded conformation via the universally conserved Sec pathway or in a folded conformation via the Twin arginine transport (Tat) pathway. Extensive in vivo and in silico analyses of N-terminal signal peptides that target proteins to these pathways have led to the development of computational tools that not only predict Sec and Tat substrates with high accuracy but also provide information about signal peptide processing and targeting. Predictions therefore include indications as to whether a substrate is a soluble secreted protein, a membrane or cell wall anchored protein, or a surface structure subunit, and whether it is targeted for post-translational modification such as glycosylation or the addition of a lipid. The use of these in silico tools, in combination with biochemical and genetic analyses of transport pathways and their substrates, has resulted in improved predictions of the subcellular localization of archaeal secreted proteins, allowing for a more accurate annotation of archaeal proteomes, and has led to the identification of potential adaptations to extreme environments, as well as phyla-specific pathways among the archaea. A more comprehensive understanding of the transport pathways used and post-translational modifications of secreted archaeal proteins will also facilitate the identification and heterologous expression of commercially valuable archaeal enzymes.
Collapse
|
53
|
Paramasivam N, Linke D. ClubSub-P: Cluster-Based Subcellular Localization Prediction for Gram-Negative Bacteria and Archaea. Front Microbiol 2011; 2:218. [PMID: 22073040 PMCID: PMC3210502 DOI: 10.3389/fmicb.2011.00218] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Accepted: 10/12/2011] [Indexed: 12/17/2022] Open
Abstract
The subcellular localization (SCL) of proteins provides important clues to their function in a cell. In our efforts to predict useful vaccine targets against Gram-negative bacteria, we noticed that misannotated start codons frequently lead to wrongly assigned SCLs. This and other problems in SCL prediction, such as the relatively high false-positive and false-negative rates of some tools, can be avoided by applying multiple prediction tools to groups of homologous proteins. Here we present ClubSub-P, an online database that combines existing SCL prediction tools into a consensus pipeline from more than 600 proteomes of fully sequenced microorganisms. On top of the consensus prediction at the level of single sequences, the tool uses clusters of homologous proteins from Gram-negative bacteria and from Archaea to eliminate false-positive and false-negative predictions. ClubSub-P can assign the SCL of proteins from Gram-negative bacteria and Archaea with high precision. The database is searchable, and can easily be expanded using either new bacterial genomes or new prediction tools as they become available. This will further improve the performance of the SCL prediction, as well as the detection of misannotated start codons and other annotation errors. ClubSub-P is available online at http://toolkit.tuebingen.mpg.de/clubsubp/
Collapse
Affiliation(s)
- Nagarajan Paramasivam
- Department I Protein Evolution, Max Planck Institute for Developmental Biology Tübingen, Germany
| | | |
Collapse
|
54
|
Proteomic characterization of cellular and molecular processes that enable the Nanoarchaeum equitans--Ignicoccus hospitalis relationship. PLoS One 2011; 6:e22942. [PMID: 21826220 PMCID: PMC3149612 DOI: 10.1371/journal.pone.0022942] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2011] [Accepted: 07/01/2011] [Indexed: 12/05/2022] Open
Abstract
Nanoarchaeum equitans, the only cultured representative of the Nanoarchaeota, is dependent on direct physical contact with its host, the hyperthermophile Ignicoccus hospitalis. The molecular mechanisms that enable this relationship are unknown. Using whole-cell proteomics, differences in the relative abundance of >75% of predicted protein-coding genes from both Archaea were measured to identify the specific response of I. hospitalis to the presence of N. equitans on its surface. A purified N. equitans sample was also analyzed for evidence of interspecies protein transfer. The depth of cellular proteome coverage achieved here is amongst the highest reported for any organism. Based on changes in the proteome under the specific conditions of this study, I. hospitalis reacts to N. equitans by curtailing genetic information processing (replication, transcription) in lieu of intensifying its energetic, protein processing and cellular membrane functions. We found no evidence of significant Ignicoccus biosynthetic enzymes being transported to N. equitans. These results suggest that, under laboratory conditions, N. equitans diverts some of its host's metabolism and cell cycle control to compensate for its own metabolic shortcomings, thus appearing to be entirely dependent on small, transferable metabolites and energetic precursors from I. hospitalis.
Collapse
|
55
|
Worm P, Stams AJM, Cheng X, Plugge CM. Growth- and substrate-dependent transcription of formate dehydrogenase and hydrogenase coding genes in Syntrophobacter fumaroxidans and Methanospirillum hungatei. MICROBIOLOGY-SGM 2010; 157:280-289. [PMID: 20884694 DOI: 10.1099/mic.0.043927-0] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Transcription of genes coding for formate dehydrogenases (fdh genes) and hydrogenases (hyd genes) in Syntrophobacter fumaroxidans and Methanospirillum hungatei was studied following growth under different conditions. Under all conditions tested, all fdh and hyd genes were transcribed. However, transcription levels of the individual genes varied depending on the substrate and growth conditions. Our results strongly suggest that in syntrophically grown S. fumaroxidans cells, the [FeFe]-hydrogenase (encoded by Sfum_844-46), FDH1 (Sfum_2703-06) and Hox (Sfum_2713-16) may confurcate electrons from NADH and ferredoxin to protons and carbon dioxide to produce hydrogen and formate, respectively. Based on bioinformatic analysis, a membrane-integrated energy-converting [NiFe]-hydrogenase (Mhun_1741-46) of M. hungatei might be involved in the energy-dependent reduction of CO(2) to formylmethanofuran. The best candidates for F(420)-dependent N(5),N(10)-methyl-H(4) MPT and N(5),N(10),-methylene-H(4)MPT reduction are the cytoplasmic [NiFe]-hydrogenase and FDH1. 16S rRNA ratios indicate that in one of the triplicate co-cultures of S. fumaroxidans and M. hungatei, less energy was available for S. fumaroxidans. This led to enhanced transcription of genes coding for the Rnf-complex (Sfum_2694-99) and of several fdh and hyd genes. The Rnf-complex probably reoxidized NADH with ferredoxin reduction, followed by ferredoxin oxidation by the induced formate dehydrogenases and hydrogenases.
Collapse
Affiliation(s)
- Petra Worm
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, The Netherlands
| | - Alfons J M Stams
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, The Netherlands
| | - Xu Cheng
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, The Netherlands
| | - Caroline M Plugge
- Laboratory of Microbiology, Wageningen University, Dreijenplein 10, 6703 HB Wageningen, The Netherlands
| |
Collapse
|
56
|
Bagos PG, Nikolaou EP, Liakopoulos TD, Tsirigos KD. Combined prediction of Tat and Sec signal peptides with hidden Markov models. ACTA ACUST UNITED AC 2010; 26:2811-7. [PMID: 20847219 DOI: 10.1093/bioinformatics/btq530] [Citation(s) in RCA: 169] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Computational prediction of signal peptides is of great importance in computational biology. In addition to the general secretory pathway (Sec), Bacteria, Archaea and chloroplasts possess another major pathway that utilizes the Twin-Arginine translocase (Tat), which recognizes longer and less hydrophobic signal peptides carrying a distinctive pattern of two consecutive Arginines (RR) in the n-region. A major functional differentiation between the Sec and Tat export pathways lies in the fact that the former translocates secreted proteins unfolded through a protein-conducting channel, whereas the latter translocates completely folded proteins using an unknown mechanism. The purpose of this work is to develop a novel method for predicting and discriminating Sec from Tat signal peptides at better accuracy. RESULTS We report the development of a novel method, PRED-TAT, which is capable of discriminating Sec from Tat signal peptides and predicting their cleavage sites. The method is based on Hidden Markov Models and possesses a modular architecture suitable for both Sec and Tat signal peptides. On an independent test set of experimentally verified Tat signal peptides, PRED-TAT clearly outperforms the previously proposed methods TatP and TATFIND, whereas, when evaluated as a Sec signal peptide predictor compares favorably to top-scoring predictors such as SignalP and Phobius. The method is freely available for academic users at http://www.compgen.org/tools/PRED-TAT/.
Collapse
Affiliation(s)
- Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Central Greece, Papasiopoulou 2-4, Lamia 35100, Greece.
| | | | | | | |
Collapse
|
57
|
Shaping the archaeal cell envelope. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2010; 2010:608243. [PMID: 20671907 PMCID: PMC2910488 DOI: 10.1155/2010/608243] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2010] [Accepted: 05/29/2010] [Indexed: 12/12/2022]
Abstract
Although archaea have a similar cellular organization as other prokaryotes, the lipid composition of their membranes and their cell surface is unique. Here we discuss recent developments in our understanding of the archaeal protein secretion mechanisms, the assembly of macromolecular cell surface structures, and the release of S-layer-coated vesicles from the archaeal membrane.
Collapse
|
58
|
Łabaj PP, Leparc GG, Bardet AF, Kreil G, Kreil DP. Single amino acid repeats in signal peptides. FEBS J 2010; 277:3147-57. [DOI: 10.1111/j.1742-4658.2010.07720.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
59
|
The Sulfolobus rod-shaped virus 2 encodes a prominent structural component of the unique virion release system in Archaea. Virology 2010; 404:1-4. [PMID: 20488501 DOI: 10.1016/j.virol.2010.04.020] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Revised: 03/08/2010] [Accepted: 04/20/2010] [Indexed: 10/19/2022]
Abstract
Recently a unique mechanism of virion release was discovered in Archaea, different from lysis and egress systems of bacterial and eukaryotic viruses. It involves formation of pyramidal structures on the host cell surface that rupture the S-layer and by opening outwards, create apertures through which mature virions escape the cell. Here we present results of a protein analysis of Sulfolobus islandicus cells infected with the rudivirus SIRV2, which enable us to postulate SIRV2-encoded protein P98 as the major constituent of these exceptional cellular ultrastructures.
Collapse
|
60
|
Goudenège D, Avner S, Lucchetti-Miganeh C, Barloy-Hubler F. CoBaltDB: Complete bacterial and archaeal orfeomes subcellular localization database and associated resources. BMC Microbiol 2010; 10:88. [PMID: 20331850 PMCID: PMC2850352 DOI: 10.1186/1471-2180-10-88] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Accepted: 03/23/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The functions of proteins are strongly related to their localization in cell compartments (for example the cytoplasm or membranes) but the experimental determination of the sub-cellular localization of proteomes is laborious and expensive. A fast and low-cost alternative approach is in silico prediction, based on features of the protein primary sequences. However, biologists are confronted with a very large number of computational tools that use different methods that address various localization features with diverse specificities and sensitivities. As a result, exploiting these computer resources to predict protein localization accurately involves querying all tools and comparing every prediction output; this is a painstaking task. Therefore, we developed a comprehensive database, called CoBaltDB, that gathers all prediction outputs concerning complete prokaryotic proteomes. DESCRIPTION The current version of CoBaltDB integrates the results of 43 localization predictors for 784 complete bacterial and archaeal proteomes (2.548.292 proteins in total). CoBaltDB supplies a simple user-friendly interface for retrieving and exploring relevant information about predicted features (such as signal peptide cleavage sites and transmembrane segments). Data are organized into three work-sets ("specialized tools", "meta-tools" and "additional tools"). The database can be queried using the organism name, a locus tag or a list of locus tags and may be browsed using numerous graphical and text displays. CONCLUSIONS With its new functionalities, CoBaltDB is a novel powerful platform that provides easy access to the results of multiple localization tools and support for predicting prokaryotic protein localizations with higher confidence than previously possible. CoBaltDB is available at http://www.umr6026.univ-rennes1.fr/english/home/research/basic/software/cobalten.
Collapse
Affiliation(s)
- David Goudenège
- CNRS UMR 6026, ICM, Equipe B@SIC, Université de Rennes 1, Campus de Beaulieu, 35042 Rennes, France
| | | | | | | |
Collapse
|
61
|
Impact of molecular hydrogen on chalcopyrite bioleaching by the extremely thermoacidophilic archaeon Metallosphaera sedula. Appl Environ Microbiol 2010; 76:2668-72. [PMID: 20190092 DOI: 10.1128/aem.02016-09] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Hydrogen served as a competitive inorganic energy source, impacting the CuFeS(2) bioleaching efficiency of the extremely thermoacidophilic archaeon Metallosphaera sedula. Open reading frames encoding key terminal oxidase and electron transport chain components were triggered by CuFeS(2). Evidence of heterotrophic metabolism was noted after extended periods of bioleaching, presumably related to cell lysis.
Collapse
|
62
|
Magidovich H, Yurist-Doutsch S, Konrad Z, Ventura VV, Dell A, Hitchen PG, Eichler J. AglP is a S-adenosyl-L-methionine-dependent methyltransferase that participates in the N-glycosylation pathway of Haloferax volcanii. Mol Microbiol 2010; 76:190-9. [PMID: 20149102 DOI: 10.1111/j.1365-2958.2010.07090.x] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
While pathways for N-glycosylation in Eukarya and Bacteria have been solved, considerably less is known of this post-translational modification in Archaea. In the halophilic archaeon Haloferax volcanii, proteins encoded by the agl genes are involved in the assembly and attachment of a pentasaccharide to select asparagine residues of the S-layer glycoprotein. AglP, originally identified based on the proximity of its encoding gene to other agl genes whose products were shown to participate in N-glycosylation, was proposed, based on sequence homology, to serve as a methyltransferase. In the present report, gene deletion and mass spectrometry were employed to reveal that AglP is responsible for adding a 14 Da moiety to a hexuronic acid found at position four of the pentasaccharide decorating the Hfx. volcanii S-layer glycoprotein. Subsequent purification of a tagged version of AglP and development of an in vitro assay to test the function of the protein confirmed that AglP is a S-adenosyl-L-methionine-dependent methyltransferase.
Collapse
Affiliation(s)
- Hilla Magidovich
- Department of Life Sciences, Ben Gurion University, Beersheva, Israel
| | | | | | | | | | | | | |
Collapse
|
63
|
Choo KH, Tan TW, Ranganathan S. A comprehensive assessment of N-terminal signal peptides prediction methods. BMC Bioinformatics 2009; 10 Suppl 15:S2. [PMID: 19958512 PMCID: PMC2788353 DOI: 10.1186/1471-2105-10-s15-s2] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Background Amino-terminal signal peptides (SPs) are short regions that guide the targeting of secretory proteins to the correct subcellular compartments in the cell. They are cleaved off upon the passenger protein reaching its destination. The explosive growth in sequencing technologies has led to the deposition of vast numbers of protein sequences necessitating rapid functional annotation techniques, with subcellular localization being a key feature. Of the myriad software prediction tools developed to automate the task of assigning the SP cleavage site of these new sequences, we review here, the performance and reliability of commonly used SP prediction tools. Results The available signal peptide data has been manually curated and organized into three datasets representing eukaryotes, Gram-positive and Gram-negative bacteria. These datasets are used to evaluate thirteen prediction tools that are publicly available. SignalP (both the HMM and ANN versions) maintains consistency and achieves the best overall accuracy in all three benchmarking experiments, ranging from 0.872 to 0.914 although other prediction tools are narrowing the performance gap. Conclusion The majority of the tools evaluated in this study encounter no difficulty in discriminating between secretory and non-secretory proteins. The challenge clearly remains with pinpointing the correct SP cleavage site. The composite scoring schemes employed by SignalP may help to explain its accuracy. Prediction task is divided into a number of separate steps, thus allowing each score to tackle a particular aspect of the prediction.
Collapse
Affiliation(s)
- Khar Heng Choo
- Institute for Infocomm Research, 1 Fusionopolis Way, #21-01 Connexis, Singapore.
| | | | | |
Collapse
|
64
|
Ellen AF, Albers SV, Driessen AJM. Comparative study of the extracellular proteome of Sulfolobus species reveals limited secretion. Extremophiles 2009; 14:87-98. [PMID: 19957093 PMCID: PMC2797410 DOI: 10.1007/s00792-009-0290-y] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2009] [Accepted: 11/10/2009] [Indexed: 01/01/2023]
Abstract
Although a large number of potentially secreted proteins can be predicted on the basis of genomic distribution of signal sequence-bearing proteins, protein secretion in Archaea has barely been studied. A proteomic inventory and comparison of the growth medium proteins in three hyperthermoacidophiles, i.e., Sulfolobus solfataricus, S. acidocaldarius and S. tokodaii, indicates that only few proteins are freely secreted into the growth medium and that the majority originates from cell envelope bound forms. In S. acidocaldarius both cell-associated and secreted alpha-amylase activities are detected. Inactivation of the amyA gene resulted in a complete loss of activity, suggesting that the same protein is responsible for the a-amylase activity at both locations. It is concluded that protein secretion in Sulfolobus is a limited process, and it is suggested that the S-layer may act as a barrier for the free diffusion of folded proteins into the medium.
Collapse
Affiliation(s)
- Albert F Ellen
- Department of Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Kerklaan 30, 9751 NN Haren, The Netherlands
| | | | | |
Collapse
|
65
|
Different minimal signal peptide lengths recognized by the archaeal prepilin-like peptidases FlaK and PibD. J Bacteriol 2009; 191:6732-40. [PMID: 19717585 DOI: 10.1128/jb.00673-09] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
In Archaea, the preflagellin peptidase (a type IV prepilin-like peptidase designated FlaK in Methanococcus voltae and Methanococcus maripaludis) is the enzyme that cleaves the N-terminal signal peptide from preflagellins. In methanogens and several other archaeal species, the typical flagellin signal peptide length is 11 to 12 amino acids, while in other archaea preflagellins possess extremely short signal peptides. A systematic approach to address the signal peptide length requirement for preflagellin processing is presented in this study. M. voltae preflagellin FlaB2 proteins with signal peptides 3 to 12 amino acids in length were generated and used as a substrate in an in vitro assay utilizing M. voltae membranes as an enzyme source. Processing by FlaK was observed in FlaB2 proteins containing signal peptides shortened to 5 amino acids; signal peptides 4 or 3 amino acids in length were unprocessed. In the case of Sulfolobus solfataricus, where the preflagellin peptidase PibD has broader substrate specificity, some predicted substrates have predicted signal peptides as short as 3 amino acids. Interestingly, the shorter signal peptides of the various mutant FlaB2 proteins not processed by FlaK were processed by PibD, suggesting that some archaeal preflagellin peptidases are likely adapted toward cleaving shorter signal peptides. The functional complementation of signal peptidase activity by FlaK and PibD in an M. maripaludis DeltaflaK mutant indicated that processing of preflagellins was detected by complementation with either FlaK or PibD, yet only FlaK-complemented cells were flagellated. This suggested that a block in an assembly step subsequent to signal peptide removal occurred in the PibD complementation.
Collapse
|