1
|
Liu J, Parrish JR, Hines J, Mansfield L, Finley RL. A proteome-wide screen of Campylobacter jejuni using protein microarrays identifies novel and conformational antigens. PLoS One 2019; 14:e0210351. [PMID: 30633767 PMCID: PMC6329530 DOI: 10.1371/journal.pone.0210351] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 12/20/2018] [Indexed: 02/07/2023] Open
Abstract
Campylobacter jejuni (C. jejuni) is a foodborne intestinal pathogen and major cause of gastroenteritis worldwide. C. jejuni proteins that are immunogenic have been sought for their potential use in the development of biomarkers, diagnostic assays, or subunit vaccines for humans or livestock. To identify new immunogenic C. jejuni proteins, we used a native protein microarray approach. A protein chip, with over 1400 individually purified GST-tagged C. jejuni proteins, representing over 86% of the proteome, was constructed to screen for antibody titers present in test sera raised against whole C. jejuni cells. Dual detection of GST signals was incorporated as a way of normalizing the variation of protein concentrations contributing to the antibody staining intensities. We detected strong signals to 102 C. jejuni antigens. In addition to antigens recognized by antiserum raised against C. jejuni, parallel experiments were conducted to identify antigens cross-reactive to antiserum raised against various serotypes of E. coli or Salmonella or to healthy human sera. This led to the identification of 34 antigens specifically recognized by the C. jejuni antiserum, only four of which were previously known. The chip approach also allowed identification of conformational antigens. We demonstrate in the case of Cj1621 that antigen signals are lost to denaturing conditions commonly used in other approaches to identify immunogens. Antigens identified in this study include those possessing sequence features indicative of cell surface localization, as well as those that do not. Together, our results indicate that the unbiased chip-based screen can help reveal the full repertoire of host antibodies against microbial proteomes.
Collapse
Affiliation(s)
- Jiayou Liu
- Center for Molecular Medicine & Genetics, Wayne State University School of Medicine, Detroit, Michigan, United States of America
| | - Jodi R Parrish
- Center for Molecular Medicine & Genetics, Wayne State University School of Medicine, Detroit, Michigan, United States of America
| | - Julie Hines
- Center for Molecular Medicine & Genetics, Wayne State University School of Medicine, Detroit, Michigan, United States of America
| | - Linda Mansfield
- Department of Microbiology & Molecular Genetics, Michigan State University, East Lansing, Michigan, United States of America
| | - Russell L Finley
- Center for Molecular Medicine & Genetics, Wayne State University School of Medicine, Detroit, Michigan, United States of America.,Department of Microbiology, Immunology, and Biochemistry Wayne State University School of Medicine, Detroit, Michigan, United States of America
| |
Collapse
|
2
|
RepEx: A web server to extract sequence repeats from protein and DNA sequences. Comput Biol Chem 2018; 78:424-430. [PMID: 30598392 DOI: 10.1016/j.compbiolchem.2018.12.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Accepted: 12/25/2018] [Indexed: 11/20/2022]
Abstract
Evolution builds up new genetic material from existing ones, not in random, but in highly ordered and eloquent patterns. Most of these sequence repeats are revelatory of valuable information contributing to areas of disease research and function of macromolecules, to name a few. In the age of next generation genome sequencing, rapid and efficient extraction of all unbiased sequence repeats from macromolecules is the need of the hour. In view of this reckoning, an online web-based computing server, RepEx, has been developed to extract and display all possible repeats for DNA and protein sequences. Apart from exact or identical repeats, the server has been designed adeptly to identify and extract degenerate, inverted, everted and mirror repeats from both DNA and protein sequences. The server has striking output displays, featuring interactive graphs and comprehensive output files. In addition, RepEx has been accoutered with an easy-to-use interface and search filters to facilitate a user-defined query or search and is freely available and accessible via the World Wide Web at http://bioserver2.physics.iisc.ac.in/RepEx/.
Collapse
|
3
|
Kazemian HB, Yusuf SA, White K. Signal peptide discrimination and cleavage site identification using SVM and NN. Comput Biol Med 2014; 45:98-110. [PMID: 24480169 DOI: 10.1016/j.compbiomed.2013.11.017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2013] [Revised: 10/28/2013] [Accepted: 11/22/2013] [Indexed: 10/25/2022]
Abstract
About 15% of all proteins in a genome contain a signal peptide (SP) sequence, at the N-terminus, that targets the protein to intracellular secretory pathways. Once the protein is targeted correctly in the cell, the SP is cleaved, releasing the mature protein. Accurate prediction of the presence of these short amino-acid SP chains is crucial for modelling the topology of membrane proteins, since SP sequences can be confused with transmembrane domains due to similar composition of hydrophobic amino acids. This paper presents a cascaded Support Vector Machine (SVM)-Neural Network (NN) classification methodology for SP discrimination and cleavage site identification. The proposed method utilises a dual phase classification approach using SVM as a primary classifier to discriminate SP sequences from Non-SP. The methodology further employs NNs to predict the most suitable cleavage site candidates. In phase one, a SVM classification utilises hydrophobic propensities as a primary feature vector extraction using symmetric sliding window amino-acid sequence analysis for discrimination of SP and Non-SP. In phase two, a NN classification uses asymmetric sliding window sequence analysis for prediction of cleavage site identification. The proposed SVM-NN method was tested using Uni-Prot non-redundant datasets of eukaryotic and prokaryotic proteins with SP and Non-SP N-termini. Computer simulation results demonstrate an overall accuracy of 0.90 for SP and Non-SP discrimination based on Matthews Correlation Coefficient (MCC) tests using SVM. For SP cleavage site prediction, the overall accuracy is 91.5% based on cross-validation tests using the novel SVM-NN model.
Collapse
Affiliation(s)
| | | | - K White
- London Metropolitan University, UK
| |
Collapse
|
4
|
Neumann S, Fuchs A, Hummel B, Frishman D. Classification of α-helical membrane proteins using predicted helix architectures. PLoS One 2013; 8:e77491. [PMID: 24204844 PMCID: PMC3808409 DOI: 10.1371/journal.pone.0077491] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 08/26/2013] [Indexed: 11/24/2022] Open
Abstract
Despite significant methodological advances in protein structure determination high-resolution structures of membrane proteins are still rare, leaving sequence-based predictions as the only option for exploring the structural variability of membrane proteins at large scale. Here, a new structural classification approach for α-helical membrane proteins is introduced based on the similarity of predicted helix interaction patterns. Its application to proteins with known 3D structure showed that it is able to reliably detect structurally similar proteins even in the absence of any sequence similarity, reproducing the SCOP and CATH classifications with a sensitivity of 65% at a specificity of 90%. We applied the new approach to enhance our comprehensive structural classification of α-helical membrane proteins (CAMPS), which is primarily based on sequence and topology similarity, in order to find protein clusters that describe the same fold in the absence of sequence similarity. The total of 151 helix architectures were delineated for proteins with more than four transmembrane segments. Interestingly, we observed that proteins with 8 and more transmembrane helices correspond to fewer different architectures than proteins with up to 7 helices, suggesting that in large membrane proteins the evolutionary tendency to re-use already available folds is more pronounced.
Collapse
Affiliation(s)
- Sindy Neumann
- Department of Genome Oriented Bioinformatics, Technische Universität München, Wissenschaftszentrum Weihenstephan, Freising, Germany
| | - Angelika Fuchs
- pRED, Pharma Research and Early Development, pRED Informatics, Roche Diagnostics GmbH, Penzberg, Germany
| | - Barbara Hummel
- Department of Urology/Women’s Hospital and Center for Clinical Research, University of Freiburg Medical Center, Freiburg, Germany
| | - Dmitrij Frishman
- Department of Genome Oriented Bioinformatics, Technische Universität München, Wissenschaftszentrum Weihenstephan, Freising, Germany
- Helmholtz Center Munich - German Research Center for Environmental Health (GmbH), Institute of Bioinformatics and Systems Biology, Neuherberg, Germany
- * E-mail:
| |
Collapse
|
5
|
Mathias RA, Chen YS, Kapp EA, Greening DW, Mathivanan S, Simpson RJ. Triton X-114 phase separation in the isolation and purification of mouse liver microsomal membrane proteins. Methods 2011; 54:396-406. [DOI: 10.1016/j.ymeth.2011.01.006] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2010] [Revised: 01/17/2011] [Accepted: 01/19/2011] [Indexed: 11/29/2022] Open
|
6
|
Mathias RA, Chen YS, Goode RJA, Kapp EA, Mathivanan S, Moritz RL, Zhu HJ, Simpson RJ. Tandem application of cationic colloidal silica and Triton X-114 for plasma membrane protein isolation and purification: towards developing an MDCK protein database. Proteomics 2011; 11:1238-53. [PMID: 21337516 DOI: 10.1002/pmic.201000591] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2010] [Revised: 12/03/2010] [Accepted: 12/13/2010] [Indexed: 11/07/2022]
Abstract
Plasma membrane (PM) proteins are attractive therapeutic targets because of their accessibility to drugs. Although genes encoding PM proteins represent 20-30% of eukaryotic genomes, a detailed characterisation of their encoded proteins is underrepresented, due, to their low copy number and the inherent difficulties in their isolation and purification as a consequence of their high hydrophobicity. We describe here a strategy that combines two orthogonal methods to isolate and purify PM proteins from Madin Darby canine kidney (MDCK) cells. In this two-step method, we first used cationic colloidal silica (CCS) to isolate adherent (Ad) and non-adherent (nAd) PM fractions, and then subjected each fraction to Triton X-114 (TX-114) phase partitioning to further enrich for hydrophobic proteins. While CCS alone identified 255/757 (34%) membrane proteins, CCS/TX-114 in combination yielded 453/745 (61%). Strikingly, of those proteins unique to CCS/TX-114, 277/393 (70%) had membrane annotation. Further characterisation of the CCS/TX-114 data set using Uniprot and transmembrane hidden Markov model revealed that 306/745 (41%) contained one or more transmembrane domains (TMDs), including proteins with 25 and 17 TMDs. Of the remaining proteins in the data set, 69/439 (16%) are known to contain lipid modifications. Of all membrane proteins identified, 93 had PM origin, including proteins that mediate cell adhesion, modulate transmembrane ion transport, and cell-cell communication. These studies reveal that the application of CCS to first isolate Ad and nAd PM fractions, followed by their detergent-phase TX-114 partitioning, to be a powerful method to isolate low-abundance PM proteins, and a useful adjunct for in-depth cell surface proteome analyses.
Collapse
Affiliation(s)
- Rommel A Mathias
- Ludwig Institute for Cancer Research, Parkville, Victoria, Australia
| | | | | | | | | | | | | | | |
Collapse
|
7
|
Influence of assignment on the prediction of transmembrane helices in protein structures. Amino Acids 2010; 39:1241-54. [DOI: 10.1007/s00726-010-0559-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2009] [Accepted: 03/08/2010] [Indexed: 02/01/2023]
|
8
|
Lee HJ, Kwon MS, Lee EY, Cho SY, Paik YK. Establishment of a PF2D-MS/MS platform for rapid profiling and semiquantitative analysis of membrane protein biomarkers. Proteomics 2008; 8:2168-77. [PMID: 18528841 DOI: 10.1002/pmic.200701022] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Current proteome profiling techniques have identified relatively few mammalian membrane proteins despite their numerous important functions. To establish a standard throughput-potential profiling platform for membrane proteins, Triton X-100-solubilized rat liver microsomal proteins were separated on a 2-D separation system (2-D liquid phase fractionation (PF2D)) in two different pH ranges (4.0-8.5 and 7.0-10.5). This system produced 182 proteins with more than two transmembrane domain (TMD), including 16 TMDs with high confidence. Comparative 2-D liquid maps with high resolution and reproducibility have been constructed for liver microsome from the phenobarbital (PB) treated rats. PF2D was also found to be useful for the semiquantification of some representative cytochrome P450 family proteins (e.g., cytochrome P450 2B2) that were induced by PB treatment compared with untreated controls. Thus, the combination of both high-detection capacity and rapid preliminary semiquantification in a PF2D platform could become a standard system for the routine analysis of membrane proteins.
Collapse
Affiliation(s)
- Hyoung-Joo Lee
- Department of Biochemistry, Yonsei Proteome Research Center and Biomedical Proteome Research Center, Yonsei University, Sudaemoon-Ku, Seoul, Korea
| | | | | | | | | |
Collapse
|
9
|
|
10
|
Baumgärtner M, Kärst U, Gerstel B, Loessner M, Wehland J, Jänsch L. Inactivation of Lgt allows systematic characterization of lipoproteins from Listeria monocytogenes. J Bacteriol 2006; 189:313-24. [PMID: 17041050 PMCID: PMC1797373 DOI: 10.1128/jb.00976-06] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Lipoprotein anchoring in bacteria is mediated by the prolipoprotein diacylglyceryl transferase (Lgt), which catalyzes the transfer of a diacylglyceryl moiety to the prospective N-terminal cysteine of the mature lipoprotein. Deletion of the lgt gene in the gram-positive pathogen Listeria monocytogenes (i) impairs intracellular growth of the bacterium in different eukaryotic cell lines and (ii) leads to increased release of lipoproteins into the culture supernatant. Comparative extracellular proteome analyses of the EGDe wild-type strain and the Delta lgt mutant provided systematic insight into the relative expression of lipoproteins. Twenty-six of the 68 predicted lipoproteins were specifically released into the extracellular proteome of the Delta lgt strain, and this proved that deletion of lgt is an excellent approach for experimental verification of listerial lipoproteins. Consequently, we generated Delta lgt Delta prfA double mutants to detect lipoproteins belonging to the main virulence regulon that is controlled by PrfA. Overall, we identified three lipoproteins whose extracellular levels are regulated and one lipoprotein that is posttranslationally modified depending on PrfA. It is noteworthy that in contrast to previous studies of Escherichia coli, we unambiguously demonstrated that lipidation by Lgt is not a prerequisite for activity of the lipoprotein-specific signal peptidase II (Lsp) in Listeria.
Collapse
Affiliation(s)
- Maja Baumgärtner
- Department of Cell Biology, Helmholtz Centre for Infection Research (HZI), D-38124 Braunschweig, Germany
| | | | | | | | | | | |
Collapse
|
11
|
Thornton J, Blakey D, Scanlon E, Merrick M. The ammonia channel protein AmtB from Escherichia coli is a polytopic membrane protein with a cleavable signal peptide. FEMS Microbiol Lett 2006; 258:114-20. [PMID: 16630265 DOI: 10.1111/j.1574-6968.2006.00202.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
The Escherichia coli ammonia channel protein, AmtB, is a homotrimeric polytopic inner membrane protein in which each subunit has 11 transmembrane helices. We have shown that the structural gene amtB encodes a preprotein with a signal peptide that is cleaved off to produce a topology with the N-terminus in the periplasm and the C-terminus in the cytoplasm. Deletion of the signal peptide coding region results in significantly lower levels of AmtB accumulation in the membrane but modification of the signal peptidase cleavage site, leading to aberrant cleavage, does not prevent trimer formation and does not inactivate the protein. The presence of a signal peptide is apparently not a conserved feature of all prokaryotic Amt proteins. Comparison of predicted AmtB sequences suggests that while Amt proteins in Gram-negative organisms utilize a signal peptide, the homologous proteins in Gram-positive organisms do not.
Collapse
Affiliation(s)
- Jeremy Thornton
- Department of Molecular Microbiology, John Innes Centre, Norwich, UK
| | | | | | | |
Collapse
|
12
|
Dieterich G, Kärst U, Fischer E, Wehland J, Jänsch L. LEGER: knowledge database and visualization tool for comparative genomics of pathogenic and non-pathogenic Listeria species. Nucleic Acids Res 2006; 34:D402-6. [PMID: 16381897 PMCID: PMC1347434 DOI: 10.1093/nar/gkj071] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Listeria species are ubiquitous in the environment and often contaminate foods because they grow under conditions used for food preservation. Listeria monocytogenes, the human and animal pathogen, causes Listeriosis, an infection with a high mortality rate in risk groups such as immune-compromised individuals. Furthermore, L.monocytogenes is a model organism for the study of intracellular bacterial pathogens. The publication of its genome sequence and that of the non-pathogenic species Listeria innocua initiated numerous comparative studies and efforts to sequence all species comprising the genus. The Proteome database LEGER (http://leger2.gbf.de/cgi-bin/expLeger.pl) was developed to support functional genome analyses by combining information obtained by applying bioinformatics methods and from public databases to improve the original annotations. LEGER offers three unique key features: (i) it is the first comprehensive information system focusing on the functional assignment of genes and proteins; (ii) integrated visualization tools, KEGG pathway and Genome Viewer, alleviate the functional exploration of complex data; and (iii) LEGER presents results of systematic post-genome studies, thus facilitating analyses combining computational and experimental results. Moreover, LEGER provides an unpublished membrane proteome analysis of L.innocua and in total visualizes experimentally validated information about the subcellular localizations of 789 different listerial proteins.
Collapse
Affiliation(s)
- Guido Dieterich
- Department for Cell Biology, Research Centre for Biotechnology (GBF), Mascheroder Weg 1, 38124 Braunschweig, Germany
| | | | | | | | | |
Collapse
|
13
|
Monné M, Chan KW, Slotboom DJ, Kunji ERS. Functional expression of eukaryotic membrane proteins in Lactococcus lactis. Protein Sci 2005; 14:3048-56. [PMID: 16260761 PMCID: PMC2253241 DOI: 10.1110/ps.051689905] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The overproduction of eukaryotic membrane proteins is a major impediment in their structural and functional characterization. Here we have used the nisin-inducible expression system of Lactococcus lactis for the overproduction of 11 mitochondrial transport proteins from yeast. They were expressed at high levels in a functional state in the cytoplasmic membrane. The results also show that the level of expression is influenced by the N-terminal regions of the transporters. Expression levels were improved >10-fold either by replacing or truncating these regions or by adding lactococcal signal peptides. The observed expression levels are now compatible with a realistic exploration of crystallization conditions. The lactococcal expression system may be used for the high-throughput functional characterization of eukaryotic membrane proteins and structural genomics.
Collapse
Affiliation(s)
- Magnus Monné
- The Medical Research Council, Dunn Human Nutrition Unit, Hills Road, Wellcome Trust/MRC Building, Cambridge CB2 2XY, UK
| | | | | | | |
Collapse
|
14
|
Arai M, Fukushi T, Satake M, Shimizu T. A proteome-wide analysis of domain architectures of prokaryotic single-spanning transmembrane proteins. Comput Biol Chem 2005; 29:379-87. [PMID: 16213795 DOI: 10.1016/j.compbiolchem.2005.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2005] [Accepted: 08/08/2005] [Indexed: 11/24/2022]
Abstract
We performed a proteome-wide survey of the domain architectures in single-spanning transmembrane (TM) proteins (single-spannings) from 87 sequenced prokaryotic (Bacterial and Archaean) genomes by assigning Pfam domains to their N-tail and C-tail loops. Out of 14,625 single-spannings, 3,516 sequences have at least one domain assigned, and no domains were assigned to 7,850, with the remaining 3,259 with less reliable assignment. In the domain-assigned sequences, 3116 sequences are with at most two domains, and the other 400 sequences with more than two. The assigned domains distribute over 651 Pfam families, which account for 11.4% of the total Pfam-A families. Among the 651 families are mostly soluble-protein-originated ones, but only 21 families are unique to TM proteins. The occurrence frequency of the individual domain families follows a power-law, that is, 264 families occur only once, 106 just twice, and the families appeared more than 30 times are counted by only 39. It is found that the great majority of the sequences having one or two domains are of the type II topology with the C-tail loop containing domains on it. On the contrary, the N-tail loop of the same type topology seldom carries domains. Importantly, the assigned domains are always found on the tail loops longer than 60 residues, even for the small domains with less than 30 residues. There are still as many as 5,800 sequences without assigned domains in spite of having at least one long tail, on which no less than 1,000 novel domain families are expected most likely to lie concealed unknown yet. We also investigated the domain arrangement preference and the domain family combination patterns in 'singlets' (single-spannings with one assigned domain) and 'doublets' (with two domains).
Collapse
Affiliation(s)
- Masafumi Arai
- Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Japan
| | | | | | | |
Collapse
|
15
|
Pradel N, Decorps A, Ye C, Santini CL, Wu LF. YidC-dependent translocation of green fluorescence protein fused to the FliP cleavable signal peptide. Biochimie 2005; 87:191-6. [PMID: 15760712 DOI: 10.1016/j.biochi.2004.11.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2004] [Accepted: 11/05/2004] [Indexed: 11/17/2022]
Abstract
Escherichia coli FliP is a rare bacterial polytopic membrane protein synthesized with a cleavable, highly hydrophobic signal peptide. More hydrophilic Tat-dependent or Sec-dependent signal peptide is functionally capable of substituting for the FliP signal peptide, but a signal anchor of inner membrane protein fails to do so. To assess the intrinsic characteristics of the FliP signal peptide in mediating protein translocation, we fused it to green fluorescence protein and observed that the translocation of the chimera (FliPss-GFP) was dependent of Ffh, SecA, SecY and SecD. In addition, we showed for the first time the involvement of YidC in protein translocation across the inner membrane.
Collapse
Affiliation(s)
- Nathalie Pradel
- Laboratoire de Chimie Bactérienne, UPR9043, IBSM, CNRS, 31, chemin Joseph Aiguier, 13402 Marseille cedex 20, France
| | | | | | | | | |
Collapse
|
16
|
Nilsson J, Persson B, von Heijne G. Comparative analysis of amino acid distributions in integral membrane proteins from 107 genomes. Proteins 2005; 60:606-16. [PMID: 16028222 DOI: 10.1002/prot.20583] [Citation(s) in RCA: 97] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
We have performed a comparative analysis of amino acid distributions in predicted integral membrane proteins from a total of 107 genomes. A procedure for identification of membrane spanning helices was optimized on a homology-reduced data set of 170 multi-spanning membrane proteins with experimentally determined topologies. The optimized method was then used for extraction of highly reliable partial topologies from all predicted membrane proteins in each genome, and the average biases in amino acid distributions between loops on opposite sides of the membrane were calculated. The results strongly support the notion that a biased distribution of Lys and Arg residues between cytoplasmic and extra-cytoplasmic segments (the positive-inside rule) is present in most if not all organisms.
Collapse
Affiliation(s)
- Johan Nilsson
- Center for Genomics and Bioinformatics, Karolinska Institutet, Stockholm, Sweden
| | | | | |
Collapse
|
17
|
Wehmhöner D, Dieterich G, Fischer E, Baumgärtner M, Wehland J, Jänsch L. “LANESPECTOR”, a tool for membrane proteome profiling based on sodium dodecyl sulfate-polyacrylamide gel electrophoresis/liquid chromatography - tandem mass spectrometry analysis: Application toListeria monocytogenes membrane proteins. Electrophoresis 2005; 26:2450-60. [PMID: 15966022 DOI: 10.1002/elps.200410348] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Proteomics is required to provide insight into any type of subproteome. While the workflow based on two-dimensional polyacrylamide gel electrophoresis (2-D PAGE) can be applied for many subproteomes and comprises well-established strategies for data presentation and data analysis, the comprehensive investigation of membrane proteomes remains a challenging task. We present a number of procedures that provide an insight into such systems. We have established a novel protocol for the efficient preparation of membrane fractions, which is used here for the human pathogen Listeria monocytogenes that overcomes difficulties associated with ribosomes. Subsequently, we have used the combination of sodium dodecyl sulfate (SDS)-PAGE and liquid chromatography-tandem mass spectrometry for the characterization of the membrane proteome. Three hundred and one different membrane proteins could be identified, including 70 proteins that exhibited 2-15 transmembrane domains. However, a remarkably high ratio of proteins was detected in gel sections that were not in accordance with their expected migration behavior during SDS-PAGE. Protein identifications based on MASCOT significance criteria could be shown to be of high quality and therefore could not be the explanation of this observation. Consequently we have developed LaneSpector, a general visualization tool that allows the systematic comparison between apparent and calculated protein masses, which is routinely applicable to any high-throughput approach using a mass-dependent separation dimension prior to LC-MS/MS. The detailed presentation of the LaneSpector plot promotes the validation of the analytical process and might help to reveal relevant biological processes such as proteolysis or other post-translational modifications.
Collapse
Affiliation(s)
- Dirk Wehmhöner
- Department of Cell Biology, GBF-German Research Centre for Biotechnology, Braunschweig, Germany
| | | | | | | | | | | |
Collapse
|
18
|
Trost M, Wehmhöner D, Kärst U, Dieterich G, Wehland J, Jänsch L. Comparative proteome analysis of secretory proteins from pathogenic and nonpathogenicListeriaspecies. Proteomics 2005; 5:1544-57. [PMID: 15838904 DOI: 10.1002/pmic.200401024] [Citation(s) in RCA: 134] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Extracellular proteins of bacterial pathogens play a crucial role in the infection of the host. Here we present the first comprehensive validation of the secretory subproteome of the Gram positive pathogen Listeria monocytogenes using predictive bioinformatic and experimental proteomic approaches. The previous original signal peptide (SP) prediction (Glaser et al., Science 2001, 294, 849-852) has been greatly improved by an in-depth analysis using seven different bioinformatic tools. Subsequent careful classification of the resulting data gives a probability dependent annotation of 121 putatively secreted proteins of which 45 are novel. Complementary proteomic analysis using both two-dimensional gel electrophoresis/matrix assisted laser desorption/ionization mass spectrometry and high performance liquid chromatography/electrospray ionization-mass spectrometry has identified 105 proteins in the culture supernatant of L. monocytogenes. Among these, we were able to detect all the currently known virulence factors with an SP showing the importance of this subproteome and demonstrating the reliability of the techniques used. The comparison between the L. monocytogenes wildtype and the nonpathogenic species Listeria innocua was performed to reveal proteins probably involved in pathogenicity and/or the adaptation to their respective lifestyles. In addition to the eight known virulence factors, all of which have no orthologous genes in L. innocua, eight additional proteins have been identified that exhibit the typical key feature defining the known listerial virulence factors. Further significant differences between the two species are evident in the group of cell wall and secretory proteins that warrant further study. Our investigation clearly demonstrates that the major difference between the pathogenic and nonpathogenic species, noted in the comparative genome analysis, manifests itself strongest in the secretome.
Collapse
Affiliation(s)
- Matthias Trost
- Department of Cell Biology, German Research Centre for Biotechnology, GBF, Braunschweig, Germany
| | | | | | | | | | | |
Collapse
|
19
|
Arai M, Okumura K, Satake M, Shimizu T. Proteome-wide functional classification and identification of prokaryotic transmembrane proteins by transmembrane topology similarity comparison. Protein Sci 2005; 13:2170-83. [PMID: 15273311 PMCID: PMC2279829 DOI: 10.1110/ps.04814404] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
We propose a new method for classifying and identifying transmembrane (TM) protein functions in proteome-scale by applying a single-linkage clustering method based on TM topology similarity, which is calculated simply from comparing the lengths of loop regions. In this study, we focused on 87 prokaryotic TM proteomes consisting of 31 proteobacteria, 22 gram-positive bacteria, 19 other bacteria, and 15 archaea. Prior to performing the clustering, we first categorized individual TM protein sequences as "known," "putative" (similar to "known" sequences), or "unknown" by using the homology search and the sequence similarity comparison against SWISS-PROT to assess the current status of the functional annotation of the TM proteomes based on sequence similarity only. More than three-quarters, that is, 75.7% of the TM protein sequences are functionally "unknown," with only 3.8% and 20.5% of them being classified as "known" and "putative," respectively. Using our clustering approach based on TM topology similarity, we succeeded in increasing the rate of TM protein sequences functionally classified and identified from 24.3% to 60.9%. Obtained clusters correspond well to functional superfamilies or families, and the functional classification and identification are successfully achieved by this approach. For example, in an obtained cluster of TM proteins with six TM segments, 109 sequences out of 119 sequences annotated as "ATP-binding cassette transporter" are properly included and 122 "unknown" sequences are also contained.
Collapse
Affiliation(s)
- Masafumi Arai
- Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Hirosaki 036-8561, Japan
| | | | | | | |
Collapse
|
20
|
Pradel N, Ye C, Wu LF. A cleavable signal peptide is required for the full function of the polytopic inner membrane protein FliP of Escherichia coli. Biochem Biophys Res Commun 2004; 319:1276-80. [PMID: 15194505 DOI: 10.1016/j.bbrc.2004.05.123] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2004] [Indexed: 11/19/2022]
Abstract
FliP is a rare bacterial polytopic membrane protein synthesized with a cleavable highly hydrophobic signal peptide. It is essential for flagellum assembly and for bacterial motility. In this study, we assessed specificity of signal peptide for the FliP function. Like the wild type FliP, two altered FliPs with more hydrophilic Tat- or Sec-dependent signal peptides were both able to restore the motility of the DeltafliP mutant. Therefore, the Tat- and the Sec-dependent signal peptides seemed to be compatible with the FliP function. Moreover, deletion of the FliP signal peptide or replacing it with the transmembrane segment of MotA severely impaired the FliP function. Together these results showed that a cleavable signal peptide is required for the full function of FliP.
Collapse
Affiliation(s)
- Nathalie Pradel
- Laboratoire de Chimie Bactérienne, UPR9043, IBSM, CNRS, 31, Chemin Joseph Aiguier, F-13402 Marseille cedex 20, France
| | | | | |
Collapse
|
21
|
Arai M, Mitsuke H, Ikeda M, Xia JX, Kikuchi T, Satake M, Shimizu T. ConPred II: a consensus prediction method for obtaining transmembrane topology models with high reliability. Nucleic Acids Res 2004; 32:W390-3. [PMID: 15215417 PMCID: PMC441518 DOI: 10.1093/nar/gkh380] [Citation(s) in RCA: 173] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
ConPred II (http://bioinfo.si.hirosaki-u.ac.jp/~ConPred2/) is a server for the prediction of transmembrane (TM) topology [i.e. the number of TM segments (TMSs), TMS positions and N-tail location] based on a consensus approach by combining the results of several proposed methods. The ConPred II system is constructed from ConPred_elite and ConPred_all (previously named ConPred), proposed earlier by our group. The prediction accuracy of ConPred_elite is almost 100%, which is achieved by sacrificing the prediction coverage (20-30%). ConPred_all predicts TM topologies for all the input sequences with accuracies improved by up to 11% over individual proposed methods. In the ConPred II system, the TM topology prediction of input TM protein sequences is executed following a two-step process: (i) input sequences are first run through the ConPred_elite program; (ii) sequences for which ConPred_elite does not give the TM topology are delivered to the ConPred_all program for TM topology prediction. Users can get access to the ConPred II system automatically by submitting sequences to the server. The ConPred II server will return the predicted TM topology models and graphical representations of their contents (hydropathy plots, helical wheel diagrams of predicted TMSs and snake-like diagrams).
Collapse
Affiliation(s)
- Masafumi Arai
- Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Hirosaki 036-8561, Japan
| | | | | | | | | | | | | |
Collapse
|
22
|
Shimizu T, Mitsuke H, Noto K, Arai M. Internal gene duplication in the evolution of prokaryotic transmembrane proteins. J Mol Biol 2004; 339:1-15. [PMID: 15123416 DOI: 10.1016/j.jmb.2004.03.048] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2003] [Revised: 03/20/2004] [Accepted: 03/20/2004] [Indexed: 10/26/2022]
Abstract
We investigated the evolution of transmembrane (TM) topology by detecting partial sequence repeats in TM protein sequences and analyzing them in detail. A total of 377 sequences that seem to have evolved by internal gene duplication events were found among 38,124 predicted TM protein sequences (except for single-spannings) from 87 prokaryotic genomes. Various types of internal duplication patterns were identified in these sequences. The majority of them are diploid-type (including quasi-diploid-type) duplication in which a primordial protein sequence was duplicated internally to become an extant TM protein with twice as many TM segments as the primordial one, and the remaining ones are partial duplications including triploid-type. The diploid-type repeats are recognized in many 8-tms, 10-tms and 12-tms TM protein sequences, suggesting the diploid-type duplication was a principle mechanism in the evolutionary development of these types of TM proteins. The "positive-inside" rule is satisfied in whole sequences of both 10-tms and 8-tms TM proteins and in both halves of 10-tms proteins while not necessarily in the second half of 8-tms proteins, providing fit examples of "internal divergent topology evolution" likely occurred after a diploid-type internal duplication event. From analyzing the partial duplication patterns, several evolutionary pathways were recognized for 6-tms TM proteins, i.e. from primordial 2-tms, 3-tms and 4-tms TM proteins to extant 6-tms proteins. Similarly, the duplication pattern analysis revealed plausible evolution scenarios that 7-tms TM proteins have arisen from 3-tms, 4-tms and 5-tms TM protein precursors via partial internal gene duplications.
Collapse
Affiliation(s)
- Toshio Shimizu
- Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Hirosaki 036-8561, Japan.
| | | | | | | |
Collapse
|
23
|
Käll L, Krogh A, Sonnhammer ELL. A combined transmembrane topology and signal peptide prediction method. J Mol Biol 2004; 338:1027-36. [PMID: 15111065 DOI: 10.1016/j.jmb.2004.03.016] [Citation(s) in RCA: 1718] [Impact Index Per Article: 85.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2003] [Revised: 02/25/2004] [Accepted: 03/09/2004] [Indexed: 01/09/2023]
Abstract
An inherent problem in transmembrane protein topology prediction and signal peptide prediction is the high similarity between the hydrophobic regions of a transmembrane helix and that of a signal peptide, leading to cross-reaction between the two types of predictions. To improve predictions further, it is therefore important to make a predictor that aims to discriminate between the two classes. In addition, topology information can be gained when successfully predicting a signal peptide leading a transmembrane protein since it dictates that the N terminus of the mature protein must be on the non-cytoplasmic side of the membrane. Here, we present Phobius, a combined transmembrane protein topology and signal peptide predictor. The predictor is based on a hidden Markov model (HMM) that models the different sequence regions of a signal peptide and the different regions of a transmembrane protein in a series of interconnected states. Training was done on a newly assembled and curated dataset. Compared to TMHMM and SignalP, errors coming from cross-prediction between transmembrane segments and signal peptides were reduced substantially by Phobius. False classifications of signal peptides were reduced from 26.1% to 3.9% and false classifications of transmembrane helices were reduced from 19.0% to 7.7%. Phobius was applied to the proteomes of Homo sapiens and Escherichia coli. Here we also noted a drastic reduction of false classifications compared to TMHMM/SignalP, suggesting that Phobius is well suited for whole-genome annotation of signal peptides and transmembrane regions. The method is available at as well as at
Collapse
Affiliation(s)
- Lukas Käll
- Center for Genomics and Bioinformatics, Karolinska Institutet, SE-17 177 Stockholm, Sweden
| | | | | |
Collapse
|
24
|
Matsumoto T, Ishikawa S, Fukuda H, Kondo A. Construction of ethanol-tolerant yeast strains with combinatorial library-selected peptides. ACTA ACUST UNITED AC 2004. [DOI: 10.1016/j.molcatb.2003.12.023] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
25
|
Xia JX, Ikeda M, Shimizu T. ConPred_elite: a highly reliable approach to transmembrane topology predication. Comput Biol Chem 2004; 28:51-60. [PMID: 15022642 DOI: 10.1016/j.compbiolchem.2003.11.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The function of transmembrane (TM) proteins is closely correlated to their TM topology; large quantities of highly reliable TM topology data are becoming increasingly required. We present a new consensus approach for TM topology prediction (ConPred_elite) that can predict the whole topology with accuracies of 0.98 for prokaryotic and 0.95 for eukaryotic proteins on a dataset of experimentally-characterized TM topologies. The predicted yield on the dataset is 30.4% for prokaryotic and 21.5% for eukaryotic proteins. Applying ConPred_elite to predicted TM proteins extracted from 29 prokaryotic and 10 eukaryotic proteomes, we obtained 3871 and 7271 highly reliable TM topologies (yields, 19.8 and 13.3%), respectively. The predicted TM topology data may contribute to further research into a comprehensive functional classification and identification of TM proteins based on information of the topology.
Collapse
Affiliation(s)
- Jun-Xiong Xia
- Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Hirosaki 036-8561,Japan
| | | | | |
Collapse
|