1
|
Montezano D, Bernstein R, Copeland MM, Slusky JSG. General features of transmembrane beta barrels from a large database. Proc Natl Acad Sci U S A 2023; 120:e2220762120. [PMID: 37432995 PMCID: PMC10629564 DOI: 10.1073/pnas.2220762120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Accepted: 06/03/2023] [Indexed: 07/13/2023] Open
Abstract
Large datasets contribute new insights to subjects formerly investigated by exemplars. We used coevolution data to create a large, high-quality database of transmembrane β-barrels (TMBB). By applying simple feature detection on generated evolutionary contact maps, our method (IsItABarrel) achieves 95.88% balanced accuracy when discriminating among protein classes. Moreover, comparison with IsItABarrel revealed a high rate of false positives in previous TMBB algorithms. In addition to being more accurate than previous datasets, our database (available online) contains 1,938,936 bacterial TMBB proteins from 38 phyla, respectively, 17 and 2.2 times larger than the previous sets TMBB-DB and OMPdb. We anticipate that due to its quality and size, the database will serve as a useful resource where high-quality TMBB sequence data are required. We found that TMBBs can be divided into 11 types, three of which have not been previously reported. We find tremendous variance in proteome percentage among TMBB-containing organisms with some using 6.79% of their proteome for TMBBs and others using as little as 0.27% of their proteome. The distribution of the lengths of the TMBBs is suggestive of previously hypothesized duplication events. In addition, we find that the C-terminal β-signal varies among different classes of bacteria though its consensus sequence is LGLGYRF. However, this β-signal is only characteristic of prototypical TMBBs. The ten non-prototypical barrel types have other C-terminal motifs, and it remains to be determined if these alternative motifs facilitate TMBB insertion or perform any other signaling function.
Collapse
Affiliation(s)
- Daniel Montezano
- Computational Biology Program, University of Kansas, Lawrence, KS66045
| | - Rebecca Bernstein
- Computational Biology Program, University of Kansas, Lawrence, KS66045
| | | | - Joanna S. G. Slusky
- Computational Biology Program, University of Kansas, Lawrence, KS66045
- Department of Molecular Biosciences, University of Kansas, Lawrence, KS66045
| |
Collapse
|
2
|
Mulnaes D, Schott-Verdugo S, Koenig F, Gohlke H. TopProperty: Robust Metaprediction of Transmembrane and Globular Protein Features Using Deep Neural Networks. J Chem Theory Comput 2021; 17:7281-7289. [PMID: 34663069 DOI: 10.1021/acs.jctc.1c00685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Transmembrane proteins (TMPs) are critical components of cellular life. However, due to experimental challenges, the number of experimentally resolved TMP structures is severely underrepresented in databases compared to their cellular abundance. Prediction of (per-residue) features such as transmembrane topology, membrane exposure, secondary structure, and solvent accessibility can be a useful starting point for experimental design or protein structure prediction but often requires different computational tools for different features or types of proteins. We present TopProperty, a metapredictor that predicts all of these features for TMPs or globular proteins. TopProperty is trained on datasets without bias toward a high number of sequence homologs, and the predictions are significantly better than the evaluated state-of-the-art primary predictors on all quality metrics. TopProperty eliminates the need for protein type- or feature-tailored tools, specifically for TMPs. TopProperty is freely available as a web server and standalone at https://cpclab.uni-duesseldorf.de/topsuite/.
Collapse
Affiliation(s)
- Daniel Mulnaes
- Institut für Pharmazeutische und Medizinische Chemie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf 40225, Germany
| | - Stephan Schott-Verdugo
- John von Neumann Institute for Computing (NIC), Jülich Supercomputing Centre (JSC), Institute of Biological Information Processing (IBI-7: Structural Bioinformatics), and Institute of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum Jülich GmbH, Wilhelm-Johnen-Str., Jülich 52425, Germany
| | - Filip Koenig
- Institut für Pharmazeutische und Medizinische Chemie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf 40225, Germany
| | - Holger Gohlke
- Institut für Pharmazeutische und Medizinische Chemie, Heinrich-Heine-Universität Düsseldorf, Düsseldorf 40225, Germany.,John von Neumann Institute for Computing (NIC), Jülich Supercomputing Centre (JSC), Institute of Biological Information Processing (IBI-7: Structural Bioinformatics), and Institute of Bio- and Geosciences (IBG-4: Bioinformatics), Forschungszentrum Jülich GmbH, Wilhelm-Johnen-Str., Jülich 52425, Germany
| |
Collapse
|
3
|
Roumia AF, Tsirigos KD, Theodoropoulou MC, Tamposis IA, Hamodrakas SJ, Bagos PG. OMPdb: A Global Hub of Beta-Barrel Outer Membrane Proteins. FRONTIERS IN BIOINFORMATICS 2021; 1:646581. [PMID: 36303794 PMCID: PMC9581022 DOI: 10.3389/fbinf.2021.646581] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2020] [Accepted: 03/18/2021] [Indexed: 11/14/2022] Open
Abstract
OMPdb (www.ompdb.org) was introduced as a database for β-barrel outer membrane proteins from Gram-negative bacteria in 2011 and then included 69,354 entries classified into 85 families. The database has been updated continuously using a collection of characteristic profile Hidden Markov Models able to discriminate between the different families of prokaryotic transmembrane β-barrels. The number of families has increased ultimately to a total of 129 families in the current, second major version of OMPdb. New additions have been made in parallel with efforts to update existing families and add novel families. Here, we present the upgrade of OMPdb, which from now on aims to become a global repository for all transmembrane β-barrel proteins, both eukaryotic and bacterial.
Collapse
Affiliation(s)
- Ahmed F. Roumia
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | | | | | - Ioannis A. Tamposis
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
| | - Stavros J. Hamodrakas
- Section of Cell Biology and Biophysics, Department of Biology, School of Sciences, National and Kapodistrian University of Athens, Athens, Greece
| | - Pantelis G. Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, Greece
- *Correspondence: Pantelis G. Bagos
| |
Collapse
|
4
|
Computational prediction of secreted proteins in gram-negative bacteria. Comput Struct Biotechnol J 2021; 19:1806-1828. [PMID: 33897982 PMCID: PMC8047123 DOI: 10.1016/j.csbj.2021.03.019] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 03/18/2021] [Accepted: 03/18/2021] [Indexed: 12/29/2022] Open
Abstract
Gram-negative bacteria harness multiple protein secretion systems and secrete a large proportion of the proteome. Proteins can be exported to periplasmic space, integrated into membrane, transported into extracellular milieu, or translocated into cytoplasm of contacting cells. It is important for accurate, genome-wide annotation of the secreted proteins and their secretion pathways. In this review, we systematically classified the secreted proteins according to the types of secretion systems in Gram-negative bacteria, summarized the known features of these proteins, and reviewed the algorithms and tools for their prediction.
Collapse
|
5
|
Madeo G, Savojardo C, Martelli PL, Casadio R. BetAware-Deep: An Accurate Web Server for Discrimination and Topology Prediction of Prokaryotic Transmembrane β-barrel Proteins. J Mol Biol 2020; 433:166729. [PMID: 33972021 DOI: 10.1016/j.jmb.2020.166729] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 11/27/2020] [Accepted: 11/30/2020] [Indexed: 11/25/2022]
Abstract
TransMembrane β-Barrel (TMBB) proteins located in the outer membranes of Gram-negative bacteria are crucial for many important biological processes and primary candidates as drug targets. Structure determination of TMBB proteins is challenging and hence computational methods devised for the analysis of TMBB proteins are important for complementing experimental approaches. Here, we present a novel web server called BetAware-Deep that is able to accurately identify the topology of TMBB proteins (i.e. the number and orientation of membrane-spanning segments along the protein sequence) and to discriminate them from other protein types. The method in BetAware-Deep defines new features by exploiting a non-canonical computation of the hydrophobic moment and by adopting sequence-profile weighting of the White&Wimley hydrophobicity scale. These features are processed using a two-step approach based on deep learning and probabilistic graphical models. BetAware-Deep has been trained on a dataset comprising 58 TMBBs and benchmarked on a novel set of 15 TMBB proteins. Results showed that BetAware-Deep outperforms two recently released state-of-the-art methods for topology prediction, predicting correct topologies of 10 out of 15 proteins. TMBB detection was also assessed on a larger dataset comprising 1009 TMBB proteins and 7571 non-TMBB proteins. Even in this benchmark, BetAware-Deep scored at the level of top-performing methods. A web server has been developed allowing users to analyze input protein sequences and providing topology prediction together with a rich set of information including a graphical representation of the residue-level annotations and prediction probabilities. BetAware-Deep is available at https://busca.biocomp.unibo.it/betaware2.
Collapse
Affiliation(s)
- Giovanni Madeo
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Italy
| | - Castrense Savojardo
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Italy
| | - Pier Luigi Martelli
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Italy.
| | - Rita Casadio
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Italy; Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR), Bari, Italy
| |
Collapse
|
6
|
Roumia AF, Theodoropoulou MC, Tsirigos KD, Nielsen H, Bagos PG. Landscape of Eukaryotic Transmembrane Beta Barrel Proteins. J Proteome Res 2020; 19:1209-1221. [PMID: 32008325 DOI: 10.1021/acs.jproteome.9b00740] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Even though in the last few years several families of eukaryotic β-barrel outer membrane proteins have been discovered, their computational characterization and their annotation in public databases are far from complete. The PFAM database includes only very few characteristic profiles for these families, and in most cases, the profile hidden Markov models (pHMMs) have been trained using prokaryotic and eukaryotic proteins together. Here, we present for the first time a comprehensive computational analysis of eukaryotic transmembrane β-barrels. Twelve characteristic pHMMs were built, based on an extensive literature search, which can discriminate eukaryotic β-barrels from other classes of proteins (globular and bacterial β-barrel ones), as well as between mitochondrial and chloroplastic ones. We built eight novel profiles for the chloroplastic β-barrel families that are not present in the PFAM database and also updated the profile for the MDM10 family (PF12519) in the PFAM database and divide the porin family (PF01459) into two separate families, namely, VDAC and TOM40.
Collapse
Affiliation(s)
- Ahmed F Roumia
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| | | | - Konstantinos D Tsirigos
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200 Copenhagen, Denmark.,Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DK-2800 Kgs Lyngby, Denmark
| | - Henrik Nielsen
- Department of Health Technology, Section for Bioinformatics, Technical University of Denmark, DK-2800 Kgs Lyngby, Denmark
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35100 Lamia, Greece
| |
Collapse
|
7
|
Abstract
[Formula: see text]-Barrel membrane proteins ([Formula: see text]MPs) play important roles, but knowledge of their structures is limited. We have developed a method to predict their 3D structures. We predict strand registers and construct transmembrane (TM) domains of [Formula: see text]MPs accurately, including proteins for which no prediction has been attempted before. Our method also accurately predicts structures from protein families with a limited number of sequences and proteins with novel folds. An average main-chain rmsd of 3.48 Å is achieved between predicted and experimentally resolved structures of TM domains, which is a significant improvement ([Formula: see text]3 Å) over a recent study. For [Formula: see text]MPs with NMR structures, the deviation between predictions and experimentally solved structures is similar to the difference among the NMR structures, indicating excellent prediction accuracy. Moreover, we can now accurately model the extended [Formula: see text]-barrels and loops in non-TM domains, increasing the overall coverage of structure prediction by [Formula: see text]%. Our method is general and can be applied to genome-wide structural prediction of [Formula: see text]MPs.
Collapse
|
8
|
Abstract
Surface-exposed proteins of Gram-negative bacteria are represented by integral outer membrane beta-barrel proteins and lipoproteins. No computational methods exist for predicting surface-exposed lipoproteins, and therefore lipoprotein topology must be experimentally tested. This chapter describes three distinct but complementary methods for the detection of surface-exposed proteins: cell surface protein labeling, accessibility to extracellular protease and antibodies.
Collapse
|
9
|
Bersimis S, Sachlas A, Bagos PG. Discriminating membrane proteins using the joint distribution of length sums of success and failure runs. STAT METHOD APPL-GER 2017. [DOI: 10.1007/s10260-016-0370-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
10
|
Venko K, Roy Choudhury A, Novič M. Computational Approaches for Revealing the Structure of Membrane Transporters: Case Study on Bilitranslocase. Comput Struct Biotechnol J 2017; 15:232-242. [PMID: 28228927 PMCID: PMC5312651 DOI: 10.1016/j.csbj.2017.01.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Revised: 01/19/2017] [Accepted: 01/20/2017] [Indexed: 11/23/2022] Open
Abstract
The structural and functional details of transmembrane proteins are vastly underexplored, mostly due to experimental difficulties regarding their solubility and stability. Currently, the majority of transmembrane protein structures are still unknown and this present a huge experimental and computational challenge. Nowadays, thanks to X-ray crystallography or NMR spectroscopy over 3000 structures of membrane proteins have been solved, among them only a few hundred unique ones. Due to the vast biological and pharmaceutical interest in the elucidation of the structure and the functional mechanisms of transmembrane proteins, several computational methods have been developed to overcome the experimental gap. If combined with experimental data the computational information enables rapid, low cost and successful predictions of the molecular structure of unsolved proteins. The reliability of the predictions depends on the availability and accuracy of experimental data associated with structural information. In this review, the following methods are proposed for in silico structure elucidation: sequence-dependent predictions of transmembrane regions, predictions of transmembrane helix–helix interactions, helix arrangements in membrane models, and testing their stability with molecular dynamics simulations. We also demonstrate the usage of the computational methods listed above by proposing a model for the molecular structure of the transmembrane protein bilitranslocase. Bilitranslocase is bilirubin membrane transporter, which shares similar tissue distribution and functional properties with some of the members of the Organic Anion Transporter family and is the only member classified in the Bilirubin Transporter Family. Regarding its unique properties, bilitranslocase is a potentially interesting drug target.
Collapse
Affiliation(s)
- Katja Venko
- Department of Cheminformatics, National Institute of Chemistry, Ljubljana, Slovenia
| | - A Roy Choudhury
- Department of Cheminformatics, National Institute of Chemistry, Ljubljana, Slovenia
| | - Marjana Novič
- Department of Cheminformatics, National Institute of Chemistry, Ljubljana, Slovenia
| |
Collapse
|
11
|
Abstract
Transmembrane beta-barrels (TMBBs) constitute an important structural class of membrane proteins located in the outer membrane of gram-negative bacteria, and in the outer membrane of chloroplasts and mitochondria. They are involved in a wide variety of cellular functions and the prediction of their transmembrane topology, as well as their discrimination in newly sequenced genomes is of great importance as they are promising targets for antimicrobial drugs and vaccines. Several methods have been applied for the prediction of the transmembrane segments and the topology of beta barrel transmembrane proteins utilizing different algorithmic techniques. Hidden Markov Models (HMMs) have been efficiently used in the development of several computational methods used for this task. In this chapter we give a brief review of different available prediction methods for beta barrel transmembrane proteins pointing out sequence and structural features that should be incorporated in a prediction method. We then describe the procedure of the design and development of a Hidden Markov Model capable of predicting the transmembrane beta strands of TMBBs and discriminating them from globular proteins.
Collapse
Affiliation(s)
- Georgios N Tsaousis
- Department of Cell Biology and Biophysics, Faculty of Biology, National and Kapodistrian University of Athens, Panepistimiopolis, Athens, 15701, Greece
| | - Stavros J Hamodrakas
- Department of Cell Biology and Biophysics, Faculty of Biology, National and Kapodistrian University of Athens, Panepistimiopolis, Athens, 15701, Greece
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, Papasiopoulou 2-4, Lamia, 35100, Greece.
| |
Collapse
|
12
|
Tsirigos KD, Elofsson A, Bagos PG. PRED-TMBB2: improved topology prediction and detection of beta-barrel outer membrane proteins. Bioinformatics 2016; 32:i665-i671. [DOI: 10.1093/bioinformatics/btw444] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
|
13
|
Zhang L, Wang H, Yan L, Su L, Xu D. OMPcontact: An Outer Membrane Protein Inter-Barrel Residue Contact Prediction Method. J Comput Biol 2016; 24:217-228. [PMID: 27513917 DOI: 10.1089/cmb.2015.0236] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In the two transmembrane protein types, outer membrane proteins (OMPs) perform diverse important biochemical functions, including substrate transport and passive nutrient uptake and intake. Hence their 3D structures are expected to reveal these functions. Because experimental structures are scarce, predicted 3D structures are more adapted to OMP research instead, and the inter-barrel residue contact is becoming one of the most remarkable features, improving prediction accuracy by describing the structural information of OMPs. To predict OMP structures accurately, we explored an OMP inter-barrel residue contact prediction method: OMPcontact. Multiple OMP-specific features were integrated in the method, including residue evolutionary covariation, topology-based transmembrane segment relative residue position, OMP lipid layer accessibility, and residue evolution conservation. These features describe the properties of a residue pair in different respects: sequential, structural, evolutionary, and biochemical. Within a 3-residues slide window, a Support Vector Machine (SVM) could accurately determinate the inter-barrel contact residue pair using above features. A 5-fold cross-valuation process was applied in testing the OMPcontact performance against a non-redundant OMP set with 75 samples inside. The tests compared four evolutionary covariation methods and screen analyzed the adaptive ones for inter-barrel contact prediction. The results showed our method not only efficiently realized the prediction, but also scored the possibility for residue pairs reliably. This is expected to improve OMP tertiary structure prediction. Therefore, OMPcontact will be helpful in compiling a structural census of outer membrane protein.
Collapse
Affiliation(s)
- Li Zhang
- 1 School of Computer Science and Technology, Jilin University , Changchun, China .,4 School of Computer Science and Engineering, Changchun University of Technology , Changchun, China
| | - Han Wang
- 2 School of Computer Science and Information Technology, Northeast Normal University , Changchun, China
| | - Lun Yan
- 1 School of Computer Science and Technology, Jilin University , Changchun, China
| | - Lingtao Su
- 1 School of Computer Science and Technology, Jilin University , Changchun, China
| | - Dong Xu
- 3 Department of Computer Science, Christopher S. Bond Life Sciences Center, University of Missouri , Columbia, Missouri, U.S.A
| |
Collapse
|
14
|
Hayat S, Peters C, Shu N, Tsirigos KD, Elofsson A. Inclusion of dyad-repeat pattern improves topology prediction of transmembrane β-barrel proteins. Bioinformatics 2016; 32:1571-3. [PMID: 26794316 DOI: 10.1093/bioinformatics/btw025] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Accepted: 01/14/2016] [Indexed: 11/14/2022] Open
Abstract
UNLABELLED : Accurate topology prediction of transmembrane β-barrels is still an open question. Here, we present BOCTOPUS2, an improved topology prediction method for transmembrane β-barrels that can also identify the barrel domain, predict the topology and identify the orientation of residues in transmembrane β-strands. The major novelty of BOCTOPUS2 is the use of the dyad-repeat pattern of lipid and pore facing residues observed in transmembrane β-barrels. In a cross-validation test on a benchmark set of 42 proteins, BOCTOPUS2 predicts the correct topology in 69% of the proteins, an improvement of more than 10% over the best earlier method (BOCTOPUS) and in addition, it produces significantly fewer erroneous predictions on non-transmembrane β-barrel proteins. AVAILABILITY AND IMPLEMENTATION BOCTOPUS2 webserver along with full dataset and source code is available at http://boctopus.bioinfo.se/ CONTACT : arne@bioinfo.se SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sikander Hayat
- Memorial Sloan Kettering Cancer Center, New York City, NY, USA
| | - Christoph Peters
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and
| | - Nanjiang Shu
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and Sweden Bioinformatics Infrastructure for Life Sciences (BILS), Stockholm University, Sweden
| | - Konstantinos D Tsirigos
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and
| | - Arne Elofsson
- Stockholm Bioinformatics Center, SciLifeLab, Swedish E-Science Research Center, Stockholm University, Stockholm, SE, 10691, Sweden and
| |
Collapse
|
15
|
All-atom 3D structure prediction of transmembrane β-barrel proteins from sequences. Proc Natl Acad Sci U S A 2015; 112:5413-8. [PMID: 25858953 DOI: 10.1073/pnas.1419956112] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Transmembrane β-barrels (TMBs) carry out major functions in substrate transport and protein biogenesis but experimental determination of their 3D structure is challenging. Encouraged by successful de novo 3D structure prediction of globular and α-helical membrane proteins from sequence alignments alone, we developed an approach to predict the 3D structure of TMBs. The approach combines the maximum-entropy evolutionary coupling method for predicting residue contacts (EVfold) with a machine-learning approach (boctopus2) for predicting β-strands in the barrel. In a blinded test for 19 TMB proteins of known structure that have a sufficient number of diverse homologous sequences available, this combined method (EVfold_bb) predicts hydrogen-bonded residue pairs between adjacent β-strands at an accuracy of ∼70%. This accuracy is sufficient for the generation of all-atom 3D models. In the transmembrane barrel region, the average 3D structure accuracy [template-modeling (TM) score] of top-ranked models is 0.54 (ranging from 0.36 to 0.85), with a higher (44%) number of residue pairs in correct strand-strand registration than in earlier methods (18%). Although the nonbarrel regions are predicted less accurately overall, the evolutionary couplings identify some highly constrained loop residues and, for FecA protein, the barrel including the structure of a plug domain can be accurately modeled (TM score = 0.68). Lower prediction accuracy tends to be associated with insufficient sequence information and we therefore expect increasing numbers of β-barrel families to become accessible to accurate 3D structure prediction as the number of available sequences increases.
Collapse
|
16
|
Yeats C, Dessailly BH, Glass EM, Fremont DH, Orengo CA. Target selection for structural genomics of infectious diseases. Methods Mol Biol 2014; 1140:35-51. [PMID: 24590707 DOI: 10.1007/978-1-4939-0354-2_3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
Abstract
This chapter describes the protocols used to identify, filter, and annotate potential protein targets from an organism associated with infectious diseases. Protocols often combine computational approaches for mining information in public databases or for checking whether the protein has already been targeted for structure determination, with manual strategies that examine the literature for information on the biological role of the protein or the experimental strategies that explore the effects of knocking out the protein. Publicly available computational tools have been cited as much as possible. Where these do not exist, the concepts underlying in-house tools developed for the Center for Structural Genomics of Infectious Diseases have been described.
Collapse
Affiliation(s)
- Corin Yeats
- Dept. of Structural and Molecular Biology, University College London, Gower Street, WC1E 6BT, London, UK
| | | | | | | | | |
Collapse
|
17
|
Kyrychenko A, Freites JA, He J, Tobias DJ, Wimley WC, Ladokhin AS. Structural plasticity in the topology of the membrane-interacting domain of HIV-1 gp41. Biophys J 2014; 106:610-20. [PMID: 24507601 DOI: 10.1016/j.bpj.2013.12.032] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2013] [Revised: 12/17/2013] [Accepted: 12/23/2013] [Indexed: 11/27/2022] Open
Abstract
We use a number of computational and experimental approaches to investigate the membrane topology of the membrane-interacting C-terminal domain of the HIV-1 gp41 fusion protein. Several putative transmembrane regions are identified using hydrophobicity analysis based on the Wimley-White scales, including the membrane-proximal external region (MPER). The MPER region is an important target for neutralizing anti-HIV monoclonal antibodies and is believed to have an interfacial topology in the membrane. To assess the possibility of a transmembrane topology of MPER, we examined the membrane interactions of a peptide corresponding to a 22-residue stretch of the MPER sequence (residues 662-683) using fluorescence spectroscopy and oriented circular dichroism. In addition to the previously reported interfacial location, we identify a stable transmembrane conformation of the peptide in synthetic lipid bilayers. All-atom molecular dynamics simulations of the MPER-derived peptide in a lipid bilayer demonstrate a stable helical structure with an average tilt of 24 degrees, with the five tryptophan residues sampling different environments inside the hydrocarbon core of the lipid bilayer, consistent with the observed spectral properties of intrinsic fluorescence. The degree of lipid bilayer penetration obtained by computer simulation was verified using depth-dependent fluorescence quenching of a selectively attached fluorescence probe. Overall, our data indicate that the MPER sequence can have at least two stable conformations in the lipid bilayer, interfacial and transmembrane, and suggest a possibility that external perturbations can switch the topology during physiological functioning.
Collapse
Affiliation(s)
- Alexander Kyrychenko
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas
| | - J Alfredo Freites
- Department of Chemistry, University of California, Irvine, California
| | - Jing He
- Department of Biochemistry, Tulane University School of Medicine, New Orleans, Louisiana
| | - Douglas J Tobias
- Department of Chemistry, University of California, Irvine, California
| | - William C Wimley
- Department of Biochemistry, Tulane University School of Medicine, New Orleans, Louisiana
| | - Alexey S Ladokhin
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas.
| |
Collapse
|
18
|
Hausrath AC. Model for coupled insertion and folding of membrane-spanning proteins. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2014; 90:022707. [PMID: 25215758 DOI: 10.1103/physreve.90.022707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2014] [Indexed: 06/03/2023]
Abstract
Current understanding of the forces directing the folding of integral membrane proteins is very limited compared to the detailed picture available for water-soluble proteins. While mechanistic studies of the folding process in vitro have been conducted for only a small number of membrane proteins, the available evidence indicates that their folding process is thermodynamically driven like that of soluble proteins. In vivo, however, the majority of integral membrane proteins are installed in membranes by dedicated machinery, suggesting that the cellular systems may act to facilitate and regulate the spontaneous physical process of folding. Both the in vitro folding process and the in vivo pathway must navigate an energy landscape dominated by the energetically favorable burial of hydrophobic segments in the membrane interior and the opposition to folding due to the need for passage of polar segments across the membrane. This manuscript describes a simple, exactly solvable model which incorporates these essential features of membrane protein folding. The model is used to compare the folding time under conditions which depict both the in vitro and in vivo pathways. It is proposed that the cellular complexes responsible for insertion of membrane proteins act by lowering the energy barrier for passage of polar regions through the membrane, thereby allowing the chain to more rapidly achieve the folded state.
Collapse
Affiliation(s)
- Andrew C Hausrath
- Department of Chemistry and Biochemistry and Program in Applied Mathematics, University of Arizona, Tucson, Arizona 85721, USA
| |
Collapse
|
19
|
Hayat S, Elofsson A. Ranking models of transmembrane β-barrel proteins using Z-coordinate predictions. Bioinformatics 2013; 28:i90-6. [PMID: 22689784 PMCID: PMC3371865 DOI: 10.1093/bioinformatics/bts233] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Motivation: Transmembrane β-barrels exist in the outer membrane of gram-negative bacteria as well as in chloroplast and mitochondria. They are often involved in transport processes and are promising antimicrobial drug targets. Structures of only a few β-barrel protein families are known. Therefore, a method that could automatically generate such models would be valuable. The symmetrical arrangement of the barrels suggests that an approach based on idealized geometries may be successful. Results: Here, we present tobmodel; a method for generating 3D models of β-barrel transmembrane proteins. First, alternative topologies are obtained from the BOCTOPUS topology predictor. Thereafter, several 3D models are constructed by using different angles of the β-sheets. Finally, the best model is selected based on agreement with a novel predictor, ZPRED3, which predicts the distance from the center of the membrane for each residue, i.e. the Z-coordinate. The Z-coordinate prediction has an average error of 1.61 Å. Tobmodel predicts the correct topology for 75% of the proteins in the dataset which is a slight improvement over BOCTOPUS alone. More importantly, however, tobmodel provides a Cα template with an average RMSD of 7.24 Å from the native structure. Availability: Tobmodel is freely available as a web server at: http://tobmodel.cbr.su.se/. The datasets used for training and evaluations are also available from this site. Contact:arne@bioinfo.se
Collapse
Affiliation(s)
- Sikander Hayat
- Center for Biomembrane Research, Department of Biochemistry and Biophysics, Stockholm Bioinformatics Center, Science for Life Laboratory, Swedish E-science Research Center, Stockholm University, SE-10691 Stockholm, Sweden
| | | |
Collapse
|
20
|
BETAWARE: a machine-learning tool to detect and predict transmembrane beta-barrel proteins in prokaryotes. Bioinformatics 2013; 29:504-5. [DOI: 10.1093/bioinformatics/bts728] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
21
|
Tran VDT, Chassignet P, Steyaert JM. Supersecondary structure prediction of transmembrane beta-barrel proteins. Methods Mol Biol 2013; 932:277-294. [PMID: 22987359 DOI: 10.1007/978-1-62703-065-6_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
We introduce a graph-theoretic model for predicting the supersecondary structure of transmembrane β-barrel proteins--a particular class of proteins that performs diverse important functions but it is difficult to determine their structure with experimental methods. This ab initio model resolves the protein folding problem based on pseudo-energy minimization with the aid of a simple probabilistic filter. It also allows for determining structures whose barrel follows a given permutation on the arrangement of β-strands, and allows for rapidly discriminating the transmembrane β-barrels from other kinds of proteins. The model is fairly accurate, robust and can be run very efficiently on PC-like computers, thus proving useful for genome screening.
Collapse
Affiliation(s)
- Van Du T Tran
- Laboratory of Computer Science, Ecole Polytechnique, Palaiseau Cedex, France.
| | | | | |
Collapse
|
22
|
Nugent T, Jones DT. Membrane protein structural bioinformatics. J Struct Biol 2012; 179:327-37. [DOI: 10.1016/j.jsb.2011.10.008] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2011] [Accepted: 10/25/2011] [Indexed: 10/15/2022]
|
23
|
Abstract
MOTIVATION We previously reported the development of a highly accurate statistical algorithm for identifying β-barrel outer membrane proteins or transmembrane β-barrels (TMBBs), from genomic sequence data of Gram-negative bacteria (Freeman,T.C. and Wimley,W.C. (2010) Bioinformatics, 26, 1965-1974). We have now applied this identification algorithm to all available Gram-negative bacterial genomes (over 600 chromosomes) and have constructed a publicly available, searchable, up-to-date, database of all proteins in these genomes. RESULTS For each protein in the database, there is information on (i) β-barrel membrane protein probability for identification of β-barrels, (ii) β-strand and β-hairpin propensity for structure and topology prediction, (iii) signal sequence score because most TMBBs are secreted through the inner membrane translocon and, thus, have a signal sequence, and (iv) transmembrane α-helix predictions, for reducing false positive predictions. This information is sufficient for the accurate identification of most β-barrel membrane proteins in these genomes. In the database there are nearly 50 000 predicted TMBBs (out of 1.9 million total putative proteins). Of those, more than 15 000 are 'hypothetical' or 'putative' proteins, not previously identified as TMBBs. This wealth of genomic information is not available anywhere else. AVAILABILITY The TMBB genomic database is available at http://beta-barrel.tulane.edu/. CONTACT wwimley@tulane.edu.
Collapse
Affiliation(s)
- Thomas C Freeman
- Department of Biochemistry, Tulane University, New Orleans, LA 70112, USA
| | | |
Collapse
|
24
|
E-komon T, Burchmore R, Herzyk P, Davies R. Predicting the outer membrane proteome of Pasteurella multocida based on consensus prediction enhanced by results integration and manual confirmation. BMC Bioinformatics 2012; 13:63. [PMID: 22540951 PMCID: PMC3403877 DOI: 10.1186/1471-2105-13-63] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Accepted: 04/27/2012] [Indexed: 01/26/2023] Open
Abstract
Background Outer membrane proteins (OMPs) of Pasteurella multocida have various functions related to virulence and pathogenesis and represent important targets for vaccine development. Various bioinformatic algorithms can predict outer membrane localization and discriminate OMPs by structure or function. The designation of a confident prediction framework by integrating different predictors followed by consensus prediction, results integration and manual confirmation will improve the prediction of the outer membrane proteome. Results In the present study, we used 10 different predictors classified into three groups (subcellular localization, transmembrane β-barrel protein and lipoprotein predictors) to identify putative OMPs from two available P. multocida genomes: those of avian strain Pm70 and porcine non-toxigenic strain 3480. Predicted proteins in each group were filtered by optimized criteria for consensus prediction: at least two positive predictions for the subcellular localization predictors, three for the transmembrane β-barrel protein predictors and one for the lipoprotein predictors. The consensus predicted proteins were integrated from each group into a single list of proteins. We further incorporated a manual confirmation step including a public database search against PubMed and sequence analyses, e.g. sequence and structural homology, conserved motifs/domains, functional prediction, and protein-protein interactions to enhance the confidence of prediction. As a result, we were able to confidently predict 98 putative OMPs from the avian strain genome and 107 OMPs from the porcine strain genome with 83% overlap between the two genomes. Conclusions The bioinformatic framework developed in this study has increased the number of putative OMPs identified in P. multocida and allowed these OMPs to be identified with a higher degree of confidence. Our approach can be applied to investigate the outer membrane proteomes of other Gram-negative bacteria.
Collapse
Affiliation(s)
- Teerasak E-komon
- Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Sir Graeme Davies Building, Glasgow G12 8QQ, UK
| | | | | | | |
Collapse
|
25
|
Tran VDT, Chassignet P, Sheikh S, Steyaert JM. A graph-theoretic approach for classification and structure prediction of transmembrane β-barrel proteins. BMC Genomics 2012; 13 Suppl 2:S5. [PMID: 22537300 PMCID: PMC3394416 DOI: 10.1186/1471-2164-13-s2-s5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Background Transmembrane β-barrel proteins are a special class of transmembrane proteins which play several key roles in human body and diseases. Due to experimental difficulties, the number of transmembrane β-barrel proteins with known structures is very small. Over the years, a number of learning-based methods have been introduced for recognition and structure prediction of transmembrane β-barrel proteins. Most of these methods emphasize on homology search rather than any biological or chemical basis. Results We present a novel graph-theoretic model for classification and structure prediction of transmembrane β-barrel proteins. This model folds proteins based on energy minimization rather than a homology search, avoiding any assumption on availability of training dataset. The ab initio model presented in this paper is the first method to allow for permutations in the structure of transmembrane proteins and provides more structural information than any known algorithm. The model is also able to recognize β-barrels by assessing the pseudo free energy. We assess the structure prediction on 41 proteins gathered from existing databases on experimentally validated transmembrane β-barrel proteins. We show that our approach is quite accurate with over 90% F-score on strands and over 74% F-score on residues. The results are comparable to other algorithms suggesting that our pseudo-energy model is close to the actual physical model. We test our classification approach and show that it is able to reject α-helical bundles with 100% accuracy and β-barrel lipocalins with 97% accuracy. Conclusions We show that it is possible to design models for classification and structure prediction for transmembrane β-barrel proteins which do not depend essentially on training sets but on combinatorial properties of the structures to be proved. These models are fairly accurate, robust and can be run very efficiently on PC-like computers. Such models are useful for the genome screening.
Collapse
Affiliation(s)
- Van Du T Tran
- INRIA AMIB Team, Laboratory of Computer Science (LIX), Ecole Polytechnique, 91128, Palaiseau CEDEX, France.
| | | | | | | |
Collapse
|
26
|
Hayat S, Elofsson A. BOCTOPUS: improved topology prediction of transmembrane β barrel proteins. Bioinformatics 2012; 28:516-22. [DOI: 10.1093/bioinformatics/btr710] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
27
|
Bishop CM, Wimley WC. Structural plasticity in self-assembling transmembrane β-sheets. Biophys J 2011; 101:828-36. [PMID: 21843473 DOI: 10.1016/j.bpj.2011.06.059] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2011] [Revised: 06/23/2011] [Accepted: 06/28/2011] [Indexed: 10/17/2022] Open
Abstract
Here we test the hypothesis that membrane-spanning β-sheets can exhibit structural plasticity in membranes due to their ability to shift hydrogen-bonding patterns. Transmembrane β-sheet forming peptides of the sequence AcWL(n), where n = 5, 6, or 7, which range from 21 to 27 Å in maximum length, were incorporated into bilayers made of phosphatidylcholine lipids with saturated acyl chains containing 14, 16, or 18 carbons, which are 36-50 Å in thickness. The effect of the peptide β-sheets on fluid- and gel-phase bilayers were studied with differential scanning calorimetry and circular dichroism spectroscopy. We show that AcWL₅ forms a stable, peptide-rich gel phase in all three lipids. The whole family of AcWL(n) peptides appears to form similarly stable, nonmembrane-disrupting β-sheets in all bilayer phases and thicknesses. Bilayers containing up to 20 mol % peptide, which is the maximum concentration tested, formed gel phases with melting temperatures that were equal to, or slightly higher than, the pure lipid transitions. Given the range of peptide lengths and bilayer thicknesses tested, these experiments show that the AcWL(n) family of membrane-inserted β-sheets exhibit remarkable structural plasticity in membranes.
Collapse
Affiliation(s)
- Christopher M Bishop
- Department of Biochemistry, Tulane University School of Medicine, New Orleans, Louisiana, USA
| | | |
Collapse
|
28
|
Computational studies of membrane proteins: models and predictions for biological understanding. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2011; 1818:927-41. [PMID: 22051023 DOI: 10.1016/j.bbamem.2011.09.026] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Revised: 09/22/2011] [Accepted: 09/26/2011] [Indexed: 01/26/2023]
Abstract
We discuss recent progresses in computational studies of membrane proteins based on physical models with parameters derived from bioinformatics analysis. We describe computational identification of membrane proteins and prediction of their topology from sequence, discovery of sequence and spatial motifs, and implications of these discoveries. The detection of evolutionary signal for understanding the substitution pattern of residues in the TM segments and for sequence alignment is also discussed. We further discuss empirical potential functions for energetics of inserting residues in the TM domain, for interactions between TM helices or strands, and their applications in predicting lipid-facing surfaces of the TM domain. Recent progresses in structure predictions of membrane proteins are also reviewed, with further discussions on calculation of ensemble properties such as melting temperature based on simplified state space model. Additional topics include prediction of oligomerization state of membrane proteins, identification of the interfaces for protein-protein interactions, and design of membrane proteins. This article is part of a Special Issue entitled: Protein Folding in Membranes.
Collapse
|
29
|
Savojardo C, Fariselli P, Casadio R. Improving the detection of transmembrane -barrel chains with N-to-1 extreme learning machines. Bioinformatics 2011; 27:3123-8. [DOI: 10.1093/bioinformatics/btr549] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
30
|
TMBHMM: A frequency profile based HMM for predicting the topology of transmembrane beta barrel proteins and the exposure status of transmembrane residues. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2011; 1814:664-70. [DOI: 10.1016/j.bbapap.2011.03.004] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2010] [Revised: 02/16/2011] [Accepted: 03/07/2011] [Indexed: 10/18/2022]
|
31
|
Hayat S, Park Y, Helms V. Statistical analysis and exposure status classification of transmembrane beta barrel residues. Comput Biol Chem 2011; 35:96-107. [PMID: 21531175 DOI: 10.1016/j.compbiolchem.2011.03.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2010] [Revised: 03/01/2011] [Accepted: 03/01/2011] [Indexed: 12/28/2022]
Abstract
Several computational methods exist for the identification of transmembrane beta barrel proteins (TMBs) from sequence. Some of these methods also provide the transmembrane (TM) boundaries of the putative TMBs. The aim of this study is to (1) derive the propensities of the TM residues to be exposed to the lipid bilayer and (2) to predict the exposure status (i.e. exposed to the bilayer or hidden in protein structure) of TMB residues. Three novel propensity scales namely, BTMC, BTMI and HTMI were derived for the TMB residues at the hydrophobic core region of the outer membrane (OM), the lipid-water interface regions of the OM, and for the helical membrane proteins (HMPs) residues at the lipid-water interface regions of the inner membrane (IM), respectively. Separate propensity scales were derived for monomeric and functionally oligomeric TMBs. The derived propensities reflect differing physico-chemical properties of the respective membrane bilayer regions and were employed in a computational method for the prediction of the exposure status of TMB residues. Based on the these propensities, the conservation indices and the frequency profile of the residues, the transmembrane residues were classified into buried/exposed with an accuracy of 77.91% and 80.42% for the residues at the membrane core and the interface regions, respectively. The correlation of the derived scales with different physico-chemical properties obtained from the AAIndex database are also discussed. Knowledge about the residue propensities and burial status will be useful in annotating putative TMBs with unknown structure.
Collapse
Affiliation(s)
- Sikander Hayat
- Center for Bioinformatics, Saarland University, Saarbruecken, Germany.
| | | | | |
Collapse
|
32
|
Outer membrane proteins can be simply identified using secondary structure element alignment. BMC Bioinformatics 2011; 12:76. [PMID: 21414186 PMCID: PMC3072342 DOI: 10.1186/1471-2105-12-76] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2010] [Accepted: 03/17/2011] [Indexed: 02/04/2023] Open
Abstract
Background Outer membrane proteins (OMPs) are frequently found in the outer membranes of gram-negative bacteria, mitochondria and chloroplasts and have been found to play diverse functional roles. Computational discrimination of OMPs from globular proteins and other types of membrane proteins is helpful to accelerate new genome annotation and drug discovery. Results Based on the observation that almost all OMPs consist of antiparallel β-strands in a barrel shape and that their secondary structure arrangements differ from those of other types of proteins, we propose a simple method called SSEA-OMP to identify OMPs using secondary structure element alignment. Through intensive benchmark experiments, the proposed SSEA-OMP method is better than some well-established OMP detection methods. Conclusions The major advantage of SSEA-OMP is its good prediction performance considering its simplicity. The web server implements the method is freely accessible at http://protein.cau.edu.cn/SSEA-OMP/index.html.
Collapse
|
33
|
Hadi-Alijanvand H, Rouhani M, Proctor EA, Dokholyan NV, Moosavi-Movahedi AA. A folding pathway-dependent score to recognize membrane proteins. PLoS One 2011; 6:e16778. [PMID: 21390303 PMCID: PMC3046963 DOI: 10.1371/journal.pone.0016778] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2010] [Accepted: 12/29/2010] [Indexed: 12/11/2022] Open
Abstract
While various approaches exist to study protein localization, it is still a challenge to predict where proteins localize. Here, we consider a mechanistic viewpoint for membrane localization. Taking into account the steps for the folding pathway of α-helical membrane proteins and relating biophysical parameters to each of these steps, we create a score capable of predicting the propensity for membrane localization and call it FP(3)mem. This score is driven from the principal component analysis (PCA) of the biophysical parameters related to membrane localization. FP(3)mem allows us to rationalize the colocalization of a number of channel proteins with the Cav1.2 channel by their fewer propensities for membrane localization.
Collapse
Affiliation(s)
| | - Maryam Rouhani
- Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Elizabeth A. Proctor
- Genetics Medicine, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Nikolay V. Dokholyan
- Genetics Medicine, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | | |
Collapse
|
34
|
Zhang H, Zhang T, Chen K, Kedarisetti KD, Mizianty MJ, Bao Q, Stach W, Kurgan L. Critical assessment of high-throughput standalone methods for secondary structure prediction. Brief Bioinform 2011; 12:672-88. [PMID: 21252072 DOI: 10.1093/bib/bbq088] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Sequence-based prediction of protein secondary structure (SS) enjoys wide-spread and increasing use for the analysis and prediction of numerous structural and functional characteristics of proteins. The lack of a recent comprehensive and large-scale comparison of the numerous prediction methods results in an often arbitrary selection of a SS predictor. To address this void, we compare and analyze 12 popular, standalone and high-throughput predictors on a large set of 1975 proteins to provide in-depth, novel and practical insights. We show that there is no universally best predictor and thus detailed comparative studies are needed to support informed selection of SS predictors for a given application. Our study shows that the three-state accuracy (Q3) and segment overlap (SOV3) of the SS prediction currently reach 82% and 81%, respectively. We demonstrate that carefully designed consensus-based predictors improve the Q3 by additional 2% and that homology modeling-based methods are significantly better by 1.5% Q3 than ab initio approaches. Our empirical analysis reveals that solvent exposed and flexible coils are predicted with a higher quality than the buried and rigid coils, while inverse is true for the strands and helices. We also show that longer helices are easier to predict, which is in contrast to longer strands that are harder to find. The current methods confuse 1-6% of strand residues with helical residues and vice versa and they perform poorly for residues in the β- bridge and 3(10)-helix conformations. Finally, we compare predictions of the standalone implementations of four well-performing methods with their corresponding web servers.
Collapse
Affiliation(s)
- Hua Zhang
- Zhejiang Gongshang University, Hangzhou, Zhejiang, P.R. China
| | | | | | | | | | | | | | | |
Collapse
|
35
|
In silico methods for identifying organellar and suborganellar targeting peptides in Arabidopsis chloroplast proteins and for predicting the topology of membrane proteins. Methods Mol Biol 2011; 774:243-80. [PMID: 21822844 DOI: 10.1007/978-1-61779-234-2_16] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Numerous experimental and in silico approaches have been developed for attempting to identify the -subcellular localisation of proteins. Approximately 2,000-4,000 proteins are thought to be targeted to plastids in plants, but a complete and unambiguous catalogue has yet to be drawn up. This article reviews the various prediction methods that identify plastid targeting sequences, and those that can help estimate location and topology within the plastid or plastid membranes. The most successful approaches are described in detail, with detailed notes to help avoid common pitfalls and advice on interpreting conflicting or ambiguous results. In most cases, it is best to try multiple approaches, and we also cover the powerful new integrated databases that provide a selected blend of experimental data and predictions.
Collapse
|
36
|
Abstract
Here, we review the current knowledge about the energetics of arginine insertion into the bilayer hydrocarbon core, and we discuss discrepancies between experimental and computational studies of the insertion process. While simulations suggest that it should be very costly to place arginine into the hydrocarbon core, experiments show that arginine is found there. Both types of studies suggest that arginine insertion into the bilayer involves substantial bilayer deformation, with multiple hydrogen bonds between the arginine guanidinium group and lipid polar groups. It is possible that the discrepancies concerning the insertion cost of arginine arise because simulations overestimate the cost associated with bilayer deformation and underestimate the ability of the bilayer to adapt to charged and polar groups. This is an active area of research, and there is no doubt that a consensus view of arginine in membranes will soon emerge.
Collapse
|
37
|
Tsirigos KD, Bagos PG, Hamodrakas SJ. OMPdb: a database of {beta}-barrel outer membrane proteins from Gram-negative bacteria. Nucleic Acids Res 2010; 39:D324-31. [PMID: 20952406 PMCID: PMC3013764 DOI: 10.1093/nar/gkq863] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We describe here OMPdb, which is currently the most complete and comprehensive collection of integral β-barrel outer membrane proteins from Gram-negative bacteria. The database currently contains 69,354 proteins, which are classified into 85 families, based mainly on structural and functional criteria. Although OMPdb follows the annotation scheme of Pfam, many of the families included in the database were not previously described or annotated in other publicly available databases. There are also cross-references to other databases, references to the literature and annotation for sequence features, like transmembrane segments and signal peptides. Furthermore, via the web interface, the user can not only browse the available data, but submit advanced text searches and run BLAST queries against the database protein sequences or domain searches against the collection of profile Hidden Markov Models that represent each family's domain organization as well. The database is freely accessible for academic users at http://bioinformatics.biol.uoa.gr/OMPdb and we expect it to be useful for genome-wide analyses, comparative genomics as well as for providing training and test sets for predictive algorithms regarding transmembrane β-barrels.
Collapse
Affiliation(s)
- Konstantinos D Tsirigos
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens 15701, Greece
| | | | | |
Collapse
|
38
|
Freeman TC, Landry SJ, Wimley WC. The prediction and characterization of YshA, an unknown outer-membrane protein from Salmonella typhimurium. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2010; 1808:287-97. [PMID: 20863811 DOI: 10.1016/j.bbamem.2010.09.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Received: 06/12/2010] [Revised: 09/13/2010] [Accepted: 09/15/2010] [Indexed: 11/27/2022]
Abstract
We have developed an effective pathway for the prediction and characterization of novel transmembrane β-barrel proteins. The Freeman-Wimley algorithm, which is a highly accurate prediction method based on the physicochemical properties of experimentally characterized transmembrane β barrel (TMBB) structures, was used to predict TMBBs in the genome of Salmonella typhimurium LT2. The previously uncharacterized product of gene yshA was tested as a model for validating the algorithm. YshA is a highly conserved 230-residue protein that is predicted to have 10 transmembrane β-strands and an N-terminal signal sequence. All of the physicochemical and spectroscopic properties exhibited by YshA are consistent with the prediction that it is a TMBB. Specifically, recombinant YshA localizes to the outer membrane when expressed in Escherichia coli; YshA has a β-sheet-rich secondary structure with stable tertiary contacts in the presence of detergent micelles or when reconstituted into a lipid bilayer. When in a lipid bilayer, YshA forms a membrane-spanning pore with an effective radius of ~0.7nm. Taken together, these data substantiate the predictions made by the Freeman-Wimley algorithm by showing that YshA is a TMBB protein.
Collapse
Affiliation(s)
- Thomas C Freeman
- Department of Biochemistry, Tulane University Health Sciences Center, 1430 Tulane Ave SL-43, New Orleans, LA 70112, USA
| | | | | |
Collapse
|