1
|
Aleksandrova AA, Sarti E, Forrest LR. EncoMPASS: An encyclopedia of membrane proteins analyzed by structure and symmetry. Structure 2024; 32:492-504.e4. [PMID: 38367624 PMCID: PMC11251422 DOI: 10.1016/j.str.2024.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Revised: 01/09/2024] [Accepted: 01/10/2024] [Indexed: 02/19/2024]
Abstract
Protein structure determination and prediction, active site detection, and protein sequence alignment techniques all exploit information about protein structure and structural relationships. For membrane proteins, however, there is limited agreement among available online tools for highlighting and mapping such structural similarities. Moreover, no available resource provides a systematic overview of quaternary and internal symmetries, and their orientation relative to the membrane, despite the fact that these properties can provide key insights into membrane protein function and evolution. Here, we describe the Encyclopedia of Membrane Proteins Analyzed by Structure and Symmetry (EncoMPASS), a database for relating integral membrane proteins of known structure from the points of view of sequence, structure, and symmetry. EncoMPASS is accessible through a web interface, and its contents can be easily downloaded. This allows the user not only to focus on specific proteins, but also to study general properties of the structure and evolution of membrane proteins.
Collapse
Affiliation(s)
- Antoniya A Aleksandrova
- Computational Structural Biology Section, National Institutes of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| | - Edoardo Sarti
- Computational Structural Biology Section, National Institutes of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lucy R Forrest
- Computational Structural Biology Section, National Institutes of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA.
| |
Collapse
|
2
|
Knight MJ, Hardy BJ, Wheeler GL, Curnow P. Computational modelling of diatom silicic acid transporters predicts a conserved fold with implications for their function and evolution. BIOCHIMICA ET BIOPHYSICA ACTA. BIOMEMBRANES 2023; 1865:184056. [PMID: 36191629 DOI: 10.1016/j.bbamem.2022.184056] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 09/16/2022] [Accepted: 09/20/2022] [Indexed: 11/27/2022]
Abstract
Diatoms are an important group of algae that can produce intricate silicified cell walls (frustules). The complex process of silicification involves a set of enigmatic integral membrane proteins that are thought to actively transport the soluble precursor of biosilica, dissolved silicic acid. Full-length silicic acid transporters are found widely across the diatoms while homologous shorter proteins have now been identified in a range of other organisms. It has been suggested that modern silicic acid transporters arose from the union of such partial sequences. Here, we present a computational study of the silicic acid transporters and related transporter-like sequences to help understand the structure, function and evolution of this class of membrane protein. The AlphaFold software predicts that all of the protein sequences studied here share a common fold in the membrane domain which is entirely different from the predicted folds of non-homologous silicic acid transporters from plants. Substrate docking reveals how conserved polar residues could interact with silicic acid at a central solvent-accessible binding site, consistent with an alternating access mechanism of transport. The structural conservation between these proteins supports a model where modern silicon transporters evolved from smaller ancestral proteins by gene fusion.
Collapse
Affiliation(s)
| | | | | | - Paul Curnow
- School of Biochemistry, University of Bristol, UK.
| |
Collapse
|
3
|
Symmetry of Post-Translational Modifications in a Human Enzyme. Symmetry (Basel) 2022. [DOI: 10.3390/sym14020212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Paraoxonase 2 (PON2) is a member of a small family of human lactonases. Recently, post-translational modifications (PTMs) of PON2 were highlighted, one of which involved the modulation of the enzyme activity. Furthermore, two important single nucleotide polymorphisms (SNPs) involved in type 2 diabetes and its consequences, were found to modulate the enzyme activity as well. The position on the PON2 structural model of both residues corresponding to SNPs and PTMs suggested a symmetry of the molecule. By sequence and structure superposition we were able to confirm this finding. The result will be discussed in light of the evolution of symmetry in biological molecules and their function.
Collapse
|
4
|
Shalit Y, Tuvi-Arad I. Side chain flexibility and the symmetry of protein homodimers. PLoS One 2020; 15:e0235863. [PMID: 32706779 PMCID: PMC7380632 DOI: 10.1371/journal.pone.0235863] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 06/24/2020] [Indexed: 01/22/2023] Open
Abstract
A comprehensive analysis of crystallographic data of 565 high-resolution protein homodimers comprised of over 250,000 residues suggests that amino acids form two groups that differ in their tendency to distort or symmetrize the structure of protein homodimers. Residues of the first group tend to distort the protein homodimer and generally have long or polar side chains. These include: Lys, Gln, Glu, Arg, Asn, Met, Ser, Thr and Asp. Residues of the second group contribute to protein symmetry and are generally characterized by short or aromatic side chains. These include: Ile, Pro, His, Val, Cys, Leu, Trp, Tyr, Phe, Ala and Gly. The distributions of the continuous symmetry measures of the proteins and the continuous chirality measures of their building blocks highlight the role of side chain geometry and the interplay between entropy and symmetry in dictating the conformational flexibility of proteins.
Collapse
Affiliation(s)
- Yaffa Shalit
- Department of Natural Sciences, The Open University of Israel, Raanana, Israel
| | - Inbal Tuvi-Arad
- Department of Natural Sciences, The Open University of Israel, Raanana, Israel
- * E-mail:
| |
Collapse
|
5
|
Sarti E, Aleksandrova AA, Ganta SK, Yavatkar AS, Forrest LR. EncoMPASS: an online database for analyzing structure and symmetry in membrane proteins. Nucleic Acids Res 2020; 47:D315-D321. [PMID: 30357403 PMCID: PMC6323976 DOI: 10.1093/nar/gky952] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 10/05/2018] [Indexed: 12/21/2022] Open
Abstract
The EncoMPASS online database (http://encompass.ninds.nih.gov) collects, organizes, and presents information about membrane proteins of known structure, emphasizing their structural similarities as well as their quaternary and internal symmetries. Unlike, e.g. SCOP, the EncoMPASS database does not aim for a strict classification of membrane proteins, but instead is organized as a protein chain-centric network of sequence and structural homologues. The online server for the EncoMPASS database provides tools for comparing the structural features of its entries, making it a useful resource for homology modeling and active site identification studies. The database can also be used for inferring functionality, which for membrane proteins often involves symmetry-related mechanisms. To this end, the online database also provides a comprehensive description of both the quaternary and internal symmetries in known membrane protein structures, with a particular focus on their orientation relative to the membrane.
Collapse
Affiliation(s)
- Edoardo Sarti
- Computational Structural Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| | - Antoniya A Aleksandrova
- Computational Structural Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| | - Srujan K Ganta
- Bioinformatics Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| | - Amarendra S Yavatkar
- Bioinformatics Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lucy R Forrest
- Computational Structural Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
6
|
Pseudo-Symmetric Assembly of Protodomains as a Common Denominator in the Evolution of Polytopic Helical Membrane Proteins. J Mol Evol 2020; 88:319-344. [PMID: 32189026 PMCID: PMC7162841 DOI: 10.1007/s00239-020-09934-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 02/16/2020] [Indexed: 11/05/2022]
Abstract
The polytopic helical membrane proteome is dominated by proteins containing seven transmembrane helices (7TMHs). They cannot be grouped under a monolithic fold or superfold. However, a parallel structural analysis of folds around that magic number of seven in distinct protein superfamilies (SWEET, PnuC, TRIC, FocA, Aquaporin, GPCRs) reveals a common homology, not in their structural fold, but in their systematic pseudo-symmetric construction during their evolution. Our analysis leads to guiding principles of intragenic duplication and pseudo-symmetric assembly of ancestral transmembrane helical protodomains, consisting of 3 (or 4) helices. A parallel deconstruction and reconstruction of these domains provides a structural and mechanistic framework for their evolutionary paths. It highlights the conformational plasticity inherent to fold formation itself, the role of structural as well as functional constraints in shaping that fold, and the usefulness of protodomains as a tool to probe convergent vs divergent evolution. In the case of FocA vs. Aquaporin, this protodomain analysis sheds new light on their potential divergent evolution at the protodomain level followed by duplication and parallel evolution of the two folds. GPCR domains, whose function does not seem to require symmetry, nevertheless exhibit structural pseudo-symmetry. Their construction follows the same protodomain assembly as any other pseudo-symmetric protein suggesting their potential evolutionary origins. Interestingly, all the 6/7/8TMH pseudo-symmetric folds in this study also assemble as oligomeric forms in the membrane, emphasizing the role of symmetry in evolution, revealing self-assembly and co-evolution not only at the protodomain level but also at the domain level.
Collapse
|
7
|
Pagès G, Grudinin S. DeepSymmetry: using 3D convolutional networks for identification of tandem repeats and internal symmetries in protein structures. Bioinformatics 2019; 35:5113-5120. [PMID: 31161198 DOI: 10.1093/bioinformatics/btz454] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Revised: 04/16/2019] [Accepted: 05/29/2019] [Indexed: 01/31/2023] Open
Abstract
MOTIVATION Thanks to the recent advances in structural biology, nowadays 3D structures of various proteins are solved on a routine basis. A large portion of these structures contain structural repetitions or internal symmetries. To understand the evolution mechanisms of these proteins and how structural repetitions affect the protein function, we need to be able to detect such proteins very robustly. As deep learning is particularly suited to deal with spatially organized data, we applied it to the detection of proteins with structural repetitions. RESULTS We present DeepSymmetry, a versatile method based on 3D convolutional networks that detects structural repetitions in proteins and their density maps. Our method is designed to identify tandem repeat proteins, proteins with internal symmetries, symmetries in the raw density maps, their symmetry order and also the corresponding symmetry axes. Detection of symmetry axes is based on learning 6D Veronese mappings of 3D vectors, and the median angular error of axis determination is less than one degree. We demonstrate the capabilities of our method on benchmarks with tandem-repeated proteins and also with symmetrical assemblies. For example, we have discovered about 7800 putative tandem repeat proteins in the PDB. AVAILABILITY AND IMPLEMENTATION The method is available at https://team.inria.fr/nano-d/software/deepsymmetry. It consists of a C++ executable that transforms molecular structures into volumetric density maps, and a Python code based on the TensorFlow framework for applying the DeepSymmetry model to these maps. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Guillaume Pagès
- Inria, Université Grenoble Alpes, CNRS, Grenoble INP, LJK, 38000 Grenoble, France
| | - Sergei Grudinin
- Inria, Université Grenoble Alpes, CNRS, Grenoble INP, LJK, 38000 Grenoble, France
| |
Collapse
|
8
|
Aleksandrova AA, Sarti E, Forrest LR. MemSTATS: A Benchmark Set of Membrane Protein Symmetries and Pseudosymmetries. J Mol Biol 2019; 432:597-604. [PMID: 31628944 DOI: 10.1016/j.jmb.2019.09.020] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 08/30/2019] [Accepted: 09/23/2019] [Indexed: 02/06/2023]
Abstract
In membrane proteins, symmetry and pseudosymmetry often have functional or evolutionary implications. However, available symmetry detection methods have not been tested systematically on this class of proteins because of the lack of an appropriate benchmark set. Here we present MemSTATS, a publicly available benchmark set of both quaternary- and internal-symmetries in membrane protein structures. The symmetries are described in terms of order, repeated elements, and orientation of the axis with respect to the membrane plane. Moreover, using MemSTATS, we compare the performance of four widely used symmetry detection algorithms and highlight specific challenges and areas for improvement in the future.
Collapse
Affiliation(s)
- Antoniya A Aleksandrova
- Computational Structural Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Edoardo Sarti
- Computational Structural Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Lucy R Forrest
- Computational Structural Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
9
|
Rajathei DM, Parthasarathy S, Selvaraj S. Identification and Analysis of Long Repeats of Proteins at the Domain Level. Front Bioeng Biotechnol 2019; 7:250. [PMID: 31649924 PMCID: PMC6795024 DOI: 10.3389/fbioe.2019.00250] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Accepted: 09/16/2019] [Indexed: 12/27/2022] Open
Abstract
Amino acid repeats play an important role in the structure and function of proteins. Analysis of long repeats in protein sequences enables one to understand their abundance, structure and function in the protein universe. In the present study, amino acid repeats of length >50 (long repeats) were identified in a non-redundant set of UniProt sequences using the RADAR program. The underlying structures and functions of these long repeats were carried out using the Gene3D for structural domains, Pfam for functional domains and enzyme and non-enzyme functional classification for catalytic and binding of the proteins. From a structural perspective, these long repeats seem to predominantly occur in certain architectures such as sandwich, bundle, barrel, and roll and within these architectures abundant in the superfolds. The lengths of the repeats within each fold are not uniform exhibiting different structures for different functions. We also observed that long repeats are in the domain regions of the family and are involved in the function of the proteins. After grouping based on enzyme and non-enzyme classes, we observed the abundant occurrence of long repeats in specific catalytic and binding of the proteins. In this study, we have analyzed the occurrence of long repeats in the protein sequence universe apart from well-characterized short tandem repeats in sequences and their structures and functions of the proteins at the domain level. The present study suggests that long repeats may play an important role in the structure and function of domains of the proteins.
Collapse
Affiliation(s)
- David Mary Rajathei
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, India
| | - Subbiah Parthasarathy
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, India
| | - Samuel Selvaraj
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli, India
| |
Collapse
|
10
|
Tuvi-Arad I, Alon G. Improved algorithms for quantifying the near symmetry of proteins: complete side chains analysis. J Cheminform 2019; 11:39. [PMID: 31172379 PMCID: PMC6551912 DOI: 10.1186/s13321-019-0360-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 05/28/2019] [Indexed: 12/12/2022] Open
Abstract
Symmetry of proteins, an important source of their elegant structure and unique functions, is not as perfect as it may seem. In the framework of continuous symmetry, in which symmetry is no longer a binary yes/no property, such imperfections can be quantified and used as a global descriptor of the three-dimensional structure. We present an improved algorithm for calculating the continuous symmetry measure for proteins that takes into account their complete set of atoms including all side chains. Our method takes advantage of the protein sequence and the division into peptides in order to improve the accuracy and efficiency of the calculation over previous methods. The Hungarian algorithm is applied to solve the assignment problem and find the permutation that defines the symmetry operation. Analysis of the symmetry of several sets of protein homomers, with various degrees of rotational symmetry is presented. The new methodology lays the foundations for accurate, efficient and reliable large scale symmetry analysis of protein structure and can be used as a collective variable that describes changes of the protein geometry along various processes, both at the backbone level and for the complete protein structure.
Collapse
Affiliation(s)
- Inbal Tuvi-Arad
- Department of Natural Sciences, The Open University of Israel, 4353701, Raanana, Israel.
| | - Gil Alon
- Department of Mathematics and Computer Science, The Open University of Israel, 4353701, Raanana, Israel.
| |
Collapse
|
11
|
Bliven SE, Lafita A, Rose PW, Capitani G, Prlić A, Bourne PE. Analyzing the symmetrical arrangement of structural repeats in proteins with CE-Symm. PLoS Comput Biol 2019; 15:e1006842. [PMID: 31009453 PMCID: PMC6504099 DOI: 10.1371/journal.pcbi.1006842] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 05/07/2019] [Accepted: 01/29/2019] [Indexed: 01/04/2023] Open
Abstract
Many proteins fold into highly regular and repetitive three dimensional structures. The analysis of structural patterns and repeated elements is fundamental to understand protein function and evolution. We present recent improvements to the CE-Symm tool for systematically detecting and analyzing the internal symmetry and structural repeats in proteins. In addition to the accurate detection of internal symmetry, the tool is now capable of i) reporting the type of symmetry, ii) identifying the smallest repeating unit, iii) describing the arrangement of repeats with transformation operations and symmetry axes, and iv) comparing the similarity of all the internal repeats at the residue level. CE-Symm 2.0 helps the user investigate proteins with a robust and intuitive sequence-to-structure analysis, with many applications in protein classification, functional annotation and evolutionary studies. We describe the algorithmic extensions of the method and demonstrate its applications to the study of interesting cases of protein evolution. Many protein structures show a great deal of regularity. Even within single polypeptide chains, about 25% of proteins contain self-similar repeating structures, which can be organized in ring-like symmetric arrangements or linear open repeats. The repeats are often related, and thus comparing the sequence and structure of repeats can give an idea as to the early evolutionary history of a protein family. Additionally, the conservation and divergence of repeats can lead to insights about the function of the proteins. This work describes CE-Symm 2.0, a tool for the analysis of protein symmetry. The method automatically detects internal symmetry in protein structures and produces a multiple alignment of structural repeats. The algorithm is able to detect the geometric relationships between the repeats, including cyclic, dihedral, and polyhedral symmetries, translational repeats, and cases where multiple symmetry operators are applicable in a hierarchical manner. These complex relationships can then be visualized in a graphical interface as a complete structure, as a superposition of repeats, or as a multiple alignment of the protein sequence. CE-Symm 2.0 can be systematically used for the automatic detection of internal symmetry in protein structures, or as an interactive tool for the analysis of structural repeats.
Collapse
Affiliation(s)
- Spencer E. Bliven
- Laboratory of Biomolecular Research, Paul Scherrer Institute, Villigen, Switzerland
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- Institute of Applied Simulation, Zurich University of Applied Science, Wädenswil, Switzerland
- * E-mail: (SEB), (AL)
| | - Aleix Lafita
- Laboratory of Biomolecular Research, Paul Scherrer Institute, Villigen, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
- * E-mail: (SEB), (AL)
| | - Peter W. Rose
- RCSB Protein Data Bank, San Diego Supercomputing Center, University of California San Diego, La Jolla, California, United States of America
- Structural Bioinformatics Laboratory, San Diego Supercomputing Center, University of California San Diego, La Jolla, California, United States of America
| | - Guido Capitani
- Laboratory of Biomolecular Research, Paul Scherrer Institute, Villigen, Switzerland
- Department of Biology, ETH Zurich, Zurich, Switzerland
| | - Andreas Prlić
- RCSB Protein Data Bank, San Diego Supercomputing Center, University of California San Diego, La Jolla, California, United States of America
| | - Philip E. Bourne
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| |
Collapse
|
12
|
Youkharibache P. Protodomains: Symmetry-Related Supersecondary Structures in Proteins and Self-Complementarity. Methods Mol Biol 2019; 1958:187-219. [PMID: 30945220 PMCID: PMC8323591 DOI: 10.1007/978-1-4939-9161-7_10] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
We will consider in this chapter supersecondary structures (SSS) as a set of secondary structure elements (SSEs) found in protein domains. Some SSS arrangements/topologies have been consistently observed within known tertiary structural domains. We use them in the context of repeating supersecondary structures that self-assemble in a symmetric arrangement to form a domain. We call them protodomains (or protofolds). Protodomains are some of the most interesting and insightful SSSs. Within a given 3D protein domain/fold, recognizing such sets may give insights into a possible evolutionary process of duplication, fusion, and coevolution of these protodomains, pointing to possible original protogenes. On protein folding itself, pseudosymmetric domains may point to a "directed" assembly of pseudosymmetric protodomains, directed by the only fact that they are tethered together in a protein chain. On function, tertiary functional sites often occur at protodomain interfaces, as they often occur at domain-domain interfaces in quaternary arrangements.First, we will briefly review some lessons learned from a previously published census of pseudosymmetry in protein domains (Myers-Turnbull, D. et al., J Mol Biol. 426:2255-2268, 2014) to introduce protodomains/protofolds. We will observe that the most abundant and diversified folds, or superfolds, in the currently known protein structure universe are indeed pseudosymmetric. Then, we will learn by example and select a few domain representatives of important pseudosymmetric folds and chief among them the immunoglobulin (Ig) fold and go over a pseudosymmetry supersecondary structure (protodomain) analysis in tertiary and quaternary structures. We will point to currently available software tools to help in identifying pseudosymmetry, delineating protodomains, and see how the study of pseudosymmetry and the underlying supersecondary structures can enrich a structural analysis. This should potentially help in protein engineering, especially in the development of biologics and immunoengineering.
Collapse
|
13
|
Abstract
Repeating sequences generated from RNA gene fusions/ligations dominate ancient life, indicating central importance of building structural complexity in evolving biological systems. A simple and coherent story of life on earth is told from tracking repeating motifs that generate α/β proteins, 2-double-Ψ-β-barrel (DPBB) type RNA polymerases (RNAPs), general transcription factors (GTFs), and promoters. A general rule that emerges is that biological complexity that arises through generation of repeats is often bounded by solubility and closure (i.e., to form a pseudo-dimer or a barrel). Because the first DNA genomes were replicated by DNA template-dependent RNA synthesis followed by RNA template-dependent DNA synthesis via reverse transcriptase, the first DNA replication origins were initially 2-DPBB type RNAP promoters. A simplifying model for evolution of promoters/replication origins via repetition of core promoter elements is proposed. The model can explain why Pribnow boxes in bacterial transcription (i.e., (-12)TATAATG(-6)) so closely resemble TATA boxes (i.e., (-31)TATAAAAG(-24)) in archaeal/eukaryotic transcription. The evolution of anchor DNA sequences in bacterial (i.e., (-35)TTGACA(-30)) and archaeal (BRE(up); BRE for TFB recognition element) promoters is potentially explained. The evolution of BRE(down) elements of archaeal promoters is potentially explained.
Collapse
Affiliation(s)
- Zachary F Burton
- a Department of Biochemistry and Molecular Biology , Michigan State University , E. Lansing , MI , USA
| | - Kristopher Opron
- b Department of Mathematics , Michigan State University , E. Lansing , MI , USA
| | - Guowei Wei
- b Department of Mathematics , Michigan State University , E. Lansing , MI , USA
| | - James H Geiger
- c Department of Chemistry , Michigan State University , E. Lansing , MI , USA
| |
Collapse
|
14
|
Parallel-SymD: A Parallel Approach to Detect Internal Symmetry in Protein Domains. BIOMED RESEARCH INTERNATIONAL 2016; 2016:4628592. [PMID: 27747230 PMCID: PMC5056246 DOI: 10.1155/2016/4628592] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 08/25/2016] [Indexed: 11/24/2022]
Abstract
Internally symmetric proteins are proteins that have a symmetrical structure in their monomeric single-chain form. Around 10–15% of the protein domains can be regarded as having some sort of internal symmetry. In this regard, we previously published SymD (symmetry detection), an algorithm that determines whether a given protein structure has internal symmetry by attempting to align the protein to its own copy after the copy is circularly permuted by all possible numbers of residues. SymD has proven to be a useful algorithm to detect symmetry. In this paper, we present a new parallelized algorithm called Parallel-SymD for detecting symmetry of proteins on clusters of computers. The achieved speedup of the new Parallel-SymD algorithm scales well with the number of computing processors. Scaling is better for proteins with a larger number of residues. For a protein of 509 residues, a speedup of 63 was achieved on a parallel system with 100 processors.
Collapse
|
15
|
|
16
|
Abstract
Symmetry is a common feature among natural systems, including protein structures. A strong propensity toward symmetric architectures has long been recognized for water-soluble proteins, and this propensity has been rationalized from an evolutionary standpoint. Proteins residing in cellular membranes, however, have traditionally been less amenable to structural studies, and thus the prevalence and significance of symmetry in this important class of molecules is not as well understood. In the past two decades, researchers have made great strides in this area, and these advances have provided exciting insights into the range of architectures adopted by membrane proteins. These structural studies have revealed a similarly strong bias toward symmetric arrangements, which were often unexpected and which occurred despite the restrictions imposed by the membrane environment on the possible symmetry groups. Moreover, membrane proteins disproportionately contain internal structural repeats resulting from duplication and fusion of smaller segments. This article discusses the types and origins of symmetry in membrane proteins and the implications of symmetry for protein function.
Collapse
Affiliation(s)
- Lucy R Forrest
- Computational Structural Biology Group, Porter Neuroscience Center, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland 20852;
| |
Collapse
|
17
|
Alva V, Söding J, Lupas AN. A vocabulary of ancient peptides at the origin of folded proteins. eLife 2015; 4:e09410. [PMID: 26653858 PMCID: PMC4739770 DOI: 10.7554/elife.09410] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2015] [Accepted: 12/13/2015] [Indexed: 01/01/2023] Open
Abstract
The seemingly limitless diversity of proteins in nature arose from only a few thousand domain prototypes, but the origin of these themselves has remained unclear. We are pursuing the hypothesis that they arose by fusion and accretion from an ancestral set of peptides active as co-factors in RNA-dependent replication and catalysis. Should this be true, contemporary domains may still contain vestiges of such peptides, which could be reconstructed by a comparative approach in the same way in which ancient vocabularies have been reconstructed by the comparative study of modern languages. To test this, we compared domains representative of known folds and identified 40 fragments whose similarity is indicative of common descent, yet which occur in domains currently not thought to be homologous. These fragments are widespread in the most ancient folds and enriched for iron-sulfur- and nucleic acid-binding. We propose that they represent the observable remnants of a primordial RNA-peptide world.
Collapse
Affiliation(s)
- Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Johannes Söding
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Andrei N Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| |
Collapse
|
18
|
Pellegrini M. Tandem Repeats in Proteins: Prediction Algorithms and Biological Role. Front Bioeng Biotechnol 2015; 3:143. [PMID: 26442257 PMCID: PMC4585158 DOI: 10.3389/fbioe.2015.00143] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Accepted: 09/07/2015] [Indexed: 12/30/2022] Open
Abstract
Tandem repetitions in protein sequence and structure is a fascinating subject of research which has been a focus of study since the late 1990s. In this survey, we give an overview on the multi-faceted aspects of research on protein tandem repeats (PTR for short), including prediction algorithms, databases, early classification efforts, mechanisms of PTR formation and evolution, and synthetic PTR design. We also touch on the rather open issue of the relationship between PTR and flexibility (or disorder) in proteins. Detection of PTR either from protein sequence or structure data is challenging due to inherent high (biological) signal-to-noise ratio that is a key feature of this problem. As early in silico analytic tools have been key enablers for starting this field of study, we expect that current and future algorithmic and statistical breakthroughs will have a high impact on the investigations of the biological role of PTR.
Collapse
Affiliation(s)
- Marco Pellegrini
- Laboratory for Integrative Systems Medicine (LISM), Istituto di Informatica e Telematica, and Istituto di Fisiologia Clinica, Consiglio Nazionale delle Ricerche , Pisa , Italy
| |
Collapse
|
19
|
Leibly DJ, Arbing MA, Pashkov I, DeVore N, Waldo GS, Terwilliger TC, Yeates TO. A Suite of Engineered GFP Molecules for Oligomeric Scaffolding. Structure 2015; 23:1754-1768. [PMID: 26278175 DOI: 10.1016/j.str.2015.07.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Revised: 06/08/2015] [Accepted: 07/07/2015] [Indexed: 10/23/2022]
Abstract
Applications ranging from synthetic biology to protein crystallization could be advanced by facile systems for connecting multiple proteins together in predefined spatial relationships. One approach to this goal is to engineer many distinct assembly forms of a single carrier protein or scaffold, to which other proteins of interest can then be readily attached. In this work we chose GFP as a scaffold and engineered many alternative oligomeric forms, driven by either specific disulfide bond formation or metal ion addition. We generated a wide range of spatial arrangements of GFP subunits from 11 different oligomeric variants, and determined their X-ray structures in a total of 33 distinct crystal forms. Some of the oligomeric GFP variants show geometric polymorphism depending on conditions, while others show considerable geometric rigidity. Potential future applications of this system are discussed.
Collapse
Affiliation(s)
- David J Leibly
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA; UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Mark A Arbing
- UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Inna Pashkov
- UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA
| | - Natasha DeVore
- Bioscience Division, Los Alamos National Laboratory, MS M888, Los Alamos, NM 87545, USA
| | - Geoffrey S Waldo
- Bioscience Division, Los Alamos National Laboratory, MS M888, Los Alamos, NM 87545, USA
| | - Thomas C Terwilliger
- Bioscience Division, Los Alamos National Laboratory, MS M888, Los Alamos, NM 87545, USA
| | - Todd O Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA; UCLA-DOE Institute of Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA.
| |
Collapse
|
20
|
Stamm M, Forrest LR. Structure alignment of membrane proteins: Accuracy of available tools and a consensus strategy. Proteins 2015; 83:1720-32. [PMID: 26178143 DOI: 10.1002/prot.24857] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Revised: 05/07/2015] [Accepted: 06/07/2015] [Indexed: 12/31/2022]
Abstract
Protein structure alignment methods are used for the detection of evolutionary and functionally related positions in proteins. A wide array of different methods are available, but the choice of the best method is often not apparent to the user. Several studies have assessed the alignment accuracy and consistency of structure alignment methods, but none of these explicitly considered membrane proteins, which are important targets for drug development and have distinct structural features. Here, we compared 13 widely used pairwise structural alignment methods on a test set of homologous membrane protein structures (called HOMEP3). Each pair of structures was aligned and the corresponding sequence alignment was used to construct homology models. The model accuracy compared to the known structures was assessed using scoring functions not incorporated in the tested structural alignment methods. The analysis shows that fragment-based approaches such as FR-TM-align are the most useful for aligning structures of membrane proteins. Moreover, fragment-based approaches are more suitable for comparison of protein structures that have undergone large conformational changes. Nevertheless, no method was clearly superior to all other methods. Additionally, all methods lack a measure to rate the reliability of a position within a structure alignment. To solve both of these problems, we propose a consensus-type approach, combining alignments from four different methods, namely FR-TM-align, DaliLite, MATT, and FATCAT. Agreement between the methods is used to assign confidence values to each position of the alignment. Overall, we conclude that there remains scope for the improvement of structural alignment methods for membrane proteins.
Collapse
Affiliation(s)
- Marcus Stamm
- Computational Structural Biology Group, Max Planck Institute of Biophysics, Frankfurt Am Main, Germany
| | - Lucy R Forrest
- Computational Structural Biology Group, Max Planck Institute of Biophysics, Frankfurt Am Main, Germany.,Computational Structural Biology Section, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
21
|
Balaji S. Internal symmetry in protein structures: prevalence, functional relevance and evolution. Curr Opin Struct Biol 2015; 32:156-66. [PMID: 26093245 DOI: 10.1016/j.sbi.2015.05.004] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2015] [Revised: 05/05/2015] [Accepted: 05/06/2015] [Indexed: 10/23/2022]
Abstract
Symmetry has been found at various levels of biological organization in the protein structural universe. Numerous evolutionary studies have proposed connections between internal symmetry within protein tertiary structures, quaternary associations and protein functions. Recent computational methods, such as SymD and CE-Symm, facilitate a large-scale detection of internal symmetry in protein structures. Based on the results from these methods, about 20% of SCOP folds, superfamilies and families are estimated to have structures with internal symmetry (Figure 1d). All-β and membrane proteins fold classes contain a relatively high number of unique instances of internal symmetry. In addition to the axis of symmetry, anecdotal evidence suggests that, the region of connection or contact between symmetric units could coincide with functionally relevant sites within a fold. General principles that underlie protein internal symmetry and their connections to protein structural integrity and functions remain to be elucidated.
Collapse
Affiliation(s)
- Santhanam Balaji
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom.
| |
Collapse
|
22
|
Patra D, Mishra P, Surolia A, Vijayan M. Structure, interactions and evolutionary implications of a domain-swapped lectin dimer from Mycobacterium smegmatis. Glycobiology 2014; 24:956-65. [DOI: 10.1093/glycob/cwu059] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
23
|
Loch JI, Molenda M, Kopeć M, Świątek S, Lewiński K. Structure of two crystal forms of sheep β‐lactoglobulin with EF‐loop in closed conformation. Biopolymers 2014; 101:886-94. [DOI: 10.1002/bip.22471] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Revised: 01/31/2014] [Accepted: 02/07/2014] [Indexed: 11/08/2022]
Affiliation(s)
- Joanna I. Loch
- Department of Crystal Chemistry and Crystal PhysicsFaculty of Chemistry Jagiellonian University in KrakówIngardena 3Kraków30–060 Poland
| | - Marta Molenda
- Department of Crystal Chemistry and Crystal PhysicsFaculty of Chemistry Jagiellonian University in KrakówIngardena 3Kraków30–060 Poland
| | - Magdalena Kopeć
- Department of Crystal Chemistry and Crystal PhysicsFaculty of Chemistry Jagiellonian University in KrakówIngardena 3Kraków30–060 Poland
| | - Sylwia Świątek
- Department of Crystal Chemistry and Crystal PhysicsFaculty of Chemistry Jagiellonian University in KrakówIngardena 3Kraków30–060 Poland
| | - Krzysztof Lewiński
- Department of Crystal Chemistry and Crystal PhysicsFaculty of Chemistry Jagiellonian University in KrakówIngardena 3Kraków30–060 Poland
| |
Collapse
|
24
|
Tai CH, Paul R, Dukka KC, Shilling JD, Lee B. SymD webserver: a platform for detecting internally symmetric protein structures. Nucleic Acids Res 2014; 42:W296-300. [PMID: 24799435 PMCID: PMC4086132 DOI: 10.1093/nar/gku364] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Internal symmetry of a protein structure is the pseudo-symmetry that a single protein chain sometimes exhibits. This is in contrast to the symmetry with which monomers are arranged in many multimeric protein complexes. SymD is a program that detects proteins with internal symmetry. It proved to be useful for analyzing protein structure, function and modeling. This web-based interactive tool was developed by implementing the SymD algorithm. To the best of our knowledge, SymD webserver is the first tool of its kind with which users can easily study the symmetry of the protein they are interested in by uploading the structure or retrieving it from databases. It uses the Galaxy platform to take advantage of its extensibility and displays the symmetry properties, the symmetry axis and the sequence alignment of the structures before and after the symmetry transformation via an interactive graphical visualization environment in any modern web browser. An Example Run video displays the workflow to help users navigate. SymD webserver is publicly available at http://symd.nci.nih.gov.
Collapse
Affiliation(s)
- Chin-Hsien Tai
- Laboratory of Molecular Biology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Rohit Paul
- Office of Information Technology, National Cancer Institute, National Institutes of Health, Rockville, MD 20850, USA
| | | | - Jeffery D Shilling
- Office of Information Technology, National Cancer Institute, National Institutes of Health, Rockville, MD 20850, USA
| | - Byungkook Lee
- Laboratory of Molecular Biology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
25
|
Myers-Turnbull D, Bliven SE, Rose PW, Aziz ZK, Youkharibache P, Bourne PE, Prlić A. Systematic detection of internal symmetry in proteins using CE-Symm. J Mol Biol 2014; 426:2255-68. [PMID: 24681267 DOI: 10.1016/j.jmb.2014.03.010] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 03/17/2014] [Accepted: 03/18/2014] [Indexed: 11/26/2022]
Abstract
Symmetry is an important feature of protein tertiary and quaternary structures that has been associated with protein folding, function, evolution, and stability. Its emergence and ensuing prevalence has been attributed to gene duplications, fusion events, and subsequent evolutionary drift in sequence. This process maintains structural similarity and is further supported by this study. To further investigate the question of how internal symmetry evolved, how symmetry and function are related, and the overall frequency of internal symmetry, we developed an algorithm, CE-Symm, to detect pseudo-symmetry within the tertiary structure of protein chains. Using a large manually curated benchmark of 1007 protein domains, we show that CE-Symm performs significantly better than previous approaches. We use CE-Symm to build a census of symmetry among domain superfamilies in SCOP and note that 18% of all superfamilies are pseudo-symmetric. Our results indicate that more domains are pseudo-symmetric than previously estimated. We establish a number of recurring types of symmetry-function relationships and describe several characteristic cases in detail. With the use of the Enzyme Commission classification, symmetry was found to be enriched in some enzyme classes but depleted in others. CE-Symm thus provides a methodology for a more complete and detailed study of the role of symmetry in tertiary protein structure [availability: CE-Symm can be run from the Web at http://source.rcsb.org/jfatcatserver/symmetry.jsp. Source code and software binaries are also available under the GNU Lesser General Public License (version 2.1) at https://github.com/rcsb/symmetry. An interactive census of domains identified as symmetric by CE-Symm is available from http://source.rcsb.org/jfatcatserver/scopResults.jsp].
Collapse
Affiliation(s)
- Douglas Myers-Turnbull
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Spencer E Bliven
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Peter W Rose
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA
| | - Zaid K Aziz
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA 92093, USA
| | | | - Philip E Bourne
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA.
| | - Andreas Prlić
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
26
|
Tai CH, Bai H, Taylor TJ, Lee B. Assessment of template-free modeling in CASP10 and ROLL. Proteins 2013; 82 Suppl 2:57-83. [PMID: 24343678 DOI: 10.1002/prot.24470] [Citation(s) in RCA: 77] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2013] [Revised: 10/23/2013] [Accepted: 10/29/2013] [Indexed: 12/27/2022]
Abstract
We present the assessment of predictions for Template-Free Modeling in CASP10 and a report on the first ROLL experiment wherein predictions are collected year round for review at the regular CASP season. Models were first clustered so that duplicated or very similar ones were grouped together and represented by one model in the cluster. The representatives were then compared with targets using GDT_TS, QCS, and three additional superposition-independent score functions newly developed for CASP10. For each target, the top 15 representatives by each score were pooled to form the Top15Union set. All models in this set were visually inspected by four of us independently using the new plugin, EvalScore, which we developed with the UCSF Chimera group. The best models were selected for each target after extensive debate among the four examiners. Groups were ranked by the number of targets (hits) for which a group's model was selected as one of the best models. The Keasar group had most hits in both categories, with four of 19 FM and eight of 36 ROLL targets. The most successful prediction servers were QUARK from Zhang's group for FM category with three hits and Zhang-server for the ROLL category with seven hits. As observed in CASP9, many successful groups were not true "template-free" modelers but used remote templates and/or server models to obtain their winning models. The results of the first ROLL experiment were broadly similar to those of the CASP10 FM exercise.
Collapse
Affiliation(s)
- Chin-Hsien Tai
- Laboratory of Molecular Biology, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, 20892
| | | | | | | |
Collapse
|
27
|
Rueda M, Orozco M, Totrov M, Abagyan R. BioSuper: a web tool for the superimposition of biomolecules and assemblies with rotational symmetry. BMC STRUCTURAL BIOLOGY 2013; 13:32. [PMID: 24330655 PMCID: PMC3924234 DOI: 10.1186/1472-6807-13-32] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 12/03/2013] [Indexed: 12/02/2022]
Abstract
Background Most of the proteins in the Protein Data Bank (PDB) are oligomeric complexes consisting of two or more subunits that associate by rotational or helical symmetries. Despite the myriad of superimposition tools in the literature, we could not find any able to account for rotational symmetry and display the graphical results in the web browser. Results BioSuper is a free web server that superimposes and calculates the root mean square deviation (RMSD) of protein complexes displaying rotational symmetry. To the best of our knowledge, BioSuper is the first tool of its kind that provides immediate interactive visualization of the graphical results in the browser, biomolecule generator capabilities, different levels of atom selection, sequence-dependent and structure-based superimposition types, and is the only web tool that takes into account the equivalence of atoms in side chains displaying symmetry ambiguity. BioSuper uses ICM program functionality as a core for the superimpositions and displays the results as text, HTML tables and 3D interactive molecular objects that can be visualized in the browser or in Android and iOS platforms with a free plugin. Conclusions BioSuper is a fast and functional tool that allows for pairwise superimposition of proteins and assemblies displaying rotational symmetry. The web server was created after our own frustration when attempting to superimpose flexible oligomers. We strongly believe that its user-friendly and functional design will be of great interest for structural and computational biologists who need to superimpose oligomeric proteins (or any protein). BioSuper web server is freely available to all users at http://ablab.ucsd.edu/BioSuper.
Collapse
Affiliation(s)
| | | | | | - Ruben Abagyan
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA.
| |
Collapse
|
28
|
Shen X, Chen S, Li G. Role for gene sequence, codon bias and mRNA folding energy in modulating structural symmetry of proteins. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2013; 2013:596-9. [PMID: 24109757 DOI: 10.1109/embc.2013.6609570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Structural symmetry in proteins is commonly observed in the majority of fundamental protein folds. Meanwhile, nascent polypeptide chains of proteins have the potential to start the co-translational folding process and this process can have drastic effects on protein structure. Thus we are interested in understanding mechanisms that gene adopts in specifying structural symmetry in proteins. In the present paper, we reveal that for two representative symmetric proteins from (aβ)8-barrel fold and beta-trefoil fold, intragenic symmetry is detected in the corresponding gene sequences. Codon bias and mRNA folding energy might be involved in mediating translation speed for the formation of structural symmetry: at least one major decrease in both codon bias and mRNA folding energy can be observed in the connecting region of the symmetric substructures along the codon sequence. Results suggest that gene duplication and fusion is responsible for structural symmetry in these proteins, and the usage of rare codons or higher order of secondary structure near the boundaries of symmetric substructures might be selected in order to slow down translation speed for effectively co-translational folding process of symmetric proteins.
Collapse
|
29
|
Structure of the proton-gated urea channel from the gastric pathogen Helicobacter pylori. Nature 2012; 493:255-8. [PMID: 23222544 PMCID: PMC3974264 DOI: 10.1038/nature11684] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2012] [Accepted: 10/18/2012] [Indexed: 12/14/2022]
Abstract
Half the world's population is chronically infected with Helicobacter pylori1, causing gastritis, ulcers and increased incidence of gastric adenocarcinoma2. Its proton-gated inner-membrane urea channel, HpUreI, is essential for survival in the acidic environment of the stomach3. The channel is closed at neutral pH and opens at acidic pH to allow rapid urea access to cytoplasmic urease4. Urease produces NH3 and CO2 that neutralize entering protons and thus buffer the periplasm to pH ∼6.1 even in gastric juice at pH <2.0. Here we report the structure of HpUreI, revealing six protomers assembled in a hexameric ring surrounding a central bilayer plug of ordered lipids. Each protomer encloses a channel formed by a twisted bundle of six transmembrane helices. The bundle defines a novel fold comprising a two-helix hairpin motif repeated three times around the central axis of the channel, without the inverted repeat of mammalian urea transporters. Both the channel and the protomer interface contain residues conserved in the AmiS/UreI superfamily, suggesting preservation of channel architecture and oligomeric state in this superfamily. Predominantly aromatic or aliphatic side chains line the entire channel and define two consecutive constriction sites in the middle of the channel. Mutation of Trp153 in the cytoplasmic constriction site to Ala or Phe reduces the selectivity for urea compared to thiourea, suggesting that solute interaction with Trp153 contributes specificity. The novel hexameric channel structure described here provides a new paradigm for permeation of urea and other small amide solutes in prokaryotes and archaea.
Collapse
|
30
|
Wang JJY, Bensmail H, Gao X. Multiple graph regularized protein domain ranking. BMC Bioinformatics 2012; 13:307. [PMID: 23157331 PMCID: PMC3583823 DOI: 10.1186/1471-2105-13-307] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Accepted: 10/29/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Protein domain ranking is a fundamental task in structural biology. Most protein domain ranking methods rely on the pairwise comparison of protein domains while neglecting the global manifold structure of the protein domain database. Recently, graph regularized ranking that exploits the global structure of the graph defined by the pairwise similarities has been proposed. However, the existing graph regularized ranking methods are very sensitive to the choice of the graph model and parameters, and this remains a difficult problem for most of the protein domain ranking methods. RESULTS To tackle this problem, we have developed the Multiple Graph regularized Ranking algorithm, MultiG-Rank. Instead of using a single graph to regularize the ranking scores, MultiG-Rank approximates the intrinsic manifold of protein domain distribution by combining multiple initial graphs for the regularization. Graph weights are learned with ranking scores jointly and automatically, by alternately minimizing an objective function in an iterative algorithm. Experimental results on a subset of the ASTRAL SCOP protein domain database demonstrate that MultiG-Rank achieves a better ranking performance than single graph regularized ranking methods and pairwise similarity based ranking methods. CONCLUSION The problem of graph model and parameter selection in graph regularized protein domain ranking can be solved effectively by combining multiple graphs. This aspect of generalization introduces a new frontier in applying multiple graphs to solving protein domain ranking applications.
Collapse
Affiliation(s)
- Jim Jing-Yan Wang
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia.
| | | | | |
Collapse
|
31
|
Lee J, Blaber SI, Dubey VK, Blaber M. A polypeptide "building block" for the β-trefoil fold identified by "top-down symmetric deconstruction". J Mol Biol 2011; 407:744-63. [PMID: 21315087 DOI: 10.1016/j.jmb.2011.02.002] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2010] [Revised: 01/31/2011] [Accepted: 02/02/2011] [Indexed: 12/31/2022]
Abstract
Fibroblast growth factor-1, a member of the 3-fold symmetric β-trefoil fold, was subjected to a series of symmetric constraint mutations in a process termed "top-down symmetric deconstruction." The mutations enforced a cumulative exact 3-fold symmetry upon symmetrically equivalent positions within the protein and were combined with a stability screen. This process culminated in a β-trefoil protein with exact 3-fold primary-structure symmetry that exhibited excellent folding and stability properties. Subsequent fragmentation of the repeating primary-structure motif yielded a 42-residue polypeptide capable of spontaneous assembly as a homotrimer, producing a thermostable β-trefoil architecture. The results show that despite pronounced reduction in sequence complexity, pure symmetry in the design of a foldable, thermostable β-trefoil fold is possible. The top-down symmetric deconstruction approach provides a novel alternative means to successfully identify a useful polypeptide "building block" for subsequent "bottom-up" de novo design of target protein architecture.
Collapse
Affiliation(s)
- Jihun Lee
- Department of Biomedical Sciences, College of Medicine, Florida State University, Tallahassee, FL 32306-4300, USA
| | | | | | | |
Collapse
|
32
|
Han L, Monné M, Okumura H, Schwend T, Cherry AL, Flot D, Matsuda T, Jovine L. Insights into Egg Coat Assembly and Egg-Sperm Interaction from the X-Ray Structure of Full-Length ZP3. Cell 2010; 143:404-15. [DOI: 10.1016/j.cell.2010.09.041] [Citation(s) in RCA: 118] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2010] [Revised: 08/11/2010] [Accepted: 08/24/2010] [Indexed: 11/15/2022]
|