1
|
Maturana P, Tobar-Calfucoy E, Fuentealba M, Roversi P, Garratt R, Cabrera R. Crystal structure of the 6-phosphogluconate dehydrogenase from Gluconobacter oxydans reveals tetrameric 6PGDHs as the crucial intermediate in the evolution of structure and cofactor preference in the 6PGDH family. Wellcome Open Res 2021. [DOI: 10.12688/wellcomeopenres.16572.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Background: The enzyme 6-phosphogluconate dehydrogenase (6PGDH) is the central enzyme of the oxidative pentose phosphate pathway. Members of the 6PGDH family belong to different classes: either homodimeric enzymes assembled from long-chain subunits or homotetrameric ones assembled from short-chain subunits. Dimeric 6PGDHs bear an internal duplication absent in tetrameric 6PGDHs and distant homologues of the β-hydroxyacid dehydrogenase (βHADH) superfamily. Methods: We use X-ray crystallography to determine the structure of the apo form of the 6PGDH from Gluconobacter oxydans (Go6PGDH). We carried out a structural and phylogenetic analysis of short and long-chain 6PGDHs. We put forward an evolutionary hypothesis explaining the differences seen in oligomeric state vs. dinucleotide preference of the 6PGDH family. We determined the cofactor preference of Go6PGDH at different 6-phosphogluconate concentrations, characterizing the wild-type enzyme and three-point mutants of residues in the cofactor binding site of Go6PGDH. Results: The structural comparison suggests that the 6PG binding site initially evolved by exchanging C-terminal α-helices between subunits. An internal duplication event changed the quaternary structure of the enzyme from a tetrameric to a dimeric arrangement. The phylogenetic analysis suggests that 6PGDHs have spread from Bacteria to Archaea and Eukarya on multiple occasions by lateral gene transfer. Sequence motifs consistent with NAD+- and NADP+-specificity are found in the β2-α2 loop of dimeric and tetrameric 6PGDHs. Site-directed mutagenesis of Go6PGDH inspired by this analysis fully reverses dinucleotide preference. One of the mutants we engineered has the highest efficiency and specificity for NAD+ so far described for a 6PGDH. Conclusions: The family 6PGDH comprises dimeric and tetrameric members whose active sites are conformed by a C-terminal α-helix contributed from adjacent subunits. Dimeric 6PGDHs have evolved from the duplication-fusion of the tetrameric C-terminal domain before independent transitions of cofactor specificity. Changes in the conserved β2-α2 loop are crucial to modulate the cofactor specificity in Go6PGDH.
Collapse
|
2
|
Shalit Y, Tuvi-Arad I. Side chain flexibility and the symmetry of protein homodimers. PLoS One 2020; 15:e0235863. [PMID: 32706779 PMCID: PMC7380632 DOI: 10.1371/journal.pone.0235863] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 06/24/2020] [Indexed: 01/22/2023] Open
Abstract
A comprehensive analysis of crystallographic data of 565 high-resolution protein homodimers comprised of over 250,000 residues suggests that amino acids form two groups that differ in their tendency to distort or symmetrize the structure of protein homodimers. Residues of the first group tend to distort the protein homodimer and generally have long or polar side chains. These include: Lys, Gln, Glu, Arg, Asn, Met, Ser, Thr and Asp. Residues of the second group contribute to protein symmetry and are generally characterized by short or aromatic side chains. These include: Ile, Pro, His, Val, Cys, Leu, Trp, Tyr, Phe, Ala and Gly. The distributions of the continuous symmetry measures of the proteins and the continuous chirality measures of their building blocks highlight the role of side chain geometry and the interplay between entropy and symmetry in dictating the conformational flexibility of proteins.
Collapse
Affiliation(s)
- Yaffa Shalit
- Department of Natural Sciences, The Open University of Israel, Raanana, Israel
| | - Inbal Tuvi-Arad
- Department of Natural Sciences, The Open University of Israel, Raanana, Israel
- * E-mail:
| |
Collapse
|
3
|
Bliven SE, Lafita A, Rose PW, Capitani G, Prlić A, Bourne PE. Analyzing the symmetrical arrangement of structural repeats in proteins with CE-Symm. PLoS Comput Biol 2019; 15:e1006842. [PMID: 31009453 PMCID: PMC6504099 DOI: 10.1371/journal.pcbi.1006842] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 05/07/2019] [Accepted: 01/29/2019] [Indexed: 01/04/2023] Open
Abstract
Many proteins fold into highly regular and repetitive three dimensional structures. The analysis of structural patterns and repeated elements is fundamental to understand protein function and evolution. We present recent improvements to the CE-Symm tool for systematically detecting and analyzing the internal symmetry and structural repeats in proteins. In addition to the accurate detection of internal symmetry, the tool is now capable of i) reporting the type of symmetry, ii) identifying the smallest repeating unit, iii) describing the arrangement of repeats with transformation operations and symmetry axes, and iv) comparing the similarity of all the internal repeats at the residue level. CE-Symm 2.0 helps the user investigate proteins with a robust and intuitive sequence-to-structure analysis, with many applications in protein classification, functional annotation and evolutionary studies. We describe the algorithmic extensions of the method and demonstrate its applications to the study of interesting cases of protein evolution. Many protein structures show a great deal of regularity. Even within single polypeptide chains, about 25% of proteins contain self-similar repeating structures, which can be organized in ring-like symmetric arrangements or linear open repeats. The repeats are often related, and thus comparing the sequence and structure of repeats can give an idea as to the early evolutionary history of a protein family. Additionally, the conservation and divergence of repeats can lead to insights about the function of the proteins. This work describes CE-Symm 2.0, a tool for the analysis of protein symmetry. The method automatically detects internal symmetry in protein structures and produces a multiple alignment of structural repeats. The algorithm is able to detect the geometric relationships between the repeats, including cyclic, dihedral, and polyhedral symmetries, translational repeats, and cases where multiple symmetry operators are applicable in a hierarchical manner. These complex relationships can then be visualized in a graphical interface as a complete structure, as a superposition of repeats, or as a multiple alignment of the protein sequence. CE-Symm 2.0 can be systematically used for the automatic detection of internal symmetry in protein structures, or as an interactive tool for the analysis of structural repeats.
Collapse
Affiliation(s)
- Spencer E. Bliven
- Laboratory of Biomolecular Research, Paul Scherrer Institute, Villigen, Switzerland
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- Institute of Applied Simulation, Zurich University of Applied Science, Wädenswil, Switzerland
- * E-mail: (SEB), (AL)
| | - Aleix Lafita
- Laboratory of Biomolecular Research, Paul Scherrer Institute, Villigen, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
- * E-mail: (SEB), (AL)
| | - Peter W. Rose
- RCSB Protein Data Bank, San Diego Supercomputing Center, University of California San Diego, La Jolla, California, United States of America
- Structural Bioinformatics Laboratory, San Diego Supercomputing Center, University of California San Diego, La Jolla, California, United States of America
| | - Guido Capitani
- Laboratory of Biomolecular Research, Paul Scherrer Institute, Villigen, Switzerland
- Department of Biology, ETH Zurich, Zurich, Switzerland
| | - Andreas Prlić
- RCSB Protein Data Bank, San Diego Supercomputing Center, University of California San Diego, La Jolla, California, United States of America
| | - Philip E. Bourne
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, United States of America
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| |
Collapse
|
4
|
Youkharibache P. Protodomains: Symmetry-Related Supersecondary Structures in Proteins and Self-Complementarity. Methods Mol Biol 2019; 1958:187-219. [PMID: 30945220 PMCID: PMC8323591 DOI: 10.1007/978-1-4939-9161-7_10] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
We will consider in this chapter supersecondary structures (SSS) as a set of secondary structure elements (SSEs) found in protein domains. Some SSS arrangements/topologies have been consistently observed within known tertiary structural domains. We use them in the context of repeating supersecondary structures that self-assemble in a symmetric arrangement to form a domain. We call them protodomains (or protofolds). Protodomains are some of the most interesting and insightful SSSs. Within a given 3D protein domain/fold, recognizing such sets may give insights into a possible evolutionary process of duplication, fusion, and coevolution of these protodomains, pointing to possible original protogenes. On protein folding itself, pseudosymmetric domains may point to a "directed" assembly of pseudosymmetric protodomains, directed by the only fact that they are tethered together in a protein chain. On function, tertiary functional sites often occur at protodomain interfaces, as they often occur at domain-domain interfaces in quaternary arrangements.First, we will briefly review some lessons learned from a previously published census of pseudosymmetry in protein domains (Myers-Turnbull, D. et al., J Mol Biol. 426:2255-2268, 2014) to introduce protodomains/protofolds. We will observe that the most abundant and diversified folds, or superfolds, in the currently known protein structure universe are indeed pseudosymmetric. Then, we will learn by example and select a few domain representatives of important pseudosymmetric folds and chief among them the immunoglobulin (Ig) fold and go over a pseudosymmetry supersecondary structure (protodomain) analysis in tertiary and quaternary structures. We will point to currently available software tools to help in identifying pseudosymmetry, delineating protodomains, and see how the study of pseudosymmetry and the underlying supersecondary structures can enrich a structural analysis. This should potentially help in protein engineering, especially in the development of biologics and immunoengineering.
Collapse
|
5
|
Rogers DW, McConnell E, Miller EL, Greig D. Diminishing Returns on Intragenic Repeat Number Expansion in the Production of Signaling Peptides. Mol Biol Evol 2017; 34:3176-3185. [PMID: 28961820 PMCID: PMC5850478 DOI: 10.1093/molbev/msx243] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Signaling peptides enable communication between cells, both within and between individuals, and are therefore key to the control of complex physiological and behavioral responses. Since their small sizes prevent direct transmission to secretory pathways, these peptides are often produced as part of a larger polyprotein comprising precursors for multiple related or identical peptides; the physiological and behavioral consequences of this unusual gene structure are not understood. Here, we show that the number of mature-pheromone-encoding repeats in the yeast α-mating-factor gene MFα1 varies considerably between closely related isolates of both Saccharomyces cerevisiae and its sister species Saccharomyces paradoxus. Variation in repeat number has important phenotypic consequences: Increasing repeat number caused higher pheromone production and greater competitive mating success. However, the magnitude of the improvement decreased with increasing repeat number such that repeat amplification beyond that observed in natural isolates failed to generate more pheromone, and could actually reduce sexual fitness. We investigate multiple explanations for this pattern of diminishing returns and find that our results are most consistent with a translational trade-off: Increasing the number of encoded repeats results in more mature pheromone per translation event, but also generates longer transcripts thereby reducing the rate of translation—a phenomenon known as length-dependent translation. Length-dependent translation may be a powerful constraint on the evolution of genes encoding repetitive or modular proteins, with important physiological and behavioral consequences across eukaryotes.
Collapse
Affiliation(s)
- David W Rogers
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Ellen McConnell
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Eric L Miller
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Veterinary Medicine, Cambridge Veterinary School, University of Cambridge, Cambridge, United Kingdom
| | - Duncan Greig
- Experimental Evolution Research Group, Max Planck Institute for Evolutionary Biology, Plön, Germany.,Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| |
Collapse
|
6
|
Parallel-SymD: A Parallel Approach to Detect Internal Symmetry in Protein Domains. BIOMED RESEARCH INTERNATIONAL 2016; 2016:4628592. [PMID: 27747230 PMCID: PMC5056246 DOI: 10.1155/2016/4628592] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 08/25/2016] [Indexed: 11/24/2022]
Abstract
Internally symmetric proteins are proteins that have a symmetrical structure in their monomeric single-chain form. Around 10–15% of the protein domains can be regarded as having some sort of internal symmetry. In this regard, we previously published SymD (symmetry detection), an algorithm that determines whether a given protein structure has internal symmetry by attempting to align the protein to its own copy after the copy is circularly permuted by all possible numbers of residues. SymD has proven to be a useful algorithm to detect symmetry. In this paper, we present a new parallelized algorithm called Parallel-SymD for detecting symmetry of proteins on clusters of computers. The achieved speedup of the new Parallel-SymD algorithm scales well with the number of computing processors. Scaling is better for proteins with a larger number of residues. For a protein of 509 residues, a speedup of 63 was achieved on a parallel system with 100 processors.
Collapse
|
7
|
Beyrakhova KA, van Straaten K, Li L, Boniecki MT, Anderson DH, Cygler M. Structural and Functional Investigations of the Effector Protein LpiR1 from Legionella pneumophila. J Biol Chem 2016; 291:15767-77. [PMID: 27226543 DOI: 10.1074/jbc.m115.708701] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2015] [Indexed: 01/14/2023] Open
Abstract
Legionella pneumophila is a causative agent of a severe pneumonia, known as Legionnaires' disease. Legionella pathogenicity is mediated by specific virulence factors, called bacterial effectors, which are injected into the invaded host cell by the bacterial type IV secretion system. Bacterial effectors are involved in complex interactions with the components of the host cell immune and signaling pathways, which eventually lead to bacterial survival and replication inside the mammalian cell. Structural and functional studies of bacterial effectors are, therefore, crucial for elucidating the mechanisms of Legionella virulence. Here we describe the crystal structure of the LpiR1 (Lpg0634) effector protein and investigate the effects of its overexpression in mammalian cells. LpiR1 is an α-helical protein that consists of two similar domains aligned in an antiparallel fashion. The hydrophilic cleft between the domains might serve as a binding site for a potential host cell interaction partner. LpiR1 binds the phosphate group at a conserved site and is stabilized by Mn(2+), Ca(2+), or Mg(2+) ions. When overexpressed in mammalian cells, a GFP-LpiR1 fusion protein is localized in the cytoplasm. Intracellular signaling antibody array analysis revealed small changes in the phosphorylation state of several components of the Akt signaling pathway in HEK293T cells overexpressing LpiR1.
Collapse
Affiliation(s)
| | | | - Lei Li
- From the Department of Biochemistry and
| | | | - Deborah H Anderson
- Cancer Research, Saskatchewan Cancer Agency, University of Saskatchewan, Saskatoon, Saskatchewan S7N 5E5, Canada
| | | |
Collapse
|
8
|
Abstract
Symmetry is a common feature among natural systems, including protein structures. A strong propensity toward symmetric architectures has long been recognized for water-soluble proteins, and this propensity has been rationalized from an evolutionary standpoint. Proteins residing in cellular membranes, however, have traditionally been less amenable to structural studies, and thus the prevalence and significance of symmetry in this important class of molecules is not as well understood. In the past two decades, researchers have made great strides in this area, and these advances have provided exciting insights into the range of architectures adopted by membrane proteins. These structural studies have revealed a similarly strong bias toward symmetric arrangements, which were often unexpected and which occurred despite the restrictions imposed by the membrane environment on the possible symmetry groups. Moreover, membrane proteins disproportionately contain internal structural repeats resulting from duplication and fusion of smaller segments. This article discusses the types and origins of symmetry in membrane proteins and the implications of symmetry for protein function.
Collapse
Affiliation(s)
- Lucy R Forrest
- Computational Structural Biology Group, Porter Neuroscience Center, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, Maryland 20852;
| |
Collapse
|
9
|
Cahn JKB, Brinkmann-Chen S, Buller AR, Arnold FH. Artificial domain duplication replicates evolutionary history of ketol-acid reductoisomerases. Protein Sci 2015; 25:1241-8. [PMID: 26644020 DOI: 10.1002/pro.2852] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2015] [Accepted: 12/01/2015] [Indexed: 11/11/2022]
Abstract
The duplication of protein structural domains has been proposed as a common mechanism for the generation of new protein folds. A particularly interesting case is the class II ketol-acid reductoisomerase (KARI), which putatively arose from an ancestral class I KARI by duplication of the C-terminal domain and corresponding loss of obligate dimerization. As a result, the class II enzymes acquired a deeply embedded figure-of-eight knot. To test this evolutionary hypothesis we constructed a novel class II KARI by duplicating the C-terminal domain of a hyperthermostable class I KARI. The new protein is monomeric, as confirmed by gel filtration and X-ray crystallography, and has the deeply knotted class II KARI fold. Surprisingly, its catalytic activity is nearly unchanged from the parent KARI. This provides strong evidence in support of domain duplication as the mechanism for the evolution of the class II KARI fold and demonstrates the ability of domain duplication to generate topological novelty in a function-neutral manner.
Collapse
Affiliation(s)
- Jackson K B Cahn
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, 91125
| | - Sabine Brinkmann-Chen
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, 91125
| | - Andrew R Buller
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, 91125
| | - Frances H Arnold
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, 91125
| |
Collapse
|
10
|
Banach M, Prudhomme N, Carpentier M, Duprat E, Papandreou N, Kalinowska B, Chomilier J, Roterman I. Contribution to the prediction of the fold code: application to immunoglobulin and flavodoxin cases. PLoS One 2015; 10:e0125098. [PMID: 25915049 PMCID: PMC4411048 DOI: 10.1371/journal.pone.0125098] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Accepted: 03/20/2015] [Indexed: 12/19/2022] Open
Abstract
Background Folding nucleus of globular proteins formation starts by the mutual interaction of a group of hydrophobic amino acids whose close contacts allow subsequent formation and stability of the 3D structure. These early steps can be predicted by simulation of the folding process through a Monte Carlo (MC) coarse grain model in a discrete space. We previously defined MIRs (Most Interacting Residues), as the set of residues presenting a large number of non-covalent neighbour interactions during such simulation. MIRs are good candidates to define the minimal number of residues giving rise to a given fold instead of another one, although their proportion is rather high, typically [15-20]% of the sequences. Having in mind experiments with two sequences of very high levels of sequence identity (up to 90%) but different folds, we combined the MIR method, which takes sequence as single input, with the “fuzzy oil drop” (FOD) model that requires a 3D structure, in order to estimate the residues coding for the fold. FOD assumes that a globular protein follows an idealised 3D Gaussian distribution of hydrophobicity density, with the maximum in the centre and minima at the surface of the “drop”. If the actual local density of hydrophobicity around a given amino acid is as high as the ideal one, then this amino acid is assigned to the core of the globular protein, and it is assumed to follow the FOD model. Therefore one obtains a distribution of the amino acids of a protein according to their agreement or rejection with the FOD model. Results We compared and combined MIR and FOD methods to define the minimal nucleus, or keystone, of two populated folds: immunoglobulin-like (Ig) and flavodoxins (Flav). The combination of these two approaches defines some positions both predicted as a MIR and assigned as accordant with the FOD model. It is shown here that for these two folds, the intersection of the predicted sets of residues significantly differs from random selection. It reduces the number of selected residues by each individual method and allows a reasonable agreement with experimentally determined key residues coding for the particular fold. In addition, the intersection of the two methods significantly increases the specificity of the prediction, providing a robust set of residues that constitute the folding nucleus.
Collapse
Affiliation(s)
- Mateusz Banach
- Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, Krakow, Poland
| | - Nicolas Prudhomme
- Protein Structure Prediction group, IMPMC, UPMC & CNRS, Paris, France
| | - Mathilde Carpentier
- Protein Structure Prediction group, IMPMC, UPMC & CNRS, Paris, France
- RPBS, 35 rue Hélène Brion, 75013, Paris, France
| | - Elodie Duprat
- Protein Structure Prediction group, IMPMC, UPMC & CNRS, Paris, France
- RPBS, 35 rue Hélène Brion, 75013, Paris, France
| | - Nikolaos Papandreou
- Genetics Department, Agricultural University of Athens, Iera Odos 75, Athens, Greece
| | - Barbara Kalinowska
- Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, Krakow, Poland
| | - Jacques Chomilier
- Protein Structure Prediction group, IMPMC, UPMC & CNRS, Paris, France
- RPBS, 35 rue Hélène Brion, 75013, Paris, France
- * E-mail: (JC); (IR)
| | - Irena Roterman
- Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, Krakow, Poland
- * E-mail: (JC); (IR)
| |
Collapse
|
11
|
Tai CH, Paul R, Dukka KC, Shilling JD, Lee B. SymD webserver: a platform for detecting internally symmetric protein structures. Nucleic Acids Res 2014; 42:W296-300. [PMID: 24799435 PMCID: PMC4086132 DOI: 10.1093/nar/gku364] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Internal symmetry of a protein structure is the pseudo-symmetry that a single protein chain sometimes exhibits. This is in contrast to the symmetry with which monomers are arranged in many multimeric protein complexes. SymD is a program that detects proteins with internal symmetry. It proved to be useful for analyzing protein structure, function and modeling. This web-based interactive tool was developed by implementing the SymD algorithm. To the best of our knowledge, SymD webserver is the first tool of its kind with which users can easily study the symmetry of the protein they are interested in by uploading the structure or retrieving it from databases. It uses the Galaxy platform to take advantage of its extensibility and displays the symmetry properties, the symmetry axis and the sequence alignment of the structures before and after the symmetry transformation via an interactive graphical visualization environment in any modern web browser. An Example Run video displays the workflow to help users navigate. SymD webserver is publicly available at http://symd.nci.nih.gov.
Collapse
Affiliation(s)
- Chin-Hsien Tai
- Laboratory of Molecular Biology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Rohit Paul
- Office of Information Technology, National Cancer Institute, National Institutes of Health, Rockville, MD 20850, USA
| | | | - Jeffery D Shilling
- Office of Information Technology, National Cancer Institute, National Institutes of Health, Rockville, MD 20850, USA
| | - Byungkook Lee
- Laboratory of Molecular Biology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
12
|
Myers-Turnbull D, Bliven SE, Rose PW, Aziz ZK, Youkharibache P, Bourne PE, Prlić A. Systematic detection of internal symmetry in proteins using CE-Symm. J Mol Biol 2014; 426:2255-68. [PMID: 24681267 DOI: 10.1016/j.jmb.2014.03.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 03/17/2014] [Accepted: 03/18/2014] [Indexed: 11/26/2022]
Abstract
Symmetry is an important feature of protein tertiary and quaternary structures that has been associated with protein folding, function, evolution, and stability. Its emergence and ensuing prevalence has been attributed to gene duplications, fusion events, and subsequent evolutionary drift in sequence. This process maintains structural similarity and is further supported by this study. To further investigate the question of how internal symmetry evolved, how symmetry and function are related, and the overall frequency of internal symmetry, we developed an algorithm, CE-Symm, to detect pseudo-symmetry within the tertiary structure of protein chains. Using a large manually curated benchmark of 1007 protein domains, we show that CE-Symm performs significantly better than previous approaches. We use CE-Symm to build a census of symmetry among domain superfamilies in SCOP and note that 18% of all superfamilies are pseudo-symmetric. Our results indicate that more domains are pseudo-symmetric than previously estimated. We establish a number of recurring types of symmetry-function relationships and describe several characteristic cases in detail. With the use of the Enzyme Commission classification, symmetry was found to be enriched in some enzyme classes but depleted in others. CE-Symm thus provides a methodology for a more complete and detailed study of the role of symmetry in tertiary protein structure [availability: CE-Symm can be run from the Web at http://source.rcsb.org/jfatcatserver/symmetry.jsp. Source code and software binaries are also available under the GNU Lesser General Public License (version 2.1) at https://github.com/rcsb/symmetry. An interactive census of domains identified as symmetric by CE-Symm is available from http://source.rcsb.org/jfatcatserver/scopResults.jsp].
Collapse
Affiliation(s)
- Douglas Myers-Turnbull
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Spencer E Bliven
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Peter W Rose
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA
| | - Zaid K Aziz
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA 92093, USA
| | | | - Philip E Bourne
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA.
| | - Andreas Prlić
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
13
|
Schaper E, Gascuel O, Anisimova M. Deep conservation of human protein tandem repeats within the eukaryotes. Mol Biol Evol 2014; 31:1132-48. [PMID: 24497029 PMCID: PMC3995336 DOI: 10.1093/molbev/msu062] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Tandem repeats (TRs) are a major element of protein sequences in all domains of life. They are particularly abundant in mammals, where by conservative estimates one in three proteins contain a TR. High generation-scale duplication and deletion rates were reported for nucleic TR units. However, it is not known whether protein TR units can also be frequently lost or gained providing a source of variation for rapid adaptation of protein function, or alternatively, tend to have conserved TR unit configurations over long evolutionary times. To obtain a systematic picture, we performed a proteome-wide analysis of the mode of evolution for human protein TRs. For this purpose, we propose a novel method for the detection of orthologous TRs based on circular profile hidden Markov models. For all detected TRs, we reconstructed bispecies TR unit phylogenies across 61 eukaryotes ranging from human to yeast. Moreover, we performed additional analyses to correlate functional and structural annotations of human TRs with their mode of evolution. Surprisingly, we find that the vast majority of human TRs are ancient, with TR unit number and order preserved intact since distant speciation events. For example, ≥61% of all human TRs have been strongly conserved at least since the root of all mammals, approximately 300 Ma. Further, we find no human protein TR that shows evidence for strong recent duplications and deletions. The results are in contrast to the high generation-scale mutability of nucleic TRs. Presumably, most protein TRs fold into stable and conserved structures that are indispensable for the function of the TR-containing protein. All of our data and results are available for download from http://www.atgc-montpellier.fr/TRE.
Collapse
Affiliation(s)
- Elke Schaper
- Department of Computer Science, ETH Zürich, Zürich, Switzerland
| | | | | |
Collapse
|
14
|
Khanchezar A, Béven L, Izadpanah K, Salehi M, Saillard C. Spiralin diversity within Iranian strains of Spiroplasma citri. Curr Microbiol 2013; 68:96-104. [PMID: 23995776 DOI: 10.1007/s00284-013-0437-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2013] [Accepted: 07/15/2013] [Indexed: 10/26/2022]
Abstract
The first-cultured and most-studied spiroplasma is Spiroplasma citri, the causal agent of citrus stubborn disease, one of the three plant-pathogenic, sieve-tube-restricted, and leafhopper vector-transmitted mollicutes. In Iranian Fars province, S. citri cultures were obtained from stubborn affected citrus trees, sesame and safflower plants, and from the leafhopper vector Circulifer haematoceps. Spiralin gene sequences from different S. citri isolates were amplified by PCR, cloned, and sequenced. Phylogenetic trees based on spiralin gene sequence showed diversity and indicated the presence of three clusters among the S. citri strains. Comparison of the amino acid sequences of eleven spiralins from Iranian strains and those from the reference S. citri strain GII-3 (241 aa), Palmyre strain (242 aa), Spiroplasma kunkelii (240 aa), and Spiroplasma phoeniceum (237 aa) confirmed the conservation of general features of the protein. However, the spiralin of an S. citri isolate named Shiraz I comprised 346 amino acids and showed a large duplication of the region comprised between two short repeats previously identified in S. citri spiralins. We report in this paper the spiralin diversity in Spiroplasma strains from southern Iran and for the first time a partial internal duplication of the spiralin gene.
Collapse
Affiliation(s)
- Amin Khanchezar
- Plant Virology Research Centre (PVRC), College of Agriculture, Shiraz University, Shiraz, Iran
| | | | | | | | | |
Collapse
|
15
|
Swapna LS, Srikeerthana K, Srinivasan N. Extent of structural asymmetry in homodimeric proteins: prevalence and relevance. PLoS One 2012; 7:e36688. [PMID: 22629324 PMCID: PMC3358323 DOI: 10.1371/journal.pone.0036688] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2011] [Accepted: 04/11/2012] [Indexed: 11/21/2022] Open
Abstract
Most homodimeric proteins have symmetric structure. Although symmetry is known to confer structural and functional advantage, asymmetric organization is also observed. Using a non-redundant dataset of 223 high-resolution crystal structures of biologically relevant homodimers, we address questions on the prevalence and significance of asymmetry. We used two measures to quantify global and interface asymmetry, and assess the correlation of several molecular and structural parameters with asymmetry. We have identified rare cases (11/223) of biologically relevant homodimers with pronounced global asymmetry. Asymmetry serves as a means to bring about 2∶1 binding between the homodimer and another molecule; it also enables cellular signalling arising from asymmetric macromolecular ligands such as DNA. Analysis of these cases reveals two possible mechanisms by which possible infinite array formation is prevented. In case of homodimers associating via non-topologically equivalent surfaces in their tertiary structures, ligand-dependent mechanisms are used. For stable dimers binding via large surfaces, ligand-dependent structural change regulates polymerisation/depolymerisation; for unstable dimers binding via smaller surfaces that are not evolutionarily well conserved, dimerisation occurs only in the presence of the ligand. In case of homodimers associating via interaction surfaces with parts of the surfaces topologically equivalent in the tertiary structures, steric hindrance serves as the preventive mechanism of infinite array. We also find that homodimers exhibiting grossly symmetric organization rarely exhibit either perfect local symmetry or high local asymmetry. Binding of small ligands at the interface does not cause any significant variation in interface asymmetry. However, identification of biologically relevant interface asymmetry in grossly symmetric homodimers is confounded by the presence of similar small magnitude changes caused due to artefacts of crystallisation. Our study provides new insights regarding accommodation of asymmetry in homodimers.
Collapse
|
16
|
Szilágyi A, Zhang Y, Závodszky P. Intra-chain 3D segment swapping spawns the evolution of new multidomain protein architectures. J Mol Biol 2011; 415:221-35. [PMID: 22079367 DOI: 10.1016/j.jmb.2011.10.045] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2011] [Revised: 10/07/2011] [Accepted: 10/27/2011] [Indexed: 10/15/2022]
Abstract
Multidomain proteins form in evolution through the concatenation of domains, but structural domains may comprise multiple segments of the chain. In this work, we demonstrate that new multidomain architectures can evolve by an apparent three-dimensional swap of segments between structurally similar domains within a single-chain monomer. By a comprehensive structural search of the current Protein Data Bank (PDB), we identified 32 well-defined segment-swapped proteins (SSPs) belonging to 18 structural families. Nearly 13% of all multidomain proteins in the PDB may have a segment-swapped evolutionary precursor as estimated by more permissive searching criteria. The formation of SSPs can be explained by two principal evolutionary mechanisms: (i) domain swapping and fusion (DSF) and (ii) circular permutation (CP). By large-scale comparative analyses using structural alignment and hidden Markov model methods, it was found that the majority of SSPs have evolved via the DSF mechanism, and a much smaller fraction, via CP. Functional analyses further revealed that segment swapping, which results in two linkers connecting the domains, may impart directed flexibility to multidomain proteins and contributes to the development of new functions. Thus, inter-domain segment swapping represents a novel general mechanism by which new protein folds and multidomain architectures arise in evolution, and SSPs have structural and functional properties that make them worth defining as a separate group.
Collapse
Affiliation(s)
- András Szilágyi
- Institute of Enzymology, Hungarian Academy of Sciences, Karolina út 29, H-1113 Budapest, Hungary
| | | | | |
Collapse
|
17
|
Yadid I, Tawfik DS. Functional β-propeller lectins by tandem duplications of repetitive units. Protein Eng Des Sel 2010; 24:185-95. [DOI: 10.1093/protein/gzq053] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
18
|
Detecting internally symmetric protein structures. BMC Bioinformatics 2010; 11:303. [PMID: 20525292 PMCID: PMC2894822 DOI: 10.1186/1471-2105-11-303] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2010] [Accepted: 06/03/2010] [Indexed: 11/30/2022] Open
Abstract
Background Many functional proteins have a symmetric structure. Most of these are multimeric complexes, which are made of non-symmetric monomers arranged in a symmetric manner. However, there are also a large number of proteins that have a symmetric structure in the monomeric state. These internally symmetric proteins are interesting objects from the point of view of their folding, function, and evolution. Most algorithms that detect the internally symmetric proteins depend on finding repeating units of similar structure and do not use the symmetry information. Results We describe a new method, called SymD, for detecting symmetric protein structures. The SymD procedure works by comparing the structure to its own copy after the copy is circularly permuted by all possible number of residues. The procedure is relatively insensitive to symmetry-breaking insertions and deletions and amplifies positive signals from symmetry. It finds 70% to 80% of the TIM barrel fold domains in the ASTRAL 40 domain database and 100% of the beta-propellers as symmetric. More globally, 10% to 15% of the proteins in the ASTRAL 40 domain database may be considered symmetric according to this procedure depending on the precise cutoff value used to measure the degree of perfection of the symmetry. Symmetrical proteins occur in all structural classes and can have a closed, circular structure, a cylindrical barrel-like structure, or an open, helical structure. Conclusions SymD is a sensitive procedure for detecting internally symmetric protein structures. Using this procedure, we estimate that 10% to 15% of the known protein domains may be considered symmetric. We also report an initial, overall view of the types of symmetries and symmetric folds that occur in the protein domain structure universe.
Collapse
|