1
|
Panja AS, Maiti S, Bandyopadhyay B. Protein stability governed by its structural plasticity is inferred by physicochemical factors and salt bridges. Sci Rep 2020; 10:1822. [PMID: 32020026 PMCID: PMC7000726 DOI: 10.1038/s41598-020-58825-7] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 01/21/2020] [Indexed: 12/02/2022] Open
Abstract
Several organisms, specifically microorganisms survive in a wide range of harsh environments including extreme temperature, pH, and salt concentration. We analyzed systematically a large number of protein sequences with their structures to understand their stability and to discriminate extremophilic proteins from their non-extremophilic orthologs. Our results highlighted that the strategy for the packing of the protein core was influenced by the environmental stresses through substitutive structural events through better ionic interaction. Statistical analysis showed that a significant difference in number and composition of amino acid exist among them. The negative correlation of pairwise sequence alignments and structural alignments indicated that most of the extremophile and non-extremophile proteins didn’t contain any association for maintaining their functional stability. A significant numbers of salt bridges were noticed on the surface of the extremostable proteins. The Ramachandran plot data represented more occurrences of amino acids being present in helix and sheet regions of extremostable proteins. We also found that a significant number of small nonpolar amino acids and moderate number of charged amino acids like Arginine and Aspartic acid represented more nonplanar Omega angles in their peptide bond. Thus, extreme conditions may predispose amino acid composition including geometric variability for molecular adaptation of extremostable proteins against atmospheric variations and associated changes under natural selection pressure. The variation of amino acid composition and structural diversifications in proteins play a major role in evolutionary adaptation to mitigate climate change.
Collapse
Affiliation(s)
- Anindya S Panja
- Post Graduate Department of Biotechnology, Molecular informatics Laboratory, Oriental Institute of Science and Technology, Vidyasagar University, Midnapore, West Bengal, India.
| | - Smarajit Maiti
- Post Graduate Department of Biochemistry and Biotechnology, Cell and Molecular Therapeutics Laboratory, Oriental Institute of Science and Technology, Vidyasagar University, Midnapore, West Bengal, India
| | - Bidyut Bandyopadhyay
- Post Graduate Department of Biotechnology, Molecular informatics Laboratory, Oriental Institute of Science and Technology, Vidyasagar University, Midnapore, West Bengal, India
| |
Collapse
|
2
|
Dall E, Hollerweger JC, Dahms SO, Cui H, Häussermann K, Brandstetter H. Structural and functional analysis of cystatin E reveals enzymologically relevant dimer and amyloid fibril states. J Biol Chem 2018; 293:13151-13165. [PMID: 29967063 PMCID: PMC6109925 DOI: 10.1074/jbc.ra118.002154] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Revised: 06/26/2018] [Indexed: 12/26/2022] Open
Abstract
Protein activity is often regulated by altering the oligomerization state. One mechanism of multimerization involves domain swapping, wherein proteins exchange parts of their structures and thereby form long-lived dimers or multimers. Domain swapping has been specifically observed in amyloidogenic proteins, for example the cystatin superfamily of cysteine protease inhibitors. Cystatins are twin-headed inhibitors, simultaneously targeting the lysosomal cathepsins and legumain, with important roles in cancer progression and Alzheimer's disease. Although cystatin E is the most potent legumain inhibitor identified so far, nothing is known about its propensity to oligomerize. In this study, we show that conformational destabilization of cystatin E leads to the formation of a domain-swapped dimer with increased conformational stability. This dimer was active as a legumain inhibitor by forming a trimeric complex. By contrast, the binding sites toward papain-like proteases were buried within the cystatin E dimer. We also showed that the dimers could further convert to amyloid fibrils. Unexpectedly, cystatin E amyloid fibrils contained functional protein, which inhibited both legumain and papain-like enzymes. Fibril formation was further regulated by glycosylation. We speculate that cystatin amyloid fibrils might serve as a binding platform to stabilize the pH-sensitive legumain and cathepsins in the extracellular environment, contributing to their physiological and pathological functions.
Collapse
Affiliation(s)
- Elfriede Dall
- From the Department of Biosciences, University of Salzburg, A-5020 Salzburg, Austria and
| | - Julia C Hollerweger
- From the Department of Biosciences, University of Salzburg, A-5020 Salzburg, Austria and
| | - Sven O Dahms
- From the Department of Biosciences, University of Salzburg, A-5020 Salzburg, Austria and
| | - Haissi Cui
- the Center for Integrated Protein Science Munich, Technical University of Munich, D-85748 Munich, Germany
| | - Katharina Häussermann
- the Center for Integrated Protein Science Munich, Technical University of Munich, D-85748 Munich, Germany
| | - Hans Brandstetter
- From the Department of Biosciences, University of Salzburg, A-5020 Salzburg, Austria and
| |
Collapse
|
3
|
Raanan H, Pike DH, Moore EK, Falkowski PG, Nanda V. Modular origins of biological electron transfer chains. Proc Natl Acad Sci U S A 2018; 115:1280-1285. [PMID: 29358375 PMCID: PMC5819401 DOI: 10.1073/pnas.1714225115] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Oxidoreductases catalyze electron transfer reactions that ultimately provide the energy for life. A limited set of ancestral protein-metal modules are presumably the building blocks that evolved into this diverse protein family. However, the identity of these modules and their path to modern oxidoreductases is unknown. Using a comparative structural analysis approach, we identify a set of fundamental electron transfer modules that have evolved to form the extant oxidoreductases. Using transition metal-containing cofactors as fiducial markers, it is possible to cluster cofactor microenvironments into as few as four major modules: bacterial ferredoxin, cytochrome c, symerythrin, and plastocyanin-type folds. From structural alignments, it is challenging to ascertain whether modules evolved from a single common ancestor (homology) or arose by independent convergence on a limited set of structural forms (analogy). Additional insight into common origins is contained in the spatial adjacency network (SPAN), which is based on proximity of modules in oxidoreductases containing multiple cofactor electron transfer chains. Electron transfer chains within complex modern oxidoreductases likely evolved through repeated duplication and diversification of ancient modular units that arose in the Archean eon.
Collapse
Affiliation(s)
- Hagai Raanan
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ 08901
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ 08854
| | - Douglas H Pike
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ 08854
| | - Eli K Moore
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ 08901
| | - Paul G Falkowski
- Environmental Biophysics and Molecular Ecology Program, Department of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ 08901;
- Department of Earth and Planetary Sciences, Rutgers University, New Brunswick, NJ 08901
| | - Vikas Nanda
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ 08854;
- Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ 08854
| |
Collapse
|
4
|
Abstract
Structural domains are believed to be modules within proteins that can fold and function independently. Some proteins show tandem repetitions of apparent modular structure that do not fold independently, but rather co-operate in stabilizing structural forms that comprise several repeat-units. For many natural repeat-proteins, it has been shown that weak energetic links between repeats lead to the breakdown of co-operativity and the appearance of folding sub-domains within an apparently regular repeat array. The quasi-1D architecture of repeat-proteins is crucial in detailing how the local energetic balances can modulate the folding dynamics of these proteins, which can be related to the physiological behaviour of these ubiquitous biological systems.
Collapse
|
5
|
Gutiérrez FI, Rodriguez-Valenzuela F, Ibarra IL, Devos DP, Melo F. Efficient and automated large-scale detection of structural relationships in proteins with a flexible aligner. BMC Bioinformatics 2016; 17:20. [PMID: 26732380 PMCID: PMC4702403 DOI: 10.1186/s12859-015-0866-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 12/21/2015] [Indexed: 12/01/2022] Open
Abstract
Background The total number of known three-dimensional protein structures is rapidly increasing. Consequently, the need for fast structural search against complete databases without a significant loss of accuracy is increasingly demanding. Recently, TopSearch, an ultra-fast method for finding rigid structural relationships between a query structure and the complete Protein Data Bank (PDB), at the multi-chain level, has been released. However, comparable accurate flexible structural aligners to perform efficient whole database searches of multi-domain proteins are not yet available. The availability of such a tool is critical for a sustainable boosting of biological discovery. Results Here we report on the development of a new method for the fast and flexible comparison of protein structure chains. The method relies on the calculation of 2D matrices containing a description of the three-dimensional arrangement of secondary structure elements (angles and distances). The comparison involves the matching of an ensemble of substructures through a nested-two-steps dynamic programming algorithm. The unique features of this new approach are the integration and trade-off balancing of the following: 1) speed, 2) accuracy and 3) global and semiglobal flexible structure alignment by integration of local substructure matching. The comparison, and matching with competitive accuracy, of one medium sized (250-aa) query structure against the complete PDB database (216,322 protein chains) takes about 8 min using an average desktop computer. The method is at least 2–3 orders of magnitude faster than other tested tools with similar accuracy. We validate the performance of the method for fold and superfamily assignment in a large benchmark set of protein structures. We finally provide a series of examples to illustrate the usefulness of this method and its application in biological discovery. Conclusions The method is able to detect partial structure matching, rigid body shifts, conformational changes and tolerates substantial structural variation arising from insertions, deletions and sequence divergence, as well as structural convergence of unrelated proteins. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0866-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Fernando I Gutiérrez
- Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Alameda 340, Santiago, Chile.,Centre for Organismal Studies (COS), Heidelberg University, Heidelberg, Germany
| | - Felipe Rodriguez-Valenzuela
- Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Alameda 340, Santiago, Chile
| | - Ignacio L Ibarra
- Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Alameda 340, Santiago, Chile.,Centro Andaluz de Biología del Desarrollo (CABD), Universidad Pablo de Olavide, Sevilla, Spain
| | - Damien P Devos
- Centre for Organismal Studies (COS), Heidelberg University, Heidelberg, Germany. .,Centro Andaluz de Biología del Desarrollo (CABD), Universidad Pablo de Olavide, Sevilla, Spain.
| | - Francisco Melo
- Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Alameda 340, Santiago, Chile.
| |
Collapse
|
6
|
Parra RG, Espada R, Verstraete N, Ferreiro DU. Structural and Energetic Characterization of the Ankyrin Repeat Protein Family. PLoS Comput Biol 2015; 11:e1004659. [PMID: 26691182 PMCID: PMC4687027 DOI: 10.1371/journal.pcbi.1004659] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 11/10/2015] [Indexed: 11/21/2022] Open
Abstract
Ankyrin repeat containing proteins are one of the most abundant solenoid folds. Usually implicated in specific protein-protein interactions, these proteins are readily amenable for design, with promising biotechnological and biomedical applications. Studying repeat protein families presents technical challenges due to the high sequence divergence among the repeating units. We developed and applied a systematic method to consistently identify and annotate the structural repetitions over the members of the complete Ankyrin Repeat Protein Family, with increased sensitivity over previous studies. We statistically characterized the number of repeats, the folding of the repeat-arrays, their structural variations, insertions and deletions. An energetic analysis of the local frustration patterns reveal the basic features underlying fold stability and its relation to the functional binding regions. We found a strong linear correlation between the conservation of the energetic features in the repeat arrays and their sequence variations, and discuss new insights into the organization and function of these ubiquitous proteins. Some natural proteins are formed with repetitions of similar amino acid stretches. Ankyrin-repeat proteins constitute one of the most abundant families of this class of proteins that serve as model systems to analyze how variations in sequences exert effects in structures and biological functions. We present an in-depth analysis of the ankyrin repeat protein family, characterizing the variations in the repeating arrays both at the structural and energetic level. We introduce a consistent annotation for the repeat characteristics and describe how the structural differences are related to the sequences by their underlying energetic signatures.
Collapse
Affiliation(s)
- R. Gonzalo Parra
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| | - Rocío Espada
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| | - Nina Verstraete
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
| | - Diego U. Ferreiro
- Protein Physiology Lab, Dep de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN, Buenos Aires, Argentina
- * E-mail:
| |
Collapse
|
7
|
Pellegrini M. Tandem Repeats in Proteins: Prediction Algorithms and Biological Role. Front Bioeng Biotechnol 2015; 3:143. [PMID: 26442257 PMCID: PMC4585158 DOI: 10.3389/fbioe.2015.00143] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Accepted: 09/07/2015] [Indexed: 12/30/2022] Open
Abstract
Tandem repetitions in protein sequence and structure is a fascinating subject of research which has been a focus of study since the late 1990s. In this survey, we give an overview on the multi-faceted aspects of research on protein tandem repeats (PTR for short), including prediction algorithms, databases, early classification efforts, mechanisms of PTR formation and evolution, and synthetic PTR design. We also touch on the rather open issue of the relationship between PTR and flexibility (or disorder) in proteins. Detection of PTR either from protein sequence or structure data is challenging due to inherent high (biological) signal-to-noise ratio that is a key feature of this problem. As early in silico analytic tools have been key enablers for starting this field of study, we expect that current and future algorithmic and statistical breakthroughs will have a high impact on the investigations of the biological role of PTR.
Collapse
Affiliation(s)
- Marco Pellegrini
- Laboratory for Integrative Systems Medicine (LISM), Istituto di Informatica e Telematica, and Istituto di Fisiologia Clinica, Consiglio Nazionale delle Ricerche , Pisa , Italy
| |
Collapse
|
8
|
Machine Learnable Fold Space Representation based on Residue Cluster Classes. Comput Biol Chem 2015; 59 Pt A:1-7. [PMID: 26366526 DOI: 10.1016/j.compbiolchem.2015.07.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Revised: 07/17/2015] [Accepted: 07/25/2015] [Indexed: 11/21/2022]
Abstract
MOTIVATION Protein fold space is a conceptual framework where all possible protein folds exist and ideas about protein structure, function and evolution may be analyzed. Classification of protein folds in this space is commonly achieved by using similarity indexes and/or machine learning approaches, each with different limitations. RESULTS We propose a method for constructing a compact vector space model of protein fold space by representing each protein structure by its residues local contacts. We developed an efficient method to statistically test for the separability of points in a space and showed that our protein fold space representation is learnable by any machine-learning algorithm. AVAILABILITY An API is freely available at https://code.google.com/p/pyrcc/.
Collapse
|
9
|
Schafferhans A, Rost B. Taking structure searches to the next dimension. Structure 2015; 22:938-9. [PMID: 25007224 DOI: 10.1016/j.str.2014.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Structure comparisons are now the first step when a new experimental high-resolution protein structure has been determined. In this issue of Structure, Wiederstein and colleagues describe their latest tool for comparing structures, which gives us the unprecedented power to discover crucial structural connections between whole complexes of proteins in the full structural database in real time.
Collapse
Affiliation(s)
- Andrea Schafferhans
- Department of Informatics, Bioinformatics-I12, TUM, Boltzmannstrasse 3, 85748 Garching/Munich, Germany.
| | - Burkhard Rost
- Department of Informatics, Bioinformatics-I12, TUM, Boltzmannstrasse 3, 85748 Garching/Munich, Germany
| |
Collapse
|
10
|
Ben-Tal N, Kolodny R. Representation of the Protein Universe using Classifications, Maps, and Networks. Isr J Chem 2014. [DOI: 10.1002/ijch.201400001] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
11
|
Wiederstein M, Gruber M, Frank K, Melo F, Sippl MJ. Structure-based characterization of multiprotein complexes. Structure 2014; 22:1063-70. [PMID: 24954616 PMCID: PMC4087271 DOI: 10.1016/j.str.2014.05.005] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Revised: 05/02/2014] [Accepted: 05/05/2014] [Indexed: 01/22/2023]
Abstract
Multiprotein complexes govern virtually all cellular processes. Their 3D structures provide important clues to their biological roles, especially through structural correlations among protein molecules and complexes. The detection of such correlations generally requires comprehensive searches in databases of known protein structures by means of appropriate structure-matching techniques. Here, we present a high-speed structure search engine capable of instantly matching large protein oligomers against the complete and up-to-date database of biologically functional assemblies of protein molecules. We use this tool to reveal unseen structural correlations on the level of protein quaternary structure and demonstrate its general usefulness for efficiently exploring complex structural relationships among known protein assemblies.
Collapse
Affiliation(s)
- Markus Wiederstein
- Division of Structural Biology & Bioinformatics, Department of Molecular Biology, University of Salzburg, Hellbrunnerstraße 34, 5020 Salzburg, Austria.
| | - Markus Gruber
- Division of Structural Biology & Bioinformatics, Department of Molecular Biology, University of Salzburg, Hellbrunnerstraße 34, 5020 Salzburg, Austria
| | - Karl Frank
- Division of Structural Biology & Bioinformatics, Department of Molecular Biology, University of Salzburg, Hellbrunnerstraße 34, 5020 Salzburg, Austria
| | - Francisco Melo
- Departamento de Genetica Molecular y Microbiologia, Facultad de Ciencias Biologicas, Pontificia Universidad Catolica de Chile, Alameda 340, 8320000 Santiago, Chile; Molecular Bioinformatics Laboratory, Millennium Institute on Immunology and Immunotherapy, 8320000 Santiago, Chile
| | - Manfred J Sippl
- Division of Structural Biology & Bioinformatics, Department of Molecular Biology, University of Salzburg, Hellbrunnerstraße 34, 5020 Salzburg, Austria
| |
Collapse
|
12
|
Senn S, Nanda V, Falkowski P, Bromberg Y. Function-based assessment of structural similarity measurements using metal co-factor orientation. Proteins 2013; 82:648-56. [PMID: 24127252 DOI: 10.1002/prot.24442] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2013] [Revised: 09/17/2013] [Accepted: 09/26/2013] [Indexed: 12/20/2022]
Abstract
Structure comparison is widely used to quantify protein relationships. Although there are several approaches to calculate structural similarity, specifying significance thresholds for similarity metrics is difficult due to the inherent likeness of common secondary structure elements. In this study, metal co-factor location is used to assess the biological relevance of structural alignments. The distance between the centroids of bound co-factors adds a chemical and function-relevant constraint to the structural superimposition of two proteins. This additional dimension can be used to define cut-off values for discriminating valid and spurious alignments in large alignment sets. The hypothesis underlying our approach is that metal coordination sites constrain structural evolution, thus revealing functional relationships between distantly related proteins. A comparison of three related nitrogenases shows the sequence and fold constraints imposed on the protein structures up to 18 Å away from the centers of their bound metal clusters.
Collapse
Affiliation(s)
- Stefan Senn
- Environmental Biophysics and Molecular Ecology Program, Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, New Jersey, 08901
| | | | | | | |
Collapse
|
13
|
Kilian P, Valdes JJ, Lecina-Casas D, Chrudimský T, Růžek D. The variability of the large genomic segment of Ťahyňa orthobunyavirus and an all-atom exploration of its anti-viral drug resistance. INFECTION GENETICS AND EVOLUTION 2013; 20:304-11. [PMID: 24090866 DOI: 10.1016/j.meegid.2013.09.023] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Revised: 09/19/2013] [Accepted: 09/21/2013] [Indexed: 11/30/2022]
Abstract
Ťahyňa virus (TAHV), a member of the Bunyaviridae family (California complex), is an important but neglected human mosquito-borne pathogen. The virus genome is composed of three segments, i.e., small (S), medium (M), and large (L). Previous studies on genetic variability of viruses within the California complex were focused on S and M segments, but the L segment remains relatively unstudied. To assess the genetic variation and the relation to virus phenotype we analyzed the L segment sequences of biologically diverse TAHV strains isolated in the Czech Republic and Slovakia. Phylogenetic analysis covering all available sequences of the L segment of TAHV clearly revealed two distinguished lineages, tentatively named as "European" and "Asian". The L segment strains within the European lineage are highly conserved (identity 99.3%), whilst Asian strains are more genetically diverse (identity 97%). Based on sequence comparison with other bunyaviruses, several non-synonymous nucleotide substitutions unique for TAHV in the L segment were identified. We also identified specific residue substitutions in the endonuclease domain of TAHV compared with the La Crosse virus. Since the endonuclease domain of the La Crosse virus has been resolved, we employed an all energy landscape algorithm to analyze the ligand migration of a viral polymerase inhibitor. This allowed us to demonstrate, at the atomic level, that this viral polymerase inhibitor randomly explored the specific residue substitutions in the endonuclease domain of the TAHV L segment.
Collapse
Affiliation(s)
- Patrik Kilian
- Faculty of Science, University of South Bohemia, Branišovská 31, CZ-37005 České Budějovice, Czech Republic; Institute of Parasitology, Biology Centre of the Academy of Sciences of the Czech Republic, Branišovská 31, CZ-37005 České Budějovice, Czech Republic
| | | | | | | | | |
Collapse
|
14
|
Mariani V, Biasini M, Barbato A, Schwede T. lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics 2013; 29:2722-8. [PMID: 23986568 PMCID: PMC3799472 DOI: 10.1093/bioinformatics/btt473] [Citation(s) in RCA: 503] [Impact Index Per Article: 45.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Motivation: The assessment of protein structure prediction techniques requires objective criteria to measure the similarity between a computational model and the experimentally determined reference structure. Conventional similarity measures based on a global superposition of carbon α atoms are strongly influenced by domain motions and do not assess the accuracy of local atomic details in the model. Results: The Local Distance Difference Test (lDDT) is a superposition-free score that evaluates local distance differences of all atoms in a model, including validation of stereochemical plausibility. The reference can be a single structure, or an ensemble of equivalent structures. We demonstrate that lDDT is well suited to assess local model quality, even in the presence of domain movements, while maintaining good correlation with global measures. These properties make lDDT a robust tool for the automated assessment of structure prediction servers without manual intervention. Availability and implementation: Source code, binaries for Linux and MacOSX, and an interactive web server are available at http://swissmodel.expasy.org/lddt Contact:torsten.schwede@unibas.ch Supplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Valerio Mariani
- Biozentrum, Universität Basel, Klingelbergstrasse 50-70 and Computational Structural Biology, SIB Swiss Institute of Bioinformatics, 4056 Basel, Switzerland
| | | | | | | |
Collapse
|
15
|
Pauli I, dos Santos RN, Rostirolla DC, Martinelli LK, Ducati RG, Timmers LFSM, Basso LA, Santos DS, Guido RVC, Andricopulo AD, Norberto de Souza O. Discovery of new inhibitors of Mycobacterium tuberculosis InhA enzyme using virtual screening and a 3D-pharmacophore-based approach. J Chem Inf Model 2013; 53:2390-401. [PMID: 23889525 DOI: 10.1021/ci400202t] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Mycobacterium tuberculosis InhA (MtInhA) is an attractive enzyme to drug discovery efforts due to its validation as an effective biological target for tuberculosis therapy. In this work, two different virtual-ligand-screening approaches were applied in order to identify new InhA inhibitors' candidates from a library of ligands selected from the ZINC database. First, a 3-D pharmacophore model was built based on 36 available MtInhA crystal structures. By combining structure-based and ligand-based information, four pharmacophoric points were designed to select molecules able to satisfy the binding features of MtInhA substrate-binding cavity. The second approach consisted of using four well established docking programs, with different search algorithms, to compare the binding mode and score of the selected molecules from the aforementioned library. After detailed analyses of the results, six ligands were selected for in vitro analysis. Three of these molecules presented a satisfactory inhibitory activity with IC50 values ranging from 24 (±2) μM to 83 (±5) μM. The best compound presented an uncompetitive inhibition mode to NADH and 2-trans-dodecenoyl-CoA substrates, with Ki values of 24 (±3) μM and 20 (±2) μM, respectively. These molecules were not yet described as antituberculars or as InhA inhibitors, making its novelty interesting to start efforts on ligand optimization in order to identify new effective drugs against tuberculosis having InhA as a target. More studies are underway to dissect the discovered uncompetitive inhibitor interactions with MtInhA.
Collapse
Affiliation(s)
- Ivani Pauli
- Laboratório de Bioinformática, Modelagem e Simulação de Biossistemas - LABIO, Faculdade de Informática, PUCRS , Brazil
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Topham CM, Rouquier M, Tarrat N, André I. Adaptive Smith-Waterman residue match seeding for protein structural alignment. Proteins 2013; 81:1823-39. [DOI: 10.1002/prot.24327] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2013] [Revised: 04/22/2013] [Accepted: 05/15/2013] [Indexed: 12/30/2022]
Affiliation(s)
- Christopher M. Topham
- Université de Toulouse, INSA, UPS, INP, LISBP; 135 Avenue de Rangueil F-31077 Toulouse France
- CNRS, UMR5504; F-31400 Toulouse France
- INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés; F-31400 Toulouse France
| | - Mickaël Rouquier
- Université de Toulouse, INSA, UPS, INP, LISBP; 135 Avenue de Rangueil F-31077 Toulouse France
- CNRS, UMR5504; F-31400 Toulouse France
- INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés; F-31400 Toulouse France
| | - Nathalie Tarrat
- Université de Toulouse, INSA, UPS, INP, LISBP; 135 Avenue de Rangueil F-31077 Toulouse France
- CNRS, UMR5504; F-31400 Toulouse France
- INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés; F-31400 Toulouse France
| | - Isabelle André
- Université de Toulouse, INSA, UPS, INP, LISBP; 135 Avenue de Rangueil F-31077 Toulouse France
- CNRS, UMR5504; F-31400 Toulouse France
- INRA, UMR792 Ingénierie des Systèmes Biologiques et des Procédés; F-31400 Toulouse France
| |
Collapse
|
17
|
Parra RG, Espada R, Sánchez IE, Sippl MJ, Ferreiro DU. Detecting repetitions and periodicities in proteins by tiling the structural space. J Phys Chem B 2013; 117:12887-97. [PMID: 23758291 PMCID: PMC3807821 DOI: 10.1021/jp402105j] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
![]()
The
notion of energy landscapes provides conceptual tools for understanding
the complexities of protein folding and function. Energy landscape
theory indicates that it is much easier to find sequences that satisfy
the “Principle of Minimal Frustration” when the folded
structure is symmetric (Wolynes, P. G. Symmetry and the Energy Landscapes
of Biomolecules. Proc. Natl. Acad. Sci. U.S.A.1996, 93, 14249–14255). Similarly,
repeats and structural mosaics may be fundamentally related to landscapes
with multiple embedded funnels. Here we present analytical tools to
detect and compare structural repetitions in protein molecules. By
an exhaustive analysis of the distribution of structural repeats using
a robust metric, we define those portions of a protein molecule that
best describe the overall structure as a tessellation of basic units.
The patterns produced by such tessellations provide intuitive representations
of the repeating regions and their association toward higher order
arrangements. We find that some protein architectures can be described
as nearly periodic, while in others clear separations between repetitions
exist. Since the method is independent of amino acid sequence information,
we can identify structural units that can be encoded by a variety
of distinct amino acid sequences.
Collapse
Affiliation(s)
- R Gonzalo Parra
- Protein Physiology Lab, Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, UBA-CONICET-IQUIBICEN , Buenos Aires, Argentina
| | | | | | | | | |
Collapse
|
18
|
Affiliation(s)
- Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa 31905, Israel;
| | - Leonid Pereyaslavets
- Department of Structural Biology, Stanford University, Stanford, California 94305; ,
| | | | - Michael Levitt
- Department of Structural Biology, Stanford University, Stanford, California 94305; ,
| |
Collapse
|
19
|
Li SC. The difficulty of protein structure alignment under the RMSD. Algorithms Mol Biol 2013; 8:1. [PMID: 23286762 PMCID: PMC3599502 DOI: 10.1186/1748-7188-8-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 12/17/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Protein structure alignment is often modeled as the largest common point set (LCP) problem based on the Root Mean Square Deviation (RMSD), a measure commonly used to evaluate structural similarity. In the problem, each residue is represented by the coordinate of the Cαatom, and a structure is modeled as a sequence of 3D points. Out of two such sequences, one is to find two equal-sized subsequences of the maximum length, and a bijection between the points of the subsequences which gives an RMSD within a given threshold. The problem is considered to be difficult in terms of time complexity, but the reasons for its difficulty is not well-understood. Improving this time complexity is considered important in protein structure prediction and structural comparison, where the task of comparing very numerous structures is commonly encountered. RESULTS To study why the LCP problem is difficult, we define a natural variant of the problem, called the minimum aligned distance (MAD). In the MAD problem, the length of the subsequences to obtain is specified in the input; and instead of fulfilling a threshold, the RMSD between the points of the two subsequences is to be minimized. Our results show that the difficulty of the two problems does not lie solely in the combinatorial complexity of finding the optimal subsequences, or in the task of superimposing the structures. By placing a limit on the distance between consecutive points, and assuming that the points are specified as integral values, we show that both problems are equally difficult, in the sense that they are reducible to each other. In this case, both problems can be exactly solved in polynomial time, although the time complexity remains high. CONCLUSIONS We showed insights and techniques which we hope will lead to practical algorithms for the LCP problem for protein structures. The study identified two important factors in the problem's complexity: (1) The lack of a limit in the distance between the consecutive points of a structure; (2) The arbitrariness of the precision allowed in the input values. Both issues are of little practical concern for the purpose of protein structure alignment. When these factors are removed, the LCP problem is as hard as that of minimizing the RMSD (MAD problem), and can be solved exactly in polynomial time.
Collapse
|
20
|
Slater AW, Castellanos JI, Sippl MJ, Melo F. Towards the development of standardized methods for comparison, ranking and evaluation of structure alignments. Bioinformatics 2012; 29:47-53. [PMID: 23060612 DOI: 10.1093/bioinformatics/bts600] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Pairwise alignment of protein structures is a fundamental task in structural bioinformatics. There are numerous computer programs in the public domain that produce alignments for a given pair of protein structures, but the results obtained by the various programs generally differ substantially. Hence, in the application of such programs the question arises which of the alignment programs are the most trustworthy in the sense of overall performance, and which programs provide the best result for a given pair of proteins. The major problem in comparing, evaluating and judging alignment results is that there is no clear notion of the optimality of an alignment. As a consequence, the numeric criteria and scores reported by the individual structure alignment programs are largely incomparable. RESULTS Here we report on the development and application of a new approach for the evaluation of structure alignment results. The method uses the translation vector and rotation matrix to generate the superposition of two structures but discards the alignment reported by the individual programs. The optimal alignment is then generated in standardized form based on a suitably implemented dynamic programming algorithm where the length of the alignment is the single most informative parameter. We demonstrate that some of the most popular programs in protein structure research differ considerably in their overall performance. In particular, each of the programs investigated here produced in at least in one case the best and the worst alignment compared with all others. Hence, at the current state of development of structure comparison techniques, it is advisable to use several programs in parallel and to choose the optimal alignment in the way reported here. AVAILABILITY AND IMPLEMENTATION The computer software that implement the method described here is freely available at http://melolab.org/stovca.
Collapse
Affiliation(s)
- Alex W Slater
- Molecular Bioinformatics Laboratory, Millennium Institute on Immunology and Immunotherapy, Portugal 49, Santiago, CP 8330025, Chile
| | | | | | | |
Collapse
|
21
|
Kumar D, Gautam A, Hosur RV. A unified NMR strategy for high-throughput determination of backbone fold of small proteins. ACTA ACUST UNITED AC 2012; 13:201-12. [DOI: 10.1007/s10969-012-9144-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2012] [Accepted: 09/18/2012] [Indexed: 11/30/2022]
|
22
|
Eibl C, Grigoriu S, Hessenberger M, Wenger J, Puehringer S, Pinheiro AS, Wagner RN, Proell M, Reed JC, Page R, Diederichs K, Peti W. Structural and functional analysis of the NLRP4 pyrin domain. Biochemistry 2012; 51:7330-41. [PMID: 22928810 PMCID: PMC3445046 DOI: 10.1021/bi3007059] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
NLRP4 is a member of the nucleotide-binding and leucine-rich
repeat
receptor (NLR) family of cytosolic receptors and a member of an inflammation
signaling cascade. Here, we present the crystal structure of the NLRP4
pyrin domain (PYD) at 2.3 Å resolution. The NLRP4 PYD is a member
of the death domain (DD) superfamily and adopts a DD fold consisting
of six α-helices tightly packed around a hydrophobic core, with
a highly charged surface that is typical of PYDs. Importantly, however,
we identified several differences between the NLRP4 PYD crystal structure
and other PYD structures that are significant enough to affect NLRP4
function and its interactions with binding partners. Notably, the
length of helix α3 and the α2−α3 connecting
loop in the NLRP4 PYD are unique among PYDs. The apoptosis-associated
speck-like protein containing a CARD (ASC) is an adaptor protein whose
interactions with a number of distinct PYDs are believed to be critical
for activation of the inflammatory response. Here, we use co-immunoprecipitation,
yeast two-hybrid, and nuclear magnetic resonance chemical shift perturbation
analysis to demonstrate that, despite being important for activation
of the inflammatory response and sharing several similarities with
other known ASC-interacting PYDs (i.e., ASC2), NLRP4 does not interact
with the adaptor protein ASC. Thus, we propose that the factors governing
homotypic PYD interactions are more complex than the currently accepted
model, which states that complementary charged surfaces are the main
determinants of PYD–PYD interaction specificity.
Collapse
Affiliation(s)
- Clarissa Eibl
- Department of Molecular Biology, University of Salzburg, 5020 Salzburg, Austria
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Tan KW, Jobichen C, Ong TC, Gao YF, Tiong YS, Wong KN, Chew FT, Sivaraman J, Mok YK. Crystal structure of Der f 7, a dust mite allergen from Dermatophagoides farinae. PLoS One 2012; 7:e44850. [PMID: 22970319 PMCID: PMC3435378 DOI: 10.1371/journal.pone.0044850] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2012] [Accepted: 08/07/2012] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Der f 7 is the group 7 allergen from the dust mite Dermatophagoides farinae, homologous to the major allergen Der p 7 from D. pteronyssinus. Monoclonal antibody that bind to residues Leu48 and Phe50 was found to inhibit IgE binding to residue Asp159, which is important for the cross-reactivity between Der f 7 and Der p 7. METHODOLOGY/PRINCIPAL FINDINGS Here, we report the crystal structure of Der f 7 that shows an elongated and curved molecule consisting of two anti-parallel β-sheets--one 4-stranded and the other 5-stranded--that wrap around a long C-terminal helix. The overall fold of Der f 7 is similar to Der p 7 but key difference was found in the β1-β2 loop region. In Der f 7, Leu48 and Phe50 are in close proximity to Asp159, explaining why monoclonal antibody binding to Leu48 and Phe50 can inhibit IgE binding to Asp159. Both Der f 7 and Der p 7 bind weakly to polymyxin B via a similar binding site that is formed by the N-terminal helix, the 4-stranded β-sheet and the C-terminal helix. The thermal stability of Der f 7 is significantly lower than that of Der p 7, and the stabilities of both allergens are highly depend on pH. CONCLUSION/SIGNIFICANCE Der f 7 is homologous to Der p 7 in terms of the amino acid sequence and overall 3D structure but with significant differences in the region proximal to the IgE epitope and in thermal stability. The crystal structure of Der f 7 provides a basis for studying the function and allergenicity of this group of allergens.
Collapse
Affiliation(s)
- Kang Wei Tan
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Chacko Jobichen
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Tan Ching Ong
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Yun Feng Gao
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Yuen Sung Tiong
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Kang Ning Wong
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Fook Tim Chew
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - J. Sivaraman
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Yu Keung Mok
- Department of Biological Sciences, National University of Singapore, Singapore, Singapore
- * E-mail:
| |
Collapse
|
24
|
Tan KW, Ong TC, Gao YF, Tiong YS, Wong KN, Chew FT, Mok YK. NMR structure and IgE epitopes of Blo t 21, a major dust mite allergen from Blomia tropicalis. J Biol Chem 2012; 287:34776-85. [PMID: 22887997 DOI: 10.1074/jbc.m112.348730] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Blo t 21 is a paralogue of the group 5 allergen, Blo t 5, a major allergen from the dust mite Blomia tropicalis. Blo t 21 has moderate sequence identity (40.7%) to Blo t 5 and low to moderate cross-reactivity to Blo t 5. In B. tropicalis, the most prevalent and allergenic allergens are in the order of Blo t 21, Blo t 5, and Blo t 7. Here, we determined the NMR solution structure of Blo t 21, which represents the first structure of the group 21 dust mite allergen. The structure of Blo t 21 closely resembles the structures of Blo t 5 and Der p 5, comprising three anti-parallel α-helices arranged in a helical bundle. Using site-directed mutagenesis and specific IgE binding ELISA, Blo t 21 was found to contain both conserved and unique charged IgE epitope residues at the L2 loop region and on helix α3. Cross-inhibition assays confirmed that Blo t 21 has a low to moderate cross-reactivity with Blo t 5 and Der p 5 and represents a novel group of major allergen in B. tropicalis. In addition to group 5 allergens, Blo t 21 has also a low to moderate cross-reactivity with group 21 allergens from Dermatophagoides mites, confirming that B. tropicalis is a major and distinct source of dust mite allergens.
Collapse
Affiliation(s)
- Kang Wei Tan
- Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore
| | | | | | | | | | | | | |
Collapse
|
25
|
Analyzing of expression of novel polypeptide complexes consisting of Shiga toxin B subunit and Adherence Fimbriae of Escherichia coli based on in silico modeling. J Mol Model 2012; 18:4131-9. [PMID: 22527278 DOI: 10.1007/s00894-012-1414-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2011] [Accepted: 03/26/2012] [Indexed: 10/28/2022]
Abstract
Enterohemorrhagic (EHEC) and enteroaggregative (EAEC) are two pathotypes of diarrheagenic Escherichia coli. EAEC strains express adhesins called aggregate adherence fimbriae (AAFs) which the bacteria use to adhere to intestinal mucosa. EHEC virulence factor is Shiga toxin which belongs to the AB5 toxin family. B subunit, the nontoxic part of Shiga toxin (StxB), forms a homo pentamer and is responsible for binding to target cells. StxB has recently been proven to have adjuvant activity. In the current study we fused StxB encoding gene to 3' end of genes encoding two variants of AAFs, i.e., AAF/I and AAF/II. The in silico studies on tertiary structure and biochemical characteristics of Shiga toxin A subunit (StxA) revealed more resemblance to AAF/II than AAF/I. The constructs were prepared in a way that StxB could imitate its natural structure (pentamer formation) and its position (C-terminus) in the native toxin complex. The expression of these constructs showed the formation of AAF/II-B as a protein complex but with lower molecular mass than its expected size. In contrast, the AAF/I-B complex was not formed. Overall, the results of in silico studies and expression experiments together revealed that despite AAF/II-B expression, StxB failed to form pentamer. Therefore the observed protein complex has lower molecular mass. Since StxB is bound to AAF/II through disulfide bond, this bond prevents pentamer formation of StxB. However, due to the lack of disulfide bond between AAF/I and StxB, no protein complex is formed, thus StxB maintains its pentamer structure.
Collapse
|
26
|
Real space refinement of crystal structures with canonical distributions of electrons. Structure 2012; 19:1739-43. [PMID: 22153496 PMCID: PMC3234344 DOI: 10.1016/j.str.2011.10.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2011] [Revised: 09/28/2011] [Accepted: 10/10/2011] [Indexed: 11/21/2022]
Abstract
Recurring groups of atoms in molecules are surrounded by specific canonical distributions of electrons. Deviations from these distributions reveal unrealistic molecular geometries. Here, we show how canonical electron densities can be combined with classical electron densities derived from X-ray diffraction experiments to drive the real space refinement of crystal structures. The refinement process generally yields superior molecular models with reduced excess electron densities and improved stereochemistry without compromising the agreement between molecular models and experimental data.
Collapse
|
27
|
Sippl MJ, Wiederstein M. Detection of spatial correlations in protein structures and molecular complexes. Structure 2012; 20:718-28. [PMID: 22483118 PMCID: PMC3320710 DOI: 10.1016/j.str.2012.01.024] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2011] [Revised: 01/09/2012] [Accepted: 01/31/2012] [Indexed: 10/28/2022]
Abstract
Protein structures are frequently related by spectacular and often surprising similarities. Structural correlations among protein chains are routinely detected by various structure-matching techniques, but the comparison of oligomers and molecular complexes is largely uncharted territory. Here we solve the structure-matching problem for oligomers and large molecular aggregates, including the largest molecular complexes known today. We provide several challenging examples that cannot be handled by conventional structure-matching techniques and we report on a number of remarkable correlations. The examples cover the cell-puncturing device of bacteriophage T4, the secretion system of P. aeruginosa, members of the dehydrogenase family, DNA clamps, ferredoxin iron-storage cages, and virus capsids.
Collapse
Affiliation(s)
- Manfred J Sippl
- Division of Bioinformatics, Department of Molecular Biology, University of Salzburg, Hellbrunnerstraße 34, 5020 Salzburg, Austria.
| | | |
Collapse
|
28
|
Computer-based annotation of putative AraC/XylS-family transcription factors of known structure but unknown function. J Biomed Biotechnol 2012; 2012:103132. [PMID: 22505803 PMCID: PMC3312330 DOI: 10.1155/2012/103132] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2011] [Revised: 12/09/2011] [Accepted: 12/13/2011] [Indexed: 12/12/2022] Open
Abstract
Currently, about 20 crystal structures per day are released and deposited in the Protein Data Bank. A significant fraction of these structures is produced by research groups associated with the structural genomics consortium. The biological function of many of these proteins is generally unknown or not validated by experiment. Therefore, a growing need for functional prediction of protein structures has emerged. Here we present an integrated bioinformatics method that combines sequence-based relationships and three-dimensional (3D) structural similarity of transcriptional regulators with computer prediction of their cognate DNA binding sequences. We applied this method to the AraC/XylS family of transcription factors, which is a large family of transcriptional regulators found in many bacteria controlling the expression of genes involved in diverse biological functions. Three putative new members of this family with known 3D structure but unknown function were identified for which a probable functional classification is provided. Our bioinformatics analyses suggest that they could be involved in plant cell wall degradation (Lin2118 protein from Listeria innocua, PDB code 3oou), symbiotic nitrogen fixation (protein from Chromobacterium violaceum, PDB code 3oio), and either metabolism of plant-derived biomass or nitrogen fixation (protein from Rhodopseudomonas palustris, PDB code 3mn2).
Collapse
|
29
|
Metlitzky M, Puehringer S, J. Fisher S. Crystal structure of PqqB from <i>Pseudomonas putida</i> at 2.2 Å resolution. ACTA ACUST UNITED AC 2012. [DOI: 10.4236/jbpc.2012.32023] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
30
|
Cortines JR, Weigele PR, Gilcrease EB, Casjens SR, Teschke CM. Decoding bacteriophage P22 assembly: identification of two charged residues in scaffolding protein responsible for coat protein interaction. Virology 2011; 421:1-11. [PMID: 21974803 DOI: 10.1016/j.virol.2011.09.005] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Revised: 08/15/2011] [Accepted: 09/08/2011] [Indexed: 11/17/2022]
Abstract
Proper assembly of viruses must occur through specific interactions between capsid proteins. Many double-stranded DNA viruses and bacteriophages require internal scaffolding proteins to assemble their coat proteins into icosahedral capsids. The 303 amino acid bacteriophage P22 scaffolding protein is mostly helical, and its C-terminal helix-turn-helix (HTH) domain binds to the coat protein during virion assembly, directing the formation of an intermediate structure called the procapsid. The interaction between coat and scaffolding protein HTH domain is electrostatic, but the amino acids that form the protein-protein interface have yet to be described. In the present study, we used alanine scanning mutagenesis of charged surface residues of the C-terminal HTH domain of scaffolding protein. We have determined that P22 scaffolding protein residues R293 and K296 are crucial for binding to coat protein and that the neighboring charges are not essential but do modulate the affinity between the two proteins.
Collapse
Affiliation(s)
- Juliana R Cortines
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269, USA
| | | | | | | | | |
Collapse
|
31
|
Gelly JC, Joseph AP, Srinivasan N, de Brevern AG. iPBA: a tool for protein structure comparison using sequence alignment strategies. Nucleic Acids Res 2011; 39:W18-23. [PMID: 21586582 PMCID: PMC3125758 DOI: 10.1093/nar/gkr333] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
With the immense growth in the number of available protein structures, fast and accurate structure comparison has been essential. We propose an efficient method for structure comparison, based on a structural alphabet. Protein Blocks (PBs) is a widely used structural alphabet with 16 pentapeptide conformations that can fairly approximate a complete protein chain. Thus a 3D structure can be translated into a 1D sequence of PBs. With a simple Needleman–Wunsch approach and a raw PB substitution matrix, PB-based structural alignments were better than many popular methods. iPBA web server presents an improved alignment approach using (i) specialized PB Substitution Matrices (SM) and (ii) anchor-based alignment methodology. With these developments, the quality of ∼88% of alignments was improved. iPBA alignments were also better than DALI, MUSTANG and GANGSTA+ in >80% of the cases. The webserver is designed to for both pairwise comparisons and database searches. Outputs are given as sequence alignment and superposed 3D structures displayed using PyMol and Jmol. A local alignment option for detecting subs-structural similarity is also embedded. As a fast and efficient ‘sequence-based’ structure comparison tool, we believe that it will be quite useful to the scientific community. iPBA can be accessed at http://www.dsimb.inserm.fr/dsimb_tools/ipba/.
Collapse
Affiliation(s)
- Jean-Christophe Gelly
- INSERM, UMR-S 665, Dynamique des Structures et Interactions des Macromolécules Biologiques, Université Paris Diderot-Paris 7, Institut National de la Transfusion Sanguine, 6, rue Alexandre Cabanel, 75739 Paris cedex 15, France
| | | | | | | |
Collapse
|
32
|
Crystal structure of the heteromolecular chaperone, AscE-AscG, from the type III secretion system in Aeromonas hydrophila. PLoS One 2011; 6:e19208. [PMID: 21559439 PMCID: PMC3084799 DOI: 10.1371/journal.pone.0019208] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2011] [Accepted: 03/22/2011] [Indexed: 12/27/2022] Open
Abstract
Background The putative needle complex subunit AscF forms a ternary complex with the chaperones AscE and AscG in the type III secretion system of Aeromonas hydrophila so as to avoid premature assembly. Previously, we demonstrated that the C-terminal region of AscG (residues 62–116) in the hetero-molecular chaperone, AscE-AscG, is disordered and susceptible to limited protease digestion. Methodology/Principal Findings Here, we report the crystal structure of the ordered AscG1–61 region in complex with AscE at 2.4 Å resolution. Helices α2 and α3 of AscE in the AscE-AscG1–61 complex assumes a helix-turn-helix conformation in an anti-parallel fashion similar to that in apo AscE. However, in the presence of AscG, an additional N-terminal helix α1 in AscE (residues 4–12) is observed. PscG or YscG in the crystal structures of PscE-PscF-PscG or YscE-YscF-YscG, respectively, assumes a typical tetratricopeptide repeat (TPR) fold with three TPR repeats and one C-terminal capping helix. By comparison, AscG in AscE-AscG1–61 comprises three anti-parallel helices that resembles the N-terminal TPR repeats in the corresponding region of PscG or YscG in PscE-PscF-PscG or YscE-YscF-YscG. Thermal denaturation of AscE-AscG and AscE-AscG1–61 complexes demonstrates that the C-terminal disordered region does not contribute to the thermal stability of the overall complex. Conclusion/Significance The N-terminal region of the AscG in the AscE-AscG complex is ordered and assumes a structure similar to those in the corresponding regions of PscE-PscG-PscF or YscE-YscF-YscG complexes. While the C-terminal region of AscG in the AscE-AscG complex is disordered and will assume its structure only in the presence of the substrate AscF. We hypothesize that AscE act as a chaperone of the chaperone to keep AscG in a stable but partially disordered state for interaction with AscF.
Collapse
|
33
|
Shen YF, Li B, Liu ZP. Protein structure alignment based on internal coordinates. Interdiscip Sci 2010; 2:308-19. [DOI: 10.1007/s12539-010-0019-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2008] [Revised: 01/05/2010] [Accepted: 01/06/2010] [Indexed: 10/18/2022]
|
34
|
Geric Stare B, Fouville D, Širca S, Gallot A, Urek G, Grenier E. Molecular variability and evolution of the pectate lyase (pel-2) parasitism gene in cyst nematodes parasitizing different solanaceous plants. J Mol Evol 2010; 72:169-81. [PMID: 21153407 DOI: 10.1007/s00239-010-9413-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2010] [Accepted: 11/16/2010] [Indexed: 11/28/2022]
Abstract
While pectate lyases are major parasitism factors in plant-parasitic nematodes, there is little information on the variability of these genes within species and their utility as pathotype or host range molecular markers. We have analysed polymorphisms of pectate lyase 2 (pel-2) gene, which degrades the unesterified polygalacturonate (pectate) of the host cell-wall, in the genus Globodera. Molecular variability of the pel-2 gene and the predicted protein was evaluated in populations of G. rostochiensis, G. pallida, G. "mexicana" and G. tabacum. Seventy eight pel-2 sequences were obtained and aligned. Point mutations were observed at 373 positions, 57% of these affect the coding part of the gene and produce 129 aa replacements. The observed polymorphism does not correlate either to the pathotypes proposed in potato cyst nematodes (PCN) or the subspecies described in tobacco cyst nematodes. The trees reveal a topology different from the admitted species topology as G. rostochiensis and G. pallida sequences are more similar to each other than to G. tabacum. Species-specific sites, potentially applicable for identification, and sites distinguishing PCN from tobacco cyst nematodes, were identified. As both G. rostochiensis and G. pallida display the same host range, but distinct from G. tabacum, which cannot parasitize potato plants, it is tempting to speculate that pel-2 genes polymorphism may be implicated in this adaptation, a view supported by the fact that no active pectate lyase 2 was found in G. "mexicana", a close relative of G. pallida that is unable to develop on cultivated potato varieties.
Collapse
Affiliation(s)
- Barbara Geric Stare
- Plant Protection Department, Agricultural Institute of Slovenia, Hacquetova 17, 1001, Ljubljana, Slovenia,
| | | | | | | | | | | |
Collapse
|
35
|
Prlic A, Bliven S, Rose PW, Bluhm WF, Bizon C, Godzik A, Bourne PE. Pre-calculated protein structure alignments at the RCSB PDB website. Bioinformatics 2010; 26:2983-5. [PMID: 20937596 PMCID: PMC3003546 DOI: 10.1093/bioinformatics/btq572] [Citation(s) in RCA: 149] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Summary: With the continuous growth of the RCSB Protein Data Bank (PDB), providing an up-to-date systematic structure comparison of all protein structures poses an ever growing challenge. Here, we present a comparison tool for calculating both 1D protein sequence and 3D protein structure alignments. This tool supports various applications at the RCSB PDB website. First, a structure alignment web service calculates pairwise alignments. Second, a stand-alone application runs alignments locally and visualizes the results. Third, pre-calculated 3D structure comparisons for the whole PDB are provided and updated on a weekly basis. These three applications allow users to discover novel relationships between proteins available either at the RCSB PDB or provided by the user. Availability and Implementation: A web user interface is available at http://www.rcsb.org/pdb/workbench/workbench.do. The source code is available under the LGPL license from http://www.biojava.org. A source bundle, prepared for local execution, is available from http://source.rcsb.org Contact:andreas@sdsc.edu; pbourne@ucsd.edu
Collapse
Affiliation(s)
- Andreas Prlic
- San Diego Supercomputer Center, University of California San Diego, 9500 Gilman Drive, Mailcode 0505 La Jolla, CA 92093-0505, USA.
| | | | | | | | | | | | | |
Collapse
|
36
|
Coudevylle N, Geist L, Hötzinger M, Hartl M, Kontaxis G, Bister K, Konrat R. The v-myc-induced Q83 lipocalin is a siderocalin. J Biol Chem 2010; 285:41646-52. [PMID: 20826777 DOI: 10.1074/jbc.m110.123331] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Siderocalins are atypical lipocalins able to capture siderophores with high affinity. They contribute to the innate immune response by interfering with bacterial siderophore-mediated iron uptake but are also involved in numerous physiological processes such as inflammation, iron delivery, tissue differentiation, and cancer progression. The Q83 lipocalin was originally identified based on its overexpression in quail embryo fibroblasts transformed by the v-myc oncogene. We show here that Q83 is a siderocalin, binding the siderophore enterobactin with an affinity and mode of binding nearly identical to that of neutrophil gelatinase-associated lipocalin (NGAL), the prototypical siderocalin. This strengthens the role of siderocalins in cancer progression and inflammation. In addition, we also present the solution structure of Q83 in complex with intact enterobactin and a detailed analysis of the Q83 binding mode, including mutagenesis of the critical residues involved in enterobactin binding. These data provide a first insight into the molecular details of siderophore binding and delineate the common molecular properties defining the siderocalin protein family.
Collapse
Affiliation(s)
- Nicolas Coudevylle
- Department of Structural and Computational Biology, Max F Perutz Laboratories, University of Vienna, 1030 Vienna, Austria.
| | | | | | | | | | | | | |
Collapse
|
37
|
Stivala AD, Stuckey PJ, Wirth AI. Fast and accurate protein substructure searching with simulated annealing and GPUs. BMC Bioinformatics 2010; 11:446. [PMID: 20813068 PMCID: PMC2944279 DOI: 10.1186/1471-2105-11-446] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2010] [Accepted: 09/03/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Searching a database of protein structures for matches to a query structure, or occurrences of a structural motif, is an important task in structural biology and bioinformatics. While there are many existing methods for structural similarity searching, faster and more accurate approaches are still required, and few current methods are capable of substructure (motif) searching. RESULTS We developed an improved heuristic for tableau-based protein structure and substructure searching using simulated annealing, that is as fast or faster and comparable in accuracy, with some widely used existing methods. Furthermore, we created a parallel implementation on a modern graphics processing unit (GPU). CONCLUSIONS The GPU implementation achieves up to 34 times speedup over the CPU implementation of tableau-based structure search with simulated annealing, making it one of the fastest available methods. To the best of our knowledge, this is the first application of a GPU to the protein structural search problem.
Collapse
Affiliation(s)
- Alex D Stivala
- Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia
| | - Peter J Stuckey
- Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia
- National ICT Australia Victoria Laboratory at The University of Melbourne, Victoria 3010, Australia
| | - Anthony I Wirth
- Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia
| |
Collapse
|
38
|
Zhang ZH, Bharatham K, Sherman WA, Mihalek I. deconSTRUCT: general purpose protein database search on the substructure level. Nucleic Acids Res 2010; 38:W590-4. [PMID: 20522512 PMCID: PMC2896154 DOI: 10.1093/nar/gkq489] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
deconSTRUCT webserver offers an interface to a protein database search engine, usable for a general purpose detection of similar protein (sub)structures. Initially, it deconstructs the query structure into its secondary structure elements (SSEs) and reassembles the match to the target by requiring a (tunable) degree of similarity in the direction and sequential order of SSEs. Hierarchical organization and judicious use of the information about protein structure enables deconSTRUCT to achieve the sensitivity and specificity of the established search engines at orders of magnitude increased speed, without tying up irretrievably the substructure information in the form of a hash. In a post-processing step, a match on the level of the backbone atoms is constructed. The results presented to the user consist of the list of the matched SSEs, the transformation matrix for rigid superposition of the structures and several ways of visualization, both downloadable and implemented as a web-browser plug-in. The server is available at http://epsf.bmad.bii.a-star.edu.sg/struct_server.html.
Collapse
Affiliation(s)
- Zong Hong Zhang
- Bioinformatics Institute 30 Biopolis Street, #07-01 Matrix, Singapore
| | | | | | | |
Collapse
|
39
|
Konagurthu AS, Reboul CF, Schmidberger JW, Irving JA, Lesk AM, Stuckey PJ, Whisstock JC, Buckle AM. MUSTANG-MR structural sieving server: applications in protein structural analysis and crystallography. PLoS One 2010; 5:e10048. [PMID: 20386610 PMCID: PMC2850368 DOI: 10.1371/journal.pone.0010048] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2009] [Accepted: 03/16/2010] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND A central tenet of structural biology is that related proteins of common function share structural similarity. This has key practical consequences for the derivation and analysis of protein structures, and is exploited by the process of "molecular sieving" whereby a common core is progressively distilled from a comparison of two or more protein structures. This paper reports a novel web server for "sieving" of protein structures, based on the multiple structural alignment program MUSTANG. METHODOLOGY/PRINCIPAL FINDINGS "Sieved" models are generated from MUSTANG-generated multiple alignment and superpositions by iteratively filtering out noisy residue-residue correspondences, until the resultant correspondences in the models are optimally "superposable" under a threshold of RMSD. This residue-level sieving is also accompanied by iterative elimination of the poorly fitting structures from the input ensemble. Therefore, by varying the thresholds of RMSD and the cardinality of the ensemble, multiple sieved models are generated for a given multiple alignment and superposition from MUSTANG. To aid the identification of structurally conserved regions of functional importance in an ensemble of protein structures, Lesk-Hubbard graphs are generated, plotting the number of residue correspondences in a superposition as a function of its corresponding RMSD. The conserved "core" (or typically active site) shows a linear trend, which becomes exponential as divergent parts of the structure are included into the superposition. CONCLUSIONS The application addresses two fundamental problems in structural biology: first, the identification of common substructures among structurally related proteins--an important problem in characterization and prediction of function; second, generation of sieved models with demonstrated uses in protein crystallographic structure determination using the technique of Molecular Replacement.
Collapse
Affiliation(s)
- Arun S. Konagurthu
- NICTA Victoria Research Laboratory at The University of Melbourne, The University of Melbourne, Melbourne, Australia
- Department of Computer Science and Software Engineering, The University of Melbourne, Victoria, Australia
| | - Cyril F. Reboul
- Department of Biochemistry and Molecular Biology, Monash University, Victoria, Australia
- ARC Centre of Excellence in Structural and Functional Microbial Genomics, Monash University, Victoria, Australia
| | - Jason W. Schmidberger
- Department of Biochemistry and Molecular Biology, Monash University, Victoria, Australia
| | - James A. Irving
- Department of Biochemistry and Molecular Biology, Monash University, Victoria, Australia
| | - Arthur M. Lesk
- The Huck Institute for Genomics, Proteomics, and Bioinformatics, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Peter J. Stuckey
- NICTA Victoria Research Laboratory at The University of Melbourne, The University of Melbourne, Melbourne, Australia
- Department of Computer Science and Software Engineering, The University of Melbourne, Victoria, Australia
| | - James C. Whisstock
- Department of Biochemistry and Molecular Biology, Monash University, Victoria, Australia
- ARC Centre of Excellence in Structural and Functional Microbial Genomics, Monash University, Victoria, Australia
| | - Ashley M. Buckle
- Department of Biochemistry and Molecular Biology, Monash University, Victoria, Australia
- * E-mail:
| |
Collapse
|
40
|
Frank K, Gruber M, Sippl MJ. COPS Benchmark: interactive analysis of database search methods. Bioinformatics 2010; 26:574-5. [DOI: 10.1093/bioinformatics/btp712] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
41
|
Cooke HA, Guenther EL, Luo Y, Shen B, Bruner SD. Molecular basis of substrate promiscuity for the SAM-dependent O-methyltransferase NcsB1, involved in the biosynthesis of the enediyne antitumor antibiotic neocarzinostatin. Biochemistry 2009; 48:9590-8. [PMID: 19702337 DOI: 10.1021/bi901257q] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The small molecule component of chromoprotein enediyne antitumor antibiotics is biosynthesized through a convergent route, incorporating amino acid, polyketide, and carbohydrate building blocks around a central enediyne hydrocarbon core. The naphthoic acid moiety of the enediyne neocarzinostatin plays key roles in the biological activity of the natural product by interacting with both the carrier protein and duplex DNA at the site of action. We have previously described the in vitro characterization of an S-adenosylmethionine-dependent O-methyltransferase (NcsB1) in the neocarzinostatin biosynthetic pathway [Luo, Y., Lin, S., Zhang, J., Cooke, H. A., Bruner, S. D., and Shen, B. (2008) J. Biol. Chem. 283, 14694-14702]. Here we provide a structural basis for NcsB1 activity, illustrating that the enzyme shares an overall architecture with a large family of S-adenosylmethionine-dependent proteins. In addition, NcsB1 represents the first enzyme to be structurally characterized in the biosynthetic pathway of neocarzinostatin. By cocrystallizing the enzyme with various combinations of the cofactor and substrate analogues, details of the active site structure have been established. Changes in subdomain orientation were observed via comparison of structures in the presence and absence of substrate, suggesting that reorientation of the enzyme is involved in binding of the substrate. In addition, residues important for substrate discrimination were predicted and probed through site-directed mutagenesis and in vitro biochemical characterization.
Collapse
Affiliation(s)
- Heather A Cooke
- Department of Chemistry, Merkert Chemistry Center, Boston College, Chestnut Hill, Massachusetts 02467, USA
| | | | | | | | | |
Collapse
|
42
|
Konrat R. The protein meta-structure: a novel concept for chemical and molecular biology. Cell Mol Life Sci 2009; 66:3625-39. [PMID: 19690801 PMCID: PMC11115628 DOI: 10.1007/s00018-009-0117-0] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2009] [Revised: 07/20/2009] [Accepted: 07/21/2009] [Indexed: 12/01/2022]
Abstract
The ultimate goal of bioinformatics or computational chemical biology is the sequence-based prediction of protein functionality. However, due to the degeneracy of the primary sequence code there is no unambiguous relationship. The degeneracy can be partly lifted by going to higher levels of abstraction and, for example, incorporating 3D structural information. However, sometimes even at this conceptual level functional ambiguities often remain. Here a novel conceptual framework is described (the protein meta-structure). At this level of abstraction, the protein structure is viewed as an intricate network of interacting residues. This novel conception offers unique possibilities for chemical (molecular) biology, structural genomics and drug discovery. In this review some prototypical applications will be presented that serve to illustrate the potential of the methodology.
Collapse
Affiliation(s)
- Robert Konrat
- Department of Structural and Computational Biology, Max F. Perutz Laboratories, University of Vienna, Vienna Biocenter Campus 5, 1030, Vienna, Austria.
| |
Collapse
|
43
|
Wassermann AM, Geppert H, Bajorath J. Ligand Prediction for Orphan Targets Using Support Vector Machines and Various Target-Ligand Kernels Is Dominated by Nearest Neighbor Effects. J Chem Inf Model 2009; 49:2155-67. [DOI: 10.1021/ci9002624] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Anne Mai Wassermann
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - Hanna Geppert
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstrasse 2, D-53113 Bonn, Germany
| |
Collapse
|
44
|
Sippl MJ. Fold space unlimited. Curr Opin Struct Biol 2009; 19:312-20. [DOI: 10.1016/j.sbi.2009.03.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2009] [Revised: 02/16/2009] [Accepted: 03/16/2009] [Indexed: 11/25/2022]
|
45
|
Kumar D, Misra JR, Kumar A, Chugh J, Sharma S, Hosur RV. NMR-derived solution structure of SUMO fromDrosophila melanogaster(dSmt3). Proteins 2009; 75:1046-50. [DOI: 10.1002/prot.22389] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
46
|
Hasegawa H, Holm L. Advances and pitfalls of protein structural alignment. Curr Opin Struct Biol 2009; 19:341-8. [PMID: 19481444 DOI: 10.1016/j.sbi.2009.04.003] [Citation(s) in RCA: 303] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2009] [Accepted: 04/16/2009] [Indexed: 11/30/2022]
Abstract
Structure comparison opens a window into the distant past of protein evolution, which has been unreachable by sequence comparison alone. With 55,000 entries in the Protein Data Bank and about 500 new structures added each week, automated processing, comparison, and classification are necessary. A variety of methods use different representations, scoring functions, and optimization algorithms, and they generate contradictory results even for moderately distant structures. Sequence mutations, insertions, and deletions are accommodated by plastic deformations of the common core, retaining the precise geometry of the active site, and peripheral regions may refold completely. Therefore structure comparison methods that allow for flexibility and plasticity generate the most biologically meaningful alignments. Active research directions include both the search for fold invariant features and the modeling of structural transitions in evolution. Advances have been made in algorithmic robustness, multiple alignment, and speeding up database searches.
Collapse
Affiliation(s)
- Hitomi Hasegawa
- Institute of Biotechnology, University of Helsinki, P.O. Box 56 (Viikinkaari 5), 00014 University of Helsinki, Finland
| | | |
Collapse
|
47
|
Suhrer SJ, Wiederstein M, Gruber M, Sippl MJ. COPS--a novel workbench for explorations in fold space. Nucleic Acids Res 2009; 37:W539-44. [PMID: 19465386 PMCID: PMC2703906 DOI: 10.1093/nar/gkp411] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The COPS (Classification Of Protein Structures) web server provides access to the complete repertoire of known protein structures and protein structural domains. The COPS classification encodes pairwise structural similarities as quantified metric relationships. The resulting metrical structure is mapped to a hierarchical tree, which is largely equivalent to the structure of a file browser. Exploiting this relationship we implemented the Fold Space Navigator, a tool that makes navigation in fold space as convenient as browsing through a file system. Moreover, pairwise structural similarities among the domains can be visualized and inspected instantaneously. COPS is updated weekly and stays concurrent with the PDB repository. The server also exposes the COPS classification pipeline. Newly determined structures uploaded to the server are chopped into domains, the locations of the new domains in the classification tree are determined, and their neighborhood can be immediately explored through the Fold Space Navigator. The COPS web server is accessible at http://cops.services.came.sbg.ac.at/.
Collapse
Affiliation(s)
- Stefan J Suhrer
- Center of Applied Molecular Engineering, Division of Bioinformatics, University of Salzburg, Hellbrunnerstrasse 34, 5020 Salzburg, Austria
| | | | | | | |
Collapse
|
48
|
Does distant homology with Evf reveal a lipid binding site in Bacillus thuringiensis cytolytic toxins? FEBS Lett 2009; 583:1555-60. [PMID: 19409387 DOI: 10.1016/j.febslet.2009.04.038] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2009] [Revised: 04/22/2009] [Accepted: 04/23/2009] [Indexed: 11/23/2022]
Abstract
The Cry and Cyt classes of insecticidal toxins derived from the sporulating bacterium Bacillus thuringiensis are valuable substitutes for synthetic pesticides in agricultural contexts. Crystal structures and many biochemical data have provided insights into their molecular mechanisms, generally thought to involve oligomerization and pore formation, but have not localised the site on Cyt toxins responsible for selective binding of phospholipids containing unsaturated fatty acids. Here, distant homology between the structure of Cyt toxins and Erwinia virulence factor (Evf) is demonstrated which, along with sequence conservation analysis, allows a putative lipid binding site to be localised in the toxins.
Collapse
|
49
|
Dessailly BH, Redfern OC, Cuff A, Orengo CA. Exploiting structural classifications for function prediction: towards a domain grammar for protein function. Curr Opin Struct Biol 2009; 19:349-56. [PMID: 19398323 DOI: 10.1016/j.sbi.2009.03.009] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2009] [Revised: 02/17/2009] [Accepted: 03/16/2009] [Indexed: 12/28/2022]
Abstract
The ability to assign function to proteins has become a major bottleneck for comprehensively understanding cellular mechanisms at the molecular level. Here we discuss the extent to which structural domain classifications can help in deciphering the complex relationship between the functions of proteins and their sequences and structures. Structural classifications are particularly helpful in understanding the mosaic manner in which new proteins and functions emerge through evolution. This is partly because they provide reliable and concrete domain definitions and enable the detection of very remote structural similarities and homologies. It is also because structural data can illuminate more clearly the mechanisms by which a broader functional repertoire can emerge during evolution.
Collapse
Affiliation(s)
- Benoît H Dessailly
- Department of Structural and Molecular Biology, University College London, London WC1E 6BT, United Kingdom
| | | | | | | |
Collapse
|
50
|
Lawton TJ, Sayavedra-Soto LA, Arp DJ, Rosenzweig AC. Crystal structure of a two-domain multicopper oxidase: implications for the evolution of multicopper blue proteins. J Biol Chem 2009; 284:10174-80. [PMID: 19224923 PMCID: PMC2665071 DOI: 10.1074/jbc.m900179200] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2009] [Revised: 02/06/2009] [Indexed: 11/06/2022] Open
Abstract
The two-domain multicopper oxidases are proposed to be key intermediates in the evolution of three-domain multicopper oxidases. A number of two-domain multicopper oxidases have been identified from genome sequences and are classified as type A, type B, or type C on the basis of the predicted location of the type 1 copper center. The crystal structure of blue copper oxidase, a type C two-domain multicopper oxidase from Nitrosomonas europaea, has been determined to 1.9 A resolution. Blue copper oxidase is a trimer, of which each subunit comprises two cupredoxin domains. Each subunit houses a type 1 copper site in domain 1 and a type 2/type 3 trinuclear copper cluster at the subunit-subunit interface. The coordination geometry at the trinuclear copper site is consistent with reduction of the copper ions. Although the overall architecture of blue copper oxidase is similar to nitrite reductases, detailed structural alignments show that the fold and domain orientation more closely resemble the three-domain multicopper oxidases. These observations have important implications for the evolution of nitrite reductases and multicopper oxidases.
Collapse
Affiliation(s)
- Thomas J Lawton
- Departments of Biochemistry, Molecular Biology, and Cell Biology and of Chemistry, Northwestern University, Evanston, Illinois 60208, USA
| | | | | | | |
Collapse
|