1
|
Blekos K, Chairetakis K, Lynch I, Marcoulaki E. Principles and requirements for nanomaterial representations to facilitate machine processing and cooperation with nanoinformatics tools. J Cheminform 2023; 15:44. [PMID: 37046286 PMCID: PMC10099932 DOI: 10.1186/s13321-022-00669-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 12/10/2022] [Indexed: 04/14/2023] Open
Abstract
Efficient and machine-readable representations are needed to accurately identify, validate and communicate information of chemical structures. Many such representations have been developed (as, for example, the Simplified Molecular-Input Line-Entry System and the IUPAC International Chemical Identifier), each offering advantages specific to various use-cases. Representation of the multi-component structures of nanomaterials (NMs), though, remains out of scope for all the currently available standards, as the nature of NMs sets new challenges on formalizing the encoding of their structure, interactions and environmental parameters. In this work we identify a set of principles that a NM representation should adhere to in order to provide "machine-friendly" encodings of NMs, i.e. encodings that facilitate machine processing and cooperation with nanoinformatics tools. We illustrate our principles by showing how the recently introduced InChI-based NM representation, might be augmented, in principle, to also encode morphology and mixture properties, distributions of properties, and also to capture auxiliary information and allow data reuse.
Collapse
Affiliation(s)
- Kostas Blekos
- Institute of Nuclear and Radiological Sciences and Technology, Energy and Safety, National Centre for Scientific Research "Demokritos", 15341, Agia Paraskevi, Greece
| | - Kostas Chairetakis
- Institute of Nuclear and Radiological Sciences and Technology, Energy and Safety, National Centre for Scientific Research "Demokritos", 15341, Agia Paraskevi, Greece
| | - Iseult Lynch
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - Effie Marcoulaki
- Institute of Nuclear and Radiological Sciences and Technology, Energy and Safety, National Centre for Scientific Research "Demokritos", 15341, Agia Paraskevi, Greece.
| |
Collapse
|
2
|
Duarte IF, Caio J, Moedas MF, Rodrigues LA, Leandro AP, Rivera IA, Silva MFB. Dihydrolipoamide dehydrogenase, pyruvate oxidation, and acetylation-dependent mechanisms intersecting drug iatrogenesis. Cell Mol Life Sci 2021; 78:7451-7468. [PMID: 34718827 PMCID: PMC11072406 DOI: 10.1007/s00018-021-03996-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Revised: 09/27/2021] [Accepted: 10/15/2021] [Indexed: 10/19/2022]
Abstract
In human metabolism, pyruvate dehydrogenase complex (PDC) is one of the most intricate and large multimeric protein systems representing a central hub for cellular homeostasis. The worldwide used antiepileptic drug valproic acid (VPA) may potentially induce teratogenicity or a mild to severe hepatic toxicity, where the underlying mechanisms are not completely understood. This work aims to clarify the mechanisms that intersect VPA-related iatrogenic effects to PDC-associated dihydrolipoamide dehydrogenase (DLD; E3) activity. DLD is also a key enzyme of α-ketoglutarate dehydrogenase, branched-chain α-keto acid dehydrogenase, α-ketoadipate dehydrogenase, and the glycine decarboxylase complexes. The molecular effects of VPA will be reviewed underlining the data that sustain a potential interaction with DLD. The drug-associated effects on lipoic acid-related complexes activity may induce alterations on the flux of metabolites through tricarboxylic acid cycle, branched-chain amino acid oxidation, glycine metabolism and other cellular acetyl-CoA-connected reactions. The biotransformation of VPA involves its complete β-oxidation in mitochondria causing an imbalance on energy homeostasis. The drug consequences as histone deacetylase inhibitor and thus gene expression modulator have also been recognized. The mitochondrial localization of PDC is unequivocal, but its presence and function in the nucleus were also demonstrated, generating acetyl-CoA, crucial for histone acetylation. Bridging metabolism and epigenetics, this review gathers the evidence of VPA-induced interference with DLD or PDC functions, mainly in animal and cellular models, and highlights the uncharted in human. The consequences of this interaction may have significant impact either in mitochondrial or in nuclear acetyl-CoA-dependent processes.
Collapse
Affiliation(s)
- I F Duarte
- The Research Institute for Medicines (iMed.ULisboa), Metabolism and Genetics Group, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
| | - J Caio
- The Research Institute for Medicines (iMed.ULisboa), Metabolism and Genetics Group, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
| | - M F Moedas
- The Research Institute for Medicines (iMed.ULisboa), Metabolism and Genetics Group, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
- Division of Molecular Metabolism, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
| | - L A Rodrigues
- The Research Institute for Medicines (iMed.ULisboa), Metabolism and Genetics Group, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
| | - A P Leandro
- The Research Institute for Medicines (iMed.ULisboa), Metabolism and Genetics Group, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
- Department of Biochemistry and Human Biology, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
| | - I A Rivera
- The Research Institute for Medicines (iMed.ULisboa), Metabolism and Genetics Group, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
- Department of Biochemistry and Human Biology, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal
| | - M F B Silva
- The Research Institute for Medicines (iMed.ULisboa), Metabolism and Genetics Group, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal.
- Department of Biochemistry and Human Biology, Faculty of Pharmacy, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003, Lisboa, Portugal.
| |
Collapse
|
3
|
Poverennaya EV, Kiseleva OI, Ivanov AS, Ponomarenko EA. Methods of Computational Interactomics for Investigating Interactions of Human Proteoforms. BIOCHEMISTRY (MOSCOW) 2020; 85:68-79. [PMID: 32079518 DOI: 10.1134/s000629792001006x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Human genome contains ca. 20,000 protein-coding genes that could be translated into millions of unique protein species (proteoforms). Proteoforms coded by a single gene often have different functions, which implies different protein partners. By interacting with each other, proteoforms create a network reflecting the dynamics of cellular processes in an organism. Perturbations of protein-protein interactions change the network topology, which often triggers pathological processes. Studying proteoforms is a relatively new research area in proteomics, and this is why there are comparatively few experimental studies on the interaction of proteoforms. Bioinformatics tools can facilitate such studies by providing valuable complementary information to the experimental data and, in particular, expanding the possibilities of the studies of proteoform interactions.
Collapse
Affiliation(s)
| | - O I Kiseleva
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
| | - A S Ivanov
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
| | | |
Collapse
|
4
|
Chojnowski G, Choudhury K, Heuser P, Sobolev E, Pereira J, Oezugurel U, Lamzin VS. The use of local structural similarity of distant homologues for crystallographic model building from a molecular-replacement solution. Acta Crystallogr D Struct Biol 2020; 76:248-260. [PMID: 32133989 PMCID: PMC7057216 DOI: 10.1107/s2059798320000455] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Accepted: 01/14/2020] [Indexed: 12/18/2022] Open
Abstract
The performance of automated protein model building usually decreases with resolution, mainly owing to the lower information content of the experimental data. This calls for a more elaborate use of the available structural information about macromolecules. Here, a new method is presented that uses structural homologues to improve the quality of protein models automatically constructed using ARP/wARP. The method uses local structural similarity between deposited models and the model being built, and results in longer main-chain fragments that in turn can be more reliably docked to the protein sequence. The application of the homology-based model extension method to the example of a CFA synthase at 2.7 Å resolution resulted in a more complete model with almost all of the residues correctly built and docked to the sequence. The method was also evaluated on 1493 molecular-replacement solutions at a resolution of 4.0 Å and better that were submitted to the ARP/wARP web service for model building. A significant improvement in the completeness and sequence coverage of the built models has been observed.
Collapse
Affiliation(s)
- Grzegorz Chojnowski
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Koushik Choudhury
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Philipp Heuser
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Egor Sobolev
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Joana Pereira
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Umut Oezugurel
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Victor S. Lamzin
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| |
Collapse
|
5
|
Leite WC, Penteado RF, Gomes F, Iulek J, Etto RM, Saab SC, Steffens MBR, Galvão CW. MAW point mutation impairs H. Seropedicae RecA ATP hydrolysis and DNA repair without inducing large conformational changes in its structure. PLoS One 2019; 14:e0214601. [PMID: 30998678 PMCID: PMC6472873 DOI: 10.1371/journal.pone.0214601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 03/17/2019] [Indexed: 11/18/2022] Open
Abstract
RecA is a multifunctional protein that plays a central role in DNA repair in bacteria. The structural Make ATP Work motif (MAW) is proposed to control the ATPase activity of RecA. In the present work, we report the biochemical activity and structural effects of the L53Q mutation at the MAW motif of the RecA protein from H. seropedicae (HsRecA L53Q). In vitro studies showed that HsRecA L53Q can bind ADP, ATP, and ssDNA, as does wild-type RecA. However, the ATPase and DNA-strand exchange activities were completely lost. In vivo studies showed that the expression of HsRecA L53Q in E. coli recA1 does not change its phenotype when cells were challenged with MMS and UV. Molecular dynamics simulations showed the L53Q point mutation did not cause large conformational changes in the HsRecA structure. However, there is a difference on dynamical cross-correlation movements of the residues involved in contacts within the ATP binding site and regions that hold the DNA binding sites. Additionally, a new hydrogen bond, formed between Q53 and T49, was hypothesized to allow an independent motion of the MAW motif from the hydrophobic core, what could explain the observed loss of activity of HsRecA L53Q.
Collapse
Affiliation(s)
- Wellington C. Leite
- Department of Physics, State University of Ponta Grossa (UEPG), Ponta Grossa,Paraná, Brazil
- * E-mail: (WCL); .(CWG)
| | - Renato F. Penteado
- Department of Chemistry, State University of Ponta Grossa (UEPG), Ponta Grossa, Paraná, Brazil
| | - Fernando Gomes
- Department of Genetics and Evolutionary Biology, Institute of Biosciences, University of São Paulo, São Paulo, São Paulo, Brazil
| | - Jorge Iulek
- Department of Chemistry, State University of Ponta Grossa (UEPG), Ponta Grossa, Paraná, Brazil
| | - Rafael M. Etto
- Department of Chemistry, State University of Ponta Grossa (UEPG), Ponta Grossa, Paraná, Brazil
| | - Sérgio C. Saab
- Department of Physics, State University of Ponta Grossa (UEPG), Ponta Grossa,Paraná, Brazil
| | - Maria B. R. Steffens
- Department of Biochemistry and Molecular Biology, Federal University of Paraná, Curitiba, Paraná, Brazil
| | - Carolina W. Galvão
- Department of Structural and Molecular Biology and Genetics, State University of Ponta Grossa (UEPG), Ponta Grossa, Paraná, Brazil
- * E-mail: (WCL); .(CWG)
| |
Collapse
|
6
|
Mir S, Alhroub Y, Anyango S, Armstrong DR, Berrisford JM, Clark AR, Conroy MJ, Dana JM, Deshpande M, Gupta D, Gutmanas A, Haslam P, Mak L, Mukhopadhyay A, Nadzirin N, Paysan-Lafosse T, Sehnal D, Sen S, Smart OS, Varadi M, Kleywegt GJ, Velankar S. PDBe: towards reusable data delivery infrastructure at protein data bank in Europe. Nucleic Acids Res 2018; 46:D486-D492. [PMID: 29126160 PMCID: PMC5753225 DOI: 10.1093/nar/gkx1070] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Revised: 10/14/2017] [Accepted: 10/26/2017] [Indexed: 01/12/2023] Open
Abstract
The Protein Data Bank in Europe (PDBe, pdbe.org) is actively engaged in the deposition, annotation, remediation, enrichment and dissemination of macromolecular structure data. This paper describes new developments and improvements at PDBe addressing three challenging areas: data enrichment, data dissemination and functional reusability. New features of the PDBe Web site are discussed, including a context dependent menu providing links to raw experimental data and improved presentation of structures solved by hybrid methods. The paper also summarizes the features of the LiteMol suite, which is a set of services enabling fast and interactive 3D visualization of structures, with associated experimental maps, annotations and quality assessment information. We introduce a library of Web components which can be easily reused to port data and functionality available at PDBe to other services. We also introduce updates to the SIFTS resource which maps PDB data to other bioinformatics resources, and the PDBe REST API.
Collapse
Affiliation(s)
- Saqib Mir
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Younes Alhroub
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen Anyango
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - David R Armstrong
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - John M Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alice R Clark
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Matthew J Conroy
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jose M Dana
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mandar Deshpande
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Deepti Gupta
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Aleksandras Gutmanas
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Pauline Haslam
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Lora Mak
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Abhik Mukhopadhyay
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nurul Nadzirin
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Typhaine Paysan-Lafosse
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- InterPro, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - David Sehnal
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic
| | - Sanchayita Sen
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Oliver S Smart
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gerard J Kleywegt
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Welcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
7
|
Leite WC, Galvão CW, Saab SC, Iulek J, Etto RM, Steffens MBR, Chitteni-Pattu S, Stanage T, Keck JL, Cox MM. Structural and Functional Studies of H. seropedicae RecA Protein - Insights into the Polymerization of RecA Protein as Nucleoprotein Filament. PLoS One 2016; 11:e0159871. [PMID: 27447485 PMCID: PMC4957752 DOI: 10.1371/journal.pone.0159871] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 07/08/2016] [Indexed: 11/18/2022] Open
Abstract
The bacterial RecA protein plays a role in the complex system of DNA damage repair. Here, we report the functional and structural characterization of the Herbaspirillum seropedicae RecA protein (HsRecA). HsRecA protein is more efficient at displacing SSB protein from ssDNA than Escherichia coli RecA protein. HsRecA also promotes DNA strand exchange more efficiently. The three dimensional structure of HsRecA-ADP/ATP complex has been solved to 1.7 Å resolution. HsRecA protein contains a small N-terminal domain, a central core ATPase domain and a large C-terminal domain, that are similar to homologous bacterial RecA proteins. Comparative structural analysis showed that the N-terminal polymerization motif of archaeal and eukaryotic RecA family proteins are also present in bacterial RecAs. Reconstruction of electrostatic potential from the hexameric structure of HsRecA-ADP/ATP revealed a high positive charge along the inner side, where ssDNA is bound inside the filament. The properties of this surface may explain the greater capacity of HsRecA protein to bind ssDNA, forming a contiguous nucleoprotein filament, displace SSB and promote DNA exchange relative to EcRecA. Our functional and structural analyses provide insight into the molecular mechanisms of polymerization of bacterial RecA as a helical nucleoprotein filament.
Collapse
Affiliation(s)
- Wellington C. Leite
- Department of Physics, Ponta Grossa State University (UEPG), Av. Carlos Cavalcanti, 4748, CEP. 84.030–900, Ponta Grossa, PR, Brazil
- * E-mail: (MC); (WL)
| | - Carolina W. Galvão
- Department of Structural and Molecular Biology and Genetics, Ponta Grossa State University (UEPG), CEP 84030–900, Ponta Grossa, PR, Brazil
| | - Sérgio C. Saab
- Department of Physics, Ponta Grossa State University (UEPG), Av. Carlos Cavalcanti, 4748, CEP. 84.030–900, Ponta Grossa, PR, Brazil
| | - Jorge Iulek
- Department of Chemistry, Ponta Grossa State University (UEPG), CEP 84030–900, Ponta Grossa, PR, Brazil
| | - Rafael M. Etto
- Department of Chemistry, Ponta Grossa State University (UEPG), CEP 84030–900, Ponta Grossa, PR, Brazil
| | - Maria B. R. Steffens
- Department of Biochemistry and Molecular Biology, Federal University of Parana, CEP 81531–980 Curitiba, Brazil
| | - Sindhu Chitteni-Pattu
- Department of Biochemistry, University of Wisconsin–Madison, Madison, WI, 53706–1544, United States of America
| | - Tyler Stanage
- Department of Biochemistry, University of Wisconsin–Madison, Madison, WI, 53706–1544, United States of America
| | - James L. Keck
- Department of Biomolecular Chemistry, University of Wisconsin School of Medicine and Public Health, Madison, WI, 53706, United States of America
| | - Michael M. Cox
- Department of Biochemistry, University of Wisconsin–Madison, Madison, WI, 53706–1544, United States of America
- * E-mail: (MC); (WL)
| |
Collapse
|
8
|
Velankar S, van Ginkel G, Alhroub Y, Battle GM, Berrisford JM, Conroy MJ, Dana JM, Gore SP, Gutmanas A, Haslam P, Hendrickx PMS, Lagerstedt I, Mir S, Fernandez Montecelo MA, Mukhopadhyay A, Oldfield TJ, Patwardhan A, Sanz-García E, Sen S, Slowley RA, Wainwright ME, Deshpande MS, Iudin A, Sahni G, Salavert Torres J, Hirshberg M, Mak L, Nadzirin N, Armstrong DR, Clark AR, Smart OS, Korir PK, Kleywegt GJ. PDBe: improved accessibility of macromolecular structure data from PDB and EMDB. Nucleic Acids Res 2016; 44:D385-95. [PMID: 26476444 PMCID: PMC4702783 DOI: 10.1093/nar/gkv1047] [Citation(s) in RCA: 103] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2015] [Accepted: 10/01/2015] [Indexed: 12/30/2022] Open
Abstract
The Protein Data Bank in Europe (http://pdbe.org) accepts and annotates depositions of macromolecular structure data in the PDB and EMDB archives and enriches, integrates and disseminates structural information in a variety of ways. The PDBe website has been redesigned based on an analysis of user requirements, and now offers intuitive access to improved and value-added macromolecular structure information. Unique value-added information includes lists of reviews and research articles that cite or mention PDB entries as well as access to figures and legends from full-text open-access publications that describe PDB entries. A powerful new query system not only shows all the PDB entries that match a given query, but also shows the 'best structures' for a given macromolecule, ligand complex or sequence family using data-quality information from the wwPDB validation reports. A PDBe RESTful API has been developed to provide unified access to macromolecular structure data available in the PDB and EMDB archives as well as value-added annotations, e.g. regarding structure quality and up-to-date cross-reference information from the SIFTS resource. Taken together, these new developments facilitate unified access to macromolecular structure data in an intuitive way for non-expert users and support expert users in analysing macromolecular structure data.
Collapse
Affiliation(s)
- Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Glen van Ginkel
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Younes Alhroub
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Gary M Battle
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - John M Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Matthew J Conroy
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Jose M Dana
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Swanand P Gore
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Aleksandras Gutmanas
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Pauline Haslam
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Pieter M S Hendrickx
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Ingvar Lagerstedt
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Saqib Mir
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Manuel A Fernandez Montecelo
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Abhik Mukhopadhyay
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Thomas J Oldfield
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Ardan Patwardhan
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Eduardo Sanz-García
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sanchayita Sen
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Robert A Slowley
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Michael E Wainwright
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Mandar S Deshpande
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Andrii Iudin
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Gaurav Sahni
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Jose Salavert Torres
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Miriam Hirshberg
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Lora Mak
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Nurul Nadzirin
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - David R Armstrong
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Alice R Clark
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Oliver S Smart
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Paul K Korir
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Gerard J Kleywegt
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
9
|
Vuong H, Cheng F, Lin CC, Zhao Z. Functional consequences of somatic mutations in cancer using protein pocket-based prioritization approach. Genome Med 2014; 6:81. [PMID: 25360158 PMCID: PMC4213513 DOI: 10.1186/s13073-014-0081-7] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Accepted: 10/03/2014] [Indexed: 12/12/2022] Open
Abstract
Background Recently, a number of large-scale cancer genome sequencing projects have generated a large volume of somatic mutations; however, identifying the functional consequences and roles of somatic mutations in tumorigenesis remains a major challenge. Researchers have identified that protein pocket regions play critical roles in the interaction of proteins with small molecules, enzymes, and nucleic acid. As such, investigating the features of somatic mutations in protein pocket regions provides a promising approach to identifying new genotype-phenotype relationships in cancer. Methods In this study, we developed a protein pocket-based computational approach to uncover the functional consequences of somatic mutations in cancer. We mapped 1.2 million somatic mutations across 36 cancer types from the COSMIC database and The Cancer Genome Atlas (TCGA) onto the protein pocket regions of over 5,000 protein three-dimensional structures. We further integrated cancer cell line mutation profiles and drug pharmacological data from the Cancer Cell Line Encyclopedia (CCLE) onto protein pocket regions in order to identify putative biomarkers for anticancer drug responses. Results We found that genes harboring protein pocket somatic mutations were significantly enriched in cancer driver genes. Furthermore, genes harboring pocket somatic mutations tended to be highly co-expressed in a co-expressed protein interaction network. Using a statistical framework, we identified four putative cancer genes (RWDD1, NCF1, PLEK, and VAV3), whose expression profiles were associated with overall poor survival rates in melanoma, lung, or colorectal cancer patients. Finally, genes harboring protein pocket mutations were more likely to be drug-sensitive or drug-resistant. In a case study, we illustrated that the BAX gene was associated with the sensitivity of three anticancer drugs (midostaurin, vinorelbine, and tipifarnib). Conclusions This study provides novel insights into the functional consequences of somatic mutations during tumorigenesis and for anticancer drug responses. The computational approach used might be beneficial to the study of somatic mutations in the era of cancer precision medicine. Electronic supplementary material The online version of this article (doi:10.1186/s13073-014-0081-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Huy Vuong
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, 2525 West End Avenue, Suite 600, Nashville, TN 37203 USA
| | - Feixiong Cheng
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, 2525 West End Avenue, Suite 600, Nashville, TN 37203 USA
| | - Chen-Ching Lin
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, 2525 West End Avenue, Suite 600, Nashville, TN 37203 USA
| | - Zhongming Zhao
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, 2525 West End Avenue, Suite 600, Nashville, TN 37203 USA ; Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN 37232 USA ; Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN 37232 USA ; Center for Quantitative Sciences, Vanderbilt University Medical Center, Nashville, TN 37232 USA
| |
Collapse
|
10
|
Inhester T, Rarey M. Protein-ligand interaction databases: advanced tools to mine activity data and interactions on a structural level. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2014. [DOI: 10.1002/wcms.1192] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Therese Inhester
- Center for Bioinformatics; University of Hamburg; Hamburg Germany
| | - Matthias Rarey
- Center for Bioinformatics; University of Hamburg; Hamburg Germany
| |
Collapse
|
11
|
Hou Y, Li J, Li Y, Dong Z, Xia Q, Yuan YA. Crystal structure of Bombyx mori arylphorins reveals a 3:3 heterohexamer with multiple papain cleavage sites. Protein Sci 2014; 23:735-46. [PMID: 24639361 PMCID: PMC4093950 DOI: 10.1002/pro.2457] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Revised: 03/11/2014] [Accepted: 03/11/2014] [Indexed: 01/07/2023]
Abstract
In holometabolous insects, the accumulation and utilization of storage proteins (SPs), including arylphorins and methionine-rich proteins, are critical for the insect metamorphosis. SPs function as amino acids reserves, which are synthesized in fat body, secreted into the larval hemolymph and taken up by fat body shortly before pupation. However, the detailed molecular mechanisms of digestion and utilization of SPs during development are largely unknown. Here, we report the crystal structure of Bombyx mori arylphorins at 2.8 Å, which displays a heterohexameric structural arrangement formed by trimerization of dimers comprising two structural similar arylphorins. Our limited proteolysis assay and microarray data strongly suggest that papain-like proteases are the major players for B. mori arylphorins digestion in vitro and in vivo. Consistent with the biochemical data, dozens of papain cleavage sites are mapped on the surface of the heterohexameric structure of B. mori arylphorins. Hence, our results provide the insightful information to understand the metamorphosis of holometabolous insects at molecular level.
Collapse
Affiliation(s)
- Yong Hou
- State Key Laboratory of Silkworm Genome Biology, College of Biotechnology, Southwest UniversityBeibei, Chongqing, 400715, China,Department of Biological Sciences and Center for Bioimaging Sciences, National University of SingaporeSingapore, 117543, Singapore,SWU-NUS Joint Laboratory in Structural Genomics, Southwest UniversityBeibei, Chongqing, 400715, China
| | - Jianwei Li
- State Key Laboratory of Silkworm Genome Biology, College of Biotechnology, Southwest UniversityBeibei, Chongqing, 400715, China
| | - Yi Li
- State Key Laboratory of Silkworm Genome Biology, College of Biotechnology, Southwest UniversityBeibei, Chongqing, 400715, China
| | - Zhaoming Dong
- State Key Laboratory of Silkworm Genome Biology, College of Biotechnology, Southwest UniversityBeibei, Chongqing, 400715, China
| | - Qingyou Xia
- State Key Laboratory of Silkworm Genome Biology, College of Biotechnology, Southwest UniversityBeibei, Chongqing, 400715, China,SWU-NUS Joint Laboratory in Structural Genomics, Southwest UniversityBeibei, Chongqing, 400715, China
| | - Y Adam Yuan
- Department of Biological Sciences and Center for Bioimaging Sciences, National University of SingaporeSingapore, 117543, Singapore,SWU-NUS Joint Laboratory in Structural Genomics, Southwest UniversityBeibei, Chongqing, 400715, China,National University of Singapore (Suzhou) Research InstituteJiangsu, 215123, China,*Correspondence to: Y. Adam Yuan, Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Singapore 117543, Singapore. E-mail:
| |
Collapse
|
12
|
Espinosa O, Mitsopoulos K, Hakas J, Pearl F, Zvelebil M. Deriving a mutation index of carcinogenicity using protein structure and protein interfaces. PLoS One 2014; 9:e84598. [PMID: 24454733 PMCID: PMC3893166 DOI: 10.1371/journal.pone.0084598] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2013] [Accepted: 11/16/2013] [Indexed: 11/29/2022] Open
Abstract
With the advent of Next Generation Sequencing the identification of mutations in the genomes of healthy and diseased tissues has become commonplace. While much progress has been made to elucidate the aetiology of disease processes in cancer, the contributions to disease that many individual mutations make remain to be characterised and their downstream consequences on cancer phenotypes remain to be understood. Missense mutations commonly occur in cancers and their consequences remain challenging to predict. However, this knowledge is becoming more vital, for both assessing disease progression and for stratifying drug treatment regimes. Coupled with structural data, comprehensive genomic databases of mutations such as the 1000 Genomes project and COSMIC give an opportunity to investigate general principles of how cancer mutations disrupt proteins and their interactions at the molecular and network level. We describe a comprehensive comparison of cancer and neutral missense mutations; by combining features derived from structural and interface properties we have developed a carcinogenicity predictor, InCa (Index of Carcinogenicity). Upon comparison with other methods, we observe that InCa can predict mutations that might not be detected by other methods. We also discuss general limitations shared by all predictors that attempt to predict driver mutations and discuss how this could impact high-throughput predictions. A web interface to a server implementation is publicly available at http://inca.icr.ac.uk/.
Collapse
Affiliation(s)
- Octavio Espinosa
- Breakthrough Breast Cancer Research Centre, Institute of Cancer Research, London, United Kingdom
| | - Konstantinos Mitsopoulos
- Breakthrough Breast Cancer Research Centre, Institute of Cancer Research, London, United Kingdom
| | - Jarle Hakas
- Breakthrough Breast Cancer Research Centre, Institute of Cancer Research, London, United Kingdom
| | - Frances Pearl
- UK Cancer Therapeutics Unit, The Institute of Cancer Research, London, United Kingdom
- Translational Drug Discovery Group, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Marketa Zvelebil
- Breakthrough Breast Cancer Research Centre, Institute of Cancer Research, London, United Kingdom
| |
Collapse
|
13
|
Garbutt CC, Bangalore PV, Kannar P, Mukhtar MS. Getting to the edge: protein dynamical networks as a new frontier in plant-microbe interactions. FRONTIERS IN PLANT SCIENCE 2014; 5:312. [PMID: 25071795 PMCID: PMC4074768 DOI: 10.3389/fpls.2014.00312] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Accepted: 06/11/2014] [Indexed: 05/18/2023]
Abstract
A systems perspective on diverse phenotypes, mechanisms of infection, and responses to environmental stresses can lead to considerable advances in agriculture and medicine. A significant promise of systems biology within plants is the development of disease-resistant crop varieties, which would maximize yield output for food, clothing, building materials, and biofuel production. A systems or "-omics" perspective frames the next frontier in the search for enhanced knowledge of plant network biology. The functional understanding of network structure and dynamics is vital to expanding our knowledge of how the intercellular communication processes are executed. This review article will systematically discuss various levels of organization of systems biology beginning with the building blocks termed "-omes" and ending with complex transcriptional and protein-protein interaction networks. We will also highlight the prevailing computational modeling approaches of biological regulatory network dynamics. The latest developments in the "-omics" approach will be reviewed and discussed to underline and highlight novel technologies and research directions in plant network biology.
Collapse
Affiliation(s)
- Cassandra C. Garbutt
- Department of Biology, The University of Alabama at BirminghamBirmingham, AL, USA
| | - Purushotham V. Bangalore
- Department of Computer and Information Sciences, The University of Alabama at BirminghamBirmingham, AL, USA
| | - Pegah Kannar
- Department of Biology, The University of Alabama at BirminghamBirmingham, AL, USA
| | - M. S. Mukhtar
- Department of Biology, The University of Alabama at BirminghamBirmingham, AL, USA
- Nutrition Obesity Research Center, The University of Alabama at BirminghamBirmingham, AL, USA
- *Correspondence: M. S. Mukhtar, Department of Biology, The University of Alabama at Birmingham, Campbell Hall 369, 1300 University Boulevard, Birmingham, AL 35294-1170, USA e-mail:
| |
Collapse
|
14
|
Gutmanas A, Alhroub Y, Battle GM, Berrisford JM, Bochet E, Conroy MJ, Dana JM, Fernandez Montecelo MA, van Ginkel G, Gore SP, Haslam P, Hatherley R, Hendrickx PMS, Hirshberg M, Lagerstedt I, Mir S, Mukhopadhyay A, Oldfield TJ, Patwardhan A, Rinaldi L, Sahni G, Sanz-García E, Sen S, Slowley RA, Velankar S, Wainwright ME, Kleywegt GJ. PDBe: Protein Data Bank in Europe. Nucleic Acids Res 2013; 42:D285-91. [PMID: 24288376 PMCID: PMC3965016 DOI: 10.1093/nar/gkt1180] [Citation(s) in RCA: 105] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The Protein Data Bank in Europe (pdbe.org) is a founding member of the Worldwide PDB consortium (wwPDB; wwpdb.org) and as such is actively engaged in the deposition, annotation, remediation and dissemination of macromolecular structure data through the single global archive for such data, the PDB. Similarly, PDBe is a member of the EMDataBank organisation (emdatabank.org), which manages the EMDB archive for electron microscopy data. PDBe also develops tools that help the biomedical science community to make effective use of the data in the PDB and EMDB for their research. Here we describe new or improved services, including updated SIFTS mappings to other bioinformatics resources, a new browser for the PDB archive based on Gene Ontology (GO) annotation, updates to the analysis of Nuclear Magnetic Resonance-derived structures, redesigned search and browse interfaces, and new or updated visualisation and validation tools for EMDB entries.
Collapse
Affiliation(s)
- Aleksandras Gutmanas
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Lagerstedt I, Moore WJ, Patwardhan A, Sanz-García E, Best C, Swedlow JR, Kleywegt GJ. Web-based visualisation and analysis of 3D electron-microscopy data from EMDB and PDB. J Struct Biol 2013; 184:173-81. [PMID: 24113529 PMCID: PMC3898923 DOI: 10.1016/j.jsb.2013.09.021] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2013] [Revised: 09/24/2013] [Accepted: 09/25/2013] [Indexed: 11/25/2022]
Abstract
The Protein Data Bank in Europe (PDBe) has developed web-based tools for the visualisation and analysis of 3D electron microscopy (3DEM) structures in the Electron Microscopy Data Bank (EMDB) and Protein Data Bank (PDB). The tools include: (1) a volume viewer for 3D visualisation of maps, tomograms and models, (2) a slice viewer for inspecting 2D slices of tomographic reconstructions, and (3) visual analysis pages to facilitate analysis and validation of maps, tomograms and models. These tools were designed to help non-experts and experts alike to get some insight into the content and assess the quality of 3DEM structures in EMDB and PDB without the need to install specialised software or to download large amounts of data from these archives. The technical challenges encountered in developing these tools, as well as the more general considerations when making archived data available to the user community through a web interface, are discussed.
Collapse
Affiliation(s)
- Ingvar Lagerstedt
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - William J. Moore
- Centre for Gene Regulation and Expression, College of Life Sciences, University of Dundee, Dow Street, Dundee DD1 5EH, United Kingdom
| | - Ardan Patwardhan
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Eduardo Sanz-García
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Christoph Best
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Jason R. Swedlow
- Centre for Gene Regulation and Expression, College of Life Sciences, University of Dundee, Dow Street, Dundee DD1 5EH, United Kingdom
| | - Gerard J. Kleywegt
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, United Kingdom
| |
Collapse
|
16
|
Gutmanas A, Oldfield TJ, Patwardhan A, Sen S, Velankar S, Kleywegt GJ. The role of structural bioinformatics resources in the era of integrative structural biology. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:710-21. [PMID: 23633580 PMCID: PMC3640467 DOI: 10.1107/s0907444913001157] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Accepted: 01/11/2013] [Indexed: 11/10/2022]
Abstract
The history and the current state of the PDB and EMDB archives is briefly described, as well as some of the challenges that they face. It seems natural that the role of structural biology archives will change from being a pure repository of historic data into becoming an indispensable resource for the wider biomedical community. As part of this transformation, it will be necessary to validate the biomacromolecular structure data and ensure the highest possible quality for the archive holdings, to combine structural data from different spatial scales into a unified resource and to integrate structural data with functional, genetic and taxonomic data as well as other information available in bioinformatics resources. Some recent developments and plans to address these challenges at PDBe are presented.
Collapse
Affiliation(s)
- Aleksandras Gutmanas
- Protein Data Bank in Europe, EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| | - Thomas J. Oldfield
- Protein Data Bank in Europe, EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| | - Ardan Patwardhan
- Protein Data Bank in Europe, EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| | - Sanchayita Sen
- Protein Data Bank in Europe, EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| | - Sameer Velankar
- Protein Data Bank in Europe, EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| | - Gerard J. Kleywegt
- Protein Data Bank in Europe, EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| |
Collapse
|
17
|
Hendrickx PMS, Gutmanas A, Kleywegt GJ. Vivaldi: visualization and validation of biomacromolecular NMR structures from the PDB. Proteins 2013. [PMID: 23180575 PMCID: PMC3618379 DOI: 10.1002/prot.24213] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We describe Vivaldi (VIsualization and VALidation DIsplay; http://pdbe.org/vivaldi), a web-based service for the analysis, visualization, and validation of NMR structures in the Protein Data Bank (PDB). Vivaldi provides access to model coordinates and several types of experimental NMR data using interactive visualization tools, augmented with structural annotations and model-validation information. The service presents information about the modeled NMR ensemble, validation of experimental chemical shifts, residual dipolar couplings, distance and dihedral angle constraints, as well as validation scores based on empirical knowledge and databases. Vivaldi was designed for both expert NMR spectroscopists and casual non-expert users who wish to obtain a better grasp of the information content and quality of NMR structures in the public archive. © Proteins 2013. © 2012 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Pieter M S Hendrickx
- Protein Data Bank in Europe, EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | | |
Collapse
|
18
|
Nucleotide binding architecture for secreted cytotoxic endoribonucleases. Biochimie 2012; 95:1087-97. [PMID: 23274129 DOI: 10.1016/j.biochi.2012.12.015] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Accepted: 12/13/2012] [Indexed: 12/20/2022]
Abstract
Vertebrate secreted RNases are small cationic protein endowed with an endoribonuclease activity that belong to the RNase A superfamily and display diverse cytotoxic activities. In an effort to unravel their mechanism of action, we have analysed their nucleotide binding recognition patterns. General shared features with other nucleotide binding proteins were deduced from overall statistics on the available structure complexes at the Protein Data Bank and compared with the particularities of selected representative endoribonuclease families. Results were compared with other endoribonuclease representative families and with the overall protein-nucleotide interaction features. Preferred amino acids and atom types involved in pair bonding interactions were identified, defining the spatial motives for phosphate, base and ribose building blocks. Together with the conserved catalytic triad at the active site, variability was observed for secondary binding subsites that may contribute to the proper substrate alignment and could explain the distinct substrate preference patterns. Highly conserved binding patterns were identified for the pyrimidine and purine subsites at the main and secondary base subsites. Particular substitution could be ascribed to specific adenine or guanine specificities. Distribution of evolutionary conserved residues were compared to search for the structure determinants that underlie their diverse catalytic efficiency and those that may account for putative physiological substrate targets or other non-catalytic biological activities that contribute to the antipathogen role of the RNases involved in the host defence system. A side by side comparison with another endoribonuclease superfamily of secreted cytotoxic proteins, the microbial RNases, was carried on to analyse the common features and peculiarities that rule their substrate recognition. The data provides the structural basis for the development of applied therapies targeting cellular nucleotide polymers.
Collapse
|
19
|
Velankar S, Dana JM, Jacobsen J, van Ginkel G, Gane PJ, Luo J, Oldfield TJ, O'Donovan C, Martin MJ, Kleywegt GJ. SIFTS: Structure Integration with Function, Taxonomy and Sequences resource. Nucleic Acids Res 2012. [PMID: 23203869 PMCID: PMC3531078 DOI: 10.1093/nar/gks1258] [Citation(s) in RCA: 174] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
The Structure Integration with Function, Taxonomy and Sequences resource (SIFTS; http://pdbe.org/sifts) is a close collaboration between the Protein Data Bank in Europe (PDBe) and UniProt. The two teams have developed a semi-automated process for maintaining up-to-date cross-reference information to UniProt entries, for all protein chains in the PDB entries present in the UniProt database. This process is carried out for every weekly PDB release and the information is stored in the SIFTS database. The SIFTS process includes cross-references to other biological resources such as Pfam, SCOP, CATH, GO, InterPro and the NCBI taxonomy database. The information is exported in XML format, one file for each PDB entry, and is made available by FTP. Many bioinformatics resources use SIFTS data to obtain cross-references between the PDB and other biological databases so as to provide their users with up-to-date information.
Collapse
Affiliation(s)
- Sameer Velankar
- Protein Data Bank in Europe, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Smart OS, Womack TO, Flensburg C, Keller P, Paciorek W, Sharff A, Vonrhein C, Bricogne G. Exploiting structure similarity in refinement: automated NCS and target-structure restraints in BUSTER. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2012; 68:368-80. [PMID: 22505257 PMCID: PMC3322596 DOI: 10.1107/s0907444911056058] [Citation(s) in RCA: 485] [Impact Index Per Article: 40.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Accepted: 12/28/2011] [Indexed: 12/05/2023]
Abstract
Maximum-likelihood X-ray macromolecular structure refinement in BUSTER has been extended with restraints facilitating the exploitation of structural similarity. The similarity can be between two or more chains within the structure being refined, thus favouring NCS, or to a distinct 'target' structure that remains fixed during refinement. The local structural similarity restraints (LSSR) approach considers all distances less than 5.5 Å between pairs of atoms in the chain to be restrained. For each, the difference from the distance between the corresponding atoms in the related chain is found. LSSR applies a restraint penalty on each difference. A functional form that reaches a plateau for large differences is used to avoid the restraints distorting parts of the structure that are not similar. Because LSSR are local, there is no need to separate out domains. Some restraint pruning is still necessary, but this has been automated. LSSR have been available to academic users of BUSTER since 2009 with the easy-to-use -autoncs and -target target.pdb options. The use of LSSR is illustrated in the re-refinement of PDB entries 5rnt, where -target enables the correct ligand-binding structure to be found, and 1osg, where -autoncs contributes to the location of an additional copy of the cyclic peptide ligand.
Collapse
Affiliation(s)
- Oliver S Smart
- Global Phasing Ltd, Sheraton House, Castle Park, Cambridge CB3 0AX, England.
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Gore S, Velankar S, Kleywegt GJ. Implementing an X-ray validation pipeline for the Protein Data Bank. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2012; 68:478-83. [PMID: 22505268 PMCID: PMC3322607 DOI: 10.1107/s0907444911050359] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/06/2011] [Accepted: 11/23/2011] [Indexed: 11/10/2022]
Abstract
There is an increasing realisation that the quality of the biomacromolecular structures deposited in the Protein Data Bank (PDB) archive needs to be assessed critically using established and powerful validation methods. The Worldwide Protein Data Bank (wwPDB) organization has convened several Validation Task Forces (VTFs) to advise on the methods and standards that should be used to validate all of the entries already in the PDB as well as all structures that will be deposited in the future. The recommendations of the X-ray VTF are currently being implemented in a software pipeline. Here, ongoing work on this pipeline is briefly described as well as ways in which validation-related information could be presented to users of structural data.
Collapse
Affiliation(s)
- Swanand Gore
- Protein Data Bank in Europe (PDBe), EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| | - Gerard J. Kleywegt
- Protein Data Bank in Europe (PDBe), EMBL–EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, England
| |
Collapse
|
22
|
Weatheritt RJ, Luck K, Petsalaki E, Davey NE, Gibson TJ. The identification of short linear motif-mediated interfaces within the human interactome. ACTA ACUST UNITED AC 2012; 28:976-82. [PMID: 22328783 PMCID: PMC3315716 DOI: 10.1093/bioinformatics/bts072] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Eukaryotic proteins are highly modular, containing multiple interaction interfaces that mediate binding to a network of regulators and effectors. Recent advances in high-throughput proteomics have rapidly expanded the number of known protein-protein interactions (PPIs); however, the molecular basis for the majority of these interactions remains to be elucidated. There has been a growing appreciation of the importance of a subset of these PPIs, namely those mediated by short linear motifs (SLiMs), particularly the canonical and ubiquitous SH2, SH3 and PDZ domain-binding motifs. However, these motif classes represent only a small fraction of known SLiMs and outside these examples little effort has been made, either bioinformatically or experimentally, to discover the full complement of motif instances. RESULTS In this article, interaction data are analysed to identify and characterize an important subset of PPIs, those involving SLiMs binding to globular domains. To do this, we introduce iELM, a method to identify interactions mediated by SLiMs and add molecular details of the interaction interfaces to both interacting proteins. The method identifies SLiM-mediated interfaces from PPI data by searching for known SLiM-domain pairs. This approach was applied to the human interactome to identify a set of high-confidence putative SLiM-mediated PPIs. AVAILABILITY iELM is freely available at http://elmint.embl.de CONTACT toby.gibson@embl.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- R J Weatheritt
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | | | | | | | | |
Collapse
|
23
|
Hamming OJ, Kang L, Svensson A, Karlsen JL, Rahbek-Nielsen H, Paludan SR, Hjorth SA, Bondensgaard K, Hartmann R. Crystal structure of interleukin-21 receptor (IL-21R) bound to IL-21 reveals that sugar chain interacting with WSXWS motif is integral part of IL-21R. J Biol Chem 2012; 287:9454-60. [PMID: 22235133 DOI: 10.1074/jbc.m111.311084] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
IL-21 is a class I cytokine that exerts pleiotropic effects on both innate and adaptive immune responses. It signals through a heterodimeric receptor complex consisting of the IL-21 receptor (IL-21R) and the common γ-chain. A hallmark of the class I cytokine receptors is the class I cytokine receptor signature motif (WSXWS). The exact role of this motif has not been determined yet; however, it has been implicated in diverse functions, including ligand binding, receptor internalization, proper folding, and export, as well as signal transduction. Furthermore, the WXXW motif is known to be a consensus sequence for C-mannosylation. Here, we present the crystal structure of IL-21 bound to IL-21R and reveal that the WSXWS motif of IL-21R is C-mannosylated at the first tryptophan. We furthermore demonstrate that a sugar chain bridges the two fibronectin domains that constitute the extracellular domain of IL-21R and anchors at the WSXWS motif through an extensive hydrogen bonding network, including mannosylation. The glycan thus transforms the V-shaped receptor into an A-frame. This finding offers a novel structural explanation of the role of the class I cytokine signature motif.
Collapse
Affiliation(s)
- Ole J Hamming
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus C, Denmark
| | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Velankar S, Alhroub Y, Best C, Caboche S, Conroy MJ, Dana JM, Fernandez Montecelo MA, van Ginkel G, Golovin A, Gore SP, Gutmanas A, Haslam P, Hendrickx PMS, Heuson E, Hirshberg M, John M, Lagerstedt I, Mir S, Newman LE, Oldfield TJ, Patwardhan A, Rinaldi L, Sahni G, Sanz-García E, Sen S, Slowley R, Suarez-Uruena A, Swaminathan GJ, Symmons MF, Vranken WF, Wainwright M, Kleywegt GJ. PDBe: Protein Data Bank in Europe. Nucleic Acids Res 2011; 40:D445-52. [PMID: 22110033 PMCID: PMC3245096 DOI: 10.1093/nar/gkr998] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The Protein Data Bank in Europe (PDBe; pdbe.org) is a partner in the Worldwide PDB organization (wwPDB; wwpdb.org) and as such actively involved in managing the single global archive of biomacromolecular structure data, the PDB. In addition, PDBe develops tools, services and resources to make structure-related data more accessible to the biomedical community. Here we describe recently developed, extended or improved services, including an animated structure-presentation widget (PDBportfolio), a widget to graphically display the coverage of any UniProt sequence in the PDB (UniPDB), chemistry- and taxonomy-based PDB-archive browsers (PDBeXplore), and a tool for interactive visualization of NMR structures, corresponding experimental data as well as validation and analysis results (Vivaldi).
Collapse
Affiliation(s)
- S Velankar
- Protein Data Bank in Europe, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|