1
|
Wang W, Atherton P, Kreft M, Te Molder L, van der Poel S, Hoekman L, Celie P, Joosten RP, Fässler R, Perrakis A, Sonnenberg A. Caskin2 is a novel talin and Abi1-binding protein that promotes cell motility. J Cell Sci 2024:jcs.262116. [PMID: 38587458 DOI: 10.1242/jcs.262116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Accepted: 03/19/2024] [Indexed: 04/09/2024] Open
Abstract
Talin couples the actomyosin cytoskeleton to integrins and transmits tension to the extracellular matrix. Talin also interacts with numerous additional proteins capable of modulating the actin-integrin linkage and thus downstream mechanosignaling cascades. Here, we demonstrate that the scaffold protein Caskin2 interacts directly with the R8 domain of talin through its C-terminal LD motif. Caskin2 also associates with the WAVE Regulatory Complex to promote cell migration in an Abi1-dependent manner. Furthermore, we demonstrate that the Caskin2-Abi1 interaction is regulated by growth factor-induced phosphorylation of Caskin2 on serine 878. In MCF7 and UACC893 cells, which contain an amplification of CASKIN2, Caskin2 localizes in plasma membrane-associated plaques and around focal adhesions in CMSCs. Taken together, our results identify Caskin2 as a novel talin-binding protein that may not only connect integrin-mediated adhesion to actin polymerization, but could also play a role in crosstalk between integrins and microtubules.
Collapse
Affiliation(s)
- Wei Wang
- Division of Cell Biology, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
| | - Paul Atherton
- Division of Cell Biology, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
- Department of Molecular and Clinical Cancer Medicine, Institute of Systems, Molecular and Integrative Biology, The University of Liverpool, Liverpool, L69 7BE, UK
| | - Maaike Kreft
- Division of Cell Biology, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
| | - Lisa Te Molder
- Division of Cell Biology, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
| | - Sabine van der Poel
- Division of Cell Biology, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
| | - Liesbeth Hoekman
- Proteomics Facility, The Netherlands Cancer Institute, The Netherlands
| | - Patrick Celie
- Oncode Institute and Division of Biochemistry, The Netherlands Cancer Institute, The Netherlands
| | - Robbie P Joosten
- Oncode Institute and Division of Biochemistry, The Netherlands Cancer Institute, The Netherlands
| | - Reinhard Fässler
- Department of Molecular Medicine, Max Planck Institute of Biochemistry, Germany
| | | | - Arnoud Sonnenberg
- Division of Cell Biology, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
| |
Collapse
|
2
|
de Vries I, Ammerlaan D, Heidebrecht T, Celie PH, Geerke DP, Joosten RP, Perrakis A. Distant sequence regions of JBP1 contribute to J-DNA binding. Life Sci Alliance 2023; 6:e202302150. [PMID: 37328191 PMCID: PMC10276184 DOI: 10.26508/lsa.202302150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 06/18/2023] Open
Abstract
Base-J (β-D-glucopyranosyloxymethyluracil) is a modified DNA nucleotide that replaces 1% of thymine in kinetoplastid flagellates. The biosynthesis and maintenance of base-J depends on the base-J-binding protein 1 (JBP1) that has a thymidine hydroxylase domain and a J-DNA-binding domain (JDBD). How the thymidine hydroxylase domain synergizes with the JDBD to hydroxylate thymine in specific genomic sites, maintaining base-J during semi-conservative DNA replication, remains unclear. Here, we present a crystal structure of the JDBD including a previously disordered DNA-contacting loop and use it as starting point for molecular dynamics simulations and computational docking studies to propose recognition models for JDBD binding to J-DNA. These models guided mutagenesis experiments, providing additional data for docking, which reveals a binding mode for JDBD onto J-DNA. This model, together with the crystallographic structure of the TET2 JBP1-homologue in complex with DNA and the AlphaFold model of full-length JBP1, allowed us to hypothesize that the flexible JBP1 N-terminus contributes to DNA-binding, which we confirmed experimentally. Α high-resolution JBP1:J-DNA complex, which must involve conformational changes, would however need to be determined experimentally to further understand this unique underlying molecular mechanism that ensures replication of epigenetic information.
Collapse
Affiliation(s)
- Ida de Vries
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Danique Ammerlaan
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Tatjana Heidebrecht
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Patrick Hn Celie
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Daan P Geerke
- Department of Chemistry and Pharmaceutical Sciences, Amsterdam Institute of Molecular and Life Sciences (AIMMS) and Amsterdam Center for Multiscale Modeling (ACMM), Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Robbie P Joosten
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| |
Collapse
|
3
|
Dialpuri JS, Bagdonas H, Atanasova M, Schofield LC, Hekkelman ML, Joosten RP, Agirre J. Analysis and validation of overall N-glycan conformation in Privateer. Acta Crystallogr D Struct Biol 2023:S2059798323003510. [PMID: 37219590 DOI: 10.1107/s2059798323003510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023] Open
Abstract
The oligosaccharides in N-glycosylation provide key structural and functional contributions to a glycoprotein. These contributions are dependent on the composition and overall conformation of the glycans. The Privateer software allows structural biologists to evaluate and improve the atomic structures of carbohydrates, including N-glycans; this software has recently been extended to check glycan composition through the use of glycomics data. Here, a broadening of the scope of the software to analyse and validate the overall conformation of N-glycans is presented, focusing on a newly compiled set of glycosidic linkage torsional preferences harvested from a curated set of glycoprotein models.
Collapse
Affiliation(s)
- Jordan S Dialpuri
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Haroldas Bagdonas
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Mihaela Atanasova
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Lucy C Schofield
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Maarten L Hekkelman
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Robbie P Joosten
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| |
Collapse
|
4
|
Agirre J, Atanasova M, Bagdonas H, Ballard CB, Baslé A, Beilsten-Edmands J, Borges RJ, Brown DG, Burgos-Mármol JJ, Berrisford JM, Bond PS, Caballero I, Catapano L, Chojnowski G, Cook AG, Cowtan KD, Croll TI, Debreczeni JÉ, Devenish NE, Dodson EJ, Drevon TR, Emsley P, Evans G, Evans PR, Fando M, Foadi J, Fuentes-Montero L, Garman EF, Gerstel M, Gildea RJ, Hatti K, Hekkelman ML, Heuser P, Hoh SW, Hough MA, Jenkins HT, Jiménez E, Joosten RP, Keegan RM, Keep N, Krissinel EB, Kolenko P, Kovalevskiy O, Lamzin VS, Lawson DM, Lebedev AA, Leslie AGW, Lohkamp B, Long F, Malý M, McCoy AJ, McNicholas SJ, Medina A, Millán C, Murray JW, Murshudov GN, Nicholls RA, Noble MEM, Oeffner R, Pannu NS, Parkhurst JM, Pearce N, Pereira J, Perrakis A, Powell HR, Read RJ, Rigden DJ, Rochira W, Sammito M, Sánchez Rodríguez F, Sheldrick GM, Shelley KL, Simkovic F, Simpkin AJ, Skubak P, Sobolev E, Steiner RA, Stevenson K, Tews I, Thomas JMH, Thorn A, Valls JT, Uski V, Usón I, Vagin A, Velankar S, Vollmar M, Walden H, Waterman D, Wilson KS, Winn MD, Winter G, Wojdyr M, Yamashita K. The CCP4 suite: integrative software for macromolecular crystallography. Acta Crystallogr D Struct Biol 2023; 79:449-461. [PMID: 37259835 PMCID: PMC10233625 DOI: 10.1107/s2059798323003595] [Citation(s) in RCA: 84] [Impact Index Per Article: 84.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 04/19/2023] [Indexed: 06/02/2023] Open
Abstract
The Collaborative Computational Project No. 4 (CCP4) is a UK-led international collective with a mission to develop, test, distribute and promote software for macromolecular crystallography. The CCP4 suite is a multiplatform collection of programs brought together by familiar execution routines, a set of common libraries and graphical interfaces. The CCP4 suite has experienced several considerable changes since its last reference article, involving new infrastructure, original programs and graphical interfaces. This article, which is intended as a general literature citation for the use of the CCP4 software suite in structure determination, will guide the reader through such transformations, offering a general overview of the new features and outlining future developments. As such, it aims to highlight the individual programs that comprise the suite and to provide the latest references to them for perusal by crystallographers around the world.
Collapse
Affiliation(s)
- Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Mihaela Atanasova
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Haroldas Bagdonas
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Charles B. Ballard
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Arnaud Baslé
- Biosciences Institute, Newcastle University, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - James Beilsten-Edmands
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Rafael J. Borges
- The Center of Medicinal Chemistry (CQMED), Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Av. Dr. André Tosello 550, 13083-886 Campinas, Brazil
| | - David G. Brown
- Laboratoires Servier SAS Institut de Recherches, Croissy-sur-Seine, France
| | - J. Javier Burgos-Mármol
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - John M. Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Paul S. Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Iracema Caballero
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Lucrezia Catapano
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Atlanta G. Cook
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King’s Buildings, Edinburgh EH9 3BF, United Kingdom
| | - Kevin D. Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Tristan I. Croll
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
- Altos Labs, Portway Building, Granta Park, Great Abington, Cambridge CB21 6GP, United Kingdom
| | - Judit É. Debreczeni
- Discovery Sciences, R&D BioPharmaceuticals, AstraZeneca, Darwin Building, Cambridge Science Park, Milton Road, Cambridge CB4 0WG, United Kingdom
| | - Nicholas E. Devenish
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Eleanor J. Dodson
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Tarik R. Drevon
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Paul Emsley
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Gwyndaf Evans
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Rosalind Franklin Institute, Harwell Science and Innovation Campus, Didcot OX11 0QS, United Kingdom
| | - Phil R. Evans
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Maria Fando
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - James Foadi
- Department of Mathematical Sciences, University of Bath, Bath, United Kingdom
| | - Luis Fuentes-Montero
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Elspeth F. Garman
- Department of Biochemistry, University of Oxford, Dorothy Crowfoot Hodgkin Building, Oxford OX1 3QU, United Kingdom
| | - Markus Gerstel
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Richard J. Gildea
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Kaushik Hatti
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Maarten L. Hekkelman
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Philipp Heuser
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Soon Wen Hoh
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Michael A. Hough
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom
| | - Huw T. Jenkins
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Elisabet Jiménez
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Robbie P. Joosten
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Ronan M. Keegan
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Nicholas Keep
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck College, London WC1E 7HX, United Kingdom
| | - Eugene B. Krissinel
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Petr Kolenko
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 55, 252 50 Vestec, Czech Republic
| | - Oleg Kovalevskiy
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Victor S. Lamzin
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - David M. Lawson
- Department of Biochemistry and Metabolism, John Innes Centre, Norwich NR4 7UH, United Kingdom
| | - Andrey A. Lebedev
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Andrew G. W. Leslie
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Bernhard Lohkamp
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, SE-171 77 Stockholm, Sweden
| | - Fei Long
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Martin Malý
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 55, 252 50 Vestec, Czech Republic
- Biological Sciences, Institute for Life Sciences, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Airlie J. McCoy
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Stuart J. McNicholas
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Ana Medina
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Claudia Millán
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - James W. Murray
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
| | - Garib N. Murshudov
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Robert A. Nicholls
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Martin E. M. Noble
- Translational and Clinical Research Institute, Newcastle University, Paul O’Gorman Building, Medical School, Framlington Place, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - Robert Oeffner
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Navraj S. Pannu
- Department of Infectious Diseases, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands
| | - James M. Parkhurst
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Rosalind Franklin Institute, Harwell Science and Innovation Campus, Didcot OX11 0QS, United Kingdom
| | - Nicholas Pearce
- Department of Physics, Chemistry and Biology (IFM), Linköping University, SE-581 83 Linköping, Sweden
| | - Joana Pereira
- Biozentrum and SIB Swiss Institute of Bioinformatics, University of Basel, 4056 Basel, Switzerland
| | - Anastassis Perrakis
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Harold R. Powell
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
| | - Randy J. Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - William Rochira
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Massimo Sammito
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
- Discovery Centre, Biologics Engineering, AstraZeneca, Biomedical Campus, 1 Francis Crick Avenue, Trumpington, Cambridge CB2 0AA, United Kingdom
| | - Filomeno Sánchez Rodríguez
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - George M. Sheldrick
- Department of Structural Chemistry, Georg-August-Universität Göttingen, Tammannstrasse 4, 37077 Göttingen, Germany
| | - Kathryn L. Shelley
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Felix Simkovic
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Adam J. Simpkin
- Laboratoires Servier SAS Institut de Recherches, Croissy-sur-Seine, France
| | - Pavol Skubak
- Department of Infectious Diseases, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands
| | - Egor Sobolev
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Roberto A. Steiner
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
- Department of Biomedical Sciences, University of Padova, Italy
| | - Kyle Stevenson
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Ivo Tews
- Biological Sciences, Institute for Life Sciences, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Jens M. H. Thomas
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Andrea Thorn
- Institute for Nanostructure and Solid State Physics, Universität Hamburg, 22761 Hamburg, Germany
| | - Josep Triviño Valls
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Ville Uski
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Isabel Usón
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
- ICREA, Institució Catalana de Recerca i Estudis Avançats, Passeig Lluís Companys 23, 08003 Barcelona, Spain
| | - Alexei Vagin
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Melanie Vollmar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Helen Walden
- School of Molecular Biosciences, College of Medical Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - David Waterman
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Keith S. Wilson
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Martyn D. Winn
- Scientific Computing Department, Science and Technology Facilities Council, Didcot OX11 0FA, United Kingdom
| | - Graeme Winter
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited (United Kingdom), Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Keitaro Yamashita
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
5
|
Hekkelman ML, de Vries I, Joosten RP, Perrakis A. AlphaFill: enriching AlphaFold models with ligands and cofactors. Nat Methods 2023; 20:205-213. [PMID: 36424442 PMCID: PMC9911346 DOI: 10.1038/s41592-022-01685-y] [Citation(s) in RCA: 110] [Impact Index Per Article: 110.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 10/18/2022] [Indexed: 11/27/2022]
Abstract
Artificial intelligence-based protein structure prediction approaches have had a transformative effect on biomolecular sciences. The predicted protein models in the AlphaFold protein structure database, however, all lack coordinates for small molecules, essential for molecular structure or function: hemoglobin lacks bound heme; zinc-finger motifs lack zinc ions essential for structural integrity and metalloproteases lack metal ions needed for catalysis. Ligands important for biological function are absent too; no ADP or ATP is bound to any of the ATPases or kinases. Here we present AlphaFill, an algorithm that uses sequence and structure similarity to 'transplant' such 'missing' small molecules and ions from experimentally determined structures to predicted protein models. The algorithm was successfully validated against experimental structures. A total of 12,029,789 transplants were performed on 995,411 AlphaFold models and are available together with associated validation metrics in the alphafill.eu databank, a resource to help scientists make new hypotheses and design targeted experiments.
Collapse
Affiliation(s)
- Maarten L. Hekkelman
- grid.430814.a0000 0001 0674 1393Oncode Institute and Department of Biochemistry, The Netherlands Cancer Institute, Amsterdam, the Netherlands
| | - Ida de Vries
- grid.430814.a0000 0001 0674 1393Oncode Institute and Department of Biochemistry, The Netherlands Cancer Institute, Amsterdam, the Netherlands
| | - Robbie P. Joosten
- grid.430814.a0000 0001 0674 1393Oncode Institute and Department of Biochemistry, The Netherlands Cancer Institute, Amsterdam, the Netherlands
| | - Anastassis Perrakis
- Oncode Institute and Department of Biochemistry, The Netherlands Cancer Institute, Amsterdam, the Netherlands.
| |
Collapse
|
6
|
Agirre J, Joosten RP, Roseman A. Model building and beyond. Acta Crystallogr D Struct Biol 2022; 78:1192-1193. [PMID: 36189739 PMCID: PMC9527763 DOI: 10.1107/s2059798322009330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
An introduction to the Proceedings of the 2020 CCP4 Study Weekend on model building which are available as a virtual issue at https://journals.iucr.org/special_issues/2020/CCP42020/.
Collapse
|
7
|
Westbrook JD, Young JY, Shao C, Feng Z, Guranovic V, Lawson CL, Vallat B, Adams PD, Berrisford JM, Bricogne G, Diederichs K, Joosten RP, Keller P, Moriarty NW, Sobolev OV, Velankar S, Vonrhein C, Waterman DG, Kurisu G, Berman HM, Burley SK, Peisach E. PDBx/mmCIF Ecosystem: Foundational Semantic Tools for Structural Biology. J Mol Biol 2022; 434:167599. [PMID: 35460671 DOI: 10.1016/j.jmb.2022.167599] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/31/2022] [Accepted: 04/13/2022] [Indexed: 02/07/2023]
Abstract
PDBx/mmCIF, Protein Data Bank Exchange (PDBx) macromolecular Crystallographic Information Framework (mmCIF), has become the data standard for structural biology. With its early roots in the domain of small-molecule crystallography, PDBx/mmCIF provides an extensible data representation that is used for deposition, archiving, remediation, and public dissemination of experimentally determined three-dimensional (3D) structures of biological macromolecules by the Worldwide Protein Data Bank (wwPDB, wwpdb.org). Extensions of PDBx/mmCIF are similarly used for computed structure models by ModelArchive (modelarchive.org), integrative/hybrid structures by PDB-Dev (pdb-dev.wwpdb.org), small angle scattering data by Small Angle Scattering Biological Data Bank SASBDB (sasbdb.org), and for models computed generated with the AlphaFold 2.0 deep learning software suite (alphafold.ebi.ac.uk). Community-driven development of PDBx/mmCIF spans three decades, involving contributions from researchers, software and methods developers in structural sciences, data repository providers, scientific publishers, and professional societies. Having a semantically rich and extensible data framework for representing a wide range of structural biology experimental and computational results, combined with expertly curated 3D biostructure data sets in public repositories, accelerates the pace of scientific discovery. Herein, we describe the architecture of the PDBx/mmCIF data standard, tools used to maintain representations of the data standard, governance, and processes by which data content standards are extended, plus community tools/software libraries available for processing and checking the integrity of PDBx/mmCIF data. Use cases exemplify how the members of the Worldwide Protein Data Bank have used PDBx/mmCIF as the foundation for its pipeline for delivering Findable, Accessible, Interoperable, and Reusable (FAIR) data to many millions of users worldwide.
Collapse
Affiliation(s)
- John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA
| | - Jasmine Y Young
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Chenghua Shao
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Zukang Feng
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Vladimir Guranovic
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Catherine L Lawson
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Paul D Adams
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Department of Bioengineering, University of California at Berkeley, Berkeley, CA 94720, USA
| | - John M Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gerard Bricogne
- Global Phasing Ltd, Sheraton House, Castle Park, Cambridge CB3 0AK, UK
| | | | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, the Netherlands; Oncode Institute, 3521 AL Utrecht, the Netherlands. https://www.twitter.com/Robbie_Joosten
| | - Peter Keller
- Global Phasing Ltd, Sheraton House, Castle Park, Cambridge CB3 0AK, UK
| | - Nigel W Moriarty
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Oleg V Sobolev
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Clemens Vonrhein
- Global Phasing Ltd, Sheraton House, Castle Park, Cambridge CB3 0AK, UK
| | - David G Waterman
- UKRI-STFC Rutherford Appleton Laboratory, Didcot OX11 0FA, UK; CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, UK. https://www.twitter.com/upintheair
| | - Genji Kurisu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; The Bridge Institute, Michelson Center for Convergent Bioscience, University of Southern California, Los Angeles, CA, USA
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, La Jolla, CA 92093, USA.
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| |
Collapse
|
8
|
Atanasova M, Nicholls RA, Joosten RP, Agirre J. Updated restraint dictionaries for carbohydrates in the pyranose form. Acta Crystallogr D Struct Biol 2022; 78:455-465. [PMID: 35362468 PMCID: PMC8972801 DOI: 10.1107/s2059798322001103] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 01/31/2022] [Indexed: 11/10/2022] Open
Abstract
Restraint dictionaries are used during macromolecular structure refinement to encapsulate intramolecular connectivity and geometric information. These dictionaries allow previously determined `ideal' values of features such as bond lengths, angles and torsions to be used as restraint targets. During refinement, restraints influence the model to adopt a conformation that agrees with prior observation. This is especially important when refining crystal structures of glycosylated proteins, as their resolutions tend to be worse than those of nonglycosylated proteins. Pyranosides, the overwhelming majority component in all forms of protein glycosylation, often display conformational errors in crystal structures. Whilst many of these flaws usually relate to model building, refinement issues may also have their root in suboptimal restraint dictionaries. In order to avoid subsequent misinterpretation and to improve the quality of all pyranose monosaccharide entries in the CCP4 Monomer Library, new dictionaries with improved ring torsion restraints, coordinates reflecting the lowest-energy ring pucker and updated geometry have been produced and evaluated. These new dictionaries are now part of the CCP4 Monomer Library and will be released with CCP4 version 8.0.
Collapse
Affiliation(s)
- Mihaela Atanasova
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Robbie P. Joosten
- Biochemistry Department, Netherlands Cancer Institute, Amsterdam, The Netherlands
- Oncode Institute, The Netherlands
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| |
Collapse
|
9
|
Joosten RP, Nicholls RA, Agirre J. Towards consistency in geometry restraints for carbohydrates in the pyranose form: modern dictionary generators reviewed. Curr Med Chem 2021; 29:1193-1207. [PMID: 34477506 PMCID: PMC7612510 DOI: 10.2174/0929867328666210902140754] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 08/03/2021] [Accepted: 08/07/2021] [Indexed: 11/23/2022]
Abstract
Macromolecular restrained refinement is nowadays the most used method for improving the agreement between an atomic structural model and experimental data. Restraint dictionaries, a key tool behind the success of the method, allow fine-tuning geometric properties such as distances and angles between atoms beyond simplistic expectations. Dictionary generators can provide restraint target estimates derived from different sources, from fully theoretical to experimental and any combination in between. Carbohydrates are stereochemically complex biomolecules and, in their pyranose form, have clear conformational preferences. As such, they pose unique problems to dictionary generators and in the course of this study, require special attention from software developers. Functional differences between restraint generators will be discussed, as well as the process of achieving consistent results with different software designs. The study will conclude a set of practical considerations, as well as recommendations for the generation of new restraint dictionaries, using the improved software alternatives discussed.
Collapse
Affiliation(s)
- Robbie P Joosten
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam. Netherlands
| | - Robert A Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH. United Kingdom
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, YO10 5DD. United Kingdom
| |
Collapse
|
10
|
de Vries I, Kwakman T, Lu XJ, Hekkelman ML, Deshpande M, Velankar S, Perrakis A, Joosten RP. New restraints and validation approaches for nucleic acid structures in PDB-REDO. Acta Crystallogr D Struct Biol 2021; 77:1127-1141. [PMID: 34473084 PMCID: PMC8411979 DOI: 10.1107/s2059798321007610] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 07/26/2021] [Indexed: 11/10/2022] Open
Abstract
The quality of macromolecular structure models crucially depends on refinement and validation targets, which optimally describe the expected chemistry. Commonly used software for these two procedures has been designed and developed in a protein-centric manner, resulting in relatively few established features for the refinement and validation of nucleic acid-containing structure models. Here, new nucleic acid-specific approaches implemented in PDB-REDO are described, including a new restraint model using noncovalent geometries (base-pair hydrogen bonding and base-pair stacking) as refinement targets. New validation routines are also presented, including a metric for Watson-Crick base-pair geometry normality (ZbpG). Applying the PDB-REDO pipeline with the new restraint model to the whole Protein Data Bank (PDB) demonstrates an overall positive effect on the quality of nucleic acid-containing structure models. Finally, we discuss examples of improvements in the geometry of specific nucleic acid structures in the PDB. The new PDB-REDO models and pipeline are available at https://pdb-redo.eu/.
Collapse
Affiliation(s)
- Ida de Vries
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Tim Kwakman
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Xiang-Jun Lu
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Maarten L. Hekkelman
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Mandar Deshpande
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
| | - Anastassis Perrakis
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Robbie P. Joosten
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| |
Collapse
|
11
|
Nicholls RA, Wojdyr M, Joosten RP, Catapano L, Long F, Fischer M, Emsley P, Murshudov GN. The missing link: covalent linkages in structural models. Acta Crystallogr D Struct Biol 2021; 77:727-745. [PMID: 34076588 PMCID: PMC8171067 DOI: 10.1107/s2059798321003934] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 04/13/2021] [Indexed: 11/10/2022] Open
Abstract
Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifying the correct chemistry, obtaining an appropriate restraint dictionary and ensuring its correct application. The analysis presented herein assesses the extent of problems involving covalent linkages in the Protein Data Bank (PDB). Not only will this facilitate the remediation of existing models, but also, more importantly, it will inform and thus improve the quality of future linkages. By considering linkages of known type in the CCP4 Monomer Library (CCP4-ML), failure to model a covalent linkage is identified to result in inaccurate (systematically longer) interatomic distances. Scanning the PDB for proximal atom pairs that do not have a corresponding type in the CCP4-ML reveals a large number of commonly occurring types of unannotated potential linkages; in general, these may or may not be covalently linked. Manual consideration of the most commonly occurring cases identifies a number of genuine classes of covalent linkages. The recent expansion of the CCP4-ML is discussed, which has involved the addition of over 16 000 and the replacement of over 11 000 component dictionaries using AceDRG. As part of this effort, the CCP4-ML has also been extended using AceDRG link dictionaries for the aforementioned linkage types identified in this analysis. This will facilitate the identification of such linkage types in future modelling efforts, whilst concurrently easing the process involved in their application. The need for a universal standard for maintaining link records corresponding to covalent linkages, and references to the associated dictionaries used during modelling and refinement, following deposition to the PDB is emphasized. The importance of correctly modelling covalent linkages is demonstrated using a case study, which involves the covalent linkage of an inhibitor to the main protease in various viral species, including SARS-CoV-2. This example demonstrates the importance of properly modelling covalent linkages using a comprehensive restraint dictionary, as opposed to just using a single interatomic distance restraint or failing to model the covalent linkage at all.
Collapse
Affiliation(s)
- Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited, Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Robbie P. Joosten
- Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
- Oncode Institute, The Netherlands
| | - Lucrezia Catapano
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Fei Long
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcus Fischer
- Chemical Biology and Therapeutics and Structural Biology, St Jude Children’s Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105-3678, USA
| | - Paul Emsley
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Garib N. Murshudov
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
12
|
Nicholls RA, Joosten RP, Long F, Wojdyr M, Lebedev A, Krissinel E, Catapano L, Fischer M, Emsley P, Murshudov GN. Modelling covalent linkages in CCP4. Acta Crystallogr D Struct Biol 2021; 77:712-726. [PMID: 34076587 PMCID: PMC8171069 DOI: 10.1107/s2059798321001753] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 02/12/2021] [Indexed: 11/10/2022] Open
Abstract
In this contribution, the current protocols for modelling covalent linkages within the CCP4 suite are considered. The mechanism used for modelling covalent linkages is reviewed: the use of dictionaries for describing changes to stereochemistry as a result of the covalent linkage and the application of link-annotation records to structural models to ensure the correct treatment of individual instances of covalent linkages. Previously, linkage descriptions were lacking in quality compared with those of contemporary component dictionaries. Consequently, AceDRG has been adapted for the generation of link dictionaries of the same quality as for individual components. The approach adopted by AceDRG for the generation of link dictionaries is outlined, which includes associated modifications to the linked components. A number of tools to facilitate the practical modelling of covalent linkages available within the CCP4 suite are described, including a new restraint-dictionary accumulator, the Make Covalent Link tool and AceDRG interface in Coot, the 3D graphical editor JLigand and the mechanisms for dealing with covalent linkages in the CCP4i2 and CCP4 Cloud environments. These integrated solutions streamline and ease the covalent-linkage modelling workflow, seamlessly transferring relevant information between programs. Current recommended practice is elucidated by means of instructive practical examples. By summarizing the different approaches to modelling linkages that are available within the CCP4 suite, limitations and potential pitfalls that may be encountered are highlighted in order to raise awareness, with the intention of improving the quality of future modelled covalent linkages in macromolecular complexes.
Collapse
Affiliation(s)
- Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Robbie P. Joosten
- Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
- Oncode Institute, The Netherlands
| | - Fei Long
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited, Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Andrey Lebedev
- CCP4, STFC Rutherford Appleton Laboratory, Chilton, Didcot OX11 0QX, United Kingdom
| | - Eugene Krissinel
- CCP4, STFC Rutherford Appleton Laboratory, Chilton, Didcot OX11 0QX, United Kingdom
| | - Lucrezia Catapano
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Marcus Fischer
- Chemical Biology and Therapeutics and Structural Biology, St Jude Children’s Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105-3678, USA
| | - Paul Emsley
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Garib N. Murshudov
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
13
|
van Beusekom B, Damaskos G, Hekkelman ML, Salgado-Polo F, Hiruma Y, Perrakis A, Joosten RP. LAHMA: structure analysis through local annotation of homology-matched amino acids. Acta Crystallogr D Struct Biol 2021; 77:28-40. [PMID: 33404523 PMCID: PMC7787103 DOI: 10.1107/s2059798320014473] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Accepted: 10/30/2020] [Indexed: 11/11/2022] Open
Abstract
Comparison of homologous structure models is a key step in analyzing protein structure. With a wealth of homologous structures, comparison becomes a tedious process, and often only a small (user-biased) selection of data is used. A multitude of structural superposition algorithms are then typically used to visualize the structures together in 3D and to compare them. Here, the Local Annotation of Homology-Matched Amino acids (LAHMA) website (https://lahma.pdb-redo.eu) is presented, which compares any structure model with all of its close homologs from the PDB-REDO databank. LAHMA displays structural features in sequence space, allowing users to uncover differences between homologous structure models that can be analyzed for their relevance to chemistry or biology. LAHMA visualizes numerous structural features, also allowing one-click comparison of structure-quality plots (for example the Ramachandran plot) and `in-browser' structural visualization of 3D models.
Collapse
Affiliation(s)
- Bart van Beusekom
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - George Damaskos
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Maarten L. Hekkelman
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Fernando Salgado-Polo
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Yoshitaka Hiruma
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Robbie P. Joosten
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| |
Collapse
|
14
|
Lange J, Baakman C, Pistorius A, Krieger E, Hooft R, Joosten RP, Vriend G. Facilities that make the PDB data collection more powerful. Protein Sci 2019; 29:330-344. [PMID: 31724231 PMCID: PMC6933850 DOI: 10.1002/pro.3788] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Accepted: 11/11/2019] [Indexed: 01/13/2023]
Abstract
We describe a series of databases and tools that directly or indirectly support biomedical research on macromolecules, with focus on their applicability in protein structure bioinformatics research. DSSP, that determines secondary structures of proteins, has been updated to work well with extremely large structures in multiple formats. The PDBREPORT database that lists anomalies in protein structures has been remade to remove many small problems. These reports are now available as PDF-formatted files with a computer-readable summary. The VASE software has been added to analyze and visualize HSSP multiple sequence alignments for protein structures. The Lists collection of databases has been extended with a series of databases, most noticeably with a database that gives each protein structure a grade for usefulness in protein structure bioinformatics projects. The PDB-REDO collection of reanalyzed and re-refined protein structures that were solved by X-ray crystallography has been improved by dealing better with sugar residues and with hydrogen bonds, and adding many missing surface loops. All academic software underlying these protein structure bioinformatics applications and databases are now publicly accessible, either directly from the authors or from the GitHub software repository.
Collapse
Affiliation(s)
- Joanna Lange
- Bio-Prodict, Nijmegen, The Netherlands.,Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, Nijmegen, The Netherlands
| | - Coos Baakman
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, Nijmegen, The Netherlands
| | - Arthur Pistorius
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, Nijmegen, The Netherlands
| | - Elmar Krieger
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, Nijmegen, The Netherlands
| | - Rob Hooft
- Department of Computer Science, Dutch Techcentre for Life Sciences (DTL), Amsterdam, The Netherlands.,Department of Computer Science, Vrije Universiteit Amsterdam (VU), Amsterdam, The Netherlands
| | - Robbie P Joosten
- Biochemistry department, Netherlands Cancer Institute (NKI), Amsterdam, The Netherlands
| | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, Nijmegen, The Netherlands.,Baco Institute of Protein Science (BIPS), Mindoro, Philippines
| |
Collapse
|
15
|
van Beusekom B, Wezel N, Hekkelman ML, Perrakis A, Emsley P, Joosten RP. Building and rebuilding N-glycans in protein structure models. Acta Crystallogr D Struct Biol 2019; 75:416-425. [PMID: 30988258 PMCID: PMC6465985 DOI: 10.1107/s2059798319003875] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 03/20/2019] [Indexed: 01/16/2023]
Abstract
Carbohydrates are automatically built and rebuilt using Coot in the PDB-REDO pipeline. N-Glycosylation is one of the most common post-translational modifications and is implicated in, for example, protein folding and interaction with ligands and receptors. N-Glycosylation trees are complex structures of linked carbohydrate residues attached to asparagine residues. While carbohydrates are typically modeled in protein structures, they are often incomplete or have the wrong chemistry. Here, new tools are presented to automatically rebuild existing glycosylation trees, to extend them where possible, and to add new glycosylation trees if they are missing from the model. The method has been incorporated in the PDB-REDO pipeline and has been applied to build or rebuild 16 452 carbohydrate residues in 11 651 glycosylation trees in 4498 structure models, and is also available from the PDB-REDO web server. With better modeling of N-glycosylation, the biological function of this important modification can be better and more easily understood.
Collapse
Affiliation(s)
- Bart van Beusekom
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Natasja Wezel
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Maarten L Hekkelman
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Paul Emsley
- MRC Laboratory for Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, England
| | - Robbie P Joosten
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| |
Collapse
|
16
|
Morris C, Andreetto P, Banci L, Bonvin AMJJ, Chojnowski G, Cano LD, Carazo JM, Conesa P, Daenke S, Damaskos G, Giachetti A, Haley NEC, Hekkelman ML, Heuser P, Joosten RP, Kouřil D, Křenek A, Kulhánek T, Lamzin VS, Nadzirin N, Perrakis A, Rosato A, Sanderson F, Segura J, Schaarschmidt J, Sobolev E, Traldi S, Trellet ME, Velankar S, Verlato M, Winn M. West-Life: A Virtual Research Environment for structural biology. J Struct Biol X 2019; 1:100006. [PMID: 32647812 PMCID: PMC7337051 DOI: 10.1016/j.yjsbx.2019.100006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Data processing and data management services for structural biology. Enhancements to existing web services for structure solution and analysis. New pipelines to link these services into more complex higher-level workflows. New data management facilities. Making the benefits of European e-Infrastructures more accessible to structural biologists.
The West-Life project (https://about.west-life.eu/) is a Horizon 2020 project funded by the European Commission to provide data processing and data management services for the international community of structural biologists, and in particular to support integrative experimental approaches within the field of structural biology. It has developed enhancements to existing web services for structure solution and analysis, created new pipelines to link these services into more complex higher-level workflows, and added new data management facilities. Through this work it has striven to make the benefits of European e-Infrastructures more accessible to life-science researchers in general and structural biologists in particular.
Collapse
Affiliation(s)
| | | | - Lucia Banci
- Magnetic Resonance Center, University of Florence, Italy
| | | | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, c/o DESY, Notkestr. 85, 22607 Hamburg, Germany
| | | | | | | | | | - George Damaskos
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | | | | | - Maarten L Hekkelman
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Philipp Heuser
- European Molecular Biology Laboratory, c/o DESY, Notkestr. 85, 22607 Hamburg, Germany
| | - Robbie P Joosten
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | | | | | | | - Victor S Lamzin
- European Molecular Biology Laboratory, c/o DESY, Notkestr. 85, 22607 Hamburg, Germany
| | - Nurul Nadzirin
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | - Anastassis Perrakis
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Antonio Rosato
- Magnetic Resonance Center, University of Florence, Italy
| | | | | | | | - Egor Sobolev
- European Molecular Biology Laboratory, c/o DESY, Notkestr. 85, 22607 Hamburg, Germany
| | | | | | - Sameer Velankar
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | | | | |
Collapse
|
17
|
Roorda JC, Joosten RP, Perrakis A, Hiruma Y. A crystal structure of the human protein kinase Mps1 reveals an ordered conformation of the activation loop. Proteins 2019; 87:348-352. [PMID: 30582207 PMCID: PMC6590424 DOI: 10.1002/prot.25651] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 12/18/2018] [Accepted: 12/19/2018] [Indexed: 11/11/2022]
Abstract
Monopolar spindle 1 (Mps1) is a dual-specificity protein kinase, orchestrating faithful chromosome segregation during mitosis. All reported structures of the Mps1 kinase adopt the hallmarks of an inactive conformation, which includes a mostly disordered activation loop. Here, we present a 2.4 Å resolution crystal structure of an "extended" version of the Mps1 kinase domain, which shows an ordered activation loop. However, the other structural characteristics of an active kinase are not present. Our structure shows that the Mps1 activation loop can fit to the ATP binding pocket and interferes with ATP, but less so with inhibitors binding, partly explain the potency of various Mps1 inhibitors.
Collapse
Affiliation(s)
- Jacomina C Roorda
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Robbie P Joosten
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Yoshitaka Hiruma
- Division of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| |
Collapse
|
18
|
van Beusekom B, Heidebrecht T, Adamopoulos A, Fish A, Pardon E, Steyaert J, Joosten RP, Perrakis A. Characterization and structure determination of a llama-derived nanobody targeting the J-base binding protein 1. Acta Crystallogr F Struct Biol Commun 2018; 74:690-695. [PMID: 30387773 PMCID: PMC6213982 DOI: 10.1107/s2053230x18010282] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2018] [Accepted: 07/16/2018] [Indexed: 01/06/2023] Open
Abstract
J-base binding protein 1 (JBP1) contributes to the biosynthesis and maintenance of base J (β-D-glucosylhydroxymethyluracil), a modification of thymidine confined to some protozoa. Camelid (llama) single-domain antibody fragments (nanobodies) targeting JBP1 were produced for use as crystallization chaperones. Surface plasmon resonance screening identified Nb6 as a strong binder, recognizing JBP1 with a 1:1 stoichiometry and high affinity (Kd = 30 nM). Crystallization trials of JBP1 in complex with Nb6 yielded crystals that diffracted to 1.47 Å resolution. However, the dimensions of the asymmetric unit and molecular replacement with a nanobody structure clearly showed that the crystals of the expected complex with JBP1 were of the nanobody alone. Nb6 crystallizes in space group P31 with two molecules in the asymmetric unit; its crystal structure was refined to a final resolution of 1.64 Å. Ensemble refinement suggests that in the ligand-free state one of the complementarity-determining regions (CDRs) is flexible, while the other two adopt well defined conformations.
Collapse
Affiliation(s)
- Bart van Beusekom
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Tatjana Heidebrecht
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Athanassios Adamopoulos
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Alexander Fish
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Els Pardon
- VIB–VUB Center for Structural Biology, VIB, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Jan Steyaert
- VIB–VUB Center for Structural Biology, VIB, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Robbie P. Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| |
Collapse
|
19
|
van Beusekom B, Joosten K, Hekkelman ML, Joosten RP, Perrakis A. Homology-based loop modeling yields more complete crystallographic protein structures. IUCrJ 2018; 5:585-594. [PMID: 30224962 PMCID: PMC6126648 DOI: 10.1107/s2052252518010552] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Accepted: 07/23/2018] [Indexed: 06/08/2023]
Abstract
Inherent protein flexibility, poor or low-resolution diffraction data or poorly defined electron-density maps often inhibit the building of complete structural models during X-ray structure determination. However, recent advances in crystallographic refinement and model building often allow completion of previously missing parts. This paper presents algorithms that identify regions missing in a certain model but present in homologous structures in the Protein Data Bank (PDB), and 'graft' these regions of interest. These new regions are refined and validated in a fully automated procedure. Including these developments in the PDB-REDO pipeline has enabled the building of 24 962 missing loops in the PDB. The models and the automated procedures are publicly available through the PDB-REDO databank and webserver. More complete protein structure models enable a higher quality public archive but also a better understanding of protein function, better comparison between homologous structures and more complete data mining in structural bioinformatics projects.
Collapse
Affiliation(s)
- Bart van Beusekom
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, The Netherlands
| | - Krista Joosten
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, The Netherlands
| | - Maarten L. Hekkelman
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, The Netherlands
| | - Robbie P. Joosten
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, The Netherlands
| | - Anastassis Perrakis
- Department of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066CX, The Netherlands
| |
Collapse
|
20
|
Abstract
Glycosylation is one of the most common forms of protein post-translational modification, but is also the most complex. Dealing with glycoproteins in structure model building, refinement, validation and PDB deposition is more error-prone than dealing with nonglycosylated proteins owing to limitations of the experimental data and available software tools. Also, experimentalists are typically less experienced in dealing with carbohydrate residues than with amino-acid residues. The results of the reannotation and re-refinement by PDB-REDO of 8114 glycoprotein structure models from the Protein Data Bank are analyzed. The positive aspects of 3620 reannotations and subsequent refinement, as well as the remaining challenges to obtaining consistently high-quality carbohydrate models, are discussed.
Collapse
Affiliation(s)
- Bart van Beusekom
- Division of Biochemistry, Netherlands Cancer Insitute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Thomas Lütteke
- Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Frankfurter Strasse 100, 35392 Giessen, Germany
| | - Robbie P. Joosten
- Division of Biochemistry, Netherlands Cancer Insitute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| |
Collapse
|
21
|
Read RJ, Oeffner RD, Joosten RP. On the information content of X-ray diffraction data. Acta Crystallogr A Found Adv 2018. [DOI: 10.1107/s010876731809699x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|
22
|
van Beusekom B, Touw WG, Tatineni M, Somani S, Rajagopal G, Luo J, Gilliland GL, Perrakis A, Joosten RP. Homology-based hydrogen bond information improves crystallographic structures in the PDB. Protein Sci 2018; 27:798-808. [PMID: 29168245 PMCID: PMC5818736 DOI: 10.1002/pro.3353] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 11/16/2017] [Accepted: 11/16/2017] [Indexed: 01/06/2023]
Abstract
The Protein Data Bank (PDB) is the global archive for structural information on macromolecules, and a popular resource for researchers, teachers, and students, amassing more than one million unique users each year. Crystallographic structure models in the PDB (more than 100,000 entries) are optimized against the crystal diffraction data and geometrical restraints. This process of crystallographic refinement typically ignored hydrogen bond (H-bond) distances as a source of information. However, H-bond restraints can improve structures at low resolution where diffraction data are limited. To improve low-resolution structure refinement, we present methods for deriving H-bond information either globally from well-refined high-resolution structures from the PDB-REDO databank, or specifically from on-the-fly constructed sets of homologous high-resolution structures. Refinement incorporating HOmology DErived Restraints (HODER), improves geometrical quality and the fit to the diffraction data for many low-resolution structures. To make these improvements readily available to the general public, we applied our new algorithms to all crystallographic structures in the PDB: using massively parallel computing, we constructed a new instance of the PDB-REDO databank (https://pdb-redo.eu). This resource is useful for researchers to gain insight on individual structures, on specific protein families (as we demonstrate with examples), and on general features of protein structure using data mining approaches on a uniformly treated dataset.
Collapse
Affiliation(s)
- Bart van Beusekom
- Department of BiochemistryNetherlands Cancer Institute, Plesmanlaan 121Amsterdam1066 CXThe Netherlands
| | - Wouter G. Touw
- Department of BiochemistryNetherlands Cancer Institute, Plesmanlaan 121Amsterdam1066 CXThe Netherlands
| | - Mahidhar Tatineni
- San Diego Supercomputer Center, University of California, San Diego, 9500 Gilman DriveLa JollaCalifornia92093‐0505
| | - Sandeep Somani
- Discovery Sciences, Janssen R&D LLCSpring HousePennsylvania
| | | | - Jinquan Luo
- Janssen BioTherapeutics, Janssen R&D LLCSpring HousePennsylvania
| | | | - Anastassis Perrakis
- Department of BiochemistryNetherlands Cancer Institute, Plesmanlaan 121Amsterdam1066 CXThe Netherlands
| | - Robbie P. Joosten
- Department of BiochemistryNetherlands Cancer Institute, Plesmanlaan 121Amsterdam1066 CXThe Netherlands
| |
Collapse
|
23
|
Delbart F, Brams M, Gruss F, Noppen S, Peigneur S, Boland S, Chaltin P, Brandao-Neto J, von Delft F, Touw WG, Joosten RP, Liekens S, Tytgat J, Ulens C. An allosteric binding site of the α7 nicotinic acetylcholine receptor revealed in a humanized acetylcholine-binding protein. J Biol Chem 2017; 293:2534-2545. [PMID: 29237730 PMCID: PMC5818190 DOI: 10.1074/jbc.m117.815316] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2017] [Revised: 10/24/2017] [Indexed: 11/06/2022] Open
Abstract
Nicotinic acetylcholine receptors (nAChRs) belong to the family of pentameric ligand-gated ion channels and mediate fast excitatory transmission in the central and peripheral nervous systems. Among the different existing receptor subtypes, the homomeric α7 nAChR has attracted considerable attention because of its possible implication in several neurological and psychiatric disorders, including cognitive decline associated with Alzheimer's disease or schizophrenia. Allosteric modulators of ligand-gated ion channels are of particular interest as therapeutic agents, as they modulate receptor activity without affecting normal fluctuations of synaptic neurotransmitter release. Here, we used X-ray crystallography and surface plasmon resonance spectroscopy of α7-acetylcholine-binding protein (AChBP), a humanized chimera of a snail AChBP, which has 71% sequence similarity with the extracellular ligand-binding domain of the human α7 nAChR, to investigate the structural determinants of allosteric modulation. We extended previous observations that an allosteric site located in the vestibule of the receptor offers an attractive target for receptor modulation. We introduced seven additional humanizing mutations in the vestibule-located binding site of AChBP to improve its suitability as a model for studying allosteric binding. Using a fragment-based screening approach, we uncovered an allosteric binding site located near the β8-β9 loop, which critically contributes to coupling ligand binding to channel opening in human α7 nAChR. This work expands our understanding of the topology of allosteric binding sites in AChBP and, by extrapolation, in the human α7 nAChR as determined by electrophysiology measurements. Our insights pave the way for drug design strategies targeting nAChRs involved in ion channel-mediated disorders.
Collapse
Affiliation(s)
- Florian Delbart
- From the Department of Cellular and Molecular Medicine, Laboratory of Structural Neurobiology, Faculty of Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Marijke Brams
- From the Department of Cellular and Molecular Medicine, Laboratory of Structural Neurobiology, Faculty of Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Fabian Gruss
- From the Department of Cellular and Molecular Medicine, Laboratory of Structural Neurobiology, Faculty of Medicine, KU Leuven, 3000 Leuven, Belgium
| | - Sam Noppen
- the Department of Microbiology and Immunology, Laboratory of Virology and Chemotherapy, Rega Institute for Medical Research, KU Leuven, 3000 Leuven, Belgium
| | - Steve Peigneur
- the Laboratory of Toxicology and Pharmacology, Faculty of Pharmaceutical Sciences, KU Leuven, 3000 Leuven, Belgium
| | - Sandro Boland
- the Center for Innovation and Stimulation of Drug Discovery Leuven, Cistim Leuven vzw, 3001 Heverlee, Belgium
| | - Patrick Chaltin
- the Center for Innovation and Stimulation of Drug Discovery Leuven, Cistim Leuven vzw, 3001 Heverlee, Belgium.,the Center for Innovation and Stimulation of Drug Discovery Leuven and Center for Drug Design and Discovery, KU Leuven, 3001 Heverlee, Belgium
| | - Jose Brandao-Neto
- Diamond Light Source Ltd., Harwell Science and Innovation Campus, Didcot OX11 0QX, United Kingdom, and
| | - Frank von Delft
- Diamond Light Source Ltd., Harwell Science and Innovation Campus, Didcot OX11 0QX, United Kingdom, and
| | - Wouter G Touw
- the Division of Biochemistry, Netherlands Cancer Institute, 1066CX Amsterdam, The Netherlands
| | - Robbie P Joosten
- the Division of Biochemistry, Netherlands Cancer Institute, 1066CX Amsterdam, The Netherlands
| | - Sandra Liekens
- the Department of Microbiology and Immunology, Laboratory of Virology and Chemotherapy, Rega Institute for Medical Research, KU Leuven, 3000 Leuven, Belgium
| | - Jan Tytgat
- the Laboratory of Toxicology and Pharmacology, Faculty of Pharmaceutical Sciences, KU Leuven, 3000 Leuven, Belgium
| | - Chris Ulens
- From the Department of Cellular and Molecular Medicine, Laboratory of Structural Neurobiology, Faculty of Medicine, KU Leuven, 3000 Leuven, Belgium,
| |
Collapse
|
24
|
Hiruma Y, Koch A, Hazraty N, Tsakou F, Medema RH, Joosten RP, Perrakis A. Understanding inhibitor resistance in Mps1 kinase through novel biophysical assays and structures. J Biol Chem 2017; 292:14496-14504. [PMID: 28726638 DOI: 10.1074/jbc.m117.783555] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 07/06/2017] [Indexed: 01/09/2023] Open
Abstract
Monopolar spindle 1 (Mps1/TTK) is a protein kinase essential in mitotic checkpoint signaling, preventing anaphase until all chromosomes are properly attached to spindle microtubules. Mps1 has emerged as a potential target for cancer therapy, and a variety of compounds have been developed to inhibit its kinase activity. Mutations in the catalytic domain of Mps1 that give rise to inhibitor resistance, but retain catalytic activity and do not display cross-resistance to other Mps1 inhibitors, have been described. Here we characterize the interactions of two such mutants, Mps1 C604Y and C604W, which raise resistance to two closely related compounds, NMS-P715 and its derivative Cpd-5, but not to the well characterized Mps1 inhibitor, reversine. We show that estimates of the IC50 (employing a novel specific and efficient assay that utilizes a fluorescently labeled substrate) and the binding affinity (KD ) indicate that, in both mutants, Cpd-5 should be better tolerated than the closely related NMS-P715. To gain further insight, we determined the crystal structure of the Mps1 kinase mutants bound to Cpd-5 and NMS-P715 and compared the binding modes of Cpd-5, NMS-P715, and reversine. The difference in steric hindrance between Tyr/Trp604 and the trifluoromethoxy moiety of NMS-P715, the methoxy moiety of Cpd-5, and complete absence of such a group in reversine, account for differences we observe in vitro Our analysis enforces the notion that inhibitors targeting Mps1 drug-resistant mutations can emerge as a feasible intervention strategy based on existing scaffolds, if the clinical need arises.
Collapse
Affiliation(s)
| | - Andre Koch
- Cell Biology, Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands
| | | | | | - René H Medema
- Cell Biology, Netherlands Cancer Institute, 1066 CX Amsterdam, The Netherlands
| | | | | |
Collapse
|
25
|
Abstract
Glycoproteins and protein-carbohydrate complexes in the worldwide Protein Data Bank (wwPDB) can be an excellent source of information for glycoscientists. Unfortunately, a rather large number of errors and inconsistencies is found in the glycan moieties of these 3D structures. This review illustrates frequent problems of carbohydrate moieties in wwPDB entries, such as nomenclature issues, incorrect N-glycan core structures, missing or erroneous linkages, or poor glycan geometry, and describes the carbohydrate-specific validation tools that are designed to identify such problems. Recommendations how to avoid these issues or how to rectify incorrect structures are also given.
Collapse
Affiliation(s)
- Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Thomas Lütteke
- Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Frankfurter Str. 100, 35392 Giessen, Germany.
| |
Collapse
|
26
|
Hiruma Y, Koch A, Dharadhar S, Joosten RP, Perrakis A. Structural basis of reversine selectivity in inhibiting Mps1 more potently than aurora B kinase. Proteins 2016; 84:1761-1766. [PMID: 27699881 DOI: 10.1002/prot.25174] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Revised: 09/20/2016] [Accepted: 09/24/2016] [Indexed: 12/13/2022]
Abstract
Monopolar spindle 1 (Mps1, also known as TTK) is a protein kinase crucial for ensuring that cell division progresses to anaphase only after all chromosomes are connected to spindle microtubules. Incomplete chromosomal attachment leads to abnormal chromosome counts in the daughter cells (aneuploidy), a condition common in many solid cancers. Therefore Mps1 is an established target in cancer therapy. Mps1 kinase inhibitors include reversine (2-(4-morpholinoanilino)-6-cyclohexylaminopurine), a promiscuous compound first recognized as an inhibitor of the Aurora B mitotic kinase. Here, we present the 3.0-Å resolution crystal structure of the Mps1 kinase domain bound to reversine. Structural comparison of reversine bound to Mps1 and Aurora B, indicates a similar binding pose for the purine moiety of reversine making three conserved hydrogen bonds to the protein main chain, explaining the observed promiscuity of this inhibitor. The cyclohexyl and morpholinoaniline moieties of reversine however, have more extensive contacts with the protein in Mps1 than in Aurora B. This is reflected both in structure-based docking energy calculations, and in new experimental data we present here, that both confirm that the affinity of reversine towards Mps1 is about two orders of magnitude higher than towards Aurora B. Thus, our data provides detailed structural understanding of the existing literature that argues reversine inhibits Mps1 more efficiently than Aurora B based on biochemical and in-cell assays. Proteins 2016; 84:1761-1766. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yoshitaka Hiruma
- Division of Biochemistry, Netherlands Cancer Institute, 1066CX, Amsterdam, The Netherlands
| | - Andre Koch
- Division of Cell Biology, Netherlands Cancer Institute, 1066CX, Amsterdam, The Netherlands
| | - Shreya Dharadhar
- Division of Biochemistry, Netherlands Cancer Institute, 1066CX, Amsterdam, The Netherlands
| | - Robbie P Joosten
- Division of Biochemistry, Netherlands Cancer Institute, 1066CX, Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Division of Biochemistry, Netherlands Cancer Institute, 1066CX, Amsterdam, The Netherlands
| |
Collapse
|
27
|
Touw WG, van Beusekom B, Evers JMG, Vriend G, Joosten RP. Validation and correction of Zn-Cys xHis y complexes. Acta Crystallogr D Struct Biol 2016; 72:1110-1118. [PMID: 27710932 PMCID: PMC5053137 DOI: 10.1107/s2059798316013036] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 08/12/2016] [Indexed: 11/10/2022] Open
Abstract
Many crystal structures in the Protein Data Bank contain zinc ions in a geometrically distorted tetrahedral complex with four Cys and/or His ligands. A method is presented to automatically validate and correct these zinc complexes. Analysis of the corrected zinc complexes shows that the average Zn-Cys distances and Cys-Zn-Cys angles are a function of the number of cysteines and histidines involved. The observed trends can be used to develop more context-sensitive targets for model validation and refinement.
Collapse
Affiliation(s)
- Wouter G. Touw
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Geert Grooteplein-Zuid 26-28, 6525 GA Nijmegen, The Netherlands
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Bart van Beusekom
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Jochem M. G. Evers
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Geert Grooteplein-Zuid 26-28, 6525 GA Nijmegen, The Netherlands
| | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Geert Grooteplein-Zuid 26-28, 6525 GA Nijmegen, The Netherlands
| | - Robbie P. Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| |
Collapse
|
28
|
Hausmann J, Keune WJ, Hipgrave Ederveen AL, van Zeijl L, Joosten RP, Perrakis A. Structural snapshots of the catalytic cycle of the phosphodiesterase Autotaxin. J Struct Biol 2016; 195:199-206. [PMID: 27268273 DOI: 10.1016/j.jsb.2016.06.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Revised: 05/17/2016] [Accepted: 06/03/2016] [Indexed: 01/01/2023]
Abstract
Autotaxin (ATX) is a secreted phosphodiesterase that produces the signalling lipid lysophosphatidic acid (LPA). The bimetallic active site of ATX is structurally related to the alkaline phosphatase superfamily. Here, we present a new crystal structure of ATX in complex with orthovanadate (ATX-VO5), which binds the Oγ nucleophile of Thr209 and adopts a trigonal bipyramidal conformation, following the nucleophile attack onto the substrate. We have now a portfolio of ATX structures we discuss as intermediates of the catalytic mechanism: the new ATX-VO5 structure; a unique structure where the nucleophile Thr209 is phosphorylated (ATX-pThr). Comparing these to a complex with the LPA product (ATX-LPA) and with a complex with a phosphate ion (ATX-PO4), that represent the Michaelis complex of the reaction, we observe movements of Thr209, changes in the relative displacement of the zinc ions, and a water molecule that likely fulfils the second nucleophilic attack. We propose that ATX follows the associative two-step in-line displacement mechanism.
Collapse
Affiliation(s)
- Jens Hausmann
- Division of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands.
| | - Willem-Jan Keune
- Division of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Agnes L Hipgrave Ederveen
- Division of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Leonie van Zeijl
- Division of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Robbie P Joosten
- Division of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Division of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands.
| |
Collapse
|
29
|
Dym O, Song W, Felder C, Roth E, Shnyrov V, Ashani Y, Xu Y, Joosten RP, Weiner L, Sussman JL, Silman I. The impact of crystallization conditions on structure-based drug design: A case study on the methylene blue/acetylcholinesterase complex. Protein Sci 2016; 25:1096-114. [PMID: 26990888 DOI: 10.1002/pro.2923] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 03/07/2016] [Indexed: 11/05/2022]
Abstract
Structure-based drug design utilizes apoprotein or complex structures retrieved from the PDB. >57% of crystallographic PDB entries were obtained with polyethylene glycols (PEGs) as precipitant and/or as cryoprotectant, but <6% of these report presence of individual ethyleneglycol oligomers. We report a case in which ethyleneglycol oligomers' presence in a crystal structure markedly affected the bound ligand's position. Specifically, we compared the positions of methylene blue and decamethonium in acetylcholinesterase complexes obtained using isomorphous crystals precipitated with PEG200 or ammonium sulfate. The ligands' positions within the active-site gorge in complexes obtained using PEG200 are influenced by presence of ethyleneglycol oligomers in both cases bound to W84 at the gorge's bottom, preventing interaction of the ligand's proximal quaternary group with its indole. Consequently, both ligands are ∼3.0Å further up the gorge than in complexes obtained using crystals precipitated with ammonium sulfate, in which the quaternary groups make direct π-cation interactions with the indole. These findings have implications for structure-based drug design, since data for ligand-protein complexes with polyethylene glycol as precipitant may not reflect the ligand's position in its absence, and could result in selecting incorrect drug discovery leads. Docking methylene blue into the structure obtained with PEG200, but omitting the ethyleneglycols, yields results agreeing poorly with the crystal structure; excellent agreement is obtained if they are included. Many proteins display features in which precipitants might lodge. It will be important to investigate presence of precipitants in published crystal structures, and whether it has resulted in misinterpreting electron density maps, adversely affecting drug design.
Collapse
Affiliation(s)
- Orly Dym
- Israel Structural Proteomics Center, Weizmann Institute of Science, Rehovot, 76100, Israel.,Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Wanling Song
- CAS Key Laboratory of Receptor Research, Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences (CAS), Shanghai (22), China
| | - Clifford Felder
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Esther Roth
- Department of Neurobiology, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Valery Shnyrov
- Department of Biochemistry and Molecular Biology, Universidad de Salamanca, Salamanca, 37007, Spain
| | - Yacov Ashani
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Yechun Xu
- CAS Key Laboratory of Receptor Research, Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences (CAS), Shanghai (22), China
| | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, CX, 1066, the Netherlands
| | - Lev Weiner
- Department of Chemical Research Support, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Joel L Sussman
- Israel Structural Proteomics Center, Weizmann Institute of Science, Rehovot, 76100, Israel.,Department of Structural Biology, Weizmann Institute of Science, Rehovot, 76100, Israel
| | - Israel Silman
- Department of Neurobiology, Weizmann Institute of Science, Rehovot, 76100, Israel
| |
Collapse
|
30
|
Touw WG, Joosten RP, Vriend G. New Biological Insights from Better Structure Models. J Mol Biol 2016; 428:1375-1393. [PMID: 26869101 DOI: 10.1016/j.jmb.2016.02.002] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Revised: 01/04/2016] [Accepted: 02/01/2016] [Indexed: 02/01/2023]
Abstract
Structure validation is a key component of all steps in the structure determination process, from structure building, refinement, deposition, and evaluation all the way to post-deposition optimisation of structures in the Protein Data Bank (PDB) by re-refinement and re-building. Today, many aspects of protein structures are understood better than 10years ago, and combined with improved software and more computing power, the automated PDB_REDO procedure can significantly improve about 85% of all X-ray structures ever deposited in the PDB. We review structure validation, structure improvement, and a series of validation resources and facilities that give access to improved PDB files and to reports on the quality of the original and the improved structures. Post-deposition optimisation generally leads to improved protein structures and a series of examples will illustrate how that, in turn, leads to improved or even novel biological insights.
Collapse
Affiliation(s)
- Wouter G Touw
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Geert Grooteplein-Zuid 26-28, 6525 GA Nijmegen, The Netherlands
| | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Geert Grooteplein-Zuid 26-28, 6525 GA Nijmegen, The Netherlands.
| |
Collapse
|
31
|
Abstract
The use of macromolecular structures is widespread for a variety of applications, from teaching protein structure principles all the way to ligand optimization in drug development. Applying data mining techniques on these experimentally determined structures requires a highly uniform, standardized structural data source. The Protein Data Bank (PDB) has evolved over the years toward becoming the standard resource for macromolecular structures. However, the process selecting the data most suitable for specific applications is still very much based on personal preferences and understanding of the experimental techniques used to obtain these models. In this chapter, we will first explain the challenges with data standardization, annotation, and uniformity in the PDB entries determined by X-ray crystallography. We then discuss the specific effect that crystallographic data quality and model optimization methods have on structural models and how validation tools can be used to make informed choices. We also discuss specific advantages of using the PDB_REDO databank as a resource for structural data. Finally, we will provide guidelines on how to select the most suitable protein structure models for detailed analysis and how to select a set of structure models suitable for data mining.
Collapse
Affiliation(s)
- Bart van Beusekom
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Anastassis Perrakis
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands
| | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX, Amsterdam, The Netherlands.
| |
Collapse
|
32
|
|
33
|
Touw WG, Joosten RP, Vriend G. Detection of trans-cis flips and peptide-plane flips in protein structures. ACTA ACUST UNITED AC 2015; 71:1604-14. [PMID: 26249342 PMCID: PMC4528797 DOI: 10.1107/s1399004715008263] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2015] [Accepted: 04/27/2015] [Indexed: 11/13/2022]
Abstract
A method is presented to detect peptide bonds that need either a trans–cis flip or a peptide-plane flip. A coordinate-based method is presented to detect peptide bonds that need correction either by a peptide-plane flip or by a trans–cis inversion of the peptide bond. When applied to the whole Protein Data Bank, the method predicts 4617 trans–cis flips and many thousands of hitherto unknown peptide-plane flips. A few examples are highlighted for which a correction of the peptide-plane geometry leads to a correction of the understanding of the structure–function relation. All data, including 1088 manually validated cases, are freely available and the method is available from a web server, a web-service interface and through WHAT_CHECK.
Collapse
Affiliation(s)
- Wouter G Touw
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Geert Grooteplein-Zuid 26-28, 6525 GA Nijmegen, The Netherlands
| | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Center, Geert Grooteplein-Zuid 26-28, 6525 GA Nijmegen, The Netherlands
| |
Collapse
|
34
|
Abstract
We present a series of databanks (http://swift.cmbi.ru.nl/gv/facilities/) that hold information that is computationally derived from Protein Data Bank (PDB) entries and that might augment macromolecular structure studies. These derived databanks run parallel to the PDB, i.e. they have one entry per PDB entry. Several of the well-established databanks such as HSSP, PDBREPORT and PDB_REDO have been updated and/or improved. The software that creates the DSSP databank, for example, has been rewritten to better cope with π-helices. A large number of databanks have been added to aid computational structural biology; some examples are lists of residues that make crystal contacts, lists of contacting residues using a series of contact definitions or lists of residue accessibilities. PDB files are not the optimal presentation of the underlying data for many studies. We therefore made a series of databanks that hold PDB files in an easier to use or more consistent representation. The BDB databank holds X-ray PDB files with consistently represented B-factors. We also added several visualization tools to aid the users of our databanks.
Collapse
Affiliation(s)
- Wouter G Touw
- Centre for Molecular and Biomolecular Informatics, CMBI, Radboud university medical center, Geert Grooteplein Zuid 26-28 6525 GA Nijmegen, The Netherlands
| | - Coos Baakman
- Centre for Molecular and Biomolecular Informatics, CMBI, Radboud university medical center, Geert Grooteplein Zuid 26-28 6525 GA Nijmegen, The Netherlands
| | - Jon Black
- Centre for Molecular and Biomolecular Informatics, CMBI, Radboud university medical center, Geert Grooteplein Zuid 26-28 6525 GA Nijmegen, The Netherlands
| | - Tim A H te Beek
- Bio-Prodict BV, Nieuwe Marktstraat 54E, 6511 AA Nijmegen, The Netherlands
| | - E Krieger
- Centre for Molecular and Biomolecular Informatics, CMBI, Radboud university medical center, Geert Grooteplein Zuid 26-28 6525 GA Nijmegen, The Netherlands
| | - Robbie P Joosten
- Centre for Molecular and Biomolecular Informatics, CMBI, Radboud university medical center, Geert Grooteplein Zuid 26-28 6525 GA Nijmegen, The Netherlands Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
| | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics, CMBI, Radboud university medical center, Geert Grooteplein Zuid 26-28 6525 GA Nijmegen, The Netherlands
| |
Collapse
|
35
|
Joosten RP, Long F, Murshudov GN, Perrakis A. The PDB_REDO server for macromolecular structure model optimization. IUCrJ 2014; 1:213-20. [PMID: 25075342 PMCID: PMC4107921 DOI: 10.1107/s2052252514009324] [Citation(s) in RCA: 574] [Impact Index Per Article: 57.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2014] [Accepted: 04/25/2014] [Indexed: 05/18/2023]
Abstract
The refinement and validation of a crystallographic structure model is the last step before the coordinates and the associated data are submitted to the Protein Data Bank (PDB). The success of the refinement procedure is typically assessed by validating the models against geometrical criteria and the diffraction data, and is an important step in ensuring the quality of the PDB public archive [Read et al. (2011 ▶), Structure, 19, 1395-1412]. The PDB_REDO procedure aims for 'constructive validation', aspiring to consistent and optimal refinement parameterization and pro-active model rebuilding, not only correcting errors but striving for optimal interpretation of the electron density. A web server for PDB_REDO has been implemented, allowing thorough, consistent and fully automated optimization of the refinement procedure in REFMAC and partial model rebuilding. The goal of the web server is to help practicing crystallo-graphers to improve their model prior to submission to the PDB. For this, additional steps were implemented in the PDB_REDO pipeline, both in the refinement procedure, e.g. testing of resolution limits and k-fold cross-validation for small test sets, and as new validation criteria, e.g. the density-fit metrics implemented in EDSTATS and ligand validation as implemented in YASARA. Innovative ways to present the refinement and validation results to the user are also described, which together with auto-generated Coot scripts can guide users to subsequent model inspection and improvement. It is demonstrated that using the server can lead to substantial improvement of structure models before they are submitted to the PDB.
Collapse
Affiliation(s)
- Robbie P. Joosten
- Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - Fei Long
- Structural Studies Division, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0HQ, England
| | - Garib N. Murshudov
- Structural Studies Division, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0HQ, England
| | - Anastassis Perrakis
- Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| |
Collapse
|
36
|
Rimsa V, Eadsforth TC, Joosten RP, Hunter WN. High-resolution structure of the M14-type cytosolic carboxypeptidase from Burkholderia cenocepacia refined exploiting PDB_REDO strategies. ACTA ACUST UNITED AC 2014; 70:279-89. [PMID: 24531462 PMCID: PMC3940198 DOI: 10.1107/s1399004713026801] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 09/30/2013] [Indexed: 01/01/2023]
Abstract
The structure of a bacterial M14-family carboxypeptidase determined exploiting microfocus synchrotron radiation and highly automated refinement protocols reveals its potential to act as a polyglutamylase. A potential cytosolic metallocarboxypeptidase from Burkholderia cenocepacia has been crystallized and a synchrotron-radiation microfocus beamline allowed the acquisition of diffraction data to 1.9 Å resolution. The asymmetric unit comprises a tetramer containing over 1500 amino acids, and the high-throughput automated protocols embedded in PDB_REDO were coupled with model–map inspections in refinement. This approach has highlighted the value of such protocols for efficient analyses. The subunit is constructed from two domains. The N-terminal domain has previously only been observed in cytosolic carboxypeptidase (CCP) proteins. The C-terminal domain, which carries the Zn2+-containing active site, serves to classify this protein as a member of the M14D subfamily of carboxypeptidases. Although eukaryotic CCPs possess deglutamylase activity and are implicated in processing modified tubulin, the function and substrates of the bacterial family members remain unknown. The B. cenocepacia protein did not display deglutamylase activity towards a furylacryloyl glutamate derivative, a potential substrate. Residues previously shown to coordinate the divalent cation and that contribute to peptide-bond cleavage in related enzymes such as bovine carboxypeptidase are conserved. The location of a conserved basic patch in the active site adjacent to the catalytic Zn2+, where an acetate ion is identified, suggests recognition of the carboxy-terminus in a similar fashion to other carboxypeptidases. However, there are significant differences that indicate the recognition of substrates with different properties. Of note is the presence of a lysine in the S1′ recognition subsite that suggests specificity towards an acidic substrate.
Collapse
Affiliation(s)
- Vadim Rimsa
- Division of Biological Chemistry and Drug Discovery, College of Life Sciences, University of Dundee, Dundee DD1 5EH, Scotland
| | - Thomas C Eadsforth
- Division of Biological Chemistry and Drug Discovery, College of Life Sciences, University of Dundee, Dundee DD1 5EH, Scotland
| | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | - William N Hunter
- Division of Biological Chemistry and Drug Discovery, College of Life Sciences, University of Dundee, Dundee DD1 5EH, Scotland
| |
Collapse
|
37
|
Joosten RP, Soueidan H, Wessels LFA, Perrakis A. Timely deposition of macromolecular structures is necessary for peer review. Acta Crystallogr D Biol Crystallogr 2013; 69:2293-5. [PMID: 24311569 PMCID: PMC3852646 DOI: 10.1107/s0907444913024621] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2013] [Accepted: 09/03/2013] [Indexed: 11/10/2022]
Abstract
Most of the macromolecular structures in the Protein Data Bank (PDB), which are used daily by thousands of educators and scientists alike, are determined by X-ray crystallography. It was examined whether the crystallographic models and data were deposited to the PDB at the same time as the publications that describe them were submitted for peer review. This condition is necessary to ensure pre-publication validation and the quality of the PDB public archive. It was found that a significant proportion of PDB entries were submitted to the PDB after peer review of the corresponding publication started, and many were only submitted after peer review had ended. It is argued that clear description of journal policies and effective policing is important for pre-publication validation, which is key in ensuring the quality of the PDB and of peer-reviewed literature.
Collapse
Affiliation(s)
- Robbie P Joosten
- Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | | | | | | |
Collapse
|
38
|
Cereto-Massagué A, Ojeda MJ, Joosten RP, Valls C, Mulero M, Salvado MJ, Arola-Arnal A, Arola L, Garcia-Vallvé S, Pujadas G. The good, the bad and the dubious: VHELIBS, a validation helper for ligands and binding sites. J Cheminform 2013; 5:36. [PMID: 23895374 PMCID: PMC3733808 DOI: 10.1186/1758-2946-5-36] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2013] [Accepted: 07/18/2013] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Many Protein Data Bank (PDB) users assume that the deposited structural models are of high quality but forget that these models are derived from the interpretation of experimental data. The accuracy of atom coordinates is not homogeneous between models or throughout the same model. To avoid basing a research project on a flawed model, we present a tool for assessing the quality of ligands and binding sites in crystallographic models from the PDB. RESULTS The Validation HElper for LIgands and Binding Sites (VHELIBS) is software that aims to ease the validation of binding site and ligand coordinates for non-crystallographers (i.e., users with little or no crystallography knowledge). Using a convenient graphical user interface, it allows one to check how ligand and binding site coordinates fit to the electron density map. VHELIBS can use models from either the PDB or the PDB_REDO databank of re-refined and re-built crystallographic models. The user can specify threshold values for a series of properties related to the fit of coordinates to electron density (Real Space R, Real Space Correlation Coefficient and average occupancy are used by default). VHELIBS will automatically classify residues and ligands as Good, Dubious or Bad based on the specified limits. The user is also able to visually check the quality of the fit of residues and ligands to the electron density map and reclassify them if needed. CONCLUSIONS VHELIBS allows inexperienced users to examine the binding site and the ligand coordinates in relation to the experimental data. This is an important step to evaluate models for their fitness for drug discovery purposes such as structure-based pharmacophore development and protein-ligand docking experiments.
Collapse
Affiliation(s)
- Adrià Cereto-Massagué
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
| | - María José Ojeda
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
| | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, Amsterdam 1066 CX, The Netherlands
| | - Cristina Valls
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
| | - Miquel Mulero
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
| | - M Josepa Salvado
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
| | - Anna Arola-Arnal
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
| | - Lluís Arola
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
- Centre Tecnològic de Nutrició i Salut (CTNS), TECNIO, CEICS, Avinguda Universitat 1, Reus, Catalonia 43204, Spain
| | - Santiago Garcia-Vallvé
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
- Centre Tecnològic de Nutrició i Salut (CTNS), TECNIO, CEICS, Avinguda Universitat 1, Reus, Catalonia 43204, Spain
| | - Gerard Pujadas
- Grup de Recerca en Nutrigenòmica, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, Campus de Sescelades, C/ Marceŀlí Domingo s/n, Tarragona, Catalonia 43007, Spain
- Centre Tecnològic de Nutrició i Salut (CTNS), TECNIO, CEICS, Avinguda Universitat 1, Reus, Catalonia 43204, Spain
| |
Collapse
|
39
|
Jansen S, Perrakis A, Ulens C, Winkler C, Andries M, Joosten RP, Van Acker M, Luyten FP, Moolenaar WH, Bollen M. Structure of NPP1, an ectonucleotide pyrophosphatase/phosphodiesterase involved in tissue calcification. Structure 2012; 20:1948-59. [PMID: 23041369 DOI: 10.1016/j.str.2012.09.001] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2012] [Revised: 08/25/2012] [Accepted: 09/02/2012] [Indexed: 10/27/2022]
Abstract
Ectonucleotide pyrophosphatase/phosphodiesterase-1 (NPP1) converts extracellular nucleotides into inorganic pyrophosphate, whereas its close relative NPP2/autotaxin hydrolyzes lysophospholipids. NPP1 regulates calcification in mineralization-competent tissues, and a lack of NPP1 function underlies calcification disorders. Here, we show that NPP1 forms homodimers via intramembrane disulfide bonding, but is also processed intracellularly to a secreted monomer. The structure of secreted NPP1 reveals a characteristic bimetallic active site and a nucleotide-binding groove, but it lacks the lipid-binding pocket and open tunnel present in NPP2. A loop adjacent to the nucleotide-binding site, which is disordered in NPP2, is well ordered in NPP1 and might promote nucleotide binding. Remarkably, the N-terminal somatomedin B-like domains of NPP1, unlike those in NPP2, are flexible and do not contact the catalytic domain. Our results provide a structural basis for the nucleotide pyrophosphatase activity of NPP1 and help to understand how disease-causing mutations may affect NPP1 structure and function.
Collapse
Affiliation(s)
- Silvia Jansen
- Laboratory of Biosignaling and Therapeutics, Department of Cellular and Molecular Medicine, University of Leuven, 3000 Leuven, Belgium
| | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Joosten RP, Joosten K, Murshudov GN, Perrakis A. PDB_REDO: constructive validation, more than just looking for errors. Acta Crystallogr D Biol Crystallogr 2012; 68:484-96. [PMID: 22505269 PMCID: PMC3322608 DOI: 10.1107/s0907444911054515] [Citation(s) in RCA: 170] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2011] [Accepted: 12/18/2011] [Indexed: 11/17/2022]
Abstract
Developments of the PDB_REDO procedure that combine re-refinement and rebuilding within a unique decision-making framework to improve structures in the PDB are presented. PDB_REDO uses a variety of existing and custom-built software modules to choose an optimal refinement protocol (e.g. anisotropic, isotropic or overall B-factor refinement, TLS model) and to optimize the geometry versus data-refinement weights. Next, it proceeds to rebuild side chains and peptide planes before a final optimization round. PDB_REDO works fully automatically without the need for intervention by a crystallographic expert. The pipeline was tested on 12 000 PDB entries and the great majority of the test cases improved both in terms of crystallographic criteria such as R(free) and in terms of widely accepted geometric validation criteria. It is concluded that PDB_REDO is useful to update the otherwise `static' structures in the PDB to modern crystallographic standards. The publically available PDB_REDO database provides better model statistics and contributes to better refinement and validation targets.
Collapse
Affiliation(s)
- Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands.
| | | | | | | |
Collapse
|
41
|
Read RJ, Adams PD, Arendall WB, Brunger AT, Emsley P, Joosten RP, Kleywegt GJ, Krissinel EB, Lütteke T, Otwinowski Z, Perrakis A, Richardson JS, Sheffler WH, Smith JL, Tickle IJ, Vriend G, Zwart PH. A new generation of crystallographic validation tools for the protein data bank. Structure 2012; 19:1395-412. [PMID: 22000512 PMCID: PMC3195755 DOI: 10.1016/j.str.2011.08.006] [Citation(s) in RCA: 332] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2011] [Revised: 08/02/2011] [Accepted: 08/02/2011] [Indexed: 11/26/2022]
Abstract
This report presents the conclusions of the X-ray Validation Task Force of the worldwide Protein Data Bank (PDB). The PDB has expanded massively since current criteria for validation of deposited structures were adopted, allowing a much more sophisticated understanding of all the components of macromolecular crystals. The size of the PDB creates new opportunities to validate structures by comparison with the existing database, and the now-mandatory deposition of structure factors creates new opportunities to validate the underlying diffraction data. These developments highlighted the need for a new assessment of validation criteria. The Task Force recommends that a small set of validation data be presented in an easily understood format, relative to both the full PDB and the applicable resolution class, with greater detail available to interested users. Most importantly, we recommend that referees and editors judging the quality of structural experiments have access to a concise summary of well-established quality indicators.
Collapse
Affiliation(s)
- Randy J Read
- CIMR, University of Cambridge, Cambridge CB2 0XY, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Abstract
MOTIVATION Macromolecular crystal structures in the Protein Data Bank (PDB) are a key source of structural insight into biological processes. These structures, some >30 years old, were constructed with methods of their era. With PDB_REDO, we aim to automatically optimize these structures to better fit their corresponding experimental data, passing the benefits of new methods in crystallography on to a wide base of non-crystallographer structure users. RESULTS We developed new algorithms to allow automatic rebuilding and remodeling of main chain peptide bonds and side chains in crystallographic electron density maps, and incorporated these and further enhancements in the PDB_REDO procedure. Applying the updated PDB_REDO to the oldest, but also to some of the newest models in the PDB, corrects existing modeling errors and brings these models to a higher quality, as judged by standard validation methods. AVAILABILITY AND IMPLEMENTATION The PDB_REDO database and links to all software are available at http://www.cmbi.ru.nl/pdb_redo. CONTACT r.joosten@nki.nl; a.perrakis@nki.nl SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Robbie P Joosten
- Department of Biochemistry, NKI, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands.
| | | | | | | | | |
Collapse
|
43
|
Heidebrecht T, Christodoulou E, Chalmers MJ, Jan S, Ter Riet B, Grover RK, Joosten RP, Littler D, van Luenen H, Griffin PR, Wentworth P, Borst P, Perrakis A. The structural basis for recognition of base J containing DNA by a novel DNA binding domain in JBP1. Nucleic Acids Res 2011; 39:5715-28. [PMID: 21415010 PMCID: PMC3141245 DOI: 10.1093/nar/gkr125] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The J-binding protein 1 (JBP1) is essential for biosynthesis and maintenance of DNA base-J (β-d-glucosyl-hydroxymethyluracil). Base-J and JBP1 are confined to some pathogenic protozoa and are absent from higher eukaryotes, prokaryotes and viruses. We show that JBP1 recognizes J-containing DNA (J-DNA) through a 160-residue domain, DB-JBP1, with 10 000-fold preference over normal DNA. The crystal structure of DB-JBP1 revealed a helix-turn-helix variant fold, a 'helical bouquet' with a 'ribbon' helix encompassing the amino acids responsible for DNA binding. Mutation of a single residue (Asp525) in the ribbon helix abrogates specificity toward J-DNA. The same mutation renders JBP1 unable to rescue the targeted deletion of endogenous JBP1 genes in Leishmania and changes its distribution in the nucleus. Based on mutational analysis and hydrogen/deuterium-exchange mass-spectrometry data, a model of JBP1 bound to J-DNA was constructed and validated by small-angle X-ray scattering data. Our results open new possibilities for targeted prevention of J-DNA recognition as a therapeutic intervention for parasitic diseases.
Collapse
Affiliation(s)
- Tatjana Heidebrecht
- Division of Biochemistry, The Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Joosten RP, te Beek TAH, Krieger E, Hekkelman ML, Hooft RWW, Schneider R, Sander C, Vriend G. A series of PDB related databases for everyday needs. Nucleic Acids Res 2010; 39:D411-9. [PMID: 21071423 PMCID: PMC3013697 DOI: 10.1093/nar/gkq1105] [Citation(s) in RCA: 516] [Impact Index Per Article: 36.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
The Protein Data Bank (PDB) is the world-wide repository of macromolecular structure information. We present a series of databases that run parallel to the PDB. Each database holds one entry, if possible, for each PDB entry. DSSP holds the secondary structure of the proteins. PDBREPORT holds reports on the structure quality and lists errors. HSSP holds a multiple sequence alignment for all proteins. The PDBFINDER holds easy to parse summaries of the PDB file content, augmented with essentials from the other systems. PDB_REDO holds re-refined, and often improved, copies of all structures solved by X-ray. WHY_NOT summarizes why certain files could not be produced. All these systems are updated weekly. The data sets can be used for the analysis of properties of protein structures in areas ranging from structural genomics, to cancer biology and protein design.
Collapse
Affiliation(s)
- Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
| | | | | | | | | | | | | | | |
Collapse
|
45
|
|
46
|
Joosten RP, Salzemann J, Bloch V, Stockinger H, Berglund AC, Blanchet C, Bongcam-Rudloff E, Combet C, Da Costa AL, Deleage G, Diarena M, Fabbretti R, Fettahi G, Flegel V, Gisel A, Kasam V, Kervinen T, Korpelainen E, Mattila K, Pagni M, Reichstadt M, Breton V, Tickle IJ, Vriend G. PDB_REDO: automated re-refinement of X-ray structure models in the PDB. J Appl Crystallogr 2009; 42:376-384. [PMID: 22477769 PMCID: PMC3246819 DOI: 10.1107/s0021889809008784] [Citation(s) in RCA: 169] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2009] [Accepted: 03/10/2009] [Indexed: 11/24/2022] Open
Abstract
Structural biology, homology modelling and rational drug design require accurate three-dimensional macromolecular coordinates. However, the coordinates in the Protein Data Bank (PDB) have not all been obtained using the latest experimental and computational methods. In this study a method is presented for automated re-refinement of existing structure models in the PDB. A large-scale benchmark with 16 807 PDB entries showed that they can be improved in terms of fit to the deposited experimental X-ray data as well as in terms of geometric quality. The re-refinement protocol uses TLS models to describe concerted atom movement. The resulting structure models are made available through the PDB_REDO databank (http://www.cmbi.ru.nl/pdb_redo/). Grid computing techniques were used to overcome the computational requirements of this endeavour.
Collapse
Affiliation(s)
- Robbie P. Joosten
- Centre for Molecular and Biomolecular Informatics, NCMLS, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Jean Salzemann
- CNRS/IN2P3, Laboratoire de Physique Corpusculaire, Université Blaize Pascal, Clermont-Ferrand, France
| | - Vincent Bloch
- CNRS/IN2P3, Laboratoire de Physique Corpusculaire, Université Blaize Pascal, Clermont-Ferrand, France
| | - Heinz Stockinger
- Swiss Institute of Bioinformatics, Vital-IT Group, Lausanne, Switzerland
| | | | - Christophe Blanchet
- IBCP, CNRS Université de Lyon1, IFR128 BioSciences Lyon-Gerland, Lyon, France
| | | | - Christophe Combet
- IBCP, CNRS Université de Lyon1, IFR128 BioSciences Lyon-Gerland, Lyon, France
| | - Ana L. Da Costa
- CNRS/IN2P3, Laboratoire de Physique Corpusculaire, Université Blaize Pascal, Clermont-Ferrand, France
| | - Gilbert Deleage
- IBCP, CNRS Université de Lyon1, IFR128 BioSciences Lyon-Gerland, Lyon, France
| | - Matteo Diarena
- CNRS/IN2P3, Laboratoire de Physique Corpusculaire, Université Blaize Pascal, Clermont-Ferrand, France
| | - Roberto Fabbretti
- Swiss Institute of Bioinformatics, Vital-IT Group, Lausanne, Switzerland
| | - Géraldine Fettahi
- CNRS/IN2P3, Laboratoire de Physique Corpusculaire, Université Blaize Pascal, Clermont-Ferrand, France
| | - Volker Flegel
- Swiss Institute of Bioinformatics, Vital-IT Group, Lausanne, Switzerland
| | - Andreas Gisel
- Institute for Biomedical Technologies Bari, CNR, Bari, Italy
| | - Vinod Kasam
- Fraunhofer-Institute for Algorithms and Scientific Computing, Sankt Augustin, Germany
| | - Timo Kervinen
- CSC – The Finnish IT Center for Science, Espoo, Finland
| | | | - Kimmo Mattila
- CSC – The Finnish IT Center for Science, Espoo, Finland
| | - Marco Pagni
- Swiss Institute of Bioinformatics, Vital-IT Group, Lausanne, Switzerland
| | - Matthieu Reichstadt
- CNRS/IN2P3, Laboratoire de Physique Corpusculaire, Université Blaize Pascal, Clermont-Ferrand, France
| | - Vincent Breton
- CNRS/IN2P3, Laboratoire de Physique Corpusculaire, Université Blaize Pascal, Clermont-Ferrand, France
| | | | - Gert Vriend
- Centre for Molecular and Biomolecular Informatics, NCMLS, Radboud University Medical Center, Nijmegen, The Netherlands
| |
Collapse
|
47
|
Joosten RP, Womack T, Vriend G, Bricogne G. Re-refinement from deposited X-ray data can deliver improved models for most PDB entries. Acta Crystallogr D Biol Crystallogr 2009; 65:176-85. [PMID: 19171973 PMCID: PMC2631631 DOI: 10.1107/s0907444908037591] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2008] [Accepted: 11/12/2008] [Indexed: 11/17/2022]
Abstract
The deposition of X-ray data along with the customary structural models defining PDB entries makes it possible to apply large-scale re-refinement protocols to these entries, thus giving users the benefit of improvements in X-ray methods that have occurred since the structure was deposited. Automated gradient refinement is an effective method to achieve this goal, but real-space intervention is most often required in order to adequately address problems detected by structure-validation software. In order to improve the existing protocol, automated re-refinement was combined with structure validation and difference-density peak analysis to produce a catalogue of problems in PDB entries that are amenable to automatic correction. It is shown that re-refinement can be effective in producing improvements, which are often associated with the systematic use of the TLS parameterization of B factors, even for relatively new and high-resolution PDB entries, while the accompanying manual or semi-manual map analysis and fitting steps show good prospects for eventual automation. It is proposed that the potential for simultaneous improvements in methods and in re-refinement results be further encouraged by broadening the scope of depositions to include refinement metadata and ultimately primary rather than reduced X-ray data.
Collapse
Affiliation(s)
- Robbie P. Joosten
- CMBI, NCMLS, Radboud University Nijmegen Medical Centre, PO Box 9101, 6500 HB Nijmegen, The Netherlands
| | - Thomas Womack
- Global Phasing, Sheraton House, Castle Park, Cambridge CB3 0AX, England
| | - Gert Vriend
- CMBI, NCMLS, Radboud University Nijmegen Medical Centre, PO Box 9101, 6500 HB Nijmegen, The Netherlands
| | - Gérard Bricogne
- Global Phasing, Sheraton House, Castle Park, Cambridge CB3 0AX, England
| |
Collapse
|
48
|
|
49
|
Smith BJ, Huyton T, Joosten RP, McKimm-Breschkin JL, Zhang JG, Luo CS, Lou MZ, Labrou NE, Garrett TPJ. Structure of a calcium-deficient form of influenza virus neuraminidase: implications for substrate binding. Acta Crystallogr D Biol Crystallogr 2006; 62:947-52. [PMID: 16929094 DOI: 10.1107/s0907444906020063] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2006] [Accepted: 05/29/2006] [Indexed: 11/10/2022]
Abstract
The X-ray structure of influenza virus neuraminidase (NA) isolated from whale, subtype N9, has been determined at 2.2 A resolution and contains a tetrameric protein in the asymmetric unit. In structures of NA determined previously, a calcium ion is observed to coordinate amino acids near the substrate-binding site. In three of the NA monomers determined here this calcium is absent, resulting in structural alterations near the substrate-binding site. These changes affect the conformation of residues that participate in several key interactions between the enzyme and substrate and provide at a molecular level the basis of the structural and functional role of calcium in substrate and inhibitor binding. Several sulfate ions were identified in complex with the protein. These are located in the active site, occupying the space reserved for the substrate (sialic acid) carboxylate, and in positions leading away from the substrate-binding site. These sites offer a new opportunity for the design of inhibitors of influenza virus NA.
Collapse
Affiliation(s)
- Brian J Smith
- The Walter and Eliza Hall Institute of Medical Research, 1G Royal Parade, Parkville, Victoria 3050, Australia.
| | | | | | | | | | | | | | | | | |
Collapse
|