1
|
Szikszai M, Magnus M, Sanghi S, Kadyan S, Bouatta N, Rivas E. RNA3DB: A structurally-dissimilar dataset split for training and benchmarking deep learning models for RNA structure prediction. J Mol Biol 2024; 436:168552. [PMID: 38552946 PMCID: PMC11377173 DOI: 10.1016/j.jmb.2024.168552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 03/19/2024] [Accepted: 03/22/2024] [Indexed: 04/09/2024]
Abstract
With advances in protein structure prediction thanks to deep learning models like AlphaFold, RNA structure prediction has recently received increased attention from deep learning researchers. RNAs introduce substantial challenges due to the sparser availability and lower structural diversity of the experimentally resolved RNA structures in comparison to protein structures. These challenges are often poorly addressed by the existing literature, many of which report inflated performance due to using training and testing sets with significant structural overlap. Further, the most recent Critical Assessment of Structure Prediction (CASP15) has shown that deep learning models for RNA structure are currently outperformed by traditional methods. In this paper we present RNA3DB, a dataset of structured RNAs, derived from the Protein Data Bank (PDB), that is designed for training and benchmarking deep learning models. The RNA3DB method arranges the RNA 3D chains into distinct groups (Components) that are non-redundant both with regard to sequence as well as structure, providing a robust way of dividing training, validation, and testing sets. Any split of these structurally-dissimilar Components are guaranteed to produce test and validations sets that are distinct by sequence and structure from those in the training set. We provide the RNA3DB dataset, a particular train/test split of the RNA3DB Components (in an approximate 70/30 ratio) that will be updated periodically. We also provide the RNA3DB methodology along with the source-code, with the goal of creating a reproducible and customizable tool for producing structurally-dissimilar dataset splits for structural RNAs.
Collapse
Affiliation(s)
- Marcell Szikszai
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| | - Marcin Magnus
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| | - Siddhant Sanghi
- Department of Systems Biology, Columbia University, New York 10027, NY, USA; College of Biological Sciences, UC Davis, Davis 95616, CA, USA
| | - Sachin Kadyan
- Department of Systems Biology, Columbia University, New York 10027, NY, USA
| | - Nazim Bouatta
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston 02115, MA, USA
| | - Elena Rivas
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| |
Collapse
|
2
|
Del Conte A, Camagni GF, Clementel D, Minervini G, Monzon AM, Ferrari C, Piovesan D, Tosatto SE. RING 4.0: faster residue interaction networks with novel interaction types across over 35,000 different chemical structures. Nucleic Acids Res 2024; 52:W306-W312. [PMID: 38686797 PMCID: PMC11223866 DOI: 10.1093/nar/gkae337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 04/09/2024] [Accepted: 04/19/2024] [Indexed: 05/02/2024] Open
Abstract
Residue interaction networks (RINs) are a valuable approach for representing contacts in protein structures. RINs have been widely used in various research areas, including the analysis of mutation effects, domain-domain communication, catalytic activity, and molecular dynamics simulations. The RING server is a powerful tool to calculate non-covalent molecular interactions based on geometrical parameters, providing high-quality and reliable results. Here, we introduce RING 4.0, which includes significant enhancements for identifying both covalent and non-covalent bonds in protein structures. It now encompasses seven different interaction types, with the addition of π-hydrogen, halogen bonds and metal ion coordination sites. The definitions of all available bond types have also been refined and RING can now process the complete PDB chemical component dictionary (over 35000 different molecules) which provides atom names and covalent connectivity information for all known ligands. Optimization of the software has improved execution time by an order of magnitude. The RING web server has been redesigned to provide a more engaging and interactive user experience, incorporating new visualization tools. Users can now visualize all types of interactions simultaneously in the structure viewer and network component. The web server, including extensive help and tutorials, is available from URL: https://ring.biocomputingup.it/.
Collapse
Affiliation(s)
- Alessio Del Conte
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Giorgia F Camagni
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Damiano Clementel
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | | | | - Carlo Ferrari
- Department of Information Engineering, University of Padova, Padova, Italy
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | |
Collapse
|
3
|
Waterhouse AM, Studer G, Robin X, Bienert S, Tauriello G, Schwede T. The structure assessment web server: for proteins, complexes and more. Nucleic Acids Res 2024; 52:W318-W323. [PMID: 38634802 PMCID: PMC11223858 DOI: 10.1093/nar/gkae270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 03/21/2024] [Accepted: 04/02/2024] [Indexed: 04/19/2024] Open
Abstract
The 'structure assessment' web server is a one-stop shop for interactive evaluation and benchmarking of structural models of macromolecular complexes including proteins and nucleic acids. A user-friendly web dashboard links sequence with structure information and results from a variety of state-of-the-art tools, which facilitates the visual exploration and evaluation of structure models. The dashboard integrates stereochemistry information, secondary structure information, global and local model quality assessment of the tertiary structure of comparative protein models, as well as prediction of membrane location. In addition, a benchmarking mode is available where a model can be compared to a reference structure, providing easy access to scores that have been used in recent CASP experiments and CAMEO. The structure assessment web server is available at https://swissmodel.expasy.org/assess.
Collapse
Affiliation(s)
- Andrew M Waterhouse
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Gabriel Studer
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Xavier Robin
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Computational Structural Biology, Basel, Switzerland
| |
Collapse
|
4
|
Bugnon M, Röhrig UF, Goullieux M, Perez MS, Daina A, Michielin O, Zoete V. SwissDock 2024: major enhancements for small-molecule docking with Attracting Cavities and AutoDock Vina. Nucleic Acids Res 2024; 52:W324-W332. [PMID: 38686803 PMCID: PMC11223881 DOI: 10.1093/nar/gkae300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 03/19/2024] [Accepted: 04/19/2024] [Indexed: 05/02/2024] Open
Abstract
Drug discovery aims to identify potential therapeutic compounds capable of modulating the activity of specific biological targets. Molecular docking can efficiently support this process by predicting binding interactions between small molecules and macromolecular targets and potentially accelerating screening campaigns. SwissDock is a computational tool released in 2011 as part of the SwissDrugDesign project, providing a free web-based service for small-molecule docking after automatized preparation of ligands and targets. Here, we present the latest version of SwissDock, in which EADock DSS has been replaced by two state-of-the-art docking programs, i.e. Attracting Cavities and AutoDock Vina. AutoDock Vina provides faster docking predictions, while Attracting Cavities offers more accurate results. Ligands can be imported in various ways, including as files, SMILES notation or molecular sketches. Targets can be imported as PDB files or identified by their PDB ID. In addition, advanced search options are available both for ligands and targets, giving users automatized access to widely-used databases. The web interface has been completely redesigned for interactive submission and analysis of docking results. Moreover, we developed a user-friendly command-line access which, in addition to all options of the web site, also enables covalent ligand docking with Attracting Cavities. The new version of SwissDock is freely available at https://www.swissdock.ch/.
Collapse
Affiliation(s)
- Marine Bugnon
- Molecular Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Ute F Röhrig
- Molecular Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Mathilde Goullieux
- Molecular Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Marta A S Perez
- Molecular Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Antoine Daina
- Molecular Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Olivier Michielin
- Molecular Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
- Department of Oncology, Geneva University Hospital (HUG), CH-1205 Geneva, Switzerland
| | - Vincent Zoete
- Molecular Modeling Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
- Department of Oncology UNIL-CHUV, Ludwig Institute for Cancer Research, Lausanne Branch, University of Lausanne, CH-1015 Lausanne, Switzerland
| |
Collapse
|
5
|
Lawson CL, Kryshtafovych A, Pintilie GD, Burley SK, Černý J, Chen VB, Emsley P, Gobbi A, Joachimiak A, Noreng S, Prisant MG, Read RJ, Richardson JS, Rohou AL, Schneider B, Sellers BD, Shao C, Sourial E, Williams CI, Williams CJ, Yang Y, Abbaraju V, Afonine PV, Baker ML, Bond PS, Blundell TL, Burnley T, Campbell A, Cao R, Cheng J, Chojnowski G, Cowtan KD, DiMaio F, Esmaeeli R, Giri N, Grubmüller H, Hoh SW, Hou J, Hryc CF, Hunte C, Igaev M, Joseph AP, Kao WC, Kihara D, Kumar D, Lang L, Lin S, Maddhuri Venkata Subramaniya SR, Mittal S, Mondal A, Moriarty NW, Muenks A, Murshudov GN, Nicholls RA, Olek M, Palmer CM, Perez A, Pohjolainen E, Pothula KR, Rowley CN, Sarkar D, Schäfer LU, Schlicksup CJ, Schröder GF, Shekhar M, Si D, Singharoy A, Sobolev OV, Terashi G, Vaiana AC, Vedithi SC, Verburgt J, Wang X, Warshamanage R, Winn MD, Weyand S, Yamashita K, Zhao M, Schmid MF, Berman HM, Chiu W. Outcomes of the EMDataResource cryo-EM Ligand Modeling Challenge. Nat Methods 2024; 21:1340-1348. [PMID: 38918604 DOI: 10.1038/s41592-024-02321-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Accepted: 05/24/2024] [Indexed: 06/27/2024]
Abstract
The EMDataResource Ligand Model Challenge aimed to assess the reliability and reproducibility of modeling ligands bound to protein and protein-nucleic acid complexes in cryogenic electron microscopy (cryo-EM) maps determined at near-atomic (1.9-2.5 Å) resolution. Three published maps were selected as targets: Escherichia coli beta-galactosidase with inhibitor, SARS-CoV-2 virus RNA-dependent RNA polymerase with covalently bound nucleotide analog and SARS-CoV-2 virus ion channel ORF3a with bound lipid. Sixty-one models were submitted from 17 independent research groups, each with supporting workflow details. The quality of submitted ligand models and surrounding atoms were analyzed by visual inspection and quantification of local map quality, model-to-map fit, geometry, energetics and contact scores. A composite rather than a single score was needed to assess macromolecule+ligand model quality. These observations lead us to recommend best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.
Collapse
Affiliation(s)
- Catherine L Lawson
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.
| | | | - Grigore D Pintilie
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA
| | - Stephen K Burley
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA
- RCSB Protein Data Bank and San Diego Supercomputer Center, University of California San Diego, La Jolla, CA, USA
| | - Jiří Černý
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, Czech Republic
| | - Vincent B Chen
- Department of Biochemistry, Duke University, Durham, NC, USA
| | - Paul Emsley
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Alberto Gobbi
- Discovery Chemistry, Genentech Inc., San Francisco, CA, USA
- , Berlin, Germany
| | - Andrzej Joachimiak
- Structural Biology Center, X-ray Science Division, Argonne National Laboratory, Argonne, IL, USA
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
| | - Sigrid Noreng
- Structural Biology, Genentech Inc., South San Francisco, CA, USA
- Protein Science, Septerna, South San Francisco, CA, USA
| | | | - Randy J Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | - Alexis L Rohou
- Structural Biology, Genentech Inc., South San Francisco, CA, USA
| | - Bohdan Schneider
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, Czech Republic
| | - Benjamin D Sellers
- Discovery Chemistry, Genentech Inc., San Francisco, CA, USA
- Computational Chemistry, Vilya, South San Francisco, CA, USA
| | - Chenghua Shao
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | - Ying Yang
- Structural Biology, Genentech Inc., South San Francisco, CA, USA
| | - Venkat Abbaraju
- RCSB Protein Data Bank and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Pavel V Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Paul S Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Tom Burnley
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Arthur Campbell
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Renzhi Cao
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | | | - K D Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Frank DiMaio
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Reza Esmaeeli
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Nabin Giri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | - Helmut Grubmüller
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Soon Wen Hoh
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Jie Hou
- Department of Computer Science, Saint Louis University, St. Louis, MO, USA
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Carola Hunte
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS-Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg, Germany
| | - Maxim Igaev
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Agnel P Joseph
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Wei-Chun Kao
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS-Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg, Germany
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Dilip Kumar
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA
- Trivedi School of Biosciences, Ashoka University, Sonipat, India
| | - Lijun Lang
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
- The Chinese University of Hong Kong, Hong Kong, China
| | - Sean Lin
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Sumit Mittal
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
- School of Advanced Sciences and Languages, VIT Bhopal University, Bhopal, India
| | - Arup Mondal
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
- National Renewable Energy Laboratory (NREL), Golden, CO, USA
| | - Nigel W Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Andrew Muenks
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | - Robert A Nicholls
- MRC Laboratory of Molecular Biology, Cambridge, UK
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Mateusz Olek
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
- Electron Bio-Imaging Centre, Diamond Light Source, Harwell Science and Innovation Campus, Didcot, UK
| | - Colin M Palmer
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Emmi Pohjolainen
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Karunakar R Pothula
- Institute of Biological Information Processing (IBI-7, Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | | | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
- MSU-DOE Plant Research Laboratory, East Lansing, MI, USA
- School of Molecular Sciences, Arizona State University, Tempe, AZ, USA
| | - Luisa U Schäfer
- Institute of Biological Information Processing (IBI-7, Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | - Christopher J Schlicksup
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Gunnar F Schröder
- Institute of Biological Information Processing (IBI-7, Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
- Physics Department, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Mrinal Shekhar
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Dong Si
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Oleg V Sobolev
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Andrea C Vaiana
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
- Nature's Toolbox (NTx), Rio Rancho, NM, USA
| | | | - Jacob Verburgt
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Xiao Wang
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | | | - Martyn D Winn
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Simone Weyand
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | - Minglei Zhao
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
| | - Michael F Schmid
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
| | - Helen M Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Wah Chiu
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA.
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA.
| |
Collapse
|
6
|
Licciardello G, Doppler M, Sicher C, Bueschl C, Ruso D, Schuhmacher R, Perazzolli M. Metabolic changes in tomato plants caused by psychrotolerant Antarctic endophytic bacteria might be implicated in cold stress mitigation. PHYSIOLOGIA PLANTARUM 2024; 176:e14352. [PMID: 38764037 DOI: 10.1111/ppl.14352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 05/06/2024] [Indexed: 05/21/2024]
Abstract
Climate change is responsible for mild winters and warm springs that can induce premature plant development, increasing the risk of exposure to cold stress with a severe reduction in plant growth. Tomato plants are sensitive to cold stress and beneficial microorganisms can increase their tolerance. However, scarce information is available on mechanisms stimulated by bacterial endophytes in tomato plants against cold stress. This study aimed to clarify metabolic changes stimulated by psychrotolerant endophytic bacteria in tomato plants exposed to cold stress and annotate compounds possibly associated with cold stress mitigation. Tomato seeds were inoculated with two bacterial endophytes isolated from Antarctic Colobanthus quitensis plants (Ewingella sp. S1.OA.A_B6 and Pseudomonas sp. S2.OTC.A_B10) or with Paraburkholderia phytofirmans PsJN, while mock-inoculated seeds were used as control. The metabolic composition of tomato plants was analyzed immediately after cold stress exposure (4°C for seven days) or after two and four days of recovery at 25°C. Under cold stress, the content of malondialdehyde, phenylalanine, ferulic acid, and p-coumaric acid was lower in bacterium-inoculated compared to mock-inoculated plants, indicating a reduction of lipid peroxidation and the stimulation of phenolic compound metabolism. The content of two phenolic compounds, five putative phenylalanine-derived dipeptides, and three further phenylalanine-derived compounds was higher in bacterium-inoculated compared to mock-inoculated samples under cold stress. Thus, psychrotolerant endophytic bacteria can reprogram polyphenol metabolism and stimulate the accumulation of secondary metabolites, like 4-hydroxybenzoic and salicylic acid, which are presumably involved in cold stress mitigation, and phenylalanine-derived dipeptides possibly involved in plant stress responses.
Collapse
Affiliation(s)
- Giorgio Licciardello
- Center Agriculture Food Environment (C3A), University of Trento, San Michele all'Adige, Trento, Italy
- Institute of Bioanalytics and Agro-Metabolomics, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Tulln, Austria
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Trento, Italy
| | - Maria Doppler
- Institute of Bioanalytics and Agro-Metabolomics, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Tulln, Austria
- Core Facility Bioactive Molecules: Screening and Analysis, University of Natural Resources and Life Sciences, Tulln, Austria
| | - Carmela Sicher
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Trento, Italy
| | - Christoph Bueschl
- Institute of Bioanalytics and Agro-Metabolomics, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Tulln, Austria
| | - David Ruso
- Institute of Bioanalytics and Agro-Metabolomics, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Tulln, Austria
| | - Rainer Schuhmacher
- Institute of Bioanalytics and Agro-Metabolomics, Department of Agrobiotechnology (IFA-Tulln), University of Natural Resources and Life Sciences, Tulln, Austria
| | - Michele Perazzolli
- Center Agriculture Food Environment (C3A), University of Trento, San Michele all'Adige, Trento, Italy
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Trento, Italy
| |
Collapse
|
7
|
Szikszai M, Magnus M, Sanghi S, Kadyan S, Bouatta N, Rivas E. RNA3DB: A structurally-dissimilar dataset split for training and benchmarking deep learning models for RNA structure prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.30.578025. [PMID: 38352531 PMCID: PMC10862857 DOI: 10.1101/2024.01.30.578025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
With advances in protein structure prediction thanks to deep learning models like AlphaFold, RNA structure prediction has recently received increased attention from deep learning researchers. RNAs introduce substantial challenges due to the sparser availability and lower structural diversity of the experimentally resolved RNA structures in comparison to protein structures. These challenges are often poorly addressed by the existing literature, many of which report inflated performance due to using training and testing sets with significant structural overlap. Further, the most recent Critical Assessment of Structure Prediction (CASP15) has shown that deep learning models for RNA structure are currently outperformed by traditional methods. In this paper we present RNA3DB, a dataset of structured RNAs, derived from the Protein Data Bank (PDB), that is designed for training and benchmarking deep learning models. The RNA3DB method arranges the RNA 3D chains into distinct groups (Components) that are non-redundant both with regard to sequence as well as structure, providing a robust way of dividing training, validation, and testing sets. Any split of these structurally-dissimilar Components are guaranteed to produce test and validations sets that are distinct by sequence and structure from those in the training set. We provide the RNA3DB dataset, a particular train/test split of the RNA3DB Components (in an approximate 70/30 ratio) that will be updated periodically. We also provide the RNA3DB methodology along with the source-code, with the goal of creating a reproducible and customizable tool for producing structurally-dissimilar dataset splits for structural RNAs.
Collapse
Affiliation(s)
- Marcell Szikszai
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| | - Marcin Magnus
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| | - Siddhant Sanghi
- Department of Systems Biology, Columbia University, New York, 10027, NY, USA
- College of Biological Sciences, UC Davis, Davis, 95616, CA, USA
| | - Sachin Kadyan
- Department of Systems Biology, Columbia University, New York, 10027, NY, USA
| | - Nazim Bouatta
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, 02115, MA, USA
| | - Elena Rivas
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, 02138, MA, USA
| |
Collapse
|
8
|
Rodríguez-Salazar CA, van Tol S, Mailhot O, Gonzalez-Orozco M, Galdino GT, Warren AN, Teruel N, Behera P, Afreen KS, Zhang L, Juelich TL, Smith JK, Zylber MI, Freiberg AN, Najmanovich RJ, Giraldo MI, Rajsbaum R. Ebola virus VP35 interacts non-covalently with ubiquitin chains to promote viral replication. PLoS Biol 2024; 22:e3002544. [PMID: 38422166 PMCID: PMC10942258 DOI: 10.1371/journal.pbio.3002544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 03/15/2024] [Accepted: 02/09/2024] [Indexed: 03/02/2024] Open
Abstract
Ebolavirus (EBOV) belongs to a family of highly pathogenic viruses that cause severe hemorrhagic fever in humans. EBOV replication requires the activity of the viral polymerase complex, which includes the cofactor and Interferon antagonist VP35. We previously showed that the covalent ubiquitination of VP35 promotes virus replication by regulating interactions with the polymerase complex. In addition, VP35 can also interact non-covalently with ubiquitin (Ub); however, the function of this interaction is unknown. Here, we report that VP35 interacts with free (unanchored) K63-linked polyUb chains. Ectopic expression of Isopeptidase T (USP5), which is known to degrade unanchored polyUb chains, reduced VP35 association with Ub and correlated with diminished polymerase activity in a minigenome assay. Using computational methods, we modeled the VP35-Ub non-covalent interacting complex, identified the VP35-Ub interacting surface, and tested mutations to validate the interface. Docking simulations identified chemical compounds that can block VP35-Ub interactions leading to reduced viral polymerase activity. Treatment with the compounds reduced replication of infectious EBOV in cells and in vivo in a mouse model. In conclusion, we identified a novel role of unanchored polyUb in regulating Ebola virus polymerase function and discovered compounds that have promising anti-Ebola virus activity.
Collapse
Affiliation(s)
- Carlos A. Rodríguez-Salazar
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America
- Molecular Biology and Virology Laboratory, Faculty of Medicine and Health Sciences, Corporación Universitaria Empresarial Alexander von Humboldt, Armenia, Colombia
| | - Sarah van Tol
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Olivier Mailhot
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Maria Gonzalez-Orozco
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Gabriel T. Galdino
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Abbey N. Warren
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America
- Center for Virus-Host-Innate Immunity and Department of Medicine; Rutgers Biomedical and Health Sciences, Institute for Infectious and Inflammatory Diseases, Rutgers University, Newark, New Jersey, United States of America
| | - Natalia Teruel
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Padmanava Behera
- Center for Virus-Host-Innate Immunity and Department of Medicine; Rutgers Biomedical and Health Sciences, Institute for Infectious and Inflammatory Diseases, Rutgers University, Newark, New Jersey, United States of America
| | - Kazi Sabrina Afreen
- Center for Virus-Host-Innate Immunity and Department of Medicine; Rutgers Biomedical and Health Sciences, Institute for Infectious and Inflammatory Diseases, Rutgers University, Newark, New Jersey, United States of America
| | - Lihong Zhang
- Department of Pathology, University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Terry L. Juelich
- Department of Pathology, University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Jennifer K. Smith
- Department of Pathology, University of Texas Medical Branch, Galveston, Texas, United States of America
| | - María Inés Zylber
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Alexander N. Freiberg
- Department of Pathology, University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Rafael J. Najmanovich
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Maria I. Giraldo
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Ricardo Rajsbaum
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston, Texas, United States of America
- Center for Virus-Host-Innate Immunity and Department of Medicine; Rutgers Biomedical and Health Sciences, Institute for Infectious and Inflammatory Diseases, Rutgers University, Newark, New Jersey, United States of America
| |
Collapse
|
9
|
Lawson CL, Kryshtafovych A, Pintilie GD, Burley SK, Černý J, Chen VB, Emsley P, Gobbi A, Joachimiak A, Noreng S, Prisant M, Read RJ, Richardson JS, Rohou AL, Schneider B, Sellers BD, Shao C, Sourial E, Williams CI, Williams CJ, Yang Y, Abbaraju V, Afonine PV, Baker ML, Bond PS, Blundell TL, Burnley T, Campbell A, Cao R, Cheng J, Chojnowski G, Cowtan KD, DiMaio F, Esmaeeli R, Giri N, Grubmüller H, Hoh SW, Hou J, Hryc CF, Hunte C, Igaev M, Joseph AP, Kao WC, Kihara D, Kumar D, Lang L, Lin S, Maddhuri Venkata Subramaniya SR, Mittal S, Mondal A, Moriarty NW, Muenks A, Murshudov GN, Nicholls RA, Olek M, Palmer CM, Perez A, Pohjolainen E, Pothula KR, Rowley CN, Sarkar D, Schäfer LU, Schlicksup CJ, Schröder GF, Shekhar M, Si D, Singharoy A, Sobolev OV, Terashi G, Vaiana AC, Vedithi SC, Verburgt J, Wang X, Warshamanage R, Winn MD, Weyand S, Yamashita K, Zhao M, Schmid MF, Berman HM, Chiu W. Outcomes of the EMDataResource Cryo-EM Ligand Modeling Challenge. RESEARCH SQUARE 2024:rs.3.rs-3864137. [PMID: 38343795 PMCID: PMC10854310 DOI: 10.21203/rs.3.rs-3864137/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/18/2024]
Abstract
The EMDataResource Ligand Model Challenge aimed to assess the reliability and reproducibility of modeling ligands bound to protein and protein/nucleic-acid complexes in cryogenic electron microscopy (cryo-EM) maps determined at near-atomic (1.9-2.5 Å) resolution. Three published maps were selected as targets: E. coli beta-galactosidase with inhibitor, SARS-CoV-2 RNA-dependent RNA polymerase with covalently bound nucleotide analog, and SARS-CoV-2 ion channel ORF3a with bound lipid. Sixty-one models were submitted from 17 independent research groups, each with supporting workflow details. We found that (1) the quality of submitted ligand models and surrounding atoms varied, as judged by visual inspection and quantification of local map quality, model-to-map fit, geometry, energetics, and contact scores, and (2) a composite rather than a single score was needed to assess macromolecule+ligand model quality. These observations lead us to recommend best practices for assessing cryo-EM structures of liganded macromolecules reported at near-atomic resolution.
Collapse
Affiliation(s)
- Catherine L. Lawson
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | | - Grigore D. Pintilie
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA
| | - Stephen K. Burley
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Rutgers Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ USA
- San Diego Supercomputer Center, University of California San Diego, La Jolla, CA USA
| | - Jiří Černý
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, CZ
| | | | - Paul Emsley
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Alberto Gobbi
- Discovery Chemistry, Genentech Inc, South San Francisco, USA
| | - Andrzej Joachimiak
- Structural Biology Center, X-ray Science Division, Argonne National Laboratory, Argonne, IL, USA
| | - Sigrid Noreng
- Structural Biology, Genentech Inc, South San Francisco, USA
| | | | - Randy J. Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | | | - Bohdan Schneider
- Institute of Biotechnology, Czech Academy of Sciences, Vestec, CZ
| | | | - Chenghua Shao
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | | | | | | | - Ying Yang
- Structural Biology, Genentech Inc, South San Francisco, USA
| | - Venkat Abbaraju
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Pavel V. Afonine
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew L. Baker
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Paul S. Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Tom L. Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Tom Burnley
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Arthur Campbell
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Renzhi Cao
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, USA
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | | | - Kevin D. Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Frank DiMaio
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Reza Esmaeeli
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Nabin Giri
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
| | - Helmut Grubmüller
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Soon Wen Hoh
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
| | - Jie Hou
- Department of Computer Science, Saint Louis University, St. Louis, MO, USA
| | - Corey F. Hryc
- Department of Biochemistry and Molecular Biology, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Carola Hunte
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS - Centre for Integrative Biological Signalling Studies, University of Freiburg, 79104 Freiburg, Germany
| | - Maxim Igaev
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Agnel P. Joseph
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Wei-Chun Kao
- Institute of Biochemistry and Molecular Biology, ZBMZ, Faculty of Medicine and CIBSS - Centre for Integrative Biological Signalling Studies, University of Freiburg, 79104 Freiburg, Germany
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, USA
| | - Dilip Kumar
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA
| | - Lijun Lang
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Sean Lin
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Sumit Mittal
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
- School of Advanced Sciences and Languages, VIT Bhopal University, Bhopal, India
| | - Arup Mondal
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Nigel W. Moriarty
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Andrew Muenks
- Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | | | - Mateusz Olek
- York Structural Biology Laboratory, Department of Chemistry, University of York, York, UK
- Electron Bio-Imaging Centre, Diamond Light Source, Harwell Science and Innovation Campus, Didcot, UK
| | - Colin M. Palmer
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Emmi Pohjolainen
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
| | - Karunakar R. Pothula
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | | | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Luisa U. Schäfer
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
| | - Christopher J. Schlicksup
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Gunnar F. Schröder
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry) and Jülich Centre for Structural Biology (JuStruct), Forschungszentrum Jülich, Jülich, Germany
- Physics Department, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Mrinal Shekhar
- Center for Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Dong Si
- Division of Computing & Software Systems, University of Washington, Bothell, WA, USA
| | | | - Oleg V. Sobolev
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Genki Terashi
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Andrea C. Vaiana
- Theoretical and Computational Biophysics Department, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
- Nature’s Toolbox (NTx), Rio Rancho, NM, USA
| | | | - Jacob Verburgt
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | - Xiao Wang
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA
| | | | - Martyn D. Winn
- Scientific Computing Department, UKRI Science and Technology Facilities Council, Research Complex at Harwell, Didcot, UK
| | - Simone Weyand
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | - Minglei Zhao
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
| | - Michael F. Schmid
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
| | - Helen M. Berman
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Wah Chiu
- Departments of Bioengineering and of Microbiology and Immunology, Stanford University, Stanford, CA, USA
- Division of Cryo-EM and Bioimaging, SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
| |
Collapse
|
10
|
Flachsenberg F, Ehrt C, Gutermuth T, Rarey M. Redocking the PDB. J Chem Inf Model 2024; 64:219-237. [PMID: 38108627 DOI: 10.1021/acs.jcim.3c01573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Molecular docking is a standard technique in structure-based drug design (SBDD). It aims to predict the 3D structure of a small molecule in the binding site of a receptor (often a protein). Despite being a common technique, it often necessitates multiple tools and involves manual steps. Here, we present the JAMDA preprocessing and docking workflow that is easy to use and allows fully automated docking. We evaluate the JAMDA docking workflow on binding sites extracted from the complete PDB and derive key factors determining JAMDA's docking performance. With that, we try to remove most of the bias due to manual intervention and provide a realistic estimate of the redocking performance of our JAMDA preprocessing and docking workflow for any PDB structure. On this large PDBScan22 data set, our JAMDA workflow finds a pose with an RMSD of at most 2 Å to the crystal ligand on the top rank for 30.1% of the structures. When applying objective structure quality filters to the PDBScan22 data set, the success rate increases to 61.8%. Given the prepared structures from the JAMDA preprocessing pipeline, both JAMDA and the widely used AutoDock Vina perform comparably on this filtered data set (the PDBScan22-HQ data set).
Collapse
Affiliation(s)
- Florian Flachsenberg
- Universität Hamburg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| | - Christiane Ehrt
- Universität Hamburg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| | - Torben Gutermuth
- Universität Hamburg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| | - Matthias Rarey
- Universität Hamburg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| |
Collapse
|
11
|
Lawson CL, Berman H, Chen L, Vallat B, Zirbel C. The Nucleic Acid Knowledgebase: a new portal for 3D structural information about nucleic acids. Nucleic Acids Res 2024; 52:D245-D254. [PMID: 37953312 PMCID: PMC10767938 DOI: 10.1093/nar/gkad957] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 10/02/2023] [Accepted: 10/16/2023] [Indexed: 11/14/2023] Open
Abstract
The Nucleic Acid Knowledgebase (nakb.org) is a new data resource, updated weekly, for experimentally determined 3D structures containing DNA and/or RNA nucleic acid polymers and their biological assemblies. NAKB indexes nucleic acid-containing structures derived from all major structure determination methods (X-ray, NMR and EM), including all held by the Protein Data Bank (PDB). As the planned successor to the Nucleic Acid Database (NDB), NAKB's design preserves all functionality of the NDB and provides novel nucleic acid-centric content, including structural and functional annotations, as well as annotations from and links to external resources. A variety of custom interactive tools have been developed to enable rapid exploration and drill-down of NAKB's content.
Collapse
Affiliation(s)
- Catherine L Lawson
- Institute for Quantitative Biomedicine, Rutgers, State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Li Chen
- Institute for Quantitative Biomedicine, Rutgers, State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Brinda Vallat
- Institute for Quantitative Biomedicine, Rutgers, State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Craig L Zirbel
- Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| |
Collapse
|
12
|
Kunnakkattu IR, Choudhary P, Pravda L, Nadzirin N, Smart OS, Yuan Q, Anyango S, Nair S, Varadi M, Velankar S. PDBe CCDUtils: an RDKit-based toolkit for handling and analysing small molecules in the Protein Data Bank. J Cheminform 2023; 15:117. [PMID: 38042830 PMCID: PMC10693035 DOI: 10.1186/s13321-023-00786-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/17/2023] [Indexed: 12/04/2023] Open
Abstract
While the Protein Data Bank (PDB) contains a wealth of structural information on ligands bound to macromolecules, their analysis can be challenging due to the large amount and diversity of data. Here, we present PDBe CCDUtils, a versatile toolkit for processing and analysing small molecules from the PDB in PDBx/mmCIF format. PDBe CCDUtils provides streamlined access to all the metadata for small molecules in the PDB and offers a set of convenient methods to compute various properties using RDKit, such as 2D depictions, 3D conformers, physicochemical properties, scaffolds, common fragments, and cross-references to small molecule databases using UniChem. The toolkit also provides methods for identifying all the covalently attached chemical components in a macromolecular structure and calculating similarity among small molecules. By providing a broad range of functionality, PDBe CCDUtils caters to the needs of researchers in cheminformatics, structural biology, bioinformatics and computational chemistry.
Collapse
Affiliation(s)
- Ibrahim Roshan Kunnakkattu
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Preeti Choudhary
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Lukas Pravda
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Nurul Nadzirin
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Oliver S Smart
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Qi Yuan
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Stephen Anyango
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sreenath Nair
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| |
Collapse
|
13
|
Mejia‐Rodriguez D, Kim H, Sadler N, Li X, Bohutskyi P, Valiev M, Qian W, Cheung MS. PTM-Psi: A python package to facilitate the computational investigation of post-translational modification on protein structures and their impacts on dynamics and functions. Protein Sci 2023; 32:e4822. [PMID: 37902126 PMCID: PMC10659954 DOI: 10.1002/pro.4822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 10/21/2023] [Accepted: 10/25/2023] [Indexed: 10/31/2023]
Abstract
Post-translational modification (PTM) of a protein occurs after it has been synthesized from its genetic template, and involves chemical modifications of the protein's specific amino acid residues. Despite of the central role played by PTM in regulating molecular interactions, particularly those driven by reversible redox reactions, it remains challenging to interpret PTMs in terms of protein dynamics and function because there are numerous combinatorially enormous means for modifying amino acids in response to changes in the protein environment. In this study, we provide a workflow that allows users to interpret how perturbations caused by PTMs affect a protein's properties, dynamics, and interactions with its binding partners based on inferred or experimentally determined protein structure. This Python-based workflow, called PTM-Psi, integrates several established open-source software packages, thereby enabling the user to infer protein structure from sequence, develop force fields for non-standard amino acids using quantum mechanics, calculate free energy perturbations through molecular dynamics simulations, and score the bound complexes via docking algorithms. Using the S-nitrosylation of several cysteines on the GAP2 protein as an example, we demonstrated the utility of PTM-Psi for interpreting sequence-structure-function relationships derived from thiol redox proteomics data. We demonstrate that the S-nitrosylated cysteine that is exposed to the solvent indirectly affects the catalytic reaction of another buried cysteine over a distance in GAP2 protein through the movement of the two ligands. Our workflow tracks the PTMs on residues that are responsive to changes in the redox environment and lays the foundation for the automation of molecular and systems biology modeling.
Collapse
Affiliation(s)
- Daniel Mejia‐Rodriguez
- Physical Sciences Division, Physical and Computational Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
| | - Hoshin Kim
- Physical Sciences Division, Physical and Computational Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
| | - Natalie Sadler
- Biological Sciences Division, Earth and Biological Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
| | - Xiaolu Li
- Biological Sciences Division, Earth and Biological Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
| | - Pavlo Bohutskyi
- Biological Sciences Division, Earth and Biological Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
- Biological Systems EngineeringWashington State UniversityRichlandWashingtonUSA
| | - Marat Valiev
- Physical Sciences Division, Physical and Computational Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
| | - Wei‐Jun Qian
- Biological Sciences Division, Earth and Biological Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
| | - Margaret S. Cheung
- Physical Sciences Division, Physical and Computational Sciences Directorate, Pacific Northwest National LaboratoryRichlandWashingtonUSA
- Environmental Molecular Sciences LaboratoryRichlandWashingtonUSA
- University of WashingtonSeattleWashingtonUSA
| |
Collapse
|
14
|
Leemann M, Sagasta A, Eberhardt J, Schwede T, Robin X, Durairaj J. Automated benchmarking of combined protein structure and ligand conformation prediction. Proteins 2023; 91:1912-1924. [PMID: 37885318 DOI: 10.1002/prot.26605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 09/15/2023] [Accepted: 09/21/2023] [Indexed: 10/28/2023]
Abstract
The prediction of protein-ligand complexes (PLC), using both experimental and predicted structures, is an active and important area of research, underscored by the inclusion of the Protein-Ligand Interaction category in the latest round of the Critical Assessment of Protein Structure Prediction experiment CASP15. The prediction task in CASP15 consisted of predicting both the three-dimensional structure of the receptor protein as well as the position and conformation of the ligand. This paper addresses the challenges and proposed solutions for devising automated benchmarking techniques for PLC prediction. The reliability of experimentally solved PLC as ground truth reference structures is assessed using various validation criteria. Similarity of PLC to previously released complexes are employed to judge PLC diversity and the difficulty of a PLC as a prediction target. We show that the commonly used PDBBind time-split test-set is inappropriate for comprehensive PLC evaluation, with state-of-the-art tools showing conflicting results on a more representative and high quality dataset constructed for benchmarking purposes. We also show that redocking on crystal structures is a much simpler task than docking into predicted protein models, demonstrated by the two PLC-prediction-specific scoring metrics created. Finally, we introduce a fully automated pipeline that predicts PLC and evaluates the accuracy of the protein structure, ligand pose, and protein-ligand interactions.
Collapse
Affiliation(s)
- Michèle Leemann
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Ander Sagasta
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Jerome Eberhardt
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Xavier Robin
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Janani Durairaj
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
15
|
Meng EC, Goddard TD, Pettersen EF, Couch GS, Pearson ZJ, Morris JH, Ferrin TE. UCSF ChimeraX: Tools for structure building and analysis. Protein Sci 2023; 32:e4792. [PMID: 37774136 PMCID: PMC10588335 DOI: 10.1002/pro.4792] [Citation(s) in RCA: 330] [Impact Index Per Article: 330.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 09/20/2023] [Accepted: 09/23/2023] [Indexed: 10/01/2023]
Abstract
Advances in computational tools for atomic model building are leading to accurate models of large molecular assemblies seen in electron microscopy, often at challenging resolutions of 3-4 Å. We describe new methods in the UCSF ChimeraX molecular modeling package that take advantage of machine-learning structure predictions, provide likelihood-based fitting in maps, and compute per-residue scores to identify modeling errors. Additional model-building tools assist analysis of mutations, post-translational modifications, and interactions with ligands. We present the latest ChimeraX model-building capabilities, including several community-developed extensions. ChimeraX is available free of charge for noncommercial use at https://www.rbvi.ucsf.edu/chimerax.
Collapse
Affiliation(s)
- Elaine C. Meng
- Department of Pharmaceutical ChemistryUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - Thomas D. Goddard
- Department of Pharmaceutical ChemistryUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - Eric F. Pettersen
- Department of Pharmaceutical ChemistryUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - Greg S. Couch
- Department of Pharmaceutical ChemistryUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - Zach J. Pearson
- Department of Pharmaceutical ChemistryUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - John H. Morris
- Department of Pharmaceutical ChemistryUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - Thomas E. Ferrin
- Department of Pharmaceutical ChemistryUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| |
Collapse
|
16
|
Olla S, Siguri C, Fais A, Era B, Fantini MC, Di Petrillo A. Inhibitory Effect of Quercetin on Oxidative Endogen Enzymes: A Focus on Putative Binding Modes. Int J Mol Sci 2023; 24:15391. [PMID: 37895071 PMCID: PMC10607112 DOI: 10.3390/ijms242015391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 10/13/2023] [Accepted: 10/16/2023] [Indexed: 10/29/2023] Open
Abstract
Oxidative stress is defined as an imbalance between the production of free radicals and reactive oxygen species (ROS) and the ability of the body to neutralize them by anti-oxidant defense systems. Cells can produce ROS during physiological processes, but excessive ROS can lead to non-specific and irreversible damage to biological molecules, such as DNA, lipids, and proteins. Mitochondria mainly produce endogenous ROS during both physiological and pathological conditions. Enzymes like nicotinamide adenine dinucleotide phosphate oxidase (NOX), xanthine oxidase (XO), lipoxygenase (LOX), myeloperoxidase (MPO), and monoamine oxidase (MAO) contribute to this process. The body has enzymatic and non-enzymatic defense systems to neutralize ROS. The intake of bioactive phenols, like quercetin (Que), can protect against pro-oxidative damage by quenching ROS through a non-enzymatic system. In this study, we evaluate the ability of Que to target endogenous oxidant enzymes involved in ROS production and explore the mechanisms of action underlying its anti-oxidant properties. Que can act as a free radical scavenger by donating electrons through the negative charges in its phenolic and ketone groups. Additionally, it can effectively inhibit the activity of several endogenous oxidative enzymes by binding them with high affinity and specificity. Que had the best molecular docking results with XO, followed by MAO-A, 5-LOX, NOX, and MPO. Que's binding to these enzymes was confirmed by subsequent molecular dynamics, revealing different stability phases depending on the enzyme bound. The 500 ns simulation showed a net evolution of binding for NOX and MPO. These findings suggest that Que has potential as a natural therapy for diseases related to oxidative stress.
Collapse
Affiliation(s)
- Stefania Olla
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Cittadella Universitaria di Monserrato, 09042 Monserrato, Italy;
| | - Chiara Siguri
- Istituto di Ricerca Genetica e Biomedica, Consiglio Nazionale delle Ricerche, Cittadella Universitaria di Monserrato, 09042 Monserrato, Italy;
| | - Antonella Fais
- Department of Life and Environmental Sciences, University of Cagliari, 09042 Monserrato, Italy; (A.F.); (B.E.)
| | - Benedetta Era
- Department of Life and Environmental Sciences, University of Cagliari, 09042 Monserrato, Italy; (A.F.); (B.E.)
| | - Massimo Claudio Fantini
- Department of Medical Science and Public Health, University of Cagliari, 09042 Monserrato, Italy;
| | - Amalia Di Petrillo
- Department of Medical Science and Public Health, University of Cagliari, 09042 Monserrato, Italy;
| |
Collapse
|
17
|
Jalili F, Jalili C, Jalalvand AR, Salari N, Pourmotabbed A, Adibi H. Synthesis, characterization and hypolipidemic effects of urazine derivatives on rat: Study of molecular modeling and enzyme inhibition. Bioorg Chem 2023; 139:106681. [PMID: 37385105 DOI: 10.1016/j.bioorg.2023.106681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 04/08/2023] [Accepted: 06/12/2023] [Indexed: 07/01/2023]
Abstract
The prevalence of hyperlipidemia has increased dramatically worldwide. It is a major public health threat, characterized by the presence of an abnormal lipid profile, primarily with elevated serum total cholesterol (TC), low-density lipoprotein (LDL), very low-density lipoprotein (VLDL) levels, and reduced high-density lipoprotein (HDL) level. Genetic factors, dietary and lifestyle habits play important roles in hyperlipidemia. It can increase the risk of chronic metabolic disorders, such as obesity, cardiovascular disease, and type II diabetes. The main objective of the present study was to evaluate the effect of urazine derivatives on serum triglyceride, cholesterol, LDL, HDL, and nitric oxide (NO) levels in high-fat diet (HFD)-induced hyperlipidemic rats. Synthetic compounds were prepared and confirmed by spectroscopic methods. Then, 88 male Sprague-Dawley rats were divided into 11 groups: control, HFD-treated group, HFD plus atorvastatin-treated group, and HFD plus 8 synthetic compounds-treated groups. The body weight, triglyceride, cholesterol, LDL, HDL, and NO levels were measured. The data with p < 0.05 were considered significant. The results indicated that HFD significantly increased cholesterol, triglyceride, and LDL levels and decreased NO concentration and HDL level compared to the control group (p < 0.05). However, HFD plus urazine derivatives significantly decreased NO, cholesterol, and triglyceride levels and increased HDL levels compared to the HFD-treated group (p < 0.05). Urazine derivatives may improve liver dysfunction in HFD-induced hyperlipidemic rats by modulation of detoxification enzymes and their anti-oxidant effects and also blood lipid profile.
Collapse
Affiliation(s)
- Faramarz Jalili
- Student Research Committee, Kermanshah University of Medical Sciences, Kermanshah, Iran; Graduate Studies Student, School of Health Administration, Dalhousie University, Halifax, NS, Canada
| | - Cyrus Jalili
- Medical Biology Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Ali R Jalalvand
- Research Center of Oils and Fats, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Nader Salari
- Sleep Disorders Research Center, Health Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Ali Pourmotabbed
- Deparment Of Physiology, School of Medicine, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Hadi Adibi
- Pharmaceutical Sciences Research Center, Health Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran.
| |
Collapse
|
18
|
Seidel T, Permann C, Wieder O, Kohlbacher SM, Langer T. High-Quality Conformer Generation with CONFORGE: Algorithm and Performance Assessment. J Chem Inf Model 2023; 63:5549-5570. [PMID: 37624145 PMCID: PMC10498443 DOI: 10.1021/acs.jcim.3c00563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Indexed: 08/26/2023]
Abstract
Knowledge of the putative bound-state conformation of a molecule is an essential prerequisite for the successful application of many computer-aided drug design methods that aim to assess or predict its capability to bind to a particular target receptor. An established approach to predict bioactive conformers in the absence of receptor structure information is to sample the low-energy conformational space of the investigated molecules and derive representative conformer ensembles that can be expected to comprise members closely resembling possible bound-state ligand conformations. The high relevance of such conformer generation functionality led to the development of a wide panel of dedicated commercial and open-source software tools throughout the last decades. Several published benchmarking studies have shown that open-source tools usually lag behind their commercial competitors in many key aspects. In this work, we introduce the open-source conformer ensemble generator CONFORGE, which aims at delivering state-of-the-art performance for all types of organic molecules in drug-like chemical space. The ability of CONFORGE and several well-known commercial and open-source conformer ensemble generators to reproduce experimental 3D structures as well as their computational efficiency and robustness has been assessed thoroughly for both typical drug-like molecules and macrocyclic structures. For small molecules, CONFORGE clearly outperformed all other tested open-source conformer generators and performed at least equally well as the evaluated commercial generators in terms of both processing speed and accuracy. In the case of macrocyclic structures, CONFORGE achieved the best average accuracy among all benchmarked generators, with RDKit's generator coming close in second place.
Collapse
Affiliation(s)
- Thomas Seidel
- Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Christian Permann
- NeGeMac
Research Platform, Department of Pharmaceutical Sciences, Division
of Pharmaceutical Chemistry, University
of Vienna, Josef-Holaubek-Platz
2, 1090 Vienna, Austria
| | - Oliver Wieder
- Christian
Doppler Laboratory for Molecular Informatics in the Biosciences, Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Stefan M. Kohlbacher
- Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| | - Thierry Langer
- Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
- NeGeMac
Research Platform, Department of Pharmaceutical Sciences, Division
of Pharmaceutical Chemistry, University
of Vienna, Josef-Holaubek-Platz
2, 1090 Vienna, Austria
- Christian
Doppler Laboratory for Molecular Informatics in the Biosciences, Department
of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria
| |
Collapse
|
19
|
Goßen J, Ribeiro RP, Bier D, Neumaier B, Carloni P, Giorgetti A, Rossetti G. AI-based identification of therapeutic agents targeting GPCRs: introducing ligand type classifiers and systems biology. Chem Sci 2023; 14:8651-8661. [PMID: 37592985 PMCID: PMC10430665 DOI: 10.1039/d3sc02352d] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Accepted: 07/20/2023] [Indexed: 08/19/2023] Open
Abstract
Identifying ligands targeting G protein coupled receptors (GPCRs) with novel chemotypes other than the physiological ligands is a challenge for in silico screening campaigns. Here we present an approach that identifies novel chemotype ligands by combining structural data with a random forest agonist/antagonist classifier and a signal-transduction kinetic model. As a test case, we apply this approach to identify novel antagonists of the human adenosine transmembrane receptor type 2A, an attractive target against Parkinson's disease and cancer. The identified antagonists were tested here in a radio ligand binding assay. Among those, we found a promising ligand whose chemotype differs significantly from all so-far reported antagonists, with a binding affinity of 310 ± 23.4 nM. Thus, our protocol emerges as a powerful approach to identify promising ligand candidates with novel chemotypes while preserving antagonistic potential and affinity in the nanomolar range.
Collapse
Affiliation(s)
- Jonas Goßen
- Institute for Computational Biomedicine (INM-9/IAS-5) Forschungszentrum Jülich Wilhelm-Johnen-Straße 52428 Jülich Germany
- Faculty of Mathematics, Computer Science and Natural Sciences RWTH Aachen University Aachen Germany
| | - Rui Pedro Ribeiro
- Institute for Computational Biomedicine (INM-9/IAS-5) Forschungszentrum Jülich Wilhelm-Johnen-Straße 52428 Jülich Germany
| | - Dirk Bier
- Institute of Neuroscience and Medicine, Nuclear Chemistry (INM-5), Forschungszentrum Jülich GmbH Wilhelm-Johnen-Straße 52428 Jülich Germany
| | - Bernd Neumaier
- Institute of Neuroscience and Medicine, Nuclear Chemistry (INM-5), Forschungszentrum Jülich GmbH Wilhelm-Johnen-Straße 52428 Jülich Germany
- Institute of Radiochemistry and Experimental Molecular Imaging, University of Cologne, Faculty of Medicine and University Hospital Cologne Kerpener Straße 62 50937 Cologne Germany
| | - Paolo Carloni
- Institute for Computational Biomedicine (INM-9/IAS-5) Forschungszentrum Jülich Wilhelm-Johnen-Straße 52428 Jülich Germany
- Faculty of Mathematics, Computer Science and Natural Sciences RWTH Aachen University Aachen Germany
- JARA-Institut Molecular Neuroscience and Neuroimaging (INM-11) Forschungszentrum Jülich Wilhelm-Johnen-Straße 52428 Jülich Germany
| | - Alejandro Giorgetti
- Institute for Computational Biomedicine (INM-9/IAS-5) Forschungszentrum Jülich Wilhelm-Johnen-Straße 52428 Jülich Germany
- Department of Biotechnology University of Verona Verona Italy
| | - Giulia Rossetti
- Institute for Computational Biomedicine (INM-9/IAS-5) Forschungszentrum Jülich Wilhelm-Johnen-Straße 52428 Jülich Germany
- Jülich Supercomputing Centre (JSC) Forschungszentrum Jülich Jülich Germany
- Department of Neurology University Hospital Aachen (UKA), RWTH Aachen University Aachen Germany
| |
Collapse
|
20
|
Vallat B, Tauriello G, Bienert S, Haas J, Webb BM, Žídek A, Zheng W, Peisach E, Piehl DW, Anischanka I, Sillitoe I, Tolchard J, Varadi M, Baker D, Orengo C, Zhang Y, Hoch JC, Kurisu G, Patwardhan A, Velankar S, Burley SK, Sali A, Schwede T, Berman HM, Westbrook JD. ModelCIF: An Extension of PDBx/mmCIF Data Representation for Computed Structure Models. J Mol Biol 2023; 435:168021. [PMID: 36828268 PMCID: PMC10293049 DOI: 10.1016/j.jmb.2023.168021] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/15/2023] [Accepted: 02/16/2023] [Indexed: 02/24/2023]
Abstract
ModelCIF (github.com/ihmwg/ModelCIF) is a data information framework developed for and by computational structural biologists to enable delivery of Findable, Accessible, Interoperable, and Reusable (FAIR) data to users worldwide. ModelCIF describes the specific set of attributes and metadata associated with macromolecular structures modeled by solely computational methods and provides an extensible data representation for deposition, archiving, and public dissemination of predicted three-dimensional (3D) models of macromolecules. It is an extension of the Protein Data Bank Exchange / macromolecular Crystallographic Information Framework (PDBx/mmCIF), which is the global data standard for representing experimentally-determined 3D structures of macromolecules and associated metadata. The PDBx/mmCIF framework and its extensions (e.g., ModelCIF) are managed by the Worldwide Protein Data Bank partnership (wwPDB, wwpdb.org) in collaboration with relevant community stakeholders such as the wwPDB ModelCIF Working Group (wwpdb.org/task/modelcif). This semantically rich and extensible data framework for representing computed structure models (CSMs) accelerates the pace of scientific discovery. Herein, we describe the architecture, contents, and governance of ModelCIF, and tools and processes for maintaining and extending the data standard. Community tools and software libraries that support ModelCIF are also described.
Collapse
Affiliation(s)
- Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA.
| | - Gerardo Tauriello
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Stefan Bienert
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Juergen Haas
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Benjamin M Webb
- Department of Bioengineering and Therapeutic Sciences, the Quantitative Biosciences Institute (QBI), and the Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94157, USA
| | | | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Dennis W Piehl
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Ivan Anischanka
- Department of Biochemistry, and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Ian Sillitoe
- Department of Structural and Molecular Biology, UCL, London, UK
| | - James Tolchard
- AlphaFold Protein Structure Database, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK; Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Mihaly Varadi
- AlphaFold Protein Structure Database, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK; Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - David Baker
- Department of Biochemistry, and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | | | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jeffrey C Hoch
- Biological Magnetic Resonance Data Bank, Department of Molecular Biology and Biophysics, University of Connecticut, Farmington, CT 06030, USA
| | - Genji Kurisu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan
| | - Ardan Patwardhan
- Electron Microscopy Data Bank, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Sameer Velankar
- AlphaFold Protein Structure Database, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK; Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, La Jolla, CA 92093, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, the Quantitative Biosciences Institute (QBI), and the Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94157, USA. https://twitter.com/salilab_ucsf
| | - Torsten Schwede
- Biozentrum, University of Basel, Basel, Switzerland; Computational Structural Biology, SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA
| |
Collapse
|
21
|
Rodríguez-Salazar CA, van Tol S, Mailhot O, Galdino G, Teruel N, Zhang L, Warren AN, González-Orozco M, Freiberg AN, Najmanovich RJ, Giraldo MI, Rajsbaum R. Ebola Virus VP35 Interacts Non-Covalently with Ubiquitin Chains to Promote Viral Replication Creating New Therapeutic Opportunities. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.14.549057. [PMID: 37503276 PMCID: PMC10369991 DOI: 10.1101/2023.07.14.549057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Ebolavirus (EBOV) belongs to a family of highly pathogenic viruses that cause severe hemorrhagic fever in humans. EBOV replication requires the activity of the viral polymerase complex, which includes the co-factor and Interferon antagonist VP35. We previously showed that the covalent ubiquitination of VP35 promotes virus replication by regulating interactions with the polymerase complex. In addition, VP35 can also interact non-covalently with ubiquitin (Ub); however, the function of this interaction is unknown. Here, we report that VP35 interacts with free (unanchored) K63-linked polyUb chains. Ectopic expression of Isopeptidase T (USP5), which is known to degrade unanchored polyUb chains, reduced VP35 association with Ub and correlated with diminished polymerase activity in a minigenome assay. Using computational methods, we modeled the VP35-Ub non-covalent interacting complex, identified the VP35-Ub interacting surface and tested mutations to validate the interface. Docking simulations identified chemical compounds that can block VP35-Ub interactions leading to reduced viral polymerase activity that correlated with reduced replication of infectious EBOV. In conclusion, we identified a novel role of unanchored polyUb in regulating Ebola virus polymerase function and discovered compounds that have promising anti-Ebola virus activity.
Collapse
Affiliation(s)
- Carlos A. Rodríguez-Salazar
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston 77555, Texas, USA
- Molecular Biology and Virology Laboratory, Faculty of Medicine and Health Sciences, Corporación Universitaria Empresarial Alexander von Humboldt, Armenia 630003, Colombia
| | - Sarah van Tol
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston 77555, Texas, USA
| | - Olivier Mailhot
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Gabriel Galdino
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Natalia Teruel
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - Lihong Zhang
- Department of Pathology, University of Texas Medical Branch, Galveston 77555, Texas, USA
| | - Abbey N. Warren
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston 77555, Texas, USA
- Center for Virus-Host-Innate Immunity and Department of Medicine; Rutgers Biomedical and Health Sciences, Institute for Infectious and Inflammatory Diseases, Rutgers University, Newark, New Jersey 07103
| | - María González-Orozco
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston 77555, Texas, USA
| | - Alexander N. Freiberg
- Department of Pathology, University of Texas Medical Branch, Galveston 77555, Texas, USA
| | - Rafael J. Najmanovich
- Department of Pharmacology and Physiology, Faculty of Medicine, Université de Montréal, Montreal, Canada
| | - María I. Giraldo
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston 77555, Texas, USA
| | - Ricardo Rajsbaum
- Department of Microbiology and Immunology, University of Texas Medical Branch, Galveston 77555, Texas, USA
- Center for Virus-Host-Innate Immunity and Department of Medicine; Rutgers Biomedical and Health Sciences, Institute for Infectious and Inflammatory Diseases, Rutgers University, Newark, New Jersey 07103
| |
Collapse
|
22
|
Kunzmann P, Müller TD, Greil M, Krumbach JH, Anter JM, Bauer D, Islam F, Hamacher K. Biotite: new tools for a versatile Python bioinformatics library. BMC Bioinformatics 2023; 24:236. [PMID: 37277726 PMCID: PMC10243083 DOI: 10.1186/s12859-023-05345-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 05/18/2023] [Indexed: 06/07/2023] Open
Abstract
BACKGROUND Biotite is a program library for sequence and structural bioinformatics written for the Python programming language. It implements widely used computational methods into a consistent and accessible package. This allows for easy combination of various data analysis, modeling and simulation methods. RESULTS This article presents major functionalities introduced into Biotite since its original publication. The fields of application are shown using concrete examples. We show that the computational performance of Biotite for bioinformatics tasks is comparable to individual, special purpose software systems specifically developed for the respective single task. CONCLUSIONS The results show that Biotite can be used as program library to either answer specific bioinformatics questions and simultaneously allow the user to write entire, self-contained software applications with sufficient performance for general application.
Collapse
Affiliation(s)
- Patrick Kunzmann
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany.
| | - Tom David Müller
- Department of Computer Science, Eberhard Karls University of Tübingen, Sand 14, 72076, Tübingen, Germany
| | | | - Jan Hendrik Krumbach
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Jacob Marcel Anter
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Daniel Bauer
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Faisal Islam
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| | - Kay Hamacher
- Computational Biology and Simulation, Technical University of Darmstadt, Schnittspahnstraße 2, 64287, Darmstadt, Germany
| |
Collapse
|
23
|
Agirre J, Atanasova M, Bagdonas H, Ballard CB, Baslé A, Beilsten-Edmands J, Borges RJ, Brown DG, Burgos-Mármol JJ, Berrisford JM, Bond PS, Caballero I, Catapano L, Chojnowski G, Cook AG, Cowtan KD, Croll TI, Debreczeni JÉ, Devenish NE, Dodson EJ, Drevon TR, Emsley P, Evans G, Evans PR, Fando M, Foadi J, Fuentes-Montero L, Garman EF, Gerstel M, Gildea RJ, Hatti K, Hekkelman ML, Heuser P, Hoh SW, Hough MA, Jenkins HT, Jiménez E, Joosten RP, Keegan RM, Keep N, Krissinel EB, Kolenko P, Kovalevskiy O, Lamzin VS, Lawson DM, Lebedev AA, Leslie AGW, Lohkamp B, Long F, Malý M, McCoy AJ, McNicholas SJ, Medina A, Millán C, Murray JW, Murshudov GN, Nicholls RA, Noble MEM, Oeffner R, Pannu NS, Parkhurst JM, Pearce N, Pereira J, Perrakis A, Powell HR, Read RJ, Rigden DJ, Rochira W, Sammito M, Sánchez Rodríguez F, Sheldrick GM, Shelley KL, Simkovic F, Simpkin AJ, Skubak P, Sobolev E, Steiner RA, Stevenson K, Tews I, Thomas JMH, Thorn A, Valls JT, Uski V, Usón I, Vagin A, Velankar S, Vollmar M, Walden H, Waterman D, Wilson KS, Winn MD, Winter G, Wojdyr M, Yamashita K. The CCP4 suite: integrative software for macromolecular crystallography. Acta Crystallogr D Struct Biol 2023; 79:449-461. [PMID: 37259835 PMCID: PMC10233625 DOI: 10.1107/s2059798323003595] [Citation(s) in RCA: 156] [Impact Index Per Article: 156.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 04/19/2023] [Indexed: 06/02/2023] Open
Abstract
The Collaborative Computational Project No. 4 (CCP4) is a UK-led international collective with a mission to develop, test, distribute and promote software for macromolecular crystallography. The CCP4 suite is a multiplatform collection of programs brought together by familiar execution routines, a set of common libraries and graphical interfaces. The CCP4 suite has experienced several considerable changes since its last reference article, involving new infrastructure, original programs and graphical interfaces. This article, which is intended as a general literature citation for the use of the CCP4 software suite in structure determination, will guide the reader through such transformations, offering a general overview of the new features and outlining future developments. As such, it aims to highlight the individual programs that comprise the suite and to provide the latest references to them for perusal by crystallographers around the world.
Collapse
Affiliation(s)
- Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Mihaela Atanasova
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Haroldas Bagdonas
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Charles B. Ballard
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Arnaud Baslé
- Biosciences Institute, Newcastle University, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - James Beilsten-Edmands
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Rafael J. Borges
- The Center of Medicinal Chemistry (CQMED), Center for Molecular Biology and Genetic Engineering (CBMEG), University of Campinas (UNICAMP), Av. Dr. André Tosello 550, 13083-886 Campinas, Brazil
| | - David G. Brown
- Laboratoires Servier SAS Institut de Recherches, Croissy-sur-Seine, France
| | - J. Javier Burgos-Mármol
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - John M. Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Paul S. Bond
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Iracema Caballero
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Lucrezia Catapano
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Atlanta G. Cook
- The Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, The King’s Buildings, Edinburgh EH9 3BF, United Kingdom
| | - Kevin D. Cowtan
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Tristan I. Croll
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
- Altos Labs, Portway Building, Granta Park, Great Abington, Cambridge CB21 6GP, United Kingdom
| | - Judit É. Debreczeni
- Discovery Sciences, R&D BioPharmaceuticals, AstraZeneca, Darwin Building, Cambridge Science Park, Milton Road, Cambridge CB4 0WG, United Kingdom
| | - Nicholas E. Devenish
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Eleanor J. Dodson
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Tarik R. Drevon
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Paul Emsley
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Gwyndaf Evans
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Rosalind Franklin Institute, Harwell Science and Innovation Campus, Didcot OX11 0QS, United Kingdom
| | - Phil R. Evans
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Maria Fando
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - James Foadi
- Department of Mathematical Sciences, University of Bath, Bath, United Kingdom
| | - Luis Fuentes-Montero
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Elspeth F. Garman
- Department of Biochemistry, University of Oxford, Dorothy Crowfoot Hodgkin Building, Oxford OX1 3QU, United Kingdom
| | - Markus Gerstel
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Richard J. Gildea
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Kaushik Hatti
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Maarten L. Hekkelman
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Philipp Heuser
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Soon Wen Hoh
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Michael A. Hough
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom
| | - Huw T. Jenkins
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Elisabet Jiménez
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Robbie P. Joosten
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Ronan M. Keegan
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Nicholas Keep
- Department of Biological Sciences, Institute of Structural and Molecular Biology, Birkbeck College, London WC1E 7HX, United Kingdom
| | - Eugene B. Krissinel
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Petr Kolenko
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 55, 252 50 Vestec, Czech Republic
| | - Oleg Kovalevskiy
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Victor S. Lamzin
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - David M. Lawson
- Department of Biochemistry and Metabolism, John Innes Centre, Norwich NR4 7UH, United Kingdom
| | - Andrey A. Lebedev
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Andrew G. W. Leslie
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Bernhard Lohkamp
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, SE-171 77 Stockholm, Sweden
| | - Fei Long
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Martin Malý
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Břehová 7, 115 19 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Průmyslová 55, 252 50 Vestec, Czech Republic
- Biological Sciences, Institute for Life Sciences, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Airlie J. McCoy
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Stuart J. McNicholas
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Ana Medina
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Claudia Millán
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - James W. Murray
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
| | - Garib N. Murshudov
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Robert A. Nicholls
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Martin E. M. Noble
- Translational and Clinical Research Institute, Newcastle University, Paul O’Gorman Building, Medical School, Framlington Place, Newcastle upon Tyne NE2 4HH, United Kingdom
| | - Robert Oeffner
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Navraj S. Pannu
- Department of Infectious Diseases, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands
| | - James M. Parkhurst
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Rosalind Franklin Institute, Harwell Science and Innovation Campus, Didcot OX11 0QS, United Kingdom
| | - Nicholas Pearce
- Department of Physics, Chemistry and Biology (IFM), Linköping University, SE-581 83 Linköping, Sweden
| | - Joana Pereira
- Biozentrum and SIB Swiss Institute of Bioinformatics, University of Basel, 4056 Basel, Switzerland
| | - Anastassis Perrakis
- Oncode Institute and Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Harold R. Powell
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
| | - Randy J. Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - William Rochira
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Massimo Sammito
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2 0XY, United Kingdom
- Discovery Centre, Biologics Engineering, AstraZeneca, Biomedical Campus, 1 Francis Crick Avenue, Trumpington, Cambridge CB2 0AA, United Kingdom
| | - Filomeno Sánchez Rodríguez
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - George M. Sheldrick
- Department of Structural Chemistry, Georg-August-Universität Göttingen, Tammannstrasse 4, 37077 Göttingen, Germany
| | - Kathryn L. Shelley
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Felix Simkovic
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Adam J. Simpkin
- Laboratoires Servier SAS Institut de Recherches, Croissy-sur-Seine, France
| | - Pavol Skubak
- Department of Infectious Diseases, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands
| | - Egor Sobolev
- European Molecular Biology Laboratory, c/o DESY, Notkestrasse 85, 22607 Hamburg, Germany
| | - Roberto A. Steiner
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
- Department of Biomedical Sciences, University of Padova, Italy
| | - Kyle Stevenson
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Ivo Tews
- Biological Sciences, Institute for Life Sciences, University of Southampton, Southampton SO17 1BJ, United Kingdom
| | - Jens M. H. Thomas
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Andrea Thorn
- Institute for Nanostructure and Solid State Physics, Universität Hamburg, 22761 Hamburg, Germany
| | - Josep Triviño Valls
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
| | - Ville Uski
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Isabel Usón
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona Science Park, Helix Building, Baldiri Reixac 15, 08028 Barcelona, Spain
- ICREA, Institució Catalana de Recerca i Estudis Avançats, Passeig Lluís Companys 23, 08003 Barcelona, Spain
| | - Alexei Vagin
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Melanie Vollmar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Helen Walden
- School of Molecular Biosciences, College of Medical Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - David Waterman
- STFC, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
- CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, United Kingdom
| | - Keith S. Wilson
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Martyn D. Winn
- Scientific Computing Department, Science and Technology Facilities Council, Didcot OX11 0FA, United Kingdom
| | - Graeme Winter
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited (United Kingdom), Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Keitaro Yamashita
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
24
|
Yamashita K, Wojdyr M, Long F, Nicholls RA, Murshudov GN. GEMMI and Servalcat restrain REFMAC5. Acta Crystallogr D Struct Biol 2023; 79:368-373. [PMID: 37158197 PMCID: PMC10167671 DOI: 10.1107/s2059798323002413] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 03/13/2023] [Indexed: 05/10/2023] Open
Abstract
Macromolecular refinement uses experimental data together with prior chemical knowledge (usually digested into geometrical restraints) to optimally fit an atomic structural model into experimental data, while ensuring that the model is chemically plausible. In the CCP4 suite this chemical knowledge is stored in a Monomer Library, which comprises a set of restraint dictionaries. To use restraints in refinement, the model is analysed and template restraints from the dictionary are used to infer (i) restraints between concrete atoms and (ii) the positions of riding hydrogen atoms. Recently, this mundane process has been overhauled. This was also an opportunity to enhance the Monomer Library with new features, resulting in a small improvement in REFMAC5 refinement. Importantly, the overhaul of this part of CCP4 has increased flexibility and eased experimentation, opening up new possibilities.
Collapse
Affiliation(s)
- Keitaro Yamashita
- MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Trumpington, Cambridge CB2 0QH, United Kingdom
| | | | - Fei Long
- MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Trumpington, Cambridge CB2 0QH, United Kingdom
| | - Robert A Nicholls
- MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Trumpington, Cambridge CB2 0QH, United Kingdom
| | - Garib N Murshudov
- MRC Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Avenue, Trumpington, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
25
|
Wang S, Luan J, Chen L, Liu H, Li W, Wang J. Computational characteristics of the structure-activity relationship of inhibitors targeting Pks13-TE domain. Comput Biol Chem 2023; 104:107864. [PMID: 37028177 DOI: 10.1016/j.compbiolchem.2023.107864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 03/22/2023] [Accepted: 04/01/2023] [Indexed: 04/09/2023]
Abstract
Multiple studies have established the Pks13-TE domain as a promising target for anti-tuberculosis drug development. However, recent findings have revealed that the lead compound currently in the pipeline for Pks13-TE has significant cardiotoxicity issues. Given the pressing need for new chemical structures for Pks13-TE inhibitors, this study aims to provide a detailed understanding of the Pks13-TE domain binding site through the application of computational chemical biology techniques. Our results highlight the size and shape of the Pks13-TE domain binding pocket, key residues including Asp1644, Asn1640, Phe1670, and Tyr1674 within the pocket, and inhibitor pharmacophore characteristics such as aromatic ring sites, positively charged sites, and hydrogen bond donors. To our knowledge, these simulation results are novel and contribute to the discovery of next-generation Pks13-TE inhibitors without similar prior studies.
Collapse
Affiliation(s)
- Shizun Wang
- Key Laboratory of Structure-Based Drug Design & Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, China
| | - Jiasi Luan
- Key Laboratory of Structure-Based Drug Design & Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, China; School of Medical Devices, Shenyang Pharmaceutical University, Shenyang 110016, China
| | - Lu Chen
- Key Laboratory of Structure-Based Drug Design & Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, China
| | - Haihan Liu
- Key Laboratory of Structure-Based Drug Design & Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, China
| | - Weixia Li
- Key Laboratory of Structure-Based Drug Design & Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, China
| | - Jian Wang
- Key Laboratory of Structure-Based Drug Design & Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, China.
| |
Collapse
|
26
|
Kuschert S, Stroet M, Chin YKY, Conibear AC, Jia X, Lee T, Bartling CRO, Strømgaard K, Güntert P, Rosengren KJ, Mark AE, Mobli M. Facilitating the structural characterisation of non-canonical amino acids in biomolecular NMR. MAGNETIC RESONANCE (GOTTINGEN, GERMANY) 2023; 4:57-72. [PMID: 37904802 PMCID: PMC10583272 DOI: 10.5194/mr-4-57-2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 02/07/2023] [Indexed: 11/01/2023]
Abstract
Peptides and proteins containing non-canonical amino acids (ncAAs) are a large and important class of biopolymers. They include non-ribosomally synthesised peptides, post-translationally modified proteins, expressed or synthesised proteins containing unnatural amino acids, and peptides and proteins that are chemically modified. Here, we describe a general procedure for generating atomic descriptions required to incorporate ncAAs within popular NMR structure determination software such as CYANA, CNS, Xplor-NIH and ARIA. This procedure is made publicly available via the existing Automated Topology Builder (ATB) server (https://atb.uq.edu.au, last access: 17 February 2023) with all submitted ncAAs stored in a dedicated database. The described procedure also includes a general method for linking of side chains of amino acids from CYANA templates. To ensure compatibility with other systems, atom names comply with IUPAC guidelines. In addition to describing the workflow, 3D models of complex natural products generated by CYANA are presented, including vancomycin. In order to demonstrate the manner in which the templates for ncAAs generated by the ATB can be used in practice, we use a combination of CYANA and CNS to solve the structure of a synthetic peptide designed to disrupt Alzheimer-related protein-protein interactions. Automating the generation of structural templates for ncAAs will extend the utility of NMR spectroscopy to studies of more complex biomolecules, with applications in the rapidly growing fields of synthetic biology and chemical biology. The procedures we outline can also be used to standardise the creation of structural templates for any amino acid and thus have the potential to impact structural biology more generally.
Collapse
Affiliation(s)
- Sarah Kuschert
- Centre for Advanced Imaging, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Martin Stroet
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Yanni Ka-Yan Chin
- Centre for Advanced Imaging, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Anne Claire Conibear
- Institute of Applied Synthetic Chemistry, Technische Universität Wien, Getreidemarkt 9/163, Wien 1060, Vienna, Austria
| | - Xinying Jia
- Centre for Advanced Imaging, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Thomas Lee
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | | | - Kristian Strømgaard
- Department of Drug Design and Pharmacology, University of Copenhagen, Universitetsparken 2, 2100 Copenhagen, Denmark
| | - Peter Güntert
- Laboratory of Physical Chemistry, ETH Zürich, 8093 Zurich, Switzerland
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt, 60438 Frankfurt am Main, Germany
- Department of Chemistry, Tokyo Metropolitan University, Hachiōji, Tokyo 192-0397, Japan
| | - Karl Johan Rosengren
- School of Biomedical Sciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Alan Edward Mark
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Mehdi Mobli
- Centre for Advanced Imaging, Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
27
|
Katari SK, Pasala C, Nalamolu RM, Bitla AR, Umamaheswari A. In silico trials to design potent inhibitors against matrilysin (MMP-7). J Biomol Struct Dyn 2022; 40:11851-11862. [PMID: 34405760 DOI: 10.1080/07391102.2021.1965032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The study deals with structure-based rational drug design against the chief zinc-rely endopeptidase called matrilysin (MMP-7) that is involved in inflammatory and metastasis process of several carcinomas. Hyperactivated matrilysin of human was targeted, because of its hydrolytic actions on extracellular matrix (ECM) protein components constitutes fibrillar collagens, gelatins, fibronectins and it also activates zymogen forms of vital matrix metalloproteinases (gelatinase A-MMP-2 and B-MMP-9) responsible for ECM destruction in many cancers. In the present work, e-pharmacophores were generated for the respective five co-crystal structures of human matrilysin by mapping ligand's pharmacophoric features. During the lead-optimization campaign, the five e-pharmacophores-based shape screening against an in-house library of >21 million compounds created a dataset of 5000 structural analogs. The subsequent three different docking strategies, including rigid-receptor docking, quantum-polarized-ligand docking, induced-fit docking and free energy binding calculations resulted four leads as novel and potent MMP-7 binders. These four leads were observed with good pharmacological features and good receiver operating characteristics curve metrics (ROC: 0.93) in post-docking evaluations against five existing co-crystal inhibitors and 1000 decoy molecules with MMP-7. Moreover, stability and dynamics behavior of matrilysin-lead1 complex and matrilysin-cocrystal ligand (TQJ) complex were analyzed in natural physiological milieu of 1000 ns or 1 µs molecular dynamics simulations. Lead1-MMP-7 complex was found with an average Cα root-mean-square deviation (RMSD) of 2.35 Å, average ligand root-mean-square fluctuations (RMSF) of 0.66 Å and the strong metallic interactions with E220, a key residue for proteolytic action thereby hinders ECM proteolysis that in turn can halt metastatic cancerous condition.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Sudheer Kumar Katari
- Department of Bioinformatics, Bioinformatics Centre, Sri Venkateswara Institute of Medical Sciences University, Tirupati, Andhra Pradesh, India
| | - Chiranjeevi Pasala
- Department of Bioinformatics, Bioinformatics Centre, Sri Venkateswara Institute of Medical Sciences University, Tirupati, Andhra Pradesh, India
| | - Ravina Madhulitha Nalamolu
- Department of Bioinformatics, Bioinformatics Centre, Sri Venkateswara Institute of Medical Sciences University, Tirupati, Andhra Pradesh, India
| | - Aparna R Bitla
- Department of Biochemistry, Sri Venkateswara Institute of Medical Sciences University, Tirupati, Andhra Pradesh, India
| | - Amineni Umamaheswari
- Department of Bioinformatics, Bioinformatics Centre, Sri Venkateswara Institute of Medical Sciences University, Tirupati, Andhra Pradesh, India
| |
Collapse
|
28
|
Abstract
Covalent inhibition has emerged as a promising orthogonal approach for drug discovery, despite the significant challenge in achieving target specificity. To facilitate the structure-based rational design of target-specific covalent modulators, we developed an integrated computational protocol to curate covalent binders from the RCSB Protein Data Bank (PDB). Starting from the macromolecular crystallographic information files (mmCIF) in the PDB archive, covalent bond records, which indicate the side chain modification of amino acid residue by a covalent binder, were collected and cleaned. Then, residue-binder adducts, which are products of chemical reactions between targeted residues and covalent binders, were recovered with the help of the Chemical Component Dictionary in PDB. Finally, several strategies were employed to curate the pre-reaction forms of covalent binders from the adducts. Our curated CovBinderInPDB database contains 7375 covalent modifications in which 2189 unique covalent binders target nine types of amino acid residues (Cys, Lys, Ser, Asp, Glu, His, Met, Thr, and Tyr) from 3555 complex structures of 1170 unique protein chains. This database would set a solid foundation for developing and benchmarking computational strategies for covalent modulator design and is freely accessible at https://yzhang.hpc.nyu.edu/CovBinderInPDB.
Collapse
Affiliation(s)
- Xiao-Kang Guo
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Yingkai Zhang
- Department of Chemistry, New York University, New York, New York 10003, United States, Simons Center for Computational Physical Chemistry, New York University, New York, New York 10003, United States, NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China,
| |
Collapse
|
29
|
Burley SK, Bhikadiya C, Bi C, Bittrich S, Chao H, Chen L, Craig PA, Crichlow GV, Dalenberg K, Duarte JM, Dutta S, Fayazi M, Feng Z, Flatt JW, Ganesan SJ, Ghosh S, Goodsell DS, Green RK, Guranovic V, Henry J, Hudson BP, Khokhriakov I, Lawson CL, Liang Y, Lowe R, Peisach E, Persikova I, Piehl DW, Rose Y, Sali A, Segura J, Sekharan M, Shao C, Vallat B, Voigt M, Webb B, Westbrook JD, Whetstone S, Young JY, Zalevsky A, Zardecki C. RCSB Protein Data bank: Tools for visualizing and understanding biological macromolecules in 3D. Protein Sci 2022; 31:e4482. [PMID: 36281733 PMCID: PMC9667899 DOI: 10.1002/pro.4482] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 10/17/2022] [Accepted: 10/19/2022] [Indexed: 12/14/2022]
Abstract
Now in its 52nd year of continuous operations, the Protein Data Bank (PDB) is the premiere open-access global archive housing three-dimensional (3D) biomolecular structure data. It is jointly managed by the Worldwide Protein Data Bank (wwPDB) partnership. The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) is funded by the National Science Foundation, National Institutes of Health, and US Department of Energy and serves as the US data center for the wwPDB. RCSB PDB is also responsible for the security of PDB data in its role as wwPDB-designated Archive Keeper. Every year, RCSB PDB serves tens of thousands of depositors of 3D macromolecular structure data (coming from macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction). The RCSB PDB research-focused web portal (RCSB.org) makes PDB data available at no charge and without usage restrictions to many millions of PDB data consumers around the world. The RCSB PDB training, outreach, and education web portal (PDB101.RCSB.org) serves nearly 700 K educators, students, and members of the public worldwide. This invited Tools Issue contribution describes how RCSB PDB (i) is organized; (ii) works with wwPDB partners to process new depositions; (iii) serves as the wwPDB-designated Archive Keeper; (iv) enables exploration and 3D visualization of PDB data via RCSB.org; and (v) supports training, outreach, and education via PDB101.RCSB.org. New tools and features at RCSB.org are presented using examples drawn from high-resolution structural studies of proteins relevant to treatment of human cancers by targeting immune checkpoints.
Collapse
Affiliation(s)
- Stephen K. Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Cancer Institute of New Jersey, Rutgers, The State University of New JerseyNew BrunswickNew JerseyUSA
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
- Department of Chemistry and Chemical Biology, RutgersThe State University of New JerseyPiscatawayNew JerseyUSA
| | - Charmi Bhikadiya
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Chunxiao Bi
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
| | - Sebastian Bittrich
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
| | - Henry Chao
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Li Chen
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Paul A. Craig
- School of Chemistry and Materials ScienceRochester Institute of TechnologyRochesterNew YorkUSA
| | - Gregg V. Crichlow
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Kenneth Dalenberg
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Jose M. Duarte
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
| | - Shuchismita Dutta
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Cancer Institute of New Jersey, Rutgers, The State University of New JerseyNew BrunswickNew JerseyUSA
| | - Maryam Fayazi
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Zukang Feng
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Justin W. Flatt
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Sai J. Ganesan
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic SciencesQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Pharmaceutical ChemistryQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
| | - Sutapa Ghosh
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - David S. Goodsell
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Cancer Institute of New Jersey, Rutgers, The State University of New JerseyNew BrunswickNew JerseyUSA
- Department of Integrative Structural and Computational BiologyThe Scripps Research InstituteLa JollaCaliforniaUSA
| | - Rachel Kramer Green
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Vladimir Guranovic
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Jeremy Henry
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
| | - Brian P. Hudson
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Igor Khokhriakov
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
| | - Catherine L. Lawson
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Yuhe Liang
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Robert Lowe
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Irina Persikova
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Dennis W. Piehl
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Yana Rose
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
| | - Andrej Sali
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic SciencesQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Pharmaceutical ChemistryQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
| | - Joan Segura
- Research Collaboratory for Structural Bioinformatics Protein Data BankSan Diego Supercomputer Center, University of CaliforniaLa JollaCaliforniaUSA
| | - Monica Sekharan
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Chenghua Shao
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Maria Voigt
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Benjamin Webb
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic SciencesQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Pharmaceutical ChemistryQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
| | - John D. Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Shamara Whetstone
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Jasmine Y. Young
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Arthur Zalevsky
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic SciencesQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Pharmaceutical ChemistryQuantitative Biosciences Institute, University of CaliforniaSan FranciscoCaliforniaUSA
| | - Christine Zardecki
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| |
Collapse
|
30
|
Westbrook JD, Young JY, Shao C, Feng Z, Guranovic V, Lawson CL, Vallat B, Adams PD, Berrisford JM, Bricogne G, Diederichs K, Joosten RP, Keller P, Moriarty NW, Sobolev OV, Velankar S, Vonrhein C, Waterman DG, Kurisu G, Berman HM, Burley SK, Peisach E. PDBx/mmCIF Ecosystem: Foundational Semantic Tools for Structural Biology. J Mol Biol 2022; 434:167599. [PMID: 35460671 DOI: 10.1016/j.jmb.2022.167599] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/31/2022] [Accepted: 04/13/2022] [Indexed: 02/07/2023]
Abstract
PDBx/mmCIF, Protein Data Bank Exchange (PDBx) macromolecular Crystallographic Information Framework (mmCIF), has become the data standard for structural biology. With its early roots in the domain of small-molecule crystallography, PDBx/mmCIF provides an extensible data representation that is used for deposition, archiving, remediation, and public dissemination of experimentally determined three-dimensional (3D) structures of biological macromolecules by the Worldwide Protein Data Bank (wwPDB, wwpdb.org). Extensions of PDBx/mmCIF are similarly used for computed structure models by ModelArchive (modelarchive.org), integrative/hybrid structures by PDB-Dev (pdb-dev.wwpdb.org), small angle scattering data by Small Angle Scattering Biological Data Bank SASBDB (sasbdb.org), and for models computed generated with the AlphaFold 2.0 deep learning software suite (alphafold.ebi.ac.uk). Community-driven development of PDBx/mmCIF spans three decades, involving contributions from researchers, software and methods developers in structural sciences, data repository providers, scientific publishers, and professional societies. Having a semantically rich and extensible data framework for representing a wide range of structural biology experimental and computational results, combined with expertly curated 3D biostructure data sets in public repositories, accelerates the pace of scientific discovery. Herein, we describe the architecture of the PDBx/mmCIF data standard, tools used to maintain representations of the data standard, governance, and processes by which data content standards are extended, plus community tools/software libraries available for processing and checking the integrity of PDBx/mmCIF data. Use cases exemplify how the members of the Worldwide Protein Data Bank have used PDBx/mmCIF as the foundation for its pipeline for delivering Findable, Accessible, Interoperable, and Reusable (FAIR) data to many millions of users worldwide.
Collapse
Affiliation(s)
- John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA
| | - Jasmine Y Young
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Chenghua Shao
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Zukang Feng
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Vladimir Guranovic
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Catherine L Lawson
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Paul D Adams
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Department of Bioengineering, University of California at Berkeley, Berkeley, CA 94720, USA
| | - John M Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Gerard Bricogne
- Global Phasing Ltd, Sheraton House, Castle Park, Cambridge CB3 0AK, UK
| | | | - Robbie P Joosten
- Department of Biochemistry, Netherlands Cancer Institute, Amsterdam, the Netherlands; Oncode Institute, 3521 AL Utrecht, the Netherlands. https://www.twitter.com/Robbie_Joosten
| | - Peter Keller
- Global Phasing Ltd, Sheraton House, Castle Park, Cambridge CB3 0AK, UK
| | - Nigel W Moriarty
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Oleg V Sobolev
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Clemens Vonrhein
- Global Phasing Ltd, Sheraton House, Castle Park, Cambridge CB3 0AK, UK
| | - David G Waterman
- UKRI-STFC Rutherford Appleton Laboratory, Didcot OX11 0FA, UK; CCP4, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot OX11 0FA, UK. https://www.twitter.com/upintheair
| | - Genji Kurisu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, Suita, Osaka 565-0871, Japan
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; The Bridge Institute, Michelson Center for Convergent Bioscience, University of Southern California, Los Angeles, CA, USA
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, La Jolla, CA 92093, USA.
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| |
Collapse
|
31
|
dos Santos IV, Borges RS, Silva GM, de Lima LR, Bastos RS, Ramos RS, Silva LB, da Silva CHTP, dos Santos CBR. Hierarchical Virtual Screening Based on Rocaglamide Derivatives to Discover New Potential Anti-Skin Cancer Agents. Front Mol Biosci 2022; 9:836572. [PMID: 35720115 PMCID: PMC9201829 DOI: 10.3389/fmolb.2022.836572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 01/26/2022] [Indexed: 12/04/2022] Open
Abstract
Skin Cancer (SC) is among the most common type of cancers worldwide. The search for SC therapeutics using molecular modeling strategies as well as considering natural plant-derived products seems to be a promising strategy. The phytochemical Rocaglamide A (Roc-A) and its derivatives rise as an interesting set of reference compounds due to their in vitro cytotoxic activity with SC cell lines. In view of this, we performed a hierarchical virtual screening study considering Roc-A and its derivatives, with the aim to find new chemical entities with potential activity against SC. For this, we selected 15 molecules (Roc-A and 14 derivatives) and initially used them in docking studies to predict their interactions with Checkpoint kinase 1 (Chk1) as a target for SC. This allowed us to compile and use them as a training set to build robust pharmacophore models, validated by Pearson’s correlation (p) values and hierarchical cluster analysis (HCA), subsequentially submitted to prospective virtual screening using the Molport® database. Outputted compounds were then selected considering their similarities to Roc-A, followed by analyses of predicted toxicity and pharmacokinetic properties as well as of consensus molecular docking using three software. 10 promising compounds were selected and analyzed in terms of their properties and structural features and, also, considering their previous reports in literature. In this way, the 10 promising virtual hits found in this work may represent potential anti-SC agents and further investigations concerning their biological tests shall be conducted.
Collapse
Affiliation(s)
- Igor V.F. dos Santos
- Modeling and Computational Chemistry Laboratory, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Biotechnology and Biodiversity-Network BIONORTE, Federal University of Amapá, Macapá, Brazil
| | - Rosivaldo S. Borges
- Modeling and Computational Chemistry Laboratory, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Medicinal Chemistry and Molecular Modeling, Federal University of Pará, Belém, Brazil
| | - Guilherme M. Silva
- Computational Laboratory of Pharmaceutical Chemistry, School of Pharmaceutical Sciences of Ribeirão Preto - Universidade de São Paulo, Ribeirão Preto, Brazil
- Departamento de Química, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto - Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Lúcio R. de Lima
- Modeling and Computational Chemistry Laboratory, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Medicinal Chemistry and Molecular Modeling, Federal University of Pará, Belém, Brazil
| | - Ruan S. Bastos
- Modeling and Computational Chemistry Laboratory, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Medicinal Chemistry and Molecular Modeling, Federal University of Pará, Belém, Brazil
| | - Ryan S. Ramos
- Modeling and Computational Chemistry Laboratory, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Biotechnology and Biodiversity-Network BIONORTE, Federal University of Amapá, Macapá, Brazil
| | - Luciane B. Silva
- Modeling and Computational Chemistry Laboratory, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Medicinal Chemistry and Molecular Modeling, Federal University of Pará, Belém, Brazil
| | - Carlos H. T. P. da Silva
- Computational Laboratory of Pharmaceutical Chemistry, School of Pharmaceutical Sciences of Ribeirão Preto - Universidade de São Paulo, Ribeirão Preto, Brazil
- Departamento de Química, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto - Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Cleydson B. R. dos Santos
- Modeling and Computational Chemistry Laboratory, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Biotechnology and Biodiversity-Network BIONORTE, Federal University of Amapá, Macapá, Brazil
- Graduate Program in Medicinal Chemistry and Molecular Modeling, Federal University of Pará, Belém, Brazil
- *Correspondence: Cleydson B. R. dos Santos,
| |
Collapse
|
32
|
Clementel D, Del Conte A, Monzon AM, Camagni GF, Minervini G, Piovesan D, Tosatto SCE. RING 3.0: fast generation of probabilistic residue interaction networks from structural ensembles. Nucleic Acids Res 2022; 50:W651-W656. [PMID: 35554554 PMCID: PMC9252747 DOI: 10.1093/nar/gkac365] [Citation(s) in RCA: 67] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/15/2022] [Accepted: 04/30/2022] [Indexed: 12/18/2022] Open
Abstract
Residue interaction networks (RINs) are used to represent residue contacts in protein structures. Thanks to the advances in network theory, RINs have been proved effective as an alternative to coordinate data in the analysis of complex systems. The RING server calculates high quality and reliable non-covalent molecular interactions based on geometrical parameters. Here, we present the new RING 3.0 version extending the previous functionality in several ways. The underlying software library has been re-engineered to improve speed by an order of magnitude. RING now also supports the mmCIF format and provides typed interactions for the entire PDB chemical component dictionary, including nucleic acids. Moreover, RING now employs probabilistic graphs, where multiple conformations (e.g. NMR or molecular dynamics ensembles) are mapped as weighted edges, opening up new ways to analyze structural data. The web interface has been expanded to include a simultaneous view of the RIN alongside a structure viewer, with both synchronized and clickable. Contact evolution across models (or time) is displayed as a heatmap and can help in the discovery of correlating interaction patterns. The web server, together with an extensive help and tutorial, is available from URL: https://ring.biocomputingup.it/.
Collapse
Affiliation(s)
- Damiano Clementel
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| | - Alessio Del Conte
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| | | | - Giorgia F Camagni
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| | - Giovanni Minervini
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| |
Collapse
|
33
|
Firouzi R, Ashouri M, Karimi‐Jafari MH. Structural insights into the substrate‐binding site of main protease for the structure‐based COVID‐19 drug discovery. Proteins 2022; 90:1090-1101. [DOI: 10.1002/prot.26318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 01/30/2022] [Accepted: 01/31/2022] [Indexed: 11/06/2022]
Affiliation(s)
- Rohoullah Firouzi
- Department of Physical Chemistry Chemistry and Chemical Engineering Research Center of Iran Tehran Iran
| | - Mitra Ashouri
- Department of Physical Chemistry, School of Chemistry, College of Science University of Tehran Tehran Iran
| | | |
Collapse
|
34
|
Wang Z, Ji H. Characterization of Hydrophilic α-Helical Hot Spots on the Protein-Protein Interaction Interfaces for the Design of α-Helix Mimetics. J Chem Inf Model 2022; 62:1873-1890. [PMID: 35385659 DOI: 10.1021/acs.jcim.1c01556] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The cooperativity index, Kc, was developed to examine the binding synergy between hot spots of the ligand-protein. For the first time, the convergence of the side-chain spatial arrangements of hydrophilic α-helical hot spots Thr, Tyr, Asp, Asn, Ser, Cys, and His in protein-protein interaction (PPI) complex structures was disclosed and quantified by developing novel clustering models. In-depth analyses revealed the driving force for the protein-protein binding conformation convergence of hydrophilic α-helical hot spots. This observation allows deriving pharmacophore models to design new mimetics for hydrophilic α-helical hot spots. A computational protocol was developed to search amino acid analogues and small-molecule mimetics for each hydrophilic α-helical hot spot. As a pilot study, diverse building blocks of commercially available nonstandard L-type α-amino acids and the phenyl ring-containing small-molecule fragments were obtained, which serve as a fragment collection to mimic hydrophilic α-helical hot spots for the improvement of binding affinity, selectivity, physicochemical properties, and synthesis accessibility of α-helix mimetics.
Collapse
Affiliation(s)
- Zhen Wang
- Drug Discovery Department, H. Lee Moffitt Cancer Center & Research Institute, 12902 Magnolia Drive, Tampa, Florida 33612-9497, United States.,Departments of Chemistry and Oncologic Sciences, University of South Florida, Tampa, Florida 33620-9497, United States
| | - Haitao Ji
- Drug Discovery Department, H. Lee Moffitt Cancer Center & Research Institute, 12902 Magnolia Drive, Tampa, Florida 33612-9497, United States.,Departments of Chemistry and Oncologic Sciences, University of South Florida, Tampa, Florida 33620-9497, United States
| |
Collapse
|
35
|
Atanasova M, Nicholls RA, Joosten RP, Agirre J. Updated restraint dictionaries for carbohydrates in the pyranose form. Acta Crystallogr D Struct Biol 2022; 78:455-465. [PMID: 35362468 PMCID: PMC8972801 DOI: 10.1107/s2059798322001103] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 01/31/2022] [Indexed: 11/10/2022] Open
Abstract
Restraint dictionaries are used during macromolecular structure refinement to encapsulate intramolecular connectivity and geometric information. These dictionaries allow previously determined `ideal' values of features such as bond lengths, angles and torsions to be used as restraint targets. During refinement, restraints influence the model to adopt a conformation that agrees with prior observation. This is especially important when refining crystal structures of glycosylated proteins, as their resolutions tend to be worse than those of nonglycosylated proteins. Pyranosides, the overwhelming majority component in all forms of protein glycosylation, often display conformational errors in crystal structures. Whilst many of these flaws usually relate to model building, refinement issues may also have their root in suboptimal restraint dictionaries. In order to avoid subsequent misinterpretation and to improve the quality of all pyranose monosaccharide entries in the CCP4 Monomer Library, new dictionaries with improved ring torsion restraints, coordinates reflecting the lowest-energy ring pucker and updated geometry have been produced and evaluated. These new dictionaries are now part of the CCP4 Monomer Library and will be released with CCP4 version 8.0.
Collapse
Affiliation(s)
- Mihaela Atanasova
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| | - Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Robbie P. Joosten
- Biochemistry Department, Netherlands Cancer Institute, Amsterdam, The Netherlands
- Oncode Institute, The Netherlands
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, United Kingdom
| |
Collapse
|
36
|
Kunzmann P, Anter JM, Hamacher K. Adding hydrogen atoms to molecular models via fragment superimposition. Algorithms Mol Biol 2022; 17:7. [PMID: 35351165 PMCID: PMC8966362 DOI: 10.1186/s13015-022-00215-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Accepted: 03/10/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Most experimentally determined structures of biomolecules lack annotated hydrogen positions due to their low electron density. However, thorough structure analysis and simulations require knowledge about the positions of hydrogen atoms. Existing methods for their prediction are either limited to a certain range of molecules or only work effectively on small compounds. RESULTS We present a novel algorithm that compiles fragments of molecules with known hydrogen atom positions into a library. Using this library the method is able to predict hydrogen positions for molecules with similar moieties. We show that the method is able to accurately assign hydrogen atoms to most organic compounds including biomacromolecules, if a sufficiently large library is used. CONCLUSIONS We bundled the algorithm into the open-source Python package and command line program Hydride. Since usually no additional parametrization is necessary for the problem at hand, the software works out-of-box for a wide range of molecular systems usually within a few seconds of computation time. Hence, we believe that Hydride could be a valuable tool for structural biologists and biophysicists alike.
Collapse
|
37
|
Simplified quality assessment for small-molecule ligands in the Protein Data Bank. Structure 2022; 30:252-262.e4. [PMID: 35026162 PMCID: PMC8849442 DOI: 10.1016/j.str.2021.10.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 09/14/2021] [Accepted: 10/06/2021] [Indexed: 02/05/2023]
Abstract
More than 70% of the experimentally determined macromolecular structures in the Protein Data Bank (PDB) contain small-molecule ligands. Quality indicators of ∼643,000 ligands present in ∼106,000 PDB X-ray crystal structures have been analyzed. Ligand quality varies greatly with regard to goodness of fit between ligand structure and experimental data, deviations in bond lengths and angles from known chemical structures, and inappropriate interatomic clashes between the ligand and its surroundings. Based on principal component analysis, correlated quality indicators of ligand structure have been aggregated into two largely orthogonal composite indicators measuring goodness of fit to experimental data and deviation from ideal chemical structure. Ranking of the composite quality indicators across the PDB archive enabled construction of uniformly distributed composite ranking score. This score is implemented at RCSB.org to compare chemically identical ligands in distinct PDB structures with easy-to-interpret two-dimensional ligand quality plots, allowing PDB users to quickly assess ligand structure quality and select the best exemplars.
Collapse
|
38
|
What Features of Ligands Are Relevant to the Opening of Cryptic Pockets in Drug Targets? INFORMATICS 2022. [DOI: 10.3390/informatics9010008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Small-molecule drug design aims to identify inhibitors that can specifically bind to a functionally important region on the target, i.e., an active site of an enzyme. Identification of potential binding pockets is typically based on static three-dimensional structures. However, small molecules may induce and select a dynamic binding pocket that is not visible in the apo protein, which presents a well-recognized challenge for structure-based drug discovery. Here, we assessed whether it is possible to identify features in molecules, which we refer to as inducers, that can induce the opening of cryptic pockets. The volume change between apo and bound protein conformations was used as a metric to differentiate chemical features in inducers vs. non-inducers. Based on the dataset of holo–apo pairs, classification models were built to determine an optimum threshold. The model analysis suggested that inducers preferred to be more hydrophobic and aromatic. The impact of sulfur was ambiguous, while phosphorus and halogen atoms were overrepresented in inducers. The fragment analysis showed that small changes in the structures of molecules can strongly affect the potential to induce a cryptic pocket. This analysis and developed model can be used to design inducers that can potentially open cryptic pockets for undruggable proteins.
Collapse
|
39
|
Burley SK, Bhikadiya C, Bi C, Bittrich S, Chen L, Crichlow GV, Duarte JM, Dutta S, Fayazi M, Feng Z, Flatt JW, Ganesan SJ, Goodsell DS, Ghosh S, Kramer Green R, Guranovic V, Henry J, Hudson BP, Lawson CL, Liang Y, Lowe R, Peisach E, Persikova I, Piehl DW, Rose Y, Sali A, Segura J, Sekharan M, Shao C, Vallat B, Voigt M, Westbrook JD, Whetstone S, Young JY, Zardecki C. RCSB Protein Data Bank: Celebrating 50 years of the PDB with new tools for understanding and visualizing biological macromolecules in 3D. Protein Sci 2022; 31:187-208. [PMID: 34676613 PMCID: PMC8740825 DOI: 10.1002/pro.4213] [Citation(s) in RCA: 76] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 10/12/2021] [Accepted: 10/12/2021] [Indexed: 01/03/2023]
Abstract
The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), funded by the US National Science Foundation, National Institutes of Health, and Department of Energy, has served structural biologists and Protein Data Bank (PDB) data consumers worldwide since 1999. RCSB PDB, a founding member of the Worldwide Protein Data Bank (wwPDB) partnership, is the US data center for the global PDB archive housing biomolecular structure data. RCSB PDB is also responsible for the security of PDB data, as the wwPDB-designated Archive Keeper. Annually, RCSB PDB serves tens of thousands of three-dimensional (3D) macromolecular structure data depositors (using macromolecular crystallography, nuclear magnetic resonance spectroscopy, electron microscopy, and micro-electron diffraction) from all inhabited continents. RCSB PDB makes PDB data available from its research-focused RCSB.org web portal at no charge and without usage restrictions to millions of PDB data consumers working in every nation and territory worldwide. In addition, RCSB PDB operates an outreach and education PDB101.RCSB.org web portal that was used by more than 800,000 educators, students, and members of the public during calendar year 2020. This invited Tools Issue contribution describes (i) how the archive is growing and evolving as new experimental methods generate ever larger and more complex biomolecular structures; (ii) the importance of data standards and data remediation in effective management of the archive and facile integration with more than 50 external data resources; and (iii) new tools and features for 3D structure analysis and visualization made available during the past year via the RCSB.org web portal.
Collapse
Affiliation(s)
- Stephen K. Burley
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Cancer Institute of New JerseyRutgers, The State University of New JerseyNew BrunswickNew JerseyUSA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer CenterUniversity of CaliforniaLa JollaCaliforniaUSA
- Department of Chemistry and Chemical BiologyRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Charmi Bhikadiya
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Chunxiao Bi
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer CenterUniversity of CaliforniaLa JollaCaliforniaUSA
| | - Sebastian Bittrich
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer CenterUniversity of CaliforniaLa JollaCaliforniaUSA
| | - Li Chen
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Gregg V. Crichlow
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Jose M. Duarte
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer CenterUniversity of CaliforniaLa JollaCaliforniaUSA
| | - Shuchismita Dutta
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Cancer Institute of New JerseyRutgers, The State University of New JerseyNew BrunswickNew JerseyUSA
| | - Maryam Fayazi
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Zukang Feng
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Justin W. Flatt
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Sai J. Ganesan
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, Quantitative Biosciences InstituteUniversity of CaliforniaSan FranciscoCaliforniaUSA
| | - David S. Goodsell
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Cancer Institute of New JerseyRutgers, The State University of New JerseyNew BrunswickNew JerseyUSA
- Department of Integrative Structural and Computational BiologyThe Scripps Research InstituteLa JollaCaliforniaUSA
| | - Sutapa Ghosh
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Rachel Kramer Green
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Vladimir Guranovic
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Jeremy Henry
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer CenterUniversity of CaliforniaLa JollaCaliforniaUSA
| | - Brian P. Hudson
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Catherine L. Lawson
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Yuhe Liang
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Robert Lowe
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Irina Persikova
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Dennis W. Piehl
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Yana Rose
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer CenterUniversity of CaliforniaLa JollaCaliforniaUSA
| | - Andrej Sali
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, Quantitative Biosciences InstituteUniversity of CaliforniaSan FranciscoCaliforniaUSA
| | - Joan Segura
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer CenterUniversity of CaliforniaLa JollaCaliforniaUSA
| | - Monica Sekharan
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Chenghua Shao
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Maria Voigt
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - John D. Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Cancer Institute of New JerseyRutgers, The State University of New JerseyNew BrunswickNew JerseyUSA
| | - Shamara Whetstone
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Jasmine Y. Young
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| | - Christine Zardecki
- Research Collaboratory for Structural Bioinformatics Protein Data BankRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
- Institute for Quantitative BiomedicineRutgers, The State University of New JerseyPiscatawayNew JerseyUSA
| |
Collapse
|
40
|
Liu Y, Biczysko M, Moriarty NW. A radical approach to radicals. Acta Crystallogr D Struct Biol 2022; 78:43-51. [PMID: 34981760 DOI: 10.1107/s2059798321010809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 10/18/2021] [Indexed: 11/10/2022] Open
Abstract
Nitroxide radicals are characterized by a long-lived spin-unpaired electronic ground state and are strongly sensitive to their chemical surroundings. Combined with electron paramagnetic resonance spectroscopy, these electronic features have led to the widespread application of nitroxide derivatives as spin labels for use in studying protein structure and dynamics. Site-directed spin labelling requires the incorporation of nitroxides into the protein structure, leading to a new protein-ligand molecular model. However, in protein crystallographic refinement nitroxides are highly unusual molecules with an atypical chemical composition. Because macromolecular crystallography is almost entirely agnostic to chemical radicals, their structural information is generally less accurate or even erroneous. In this work, proteins that contain an example of a radical compound (Chemical Component Dictionary ID MTN) from the nitroxide family were re-refined by defining its ideal structural parameters based on quantum-chemical calculations. The refinement results show that this procedure improves the MTN ligand geometries, while at the same time retaining higher agreement with experimental data.
Collapse
Affiliation(s)
- Youjia Liu
- International Center for Quantum and Molecular Structures, Shanghai University, Shanghai 200444, People's Republic of China
| | - Malgorzata Biczysko
- International Center for Quantum and Molecular Structures, Shanghai University, Shanghai 200444, People's Republic of China
| | - Nigel W Moriarty
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720-8235, USA
| |
Collapse
|
41
|
BEHZADI PAYAM, GAJDÁCS MÁRIÓ. Worldwide Protein Data Bank (wwPDB): A virtual treasure for research in biotechnology. Eur J Microbiol Immunol (Bp) 2021; 11:77-86. [PMID: 34908533 PMCID: PMC8830413 DOI: 10.1556/1886.2021.00020] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Accepted: 11/23/2021] [Indexed: 12/25/2022] Open
Abstract
The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RSCB PDB) provides a wide range of digital data regarding biology and biomedicine. This huge internet resource involves a wide range of important biological data, obtained from experiments around the globe by different scientists. The Worldwide Protein Data Bank (wwPDB) represents a brilliant collection of 3D structure data associated with important and vital biomolecules including nucleic acids (RNAs and DNAs) and proteins. Moreover, this database accumulates knowledge regarding function and evolution of biomacromolecules which supports different disciplines such as biotechnology. 3D structure, functional characteristics and phylogenetic properties of biomacromolecules give a deep understanding of the biomolecules' characteristics. An important advantage of the wwPDB database is the data updating time, which is done every week. This updating process helps users to have the newest data and information for their projects. The data and information in wwPDB can be a great support to have an accurate imagination and illustrations of the biomacromolecules in biotechnology. As demonstrated by the SARS-CoV-2 pandemic, rapidly reliable and accessible biological data for microbiology, immunology, vaccinology, and drug development are critical to address many healthcare-related challenges that are facing humanity. The aim of this paper is to introduce the readers to wwPDB, and to highlight the importance of this database in biotechnology, with the expectation that the number of scientists interested in the utilization of Protein Data Bank's resources will increase substantially in the coming years.
Collapse
Affiliation(s)
- PAYAM BEHZADI
- Department of Microbiology, College of Basic Sciences, Shahr-e-Qods Branch, Islamic Azad University, Tehran, 37541-374, Iran
| | - MÁRIÓ GAJDÁCS
- Department of Oral Biology and Experimental Dental Research, Faculty of Dentistry, University of Szeged, 6720, Szeged, Hungary,*Corresponding author. Tel.: +36-62-342-532. E-mail:
| |
Collapse
|
42
|
Vallat B, Webb B, Fayazi M, Voinea S, Tangmunarunkit H, Ganesan SJ, Lawson CL, Westbrook JD, Kesselman C, Sali A, Berman HM. New system for archiving integrative structures. Acta Crystallogr D Struct Biol 2021; 77:1486-1496. [PMID: 34866606 PMCID: PMC8647179 DOI: 10.1107/s2059798321010871] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 10/19/2021] [Indexed: 11/30/2022] Open
Abstract
Structures of many complex biological assemblies are increasingly determined using integrative approaches, in which data from multiple experimental methods are combined. A standalone system, called PDB-Dev, has been developed for archiving integrative structures and making them publicly available. Here, the data standards and software tools that support PDB-Dev are described along with the new and updated components of the PDB-Dev data-collection, processing and archiving infrastructure. Following the FAIR (Findable, Accessible, Interoperable and Reusable) principles, PDB-Dev ensures that the results of integrative structure determinations are freely accessible to everyone.
Collapse
Affiliation(s)
- Brinda Vallat
- RCSB PDB, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Benjamin Webb
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, California, USA
| | - Maryam Fayazi
- RCSB PDB, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Serban Voinea
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Hongsuda Tangmunarunkit
- Information Sciences Institute, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Sai J. Ganesan
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, California, USA
| | - Catherine L. Lawson
- RCSB PDB, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - John D. Westbrook
- RCSB PDB, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Carl Kesselman
- RCSB PDB, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Andrej Sali
- Department of Bioengineering and Therapeutic Sciences, Department of Pharmaceutical Chemistry, and California Institute for Quantitative Biosciences, University of California at San Francisco, San Francisco, California, USA
| | - Helen M. Berman
- Department of Chemistry and Chemical Biology and Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| |
Collapse
|
43
|
Shao C, Feng Z, Westbrook JD, Peisach E, Berrisford J, Ikegawa Y, Kurisu G, Velankar S, Burley SK, Young JY. Modernized uniform representation of carbohydrate molecules in the Protein Data Bank. Glycobiology 2021; 31:1204-1218. [PMID: 33978738 PMCID: PMC8457362 DOI: 10.1093/glycob/cwab039] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 04/05/2021] [Accepted: 04/25/2021] [Indexed: 12/12/2022] Open
Abstract
Since 1971, the Protein Data Bank (PDB) has served as the single global archive for experimentally determined 3D structures of biological macromolecules made freely available to the global community according to the FAIR principles of Findability-Accessibility-Interoperability-Reusability. During the first 50 years of continuous PDB operations, standards for data representation have evolved to better represent rich and complex biological phenomena. Carbohydrate molecules present in more than 14,000 PDB structures have recently been reviewed and remediated to conform to a new standardized format. This machine-readable data representation for carbohydrates occurring in the PDB structures and the corresponding reference data improves the findability, accessibility, interoperability and reusability of structural information pertaining to these molecules. The PDB Exchange MacroMolecular Crystallographic Information File data dictionary now supports (i) standardized atom nomenclature that conforms to International Union of Pure and Applied Chemistry-International Union of Biochemistry and Molecular Biology (IUPAC-IUBMB) recommendations for carbohydrates, (ii) uniform representation of branched entities for oligosaccharides, (iii) commonly used linear descriptors of carbohydrates developed by the glycoscience community and (iv) annotation of glycosylation sites in proteins. For the first time, carbohydrates in PDB structures are consistently represented as collections of standardized monosaccharides, which precisely describe oligosaccharide structures and enable improved carbohydrate visualization, structure validation, robust quantitative and qualitative analyses, search for dendritic structures and classification. The uniform representation of carbohydrate molecules in the PDB described herein will facilitate broader usage of the resource by the glycoscience community and researchers studying glycoproteins.
Collapse
Affiliation(s)
- Chenghua Shao
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Zukang Feng
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Rutgers Cancer Institute of New Jersey, Robert Wood Johnson Medical School, New Brunswick, NJ 08903, USA
| | - Ezra Peisach
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - John Berrisford
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Yasuyo Ikegawa
- Protein Data Bank Japan (PDBj), Institute for Protein Research, Osaka University, Osaka 565-0871, Japan
| | - Genji Kurisu
- Protein Data Bank Japan (PDBj), Institute for Protein Research, Osaka University, Osaka 565-0871, Japan
| | - Sameer Velankar
- Protein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
- Rutgers Cancer Institute of New Jersey, Robert Wood Johnson Medical School, New Brunswick, NJ 08903, USA
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, La Jolla, San Diego, CA 92093, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Jasmine Y Young
- Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB), Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| |
Collapse
|
44
|
Revisiting the Proposition of Binding Pockets and Bioactive Poses for GSK-3β Allosteric Modulators Addressed to Neurodegenerative Diseases. Int J Mol Sci 2021; 22:ijms22158252. [PMID: 34361017 PMCID: PMC8348340 DOI: 10.3390/ijms22158252] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 05/14/2021] [Accepted: 05/18/2021] [Indexed: 02/07/2023] Open
Abstract
Glycogen synthase kinase-3 beta (GSK-3β) is an enzyme pertinently linked to neurodegenerative diseases since it is associated with the regulation of key neuropathological features in the central nervous system. Among the different kinds of inhibitors of this kinase, the allosteric ones stand out due to their selective and subtle modulation, lowering the chance of producing side effects. The mechanism of GSK-3β allosteric modulators may be considered still vague in terms of elucidating a well-defined binding pocket and a bioactive pose for them. In this context, we propose to reinvestigate and reinforce such knowledge by the application of an extensive set of in silico methodologies, such as cavity detection, ligand 3D shape analysis and docking (with robust validation of corresponding protocols), and molecular dynamics. The results here obtained were consensually consistent in furnishing new structural data, in particular by providing a solid bioactive pose of one of the most representative GSK-3β allosteric modulators. We further applied this to the prospect for new compounds by ligand-based virtual screening and analyzed the potential of the two obtained virtual hits by quantum chemical calculations. All potential hits achieved will be subsequently tested by in vitro assays in order to validate our approaches as well as to unveil novel chemical entities as GSK-3β allosteric modulators.
Collapse
|
45
|
Croitoru A, Park SJ, Kumar A, Lee J, Im W, MacKerell AD, Aleksandrov A. Additive CHARMM36 Force Field for Nonstandard Amino Acids. J Chem Theory Comput 2021; 17:3554-3570. [PMID: 34009984 DOI: 10.1021/acs.jctc.1c00254] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Nonstandard amino acids are both abundant in nature, where they play a key role in various cellular processes, and can be synthesized in laboratories, for example, for the manufacture of a range of pharmaceutical agents. In this work, we have extended the additive all-atom CHARMM36 and CHARMM General force field (CGenFF) to a large set of 333 nonstandard amino acids. These include both amino acids with nonstandard side chains, such as post-translationally modified and artificial amino acids, as well as amino acids with modified backbone groups, such as chromophores composed of several amino acids. Model compounds representative of the nonstandard amino acids were parametrized for protonation states that are likely at the physiological pH of 7 and, for some more common residues, in both d- and l-stereoisomers. Considering all protonation, tautomeric, and stereoisomeric forms, a total of 406 nonstandard amino acids were parametrized. Emphasis was placed on the quality of both intra- and intermolecular parameters. Partial charges were derived using quantum mechanical (QM) data on model compound dipole moments, electrostatic potentials, and interactions with water. Optimization of all intramolecular parameters, including torsion angle parameters, was performed against information from QM adiabatic potential energy surface (PES) scans. Special emphasis was put on the quality of terms corresponding to PES around rotatable dihedral angles. Validation of the force field was based on molecular dynamics simulations of 20 protein complexes containing different nonstandard amino acids. Overall, the presented parameters will allow for computational studies of a wide range of proteins containing nonstandard amino acids, including natural and artificial residues.
Collapse
Affiliation(s)
- Anastasia Croitoru
- Laboratoire d'Optique et Biosciences (CNRS UMR7645, INSERM U1182), Ecole Polytechnique, Institut Polytechnique de Paris, F-91128 Palaiseau, France
| | - Sang-Jun Park
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Anmol Kumar
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, 20 Penn Street, Baltimore, Maryland 21201, United States
| | - Jumin Lee
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Wonpil Im
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Alexander D MacKerell
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, 20 Penn Street, Baltimore, Maryland 21201, United States
| | - Alexey Aleksandrov
- Laboratoire d'Optique et Biosciences (CNRS UMR7645, INSERM U1182), Ecole Polytechnique, Institut Polytechnique de Paris, F-91128 Palaiseau, France
| |
Collapse
|
46
|
Nicholls RA, Wojdyr M, Joosten RP, Catapano L, Long F, Fischer M, Emsley P, Murshudov GN. The missing link: covalent linkages in structural models. Acta Crystallogr D Struct Biol 2021; 77:727-745. [PMID: 34076588 PMCID: PMC8171067 DOI: 10.1107/s2059798321003934] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 04/13/2021] [Indexed: 11/10/2022] Open
Abstract
Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifying the correct chemistry, obtaining an appropriate restraint dictionary and ensuring its correct application. The analysis presented herein assesses the extent of problems involving covalent linkages in the Protein Data Bank (PDB). Not only will this facilitate the remediation of existing models, but also, more importantly, it will inform and thus improve the quality of future linkages. By considering linkages of known type in the CCP4 Monomer Library (CCP4-ML), failure to model a covalent linkage is identified to result in inaccurate (systematically longer) interatomic distances. Scanning the PDB for proximal atom pairs that do not have a corresponding type in the CCP4-ML reveals a large number of commonly occurring types of unannotated potential linkages; in general, these may or may not be covalently linked. Manual consideration of the most commonly occurring cases identifies a number of genuine classes of covalent linkages. The recent expansion of the CCP4-ML is discussed, which has involved the addition of over 16 000 and the replacement of over 11 000 component dictionaries using AceDRG. As part of this effort, the CCP4-ML has also been extended using AceDRG link dictionaries for the aforementioned linkage types identified in this analysis. This will facilitate the identification of such linkage types in future modelling efforts, whilst concurrently easing the process involved in their application. The need for a universal standard for maintaining link records corresponding to covalent linkages, and references to the associated dictionaries used during modelling and refinement, following deposition to the PDB is emphasized. The importance of correctly modelling covalent linkages is demonstrated using a case study, which involves the covalent linkage of an inhibitor to the main protease in various viral species, including SARS-CoV-2. This example demonstrates the importance of properly modelling covalent linkages using a comprehensive restraint dictionary, as opposed to just using a single interatomic distance restraint or failing to model the covalent linkage at all.
Collapse
Affiliation(s)
- Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited, Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Robbie P. Joosten
- Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
- Oncode Institute, The Netherlands
| | - Lucrezia Catapano
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Fei Long
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcus Fischer
- Chemical Biology and Therapeutics and Structural Biology, St Jude Children’s Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105-3678, USA
| | - Paul Emsley
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Garib N. Murshudov
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
47
|
Nicholls RA, Joosten RP, Long F, Wojdyr M, Lebedev A, Krissinel E, Catapano L, Fischer M, Emsley P, Murshudov GN. Modelling covalent linkages in CCP4. Acta Crystallogr D Struct Biol 2021; 77:712-726. [PMID: 34076587 PMCID: PMC8171069 DOI: 10.1107/s2059798321001753] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 02/12/2021] [Indexed: 11/10/2022] Open
Abstract
In this contribution, the current protocols for modelling covalent linkages within the CCP4 suite are considered. The mechanism used for modelling covalent linkages is reviewed: the use of dictionaries for describing changes to stereochemistry as a result of the covalent linkage and the application of link-annotation records to structural models to ensure the correct treatment of individual instances of covalent linkages. Previously, linkage descriptions were lacking in quality compared with those of contemporary component dictionaries. Consequently, AceDRG has been adapted for the generation of link dictionaries of the same quality as for individual components. The approach adopted by AceDRG for the generation of link dictionaries is outlined, which includes associated modifications to the linked components. A number of tools to facilitate the practical modelling of covalent linkages available within the CCP4 suite are described, including a new restraint-dictionary accumulator, the Make Covalent Link tool and AceDRG interface in Coot, the 3D graphical editor JLigand and the mechanisms for dealing with covalent linkages in the CCP4i2 and CCP4 Cloud environments. These integrated solutions streamline and ease the covalent-linkage modelling workflow, seamlessly transferring relevant information between programs. Current recommended practice is elucidated by means of instructive practical examples. By summarizing the different approaches to modelling linkages that are available within the CCP4 suite, limitations and potential pitfalls that may be encountered are highlighted in order to raise awareness, with the intention of improving the quality of future modelled covalent linkages in macromolecular complexes.
Collapse
Affiliation(s)
- Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Robbie P. Joosten
- Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
- Oncode Institute, The Netherlands
| | - Fei Long
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited, Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Andrey Lebedev
- CCP4, STFC Rutherford Appleton Laboratory, Chilton, Didcot OX11 0QX, United Kingdom
| | - Eugene Krissinel
- CCP4, STFC Rutherford Appleton Laboratory, Chilton, Didcot OX11 0QX, United Kingdom
| | - Lucrezia Catapano
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Marcus Fischer
- Chemical Biology and Therapeutics and Structural Biology, St Jude Children’s Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105-3678, USA
| | - Paul Emsley
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Garib N. Murshudov
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
48
|
Debio-0932, a second generation oral Hsp90 inhibitor, induces apoptosis in MCF-7 and MDA-MB-231 cell lines. Mol Biol Rep 2021; 48:3439-3449. [PMID: 33999319 DOI: 10.1007/s11033-021-06392-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2021] [Accepted: 04/29/2021] [Indexed: 12/09/2022]
Abstract
Heat shock protein 90 (Hsp90) is a key chaperone that is abnormally expressed in cancer cells, and therefore, designing novel compounds to inhibit chaperone activities of the Hsp90 is a promising therapeutic approach for cancer drug discovery. Debio-0932 is a second-generation Hsp90 inhibitor that exhibited promising anticancer activity against a wide variety of cancer types with a strong binding affinity for Hsp90 and high oral bioavailability. Anticancer activities of the Debio-0932 were tested in MCF-7 and MDA-MB-231 cell lines. Molecular docking results indicated that Debio-0932 was selectively bound to the ATP binding pocket of the Hsp90 with an estimated free energy of binding - 7.24 kcal/mol. Antiproliferative activity of Debio-0932 was determined by XTT assay and Debio-0932 exhibited a cytotoxic effect on MCF-7 and MDA-MB-231 cells in a time and dose-depended manner. Apoptosis inducer role of Debio-0932 was evaluated in MCF-7 and MDA-MB-231 cells with fluorometric apoptosis/necrosis detection kit. Treatment with Debio-0932 stimulated apoptosis in both breast cancer cell lines. mRNA and protein expression levels of Bax, Bcl-2 and Casp-9 were determined in MCF-7 and MDA-MB-231 cells by RT-PCR and Western blotting respectively. Debio-0932 stimulated the down-regulation of anti-apoptotic protein Bcl-2 and the up-regulation of apoptotic protein Bax and cleavage of Casp-9 in cancer cells. Moreover, the anti-invasive potential of Debio-0932 was evaluated in endothelial cells (HUVEC) by wound-healing assay. Debio-0932 decreased the migration of HUVEC cells as compared to the control group. These results indicate that Debio-0932 is a promising compound to treat triple-negative breast cancer and hormone receptor-positive breast cancer, and their metastases.
Collapse
|
49
|
Feng Z, Westbrook JD, Sala R, Smart OS, Bricogne G, Matsubara M, Yamada I, Tsuchiya S, Aoki-Kinoshita KF, Hoch JC, Kurisu G, Velankar S, Burley SK, Young JY. Enhanced validation of small-molecule ligands and carbohydrates in the Protein Data Bank. Structure 2021; 29:393-400.e1. [PMID: 33657417 PMCID: PMC8026741 DOI: 10.1016/j.str.2021.02.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 01/22/2021] [Accepted: 02/11/2021] [Indexed: 12/19/2022]
Abstract
The Worldwide Protein Data Bank (wwPDB) has provided validation reports based on recommendations from community Validation Task Forces for structures in the PDB since 2013. To further enhance validation of small molecules as recommended from the 2016 Ligand Validation Workshop, wwPDB, Global Phasing Ltd., and the Noguchi Institute, recently formed a public/private partnership to incorporate some of their software tools into the wwPDB validation package. Augmented wwPDB validation report features include: two-dimensional (2D) diagrams of small-molecule ligands and carbohydrates, highlighting geometric validation outcomes; 2D topological diagrams of oligosaccharides present in branched entities generated using 2D Symbol Nomenclature for Glycan representation; and views of 3D electron density maps for ligands and carbohydrates, illustrating the goodness-of-fit between the atomic structure and experimental data (X-ray crystallographic structures only). These improvements will impact confidence in ligand conformation and ligand-macromolecular interactions that will aid in understanding biochemical function and contribute to small-molecule drug discovery.
Collapse
Affiliation(s)
- Zukang Feng
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - John D Westbrook
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Raul Sala
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Oliver S Smart
- Global Phasing Ltd., Sheraton House, Castle Park, Cambridge CB3 0AX, UK
| | - Gérard Bricogne
- Global Phasing Ltd., Sheraton House, Castle Park, Cambridge CB3 0AX, UK
| | - Masaaki Matsubara
- The Noguchi Institute, 1-9-7, Kaga, Itabashi-ku, Tokyo 173-0003, Japan
| | - Issaku Yamada
- The Noguchi Institute, 1-9-7, Kaga, Itabashi-ku, Tokyo 173-0003, Japan
| | | | - Kiyoko F Aoki-Kinoshita
- Faculty of Science and Engineering, Soka University, 1-236 Tangi-machi, Hachioji-shi, Tokyo 192-8577, Japan; Glycan & Life Systems Integration Center, Soka University, 1-236 Tangi-machi, Hachioji-shi, Tokyo 192-8577, Japan
| | - Jeffrey C Hoch
- Biological Magnetic Resonance Data Bank, Department of Molecular Biology and Biophysics, University of Connecticut, UConn Health, 263 Farmington Avenue, Farmington, CT 06030-3305, USA
| | - Genji Kurisu
- Protein Data Bank Japan, Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita-shi, Osaka 565-0871, Japan
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Stephen K Burley
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA; Rutgers Cancer Institute of New Jersey, Robert Wood Johnson Medical School, New Brunswick, NJ 08903, USA; Research Collaboratory for Structural Bioinformatics Protein Data Bank, San Diego Supercomputer Center, University of California, San Diego, La Jolla, CA 92093, USA; Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Jasmine Y Young
- Research Collaboratory for Structural Bioinformatics Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA.
| |
Collapse
|
50
|
Topham CM, Smith JC. Peptide nucleic acid Hoogsteen strand linker design for major groove recognition of DNA thymine bases. J Comput Aided Mol Des 2021; 35:355-369. [PMID: 33624202 DOI: 10.1007/s10822-021-00375-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 02/03/2021] [Indexed: 10/22/2022]
Abstract
Sequence-specific targeting of double-stranded DNA and non-coding RNA via triple-helix-forming peptide nucleic acids (PNAs) has attracted considerable attention in therapeutic, diagnostic and nanotechnological fields. An E-base (3-oxo-2,3-dihydropyridazine), attached to the polyamide backbone of a PNA Hoogsteen strand by a side-chain linker molecule, is typically used in the hydrogen bond recognition of the 4-oxo group of thymine and uracil nucleic acid bases in the major groove. We report on the application of quantum chemical computational methods, in conjunction with spatial constraints derived from the experimental structure of a homopyrimidine PNA·DNA-PNA hetero-triplex, to investigate the influence of linker flexibility on binding interactions of the E-base with thymine and uracil bases in geometry-optimised model systems. Hydrogen bond formation between the N2 E-base atom and target pyrimidine base 4-oxo groups in model systems containing a β-alanine linker (J Am Chem Soc 119:11116, 1997) was found to incur significant internal strain energy and the potential disruption of intra-stand aromatic base stacking interactions in an oligomeric context. In geometry-optimised model systems containing a 3-trans olefin linker (Bioorg Med Chem Lett 14:1551, 2004) the E-base swung out away from the target pyrimidine bases into the solvent. These findings are in qualitative agreement with calorimetric measurements in hybridisation experiments at T-A and U-A inversion sites. In contrast, calculations on a novel 2-cis olefin linker design indicate that it could permit simultaneous E-base hydrogen bonding with the thymine 4-oxo group, circumvention and solvent screening of the thymine 5-methyl group, and maintenance of triplex intra-stand base stacking interactions.
Collapse
Affiliation(s)
- Christopher M Topham
- Molecular Forces Consulting, 24 Avenue Jacques Besse, 81500, Lavaur, France.
- Computational Molecular Biophysics, IWR Der Universität Heidelberg, Im Neuenheimer Feld 368, 69120, Heidelberg, Germany.
- Center for Molecular Biophysics, University of Tennessee / Oak Ridge National Laboratory, P.O.Box 2008, Oak Ridge, TN, 37831-6309, USA.
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, M407 Walters Life Sciences, 1414 Cumberland Avenue, Knoxville, TN, 37996, USA.
| | - Jeremy C Smith
- Computational Molecular Biophysics, IWR Der Universität Heidelberg, Im Neuenheimer Feld 368, 69120, Heidelberg, Germany
- Center for Molecular Biophysics, University of Tennessee / Oak Ridge National Laboratory, P.O.Box 2008, Oak Ridge, TN, 37831-6309, USA
- Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, M407 Walters Life Sciences, 1414 Cumberland Avenue, Knoxville, TN, 37996, USA
| |
Collapse
|