1
|
Liebschner D, Moriarty NW, Poon BK, Adams PD. In situ ligand restraints from quantum-mechanical methods. Acta Crystallogr D Struct Biol 2023; 79:100-110. [PMID: 36762856 PMCID: PMC9912925 DOI: 10.1107/s2059798323000025] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 01/02/2023] [Indexed: 01/21/2023] Open
Abstract
In macromolecular crystallographic structure refinement, ligands present challenges for the generation of geometric restraints due to their large chemical variability, their possible novel nature and their specific interaction with the binding pocket of the protein. Quantum-mechanical approaches are useful for providing accurate ligand geometries, but can be plagued by the number of minima in flexible molecules. In an effort to avoid these issues, the Quantum Mechanical Restraints (QMR) procedure optimizes the ligand geometry in situ, thus accounting for the influence of the macromolecule on the local energy minima of the ligand. The optimized ligand geometry is used to generate target values for geometric restraints during the crystallographic refinement. As demonstrated using a sample of >2330 ligand instances in >1700 protein-ligand models, QMR restraints generally result in lower deviations from the target stereochemistry compared with conventionally generated restraints. In particular, the QMR approach provides accurate torsion restraints for ligands and other entities.
Collapse
Affiliation(s)
- Dorothee Liebschner
- Molecular Biosciences and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Nigel W. Moriarty
- Molecular Biosciences and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Billy K. Poon
- Molecular Biosciences and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Paul D. Adams
- Molecular Biosciences and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
2
|
Joosten RP, Nicholls RA, Agirre J. Towards consistency in geometry restraints for carbohydrates in the pyranose form: modern dictionary generators reviewed. Curr Med Chem 2021; 29:1193-1207. [PMID: 34477506 PMCID: PMC7612510 DOI: 10.2174/0929867328666210902140754] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 08/03/2021] [Accepted: 08/07/2021] [Indexed: 11/23/2022]
Abstract
Macromolecular restrained refinement is nowadays the most used method for improving the agreement between an atomic structural model and experimental data. Restraint dictionaries, a key tool behind the success of the method, allow fine-tuning geometric properties such as distances and angles between atoms beyond simplistic expectations. Dictionary generators can provide restraint target estimates derived from different sources, from fully theoretical to experimental and any combination in between. Carbohydrates are stereochemically complex biomolecules and, in their pyranose form, have clear conformational preferences. As such, they pose unique problems to dictionary generators and in the course of this study, require special attention from software developers. Functional differences between restraint generators will be discussed, as well as the process of achieving consistent results with different software designs. The study will conclude a set of practical considerations, as well as recommendations for the generation of new restraint dictionaries, using the improved software alternatives discussed.
Collapse
Affiliation(s)
- Robbie P Joosten
- Oncode Institute and Division of Biochemistry, Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam. Netherlands
| | - Robert A Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH. United Kingdom
| | - Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, YO10 5DD. United Kingdom
| |
Collapse
|
3
|
Nicholls RA, Wojdyr M, Joosten RP, Catapano L, Long F, Fischer M, Emsley P, Murshudov GN. The missing link: covalent linkages in structural models. Acta Crystallogr D Struct Biol 2021; 77:727-745. [PMID: 34076588 PMCID: PMC8171067 DOI: 10.1107/s2059798321003934] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 04/13/2021] [Indexed: 11/10/2022] Open
Abstract
Covalent linkages between constituent blocks of macromolecules and ligands have been subject to inconsistent treatment during the model-building, refinement and deposition process. This may stem from a number of sources, including difficulties with initially detecting the covalent linkage, identifying the correct chemistry, obtaining an appropriate restraint dictionary and ensuring its correct application. The analysis presented herein assesses the extent of problems involving covalent linkages in the Protein Data Bank (PDB). Not only will this facilitate the remediation of existing models, but also, more importantly, it will inform and thus improve the quality of future linkages. By considering linkages of known type in the CCP4 Monomer Library (CCP4-ML), failure to model a covalent linkage is identified to result in inaccurate (systematically longer) interatomic distances. Scanning the PDB for proximal atom pairs that do not have a corresponding type in the CCP4-ML reveals a large number of commonly occurring types of unannotated potential linkages; in general, these may or may not be covalently linked. Manual consideration of the most commonly occurring cases identifies a number of genuine classes of covalent linkages. The recent expansion of the CCP4-ML is discussed, which has involved the addition of over 16 000 and the replacement of over 11 000 component dictionaries using AceDRG. As part of this effort, the CCP4-ML has also been extended using AceDRG link dictionaries for the aforementioned linkage types identified in this analysis. This will facilitate the identification of such linkage types in future modelling efforts, whilst concurrently easing the process involved in their application. The need for a universal standard for maintaining link records corresponding to covalent linkages, and references to the associated dictionaries used during modelling and refinement, following deposition to the PDB is emphasized. The importance of correctly modelling covalent linkages is demonstrated using a case study, which involves the covalent linkage of an inhibitor to the main protease in various viral species, including SARS-CoV-2. This example demonstrates the importance of properly modelling covalent linkages using a comprehensive restraint dictionary, as opposed to just using a single interatomic distance restraint or failing to model the covalent linkage at all.
Collapse
Affiliation(s)
- Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcin Wojdyr
- Global Phasing Limited, Sheraton House, Castle Park, Cambridge CB3 0AX, United Kingdom
| | - Robbie P. Joosten
- Netherlands Cancer Institute, Plesmanlaan 121, 1066 CX Amsterdam, The Netherlands
- Oncode Institute, The Netherlands
| | - Lucrezia Catapano
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
- Randall Centre for Cell and Molecular Biophysics, Faculty of Life Sciences and Medicine, King’s College London, London SE1 9RT, United Kingdom
| | - Fei Long
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Marcus Fischer
- Chemical Biology and Therapeutics and Structural Biology, St Jude Children’s Research Hospital, 262 Danny Thomas Place, Memphis, TN 38105-3678, USA
| | - Paul Emsley
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| | - Garib N. Murshudov
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
4
|
van Zundert GCP, Moriarty NW, Sobolev OV, Adams PD, Borrelli KW. Macromolecular refinement of X-ray and cryoelectron microscopy structures with Phenix/OPLS3e for improved structure and ligand quality. Structure 2021; 29:913-921.e4. [PMID: 33823127 DOI: 10.1016/j.str.2021.03.011] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 01/21/2021] [Accepted: 03/12/2021] [Indexed: 11/30/2022]
Abstract
With the advent of the resolution revolution in cryoelectron microscopy (cryo-EM), low-resolution refinement is common, and likewise increases the need for a reliable force field. Here, we report on the incorporation of the OPLS3e force field with the VSGB2.1 solvation model in the structure determination package Phenix. Our results show significantly improved structure quality and reduced ligand strain at lower resolution for X-ray refinement. For refinement of cryo-EM-based structures, we find comparable quality structures, goodness-of-fit, and reduced ligand strain. We also show how structure quality and ligand strain are related to the map-model cross-correlation as a function of data weight, and how that can detect overfitting. Signs of overfitting are found in over half of our cryo-EM dataset, which can be remedied by a re-refinement at a lower data weight. Finally, a start-to-end script for refining structures with Phenix/OPLS3e is available in the Schrödinger 2020-3 distribution.
Collapse
Affiliation(s)
| | - Nigel W Moriarty
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Oleg V Sobolev
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Paul D Adams
- Molecular Biosciences and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Department of Bioengineering, University of California at Berkeley, Berkeley, CA 94720, USA
| | | |
Collapse
|
5
|
Dittrich B. On modelling disordered crystal structures through restraints from molecule-in-cluster computations, and distinguishing static and dynamic disorder. IUCRJ 2021; 8:305-318. [PMID: 33708406 PMCID: PMC7924241 DOI: 10.1107/s2052252521000531] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 01/14/2021] [Indexed: 06/12/2023]
Abstract
Distinguishing disorder into static and dynamic based on multi-temperature X-ray or neutron diffraction experiments is the current state of the art, but is only descriptive, not predictive. Here, several disordered structures are revisited from the Cambridge Crystallographic Data Center 'drug subset', the Cambridge Structural Database and own earlier work, where experimental intensities of Bragg diffraction data were available. Using the molecule-in-cluster approach, structures with distinguishable conformations were optimized separately, as extracted from available or generated disorder models of the respective disordered crystal structures. Re-combining these 'archetype structures' by restraining positional and constraining displacement parameters for conventional least-squares refinement, based on the optimized geometries, then often achieves a superior fit to the experimental diffraction data compared with relying on experimental information alone. It also simplifies and standardizes disorder refinement. Ten example structures were analysed. It is observed that energy differences between separate disorder conformations are usually within a small energy window of RT (T = crystallization temperature). Further computations classify disorder into static or dynamic, using single experiments performed at one single temperature, and this was achieved for propionamide.
Collapse
Affiliation(s)
- Birger Dittrich
- Novartis Campus, Novartis Pharma AG, Postfach, Basel, CH-4002, Switzerland
- Mathematisch-Naturwissenschaftliche Fakultät, Universität Zürich, Winterthurerstrasse 190, Zürich, CH-8057, Switzerland
| |
Collapse
|
6
|
Plaza-Garrido M, Salinas-Garcia MC, Alba-Elena D, Martínez JC, Camara-Artigas A. Lysozyme crystals dyed with bromophenol blue: where has the dye gone? ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY 2020; 76:845-856. [PMID: 32876060 DOI: 10.1107/s2059798320008803] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 06/30/2020] [Indexed: 11/10/2022]
Abstract
Protein crystals can easily be coloured by adding dyes to their mother liquor, but most structures of these protein-dye complexes remain unsolved. Here, structures of lysozyme in complex with bromophenol blue obtained by soaking orthorhombic and tetragonal crystals in a saturated solution of the dye at different pH values from 5.0 to 7.5 are reported. Two different binding sites can be found in the lysozyme-bromophenol blue crystals: binding site I is located near the amino- and carboxyl-termini, while binding site II is located adjacent to helices α1 (residues 4-15) and α3 (residues 88-100). In the orthorhombic crystals soaked at pH 7.0, binding of the dye takes place in both sites without significant changes in the unit cell. However, soaking tetragonal crystals with bromophenol blue results in two different complexes. Crystals soaked at pH 5.5 (HEWL-T1) show a single dye molecule bound to site II, and the crystals belong to space group P43212 without significant changes in the unit cell (a = b = 78.50, c = 37.34 Å). On the other hand, crystals soaked at pH 6.5 in the presence of imidazole (HEWL-T2) show up to eight molecules of the dye bound to site II, and display changes in space group (P212121) and unit cell (a = 38.00, b = 76.65, c = 84.86 Å). In all of the structures, the dye molecules are placed at the surface of the protein near to positively charged residues accessible through the main solvent channels of the crystal. Differences in the arrangement of the dye molecules at the surface of the protein suggest that the binding is not specific and is mainly driven by electrostatic interactions.
Collapse
Affiliation(s)
- Marina Plaza-Garrido
- Department of Chemistry and Physics, Agrifood Campus of International Excellence (ceiA3) and CIAMBITAL, University of Almería, Carretera de Sacramento s/n, 04120 Almería, Spain
| | - M Carmen Salinas-Garcia
- Department of Chemistry and Physics, Agrifood Campus of International Excellence (ceiA3) and CIAMBITAL, University of Almería, Carretera de Sacramento s/n, 04120 Almería, Spain
| | - Daniel Alba-Elena
- Department of Chemistry and Physics, Agrifood Campus of International Excellence (ceiA3) and CIAMBITAL, University of Almería, Carretera de Sacramento s/n, 04120 Almería, Spain
| | - Jose C Martínez
- Department of Physical Chemistry and Institute of Biotechnology, Faculty of Sciences, University of Granada, 18071 Granada, Spain
| | - Ana Camara-Artigas
- Department of Chemistry and Physics, Agrifood Campus of International Excellence (ceiA3) and CIAMBITAL, University of Almería, Carretera de Sacramento s/n, 04120 Almería, Spain
| |
Collapse
|
7
|
Wu C, Li B, Zhang Y, Chen T, Chen C, Jiang W, Wang Q, Chen T. Intranasal delivery of paeoniflorin nanocrystals for brain targeting. Asian J Pharm Sci 2019; 15:326-335. [PMID: 32636950 PMCID: PMC7327772 DOI: 10.1016/j.ajps.2019.11.002] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 08/27/2019] [Accepted: 11/07/2019] [Indexed: 11/18/2022] Open
Abstract
Paeoniflorin (PA) is an anti-Parkinson Chinese medicine with inferior bioavailability and difficulty in delivery to the brain. This research is to develop an efficacious PA nanocrystal formulation (PA-NCs) that is suitable for intranasal administration to treat Parkinson's disease (PD). PA-NCs were fabricated through an antisolvent precipitation method using TPGS as the stabilizer. The rod-shaped PA-NCs had particle size of 139.6 ± 1.3 nm and zeta potential of −23.2 ± 0.529 mV. A molecular dynamics simulation indicated that van der Waals forces are the primary drivers of interactions between PA and TPGS. In the ex vivo nasal mucosa permeation assay, the cumulative drug release at 24 h was 87.14% ± 5.34%, which was significantly higher than that of free PA. PA-NCs exhibited substantially improved cellular uptake as well as permeability on Calu-3 cells as compared to PA alone. FRET imaging analysis demonstrated that intact NCs could be internalized into Calu-3 cells. Moreover, PA-NCs conferred desirable protective effect against MPP+-induced SH-SY5Y cellular damage. Pharmacokinetic studies revealed a higher PA concentration in the brain following intranasal delivery of PA-NCs. In summary, the intranasal administration of PA-NCs is a promising treatment strategy for PD.
Collapse
Affiliation(s)
- Chaoyin Wu
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Benyue Li
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Yi Zhang
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Tingting Chen
- Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou 510530, China
| | - Chuangrong Chen
- Science and Technology Department, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Wei Jiang
- Department of Radiology, Sun Yet-sen Memorial Hospital, Sun Yet-sen University, Guangzhou 510120, China
| | - Qi Wang
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Tongkai Chen
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| |
Collapse
|
8
|
Taylor R, Wood PA. A Million Crystal Structures: The Whole Is Greater than the Sum of Its Parts. Chem Rev 2019; 119:9427-9477. [PMID: 31244003 DOI: 10.1021/acs.chemrev.9b00155] [Citation(s) in RCA: 127] [Impact Index Per Article: 25.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The founding in 1965 of what is now called the Cambridge Structural Database (CSD) has reaped dividends in numerous and diverse areas of chemical research. Each of the million or so crystal structures in the database was solved for its own particular reason, but collected together, the structures can be reused to address a multitude of new problems. In this Review, which is focused mainly on the last 10 years, we chronicle the contribution of the CSD to research into molecular geometries, molecular interactions, and molecular assemblies and demonstrate its value in the design of biologically active molecules and the solid forms in which they are delivered. Its potential in other commercially relevant areas is described, including gas storage and delivery, thin films, and (opto)electronics. The CSD also aids the solution of new crystal structures. Because no scientific instrument is without shortcomings, the limitations of CSD research are assessed. We emphasize the importance of maintaining database quality: notwithstanding the arrival of big data and machine learning, it remains perilous to ignore the principle of garbage in, garbage out. Finally, we explain why the CSD must evolve with the world around it to ensure it remains fit for purpose in the years ahead.
Collapse
Affiliation(s)
- Robin Taylor
- Cambridge Crystallographic Data Centre , 12 Union Road , Cambridge CB2 1EZ , United Kingdom
| | - Peter A Wood
- Cambridge Crystallographic Data Centre , 12 Union Road , Cambridge CB2 1EZ , United Kingdom
| |
Collapse
|
9
|
Borbulevych O, Martin RI, Westerhoff LM. High-throughput quantum-mechanics/molecular-mechanics (ONIOM) macromolecular crystallographic refinement with PHENIX/DivCon: the impact of mixed Hamiltonian methods on ligand and protein structure. Acta Crystallogr D Struct Biol 2018; 74:1063-1077. [PMID: 30387765 PMCID: PMC6213575 DOI: 10.1107/s2059798318012913] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 09/12/2018] [Indexed: 12/28/2022] Open
Abstract
Conventional macromolecular crystallographic refinement relies on often dubious stereochemical restraints, the preparation of which often requires human validation for unusual species, and on rudimentary energy functionals that are devoid of nonbonding effects owing to electrostatics, polarization, charge transfer or even hydrogen bonding. While this approach has served the crystallographic community for decades, as structure-based drug design/discovery (SBDD) has grown in prominence it has become clear that these conventional methods are less rigorous than they need to be in order to produce properly predictive protein-ligand models, and that the human intervention that is required to successfully treat ligands and other unusual chemistries found in SBDD often precludes high-throughput, automated refinement. Recently, plugins to the Python-based Hierarchical ENvironment for Integrated Xtallography (PHENIX) crystallographic platform have been developed to augment conventional methods with the in situ use of quantum mechanics (QM) applied to ligand(s) along with the surrounding active site(s) at each step of refinement [Borbulevych et al. (2014), Acta Cryst D70, 1233-1247]. This method (Region-QM) significantly increases the accuracy of the X-ray refinement process, and this approach is now used, coupled with experimental density, to accurately determine protonation states, binding modes, ring-flip states, water positions and so on. In the present work, this approach is expanded to include a more rigorous treatment of the entire structure, including the ligand(s), the associated active site(s) and the entire protein, using a fully automated, mixed quantum-mechanics/molecular-mechanics (QM/MM) Hamiltonian recently implemented in the DivCon package. This approach was validated through the automatic treatment of a population of 80 protein-ligand structures chosen from the Astex Diverse Set. Across the entire population, this method results in an average 3.5-fold reduction in ligand strain and a 4.5-fold improvement in MolProbity clashscore, as well as improvements in Ramachandran and rotamer outlier analyses. Overall, these results demonstrate that the use of a structure-wide QM/MM Hamiltonian exhibits improvements in the local structural chemistry of the ligand similar to Region-QM refinement but with significant improvements in the overall structure beyond the active site.
Collapse
Affiliation(s)
- Oleg Borbulevych
- QuantumBio Inc., 2790 West College Avenue, State College, PA 16801, USA
| | - Roger I. Martin
- QuantumBio Inc., 2790 West College Avenue, State College, PA 16801, USA
| | | |
Collapse
|
10
|
Shabalin IG, Porebski PJ, Minor W. Refining the macromolecular model - achieving the best agreement with the data from X-ray diffraction experiment. CRYSTALLOGR REV 2018; 24:236-262. [PMID: 30416256 DOI: 10.1080/0889311x.2018.1521805] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Refinement of macromolecular X-ray crystal structures involves using complex software with hundreds of different settings. The complexity of underlying concepts and the sheer amount sof instructions may make it difficult for less experienced crystallographers to achieve optimal results in their refinements. This tutorial review offers guidelines for choosing the best settings for the reciprocal-space refinement of macromolecular models and provides practical tips for manual model correction. To help aspiring crystallographers navigate the process, some of the most practically important concepts of protein structure refinement are described. Among the topics covered are the use and purpose of R-free, geometrical restraints, restraints on atomic displacement parameters (ADPs), refinement weights, various parametrizations of ADPs (full anisotropic refinement and TLS), and omit maps. We also give practical tips for manual model correction in Coot, modelling of side-chains with poor or missing density, and ligand identification, fitting, and refinement.
Collapse
Affiliation(s)
- Ivan G Shabalin
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, United States.,Center for Structural Genomics of Infectious Diseases (CSGID), Charlottesville, VA, 22908, United States
| | - Przemyslaw J Porebski
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, United States.,Center for Structural Genomics of Infectious Diseases (CSGID), Charlottesville, VA, 22908, United States
| | - Wladek Minor
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22908, United States.,Center for Structural Genomics of Infectious Diseases (CSGID), Charlottesville, VA, 22908, United States
| |
Collapse
|
11
|
Agirre J. Strategies for carbohydrate model building, refinement and validation. Acta Crystallogr D Struct Biol 2017; 73:171-186. [PMID: 28177313 PMCID: PMC5297920 DOI: 10.1107/s2059798316016910] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 10/21/2016] [Indexed: 12/19/2022] Open
Abstract
Sugars are the most stereochemically intricate family of biomolecules and present substantial challenges to anyone trying to understand their nomenclature, reactions or branched structures. Current crystallographic programs provide an abstraction layer allowing inexpert structural biologists to build complete protein or nucleic acid model components automatically either from scratch or with little manual intervention. This is, however, still not generally true for sugars. The need for carbohydrate-specific building and validation tools has been highlighted a number of times in the past, concomitantly with the introduction of a new generation of experimental methods that have been ramping up the production of protein-sugar complexes and glycoproteins for the past decade. While some incipient advances have been made to address these demands, correctly modelling and refining carbohydrates remains a challenge. This article will address many of the typical difficulties that a structural biologist may face when dealing with carbohydrates, with an emphasis on problem solving in the resolution range where X-ray crystallography and cryo-electron microscopy are expected to overlap in the next decade.
Collapse
Affiliation(s)
- Jon Agirre
- York Structural Biology Laboratory, Department of Chemistry, University of York, York YO10 5DD, England
| |
Collapse
|
12
|
Abstract
Crystal structures of protein-ligand complexes are often used to infer biology and inform structure-based drug discovery. Hence, it is important to build accurate, reliable models of ligands that give confidence in the interpretation of the respective protein-ligand complex. This paper discusses key stages in the ligand-fitting process, including ligand binding-site identification, ligand description and conformer generation, ligand fitting, refinement and subsequent validation. The CCP4 suite contains a number of software tools that facilitate this task: AceDRG for the creation of ligand descriptions and conformers, Lidia and JLigand for two-dimensional and three-dimensional ligand editing and visual analysis, Coot for density interpretation, ligand fitting, analysis and validation, and REFMAC5 for macromolecular refinement. In addition to recent advancements in automatic carbohydrate building in Coot (LO/Carb) and ligand-validation tools (FLEV), the release of the CCP4i2 GUI provides an integrated solution that streamlines the ligand-fitting workflow, seamlessly passing results from one program to the next. The ligand-fitting process is illustrated using instructive practical examples, including problematic cases such as post-translational modifications, highlighting the need for careful analysis and rigorous validation.
Collapse
Affiliation(s)
- Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| |
Collapse
|
13
|
Long F, Nicholls RA, Emsley P, Gražulis S, Merkys A, Vaitkus A, Murshudov GN. AceDRG: a stereochemical description generator for ligands. Acta Crystallogr D Struct Biol 2017; 73:112-122. [PMID: 28177307 PMCID: PMC5297914 DOI: 10.1107/s2059798317000067] [Citation(s) in RCA: 221] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 01/03/2017] [Indexed: 11/11/2022] Open
Abstract
The program AceDRG is designed for the derivation of stereochemical information about small molecules. It uses local chemical and topological environment-based atom typing to derive and organize bond lengths and angles from a small-molecule database: the Crystallography Open Database (COD). Information about the hybridization states of atoms, whether they belong to small rings (up to seven-membered rings), ring aromaticity and nearest-neighbour information is encoded in the atom types. All atoms from the COD have been classified according to the generated atom types. All bonds and angles have also been classified according to the atom types and, in a certain sense, bond types. Derived data are tabulated in a machine-readable form that is freely available from CCP4. AceDRG can also generate stereochemical information, provided that the basic bonding pattern of a ligand is known. The basic bonding pattern is perceived from one of the computational chemistry file formats, including SMILES, mmCIF, SDF MOL and SYBYL MOL2 files. Using the bonding chemistry, atom types, and bond and angle tables generated from the COD, AceDRG derives the `ideal' bond lengths, angles, plane groups, aromatic rings and chirality information, and writes them to an mmCIF file that can be used by the refinement program REFMAC5 and the model-building program Coot. Other refinement and model-building programs such as PHENIX and BUSTER can also use these files. AceDRG also generates one or more coordinate sets corresponding to the most favourable conformation(s) of a given ligand. AceDRG employs RDKit for chemistry perception and for initial conformation generation, as well as for the interpretation of SMILES strings, SDF MOL and SYBYL MOL2 files.
Collapse
Affiliation(s)
- Fei Long
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| | - Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| | - Paul Emsley
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| | - Saulius Gražulis
- Institute of Biotechnology, Saulėtekio al. 7, LT-10257 Vilnius, Lithuania
| | - Andrius Merkys
- Institute of Biotechnology, Saulėtekio al. 7, LT-10257 Vilnius, Lithuania
| | - Antanas Vaitkus
- Institute of Biotechnology, Saulėtekio al. 7, LT-10257 Vilnius, Lithuania
| | - Garib N. Murshudov
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| |
Collapse
|
14
|
Bax B, Chung CW, Edge C. Getting the chemistry right: protonation, tautomers and the importance of H atoms in biological chemistry. Acta Crystallogr D Struct Biol 2017; 73:131-140. [PMID: 28177309 PMCID: PMC5297916 DOI: 10.1107/s2059798316020283] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 12/21/2016] [Indexed: 11/13/2023] Open
Abstract
There are more H atoms than any other type of atom in an X-ray crystal structure of a protein-ligand complex, but as H atoms only have one electron they diffract X-rays weakly and are `hard to see'. The positions of many H atoms can be inferred by our chemical knowledge, and such H atoms can be added with confidence in `riding positions'. For some chemical groups, however, there is more ambiguity over the possible hydrogen placements, for example hydroxyls and groups that can exist in multiple protonation states or tautomeric forms. This ambiguity is far from rare, since about 25% of drugs have more than one tautomeric form. This paper focuses on the most common, `prototropic', tautomers, which are isomers that readily interconvert by the exchange of an H atom accompanied by the switch of a single and an adjacent double bond. Hydrogen-exchange rates and different protonation states of compounds (e.g. buffers) are also briefly discussed. The difference in heavy (non-H) atom positions between two tautomers can be small, and careful refinement of all possible tautomers may single out the likely bound ligand tautomer. Experimental methods to determine H-atom positions, such as neutron crystallography, are often technically challenging. Therefore, chemical knowledge and computational approaches are frequently used in conjugation with experimental data to deduce the bound tautomer state. Proton movement is a key feature of many enzymatic reactions, so understanding the orchestration of hydrogen/proton motion is of critical importance to biological chemistry. For example, structural studies have suggested that, just as a chemist may use heat, some enzymes use directional movement to protonate specific O atoms on phosphates to catalyse phosphotransferase reactions. To inhibit `wriggly' enzymes that use movement to effect catalysis, it may be advantageous to have inhibitors that can maintain favourable contacts by adopting different tautomers as the enzyme `wriggles'.
Collapse
Affiliation(s)
- Ben Bax
- Structural Biology, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
- Platform Technology and Science, GlaxoSmithKline, Medicines Research Centre, Gunnels Wood Road, Stevenage SG1 2NY, England
| | - Chun-wa Chung
- Platform Technology and Science, GlaxoSmithKline, Medicines Research Centre, Gunnels Wood Road, Stevenage SG1 2NY, England
| | - Colin Edge
- Platform Technology and Science, GlaxoSmithKline, Medicines Research Centre, Gunnels Wood Road, Stevenage SG1 2NY, England
| |
Collapse
|
15
|
Long F, Nicholls RA, Emsley P, Gražulis S, Merkys A, Vaitkus A, Murshudov GN. Validation and extraction of molecular-geometry information from small-molecule databases. Acta Crystallogr D Struct Biol 2017; 73:103-111. [PMID: 28177306 PMCID: PMC5297913 DOI: 10.1107/s2059798317000079] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 01/03/2017] [Indexed: 11/18/2022] Open
Abstract
A freely available small-molecule structure database, the Crystallography Open Database (COD), is used for the extraction of molecular-geometry information on small-molecule compounds. The results are used for the generation of new ligand descriptions, which are subsequently used by macromolecular model-building and structure-refinement software. To increase the reliability of the derived data, and therefore the new ligand descriptions, the entries from this database were subjected to very strict validation. The selection criteria made sure that the crystal structures used to derive atom types, bond and angle classes are of sufficiently high quality. Any suspicious entries at a crystal or molecular level were removed from further consideration. The selection criteria included (i) the resolution of the data used for refinement (entries solved at 0.84 Å resolution or higher) and (ii) the structure-solution method (structures must be from a single-crystal experiment and all atoms of generated molecules must have full occupancies), as well as basic sanity checks such as (iii) consistency between the valences and the number of connections between atoms, (iv) acceptable bond-length deviations from the expected values and (v) detection of atomic collisions. The derived atom types and bond classes were then validated using high-order moment-based statistical techniques. The results of the statistical analyses were fed back to fine-tune the atom typing. The developed procedure was repeated four times, resulting in fine-grained atom typing, bond and angle classes. The procedure will be repeated in the future as and when new entries are deposited in the COD. The whole procedure can also be applied to any source of small-molecule structures, including the Cambridge Structural Database and the ZINC database.
Collapse
Affiliation(s)
- Fei Long
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| | - Robert A. Nicholls
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| | - Paul Emsley
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| | - Saulius Gražulis
- Institute of Biotechnology, Saulėtekio al. 7, LT-10257 Vilnius, Lithuania
| | - Andrius Merkys
- Institute of Biotechnology, Saulėtekio al. 7, LT-10257 Vilnius, Lithuania
| | - Antanas Vaitkus
- Institute of Biotechnology, Saulėtekio al. 7, LT-10257 Vilnius, Lithuania
| | - Garib N. Murshudov
- Structural Studies, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England
| |
Collapse
|