1
|
Patro LPP, Rathinavelan T. STRIDER: Steric hindrance and metal coordination identifier. Comput Biol Chem 2022; 98:107686. [DOI: 10.1016/j.compbiolchem.2022.107686] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 04/17/2022] [Accepted: 04/18/2022] [Indexed: 11/03/2022]
|
2
|
Fowler NJ, Sljoka A, Williamson MP. A method for validating the accuracy of NMR protein structures. Nat Commun 2020; 11:6321. [PMID: 33339822 PMCID: PMC7749147 DOI: 10.1038/s41467-020-20177-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 11/13/2020] [Indexed: 01/13/2023] Open
Abstract
We present a method that measures the accuracy of NMR protein structures. It compares random coil index [RCI] against local rigidity predicted by mathematical rigidity theory, calculated from NMR structures [FIRST], using a correlation score (which assesses secondary structure), and an RMSD score (which measures overall rigidity). We test its performance using: structures refined in explicit solvent, which are much better than unrefined structures; decoy structures generated for 89 NMR structures; and conventional predictors of accuracy such as number of restraints per residue, restraint violations, energy of structure, ensemble RMSD, Ramachandran distribution, and clashscore. Restraint violations and RMSD are poor measures of accuracy. Comparisons of NMR to crystal structures show that secondary structure is equally accurate, but crystal structures are typically too rigid in loops, whereas NMR structures are typically too floppy overall. We show that the method is a useful addition to existing measures of accuracy.
Collapse
Affiliation(s)
- Nicholas J Fowler
- Dept of Molecular Biology and Biotechnology, University of Sheffield, Sheffield, UK
| | - Adnan Sljoka
- RIKEN Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo, 103-0027, Japan.
- Dept of Chemistry, University of Toronto, UTM, 3359 Mississauga Road North, Mississauga, ON, L5L 1C6, Canada.
| | - Mike P Williamson
- Dept of Molecular Biology and Biotechnology, University of Sheffield, Sheffield, UK.
| |
Collapse
|
3
|
Allain F, Mareuil F, Ménager H, Nilges M, Bardiaux B. ARIAweb: a server for automated NMR structure calculation. Nucleic Acids Res 2020; 48:W41-W47. [PMID: 32383755 PMCID: PMC7319541 DOI: 10.1093/nar/gkaa362] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 04/14/2020] [Accepted: 04/28/2020] [Indexed: 11/13/2022] Open
Abstract
Nuclear magnetic resonance (NMR) spectroscopy is a method of choice to study the dynamics and determine the atomic structure of macromolecules in solution. The standalone program ARIA (Ambiguous Restraints for Iterative Assignment) for automated assignment of nuclear Overhauser enhancement (NOE) data and structure calculation is well established in the NMR community. To ultimately provide a perfectly transparent and easy to use service, we designed an online user interface to ARIA with additional functionalities. Data conversion, structure calculation setup and execution, followed by interactive visualization of the generated 3D structures are all integrated in ARIAweb and freely accessible at https://ariaweb.pasteur.fr.
Collapse
Affiliation(s)
- Fabrice Allain
- Structural Bioinformatics Unit, Department of Structural Biology and Chemistry, CNRS UMR 3528, Institut Pasteur, Paris, 75015, France
| | - Fabien Mareuil
- Bioinformatics and Biostatistics Hub, Department of Computational Biology, CNRS USR 3756, Institut Pasteur, Paris, 75015, France
| | - Hervé Ménager
- Bioinformatics and Biostatistics Hub, Department of Computational Biology, CNRS USR 3756, Institut Pasteur, Paris, 75015, France
| | - Michael Nilges
- Structural Bioinformatics Unit, Department of Structural Biology and Chemistry, CNRS UMR 3528, Institut Pasteur, Paris, 75015, France
| | - Benjamin Bardiaux
- Structural Bioinformatics Unit, Department of Structural Biology and Chemistry, CNRS UMR 3528, Institut Pasteur, Paris, 75015, France
| |
Collapse
|
4
|
Wozniak PP, Pelc J, Skrzypecki M, Vriend G, Kotulska M. Bio-knowledge-based filters improve residue-residue contact prediction accuracy. Bioinformatics 2019; 34:3675-3683. [PMID: 29850768 DOI: 10.1093/bioinformatics/bty416] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Accepted: 05/19/2018] [Indexed: 11/13/2022] Open
Abstract
Motivation Residue-residue contact prediction through direct coupling analysis has reached impressive accuracy, but yet higher accuracy will be needed to allow for routine modelling of protein structures. One way to improve the prediction accuracy is to filter predicted contacts using knowledge about the particular protein of interest or knowledge about protein structures in general. Results We focus on the latter and discuss a set of filters that can be used to remove false positive contact predictions. Each filter depends on one or a few cut-off parameters for which the filter performance was investigated. Combining all filters while using default parameters resulted for a test set of 851 protein domains in the removal of 29% of the predictions of which 92% were indeed false positives. Availability and implementation All data and scripts are available at http://comprec-lin.iiar.pwr.edu.pl/FPfilter/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- P P Wozniak
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - J Pelc
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - M Skrzypecki
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| | - G Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - M Kotulska
- Department of Biomedical Engineering, Faculty of Fundamental Problems of Technology, Wroclaw University of Science and Technology, Wroclaw, Poland
| |
Collapse
|
5
|
Jiang F, Wu HN, Kang W, Wu YD. Developments and Applications of Coil-Library-Based Residue-Specific Force Fields for Molecular Dynamics Simulations of Peptides and Proteins. J Chem Theory Comput 2019; 15:2761-2773. [DOI: 10.1021/acs.jctc.8b00794] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Fan Jiang
- Laboratory of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen 518055, China
| | - Hao-Nan Wu
- Laboratory of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen 518055, China
| | - Wei Kang
- Laboratory of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen 518055, China
- College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| | - Yun-Dong Wu
- Laboratory of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen 518055, China
- College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China
| |
Collapse
|
6
|
Ehrt C, Brinkjost T, Koch O. A benchmark driven guide to binding site comparison: An exhaustive evaluation using tailor-made data sets (ProSPECCTs). PLoS Comput Biol 2018; 14:e1006483. [PMID: 30408032 PMCID: PMC6224041 DOI: 10.1371/journal.pcbi.1006483] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2018] [Accepted: 09/02/2018] [Indexed: 11/24/2022] Open
Abstract
The automated comparison of protein-ligand binding sites provides useful insights into yet unexplored site similarities. Various stages of computational and chemical biology research can benefit from this knowledge. The search for putative off-targets and the establishment of polypharmacological effects by comparing binding sites led to promising results for numerous projects. Although many cavity comparison methods are available, a comprehensive analysis to guide the choice of a tool for a specific application is wanting. Moreover, the broad variety of binding site modeling approaches, comparison algorithms, and scoring metrics impedes this choice. Herein, we aim to elucidate strengths and weaknesses of binding site comparison methodologies. A detailed benchmark study is the only possibility to rationalize the selection of appropriate tools for different scenarios. Specific evaluation data sets were developed to shed light on multiple aspects of binding site comparison. An assembly of all applied benchmark sets (ProSPECCTs–Protein Site Pairs for the Evaluation of Cavity Comparison Tools) is made available for the evaluation and optimization of further and still emerging methods. The results indicate the importance of such analyses to facilitate the choice of a methodology that complies with the requirements of a specific scientific challenge. Binding site similarities are useful in the context of promiscuity prediction, drug repurposing, the analysis of protein-ligand and protein-protein complexes, function prediction, and further fields of general interest in chemical biology and biochemistry. Many years of research have led to the development of a multitude of methods for binding site analysis and comparison. On the one hand, their availability supports research. On the other hand, the huge number of methods hampers the efficient selection of a specific tool. Our research is dedicated to the analysis of different cavity comparison tools. We use several binding site data sets to establish guidelines which can be applied to ensure a successful application of comparison methods by circumventing potential pitfalls.
Collapse
Affiliation(s)
- Christiane Ehrt
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Dortmund, Germany
| | - Tobias Brinkjost
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Dortmund, Germany
- Department of Computer Science, TU Dortmund University, Dortmund, Germany
| | - Oliver Koch
- Faculty of Chemistry and Chemical Biology, TU Dortmund University, Dortmund, Germany
- * E-mail: ,
| |
Collapse
|
7
|
Sala D, Musiani F, Rosato A. Application of Molecular Dynamics to the Investigation of Metalloproteins Involved in Metal Homeostasis. Eur J Inorg Chem 2018. [DOI: 10.1002/ejic.201800602] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Davide Sala
- Magnetic Resonance Center (CERM); University of Florence; Via Luigi Sacconi 6 50019 Sesto Fiorentino Italy
| | - Francesco Musiani
- Laboratory of Bioinorganic Chemistry; Department of Pharmacy and Biotechnology; University of Bologna; Viale Giuseppe Fanin 40, I 40127 Bologna Italy
| | - Antonio Rosato
- Magnetic Resonance Center (CERM); University of Florence; Via Luigi Sacconi 6 50019 Sesto Fiorentino Italy
- Consorzio Interuniversitario di Risonanze Magnetiche di Metallo Proteine; Via Luigi Sacconi 6 50019 Sesto Fiorentino Italy
- Department of Chemistry; University of Florence; Via della Lastruccia 3 50019 Sesto Fiorentino Italy
| |
Collapse
|
8
|
Unraveling the meaning of chemical shifts in protein NMR. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2017; 1865:1564-1576. [PMID: 28716441 DOI: 10.1016/j.bbapap.2017.07.005] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2017] [Revised: 06/29/2017] [Accepted: 07/07/2017] [Indexed: 12/14/2022]
Abstract
Chemical shifts are among the most informative parameters in protein NMR. They provide wealth of information about protein secondary and tertiary structure, protein flexibility, and protein-ligand binding. In this report, we review the progress in interpreting and utilizing protein chemical shifts that has occurred over the past 25years, with a particular focus on the large body of work arising from our group and other Canadian NMR laboratories. More specifically, this review focuses on describing, assessing, and providing some historical context for various chemical shift-based methods to: (1) determine protein secondary and super-secondary structure; (2) derive protein torsion angles; (3) assess protein flexibility; (4) predict residue accessible surface area; (5) refine 3D protein structures; (6) determine 3D protein structures and (7) characterize intrinsically disordered proteins. This review also briefly covers some of the methods that we previously developed to predict chemical shifts from 3D protein structures and/or protein sequence data. It is hoped that this review will help to increase awareness of the considerable utility of NMR chemical shifts in structural biology and facilitate more widespread adoption of chemical-shift based methods by the NMR spectroscopists, structural biologists, protein biophysicists, and biochemists worldwide. This article is part of a Special Issue entitled: Biophysics in Canada, edited by Lewis Kay, John Baenziger, Albert Berghuis and Peter Tieleman.
Collapse
|
9
|
Abstract
Halogen bonding (X-bonding) has attracted notable attention among noncovalent interactions. This highly directional attraction between a halogen atom and an electron donor has been exploited in knowledge-based drug design. A great deal of information has been gathered about X-bonds in protein-ligand complexes, as opposed to nucleic acid complexes. Here we provide a thorough analysis of nucleic acid complexes containing either halogenated building blocks or halogenated ligands. We analyzed close contacts between halogens and electron-rich moieties. The phosphate backbone oxygen is clearly the most common halogen acceptor. We identified 21 X-bonds within known structures of nucleic acid complexes. A vast majority of the X-bonds is formed by halogenated nucleobases, such as bromouridine, and feature excellent geometries. Noncovalent ligands have been found to form only interactions with suboptimal interaction geometries. Hence, the first X-bonded nucleic acid binder remains to be discovered.
Collapse
Affiliation(s)
- Michal H Kolář
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences , Flemingovo nam. 2, 16610 Prague, Czech Republic
| | - Oriana Tabarrini
- Department of Pharmaceutical Sciences, University of Perugia , Via del Liceo 1, I-06123 Perugia, Italy
| |
Collapse
|
10
|
NMR-based automated protein structure determination. Arch Biochem Biophys 2017; 628:24-32. [PMID: 28263718 DOI: 10.1016/j.abb.2017.02.011] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2017] [Revised: 02/18/2017] [Accepted: 02/28/2017] [Indexed: 11/21/2022]
Abstract
NMR spectra analysis for protein structure determination can now in many cases be performed by automated computational methods. This overview of the computational methods for NMR protein structure analysis presents recent automated methods for signal identification in multidimensional NMR spectra, sequence-specific resonance assignment, collection of conformational restraints, and structure calculation, as implemented in the CYANA software package. These algorithms are sufficiently reliable and integrated into one software package to enable the fully automated structure determination of proteins starting from NMR spectra without manual interventions or corrections at intermediate steps, with an accuracy of 1-2 Å backbone RMSD in comparison with manually solved reference structures.
Collapse
|
11
|
Wozniak PP, Vriend G, Kotulska M. Correlated mutations select misfolded from properly folded proteins. Bioinformatics 2017; 33:1497-1504. [PMID: 28203707 DOI: 10.1093/bioinformatics/btx013] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2016] [Accepted: 01/11/2017] [Indexed: 11/14/2022] Open
Affiliation(s)
- P P Wozniak
- Faculty of Fundamental Problems of Technology, Department of Biomedical Engineering, Wrocław University of Science and Technology, Wrocław, Poland
| | - G Vriend
- Centre for Molecular and Biomolecular Informatics, Radboud University Medical Centre, Nijmegen, The Netherlands
| | - M Kotulska
- Faculty of Fundamental Problems of Technology, Department of Biomedical Engineering, Wrocław University of Science and Technology, Wrocław, Poland
| |
Collapse
|
12
|
Núñez Miguel R, Sanders J, Furmaniak J, Rees Smith B. Glycosylation pattern analysis of glycoprotein hormones and their receptors. J Mol Endocrinol 2017; 58:25-41. [PMID: 27875255 DOI: 10.1530/jme-16-0169] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Accepted: 11/13/2016] [Indexed: 11/08/2022]
Abstract
We have studied glycosylation patterns in glycoprotein hormones (GPHs) and glycoprotein hormone receptor (GPHR) extracellular domains (ECD) from different species to identify areas not glycosylated that could be involved in intermolecular or intramolecular interactions. Comparative models of the structure of the TSHR ECD in complex with TSH and in complex with TSHR autoantibodies (M22, stimulating and K1-70, blocking) were obtained based on the crystal structures of the FSH-FSHR ECD, M22-TSHR leucine-rich repeat domain (LRD) and K1-70-TSHR LRD complexes. The glycosylation sites of the GPHRs and GPHs from all species studied were mapped on the model of the human TSH TSHR ECD complex. The areas on the surfaces of GPHs that are known to interact with their receptors are not glycosylated and two areas free from glycosylation, not involved in currently known interactions, have been identified. The concave faces of GPHRs leucine-rich repeats 3-7 are free from glycosylation, consistent with known interactions with the hormones. In addition, four other non-glycosylated areas have been identified, two located on the receptors' convex surfaces, one in the long loop of the hinge regions and one at the C-terminus of the extracellular domains. Experimental evidence suggests that the non-glycosylated areas identified on the hormones and receptors are likely to be involved in forming intramolecular or intermolecular interactions.
Collapse
|
13
|
Sala D, Giachetti A, Luchinat C, Rosato A. A protocol for the refinement of NMR structures using simultaneously pseudocontact shift restraints from multiple lanthanide ions. JOURNAL OF BIOMOLECULAR NMR 2016; 66:175-185. [PMID: 27771862 DOI: 10.1007/s10858-016-0065-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2016] [Accepted: 09/29/2016] [Indexed: 06/06/2023]
Abstract
The binding of paramagnetic metal ions to proteins produces a number of different effects on the NMR spectra of the system. In particular, when the magnetic susceptibility of the metal ion is anisotropic, pseudocontact shifts (PCSs) arise and can be easily measured. They constitute very useful restraints for the solution structure determination of metal-binding proteins. In this context, there has been great interest in the use of lanthanide(III) ions to induce PCSs in diamagnetic proteins, e.g. through the replacement native calcium(II) ions. By preparing multiple samples in each of which a different ion of the lanthanide series is introduced, it is possible to obtain multiple independent PCS datasets that can be used synergistically to generate protein structure ensembles (typically called bundles). For typical NMR-based determination of protein structure, it is necessary to perform an energetic refinement of such initial bundles to obtain final structures whose geometric quality is suitable for deposition in the PDB. This can be conveniently done by using restrained molecular dynamics simulations (rMD) in explicit solvent. However, there are no available protocols for rMD using multiple PCS datasets as part of the restraints. In this work, we extended the PCS module of the AMBER MD package to handle multiple datasets and tuned a previously developed protocol for NMR structure refinement to achieve consistent convergence with PCS restraints. Test calculations with real experimental data show that this new implementation delivers the expected improvement of protein geometry, resulting in final structures that are of suitable quality for deposition. Furthermore, we observe that also initial structures generated only with traditional restraints can be successfully refined using traditional and PCS restraints simultaneously.
Collapse
Affiliation(s)
- Davide Sala
- Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Italy
| | - Andrea Giachetti
- Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Italy
| | - Claudio Luchinat
- Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Italy.
- Department of Chemistry, University of Florence, Via della Lastruccia 3, 50019, Sesto Fiorentino, Italy.
| | - Antonio Rosato
- Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Italy.
- Department of Chemistry, University of Florence, Via della Lastruccia 3, 50019, Sesto Fiorentino, Italy.
| |
Collapse
|
14
|
Geng H, Jiang F, Wu YD. Accurate Structure Prediction and Conformational Analysis of Cyclic Peptides with Residue-Specific Force Fields. J Phys Chem Lett 2016; 7:1805-10. [PMID: 27128113 DOI: 10.1021/acs.jpclett.6b00452] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Cyclic peptides (CPs) are promising candidates for drugs, chemical biology tools, and self-assembling nanomaterials. However, the development of reliable and accurate computational methods for their structure prediction has been challenging. Here, 20 all-trans CPs of 5-12 residues selected from Cambridge Structure Database have been simulated using replica-exchange molecular dynamics with four different force fields. Our recently developed residue-specific force fields RSFF1 and RSFF2 can correctly identify the crystal-like conformations of more than half CPs as the most populated conformation. The RSFF2 performs the best, which consistently predicts the crystal structures of 17 out of 20 CPs with rmsd < 1.1 Å. We also compared the backbone (ϕ, ψ) sampling of residues in CPs with those in short linear peptides and in globular proteins. In general, unlike linear peptides, CPs have local conformational free energies and entropies quite similar to globular proteins.
Collapse
Affiliation(s)
- Hao Geng
- Laboratory of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School , Shenzhen 518055, China
| | - Fan Jiang
- Laboratory of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School , Shenzhen 518055, China
| | - Yun-Dong Wu
- Laboratory of Computational Chemistry and Drug Design, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School , Shenzhen 518055, China
- College of Chemistry and Molecular Engineering, Peking University , Beijing 100871, China
| |
Collapse
|
15
|
Li W, Schaeffer RD, Otwinowski Z, Grishin NV. Estimation of Uncertainties in the Global Distance Test (GDT_TS) for CASP Models. PLoS One 2016; 11:e0154786. [PMID: 27149620 PMCID: PMC4858170 DOI: 10.1371/journal.pone.0154786] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 04/19/2016] [Indexed: 11/19/2022] Open
Abstract
The Critical Assessment of techniques for protein Structure Prediction (or CASP) is a community-wide blind test experiment to reveal the best accomplishments of structure modeling. Assessors have been using the Global Distance Test (GDT_TS) measure to quantify prediction performance since CASP3 in 1998. However, identifying significant score differences between close models is difficult because of the lack of uncertainty estimations for this measure. Here, we utilized the atomic fluctuations caused by structure flexibility to estimate the uncertainty of GDT_TS scores. Structures determined by nuclear magnetic resonance are deposited as ensembles of alternative conformers that reflect the structural flexibility, whereas standard X-ray refinement produces the static structure averaged over time and space for the dynamic ensembles. To recapitulate the structural heterogeneous ensemble in the crystal lattice, we performed time-averaged refinement for X-ray datasets to generate structural ensembles for our GDT_TS uncertainty analysis. Using those generated ensembles, our study demonstrates that the time-averaged refinements produced structure ensembles with better agreement with the experimental datasets than the averaged X-ray structures with B-factors. The uncertainty of the GDT_TS scores, quantified by their standard deviations (SDs), increases for scores lower than 50 and 70, with maximum SDs of 0.3 and 1.23 for X-ray and NMR structures, respectively. We also applied our procedure to the high accuracy version of GDT-based score and produced similar results with slightly higher SDs. To facilitate score comparisons by the community, we developed a user-friendly web server that produces structure ensembles for NMR and X-ray structures and is accessible at http://prodata.swmed.edu/SEnCS. Our work helps to identify the significance of GDT_TS score differences, as well as to provide structure ensembles for estimating SDs of any scores.
Collapse
Affiliation(s)
- Wenlin Li
- Department of Biochemistry and Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390–9050, United States of America
| | - R. Dustin Schaeffer
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, 75390–9050, United States of America
| | - Zbyszek Otwinowski
- Department of Biochemistry and Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390–9050, United States of America
| | - Nick V. Grishin
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, 75390–9050, United States of America
- Department of Biochemistry and Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390–9050, United States of America
- * E-mail:
| |
Collapse
|
16
|
Mareuil F, Malliavin TE, Nilges M, Bardiaux B. Improved reliability, accuracy and quality in automated NMR structure calculation with ARIA. JOURNAL OF BIOMOLECULAR NMR 2015; 62:425-438. [PMID: 25861734 PMCID: PMC4569677 DOI: 10.1007/s10858-015-9928-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Accepted: 04/03/2015] [Indexed: 05/30/2023]
Abstract
In biological NMR, assignment of NOE cross-peaks and calculation of atomic conformations are critical steps in the determination of reliable high-resolution structures. ARIA is an automated approach that performs NOE assignment and structure calculation in a concomitant manner in an iterative procedure. The log-harmonic shape for distance restraint potential and the Bayesian weighting of distance restraints, recently introduced in ARIA, were shown to significantly improve the quality and the accuracy of determined structures. In this paper, we propose two modifications of the ARIA protocol: (1) the softening of the force field together with adapted hydrogen radii, which is meaningful in the context of the log-harmonic potential with Bayesian weighting, (2) a procedure that automatically adjusts the violation tolerance used in the selection of active restraints, based on the fitting of the structure to the input data sets. The new ARIA protocols were fine-tuned on a set of eight protein targets from the CASD-NMR initiative. As a result, the convergence problems previously observed for some targets was resolved and the obtained structures exhibited better quality. In addition, the new ARIA protocols were applied for the structure calculation of ten new CASD-NMR targets in a blind fashion, i.e. without knowing the actual solution. Even though optimisation of parameters and pre-filtering of unrefined NOE peak lists were necessary for half of the targets, ARIA consistently and reliably determined very precise and highly accurate structures for all cases. In the context of integrative structural biology, an increasing number of experimental methods are used that produce distance data for the determination of 3D structures of macromolecules, stressing the importance of methods that successfully make use of ambiguous and noisy distance data.
Collapse
Affiliation(s)
- Fabien Mareuil
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Institut Pasteur, 25-28 rue du Dr Roux, 75724, Paris Cedex 15, France
- Cellule d'Informatique pour la Biologie, Institut Pasteur, 25-28 rue du Dr Roux, 75724, Paris Cedex 15, France
| | - Thérèse E Malliavin
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Institut Pasteur, 25-28 rue du Dr Roux, 75724, Paris Cedex 15, France
| | - Michael Nilges
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Institut Pasteur, 25-28 rue du Dr Roux, 75724, Paris Cedex 15, France
| | - Benjamin Bardiaux
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Institut Pasteur, 25-28 rue du Dr Roux, 75724, Paris Cedex 15, France.
| |
Collapse
|
17
|
Zhu T, Zhang JZH, He X. Correction of erroneously packed protein's side chains in the NMR structure based on ab initio chemical shift calculations. Phys Chem Chem Phys 2015; 16:18163-9. [PMID: 25052367 DOI: 10.1039/c4cp02553a] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In this work, protein side chain (1)H chemical shifts are used as probes to detect and correct side-chain packing errors in protein's NMR structures through structural refinement. By applying the automated fragmentation quantum mechanics/molecular mechanics (AF-QM/MM) method for ab initio calculation of chemical shifts, incorrect side chain packing was detected in the NMR structures of the Pin1 WW domain. The NMR structure is then refined by using molecular dynamics simulation and the polarized protein-specific charge (PPC) model. The computationally refined structure of the Pin1 WW domain is in excellent agreement with the corresponding X-ray structure. In particular, the use of the PPC model yields a more accurate structure than that using the standard (nonpolarizable) force field. For comparison, some of the widely used empirical models for chemical shift calculations are unable to correctly describe the relationship between the particular proton chemical shift and protein structures. The AF-QM/MM method can be used as a powerful tool for protein NMR structure validation and structural flaw detection.
Collapse
Affiliation(s)
- Tong Zhu
- State Key Laboratory of Precision Spectroscopy, Institute of Theoretical and Computational Science, East China Normal University, Shanghai, 200062, China.
| | | | | |
Collapse
|
18
|
Buchner L, Güntert P. Systematic evaluation of combined automated NOE assignment and structure calculation with CYANA. JOURNAL OF BIOMOLECULAR NMR 2015; 62:81-95. [PMID: 25796507 DOI: 10.1007/s10858-015-9921-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Accepted: 03/16/2015] [Indexed: 05/07/2023]
Abstract
The automated assignment of NOESY cross peaks has become a fundamental technique for NMR protein structure analysis. A widely used algorithm for this purpose is implemented in the program CYANA. It has been used for a large number of structure determinations of proteins in solution but a systematic evaluation of its performance has not yet been reported. In this paper we systematically analyze the reliability of combined automated NOESY assignment and structure calculation with CYANA under a variety of conditions on the basis of the experimental NMR data sets of ten proteins. To evaluate the robustness of the algorithm, the original high-quality experimental data sets were modified in different ways to simulate the effect of data imperfections, i.e. incomplete or erroneous chemical shift assignments, missing NOESY cross peaks, inaccurate peak positions, inaccurate peak intensities, lower dimensionality NOESY spectra, and higher tolerances for the matching of chemical shifts and peak positions. The results show that the algorithm is remarkably robust with regard to imperfections of the NOESY peak lists and the chemical shift tolerances but susceptible to lacking or erroneous resonance assignments, in particular for nuclei that are involved in many NOESY cross peaks.
Collapse
Affiliation(s)
- Lena Buchner
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, and Frankfurt Institute of Advanced Studies, Goethe University Frankfurt am Main, Max-von-Laue-Str. 9, 60438, Frankfurt am Main, Germany
| | | |
Collapse
|
19
|
Cassioli A, Bardiaux B, Bouvier G, Mucherino A, Alves R, Liberti L, Nilges M, Lavor C, Malliavin TE. An algorithm to enumerate all possible protein conformations verifying a set of distance constraints. BMC Bioinformatics 2015; 16:23. [PMID: 25627244 PMCID: PMC4384350 DOI: 10.1186/s12859-015-0451-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 01/05/2015] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND The determination of protein structures satisfying distance constraints is an important problem in structural biology. Whereas the most common method currently employed is simulated annealing, there have been other methods previously proposed in the literature. Most of them, however, are designed to find one solution only. RESULTS In order to explore exhaustively the feasible conformational space, we propose here an interval Branch-and-Prune algorithm (iBP) to solve the Distance Geometry Problem (DGP) associated to protein structure determination. This algorithm is based on a discretization of the problem obtained by recursively constructing a search space having the structure of a tree, and by verifying whether the generated atomic positions are feasible or not by making use of pruning devices. The pruning devices used here are directly related to features of protein conformations. CONCLUSIONS We described the new algorithm iBP to generate protein conformations satisfying distance constraints, that would potentially allows a systematic exploration of the conformational space. The algorithm iBP has been applied on three α-helical peptides.
Collapse
Affiliation(s)
| | - Benjamin Bardiaux
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | - Guillaume Bouvier
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | | | - Rafael Alves
- LIX, Ecole Polytechnique, Palaiseau, 91128, France.
| | - Leo Liberti
- LIX, Ecole Polytechnique, Palaiseau, 91128, France. .,IBM TJ Watson Research Center, NY Yorktown Heights, 10598, USA.
| | - Michael Nilges
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | - Carlile Lavor
- University of Campinas (IMECC-UNICAMP), Campinas-SP, 13083-859, Brasil.
| | - Thérèse E Malliavin
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| |
Collapse
|
20
|
Buchner L, Güntert P. Increased reliability of nuclear magnetic resonance protein structures by consensus structure bundles. Structure 2015; 23:425-34. [PMID: 25579816 DOI: 10.1016/j.str.2014.11.014] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 11/17/2014] [Accepted: 11/17/2014] [Indexed: 11/17/2022]
Abstract
Nuclear magnetic resonance (NMR) structures are represented by bundles of conformers calculated from different randomized initial structures using identical experimental input data. The spread among these conformers indicates the precision of the atomic coordinates. However, there is as yet no reliable measure of structural accuracy, i.e., how close NMR conformers are to the "true" structure. Instead, the precision of structure bundles is widely (mis)interpreted as a measure of structural quality. Attempts to increase precision often overestimate accuracy by tight bundles of high precision but much lower accuracy. To overcome this problem, we introduce a protocol for NMR structure determination with the software package CYANA, which produces, like the traditional method, bundles of conformers in agreement with a common set of conformational restraints but with a realistic precision that is, throughout a variety of proteins and NMR data sets, a much better estimate of structural accuracy than the precision of conventional structure bundles.
Collapse
Affiliation(s)
- Lena Buchner
- Institute of Biophysical Chemistry and Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt, Max-von-Laue-Straße 9, 60438 Frankfurt am Main, Germany
| | - Peter Güntert
- Institute of Biophysical Chemistry and Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt, Max-von-Laue-Straße 9, 60438 Frankfurt am Main, Germany; Frankfurt Institute of Advanced Studies, Goethe University Frankfurt, Ruth-Moufang-Straße 1, 60438 Frankfurt am Main, Germany; Department of Chemistry, Graduate School of Science and Engineering, Tokyo Metropolitan University, 1-1 Minami-Ohsawa, Hachioji, Tokyo 192-0397, Japan.
| |
Collapse
|
21
|
Mishra SH, Harden BJ, Frueh DP. A 3D time-shared NOESY experiment designed to provide optimal resolution for accurate assignment of NMR distance restraints in large proteins. JOURNAL OF BIOMOLECULAR NMR 2014; 60:265-74. [PMID: 25381567 PMCID: PMC4245328 DOI: 10.1007/s10858-014-9873-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 10/31/2014] [Indexed: 05/21/2023]
Abstract
Structure determination of proteins by solution NMR has become an established method, but challenges increase steeply with the size of proteins. Notably, spectral crowding and signal overlap impair the analysis of cross-peaks in NOESY spectra that provide distance restraints for structural models. An optimal spectral resolution can alleviate overlap but requires prohibitively long experimental time with existing methods. Here we present a time-shared 3D experiment optimized for large proteins that provides ¹⁵N and ¹³C dispersed NOESY spectra in a single measurement. NOESY correlations appear in the detected dimension and hence benefit from the highest resolution achievable of all dimensions without increase in experimental time. By design, this experiment is inherently optimal for non-uniform sampling acquisition when compared to current alternatives. Thus, ¹⁵N and ¹³C dispersed NOESY spectra with ultra-high resolution in all dimensions were acquired in parallel within about 4 days instead of 80 days for a 52 kDa monomeric protein at a concentration of 350 μM.
Collapse
Affiliation(s)
- Subrata H Mishra
- Department of Biophysics & Biophysical Chemistry, Johns Hopkins University School of Medicine, 701 Hunterian, 725 North Wolfe St., Baltimore, MD 21205, USA; Fax: 410-955-0637, Phone: 410-614-4719
| | - Bradley J Harden
- Department of Biophysics & Biophysical Chemistry, Johns Hopkins University School of Medicine, 701 Hunterian, 725 North Wolfe St., Baltimore, MD 21205, USA; Fax: 410-955-0637, Phone: 410-614-4719
| | - Dominique P Frueh
- Department of Biophysics & Biophysical Chemistry, Johns Hopkins University School of Medicine, 701 Hunterian, 725 North Wolfe St., Baltimore, MD 21205, USA; Fax: 410-955-0637, Phone: 410-614-4719
| |
Collapse
|
22
|
Vranken WF. NMR structure validation in relation to dynamics and structure determination. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2014; 82:27-38. [PMID: 25444697 DOI: 10.1016/j.pnmrs.2014.08.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 08/14/2014] [Accepted: 08/14/2014] [Indexed: 06/04/2023]
Abstract
NMR spectroscopy is a key technique for understanding the behaviour of proteins, especially highly dynamic proteins that adopt multiple conformations in solution. Overall, protein structures determined from NMR spectroscopy data constitute just over 10% of the Protein Data Bank archive. This review covers the validation of these NMR protein structures, but rather than describing currently available methodology, it focuses on concepts that are important for understanding where and how validation is most relevant. First, the inherent characteristics of the protein under study have an influence on quality and quantity of the distinct types of data that can be acquired from NMR experiments. Second, these NMR data are necessarily transformed into a model for use in a structure calculation protocol, and the protein structures that result from this reflect the types of NMR data used as well as the protein characteristics. The validation of NMR protein structures should therefore take account, wherever possible, of the inherent behavioural characteristics of the protein, the types of available NMR data, and the calculation protocol. These concepts are discussed in the context of 'knowledge based' and 'model versus data' validation, with suggestions for questions to ask and different validation categories to consider. The principal aim of this review is to stimulate discussion and to help the reader understand the relationships between the above elements in order to make informed decisions on which validation approaches are the most relevant in particular cases.
Collapse
Affiliation(s)
- Wim F Vranken
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Department of Structural Biology, VIB, 1050 Brussels, Belgium; Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, La Plaine Campus, Triomflaan, BC Building, 6th Floor, CP 263, 1050 Brussels, Belgium.
| |
Collapse
|
23
|
Sripakdeevong P, Cevec M, Chang AT, Erat MC, Ziegeler M, Zhao Q, Fox GE, Gao X, Kennedy SD, Kierzek R, Nikonowicz EP, Schwalbe H, Sigel RKO, Turner DH, Das R. Structure determination of noncanonical RNA motifs guided by ¹H NMR chemical shifts. Nat Methods 2014; 11:413-6. [PMID: 24584194 PMCID: PMC3985481 DOI: 10.1038/nmeth.2876] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Accepted: 01/06/2014] [Indexed: 12/31/2022]
Abstract
Structured noncoding RNAs underlie fundamental cellular processes, but determining their three-dimensional structures remains challenging. We demonstrate that integrating ¹H NMR chemical shift data with Rosetta de novo modeling can be used to consistently determine high-resolution RNA structures. On a benchmark set of 23 noncanonical RNA motifs, including 11 'blind' targets, chemical-shift Rosetta for RNA (CS-Rosetta-RNA) recovered experimental structures with high accuracy (0.6-2.0 Å all-heavy-atom r.m.s. deviation) in 18 cases.
Collapse
Affiliation(s)
| | - Mirko Cevec
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Andrew T Chang
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas, USA
| | - Michèle C Erat
- 1] Department of Biochemistry, University of Oxford, Oxford, UK. [2] Institute of Inorganic Chemistry, University of Zurich, Zurich, Switzerland
| | - Melanie Ziegeler
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Qin Zhao
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - George E Fox
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, USA
| | - Xiaolian Gao
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, USA
| | - Scott D Kennedy
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, New York, USA
| | - Ryszard Kierzek
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Edward P Nikonowicz
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas, USA
| | - Harald Schwalbe
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Roland K O Sigel
- Institute of Inorganic Chemistry, University of Zurich, Zurich, Switzerland
| | - Douglas H Turner
- Department of Chemistry, University of Rochester, Rochester, New York, USA
| | - Rhiju Das
- 1] Biophysics Program, Stanford University, Stanford, California, USA. [2] Department of Biochemistry, Stanford University, Stanford, California, USA. [3] Department of Physics, Stanford University, Stanford, California, USA
| |
Collapse
|
24
|
Vuister GW, Fogh RH, Hendrickx PMS, Doreleijers JF, Gutmanas A. An overview of tools for the validation of protein NMR structures. JOURNAL OF BIOMOLECULAR NMR 2014; 58:259-285. [PMID: 23877928 DOI: 10.1007/s10858-013-9750-x] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2013] [Accepted: 06/04/2013] [Indexed: 06/02/2023]
Abstract
Biomolecular structures at atomic resolution present a valuable resource for the understanding of biology. NMR spectroscopy accounts for 11% of all structures in the PDB repository. In response to serious problems with the accuracy of some of the NMR-derived structures and in order to facilitate proper analysis of the experimental models, a number of program suites are available. We discuss nine of these tools in this review: PROCHECK-NMR, PSVS, GLM-RMSD, CING, Molprobity, Vivaldi, ResProx, NMR constraints analyzer and QMEAN. We evaluate these programs for their ability to assess the structural quality, restraints and their violations, chemical shifts, peaks and the handling of multi-model NMR ensembles. We document both the input required by the programs and output they generate. To discuss their relative merits we have applied the tools to two representative examples from the PDB: a small, globular monomeric protein (Staphylococcal nuclease from S. aureus, PDB entry 2kq3) and a small, symmetric homodimeric protein (a region of human myosin-X, PDB entry 2lw9).
Collapse
Affiliation(s)
- Geerten W Vuister
- Department of Biochemistry, School of Biological Sciences, University of Leicester, Henry Wellcome Building, Lancaster Road, Leicester, LE1 9HN, UK,
| | | | | | | | | |
Collapse
|
25
|
Vila JA, Arnautova YA. 13C Chemical Shifts in Proteins: A Rich Source of Encoded Structural Information. COMPUTATIONAL METHODS TO STUDY THE STRUCTURE AND DYNAMICS OF BIOMOLECULES AND BIOMOLECULAR PROCESSES 2014. [PMCID: PMC7121069 DOI: 10.1007/978-3-642-28554-7_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Despite the formidable progress in Nuclear Magnetic Resonance (NMR) spectroscopy, quality assessment of NMR-derived structures remains as an important problem. Thus, validation of protein structures is essential for the spectroscopists, since it could enable them to detect structural flaws and potentially guide their efforts in further refinement. Moreover, availability of accurate and efficient validation tools would help molecular biologists and computational chemists to evaluate quality of available experimental structures and to select a protein model which is the most suitable for a given scientific problem. The 13Cα nuclei are ubiquitous in proteins, moreover, their shieldings are easily obtainable from NMR experiments and represent a rich source of encoded structural information that makes 13Cα chemical shifts an attractive candidate for use in computational methods aimed at determination and validation of protein structures. In this chapter, the basis of a novel methodology of computing, at the quantum chemical level of theory, the 13Cα shielding for the amino acid residues in proteins is described. We also identify and examine the main factors affecting the 13Cα-shielding computation. Finally, we illustrate how the information encoded in the 13C chemical shifts can be used for a number of applications, viz., from protein structure prediction of both α-helical and β-sheet conformations, to determination of the fraction of the tautomeric forms of the imidazole ring of histidine in proteins as a function of pH or to accurate detection of structural flaws, at a residue-level, in NMR-determined protein models.
Collapse
|
26
|
Validation of metal-binding sites in macromolecular structures with the CheckMyMetal web server. Nat Protoc 2013; 9:156-70. [PMID: 24356774 DOI: 10.1038/nprot.2013.172] [Citation(s) in RCA: 227] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Metals have vital roles in both the mechanism and architecture of biological macromolecules. Yet structures of metal-containing macromolecules in which metals are misidentified and/or suboptimally modeled are abundant in the Protein Data Bank (PDB). This shows the need for a diagnostic tool to identify and correct such modeling problems with metal-binding environments. The CheckMyMetal (CMM) web server (http://csgid.org/csgid/metal_sites/) is a sophisticated, user-friendly web-based method to evaluate metal-binding sites in macromolecular structures using parameters derived from 7,350 metal-binding sites observed in a benchmark data set of 2,304 high-resolution crystal structures. The protocol outlines how the CMM server can be used to detect geometric and other irregularities in the structures of metal-binding sites, as well as how it can alert researchers to potential errors in metal assignment. The protocol also gives practical guidelines for correcting problematic sites by modifying the metal-binding environment and/or redefining metal identity in the PDB file. Several examples where this has led to meaningful results are described in the ANTICIPATED RESULTS section. CMM was designed for a broad audience--biomedical researchers studying metal-containing proteins and nucleic acids--but it is equally well suited for structural biologists validating new structures during modeling or refinement. The CMM server takes the coordinates of a metal-containing macromolecule structure in the PDB format as input and responds within a few seconds for a typical protein structure with 2-5 metal sites and a few hundred amino acids.
Collapse
|
27
|
Martin OA, Arnautova YA, Icazatti AA, Scheraga HA, Vila JA. Physics-based method to validate and repair flaws in protein structures. Proc Natl Acad Sci U S A 2013; 110:16826-31. [PMID: 24082119 PMCID: PMC3801053 DOI: 10.1073/pnas.1315525110] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A method that makes use of information provided by the combination of (13)C(α) and (13)C(β) chemical shifts, computed at the density functional level of theory, enables one to (i) validate, at the residue level, conformations of proteins and detect backbone or side-chain flaws by taking into account an ensemble average of chemical shifts over all of the conformations used to represent a protein, with a sensitivity of ∼90%; and (ii) provide a set of (χ1/χ2) torsional angles that leads to optimal agreement between the observed and computed (13)C(α) and (13)C(β) chemical shifts. The method has been incorporated into the CheShift-2 protein validation Web server. To test the reliability of the provided set of (χ1/χ2) torsional angles, the side chains of all reported conformations of five NMR-determined protein models were refined by a simple routine, without using NOE-based distance restraints. The refinement of each of these five proteins leads to optimal agreement between the observed and computed (13)C(α) and (13)C(β) chemical shifts for ∼94% of the flaws, on average, without introducing a significantly large number of violations of the NOE-based distance restraints for a distance range ≤ 0.5 , in which the largest number of distance violations occurs. The results of this work suggest that use of the provided set of (χ1/χ2) torsional angles together with other observables, such as NOEs, should lead to a fast and accurate refinement of the side-chain conformations of protein models.
Collapse
Affiliation(s)
- Osvaldo A. Martin
- Instituto de Matemática Aplicada San Luis, Consejo Nacional de Investigaciones Científicas y Técnicas de Argentina, Departamento de Física, Universidad Nacional de San Luis, 5700 San Luis, Argentina
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853; and
| | | | - Alejandro A. Icazatti
- Instituto de Matemática Aplicada San Luis, Consejo Nacional de Investigaciones Científicas y Técnicas de Argentina, Departamento de Física, Universidad Nacional de San Luis, 5700 San Luis, Argentina
| | - Harold A. Scheraga
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853; and
| | - Jorge A. Vila
- Instituto de Matemática Aplicada San Luis, Consejo Nacional de Investigaciones Científicas y Técnicas de Argentina, Departamento de Física, Universidad Nacional de San Luis, 5700 San Luis, Argentina
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853; and
| |
Collapse
|
28
|
Tejero R, Snyder D, Mao B, Aramini JM, Montelione GT. PDBStat: a universal restraint converter and restraint analysis software package for protein NMR. JOURNAL OF BIOMOLECULAR NMR 2013; 56:337-51. [PMID: 23897031 PMCID: PMC3932191 DOI: 10.1007/s10858-013-9753-7] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Accepted: 06/11/2013] [Indexed: 05/20/2023]
Abstract
The heterogeneous array of software tools used in the process of protein NMR structure determination presents organizational challenges in the structure determination and validation processes, and creates a learning curve that limits the broader use of protein NMR in biology. These challenges, including accurate use of data in different data formats required by software carrying out similar tasks, continue to confound the efforts of novices and experts alike. These important issues need to be addressed robustly in order to standardize protein NMR structure determination and validation. PDBStat is a C/C++ computer program originally developed as a universal coordinate and protein NMR restraint converter. Its primary function is to provide a user-friendly tool for interconverting between protein coordinate and protein NMR restraint data formats. It also provides an integrated set of computational methods for protein NMR restraint analysis and structure quality assessment, relabeling of prochiral atoms with correct IUPAC names, as well as multiple methods for analysis of the consistency of atomic positions indicated by their convergence across a protein NMR ensemble. In this paper we provide a detailed description of the PDBStat software, and highlight some of its valuable computational capabilities. As an example, we demonstrate the use of the PDBStat restraint converter for restrained CS-Rosetta structure generation calculations, and compare the resulting protein NMR structure models with those generated from the same NMR restraint data using more traditional structure determination methods. These results demonstrate the value of a universal restraint converter in allowing the use of multiple structure generation methods with the same restraint data for consensus analysis of protein NMR structures and the underlying restraint data.
Collapse
Affiliation(s)
- Roberto Tejero
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
- Departamento de Quίmica Fίsica, Universidad de Valencia, Avenida Dr. Moliner 50 46100 Burjassot, Valencia, SPAIN
| | - David Snyder
- Department of Chemistry, William Paterson University, 300 Pompton Road Wayne, New Jersey 07470, USA
| | - Binchen Mao
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
| | - James M. Aramini
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
- To whom correspondence should be addressed: Prof. Gaetano T. Montelione CABM, Rutgers University 679 Hoes Lane Piscataway, NJ 08854-5638 Phone: 732-235-5321
| |
Collapse
|
29
|
Schieborr U, Sreeramulu S, Elshorst B, Maurer M, Saxena K, Stehle T, Kudlinzki D, Gande SL, Schwalbe H. MOTOR: model assisted software for NMR structure determination. Proteins 2013; 81:2007-22. [PMID: 23852655 DOI: 10.1002/prot.24361] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Revised: 06/26/2013] [Accepted: 06/28/2013] [Indexed: 11/06/2022]
Abstract
Eukaryotic proteins with important biological function can be partially unstructured, conformational flexible, or heterogenic. Crystallization trials often fail for such proteins. In NMR spectroscopy, parts of the polypeptide chain undergoing dynamics in unfavorable time regimes cannot be observed. De novo NMR structure determination is seriously hampered when missing signals lead to an incomplete chemical shift assignment resulting in an information content of the NOE data insufficient to determine the structure ab initio. We developed a new protein structure determination strategy for such cases based on a novel NOE assignment strategy utilizing a number of model structures but no explicit reference structure as it is used for bootstrapping like algorithms. The software distinguishes in detail between consistent and mutually exclusive pairs of possible NOE assignments on the basis of different precision levels of measured chemical shifts searching for a set of maximum number of consistent NOE assignments in agreement with 3D space. Validation of the method using the structure of the low molecular-weight-protein tyrosine phosphatase A (MptpA) showed robust results utilizing protein structures with 30-45% sequence identity and 70% of the chemical shift assignments. About 60% of the resonance assignments are sufficient to identify those structural models with highest conformational similarity to the real structure. The software was benchmarked by de novo solution structures of fibroblast growth factor 21 (FGF21) and the extracellular fibroblast growth factor receptor domain FGFR4 D2, which both failed in crystallization trials and in classical NMR structure determination.
Collapse
Affiliation(s)
- Ulrich Schieborr
- Johann Wolfgang Goethe-University Frankfurt, Institute for Organic Chemistry and Chemical Biology, Center for Biomolecular Magnetic Resonance, Max-von-Laue-Str. 7, 60438, Frankfurt am Main, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Doreleijers JF, Sousa da Silva AW, Krieger E, Nabuurs SB, Spronk CAEM, Stevens TJ, Vranken WF, Vriend G, Vuister GW. CING: an integrated residue-based structure validation program suite. JOURNAL OF BIOMOLECULAR NMR 2012; 54:267-83. [PMID: 22986687 PMCID: PMC3483101 DOI: 10.1007/s10858-012-9669-7] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2012] [Accepted: 08/31/2012] [Indexed: 05/03/2023]
Abstract
We present a suite of programs, named CING for Common Interface for NMR Structure Generation that provides for a residue-based, integrated validation of the structural NMR ensemble in conjunction with the experimental restraints and other input data. External validation programs and new internal validation routines compare the NMR-derived models with empirical data, measured chemical shifts, distance- and dihedral restraints and the results are visualized in a dynamic Web 2.0 report. A red-orange-green score is used for residues and restraints to direct the user to those critiques that warrant further investigation. Overall green scores below ~20 % accompanied by red scores over ~50 % are strongly indicative of poorly modelled structures. The publically accessible, secure iCing webserver ( https://nmr.le.ac.uk ) allows individual users to upload the NMR data and run a CING validation analysis.
Collapse
Affiliation(s)
- Jurgen F. Doreleijers
- CMBI, Radboud University Medical Centre, Geert Grooteplein 26-28, 6525 GA Nijmegen, The Netherlands
| | | | - Elmar Krieger
- YASARA Biosciences GmbH, Wagramer Strasse 25/3/45, 1220 Vienna, Austria
| | - Sander B. Nabuurs
- CMBI, Radboud University Medical Centre, Geert Grooteplein 26-28, 6525 GA Nijmegen, The Netherlands
| | | | - Tim J. Stevens
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA UK
| | - Wim F. Vranken
- Department of Structural Biology, VIB, Building E, 4th Floor, Pleinlaan 2, 1050 Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Building E, 4th Floor, Pleinlaan 2, 1050 Brussels, Belgium
| | - Gert Vriend
- CMBI, Radboud University Medical Centre, Geert Grooteplein 26-28, 6525 GA Nijmegen, The Netherlands
| | - Geerten W. Vuister
- Department of Biochemistry, University of Leicester, Henry Wellcome Building, Lancaster Road, Leicester, LE1 9HN UK
| |
Collapse
|
31
|
Wang X. Solution structure of decorin-binding protein A from Borrelia burgdorferi. Biochemistry 2012; 51:8353-62. [PMID: 22985470 DOI: 10.1021/bi3007093] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Decorin-binding protein A (DBPA) is an important lipoprotein from the bacterium Borrelia burgdorferi, the causative agent of Lyme disease. The absence of DBPA drastically reduces the pathogenic potential of the bacterium, and biochemical evidence indicates DBPA's interactions with the glycosaminoglycan (GAG) portion of decorin are crucial to its function. We have determined the solution structure of DBPA and studied DBPA's interactions with various forms of GAGs. DBPA is determined to be a helical bundle protein consisting of five helices held together by a strong hydrophobic core. The structure also possesses a basic patch formed by portions of two helices and two flexible linkers. Low-molecular mass heparin-induced chemical shift perturbations for residues in the region as well as increases in signal intensities of select residues in their presence confirm residues in the pocket are perturbed by heparin binding. Dermatan sulfate fragments, the dominant GAG type found on decorin, were shown to have lower affinity than heparin but are still capable of binding DBPA.
Collapse
Affiliation(s)
- Xu Wang
- Department of Chemistry and Biochemistry, Arizona State University, Tempe, AZ 85287, USA.
| |
Collapse
|
32
|
Vila JA, Sue SC, Fraser JS, Scheraga HA, Dyson HJ. CheShift-2 resolves a local inconsistency between two X-ray crystal structures. JOURNAL OF BIOMOLECULAR NMR 2012; 54:193-198. [PMID: 22945426 PMCID: PMC3471536 DOI: 10.1007/s10858-012-9663-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Accepted: 08/17/2012] [Indexed: 06/01/2023]
Abstract
Since chemical shifts provide important and relatively accessible information about protein structure in solution, a Web server, CheShift-2, was developed for structure interrogation, based on a quantum mechanics database of (13)C( α ) chemical shifts. We report the application of CheShift-2 to a local inconsistency between two X-ray crystal structures (PDB IDs 1IKN and 1NFI) of the complex between the p65/p50 heterodimer of NFκB and its inhibitor IκBα. The availability of NMR resonance assignments that included the region of the inconsistency provided an opportunity for independent validation of the CheShift-2 server. Application of the server showed that the (13)C( α ) chemical shifts measured for the Gly270-Pro281 sequence close to the C-terminus of IκBα were unequivocally consistent with the backbone structure modeled in the 1IKN structure, and were inconsistent with the 1NFI structure. Previous NOE measurements had demonstrated that the position of a tryptophan ring in the region immediately N-terminal in this region was not consistent with either structure. Subsequent recalculation of the local structure in this region, based on the electron density of the deposited structure factors for 1IKN, confirmed that the local backbone structure was best modeled by 1IKN, but that the rotamer of Trp258 is consistent with the 1NFI structure, including the presence of a hydrogen bond between the ring NεH of Trp258 and the backbone carbonyl group of Gln278. The consensus between all of these measures suggests that the CheShift-2 server operates well under circumstances in which backbone chemical shifts are available but where local plasticity may render X-ray structural data ambiguous.
Collapse
Affiliation(s)
- Jorge A Vila
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA
| | | | | | | | | |
Collapse
|
33
|
Berjanskii M, Zhou J, Liang Y, Lin G, Wishart DS. Resolution-by-proxy: a simple measure for assessing and comparing the overall quality of NMR protein structures. JOURNAL OF BIOMOLECULAR NMR 2012; 53:167-180. [PMID: 22678091 DOI: 10.1007/s10858-012-9637-2] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2011] [Accepted: 02/15/2012] [Indexed: 06/01/2023]
Abstract
In protein X-ray crystallography, resolution is often used as a good indicator of structural quality. Diffraction resolution of protein crystals correlates well with the number of X-ray observables that are used in structure generation and, therefore, with protein coordinate errors. In protein NMR, there is no parameter identical to X-ray resolution. Instead, resolution is often used as a synonym of NMR model quality. Resolution of NMR structures is often deduced from ensemble precision, torsion angle normality and number of distance restraints per residue. The lack of common techniques to assess the resolution of X-ray and NMR structures complicates the comparison of structures solved by these two methods. This problem is sometimes approached by calculating "equivalent resolution" from structure quality metrics. However, existing protocols do not offer a comprehensive assessment of protein structure as they calculate equivalent resolution from a relatively small number (<5) of protein parameters. Here, we report a development of a protocol that calculates equivalent resolution from 25 measurable protein features. This new method offers better performance (correlation coefficient of 0.92, mean absolute error of 0.28 Å) than existing predictors of equivalent resolution. Because the method uses coordinate data as a proxy for X-ray diffraction data, we call this measure "Resolution-by-Proxy" or ResProx. We demonstrate that ResProx can be used to identify under-restrained, poorly refined or inaccurate NMR structures, and can discover structural defects that the other equivalent resolution methods cannot detect. The ResProx web server is available at http://www.resprox.ca.
Collapse
Affiliation(s)
- Mark Berjanskii
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada
| | | | | | | | | |
Collapse
|
34
|
Twomey EC, Wei Y. High-definition NMR structure of PED/PEA-15 death effector domain reveals details of key polar side chain interactions. Biochem Biophys Res Commun 2012; 424:141-6. [PMID: 22732408 DOI: 10.1016/j.bbrc.2012.06.091] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2012] [Accepted: 06/16/2012] [Indexed: 11/17/2022]
Abstract
Death effector domain (DED) proteins constitute a subfamily of the large death domain superfamily that is primarily involved in apoptosis pathways. DED structures have characteristic side chain-side chain interactions among polar residues on the protein surface, forming a network of hydrogen bonds and salt bridges. The polar interaction network is functionally important in promoting protein-protein interactions by maintaining optimal side chain orientations. We have refined the solution DED structure of the PED/PEA-15 protein, a representative member of DED subfamily, using traditional NMR restraints with the addition of residual dipolar coupling (RDC) restraints from two independent alignment media, and employed the explicit solvent refinement protocol. The newly refined DED structure of PED/PEA-15 possesses higher structural quality as indicated by WHAT IF Z-scores, with most significant improvement in the backbone conformation normality quality factor. This higher quality DED structure of PED/PEA-15 leads to the identification of a number of key polar side chain interactions, which are not typically observed in NMR protein structures. The elucidation of polar side chain interactions is a key step towards the understanding of protein-protein interactions involving the death domain superfamily. The NMR structures with extensive details of protein structural features are thereby termed high-definition (HD) NMR structures.
Collapse
Affiliation(s)
- Edward C Twomey
- Department of Chemistry and Biochemistry, Seton Hall University, South Orange, NJ 07094-2646, USA
| | | |
Collapse
|
35
|
Rosato A, Aramini JM, Arrowsmith C, Bagaria A, Baker D, Cavalli A, Doreleijers JF, Eletsky A, Giachetti A, Guerry P, Gutmanas A, Güntert P, He Y, Herrmann T, Huang YJ, Jaravine V, Jonker HRA, Kennedy MA, Lange OF, Liu G, Malliavin TE, Mani R, Mao B, Montelione GT, Nilges M, Rossi P, van der Schot G, Schwalbe H, Szyperski TA, Vendruscolo M, Vernon R, Vranken WF, Vries SD, Vuister GW, Wu B, Yang Y, Bonvin AMJJ. Blind testing of routine, fully automated determination of protein structures from NMR data. Structure 2012; 20:227-36. [PMID: 22325772 DOI: 10.1016/j.str.2012.01.002] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2011] [Revised: 12/19/2011] [Accepted: 01/03/2012] [Indexed: 11/16/2022]
Abstract
The protocols currently used for protein structure determination by nuclear magnetic resonance (NMR) depend on the determination of a large number of upper distance limits for proton-proton pairs. Typically, this task is performed manually by an experienced researcher rather than automatically by using a specific computer program. To assess whether it is indeed possible to generate in a fully automated manner NMR structures adequate for deposition in the Protein Data Bank, we gathered 10 experimental data sets with unassigned nuclear Overhauser effect spectroscopy (NOESY) peak lists for various proteins of unknown structure, computed structures for each of them using different, fully automatic programs, and compared the results to each other and to the manually solved reference structures that were not available at the time the data were provided. This constitutes a stringent "blind" assessment similar to the CASP and CAPRI initiatives. This study demonstrates the feasibility of routine, fully automated protein structure determination by NMR.
Collapse
Affiliation(s)
- Antonio Rosato
- Magnetic Resonance Center, University of Florence, 50019 Sesto Fiorentino, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Sahakyan AB, Cavalli A, Vranken WF, Vendruscolo M. Protein Structure Validation Using Side-Chain Chemical Shifts. J Phys Chem B 2012; 116:4754-9. [DOI: 10.1021/jp2122054] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Aleksandr B. Sahakyan
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge
CB2 1EW, U.K
| | - Andrea Cavalli
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge
CB2 1EW, U.K
| | - Wim F. Vranken
- Department
of Structural Biology,
VIB and Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| | - Michele Vendruscolo
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge
CB2 1EW, U.K
| |
Collapse
|
37
|
Abstract
Summary: The differences between observed and predicted 13Cα chemical shifts can be used as a sensitive probe with which to detect possible local flaws in protein structures. For this reason, we previously introduced CheShift, a Web server for protein structure validation. Now, we present CheShift-2 in which a graphical user interface is implemented to render such local flaws easily visible. A series of applications to 15 ensembles of conformations illustrate the ability of CheShift-2 to locate the main structural flaws rapidly and accurately on a per-residue basis. Since accuracy plays a central role in CheShift predictions, the treatment of histidine (His) is investigated here by exploring which form of His should be used in CheShift-2. Availability:CheShift-2 is free of charge for academic use and can be accessed from www.cheshift.com Contact:has5@cornell.edu; jv84@cornell.edu Supplementary information:Supplementary data are available at the Bioinformatics online.
Collapse
Affiliation(s)
- Osvaldo A Martin
- Universidad Nacional de San Luis, IMASL-CONICET, Ejército de Los Andes, San Luis, Argentina
| | | | | |
Collapse
|
38
|
Bagaria A, Jaravine V, Huang YJ, Montelione GT, Güntert P. Protein structure validation by generalized linear model root-mean-square deviation prediction. Protein Sci 2012; 21:229-38. [PMID: 22113924 DOI: 10.1002/pro.2007] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2011] [Revised: 11/17/2011] [Accepted: 11/19/2011] [Indexed: 01/03/2023]
Abstract
Large-scale initiatives for obtaining spatial protein structures by experimental or computational means have accentuated the need for the critical assessment of protein structure determination and prediction methods. These include blind test projects such as the critical assessment of protein structure prediction (CASP) and the critical assessment of protein structure determination by nuclear magnetic resonance (CASD-NMR). An important aim is to establish structure validation criteria that can reliably assess the accuracy of a new protein structure. Various quality measures derived from the coordinates have been proposed. A universal structural quality assessment method should combine multiple individual scores in a meaningful way, which is challenging because of their different measurement units. Here, we present a method based on a generalized linear model (GLM) that combines diverse protein structure quality scores into a single quantity with intuitive meaning, namely the predicted coordinate root-mean-square deviation (RMSD) value between the present structure and the (unavailable) "true" structure (GLM-RMSD). For two sets of structural models from the CASD-NMR and CASP projects, this GLM-RMSD value was compared with the actual accuracy given by the RMSD value to the corresponding, experimentally determined reference structure from the Protein Data Bank (PDB). The correlation coefficients between actual (model vs. reference from PDB) and predicted (model vs. "true") heavy-atom RMSDs were 0.69 and 0.76, for the two datasets from CASD-NMR and CASP, respectively, which is considerably higher than those for the individual scores (-0.24 to 0.68). The GLM-RMSD can thus predict the accuracy of protein structures more reliably than individual coordinate-based quality scores.
Collapse
Affiliation(s)
- Anurag Bagaria
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, and Frankfurt Institute of Advanced Studies, Goethe University Frankfurt, Frankfurt am Main, Germany
| | | | | | | | | |
Collapse
|
39
|
Abstract
In solution or solid-state, determining the three-dimensional structure of biomolecules by Nuclear -Magnetic Resonance (NMR) normally requires the collection of distance information. The interpretation of the spectra containing this distance information is a critical step in an NMR structure determination. In this chapter, we present the Ambiguous Restraints for Iterative Assignment (ARIA) program for automated cross-peak assignment and determination of macromolecular structure from solution and solid-state NMR experiments. While the program was initially designed for the assignment of nuclear Overhauser effect (NOE) resonances, it has been extended to the interpretation of magic-angle spinning (MAS) solid-state NMR data. This chapter first details the concepts and procedures carried out by the program. Then, we describe both the general strategy for structure determination with ARIA 2.3 and practical aspects of the technique. ARIA 2.3 includes all recent developments. such as an extended integration of the Collaborative Computing Project for the NMR community (CCPN), the incorporation of the log-harmonic distance restraint potential and an automated treatment of symmetric oligomers.
Collapse
Affiliation(s)
- Benjamin Bardiaux
- NMR-supported Structural Biology, Leibnitz-Institutfür Molekulare Pharmakologie (FMP), Berlin, Germany.
| | | | | |
Collapse
|
40
|
Bertini I, Case DA, Ferella L, Giachetti A, Rosato A. A Grid-enabled web portal for NMR structure refinement with AMBER. ACTA ACUST UNITED AC 2011; 27:2384-90. [PMID: 21757462 DOI: 10.1093/bioinformatics/btr415] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION The typical workflow for NMR structure determination involves collecting thousands of conformational restraints, calculating a bundle of 20-40 conformers in agreement with them and refining the energetics of these conformers. The structure calculation step employs simulated annealing based on molecular dynamics (MD) simulations with very simplified force fields. The value of refining the calculated conformers using restrained MD (rMD) simulations with state-of-art force fields is documented. This refinement however presents various subtleties, from the proper formatting of conformational restraints to the definition of suitable protocols. RESULTS We describe a web interface to set up and run calculations with the AMBER package, which we called AMPS-NMR (AMBER-based Portal Server for NMR structures). The interface allows the refinement of NMR structures through rMD. Some predefined protocols are provided for this purpose, which can be personalized; it is also possible to create an entirely new protocol. AMPS-NMR can handle various restraint types. Standard rMD refinement in explicit water of the structures of three different proteins are shown as examples. AMPS-NMR additionally includes a workspace for the user to store different calculations. As an ancillary service, a web interface to AnteChamber is available, enabling the calculation of force field parameters for organic molecules such as ligands in protein-ligand adducts. AVAILABILITY AND IMPLEMENTATION AMPS-NMR is embedded within the NMR services of the WeNMR project and is available at http://py-enmr.cerm.unifi.it/access/index/amps-nmr; its use requires registration with a digital certificate. CONTACT ivanobertini@cerm.unifi.it SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ivano Bertini
- Magnetic Resonance Center (CERM), University of Florence, Via L. Sacconi 6, Italy.
| | | | | | | | | |
Collapse
|
41
|
Frank A, Onila I, Möller HM, Exner TE. Toward the quantum chemical calculation of nuclear magnetic resonance chemical shifts of proteins. Proteins 2011; 79:2189-202. [PMID: 21557322 DOI: 10.1002/prot.23041] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2011] [Revised: 03/01/2011] [Accepted: 03/13/2011] [Indexed: 11/09/2022]
Abstract
Despite the many protein structures solved successfully by nuclear magnetic resonance (NMR) spectroscopy, quality control of NMR structures is still by far not as well established and standardized as in crystallography. Therefore, there is still the need for new, independent, and unbiased evaluation tools to identify problematic parts and in the best case also to give guidelines that how to fix them. We present here, quantum chemical calculations of NMR chemical shifts for many proteins based on our fragment-based quantum chemical method: the adjustable density matrix assembler (ADMA). These results show that (13)C chemical shifts of reasonable accuracy can be obtained that can already provide a powerful measure for the structure validation. (1)H and even more (15)N chemical shifts deviate more strongly from experiment due to the insufficient treatment of solvent effects and conformational averaging.
Collapse
Affiliation(s)
- Andrea Frank
- Department of Chemistry and Zukunftskolleg, University of Konstanz, Konstanz D-78457, Germany
| | | | | | | |
Collapse
|
42
|
Breukels V, Konijnenberg A, Nabuurs SM, Doreleijers JF, Kovalevskaya NV, Vuister GW. Overview on the use of NMR to examine protein structure. CURRENT PROTOCOLS IN PROTEIN SCIENCE 2011; Chapter 17:Unit17.5. [PMID: 21488042 DOI: 10.1002/0471140864.ps1705s64] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Any protein structure determination process contains several steps, starting from obtaining a suitable sample, then moving on to acquiring data and spectral assignment, and lastly to the final steps of structure determination and validation. This unit describes all of these steps, starting with the basic physical principles behind NMR and some of the most commonly measured and observed phenomena such as chemical shift, scalar and residual coupling, and the nuclear Overhauser effect. Then, in somewhat more detail, the process of spectral assignment and structure elucidation is explained. Furthermore, the use of NMR to study protein-ligand interaction, protein dynamics, or protein folding is described.
Collapse
Affiliation(s)
- Vincent Breukels
- Protein Biophysics, Institute for Molecules and Materials, Radboud University Nijmegen, Nijmegen, The Netherlands
| | | | | | | | | | | |
Collapse
|
43
|
Abstract
Around half of all protein structures solved nowadays using solution-state nuclear magnetic resonance (NMR) spectroscopy have been because of automated data analysis. The pervasiveness of computational approaches in general hides, however, a more nuanced view in which the full variety and richness of the field appears. This review is structured around a comparison of methods associated with three NMR observables: classical nuclear Overhauser effect (NOE) constraint gathering in contrast with more recent chemical shift and residual dipole coupling (RDC) based protocols. In each case, the emphasis is placed on the latest research, covering mainly the past 5 years. By describing both general concepts and representative programs, the objective is to map out a field in which--through the very profusion of approaches--it is all too easy to lose one's bearings.
Collapse
|
44
|
Lee HW, Wylie G, Bansal S, Wang X, Barb AW, Macnaughtan MA, Ertekin A, Montelione GT, Prestegard JH. Three-dimensional structure of the weakly associated protein homodimer SeR13 using RDCs and paramagnetic surface mapping. Protein Sci 2011; 19:1673-85. [PMID: 20589905 DOI: 10.1002/pro.447] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The traditional NMR-based method for determining oligomeric protein structure usually involves distinguishing and assigning intra- and intersubunit NOEs. This task becomes challenging when determining symmetric homo-dimer structures because NOE cross-peaks from a given pair of protons occur at the same position whether intra- or intersubunit in origin. While there are isotope-filtering strategies for distinguishing intra from intermolecular NOE interactions in these cases, they are laborious and often prove ineffectual in cases of weak dimers, where observation of intermolecular NOEs is rare. Here, we present an efficient procedure for weak dimer structure determination based on residual dipolar couplings (RDCs), chemical shift changes upon dilution, and paramagnetic surface perturbations. This procedure is applied to the Northeast Structural Genomics Consortium protein target, SeR13, a negatively charged Staphylococcus epidermidis dimeric protein (K(d) 3.4 ± 1.4 mM) composed of 86 amino acids. A structure determination for the monomeric form using traditional NMR methods is presented, followed by a dimer structure determination using docking under orientation constraints from RDCs data, and scoring under residue pair potentials and shape-based predictions of RDCs. Validation using paramagnetic surface perturbation and chemical shift perturbation data acquired on sample dilution is also presented. The general utility of the dimer structure determination procedure and the possible relevance of SeR13 dimer formation are discussed.
Collapse
Affiliation(s)
- Hsiau-Wei Lee
- Complex Carbohydrate Research Center, Northeast Structural Genomics Consortium, The University of Georgia, Athens, Georgia 30602, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Wang X, Lee HW, Liu Y, Prestegard JH. Structural NMR of protein oligomers using hybrid methods. J Struct Biol 2011; 173:515-29. [PMID: 21074622 PMCID: PMC3040251 DOI: 10.1016/j.jsb.2010.11.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2010] [Revised: 10/03/2010] [Accepted: 11/04/2010] [Indexed: 11/19/2022]
Abstract
Solving structures of native oligomeric protein complexes using traditional high-resolution NMR techniques remains challenging. However, increased utilization of computational platforms, and integration of information from less traditional NMR techniques with data from other complementary biophysical methods, promises to extend the boundary of NMR-applicable targets. This article reviews several of the techniques capable of providing less traditional and complementary structural information. In particular, the use of orientational constraints coming from residual dipolar couplings and residual chemical shift anisotropy offsets are shown to simplify the construction of models for oligomeric complexes, especially in cases of weak homo-dimers. Combining this orientational information with interaction site information supplied by computation, chemical shift perturbation, paramagnetic surface perturbation, cross-saturation and mass spectrometry allows high resolution models of the complexes to be constructed with relative ease. Non-NMR techniques, such as mass spectrometry, EPR and small angle X-ray scattering, are also expected to play increasingly important roles by offering alternative methods of probing the overall shape of the complex. Computational platforms capable of integrating information from multiple sources in the modeling process are also discussed in the article. And finally a new, detailed example on the determination of a chemokine tetramer structure will be used to illustrate how a non-traditional approach to oligomeric structure determination works in practice.
Collapse
Affiliation(s)
- Xu Wang
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602. USA
| | - Hsiau-Wei Lee
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602. USA
| | - Yizhou Liu
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602. USA
| | - James H. Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602. USA
| |
Collapse
|
46
|
Bordner AJ. Orientation-dependent backbone-only residue pair scoring functions for fixed backbone protein design. BMC Bioinformatics 2010; 11:192. [PMID: 20398384 PMCID: PMC2874805 DOI: 10.1186/1471-2105-11-192] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Accepted: 04/16/2010] [Indexed: 11/24/2022] Open
Abstract
Background Empirical scoring functions have proven useful in protein structure modeling. Most such scoring functions depend on protein side chain conformations. However, backbone-only scoring functions do not require computationally intensive structure optimization and so are well suited to protein design, which requires fast score evaluation. Furthermore, scoring functions that account for the distinctive relative position and orientation preferences of residue pairs are expected to be more accurate than those that depend only on the separation distance. Results Residue pair scoring functions for fixed backbone protein design were derived using only backbone geometry. Unlike previous studies that used spherical harmonics to fit 2D angular distributions, Gaussian Mixture Models were used to fit the full 3D (position only) and 6D (position and orientation) distributions of residue pairs. The performance of the 1D (residue separation only), 3D, and 6D scoring functions were compared by their ability to identify correct threading solutions for a non-redundant benchmark set of protein backbone structures. The threading accuracy was found to steadily increase with increasing dimension, with the 6D scoring function achieving the highest accuracy. Furthermore, the 3D and 6D scoring functions were shown to outperform side chain-dependent empirical potentials from three other studies. Next, two computational methods that take advantage of the speed and pairwise form of these new backbone-only scoring functions were investigated. The first is a procedure that exploits available sequence data by averaging scores over threading solutions for homologs. This was evaluated by applying it to the challenging problem of identifying interacting transmembrane alpha-helices and found to further improve prediction accuracy. The second is a protein design method for determining the optimal sequence for a backbone structure by applying Belief Propagation optimization using the 6D scoring functions. The sensitivity of this method to backbone structure perturbations was compared with that of fixed-backbone all-atom modeling by determining the similarities between optimal sequences for two different backbone structures within the same protein family. The results showed that the design method using 6D scoring functions was more robust to small variations in backbone structure than the all-atom design method. Conclusions Backbone-only residue pair scoring functions that account for all six relative degrees of freedom are the most accurate and including the scores of homologs further improves the accuracy in threading applications. The 6D scoring function outperformed several side chain-dependent potentials while avoiding time-consuming and error prone side chain structure prediction. These scoring functions are particularly useful as an initial filter in protein design problems before applying all-atom modeling.
Collapse
|
47
|
Vila JA, Scheraga HA. Assessing the accuracy of protein structures by quantum mechanical computations of 13C(alpha) chemical shifts. Acc Chem Res 2009; 42:1545-53. [PMID: 19572703 PMCID: PMC3396562 DOI: 10.1021/ar900068s] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Two major techniques have been used to determine the three-dimensional structures of proteins: X-ray diffraction and NMR spectroscopy. In particular, the validation of NMR-derived protein structures is one of the most challenging problems in NMR spectroscopy. Therefore, researchers have proposed a plethora of methods to determine the accuracy and reliability of protein structures. Despite these proposals, there is a growing need for more sophisticated, physics-based structure validation methods. This approach will enable us to (a) characterize the "quality" of the NMR-derived ensemble as a whole by a single parameter, (b) unambiguously identify flaws in the sequence at a residue level, and (c) provide precise information, such as sets of backbone and side-chain torsional angles, that we can use to detect local flaws. Rather than reviewing all of the existing validation methods, this Account describes the contributions of our research group toward a solution of the long-standing problem of both global and local structure validation of NMR-derived protein structures. We emphasize a recently introduced physics-based methodology that makes use of observed and computed (13)C(alpha) chemical shifts (at the density functional theory (DFT) level of theory) for an accurate validation of protein structures in solution and in crystals. By assessing the ability of computed (13)C(alpha) chemical shifts to reproduce observed (13)C(alpha) chemical shifts of a single structure or ensemble of structures in solution and in crystals, we accomplish a global validation by using the conformationally averaged root-mean-square deviation, ca-rmsd, as a scoring function. In addition, the method enables us to provide local validation by identifying a set of individual amino acid conformations for which the computed and observed (13)C(alpha) chemical shifts do not agree within a certain error range and may represent a nonreliable fold of the protein model. Although it is computationally intensive, our validation method has several advantages, which we illustrate through a series of applications. This method makes use of the (13)C(alpha) chemical shifts, not shielding, that are ubiquitous to proteins and can be computed precisely from the phi, psi, and chi torsional angles. There is no need for a priori knowledge of the oligomeric state of the protein, and no knowledge-based information or additional NMR data are required. The primary limitation at this point is the computational cost of such calculations. However, we anticipate that enhancements in the speed of calculating these chemical shifts coupled with the ever-increasing computational power should soon make this a standard method accessible to the general NMR community.
Collapse
Affiliation(s)
- Jorge A. Vila
- Baker Laboratory of Chemistry and Chemical Biology, Cornell
University, Ithaca NY, 14853-1301, USA
- Universidad Nacional de San Luis, IMASL-CONICET,
Ejército de Los Andes 950-5700 San Luis-Argentina
| | - Harold A. Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell
University, Ithaca NY, 14853-1301, USA
| |
Collapse
|
48
|
Ikeya T, Takeda M, Yoshida H, Terauchi T, Jee JG, Kainosho M, Güntert P. Automated NMR structure determination of stereo-array isotope labeled ubiquitin from minimal sets of spectra using the SAIL-FLYA system. JOURNAL OF BIOMOLECULAR NMR 2009; 44:261-72. [PMID: 19597942 DOI: 10.1007/s10858-009-9339-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2009] [Accepted: 06/24/2009] [Indexed: 05/05/2023]
Abstract
Stereo-array isotope labeling (SAIL) has been combined with the fully automated NMR structure determination algorithm FLYA to determine the three-dimensional structure of the protein ubiquitin from different sets of input NMR spectra. SAIL provides a complete stereo- and regio-specific pattern of stable isotopes that results in sharper resonance lines and reduced signal overlap, without information loss. Here we show that as a result of the superior quality of the SAIL NMR spectra, reliable, fully automated analyses of the NMR spectra and structure calculations are possible using fewer input spectra than with conventional uniformly 13C/15N-labeled proteins. FLYA calculations with SAIL ubiquitin, using a single three-dimensional "through-bond" spectrum (and 2D HSQC spectra) in addition to the 13C-edited and 15N-edited NOESY spectra for conformational restraints, yielded structures with an accuracy of 0.83-1.15 A for the backbone RMSD to the conventionally determined solution structure of SAIL ubiquitin. NMR structures can thus be determined almost exclusively from the NOESY spectra that yield the conformational restraints, without the need to record many spectra only for determining intermediate, auxiliary data of the chemical shift assignments. The FLYA calculations for this report resulted in 252 ubiquitin structure bundles, obtained with different input data but identical structure calculation and refinement methods. These structures cover the entire range from highly accurate structures to seriously, but not trivially, wrong structures, and thus constitute a valuable database for the substantiation of structure validation methods.
Collapse
Affiliation(s)
- Teppei Ikeya
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt am Main, Max-von-Laue-Str. 9, 60438 Frankfurt am Main, Germany
| | | | | | | | | | | | | |
Collapse
|
49
|
Sippl MJ. Fold space unlimited. Curr Opin Struct Biol 2009; 19:312-20. [DOI: 10.1016/j.sbi.2009.03.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2009] [Revised: 02/16/2009] [Accepted: 03/16/2009] [Indexed: 11/25/2022]
|
50
|
Bardiaux B, Bernard A, Rieping W, Habeck M, Malliavin TE, Nilges M. Influence of different assignment conditions on the determination of symmetric homodimeric structures with ARIA. Proteins 2009; 75:569-85. [DOI: 10.1002/prot.22268] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|