1
|
REDCRAFT: A computational platform using residual dipolar coupling NMR data for determining structures of perdeuterated proteins in solution. PLoS Comput Biol 2021; 17:e1008060. [PMID: 33524015 PMCID: PMC7877757 DOI: 10.1371/journal.pcbi.1008060] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 02/11/2021] [Accepted: 01/05/2021] [Indexed: 01/10/2023] Open
Abstract
Nuclear Magnetic Resonance (NMR) spectroscopy is one of the three primary experimental means of characterizing macromolecular structures, including protein structures. Structure determination by solution NMR spectroscopy has traditionally relied heavily on distance restraints derived from nuclear Overhauser effect (NOE) measurements. While structure determination of proteins from NOE-based restraints is well understood and broadly used, structure determination from Residual Dipolar Couplings (RDCs) is relatively less well developed. Here, we describe the new features of the protein structure modeling program REDCRAFT and focus on the new Adaptive Decimation (AD) feature. The AD plays a critical role in improving the robustness of REDCRAFT to missing or noisy data, while allowing structure determination of larger proteins from less data. In this report we demonstrate the successful application of REDCRAFT in structure determination of proteins ranging in size from 50 to 145 residues using experimentally collected data, and of larger proteins (145 to 573 residues) using simulated RDC data. In both cases, REDCRAFT uses only RDC data that can be collected from perdeuterated proteins. Finally, we compare the accuracy of structure determination from RDCs alone with traditional NOE-based methods for the structurally novel PF.2048.1 protein. The RDC-based structure of PF.2048.1 exhibited 1.0 Å BB-RMSD with respect to a high-quality NOE-based structure. Although optimal strategies would include using RDC data together with chemical shift, NOE, and other NMR data, these studies provide proof-of-principle for robust structure determination of largely-perdeuterated proteins from RDC data alone using REDCRAFT. Residual Dipolar Couplings have the potential to improve the accuracy and reduce the time needed to characterize protein structures. In addition, RDC data have been demonstrated to concurrently elucidate structure of proteins, provide assignment of resonances, and characterize the internal dynamics of proteins. Given all the advantages associated with the study of proteins from RDC data, based on the statistics provided by the Protein Databank (PDB), surprisingly only 124 proteins (out of nearly 150,000 proteins) have utilized RDCs as part of their structure determination. Even a smaller subset of these proteins (approximately 7) have utilized RDCs as the primary source of data for structure determination. One key factor in the use of RDCs is the challenging computational and analytical aspects of this source of data. In this report, we demonstrate the success of the REDCRAFT software package in structure determination of proteins using RDC data that can be collected from small and large proteins in a routine fashion. REDCRAFT accomplishes the challenging task of structure determination from RDCs by introducing a unique search and optimization technique that is both robust and computationally tractable. Structure determination from routinely collectable RDC data using REDCRAFT can complement existing methods to provide faster and more accurate studies of larger and more complex protein structures by NMR spectroscopy in solution state.
Collapse
|
2
|
Simin M, Irausquin S, Cole CA, Valafar H. Improvements to REDCRAFT: a software tool for simultaneous characterization of protein backbone structure and dynamics from residual dipolar couplings. JOURNAL OF BIOMOLECULAR NMR 2014; 60:241-264. [PMID: 25403759 DOI: 10.1007/s10858-014-9871-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Accepted: 10/30/2014] [Indexed: 06/04/2023]
Abstract
Within the past two decades, there has been an increase in the acquisition of residual dipolar couplings (RDC) for investigations of biomolecular structures. Their use however is still not as widely adopted as the traditional methods of structure determination by NMR, despite their potential for extending the limits in studies that examine both the structure and dynamics of biomolecules. This is in part due to the difficulties associated with the analysis of this information-rich data type. The software analysis tool REDCRAFT was previously introduced to address some of these challenges. Here we describe and evaluate a number of additional features that have been incorporated in order to extend its computational and analytical capabilities. REDCRAFT's more traditional enhancements integrate a modified steric collision term, as well as structural refinement in the rotamer space. Other, non-traditional improvements include: the filtering of viable structures based on relative order tensor estimates, decimation of the conformational space based on structural similarity, and forward/reverse folding of proteins. Utilizing REDCRAFT's newest features we demonstrate de-novo folding of proteins 1D3Z and 1P7E to within less than 1.6 Å of the corresponding X-ray structures, using as many as four RDCs per residue and as little as two RDCs per residue, in two alignment media. We also show the successful folding of a structure to less than 1.6 Å of the X-ray structure using {C(i-1)-N(i), N(i)-H(i), and C(i-1)-H(i)} RDCs in one alignment medium, and only {N(i)-H(i)} in the second alignment medium (a set of data which can be collected on deuterated samples). The program is available for download from our website at http://ifestos.cse.sc.edu .
Collapse
Affiliation(s)
- Mikhail Simin
- Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, 29208, USA
| | | | | | | |
Collapse
|
3
|
Miao X, Mukhopadhyay R, Valafar H. Estimation of relative order tensors, and reconstruction of vectors in space using unassigned RDC data and its application. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2008; 194:202-11. [PMID: 18692422 PMCID: PMC2669903 DOI: 10.1016/j.jmr.2008.07.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2008] [Revised: 06/27/2008] [Accepted: 07/02/2008] [Indexed: 05/11/2023]
Abstract
Advances in NMR instrumentation and pulse sequence design have resulted in easier acquisition of Residual Dipolar Coupling (RDC) data. However, computational and theoretical analysis of this type of data has continued to challenge the international community of investigators because of their complexity and rich information content. Contemporary use of RDC data has required a-priori assignment, which significantly increases the overall cost of structural analysis. This article introduces a novel algorithm that utilizes unassigned RDC data acquired from multiple alignment media (nD-RDC, n3) for simultaneous extraction of the relative order tensor matrices and reconstruction of the interacting vectors in space. Estimation of the relative order tensors and reconstruction of the interacting vectors can be invaluable in a number of endeavors. An example application has been presented where the reconstructed vectors have been used to quantify the fitness of a template protein structure to the unknown protein structure. This work has other important direct applications such as verification of the novelty of an unknown protein and validation of the accuracy of an available protein structure model in drug design. More importantly, the presented work has the potential to bridge the gap between experimental and computational methods of structure determination.
Collapse
Affiliation(s)
- Xijiang Miao
- Computer Science and Engineering, Swearingen Engineering Center, University of South Carolina, Columbia, SC 29308, USA
| | | | | |
Collapse
|
4
|
Rothweiler U, Czarna A, Weber L, Popowicz GM, Brongel K, Kowalska K, Orth M, Stemmann O, Holak TA. NMR Screening for Lead Compounds Using Tryptophan-Mutated Proteins. J Med Chem 2008; 51:5035-42. [DOI: 10.1021/jm8002813] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Ulli Rothweiler
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Anna Czarna
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Lutz Weber
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Grzegorz M. Popowicz
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Kinga Brongel
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Kaja Kowalska
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Michael Orth
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Olaf Stemmann
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| | - Tad A. Holak
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany, NexusPharma Inc., 253-13 Summit Square Center, Langhorne, Pennsylvania 19047-1098
| |
Collapse
|
5
|
Bryson M, Tian F, Prestegard JH, Valafar H. REDCRAFT: a tool for simultaneous characterization of protein backbone structure and motion from RDC data. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2008; 191:322-34. [PMID: 18258464 PMCID: PMC2728087 DOI: 10.1016/j.jmr.2008.01.007] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2007] [Revised: 12/23/2007] [Accepted: 01/04/2008] [Indexed: 05/09/2023]
Abstract
REDCRAFT, a new open source software tool that accommodates the analysis of RDC data for simultaneous structure and dynamics characterization of proteins is presented in this article. Simultaneous consideration of structure and motion is believed to be necessary for accurate representation of the solution-state of a protein. REDCRAFT is designed to primarily utilize RDC data from multiple alignment media in two stages. During Stage-I, a list of possible torsion angles joining any two neighboring peptide planes is ranked based on their fitness to experimental constraints; in Stage-II, a dipeptide fragment is extended by addition of one peptide plane at a time. The algorithm adopted by REDCRAFT is very efficient and can produce a structure for an 80 residue protein within two hours on a typical desktop computer. REDCRAFT exhibits robustness with respect to noise and missing data. REDCRAFT describes the overall alignment of the molecule in the form of an order tensor matrix and is capable of identifying peptide fragments with internal dynamics. Identification of the location of internal motion will permit a more accurate structural representation. Experimental data from two proteins as well as simulated data are presented to illustrate the capabilities of REDCRAFT in both structure determination and identification of the dynamical regions.
Collapse
Affiliation(s)
- Michael Bryson
- University of South Carolina, Department of Computer Science and Engineering, 315 Main Street, Columbia, SC 29208, USA
| | | | | | | |
Collapse
|
6
|
Picomole-level mapping of protein disulfides by mass spectrometry following partial reduction and alkylation. Anal Biochem 2008; 377:95-104. [PMID: 18358819 DOI: 10.1016/j.ab.2008.02.025] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2008] [Revised: 02/19/2008] [Accepted: 02/21/2008] [Indexed: 10/22/2022]
Abstract
We have deduced the disulfide bond linkage patterns, at very low protein levels (<0.5 nmol), in two cysteine-rich polypeptide domains using a new strategy involving partial reduction/alkylation of the protein, followed by peptide mapping and tanden mass spectrometry (MS/MS) sequencing on a nanoflow liquid chromatography-MS/MS system. The substrates for our work were the cysteine-rich ectodomain of human Fn14, a member of the tumor necrosis factor receptor family, and the IgV domain of murine TIM-1 (T-cell, Ig domain, and mucin domain-1). We have successfully determined the disulfide linkages for Fn14 and independently confirmed those of the IgV domain of TIM-1, whose crystal structure was published recently. The procedures that we describe here can be used to determine the disulfide structures for proteins with complex characteristics. They will also provide a means to obtain important information for structure-function studies and to ensure correct protein folding and batch-to-batch consistency in commercially produced recombinant proteins.
Collapse
|
7
|
|
8
|
Affiliation(s)
- David D Boehr
- Department of Molecular Biology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, USA
| | | | | |
Collapse
|
9
|
Krajewski M, Rothweiler U, D'Silva L, Majumdar S, Klein C, Holak TA. An NMR-based antagonist induced dissociation assay for targeting the ligand-protein and protein-protein interactions in competition binding experiments. J Med Chem 2007; 50:4382-7. [PMID: 17696513 DOI: 10.1021/jm070365v] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
We present an NMR-based antagonist induced dissociation assay (AIDA) for validation of inhibitor action on protein-protein interactions. As opposed to many standard NMR methods, AIDA directly validates the inhibitor potency in an in vitro NMR competition binding experiment. AIDA requires a large protein fragment (larger than 30 kDa) to bind to a small reporter protein (less than 20 kDa). We show here that a small fragment of a protein fused to glutathione S-transferase (GST) can effectively substitute the large protein component. We successfully used a GST-tagged N-terminal 73-residue p53 domain for binding studies with the human MDM2 protein. Other interactions we studied involved complexes of CDK2, cyclin A, p27, and the retinoblastoma protein. All these proteins play a key role in the cell division cycle, are associated with tumorigenesis, and are thus the subject of anticancer therapy strategies.
Collapse
Affiliation(s)
- Marcin Krajewski
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany
| | | | | | | | | | | |
Collapse
|
10
|
Chen K, Tjandra N. Top-down approach in protein RDC data analysis: de novo estimation of the alignment tensor. JOURNAL OF BIOMOLECULAR NMR 2007; 38:303-13. [PMID: 17593526 DOI: 10.1007/s10858-007-9168-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2007] [Accepted: 05/18/2007] [Indexed: 05/16/2023]
Abstract
In solution NMR spectroscopy the residual dipolar coupling (RDC) is invaluable in improving both the precision and accuracy of NMR structures during their structural refinement. The RDC also provides a potential to determine protein structure de novo. These procedures are only effective when an accurate estimate of the alignment tensor has already been made. Here we present a top-down approach, starting from the secondary structure elements and finishing at the residue level, for RDC data analysis in order to obtain a better estimate of the alignment tensor. Using only the RDCs from N-H bonds of residues in alpha-helices and CA-CO bonds in beta-strands, we are able to determine the offset and the approximate amplitude of the RDC modulation-curve for each secondary structure element, which are subsequently used as targets for global minimization. The alignment order parameters and the orientation of the major principal axis of individual helix or strand, with respect to the alignment frame, can be determined in each of the eight quadrants of a sphere. The following minimization against RDC of all residues within the helix or strand segment can be carried out with fixed alignment order parameters to improve the accuracy of the orientation. For a helical protein Bax, the three components A(xx), A(yy) and A(zz), of the alignment order can be determined with this method in average to within 2.3% deviation from the values calculated with the available atomic coordinates. Similarly for beta-sheet protein Ubiquitin they agree in average to within 8.5%. The larger discrepancy in beta-strand parameters comes from both the diversity of the beta-sheet structure and the lower precision of CA-CO RDCs. This top-down approach is a robust method for alignment tensor estimation and also holds a promise for providing a protein topological fold using limited sets of RDCs.
Collapse
Affiliation(s)
- Kang Chen
- Laboratory of Molecular Biophysics, National Heart, Lung, and Blood Institute, National Institutes of Health, Building 50, Room 3503, Bethesda, MD 20892, USA
| | | |
Collapse
|
11
|
Schanda P, Forge V, Brutscher B. HET-SOFAST NMR for fast detection of structural compactness and heterogeneity along polypeptide chains. MAGNETIC RESONANCE IN CHEMISTRY : MRC 2006; 44 Spec No:S177-84. [PMID: 16823898 DOI: 10.1002/mrc.1825] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Structure elucidation of proteins by either NMR or X-ray crystallography often requires the screening of a large number of samples for promising protein constructs and optimum solution conditions. For large-scale screening of protein samples in solution, robust methods are needed that allow a rapid assessment of the folding of a polypeptide under diverse sample conditions. Here we present HET-SOFAST NMR, a highly sensitive new method for semi-quantitative characterization of the structural compactness and heterogeneity of polypeptide chains in solution. On the basis of one-dimensional 1H HET-SOFAST NMR data, obtained on well-folded, molten globular, partially- and completely unfolded proteins, we define empirical thresholds that can be used as quantitative benchmarks for protein compactness. For 15N-enriched protein samples, two-dimensional 1H-15N HET-SOFAST correlation spectra provide site-specific information about the structural heterogeneity along the polypeptide chain.
Collapse
Affiliation(s)
- Paul Schanda
- Institut de Biologie Structurale, Jean-Pierre Ebel C.N.R.S.-C.E.A.-UJF, 41, rue Jules Horowitz, 38027 Grenoble Cedex 1, France
| | | | | |
Collapse
|
12
|
Moreau VH, Valente AP, Almeida FC. Prediction of the amount of secondary structure of proteins using unassigned NMR spectra: a tool for target selection in structural proteomics. Genet Mol Biol 2006. [DOI: 10.1590/s1415-47572006000400030] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Affiliation(s)
- Vitor Hugo Moreau
- Faculdade de Tecnologia e Ciências, Brazil; Universidade Federal do Rio de Janeiro, Brazil
| | | | | |
Collapse
|
13
|
D'Silva L, Ozdowy P, Krajewski M, Rothweiler U, Singh M, Holak TA. Monitoring the effects of antagonists on protein-protein interactions with NMR spectroscopy. J Am Chem Soc 2005; 127:13220-6. [PMID: 16173750 DOI: 10.1021/ja052143x] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
We describe an NMR method that directly monitors the influence of ligands on protein-protein interactions. For a two-protein interaction complex, the size of one component should be small enough (less than ca. 15 kDa) to provide a good quality (15)N((13)C) HSQC spectrum after (15)N((13)C) labeling. The size of the second unlabeled component should be large enough so that the molecular weight of the preformed complex is larger than ca. 40 kDa. When the smaller protein binds to a larger one, broadening of NMR resonances results in the disappearance of most of its cross-peaks in the HSQC spectrum. Addition of an antagonist that can dissociate the complex would restore the HSQC spectrum of the smaller component. The method directly shows whether an antagonist releases proteins in their wild-type folded states or whether it induces their denaturation, partial unfolding, or precipitation. We illustrate the method by studying lead compounds that have recently been reported to block the MDM2-p53 interaction. Activation of p53 in tumor cells by inhibiting its interaction with MDM2 offers new strategy for cancer therapy.
Collapse
Affiliation(s)
- Loyola D'Silva
- Max Planck Institute for Biochemistry, D-82152 Martinsried, Germany
| | | | | | | | | | | |
Collapse
|
14
|
Peti W, Page R, Moy K, O'Neil-Johnson M, Wilson IA, Stevens RC, Wüthrich K. Towards miniaturization of a structural genomics pipeline using micro-expression and microcoil NMR. ACTA ACUST UNITED AC 2005; 6:259-67. [PMID: 16283429 DOI: 10.1007/s10969-005-9000-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2005] [Revised: 05/18/2005] [Accepted: 05/20/2005] [Indexed: 10/25/2022]
Abstract
In structural genomics centers, nuclear magnetic resonance (NMR) screening is in increasing use as a tool to identify folded proteins that are promising targets for three-dimensional structure determination by X-ray crystallography or NMR spectroscopy. The use of 1D 1H NMR spectra or 2D [1H,15N]-correlation spectroscopy (COSY) typically requires milligram quantities of unlabeled or isotope-labeled protein, respectively. Here, we outline ways towards miniaturization of a structural genomics pipeline with NMR screening for folded globular proteins, using a high-density micro-fermentation device and a microcoil NMR probe. The proteins are micro-expressed in unlabeled or isotope-labeled media, purified, and then subjected to 1D 1H NMR and/or 2D [1H,15N]-COSY screening. To demonstrate that the miniaturization is functioning effectively, we processed nine mouse homologue protein targets and compared the results with those from the "macro-scale" Joint Center of Structural Genomics (JCSG) high-throughput pipeline. The results from the two pipelines were comparable, illustrating that the data were not compromised in the miniaturized approach.
Collapse
Affiliation(s)
- Wolfgang Peti
- The Scripps Research Institute, Department of Molecular Biology and the Joint Center of Structural Genomics, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | | | | | | | | | |
Collapse
|
15
|
Vijayan V, Zweckstetter M. Simultaneous measurement of protein one-bond residual dipolar couplings without increased resonance overlap. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2005; 174:245-253. [PMID: 15862241 DOI: 10.1016/j.jmr.2005.02.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2004] [Revised: 01/17/2005] [Accepted: 02/12/2005] [Indexed: 05/24/2023]
Abstract
A NMR strategy designed to measure simultaneously and without increased resonance overlap scalar and dipolar couplings (RDCs) in (13)C-, (15)N-labeled proteins is presented. Contrary to common schemes for simultaneous measurement of RDCs, a single reference experiment is used for the extraction of more than one type of coupling, thereby reducing the required measurement time. This is accomplished by a common reference spectrum followed by a series of interleaved experiments, in which a particular coupling dependent parameter is varied according to the quantitative J-correlation method or using accordion spectroscopy. To illustrate this idea, we have modified the 3D TROSY-HNCO and the 3D CBCA(CO)NH experiment allowing efficient measurement of one-bond (1)D(NH), (1)D(C'N), (1)D(CalphaHalpha), (1)D(CbetaHbeta), and (1)D(CalphaC') couplings in small to medium sized proteins. In addition, the experiments are expected to be useful for largely unfolded proteins, which show strong resonance overlap but have very favorable relaxation properties. Measurement of RDCs is demonstrated on uniformly (15)N-(13)C-labeled ubiquitin and on the sensory domain of the membraneous two-component fumarate sensor DcuS of Escherichia coli (17 kDa). DcuS was found to be unstable and to precipitate in one to two weeks. RDCs obtained from these experiments are in good agreement with the 1.8A X-ray structure of ubiquitin.
Collapse
Affiliation(s)
- Vinesh Vijayan
- Department for NMR-based Structural Biology, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077 Göttingen, Germany
| | | |
Collapse
|
16
|
Todd AE, Marsden RL, Thornton JM, Orengo CA. Progress of Structural Genomics Initiatives: An Analysis of Solved Target Structures. J Mol Biol 2005; 348:1235-60. [PMID: 15854658 DOI: 10.1016/j.jmb.2005.03.037] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2004] [Revised: 02/28/2005] [Accepted: 03/15/2005] [Indexed: 11/27/2022]
Abstract
The explosion in gene sequence data and technological breakthroughs in protein structure determination inspired the launch of structural genomics (SG) initiatives. An often stated goal of structural genomics is the high-throughput structural characterisation of all protein sequence families, with the long-term hope of significantly impacting on the life sciences, biotechnology and drug discovery. Here, we present a comprehensive analysis of solved SG targets to assess progress of these initiatives. Eleven consortia have contributed 316 non-redundant entries and 323 protein chains to the Protein Data Bank (PDB), and 459 and 393 domains to the CATH and SCOP structure classifications, respectively. The quality and size of these proteins are comparable to those solved in traditional structural biology and, despite huge scope for duplicated efforts, only 14% of targets have a close homologue (>/=30% sequence identity) solved by another consortium. Analysis of CATH and SCOP revealed the significant contribution that structural genomics is making to the coverage of superfamilies and folds. A total of 67% of SG domains in CATH are unique, lacking an already characterised close homologue in the PDB, whereas only 21% of non-SG domains are unique. For 29% of domains, structure determination revealed a remote evolutionary relationship not apparent from sequence, and 19% and 11% contributed new superfamilies and folds. The secondary structure class, fold and superfamily distributions of this dataset reflect those of the genomes. The domains fall into 172 different folds and 259 superfamilies in CATH but the distribution is highly skewed. The most populous of these are those that recur most frequently in the genomes. Whilst 11% of superfamilies are bacteria-specific, most are common to all three superkingdoms of life and together the 316 PDB entries have provided new and reliable homology models for 9287 non-redundant gene sequences in 206 completely sequenced genomes. From the perspective of this analysis, it appears that structural genomics is on track to be a success, and it is hoped that this work will inform future directions of the field.
Collapse
Affiliation(s)
- Annabel E Todd
- Department of Biochemistry and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK.
| | | | | | | |
Collapse
|
17
|
Peti W, Etezady-Esfarjani T, Herrmann T, Klock HE, Lesley SA, Wüthrich K. NMR for structural proteomics of Thermotoga maritima: screening and structure determination. ACTA ACUST UNITED AC 2005; 5:205-15. [PMID: 15263836 DOI: 10.1023/b:jsfg.0000029055.84242.9f] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
This paper describes the NMR screening of 141 small (<15 kDa) recombinant Thermotoga maritima proteins for globular folding. The experimental data shows that approximately 25% of the screened proteins are folded under our screening conditions, which makes this procedure an important step for selecting those proteins that are suitable for structure determination. A comparison of screening based either on 1D 1H NMR with unlabeled proteins or on 2D [1H,15N]-COSY with uniformly 15N-labeled proteins is presented, and a comprehensive analysis of the 1D 1H NMR screening data is described. As an illustration of the utility of these methods to structural proteomics, the NMR structure determination of TM1492 (ribosomal protein L29) is presented. This 66-residue protein consists of a N-terminal 3(10)-helix and two long alpha-helices connected by a tight turn centered about glycine 35, where conserved leucine and isoleucine residues in the two alpha-helices form a small hydrophobic core.
Collapse
Affiliation(s)
- Wolfgang Peti
- The Scripps Research Institute, Department of Molecular Biology and Joint Center of Structural Genomics, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA.
| | | | | | | | | | | |
Collapse
|
18
|
Page R, Peti W, Wilson IA, Stevens RC, Wüthrich K. NMR screening and crystal quality of bacterially expressed prokaryotic and eukaryotic proteins in a structural genomics pipeline. Proc Natl Acad Sci U S A 2005; 102:1901-5. [PMID: 15677718 PMCID: PMC548552 DOI: 10.1073/pnas.0408490102] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In the Joint Center for Structural Genomics, one-dimensional (1D) 1H NMR spectroscopy is routinely used to characterize the folded state of protein targets and, thus, serves to guide subsequent crystallization efforts and to identify proteins for NMR structure determination. Here, we describe 1D 1H NMR screening of a group of 79 mouse homologue proteins, which correlates the NMR data with the outcome of subsequent crystallization experiments and crystallographic structure determination. Based on the 1D 1H NMR spectra, the proteins are classified into four groups, "A" to "D." A-type proteins are candidates for structure determination by NMR or crystallography; "B"-type are earmarked for crystallography; "C" indicates folded globular proteins with broadened line shapes; and "D" are nonglobular, "unfolded" polypeptides. The results obtained from coarse- and fine-screen crystallization trials imply that only A- and B-type proteins should be used for extensive crystallization trials in the future, with C and D proteins subjected only to coarse-screen crystallization trials. Of the presently studied 79 soluble protein targets, 63% yielded A- or B-quality 1D 1H NMR spectra. Although similar yields of crystallization hits were obtained for all four groups, A to D, crystals from A- and B-type proteins diffracted on average to significantly higher resolution than crystals produced from C- or D-type proteins. Furthermore, the output of refined crystal structures from this test set of proteins was 4-fold higher for A- and B-type than for C- and D-type proteins.
Collapse
Affiliation(s)
- Rebecca Page
- Department of Molecular Biology and Joint Center for Structural Genomics, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | | | | | |
Collapse
|
19
|
Liu PH, Ding S. Application of Liquid Crystalline NMR Solvents to a Mixture of Ketones. J CHIN CHEM SOC-TAIP 2005. [DOI: 10.1002/jccs.200500001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
20
|
Hoffmann B, Eichmüller C, Steinhauser O, Konrat R. Rapid Assessment of Protein Structural Stability and Fold Validation via NMR. Methods Enzymol 2005; 394:142-75. [PMID: 15808220 DOI: 10.1016/s0076-6879(05)94006-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
Abstract
In structural proteomics, it is necessary to efficiently screen in a high-throughput manner for the presence of stable structures in proteins that can be subjected to subsequent structure determination by X-ray or NMR spectroscopy. Here we illustrate that the (1)H chemical distribution in a protein as detected by (1)H NMR spectroscopy can be used to probe protein structural stability (e.g., the presence of stable protein structures) of proteins in solution. Based on experimental data obtained on well-structured proteins and proteins that exist in a molten globule state or a partially folded alpha-helical state, a well-defined threshold exists that can be used as a quantitative benchmark for protein structural stability (e.g., foldedness) in solution. Additionally, in this chapter we describe a largely automated strategy for rapid fold validation and structure-based backbone signal assignment. Our methodology is based on a limited number of NMR experiments (e.g., HNCA and 3D NOESY-HSQC) and performs a Monte Carlo-type optimization. The novel feature of the method is the opportunity to screen for structural fragments (e.g., template scanning). The performance of this new validation tool is demonstrated with applications to a diverse set of proteins.
Collapse
Affiliation(s)
- Bernd Hoffmann
- Institute of Theoretical Chemistry and Molecular Structural Biology, University of Vienna, Austria
| | | | | | | |
Collapse
|
21
|
Yamniuk AP, Vogel HJ. Calmodulin's flexibility allows for promiscuity in its interactions with target proteins and peptides. Mol Biotechnol 2004; 27:33-57. [PMID: 15122046 DOI: 10.1385/mb:27:1:33] [Citation(s) in RCA: 245] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The small bilobal calcium regulatory protein calmodulin (CaM) activates numerous target enzymes in response to transient changes in intracellular calcium concentrations. Binding of calcium to the two helix-loop-helix calcium-binding motifs in each of the globular domains induces conformational changes that expose a methionine-rich hydrophobic patch on the surface of each domain of the protein, which it uses to bind to peptide sequences in its target enzymes. Although these CaM-binding domains typically have little sequence identity, the positions of several bulky hydrophobic residues are often conserved, allowing for classification of CaM-binding domains into recognition motifs, such as the 1-14 and 1-10 motifs. For calcium-independent binding of CaM, a third motif known as the IQ motif is also common. Many CaM-peptide complexes have globular conformations, where CaM's central linker connecting the two domains unwinds, allowing the protein to wrap around a single predominantly alpha-helical target peptide sequence. However, novel structures have recently been reported where the conformation of CaM is highly dissimilar to these globular complexes, in some instances with less than a full compliment of bound calcium ions, as well as novel stoichiometries. Furthermore, many divergent CaM isoforms from yeast and plant species have been discovered with unique calcium-binding and enzymatic activation characteristics compared to the single CaM isoform found in mammals.
Collapse
Affiliation(s)
- Aaron P Yamniuk
- Structural Biology Research Group, Department of Biological Sciences, University of Calgary, 2500 University Drive NW, Calgary, AB, Canada
| | | |
Collapse
|
22
|
Tate SI, Shimahara H, Utsunomiya-Tate N. Molecular-orientation analysis based on alignment-induced TROSY chemical shift changes. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2004; 171:284-292. [PMID: 15546755 DOI: 10.1016/j.jmr.2004.09.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2004] [Revised: 09/03/2004] [Indexed: 05/24/2023]
Abstract
We present a new NMR technique for determining the alignment tensor of a weakly aligned protein using only alignment-induced 15N transverse relaxation optimized spectroscopy (TROSY) chemical shift changes. Alignment-induced TROSY chemical shift changes reflect the combined contributions from two different anisotropic spin interactions including the residual dipolar couplings (RDCs) and the residual chemical shift anisotropy effects (RCSAs). We show here that these two residual anisotropic spin interactions' values, encoded in the TROSY chemical shift changes, can be used to determine a weakly aligned protein's alignment tensor. To prove the significance of this method, we show that our TROSY-based analysis gives the consistent alignment angles with those determined using RDCs for 15N-labeled ubiquitin (8.6 kDa) in an aligned medium, within an uncertainty range estimated by considering experimental and structural noises, being 5 degrees at most. Because our approach requires a pre-determined 15N CSA tensor value, we also estimated the uncertainties associated with the resultant alignment tensor values caused by variation in 15N CSA tensors. In spite of the significant variations in literature-reported 15N CSA tensors, they gave consistent orientation angles within an uncertainty range. These results ensure that our TROSY-based approach is a useful alternative to the RDC-based method to determine the alignment angles especially for large proteins in a weakly aligned state.
Collapse
Affiliation(s)
- Shin-ichi Tate
- Department of structural biology, Biomolecular Engineering Research Institute, 6-2-3 Furuedai, Suita, Osaka 565-0874, Japan.
| | | | | |
Collapse
|
23
|
Moseley HNB, Riaz N, Aramini JM, Szyperski T, Montelione GT. A generalized approach to automated NMR peak list editing: application to reduced dimensionality triple resonance spectra. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2004; 170:263-277. [PMID: 15388090 DOI: 10.1016/j.jmr.2004.06.015] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2004] [Revised: 06/02/2004] [Indexed: 05/24/2023]
Abstract
We present an algorithm and program called Pattern Picker that performs editing of raw peak lists derived from multidimensional NMR experiments with characteristic peak patterns. Pattern Picker detects groups of correlated peaks within peak lists from reduced dimensionality triple resonance (RD-TR) NMR spectra, with high fidelity and high yield. With typical quality RD-TR NMR data sets, Pattern Picker performs almost as well as human analysis, and is very robust in discriminating real peak sets from noise and other artifacts in unedited peak lists. The program uses a depth-first search algorithm with short-circuiting to efficiently explore a search tree representing every possible combination of peaks forming a group. The Pattern Picker program is particularly valuable for creating an automated peak picking/editing process. The Pattern Picker algorithm can be applied to a broad range of experiments with distinct peak patterns including RD, G-matrix Fourier transformation (GFT) NMR spectra, and experiments to measure scalar and residual dipolar coupling, thus promoting the use of experiments that are typically harder for a human to analyze. Since the complexity of peak patterns becomes a benefit rather than a drawback, Pattern Picker opens new opportunities in NMR experiment design.
Collapse
Affiliation(s)
- Hunter N B Moseley
- Department of Molecular Biology and Biochemistry, Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ 08854, USA
| | | | | | | | | |
Collapse
|
24
|
Lichtenecker R, Ludwiczek ML, Schmid W, Konrat R. Simplification of protein NOESY spectra using bioorganic precursor synthesis and NMR spectral editing. J Am Chem Soc 2004; 126:5348-9. [PMID: 15113192 DOI: 10.1021/ja049679n] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A novel method is proposed for the analysis of protein NOEs in solution. In this approach, chemically synthesized precursor compounds for the amino acids valine, leucine, and isoleucine are used for amino acid specific labeling of these hydrophobic residues. The methodology is based on a novel synthetic route to 12C,1H,2H Val, Leu, and Ile side chains selectively labeled with 13CH3 only at the terminal methyl group. In an otherwise 12C,1H labeled protein, discrimination between protons bound to 12C and 13C (or 15N) can be achieved using standard isotope-editing NMR pulse schemes. This strategy significantly relieves problems with spectral overlap through selective observation of interresidue methyl NOEs and will thus be a powerful extension of existing biomolecular NMR methodology.
Collapse
Affiliation(s)
- Roman Lichtenecker
- Institute of Organical Chemistry, Währingerstrasse 38, A-1090 Vienna, Austria
| | | | | | | |
Collapse
|
25
|
Jee J, Güntert P. Influence of the completeness of chemical shift assignments on NMR structures obtained with automated NOE assignment. ACTA ACUST UNITED AC 2004; 4:179-89. [PMID: 14649302 DOI: 10.1023/a:1026122726574] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Reliable automated NOE assignment and structure calculation on the basis of a largely complete, assigned input chemical shift list and a list of unassigned NOESY cross peaks has recently become feasible for routine NMR protein structure calculation and has been shown to yield results that are equivalent to those of the conventional, manual approach. However, these algorithms rely on the availability of a virtually complete list of the chemical shifts. This paper investigates the influence of incomplete chemical shift assignments on the reliability of NMR structures obtained with automated NOESY cross peak assignment. The program CYANA was used for combined automated NOESY assignment with the CANDID algorithm and structure calculations with torsion angle dynamics at various degrees of completeness of the chemical shift assignment which was simulated by random omission of entries in the experimental 1H chemical shift lists that had been used for the earlier, conventional structure determinations of two proteins. Sets of structure calculations were performed choosing the omitted chemical shifts randomly among all assigned hydrogen atoms, or among aromatic hydrogen atoms. For comparison, automated NOESY assignment and structure calculations were performed with the complete experimental chemical shift but under random omission of NOESY cross peaks. When heteronuclear-resolved three-dimensional NOESY spectra are available the current CANDID algorithm yields in the absence of up to about 10% of the experimental 1H chemical shifts reliable NOE assignments and three-dimensional structures that deviate by less than 2 A from the reference structure obtained using all experimental chemical shift assignments. In contrast, the algorithm can accommodate the omission of up to 50% of the cross peaks in heteronuclear- resolved NOESY spectra without producing structures with a RMSD of more than 2 A to the reference structure. When only homonuclear NOESY spectra are available, the algorithm is slightly more susceptible to missing data and can tolerate the absence of up to about 7% of the experimental 1H chemical shifts or of up to 30% of the NOESY peaks.
Collapse
Affiliation(s)
- JunGoo Jee
- Genomic Sciences Center, RIKEN Yokohama Institute, 1-7-22 Suehiro, Tsurumi, Yokohama 230-0045, Japan
| | | |
Collapse
|
26
|
Prestegard JH, Bougault CM, Kishore AI. Residual Dipolar Couplings in Structure Determination of Biomolecules. Chem Rev 2004; 104:3519-40. [PMID: 15303825 DOI: 10.1021/cr030419i] [Citation(s) in RCA: 340] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- J H Prestegard
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Road, Athens, GA 30602, USA.
| | | | | |
Collapse
|
27
|
Meiler J, Baker D. Rapid protein fold determination using unassigned NMR data. Proc Natl Acad Sci U S A 2003; 100:15404-9. [PMID: 14668443 PMCID: PMC307580 DOI: 10.1073/pnas.2434121100] [Citation(s) in RCA: 99] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2003] [Indexed: 11/18/2022] Open
Abstract
Experimental structure determination by x-ray crystallography and NMR spectroscopy is slow and time-consuming compared with the rate at which new protein sequences are being identified. NMR spectroscopy has the advantage of rapidly providing the structurally relevant information in the form of unassigned chemical shifts (CSs), intensities of NOESY crosspeaks [nuclear Overhauser effects (NOEs)], and residual dipolar couplings (RDCs), but use of these data are limited by the time and effort needed to assign individual resonances to specific atoms. Here, we develop a method for generating low-resolution protein structures by using unassigned NMR data that relies on the de novo protein structure prediction algorithm, rosetta [Simons, K. T., Kooperberg, C., Huang, E. & Baker, D. (1997) J. Mol. Biol. 268, 209-225] and a Monte Carlo procedure that searches for the assignment of resonances to atoms that produces the best fit of the experimental NMR data to a candidate 3D structure. A large ensemble of models is generated from sequence information alone by using rosetta, an optimal assignment is identified for each model, and the models are then ranked based on their fit with the NMR data assuming the identified assignments. The method was tested on nine protein sequences between 56 and 140 amino acids and published CS, NOE, and RDC data. The procedure yielded models with rms deviations between 3 and 6 A, and, in four of the nine cases, the partial assignments obtained by the method could be used to refine the structures to high resolution (0.6-1.8 A) by repeated cycles of structure generation guided by the partial assignments, followed by reassignment using the newly generated models.
Collapse
Affiliation(s)
- Jens Meiler
- Department of Biochemistry and Howard Hughes Medical Institute, University of Washington, PO Box 357350, Seattle, WA 98195-7350, USA
| | | |
Collapse
|
28
|
Galvão-Botton LMP, Katsuyama AM, Guzzo CR, Almeida FCL, Farah CS, Valente AP. High-throughput screening of structural proteomics targets using NMR. FEBS Lett 2003; 552:207-13. [PMID: 14527688 DOI: 10.1016/s0014-5793(03)00926-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We applied a high-throughput strategy for the screening of targets for structural proteomics of Xanthomonas axonopodis pv citri. This strategy is based on the rapid (1)H-(15)N HSQC NMR analysis of bacterial lysates containing selectively (15)N-labelled heterologous proteins. Our analysis permitted us to classify the 19 soluble candidates in terms of 'foldedness', that is, the extent to which they present a well-folded solution structure, as reflected by the quality of their NMR spectra. This classification allowed us to define a priority list to be used as a guide to select protein candidates for further structural studies.
Collapse
Affiliation(s)
- Leonor M P Galvão-Botton
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, CEP 05508-900, SP, São Paulo, Brazil
| | | | | | | | | | | |
Collapse
|
29
|
Berthault P, Jeannerat D, Camerel F, Alvarez Salgado F, Boulard Y, Gabriel JCP, Desvaux H. Dilute liquid crystals used to enhance residual dipolar couplings may alter conformational equilibrium in oligosaccharides. Carbohydr Res 2003; 338:1771-85. [PMID: 12892944 DOI: 10.1016/s0008-6215(03)00243-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The solution structures of a trisaccharide and a pentasaccharide containing the Lewis(x) motif were determined by two independent approaches using either dipolar cross-relaxation (NOE) or residual dipolar coupling (RDC) data. For the latter, one-bond 13C[bond](1)H RDC enhanced by two different mineral liquid crystals were used alone. Home-written programs were employed firstly for measuring accurately the coupling constants in the direct dimension of non-decoupled HSQC experiments, secondly for transforming each RDC data set into geometrical restraints. In this second program, the complete molecular structure was expressed in a unique frame where the alignment tensor is diagonal. Assuming that the pyranose rings are rigid, their relative orientation is defined by optimizing the glycosidic torsion angles. For the trisaccharide, a good agreement was observed between the results of both approaches (NOE and RDC). In contrast, for the pentasaccharide, strong discrepancies appeared, which seem to result from interactions between the pentasaccharide and the mesogens, affecting conformational equilibrium. This observation is of importance, as it reveals that using simultaneously NOE and RDC can be hazardous as the former represent 99% of the molecules free in solution, whereas the latter correspond to less than 1% of the structure bound to the mesogen.
Collapse
Affiliation(s)
- Patrick Berthault
- Laboratoire Commun de R.M.N., DSM/DRECAM/Service de Chimie Moléculaire, URA 331 CNRS, C.E.A./Saclay, F-91191 Gif sur Yvette, France
| | | | | | | | | | | | | |
Collapse
|
30
|
Kennedy MA, Montelione GT, Arrowsmith CH, Markley JL. Role for NMR in structural genomics. JOURNAL OF STRUCTURAL AND FUNCTIONAL GENOMICS 2003; 2:155-69. [PMID: 12836706 DOI: 10.1023/a:1021261026670] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The 2nd EMSL Workshop on Structural Genomics was held on 28th and 29th July 2000 at the Environmental Molecular Sciences Laboratory at the Department of Energy's Pacific Northwest National Laboratory in Richland, WA. The workshop focused on four topics: 1. The role for NMR in structural and functional genomics; 2. The technical challenges NMR faces for structural and functional genomics; 3. The potential need for a national NMR center for structural and functional genomics in the United States; and 4. Organization of the NMR community. This report summarizes the workshop proceedings and conclusions reached regarding the role of NMR in the emerging fields of structural and functional genomics.
Collapse
Affiliation(s)
- Michael A Kennedy
- Pacific Northwest National Laboratory, Environmental Molecular Sciences Laboratory, Richland, WA 99352, USA.
| | | | | | | |
Collapse
|
31
|
Verdier L, Sakhaii P, Zweckstetter M, Griesinger C. Measurement of long range H,C couplings in natural products in orienting media: a tool for structure elucidation of natural products. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2003; 163:353-359. [PMID: 12914852 DOI: 10.1016/s1090-7807(03)00063-6] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
In this paper we show that water insoluble compounds dissolved in poly-gamma-benzyl-glutamate are amenable to the measurement of a number of homo- and heteronuclear dipolar couplings. The sensitivity and experimental precision of dipolar couplings are sufficient to obtain a good match with the structure. In order to achieve the necessary precision for H,C dipolar couplings between protons and carbons that are not directly bound a new method for the measurement of heteronuclear long range couplings is introduced that allows a one-parameter fit to a HSQC-based experiment as reference experiment. The methodology is applied to menthol (1R, 3S, 4R).
Collapse
Affiliation(s)
- Laurent Verdier
- Max Planck Institute for Biophysical Chemistry, Abt. 030, Am Fassberg 11, 37077 Göttingen, Germany
| | | | | | | |
Collapse
|
32
|
Zheng D, Huang YJ, Moseley HNB, Xiao R, Aramini J, Swapna GVT, Montelione GT. Automated protein fold determination using a minimal NMR constraint strategy. Protein Sci 2003; 12:1232-46. [PMID: 12761394 PMCID: PMC2323888 DOI: 10.1110/ps.0300203] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2003] [Revised: 03/11/2003] [Accepted: 03/12/2003] [Indexed: 10/27/2022]
Abstract
Determination of precise and accurate protein structures by NMR generally requires weeks or even months to acquire and interpret all the necessary NMR data. However, even medium-accuracy fold information can often provide key clues about protein evolution and biochemical function(s). In this article we describe a largely automatic strategy for rapid determination of medium-accuracy protein backbone structures. Our strategy derives from ideas originally introduced by other groups for determining medium-accuracy NMR structures of large proteins using deuterated, (13)C-, (15)N-enriched protein samples with selective protonation of side-chain methyl groups ((13)CH(3)). Data collection includes acquiring NMR spectra for automatically determining assignments of backbone and side-chain (15)N, H(N) resonances, and side-chain (13)CH(3) methyl resonances. These assignments are determined automatically by the program AutoAssign using backbone triple resonance NMR data, together with Spin System Type Assignment Constraints (STACs) derived from side-chain triple-resonance experiments. The program AutoStructure then derives conformational constraints using these chemical shifts, amide (1)H/(2)H exchange, nuclear Overhauser effect spectroscopy (NOESY), and residual dipolar coupling data. The total time required for collecting such NMR data can potentially be as short as a few days. Here we demonstrate an integrated set of NMR software which can process these NMR spectra, carry out resonance assignments, interpret NOESY data, and generate medium-accuracy structures within a few days. The feasibility of this combined data collection and analysis strategy starting from raw NMR time domain data was illustrated by automatic analysis of a medium accuracy structure of the Z domain of Staphylococcal protein A.
Collapse
Affiliation(s)
- Deyou Zheng
- Center for Advanced Biotechnology and Medicine (CABM), Northeast Structural Genomics Consortium, and Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, New Jersey 08854, USA
| | | | | | | | | | | | | |
Collapse
|
33
|
Abstract
The role of NMR in structural genomics is outlined, with particular emphasis on using protein domains as targets. Strategies for domain expression, characterization, and labeling are presented.
Collapse
Affiliation(s)
- David Staunton
- Department of Biochemistry, University of Oxford, South Parks Road, UK
| | | | | |
Collapse
|
34
|
Valafar H, Prestegard JH, Valafar F. Datamining protein structure databanks for crystallization patterns of proteins. Ann N Y Acad Sci 2002; 980:13-22. [PMID: 12594078 DOI: 10.1111/j.1749-6632.2002.tb04885.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
A study of 345 protein structures selected among 1,500 structures determined by nuclear magnetic resonance (NMR) methods, revealed useful correlations between crystallization properties and several parameters for the studied proteins. NMR methods of structure determination do not require the growth of protein crystals, and hence allow comparison of properties of proteins that have or have not been the subject of crystallographic approaches. One- and two-dimensional statistical analyses of the data confirmed a hypothesized relation between the size of the molecule and its crystallization potential. Furthermore, two-dimensional Bayesian analysis revealed a significant relationship between relative ratio of different secondary structures and the likelihood of success for crystallization trials. The most immediate result is an apparent correlation of crystallization potential with protein size. Further analysis of the data revealed a relationship between the unstructured fraction of proteins and the success of its crystallization. Utilization of Bayesian analysis on the latter correlation resulted in a prediction performance of about 64%, whereas a two-dimensional Bayesian analysis succeeded with a performance of about 75%.
Collapse
Affiliation(s)
- Homayoun Valafar
- Southeast Collaboratory for Structural Genomics, Athens, Georgia 30602, USA.
| | | | | |
Collapse
|
35
|
Rehm T, Huber R, Holak TA. Application of NMR in structural proteomics: screening for proteins amenable to structural analysis. Structure 2002; 10:1613-8. [PMID: 12467568 DOI: 10.1016/s0969-2126(02)00894-8] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
In the time of structural proteomics when protein structures are targeted on a genome-wide scale, the detection of "well-behaved" proteins that would yield good quality NMR spectra or X-ray images is the key to high-throughput structure determination. Already, simple one-dimensional proton NMR spectra provide enough information for assessing the folding properties of proteins. Heteronuclear two-dimensional spectra are routinely used for screenings that reveal structural, as well as binding, properties of proteins. NMR can thus provide important information for optimizing conditions for protein constructs that are amenable to structural studies.
Collapse
Affiliation(s)
- Till Rehm
- Max-Planck-Institut für Biochemie, Am Klopferspitz 18a, D-82152 München, Germany
| | | | | |
Collapse
|
36
|
Ortiz AR, Strauss CEM, Olmea O. MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison. Protein Sci 2002; 11:2606-21. [PMID: 12381844 PMCID: PMC2373724 DOI: 10.1110/ps.0215902] [Citation(s) in RCA: 320] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Advances in structural genomics and protein structure prediction require the design of automatic, fast, objective, and well benchmarked methods capable of comparing and assessing the similarity of low-resolution three-dimensional structures, via experimental or theoretical approaches. Here, a new method for sequence-independent structural alignment is presented that allows comparison of an experimental protein structure with an arbitrary low-resolution protein tertiary model. The heuristic algorithm is given and then used to show that it can describe random structural alignments of proteins with different folds with good accuracy by an extreme value distribution. From this observation, a structural similarity score between two proteins or two different conformations of the same protein is derived from the likelihood of obtaining a given structural alignment by chance. The performance of the derived score is then compared with well established, consensus manual-based scores and data sets. We found that the new approach correlates better than other tools with the gold standard provided by a human evaluator. Timings indicate that the algorithm is fast enough for routine use with large databases of protein models. Overall, our results indicate that the new program (MAMMOTH) will be a good tool for protein structure comparisons in structural genomics applications. MAMMOTH is available from our web site at http://physbio.mssm.edu/~ortizg/.
Collapse
Affiliation(s)
- Angel R Ortiz
- Department of Physiology and Biophysics, Mount Sinai School of Medicine, New York University, New York, New York 10029, USA.
| | | | | |
Collapse
|
37
|
Betz SF, Baxter SM, Fetrow JS. Function first: a powerful approach to post-genomic drug discovery. Drug Discov Today 2002; 7:865-71. [PMID: 12546953 DOI: 10.1016/s1359-6446(02)02398-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
In the post-genomic era, pharmaceutical researchers must evaluate vast numbers of protein sequences and formulate novel, intelligent strategies for identifying valid targets and discovering leads against them. The identification of small molecules that selectively target proteins or protein families will be aided by knowing the function and/or the structure of the target(s). By identifying protein function first, efficiencies are gained that allow subsequent focus of resources on particular protein families of interest. This article reviews current proteomic-scale approaches to identifying function as a way of accelerating lead discovery.
Collapse
Affiliation(s)
- Stephen F Betz
- GeneFormatics, 5830 Oberlin Drive, Suite 200, San Diego, CA 92121, USA
| | | | | |
Collapse
|
38
|
Barbieri R, Bertini I, Cavallaro G, Lee YM, Luchinat C, Rosato A. Paramagnetically induced residual dipolar couplings for solution structure determination of lanthanide binding proteins. J Am Chem Soc 2002; 124:5581-7. [PMID: 11996601 DOI: 10.1021/ja025528d] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Lanthanides may substitute calcium in calcium-binding proteins, such as, for instance, EF-hand proteins. Paramagnetic lanthanides are capable of orienting the protein in high magnetic fields to an extent similar to that obtained by using orienting devices, and each lanthanide orients according to its magnetic susceptibility tensor. Here, Ce(3+), Tb(3+), Dy(3+), Ho(3+), Er(3+), Tm(3+), Yb(3+) in the C-terminal site of calbindin D(9k) have been investigated. Such systems provide (1)H-(15)N residual dipolar couplings (rdc) which can be used for solution structure determinations. Within the frame of optimizing the use of residual dipolar couplings for efficient solution structure determination, it is proposed here to use a number of lanthanides (e.g., >2) to obtain the orientations of the internuclear vectors with respect to an arbitrary reference system. This is facilitated by the independent knowledge of the magnetic susceptibility anisotropy tensor of each metal, obtained from the analysis of the pseudocontact shifts. A further module of the program PARAMAGNETIC-DYANA, called RDCDYANA-ANGLES, is developed to efficiently incorporate such rdc-derived orientations, instead of the rdc themselves, as constraints in the solution structure calculation. This strategy is absolutely general and can be extended to any other pair of dipole-dipole coupled nuclei. The effect of mobility is also assessed. In principle, information on the mobility can be obtained with a number of lanthanide ions >5, or by combining a smaller number of lanthanide ions with a few orienting devices.
Collapse
Affiliation(s)
- Renato Barbieri
- Magnetic Resonance Center and Department of Chemistry, University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Italy
| | | | | | | | | | | |
Collapse
|
39
|
Al-Hashimi HM, Gorin A, Majumdar A, Gosser Y, Patel DJ. Towards structural genomics of RNA: rapid NMR resonance assignment and simultaneous RNA tertiary structure determination using residual dipolar couplings. J Mol Biol 2002; 318:637-49. [PMID: 12054812 DOI: 10.1016/s0022-2836(02)00160-2] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
We report a new residual dipolar couplings (RDCs) based NMR procedure for rapidly determining RNA tertiary structure demonstrated on a uniformly (15)N/(13)C-labeled 27 nt variant of the trans-activation response element (TAR) RNA from HIV-I. In this procedure, the time-consuming nuclear Overhauser enhancement (NOE)-based sequential assignment step is replaced by a fully automated RDC-based assignment strategy. This approach involves examination of all allowed sequence-specific resonance assignment permutations for best-fit agreement between measured RDCs and coordinates for sub-structures in a target RNA. Using idealized A-form geometries to model Watson-Crick helices and coordinates from a previous X-ray structure to model a hairpin loop in TAR, the best-fit RDC assignment solutions are determined very rapidly (<five minutes of computational time) and are in complete agreement with corresponding NOE-based assignments. Orientational constraints derived from RDCs are used simultaneously to assemble sub-structures into an RNA tertiary conformation. Through enhanced speeds of application and reduced reliance on chemical shift dispersion, this RDC-based approach lays the foundation for rapidly determining RNA conformations in a structural genomics context, and may increase the size limit of RNAs that can be examined by NMR.
Collapse
Affiliation(s)
- Hashim M Al-Hashimi
- Cellular Biochemistry and Biophysics Program, Memorial Sloan-Kettering Cancer Center, New York, NY 10021, USA.
| | | | | | | | | |
Collapse
|
40
|
Current Awareness on Comparative and Functional Genomics. Comp Funct Genomics 2002. [PMCID: PMC2447231 DOI: 10.1002/cfg.116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
|
41
|
Montelione GT. Structural genomics: an approach to the protein folding problem. Proc Natl Acad Sci U S A 2001; 98:13488-9. [PMID: 11717420 PMCID: PMC61067 DOI: 10.1073/pnas.261549098] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- G T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers University, Piscataway, NJ 08854-5638, USA.
| |
Collapse
|
42
|
Abstract
The introduction of residual dipolar coupling methodology has increased the scope of structural biological problems that can be addressed by NMR spectroscopy. Conformational changes, the relative orientation of domains, and intermolecular complexes can now be characterized accurately and rapidly using NMR. The development of residual dipolar coupling methodology for the rapid recognition of homologous protein folds and for studies of submillisecond timescale dynamics has also seen considerable progress.
Collapse
Affiliation(s)
- J R Tolman
- Section de Chimie, BCH, Université de Lausanne, Switzerland.
| |
Collapse
|
43
|
Abstract
Partial alignment of biomolecules in solution has added a new dimension to structural investigation by high-resolution NMR methods. Applications to proteins, nucleic acids and carbohydrates now abound. Limitations initially associated with compatibility of biomolecules with the liquid-crystal media commonly used to achieve alignment have begun to disappear. This is, in part, a result of the introduction of a wide variety of new media. Future applications to biologically important problems such as the structural organization of multi-domain proteins and multi-protein assemblies look very promising.
Collapse
Affiliation(s)
- J H Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, USA.
| | | |
Collapse
|
44
|
Abstract
Structural genomics projects aim to provide an experimental or computational three-dimensional model structure for all of the tractable macromolecules that are encoded by complete genomes. To this end, pilot centres worldwide are now exploring the feasibility of large-scale structure determination. Their experimental structures and computational models are expected to yield insight into the molecular function and mechanism of thousands of proteins. The pervasiveness of this information is likely to change the use of structure in molecular biology and biochemistry.
Collapse
Affiliation(s)
- S E Brenner
- Department of Plant and Microbial Biology, University of California, 461A Koshland Hall, Berkeley, California 94720-3102, USA.
| |
Collapse
|