1
|
Takaba K, Friedman AJ, Cavender CE, Behara PK, Pulido I, Henry MM, MacDermott-Opeskin H, Iacovella CR, Nagle AM, Payne AM, Shirts MR, Mobley DL, Chodera JD, Wang Y. Machine-learned molecular mechanics force fields from large-scale quantum chemical data. Chem Sci 2024; 15:12861-12878. [PMID: 39148808 PMCID: PMC11322960 DOI: 10.1039/d4sc00690a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 06/17/2024] [Indexed: 08/17/2024] Open
Abstract
The development of reliable and extensible molecular mechanics (MM) force fields-fast, empirical models characterizing the potential energy surface of molecular systems-is indispensable for biomolecular simulation and computer-aided drug design. Here, we introduce a generalized and extensible machine-learned MM force field, espaloma-0.3, and an end-to-end differentiable framework using graph neural networks to overcome the limitations of traditional rule-based methods. Trained in a single GPU-day to fit a large and diverse quantum chemical dataset of over 1.1 M energy and force calculations, espaloma-0.3 reproduces quantum chemical energetic properties of chemical domains highly relevant to drug discovery, including small molecules, peptides, and nucleic acids. Moreover, this force field maintains the quantum chemical energy-minimized geometries of small molecules and preserves the condensed phase properties of peptides and folded proteins, self-consistently parametrizing proteins and ligands to produce stable simulations leading to highly accurate predictions of binding free energies. This methodology demonstrates significant promise as a path forward for systematically building more accurate force fields that are easily extensible to new chemical domains of interest.
Collapse
Affiliation(s)
- Kenichiro Takaba
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
- Pharmaceuticals Research Center, Advanced Drug Discovery, Asahi Kasei Pharma Corporation Shizuoka 410-2321 Japan
| | - Anika J Friedman
- Department of Chemical and Biological Engineering, University of Colorado Boulder Boulder CO 80309 USA
| | - Chapin E Cavender
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego 9500 Gilman Drive La Jolla CA 92093 USA
| | - Pavan Kumar Behara
- Center for Neurotherapeutics, Department of Pathology and Laboratory Medicine, University of California Irvine CA 92697 USA
| | - Iván Pulido
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
| | - Michael M Henry
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
| | | | - Christopher R Iacovella
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
| | - Arnav M Nagle
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
- Department of Bioengineering, University of California, Berkeley Berkeley CA 94720 USA
| | - Alexander Matthew Payne
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
- Tri-Institutional PhD Program in Chemical Biology, Memorial Sloan Kettering Cancer Center New York 10065 USA
| | - Michael R Shirts
- Department of Chemical and Biological Engineering, University of Colorado Boulder Boulder CO 80309 USA
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California Irvine California 92697 USA
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
| | - Yuanqing Wang
- Simons Center for Computational Physical Chemistry and Center for Data Science, New York University New York NY 10004 USA
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center New York NY 10065 USA
| |
Collapse
|
2
|
Chiliveri SC, Robertson AJ, Shen Y, Torchia DA, Bax A. Advances in NMR Spectroscopy of Weakly Aligned Biomolecular Systems. Chem Rev 2021; 122:9307-9330. [PMID: 34766756 DOI: 10.1021/acs.chemrev.1c00730] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The measurement and application of residual dipolar couplings (RDCs) in solution NMR studies of biological macromolecules has become well established over the past quarter of a century. Numerous methods for generating the requisite anisotropic orientational molecular distribution have been demonstrated, each with its specific strengths and weaknesses. In parallel, an enormous number of pulse schemes have been introduced to measure the many different types of RDCs, ranging from the most widely measured backbone amide 15N-1H RDCs, to 1H-1H RDCs and couplings between low-γ nuclei. Applications of RDCs range from structure validation and refinement to the determination of relative domain orientations, the measurement of backbone and domain motions, and de novo structure determination. Nevertheless, it appears that the power of the RDC methodology remains underutilized. This review aims to highlight the practical aspects of sample preparation and RDC measurement while describing some of the most straightforward applications that take advantage of the exceptionally precise information contained in such data. Some emphasis will be placed on more recent developments that enable the accurate measurement of RDCs in larger systems, which is key to the ongoing shift in focus of biological NMR spectroscopy from structure determination toward gaining improved understanding of how molecular flexibility drives protein function.
Collapse
Affiliation(s)
- Sai Chaitanya Chiliveri
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Angus J Robertson
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Yang Shen
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Dennis A Torchia
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Ad Bax
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| |
Collapse
|
3
|
Vögeli B, Olsson S, Güntert P, Riek R. The Exact NOE as an Alternative in Ensemble Structure Determination. Biophys J 2016; 110:113-26. [PMID: 26745415 DOI: 10.1016/j.bpj.2015.11.031] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Revised: 11/22/2015] [Accepted: 11/23/2015] [Indexed: 10/22/2022] Open
Abstract
The structure-function paradigm is increasingly replaced by the structure-dynamics-function paradigm. All protein activity is steered by the interplay between enthalpy and entropy. Conformational dynamics serves as a proxy of conformational entropy. Therefore, it is essential to study not only the average conformation but also the spatial sampling of a protein on all timescales. To this purpose, we have established a protocol for determining multiple-state ensembles of proteins based on exact nuclear Overhauser effects (eNOEs). We have recently extended our previously reported eNOE data set for the protein GB3 by a very large set of backbone and side-chain residual dipolar couplings and three-bond J couplings. Here, we demonstrate that at least four structural states are required to represent the complete data set by dissecting the contributions to the CYANA target function, which quantifies restraint violations in structure calculation. We present a four-state ensemble of GB3, which largely preserves the characteristics obtained from eNOEs only. Due to the abundance of the input data, the ensemble and χ(1) angles in particular are well suited for cross-validation of the input data and comparison to x-ray structures. Principal component analysis is used to automatically identify and validate relevant states of the ensembles. Overall, our findings suggest that eNOEs are a valuable alternative to traditional NMR probes in spatial elucidation of proteins.
Collapse
Affiliation(s)
- Beat Vögeli
- Laboratory of Physical Chemistry, Vladimir-Prelog-Weg 2, Swiss Federal Institute of Technology, ETH-Hönggerberg, Zürich, Switzerland.
| | - Simon Olsson
- Laboratory of Physical Chemistry, Vladimir-Prelog-Weg 2, Swiss Federal Institute of Technology, ETH-Hönggerberg, Zürich, Switzerland; Institute for Research in Biomedicine, Bellinzona, Switzerland
| | - Peter Güntert
- Laboratory of Physical Chemistry, Vladimir-Prelog-Weg 2, Swiss Federal Institute of Technology, ETH-Hönggerberg, Zürich, Switzerland; Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance and Frankfurt Institute for Advanced Studies, J.W. Goethe-Universität, Frankfurt am Main, Germany; Graduate School of Science, Tokyo Metropolitan University, Hachioji, Tokyo, Japan
| | - Roland Riek
- Laboratory of Physical Chemistry, Vladimir-Prelog-Weg 2, Swiss Federal Institute of Technology, ETH-Hönggerberg, Zürich, Switzerland
| |
Collapse
|
4
|
Pang YP. FF12MC: A revised AMBER forcefield and new protein simulation protocol. Proteins 2016; 84:1490-516. [PMID: 27348292 PMCID: PMC5129589 DOI: 10.1002/prot.25094] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Revised: 06/16/2016] [Accepted: 06/18/2016] [Indexed: 12/25/2022]
Abstract
Specialized to simulate proteins in molecular dynamics (MD) simulations with explicit solvation, FF12MC is a combination of a new protein simulation protocol employing uniformly reduced atomic masses by tenfold and a revised AMBER forcefield FF99 with (i) shortened CH bonds, (ii) removal of torsions involving a nonperipheral sp(3) atom, and (iii) reduced 1-4 interaction scaling factors of torsions ϕ and ψ. This article reports that in multiple, distinct, independent, unrestricted, unbiased, isobaric-isothermal, and classical MD simulations FF12MC can (i) simulate the experimentally observed flipping between left- and right-handed configurations for C14-C38 of BPTI in solution, (ii) autonomously fold chignolin, CLN025, and Trp-cage with folding times that agree with the experimental values, (iii) simulate subsequent unfolding and refolding of these miniproteins, and (iv) achieve a robust Z score of 1.33 for refining protein models TMR01, TMR04, and TMR07. By comparison, the latest general-purpose AMBER forcefield FF14SB locks the C14-C38 bond to the right-handed configuration in solution under the same protein simulation conditions. Statistical survival analysis shows that FF12MC folds chignolin and CLN025 in isobaric-isothermal MD simulations 2-4 times faster than FF14SB under the same protein simulation conditions. These results suggest that FF12MC may be used for protein simulations to study kinetics and thermodynamics of miniprotein folding as well as protein structure and dynamics. Proteins 2016; 84:1490-1516. © 2016 The Authors Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Yuan-Ping Pang
- Computer-Aided Molecular Design Laboratory, Mayo Clinic, Rochester, MN, 55905, USA.
| |
Collapse
|
5
|
Compiled data set of exact NOE distance limits, residual dipolar couplings and scalar couplings for the protein GB3. Data Brief 2015; 5:99-106. [PMID: 26504890 PMCID: PMC4576366 DOI: 10.1016/j.dib.2015.08.020] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Accepted: 08/25/2015] [Indexed: 01/26/2023] Open
Abstract
We compiled an NMR data set consisting of exact nuclear Overhauser enhancement (eNOE) distance limits, residual dipolar couplings (RDCs) and scalar (J) couplings for GB3, which forms one of the largest and most diverse data set for structural characterization of a protein to date. All data have small experimental errors, which are carefully estimated. We use the data in the research article Vogeli et al., 2015, Complementarity and congruence between exact NOEs and traditional NMR probes for spatial decoding of protein dynamics, J. Struct. Biol., 191, 3, 306–317, doi:10.1016/j.jsb.2015.07.008 [1] for cross-validation in multiple-state structural ensemble calculation. We advocate this set to be an ideal test case for molecular dynamics simulations and structure calculations.
Collapse
|
6
|
Vögeli B, Olsson S, Riek R, Güntert P. Complementarity and congruence between exact NOEs and traditional NMR probes for spatial decoding of protein dynamics. J Struct Biol 2015. [DOI: 10.1016/j.jsb.2015.07.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
|
7
|
Maier JA, Martinez C, Kasavajhala K, Wickstrom L, Hauser KE, Simmerling C. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. J Chem Theory Comput 2015; 11:3696-713. [PMID: 26574453 DOI: 10.1021/acs.jctc.5b00255] [Citation(s) in RCA: 6714] [Impact Index Per Article: 746.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Molecular mechanics is powerful for its speed in atomistic simulations, but an accurate force field is required. The Amber ff99SB force field improved protein secondary structure balance and dynamics from earlier force fields like ff99, but weaknesses in side chain rotamer and backbone secondary structure preferences have been identified. Here, we performed a complete refit of all amino acid side chain dihedral parameters, which had been carried over from ff94. The training set of conformations included multidimensional dihedral scans designed to improve transferability of the parameters. Improvement in all amino acids was obtained as compared to ff99SB. Parameters were also generated for alternate protonation states of ionizable side chains. Average errors in relative energies of pairs of conformations were under 1.0 kcal/mol as compared to QM, reduced 35% from ff99SB. We also took the opportunity to make empirical adjustments to the protein backbone dihedral parameters as compared to ff99SB. Multiple small adjustments of φ and ψ parameters were tested against NMR scalar coupling data and secondary structure content for short peptides. The best results were obtained from a physically motivated adjustment to the φ rotational profile that compensates for lack of ff99SB QM training data in the β-ppII transition region. Together, these backbone and side chain modifications (hereafter called ff14SB) not only better reproduced their benchmarks, but also improved secondary structure content in small peptides and reproduction of NMR χ1 scalar coupling measurements for proteins in solution. We also discuss the Amber ff12SB parameter set, a preliminary version of ff14SB that includes most of its improvements.
Collapse
Affiliation(s)
- James A Maier
- Graduate Program in Biochemistry and Structural Biology, ‡Department of Chemistry, and §Laufer Center for Physical and Quantitative Biology, Stony Brook University , Stony Brook, New York 11794, United States
| | - Carmenza Martinez
- Graduate Program in Biochemistry and Structural Biology, ‡Department of Chemistry, and §Laufer Center for Physical and Quantitative Biology, Stony Brook University , Stony Brook, New York 11794, United States
| | - Koushik Kasavajhala
- Graduate Program in Biochemistry and Structural Biology, ‡Department of Chemistry, and §Laufer Center for Physical and Quantitative Biology, Stony Brook University , Stony Brook, New York 11794, United States
| | - Lauren Wickstrom
- Graduate Program in Biochemistry and Structural Biology, ‡Department of Chemistry, and §Laufer Center for Physical and Quantitative Biology, Stony Brook University , Stony Brook, New York 11794, United States
| | - Kevin E Hauser
- Graduate Program in Biochemistry and Structural Biology, ‡Department of Chemistry, and §Laufer Center for Physical and Quantitative Biology, Stony Brook University , Stony Brook, New York 11794, United States
| | - Carlos Simmerling
- Graduate Program in Biochemistry and Structural Biology, ‡Department of Chemistry, and §Laufer Center for Physical and Quantitative Biology, Stony Brook University , Stony Brook, New York 11794, United States
| |
Collapse
|
8
|
Orts J, Vögeli B, Riek R, Güntert P. Stereospecific assignments in proteins using exact NOEs. JOURNAL OF BIOMOLECULAR NMR 2013; 57:211-8. [PMID: 24136114 DOI: 10.1007/s10858-013-9780-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 09/04/2013] [Indexed: 05/27/2023]
Abstract
Recently developed methods to measure distances in proteins with high accuracy by "exact" nuclear Overhauser effects (eNOEs) make it possible to determine stereospecific assignments, which are particularly important to fully exploit the accuracy of the eNOE distance measurements. Stereospecific assignments are determined by comparing the eNOE-derived distances to protein structure bundles calculated without stereospecific assignments, or an independently determined crystal structure. The absolute and relative CYANA target function difference upon swapping the stereospecific assignment of a diastereotopic group yields the respective stereospecific assignment. We applied the method to the eNOE data set that has recently been obtained for the third immunoglobulin-binding domain of protein G (GB3). The 884 eNOEs provide relevant data for 47 of the total of 75 diastereotopic groups. Stereospecific assignments could be established for 45 diastereotopic groups (96 %) using the X-ray structure, or for 27 diastereotopic groups (57 %) using structures calculated with the eNOE data set without stereospecific assignments, all of which are in agreement with those determined previously. The latter case is relevant for structure determinations based on eNOEs. The accuracy of the eNOE distance measurements is crucial for making stereospecific assignments because applying the same method to the traditional NOE data set for GB3 with imprecise upper distance bounds yields only 13 correct stereospecific assignments using the X-ray structure or 2 correct stereospecific assignments using NMR structures calculated without stereospecific assignments.
Collapse
Affiliation(s)
- Julien Orts
- Laboratory of Physical Chemistry, Swiss Federal Institute of Technology, 8093, Zurich, Switzerland
| | | | | | | |
Collapse
|
9
|
Shi Y, Xia Z, Zhang J, Best R, Wu C, Ponder JW, Ren P. The Polarizable Atomic Multipole-based AMOEBA Force Field for Proteins. J Chem Theory Comput 2013; 9:4046-4063. [PMID: 24163642 DOI: 10.1021/ct4003702] [Citation(s) in RCA: 466] [Impact Index Per Article: 42.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Development of the AMOEBA (Atomic Multipole Optimized Energetics for Biomolecular Simulation) force field for proteins is presented. The current version (AMOEBA-2013) utilizes permanent electrostatic multipole moments through the quadrupole at each atom, and explicitly treats polarization effects in various chemical and physical environments. The atomic multipole electrostatic parameters for each amino acid residue type are derived from high-level gas phase quantum mechanical calculations via a consistent and extensible protocol. Molecular polarizability is modeled via a Thole-style damped interactive induction model based upon distributed atomic polarizabilities. Inter- and intramolecular polarization is treated in a consistent fashion via the Thole model. The intramolecular polarization model ensures transferability of electrostatic parameters among different conformations, as demonstrated by the agreement between QM and AMOEBA electrostatic potentials, and dipole moments of dipeptides. The backbone and side chain torsional parameters were determined by comparing to gas-phase QM (RI-TRIM MP2/CBS) conformational energies of dipeptides and to statistical distributions from the Protein Data Bank. Molecular dynamics simulations are reported for short peptides in explicit water to examine their conformational properties in solution. Overall the calculated conformational free energies and J-coupling constants are consistent with PDB statistics and experimental NMR results, respectively. In addition, the experimental crystal structures of a number of proteins are well maintained during molecular dynamics (MD) simulation. While further calculations are necessary to fully validate the force field, initial results suggest the AMOEBA polarizable multipole force field is able to describe the structure and energetics of peptides and proteins, in both gas-phase and solution environments.
Collapse
Affiliation(s)
- Yue Shi
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, TX 78712
| | | | | | | | | | | | | |
Collapse
|
10
|
Shen Y, Bax A. Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks. JOURNAL OF BIOMOLECULAR NMR 2013; 56:227-41. [PMID: 23728592 PMCID: PMC3701756 DOI: 10.1007/s10858-013-9741-y] [Citation(s) in RCA: 825] [Impact Index Per Article: 75.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Accepted: 05/03/2013] [Indexed: 05/05/2023]
Abstract
A new program, TALOS-N, is introduced for predicting protein backbone torsion angles from NMR chemical shifts. The program relies far more extensively on the use of trained artificial neural networks than its predecessor, TALOS+. Validation on an independent set of proteins indicates that backbone torsion angles can be predicted for a larger, ≥90 % fraction of the residues, with an error rate smaller than ca 3.5 %, using an acceptance criterion that is nearly two-fold tighter than that used previously, and a root mean square difference between predicted and crystallographically observed (ϕ, ψ) torsion angles of ca 12º. TALOS-N also reports sidechain χ(1) rotameric states for about 50 % of the residues, and a consistency with reference structures of 89 %. The program includes a neural network trained to identify secondary structure from residue sequence and chemical shifts.
Collapse
Affiliation(s)
- Yang Shen
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Building 5, Room 126 NIH, Bethesda, MD 20892-0520, USA
| | | |
Collapse
|
11
|
Vögeli B, Güntert P, Riek R. Multiple-state ensemble structure determination from eNOE spectroscopy. Mol Phys 2013. [DOI: 10.1080/00268976.2012.728257] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
12
|
Feytens D, Chaume G, Chassaing G, Lavielle S, Brigaud T, Byun BJ, Kang YK, Miclet E. Local Control of the Cis–Trans Isomerization and Backbone Dihedral Angles in Peptides Using Trifluoromethylated Pseudoprolines. J Phys Chem B 2012; 116:4069-79. [DOI: 10.1021/jp300284u] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Debby Feytens
- Laboratoire des BioMolécules, UPMC Paris 06, 4, Place Jussieu, 75005 Paris, France
- Laboratoire des BioMolécules, Departement de Chimie, Ecole Normale Superieure, 24, rue Lhomond, 75005 Paris, France
- UMR 7203, FR 2569, 4, Place Jussieu, 75005 Paris, France
| | - Grégory Chaume
- Laboratoire SOSCO, Université de Cergy-Pontoise, EA 4505, 5 mail Gay Lussac, 95000 Cergy-Pontoise, France
| | - Gérard Chassaing
- Laboratoire des BioMolécules, UPMC Paris 06, 4, Place Jussieu, 75005 Paris, France
- Laboratoire des BioMolécules, Departement de Chimie, Ecole Normale Superieure, 24, rue Lhomond, 75005 Paris, France
- UMR 7203, FR 2569, 4, Place Jussieu, 75005 Paris, France
| | - Solange Lavielle
- Laboratoire des BioMolécules, UPMC Paris 06, 4, Place Jussieu, 75005 Paris, France
- Laboratoire des BioMolécules, Departement de Chimie, Ecole Normale Superieure, 24, rue Lhomond, 75005 Paris, France
- UMR 7203, FR 2569, 4, Place Jussieu, 75005 Paris, France
| | - Thierry Brigaud
- Laboratoire SOSCO, Université de Cergy-Pontoise, EA 4505, 5 mail Gay Lussac, 95000 Cergy-Pontoise, France
| | - Byung Jin Byun
- Department of Chemistry, Chungbuk National University, Cheongju, Chungbuk 361-763, Republic of Korea
| | - Young Kee Kang
- Department of Chemistry, Chungbuk National University, Cheongju, Chungbuk 361-763, Republic of Korea
| | - Emeric Miclet
- Laboratoire des BioMolécules, UPMC Paris 06, 4, Place Jussieu, 75005 Paris, France
- Laboratoire des BioMolécules, Departement de Chimie, Ecole Normale Superieure, 24, rue Lhomond, 75005 Paris, France
- UMR 7203, FR 2569, 4, Place Jussieu, 75005 Paris, France
| |
Collapse
|
13
|
Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO, Shaw DE. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins 2010; 78:1950-8. [PMID: 20408171 PMCID: PMC2970904 DOI: 10.1002/prot.22711] [Citation(s) in RCA: 4198] [Impact Index Per Article: 299.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Recent advances in hardware and software have enabled increasingly long molecular dynamics (MD) simulations of biomolecules, exposing certain limitations in the accuracy of the force fields used for such simulations and spurring efforts to refine these force fields. Recent modifications to the Amber and CHARMM protein force fields, for example, have improved the backbone torsion potentials, remedying deficiencies in earlier versions. Here, we further advance simulation accuracy by improving the amino acid side-chain torsion potentials of the Amber ff99SB force field. First, we used simulations of model alpha-helical systems to identify the four residue types whose rotamer distribution differed the most from expectations based on Protein Data Bank statistics. Second, we optimized the side-chain torsion potentials of these residues to match new, high-level quantum-mechanical calculations. Finally, we used microsecond-timescale MD simulations in explicit solvent to validate the resulting force field against a large set of experimental NMR measurements that directly probe side-chain conformations. The new force field, which we have termed Amber ff99SB-ILDN, exhibits considerably better agreement with the NMR data. Proteins 2010. © 2010 Wiley-Liss, Inc.
Collapse
|
14
|
Plevin MJ, Bryce DL, Boisbouvier J. Direct detection of CH/π interactions in proteins. Nat Chem 2010; 2:466-71. [DOI: 10.1038/nchem.650] [Citation(s) in RCA: 225] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2009] [Accepted: 03/23/2010] [Indexed: 11/09/2022]
|
15
|
Vögeli B, Riek R. Side chain: backbone projections in aromatic and ASX residues from NMR cross-correlated relaxation. JOURNAL OF BIOMOLECULAR NMR 2010; 46:135-147. [PMID: 19904498 DOI: 10.1007/s10858-009-9387-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2009] [Accepted: 10/22/2009] [Indexed: 05/28/2023]
Abstract
The measurements of cross-correlated relaxation rates between H(N)-N and C(beta)-C(gamma) intraresidual and sequential dipolar interactions is demonstrated in ASN, ASP and aromatic residues. The experiment can be used for deuterated samples and no additional knowledge such as Karplus parametrizations is required for the analysis. The data constitutes a new type of information since no other method relates the C(beta)-C(gamma) bond to H(N)-N. Using this method the dominant populations of rotamer states of chi 1 can be readily cross checked provided that phi or psi are known. In addition, dynamics on all timescales can be probed. As opposed to standard dynamics analysis of isolated bonds, the presented observables depend on relative dynamics with an interesting prospect to analyze correlated fluctuations of the two torsion angles phi or psi with chi 1. Experimental rates are compared to single conformer and ensemble representations of GB3 and ubiquitin. In particular, it is found that the recently published ubiquitin ensemble 2k39 improves the agreement obtained for 1UBQ. In general, however, input data restricting ASX and aromatic side chains in structure calculation is sparse highlighting the need for new NMR observables.
Collapse
Affiliation(s)
- Beat Vögeli
- Laboratory of Physical Chemistry, Swiss Federal Institute of Technology, ETH-Hönggerberg, 8093, Zurich, Switzerland.
| | | |
Collapse
|
16
|
Ayala I, Sounier R, Usé N, Gans P, Boisbouvier J. An efficient protocol for the complete incorporation of methyl-protonated alanine in perdeuterated protein. JOURNAL OF BIOMOLECULAR NMR 2009; 43:111-9. [PMID: 19115043 DOI: 10.1007/s10858-008-9294-7] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Revised: 11/28/2008] [Accepted: 12/01/2008] [Indexed: 05/20/2023]
Abstract
A strategy for the introduction of ((1)H,(13)C-methyl)-alanine into perdeuterated proteins is described. Specific protonation of alanine methyl groups to a level of 95% can be achieved by overexpressing proteins in M9/D(2)O based bacterial growth medium supplemented with 800 mg/l of 2-[(2)H], 3-[(13)C] L: -alanine. However, though simple, this approach results in undesired, non-specific background labeling due to isotope scrambling via different amino acid metabolic pathways. Following a careful analysis of known metabolic pathways we found that co-addition of perdeuterated forms of alpha-ketoisovalerate-d(7), succinate-d(4) and L: -isoleucine-d(10) with labeled L: -alanine, reduces undesired background labeling to <1%. When combined with recently developed methyl TROSY experiments, this methyl-specific labeling protocol permits the acquisition of excellent quality correlation spectra of alanine methyl groups in high molecular weight proteins. Our cost effective strategy offers a significant enhancement in the level of incorporation of methyl-labeled alanine in overexpressed proteins over previously reported methods.
Collapse
Affiliation(s)
- Isabel Ayala
- Institut de Biologie Structurale Jean-Pierre Ebel, UMR 5075, Commissariat à l'Energie Atomique, Centre National de la Recherche Scientifique, Université Joseph-Fourier, 41 rue Jules Horowitz, 38027, Grenoble Cedex, France
| | | | | | | | | |
Collapse
|
17
|
Guichard G, Violette A, Chassaing G, Miclet E. Solution structure determination of oligoureas using methylene spin state selective NMR at 13C natural abundance. MAGNETIC RESONANCE IN CHEMISTRY : MRC 2008; 46:918-924. [PMID: 18720449 DOI: 10.1002/mrc.2271] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Ability of N,N'-linked oligoureas containing proteinogenic side chains to adopt a stable helix conformation in solution has been described recently. NMR as well as circular dichroism (CD) spectroscopies were employed to gain insight into their specific fold. It is herein proposed to extend the structural information available on these peptidomimetics by an advantageous use of a methylene spin state selective NMR experiment. Homodecoupling provided by the pulse scheme made it possible to readily measure conformation-dependent (3)J(HH) constants that are difficult if not impossible to obtain with standard NMR experiments. Adding those couplings to the NMR restraints improved the quality of the structure calculations significantly, as judged by a ca 30% decrease of the root mean square deviation (RMSD) obtained over an ensemble of 20 structures. Moreover, accurate determination of individual (1)J(CH) couplings within each methylene group revealed uniform values throughout the oligourea sequence, with (1)J(CH) systematically slightly larger for the pro-S hydrogen than for the pro-R. As shown in this study, the methylene spin state selective NMR experiment displays a good intrinsic sensitivity and could therefore provide valuable structural information at (13)C natural abundance for peptidomimetic molecules and foldamers bearing diastereotopic methylene protons.
Collapse
Affiliation(s)
- Gilles Guichard
- CNRS, Institut de Biologie Moléculaire et Cellulaire, Immunologie et Chimie Thérapeutique, 15, Rue Descartes, Strasbourg F-67084, France
| | | | | | | |
Collapse
|
18
|
Liquid-crystal NMR structure of HIV TAR RNA bound to its SELEX RNA aptamer reveals the origins of the high stability of the complex. Proc Natl Acad Sci U S A 2008; 105:9210-5. [PMID: 18607001 DOI: 10.1073/pnas.0712121105] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Transactivation-response element (TAR) is a stable stem-loop structure of HIV RNA, which plays a crucial role during the life cycle of the virus. The apical loop of TAR acts as a binding site for essential cellular cofactors required for the replication of HIV. High-affinity aptamers directed against the apical loop of TAR have been identified by the SELEX approach. The RNA aptamers with the highest affinity for TAR fold as hairpins and form kissing complexes with the targeted RNA through loop-loop interactions. The aptamers with the strongest binding properties all possess a GA base pair combination at the loop-closing position. Using liquid-crystal NMR methodology, we have obtained a structural model in solution of a TAR-aptamer kissing complex with an unprecedented accuracy. This high-resolution structure reveals that the GA base pair is unilaterally shifted toward the 5' strand and is stabilized by a network of intersugar hydrogen bonds. This specific conformation of the GA base pair allows for the formation of two supplementary stable base-pair interactions. By systematic permutations of the loop-closing base pair, we establish that the identified atomic interactions, which form the basis for the high stability of the complex, are maintained in several other kissing complexes. This study rationalizes the stabilizing role of the loop-closing GA base pairs in kissing complexes and may help the development or improvement of drugs against RNA loops of viruses or pathogens as well as the conception of biochemical tools targeting RNA hairpins involved in important biological functions.
Collapse
|
19
|
Getz M, Sun X, Casiano-Negroni A, Zhang Q, Al-Hashimi HM. NMR studies of RNA dynamics and structural plasticity using NMR residual dipolar couplings. Biopolymers 2007; 86:384-402. [PMID: 17594140 DOI: 10.1002/bip.20765] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
An increasing number of RNAs are being discovered that perform their functions by undergoing large changes in conformation in response to a variety of cellular signals, including recognition of proteins and small molecular targets, changes in temperature, and RNA synthesis itself. The measurement of NMR residual dipolar couplings (RDCs) in partially aligned systems is providing new insights into the structural plasticity of RNA through combined characterization of large-amplitude collective helix motions and local flexibility in noncanonical regions over a wide window of biologically relevant timescales (<milliseconds). Here, we review RDC methodology for studying RNA structural dynamics and survey what has been learnt thus far from application of these methods. Future methodological challenges are also identified.
Collapse
Affiliation(s)
- Melissa Getz
- Department of Chemistry, The University of Michigan, Ann Arbor, MI 48109, USA
| | | | | | | | | |
Collapse
|
20
|
Bailor MH, Musselman C, Hansen AL, Gulati K, Patel DJ, Al-Hashimi HM. Characterizing the relative orientation and dynamics of RNA A-form helices using NMR residual dipolar couplings. Nat Protoc 2007; 2:1536-46. [PMID: 17571061 PMCID: PMC4707013 DOI: 10.1038/nprot.2007.221] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
We present a protocol for determining the relative orientation and dynamics of A-form helices in 13C/15N isotopically enriched RNA samples using NMR residual dipolar couplings (RDCs). Non-terminal Watson-Crick base pairs in helical stems are experimentally identified using NOE and trans-hydrogen bond connectivity and modeled using the idealized A-form helix geometry. RDCs measured in the partially aligned RNA are used to compute order tensors describing average alignment of each helix relative to the applied magnetic field. The order tensors are translated into Euler angles defining the average relative orientation of helices and order parameters describing the amplitude and asymmetry of interhelix motions. The protocol does not require complete resonance assignments and therefore can be implemented rapidly to RNAs much larger than those for which complete high-resolution NMR structure determination is feasible. The protocol is particularly valuable for exploring adaptive changes in RNA conformation that occur in response to biologically relevant signals. Following resonance assignments, the procedure is expected to take no more than 2 weeks of acquisition and data analysis time.
Collapse
Affiliation(s)
- Maximillian H Bailor
- Department of Chemistry & Biophysics Research Division, The University of Michigan, Ann Arbor, Michigan 48109, USA
| | | | | | | | | | | |
Collapse
|
21
|
Tzvetkova P, Simova S, Luy B. P.E.HSQC: a simple experiment for simultaneous and sign-sensitive measurement of (1JCH+DCH) and (2JHH+DHH) couplings. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2007; 186:193-200. [PMID: 17347001 DOI: 10.1016/j.jmr.2007.02.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2006] [Revised: 02/09/2007] [Accepted: 02/12/2007] [Indexed: 05/14/2023]
Abstract
The angular information content of residual dipolar couplings between nuclei of fixed distance makes the accurate and sign-sensitive measurement of (1JCH+DCH) and (2JHH+DHH) couplings highly desirable. Experiments published so far are typically highly specialized for the effective measurement of a subset of couplings. The P.E.HSQC presented here, is an E.COSY based experiment which allows the simultaneous measurement of all heteronuclear and homonuclear couplings within CH, CH2, and CH3 groups in a single spectrum with the necessary precision and sign information. The simplicity of the approach and the absence of artefacts like phase distortions due to antiphase evolution make it ideally suited for coupling determination of organic molecules at natural abundance.
Collapse
Affiliation(s)
- Pavleta Tzvetkova
- Department Chemie, Organische Chemie II, Technische Universität München, Lichtenbergstrasse 4, D-85747 Garching, Germany
| | | | | |
Collapse
|
22
|
Balayssac S, Bertini I, Luchinat C, Parigi G, Piccioli M. 13C direct detected NMR increases the detectability of residual dipolar couplings. J Am Chem Soc 2007; 128:15042-3. [PMID: 17117827 DOI: 10.1021/ja0645436] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
13C direct detection is becoming an increasingly efficient approach to identify signals of residues that escape detection in 1H detected experiments. Pulse sequences have been developed to obtain 1H partially recoupled experiments for the measurement of the 1JHalphaCalpha and 1JHN couplings with the same resolution available in conventional 1H detected experiments. A consistent set of backbone rdc obtained without any 1H-based experiment has been obtained and shown to be effective for protein solution structure determination.
Collapse
Affiliation(s)
- Stéphane Balayssac
- Magnetic Resonance Center, University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Italy
| | | | | | | | | |
Collapse
|
23
|
Puttonen E, Tossavainen H, Permi P. Simultaneous determination of one- and two-bond scalar and residual dipolar couplings between 13C', 13Calpha and 15N spins in proteins. MAGNETIC RESONANCE IN CHEMISTRY : MRC 2006; 44 Spec No:S168-76. [PMID: 16823899 DOI: 10.1002/mrc.1836] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Four simple and sensitive HNCO-based methods for measurement of 1J(C'Calpha), 1J(NCalpha) and 2J(NCalpha) coupling constants in protein main chains are presented. Three of these experiments enable the simultaneous measurement of 1J(C'Calpha), 1J(NCalpha) and 2J(NCalpha) couplings. Exploitation of the E.COSY principle provides excellent dispersion of cross peaks in the resulting 3D spectra. The couplings can be retrieved with good accuracy from peak-to-peak separations. Karplus parameterizations are provided for 1J(NCalpha) and 2J(NCalpha), obtained from a nearly complete set of couplings of human ubiquitin. In addition, feasibility of the proposed methodology for measuring several residual dipolar couplings (RDCs) simultaneously is assessed.
Collapse
Affiliation(s)
- Eetu Puttonen
- NMR Laboratory, Program in Structural Biology and Biophysics, Institute of Biotechnology, P.O. Box 65, FI-00014, University of Helsinki, Helsinki, Finland
| | | | | |
Collapse
|
24
|
Nolis P, Espinosa JF, Parella T. Optimum spin-state selection for all multiplicities in the acquisition dimension of the HSQC experiment. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2006; 180:39-50. [PMID: 16448830 DOI: 10.1016/j.jmr.2006.01.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2005] [Revised: 12/29/2005] [Accepted: 01/04/2006] [Indexed: 05/06/2023]
Abstract
Most conventional heteronuclear spin-state-selective (S(3)) NMR experiments only work for a specific multiplicity, typically IS spin systems. Here, we introduce a general and efficient IPAP strategy to achieve S(3) editing simultaneously for all multiplicities in the acquisition dimension of the HSQC experiment. Complementary in-phase (HSQC-IP) and anti-phase (HSQC-AP) data are separately recorded with a simple phase exchange of two 90 degrees proton pulses involved in the mixing process of the F2-coupled sensitivity-improved HSQC pulse sequence. Additive and subtractive linear combination of these IP/AP data generates simplified F2-alpha/beta-spin-edited HSQC subspectra for all IS, I(2)S, and I(3)S spin systems and combines enhanced and optimized sensitivity with excellent tolerance to unwanted cross-talk contributions over a considerable range of coupling constants. Practical aspects such as pulse phase settings, transfer efficiency dependence, inter-pulse delay optimization, and percentage of cross-talk are theoretically analyzed and discussed as a function of each I(n)S multiplicity. Particular emphasis on the features associated to spin-editing in diastereotopic I(2)S spin systems and application to the measurement of long-range proton-carbon coupling constants are also provided.
Collapse
Affiliation(s)
- Pau Nolis
- Servei de Ressonància Magnètica Nuclear, Universitat Autònoma de Barcelona, E-08193 Bellaterra, Barcelona, Spain
| | | | | |
Collapse
|
25
|
Ball G, Meenan N, Bromek K, Smith BO, Bella J, Uhrín D. Measurement of one-bond 13Calpha-1Halpha residual dipolar coupling constants in proteins by selective manipulation of CalphaHalpha spins. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2006; 180:127-36. [PMID: 16495100 DOI: 10.1016/j.jmr.2006.01.017] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2005] [Revised: 01/24/2006] [Accepted: 01/31/2006] [Indexed: 05/06/2023]
Abstract
We have developed new 2D and 3D experiments for the measurement of C(alpha)-H(alpha) residual dipolar coupling constants in (13)C and (15)N labelled proteins. Two experiments, 2D (HNCO)-(J-CA)NH and 3D (HN)CO-(J-CA)NH, sample the C(alpha)-H(alpha) splitting by means of C(alpha) magnetization, while 2D (J-HACACO)NH and 3D J-HA(CACO)NH use H(alpha) magnetization to achieve a similar result. In the 2D experiments the coupling evolution is superimposed on the evolution of the (15)N chemical shifts and the IPAP principle is used to obtain (1)H-(15)N HSQC-like spectra from which the splitting is determined. The use of a third dimension in 3D experiments reduces spectral overlap to the point where use of an IPAP scheme may not be necessary. The length of the sampling interval in the J-dimension of these experiments is dictated solely by the relaxation properties of C(alpha) or H(alpha) nuclei. This was made possible by the use of C(alpha) selective pulses in combination with either a DPFGSE or modified BIRD pulses. Inclusion of these pulse sequence elements in the J-evolution periods removes unwanted spin-spin interactions. This allows prolonged sampling periods ( approximately 25 ms) yielding higher precision C(alpha)-H(alpha) splitting determination than is achievable with existing frequency based methods.
Collapse
Affiliation(s)
- Graeme Ball
- School of Chemistry, University of Edinburgh, West Mains Road, Edinburgh EH9 3JJ, UK
| | | | | | | | | | | |
Collapse
|
26
|
Bax A, Grishaev A. Weak alignment NMR: a hawk-eyed view of biomolecular structure. Curr Opin Struct Biol 2006; 15:563-70. [PMID: 16140525 DOI: 10.1016/j.sbi.2005.08.006] [Citation(s) in RCA: 196] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2005] [Revised: 08/04/2005] [Accepted: 08/24/2005] [Indexed: 10/25/2022]
Abstract
Imposing a very slight deviation from the isotropic random distribution of macromolecules in solution in an NMR sample tube permits the measurement of residual internuclear dipolar couplings (RDCs). Such interactions are very sensitive functions of the time-averaged orientation of the corresponding internuclear vectors and thereby offer highly precise structural information. In recent years, advances have been made both in the technology to measure RDCs and in the computational procedures that integrate this information in the structure determination process. The exceptional precision with which RDCs can be measured under weakly aligned conditions is also starting to reveal the mostly, but not universally, subtle effects of internal protein dynamics. Importantly, RDCs potentially can reveal motions taking place on a timescale slower than rotational diffusion and analysis is uniquely sensitive to the direction of motion, not just its amplitude.
Collapse
Affiliation(s)
- Ad Bax
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0520, USA.
| | | |
Collapse
|