1
|
Dingeldein L, Cossio P, Covino R. Simulation-based inference of single-molecule experiments. Curr Opin Struct Biol 2025; 91:102988. [PMID: 39921963 DOI: 10.1016/j.sbi.2025.102988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Revised: 12/30/2024] [Accepted: 01/03/2025] [Indexed: 02/10/2025]
Abstract
Single-molecule experiments are a unique tool to characterize the structural dynamics of biomolecules. However, reconstructing molecular details from noisy single-molecule data is challenging. Simulation-based inference (SBI) is a powerful framework for analyzing complex experimental data, integrating statistical inference, physics-based simulators, and machine learning. Recent advances in deep learning have accelerated the development of new SBI methods, enabling the application of Bayesian inference to an ever-increasing number of scientific problems. Here, we review the nascent application of SBI to the analysis of single-molecule experiments. We introduce parametric Bayesian inference and discuss its limitations. We then overview emerging deep learning-based SBI methods to perform Bayesian inference for complex models encoded in computer simulators. We illustrate the first applications of SBI to single-molecule force spectroscopy and cryo-electron microscopy experiments. SBI allows us to leverage powerful computer algorithms modeling complex biomolecular phenomena to connect scientific models and experiments in a principled way.
Collapse
Affiliation(s)
- Lars Dingeldein
- Institute of Physics, Goethe University Frankfurt, Frankfurt am Main, Germany; Frankfurt Institute for Advanced Studies, Frankfurt am Main, Germany
| | - Pilar Cossio
- Center for Computational Mathematics, Flatiron Institute, New York, United States; Center for Computational Biology, Flatiron Institute, New York, United States
| | - Roberto Covino
- Frankfurt Institute for Advanced Studies, Frankfurt am Main, Germany; Institute of Computer Science, Goethe University Frankfurt, Frankfurt am Main, Germany.
| |
Collapse
|
2
|
Borthakur K, Sisk TR, Panei FP, Bonomi M, Robustelli P. Determining accurate conformational ensembles of intrinsically disordered proteins at atomic resolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.04.616700. [PMID: 39651234 PMCID: PMC11623552 DOI: 10.1101/2024.10.04.616700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Determining accurate atomic resolution conformational ensembles of intrinsically disordered proteins (IDPs) is extremely challenging. Molecular dynamics (MD) simulations provide atomistic conformational ensembles of IDPs, but their accuracy is highly dependent on the quality of physical models, or force fields, used. Here, we demonstrate how to determine accurate atomic resolution conformational ensembles of IDPs by integrating all-atom MD simulations with experimental data from nuclear magnetic resonance (NMR) spectroscopy and small-angle x-ray scattering (SAXS) with a simple, robust and fully automated maximum entropy reweighting procedure. We demonstrate that when this approach is applied with sufficient experimental data, IDP ensembles derived from different MD force fields converge to highly similar conformational distributions. The maximum entropy reweighting procedure presented here facilitates the integration of MD simulations with extensive experimental datasets and enables the calculation of accurate, force-field independent atomic resolution conformational ensembles of IDPs.
Collapse
|
3
|
Tiemann JKS, Szczuka M, Bouarroudj L, Oussaren M, Garcia S, Howard RJ, Delemotte L, Lindahl E, Baaden M, Lindorff-Larsen K, Chavent M, Poulain P. MDverse, shedding light on the dark matter of molecular dynamics simulations. eLife 2024; 12:RP90061. [PMID: 39212001 PMCID: PMC11364437 DOI: 10.7554/elife.90061] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024] Open
Abstract
The rise of open science and the absence of a global dedicated data repository for molecular dynamics (MD) simulations has led to the accumulation of MD files in generalist data repositories, constituting the dark matter of MD - data that is technically accessible, but neither indexed, curated, or easily searchable. Leveraging an original search strategy, we found and indexed about 250,000 files and 2000 datasets from Zenodo, Figshare and Open Science Framework. With a focus on files produced by the Gromacs MD software, we illustrate the potential offered by the mining of publicly available MD data. We identified systems with specific molecular composition and were able to characterize essential parameters of MD simulation such as temperature and simulation length, and could identify model resolution, such as all-atom and coarse-grain. Based on this analysis, we inferred metadata to propose a search engine prototype to explore the MD data. To continue in this direction, we call on the community to pursue the effort of sharing MD data, and to report and standardize metadata to reuse this valuable matter.
Collapse
Affiliation(s)
- Johanna KS Tiemann
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Magdalena Szczuka
- Institut de Pharmacologie et Biologie Structurale, CNRS, Université de ToulouseToulouseFrance
| | - Lisa Bouarroudj
- Université Paris Cité, CNRS, Institut Jacques MonodParisFrance
| | | | | | - Rebecca J Howard
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm UniversityStockholmSweden
| | - Lucie Delemotte
- Department of applied physics, Science for Life Laboratory, KTH Royal Institute of TechnologyStockholmSweden
| | - Erik Lindahl
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm UniversityStockholmSweden
- Department of applied physics, Science for Life Laboratory, KTH Royal Institute of TechnologyStockholmSweden
| | - Marc Baaden
- Laboratoire de Biochimie Théorique, CNRS, Université Paris CitéParisFrance
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Matthieu Chavent
- Institut de Pharmacologie et Biologie Structurale, CNRS, Université de ToulouseToulouseFrance
| | - Pierre Poulain
- Université Paris Cité, CNRS, Institut Jacques MonodParisFrance
| |
Collapse
|
4
|
Tiemann JKS, Szczuka M, Bouarroudj L, Oussaren M, Garcia S, Howard RJ, Delemotte L, Lindahl E, Baaden M, Lindorff-Larsen K, Chavent M, Poulain P. MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.02.538537. [PMID: 37205542 PMCID: PMC10187166 DOI: 10.1101/2023.05.02.538537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
The rise of open science and the absence of a global dedicated data repository for molecular dynamics (MD) simulations has led to the accumulation of MD files in generalist data repositories, constituting the dark matter of MD - data that is technically accessible, but neither indexed, curated, or easily searchable. Leveraging an original search strategy, we found and indexed about 250,000 files and 2,000 datasets from Zenodo, Figshare and Open Science Framework. With a focus on files produced by the Gromacs MD software, we illustrate the potential offered by the mining of publicly available MD data. We identified systems with specific molecular composition and were able to characterize essential parameters of MD simulation such as temperature and simulation length, and could identify model resolution, such as all-atom and coarse-grain. Based on this analysis, we inferred metadata to propose a search engine prototype to explore the MD data. To continue in this direction, we call on the community to pursue the effort of sharing MD data, and to report and standardize metadata to reuse this valuable matter.
Collapse
|
5
|
Champion C, Lehner M, Smith AA, Ferrage F, Bolik-Coulon N, Riniker S. Unraveling motion in proteins by combining NMR relaxometry and molecular dynamics simulations: A case study on ubiquitin. J Chem Phys 2024; 160:104105. [PMID: 38465679 DOI: 10.1063/5.0188416] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 02/20/2024] [Indexed: 03/12/2024] Open
Abstract
Nuclear magnetic resonance (NMR) relaxation experiments shine light onto the dynamics of molecular systems in the picosecond to millisecond timescales. As these methods cannot provide an atomically resolved view of the motion of atoms, functional groups, or domains giving rise to such signals, relaxation techniques have been combined with molecular dynamics (MD) simulations to obtain mechanistic descriptions and gain insights into the functional role of side chain or domain motion. In this work, we present a comparison of five computational methods that permit the joint analysis of MD simulations and NMR relaxation experiments. We discuss their relative strengths and areas of applicability and demonstrate how they may be utilized to interpret the dynamics in MD simulations with the small protein ubiquitin as a test system. We focus on the aliphatic side chains given the rigidity of the backbone of this protein. We find encouraging agreement between experiment, Markov state models built in the χ1/χ2 rotamer space of isoleucine residues, explicit rotamer jump models, and a decomposition of the motion using ROMANCE. These methods allow us to ascribe the dynamics to specific rotamer jumps. Simulations with eight different combinations of force field and water model highlight how the different metrics may be employed to pinpoint force field deficiencies. Furthermore, the presented comparison offers a perspective on the utility of NMR relaxation to serve as validation data for the prediction of kinetics by state-of-the-art biomolecular force fields.
Collapse
Affiliation(s)
- Candide Champion
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Marc Lehner
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Albert A Smith
- Institute for Medical Physics and Biophysics, Leipzig University, Härtelstrasse 16-18, 04107 Leipzig, Germany
| | - Fabien Ferrage
- Laboratoire des Biomolécules, LBM, Département de Chimie, École normale supérieure, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - Nicolas Bolik-Coulon
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
6
|
Shukla VK, Heller GT, Hansen DF. Biomolecular NMR spectroscopy in the era of artificial intelligence. Structure 2023; 31:1360-1374. [PMID: 37848030 DOI: 10.1016/j.str.2023.09.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 09/15/2023] [Accepted: 09/21/2023] [Indexed: 10/19/2023]
Abstract
Biomolecular nuclear magnetic resonance (NMR) spectroscopy and artificial intelligence (AI) have a burgeoning synergy. Deep learning-based structural predictors have forever changed structural biology, yet these tools currently face limitations in accurately characterizing protein dynamics, allostery, and conformational heterogeneity. We begin by highlighting the unique abilities of biomolecular NMR spectroscopy to complement AI-based structural predictions toward addressing these knowledge gaps. We then highlight the direct integration of deep learning approaches into biomolecular NMR methods. AI-based tools can dramatically improve the acquisition and analysis of NMR spectra, enhancing the accuracy and reliability of NMR measurements, thus streamlining experimental processes. Additionally, deep learning enables the development of novel types of NMR experiments that were previously unattainable, expanding the scope and potential of biomolecular NMR spectroscopy. Ultimately, a combination of AI and NMR promises to further revolutionize structural biology on several levels, advance our understanding of complex biomolecular systems, and accelerate drug discovery efforts.
Collapse
Affiliation(s)
- Vaibhav Kumar Shukla
- Department of Structural and Molecular Biology, Division of Biosciences, University College London, London WC1E 6BT, UK
| | - Gabriella T Heller
- Department of Structural and Molecular Biology, Division of Biosciences, University College London, London WC1E 6BT, UK.
| | - D Flemming Hansen
- Department of Structural and Molecular Biology, Division of Biosciences, University College London, London WC1E 6BT, UK.
| |
Collapse
|
7
|
Bolik-Coulon N, Zachrdla M, Bouvignies G, Pelupessy P, Ferrage F. Comprehensive analysis of relaxation decays from high-resolution relaxometry. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2023; 355:107555. [PMID: 37797558 DOI: 10.1016/j.jmr.2023.107555] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 09/06/2023] [Accepted: 09/08/2023] [Indexed: 10/07/2023]
Abstract
Relaxometry consists in measuring relaxation rates over orders of magnitude of magnetic fields to probe motions of complex systems. High-resolution relaxometry (HRR) experiments can be performed on conventional high-field NMR magnets equipped with a sample shuttle. During the experiment, the sample shuttle transfers the sample between the high-field magnetic center and a chosen position in the stray field for relaxation during a variable delay, thus using the stray field as a variable field. As the relaxation delay occurs outside of the probe, HRR experiments cannot rely on the control of cross-relaxation pathways, which is standard in high-field relaxation pulse sequences. Thus, decay rates are not pure relaxation rates, which may impair a reliable description of the dynamics. Previously, we took into account cross-relaxation effects in the analysis of high-resolution relaxometry data by applying a correction factor to relaxometry decay rates in order to estimate relaxation rates. These correction factors were obtained from the iterative simulation of the relaxation decay while the sample lies outside of the probe and a preceding analysis of relaxation rates which relies on the approximation of a priori multi-exponential decays by mono-exponential functions. However, an analysis protocol matching directly experimental and simulated relaxometry decays should be more self consistent and more generally applicable as it can accommodate deviations from mono-exponential decays. Here, we introduce Matching INtensities for the Optimization of Timescales and Amplitudes of motions Under Relaxometry (MINOTAUR), a framework for the analysis of high-resolution relaxometry that takes as input the intensity decays at all fields. This approach uses the full relaxation matrix to calculate intensity decays, allowing complex relaxation pathways to be taken into account. Therefore, it eliminates the need for a correction of decay rates and for fitting multi-exponential decays with mono-exponential functions. The MINOTAUR software is designed as a flexible framework where relaxation matrices and spectral density functions corresponding to various models of motions can be defined on a case-by-case basis. The agreement with our previous analyses of protein side-chain dynamics from carbon-13 relaxation is excellent, while providing a more robust analysis tool. We expect MINOTAUR to become the tool of choice for the analysis of high-resolution relaxometry.
Collapse
Affiliation(s)
- Nicolas Bolik-Coulon
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France.
| | - Milan Zachrdla
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
| | - Guillaume Bouvignies
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
| | - Philippe Pelupessy
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
| | - Fabien Ferrage
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France.
| |
Collapse
|
8
|
Kümmerer F, Orioli S, Lindorff-Larsen K. Fitting Force Field Parameters to NMR Relaxation Data. J Chem Theory Comput 2023. [PMID: 37276045 DOI: 10.1021/acs.jctc.3c00174] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
We present an approach to optimize force field parameters using time-dependent data from NMR relaxation experiments. To do so, we scan parameters in the dihedral angle potential energy terms describing the rotation of the methyl groups in proteins and compare NMR relaxation rates calculated from molecular dynamics simulations with the modified force fields to deuterium relaxation measurements of T4 lysozyme. We find that a small modification of Cγ methyl groups improves the agreement with experiments both for the protein used to optimize the force field and when validating using simulations of CI2 and ubiquitin. We also show that these improvements enable a more effective a posteriori reweighting of the MD trajectories. The resulting force field thus enables more direct comparison between simulations and side-chain NMR relaxation data and makes it possible to construct ensembles that better represent the dynamics of proteins in solution.
Collapse
Affiliation(s)
- Felix Kümmerer
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Simone Orioli
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biophysics, Niels Bohr Institute, Faculty of Science, University of Copenhagen, DK-2100 Copenhagen Ø, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
9
|
Negroni M, Kurzbach D. Missing Pieces in Structure Puzzles: How Hyperpolarized NMR Spectroscopy Can Complement Structural Biology and Biochemistry. Chembiochem 2023; 24:e202200703. [PMID: 36624049 DOI: 10.1002/cbic.202200703] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 01/08/2023] [Accepted: 01/09/2023] [Indexed: 01/11/2023]
Abstract
Structure determination lies at the heart of many biochemical research programs. However, the "giants": X-ray diffraction, electron microscopy, molecular dynamics simulations, and nuclear magnetic resonance, among others, leave quite a few dark spots on the structural pictures drawn of proteins, nucleic acids, membranes, and other biomacromolecules. For example, structural models under physiological conditions or of short-lived intermediates often remain out of reach of the established experimental methods. This account frames the possibility of including hyperpolarized, that is, dramatically signal-enhanced NMR in existing workflows to fill these spots with detailed depictions. We highlight how integrating methods based on dissolution dynamic nuclear polarization can provide valuable complementary information about formerly inaccessible conformational spaces for many systems. A particular focus will be on hyperpolarized buffers to facilitate the NMR structure determination of challenging systems.
Collapse
Affiliation(s)
- Mattia Negroni
- Faculty of Chemistry, Institute of Biological Chemistry, University of Vienna, Währinger Str. 38, 1090, Vienna, Austria
| | - Dennis Kurzbach
- Faculty of Chemistry, Institute of Biological Chemistry, University of Vienna, Währinger Str. 38, 1090, Vienna, Austria
| |
Collapse
|
10
|
Söldner B, Grohe K, Neidig P, Auch J, Blach S, Klein A, Vasa SK, Schäfer LV, Linser R. Integrated Assessment of the Structure and Dynamics of Solid Proteins. J Phys Chem Lett 2023; 14:1725-1731. [PMID: 36757335 DOI: 10.1021/acs.jpclett.2c03398] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Understanding macromolecular function, interactions, and stability hinges on detailed assessment of conformational ensembles. For solid proteins, accurate elucidation of the spatial aspects of dynamics at physiological temperatures is limited by the qualitative character or low abundance of solid-state nuclear magnetic resonance internuclear distance information. Here, we demonstrate access to abundant proton-proton internuclear distances for integrated structural biology and chemistry with unprecedented accuracy. Apart from highest-resolution single-state structures, the exact distances enable molecular dynamics (MD) ensemble simulations orchestrated by a dense network of experimental interproton distance boundaries gathered in the context of their physical lattices. This direct embedding of experimental ensemble distances into MD will provide access to representative, atomic-level spatial details of conformational dynamics in supramolecular assemblies, crystalline and lipid-embedded proteins, and beyond.
Collapse
Affiliation(s)
- Benedikt Söldner
- Department of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Straße 4a, 44227 Dortmund, Germany
| | - Kristof Grohe
- Bruker BioSpin GmbH, Rudolf-Plank-Straße 23, 76275 Ettlingen, Germany
| | - Peter Neidig
- Bruker BioSpin GmbH, Rudolf-Plank-Straße 23, 76275 Ettlingen, Germany
| | - Jelena Auch
- Department of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Straße 4a, 44227 Dortmund, Germany
| | - Sebastian Blach
- Center for Theoretical Chemistry, Ruhr University Bochum, Universitätsstraße 150, 44801 Bochum, Germany
| | - Alexander Klein
- Department of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Straße 4a, 44227 Dortmund, Germany
| | - Suresh K Vasa
- Department of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Straße 4a, 44227 Dortmund, Germany
| | - Lars V Schäfer
- Center for Theoretical Chemistry, Ruhr University Bochum, Universitätsstraße 150, 44801 Bochum, Germany
| | - Rasmus Linser
- Department of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Straße 4a, 44227 Dortmund, Germany
| |
Collapse
|
11
|
How does it really move? Recent progress in the investigation of protein nanosecond dynamics by NMR and simulation. Curr Opin Struct Biol 2022; 77:102459. [PMID: 36148743 DOI: 10.1016/j.sbi.2022.102459] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2022] [Revised: 07/29/2022] [Accepted: 08/05/2022] [Indexed: 12/14/2022]
Abstract
Nuclear magnetic resonance (NMR) spin relaxation experiments currently probe molecular motions on timescales from picoseconds to nanoseconds. The detailed interpretation of these motions in atomic detail benefits from complementarity with the results from molecular dynamics (MD) simulations. In this mini-review, we describe the recent developments in experimental techniques to study the backbone dynamics from 15N relaxation and side-chain dynamics from 13C relaxation, discuss the different analysis approaches from model-free to dynamics detectors, and highlight the many ways that NMR relaxation experiments and MD simulations can be used together to improve the interpretation and gain insights into protein dynamics.
Collapse
|
12
|
Bolik-Coulon N, Languin-Cattoën O, Carnevale D, Zachrdla M, Laage D, Sterpone F, Stirnemann G, Ferrage F. Explicit Models of Motion to Understand Protein Side-Chain Dynamics. PHYSICAL REVIEW LETTERS 2022; 129:203001. [PMID: 36462011 DOI: 10.1103/physrevlett.129.203001] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 09/04/2022] [Indexed: 06/17/2023]
Abstract
Nuclear magnetic relaxation is widely used to probe protein dynamics. For decades, most analyses of relaxation in proteins have relied successfully on the model-free approach, forgoing mechanistic descriptions of motion. Model-free types of correlation functions cannot describe a large carbon-13 relaxation dataset in protein side chains. Here, we use molecular dynamics simulations to design explicit models of motion and solve Fokker-Planck diffusion equations. These models of motion provide better agreement with relaxation data, mechanistic insight, and a direct link to configuration entropy.
Collapse
Affiliation(s)
- Nicolas Bolik-Coulon
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
| | - Olivier Languin-Cattoën
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologique Physico-Chimique, Université Paris Cité, PSL University, 13 rue Pierre et Marie Curie, 75005 Paris, France
| | - Diego Carnevale
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
| | - Milan Zachrdla
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologique Physico-Chimique, Université Paris Cité, PSL University, 13 rue Pierre et Marie Curie, 75005 Paris, France
| | - Damien Laage
- PASTEUR, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
| | - Fabio Sterpone
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologique Physico-Chimique, Université Paris Cité, PSL University, 13 rue Pierre et Marie Curie, 75005 Paris, France
| | - Guillaume Stirnemann
- CNRS Laboratoire de Biochimie Théorique, Institut de Biologique Physico-Chimique, Université Paris Cité, PSL University, 13 rue Pierre et Marie Curie, 75005 Paris, France
| | - Fabien Ferrage
- Laboratoire des Biomolécules, LBM, Département de chimie, École Normale Supérieure, PSL University, Sorbonne Université, CNRS, 24 rue Lhomond, 75005 Paris, France
| |
Collapse
|
13
|
Chowdhury UD, Bhargava BL. Understanding the conformational changes in the influenza B M2 ion channel at various protonation states. Biophys Chem 2022; 289:106859. [PMID: 35905599 DOI: 10.1016/j.bpc.2022.106859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 07/06/2022] [Accepted: 07/19/2022] [Indexed: 11/17/2022]
Abstract
The characterization of influenza (A/B M2) ion channels is very important as they are potential binding sites for the drugs. We report the all-atom molecular dynamics study of the influenza B M2 ion channel in the presence of explicit solvent and lipid bilayers using the high resolution solid-state NMR structures. The importance of the various protonation states of histidine in the activation of the ion channel is discussed. The conformational changes at the closed and the open structures clearly show that the increase in tilt angle is necessary for the activation of the ion channel. Additionally, the free energy surfaces of the eight systems show the importance of the protonation state of the histidine residues in the activation of the influenza B M2 ion channel. The protonation of the histidine residues increases the tilt angle and the intra-helix distance which is evident from the superimposition of the structures corresponding to the maxima and the minima in the free energy landscape. The findings imply differences in the singly protonated and double protonated conformational states of BM2 ion channel and provide insights to help further studies of these ion channels as the drug targets for the influenza virus.
Collapse
Affiliation(s)
- Unmesh D Chowdhury
- School of Chemical Sciences, National Institute of Science Education & Research - Bhubaneswar, an OCC of Homi Bhabha National Institute, P.O.Jatni, Khurda, Odisha 752050, India
| | - B L Bhargava
- School of Chemical Sciences, National Institute of Science Education & Research - Bhubaneswar, an OCC of Homi Bhabha National Institute, P.O.Jatni, Khurda, Odisha 752050, India.
| |
Collapse
|
14
|
Bolik-Coulon N, Ferrage F. Explicit models of motions to analyze NMR relaxation data in proteins. J Chem Phys 2022; 157:125102. [DOI: 10.1063/5.0095910] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Nuclear Magnetic Resonance (NMR) is a tool of choice to characterize molecular motions. In biological macromolecules, pico- to nano-second motions, in particular, can be probed by nuclear spin relaxation rates which depend on the time fluctuations of the orientations of spin interaction frames. For the past 40 years, relaxation rates have been successfully analyzed using the Model Free (MF) approach which makes no assumption on the nature of motions and reports on the effective amplitude and time-scale of the motions. However, obtaining a mechanistic picture of motions from this type of analysis is difficult at best, unless complemented with molecular dynamics (MD) simulations. In spite of their limited accuracy, such simulations can be used to obtain the information necessary to build explicit models of motions designed to analyze NMR relaxation data. Here, we present how to build such models, suited in particular to describe motions of methyl-bearing protein side-chains and compare them with the MF approach. We show on synthetic data that explicit models of motions are more robust in the presence of rotamer jumps which dominate the relaxation in methyl groups of protein side-chains. We expect this work to motivate the use of explicit models of motion to analyze MD and NMR data.
Collapse
Affiliation(s)
| | - Fabien Ferrage
- Departement de chimie, Ecole Normale Superieure Departement de Chimie, France
| |
Collapse
|
15
|
Fernández-Quintero ML, DeRose EF, Gabel SA, Mueller GA, Liedl KR. Nanobody Paratope Ensembles in Solution Characterized by MD Simulations and NMR. Int J Mol Sci 2022; 23:5419. [PMID: 35628231 PMCID: PMC9141556 DOI: 10.3390/ijms23105419] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 05/02/2022] [Accepted: 05/10/2022] [Indexed: 01/27/2023] Open
Abstract
Variable domains of camelid antibodies (so-called nanobodies or VHH) are the smallest antibody fragments that retain complete functionality and therapeutic potential. Understanding of the nanobody-binding interface has become a pre-requisite for rational antibody design and engineering. The nanobody-binding interface consists of up to three hypervariable loops, known as the CDR loops. Here, we structurally and dynamically characterize the conformational diversity of an anti-GFP-binding nanobody by using molecular dynamics simulations in combination with experimentally derived data from nuclear magnetic resonance (NMR) spectroscopy. The NMR data contain both structural and dynamic information resolved at various timescales, which allows an assessment of the quality of protein MD simulations. Thus, in this study, we compared the ensembles for the anti-GFP-binding nanobody obtained from MD simulations with results from NMR. We find excellent agreement of the NOE-derived distance maps obtained from NMR and MD simulations and observe similar conformational spaces for the simulations with and without NOE time-averaged restraints. We also compare the measured and calculated order parameters and find generally good agreement for the motions observed in the ps-ns timescale, in particular for the CDR3 loop. Understanding of the CDR3 loop dynamics is especially critical for nanobodies, as this loop is typically critical for antigen recognition.
Collapse
Affiliation(s)
- Monica L. Fernández-Quintero
- Department of General, Inorganic and Theoretical Chemistry, Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innrain 80-82, A-6020 Innsbruck, Austria;
| | - Eugene F. DeRose
- Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, 111 T.W. Alexander Dr. MD-MR-01, Research Triangle Park, NC 27709, USA; (E.F.D.); (S.A.G.)
| | - Scott A. Gabel
- Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, 111 T.W. Alexander Dr. MD-MR-01, Research Triangle Park, NC 27709, USA; (E.F.D.); (S.A.G.)
| | - Geoffrey A. Mueller
- Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences, 111 T.W. Alexander Dr. MD-MR-01, Research Triangle Park, NC 27709, USA; (E.F.D.); (S.A.G.)
| | - Klaus R. Liedl
- Department of General, Inorganic and Theoretical Chemistry, Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innrain 80-82, A-6020 Innsbruck, Austria;
| |
Collapse
|
16
|
Polêto MD, Lemkul JA. Integration of Experimental Data and Use of Automated Fitting Methods in Developing Protein Force Fields. Commun Chem 2022; 5:10.1038/s42004-022-00653-z. [PMID: 35382231 PMCID: PMC8979544 DOI: 10.1038/s42004-022-00653-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 02/21/2022] [Indexed: 01/27/2023] Open
Abstract
The development of accurate protein force fields has been the cornerstone of molecular simulations for the past 50 years. During this period, many lessons have been learned regarding the use of experimental target data and parameter fitting procedures. Here, we review recent advances in protein force field development. We discuss the recent emergence of polarizable force fields and the role of electronic polarization and areas in which additive force fields fall short. The use of automated fitting methods and the inclusion of additional experimental solution data during parametrization is discussed as a means to highlight possible routes to improve the accuracy of force fields even further.
Collapse
Affiliation(s)
- Marcelo D. Polêto
- Department of Biochemistry, Virginia Tech, Blacksburg, VA 24061 United States
| | - Justin A. Lemkul
- Department of Biochemistry, Virginia Tech, Blacksburg, VA 24061 United States
- Center for Drug Discovery, Virginia Tech, Blacksburg, VA 24061 United States
| |
Collapse
|
17
|
Conformational ensembles of intrinsically disordered proteins and flexible multidomain proteins. Biochem Soc Trans 2022; 50:541-554. [PMID: 35129612 DOI: 10.1042/bst20210499] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 01/13/2022] [Accepted: 01/17/2022] [Indexed: 12/29/2022]
Abstract
Intrinsically disordered proteins (IDPs) and multidomain proteins with flexible linkers show a high level of structural heterogeneity and are best described by ensembles consisting of multiple conformations with associated thermodynamic weights. Determining conformational ensembles usually involves the integration of biophysical experiments and computational models. In this review, we discuss current approaches to determine conformational ensembles of IDPs and multidomain proteins, including the choice of biophysical experiments, computational models used to sample protein conformations, models to calculate experimental observables from protein structure, and methods to refine ensembles against experimental data. We also provide examples of recent applications of integrative conformational ensemble determination to study IDPs and multidomain proteins and suggest future directions for research in the field.
Collapse
|
18
|
Gavrilov Y, Kümmerer F, Orioli S, Prestel A, Lindorff-Larsen K, Teilum K. Double Mutant of Chymotrypsin Inhibitor 2 Stabilized through Increased Conformational Entropy. Biochemistry 2022; 61:160-170. [PMID: 35019273 DOI: 10.1021/acs.biochem.1c00749] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The conformational heterogeneity of a folded protein can affect not only its function but also stability and folding. We recently discovered and characterized a stabilized double mutant (L49I/I57V) of the protein CI2 and showed that state-of-the-art prediction methods could not predict the increased stability relative to the wild-type protein. Here, we have examined whether changed native-state dynamics, and resulting entropy changes, can explain the stability changes in the double mutant protein, as well as the two single mutant forms. We have combined NMR relaxation measurements of the ps-ns dynamics of amide groups in the backbone and the methyl groups in the side chains with molecular dynamics simulations to quantify the native-state dynamics. The NMR experiments reveal that the mutations have different effects on the conformational flexibility of CI2: a reduction in conformational dynamics (and entropy estimated from this) of the native state of the L49I variant correlates with its decreased stability, while increased dynamics of the I57V and L49I/I57V variants correlates with their increased stability. These findings suggest that explicitly accounting for changes in native-state entropy might be needed to improve the predictions of the effect of mutations on protein stability.
Collapse
Affiliation(s)
- Yulian Gavrilov
- Structural Biology and NMR Laboratory and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark
| | - Felix Kümmerer
- Structural Biology and NMR Laboratory and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark
| | - Simone Orioli
- Structural Biology and NMR Laboratory and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark.,Structural Biophysics, Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100 Copenhagen Ø, Denmark
| | - Andreas Prestel
- Structural Biology and NMR Laboratory and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark
| | - Kaare Teilum
- Structural Biology and NMR Laboratory and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark
| |
Collapse
|
19
|
Czaplewski C, Gong Z, Lubecka EA, Xue K, Tang C, Liwo A. Recent Developments in Data-Assisted Modeling of Flexible Proteins. Front Mol Biosci 2022; 8:765562. [PMID: 35004845 PMCID: PMC8740120 DOI: 10.3389/fmolb.2021.765562] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 12/06/2021] [Indexed: 11/13/2022] Open
Abstract
Many proteins can fold into well-defined conformations. However, intrinsically-disordered proteins (IDPs) do not possess a defined structure. Moreover, folded multi-domain proteins often digress into alternative conformations. Collectively, the conformational dynamics enables these proteins to fulfill specific functions. Thus, most experimental observables are averaged over the conformations that constitute an ensemble. In this article, we review the recent developments in the concept and methods for the determination of the dynamic structures of flexible peptides and proteins. In particular, we describe ways to extract information from nuclear magnetic resonance small-angle X-ray scattering (SAXS), and chemical cross-linking coupled with mass spectroscopy (XL-MS) measurements. All these techniques can be used to obtain ensemble-averaged restraints or to re-weight the simulated conformational ensembles.
Collapse
Affiliation(s)
| | - Zhou Gong
- Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan, China
| | - Emilia A Lubecka
- Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Gdańsk, Poland
| | - Kai Xue
- PKU-Tsinghua Center for Life Sciences, Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Chun Tang
- PKU-Tsinghua Center for Life Sciences, Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
| | - Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Gdańsk, Poland
| |
Collapse
|
20
|
Hoffmann F, Mulder FAA, Schäfer LV. How Much Entropy Is Contained in NMR Relaxation Parameters? J Phys Chem B 2021; 126:54-68. [PMID: 34936366 DOI: 10.1021/acs.jpcb.1c07786] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Solution-state NMR relaxation experiments are the cornerstone to study internal protein dynamics at an atomic resolution on time scales that are faster than the overall rotational tumbling time τR. Since the motions described by NMR relaxation parameters are connected to thermodynamic quantities like conformational entropies, the question arises how much of the total entropy is contained within this tumbling time. Using all-atom molecular dynamics simulations of the T4 lysozyme, we found that entropy buildup is rather fast for the backbone, such that the majority of the entropy is indeed contained in the short-time dynamics. In contrast, the contribution of the slow dynamics of side chains on time scales beyond τR on the side-chain conformational entropy is significant and should be taken into account for the extraction of accurate thermodynamic properties.
Collapse
Affiliation(s)
- Falk Hoffmann
- Center for Theoretical Chemistry, Ruhr University Bochum, D-44 780 Bochum, Germany
| | - Frans A A Mulder
- Interdisciplinary Nanoscience Center and Department of Chemistry, University of Aarhus, DK-8000 Aarhus, Denmark
| | - Lars V Schäfer
- Center for Theoretical Chemistry, Ruhr University Bochum, D-44 780 Bochum, Germany
| |
Collapse
|