1
|
Bakker MJ, Mládek A, Semrád H, Zapletal V, Pavlíková Přecechtělová J. Improving IDP theoretical chemical shift accuracy and efficiency through a combined MD/ADMA/DFT and machine learning approach. Phys Chem Chem Phys 2022; 24:27678-27692. [PMID: 36373847 DOI: 10.1039/d2cp01638a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
This work extends the multi-scale computational scheme for the quantum mechanics (QM) calculations of Nuclear Magnetic Resonance (NMR) chemical shifts (CSs) in proteins that lack a well-defined 3D structure. The scheme couples the sampling of an intrinsically disordered protein (IDP) by classical molecular dynamics (MD) with protein fragmentation using the adjustable density matrix assembler (ADMA) and density functional theory (DFT) calculations. In contrast to our early investigation on IDPs (Pavlíková Přecechtělová et al., J. Chem. Theory Comput., 2019, 15, 5642-5658) and the state-of-the art NMR calculations for structured proteins, a partial re-optimization was implemented on the raw MD geometries in vibrational normal mode coordinates to enhance the accuracy of the MD/ADMA/DFT computational scheme. In addition, machine-learning based cluster analysis was performed on the scheme to explore its potential in producing protein structure ensembles (CLUSTER ensembles) that yield accurate CSs at a reduced computational cost. The performance of the cluster-based calculations is validated against results obtained with conventional structural ensembles consisting of MD snapshots extracted from the MD trajectory at regular time intervals (REGULAR ensembles). CS calculations performed with the refined MD/ADMA/DFT framework employed the 6-311++G(d,p) basis set that outperformed IGLO-III calculations with the same density functional approximation (B3LYP) and both explicit and implicit solvation. The partial geometry optimization did not universally improve the agreement of computed CSs with the experiment but substantially decreased errors associated with the ensemble averaging. A CLUSTER ensemble with 50 structures yielded ensemble averages close to those obtained with a REGULAR ensemble consisting of 500 MD frames. The cluster based calculations thus required only a fraction of the computational time.
Collapse
Affiliation(s)
- Michael J Bakker
- Faculty of Pharmacy in Hradec Králové, Charles University, Akademika Heyrovského 1203/8, 500 05 Hradec Králové, Czech Republic.
| | - Arnošt Mládek
- Faculty of Pharmacy in Hradec Králové, Charles University, Akademika Heyrovského 1203/8, 500 05 Hradec Králové, Czech Republic.
| | - Hugo Semrád
- Faculty of Pharmacy in Hradec Králové, Charles University, Akademika Heyrovského 1203/8, 500 05 Hradec Králové, Czech Republic. .,Department of Chemistry, Faculty of Science, Masaryk University, Kotlářská 267/2, 611 37 Brno, Czech Republic
| | - Vojtěch Zapletal
- Faculty of Pharmacy in Hradec Králové, Charles University, Akademika Heyrovského 1203/8, 500 05 Hradec Králové, Czech Republic.
| | - Jana Pavlíková Přecechtělová
- Faculty of Pharmacy in Hradec Králové, Charles University, Akademika Heyrovského 1203/8, 500 05 Hradec Králové, Czech Republic.
| |
Collapse
|
2
|
DIPEND: An Open-Source Pipeline to Generate Ensembles of Disordered Segments Using Neighbor-Dependent Backbone Preferences. Biomolecules 2021; 11:biom11101505. [PMID: 34680137 PMCID: PMC8534045 DOI: 10.3390/biom11101505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 09/26/2021] [Accepted: 10/06/2021] [Indexed: 11/17/2022] Open
Abstract
Ensemble-based structural modeling of flexible protein segments such as intrinsically disordered regions is a complex task often solved by selection of conformers from an initial pool based on their conformity to experimental data. However, the properties of the conformational pool are crucial, as the sampling of the conformational space should be sufficient and, in the optimal case, relatively uniform. In other words, the ideal sampling is both efficient and exhaustive. To achieve this, specialized tools are usually necessary, which might not be maintained in the long term, available on all platforms or flexible enough to be tweaked to individual needs. Here, we present an open-source and extendable pipeline to generate initial protein structure pools for use with selection-based tools to obtain ensemble models of flexible protein segments. Our method is implemented in Python and uses ChimeraX, Scwrl4, Gromacs and neighbor-dependent backbone distributions compiled and published previously by the Dunbrack lab. All these tools and data are publicly available and maintained. Our basic premise is that by using residue-specific, neighbor-dependent Ramachandran distributions, we can enhance the efficient exploration of the relevant region of the conformational space. We have also provided a straightforward way to bias the sampling towards specific conformations for selected residues by combining different conformational distributions. This allows the consideration of a priori known conformational preferences such as in the case of preformed structural elements. The open-source and modular nature of the pipeline allows easy adaptation for specific problems. We tested the pipeline on an intrinsically disordered segment of the protein Cd3ϵ and also a single-alpha helical (SAH) region by generating conformational pools and selecting ensembles matching experimental data using the CoNSEnsX+ server.
Collapse
|
3
|
Zhang K, Frank AT. Probabilistic Modeling of RNA Ensembles Using NMR Chemical Shifts. J Phys Chem B 2021; 125:9970-9978. [PMID: 34449236 DOI: 10.1021/acs.jpcb.1c05651] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
NMR-derived chemical shifts are structural fingerprints that are sensitive to the underlying conformational distributions of molecules. Thus, chemical shift data are now routinely used to infer the dynamical or conformational ensembles of peptides and proteins. However, for RNAs, techniques for inferring their conformational ensembles from chemical shift data have received less attention. Here, we used chemical shift data and the Bayesian/maximum entropy (BME) approach to model the secondary structure ensembles of several single-stranded RNAs. Inspection of the resulting ensembles indicates that the secondary structure of the highest weighted (most probable) conformer in the ensemble typically resembled the known NMR structure. Furthermore, using apo chemical shifts measured for the HIV-1 TAR RNA, we found that our framework reproduces the expected structure yet predicts the existence of a previously unobserved base pair, which we speculate may be sampled transiently. We expect that the chemical shift-based BME (CS-BME) framework we describe here should find utility as a general strategy for modeling RNA ensembles using chemical shift data.
Collapse
Affiliation(s)
- Kexin Zhang
- Chemistry Department, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| | - Aaron T Frank
- Biophysics Program, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
4
|
Salient Features of Monomeric Alpha-Synuclein Revealed by NMR Spectroscopy. Biomolecules 2020; 10:biom10030428. [PMID: 32164323 PMCID: PMC7175124 DOI: 10.3390/biom10030428] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Revised: 02/09/2020] [Accepted: 03/04/2020] [Indexed: 12/17/2022] Open
Abstract
Elucidating the structural details of proteins is highly valuable and important for the proper understanding of protein function. In the case of intrinsically disordered proteins (IDPs), however, obtaining the structural details is quite challenging, as the traditional structural biology tools have only limited use. Nuclear magnetic resonance (NMR) is a unique experimental tool that provides ensemble conformations of IDPs at atomic resolution, and when studying IDPs, a slightly different experimental strategy needs to be employed than the one used for globular proteins. We address this point by reviewing many NMR investigations carried out on the α-synuclein protein, the aggregation of which is strongly correlated with Parkinson’s disease.
Collapse
|
5
|
Pavlíková Přecechtělová J, Mládek A, Zapletal V, Hritz J. Quantum Chemical Calculations of NMR Chemical Shifts in Phosphorylated Intrinsically Disordered Proteins. J Chem Theory Comput 2019; 15:5642-5658. [DOI: 10.1021/acs.jctc.8b00257] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Jana Pavlíková Přecechtělová
- Faculty of Pharmacy in Hradec Králové, Charles University, Akademika Heyrovského 1203, 500 05 Hradec Králové, Czech Republic
| | | | | | | |
Collapse
|
6
|
Generation of the configurational ensemble of an intrinsically disordered protein from unbiased molecular dynamics simulation. Proc Natl Acad Sci U S A 2019; 116:20446-20452. [PMID: 31548393 DOI: 10.1073/pnas.1907251116] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are abundant in eukaryotic proteomes, play a major role in cell signaling, and are associated with human diseases. To understand IDP function it is critical to determine their configurational ensemble, i.e., the collection of 3-dimensional structures they adopt, and this remains an immense challenge in structural biology. Attempts to determine this ensemble computationally have been hitherto hampered by the necessity of reweighting molecular dynamics (MD) results or biasing simulation in order to match ensemble-averaged experimental observables, operations that reduce the precision of the generated model because different structural ensembles may yield the same experimental observable. Here, by employing enhanced sampling MD we reproduce the experimental small-angle neutron and X-ray scattering profiles and the NMR chemical shifts of the disordered N terminal (SH4UD) of c-Src kinase without reweighting or constraining the simulations. The unbiased simulation results reveal a weakly funneled and rugged free energy landscape of SH4UD, which gives rise to a heterogeneous ensemble of structures that cannot be described by simple polymer theory. SH4UD adopts transient helices, which are found away from known phosphorylation sites and could play a key role in the stabilization of structural regions necessary for phosphorylation. Our findings indicate that adequately sampled molecular simulations can be performed to provide accurate physical models of flexible biosystems, thus rationalizing their biological function.
Collapse
|
7
|
Uluca B, Viennet T, Petrović D, Shaykhalishahi H, Weirich F, Gönülalan A, Strodel B, Etzkorn M, Hoyer W, Heise H. DNP-Enhanced MAS NMR: A Tool to Snapshot Conformational Ensembles of α-Synuclein in Different States. Biophys J 2019; 114:1614-1623. [PMID: 29642031 PMCID: PMC5954275 DOI: 10.1016/j.bpj.2018.02.011] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2017] [Revised: 02/04/2018] [Accepted: 02/07/2018] [Indexed: 11/06/2022] Open
Abstract
Intrinsically disordered proteins dynamically sample a wide conformational space and therefore do not adopt a stable and defined three-dimensional conformation. The structural heterogeneity is related to their proper functioning in physiological processes. Knowledge of the conformational ensemble is crucial for a complete comprehension of this kind of proteins. We here present an approach that utilizes dynamic nuclear polarization-enhanced solid-state NMR spectroscopy of sparsely isotope-labeled proteins in frozen solution to take snapshots of the complete structural ensembles by exploiting the inhomogeneously broadened line-shapes. We investigated the intrinsically disordered protein α-synuclein (α-syn), which plays a key role in the etiology of Parkinson’s disease, in three different physiologically relevant states. For the free monomer in frozen solution we could see that the so-called “random coil conformation” consists of α-helical and β-sheet-like conformations, and that secondary chemical shifts of neighboring amino acids tend to be correlated, indicative of frequent formation of secondary structure elements. Based on these results, we could estimate the number of disordered regions in fibrillar α-syn as well as in α-syn bound to membranes in different protein-to-lipid ratios. Our approach thus provides quantitative information on the propensity to sample transient secondary structures in different functional states. Molecular dynamics simulations rationalize the results.
Collapse
Affiliation(s)
- Boran Uluca
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Physical Biology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Thibault Viennet
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Physical Biology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Dušan Petrović
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany
| | - Hamed Shaykhalishahi
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Physical Biology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Franziska Weirich
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Physical Biology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Ayşenur Gönülalan
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany
| | - Birgit Strodel
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Theoretical and Computational Chemistry, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Manuel Etzkorn
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Physical Biology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Wolfgang Hoyer
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Physical Biology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Henrike Heise
- Institute of Complex Systems, Structural Biochemistry, Research Center Jülich, Jülich, Germany; Institute of Physical Biology, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany.
| |
Collapse
|
8
|
Milles S, Salvi N, Blackledge M, Jensen MR. Characterization of intrinsically disordered proteins and their dynamic complexes: From in vitro to cell-like environments. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2018; 109:79-100. [PMID: 30527137 DOI: 10.1016/j.pnmrs.2018.07.001] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2018] [Revised: 07/16/2018] [Accepted: 07/16/2018] [Indexed: 05/08/2023]
Abstract
Over the last two decades, it has become increasingly clear that a large fraction of the human proteome is intrinsically disordered or contains disordered segments of significant length. These intrinsically disordered proteins (IDPs) play important regulatory roles throughout biology, underlining the importance of understanding their conformational behavior and interaction mechanisms at the molecular level. Here we review recent progress in the NMR characterization of the structure and dynamics of IDPs in various functional states and environments. We describe the complementarity of different NMR parameters for quantifying the conformational propensities of IDPs in their isolated and phosphorylated states, and we discuss the challenges associated with obtaining structural models of dynamic protein-protein complexes involving IDPs. In addition, we review recent progress in understanding the conformational behavior of IDPs in cell-like environments such as in the presence of crowding agents, in membrane-less organelles and in the complex environment of the human cell.
Collapse
Affiliation(s)
- Sigrid Milles
- Univ. Grenoble Alpes, CNRS, CEA, IBS, F-38000 Grenoble, France
| | - Nicola Salvi
- Univ. Grenoble Alpes, CNRS, CEA, IBS, F-38000 Grenoble, France
| | | | | |
Collapse
|