1
|
González-Delgado J, Bernadó P, Neuvial P, Cortés J. Weighted families of contact maps to characterize conformational ensembles of (highly-)flexible proteins. Bioinformatics 2024; 40:btae627. [PMID: 39432675 PMCID: PMC11530230 DOI: 10.1093/bioinformatics/btae627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 09/17/2024] [Accepted: 10/16/2024] [Indexed: 10/23/2024] Open
Abstract
MOTIVATION Characterizing the structure of flexible proteins, particularly within the realm of intrinsic disorder, presents a formidable challenge due to their high conformational variability. Currently, their structural representation relies on (possibly large) conformational ensembles derived from a combination of experimental and computational methods. The detailed structural analysis of these ensembles is a difficult task, for which existing tools have limited effectiveness. RESULTS This study proposes an innovative extension of the concept of contact maps to the ensemble framework, incorporating the intrinsic probabilistic nature of disordered proteins. Within this framework, a conformational ensemble is characterized through a weighted family of contact maps. To achieve this, conformations are first described using a refined definition of contact that appropriately accounts for the geometry of the inter-residue interactions and the sequence context. Representative structural features of the ensemble naturally emerge from the subsequent clustering of the resulting contact-based descriptors. Importantly, transiently populated structural features are readily identified within large ensembles. The performance of the method is illustrated by several use cases and compared with other existing approaches, highlighting its superiority in capturing relevant structural features of highly flexible proteins. AVAILABILITY AND IMPLEMENTATION An open-source implementation of the method is provided together with an easy-to-use Jupyter notebook, available at https://gitlab.laas.fr/moma/WARIO.
Collapse
Affiliation(s)
- Javier González-Delgado
- LAAS-CNRS, Université de Toulouse, CNRS, 31400 Toulouse, France
- Institut de Mathématiques de Toulouse, Université de Toulouse, CNRS, 31400 Toulouse, France
| | - Pau Bernadó
- Centre de Biologie Structurale, Université de Montpellier, INSERM, CNRS, 34090 Montpellier, France
| | - Pierre Neuvial
- Institut de Mathématiques de Toulouse, Université de Toulouse, CNRS, 31400 Toulouse, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, 31400 Toulouse, France
| |
Collapse
|
2
|
Bastida A, Zúñiga J, Fogolari F, Soler MA. Statistical accuracy of molecular dynamics-based methods for sampling conformational ensembles of disordered proteins. Phys Chem Chem Phys 2024; 26:23213-23227. [PMID: 39190324 DOI: 10.1039/d4cp02564d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/28/2024]
Abstract
The characterization of the statistical ensemble of conformations of intrinsically disordered regions (IDRs) is a great challenge both from experimental and computational points of view. In this respect, a number of protocols have been developed using molecular dynamics (MD) simulations to sample the huge conformational space of the molecule. In this work, we consider one of the best methods available, replica exchange solute tempering (REST), as a reference to compare the results obtained using this method with the results obtained using other methods, in terms of experimentally measurable quantities. Along with the methods assessed, we propose here a novel protocol called probabilistic MD chain growth (PMD-CG), which combines the flexible-meccano and hierarchical chain growth methods with the statistical data obtained from tripeptide MD trajectories as the starting point. The system chosen for testing is a 20-residue region from the C-terminal domain of the p53 tumor suppressor protein (p53-CTD). Our results show that PMD-CG provides an ensemble of conformations extremely quickly, after suitable computation of the conformational pool for all peptide triplets of the IDR sequence. The measurable quantities computed on the ensemble of conformations agree well with those based on the REST conformational ensemble.
Collapse
Affiliation(s)
- Adolfo Bastida
- Departamento de Química Física, Universidad de Murcia, 30100 Murcia, Spain.
| | - José Zúñiga
- Departamento de Química Física, Universidad de Murcia, 30100 Murcia, Spain.
| | - Federico Fogolari
- Dipartimento di Scienze Matematiche, Informatiche e Fisiche, Università di Udine, 33100 Udine, Italy.
| | - Miguel A Soler
- Dipartimento di Scienze Matematiche, Informatiche e Fisiche, Università di Udine, 33100 Udine, Italy.
| |
Collapse
|
3
|
Fiorucci L, Schiavina M, Felli IC, Pierattelli R, Ravera E. Are Protein Conformational Ensembles in Agreement with Experimental Data? A Geometrical Interpretation of the Problem. J Chem Inf Model 2024; 64:5392-5401. [PMID: 38959217 DOI: 10.1021/acs.jcim.4c00582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
The conformational variability of biological macromolecules can play an important role in their biological function. Therefore, understanding conformational variability is expected to be key for predicting the behavior of a particular molecule in the context of organism-wide studies. Several experimental methods have been developed and deployed for accessing this information, and computational methods are continuously updated for the profitable integration of different experimental sources. The outcome of this endeavor is conformational ensembles, which may vary significantly in properties and composition when different ensemble reconstruction methods are used, and this raises the issue of comparing the predicted ensembles against experimental data. In this article, we discuss a geometrical formulation to provide a framework for understanding the agreement of an ensemble prediction to the experimental observations.
Collapse
Affiliation(s)
- Letizia Fiorucci
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Marco Schiavina
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Isabella C Felli
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Roberta Pierattelli
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | - Enrico Ravera
- Department of Chemistry "Ugo Schiff" and Magnetic Resonance Center (CERM), University of Florence, Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Consorzio Interuniversitario Risonanze Magnetiche di Metalloproteine (CIRMMP), Via Luigi Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
- Florence Data Science, University of Florence, Viale G.B. Morgagni 59, 50134 Florence, Italy
| |
Collapse
|
4
|
Waszkiewicz R, Michaś A, Białobrzewski MK, Klepka BP, Cieplak-Rotowska MK, Staszałek Z, Cichocki B, Lisicki M, Szymczak P, Niedzwiecka A. Hydrodynamic Radii of Intrinsically Disordered Proteins: Fast Prediction by Minimum Dissipation Approximation and Experimental Validation. J Phys Chem Lett 2024; 15:5024-5033. [PMID: 38696815 PMCID: PMC11103702 DOI: 10.1021/acs.jpclett.4c00312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/12/2024] [Accepted: 04/26/2024] [Indexed: 05/04/2024]
Abstract
The diffusion coefficients of globular and fully unfolded proteins can be predicted with high accuracy solely from their mass or chain length. However, this approach fails for intrinsically disordered proteins (IDPs) containing structural domains. We propose a rapid predictive methodology for estimating the diffusion coefficients of IDPs. The methodology uses accelerated conformational sampling based on self-avoiding random walks and includes hydrodynamic interactions between coarse-grained protein subunits, modeled using the generalized Rotne-Prager-Yamakawa approximation. To estimate the hydrodynamic radius, we rely on the minimum dissipation approximation recently introduced by Cichocki et al. Using a large set of experimentally measured hydrodynamic radii of IDPs over a wide range of chain lengths and domain contributions, we demonstrate that our predictions are more accurate than the Kirkwood approximation and phenomenological approaches. Our technique may prove to be valuable in predicting the hydrodynamic properties of both fully unstructured and multidomain disordered proteins.
Collapse
Affiliation(s)
- Radost Waszkiewicz
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Agnieszka Michaś
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | - Michał K. Białobrzewski
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | - Barbara P. Klepka
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | | | - Zuzanna Staszałek
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| | - Bogdan Cichocki
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Maciej Lisicki
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Piotr Szymczak
- Institute
of Theoretical Physics, Faculty of Physics, University of Warsaw, L. Pasteura 5, 02-093 Warsaw, Poland
| | - Anna Niedzwiecka
- Institute
of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668 Warsaw, Poland
| |
Collapse
|
5
|
Shahrajabian MH, Sun W. Characterization of Intrinsically Disordered Proteins in Healthy and Diseased States by Nuclear Magnetic Resonance. Rev Recent Clin Trials 2024; 19:176-188. [PMID: 38409704 DOI: 10.2174/0115748871271420240213064251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/10/2023] [Accepted: 12/13/2023] [Indexed: 02/28/2024]
Abstract
INTRODUCTION Intrinsically Disordered Proteins (IDPs) are active in different cellular procedures like ordered assembly of chromatin and ribosomes, interaction with membrane, protein, and ligand binding, molecular recognition, binding, and transportation via nuclear pores, microfilaments and microtubules process and disassembly, protein functions, RNA chaperone, and nucleic acid binding, modulation of the central dogma, cell cycle, and other cellular activities, post-translational qualification and substitute splicing, and flexible entropic linker and management of signaling pathways. METHODS The intrinsic disorder is a precise structural characteristic that permits IDPs/IDPRs to be involved in both one-to-many and many-to-one signaling. IDPs/IDPRs also exert some dynamical and structural ordering, being much less constrained in their activities than folded proteins. Nuclear magnetic resonance (NMR) spectroscopy is a major technique for the characterization of IDPs, and it can be used for dynamic and structural studies of IDPs. RESULTS AND CONCLUSION This review was carried out to discuss intrinsically disordered proteins and their different goals, as well as the importance and effectiveness of NMR in characterizing intrinsically disordered proteins in healthy and diseased states.
Collapse
Affiliation(s)
- Mohamad Hesam Shahrajabian
- National Key Laboratory of Agricultural Microbiology, Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Wenli Sun
- National Key Laboratory of Agricultural Microbiology, Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| |
Collapse
|
6
|
Białobrzewski MK, Klepka BP, Michaś A, Cieplak-Rotowska MK, Staszałek Z, Niedźwiecka A. Diversity of hydrodynamic radii of intrinsically disordered proteins. EUROPEAN BIOPHYSICS JOURNAL : EBJ 2023; 52:607-618. [PMID: 37831084 PMCID: PMC10618399 DOI: 10.1007/s00249-023-01683-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 08/08/2023] [Accepted: 09/06/2023] [Indexed: 10/14/2023]
Abstract
Intrinsically disordered proteins (IDPs) form an important class of biomolecules regulating biological processes in higher organisms. The lack of a fixed spatial structure facilitates them to perform their regulatory functions and allows the efficiency of biochemical reactions to be controlled by temperature and the cellular environment. From the biophysical point of view, IDPs are biopolymers with a broad configuration state space and their actual conformation depends on non-covalent interactions of its amino acid side chain groups at given temperature and chemical conditions. Thus, the hydrodynamic radius (Rh) of an IDP of a given polymer length (N) is a sequence- and environment-dependent variable. We have reviewed the literature values of hydrodynamic radii of IDPs determined experimentally by SEC, AUC, PFG NMR, DLS, and FCS, and complement them with our FCS results obtained for a series of protein fragments involved in the regulation of human gene expression. The data collected herein show that the values of hydrodynamic radii of IDPs can span the full space between the folded globular and denatured proteins in the Rh(N) diagram.
Collapse
Affiliation(s)
- Michał K Białobrzewski
- Laboratory of Biological Physics, Institute of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668, Warsaw, Poland
| | - Barbara P Klepka
- Laboratory of Biological Physics, Institute of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668, Warsaw, Poland
| | - Agnieszka Michaś
- Laboratory of Biological Physics, Institute of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668, Warsaw, Poland
| | - Maja K Cieplak-Rotowska
- Laboratory of Biological Physics, Institute of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668, Warsaw, Poland
- Division of Biophysics, Institute of Experimental Physics, Faculty of Physics, University of Warsaw, Pasteura 5, PL-02093, Warsaw, Poland
- The International Institute of Molecular Mechanisms and Machines, Polish Academy of Sciences, Flisa 6, PL-02247, Warsaw, Poland
| | - Zuzanna Staszałek
- Laboratory of Biological Physics, Institute of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668, Warsaw, Poland
| | - Anna Niedźwiecka
- Laboratory of Biological Physics, Institute of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, PL-02668, Warsaw, Poland.
| |
Collapse
|
7
|
Liu ZH, Zhang O, Teixeira JMC, Li J, Head-Gordon T, Forman-Kay JD. SPyCi-PDB: A modular command-line interface for back-calculating experimental datatypes of protein structures. JOURNAL OF OPEN SOURCE SOFTWARE 2023; 8:4861. [PMID: 38726305 PMCID: PMC11081106 DOI: 10.21105/joss.04861] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/12/2024]
Affiliation(s)
- Zi Hao Liu
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario, M5S 1A8, Canada
| | - Oufan Zhang
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemistry, University of California, Berkeley, California 94720-1460, USA
| | - João M C Teixeira
- Department of Biomedical Sciences, University of Padova, Padova 35131, Italy
| | - Jie Li
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemistry, University of California, Berkeley, California 94720-1460, USA
| | - Teresa Head-Gordon
- Pitzer Center for Theoretical Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemistry, University of California, Berkeley, California 94720-1460, USA
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720-1462, USA
- Department of Bioengineering, University of California, Berkeley, California 94720-1762, USA
| | - Julie D Forman-Kay
- Molecular Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario, M5S 1A8, Canada
| |
Collapse
|
8
|
Elena-Real CA, Urbanek A, Lund XL, Morató A, Sagar A, Fournet A, Estaña A, Bellande T, Allemand F, Cortés J, Sibille N, Melki R, Bernadó P. Multi-site-specific isotopic labeling accelerates high-resolution structural investigations of pathogenic huntingtin exon-1. Structure 2023:S0969-2126(23)00126-0. [PMID: 37119819 DOI: 10.1016/j.str.2023.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 03/10/2023] [Accepted: 04/04/2023] [Indexed: 05/01/2023]
Abstract
Huntington's disease neurodegeneration occurs when the number of consecutive glutamines in the huntingtin exon-1 (HTTExon1) exceeds a pathological threshold of 35. The sequence homogeneity of HTTExon1 reduces the signal dispersion in NMR spectra, hampering its structural characterization. By simultaneously introducing three isotopically labeled glutamines in a site-specific manner in multiple concatenated samples, 18 glutamines of a pathogenic HTTExon1 with 36 glutamines were unambiguously assigned. Chemical shift analyses indicate the α-helical persistence in the homorepeat and the absence of an emerging toxic conformation around the pathological threshold. Using the same type of samples, the recognition mechanism of Hsc70 molecular chaperone has been investigated, indicating that it binds to the N17 region of HTTExon1, inducing the partial unfolding of the poly-Q. The proposed strategy facilitates high-resolution structural and functional studies in low-complexity regions.
Collapse
Affiliation(s)
- Carlos A Elena-Real
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France
| | - Annika Urbanek
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France
| | - Xamuel L Lund
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France; Institut Laue Langevin, 38000 Grenoble, France
| | - Anna Morató
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France
| | - Amin Sagar
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France
| | - Aurélie Fournet
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France
| | - Alejandro Estaña
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France; LAAS-CNRS, Université de Toulouse, CNRS, 31400, Toulouse, France
| | - Tracy Bellande
- Institut François Jacob, Molecular Imaging Center (MIRCen), Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA) and Laboratory of Neurodegenerative Diseases, Centre National de la Recherche Scientifique (CNRS), Université Paris-Saclay, CEA-Fontenay-aux-Roses Bâtiment 61, 18, route du Panorama, 92265 Fontenay-aux-Rses cedex, France
| | - Frédéric Allemand
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, 31400, Toulouse, France
| | - Nathalie Sibille
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France
| | - Ronald Melki
- Institut François Jacob, Molecular Imaging Center (MIRCen), Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA) and Laboratory of Neurodegenerative Diseases, Centre National de la Recherche Scientifique (CNRS), Université Paris-Saclay, CEA-Fontenay-aux-Roses Bâtiment 61, 18, route du Panorama, 92265 Fontenay-aux-Rses cedex, France
| | - Pau Bernadó
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS, 29, rue de Navacelles, 34090 Montpellier, France.
| |
Collapse
|
9
|
González-Delgado J, Sagar A, Zanon C, Lindorff-Larsen K, Bernadó P, Neuvial P, Cortés J. WASCO: A Wasserstein-based statistical tool to compare conformational ensembles of intrinsically disordered proteins. J Mol Biol 2023:168053. [PMID: 36934808 DOI: 10.1016/j.jmb.2023.168053] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 02/10/2023] [Accepted: 03/14/2023] [Indexed: 03/19/2023]
Abstract
The structural investigation of intrinsically disordered proteins (IDPs) requires ensemble models describing the diversity of the conformational states of the molecule. Due to their probabilistic nature, there is a need for new paradigms that understand and treat IDPs from a purely statistical point of view, considering their conformational ensembles as well-defined probability distributions. In this work, we define a conformational ensemble as an ordered set of probability distributions and provide a suitable metric to detect differences between two given ensembles at the residue level, both locally and globally. The underlying geometry of the conformational space is properly integrated, one ensemble being characterized by a set of probability distributions supported on the three-dimensional Euclidean space (for global-scale comparisons) and on the two-dimensional flat torus (for local-scale comparisons). The inherent uncertainty of the data is also taken into account to provide finer estimations of the differences between ensembles. Additionally, an overall distance between ensembles is defined from the differences at the residue level. We illustrate the interest of the approach with several examples of applications for the comparison of conformational ensembles: (i) produced from molecular dynamics (MD) simulations using different force fields, and (ii) before and after refinement with experimental data. We also show the usefulness of the method to assess the convergence of MD simulations, and discuss other potential applications such as in machine-learning-based approaches. The numerical tool has been implemented in Python through easy-to-use Jupyter Notebooks available at https://gitlab.laas.fr/moma/WASCO.
Collapse
Affiliation(s)
- Javier González-Delgado
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France; Institut de Mathématiques de Toulouse, Université de Toulouse, CNRS, Toulouse, France
| | - Amin Sagar
- Centre de Biologie Structurale, Université de Montpellier, INSERM, CNRS, Montpellier, France
| | | | - Kresten Lindorff-Larsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Denmark
| | - Pau Bernadó
- Centre de Biologie Structurale, Université de Montpellier, INSERM, CNRS, Montpellier, France
| | - Pierre Neuvial
- Institut de Mathématiques de Toulouse, Université de Toulouse, CNRS, Toulouse, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| |
Collapse
|
10
|
Elena-Real CA, Sagar A, Urbanek A, Popovic M, Morató A, Estaña A, Fournet A, Doucet C, Lund XL, Shi ZD, Costa L, Thureau A, Allemand F, Swenson RE, Milhiet PE, Crehuet R, Barducci A, Cortés J, Sinnaeve D, Sibille N, Bernadó P. The structure of pathogenic huntingtin exon 1 defines the bases of its aggregation propensity. Nat Struct Mol Biol 2023; 30:309-320. [PMID: 36864173 DOI: 10.1038/s41594-023-00920-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 01/05/2023] [Indexed: 03/04/2023]
Abstract
Huntington's disease is a neurodegenerative disorder caused by a CAG expansion in the first exon of the HTT gene, resulting in an extended polyglutamine (poly-Q) tract in huntingtin (httex1). The structural changes occurring to the poly-Q when increasing its length remain poorly understood due to its intrinsic flexibility and the strong compositional bias. The systematic application of site-specific isotopic labeling has enabled residue-specific NMR investigations of the poly-Q tract of pathogenic httex1 variants with 46 and 66 consecutive glutamines. Integrative data analysis reveals that the poly-Q tract adopts long α-helical conformations propagated and stabilized by glutamine side chain to backbone hydrogen bonds. We show that α-helical stability is a stronger signature in defining aggregation kinetics and the structure of the resulting fibrils than the number of glutamines. Our observations provide a structural perspective of the pathogenicity of expanded httex1 and pave the way to a deeper understanding of poly-Q-related diseases.
Collapse
Affiliation(s)
- Carlos A Elena-Real
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Amin Sagar
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Annika Urbanek
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Matija Popovic
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Anna Morató
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Alejandro Estaña
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
- LAAS-CNRS, University of Toulouse, CNRS, Toulouse, France
| | - Aurélie Fournet
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Christine Doucet
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Xamuel L Lund
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
- Institute of Laue Langevin, Grenoble, France
| | - Zhen-Dan Shi
- The Chemistry and Synthesis Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Rockville, MD, USA
| | - Luca Costa
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | | | - Frédéric Allemand
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Rolf E Swenson
- The Chemistry and Synthesis Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Rockville, MD, USA
| | | | - Ramon Crehuet
- Institute for Advanced Chemistry of Catalonia (IQAC), CSIC, Barcelona, Spain
| | - Alessandro Barducci
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Juan Cortés
- LAAS-CNRS, University of Toulouse, CNRS, Toulouse, France
| | - Davy Sinnaeve
- Univ. Lille, INSERM, CHU Lille, Institut Pasteur de Lille, U1167 - RID-AGE - Risk Factors and Molecular Determinants of Aging-Related Diseases, Lille, France
- CNRS, EMR9002, Integrative Structural Biology, Lille, France
| | - Nathalie Sibille
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France
| | - Pau Bernadó
- Centre for Structural Biology, University of Montpellier, INSERM, CNRS, Montpellier, France.
| |
Collapse
|
11
|
Structural ensembles of disordered proteins from hierarchical chain growth and simulation. Curr Opin Struct Biol 2023; 78:102501. [PMID: 36463772 DOI: 10.1016/j.sbi.2022.102501] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 10/26/2022] [Accepted: 10/28/2022] [Indexed: 12/03/2022]
Abstract
Disordered proteins and nucleic acids play key roles in cellular function and disease. Here, we review recent advances in the computational exploration of the conformational dynamics of flexible biomolecules. While atomistic molecular dynamics (MD) simulation has seen a lot of improvement in recent years, large-scale computing resources and careful validation are required to simulate full-length disordered biopolymers in solution. As a computationally efficient alternative, hierarchical chain growth (HCG) combines pre-sampled chain fragments in a statistically reproducible manner into ensembles of full-length atomically detailed biomolecular structures. Experimental data can be integrated during and after chain assembly. Applications to the neurodegeneration-linked proteins α-synuclein, tau, and TDP-43, including as condensate, illustrate the use of HCG. We conclude by highlighting the emerging connections to AI-based structural modeling including AlphaFold2.
Collapse
|
12
|
González-Delgado J, Bernadó P, Neuvial P, Cortés J. Statistical proofs of the interdependence between nearest neighbor effects on polypeptide backbone conformations. J Struct Biol 2022; 214:107907. [PMID: 36272694 DOI: 10.1016/j.jsb.2022.107907] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 10/03/2022] [Accepted: 10/09/2022] [Indexed: 11/06/2022]
Abstract
Backbone dihedral angles ϕ and ψ are the main structural descriptors of proteins and peptides. The distribution of these angles has been investigated over decades as they are essential for the validation and refinement of experimental measurements, as well as for structure prediction and design methods. The dependence of these distributions, not only on the nature of each amino acid but also on that of the closest neighbors, has been the subject of numerous studies. Although neighbor-dependent distributions are nowadays generally accepted as a good model, there is still some controversy about the combined effects of left and right neighbors. We have investigated this question using rigorous methods based on recently-developed statistical techniques. Our results unambiguously demonstrate that the influence of left and right neighbors cannot be considered independently. Consequently, three-residue fragments should be considered as the minimal building blocks to investigate polypeptide sequence-structure relationships.
Collapse
Affiliation(s)
- Javier González-Delgado
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France; Institut de Mathématiques de Toulouse, Université de Toulouse, CNRS, France
| | - Pau Bernadó
- Centre de Biologie Structurale, Université de Montpellier, INSERM, CNRS, France
| | - Pierre Neuvial
- Institut de Mathématiques de Toulouse, Université de Toulouse, CNRS, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France.
| |
Collapse
|
13
|
Chaves-Arquero B, Martínez-Lumbreras S, Sibille N, Camero S, Bernadó P, Jiménez MÁ, Zorrilla S, Pérez-Cañadillas JM. eIF4G1 N-terminal intrinsically disordered domain is a multi-docking station for RNA, Pab1, Pub1, and self-assembly. Front Mol Biosci 2022; 9:986121. [PMID: 36213119 PMCID: PMC9537944 DOI: 10.3389/fmolb.2022.986121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 08/29/2022] [Indexed: 11/13/2022] Open
Abstract
Yeast eIF4G1 interacts with RNA binding proteins (RBPs) like Pab1 and Pub1 affecting its function in translation initiation and stress granules formation. We present an NMR and SAXS study of the N-terminal intrinsically disordered region of eIF4G1 (residues 1-249) and its interactions with Pub1, Pab1 and RNA. The conformational ensemble of eIF4G11-249 shows an α-helix within the BOX3 conserved element and a dynamic network of fuzzy π-π and π-cation interactions involving arginine and aromatic residues. The Pab1 RRM2 domain interacts with eIF4G1 BOX3, the canonical interaction site, but also with BOX2, a conserved element of unknown function to date. The RNA1 region interacts with RNA through a new RNA interaction motif and with the Pub1 RRM3 domain. This later also interacts with eIF4G1 BOX1 modulating its intrinsic self-assembly properties. The description of the biomolecular interactions involving eIF4G1 to the residue detail increases our knowledge about biological processes involving this key translation initiation factor.
Collapse
Affiliation(s)
- Belén Chaves-Arquero
- Department of Biological Physical Chemistry, Institute of Physical-Chemistry “Rocasolano”, CSIC, Madrid, Spain
| | - Santiago Martínez-Lumbreras
- Department of Biological Physical Chemistry, Institute of Physical-Chemistry “Rocasolano”, CSIC, Madrid, Spain
| | - Nathalie Sibille
- Centre de Biochimie Structurale (CBS), CNRS, INSERM, Univ. Montpellier, Montpellier, France
| | - Sergio Camero
- Department of Biological Physical Chemistry, Institute of Physical-Chemistry “Rocasolano”, CSIC, Madrid, Spain
| | - Pau Bernadó
- Centre de Biochimie Structurale (CBS), CNRS, INSERM, Univ. Montpellier, Montpellier, France
| | - M. Ángeles Jiménez
- Department of Biological Physical Chemistry, Institute of Physical-Chemistry “Rocasolano”, CSIC, Madrid, Spain
| | - Silvia Zorrilla
- Department of Cellular and Molecular Biology, Biological Research Center, CSIC, Madrid, Spain
| | | |
Collapse
|
14
|
Teixeira JMC, Liu ZH, Namini A, Li J, Vernon RM, Krzeminski M, Shamandy AA, Zhang O, Haghighatlari M, Yu L, Head-Gordon T, Forman-Kay JD. IDPConformerGenerator: A Flexible Software Suite for Sampling the Conformational Space of Disordered Protein States. J Phys Chem A 2022; 126:5985-6003. [PMID: 36030416 PMCID: PMC9465686 DOI: 10.1021/acs.jpca.2c03726] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 08/08/2022] [Indexed: 11/29/2022]
Abstract
The power of structural information for informing biological mechanisms is clear for stable folded macromolecules, but similar structure-function insight is more difficult to obtain for highly dynamic systems such as intrinsically disordered proteins (IDPs) which must be described as structural ensembles. Here, we present IDPConformerGenerator, a flexible, modular open-source software platform for generating large and diverse ensembles of disordered protein states that builds conformers that obey geometric, steric, and other physical restraints on the input sequence. IDPConformerGenerator samples backbone phi (φ), psi (ψ), and omega (ω) torsion angles of relevant sequence fragments from loops and secondary structure elements extracted from folded protein structures in the RCSB Protein Data Bank and builds side chains from robust Monte Carlo algorithms using expanded rotamer libraries. IDPConformerGenerator has many user-defined options enabling variable fractional sampling of secondary structures, supports Bayesian models for assessing the agreement of IDP ensembles for consistency with experimental data, and introduces a machine learning approach to transform between internal and Cartesian coordinates with reduced error. IDPConformerGenerator will facilitate the characterization of disordered proteins to ultimately provide structural insights into these states that have key biological functions.
Collapse
Affiliation(s)
- João M. C. Teixeira
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Zi Hao Liu
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Ashley Namini
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Jie Li
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Robert M. Vernon
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Mickaël Krzeminski
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Alaa A. Shamandy
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
| | - Oufan Zhang
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Mojtaba Haghighatlari
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Lei Yu
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Teresa Head-Gordon
- Pitzer Center for Theoretical Chemistry, Department of Chemistry, Department of Chemical
and Biomolecular Engineering, and Department of Bioengineering, University of California, Berkeley, California 94720, United States
| | - Julie D. Forman-Kay
- Molecular
Medicine Program, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department
of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
15
|
Low Complexity Induces Structure in Protein Regions Predicted as Intrinsically Disordered. Biomolecules 2022; 12:biom12081098. [PMID: 36008992 PMCID: PMC9405754 DOI: 10.3390/biom12081098] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 08/02/2022] [Accepted: 08/06/2022] [Indexed: 01/02/2023] Open
Abstract
There is increasing evidence that many intrinsically disordered regions (IDRs) in proteins play key functional roles through interactions with other proteins or nucleic acids. These interactions often exhibit a context-dependent structural behavior. We hypothesize that low complexity regions (LCRs), often found within IDRs, could have a role in inducing local structure in IDRs. To test this, we predicted IDRs in the human proteome and analyzed their structures or those of homologous sequences in the Protein Data Bank (PDB). We then identified two types of simple LCRs within IDRs: regions with only one (polyX or homorepeats) or with only two types of amino acids (polyXY). We were able to assign structural information from the PDB more often to these LCRs than to the surrounding IDRs (polyX 61.8% > polyXY 50.5% > IDRs 39.7%). The most frequently observed polyX and polyXY within IDRs contained E (Glu) or G (Gly). Structural analyses of these sequences and of homologs indicate that polyEK regions induce helical conformations, while the other most frequent LCRs induce coil structures. Our work proposes bioinformatics methods to help in the study of the structural behavior of IDRs and provides a solid basis suggesting a structuring role of LCRs within them.
Collapse
|
16
|
Conformational buffering underlies functional selection in intrinsically disordered protein regions. Nat Struct Mol Biol 2022; 29:781-790. [PMID: 35948766 DOI: 10.1038/s41594-022-00811-w] [Citation(s) in RCA: 48] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 06/23/2022] [Indexed: 02/02/2023]
Abstract
Many disordered proteins conserve essential functions in the face of extensive sequence variation, making it challenging to identify the mechanisms responsible for functional selection. Here we identify the molecular mechanism of functional selection for the disordered adenovirus early gene 1A (E1A) protein. E1A competes with host factors to bind the retinoblastoma (Rb) protein, subverting cell cycle regulation. We show that two binding motifs tethered by a hypervariable disordered linker drive picomolar affinity Rb binding and host factor displacement. Compensatory changes in amino acid sequence composition and sequence length lead to conservation of optimal tethering across a large family of E1A linkers. We refer to this compensatory mechanism as conformational buffering. We also detect coevolution of the motifs and linker, which can preserve or eliminate the tethering mechanism. Conformational buffering and motif-linker coevolution explain robust functional encoding within hypervariable disordered linkers and could underlie functional selection of many disordered protein regions.
Collapse
|
17
|
Conformational ensemble of the full-length SARS-CoV-2 nucleocapsid (N) protein based on molecular simulations and SAXS data. Biophys Chem 2022; 288:106843. [PMID: 35696898 PMCID: PMC9172258 DOI: 10.1016/j.bpc.2022.106843] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Revised: 05/10/2022] [Accepted: 06/02/2022] [Indexed: 11/02/2022]
Abstract
The nucleocapsid protein of the SARS-CoV-2 virus comprises two RNA-binding domains and three regions that are intrinsically disordered. While the structures of the RNA-binding domains have been solved using protein crystallography and NMR, current knowledge of the conformations of the full-length nucleocapsid protein is rather limited. To fill in this knowledge gap, we combined coarse-grained molecular simulations with data from small-angle X-ray scattering (SAXS) experiments using the ensemble refinement of SAXS (EROS) method. Our results show that the dimer of the full-length nucleocapsid protein exhibits large conformational fluctuations with its radius of gyration ranging from about 4 to 8 nm. The RNA-binding domains do not make direct contacts. The disordered region that links these two domains comprises a hydrophobic α-helix which makes frequent and nonspecific contacts with the RNA-binding domains. Each of the intrinsically disordered regions adopts conformations that are locally compact, yet on average, much more extended than Gaussian chains of equivalent lengths. We offer a detailed picture of the conformational ensemble of the nucleocapsid protein dimer under near-physiological conditions, which will be important for understanding the nucleocapsid assembly process.
Collapse
|
18
|
Stelzl L, Pietrek LM, Holla A, Oroz J, Sikora M, Köfinger J, Schuler B, Zweckstetter M, Hummer G. Global Structure of the Intrinsically Disordered Protein Tau Emerges from Its Local Structure. JACS AU 2022; 2:673-686. [PMID: 35373198 PMCID: PMC8970000 DOI: 10.1021/jacsau.1c00536] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Indexed: 05/13/2023]
Abstract
The paradigmatic disordered protein tau plays an important role in neuronal function and neurodegenerative diseases. To disentangle the factors controlling the balance between functional and disease-associated conformational states, we build a structural ensemble of the tau K18 fragment containing the four pseudorepeat domains involved in both microtubule binding and amyloid fibril formation. We assemble 129-residue-long tau K18 chains with atomic detail from an extensive fragment library constructed with molecular dynamics simulations. We introduce a reweighted hierarchical chain growth (RHCG) algorithm that integrates experimental data reporting on the local structure into the assembly process in a systematic manner. By combining Bayesian ensemble refinement with importance sampling, we obtain well-defined ensembles and overcome the problem of exponentially varying weights in the integrative modeling of long-chain polymeric molecules. The resulting tau K18 ensembles capture nuclear magnetic resonance (NMR) chemical shift and J-coupling measurements. Without further fitting, we achieve very good agreement with measurements of NMR residual dipolar couplings. The good agreement with experimental measures of global structure such as single-molecule Förster resonance energy transfer (FRET) efficiencies is improved further by ensemble refinement. By comparing wild-type and mutant ensembles, we show that pathogenic single-point P301L, P301S, and P301T mutations shift the population from the turn-like conformations of the functional microtubule-bound state to the extended conformations of disease-associated tau fibrils. RHCG thus provides us with an atomically detailed view of the population equilibrium between functional and aggregation-prone states of tau K18, and demonstrates that global structural characteristics of this intrinsically disordered protein emerge from its local structure.
Collapse
Affiliation(s)
- Lukas
S. Stelzl
- Department
of Theoretical Biophysics, Max Planck Institute
of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
- Faculty
of Biology, Johannes Gutenberg University
Mainz, Gresemundweg 2, 55128 Mainz, Germany
- KOMET 1, Institute of Physics, Johannes
Gutenberg University Mainz, 55099 Mainz, Germany
- Institute of Molecular Biology (IMB), 55128 Mainz, Germany
| | - Lisa M. Pietrek
- Department
of Theoretical Biophysics, Max Planck Institute
of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Andrea Holla
- Department
of Biochemistry, University of Zurich, 8057 Zurich, Switzerland
| | - Javier Oroz
- German
Center for Neurodegenerative Diseases (DZNE), von-Siebold-Str. 3a, 37075 Göttingen, Germany
- Rocasolano
Institute for Physical Chemistry, CSIC, Serrano 119, 28006 Madrid, Spain
| | - Mateusz Sikora
- Department
of Theoretical Biophysics, Max Planck Institute
of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
- Faculty
of Physics, University of Vienna, Kolingasse 14-16, 1090 Vienna, Austria
| | - Jürgen Köfinger
- Department
of Theoretical Biophysics, Max Planck Institute
of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
| | - Benjamin Schuler
- Department
of Biochemistry, University of Zurich, 8057 Zurich, Switzerland
- Department
of Physics, University of Zurich, 8057 Zurich, Switzerland
| | - Markus Zweckstetter
- German
Center for Neurodegenerative Diseases (DZNE), von-Siebold-Str. 3a, 37075 Göttingen, Germany
- Department
for NMR-based Structural Biology, Max Planck
Institute for Multidisciplinary Sciences, Am Faßberg 11, 37077 Göttingen, Germany
| | - Gerhard Hummer
- Department
of Theoretical Biophysics, Max Planck Institute
of Biophysics, Max-von-Laue-Straße 3, 60438 Frankfurt am Main, Germany
- Institute
for Biophysics, Goethe University Frankfurt, Max-von-Laue-Straße 9, 60438 Frankfurt am Main, Germany
| |
Collapse
|
19
|
Gehi BR, Gadhave K, Uversky VN, Giri R. Intrinsic disorder in proteins associated with oxidative stress-induced JNK signaling. Cell Mol Life Sci 2022; 79:202. [PMID: 35325330 PMCID: PMC11073203 DOI: 10.1007/s00018-022-04230-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 03/01/2022] [Accepted: 03/03/2022] [Indexed: 01/02/2023]
Abstract
The c-Jun N-terminal kinase (JNK) signaling cascade is a mitogen-activated protein kinase (MAPK) signaling pathway that can be activated in response to a wide range of environmental stimuli. Based on the type, degree, and duration of the stimulus, the JNK signaling cascade dictates the fate of the cell by influencing gene expression through its substrate transcription factors. Oxidative stress is a result of a disturbance in the pro-oxidant/antioxidant homeostasis of the cell and is associated with a large number of diseases, such as neurodegenerative disorders, cancer, diabetes, cardiovascular diseases, and disorders of the immune system, where it activates the JNK signaling pathway. Among different biological roles ascribed to the intrinsically disordered proteins (IDPs) and hybrid proteins containing ordered domains and intrinsically disordered protein regions (IDPRs) are signaling hub functions, as intrinsic disorder allows proteins to undertake multiple interactions, each with a different consequence. In order to ensure precise signaling, the cellular abundance of IDPs is highly regulated, and mutations or changes in abundance of IDPs/IDPRs are often associated with disease. In this study, we have used a combination of six disorder predictors to evaluate the presence of intrinsic disorder in proteins of the oxidative stress-induced JNK signaling cascade, and as per our findings, none of the 18 proteins involved in this pathway are ordered. The highest level of intrinsic disorder was observed in the scaffold proteins, JIP1, JIP2, JIP3; dual specificity phosphatases, MKP5, MKP7; 14-3-3ζ and transcription factor c-Jun. The MAP3Ks, MAP2Ks, MAPKs, TRAFs, and thioredoxin were the proteins that were predicted to be moderately disordered. Furthermore, to characterize the predicted IDPs/IDPRs in the proteins of the JNK signaling cascade, we identified the molecular recognition features (MoRFs), posttranslational modification (PTM) sites, and short linear motifs (SLiMs) associated with the disordered regions. These findings will serve as a foundation for experimental characterization of disordered regions in these proteins, which represents a crucial step for a better understanding of the roles of IDPRs in diseases associated with this important pathway.
Collapse
Affiliation(s)
- Bhuvaneshwari R Gehi
- School of Basic Sciences, Indian Institute of Technology Mandi, VPO Kamand, Mandi, Himachal Pradesh, 175005, India
- Molecular Biophysics Unit (MBU), Indian Institute of Science, Bengaluru, 560012, India
| | - Kundlik Gadhave
- School of Basic Sciences, Indian Institute of Technology Mandi, VPO Kamand, Mandi, Himachal Pradesh, 175005, India
| | - Vladimir N Uversky
- Department of Molecular Medicine and Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Moscow region, 142290, Russia.
| | - Rajanish Giri
- School of Basic Sciences, Indian Institute of Technology Mandi, VPO Kamand, Mandi, Himachal Pradesh, 175005, India.
| |
Collapse
|
20
|
Pesce F, Lindorff-Larsen K. Refining conformational ensembles of flexible proteins against small-angle x-ray scattering data. Biophys J 2021; 120:5124-5135. [PMID: 34627764 PMCID: PMC8633713 DOI: 10.1016/j.bpj.2021.10.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 09/09/2021] [Accepted: 10/04/2021] [Indexed: 01/30/2023] Open
Abstract
Intrinsically disordered proteins and flexible regions in multidomain proteins display substantial conformational heterogeneity. Characterizing the conformational ensembles of these proteins in solution typically requires combining one or more biophysical techniques with computational modeling or simulations. Experimental data can either be used to assess the accuracy of a computational model or to refine the computational model to get a better agreement with the experimental data. In both cases, one generally needs a so-called forward model (i.e., an algorithm to calculate experimental observables from individual conformations or ensembles). In many cases, this involves one or more parameters that need to be set, and it is not always trivial to determine the optimal values or to understand the impact on the choice of parameters. For example, in the case of small-angle x-ray scattering (SAXS) experiments, many forward models include parameters that describe the contribution of the hydration layer and displaced solvent to the background-subtracted experimental data. Often, one also needs to fit a scale factor and a constant background for the SAXS data but across the entire ensemble. Here, we present a protocol to dissect the effect of the free parameters on the calculated SAXS intensities and to identify a reliable set of values. We have implemented this procedure in our Bayesian/maximum entropy framework for ensemble refinement and demonstrate the results on four intrinsically disordered proteins and a protein with three domains connected by flexible linkers. Our results show that the resulting ensembles can depend on the parameters used for solvent effects and suggest that these should be chosen carefully. We also find a set of parameters that work robustly across all proteins.
Collapse
Affiliation(s)
- Francesco Pesce
- Structural Biology and NMR Laboratory, The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
21
|
Pelus A, Bordes G, Barbe S, Bouchiba Y, Burnard C, Cortés J, Enjalbert B, Esque J, Estaña A, Fauré R, Henras AK, Heux S, Le Men C, Millard P, Nouaille S, Pérochon J, Toanen M, Truan G, Verdier A, Wagner C, Romeo Y, Montanier CY. A tripartite carbohydrate-binding module to functionalize cellulose nanocrystals. Biomater Sci 2021; 9:7444-7455. [PMID: 34647546 DOI: 10.1039/d1bm01156a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The development of protein and microorganism engineering have led to rising expectations of biotechnology in the design of emerging biomaterials, putatively of high interest to reduce our dependence on fossil carbon resources. In this way, cellulose, a renewable carbon based polysaccharide and derived products, displays unique properties used in many industrial applications. Although the functionalization of cellulose is common, it is however limited in terms of number and type of functions. In this work, a Carbohydrate-Binding Module (CBM) was used as a central core to provide a versatile strategy to bring a large diversity of functions to cellulose surfaces. CBM3a from Clostridium thermocellum, which has a high affinity for crystalline cellulose, was flanked through linkers with a streptavidin domain and an azide group introduced through a non-canonical amino acid. Each of these two extra domains was effectively produced and functionalized with a variety of biological and chemical molecules. Structural properties of the resulting tripartite chimeric protein were investigated using molecular modelling approaches, and its potential for the multi-functionalization of cellulose was confirmed experimentally. As a proof of concept, we show that cellulose can be labelled with a fluorescent version of the tripartite protein grafted to magnetic beads and captured using a magnet.
Collapse
Affiliation(s)
- Angeline Pelus
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Gaëlle Bordes
- Molecular, Cellular and Developmental biology department (MCD), Centre de Biologie Integrative (CBI), Université de Toulouse, CNRS, UPS, 31062 Toulouse, France.
| | - Sophie Barbe
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Younes Bouchiba
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Callum Burnard
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Brice Enjalbert
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Jeremy Esque
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | | | - Régis Fauré
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Anthony K Henras
- Molecular, Cellular and Developmental biology department (MCD), Centre de Biologie Integrative (CBI), Université de Toulouse, CNRS, UPS, 31062 Toulouse, France.
| | - Stéphanie Heux
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Claude Le Men
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Pierre Millard
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | | | - Julien Pérochon
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Marion Toanen
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Gilles Truan
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Amandine Verdier
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Camille Wagner
- TBI, Université de Toulouse, CNRS, INRAE, INSA, Toulouse, France.
| | - Yves Romeo
- Molecular, Cellular and Developmental biology department (MCD), Centre de Biologie Integrative (CBI), Université de Toulouse, CNRS, UPS, 31062 Toulouse, France.
| | | |
Collapse
|
22
|
DIPEND: An Open-Source Pipeline to Generate Ensembles of Disordered Segments Using Neighbor-Dependent Backbone Preferences. Biomolecules 2021; 11:biom11101505. [PMID: 34680137 PMCID: PMC8534045 DOI: 10.3390/biom11101505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 09/26/2021] [Accepted: 10/06/2021] [Indexed: 11/17/2022] Open
Abstract
Ensemble-based structural modeling of flexible protein segments such as intrinsically disordered regions is a complex task often solved by selection of conformers from an initial pool based on their conformity to experimental data. However, the properties of the conformational pool are crucial, as the sampling of the conformational space should be sufficient and, in the optimal case, relatively uniform. In other words, the ideal sampling is both efficient and exhaustive. To achieve this, specialized tools are usually necessary, which might not be maintained in the long term, available on all platforms or flexible enough to be tweaked to individual needs. Here, we present an open-source and extendable pipeline to generate initial protein structure pools for use with selection-based tools to obtain ensemble models of flexible protein segments. Our method is implemented in Python and uses ChimeraX, Scwrl4, Gromacs and neighbor-dependent backbone distributions compiled and published previously by the Dunbrack lab. All these tools and data are publicly available and maintained. Our basic premise is that by using residue-specific, neighbor-dependent Ramachandran distributions, we can enhance the efficient exploration of the relevant region of the conformational space. We have also provided a straightforward way to bias the sampling towards specific conformations for selected residues by combining different conformational distributions. This allows the consideration of a priori known conformational preferences such as in the case of preformed structural elements. The open-source and modular nature of the pipeline allows easy adaptation for specific problems. We tested the pipeline on an intrinsically disordered segment of the protein Cd3ϵ and also a single-alpha helical (SAH) region by generating conformational pools and selecting ensembles matching experimental data using the CoNSEnsX+ server.
Collapse
|
23
|
Clerc I, Sagar A, Barducci A, Sibille N, Bernadó P, Cortés J. The diversity of molecular interactions involving intrinsically disordered proteins: A molecular modeling perspective. Comput Struct Biotechnol J 2021; 19:3817-3828. [PMID: 34285781 PMCID: PMC8273358 DOI: 10.1016/j.csbj.2021.06.031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 06/17/2021] [Accepted: 06/21/2021] [Indexed: 01/15/2023] Open
Abstract
Intrinsically Disordered Proteins and Regions (IDPs/IDRs) are key components of a multitude of biological processes. Conformational malleability enables IDPs/IDRs to perform very specialized functions that cannot be accomplished by globular proteins. The functional role for most of these proteins is related to the recognition of other biomolecules to regulate biological processes or as a part of signaling pathways. Depending on the extent of disorder, the number of interacting sites and the type of partner, very different architectures for the resulting assemblies are possible. More recently, molecular condensates with liquid-like properties composed of multiple copies of IDPs and nucleic acids have been proven to regulate key processes in eukaryotic cells. The structural and kinetic details of disordered biomolecular complexes are difficult to unveil experimentally due to their inherent conformational heterogeneity. Computational approaches, alone or in combination with experimental data, have emerged as unavoidable tools to understand the functional mechanisms of this elusive type of assemblies. The level of description used, all-atom or coarse-grained, strongly depends on the size of the molecular systems and on the timescale of the investigated mechanism. In this mini-review, we describe the most relevant architectures found for molecular interactions involving IDPs/IDRs and the computational strategies applied for their investigation.
Collapse
Affiliation(s)
- Ilinka Clerc
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Amin Sagar
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Alessandro Barducci
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Nathalie Sibille
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Pau Bernadó
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| |
Collapse
|
24
|
Sagar A, Jeffries CM, Petoukhov MV, Svergun DI, Bernadó P. Comment on the Optimal Parameters to Derive Intrinsically Disordered Protein Conformational Ensembles from Small-Angle X-ray Scattering Data Using the Ensemble Optimization Method. J Chem Theory Comput 2021; 17:2014-2021. [PMID: 33725442 DOI: 10.1021/acs.jctc.1c00014] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The Ensemble Optimization Method (EOM) is a popular approach to describe small-angle X-ray scattering (SAXS) data from highly disordered proteins. The EOM algorithm selects subensembles of coexisting states from large pools of randomized conformations to fit the SAXS data. Based on the unphysical bimodal radius of gyration (Rg) distribution of conformations resulting from the EOM analysis, a recent article (Fagerberg et al. J. Chem. Theory Comput. 2019, 15 (12), 6968-6983) concluded that this approach inadequately described the SAXS data measured for human Histatin 5 (Hst5), a peptide with antifungal properties. Using extensive experimental and synthetic data, we explored the origin of this observation. We found that the one-bead-per-residue coarse-grained representation with averaged scattering form factors (provided in the EOM as an add-on to represent disordered missing loops or domains) may not be appropriate for EOM analyses of scattering data from short (below 50 residues) proteins/peptides. The method of choice for these proteins is to employ atomistic models (e.g., from molecular dynamics simulations) to sample the protein conformational landscape. As a convenient alternative, we have also improved the coarse-grained approach by introducing amino acid specific form factors in the calculations. We also found that, for small proteins, the search for relatively large subensembles of 20-50 conformers (as implemented in the original EOM version) more adequately describes the conformational space sampled in solution than the procedures optimizing the ensemble size. Our observations have been added as recommendations into the information for EOM users to promote the proper utilization of the program for ensemble-based modeling of SAXS data for all types of disordered systems.
Collapse
Affiliation(s)
- Amin Sagar
- Centre de Biologie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 29, rue de Navacelles, 34090 Montpellier, France
| | - Cy M Jeffries
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Maxim V Petoukhov
- A.V. Shubnikov Institute of Crystallography of Federal Scientific Research Centre "Crystallography and Photonics" of Russian Academy of Sciences, 119333 Moscow, Russia
| | - Dmitri I Svergun
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Pau Bernadó
- Centre de Biologie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 29, rue de Navacelles, 34090 Montpellier, France
| |
Collapse
|
25
|
Joshi T, Garg S, Estaña A, Cortés J, Bernadó P, Das S, Kammath AR, Sagar A, Rakshit S. Interdomain linkers tailor the stability of immunoglobulin repeats in polyproteins. Biochem Biophys Res Commun 2021; 550:43-48. [PMID: 33684619 DOI: 10.1016/j.bbrc.2021.02.114] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 02/24/2021] [Indexed: 11/28/2022]
Abstract
Linkers in polyproteins are considered as mere spacers between two adjacent domains. However, a series of studies using single-molecule force spectroscopy have recently reported distinct thermodynamic stability of I27 in polyproteins with varying linkers and indicated the vital role of linkers in domain stability. A flexible glycine rich linker (-(GGG)n, n ≥ 3) featured unfolding at lower forces than the regularly used arg-ser (RS) based linker. Interdomain interactions among I27 domains in Gly-rich linkers were suggested to lead to reduced domain stability. However, the negative impact of inter domain interactions on domain stability is thermodynamically counter-intuitive and demanded thorough investigations. Here, using an array of ensemble equilibrium experiments and in-silico measurements with I27 singlet and doublets with two aforementioned linkers, we delineate that the inter-domain interactions in fact raise the stability of the polyprotein with RS linker. More surprisingly, a highly flexible Gly-rich linker has no interference on the stability of polyprotein. Overall, we conclude that flexible linkers are preferred in a polyprotein for maintaining domain's independence.
Collapse
Affiliation(s)
- Tanuja Joshi
- Department of Chemical Sciences, Indian Institute of Science Education and Research, Mohali, Punjab, India
| | - Surbhi Garg
- Department of Chemical Sciences, Indian Institute of Science Education and Research, Mohali, Punjab, India
| | - Alejandro Estaña
- Centre de BiochimieStructurale, INSERM, CNRS, Université de Montpellier, Montpellier, France; LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
| | - Pau Bernadó
- Centre de BiochimieStructurale, INSERM, CNRS, Université de Montpellier, Montpellier, France
| | - Sayan Das
- Department of Chemical Sciences, Indian Institute of Science Education and Research, Mohali, Punjab, India
| | - Anjana R Kammath
- Department of Chemical Sciences, Indian Institute of Science Education and Research, Mohali, Punjab, India
| | - Amin Sagar
- Department of Chemical Sciences, Indian Institute of Science Education and Research, Mohali, Punjab, India; Centre de BiochimieStructurale, INSERM, CNRS, Université de Montpellier, Montpellier, France.
| | - Sabyasachi Rakshit
- Department of Chemical Sciences, Indian Institute of Science Education and Research, Mohali, Punjab, India; Centre for Protein Science Design and Engineering, Indian Institute of Science Education and Research, Mohali, Punjab, India.
| |
Collapse
|
26
|
Jagannathan NS, Hogue CWV, Tucker-Kellogg L. Computational modeling suggests binding-induced expansion of Epsin disordered regions upon association with AP2. PLoS Comput Biol 2021; 17:e1008474. [PMID: 33406091 PMCID: PMC7787433 DOI: 10.1371/journal.pcbi.1008474] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 10/27/2020] [Indexed: 11/22/2022] Open
Abstract
Intrinsically disordered regions (IDRs) are prevalent in the eukaryotic proteome. Common functional roles of IDRs include forming flexible linkers or undergoing allosteric folding-upon-binding. Recent studies have suggested an additional functional role for IDRs: generating steric pressure on the plasma membrane during endocytosis, via molecular crowding. However, in order to accomplish useful functions, such crowding needs to be regulated in space (e.g., endocytic hotspots) and time (e.g., during vesicle formation). In this work, we explore binding-induced regulation of IDR steric volume. We simulate the IDRs of two proteins from Clathrin-mediated endocytosis (CME) to see if their conformational spaces are regulated via binding-induced expansion. Using Monte-Carlo computational modeling of excluded volumes, we generate large conformational ensembles (3 million) for the IDRs of Epsin and Eps15 and dock the conformers to the alpha subunit of Adaptor Protein 2 (AP2α), their CME binding partner. Our results show that as more molecules of AP2α are bound, the Epsin-derived ensemble shows a significant increase in global dimensions, measured as the radius of Gyration (RG) and the end-to-end distance (EED). Unlike Epsin, Eps15-derived conformers that permit AP2α binding at one motif were found to be more likely to accommodate binding of AP2α at other motifs, suggesting a tendency toward co-accessibility of binding motifs. Co-accessibility was not observed for any pair of binding motifs in Epsin. Thus, we speculate that the disordered regions of Epsin and Eps15 perform different roles during CME, with accessibility in Eps15 allowing it to act as a recruiter of AP2α molecules, while binding-induced expansion of the Epsin disordered region could impose steric pressure and remodel the plasma membrane during vesicle formation. Protein functions were originally believed to arise from ordered protein structures. This dogma was later challenged by the identification of intrinsically disordered proteins that lack specific structure. The functional roles of such proteins usually fell in two categories–exploiting the disorder for flexibility (like floppy connector), or imposing order upon binding to an external partner. In this study we explore the possibility of an alternative mechanism that harnesses disorder for function through regulated molecular crowding. Specifically, we use modeling to study two proteins involved in reshaping the cell membrane, Epsin and Eps15. We ask if they undergo binding-induced expansion, where binding of an external partner AP2 causes not a transition toward order, but rather an energetically favorable increase in propensity to occupy larger volumes. Our results show that Epsin tends to occupy a larger volume when bound to AP2, consistent with increased molecular crowding, which could help reshape the cell membrane. Such regulation of disorder via binding (without folding) opens hitherto unexplored avenues that cells might employ to harness disorder.
Collapse
Affiliation(s)
- N. Suhas Jagannathan
- Cancer & Stem Cell Biology, and Centre for Computational Biology, Duke-NUS Medical School, 8 College Road, Singapore
- Singapore-MIT Alliance, Computation and Systems Biology Program, National University of Singapore, Singapore
| | - Christopher W. V. Hogue
- Singapore-MIT Alliance, Computation and Systems Biology Program, National University of Singapore, Singapore
- Mechanobiology Institute, National University of Singapore, Singapore
| | - Lisa Tucker-Kellogg
- Cancer & Stem Cell Biology, and Centre for Computational Biology, Duke-NUS Medical School, 8 College Road, Singapore
- Singapore-MIT Alliance, Computation and Systems Biology Program, National University of Singapore, Singapore
- * E-mail:
| |
Collapse
|
27
|
Predicting Secondary Structure Propensities in IDPs Using Simple Statistics from Three-Residue Fragments. J Mol Biol 2020; 432:5447-5459. [DOI: 10.1016/j.jmb.2020.07.026] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/30/2020] [Accepted: 07/31/2020] [Indexed: 01/21/2023]
|
28
|
Urbanek A, Popovic M, Morató A, Estaña A, Elena-Real CA, Mier P, Fournet A, Allemand F, Delbecq S, Andrade-Navarro MA, Cortés J, Sibille N, Bernadó P. Flanking Regions Determine the Structure of the Poly-Glutamine in Huntingtin through Mechanisms Common among Glutamine-Rich Human Proteins. Structure 2020; 28:733-746.e5. [PMID: 32402249 DOI: 10.1016/j.str.2020.04.008] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 02/18/2020] [Accepted: 04/11/2020] [Indexed: 10/24/2022]
Abstract
The causative agent of Huntington's disease, the poly-Q homo-repeat in the N-terminal region of huntingtin (httex1), is flanked by a 17-residue-long fragment (N17) and a proline-rich region (PRR), which promote and inhibit the aggregation propensity of the protein, respectively, by poorly understood mechanisms. Based on experimental data obtained from site-specifically labeled NMR samples, we derived an ensemble model of httex1 that identified both flanking regions as opposing poly-Q secondary structure promoters. While N17 triggers helicity through a promiscuous hydrogen bond network involving the side chains of the first glutamines in the poly-Q tract, the PRR promotes extended conformations in neighboring glutamines. Furthermore, a bioinformatics analysis of the human proteome showed that these structural traits are present in many human glutamine-rich proteins and that they are more prevalent in proteins with longer poly-Q tracts. Taken together, these observations provide the structural bases to understand previous biophysical and functional data on httex1.
Collapse
Affiliation(s)
- Annika Urbanek
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France
| | - Matija Popovic
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France
| | - Anna Morató
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France
| | - Alejandro Estaña
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France; LAAS-CNRS, Université de Toulouse, CNRS, 31400 Toulouse, France
| | - Carlos A Elena-Real
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, 55128 Mainz, Germany
| | - Aurélie Fournet
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France
| | - Frédéric Allemand
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France
| | - Stephane Delbecq
- Laboratoire de Biologie Cellulaire et Moléculaire (LBCM-EA4558 Vaccination Antiparasitaire), UFR Pharmacie, Université de Montpellier, 34090 Montpellier, France
| | - Miguel A Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, 55128 Mainz, Germany
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS, 31400 Toulouse, France
| | - Nathalie Sibille
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France
| | - Pau Bernadó
- Centre de Biochimie Structurale (CBS), INSERM, CNRS, Université de Montpellier, 34090 Montpellier, France.
| |
Collapse
|
29
|
Mittal A, Changani AM, Taparia S. Unique and exclusive peptide signatures directly identify intrinsically disordered proteins from sequences without structural information. J Biomol Struct Dyn 2020; 39:2885-2893. [PMID: 32295482 DOI: 10.1080/07391102.2020.1756410] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Intrinsically disordered proteins are now widely accepted to play crucial roles in biological functions. Identification of signatures of intrinsic disorder is one of the key steps towards building a proper repertoire for their occurrence in proteomes. In this work, systematic computational synthesis of a library of all possible (3368400) dipeptides, tripeptides, tetrapeptides and pentapeptides using the natural 20 amino acids allowed us to identify 36 unique tetrapeptides present exclusively in intrinsically disordered proteins and absent in the complete primary sequence space of naturally occurring structured proteins. Further, out of more than 530000 known naturally occurring primary sequences without any structural information, 1349 sequences contain the above identified unique signatures of intrinsic disorder. These sequences, having cellular functions varying from housekeeping to metabolic to transport, more than double the number of the currently known intrinsically disordered proteins. On similar lines, we report that 26577 pentapeptide signatures exclusive to intrinsically disordered proteins, and absent in naturally occurring structured proteins, identify ∼50% of more than half-a-million curated protein sequences without structural information to be intrinsically disordered. The results reported are a major leap forward in exploring functional manifestations of intrinsically disordered proteins.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Aditya Mittal
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi (IIT Delhi), New Delhi, India.,Supercomputing Facility for Bioinformatics & Computational Biology, Indian Institute of Technology Delhi (IIT Delhi), New Delhi, India
| | | | - Sakshi Taparia
- Department of Mathematics (Bachelors Program in Mathematics & Computing), Indian Institute of Technology Delhi (IIT Delhi), New Delhi, India
| |
Collapse
|
30
|
Culka M, Rulíšek L. Interplay between Conformational Strain and Intramolecular Interaction in Protein Structures: Which of Them Is Evolutionarily Conserved? J Phys Chem B 2020; 124:3252-3260. [PMID: 32237747 DOI: 10.1021/acs.jpcb.9b11784] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
By computing strain energies of peptide fragments within protein structures and their intramolecular interaction energies, we attempt to reveal general biophysical trends behind the secondary structure formation in the context of protein evolution. Our "protein basis set" consisted of 1143 representatives of different folds obtained from curated SCOPe database, and for each member of the set, the strain and intramolecular energy was calculated on the "rolling tripeptide" basis, employing the DFT-D3/COSMO-RS method for the former and the QM-calibrated force field method (MM) for the latter. The calculated data, strain and interactions, were correlated with the conservation of amino acid residues in secondary structure elements and also with the level of the residue burial within the protein three-dimensional structure. It allowed us to formulate several observations concerning fundamental differences between two main secondary structure motifs: α-helices and β-strands. We have shown that a strong interaction is one of the determining characteristics of the β-sheet formation, at least at the level of tripeptides (and likely penta- or heptapeptides, too), and that the β-strand is a prevailing secondary structure in the strongly-interacting regions of the protein folds conserved by evolution. On the other hand, low strain was neither proven to be an important physicochemical property conserved by evolution nor does it correlate with the propensity for the α-helix and β-strand. Finally, it has been demonstrated that the strong interaction has a certain level of connection with residue burial; however, we demonstrate that these two characteristics should be rather regarded as two complementary factors. These findings represent an important contribution to understanding protein folding from first principles, which is a complementary approach to ongoing efforts to solve the protein folding problem by knowledge-based approaches and machine-learning.
Collapse
Affiliation(s)
- Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| |
Collapse
|
31
|
Sagar A, Svergun D, Bernadó P. Structural Analyses of Intrinsically Disordered Proteins by Small-Angle X-Ray Scattering. Methods Mol Biol 2020; 2141:249-269. [PMID: 32696361 DOI: 10.1007/978-1-0716-0524-0_12] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Small-angle X-ray scattering (SAXS) is a low-resolution method for the structural characterization of biological macromolecules in solution. Information about the overall structural features provided by SAXS is highly complementary to X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, and cryo-electron microscopy, which are high-resolution methods. SAXS not only provides the shape, oligomeric state, and quaternary structure of folded proteins and protein complexes but also allows for quantitative analysis of flexible biomolecules. In this chapter, the most relevant SAXS procedures for structural characterization of flexible macromolecules, including intrinsically disordered proteins (IDPs), are presented. The sample requirements for SAXS experiments on protein solutions and the sequence of steps in data collection and processing are described. The use of the advanced data analysis tools to quantitatively characterize flexible proteins is presented in detail. Typical experimental issues and potential problems encountered during SAXS data measurements and analyses are discussed.
Collapse
Affiliation(s)
- Amin Sagar
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, Montpellier, France.
| | - Dmitri Svergun
- European Molecular Biology Laboratory, Hamburg Unit, EMBL c/o DESY, Hamburg, Germany
| | - Pau Bernadó
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, Montpellier, France.
| |
Collapse
|
32
|
Pietrek LM, Stelzl LS, Hummer G. Hierarchical Ensembles of Intrinsically Disordered Proteins at Atomic Resolution in Molecular Dynamics Simulations. J Chem Theory Comput 2019; 16:725-737. [PMID: 31809054 DOI: 10.1021/acs.jctc.9b00809] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Intrinsically disordered proteins (IDPs) constitute a large fraction of the human proteome and are critical in the regulation of cellular processes. A detailed understanding of the conformational dynamics of IDPs could help to elucidate their roles in health and disease. However, the inherent flexibility of IDPs makes structural studies and their interpretation challenging. Molecular dynamics (MD) simulations could address this challenge in principle, but inaccuracies in the simulation models and the need for long simulations have stymied progress. To overcome these limitations, we adopt a hierarchical approach that builds on the "flexible-meccano" model reported by Bernadó et al. (J. Am. Chem. Soc. 2005, 127, 17968-17969). First, we exhaustively sample small IDP fragments in all-atom simulations to capture their local structures. Then, we assemble the fragments into full-length IDPs to explore the stereochemically possible global structures of IDPs. The resulting ensembles of three-dimensional structures of full-length IDPs are highly diverse, much more so than in standard MD simulation. For the paradigmatic IDP α-synuclein, our ensemble captures both the local structure, as probed by nuclear magnetic resonance spectroscopy, and its overall dimension, as obtained from small-angle X-ray scattering in solution. By generating representative and meaningful starting ensembles, we can begin to exploit the massive parallelism afforded by current and future high-performance computing resources for atomic-resolution characterization of IDPs.
Collapse
Affiliation(s)
- Lisa M Pietrek
- Department of Theoretical Biophysics , Max Planck Institute of Biophysics , Max-von-Laue Straße 3 , 60438 Frankfurt am Main , Germany
| | - Lukas S Stelzl
- Department of Theoretical Biophysics , Max Planck Institute of Biophysics , Max-von-Laue Straße 3 , 60438 Frankfurt am Main , Germany
| | - Gerhard Hummer
- Department of Theoretical Biophysics , Max Planck Institute of Biophysics , Max-von-Laue Straße 3 , 60438 Frankfurt am Main , Germany.,Institute for Biophysics , Goethe University Frankfurt , 60438 Frankfurt am Main , Germany
| |
Collapse
|
33
|
Malliavin TE, Mucherino A, Lavor C, Liberti L. Systematic Exploration of Protein Conformational Space Using a Distance Geometry Approach. J Chem Inf Model 2019; 59:4486-4503. [PMID: 31442036 DOI: 10.1021/acs.jcim.9b00215] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The optimization approaches classically used during the determination of protein structure encounter various difficulties, especially when the size of the conformational space is large. Indeed, in such a case, algorithmic convergence criteria are more difficult to set up. Moreover, the size of the search space makes it difficult to achieve a complete exploration. The interval branch-and-prune (iBP) approach, based on the reformulation of the distance geometry problem (DGP) provides a theoretical frame for the generation of protein conformations, by systematically sampling the conformational space. When an appropriate subset of interatomic distances is known exactly, this worst-case exponential-time algorithm is provably complete and fixed-parameter tractable. These guarantees, however, immediately disappear as distance measurement errors are introduced. Here we propose an improvement of this approach: threading-augmented interval branch-and-prune (TAiBP), where the combinatorial explosion of the original iBP approach arising from its exponential complexity is alleviated by partitioning the input instances into consecutive peptide fragments and by using self-organizing maps (SOMs) to obtain clusters of similar solutions. A validation of the TAiBP approach is presented here on a set of proteins of various sizes and structures. The calculation inputs are a uniform covalent geometry extracted from force field covalent terms, the backbone dihedral angles with error intervals, and a few long-range distances. For most of the proteins smaller than 50 residues and interval widths of 20°, the TAiBP approach yielded solutions with RMSD values smaller than 3 Å with respect to the initial protein conformation. The efficiency of the TAiBP approach for proteins larger than 50 residues will require the use of nonuniform covalent geometry and may have benefits from the recent development of residue-specific force-fields.
Collapse
Affiliation(s)
- Thérèse E Malliavin
- Unité de Bioinformatique Structurale, UMR 3528, CNRS, and Departement de Bioinformatique, Biostatistique et Biologie Intégrative, USR 3756, CNRS , Institut Pasteur , 75015 Paris , France
| | | | - Carlile Lavor
- Applied Math Department , IMECC-University of Campinas , Campinas , SP 13083-970 , Brazil
| | - Leo Liberti
- LIX CNRS, Ecole Polytechnique , Institut Polytechnique de Paris , Route de Saclay , 91128 Palaiseau , France
| |
Collapse
|
34
|
Abstract
Bayesian and Maximum Entropy approaches allow for a statistically sound and systematic fitting of experimental and computational data. Unfortunately, assessing the relative confidence in these two types of data remains difficult as several steps add unknown error. Here we propose the use of a validation-set method to determine the balance, and thus the amount of fitting. We apply the method to synthetic NMR chemical shift data of an intrinsically disordered protein. We show that the method gives consistent results even when other methods to assess the amount of fitting cannot be applied. Finally, we also describe how the errors in the chemical shift predictor can lead to an incorrect fitting and how using secondary chemical shifts could alleviate this problem.
Collapse
|
35
|
Neira JL, Palomino-Schätzlein M, Ricci C, Ortore MG, Rizzuti B, Iovanna JL. Dynamics of the intrinsically disordered protein NUPR1 in isolation and in its fuzzy complexes with DNA and prothymosin α. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2019; 1867:140252. [PMID: 31325636 DOI: 10.1016/j.bbapap.2019.07.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Revised: 07/09/2019] [Accepted: 07/15/2019] [Indexed: 12/17/2022]
Abstract
Intrinsically disordered proteins (IDPs) explore diverse conformations in their free states and, a few of them, also in their molecular complexes. This functional plasticity is essential for the function of IDPs, although their dynamics in both free and bound states is poorly understood. NUPR1 is a protumoral multifunctional IDP, activated during the acute phases of pancreatitis. It interacts with DNA and other IDPs, such as prothymosin α (ProTα), with dissociation constants of ~0.5 μM, and a 1:1 stoichiometry. We studied the structure and picosecond-to-nanosecond (ps-ns) dynamics by using both NMR and SAXS in: (i) isolated NUPR1; (ii) the NUPR1/ProTα complex; and (iii) the NUPR1/double stranded (ds) GGGCGCGCCC complex. Our SAXS findings show that NUPR1 remained disordered when bound to either partner, adopting a worm-like conformation; the fuzziness of bound NUPR1 was also pinpointed by NMR. Residues with the largest values of the relaxation rates (R1, R1ρ, R2 and ηxy), in the free and bound species, were mainly clustered around the 30s region of the sequence, which agree with one of the protein hot-spots already identified by site-directed mutagenesis. Not only residues in this region had larger relaxation rates, but they also moved slower than the rest of the molecule, as indicated by the reduced spectral density approach (RSDA). Upon binding, the energy landscape of NUPR1 was not funneled down to a specific, well-folded conformation, but rather its backbone flexibility was kept, with distinct motions occurring at the hot-spot region.
Collapse
Affiliation(s)
- José L Neira
- Instituto de Biología Molecular y Celular, Universidad Miguel Hernández, Elche (Alicante), Spain; Instituto de Biocomputación y Física de Sistemas Complejos (BIFI), Joint Units IQFR-CSIC-BIFI, and GBsC-CSIC-BIFI, Universidad de Zaragoza, Zaragoza, Spain.
| | | | - Caterina Ricci
- Department of Life and Environmental Sciences, Marche Polytechnic University, via Brecce Bianche, 60131 Ancona, Italy
| | - Maria Grazia Ortore
- Department of Life and Environmental Sciences, Marche Polytechnic University, via Brecce Bianche, 60131 Ancona, Italy
| | - Bruno Rizzuti
- CNR-NANOTEC, Licryl-UOS Cosenza and CEMIF.Cal, Department of Physics, University of Calabria, Via P. Bucci, Cubo 31 C, 87036 Arcavacata di Rende, Cosenza, Italy
| | - Juan L Iovanna
- Centre de Recherche en Cancérologie de Marseille (CRCM), INSERM U1068, CNRS UMR 7258, Aix-Marseille Université and Institut Paoli-Calmettes, Parc Scientifique et Technologique de Luminy, 163 Avenue de Luminy, 13288 Marseille, France
| |
Collapse
|
36
|
Hartlmüller C, Spreitzer E, Göbl C, Falsone F, Madl T. NMR characterization of solvent accessibility and transient structure in intrinsically disordered proteins. JOURNAL OF BIOMOLECULAR NMR 2019; 73:305-317. [PMID: 31297688 PMCID: PMC6692294 DOI: 10.1007/s10858-019-00248-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2019] [Accepted: 04/11/2019] [Indexed: 05/12/2023]
Abstract
In order to understand the conformational behavior of intrinsically disordered proteins (IDPs) and their biological interaction networks, the detection of residual structure and long-range interactions is required. However, the large number of degrees of conformational freedom of disordered proteins require the integration of extensive sets of experimental data, which are difficult to obtain. Here, we provide a straightforward approach for the detection of residual structure and long-range interactions in IDPs under near-native conditions using solvent paramagnetic relaxation enhancement (sPRE). Our data indicate that for the general case of an unfolded chain, with a local flexibility described by the overwhelming majority of available combinations, sPREs of non-exchangeable protons can be accurately predicted through an ensemble-based fragment approach. We show for the disordered protein α-synuclein and disordered regions of the proteins FOXO4 and p53 that deviation from random coil behavior can be interpreted in terms of intrinsic propensity to populate local structure in interaction sites of these proteins and to adopt transient long-range structure. The presented modification-free approach promises to be applicable to study conformational dynamics of IDPs and other dynamic biomolecules in an integrative approach.
Collapse
Affiliation(s)
- Christoph Hartlmüller
- Center for Integrated Protein Science Munich (CIPSM) at the Department of Chemistry, Technische Universität München, Lichtenbergstrasse 4, 87548, Garching, Germany
| | - Emil Spreitzer
- Gottfried Schatz Research Center for Cell Signaling, Metabolism and Aging, Institute of Molecular Biology & Biochemistry, Medical University of Graz, Neue Stiftingtalstrasse 6, 8010, Graz, Austria
| | - Christoph Göbl
- The Campbell Family Institute for Breast Cancer Research at Princess Margaret Cancer Centre, 610 University Avenue, Toronto, ON, M5G 2M9, Canada
| | - Fabio Falsone
- Institute of Pharmaceutical Sciences, University of Graz, Schubertstrasse 1, 8010, Graz, Austria
| | - Tobias Madl
- Gottfried Schatz Research Center for Cell Signaling, Metabolism and Aging, Institute of Molecular Biology & Biochemistry, Medical University of Graz, Neue Stiftingtalstrasse 6, 8010, Graz, Austria.
- BioTechMed-Graz, Graz, Austria.
| |
Collapse
|
37
|
Investigating the Formation of Structural Elements in Proteins Using Local Sequence-Dependent Information and a Heuristic Search Algorithm. Molecules 2019; 24:molecules24061150. [PMID: 30909488 PMCID: PMC6471799 DOI: 10.3390/molecules24061150] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Revised: 03/14/2019] [Accepted: 03/19/2019] [Indexed: 11/22/2022] Open
Abstract
Structural elements inserted in proteins are essential to define folding/unfolding mechanisms and partner recognition events governing signaling processes in living organisms. Here, we present an original approach to model the folding mechanism of these structural elements. Our approach is based on the exploitation of local, sequence-dependent structural information encoded in a database of three-residue fragments extracted from a large set of high-resolution experimentally determined protein structures. The computation of conformational transitions leading to the formation of the structural elements is formulated as a discrete path search problem using this database. To solve this problem, we propose a heuristically-guided depth-first search algorithm. The domain-dependent heuristic function aims at minimizing the length of the path in terms of angular distances, while maximizing the local density of the intermediate states, which is related to their probability of existence. We have applied the strategy to two small synthetic polypeptides mimicking two common structural motifs in proteins. The folding mechanisms extracted are very similar to those obtained when using traditional, computationally expensive approaches. These results show that the proposed approach, thanks to its simplicity and computational efficiency, is a promising research direction.
Collapse
|
38
|
Chan-Yao-Chong M, Durand D, Ha-Duong T. Molecular Dynamics Simulations Combined with Nuclear Magnetic Resonance and/or Small-Angle X-ray Scattering Data for Characterizing Intrinsically Disordered Protein Conformational Ensembles. J Chem Inf Model 2019; 59:1743-1758. [PMID: 30840442 DOI: 10.1021/acs.jcim.8b00928] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The concept of intrinsically disordered proteins (IDPs) has emerged relatively slowly, but over the past 20 years, it has become an intense research area in structural biology. Indeed, because of their considerable flexibility and structural heterogeneity, the determination of IDP conformational ensemble is particularly challenging and often requires a combination of experimental measurements and computational approaches. With the improved accuracy of all-atom force fields and the increasing computing performances, molecular dynamics (MD) simulations have become more and more reliable to generate realistic conformational ensembles. And the combination of MD simulations with experimental approaches, such as nuclear magnetic resonance (NMR) and/or small-angle X-ray scattering (SAXS) allows one to converge toward a more accurate and exhaustive description of IDP structures. In this Review, we discuss the state of the art of MD simulations of IDP conformational ensembles, with a special focus on studies that back-calculated and directly compared theoretical and experimental NMR or SAXS observables, such as chemical shifts (CS), 3J-couplings (3Jc), residual dipolar couplings (RDC), or SAXS intensities. We organize the review in three parts. In the first section, we discuss the studies which used NMR and/or SAXS data to test and validate the development of force fields or enhanced sampling techniques. In the second part, we explore different methods for the refinement of MD-derived structural ensembles, such as NMR or SAXS data-restrained MD simulations or ensemble reweighting to better fit experiments. Finally, we survey some recent studies combining MD simulations with NMR and/or SAXS measurements to investigate the relationship between IDP conformational ensemble and biological activity, as well as their implication in human diseases. From this review, we noticed that quite a few studies compared MD-generated conformational ensembles with both NMR and SAXS measurements to validate IDP structures at both local and global levels. Yet, beside the IDP propensity to form local secondary structures, their dynamic extension or compactness also appears important for their activity. Thus, we believe that a close synergy between MD simulations, NMR, and SAXS experiments would be greatly appropriate to address the challenges of characterizing the disordered structures of proteins and their complexes, relative to their biological functions.
Collapse
Affiliation(s)
- Maud Chan-Yao-Chong
- BioCIS, Université Paris-Sud, CNRS , Université Paris-Saclay , 92290 Châtenay-Malabry , France.,Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud , Université Paris-Saclay , 91198 , Gif-sur-Yvette cedex, France
| | - Dominique Durand
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud , Université Paris-Saclay , 91198 , Gif-sur-Yvette cedex, France
| | - Tâp Ha-Duong
- BioCIS, Université Paris-Sud, CNRS , Université Paris-Saclay , 92290 Châtenay-Malabry , France
| |
Collapse
|
39
|
Ciemny MP, Badaczewska-Dawid AE, Pikuzinska M, Kolinski A, Kmiecik S. Modeling of Disordered Protein Structures Using Monte Carlo Simulations and Knowledge-Based Statistical Force Fields. Int J Mol Sci 2019; 20:E606. [PMID: 30708941 PMCID: PMC6386871 DOI: 10.3390/ijms20030606] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 01/23/2019] [Accepted: 01/29/2019] [Indexed: 12/20/2022] Open
Abstract
The description of protein disordered states is important for understanding protein folding mechanisms and their functions. In this short review, we briefly describe a simulation approach to modeling protein interactions, which involve disordered peptide partners or intrinsically disordered protein regions, and unfolded states of globular proteins. It is based on the CABS coarse-grained protein model that uses a Monte Carlo (MC) sampling scheme and a knowledge-based statistical force field. We review several case studies showing that description of protein disordered states resulting from CABS simulations is consistent with experimental data. The case studies comprise investigations of protein⁻peptide binding and protein folding processes. The CABS model has been recently made available as the simulation engine of multiscale modeling tools enabling studies of protein⁻peptide docking and protein flexibility. Those tools offer customization of the modeling process, driving the conformational search using distance restraints, reconstruction of selected models to all-atom resolution, and simulation of large protein systems in a reasonable computational time. Therefore, CABS can be combined in integrative modeling pipelines incorporating experimental data and other modeling tools of various resolution.
Collapse
Affiliation(s)
- Maciej Pawel Ciemny
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
- Faculty of Physics, University of Warsaw, Pasteura 5, 02-093 Warsaw, Poland.
| | | | - Monika Pikuzinska
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| |
Collapse
|
40
|
|