1
|
Ruzmetov T, Hung TI, Jonnalagedda SP, Chen SH, Fasihianifard P, Guo Z, Bhanu B, Chang CEA. Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.05.592587. [PMID: 38979147 PMCID: PMC11230202 DOI: 10.1101/2024.05.05.592587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Proteins are inherently dynamic, and their conformational ensembles are functionally important in biology. Large-scale motions may govern protein structure-function relationship, and numerous transient but stable conformations of intrinsically disordered proteins (IDPs) can play a crucial role in biological function. Investigating conformational ensembles to understand regulations and disease-related aggregations of IDPs is challenging both experimentally and computationally. In this paper we first introduced an unsupervised deep learning-based model, termed Internal Coordinate Net (ICoN), which learns the physical principles of conformational changes from molecular dynamics (MD) simulation data. Second, we selected interpolating data points in the learned latent space that rapidly identify novel synthetic conformations with sophisticated and large-scale sidechains and backbone arrangements. Third, with the highly dynamic amyloid-β 1-42 (Aβ42) monomer, our deep learning model provided a comprehensive sampling of Aβ42's conformational landscape. Analysis of these synthetic conformations revealed conformational clusters that can be used to rationalize experimental findings. Additionally, the method can identify novel conformations with important interactions in atomistic details that are not included in the training data. New synthetic conformations showed distinct sidechain rearrangements that are probed by our EPR and amino acid substitution studies. This approach is highly transferable and can be used for any available data for training. The work also demonstrated the ability for deep learning to utilize learned natural atomistic motions in protein conformation sampling.
Collapse
|
2
|
Ruzmetov T, Hung TI, Jonnalagedda SP, Chen SH, Fasihianifard P, Guo Z, Bhanu B, Chang CEA. Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning. RESEARCH SQUARE 2024:rs.3.rs-4301803. [PMID: 38978607 PMCID: PMC11230488 DOI: 10.21203/rs.3.rs-4301803/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Proteins are inherently dynamic, and their conformational ensembles are functionally important in biology. Large-scale motions may govern protein structure-function relationship, and numerous transient but stable conformations of intrinsically disordered proteins (IDPs) can play a crucial role in biological function. Investigating conformational ensembles to understand regulations and disease-related aggregations of IDPs is challenging both experimentally and computationally. In this paper first an unsupervised deep learning-based model, termed Internal Coordinate Net (ICoN), is developed that learns the physical principles of conformational changes from molecular dynamics (MD) simulation data. Second, interpolating data points in the learned latent space are selected that rapidly identify novel synthetic conformations with sophisticated and large-scale sidechains and backbone arrangements. Third, with the highly dynamic amyloid-β1-42 (Aβ42) monomer, our deep learning model provided a comprehensive sampling of Aβ42's conformational landscape. Analysis of these synthetic conformations revealed conformational clusters that can be used to rationalize experimental findings. Additionally, the method can identify novel conformations with important interactions in atomistic details that are not included in the training data. New synthetic conformations showed distinct sidechain rearrangements that are probed by our EPR and amino acid substitution studies. The proposed approach is highly transferable and can be used for any available data for training. The work also demonstrated the ability for deep learning to utilize learned natural atomistic motions in protein conformation sampling.
Collapse
Affiliation(s)
- Talant Ruzmetov
- Department of Chemistry, University of California, Riverside, CA92521
| | - Ta I Hung
- Department of Chemistry, University of California, Riverside, CA92521
- Department of Bioengineering, University of California, Riverside, CA92521
| | | | - Si-Han Chen
- Department of Chemistry, University of California, Riverside, CA92521
| | | | - Zhefeng Guo
- Department of Neurology, Brain Research Institute, University of California, Los Angeles, CA 90095
| | - Bir Bhanu
- Department of Bioengineering, University of California, Riverside, CA92521
- Department of Electrical and Computer Engineering, University of California, Riverside, CA92521
| | - Chia-En A Chang
- Department of Chemistry, University of California, Riverside, CA92521
- Department of Bioengineering, University of California, Riverside, CA92521
| |
Collapse
|
3
|
Smardz P, Anila MM, Rogowski P, Li MS, Różycki B, Krupa P. A Practical Guide to All-Atom and Coarse-Grained Molecular Dynamics Simulations Using Amber and Gromacs: A Case Study of Disulfide-Bond Impact on the Intrinsically Disordered Amyloid Beta. Int J Mol Sci 2024; 25:6698. [PMID: 38928405 PMCID: PMC11204378 DOI: 10.3390/ijms25126698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 06/12/2024] [Accepted: 06/13/2024] [Indexed: 06/28/2024] Open
Abstract
Intrinsically disordered proteins (IDPs) pose challenges to conventional experimental techniques due to their large-scale conformational fluctuations and transient structural elements. This work presents computational methods for studying IDPs at various resolutions using the Amber and Gromacs packages with both all-atom (Amber ff19SB with the OPC water model) and coarse-grained (Martini 3 and SIRAH) approaches. The effectiveness of these methodologies is demonstrated by examining the monomeric form of amyloid-β (Aβ42), an IDP, with and without disulfide bonds at different resolutions. Our results clearly show that the addition of a disulfide bond decreases the β-content of Aβ42; however, it increases the tendency of the monomeric Aβ42 to form fibril-like conformations, explaining the various aggregation rates observed in experiments. Moreover, analysis of the monomeric Aβ42 compactness, secondary structure content, and comparison between calculated and experimental chemical shifts demonstrates that all three methods provide a reasonable choice to study IDPs; however, coarse-grained approaches may lack some atomistic details, such as secondary structure recognition, due to the simplifications used. In general, this study not only explains the role of disulfide bonds in Aβ42 but also provides a step-by-step protocol for setting up, conducting, and analyzing molecular dynamics (MD) simulations, which is adaptable for studying other biomacromolecules, including folded and disordered proteins and peptides.
Collapse
Affiliation(s)
| | | | | | | | | | - Pawel Krupa
- Institute of Physics Polish Academy of Sciences, Al. Lotników 32/46, 02-668 Warsaw, Poland; (P.S.); (M.M.A.); (P.R.); (M.S.L.); (B.R.)
| |
Collapse
|
4
|
Manchev YT, Popelier PLA. FFLUX molecular simulations driven by atomic Gaussian process regression models. J Comput Chem 2024; 45:1235-1246. [PMID: 38345165 DOI: 10.1002/jcc.27323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/14/2023] [Accepted: 01/16/2024] [Indexed: 04/19/2024]
Abstract
Machine learning (ML) force fields are revolutionizing molecular dynamics (MD) simulations as they bypass the computational cost associated with ab initio methods but do not sacrifice accuracy in the process. In this work, the GPyTorch library is used to create Gaussian process regression (GPR) models that are interfaced with the next-generation ML force field FFLUX. These models predict atomic properties of different molecular configurations that appear in a progressing MD simulation. An improved kernel function is utilized to correctly capture the periodicity of the input descriptors. The first FFLUX molecular simulations of ammonia, methanol, and malondialdehyde with the updated kernel are performed. Geometry optimizations with the GPR models result in highly accurate final structures with a maximum root-mean-squared deviation of 0.064 Å and sub-kJ mol-1 total energy predictions. Additionally, the models are tested in 298 K MD simulations with FFLUX to benchmark for robustness. The resulting energy and force predictions throughout the simulation are in excellent agreement with ab initio data for ammonia and methanol but decrease in quality for malondialdehyde due to the increased system complexity. GPR model improvements are discussed, which will ensure the future scalability to larger systems.
Collapse
Affiliation(s)
- Yulian T Manchev
- Department of Chemistry, The University of Manchester, Manchester, Great Britain
| | - Paul L A Popelier
- Department of Chemistry, The University of Manchester, Manchester, Great Britain
| |
Collapse
|
5
|
Viegas RG, Martins IBS, Leite VBP. Understanding the Energy Landscape of Intrinsically Disordered Protein Ensembles. J Chem Inf Model 2024; 64:4149-4157. [PMID: 38713459 DOI: 10.1021/acs.jcim.4c00080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
A substantial portion of various organisms' proteomes comprises intrinsically disordered proteins (IDPs) that lack a defined three-dimensional structure. These IDPs exhibit a diverse array of conformations, displaying remarkable spatiotemporal heterogeneity and exceptional conformational flexibility. Characterizing the structure or structural ensemble of IDPs presents significant conceptual and methodological challenges owing to the absence of a well-defined native structure. While databases such as the Protein Ensemble Database (PED) provide IDP ensembles obtained through a combination of experimental data and molecular modeling, the absence of reaction coordinates poses challenges in comprehensively understanding pertinent aspects of the system. In this study, we leverage the energy landscape visualization method (JCTC, 6482, 2019) to scrutinize four IDP ensembles sourced from PED. ELViM, a methodology that circumvents the need for a priori reaction coordinates, aids in analyzing the ensembles. The specific IDP ensembles investigated are as follows: two fragments of nucleoporin (NUL: 884-993 and NUS: 1313-1390), yeast sic 1 N-terminal (1-90), and the N-terminal SH3 domain of Drk (1-59). Utilizing ELViM enables the comprehensive validation of ensembles, facilitating the detection of potential inconsistencies in the sampling process. Additionally, it allows for identifying and characterizing the most prevalent conformations within an ensemble. Moreover, ELViM facilitates the comparative analysis of ensembles obtained under diverse conditions, thereby providing a powerful tool for investigating the functional mechanisms of IDPs.
Collapse
Affiliation(s)
- Rafael G Viegas
- Federal Institute of Education, Science and Technology of São Paulo (IFSP), Catanduva, São Paulo 15.808-305, Brazil
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, São Paulo 15054-000, Brazil
| | - Ingrid B S Martins
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, São Paulo 15054-000, Brazil
| | - Vitor B P Leite
- Department of Physics, São Paulo State University (UNESP), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, São Paulo 15054-000, Brazil
| |
Collapse
|
6
|
Champion C, Lehner M, Smith AA, Ferrage F, Bolik-Coulon N, Riniker S. Unraveling motion in proteins by combining NMR relaxometry and molecular dynamics simulations: A case study on ubiquitin. J Chem Phys 2024; 160:104105. [PMID: 38465679 DOI: 10.1063/5.0188416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 02/20/2024] [Indexed: 03/12/2024] Open
Abstract
Nuclear magnetic resonance (NMR) relaxation experiments shine light onto the dynamics of molecular systems in the picosecond to millisecond timescales. As these methods cannot provide an atomically resolved view of the motion of atoms, functional groups, or domains giving rise to such signals, relaxation techniques have been combined with molecular dynamics (MD) simulations to obtain mechanistic descriptions and gain insights into the functional role of side chain or domain motion. In this work, we present a comparison of five computational methods that permit the joint analysis of MD simulations and NMR relaxation experiments. We discuss their relative strengths and areas of applicability and demonstrate how they may be utilized to interpret the dynamics in MD simulations with the small protein ubiquitin as a test system. We focus on the aliphatic side chains given the rigidity of the backbone of this protein. We find encouraging agreement between experiment, Markov state models built in the χ1/χ2 rotamer space of isoleucine residues, explicit rotamer jump models, and a decomposition of the motion using ROMANCE. These methods allow us to ascribe the dynamics to specific rotamer jumps. Simulations with eight different combinations of force field and water model highlight how the different metrics may be employed to pinpoint force field deficiencies. Furthermore, the presented comparison offers a perspective on the utility of NMR relaxation to serve as validation data for the prediction of kinetics by state-of-the-art biomolecular force fields.
Collapse
Affiliation(s)
- Candide Champion
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Marc Lehner
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| | - Albert A Smith
- Institute for Medical Physics and Biophysics, Leipzig University, Härtelstrasse 16-18, 04107 Leipzig, Germany
| | - Fabien Ferrage
- Laboratoire des Biomolécules, LBM, Département de Chimie, École normale supérieure, PSL University, Sorbonne Université, CNRS, 75005 Paris, France
| | - Nicolas Bolik-Coulon
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland
| |
Collapse
|
7
|
Bakker M, Sørensen HV, Skepö M. Exploring the Role of Globular Domain Locations on an Intrinsically Disordered Region of p53: A Molecular Dynamics Investigation. J Chem Theory Comput 2024; 20:1423-1433. [PMID: 38230670 PMCID: PMC10867847 DOI: 10.1021/acs.jctc.3c00971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 12/28/2023] [Accepted: 12/29/2023] [Indexed: 01/18/2024]
Abstract
The pre-tetramerization loop (PTL) of the human tumor suppressor protein p53 is an intrinsically disordered region (IDR) necessary for the tetramerization process, and its flexibility contributes to the essential conformational changes needed. Although the IDR can be accurately simulated in the traditional manner of molecular dynamics (MD) with the end-to-end distance (EEdist) unhindered, we sought to explore the effects of restraining the EEdist to the values predicted by electron microscopy (EM) and other distances. Simulating the PTL trajectory with a restrained EEdist , we found an increased agreement of nuclear magnetic resonance (NMR) chemical shifts with experiments. Additionally, we observed a plethora of secondary structures and contacts that only appear when the trajectory is restrained. Our findings expand the understanding of the tetramerization of p53 and provide insight into how mutations could make the protein impotent. In particular, our findings demonstrate the importance of restraining the EEdist in studying IDRs and how their conformations change under different conditions. Our results provide a better understanding of the PTL and the conformational dynamics of IDRs in general, which are useful for further studies regarding mutations and their effects on the activity of p53.
Collapse
Affiliation(s)
- Michael
J. Bakker
- Faculty
of Pharmacy in Hradec Králové, Charles University, Akademika Heyrovského 1203/8, 500 05 Hradec Králové, Czech Republic
- Division
of Computational Chemistry, Department of Chemistry, Lund University, P.O. Box 124, SE-221 00 Lund, Sweden
| | - Henrik V. Sørensen
- Division
of Computational Chemistry, Department of Chemistry, Lund University, P.O. Box 124, SE-221 00 Lund, Sweden
- MAX
IV Laboratory, Lund Institute of Advanced
Neutron and X-ray Science, Scheelevägen 19, SE-223 770 Lund, Sweden
| | - Marie Skepö
- Division
of Computational Chemistry, Department of Chemistry, Lund University, P.O. Box 124, SE-221 00 Lund, Sweden
- LINXS
- Institute of Advanced Neutron and X-ray Science, Scheelevägen 19, SE-233 70 Lund, Sweden
| |
Collapse
|
8
|
Tolstova AP, Makarov AA, Adzhubei AA. Structure Comparison of Beta Amyloid Peptide Aβ 1-42 Isoforms. Molecular Dynamics Modeling. J Chem Inf Model 2024; 64:918-932. [PMID: 38241093 DOI: 10.1021/acs.jcim.3c01624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2024]
Abstract
Beta amyloid peptide Aβ 1-42 (Aβ42) has a unique dual role in the human organism, as both the peptide with an important physiological function and one of the most toxic biological compounds provoking Alzheimer's disease (AD). There are several known Aβ42 isoforms that we discuss here that are highly neurotoxic and lead to the early onset of AD. Aβ42 is an intrinsically disordered protein with no experimentally solved structure under physiological conditions. The objective of this research was to establish the appropriate molecular dynamics (MD) methodology and model a uniform set of structures for the Aβ42 isoforms that form the core of this study. For that purpose, force field selection and verification including convergence testing for MD simulations was made. Replica exchange MD and conventional MD modeling of several Aβ42 and Aβ16 isoforms that have neurotoxic and amyloidogenic effects impacting the severity of Alzheimer's disease were carried out with the optimal force field and solvent parameters. A standardized ensemble of structures for the Aβ42 and Aβ16 isoforms covering 30-50% of the conformational ensembles extracted from the free energy minima was calculated from MD trajectories. The resulting data set of modeled structures includes Aβ42 wild type, isoD7, pS8, D7H, and H6R-Aβ42 and Aβ16 wild type, isoD7, pS8, D7H, and H6R-Aβ16. The representative structures are given in the Supporting Information; they are open for public access. In the study, we also evaluated the differences between the structures of Aβ42 isoforms and speculate on their possible relevance to the known functions. Utilizing several representative structures for a single disordered protein for docking, with their subsequent averaging by conformations, would markedly increase the reliability of docking results.
Collapse
Affiliation(s)
- Anna P Tolstova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Alexander A Makarov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Alexei A Adzhubei
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
- Washington University School of Medicine and Health Sciences, Washington 20052, D.C., United States
| |
Collapse
|
9
|
Ghosh C, Nagpal S, Muñoz V. Molecular simulations integrated with experiments for probing the interaction dynamics and binding mechanisms of intrinsically disordered proteins. Curr Opin Struct Biol 2024; 84:102756. [PMID: 38118365 DOI: 10.1016/j.sbi.2023.102756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Revised: 11/28/2023] [Accepted: 11/29/2023] [Indexed: 12/22/2023]
Abstract
Intrinsically disordered proteins (IDPs) exploit their plasticity to deploy a rich panoply of soft interactions and binding phenomena. Advances in tailoring molecular simulations for IDPs combined with experimental cross-validation offer an atomistic view of the mechanisms that control IDP binding, function, and dysfunction. The emerging theme is that unbound IDPs autonomously form transient local structures and self-interactions that determine their binding behavior. Recent results have shed light on whether and how IDPs fold, stay disordered or drive condensation upon binding; how they achieve binding specificity and select among competing partners. The disorder-binding paradigm is now being proactively used by researchers to target IDPs for rational drug design and engineer molecular responsive elements for biosensing applications.
Collapse
Affiliation(s)
- Catherine Ghosh
- NSF-CREST Center for Cellular and Biomolecular Machines (CCBM), University of California at Merced, Merced, 95343 CA, USA; Department of Bioengineering, University of California at Merced, Merced, 95343 CA, USA. https://twitter.com/cat_ghosh
| | - Suhani Nagpal
- NSF-CREST Center for Cellular and Biomolecular Machines (CCBM), University of California at Merced, Merced, 95343 CA, USA; Department of Bioengineering, University of California at Merced, Merced, 95343 CA, USA; OpenEye, Cadence Molecular Sciences, Boston, 02114 MA, USA
| | - Victor Muñoz
- NSF-CREST Center for Cellular and Biomolecular Machines (CCBM), University of California at Merced, Merced, 95343 CA, USA; Department of Bioengineering, University of California at Merced, Merced, 95343 CA, USA.
| |
Collapse
|
10
|
de Raffele D, Ilie IM. Unlocking novel therapies: cyclic peptide design for amyloidogenic targets through synergies of experiments, simulations, and machine learning. Chem Commun (Camb) 2024; 60:632-645. [PMID: 38131333 DOI: 10.1039/d3cc04630c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Existing therapies for neurodegenerative diseases like Parkinson's and Alzheimer's address only their symptoms and do not prevent disease onset. Common therapeutic agents, such as small molecules and antibodies struggle with insufficient selectivity, stability and bioavailability, leading to poor performance in clinical trials. Peptide-based therapeutics are emerging as promising candidates, with successful applications for cardiovascular diseases and cancers due to their high bioavailability, good efficacy and specificity. In particular, cyclic peptides have a long in vivo stability, while maintaining a robust antibody-like binding affinity. However, the de novo design of cyclic peptides is challenging due to the lack of long-lived druggable pockets of the target polypeptide, absence of exhaustive conformational distributions of the target and/or the binder, unknown binding site, methodological limitations, associated constraints (failed trials, time, money) and the vast combinatorial sequence space. Hence, efficient alignment and cooperation between disciplines, and synergies between experiments and simulations complemented by popular techniques like machine-learning can significantly speed up the therapeutic cyclic-peptide development for neurodegenerative diseases. We review the latest advancements in cyclic peptide design against amyloidogenic targets from a computational perspective in light of recent advancements and potential of machine learning to optimize the design process. We discuss the difficulties encountered when designing novel peptide-based inhibitors and we propose new strategies incorporating experiments, simulations and machine learning to design cyclic peptides to inhibit the toxic propagation of amyloidogenic polypeptides. Importantly, these strategies extend beyond the mere design of cyclic peptides and serve as template for the de novo generation of (bio)materials with programmable properties.
Collapse
Affiliation(s)
- Daria de Raffele
- University of Amsterdam, van 't Hoff Institute for Molecular Sciences, Science Park 904, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands.
- Amsterdam Center for Multiscale Modeling (ACMM), University of Amsterdam, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands
| | - Ioana M Ilie
- University of Amsterdam, van 't Hoff Institute for Molecular Sciences, Science Park 904, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands.
- Amsterdam Center for Multiscale Modeling (ACMM), University of Amsterdam, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands
| |
Collapse
|
11
|
Patel KN, Chavda D, Manna M. Molecular Docking of Intrinsically Disordered Proteins: Challenges and Strategies. Methods Mol Biol 2024; 2780:165-201. [PMID: 38987470 DOI: 10.1007/978-1-0716-3985-6_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Intrinsically disordered proteins (IDPs) are a novel class of proteins that have established a significant importance and attention within a very short period of time. These proteins are essentially characterized by their inherent structural disorder, encoded mainly by their amino acid sequences. The profound abundance of IDPs and intrinsically disordered regions (IDRs) in the biological world delineates their deep-rooted functionality. IDPs and IDRs convey such extensive functionality through their unique dynamic nature, which enables them to carry out huge number of multifaceted biomolecular interactions and make them "interaction hub" of the cellular systems. Additionally, with such widespread functions, their misfunctioning is also intimately associated with multiple diseases. Thus, understanding the dynamic heterogeneity of various IDPs along with their interactions with respective binding partners is an important field with immense potentials in biomolecular research. In this context, molecular docking-based computational approaches have proven to be remarkable in case of ordered proteins. Molecular docking methods essentially model the biomolecular interactions in both structural and energetic terms and use this information to characterize the putative interactions between the two participant molecules. However, direct applications of the conventional docking methods to study IDPs are largely limited by their structural heterogeneity and demands for unique IDP-centric strategies. Thus, in this chapter, we have presented an overview of current methodologies for successful docking operations involving IDPs and IDRs. These specialized methods majorly include the ensemble-based and fragment-based approaches with their own benefits and limitations. More recently, artificial intelligence and machine learning-assisted approaches are also used to significantly reduce the complexity and computational burden associated with various docking applications. Thus, this chapter aims to provide a comprehensive summary of major challenges and recent advancements of molecular docking approaches in the IDP field for their better utilization and greater applicability.Asp (D).
Collapse
Affiliation(s)
- Keyur N Patel
- Applied Phycology and Biotechnology Division, CSIR Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, India
| | - Dhruvil Chavda
- Applied Phycology and Biotechnology Division, CSIR Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, India
| | - Moutusi Manna
- Applied Phycology and Biotechnology Division, CSIR Central Salt and Marine Chemicals Research Institute, Bhavnagar, Gujarat, India.
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, India.
| |
Collapse
|
12
|
Zhong H, Liu H, Liu H. Molecular Mechanism of Tau Misfolding and Aggregation: Insights from Molecular Dynamics Simulation. Curr Med Chem 2024; 31:2855-2871. [PMID: 37031392 DOI: 10.2174/0929867330666230409145247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 02/01/2023] [Accepted: 02/27/2023] [Indexed: 04/10/2023]
Abstract
Tau dysfunction has a close association with many neurodegenerative diseases, which are collectively referred to as tauopathies. Neurofibrillary tangles (NFTs) formed by misfolding and aggregation of tau are the main pathological process of tauopathy. Therefore, uncovering the misfolding and aggregation mechanism of tau protein will help to reveal the pathogenic mechanism of tauopathies. Molecular dynamics (MD) simulation is well suited for studying the dynamic process of protein structure changes. It provides detailed information on protein structure changes over time at the atomic resolution. At the same time, MD simulation can also simulate various conditions conveniently. Based on these advantages, MD simulations are widely used to study conformational transition problems such as protein misfolding and aggregation. Here, we summarized the structural features of tau, the factors affecting its misfolding and aggregation, and the applications of MD simulations in the study of tau misfolding and aggregation.
Collapse
Affiliation(s)
- Haiyang Zhong
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Hongli Liu
- Jiangsu Key Laboratory of New Drug Research and Clinical Pharmacy, Xuzhou Medical University, Xuzhou, 221004, Jiangsu, China
| | - Huanxiang Liu
- Faculty of Applied Science, Macao Polytechnic University, Macao, SAR, 999078, China
| |
Collapse
|
13
|
Yamada T, Miyazaki Y, Harada S, Kumar A, Vanni S, Shinoda W. Improved Protein Model in SPICA Force Field. J Chem Theory Comput 2023; 19:8967-8977. [PMID: 37989551 DOI: 10.1021/acs.jctc.3c01016] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
The previous version of the SPICA coarse-grained (CG) force field (FF) protein model focused primarily on membrane proteins and successfully reproduced the dimerization free energies of several transmembrane helices and the stable structures of various membrane protein assemblies. However, that model had limited accuracy when applied to other proteins, such as intrinsically disordered proteins (IDPs) and peripheral proteins, because the dimensions of the IDPs in an aqueous solution were too compact, and protein binding on the lipid membrane surface was overstabilized. To improve the accuracy of the SPICA FF model for the simulation of such systems, in this study, we introduce protein secondary structure-dependent nonbonded interaction parameters to the backbone segments and reoptimize almost all nonbonded parameters for amino acids. The improved FF proposed here successfully reproduces the radii of gyration of various IDPs, the binding sensitivity of several peripheral membrane proteins, and the dimerization free energies of several transmembrane helices. The new model also shows improved agreement with experiments on the free energy of peptide association in water. In addition, an extensive library of nonbonded interactions between proteins and lipids, including various glycerophospholipids, sphingolipids, and cholesterol, allows the study of specific interactions between lipids and peripheral and transmembrane proteins. Hence, the new SPICA FF (version 2) proposed herein is applicable with high accuracy for simulating a wide range of protein systems.
Collapse
Affiliation(s)
- Teppei Yamada
- Graduate School of Natural Science and Technology, Okayama University, 3-1-1 Tsushima-naka, Kita-ku, Okayama 700-8530, Japan
| | - Yusuke Miyazaki
- Research Institute for Interdisciplinary Science, Okayama University, 3-1-1 Tsushima-naka, Kita-ku, Okayama 700-8530, Japan
| | - Shogo Harada
- Department of Materials Chemistry, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| | - Ashutosh Kumar
- Department of Biology and National Center of Competence in Research Bio-inspired Materials, University of Fribourg, Chemin du Musée 10, 1700 Fribourg, Switzerland
| | - Stefano Vanni
- Department of Biology and National Center of Competence in Research Bio-inspired Materials, University of Fribourg, Chemin du Musée 10, 1700 Fribourg, Switzerland
| | - Wataru Shinoda
- Research Institute for Interdisciplinary Science, Okayama University, 3-1-1 Tsushima-naka, Kita-ku, Okayama 700-8530, Japan
| |
Collapse
|
14
|
Zhu J, Li Z, Tong H, Lu Z, Zhang N, Wei T, Chen HF. Phanto-IDP: compact model for precise intrinsically disordered protein backbone generation and enhanced sampling. Brief Bioinform 2023; 25:bbad429. [PMID: 38018910 PMCID: PMC10783862 DOI: 10.1093/bib/bbad429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/21/2023] [Accepted: 11/05/2023] [Indexed: 11/30/2023] Open
Abstract
The biological function of proteins is determined not only by their static structures but also by the dynamic properties of their conformational ensembles. Numerous high-accuracy static structure prediction tools have been recently developed based on deep learning; however, there remains a lack of efficient and accurate methods for exploring protein dynamic conformations. Traditionally, studies concerning protein dynamics have relied on molecular dynamics (MD) simulations, which incur significant computational costs for all-atom precision and struggle to adequately sample conformational spaces with high energy barriers. To overcome these limitations, various enhanced sampling techniques have been developed to accelerate sampling in MD. Traditional enhanced sampling approaches like replica exchange molecular dynamics (REMD) and frontier expansion sampling (FEXS) often follow the MD simulation approach and still cost a lot of computational resources and time. Variational autoencoders (VAEs), as a classic deep generative model, are not restricted by potential energy landscapes and can explore conformational spaces more efficiently than traditional methods. However, VAEs often face challenges in generating reasonable conformations for complex proteins, especially intrinsically disordered proteins (IDPs), which limits their application as an enhanced sampling method. In this study, we presented a novel deep learning model (named Phanto-IDP) that utilizes a graph-based encoder to extract protein features and a transformer-based decoder combined with variational sampling to generate highly accurate protein backbones. Ten IDPs and four structured proteins were used to evaluate the sampling ability of Phanto-IDP. The results demonstrate that Phanto-IDP has high fidelity and diversity in the generated conformation ensembles, making it a suitable tool for enhancing the efficiency of MD simulation, generating broader protein conformational space and a continuous protein transition path.
Collapse
Affiliation(s)
- Junjie Zhu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Zhengxin Li
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Haowei Tong
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Zhouyu Lu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Ningjie Zhang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Ting Wei
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| |
Collapse
|
15
|
Sun W, Lebedenko OO, Salguero NG, Shannon MD, Zandian M, Poirier MG, Skrynnikov NR, Jaroniec CP. Conformational and Interaction Landscape of Histone H4 Tails in Nucleosomes Probed by Paramagnetic NMR Spectroscopy. J Am Chem Soc 2023; 145:25478-25485. [PMID: 37943892 PMCID: PMC10719895 DOI: 10.1021/jacs.3c10340] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]
Abstract
The fundamental repeat unit of chromatin, the nucleosome, consists of approximately 147 base pairs of double-stranded DNA and a histone protein octamer containing two copies each of histones H2A, H2B, H3, and H4. Each histone possesses a dynamically disordered N-terminal tail domain, and it is well-established that the tails of histones H3 and H4 play key roles in chromatin compaction and regulation. Here we investigate the conformational ensemble and interactions of the H4 tail in nucleosomes by means of solution NMR measurements of paramagnetic relaxation enhancements (PREs) in recombinant samples reconstituted with 15N-enriched H4 and nitroxide spin-label tagged H3. The experimental PREs, which report on the proximities of individual H4 tail residues to the different H3 spin-label sites, are interpreted by using microsecond time-scale molecular dynamics simulations of the nucleosome core particle. Collectively, these data enable improved localization of histone H4 tails in nucleosomes and support the notion that H4 tails engage in a fuzzy complex interaction with nucleosomal DNA.
Collapse
Affiliation(s)
- Wenjun Sun
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| | - Olga O. Lebedenko
- Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg 199034, Russia
| | - Nicole Gonzalez Salguero
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| | - Matthew D. Shannon
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| | - Mohamad Zandian
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| | - Michael G. Poirier
- Department of Physics, The Ohio State University, Columbus, Ohio 43210, United States
| | - Nikolai R. Skrynnikov
- Laboratory of Biomolecular NMR, St. Petersburg State University, St. Petersburg 199034, Russia
- Department of Chemistry, Purdue University, West Lafayette 47907, United States
| | - Christopher P. Jaroniec
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210, United States
| |
Collapse
|
16
|
Wang X, Wang Y, Guo M, Wang X, Li Y, Zhang JZH. Assessment of an Electrostatic Energy-Based Charge Model for Modeling the Electrostatic Interactions in Water Solvent. J Chem Theory Comput 2023; 19:6294-6312. [PMID: 37656610 DOI: 10.1021/acs.jctc.3c00467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/03/2023]
Abstract
The protein force field based on the restrained electrostatic potential (RESP) charges has limitations in accurately describing hydrogen bonding interactions in proteins. To address this issue, we propose an alternative approach called the electrostatic energy-based charges (EEC) model, which shows improved performance in describing electrostatic interactions (EIs) of hydrogen bonds in proteins. In this study, we further investigate the performance of the EEC model in modeling EIs in water solvent. Our findings demonstrate that the fixed EEC model can effectively reproduce the quantum mechanics/molecular mechanics (QM/MM)-calculated EIs between a water molecule and various water solvent environments. However, to achieve the same level of computational accuracy, the electrostatic potential (ESP) charge model needs to fluctuate according to the electrostatic environment. Our analysis indicates that the requirement for charge adjustments depends on the specific mathematical and physical representation of EIs as a function of the environment for deriving charges. By comparing with widely used empirical water models calibrated to reproduce experimental properties, we confirm that the performance of the EEC model in reproducing QM/MM EIs is similar to that of general purpose TIP4P-like water models such as TIP4P-Ew and TIP4P/2005. When comparing the computed 10,000 distinct EI values within the range of -40 to 0 kcal/mol with the QM/MM results calculated at the MP2/aug-cc-pVQZ/TIP3P level, we noticed that the mean unsigned error (MUE) for the EEC model is merely 0.487 kcal/mol, which is remarkably similar to the MUE values of the TIP4P-Ew (0.63 kcal/mol) and TIP4P/2005 (0.579 kcal/mol) models. However, both the RESP method and the TIP3P model exhibit a tendency to overestimate the EIs, as evidenced by their higher MUE values of 1.761 and 1.293 kcal/mol, respectively. EEC-based molecular dynamics simulations have demonstrated that, when combined with appropriate van der Waals parameters, the EEC model can closely reproduce oxygen-oxygen radial distribution function and density of water, showing a remarkable similarity to the well-established TIP4P-like empirical water models. Our results demonstrate that the EEC model has the potential to build force fields with comparable accuracy to more sophisticated empirical TIP4P-like water models.
Collapse
Affiliation(s)
- Xianwei Wang
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Yiying Wang
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Man Guo
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Xuechao Wang
- College of Science, Zhejiang University of Technology, Hangzhou, Zhejiang 310023, China
| | - Yang Li
- College of Information Science and Engineering, Shandong Agricultural University, Tai'an, Shandong 271018, China
| | - John Z H Zhang
- Shenzhen Institute of Synthetic Biology, Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan, Shanxi 030006, China
| |
Collapse
|
17
|
Chan-Yao-Chong M, Chan J, Kono H. Benchmarking of force fields to characterize the intrinsically disordered R2-FUS-LC region. Sci Rep 2023; 13:14226. [PMID: 37648703 PMCID: PMC10468508 DOI: 10.1038/s41598-023-40801-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Accepted: 08/16/2023] [Indexed: 09/01/2023] Open
Abstract
Intrinsically Disordered Proteins (IDPs) play crucial roles in numerous diseases like Alzheimer's and ALS by forming irreversible amyloid fibrils. The effectiveness of force fields (FFs) developed for globular proteins and their modified versions for IDPs varies depending on the specific protein. This study assesses 13 FFs, including AMBER and CHARMM, by simulating the R2 region of the FUS-LC domain (R2-FUS-LC region), an IDP implicated in ALS. Due to the flexibility of the region, we show that utilizing multiple measures, which evaluate the local and global conformations, and combining them together into a final score are important for a comprehensive evaluation of force fields. The results suggest c36m2021s3p with mTIP3p water model is the most balanced FF, capable of generating various conformations compatible with known ones. In addition, the mTIP3P water model is computationally more efficient than those of top-ranked AMBER FFs with four-site water models. The evaluation also reveals that AMBER FFs tend to generate more compact conformations compared to CHARMM FFs but also more non-native contacts. The top-ranking AMBER and CHARMM FFs can reproduce intra-peptide contacts but underperform for inter-peptide contacts, indicating there is room for improvement.
Collapse
Affiliation(s)
- Maud Chan-Yao-Chong
- Molecular Modeling and Simulation (MMS) Team, Institute for Quantum Life Science, National Institutes for Quantum Science and Technology (QST), 4-9-1, Anagawa, Inage Ward, Chiba City, Chiba, 263-8555, Japan
- Toulouse Biotechnology Institute, TBI, Université de Toulouse, CNRS, INRAE, INSA, 135, Avenue de Rangueil, 31077, Toulouse Cedex 04, France
| | - Justin Chan
- Molecular Modeling and Simulation (MMS) Team, Institute for Quantum Life Science, National Institutes for Quantum Science and Technology (QST), 4-9-1, Anagawa, Inage Ward, Chiba City, Chiba, 263-8555, Japan
| | - Hidetoshi Kono
- Molecular Modeling and Simulation (MMS) Team, Institute for Quantum Life Science, National Institutes for Quantum Science and Technology (QST), 4-9-1, Anagawa, Inage Ward, Chiba City, Chiba, 263-8555, Japan.
| |
Collapse
|
18
|
Pan Z, Mu J, Chen HF. Balanced Three-Point Water Model OPC3-B for Intrinsically Disordered and Ordered Proteins. J Chem Theory Comput 2023; 19:4837-4850. [PMID: 37452752 DOI: 10.1021/acs.jctc.3c00297] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2023]
Abstract
Intrinsically disordered proteins (IDPs) play a critical role in many biological processes. Due to the inherent structural flexibility of IDPs, experimental methods present significant challenges for sampling their conformational information at the atomic level. Therefore, molecular dynamics (MD) simulations have emerged as the primary tools for modeling IDPs whose accuracy depend on force field and water model. To enhance the accuracy of physical modeling of IDPs, several force fields have been developed. However, current water models lack precision and underestimate the interaction between water molecules and proteins. Here, we used Monte-Carlo re-weighting method to re-parameterize a three-point water model based on OPC3 for IDPs (named OPC3-B). We benchmarked the performance of OPC3-B compared with nine different water models for 10 IDPs and three ordered proteins. The results indicate that the performance of OPC3-B is better than other water models for both IDPs and ordered proteins. At the same time, OPC3-B possess the power of transferability with other force field to simulate IDPs. This newly developed water model can be used to insight into the research of sequence-disordered-function paradigm for IDPs.
Collapse
Affiliation(s)
- Zhengsong Pan
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- 4+4 Medical Doctor Program, Chinese Academy of Medical Science and Peking Union Medical College, Beijing 100730, China
| | - Junxi Mu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- Shanghai Center for Bioinformation Technology, Shanghai 200235, China
| |
Collapse
|
19
|
Mondal A, Lenz S, MacCallum JL, Perez A. Hybrid computational methods combining experimental information with molecular dynamics. Curr Opin Struct Biol 2023; 81:102609. [PMID: 37224642 DOI: 10.1016/j.sbi.2023.102609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/23/2023] [Indexed: 05/26/2023]
Abstract
A goal of structural biology is to understand how macromolecules carry out their biological roles by identifying their metastable states, mechanisms of action, pathways leading to conformational changes, and the thermodynamic and kinetic relationships between those states. Integrative modeling brings structural insights into systems where traditional structure determination approaches cannot help. We focus on the synergies and challenges of integrative modeling combining experimental data with molecular dynamics simulations.
Collapse
Affiliation(s)
- Arup Mondal
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK. https://twitter.com/@amondal_chem
| | - Stefan Lenz
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada. https://twitter.com/@jlmaccal
| | - Alberto Perez
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK.
| |
Collapse
|
20
|
Zhang P, Yang W. Toward a general neural network force field for protein simulations: Refining the intramolecular interaction in protein. J Chem Phys 2023; 159:024118. [PMID: 37431910 PMCID: PMC10481389 DOI: 10.1063/5.0142280] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/22/2023] [Indexed: 07/12/2023] Open
Abstract
Molecular dynamics (MD) is an extremely powerful, highly effective, and widely used approach to understanding the nature of chemical processes in atomic details for proteins. The accuracy of results from MD simulations is highly dependent on force fields. Currently, molecular mechanical (MM) force fields are mainly utilized in MD simulations because of their low computational cost. Quantum mechanical (QM) calculation has high accuracy, but it is exceedingly time consuming for protein simulations. Machine learning (ML) provides the capability for generating accurate potential at the QM level without increasing much computational effort for specific systems that can be studied at the QM level. However, the construction of general machine learned force fields, needed for broad applications and large and complex systems, is still challenging. Here, general and transferable neural network (NN) force fields based on CHARMM force fields, named CHARMM-NN, are constructed for proteins by training NN models on 27 fragments partitioned from the residue-based systematic molecular fragmentation (rSMF) method. The NN for each fragment is based on atom types and uses new input features that are similar to MM inputs, including bonds, angles, dihedrals, and non-bonded terms, which enhance the compatibility of CHARMM-NN to MM MD and enable the implementation of CHARMM-NN force fields in different MD programs. While the main part of the energy of the protein is based on rSMF and NN, the nonbonded interactions between the fragments and with water are taken from the CHARMM force field through mechanical embedding. The validations of the method for dipeptides on geometric data, relative potential energies, and structural reorganization energies demonstrate that the CHARMM-NN local minima on the potential energy surface are very accurate approximations to QM, showing the success of CHARMM-NN for bonded interactions. However, the MD simulations on peptides and proteins indicate that more accurate methods to represent protein-water interactions in fragments and non-bonded interactions between fragments should be considered in the future improvement of CHARMM-NN, which can increase the accuracy of approximation beyond the current mechanical embedding QM/MM level.
Collapse
Affiliation(s)
- Pan Zhang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Weitao Yang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
21
|
Kaur U, Kihn KC, Ke H, Kuo W, Gierasch LM, Hebert DN, Wintrode PL, Deredge D, Gershenson A. The conformational landscape of a serpin N-terminal subdomain facilitates folding and in-cell quality control. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.24.537978. [PMID: 37163105 PMCID: PMC10168285 DOI: 10.1101/2023.04.24.537978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Many multi-domain proteins including the serpin family of serine protease inhibitors contain non-sequential domains composed of regions that are far apart in sequence. Because proteins are translated vectorially from N- to C-terminus, such domains pose a particular challenge: how to balance the conformational lability necessary to form productive interactions between early and late translated regions while avoiding aggregation. This balance is mediated by the protein sequence properties and the interactions of the folding protein with the cellular quality control machinery. For serpins, particularly α 1 -antitrypsin (AAT), mutations often lead to polymer accumulation in cells and consequent disease suggesting that the lability/aggregation balance is especially precarious. Therefore, we investigated the properties of progressively longer AAT N-terminal fragments in solution and in cells. The N-terminal subdomain, residues 1-190 (AAT190), is monomeric in solution and efficiently degraded in cells. More β -rich fragments, 1-290 and 1-323, form small oligomers in solution, but are still efficiently degraded, and even the polymerization promoting Siiyama (S53F) mutation did not significantly affect fragment degradation. In vitro, the AAT190 region is among the last regions incorporated into the final structure. Hydrogen-deuterium exchange mass spectrometry and enhanced sampling molecular dynamics simulations show that AAT190 has a broad, dynamic conformational ensemble that helps protect one particularly aggregation prone β -strand from solvent. These AAT190 dynamics result in transient exposure of sequences that are buried in folded, full-length AAT, which may provide important recognition sites for the cellular quality control machinery and facilitate degradation and, under favorable conditions, reduce the likelihood of polymerization.
Collapse
Affiliation(s)
- Upneet Kaur
- Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, MA 01003
| | - Kyle C. Kihn
- Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, Baltimore, MD 21201
| | - Haiping Ke
- Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, MA 01003
| | - Weiwei Kuo
- Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, MA 01003
| | - Lila M. Gierasch
- Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, MA 01003
- Program in Molecular and Cellular Biology, University of Massachusetts, Amherst, MA 01003
- Department of Chemistry, University of Massachusetts, Amherst, MA 01003
| | - Daniel N. Hebert
- Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, MA 01003
- Program in Molecular and Cellular Biology, University of Massachusetts, Amherst, MA 01003
| | - Patrick L. Wintrode
- Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, Baltimore, MD 21201
| | - Daniel Deredge
- Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, Baltimore, MD 21201
| | - Anne Gershenson
- Department of Biochemistry & Molecular Biology, University of Massachusetts, Amherst, MA 01003
- Program in Molecular and Cellular Biology, University of Massachusetts, Amherst, MA 01003
| |
Collapse
|
22
|
Amundarain MJ, Vietri A, Dodero VI, Costabel MD. IDP Force Fields Applied to Model PPII-Rich 33-mer Gliadin Peptides. J Phys Chem B 2023; 127:2407-2417. [PMID: 36884001 DOI: 10.1021/acs.jpcb.3c00200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]
Abstract
The 33-mer gliadin peptide and its deamidated metabolite, 33-mer DGP, are the immunodominant peptides responsible for the adaptive immune response in celiac disease (CD). CD is a complex autoimmune chronic disorder triggered by gluten ingestion that affects the small intestine and affects ∼1% of the global population. The 33-mers are polyproline II-rich (PPII) and intrinsically disordered peptides (IDPs), whose structures remain elusive. We sampled the conformational ensembles of both 33-mer peptides via molecular dynamics simulations employing two force fields (FFs) (Amber ff03ws and Amber ff99SB-disp) specifically validated for other IDPs. Our results show that both FFs allow the extensive exploration of the conformational landscape, which was not possible with the standard FF GROMOS53A6 reported before. Clustering analysis of the trajectories showed that the five largest clusters (78-88% of the total structures) present elongated, semielongated, and curved conformations in both FFs. Large average radius of gyration and solvent-exposed surfaces characterized these structures. While the structures sampled are similar, the Amber ff99SB-disp trajectories explored folded conformations with a higher probability. In addition, PPII secondary structure was preserved throughout the trajectories (58-73%) together with a non-negligible content of β structures (11-23%), in agreement with previous experimental results. This work represents the initial step in studying further the interaction of these peptides with other biologically relevant molecules, which could lead to finally disclose the molecular events that lead to CD.
Collapse
Affiliation(s)
- María J Amundarain
- Departamento de Física, Instituto de Física del Sur (IFISUR), Universidad Nacional del Sur (UNS), CONICET, Avenida Leandro N. Alem 1253, B8000CPB Bahía Blanca, Argentina
- Department of Chemistry, Organic Chemistry III, Bielefeld University, Universitätsstraße 25, 33615 Bielefeld, Germany
| | - Agustín Vietri
- Departamento de Física, Instituto de Física del Sur (IFISUR), Universidad Nacional del Sur (UNS), CONICET, Avenida Leandro N. Alem 1253, B8000CPB Bahía Blanca, Argentina
| | - Verónica I Dodero
- Department of Chemistry, Organic Chemistry III, Bielefeld University, Universitätsstraße 25, 33615 Bielefeld, Germany
| | - Marcelo D Costabel
- Departamento de Física, Instituto de Física del Sur (IFISUR), Universidad Nacional del Sur (UNS), CONICET, Avenida Leandro N. Alem 1253, B8000CPB Bahía Blanca, Argentina
| |
Collapse
|
23
|
Li T, Hendrix E, He Y. Simple and Effective Conformational Sampling Strategy for Intrinsically Disordered Proteins Using the UNRES Web Server. J Phys Chem B 2023; 127:2177-2186. [PMID: 36827446 DOI: 10.1021/acs.jpcb.2c08945] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
Abstract
Intrinsically disordered proteins (IDPs) contain more charged amino acids than folded proteins, resulting in a lack of hydrophobic core(s) and a tendency to adopt rapidly interconverting structures rather than well-defined structures. The structural heterogeneity of IDPs, encoded by the amino acid sequence, is closely related to their unique roles in biological pathways, which require them to interact with different binding partners. Recently, Robustelli and co-workers have demonstrated that a balanced all-atom force field can be used to sample heterogeneous structures of disordered proteins ( Proc. Natl. Acad. Sci. U.S.A. 2018, 115, E4758-E4766). However, such a solution requires extensive computational resources, such as Anton supercomputers. Here, we propose a simple and effective solution to sample the conformational space of IDPs using a publicly available web server, namely, the UNited-RESidue (UNRES) web server. Our proposed solution requires no investment in computational resources and no prior knowledge of UNRES. UNRES Replica Exchange Molecular Dynamics (REMD) simulations were carried out on a set of eight disordered proteins at temperatures spanning from 270 to 430 K. Utilizing the latest UNRES force field designed for structured proteins, with proper selections of temperatures, we were able to produce comparable results to all-atom force fields as reported in work done by Robustelli and co-workers. In addition, NMR observables and the radius of gyration calculated from UNRES ensembles were directly compared with the experimental data to further evaluate the accuracy of the UNRES model at all temperatures. Our results suggest that carrying out the UNRES simulations at optimal temperatures using the UNRES web server can be a good alternative to sample heterogeneous structures of IDPs.
Collapse
Affiliation(s)
- Tongtong Li
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Emily Hendrix
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Yi He
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States.,Translational Informatics Division, Department of Internal Medicine, University of New Mexico, Albuquerque, New Mexico 87131, United States
| |
Collapse
|
24
|
Walczewska-Szewc K, Nowak W. Structural Insights into ATP-Sensitive Potassium Channel Mechanics: A Role of Intrinsically Disordered Regions. J Chem Inf Model 2023; 63:1806-1818. [PMID: 36746748 PMCID: PMC10052335 DOI: 10.1021/acs.jcim.2c01196] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Commonly used techniques, such as CryoEM or X-ray, are not able to capture the structural reorganizations of disordered regions of proteins (IDR); therefore, it is difficult to assess their functions in proteins based exclusively on experiments. To fill this gap, we used computational molecular dynamics (MD) simulation methods to capture IDR dynamics and trace biological function-related interactions in the Kir6.2/SUR1 potassium channel. This ATP-sensitive octameric complex, one of the critical elements in the insulin secretion process in human pancreatic β-cells, has four to five large, disordered fragments. Using unique MD simulations of the full Kir6.2/SUR1 channel complex, we present an in-depth analysis of the dynamics of the disordered regions and discuss the possible functions they could have in this system. Our MD results confirmed the crucial role of the N-terminus of the Kir6.2 fragment and the L0-loop of the SUR1 protein in the transfer of mechanical signals between domains that trigger insulin release. Moreover, we show that the presence of IDRs affects natural ligand binding. Our research takes us one step further toward understanding the action of this vital complex.
Collapse
Affiliation(s)
- Katarzyna Walczewska-Szewc
- Institute of Physics, Faculty of Physics, Astronomy and Informatics, Nicolaus Copernicus University in Toruń, ul. Grudziądzka 5, 87-100 Toruń, Poland
| | - Wiesław Nowak
- Institute of Physics, Faculty of Physics, Astronomy and Informatics, Nicolaus Copernicus University in Toruń, ul. Grudziądzka 5, 87-100 Toruń, Poland
| |
Collapse
|
25
|
Zhao D, Zhao Y, He X, Ayers PW, Liu S. Efficient and accurate density-based prediction of macromolecular polarizabilities. Phys Chem Chem Phys 2023; 25:2131-2141. [PMID: 36562468 DOI: 10.1039/d2cp04690c] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Accurately and efficiently predicting macromolecules' polarizabilities is an open problem. In this work, we employ a few simple density-based quantities from the information-theoretic approach (ITA) to predict polarizability of proteins. We first build quantitative structure/property relationships between molecular polarizabilities and ITA quantities. We then verify the broad applicability of ITA quantities for polarizability prediction for inorganic, organic, and biological systems with both localized and delocalized electronic structure. As a proof-of-concept application, we predict the molecular polarizabilities of complex proteins. Based on the linear regression equations for 20 natural amino acid residues, 400 dipeptides, and 8000 tripeptides, one then predicts the molecular polarizability of a larger peptide or even a protein once the molecular wavefunction is obtained. Because it is extremely costly to determine the wavefunction for a macromolecule like a protein, we propose to combine the ITA with the linear-scaling generalized energy-based fragmentation (GEBF) method to predict the macromolecular polarizability. In GEBF, the total molecular polarizability is obtained as a linear combination of the corresponding quantities from a series of small subsystems. We can predict them based on the subsystem wavefunction and linear regression equations rather than compute them from the nearly-intractable coupled-perturbed Hartree-Fock or Kohn-Sham equations for the whole macromolecule. Computational results showcase that the GEBF-ITA protocol should be an inexpensive yet accurate theoretical tool for predicting macromolecular polarizabilities.
Collapse
Affiliation(s)
- Dongbo Zhao
- Institute of Biomedical Research, Yunnan University, Kunming 650500, Yunnan, P. R. China
| | - Yilin Zhao
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ONL8S 4M1, Canada.
| | - Xin He
- Qingdao Institute for Theoretical and Computational Sciences, Shandong University, Qingdao 266237, Shandong, P. R. China
| | - Paul W Ayers
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ONL8S 4M1, Canada.
| | - Shubin Liu
- Research Computing Center, University of North Carolina, Chapel Hill, North Carolina 27599-3420, USA. .,Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, USA
| |
Collapse
|
26
|
Sun B, Kekenes-Huskey PM. Myofilament-associated proteins with intrinsic disorder (MAPIDs) and their resolution by computational modeling. Q Rev Biophys 2023; 56:e2. [PMID: 36628457 PMCID: PMC11070111 DOI: 10.1017/s003358352300001x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The cardiac sarcomere is a cellular structure in the heart that enables muscle cells to contract. Dozens of proteins belong to the cardiac sarcomere, which work in tandem to generate force and adapt to demands on cardiac output. Intriguingly, the majority of these proteins have significant intrinsic disorder that contributes to their functions, yet the biophysics of these intrinsically disordered regions (IDRs) have been characterized in limited detail. In this review, we first enumerate these myofilament-associated proteins with intrinsic disorder (MAPIDs) and recent biophysical studies to characterize their IDRs. We secondly summarize the biophysics governing IDR properties and the state-of-the-art in computational tools toward MAPID identification and characterization of their conformation ensembles. We conclude with an overview of future computational approaches toward broadening the understanding of intrinsic disorder in the cardiac sarcomere.
Collapse
Affiliation(s)
- Bin Sun
- Research Center for Pharmacoinformatics (The State-Province Key Laboratories of Biomedicine-Pharmaceutics of China), Department of Medicinal Chemistry and Natural Medicine Chemistry, College of Pharmacy, Harbin Medical University, Harbin 150081, China
| | | |
Collapse
|
27
|
Zhu Q, Ge Y, Li W, Ma J. Treating Polarization Effects in Charged and Polar Bio-Molecules Through Variable Electrostatic Parameters. J Chem Theory Comput 2023; 19:396-411. [PMID: 36592097 DOI: 10.1021/acs.jctc.2c01130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Polarization plays important roles in charged and hydrogen bonding containing systems. Much effort ranging from the construction of physics-based models to quantum mechanism (QM)-based and machine learning (ML)-assisted models have been devoted to incorporating the polarization effect into the conventional force fields at different levels, such as atomic and coarse grained (CG). The application of polarizable force fields or polarization models was limited by two aspects, namely, computational cost and transferability. Different from physics-based models, no predetermining parameters were required in the QM-based approaches. Taking advantage of both the accuracy of QM calculations and efficiency of molecular mechanism (MM) and ML, polarization effects could be treated more efficiently while maintaining the QM accuracy. The computational cost could be reduced with variable electrostatic parameters, such as the charge, dipole, and electronic dielectric constant with the help of linear scaling fragmentation-based QM calculations and ML models. Polarization and entropy effects on the prediction of partition coefficient of druglike molecules are demonstrated by using both explicit or implicit all-atom molecular dynamics simulations and machine learning-assisted models. Directions and challenges for future development are also envisioned.
Collapse
Affiliation(s)
- Qiang Zhu
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| | - Yang Ge
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| | - Wei Li
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| | - Jing Ma
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education, Institute of Theoretical and Computational Chemistry, School of Chemistry and Chemical Engineering, Nanjing University, Nanjing210023, P. R. China
| |
Collapse
|
28
|
Lenton S, Fagerberg E, Tully M, Skepö M. From dilute to concentrated solutions of intrinsically disordered proteins: Interpretation and analysis of collected data. Methods Enzymol 2022; 678:299-330. [PMID: 36641212 DOI: 10.1016/bs.mie.2022.09.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Intrinsically disordered proteins (IDPs) have a broad energy landscape and consequently sample many different conformations in solution. The innate flexibility of IDPs is exploited in their biological function, and in many instances allows a single IDP to regulate a range of processes in vivo. Due to their highly flexible nature, characterizing the structural properties of IDPs is not straightforward. Often solution-based methods such as Nuclear Magnetic Resonance (NMR), Förster Resonance Energy Transfer (FRET), and Small-Angle X-ray Scattering (SAXS) are used. SAXS is indeed a powerful technique to study the structural and conformational properties of IDPs in solution, and from the obtained SAXS spectra, information about the average size, shape, and extent of oligomerization can be determined. In this chapter, we will introduce model-free methods that can be used to interpret SAXS data and introduce methods that can be used to interpret SAXS data beyond analytical models, for example, by using atomistic and different levels of coarse-grained models in combination with molecular dynamics (MD) and Monte Carlo simulations.
Collapse
Affiliation(s)
- Samuel Lenton
- Drug Delivery and Biophysics of Biopharmaceuticals, Department of Pharmacy, University of Copenhagen, Copenhagen, Denmark; Division of Theoretical Chemistry, Lund University, Lund, Sweden
| | - Eric Fagerberg
- Division of Theoretical Chemistry, Lund University, Lund, Sweden
| | - Mark Tully
- BioSAXS beamline, BM29, European Synchrotron Radiation Facility, ESRF, Grenoble, France
| | - Marie Skepö
- Division of Theoretical Chemistry, Lund University, Lund, Sweden; LINXS-Institute of Advanced Neutron and X-ray Science, Lund, Sweden.
| |
Collapse
|
29
|
Cui X, Liu H, Chen HF. Polarizable Force Field of Intrinsically Disordered Proteins with CMAP and Reweighting Optimization. J Chem Inf Model 2022; 62:4970-4982. [PMID: 36178373 DOI: 10.1021/acs.jcim.2c00835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Intrinsically disordered proteins (IDPs) are highly structurally heterogeneous without a specific tertiary structure under physiology conditions and play key roles in the development of human diseases. Due to the characteristics of diverse conformations, as one of the important methods, molecular dynamics simulation can complement information for experimental methods. Because of the enrichment for charged amino acids for IDPs, polarizable force fields should be a good choice for the simulation of IDPs. However, current polarizable force fields are limited in sampling conformer features of IDPs. Therefore, a polarizable force field was released and named Drude2019IDP based on Drude2019 with reweighting and grid-based potential energy correction map optimization. In order to evaluate the performance of Drude2019IDP, 16 dipeptides, 18 short peptides, 3 representative IDPs, and 5 structural proteins were simulated. The results show that the NMR observables driven by Drude2019IDP are in better agreement with the experiment data than those by Drude2019 on short peptides and IDPs. Drude2019IDP can sample more diverse conformations than Drude2019. Furthermore, the performances of the two force fields are similar to the sample ordered proteins. These results confirm that the developed Drude2019IDP can improve the reproduction of conformers for intrinsically disordered proteins and can be used to gain insight into the paradigm of sequence-disorder for IDPs.
Collapse
Affiliation(s)
- Xiaochen Cui
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai200240, China
| | - Hao Liu
- Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai200240, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai200240, China.,Shanghai Center for Bioinformation Technology, Shanghai200235, China
| |
Collapse
|
30
|
Roca-Martinez J, Lazar T, Gavalda-Garcia J, Bickel D, Pancsa R, Dixit B, Tzavella K, Ramasamy P, Sanchez-Fornaris M, Grau I, Vranken WF. Challenges in describing the conformation and dynamics of proteins with ambiguous behavior. Front Mol Biosci 2022; 9:959956. [PMID: 35992270 PMCID: PMC9382080 DOI: 10.3389/fmolb.2022.959956] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 06/27/2022] [Indexed: 11/13/2022] Open
Abstract
Traditionally, our understanding of how proteins operate and how evolution shapes them is based on two main data sources: the overall protein fold and the protein amino acid sequence. However, a significant part of the proteome shows highly dynamic and/or structurally ambiguous behavior, which cannot be correctly represented by the traditional fixed set of static coordinates. Representing such protein behaviors remains challenging and necessarily involves a complex interpretation of conformational states, including probabilistic descriptions. Relating protein dynamics and multiple conformations to their function as well as their physiological context (e.g., post-translational modifications and subcellular localization), therefore, remains elusive for much of the proteome, with studies to investigate the effect of protein dynamics relying heavily on computational models. We here investigate the possibility of delineating three classes of protein conformational behavior: order, disorder, and ambiguity. These definitions are explored based on three different datasets, using interpretable machine learning from a set of features, from AlphaFold2 to sequence-based predictions, to understand the overlap and differences between these datasets. This forms the basis for a discussion on the current limitations in describing the behavior of dynamic and ambiguous proteins.
Collapse
Affiliation(s)
- Joel Roca-Martinez
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - Tamas Lazar
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- VIB-VUB Center for Structural Biology, Brussels, Belgium
| | - Jose Gavalda-Garcia
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - David Bickel
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - Rita Pancsa
- Research Centre for Natural Sciences, Institute of Enzymology, Budapest, Hungary
| | - Bhawna Dixit
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
- IBiTech-Biommeda, Universiteit Gent, Gent, Belgium
| | - Konstantina Tzavella
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - Pathmanaban Ramasamy
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
- VIB-UGent Center for Medical Biotechnology, Universiteit Gent, Gent, Belgium
| | - Maite Sanchez-Fornaris
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
- Department of Computer Sciences, University of Camagüey, Camagüey, Cuba
| | - Isel Grau
- Information Systems, Eindhoven University of Technology, Eindhoven, Netherlands
| | - Wim F. Vranken
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
- *Correspondence: Wim F. Vranken,
| |
Collapse
|
31
|
Fadda E. Molecular simulations of complex carbohydrates and glycoconjugates. Curr Opin Chem Biol 2022; 69:102175. [PMID: 35728307 DOI: 10.1016/j.cbpa.2022.102175] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 05/17/2022] [Accepted: 05/19/2022] [Indexed: 11/17/2022]
Abstract
Complex carbohydrates (glycans) are the most abundant and versatile biopolymers in nature. The broad diversity of biochemical functions that carbohydrates cover is a direct consequence of the variety of 3D architectures they can adopt, displaying branched or linear arrangements, widely ranging in sizes, and with the highest diversity of building blocks of any other natural biopolymer. Despite this unparalleled complexity, a common denominator can be found in the glycans' inherent flexibility, which hinders experimental characterization, but that can be addressed by high-performance computing (HPC)-based molecular simulations. In this short review, I present and discuss the state-of-the-art of molecular simulations of complex carbohydrates and glycoconjugates, highlighting methodological strengths and weaknesses, important insights through emblematic case studies, and suggesting perspectives for future developments.
Collapse
Affiliation(s)
- Elisa Fadda
- Department of Chemistry and Hamilton Institute, Maynooth University, Ireland.
| |
Collapse
|
32
|
Mendoza-Martinez C, Papadourakis M, Llabrés S, Gupta AA, Barlow PN, Michel J. Energetics of a protein disorder-order transition in small molecule recognition. Chem Sci 2022; 13:5220-5229. [PMID: 35655546 PMCID: PMC9093188 DOI: 10.1039/d2sc00028h] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 04/01/2022] [Indexed: 12/17/2022] Open
Abstract
Many proteins recognise other proteins via mechanisms that involve the folding of intrinsically disordered regions upon complex formation. Here we investigate how the selectivity of a drug-like small molecule arises from its modulation of a protein disorder-to-order transition. Binding of the compound AM-7209 has been reported to confer order upon an intrinsically disordered ‘lid’ region of the oncoprotein MDM2. Calorimetric measurements revealed that truncation of the lid region of MDM2 increases the apparent dissociation constant of AM-7209 250-fold. By contrast, lid truncation has little effect on the binding of the ligand Nutlin-3a. Insights into these differential binding energetics were obtained via a complete thermodynamic analysis that featured adaptive absolute alchemical free energy of binding calculations with enhanced-sampling molecular dynamics simulations. The simulations reveal that in apo MDM2 the ordered lid state is energetically disfavoured. AM-7209, but not Nutlin-3a, shows a significant energetic preference for ordered lid conformations, thus shifting the balance towards ordering of the lid in the AM-7209/MDM2 complex. The methodology reported herein should facilitate broader targeting of intrinsically disordered regions in medicinal chemistry. Molecular simulations and biophysical measurements elucidate why the ligand AM-7209 orders a disordered region of the protein MDM2 on binding. This work expands strategies available to medicinal chemists for targeting disordered proteins.![]()
Collapse
Affiliation(s)
- Cesar Mendoza-Martinez
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Michail Papadourakis
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Salomé Llabrés
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Arun A Gupta
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Paul N Barlow
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| | - Julien Michel
- EaStCHEM School of Chemistry, University of Edinburgh David Brewster Road Edinburgh EH9 3FJ UK
| |
Collapse
|
33
|
Duan C, Jiang Q, Jiang X, Zeng H, Wu Q, Yu Y, Yang X. Discovery of a Novel Inhibitor Structure of Mycobacterium tuberculosis Isocitrate Lyase. Molecules 2022; 27:molecules27082447. [PMID: 35458645 PMCID: PMC9026967 DOI: 10.3390/molecules27082447] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 03/30/2022] [Accepted: 03/31/2022] [Indexed: 02/06/2023] Open
Abstract
Tuberculosis remains a global threat to public health, and dormant Mycobacterium tuberculosis leads to long-term medication that is harmful to the human body. M. tuberculosis isocitrate lyase (MtICL), which is absent in host cells, is a key rate-limiting enzyme of the glyoxylic acid cycle and is essential for the survival of dormant M. tuberculosis. The aim of this study was to evaluate natural compounds as potential MtICL inhibitors through docking and experimental verification. Screening of the TCMSP database library was done using Discovery Studio 2019 for molecular docking and interaction analysis, with the putative inhibitors of MtICL, 3-BP, and IA as reference ligands. Daphnetin (MOL005118), with a docking score of 94.8 and -CDOCKER interaction energy of 56 kcal/mol, was selected and verified on MtICL in vitro and M. smegmatis; daphnetin gave an IC50 of 4.34 μg/mL for the MtICL enzyme and an MIC value of 128 μg/mL against M. smegmatis, showing enhanced potential in comparison with 3-BP and IA. The interactions and essential amino acid residues of the protein were analyzed. In summary, natural daphnetin may be a promising new skeleton for the design of inhibitors of MtICL to combat dormant M. tuberculosis.
Collapse
Affiliation(s)
- Changyuan Duan
- Key Laboratory of Medical Laboratory Diagnostics of the Education Ministry, College of Laboratory Medicine, Chongqing Medical University, No. 1, Yixueyuan Road, Yuzhong Dist, Chongqing 400016, China; (C.D.); (X.J.); (H.Z.); (Q.W.); (Y.Y.)
| | - Qihua Jiang
- College of Pharmacy, Chongqing Medical University, Chongqing 400016, China;
| | - Xue Jiang
- Key Laboratory of Medical Laboratory Diagnostics of the Education Ministry, College of Laboratory Medicine, Chongqing Medical University, No. 1, Yixueyuan Road, Yuzhong Dist, Chongqing 400016, China; (C.D.); (X.J.); (H.Z.); (Q.W.); (Y.Y.)
| | - Hongwei Zeng
- Key Laboratory of Medical Laboratory Diagnostics of the Education Ministry, College of Laboratory Medicine, Chongqing Medical University, No. 1, Yixueyuan Road, Yuzhong Dist, Chongqing 400016, China; (C.D.); (X.J.); (H.Z.); (Q.W.); (Y.Y.)
| | - Qiaomin Wu
- Key Laboratory of Medical Laboratory Diagnostics of the Education Ministry, College of Laboratory Medicine, Chongqing Medical University, No. 1, Yixueyuan Road, Yuzhong Dist, Chongqing 400016, China; (C.D.); (X.J.); (H.Z.); (Q.W.); (Y.Y.)
| | - Yang Yu
- Key Laboratory of Medical Laboratory Diagnostics of the Education Ministry, College of Laboratory Medicine, Chongqing Medical University, No. 1, Yixueyuan Road, Yuzhong Dist, Chongqing 400016, China; (C.D.); (X.J.); (H.Z.); (Q.W.); (Y.Y.)
| | - Xiaolan Yang
- Key Laboratory of Medical Laboratory Diagnostics of the Education Ministry, College of Laboratory Medicine, Chongqing Medical University, No. 1, Yixueyuan Road, Yuzhong Dist, Chongqing 400016, China; (C.D.); (X.J.); (H.Z.); (Q.W.); (Y.Y.)
- Correspondence: ; Tel.: +86-23-6848-5240
| |
Collapse
|
34
|
Kulkarni P, Bhattacharya S, Achuthan S, Behal A, Jolly MK, Kotnala S, Mohanty A, Rangarajan G, Salgia R, Uversky V. Intrinsically Disordered Proteins: Critical Components of the Wetware. Chem Rev 2022; 122:6614-6633. [PMID: 35170314 PMCID: PMC9250291 DOI: 10.1021/acs.chemrev.1c00848] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Despite the wealth of knowledge gained about intrinsically disordered proteins (IDPs) since their discovery, there are several aspects that remain unexplored and, hence, poorly understood. A living cell is a complex adaptive system that can be described as a wetware─a metaphor used to describe the cell as a computer comprising both hardware and software and attuned to logic gates─capable of "making" decisions. In this focused Review, we discuss how IDPs, as critical components of the wetware, influence cell-fate decisions by wiring protein interaction networks to keep them minimally frustrated. Because IDPs lie between order and chaos, we explore the possibility that they can be modeled as attractors. Further, we discuss how the conformational dynamics of IDPs manifests itself as conformational noise, which can potentially amplify transcriptional noise to stochastically switch cellular phenotypes. Finally, we explore the potential role of IDPs in prebiotic evolution, in forming proteinaceous membrane-less organelles, in the origin of multicellularity, and in protein conformation-based transgenerational inheritance of acquired characteristics. Together, these ideas provide a new conceptual framework to discern how IDPs may perform critical biological functions despite their lack of structure.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
- Address for correspondence: Prakash Kulkarni, Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010, , Vladimir N. Uversky, Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, Florida 33612,
| | - Supriyo Bhattacharya
- Integrative Genomics Core, City of Hope National Medical Center, Duarte, CA, USA
| | - Srisairam Achuthan
- Division of Research Informatics, Center for Informatics, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Amita Behal
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Sourabh Kotnala
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
- Center for Neuroscience, Indian Institute of Science, Bangalore 560012, India
| | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA, USA
| | - Vladimir Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
- Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Institutskiy pereulok, 9, Dolgoprudny, Moscow region 141700, Russia
- Address for correspondence: Prakash Kulkarni, Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010, , Vladimir N. Uversky, Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, Florida 33612,
| |
Collapse
|
35
|
Kulkarni P, Leite VBP, Roy S, Bhattacharyya S, Mohanty A, Achuthan S, Singh D, Appadurai R, Rangarajan G, Weninger K, Orban J, Srivastava A, Jolly MK, Onuchic JN, Uversky VN, Salgia R. Intrinsically disordered proteins: Ensembles at the limits of Anfinsen's dogma. BIOPHYSICS REVIEWS 2022; 3:011306. [PMID: 38505224 PMCID: PMC10903413 DOI: 10.1063/5.0080512] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 02/17/2022] [Indexed: 03/21/2024]
Abstract
Intrinsically disordered proteins (IDPs) are proteins that lack rigid 3D structure. Hence, they are often misconceived to present a challenge to Anfinsen's dogma. However, IDPs exist as ensembles that sample a quasi-continuum of rapidly interconverting conformations and, as such, may represent proteins at the extreme limit of the Anfinsen postulate. IDPs play important biological roles and are key components of the cellular protein interaction network (PIN). Many IDPs can interconvert between disordered and ordered states as they bind to appropriate partners. Conformational dynamics of IDPs contribute to conformational noise in the cell. Thus, the dysregulation of IDPs contributes to increased noise and "promiscuous" interactions. This leads to PIN rewiring to output an appropriate response underscoring the critical role of IDPs in cellular decision making. Nonetheless, IDPs are not easily tractable experimentally. Furthermore, in the absence of a reference conformation, discerning the energy landscape representation of the weakly funneled IDPs in terms of reaction coordinates is challenging. To understand conformational dynamics in real time and decipher how IDPs recognize multiple binding partners with high specificity, several sophisticated knowledge-based and physics-based in silico sampling techniques have been developed. Here, using specific examples, we highlight recent advances in energy landscape visualization and molecular dynamics simulations to discern conformational dynamics and discuss how the conformational preferences of IDPs modulate their function, especially in phenotypic switching. Finally, we discuss recent progress in identifying small molecules targeting IDPs underscoring the potential therapeutic value of IDPs. Understanding structure and function of IDPs can not only provide new insight on cellular decision making but may also help to refine and extend Anfinsen's structure/function paradigm.
Collapse
Affiliation(s)
- Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Vitor B. P. Leite
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista (UNESP), São José do Rio Preto, São Paulo 15054-000, Brazil
| | - Susmita Roy
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, West Bengal 741246, India
| | - Supriyo Bhattacharyya
- Translational Bioinformatics, Center for Informatics, Department of Computational and Quantitative Medicine, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Atish Mohanty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Srisairam Achuthan
- Center for Informatics, Division of Research Informatics, City of Hope National Medical Center, Duarte, California 91010, USA
| | - Divyoj Singh
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Rajeswari Appadurai
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India
| | - Keith Weninger
- Department of Physics, North Carolina State University, Raleigh, North Carolina 27695, USA
| | | | - Anand Srivastava
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Mohit Kumar Jolly
- Center for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - Jose N. Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005-1892, USA
| | | | - Ravi Salgia
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, USA
| |
Collapse
|
36
|
Latham AP, Zhang B. Unifying coarse-grained force fields for folded and disordered proteins. Curr Opin Struct Biol 2022; 72:63-70. [PMID: 34536913 PMCID: PMC9057422 DOI: 10.1016/j.sbi.2021.08.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 08/08/2021] [Accepted: 08/17/2021] [Indexed: 12/22/2022]
Abstract
Liquid-liquid phase separation drives the formation of biological condensates that play essential roles in transcriptional regulation and signal sensing. Computational modeling could provide high-resolution structural characterizations of these condensates and help uncover physicochemical interactions that dictate their stability. However, many protein molecules involved in phase separation often contain multiple ordered domains connected with flexible, structureless linkers. Simulating such proteins necessitates force fields with consistent accuracy for both folded and disordered proteins. We provide a critical review of existing coarse-grained force fields for disordered proteins and highlight the challenges in their application to folded proteins. After discussing existing algorithms for force field parameterization, we propose an optimization strategy that should lead to computer models with improved transferability across protein types.
Collapse
Affiliation(s)
- Andrew P Latham
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
37
|
Chen J, Liu H, Cui X, Li Z, Chen HF. RNA-Specific Force Field Optimization with CMAP and Reweighting. J Chem Inf Model 2022; 62:372-385. [PMID: 35021622 DOI: 10.1021/acs.jcim.1c01148] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
RNA plays a key role in a variety of cell activities. However, it is difficult to capture its structure dynamics by the traditional experimental methods because of the inherent limitations. Molecular dynamics simulation has become a valuable complement to the experimental methods. Previous studies have indicated that the current force fields cannot accurately reproduce the conformations and structural dynamics of RNA. Therefore, an RNA-specific force field was developed to improve the conformation sampling of RNA. The distribution of ζ/α dihedrals of tetranucleotides was optimized by a reweighting method, and the grid-based energy correction map (CMAP) term was first introduced into the Amber RNA force field of ff99bsc0χOL3, named ff99OL3_CMAP1. Extensive validations of tetranucleotides and tetraloops show that ff99OL3_CMAP1 can significantly decrease the population of an incorrect structure, increase the consistency between the simulation results and experimental values for tetranucleotides, and improve the stability of tetraloops. ff99OL3_CMAP1 can also precisely reproduce the conformation of a duplex and riboswitches. These findings confirm that the newly developed force field ff99OL3_CMAP1 can improve the conformer sampling of RNA.
Collapse
Affiliation(s)
- Jun Chen
- State Key Laboratory of Microbial Metabolism, Department of Bioinformatics and Biostatistics, SJTU-Yale Joint Center for Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 20024 Shanghai, China
| | - Hao Liu
- Institute of Natural Sciences, Shanghai Jiao Tong University, 200240 Shanghai, China
| | - Xiaochen Cui
- State Key Laboratory of Microbial Metabolism, Department of Bioinformatics and Biostatistics, SJTU-Yale Joint Center for Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 20024 Shanghai, China
| | - Zhengxin Li
- State Key Laboratory of Microbial Metabolism, Department of Bioinformatics and Biostatistics, SJTU-Yale Joint Center for Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 20024 Shanghai, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Department of Bioinformatics and Biostatistics, SJTU-Yale Joint Center for Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 20024 Shanghai, China.,Shanghai Center for Bioinformation Technology, 200240 Shanghai, China
| |
Collapse
|
38
|
Extended ensemble simulations of a SARS-CoV-2 nsp1-5'-UTR complex. PLoS Comput Biol 2022; 18:e1009804. [PMID: 35045069 PMCID: PMC8803185 DOI: 10.1371/journal.pcbi.1009804] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Revised: 01/31/2022] [Accepted: 01/04/2022] [Indexed: 11/19/2022] Open
Abstract
Nonstructural protein 1 (nsp1) of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a 180-residue protein that blocks translation of host mRNAs in SARS-CoV-2-infected cells. Although it is known that SARS-CoV-2’s own RNA evades nsp1’s host translation shutoff, the molecular mechanism underlying the evasion was poorly understood. We performed an extended ensemble molecular dynamics simulation to investigate the mechanism of the viral RNA evasion. Simulation results suggested that the stem loop structure of the SARS-CoV-2 RNA 5’-untranslated region (SL1) binds to both nsp1’s N-terminal globular region and intrinsically disordered region. The consistency of the results was assessed by modeling nsp1-40S ribosome structure based on reported nsp1 experiments, including the X-ray crystallographic structure analysis, the cryo-EM electron density map, and cross-linking experiments. The SL1 binding region predicted from the simulation was open to the solvent, yet the ribosome could interact with SL1. Cluster analysis of the binding mode and detailed analysis of the binding poses suggest residues Arg124, Lys47, Arg43, and Asn126 may be involved in the SL1 recognition mechanism, consistent with the existing mutational analysis. The pandemic of COVID-19 is still rampant all over the world as of 2021 June. SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2), the causative pathogen of COVID-19, encodes a protein called nsp1 (nonstructural protein 1), which modulates and hijacks the ribosome of the infected host cells. With nsp1, infected human cells selectively translate SARS-CoV-2’s RNA, which increases the virus reproduction efficiency while evading the host immunity. Though it has been known that nsp1 recognizes characteristic stem-loop structure at 5’-end of SARS-CoV-2’s RNA (called SL1), the molecular mechanism underlying the recognition has been poorly understood. We investigated the mechanism of selective translation using the all-atom molecular dynamics simulation of nsp1-SL1 complex. Our simulation results suggest that the binding between nsp1 and SL1 is multi-modal. The results also imply that both the N-terminal globular part and the C-terminal flexible tail of nsp1 are involved in the binding. The residues involved in nsp1-SL1 binding coincides with the known mutant analyses of SARS-CoV-1 and SARS-CoV-2, as well as experimental evidence about nsp1-ribosome interactions.
Collapse
|
39
|
Mu J, Pan Z, Chen HF. Balanced Solvent Model for Intrinsically Disordered and Ordered Proteins. J Chem Inf Model 2021; 61:5141-5151. [PMID: 34546059 DOI: 10.1021/acs.jcim.1c00407] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Intrinsically disordered proteins (IDPs) have no fixed three-dimensional (3D) structures under physiological conditions, with the content being about 51% in human proteomics. IDPs are associated with many human diseases, such as cancer, diabetes, and neurodegenerative diseases. Because IDPs do not crystallize and have diverse conformers, traditional experimental methods such as crystallization and NMR can hardly capture their conformation ensemble and just provide average structural characters of IDPs. Therefore, molecular dynamics (MD) simulations become a valuable complement to the experimental data. However, the accuracy of molecular dynamics simulation for IDPs depends on the combination of force fields and solvent models. Recently, we released an environment-specific force field (ESFF1) for IDPs, which can well reproduce the local structural properties (such as J-coupling and secondary chemical shifts). However, there is still a large deviation for the radius of gyration (Rg). Therefore, a solvent model combined with ESFF1 is necessary to capture the local and global characters for IDPs and ordered proteins. Here, we investigated the underestimation or overestimation of the solvent interaction for four solvent models (TIP3P, TIP4P-Ew, TIP4P-D, OPC) under ESFF1 and found the important ε parameter of the solvent model to play a key role in scaling Rg. A near-linear relationship between the simulation Rg and the ε parameter was used to develop the new solvent model, named TIP4P-B. The results indicate that the simulated Rg with TIP4P-B is in better agreement with the experimental observations than the other four solvent models. Simultaneously, TIP4P-B can also maintain the advantages of the ESFF1 force field for the local structural properties. Additionally, TIP4P-B can successfully sample the conformation of ordered proteins. These findings confirm that TIP4P-B is a balanced solvent model and can improve sampling Rg performance for folded proteins and IDPs.
Collapse
Affiliation(s)
- Junxi Mu
- State Key Laboratory of Microbial metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Zhengsong Pan
- State Key Laboratory of Microbial metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.,MD Program, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.,Shanghai Center for Bioinformation Technology, Shanghai 200235, China
| |
Collapse
|
40
|
Sacquin-Mora S, Prévost C. When Order Meets Disorder: Modeling and Function of the Protein Interface in Fuzzy Complexes. Biomolecules 2021; 11:1529. [PMID: 34680162 PMCID: PMC8533853 DOI: 10.3390/biom11101529] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/11/2021] [Accepted: 10/12/2021] [Indexed: 11/30/2022] Open
Abstract
The degree of proteins structural organization ranges from highly structured, compact folding to intrinsic disorder, where each degree of self-organization corresponds to specific functions: well-organized structural motifs in enzymes offer a proper environment for precisely positioned functional groups to participate in catalytic reactions; at the other end of the self-organization spectrum, intrinsically disordered proteins act as binding hubs via the formation of multiple, transient and often non-specific interactions. This review focusses on cases where structurally organized proteins or domains associate with highly disordered protein chains, leading to the formation of interfaces with varying degrees of fuzziness. We present a review of the computational methods developed to provide us with information on such fuzzy interfaces, and how they integrate experimental information. The discussion focusses on two specific cases, microtubules and homologous recombination nucleoprotein filaments, where a network of intrinsically disordered tails exerts regulatory function in recruiting partner macromolecules, proteins or DNA and tuning the atomic level association. Notably, we show how computational approaches such as molecular dynamics simulations can bring new knowledge to help bridging the gap between experimental analysis, that mostly concerns ensemble properties, and the behavior of individual disordered protein chains that contribute to regulation functions.
Collapse
Affiliation(s)
- Sophie Sacquin-Mora
- CNRS, Laboratoire de Biochimie Théorique, UPR9080, Université de Paris, 13 Rue Pierre et Marie Curie, 75005 Paris, France;
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, PSL Research University, 75006 Paris, France
| | - Chantal Prévost
- CNRS, Laboratoire de Biochimie Théorique, UPR9080, Université de Paris, 13 Rue Pierre et Marie Curie, 75005 Paris, France;
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, PSL Research University, 75006 Paris, France
| |
Collapse
|
41
|
Boopathi S, Poma AB, Garduño-Juárez R. An Overview of Several Inhibitors for Alzheimer's Disease: Characterization and Failure. Int J Mol Sci 2021; 22:10798. [PMID: 34639140 PMCID: PMC8509255 DOI: 10.3390/ijms221910798] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2021] [Revised: 10/01/2021] [Accepted: 10/03/2021] [Indexed: 01/04/2023] Open
Abstract
Amyloid beta (Aβ) oligomers are the most neurotoxic aggregates causing neuronal death and cognitive damage. A detailed elucidation of the aggregation pathways from oligomers to fibril formation is crucial to develop therapeutic strategies for Alzheimer's disease (AD). Although experimental techniques rely on the measure of time- and space-average properties, they face severe difficulties in the investigation of Aβ peptide aggregation due to their intrinsically disorder character. Computer simulation is a tool that allows tracing the molecular motion of molecules; hence it complements Aβ experiments, as it allows to explore the binding mechanism between metal ions and Aβ oligomers close to the cellular membrane at the atomic resolution. In this context, integrated studies of experiments and computer simulations can assist in mapping the complete pathways of aggregation and toxicity of Aβ peptides. Aβ oligomers are disordered proteins, and due to a rapid exploration of their intrinsic conformational space in real-time, they are challenging therapeutic targets. Therefore, no good drug candidate could have been identified for clinical use. Our previous investigations identified two small molecules, M30 (2-Octahydroisoquinolin-2(1H)-ylethanamine) and Gabapentin, capable of Aβ binding and inhibiting molecular aggregation, synaptotoxicity, intracellular calcium signaling, cellular toxicity and memory losses induced by Aβ. Thus, we recommend these molecules as novel candidates to assist anti-AD drug discovery in the near future. This review discusses the most recent research investigations about the Aβ dynamics in water, close contact with cell membranes, and several therapeutic strategies to remove plaque formation.
Collapse
Affiliation(s)
- Subramanian Boopathi
- Instituto de Ciencias Físicas, Universidad Nacional Autónoma de México, Cuernavaca 62210, Mexico;
| | - Adolfo B. Poma
- Department of Biosystems and Soft Matter, Institute of Fundamental Technological Research Polish Academy of Science, Pawińskiego 5B, 02-106 Warsaw, Poland
- International Center for Research on Innovative Biobased Materials (ICRI-BioM)—International Research Agenda, Lodz University of Technology, Zeromskiego 116, 90-924 Lodz, Poland;
| | - Ramón Garduño-Juárez
- Instituto de Ciencias Físicas, Universidad Nacional Autónoma de México, Cuernavaca 62210, Mexico;
| |
Collapse
|
42
|
Gong X, Zhang Y, Chen J. Advanced Sampling Methods for Multiscale Simulation of Disordered Proteins and Dynamic Interactions. Biomolecules 2021; 11:1416. [PMID: 34680048 PMCID: PMC8533332 DOI: 10.3390/biom11101416] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 09/22/2021] [Accepted: 09/24/2021] [Indexed: 11/16/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are highly prevalent and play important roles in biology and human diseases. It is now also recognized that many IDPs remain dynamic even in specific complexes and functional assemblies. Computer simulations are essential for deriving a molecular description of the disordered protein ensembles and dynamic interactions for a mechanistic understanding of IDPs in biology, diseases, and therapeutics. Here, we provide an in-depth review of recent advances in the multi-scale simulation of disordered protein states, with a particular emphasis on the development and application of advanced sampling techniques for studying IDPs. These techniques are critical for adequate sampling of the manifold functionally relevant conformational spaces of IDPs. Together with dramatically improved protein force fields, these advanced simulation approaches have achieved substantial success and demonstrated significant promise towards the quantitative and predictive modeling of IDPs and their dynamic interactions. We will also discuss important challenges remaining in the atomistic simulation of larger systems and how various coarse-grained approaches may help to bridge the remaining gaps in the accessible time- and length-scales of IDP simulations.
Collapse
Affiliation(s)
- Xiping Gong
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (X.G.); (Y.Z.)
| | - Yumeng Zhang
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (X.G.); (Y.Z.)
| | - Jianhan Chen
- Department of Chemistry, University of Massachusetts Amherst, Amherst, MA 01003, USA; (X.G.); (Y.Z.)
- Department of Biochemistry and Molecular Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA
| |
Collapse
|
43
|
Lindorff-Larsen K, Kragelund BB. On the potential of machine learning to examine the relationship between sequence, structure, dynamics and function of intrinsically disordered proteins. J Mol Biol 2021; 433:167196. [PMID: 34390736 DOI: 10.1016/j.jmb.2021.167196] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 08/03/2021] [Accepted: 08/04/2021] [Indexed: 11/29/2022]
Abstract
Intrinsically disordered proteins (IDPs) constitute a broad set of proteins with few uniting and many diverging properties. IDPs-and intrinsically disordered regions (IDRs) interspersed between folded domains-are generally characterized as having no persistent tertiary structure; instead they interconvert between a large number of different and often expanded structures. IDPs and IDRs are involved in an enormously wide range of biological functions and reveal novel mechanisms of interactions, and while they defy the common structure-function paradigm of folded proteins, their structural preferences and dynamics are important for their function. We here discuss open questions in the field of IDPs and IDRs, focusing on areas where machine learning and other computational methods play a role. We discuss computational methods aimed to predict transiently formed local and long-range structure, including methods for integrative structural biology. We discuss the many different ways in which IDPs and IDRs can bind to other molecules, both via short linear motifs, as well as in the formation of larger dynamic complexes such as biomolecular condensates. We discuss how experiments are providing insight into such complexes and may enable more accurate predictions. Finally, we discuss the role of IDPs in disease and how new methods are needed to interpret the mechanistic effects of genomic variants in IDPs.
Collapse
Affiliation(s)
- Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen. Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark.
| | - Birthe B Kragelund
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen. Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark.
| |
Collapse
|
44
|
Zhang Y, Jiang Y, Peng J, Zhang H. Rational Design of Nonbonded Point Charge Models for Divalent Metal Cations with Lennard-Jones 12-6 Potential. J Chem Inf Model 2021; 61:4031-4044. [PMID: 34313132 DOI: 10.1021/acs.jcim.1c00580] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Exploring a metal-involved biochemical process at a molecular level often requires a reliable description of metal properties in aqueous solution by classical nonbonded models. An additional C4 term for considering ion-induced dipole interactions was previously proposed to supplement the widely used Lennard-Jones 12-6 potential (known as the 12-6-4 LJ-type model) with good accuracy. Here, we demonstrate an alternative to modeling divalent metal cations (M2+) with the traditional 12-6 LJ potential by developing nonbonded point charge models for use with 11 water models: TIP3P, SPC/E, SPC/Eb, TIP4P-Ew, TIP4P-D, and TIP4P/2005 and the more recent OPC3, TIP3P-FB, OPC, TIP4P-FB, and a99SB-disp. Our designed models simultaneously reproduce the experimental hydration free energy, ion-oxygen distance, and coordination number in the first hydration shell accurately for most of the metal cations, an accuracy equivalent to that of the complex 12-6-4 LJ-type and double exponential potential models. A systematic comparison with the existing M2+ models is presented as well in terms of effective ion radii, diffusion constants, water exchange rates, and ion-water interactions. Molecular dynamics simulations of metal substitution in Escherichia coli glyoxalase I variants show the great potential of our new models for metalloproteins.
Collapse
Affiliation(s)
- Yongguang Zhang
- Department of Biological Science and Engineering, School of Chemistry and Biological Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Yang Jiang
- Department of Chemistry, Pennsylvania State University, University Park 16802, Pennsylvania, United States
| | - Jiarong Peng
- Department of Biological Science and Engineering, School of Chemistry and Biological Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Haiyang Zhang
- Department of Biological Science and Engineering, School of Chemistry and Biological Engineering, University of Science and Technology Beijing, Beijing 100083, China
| |
Collapse
|
45
|
Qiu Y, Shan W, Zhang H. Force Field Benchmark of Amino Acids. 3. Hydration with Scaled Lennard-Jones Interactions. J Chem Inf Model 2021; 61:3571-3582. [PMID: 34185520 DOI: 10.1021/acs.jcim.1c00339] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Classical protein force fields were reported with too weak protein-water interactions relative to protein-protein interactions, leading to more compact structures and artificial protein aggregation. Here we investigated the impacts of scaled Lennard-Jones (LJ) interactions on the hydration of amino acids and the simulation of folded and intrinsically disordered proteins (IDPs). The obtained optimal scaling parameters reproduce accurately hydration free energies of neutral amino acid side chain analogues and do not affect the compactness and structural stability of folded proteins significantly. The scaling leads to less compact IDPs and varies from case to case. Strengthening the interactions between protein and water oxygen or hydrogen atoms by increasing the interacting LJ well depth (ε) appears more effective than weakening protein-protein interactions by reducing the interacting dispersion coefficients (C6). We demonstrate that weakening water-water interactions is a solution as well to obtaining more favorable protein-water interactions in an indirect way, although modern force fields like Amber ff19SB and a99SB-disp tend to use water models with strong water-water interactions. This is likely a compromise between strong protein-protein interactions and strong water-water interactions. Independent optimization of protein force fields and water models is therefore needed to make both interactions more close to reality, leading to good accuracy without bias or scaling.
Collapse
Affiliation(s)
- Yejie Qiu
- Department of Biological Science and Engineering, School of Chemistry and Biological Engineering, University of Science and Technology Beijing, 100083 Beijing, China
| | - Wenjie Shan
- Department of Biological Science and Engineering, School of Chemistry and Biological Engineering, University of Science and Technology Beijing, 100083 Beijing, China
| | - Haiyang Zhang
- Department of Biological Science and Engineering, School of Chemistry and Biological Engineering, University of Science and Technology Beijing, 100083 Beijing, China
| |
Collapse
|
46
|
Spoel D, Zhang J, Zhang H. Quantitative predictions from molecular simulations using explicit or implicit interactions. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1560] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- David Spoel
- Uppsala Center for Computational Chemistry, Science for Life Laboratory, Department of Cell and Molecular Biology Uppsala University Uppsala Sweden
| | - Jin Zhang
- Department of Chemistry Southern University of Science and Technology Shenzhen China
| | - Haiyang Zhang
- Department of Biological Science and Engineering, School of Chemistry and Biological Engineering University of Science and Technology Beijing Beijing China
| |
Collapse
|
47
|
Cui X, Liu H, Rehman AU, Chen HF. Extensive evaluation of environment-specific force field for ordered and disordered proteins. Phys Chem Chem Phys 2021; 23:12127-12136. [PMID: 34032235 DOI: 10.1039/d1cp01385h] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Intrinsically disordered proteins (IDPs) have no fixed tertiary structure under physiological conditions and are associated with many human diseases. Because IDPs have the characteristic of possessing diverse conformations, current experimental methods cannot capture all the conformations of IDPs. However, molecular dynamics simulation can sample these atomistically diverse conformations as a valuable complement to experimental data. To accurately describe the properties of IDPs, the environment-specific precise force field (ESFF1) was successfully released to reproduce the conformer character of ordered and disordered proteins. Here, three typical IDPs and thirteen folded proteins were used to further evaluate the performance of this force field. The results indicate that the NMR observables of ESFF1 better approach experimental data than do those of ff14SB for IDPs. The sampling conformations by ESFF1 are more diverse than those of ff14SB. For folded proteins, these force fields have comparable performances for reproducing conformers. Therefore, ESFF1 can be used to reveal the model of sequence-disorder-function for IDPs.
Collapse
Affiliation(s)
- Xiaochen Cui
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Hao Liu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Ashfaq Ur Rehman
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China. and Shanghai Center for Bioinformation Technology, Shanghai, 200235, China
| |
Collapse
|
48
|
Assessment of transferable forcefields for protein simulations attests improved description of disordered states and secondary structure propensities, and hints at multi-protein systems as the next challenge for optimization. Comput Struct Biotechnol J 2021; 19:2626-2636. [PMID: 34025949 PMCID: PMC8120800 DOI: 10.1016/j.csbj.2021.04.050] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 04/20/2021] [Accepted: 04/22/2021] [Indexed: 01/02/2023] Open
Abstract
Continuous assessment of transferable forcefields for molecular simulations is essential to identify their weaknesses and direct improvement efforts. The latest efforts focused on better describing disordered proteins while retaining proper description of folded domains, important because forcefields of the previous generations produce overly compact disordered states. Such improvements should additionally alleviate the related problem of over-stabilized protein–protein interactions, which has been largely overlooked. Here we evaluated three state-of-the-art forcefields, current flagships of their respective developers, optimized for ordered and disordered proteins: CHARMM36m with its recommended corrected TIP3P* water, ff19SB with the recommended OPC water, and the 2019 a99SBdisp forcefield by D. E. Shaw Research with its modified TIP4P water; plus ff14SB with TIP3P as an example of the former generation of forcefields. Our evaluation entailed simulations of (i) multiple copies of a protein that is highly soluble yet undergoes weak dimerization, (ii) a disordered peptide with low, well-characterized alpha helical propensity, and (iii) a peptide known to form insoluble β-aggregates. Our results recapitulate ff14SB-TIP3P over-stabilizing aggregates and secondary structures and place a99SBdisp-TIP4PD at the other end i.e. predicting overly weak intermolecular interactions despite reasonably predicting secondary structure propensities. In-between, CHARMM36m-TIP3P* still over-stabilizes aggregates but predicts residue-wise alpha helical propensities in solution slightly better than ff19SB-OPC, while ff19SB-OPC poses the best prediction of weak dimerization of the soluble protein still predicting aggregation of the β-peptides. This independent assessment shows that the claimed forcefield improvements are real, but also that a right balance between noncovalent attraction and repulsion has not yet been reached. We thus propose developers to consider systems like those tested here in their forcefield tuning protocols. Last, the good performance of CHARMM36m-TIP3P* further shows that tuning 3-point water models might still be an alternative to the more costly 4-point models like OPC and TIP4PD.
Collapse
|
49
|
Fatafta H, Samantray S, Sayyed-Ahmad A, Coskuner-Weber O, Strodel B. Molecular simulations of IDPs: From ensemble generation to IDP interactions leading to disorder-to-order transitions. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2021; 183:135-185. [PMID: 34656328 DOI: 10.1016/bs.pmbts.2021.06.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/26/2023]
Abstract
Intrinsically disordered proteins (IDPs) lack a well-defined three-dimensional structure but do exhibit some dynamical and structural ordering. The structural plasticity of IDPs indicates that entropy-driven motions are crucial for their function. Many IDPs undergo function-related disorder-to-order transitions upon by their interaction with specific binding partners. Approaches that are based on both experimental and theoretical tools enable the biophysical characterization of IDPs. Molecular simulations provide insights into IDP structural ensembles and disorder-to-order transition mechanisms. However, such studies depend strongly on the chosen force field parameters and simulation techniques. In this chapter, we provide an overview of IDP characteristics, review all-atom force fields recently developed for IDPs, and present molecular dynamics-based simulation methods that allow IDP ensemble generation as well as the characterization of disorder-to-order transitions. In particular, we introduce metadynamics, replica exchange molecular dynamics simulations, and also kinetic models resulting from Markov State modeling, and provide various examples for the successful application of these simulation methods to IDPs.
Collapse
Affiliation(s)
- Hebah Fatafta
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich, Jülich, Germany
| | - Suman Samantray
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich, Jülich, Germany; AICES Graduate School, RWTH Aachen University, Aachen, Germany
| | | | - Orkid Coskuner-Weber
- Molecular Biotechnology, Turkish-German University, Sahinkaya Caddesi, Istanbul, Turkey
| | - Birgit Strodel
- Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich, Jülich, Germany; Institute of Theoretical and Computational Chemistry, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
| |
Collapse
|