1
|
Tyler S, Laforge C, Guzzo A, Nicolaï A, Maisuradze GG, Senet P. Einstein Model of a Graph to Characterize Protein Folded/Unfolded States. Molecules 2023; 28:6659. [PMID: 37764437 PMCID: PMC10536427 DOI: 10.3390/molecules28186659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/11/2023] [Accepted: 09/14/2023] [Indexed: 09/29/2023] Open
Abstract
The folded structures of proteins can be accurately predicted by deep learning algorithms from their amino-acid sequences. By contrast, in spite of decades of research studies, the prediction of folding pathways and the unfolded and misfolded states of proteins, which are intimately related to diseases, remains challenging. A two-state (folded/unfolded) description of protein folding dynamics hides the complexity of the unfolded and misfolded microstates. Here, we focus on the development of simplified order parameters to decipher the complexity of disordered protein structures. First, we show that any connected, undirected, and simple graph can be associated with a linear chain of atoms in thermal equilibrium. This analogy provides an interpretation of the usual topological descriptors of a graph, namely the Kirchhoff index and Randić resistance, in terms of effective force constants of a linear chain. We derive an exact relation between the Kirchhoff index and the average shortest path length for a linear graph and define the free energies of a graph using an Einstein model. Second, we represent the three-dimensional protein structures by connected, undirected, and simple graphs. As a proof of concept, we compute the topological descriptors and the graph free energies for an all-atom molecular dynamics trajectory of folding/unfolding events of the proteins Trp-cage and HP-36 and for the ensemble of experimental NMR models of Trp-cage. The present work shows that the local, nonlocal, and global force constants and free energies of a graph are promising tools to quantify unfolded/disordered protein states and folding/unfolding dynamics. In particular, they allow the detection of transient misfolded rigid states.
Collapse
Affiliation(s)
- Steve Tyler
- Laboratoire Interdisciplinaire Carnot de Bourgogne, UMR CNRS 6303, Université de Bourgogne, 21078 Dijon CEDEX, France
| | - Christophe Laforge
- Laboratoire Interdisciplinaire Carnot de Bourgogne, UMR CNRS 6303, Université de Bourgogne, 21078 Dijon CEDEX, France
| | - Adrien Guzzo
- Laboratoire Interdisciplinaire Carnot de Bourgogne, UMR CNRS 6303, Université de Bourgogne, 21078 Dijon CEDEX, France
| | - Adrien Nicolaï
- Laboratoire Interdisciplinaire Carnot de Bourgogne, UMR CNRS 6303, Université de Bourgogne, 21078 Dijon CEDEX, France
| | - Gia G. Maisuradze
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853, USA
| | - Patrick Senet
- Laboratoire Interdisciplinaire Carnot de Bourgogne, UMR CNRS 6303, Université de Bourgogne, 21078 Dijon CEDEX, France
| |
Collapse
|
2
|
Bitran A, Park K, Serebryany E, Shakhnovich EI. Co-translational formation of disulfides guides folding of the SARS-CoV-2 receptor binding domain. Biophys J 2023; 122:3238-3253. [PMID: 37422697 PMCID: PMC10465708 DOI: 10.1016/j.bpj.2023.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 05/27/2023] [Accepted: 07/03/2023] [Indexed: 07/10/2023] Open
Abstract
Many secreted proteins, including viral proteins, contain multiple disulfide bonds. How disulfide formation is coupled to protein folding in the cell remains poorly understood at the molecular level. Here, we combine experiment and simulation to address this question as it pertains to the SARS-CoV-2 receptor binding domain (RBD). We show that the RBD can only refold reversibly if its native disulfides are present before folding. But in their absence, the RBD spontaneously misfolds into a nonnative, molten-globule-like state that is structurally incompatible with complete disulfide formation and that is highly prone to aggregation. Thus, the RBD native structure represents a metastable state on the protein's energy landscape with reduced disulfides, indicating that nonequilibrium mechanisms are needed to ensure native disulfides form before folding. Our atomistic simulations suggest that this may be achieved via co-translational folding during RBD secretion into the endoplasmic reticulum. Namely, at intermediate translation lengths, native disulfide pairs are predicted to come together with high probability, and thus, under suitable kinetic conditions, this process may lock the protein into its native state and circumvent highly aggregation-prone nonnative intermediates. This detailed molecular picture of the RBD folding landscape may shed light on SARS-CoV-2 pathology and molecular constraints governing SARS-CoV-2 evolution.
Collapse
Affiliation(s)
- Amir Bitran
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts; PhD Program in Biophysics, Harvard University, Cambridge, Massachusetts.
| | - Kibum Park
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Eugene Serebryany
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
3
|
Bitran A, Park K, Serebryany E, Shakhnovich EI. Cotranslational formation of disulfides guides folding of the SARS COV-2 receptor binding domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.11.10.516025. [PMID: 36380756 PMCID: PMC9665344 DOI: 10.1101/2022.11.10.516025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Many secreted proteins contain multiple disulfide bonds. How disulfide formation is coupled to protein folding in the cell remains poorly understood at the molecular level. Here, we combine experiment and simulation to address this question as it pertains to the SARS-CoV-2 receptor binding domain (RBD). We show that, whereas RBD can refold reversibly when its disulfides are intact, their disruption causes misfolding into a nonnative molten-globule state that is highly prone to aggregation and disulfide scrambling. Thus, non-equilibrium mechanisms are needed to ensure disulfides form prior to folding in vivo. Our simulations suggest that co-translational folding may accomplish this, as native disulfide pairs are predicted to form with high probability at intermediate lengths, ultimately committing the RBD to its metastable native state and circumventing nonnative intermediates. This detailed molecular picture of the RBD folding landscape may shed light on SARS-CoV-2 pathology and molecular constraints governing SARS-CoV-2 evolution.
Collapse
|
4
|
Newaz K, Piland J, Clark PL, Emrich SJ, Li J, Milenković T. Multi-layer sequential network analysis improves protein 3D structural classification. Proteins 2022; 90:1721-1731. [PMID: 35441395 PMCID: PMC9356989 DOI: 10.1002/prot.26349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 03/04/2022] [Accepted: 03/30/2022] [Indexed: 11/08/2022]
Abstract
Protein structural classification (PSC) is a supervised problem of assigning proteins into pre-defined structural (e.g., CATH or SCOPe) classes based on the proteins' sequence or 3D structural features. We recently proposed PSC approaches that model protein 3D structures as protein structure networks (PSNs) and analyze PSN-based protein features, which performed better than or comparable to state-of-the-art sequence or other 3D structure-based PSC approaches. However, existing PSN-based PSC approaches model the whole 3D structure of a protein as a static (i.e., single-layer) PSN. Because folding of a protein is a dynamic process, where some parts (i.e., sub-structures) of a protein fold before others, modeling the 3D structure of a protein as a PSN that captures the sub-structures might further help improve the existing PSC performance. Here, we propose to model 3D structures of proteins as multi-layer sequential PSNs that approximate 3D sub-structures of proteins, with the hypothesis that this will improve upon the current state-of-the-art PSC approaches that are based on single-layer PSNs (and thus upon the existing state-of-the-art sequence and other 3D structural approaches). Indeed, we confirm this on 72 datasets spanning ~44 000 CATH and SCOPe protein domains.
Collapse
Affiliation(s)
- Khalique Newaz
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA,Center for Data and Computing in Natural Sciences (CDCS), Institute for Computational Systems Biology, Universität Hamburg, Hamburg, 20146, Germany
| | - Jacob Piland
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Patricia L. Clark
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Scott J. Emrich
- Department of Electrical Engineering and Computer Science; University of Tennessee, Knoxville, TN 37996, USA
| | - Jun Li
- Department of Applied and Computational Mathematics and Statistics, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Tijana Milenković
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA
| |
Collapse
|
5
|
McBride JM, Tlusty T. Slowest-first protein translation scheme: Structural asymmetry and co-translational folding. Biophys J 2021; 120:5466-5477. [PMID: 34813729 PMCID: PMC8715247 DOI: 10.1016/j.bpj.2021.11.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 09/30/2021] [Accepted: 11/17/2021] [Indexed: 11/19/2022] Open
Abstract
Proteins are translated from the N to the C terminus, raising the basic question of how this innate directionality affects their evolution. To explore this question, we analyze 16,200 structures from the Protein Data Bank (PDB). We find remarkable enrichment of α helices at the C terminus and β strands at the N terminus. Furthermore, this α-β asymmetry correlates with sequence length and contact order, both determinants of folding rate, hinting at possible links to co-translational folding (CTF). Hence, we propose the "slowest-first" scheme, whereby protein sequences evolved structural asymmetry to accelerate CTF: the slowest of the cooperatively folding segments are positioned near the N terminus so they have more time to fold during translation. A phenomenological model predicts that CTF can be accelerated by asymmetry in folding rate, up to double the rate, when folding time is commensurate with translation time; analysis of the PDB predicts that structural asymmetry is indeed maximal in this regime. This correspondence is greater in prokaryotes, which generally require faster protein production. Altogether, this indicates that accelerating CTF is a substantial evolutionary force whose interplay with stability and functionality is encoded in secondary structure asymmetry.
Collapse
Affiliation(s)
- John M McBride
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan, South Korea.
| | - Tsvi Tlusty
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan, South Korea; Departments of Physics and Chemistry, Ulsan National Institute of Science and Technology, Ulsan, South Korea.
| |
Collapse
|
6
|
Validation of DBFOLD: An efficient algorithm for computing folding pathways of complex proteins. PLoS Comput Biol 2020; 16:e1008323. [PMID: 33196646 PMCID: PMC7704049 DOI: 10.1371/journal.pcbi.1008323] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 11/30/2020] [Accepted: 10/17/2020] [Indexed: 11/19/2022] Open
Abstract
Atomistic simulations can provide valuable, experimentally-verifiable insights into protein folding mechanisms, but existing ab initio simulation methods are restricted to only the smallest proteins due to severe computational speed limits. The folding of larger proteins has been studied using native-centric potential functions, but such models omit the potentially crucial role of non-native interactions. Here, we present an algorithm, entitled DBFOLD, which can predict folding pathways for a wide range of proteins while accounting for the effects of non-native contacts. In addition, DBFOLD can predict the relative rates of different transitions within a protein’s folding pathway. To accomplish this, rather than directly simulating folding, our method combines equilibrium Monte-Carlo simulations, which deploy enhanced sampling, with unfolding simulations at high temperatures. We show that under certain conditions, trajectories from these two types of simulations can be jointly analyzed to compute unknown folding rates from detailed balance. This requires inferring free energies from the equilibrium simulations, and extrapolating transition rates from the unfolding simulations to lower, physiologically-reasonable temperatures at which the native state is marginally stable. As a proof of principle, we show that our method can accurately predict folding pathways and Monte-Carlo rates for the well-characterized Streptococcal protein G. We then show that our method significantly reduces the amount of computation time required to compute the folding pathways of large, misfolding-prone proteins that lie beyond the reach of existing direct simulation. Our algorithm, which is available online, can generate detailed atomistic models of protein folding mechanisms while shedding light on the role of non-native intermediates which may crucially affect organismal fitness and are frequently implicated in disease. Many proteins must adopt a specific structure in order to function. Computational simulations have been used to shed light on the mechanisms of protein folding, but unfortunately, realistic simulations can typically only be run for small proteins, due to severe limits in computational speed. Here, we present a method to solve this problem, whereby instead of directly simulating folding from an unfolded state, we run simulations that allow for computation of equilibrium folding free energies, alongside high temperature simulations to compute unfolding rates. From these quantities, folding rates can be computed using detailed balance. Importantly, our method can account for the effects of nonnative contacts which transiently form during folding and must be broken prior to adoption of the native state. Such contacts, which are often excluded from simple models of folding, may crucially affect real protein folding pathways and are often observed in folding intermediates implicated in disease.
Collapse
|
7
|
Wolfe AD, Li S, Goedderz C, Chen XS. The structure of APOBEC1 and insights into its RNA and DNA substrate selectivity. NAR Cancer 2020; 2:zcaa027. [PMID: 33094286 PMCID: PMC7556403 DOI: 10.1093/narcan/zcaa027] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 09/05/2020] [Accepted: 09/10/2020] [Indexed: 02/06/2023] Open
Abstract
APOBEC1 (APO1), a member of AID/APOBEC nucleic acid cytosine deaminase family, can edit apolipoprotein B mRNA to regulate cholesterol metabolism. This APO1 RNA editing activity requires a cellular cofactor to achieve tight regulation. However, no cofactors are required for deamination on DNA by APO1 and other AID/APOBEC members, and aberrant deamination on genomic DNA by AID/APOBEC deaminases has been linked to cancer. Here, we present the crystal structure of APO1, which reveals a typical APOBEC deaminase core structure, plus a unique well-folded C-terminal domain that is highly hydrophobic. This APO1 C-terminal hydrophobic domain (A1HD) interacts to form a stable dimer mainly through hydrophobic interactions within the dimer interface to create a four-stranded β-sheet positively charged surface. Structure-guided mutagenesis within this and other regions of APO1 clarified the importance of the A1HD in directing RNA and cofactor interactions, providing insights into the structural basis of selectivity on DNA or RNA substrates.
Collapse
Affiliation(s)
- Aaron D Wolfe
- Genetics, Molecular and Cellular Biology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90089, USA
| | - Shuxing Li
- Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Cody Goedderz
- Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Xiaojiang S Chen
- Genetics, Molecular and Cellular Biology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
8
|
Effects of Single Mutations on Protein Stability Are Gaussian Distributed. Biophys J 2020; 118:2872-2878. [PMID: 32416078 DOI: 10.1016/j.bpj.2020.04.027] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 04/14/2020] [Accepted: 04/24/2020] [Indexed: 12/16/2022] Open
Abstract
The distribution of protein stability effects is known to be well approximated by a Gaussian distribution from previous empirical fits. Starting from first-principles statistical mechanics, we more rigorously motivate this empirical observation by deriving per-residue-position protein stability effects to be Gaussian. Our derivation requires the number of amino acids to be large, which is satisfied by the standard set of 20 amino acids found in nature. No assumption is needed on the number of residues in close proximity in space, in contrast to previous applications of the central limit theorem to protein energetics. We support our derivation results with computational and experimental data on mutant protein stabilities across all types of protein residues.
Collapse
|
9
|
Cotranslational folding allows misfolding-prone proteins to circumvent deep kinetic traps. Proc Natl Acad Sci U S A 2020; 117:1485-1495. [PMID: 31911473 DOI: 10.1073/pnas.1913207117] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Many large proteins suffer from slow or inefficient folding in vitro. It has long been known that this problem can be alleviated in vivo if proteins start folding cotranslationally. However, the molecular mechanisms underlying this improvement have not been well established. To address this question, we use an all-atom simulation-based algorithm to compute the folding properties of various large protein domains as a function of nascent chain length. We find that for certain proteins, there exists a narrow window of lengths that confers both thermodynamic stability and fast folding kinetics. Beyond these lengths, folding is drastically slowed by nonnative interactions involving C-terminal residues. Thus, cotranslational folding is predicted to be beneficial because it allows proteins to take advantage of this optimal window of lengths and thus avoid kinetic traps. Interestingly, many of these proteins' sequences contain conserved rare codons that may slow down synthesis at this optimal window, suggesting that synthesis rates may be evolutionarily tuned to optimize folding. Using kinetic modeling, we show that under certain conditions, such a slowdown indeed improves cotranslational folding efficiency by giving these nascent chains more time to fold. In contrast, other proteins are predicted not to benefit from cotranslational folding due to a lack of significant nonnative interactions, and indeed these proteins' sequences lack conserved C-terminal rare codons. Together, these results shed light on the factors that promote proper protein folding in the cell and how biomolecular self-assembly may be optimized evolutionarily.
Collapse
|
10
|
Hoffer NQ, Woodside MT. Probing microscopic conformational dynamics in folding reactions by measuring transition paths. Curr Opin Chem Biol 2019; 53:68-74. [PMID: 31479831 DOI: 10.1016/j.cbpa.2019.07.006] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 07/08/2019] [Accepted: 07/20/2019] [Indexed: 12/20/2022]
Abstract
Transition paths comprise those parts of a folding trajectory where the molecule passes through the high-energy transition states separating folded and unfolded conformations. The transition states determine the folding kinetics and mechanism but are difficult to observe because of their brief duration. Single-molecule experiments have in recent years begun to characterize transition paths in folding reactions, allowing the microscopic conformational dynamics that occur as a molecule traverses the energy barriers to be probed directly. Here we review single-molecule fluorescence and force spectroscopy measurements of transition-path properties, including the time taken to traverse the paths, the local velocity along them, the path shapes, and the variability within these measurements reflecting differences between individual barrier crossings. We discuss how these measurements have been related to theories of folding as diffusion over an energy landscape to deduce properties such as the diffusion coefficient, and how they are being combined with simulations to obtain enhanced atomistic understanding of folding. The richly detailed information available from transition path measurements holds great promise for improved understanding of microscopic mechanisms in folding.
Collapse
Affiliation(s)
- Noel Q Hoffer
- Department of Physics, University of Alberta, Edmonton, AB T6G 2E1, Canada
| | - Michael T Woodside
- Department of Physics, University of Alberta, Edmonton, AB T6G 2E1, Canada.
| |
Collapse
|
11
|
Jacobs WM, Shakhnovich EI. Accurate Protein-Folding Transition-Path Statistics from a Simple Free-Energy Landscape. J Phys Chem B 2018; 122:11126-11136. [PMID: 30091592 DOI: 10.1021/acs.jpcb.8b05842] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A central goal of protein-folding theory is to predict the stochastic dynamics of transition paths-the rare trajectories that transit between the folded and unfolded ensembles-using only thermodynamic information, such as a low-dimensional equilibrium free-energy landscape. However, commonly used one-dimensional landscapes typically fall short of this aim, because an empirical coordinate-dependent diffusion coefficient has to be fit to transition-path trajectory data in order to reproduce the transition-path dynamics. We show that an alternative, first-principles free-energy landscape predicts transition-path statistics that agree well with simulations and single-molecule experiments without requiring dynamical data as an input. This "topological configuration" model assumes that distinct, native-like substructures assemble on a time scale that is slower than native-contact formation but faster than the folding of the entire protein. Using only equilibrium simulation data to determine the free energies of these coarse-grained intermediate states, we predict a broad distribution of transition-path transit times that agrees well with the transition-path durations observed in simulations. We further show that both the distribution of finite-time displacements on a one-dimensional order parameter and the ensemble of transition-path trajectories generated by the model are consistent with the simulated transition paths. These results indicate that a landscape based on transient folding intermediates, which are often hidden by one-dimensional projections, can form the basis of a predictive model of protein-folding transition-path dynamics.
Collapse
Affiliation(s)
- William M Jacobs
- Department of Chemistry and Chemical Biology , Harvard University , 12 Oxford Street , Cambridge , Massachusetts 02138 , United States
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology , Harvard University , 12 Oxford Street , Cambridge , Massachusetts 02138 , United States
| |
Collapse
|
12
|
Sato A, Menez A. External release of entropy by synchronized movements of local secondary structures drives folding of a small, disulfide-bonded protein. PLoS One 2018; 13:e0198276. [PMID: 29894484 PMCID: PMC5997310 DOI: 10.1371/journal.pone.0198276] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 05/16/2018] [Indexed: 11/22/2022] Open
Abstract
A crucial mechanism to the formation of native, fully functional, 3D structures from local secondary structures is unraveled in this study. Through the introduction of various amino acid substitutions at four canonical β-turns in a three-fingered protein, Toxin α from Naja nigricollis, we found that the release of internal entropy to the external environment through the globally synchronized movements of local substructures plays a crucial role. Throughout the folding process, the folding species were saturated with internal entropy so that intermediates accumulated at the equilibrium state. Their relief from the equilibrium state was accomplished by the formation of a critical disulfide bridge, which could guide the synchronized movement of one of the peripheral secondary structure. This secondary structure collided with a core central structure, which flanked another peripheral secondary structure. This collision displaced the internal thermal fluctuations from the first peripheral structure to the second peripheral structure, where the displaced thermal fluctuations were ultimately released as entropy. Two protein folding processes that acted in succession were identified as the means to establish the flow of thermal fluctuations. The first process was the time-consuming assembly process, where stochastic combinations of colliding, native-like, secondary structures provided candidate structures for the folded protein. The second process was the activation process to establish the global mutual relationships of the native protein in the selected candidate. This activation process was initiated and propagated by a positive feedback process between efficient entropy release and well-packed local structures, which moved in synchronization. The molecular mechanism suggested by this experiment was assessed with a well-defined 3D structure of erabutoxin b because one of the turns that played a critical role in folding was shared with erabutoxin b.
Collapse
Affiliation(s)
- Atsushi Sato
- Department of Information Science, Faculty of Liberal Arts, Tohoku Gakuin University, Sendai, Japan
- * E-mail:
| | | |
Collapse
|
13
|
Oyarzún B, Mognetti BM. Efficient sampling of reversible cross-linking polymers: Self-assembly of single-chain polymeric nanoparticles. J Chem Phys 2018; 148:114110. [PMID: 29566497 DOI: 10.1063/1.5020158] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
We present a new simulation technique to study systems of polymers functionalized by reactive sites that bind/unbind forming reversible linkages. Functionalized polymers feature self-assembly and responsive properties that are unmatched by the systems lacking selective interactions. The scales at which the functional properties of these materials emerge are difficult to model, especially in the reversible regime where such properties result from many binding/unbinding events. This difficulty is related to large entropic barriers associated with the formation of intra-molecular loops. In this work, we present a simulation scheme that sidesteps configurational costs by dedicated Monte Carlo moves capable of binding/unbinding reactive sites in a single step. Cross-linking reactions are implemented by trial moves that reconstruct chain sections attempting, at the same time, a dimerization reaction between pairs of reactive sites. The model is parametrized by the reaction equilibrium constant of the reactive species free in solution. This quantity can be obtained by means of experiments or atomistic/quantum simulations. We use the proposed methodology to study the self-assembly of single-chain polymeric nanoparticles, starting from flexible precursors carrying regularly or randomly distributed reactive sites. We focus on understanding differences in the morphology of chain nanoparticles when linkages are reversible as compared to the well-studied case of irreversible reactions. Intriguingly, we find that the size of regularly functionalized chains, in good solvent conditions, is non-monotonous as a function of the degree of functionalization. We clarify how this result follows from excluded volume interactions and is peculiar of reversible linkages and regular functionalizations.
Collapse
Affiliation(s)
- Bernardo Oyarzún
- Interdisciplinary Center for Nonlinear Phenomena and Complex Systems, Université Libre de Bruxelles (ULB), Campus Plaine, CP 231, Blvd. du Triomphe, B-1050 Brussels, Belgium
| | - Bortolo Matteo Mognetti
- Interdisciplinary Center for Nonlinear Phenomena and Complex Systems, Université Libre de Bruxelles (ULB), Campus Plaine, CP 231, Blvd. du Triomphe, B-1050 Brussels, Belgium
| |
Collapse
|
14
|
Hu J, Chen T, Wang M, Chan HS, Zhang Z. A critical comparison of coarse-grained structure-based approaches and atomic models of protein folding. Phys Chem Chem Phys 2018; 19:13629-13639. [PMID: 28530269 DOI: 10.1039/c7cp01532a] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Structure-based coarse-grained Gō-like models have been used extensively in deciphering protein folding mechanisms because of their simplicity and tractability. Meanwhile, explicit-solvent molecular dynamics (MD) simulations with physics-based all-atom force fields have been applied successfully to simulate folding/unfolding transitions for several small, fast-folding proteins. To explore the degree to which coarse-grained Gō-like models and their extensions to incorporate nonnative interactions are capable of producing folding processes similar to those in all-atom MD simulations, here we systematically compare the computed unfolded states, transition states, and transition paths obtained using coarse-grained models and all-atom explicit-solvent MD simulations. The conformations in the unfolded state in common Gō models are more extended, and are thus more in line with experiment, than those from all-atom MD simulations. Nevertheless, the structural features of transition states obtained by the two types of models are largely similar. In contrast, the folding transition paths are significantly more sensitive to modeling details. In particular, when common Gō-like models are augmented with nonnative interactions, the predicted dimensions of the unfolded conformations become similar to those computed using all-atom MD. With this connection, the large deviations of all-atom MD from simple diffusion theory are likely caused in part by the presence of significant nonnative effects in folding processes modelled by current atomic force fields. The ramifications of our findings to the application of coarse-grained modeling to more complex biomolecular systems are discussed.
Collapse
Affiliation(s)
- Jie Hu
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| | | | | | | | | |
Collapse
|
15
|
Gopi S, Paul S, Ranu S, Naganathan AN. Extracting the Hidden Distributions Underlying the Mean Transition State Structures in Protein Folding. J Phys Chem Lett 2018; 9:1771-1777. [PMID: 29565127 DOI: 10.1021/acs.jpclett.8b00538] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The inherent conflict between noncovalent interactions and the large conformational entropy of the polypeptide chain forces folding reactions and their mechanisms to deviate significantly from chemical reactions. Accordingly, measures of structure in the transition state ensemble (TSE) are strongly influenced by the underlying distributions of microscopic folding pathways that are challenging to discern experimentally. Here, we present a detailed analysis of 150,000 folding transition paths of five proteins at three different thermodynamic conditions from an experimentally consistent statistical mechanical model. We find that the underlying TSE structural distributions are rarely unimodal, and the average experimental measures arise from complex underlying distributions. Unfolding pathways also exhibit subtle differences from folding counterparts due to a combination of Hammond behavior and native-state movements. Local interactions and topological complexity, to a lesser extent, are found to determine pathway heterogeneity, underscoring the importance of the balance between local and nonlocal energetics in protein folding.
Collapse
|
16
|
Chung HS, Eaton WA. Protein folding transition path times from single molecule FRET. Curr Opin Struct Biol 2017; 48:30-39. [PMID: 29080467 DOI: 10.1016/j.sbi.2017.10.007] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Revised: 10/03/2017] [Accepted: 10/05/2017] [Indexed: 11/28/2022]
Abstract
The transition path is the tiny segment of a single molecule trajectory when the free energy barrier between states is crossed and for protein folding contains all of the information about the self-assembly mechanism. As a first step toward obtaining structural information during the transition path from experiments, single molecule FRET spectroscopy has been used to determine average transition path times from a photon-by-photon analysis of fluorescence trajectories. These results, obtained for several different proteins, have already provided new and demanding tests that support both the accuracy of all-atom molecular dynamics simulations and the basic postulates of energy landscape theory of protein folding.
Collapse
Affiliation(s)
- Hoi Sung Chung
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0520, United States.
| | - William A Eaton
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0520, United States.
| |
Collapse
|
17
|
Abstract
Recent experiments and simulations have demonstrated that proteins can fold on the ribosome. However, the extent and generality of fitness effects resulting from cotranslational folding remain open questions. Here we report a genome-wide analysis that uncovers evidence of evolutionary selection for cotranslational folding. We describe a robust statistical approach to identify loci within genes that are both significantly enriched in slowly translated codons and evolutionarily conserved. Surprisingly, we find that domain boundaries can explain only a small fraction of these conserved loci. Instead, we propose that regions enriched in slowly translated codons are associated with cotranslational folding intermediates, which may be smaller than a single domain. We show that the intermediates predicted by a native-centric model of cotranslational folding account for the majority of these loci across more than 500 Escherichia coli proteins. By making a direct connection to protein folding, this analysis provides strong evidence that many synonymous substitutions have been selected to optimize translation rates at specific locations within genes. More generally, our results indicate that kinetics, and not just thermodynamics, can significantly alter the efficiency of self-assembly in a biological context.
Collapse
|
18
|
Frenkel D. Folding Proteins One Loop at a Time. Biophys J 2016; 111:893-4. [PMID: 27602714 DOI: 10.1016/j.bpj.2016.07.032] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 07/26/2016] [Indexed: 10/21/2022] Open
|